[2023-03-06 14:27:12,401][03942] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/config.json... [2023-03-06 14:27:12,415][03942] Rollout worker 0 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 1 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 2 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 3 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 4 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 5 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 6 uses device cpu [2023-03-06 14:27:12,416][03942] Rollout worker 7 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 8 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 9 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 10 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 11 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 12 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 13 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 14 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 15 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 16 uses device cpu [2023-03-06 14:27:12,417][03942] Rollout worker 17 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 18 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 19 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 20 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 21 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 22 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 23 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 24 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 25 uses device cpu [2023-03-06 14:27:12,418][03942] Rollout worker 26 uses device cpu [2023-03-06 14:27:12,419][03942] Rollout worker 27 uses device cpu [2023-03-06 14:27:12,419][03942] Rollout worker 28 uses device cpu [2023-03-06 14:27:12,419][03942] Rollout worker 29 uses device cpu [2023-03-06 14:27:12,419][03942] Rollout worker 30 uses device cpu [2023-03-06 14:27:12,419][03942] Rollout worker 31 uses device cpu [2023-03-06 14:27:12,443][03942] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 14:27:12,444][03942] InferenceWorker_p0-w0: min num requests: 10 [2023-03-06 14:27:12,522][03942] Starting all processes... [2023-03-06 14:27:12,523][03942] Starting process learner_proc0 [2023-03-06 14:27:12,572][03942] Starting all processes... [2023-03-06 14:27:12,637][03942] Starting process inference_proc0-0 [2023-03-06 14:27:12,637][03942] Starting process rollout_proc0 [2023-03-06 14:27:12,637][03942] Starting process rollout_proc1 [2023-03-06 14:27:12,637][03942] Starting process rollout_proc2 [2023-03-06 14:27:12,637][03942] Starting process rollout_proc3 [2023-03-06 14:27:12,639][03942] Starting process rollout_proc4 [2023-03-06 14:27:12,639][03942] Starting process rollout_proc5 [2023-03-06 14:27:12,642][03942] Starting process rollout_proc6 [2023-03-06 14:27:12,643][03942] Starting process rollout_proc7 [2023-03-06 14:27:12,645][03942] Starting process rollout_proc8 [2023-03-06 14:27:12,649][03942] Starting process rollout_proc9 [2023-03-06 14:27:12,654][03942] Starting process rollout_proc10 [2023-03-06 14:27:12,654][03942] Starting process rollout_proc11 [2023-03-06 14:27:12,658][03942] Starting process rollout_proc13 [2023-03-06 14:27:12,657][03942] Starting process rollout_proc12 [2023-03-06 14:27:12,660][03942] Starting process rollout_proc14 [2023-03-06 14:27:12,663][03942] Starting process rollout_proc15 [2023-03-06 14:27:12,666][03942] Starting process rollout_proc16 [2023-03-06 14:27:12,667][03942] Starting process rollout_proc17 [2023-03-06 14:27:12,667][03942] Starting process rollout_proc18 [2023-03-06 14:27:12,672][03942] Starting process rollout_proc19 [2023-03-06 14:27:12,672][03942] Starting process rollout_proc20 [2023-03-06 14:27:12,731][03942] Starting process rollout_proc21 [2023-03-06 14:27:12,735][03942] Starting process rollout_proc22 [2023-03-06 14:27:12,740][03942] Starting process rollout_proc23 [2023-03-06 14:27:12,756][03942] Starting process rollout_proc24 [2023-03-06 14:27:12,775][03942] Starting process rollout_proc25 [2023-03-06 14:27:12,775][03942] Starting process rollout_proc26 [2023-03-06 14:27:12,780][03942] Starting process rollout_proc27 [2023-03-06 14:27:12,785][03942] Starting process rollout_proc28 [2023-03-06 14:27:12,790][03942] Starting process rollout_proc29 [2023-03-06 14:27:12,795][03942] Starting process rollout_proc30 [2023-03-06 14:27:12,796][03942] Starting process rollout_proc31 [2023-03-06 14:27:14,532][04221] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 14:27:14,532][04221] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-06 14:27:14,542][04221] Num visible devices: 1 [2023-03-06 14:27:14,568][04221] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-06 14:27:14,569][04221] Starting seed is not provided [2023-03-06 14:27:14,569][04221] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 14:27:14,569][04221] Initializing actor-critic model on device cuda:0 [2023-03-06 14:27:14,569][04221] RunningMeanStd input shape: (39,) [2023-03-06 14:27:14,570][04221] RunningMeanStd input shape: (1,) [2023-03-06 14:27:14,577][04272] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 14:27:14,578][04272] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-06 14:27:14,588][04272] Num visible devices: 1 [2023-03-06 14:27:14,699][04437] Worker 18 uses CPU cores [18] [2023-03-06 14:27:14,721][04674] Worker 28 uses CPU cores [28] [2023-03-06 14:27:14,739][04221] Created Actor Critic model with architecture: [2023-03-06 14:27:14,739][04221] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-06 14:27:14,907][04475] Worker 13 uses CPU cores [13] [2023-03-06 14:27:14,971][04709] Worker 31 uses CPU cores [31] [2023-03-06 14:27:15,122][04480] Worker 21 uses CPU cores [21] [2023-03-06 14:27:15,191][04434] Worker 10 uses CPU cores [10] [2023-03-06 14:27:15,275][04274] Worker 0 uses CPU cores [0] [2023-03-06 14:27:15,387][04479] Worker 14 uses CPU cores [14] [2023-03-06 14:27:15,418][04478] Worker 17 uses CPU cores [17] [2023-03-06 14:27:15,715][04676] Worker 29 uses CPU cores [29] [2023-03-06 14:27:15,721][04433] Worker 5 uses CPU cores [5] [2023-03-06 14:27:15,899][04545] Worker 25 uses CPU cores [25] [2023-03-06 14:27:15,919][04275] Worker 2 uses CPU cores [2] [2023-03-06 14:27:16,068][04481] Worker 22 uses CPU cores [22] [2023-03-06 14:27:16,192][04476] Worker 19 uses CPU cores [19] [2023-03-06 14:27:16,347][04474] Worker 8 uses CPU cores [8] [2023-03-06 14:27:16,391][04276] Worker 3 uses CPU cores [3] [2023-03-06 14:27:16,515][04470] Worker 6 uses CPU cores [6] [2023-03-06 14:27:16,683][04471] Worker 9 uses CPU cores [9] [2023-03-06 14:27:16,802][04472] Worker 11 uses CPU cores [11] [2023-03-06 14:27:16,855][04436] Worker 20 uses CPU cores [20] [2023-03-06 14:27:16,995][04708] Worker 30 uses CPU cores [30] [2023-03-06 14:27:17,069][04438] Worker 16 uses CPU cores [16] [2023-03-06 14:27:17,199][04577] Worker 26 uses CPU cores [26] [2023-03-06 14:27:17,298][04642] Worker 27 uses CPU cores [27] [2023-03-06 14:27:17,403][04473] Worker 7 uses CPU cores [7] [2023-03-06 14:27:17,471][04277] Worker 4 uses CPU cores [4] [2023-03-06 14:27:17,521][04221] Using optimizer [2023-03-06 14:27:17,521][04221] No checkpoints found [2023-03-06 14:27:17,521][04221] Did not load from checkpoint, starting from scratch! [2023-03-06 14:27:17,521][04221] Initialized policy 0 weights for model version 0 [2023-03-06 14:27:17,523][04221] LearnerWorker_p0 finished initialization! [2023-03-06 14:27:17,523][04221] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 14:27:17,536][04609] Worker 24 uses CPU cores [24] [2023-03-06 14:27:17,575][04272] RunningMeanStd input shape: (39,) [2023-03-06 14:27:17,576][04272] RunningMeanStd input shape: (1,) [2023-03-06 14:27:17,610][04513] Worker 23 uses CPU cores [23] [2023-03-06 14:27:17,763][04435] Worker 15 uses CPU cores [15] [2023-03-06 14:27:17,903][04477] Worker 12 uses CPU cores [12] [2023-03-06 14:27:17,949][04273] Worker 1 uses CPU cores [1] [2023-03-06 14:27:18,176][03942] Inference worker 0-0 is ready! [2023-03-06 14:27:18,176][03942] All inference workers are ready! Signal rollout workers to start! [2023-03-06 14:27:18,941][03942] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-06 14:27:20,416][04478] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,435][04674] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,466][04277] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,524][04436] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,595][04435] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,595][04709] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,601][04477] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,608][04472] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,616][04475] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,619][04676] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,623][04577] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,623][04480] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,625][04276] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,626][04434] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,626][04513] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,627][04433] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,630][04476] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,641][04273] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,647][04275] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,648][04642] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,652][04708] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,673][04438] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,676][04470] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,678][04274] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,678][04481] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,679][04473] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,683][04609] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,683][04474] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,683][04471] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,684][04545] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,685][04437] Decorrelating experience for 0 frames... [2023-03-06 14:27:20,713][04479] Decorrelating experience for 0 frames... [2023-03-06 14:27:22,720][04674] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,726][04478] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,760][04277] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,807][04436] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,834][04477] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,861][04273] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,883][04475] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,891][04676] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,921][04577] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,926][04434] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,927][04476] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,927][04276] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,938][04472] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,948][04513] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,979][04275] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,981][04435] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,983][04709] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,986][04480] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,988][04433] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,992][04708] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,995][04642] Decorrelating experience for 32 frames... [2023-03-06 14:27:22,997][04437] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,010][04473] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,015][04470] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,020][04481] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,025][04479] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,028][04471] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,029][04474] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,032][04438] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,033][04274] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,033][04545] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,079][04609] Decorrelating experience for 32 frames... [2023-03-06 14:27:23,503][04221] Signal inference workers to stop experience collection... [2023-03-06 14:27:23,507][04272] InferenceWorker_p0-w0: stopping experience collection [2023-03-06 14:27:23,937][04221] Signal inference workers to resume experience collection... [2023-03-06 14:27:23,938][04272] InferenceWorker_p0-w0: resuming experience collection [2023-03-06 14:27:23,944][03942] Fps is (10 sec: 204.6, 60 sec: 204.6, 300 sec: 204.6). Total num frames: 1024. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-06 14:27:25,093][04272] Updated weights for policy 0, policy_version 10 (0.0225) [2023-03-06 14:27:25,896][04272] Updated weights for policy 0, policy_version 20 (0.0008) [2023-03-06 14:27:26,717][04272] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-06 14:27:27,522][04272] Updated weights for policy 0, policy_version 40 (0.0007) [2023-03-06 14:27:28,324][04272] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-06 14:27:28,941][03942] Fps is (10 sec: 5836.9, 60 sec: 5836.9, 300 sec: 5836.9). Total num frames: 58368. Throughput: 0: 3777.9. Samples: 37778. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:27:28,941][03942] Avg episode reward: [(0, '205.089')] [2023-03-06 14:27:29,110][04272] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-06 14:27:29,917][04272] Updated weights for policy 0, policy_version 70 (0.0007) [2023-03-06 14:27:30,713][04272] Updated weights for policy 0, policy_version 80 (0.0007) [2023-03-06 14:27:31,514][04272] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-06 14:27:32,313][04272] Updated weights for policy 0, policy_version 100 (0.0007) [2023-03-06 14:27:32,439][03942] Heartbeat connected on Batcher_0 [2023-03-06 14:27:32,441][03942] Heartbeat connected on LearnerWorker_p0 [2023-03-06 14:27:32,446][03942] Heartbeat connected on RolloutWorker_w0 [2023-03-06 14:27:32,447][03942] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-06 14:27:32,448][03942] Heartbeat connected on RolloutWorker_w1 [2023-03-06 14:27:32,450][03942] Heartbeat connected on RolloutWorker_w2 [2023-03-06 14:27:32,452][03942] Heartbeat connected on RolloutWorker_w3 [2023-03-06 14:27:32,453][03942] Heartbeat connected on RolloutWorker_w4 [2023-03-06 14:27:32,457][03942] Heartbeat connected on RolloutWorker_w6 [2023-03-06 14:27:32,457][03942] Heartbeat connected on RolloutWorker_w5 [2023-03-06 14:27:32,459][03942] Heartbeat connected on RolloutWorker_w7 [2023-03-06 14:27:32,462][03942] Heartbeat connected on RolloutWorker_w9 [2023-03-06 14:27:32,464][03942] Heartbeat connected on RolloutWorker_w10 [2023-03-06 14:27:32,469][03942] Heartbeat connected on RolloutWorker_w8 [2023-03-06 14:27:32,485][03942] Heartbeat connected on RolloutWorker_w11 [2023-03-06 14:27:32,487][03942] Heartbeat connected on RolloutWorker_w12 [2023-03-06 14:27:32,490][03942] Heartbeat connected on RolloutWorker_w13 [2023-03-06 14:27:32,490][03942] Heartbeat connected on RolloutWorker_w14 [2023-03-06 14:27:32,492][03942] Heartbeat connected on RolloutWorker_w15 [2023-03-06 14:27:32,495][03942] Heartbeat connected on RolloutWorker_w16 [2023-03-06 14:27:32,496][03942] Heartbeat connected on RolloutWorker_w17 [2023-03-06 14:27:32,497][03942] Heartbeat connected on RolloutWorker_w18 [2023-03-06 14:27:32,500][03942] Heartbeat connected on RolloutWorker_w19 [2023-03-06 14:27:32,502][03942] Heartbeat connected on RolloutWorker_w20 [2023-03-06 14:27:32,504][03942] Heartbeat connected on RolloutWorker_w21 [2023-03-06 14:27:32,505][03942] Heartbeat connected on RolloutWorker_w22 [2023-03-06 14:27:32,507][03942] Heartbeat connected on RolloutWorker_w23 [2023-03-06 14:27:32,509][03942] Heartbeat connected on RolloutWorker_w24 [2023-03-06 14:27:32,510][03942] Heartbeat connected on RolloutWorker_w25 [2023-03-06 14:27:32,512][03942] Heartbeat connected on RolloutWorker_w26 [2023-03-06 14:27:32,514][03942] Heartbeat connected on RolloutWorker_w27 [2023-03-06 14:27:32,517][03942] Heartbeat connected on RolloutWorker_w28 [2023-03-06 14:27:32,517][03942] Heartbeat connected on RolloutWorker_w29 [2023-03-06 14:27:32,521][03942] Heartbeat connected on RolloutWorker_w31 [2023-03-06 14:27:32,525][03942] Heartbeat connected on RolloutWorker_w30 [2023-03-06 14:27:33,095][04272] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-06 14:27:33,885][04272] Updated weights for policy 0, policy_version 120 (0.0006) [2023-03-06 14:27:33,940][03942] Fps is (10 sec: 12190.4, 60 sec: 8192.1, 300 sec: 8192.1). Total num frames: 122880. Throughput: 0: 7674.0. Samples: 115108. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:27:33,941][03942] Avg episode reward: [(0, '212.602')] [2023-03-06 14:27:33,950][04221] Saving new best policy, reward=212.602! [2023-03-06 14:27:34,697][04272] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-06 14:27:35,485][04272] Updated weights for policy 0, policy_version 140 (0.0007) [2023-03-06 14:27:36,284][04272] Updated weights for policy 0, policy_version 150 (0.0007) [2023-03-06 14:27:37,100][04272] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-06 14:27:37,896][04272] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-06 14:27:38,694][04272] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-06 14:27:38,941][03942] Fps is (10 sec: 12902.4, 60 sec: 9369.7, 300 sec: 9369.7). Total num frames: 187392. Throughput: 0: 7675.8. Samples: 153515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:27:38,941][03942] Avg episode reward: [(0, '203.005')] [2023-03-06 14:27:39,528][04272] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-06 14:27:40,308][04272] Updated weights for policy 0, policy_version 200 (0.0008) [2023-03-06 14:27:41,089][04272] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-06 14:27:41,913][04272] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-06 14:27:42,699][04272] Updated weights for policy 0, policy_version 230 (0.0007) [2023-03-06 14:27:43,491][04272] Updated weights for policy 0, policy_version 240 (0.0007) [2023-03-06 14:27:43,941][03942] Fps is (10 sec: 12799.9, 60 sec: 10035.3, 300 sec: 10035.3). Total num frames: 250880. Throughput: 0: 9198.7. Samples: 229967. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:27:43,941][03942] Avg episode reward: [(0, '191.731')] [2023-03-06 14:27:44,328][04272] Updated weights for policy 0, policy_version 250 (0.0006) [2023-03-06 14:27:45,113][04272] Updated weights for policy 0, policy_version 260 (0.0007) [2023-03-06 14:27:45,905][04272] Updated weights for policy 0, policy_version 270 (0.0008) [2023-03-06 14:27:46,744][04272] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-06 14:27:47,534][04272] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-06 14:27:48,340][04272] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-06 14:27:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 10479.0, 300 sec: 10479.0). Total num frames: 314368. Throughput: 0: 10215.5. Samples: 306463. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:27:48,941][03942] Avg episode reward: [(0, '187.223')] [2023-03-06 14:27:49,133][04272] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-06 14:27:49,950][04272] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-06 14:27:50,742][04272] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-06 14:27:51,554][04272] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-06 14:27:52,363][04272] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-06 14:27:53,148][04272] Updated weights for policy 0, policy_version 360 (0.0006) [2023-03-06 14:27:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 10795.9, 300 sec: 10795.9). Total num frames: 377856. Throughput: 0: 9853.5. Samples: 344871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:27:53,941][03942] Avg episode reward: [(0, '175.926')] [2023-03-06 14:27:53,946][04272] Updated weights for policy 0, policy_version 370 (0.0006) [2023-03-06 14:27:54,771][04272] Updated weights for policy 0, policy_version 380 (0.0007) [2023-03-06 14:27:55,563][04272] Updated weights for policy 0, policy_version 390 (0.0007) [2023-03-06 14:27:56,377][04272] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-06 14:27:57,191][04272] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-06 14:27:57,998][04272] Updated weights for policy 0, policy_version 420 (0.0006) [2023-03-06 14:27:58,792][04272] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-06 14:27:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 11033.6, 300 sec: 11033.6). Total num frames: 441344. Throughput: 0: 10520.6. Samples: 420823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:27:58,941][03942] Avg episode reward: [(0, '190.688')] [2023-03-06 14:27:59,622][04272] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-06 14:28:00,412][04272] Updated weights for policy 0, policy_version 450 (0.0007) [2023-03-06 14:28:01,234][04272] Updated weights for policy 0, policy_version 460 (0.0007) [2023-03-06 14:28:02,049][04272] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-06 14:28:02,851][04272] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-06 14:28:03,645][04272] Updated weights for policy 0, policy_version 490 (0.0006) [2023-03-06 14:28:03,940][03942] Fps is (10 sec: 12697.8, 60 sec: 11218.6, 300 sec: 11218.6). Total num frames: 504832. Throughput: 0: 11045.8. Samples: 497060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:28:03,941][03942] Avg episode reward: [(0, '214.278')] [2023-03-06 14:28:03,941][04221] Saving new best policy, reward=214.278! [2023-03-06 14:28:04,474][04272] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-06 14:28:05,269][04272] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-06 14:28:06,061][04272] Updated weights for policy 0, policy_version 520 (0.0007) [2023-03-06 14:28:06,880][04272] Updated weights for policy 0, policy_version 530 (0.0007) [2023-03-06 14:28:07,691][04272] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-06 14:28:08,491][04272] Updated weights for policy 0, policy_version 550 (0.0006) [2023-03-06 14:28:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 11366.4, 300 sec: 11366.4). Total num frames: 568320. Throughput: 0: 11893.7. Samples: 535171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:28:08,941][03942] Avg episode reward: [(0, '194.379')] [2023-03-06 14:28:09,297][04272] Updated weights for policy 0, policy_version 560 (0.0006) [2023-03-06 14:28:10,114][04272] Updated weights for policy 0, policy_version 570 (0.0007) [2023-03-06 14:28:10,923][04272] Updated weights for policy 0, policy_version 580 (0.0007) [2023-03-06 14:28:11,741][04272] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-06 14:28:12,540][04272] Updated weights for policy 0, policy_version 600 (0.0006) [2023-03-06 14:28:13,355][04272] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-06 14:28:13,941][03942] Fps is (10 sec: 12697.4, 60 sec: 11487.4, 300 sec: 11487.4). Total num frames: 631808. Throughput: 0: 12735.2. Samples: 610861. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:28:13,941][03942] Avg episode reward: [(0, '235.860')] [2023-03-06 14:28:13,942][04221] Saving new best policy, reward=235.860! [2023-03-06 14:28:14,172][04272] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-06 14:28:14,974][04272] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-06 14:28:15,790][04272] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-06 14:28:16,577][04272] Updated weights for policy 0, policy_version 650 (0.0006) [2023-03-06 14:28:17,397][04272] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-06 14:28:18,189][04272] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-06 14:28:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 11588.3, 300 sec: 11588.3). Total num frames: 695296. Throughput: 0: 12712.1. Samples: 687152. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:18,941][03942] Avg episode reward: [(0, '235.128')] [2023-03-06 14:28:18,993][04272] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-06 14:28:19,823][04272] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-06 14:28:20,623][04272] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-06 14:28:21,417][04272] Updated weights for policy 0, policy_version 710 (0.0006) [2023-03-06 14:28:22,238][04272] Updated weights for policy 0, policy_version 720 (0.0007) [2023-03-06 14:28:23,037][04272] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-06 14:28:23,847][04272] Updated weights for policy 0, policy_version 740 (0.0007) [2023-03-06 14:28:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12630.2, 300 sec: 11673.6). Total num frames: 758784. Throughput: 0: 12705.4. Samples: 725257. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:23,941][03942] Avg episode reward: [(0, '216.085')] [2023-03-06 14:28:24,685][04272] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-06 14:28:25,482][04272] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-06 14:28:26,284][04272] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-06 14:28:27,096][04272] Updated weights for policy 0, policy_version 780 (0.0007) [2023-03-06 14:28:27,907][04272] Updated weights for policy 0, policy_version 790 (0.0007) [2023-03-06 14:28:28,698][04272] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-06 14:28:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 11746.8). Total num frames: 822272. Throughput: 0: 12694.1. Samples: 801201. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:28,941][03942] Avg episode reward: [(0, '207.168')] [2023-03-06 14:28:29,491][04272] Updated weights for policy 0, policy_version 810 (0.0007) [2023-03-06 14:28:30,297][04272] Updated weights for policy 0, policy_version 820 (0.0006) [2023-03-06 14:28:31,131][04272] Updated weights for policy 0, policy_version 830 (0.0007) [2023-03-06 14:28:31,925][04272] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-06 14:28:32,744][04272] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-06 14:28:33,553][04272] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-06 14:28:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 11810.2). Total num frames: 885760. Throughput: 0: 12684.6. Samples: 877268. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:28:33,941][03942] Avg episode reward: [(0, '263.134')] [2023-03-06 14:28:33,942][04221] Saving new best policy, reward=263.134! [2023-03-06 14:28:34,366][04272] Updated weights for policy 0, policy_version 870 (0.0007) [2023-03-06 14:28:35,167][04272] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-06 14:28:35,969][04272] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-06 14:28:36,777][04272] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-06 14:28:37,595][04272] Updated weights for policy 0, policy_version 910 (0.0006) [2023-03-06 14:28:38,408][04272] Updated weights for policy 0, policy_version 920 (0.0006) [2023-03-06 14:28:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 11852.8). Total num frames: 948224. Throughput: 0: 12677.2. Samples: 915346. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:38,941][03942] Avg episode reward: [(0, '223.557')] [2023-03-06 14:28:39,203][04272] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-06 14:28:40,022][04272] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-06 14:28:40,830][04272] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-06 14:28:41,637][04272] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-06 14:28:42,438][04272] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-06 14:28:43,233][04272] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-06 14:28:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 11902.5). Total num frames: 1011712. Throughput: 0: 12677.5. Samples: 991310. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:43,941][03942] Avg episode reward: [(0, '225.323')] [2023-03-06 14:28:44,032][04272] Updated weights for policy 0, policy_version 990 (0.0007) [2023-03-06 14:28:44,846][04272] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-06 14:28:45,669][04272] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-06 14:28:46,458][04272] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-06 14:28:47,274][04272] Updated weights for policy 0, policy_version 1030 (0.0007) [2023-03-06 14:28:48,082][04272] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-06 14:28:48,882][04272] Updated weights for policy 0, policy_version 1050 (0.0006) [2023-03-06 14:28:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 11946.7). Total num frames: 1075200. Throughput: 0: 12677.2. Samples: 1067537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:28:48,941][03942] Avg episode reward: [(0, '278.224')] [2023-03-06 14:28:48,944][04221] Saving new best policy, reward=278.224! [2023-03-06 14:28:49,693][04272] Updated weights for policy 0, policy_version 1060 (0.0007) [2023-03-06 14:28:50,502][04272] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-06 14:28:51,293][04272] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-06 14:28:52,129][04272] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-06 14:28:52,928][04272] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-06 14:28:53,716][04272] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-06 14:28:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 11986.2). Total num frames: 1138688. Throughput: 0: 12676.5. Samples: 1105613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:28:53,941][03942] Avg episode reward: [(0, '276.542')] [2023-03-06 14:28:54,537][04272] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-06 14:28:55,349][04272] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-06 14:28:56,153][04272] Updated weights for policy 0, policy_version 1140 (0.0007) [2023-03-06 14:28:56,977][04272] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-06 14:28:57,778][04272] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-06 14:28:58,611][04272] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-06 14:28:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12021.8). Total num frames: 1202176. Throughput: 0: 12680.4. Samples: 1181480. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:28:58,941][03942] Avg episode reward: [(0, '241.037')] [2023-03-06 14:28:59,406][04272] Updated weights for policy 0, policy_version 1180 (0.0007) [2023-03-06 14:29:00,221][04272] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-06 14:29:01,021][04272] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-06 14:29:01,826][04272] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-06 14:29:02,633][04272] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-06 14:29:03,453][04272] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-06 14:29:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12054.0). Total num frames: 1265664. Throughput: 0: 12673.5. Samples: 1257458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:29:03,941][03942] Avg episode reward: [(0, '319.884')] [2023-03-06 14:29:03,941][04221] Saving new best policy, reward=319.884! [2023-03-06 14:29:04,271][04272] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-06 14:29:05,061][04272] Updated weights for policy 0, policy_version 1250 (0.0006) [2023-03-06 14:29:05,878][04272] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-06 14:29:06,698][04272] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-06 14:29:07,516][04272] Updated weights for policy 0, policy_version 1280 (0.0007) [2023-03-06 14:29:08,302][04272] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-06 14:29:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12073.9). Total num frames: 1328128. Throughput: 0: 12668.3. Samples: 1295332. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:29:08,941][03942] Avg episode reward: [(0, '365.968')] [2023-03-06 14:29:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000001297_1328128.pth... [2023-03-06 14:29:08,975][04221] Saving new best policy, reward=365.968! [2023-03-06 14:29:09,138][04272] Updated weights for policy 0, policy_version 1300 (0.0007) [2023-03-06 14:29:09,934][04272] Updated weights for policy 0, policy_version 1310 (0.0007) [2023-03-06 14:29:10,741][04272] Updated weights for policy 0, policy_version 1320 (0.0006) [2023-03-06 14:29:11,556][04272] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-06 14:29:12,359][04272] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-06 14:29:13,175][04272] Updated weights for policy 0, policy_version 1350 (0.0006) [2023-03-06 14:29:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12101.0). Total num frames: 1391616. Throughput: 0: 12665.0. Samples: 1371127. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:29:13,941][03942] Avg episode reward: [(0, '400.479')] [2023-03-06 14:29:13,942][04221] Saving new best policy, reward=400.479! [2023-03-06 14:29:13,998][04272] Updated weights for policy 0, policy_version 1360 (0.0007) [2023-03-06 14:29:14,780][04272] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-06 14:29:15,609][04272] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-06 14:29:16,414][04272] Updated weights for policy 0, policy_version 1390 (0.0007) [2023-03-06 14:29:17,212][04272] Updated weights for policy 0, policy_version 1400 (0.0007) [2023-03-06 14:29:18,025][04272] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-06 14:29:18,833][04272] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-06 14:29:18,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12125.9). Total num frames: 1455104. Throughput: 0: 12667.6. Samples: 1447310. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:29:18,941][03942] Avg episode reward: [(0, '405.417')] [2023-03-06 14:29:18,944][04221] Saving new best policy, reward=405.417! [2023-03-06 14:29:19,645][04272] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-06 14:29:20,434][04272] Updated weights for policy 0, policy_version 1440 (0.0007) [2023-03-06 14:29:21,246][04272] Updated weights for policy 0, policy_version 1450 (0.0007) [2023-03-06 14:29:22,050][04272] Updated weights for policy 0, policy_version 1460 (0.0007) [2023-03-06 14:29:22,844][04272] Updated weights for policy 0, policy_version 1470 (0.0007) [2023-03-06 14:29:23,638][04272] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-06 14:29:23,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12148.7). Total num frames: 1518592. Throughput: 0: 12667.5. Samples: 1485383. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:29:23,941][03942] Avg episode reward: [(0, '485.556')] [2023-03-06 14:29:23,942][04221] Saving new best policy, reward=485.556! [2023-03-06 14:29:24,462][04272] Updated weights for policy 0, policy_version 1490 (0.0007) [2023-03-06 14:29:25,271][04272] Updated weights for policy 0, policy_version 1500 (0.0007) [2023-03-06 14:29:26,078][04272] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-06 14:29:26,890][04272] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-06 14:29:27,712][04272] Updated weights for policy 0, policy_version 1530 (0.0006) [2023-03-06 14:29:28,509][04272] Updated weights for policy 0, policy_version 1540 (0.0006) [2023-03-06 14:29:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12169.9). Total num frames: 1582080. Throughput: 0: 12669.1. Samples: 1561421. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:29:28,941][03942] Avg episode reward: [(0, '504.382')] [2023-03-06 14:29:28,945][04221] Saving new best policy, reward=504.382! [2023-03-06 14:29:29,321][04272] Updated weights for policy 0, policy_version 1550 (0.0007) [2023-03-06 14:29:30,136][04272] Updated weights for policy 0, policy_version 1560 (0.0007) [2023-03-06 14:29:30,921][04272] Updated weights for policy 0, policy_version 1570 (0.0007) [2023-03-06 14:29:31,729][04272] Updated weights for policy 0, policy_version 1580 (0.0007) [2023-03-06 14:29:32,542][04272] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-06 14:29:33,345][04272] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-06 14:29:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12189.4). Total num frames: 1645568. Throughput: 0: 12663.8. Samples: 1637407. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:29:33,941][03942] Avg episode reward: [(0, '536.411')] [2023-03-06 14:29:33,942][04221] Saving new best policy, reward=536.411! [2023-03-06 14:29:34,166][04272] Updated weights for policy 0, policy_version 1610 (0.0007) [2023-03-06 14:29:34,978][04272] Updated weights for policy 0, policy_version 1620 (0.0006) [2023-03-06 14:29:35,783][04272] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-06 14:29:36,586][04272] Updated weights for policy 0, policy_version 1640 (0.0007) [2023-03-06 14:29:37,410][04272] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-06 14:29:38,217][04272] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-06 14:29:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12207.6). Total num frames: 1709056. Throughput: 0: 12663.8. Samples: 1675482. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:29:38,941][03942] Avg episode reward: [(0, '553.346')] [2023-03-06 14:29:38,946][04221] Saving new best policy, reward=553.346! [2023-03-06 14:29:39,002][04272] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-06 14:29:39,845][04272] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-06 14:29:40,657][04272] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-06 14:29:41,448][04272] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-06 14:29:42,280][04272] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-06 14:29:43,093][04272] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-06 14:29:43,892][04272] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-06 14:29:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12217.4). Total num frames: 1771520. Throughput: 0: 12658.6. Samples: 1751119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:29:43,941][03942] Avg episode reward: [(0, '588.488')] [2023-03-06 14:29:43,942][04221] Saving new best policy, reward=588.488! [2023-03-06 14:29:44,712][04272] Updated weights for policy 0, policy_version 1740 (0.0007) [2023-03-06 14:29:45,534][04272] Updated weights for policy 0, policy_version 1750 (0.0007) [2023-03-06 14:29:46,351][04272] Updated weights for policy 0, policy_version 1760 (0.0007) [2023-03-06 14:29:47,157][04272] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-06 14:29:47,984][04272] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-06 14:29:48,789][04272] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-06 14:29:48,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12646.4, 300 sec: 12226.6). Total num frames: 1833984. Throughput: 0: 12642.7. Samples: 1826379. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:29:48,941][03942] Avg episode reward: [(0, '602.319')] [2023-03-06 14:29:48,947][04221] Saving new best policy, reward=602.319! [2023-03-06 14:29:49,584][04272] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-06 14:29:50,416][04272] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-06 14:29:51,218][04272] Updated weights for policy 0, policy_version 1820 (0.0007) [2023-03-06 14:29:52,050][04272] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-06 14:29:52,876][04272] Updated weights for policy 0, policy_version 1840 (0.0006) [2023-03-06 14:29:53,699][04272] Updated weights for policy 0, policy_version 1850 (0.0007) [2023-03-06 14:29:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12241.8). Total num frames: 1897472. Throughput: 0: 12645.4. Samples: 1864374. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:29:53,941][03942] Avg episode reward: [(0, '576.658')] [2023-03-06 14:29:54,498][04272] Updated weights for policy 0, policy_version 1860 (0.0007) [2023-03-06 14:29:55,313][04272] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-06 14:29:56,119][04272] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-06 14:29:56,922][04272] Updated weights for policy 0, policy_version 1890 (0.0007) [2023-03-06 14:29:57,720][04272] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-06 14:29:58,532][04272] Updated weights for policy 0, policy_version 1910 (0.0006) [2023-03-06 14:29:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12256.0). Total num frames: 1960960. Throughput: 0: 12639.6. Samples: 1939910. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:29:58,941][03942] Avg episode reward: [(0, '549.063')] [2023-03-06 14:29:59,335][04272] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-06 14:30:00,149][04272] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-06 14:30:00,966][04272] Updated weights for policy 0, policy_version 1940 (0.0006) [2023-03-06 14:30:01,772][04272] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-06 14:30:02,586][04272] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-06 14:30:03,387][04272] Updated weights for policy 0, policy_version 1970 (0.0007) [2023-03-06 14:30:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12263.2). Total num frames: 2023424. Throughput: 0: 12634.0. Samples: 2015841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:30:03,941][03942] Avg episode reward: [(0, '554.816')] [2023-03-06 14:30:04,195][04272] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-06 14:30:05,022][04272] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-06 14:30:05,818][04272] Updated weights for policy 0, policy_version 2000 (0.0007) [2023-03-06 14:30:06,625][04272] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-06 14:30:07,433][04272] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-06 14:30:08,247][04272] Updated weights for policy 0, policy_version 2030 (0.0007) [2023-03-06 14:30:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12646.4, 300 sec: 12276.0). Total num frames: 2086912. Throughput: 0: 12634.1. Samples: 2053919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:30:08,941][03942] Avg episode reward: [(0, '558.870')] [2023-03-06 14:30:09,066][04272] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-06 14:30:09,882][04272] Updated weights for policy 0, policy_version 2050 (0.0006) [2023-03-06 14:30:10,692][04272] Updated weights for policy 0, policy_version 2060 (0.0007) [2023-03-06 14:30:11,499][04272] Updated weights for policy 0, policy_version 2070 (0.0007) [2023-03-06 14:30:12,309][04272] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-06 14:30:13,099][04272] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-06 14:30:13,906][04272] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-06 14:30:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12288.0). Total num frames: 2150400. Throughput: 0: 12628.7. Samples: 2129712. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:30:13,941][03942] Avg episode reward: [(0, '543.871')] [2023-03-06 14:30:14,712][04272] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-06 14:30:15,531][04272] Updated weights for policy 0, policy_version 2120 (0.0007) [2023-03-06 14:30:16,337][04272] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-06 14:30:17,126][04272] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-06 14:30:17,922][04272] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-06 14:30:18,741][04272] Updated weights for policy 0, policy_version 2160 (0.0006) [2023-03-06 14:30:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12299.4). Total num frames: 2213888. Throughput: 0: 12635.1. Samples: 2205989. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:30:18,941][03942] Avg episode reward: [(0, '566.643')] [2023-03-06 14:30:19,550][04272] Updated weights for policy 0, policy_version 2170 (0.0007) [2023-03-06 14:30:20,367][04272] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-06 14:30:21,173][04272] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-06 14:30:21,990][04272] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-06 14:30:22,805][04272] Updated weights for policy 0, policy_version 2210 (0.0007) [2023-03-06 14:30:23,623][04272] Updated weights for policy 0, policy_version 2220 (0.0006) [2023-03-06 14:30:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12304.6). Total num frames: 2276352. Throughput: 0: 12632.2. Samples: 2243933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:30:23,941][03942] Avg episode reward: [(0, '526.488')] [2023-03-06 14:30:24,434][04272] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-06 14:30:25,241][04272] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-06 14:30:26,043][04272] Updated weights for policy 0, policy_version 2250 (0.0006) [2023-03-06 14:30:26,846][04272] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-06 14:30:27,665][04272] Updated weights for policy 0, policy_version 2270 (0.0007) [2023-03-06 14:30:28,453][04272] Updated weights for policy 0, policy_version 2280 (0.0007) [2023-03-06 14:30:28,941][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.3, 300 sec: 12315.0). Total num frames: 2339840. Throughput: 0: 12635.8. Samples: 2319730. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:30:28,941][03942] Avg episode reward: [(0, '521.201')] [2023-03-06 14:30:29,268][04272] Updated weights for policy 0, policy_version 2290 (0.0007) [2023-03-06 14:30:30,105][04272] Updated weights for policy 0, policy_version 2300 (0.0007) [2023-03-06 14:30:30,898][04272] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-06 14:30:31,707][04272] Updated weights for policy 0, policy_version 2320 (0.0007) [2023-03-06 14:30:32,524][04272] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-06 14:30:33,310][04272] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-06 14:30:33,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12324.8). Total num frames: 2403328. Throughput: 0: 12650.2. Samples: 2395638. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:30:33,941][03942] Avg episode reward: [(0, '510.170')] [2023-03-06 14:30:34,127][04272] Updated weights for policy 0, policy_version 2350 (0.0007) [2023-03-06 14:30:34,961][04272] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-06 14:30:35,772][04272] Updated weights for policy 0, policy_version 2370 (0.0007) [2023-03-06 14:30:36,581][04272] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-06 14:30:37,395][04272] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-06 14:30:38,197][04272] Updated weights for policy 0, policy_version 2400 (0.0007) [2023-03-06 14:30:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12334.1). Total num frames: 2466816. Throughput: 0: 12640.6. Samples: 2433200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:30:38,941][03942] Avg episode reward: [(0, '545.550')] [2023-03-06 14:30:39,012][04272] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-06 14:30:39,822][04272] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-06 14:30:40,621][04272] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-06 14:30:41,444][04272] Updated weights for policy 0, policy_version 2440 (0.0006) [2023-03-06 14:30:42,249][04272] Updated weights for policy 0, policy_version 2450 (0.0006) [2023-03-06 14:30:43,051][04272] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-06 14:30:43,836][04272] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-06 14:30:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12343.0). Total num frames: 2530304. Throughput: 0: 12649.8. Samples: 2509152. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:30:43,941][03942] Avg episode reward: [(0, '581.760')] [2023-03-06 14:30:44,654][04272] Updated weights for policy 0, policy_version 2480 (0.0007) [2023-03-06 14:30:45,459][04272] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-06 14:30:46,259][04272] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-06 14:30:47,089][04272] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-06 14:30:47,906][04272] Updated weights for policy 0, policy_version 2520 (0.0008) [2023-03-06 14:30:48,727][04272] Updated weights for policy 0, policy_version 2530 (0.0006) [2023-03-06 14:30:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12346.5). Total num frames: 2592768. Throughput: 0: 12650.1. Samples: 2585094. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:30:48,941][03942] Avg episode reward: [(0, '570.521')] [2023-03-06 14:30:49,550][04272] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-06 14:30:50,351][04272] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-06 14:30:51,149][04272] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-06 14:30:51,962][04272] Updated weights for policy 0, policy_version 2570 (0.0006) [2023-03-06 14:30:52,771][04272] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-06 14:30:53,582][04272] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-06 14:30:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12354.7). Total num frames: 2656256. Throughput: 0: 12646.2. Samples: 2622996. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:30:53,941][03942] Avg episode reward: [(0, '565.459')] [2023-03-06 14:30:54,405][04272] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-06 14:30:55,235][04272] Updated weights for policy 0, policy_version 2610 (0.0007) [2023-03-06 14:30:56,048][04272] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-06 14:30:56,859][04272] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-06 14:30:57,681][04272] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-06 14:30:58,491][04272] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-06 14:30:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12357.8). Total num frames: 2718720. Throughput: 0: 12636.7. Samples: 2698364. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:30:58,941][03942] Avg episode reward: [(0, '553.337')] [2023-03-06 14:30:59,303][04272] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-06 14:31:00,105][04272] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-06 14:31:00,925][04272] Updated weights for policy 0, policy_version 2680 (0.0007) [2023-03-06 14:31:01,738][04272] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-06 14:31:02,561][04272] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-06 14:31:03,365][04272] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-06 14:31:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12365.4). Total num frames: 2782208. Throughput: 0: 12620.5. Samples: 2773908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:03,941][03942] Avg episode reward: [(0, '575.973')] [2023-03-06 14:31:04,197][04272] Updated weights for policy 0, policy_version 2720 (0.0007) [2023-03-06 14:31:05,014][04272] Updated weights for policy 0, policy_version 2730 (0.0007) [2023-03-06 14:31:05,816][04272] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-06 14:31:06,634][04272] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-06 14:31:07,447][04272] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-06 14:31:08,256][04272] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-06 14:31:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12368.1). Total num frames: 2844672. Throughput: 0: 12615.0. Samples: 2811608. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:08,941][03942] Avg episode reward: [(0, '620.339')] [2023-03-06 14:31:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000002778_2844672.pth... [2023-03-06 14:31:08,974][04221] Saving new best policy, reward=620.339! [2023-03-06 14:31:09,072][04272] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-06 14:31:09,891][04272] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-06 14:31:10,731][04272] Updated weights for policy 0, policy_version 2800 (0.0007) [2023-03-06 14:31:11,562][04272] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-06 14:31:12,409][04272] Updated weights for policy 0, policy_version 2820 (0.0007) [2023-03-06 14:31:13,222][04272] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-06 14:31:13,941][03942] Fps is (10 sec: 12390.3, 60 sec: 12595.2, 300 sec: 12366.4). Total num frames: 2906112. Throughput: 0: 12591.1. Samples: 2886331. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:31:13,942][03942] Avg episode reward: [(0, '567.986')] [2023-03-06 14:31:14,080][04272] Updated weights for policy 0, policy_version 2840 (0.0008) [2023-03-06 14:31:14,960][04272] Updated weights for policy 0, policy_version 2850 (0.0007) [2023-03-06 14:31:15,840][04272] Updated weights for policy 0, policy_version 2860 (0.0007) [2023-03-06 14:31:16,682][04272] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-06 14:31:17,495][04272] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-06 14:31:18,326][04272] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-06 14:31:18,941][03942] Fps is (10 sec: 12185.5, 60 sec: 12544.0, 300 sec: 12360.5). Total num frames: 2966528. Throughput: 0: 12515.9. Samples: 2958853. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:31:18,941][03942] Avg episode reward: [(0, '563.033')] [2023-03-06 14:31:19,140][04272] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-06 14:31:19,944][04272] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-06 14:31:20,770][04272] Updated weights for policy 0, policy_version 2920 (0.0007) [2023-03-06 14:31:21,573][04272] Updated weights for policy 0, policy_version 2930 (0.0006) [2023-03-06 14:31:22,381][04272] Updated weights for policy 0, policy_version 2940 (0.0007) [2023-03-06 14:31:23,198][04272] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-06 14:31:23,941][03942] Fps is (10 sec: 12390.5, 60 sec: 12561.1, 300 sec: 12367.4). Total num frames: 3030016. Throughput: 0: 12519.9. Samples: 2996594. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:31:23,941][03942] Avg episode reward: [(0, '590.216')] [2023-03-06 14:31:24,019][04272] Updated weights for policy 0, policy_version 2960 (0.0007) [2023-03-06 14:31:24,831][04272] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-06 14:31:25,659][04272] Updated weights for policy 0, policy_version 2980 (0.0008) [2023-03-06 14:31:26,448][04272] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-06 14:31:27,259][04272] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-06 14:31:28,054][04272] Updated weights for policy 0, policy_version 3010 (0.0007) [2023-03-06 14:31:28,875][04272] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-06 14:31:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12369.9). Total num frames: 3092480. Throughput: 0: 12514.2. Samples: 3072292. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:31:28,941][03942] Avg episode reward: [(0, '603.671')] [2023-03-06 14:31:29,685][04272] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-06 14:31:30,511][04272] Updated weights for policy 0, policy_version 3040 (0.0007) [2023-03-06 14:31:31,294][04272] Updated weights for policy 0, policy_version 3050 (0.0006) [2023-03-06 14:31:32,114][04272] Updated weights for policy 0, policy_version 3060 (0.0007) [2023-03-06 14:31:32,931][04272] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-06 14:31:33,750][04272] Updated weights for policy 0, policy_version 3080 (0.0006) [2023-03-06 14:31:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12376.4). Total num frames: 3155968. Throughput: 0: 12510.1. Samples: 3148047. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:31:33,941][03942] Avg episode reward: [(0, '606.698')] [2023-03-06 14:31:34,530][04272] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-03-06 14:31:35,372][04272] Updated weights for policy 0, policy_version 3100 (0.0007) [2023-03-06 14:31:36,188][04272] Updated weights for policy 0, policy_version 3110 (0.0007) [2023-03-06 14:31:37,010][04272] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-06 14:31:37,804][04272] Updated weights for policy 0, policy_version 3130 (0.0005) [2023-03-06 14:31:38,642][04272] Updated weights for policy 0, policy_version 3140 (0.0006) [2023-03-06 14:31:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12378.6). Total num frames: 3218432. Throughput: 0: 12509.3. Samples: 3185915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:38,941][03942] Avg episode reward: [(0, '602.376')] [2023-03-06 14:31:39,438][04272] Updated weights for policy 0, policy_version 3150 (0.0006) [2023-03-06 14:31:40,249][04272] Updated weights for policy 0, policy_version 3160 (0.0007) [2023-03-06 14:31:41,095][04272] Updated weights for policy 0, policy_version 3170 (0.0007) [2023-03-06 14:31:41,901][04272] Updated weights for policy 0, policy_version 3180 (0.0007) [2023-03-06 14:31:42,705][04272] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-06 14:31:43,537][04272] Updated weights for policy 0, policy_version 3200 (0.0006) [2023-03-06 14:31:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12384.6). Total num frames: 3281920. Throughput: 0: 12506.3. Samples: 3261147. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:43,941][03942] Avg episode reward: [(0, '580.842')] [2023-03-06 14:31:44,334][04272] Updated weights for policy 0, policy_version 3210 (0.0007) [2023-03-06 14:31:45,135][04272] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-06 14:31:45,966][04272] Updated weights for policy 0, policy_version 3230 (0.0007) [2023-03-06 14:31:46,781][04272] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-06 14:31:47,584][04272] Updated weights for policy 0, policy_version 3250 (0.0007) [2023-03-06 14:31:48,410][04272] Updated weights for policy 0, policy_version 3260 (0.0006) [2023-03-06 14:31:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12386.6). Total num frames: 3344384. Throughput: 0: 12508.4. Samples: 3336785. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:31:48,941][03942] Avg episode reward: [(0, '587.479')] [2023-03-06 14:31:49,200][04272] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-06 14:31:50,030][04272] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-06 14:31:50,847][04272] Updated weights for policy 0, policy_version 3290 (0.0006) [2023-03-06 14:31:51,643][04272] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-06 14:31:52,449][04272] Updated weights for policy 0, policy_version 3310 (0.0008) [2023-03-06 14:31:53,264][04272] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-06 14:31:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12392.3). Total num frames: 3407872. Throughput: 0: 12508.9. Samples: 3374510. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:53,941][03942] Avg episode reward: [(0, '609.375')] [2023-03-06 14:31:54,073][04272] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-06 14:31:54,886][04272] Updated weights for policy 0, policy_version 3340 (0.0006) [2023-03-06 14:31:55,712][04272] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-06 14:31:56,526][04272] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-06 14:31:57,337][04272] Updated weights for policy 0, policy_version 3370 (0.0006) [2023-03-06 14:31:58,142][04272] Updated weights for policy 0, policy_version 3380 (0.0007) [2023-03-06 14:31:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12394.1). Total num frames: 3470336. Throughput: 0: 12530.8. Samples: 3450217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:31:58,941][03942] Avg episode reward: [(0, '573.618')] [2023-03-06 14:31:58,949][04272] Updated weights for policy 0, policy_version 3390 (0.0007) [2023-03-06 14:31:59,746][04272] Updated weights for policy 0, policy_version 3400 (0.0006) [2023-03-06 14:32:00,572][04272] Updated weights for policy 0, policy_version 3410 (0.0007) [2023-03-06 14:32:01,371][04272] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-06 14:32:02,176][04272] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-06 14:32:02,992][04272] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-06 14:32:03,820][04272] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-06 14:32:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12399.4). Total num frames: 3533824. Throughput: 0: 12603.4. Samples: 3526007. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:32:03,941][03942] Avg episode reward: [(0, '581.798')] [2023-03-06 14:32:04,629][04272] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-06 14:32:05,461][04272] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-06 14:32:06,281][04272] Updated weights for policy 0, policy_version 3480 (0.0007) [2023-03-06 14:32:07,098][04272] Updated weights for policy 0, policy_version 3490 (0.0007) [2023-03-06 14:32:07,897][04272] Updated weights for policy 0, policy_version 3500 (0.0007) [2023-03-06 14:32:08,722][04272] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-06 14:32:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12401.0). Total num frames: 3596288. Throughput: 0: 12599.1. Samples: 3563554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:32:08,941][03942] Avg episode reward: [(0, '485.859')] [2023-03-06 14:32:09,575][04272] Updated weights for policy 0, policy_version 3520 (0.0007) [2023-03-06 14:32:10,390][04272] Updated weights for policy 0, policy_version 3530 (0.0008) [2023-03-06 14:32:11,222][04272] Updated weights for policy 0, policy_version 3540 (0.0007) [2023-03-06 14:32:12,028][04272] Updated weights for policy 0, policy_version 3550 (0.0007) [2023-03-06 14:32:12,833][04272] Updated weights for policy 0, policy_version 3560 (0.0007) [2023-03-06 14:32:13,653][04272] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-06 14:32:13,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12402.6). Total num frames: 3658752. Throughput: 0: 12582.7. Samples: 3638514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:13,941][03942] Avg episode reward: [(0, '500.997')] [2023-03-06 14:32:14,454][04272] Updated weights for policy 0, policy_version 3580 (0.0008) [2023-03-06 14:32:15,284][04272] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-06 14:32:16,106][04272] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-06 14:32:16,910][04272] Updated weights for policy 0, policy_version 3610 (0.0007) [2023-03-06 14:32:17,714][04272] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-06 14:32:18,537][04272] Updated weights for policy 0, policy_version 3630 (0.0007) [2023-03-06 14:32:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12611.0). Total num frames: 3721216. Throughput: 0: 12576.0. Samples: 3713968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:18,941][03942] Avg episode reward: [(0, '583.850')] [2023-03-06 14:32:19,336][04272] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-06 14:32:20,166][04272] Updated weights for policy 0, policy_version 3650 (0.0007) [2023-03-06 14:32:20,965][04272] Updated weights for policy 0, policy_version 3660 (0.0006) [2023-03-06 14:32:21,791][04272] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-06 14:32:22,612][04272] Updated weights for policy 0, policy_version 3680 (0.0006) [2023-03-06 14:32:23,394][04272] Updated weights for policy 0, policy_version 3690 (0.0007) [2023-03-06 14:32:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.2, 300 sec: 12631.7). Total num frames: 3784704. Throughput: 0: 12571.7. Samples: 3751639. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:32:23,941][03942] Avg episode reward: [(0, '562.542')] [2023-03-06 14:32:24,226][04272] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-06 14:32:25,027][04272] Updated weights for policy 0, policy_version 3710 (0.0006) [2023-03-06 14:32:25,858][04272] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-06 14:32:26,663][04272] Updated weights for policy 0, policy_version 3730 (0.0006) [2023-03-06 14:32:27,485][04272] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-06 14:32:28,296][04272] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-06 14:32:28,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12624.7). Total num frames: 3847168. Throughput: 0: 12581.1. Samples: 3827295. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:28,941][03942] Avg episode reward: [(0, '570.119')] [2023-03-06 14:32:29,126][04272] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-06 14:32:29,921][04272] Updated weights for policy 0, policy_version 3770 (0.0007) [2023-03-06 14:32:30,709][04272] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-06 14:32:31,547][04272] Updated weights for policy 0, policy_version 3790 (0.0008) [2023-03-06 14:32:32,349][04272] Updated weights for policy 0, policy_version 3800 (0.0007) [2023-03-06 14:32:33,150][04272] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-06 14:32:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 3910656. Throughput: 0: 12582.0. Samples: 3902977. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:33,941][03942] Avg episode reward: [(0, '554.861')] [2023-03-06 14:32:33,991][04272] Updated weights for policy 0, policy_version 3820 (0.0008) [2023-03-06 14:32:34,791][04272] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-06 14:32:35,601][04272] Updated weights for policy 0, policy_version 3840 (0.0007) [2023-03-06 14:32:36,428][04272] Updated weights for policy 0, policy_version 3850 (0.0007) [2023-03-06 14:32:37,230][04272] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-06 14:32:38,067][04272] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-06 14:32:38,876][04272] Updated weights for policy 0, policy_version 3880 (0.0007) [2023-03-06 14:32:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 3973120. Throughput: 0: 12578.8. Samples: 3940558. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:38,941][03942] Avg episode reward: [(0, '588.878')] [2023-03-06 14:32:39,683][04272] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-06 14:32:40,512][04272] Updated weights for policy 0, policy_version 3900 (0.0006) [2023-03-06 14:32:41,334][04272] Updated weights for policy 0, policy_version 3910 (0.0007) [2023-03-06 14:32:42,130][04272] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-06 14:32:42,938][04272] Updated weights for policy 0, policy_version 3930 (0.0007) [2023-03-06 14:32:43,762][04272] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-06 14:32:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.2, 300 sec: 12617.8). Total num frames: 4036608. Throughput: 0: 12568.0. Samples: 4015775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:32:43,941][03942] Avg episode reward: [(0, '598.108')] [2023-03-06 14:32:44,578][04272] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-06 14:32:45,409][04272] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-06 14:32:46,226][04272] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-06 14:32:47,057][04272] Updated weights for policy 0, policy_version 3980 (0.0006) [2023-03-06 14:32:47,854][04272] Updated weights for policy 0, policy_version 3990 (0.0007) [2023-03-06 14:32:48,663][04272] Updated weights for policy 0, policy_version 4000 (0.0006) [2023-03-06 14:32:48,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12578.1, 300 sec: 12614.3). Total num frames: 4099072. Throughput: 0: 12562.9. Samples: 4091339. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:32:48,941][03942] Avg episode reward: [(0, '665.399')] [2023-03-06 14:32:48,944][04221] Saving new best policy, reward=665.399! [2023-03-06 14:32:49,497][04272] Updated weights for policy 0, policy_version 4010 (0.0006) [2023-03-06 14:32:50,316][04272] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-06 14:32:51,134][04272] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-03-06 14:32:51,961][04272] Updated weights for policy 0, policy_version 4040 (0.0006) [2023-03-06 14:32:52,762][04272] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-06 14:32:53,580][04272] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-06 14:32:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12610.8). Total num frames: 4161536. Throughput: 0: 12554.2. Samples: 4128491. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:32:53,941][03942] Avg episode reward: [(0, '617.740')] [2023-03-06 14:32:54,407][04272] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-06 14:32:55,202][04272] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-06 14:32:56,019][04272] Updated weights for policy 0, policy_version 4090 (0.0007) [2023-03-06 14:32:56,830][04272] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-06 14:32:57,635][04272] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-06 14:32:58,456][04272] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-06 14:32:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.2, 300 sec: 12610.8). Total num frames: 4225024. Throughput: 0: 12565.8. Samples: 4203973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:32:58,941][03942] Avg episode reward: [(0, '650.389')] [2023-03-06 14:32:59,293][04272] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-06 14:33:00,097][04272] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-06 14:33:00,906][04272] Updated weights for policy 0, policy_version 4150 (0.0006) [2023-03-06 14:33:01,738][04272] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-06 14:33:02,541][04272] Updated weights for policy 0, policy_version 4170 (0.0007) [2023-03-06 14:33:03,378][04272] Updated weights for policy 0, policy_version 4180 (0.0007) [2023-03-06 14:33:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12603.9). Total num frames: 4286464. Throughput: 0: 12561.9. Samples: 4279252. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:33:03,941][03942] Avg episode reward: [(0, '660.226')] [2023-03-06 14:33:04,195][04272] Updated weights for policy 0, policy_version 4190 (0.0006) [2023-03-06 14:33:05,005][04272] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-06 14:33:05,838][04272] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-06 14:33:06,653][04272] Updated weights for policy 0, policy_version 4220 (0.0007) [2023-03-06 14:33:07,469][04272] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-06 14:33:08,279][04272] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-06 14:33:08,941][03942] Fps is (10 sec: 12390.4, 60 sec: 12544.0, 300 sec: 12600.4). Total num frames: 4348928. Throughput: 0: 12555.3. Samples: 4316627. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:33:08,941][03942] Avg episode reward: [(0, '687.632')] [2023-03-06 14:33:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000004248_4349952.pth... [2023-03-06 14:33:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000001297_1328128.pth [2023-03-06 14:33:08,979][04221] Saving new best policy, reward=687.632! [2023-03-06 14:33:09,105][04272] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-06 14:33:09,947][04272] Updated weights for policy 0, policy_version 4260 (0.0006) [2023-03-06 14:33:10,747][04272] Updated weights for policy 0, policy_version 4270 (0.0007) [2023-03-06 14:33:11,557][04272] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-06 14:33:12,385][04272] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-06 14:33:13,194][04272] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-06 14:33:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12600.4). Total num frames: 4412416. Throughput: 0: 12541.2. Samples: 4391647. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:33:13,941][03942] Avg episode reward: [(0, '675.647')] [2023-03-06 14:33:14,038][04272] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-06 14:33:14,848][04272] Updated weights for policy 0, policy_version 4320 (0.0006) [2023-03-06 14:33:15,650][04272] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-06 14:33:16,466][04272] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-06 14:33:17,293][04272] Updated weights for policy 0, policy_version 4350 (0.0007) [2023-03-06 14:33:18,110][04272] Updated weights for policy 0, policy_version 4360 (0.0007) [2023-03-06 14:33:18,913][04272] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-06 14:33:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12596.9). Total num frames: 4474880. Throughput: 0: 12530.5. Samples: 4466851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:33:18,941][03942] Avg episode reward: [(0, '612.337')] [2023-03-06 14:33:19,761][04272] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-06 14:33:20,566][04272] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-06 14:33:21,377][04272] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-06 14:33:22,212][04272] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-06 14:33:23,018][04272] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-06 14:33:23,822][04272] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-06 14:33:23,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12593.5). Total num frames: 4537344. Throughput: 0: 12529.2. Samples: 4504371. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:33:23,941][03942] Avg episode reward: [(0, '621.752')] [2023-03-06 14:33:24,640][04272] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-06 14:33:25,443][04272] Updated weights for policy 0, policy_version 4450 (0.0007) [2023-03-06 14:33:26,252][04272] Updated weights for policy 0, policy_version 4460 (0.0007) [2023-03-06 14:33:27,093][04272] Updated weights for policy 0, policy_version 4470 (0.0007) [2023-03-06 14:33:27,907][04272] Updated weights for policy 0, policy_version 4480 (0.0007) [2023-03-06 14:33:28,717][04272] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-06 14:33:28,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12590.0). Total num frames: 4599808. Throughput: 0: 12529.3. Samples: 4579596. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:33:28,941][03942] Avg episode reward: [(0, '611.405')] [2023-03-06 14:33:29,554][04272] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-06 14:33:30,370][04272] Updated weights for policy 0, policy_version 4510 (0.0006) [2023-03-06 14:33:31,185][04272] Updated weights for policy 0, policy_version 4520 (0.0007) [2023-03-06 14:33:32,010][04272] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-06 14:33:32,819][04272] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-06 14:33:33,646][04272] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-06 14:33:33,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12590.0). Total num frames: 4662272. Throughput: 0: 12515.5. Samples: 4654537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:33:33,941][03942] Avg episode reward: [(0, '655.055')] [2023-03-06 14:33:34,462][04272] Updated weights for policy 0, policy_version 4560 (0.0007) [2023-03-06 14:33:35,292][04272] Updated weights for policy 0, policy_version 4570 (0.0007) [2023-03-06 14:33:36,105][04272] Updated weights for policy 0, policy_version 4580 (0.0007) [2023-03-06 14:33:36,914][04272] Updated weights for policy 0, policy_version 4590 (0.0008) [2023-03-06 14:33:37,742][04272] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-06 14:33:38,549][04272] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-06 14:33:38,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12586.5). Total num frames: 4724736. Throughput: 0: 12527.3. Samples: 4692222. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:33:38,941][03942] Avg episode reward: [(0, '676.212')] [2023-03-06 14:33:39,381][04272] Updated weights for policy 0, policy_version 4620 (0.0007) [2023-03-06 14:33:40,184][04272] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-06 14:33:40,999][04272] Updated weights for policy 0, policy_version 4640 (0.0007) [2023-03-06 14:33:41,818][04272] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-06 14:33:42,633][04272] Updated weights for policy 0, policy_version 4660 (0.0007) [2023-03-06 14:33:43,438][04272] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-06 14:33:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12586.5). Total num frames: 4788224. Throughput: 0: 12521.0. Samples: 4767418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:33:43,941][03942] Avg episode reward: [(0, '673.159')] [2023-03-06 14:33:44,261][04272] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-06 14:33:45,072][04272] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-06 14:33:45,897][04272] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-06 14:33:46,710][04272] Updated weights for policy 0, policy_version 4710 (0.0006) [2023-03-06 14:33:47,531][04272] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-06 14:33:48,354][04272] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-06 14:33:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12583.1). Total num frames: 4850688. Throughput: 0: 12521.6. Samples: 4842724. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:33:48,941][03942] Avg episode reward: [(0, '645.507')] [2023-03-06 14:33:49,156][04272] Updated weights for policy 0, policy_version 4740 (0.0006) [2023-03-06 14:33:49,994][04272] Updated weights for policy 0, policy_version 4750 (0.0006) [2023-03-06 14:33:50,804][04272] Updated weights for policy 0, policy_version 4760 (0.0007) [2023-03-06 14:33:51,619][04272] Updated weights for policy 0, policy_version 4770 (0.0006) [2023-03-06 14:33:52,455][04272] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-06 14:33:53,265][04272] Updated weights for policy 0, policy_version 4790 (0.0007) [2023-03-06 14:33:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12579.6). Total num frames: 4913152. Throughput: 0: 12525.8. Samples: 4880289. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:33:53,941][03942] Avg episode reward: [(0, '590.300')] [2023-03-06 14:33:54,074][04272] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-06 14:33:54,916][04272] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-06 14:33:55,718][04272] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-06 14:33:56,541][04272] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-06 14:33:57,387][04272] Updated weights for policy 0, policy_version 4840 (0.0007) [2023-03-06 14:33:58,208][04272] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-06 14:33:58,941][03942] Fps is (10 sec: 12390.3, 60 sec: 12492.8, 300 sec: 12572.6). Total num frames: 4974592. Throughput: 0: 12518.5. Samples: 4954981. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:33:58,941][03942] Avg episode reward: [(0, '660.950')] [2023-03-06 14:33:59,017][04272] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-06 14:33:59,843][04272] Updated weights for policy 0, policy_version 4870 (0.0007) [2023-03-06 14:34:00,650][04272] Updated weights for policy 0, policy_version 4880 (0.0007) [2023-03-06 14:34:01,476][04272] Updated weights for policy 0, policy_version 4890 (0.0007) [2023-03-06 14:34:02,295][04272] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-06 14:34:03,102][04272] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-06 14:34:03,915][04272] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-06 14:34:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12576.1). Total num frames: 5038080. Throughput: 0: 12514.1. Samples: 5029986. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:03,941][03942] Avg episode reward: [(0, '625.346')] [2023-03-06 14:34:04,732][04272] Updated weights for policy 0, policy_version 4930 (0.0006) [2023-03-06 14:34:05,562][04272] Updated weights for policy 0, policy_version 4940 (0.0006) [2023-03-06 14:34:06,396][04272] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-06 14:34:07,222][04272] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-06 14:34:08,037][04272] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-06 14:34:08,845][04272] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-06 14:34:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12572.6). Total num frames: 5100544. Throughput: 0: 12511.3. Samples: 5067379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:08,941][03942] Avg episode reward: [(0, '686.647')] [2023-03-06 14:34:09,669][04272] Updated weights for policy 0, policy_version 4990 (0.0006) [2023-03-06 14:34:10,509][04272] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-06 14:34:11,321][04272] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-06 14:34:12,148][04272] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-06 14:34:12,986][04272] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-06 14:34:13,804][04272] Updated weights for policy 0, policy_version 5040 (0.0006) [2023-03-06 14:34:13,940][03942] Fps is (10 sec: 12390.6, 60 sec: 12492.8, 300 sec: 12565.7). Total num frames: 5161984. Throughput: 0: 12501.6. Samples: 5142167. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:34:13,941][03942] Avg episode reward: [(0, '645.124')] [2023-03-06 14:34:14,626][04272] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-06 14:34:15,436][04272] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-06 14:34:16,268][04272] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-06 14:34:17,085][04272] Updated weights for policy 0, policy_version 5080 (0.0007) [2023-03-06 14:34:17,913][04272] Updated weights for policy 0, policy_version 5090 (0.0007) [2023-03-06 14:34:18,725][04272] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-06 14:34:18,941][03942] Fps is (10 sec: 12390.4, 60 sec: 12492.8, 300 sec: 12562.2). Total num frames: 5224448. Throughput: 0: 12496.4. Samples: 5216876. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:34:18,941][03942] Avg episode reward: [(0, '674.851')] [2023-03-06 14:34:19,532][04272] Updated weights for policy 0, policy_version 5110 (0.0007) [2023-03-06 14:34:20,358][04272] Updated weights for policy 0, policy_version 5120 (0.0007) [2023-03-06 14:34:21,173][04272] Updated weights for policy 0, policy_version 5130 (0.0007) [2023-03-06 14:34:22,004][04272] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-06 14:34:22,814][04272] Updated weights for policy 0, policy_version 5150 (0.0007) [2023-03-06 14:34:23,635][04272] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-06 14:34:23,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12492.8, 300 sec: 12558.8). Total num frames: 5286912. Throughput: 0: 12492.2. Samples: 5254372. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:34:23,941][03942] Avg episode reward: [(0, '627.508')] [2023-03-06 14:34:24,454][04272] Updated weights for policy 0, policy_version 5170 (0.0006) [2023-03-06 14:34:25,277][04272] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-06 14:34:26,093][04272] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-06 14:34:26,912][04272] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-06 14:34:27,733][04272] Updated weights for policy 0, policy_version 5210 (0.0007) [2023-03-06 14:34:28,540][04272] Updated weights for policy 0, policy_version 5220 (0.0006) [2023-03-06 14:34:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12509.9, 300 sec: 12558.8). Total num frames: 5350400. Throughput: 0: 12490.0. Samples: 5329466. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:34:28,941][03942] Avg episode reward: [(0, '594.845')] [2023-03-06 14:34:29,367][04272] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-06 14:34:30,178][04272] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-06 14:34:30,998][04272] Updated weights for policy 0, policy_version 5250 (0.0006) [2023-03-06 14:34:31,812][04272] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-03-06 14:34:32,631][04272] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-06 14:34:33,457][04272] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-06 14:34:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12509.9, 300 sec: 12555.3). Total num frames: 5412864. Throughput: 0: 12486.5. Samples: 5404617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:33,941][03942] Avg episode reward: [(0, '673.026')] [2023-03-06 14:34:34,252][04272] Updated weights for policy 0, policy_version 5290 (0.0007) [2023-03-06 14:34:35,099][04272] Updated weights for policy 0, policy_version 5300 (0.0006) [2023-03-06 14:34:35,917][04272] Updated weights for policy 0, policy_version 5310 (0.0006) [2023-03-06 14:34:36,715][04272] Updated weights for policy 0, policy_version 5320 (0.0006) [2023-03-06 14:34:37,553][04272] Updated weights for policy 0, policy_version 5330 (0.0007) [2023-03-06 14:34:38,394][04272] Updated weights for policy 0, policy_version 5340 (0.0008) [2023-03-06 14:34:38,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12555.3). Total num frames: 5475328. Throughput: 0: 12483.3. Samples: 5442037. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:34:38,941][03942] Avg episode reward: [(0, '640.104')] [2023-03-06 14:34:39,192][04272] Updated weights for policy 0, policy_version 5350 (0.0007) [2023-03-06 14:34:39,995][04272] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-06 14:34:40,826][04272] Updated weights for policy 0, policy_version 5370 (0.0007) [2023-03-06 14:34:41,633][04272] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-06 14:34:42,477][04272] Updated weights for policy 0, policy_version 5390 (0.0007) [2023-03-06 14:34:43,263][04272] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-06 14:34:43,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12555.3). Total num frames: 5537792. Throughput: 0: 12490.0. Samples: 5517029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:43,952][03942] Avg episode reward: [(0, '641.228')] [2023-03-06 14:34:44,092][04272] Updated weights for policy 0, policy_version 5410 (0.0007) [2023-03-06 14:34:44,919][04272] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-06 14:34:45,740][04272] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-06 14:34:46,539][04272] Updated weights for policy 0, policy_version 5440 (0.0006) [2023-03-06 14:34:47,354][04272] Updated weights for policy 0, policy_version 5450 (0.0007) [2023-03-06 14:34:48,178][04272] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-06 14:34:48,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12551.8). Total num frames: 5600256. Throughput: 0: 12497.4. Samples: 5592367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:48,952][03942] Avg episode reward: [(0, '648.979')] [2023-03-06 14:34:48,987][04272] Updated weights for policy 0, policy_version 5470 (0.0005) [2023-03-06 14:34:49,796][04272] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-06 14:34:50,620][04272] Updated weights for policy 0, policy_version 5490 (0.0006) [2023-03-06 14:34:51,439][04272] Updated weights for policy 0, policy_version 5500 (0.0006) [2023-03-06 14:34:52,246][04272] Updated weights for policy 0, policy_version 5510 (0.0007) [2023-03-06 14:34:53,081][04272] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-06 14:34:53,882][04272] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-06 14:34:53,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12548.3). Total num frames: 5662720. Throughput: 0: 12504.1. Samples: 5630063. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:34:53,952][03942] Avg episode reward: [(0, '637.116')] [2023-03-06 14:34:54,699][04272] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-06 14:34:55,537][04272] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-06 14:34:56,337][04272] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-06 14:34:57,157][04272] Updated weights for policy 0, policy_version 5570 (0.0006) [2023-03-06 14:34:57,982][04272] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-06 14:34:58,800][04272] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-06 14:34:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12548.3). Total num frames: 5725184. Throughput: 0: 12514.1. Samples: 5705303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:34:58,952][03942] Avg episode reward: [(0, '628.300')] [2023-03-06 14:34:59,614][04272] Updated weights for policy 0, policy_version 5600 (0.0007) [2023-03-06 14:35:00,423][04272] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-06 14:35:01,241][04272] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-06 14:35:02,043][04272] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-06 14:35:02,865][04272] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-06 14:35:03,662][04272] Updated weights for policy 0, policy_version 5650 (0.0007) [2023-03-06 14:35:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12509.9, 300 sec: 12548.3). Total num frames: 5788672. Throughput: 0: 12529.2. Samples: 5780691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:35:03,952][03942] Avg episode reward: [(0, '599.313')] [2023-03-06 14:35:04,482][04272] Updated weights for policy 0, policy_version 5660 (0.0006) [2023-03-06 14:35:05,307][04272] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-06 14:35:06,127][04272] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-06 14:35:06,934][04272] Updated weights for policy 0, policy_version 5690 (0.0006) [2023-03-06 14:35:07,740][04272] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-06 14:35:08,531][04272] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-06 14:35:08,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12526.9, 300 sec: 12548.3). Total num frames: 5852160. Throughput: 0: 12533.7. Samples: 5818386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:35:08,951][03942] Avg episode reward: [(0, '597.821')] [2023-03-06 14:35:08,956][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000005715_5852160.pth... [2023-03-06 14:35:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000002778_2844672.pth [2023-03-06 14:35:09,342][04272] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-06 14:35:10,157][04272] Updated weights for policy 0, policy_version 5730 (0.0006) [2023-03-06 14:35:10,960][04272] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-06 14:35:11,785][04272] Updated weights for policy 0, policy_version 5750 (0.0007) [2023-03-06 14:35:12,594][04272] Updated weights for policy 0, policy_version 5760 (0.0007) [2023-03-06 14:35:13,414][04272] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-06 14:35:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12544.0, 300 sec: 12544.9). Total num frames: 5914624. Throughput: 0: 12552.9. Samples: 5894346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:35:13,951][03942] Avg episode reward: [(0, '609.212')] [2023-03-06 14:35:14,217][04272] Updated weights for policy 0, policy_version 5780 (0.0006) [2023-03-06 14:35:15,014][04272] Updated weights for policy 0, policy_version 5790 (0.0006) [2023-03-06 14:35:15,844][04272] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-06 14:35:16,643][04272] Updated weights for policy 0, policy_version 5810 (0.0007) [2023-03-06 14:35:17,459][04272] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-06 14:35:18,285][04272] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-06 14:35:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12548.3). Total num frames: 5978112. Throughput: 0: 12559.5. Samples: 5969796. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-06 14:35:18,951][03942] Avg episode reward: [(0, '658.725')] [2023-03-06 14:35:19,094][04272] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-06 14:35:19,907][04272] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-06 14:35:20,739][04272] Updated weights for policy 0, policy_version 5860 (0.0006) [2023-03-06 14:35:21,538][04272] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-06 14:35:22,349][04272] Updated weights for policy 0, policy_version 5880 (0.0007) [2023-03-06 14:35:23,177][04272] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-06 14:35:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12561.1, 300 sec: 12544.9). Total num frames: 6040576. Throughput: 0: 12566.6. Samples: 6007535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:35:23,952][03942] Avg episode reward: [(0, '663.697')] [2023-03-06 14:35:23,995][04272] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-06 14:35:24,799][04272] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-06 14:35:25,611][04272] Updated weights for policy 0, policy_version 5920 (0.0006) [2023-03-06 14:35:26,458][04272] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-06 14:35:27,254][04272] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-06 14:35:28,077][04272] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-06 14:35:28,901][04272] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-06 14:35:28,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12541.4). Total num frames: 6103040. Throughput: 0: 12570.5. Samples: 6082703. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:35:28,941][03942] Avg episode reward: [(0, '658.783')] [2023-03-06 14:35:29,694][04272] Updated weights for policy 0, policy_version 5970 (0.0006) [2023-03-06 14:35:30,529][04272] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-06 14:35:31,347][04272] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-06 14:35:32,147][04272] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-06 14:35:32,977][04272] Updated weights for policy 0, policy_version 6010 (0.0007) [2023-03-06 14:35:33,799][04272] Updated weights for policy 0, policy_version 6020 (0.0006) [2023-03-06 14:35:33,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12537.9). Total num frames: 6165504. Throughput: 0: 12568.6. Samples: 6157952. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:35:33,941][03942] Avg episode reward: [(0, '714.020')] [2023-03-06 14:35:33,951][04221] Saving new best policy, reward=714.020! [2023-03-06 14:35:34,606][04272] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-06 14:35:35,416][04272] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-06 14:35:36,239][04272] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-06 14:35:37,061][04272] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-06 14:35:37,879][04272] Updated weights for policy 0, policy_version 6070 (0.0007) [2023-03-06 14:35:38,712][04272] Updated weights for policy 0, policy_version 6080 (0.0007) [2023-03-06 14:35:38,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 6227968. Throughput: 0: 12566.6. Samples: 6195560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 14:35:38,941][03942] Avg episode reward: [(0, '676.202')] [2023-03-06 14:35:39,526][04272] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-06 14:35:40,343][04272] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-06 14:35:41,159][04272] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-06 14:35:41,959][04272] Updated weights for policy 0, policy_version 6120 (0.0007) [2023-03-06 14:35:42,771][04272] Updated weights for policy 0, policy_version 6130 (0.0006) [2023-03-06 14:35:43,610][04272] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-06 14:35:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12537.9). Total num frames: 6291456. Throughput: 0: 12569.8. Samples: 6270943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 14:35:43,952][03942] Avg episode reward: [(0, '602.126')] [2023-03-06 14:35:44,419][04272] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-06 14:35:45,236][04272] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-06 14:35:46,041][04272] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-06 14:35:46,860][04272] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-06 14:35:47,676][04272] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-06 14:35:48,498][04272] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-06 14:35:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 6353920. Throughput: 0: 12566.4. Samples: 6346178. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:35:48,941][03942] Avg episode reward: [(0, '654.622')] [2023-03-06 14:35:49,313][04272] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-06 14:35:50,122][04272] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-06 14:35:50,951][04272] Updated weights for policy 0, policy_version 6230 (0.0007) [2023-03-06 14:35:51,778][04272] Updated weights for policy 0, policy_version 6240 (0.0007) [2023-03-06 14:35:52,599][04272] Updated weights for policy 0, policy_version 6250 (0.0007) [2023-03-06 14:35:53,402][04272] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-06 14:35:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 6416384. Throughput: 0: 12561.9. Samples: 6383673. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:35:53,941][03942] Avg episode reward: [(0, '677.637')] [2023-03-06 14:35:54,236][04272] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-06 14:35:55,058][04272] Updated weights for policy 0, policy_version 6280 (0.0007) [2023-03-06 14:35:55,868][04272] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-06 14:35:56,692][04272] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-06 14:35:57,493][04272] Updated weights for policy 0, policy_version 6310 (0.0007) [2023-03-06 14:35:58,292][04272] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-06 14:35:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12561.1, 300 sec: 12531.0). Total num frames: 6478848. Throughput: 0: 12542.5. Samples: 6458759. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:35:58,952][03942] Avg episode reward: [(0, '626.182')] [2023-03-06 14:35:59,125][04272] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-06 14:35:59,936][04272] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-06 14:36:00,757][04272] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-06 14:36:01,585][04272] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-06 14:36:02,395][04272] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-06 14:36:03,200][04272] Updated weights for policy 0, policy_version 6380 (0.0006) [2023-03-06 14:36:03,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12531.0). Total num frames: 6541312. Throughput: 0: 12539.1. Samples: 6534055. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:03,951][03942] Avg episode reward: [(0, '657.523')] [2023-03-06 14:36:04,033][04272] Updated weights for policy 0, policy_version 6390 (0.0007) [2023-03-06 14:36:04,831][04272] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-06 14:36:05,633][04272] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-06 14:36:06,454][04272] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-06 14:36:07,264][04272] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-06 14:36:08,076][04272] Updated weights for policy 0, policy_version 6440 (0.0007) [2023-03-06 14:36:08,912][04272] Updated weights for policy 0, policy_version 6450 (0.0007) [2023-03-06 14:36:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12544.0, 300 sec: 12537.9). Total num frames: 6604800. Throughput: 0: 12541.4. Samples: 6571895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:08,941][03942] Avg episode reward: [(0, '683.064')] [2023-03-06 14:36:09,713][04272] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-06 14:36:10,526][04272] Updated weights for policy 0, policy_version 6470 (0.0007) [2023-03-06 14:36:11,363][04272] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-06 14:36:12,181][04272] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-06 14:36:12,999][04272] Updated weights for policy 0, policy_version 6500 (0.0007) [2023-03-06 14:36:13,815][04272] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-06 14:36:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12544.9). Total num frames: 6667264. Throughput: 0: 12536.0. Samples: 6646823. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:36:13,941][03942] Avg episode reward: [(0, '644.334')] [2023-03-06 14:36:14,620][04272] Updated weights for policy 0, policy_version 6520 (0.0006) [2023-03-06 14:36:15,439][04272] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-06 14:36:16,253][04272] Updated weights for policy 0, policy_version 6540 (0.0007) [2023-03-06 14:36:17,081][04272] Updated weights for policy 0, policy_version 6550 (0.0007) [2023-03-06 14:36:17,895][04272] Updated weights for policy 0, policy_version 6560 (0.0006) [2023-03-06 14:36:18,716][04272] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-06 14:36:18,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12541.4). Total num frames: 6729728. Throughput: 0: 12539.4. Samples: 6722227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:36:18,941][03942] Avg episode reward: [(0, '683.297')] [2023-03-06 14:36:19,530][04272] Updated weights for policy 0, policy_version 6580 (0.0007) [2023-03-06 14:36:20,366][04272] Updated weights for policy 0, policy_version 6590 (0.0007) [2023-03-06 14:36:21,182][04272] Updated weights for policy 0, policy_version 6600 (0.0007) [2023-03-06 14:36:22,015][04272] Updated weights for policy 0, policy_version 6610 (0.0007) [2023-03-06 14:36:22,820][04272] Updated weights for policy 0, policy_version 6620 (0.0006) [2023-03-06 14:36:23,616][04272] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-06 14:36:23,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12527.0, 300 sec: 12541.4). Total num frames: 6792192. Throughput: 0: 12535.7. Samples: 6759666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:23,941][03942] Avg episode reward: [(0, '626.863')] [2023-03-06 14:36:24,461][04272] Updated weights for policy 0, policy_version 6640 (0.0007) [2023-03-06 14:36:25,265][04272] Updated weights for policy 0, policy_version 6650 (0.0007) [2023-03-06 14:36:26,090][04272] Updated weights for policy 0, policy_version 6660 (0.0008) [2023-03-06 14:36:26,905][04272] Updated weights for policy 0, policy_version 6670 (0.0005) [2023-03-06 14:36:27,719][04272] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-06 14:36:28,550][04272] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-06 14:36:28,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 6854656. Throughput: 0: 12529.8. Samples: 6834784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:28,941][03942] Avg episode reward: [(0, '649.002')] [2023-03-06 14:36:29,383][04272] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-06 14:36:30,184][04272] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-06 14:36:30,999][04272] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-06 14:36:31,811][04272] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-06 14:36:32,642][04272] Updated weights for policy 0, policy_version 6740 (0.0007) [2023-03-06 14:36:33,446][04272] Updated weights for policy 0, policy_version 6750 (0.0007) [2023-03-06 14:36:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12541.4). Total num frames: 6918144. Throughput: 0: 12529.0. Samples: 6909981. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 14:36:33,941][03942] Avg episode reward: [(0, '674.363')] [2023-03-06 14:36:34,265][04272] Updated weights for policy 0, policy_version 6760 (0.0006) [2023-03-06 14:36:35,094][04272] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-06 14:36:35,911][04272] Updated weights for policy 0, policy_version 6780 (0.0007) [2023-03-06 14:36:36,737][04272] Updated weights for policy 0, policy_version 6790 (0.0007) [2023-03-06 14:36:37,533][04272] Updated weights for policy 0, policy_version 6800 (0.0007) [2023-03-06 14:36:38,321][04272] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-06 14:36:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12544.0, 300 sec: 12537.9). Total num frames: 6980608. Throughput: 0: 12527.2. Samples: 6947395. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:36:38,941][03942] Avg episode reward: [(0, '701.951')] [2023-03-06 14:36:39,170][04272] Updated weights for policy 0, policy_version 6820 (0.0007) [2023-03-06 14:36:39,974][04272] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-06 14:36:40,802][04272] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-06 14:36:41,631][04272] Updated weights for policy 0, policy_version 6850 (0.0007) [2023-03-06 14:36:42,466][04272] Updated weights for policy 0, policy_version 6860 (0.0007) [2023-03-06 14:36:43,271][04272] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-06 14:36:43,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 7043072. Throughput: 0: 12529.4. Samples: 7022580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:43,941][03942] Avg episode reward: [(0, '698.928')] [2023-03-06 14:36:44,076][04272] Updated weights for policy 0, policy_version 6880 (0.0007) [2023-03-06 14:36:44,889][04272] Updated weights for policy 0, policy_version 6890 (0.0007) [2023-03-06 14:36:45,690][04272] Updated weights for policy 0, policy_version 6900 (0.0007) [2023-03-06 14:36:46,512][04272] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-06 14:36:47,337][04272] Updated weights for policy 0, policy_version 6920 (0.0007) [2023-03-06 14:36:48,136][04272] Updated weights for policy 0, policy_version 6930 (0.0006) [2023-03-06 14:36:48,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12534.4). Total num frames: 7105536. Throughput: 0: 12530.1. Samples: 7097911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:48,941][03942] Avg episode reward: [(0, '659.375')] [2023-03-06 14:36:48,957][04272] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-06 14:36:49,795][04272] Updated weights for policy 0, policy_version 6950 (0.0007) [2023-03-06 14:36:50,622][04272] Updated weights for policy 0, policy_version 6960 (0.0007) [2023-03-06 14:36:51,449][04272] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-06 14:36:52,248][04272] Updated weights for policy 0, policy_version 6980 (0.0007) [2023-03-06 14:36:53,053][04272] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-06 14:36:53,905][04272] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-06 14:36:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 7168000. Throughput: 0: 12521.1. Samples: 7135345. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:36:53,941][03942] Avg episode reward: [(0, '687.303')] [2023-03-06 14:36:54,705][04272] Updated weights for policy 0, policy_version 7010 (0.0007) [2023-03-06 14:36:55,507][04272] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-06 14:36:56,346][04272] Updated weights for policy 0, policy_version 7030 (0.0007) [2023-03-06 14:36:57,169][04272] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-06 14:36:57,985][04272] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-06 14:36:58,817][04272] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-06 14:36:58,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12531.0). Total num frames: 7230464. Throughput: 0: 12521.9. Samples: 7210311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:36:58,941][03942] Avg episode reward: [(0, '656.958')] [2023-03-06 14:36:59,623][04272] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-06 14:37:00,457][04272] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-06 14:37:01,269][04272] Updated weights for policy 0, policy_version 7090 (0.0007) [2023-03-06 14:37:02,097][04272] Updated weights for policy 0, policy_version 7100 (0.0007) [2023-03-06 14:37:02,909][04272] Updated weights for policy 0, policy_version 7110 (0.0007) [2023-03-06 14:37:03,753][04272] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-06 14:37:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12531.0). Total num frames: 7292928. Throughput: 0: 12511.1. Samples: 7285228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:37:03,941][03942] Avg episode reward: [(0, '682.921')] [2023-03-06 14:37:04,562][04272] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-06 14:37:05,381][04272] Updated weights for policy 0, policy_version 7140 (0.0007) [2023-03-06 14:37:06,204][04272] Updated weights for policy 0, policy_version 7150 (0.0007) [2023-03-06 14:37:07,020][04272] Updated weights for policy 0, policy_version 7160 (0.0007) [2023-03-06 14:37:07,842][04272] Updated weights for policy 0, policy_version 7170 (0.0007) [2023-03-06 14:37:08,665][04272] Updated weights for policy 0, policy_version 7180 (0.0007) [2023-03-06 14:37:08,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12509.8, 300 sec: 12531.0). Total num frames: 7355392. Throughput: 0: 12510.5. Samples: 7322641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:37:08,941][03942] Avg episode reward: [(0, '623.592')] [2023-03-06 14:37:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000007183_7355392.pth... [2023-03-06 14:37:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000004248_4349952.pth [2023-03-06 14:37:09,474][04272] Updated weights for policy 0, policy_version 7190 (0.0006) [2023-03-06 14:37:10,297][04272] Updated weights for policy 0, policy_version 7200 (0.0008) [2023-03-06 14:37:11,122][04272] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-06 14:37:11,946][04272] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-06 14:37:12,771][04272] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-06 14:37:13,585][04272] Updated weights for policy 0, policy_version 7240 (0.0006) [2023-03-06 14:37:13,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12531.0). Total num frames: 7417856. Throughput: 0: 12503.3. Samples: 7397434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:37:13,941][03942] Avg episode reward: [(0, '599.948')] [2023-03-06 14:37:14,401][04272] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-06 14:37:15,228][04272] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-06 14:37:16,053][04272] Updated weights for policy 0, policy_version 7270 (0.0007) [2023-03-06 14:37:16,850][04272] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-06 14:37:17,658][04272] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-06 14:37:18,498][04272] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-06 14:37:18,941][03942] Fps is (10 sec: 12492.5, 60 sec: 12509.8, 300 sec: 12527.5). Total num frames: 7480320. Throughput: 0: 12494.8. Samples: 7472250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:37:18,941][03942] Avg episode reward: [(0, '619.423')] [2023-03-06 14:37:19,345][04272] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-06 14:37:20,146][04272] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-06 14:37:20,983][04272] Updated weights for policy 0, policy_version 7330 (0.0006) [2023-03-06 14:37:21,816][04272] Updated weights for policy 0, policy_version 7340 (0.0007) [2023-03-06 14:37:22,644][04272] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-06 14:37:23,457][04272] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-06 14:37:23,941][03942] Fps is (10 sec: 12390.3, 60 sec: 12492.8, 300 sec: 12524.0). Total num frames: 7541760. Throughput: 0: 12493.7. Samples: 7509612. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:37:23,941][03942] Avg episode reward: [(0, '640.446')] [2023-03-06 14:37:24,273][04272] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-06 14:37:25,086][04272] Updated weights for policy 0, policy_version 7380 (0.0006) [2023-03-06 14:37:25,909][04272] Updated weights for policy 0, policy_version 7390 (0.0006) [2023-03-06 14:37:26,729][04272] Updated weights for policy 0, policy_version 7400 (0.0006) [2023-03-06 14:37:27,550][04272] Updated weights for policy 0, policy_version 7410 (0.0007) [2023-03-06 14:37:28,359][04272] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-06 14:37:28,941][03942] Fps is (10 sec: 12493.2, 60 sec: 12509.9, 300 sec: 12524.0). Total num frames: 7605248. Throughput: 0: 12487.8. Samples: 7584532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:37:28,941][03942] Avg episode reward: [(0, '610.925')] [2023-03-06 14:37:29,177][04272] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-06 14:37:30,004][04272] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-06 14:37:30,814][04272] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-06 14:37:31,632][04272] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-06 14:37:32,467][04272] Updated weights for policy 0, policy_version 7470 (0.0006) [2023-03-06 14:37:33,276][04272] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-06 14:37:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12492.8, 300 sec: 12524.0). Total num frames: 7667712. Throughput: 0: 12480.5. Samples: 7659530. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:37:33,941][03942] Avg episode reward: [(0, '693.785')] [2023-03-06 14:37:34,096][04272] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-06 14:37:34,920][04272] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-06 14:37:35,723][04272] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-06 14:37:36,546][04272] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-06 14:37:37,366][04272] Updated weights for policy 0, policy_version 7530 (0.0006) [2023-03-06 14:37:38,194][04272] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-06 14:37:38,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12520.6). Total num frames: 7730176. Throughput: 0: 12482.7. Samples: 7697068. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:37:38,941][03942] Avg episode reward: [(0, '675.857')] [2023-03-06 14:37:39,021][04272] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-06 14:37:39,829][04272] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-06 14:37:40,653][04272] Updated weights for policy 0, policy_version 7570 (0.0007) [2023-03-06 14:37:41,487][04272] Updated weights for policy 0, policy_version 7580 (0.0006) [2023-03-06 14:37:42,299][04272] Updated weights for policy 0, policy_version 7590 (0.0007) [2023-03-06 14:37:43,128][04272] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-06 14:37:43,941][03942] Fps is (10 sec: 12389.5, 60 sec: 12475.6, 300 sec: 12517.1). Total num frames: 7791616. Throughput: 0: 12482.1. Samples: 7772013. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:37:43,942][03942] Avg episode reward: [(0, '684.035')] [2023-03-06 14:37:43,954][04272] Updated weights for policy 0, policy_version 7610 (0.0007) [2023-03-06 14:37:44,786][04272] Updated weights for policy 0, policy_version 7620 (0.0006) [2023-03-06 14:37:45,602][04272] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-06 14:37:46,416][04272] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-06 14:37:47,237][04272] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-06 14:37:48,055][04272] Updated weights for policy 0, policy_version 7660 (0.0007) [2023-03-06 14:37:48,867][04272] Updated weights for policy 0, policy_version 7670 (0.0007) [2023-03-06 14:37:48,941][03942] Fps is (10 sec: 12390.4, 60 sec: 12475.8, 300 sec: 12517.1). Total num frames: 7854080. Throughput: 0: 12479.3. Samples: 7846794. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:37:48,941][03942] Avg episode reward: [(0, '670.970')] [2023-03-06 14:37:49,692][04272] Updated weights for policy 0, policy_version 7680 (0.0007) [2023-03-06 14:37:50,519][04272] Updated weights for policy 0, policy_version 7690 (0.0006) [2023-03-06 14:37:51,327][04272] Updated weights for policy 0, policy_version 7700 (0.0007) [2023-03-06 14:37:52,139][04272] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-06 14:37:52,941][04272] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-06 14:37:53,766][04272] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-06 14:37:53,941][03942] Fps is (10 sec: 12596.1, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 7917568. Throughput: 0: 12477.4. Samples: 7884124. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:37:53,941][03942] Avg episode reward: [(0, '677.764')] [2023-03-06 14:37:54,582][04272] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-06 14:37:55,400][04272] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-06 14:37:56,241][04272] Updated weights for policy 0, policy_version 7760 (0.0007) [2023-03-06 14:37:57,052][04272] Updated weights for policy 0, policy_version 7770 (0.0006) [2023-03-06 14:37:57,867][04272] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-06 14:37:58,684][04272] Updated weights for policy 0, policy_version 7790 (0.0007) [2023-03-06 14:37:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12492.8, 300 sec: 12520.6). Total num frames: 7980032. Throughput: 0: 12489.2. Samples: 7959446. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:37:58,941][03942] Avg episode reward: [(0, '712.231')] [2023-03-06 14:37:59,498][04272] Updated weights for policy 0, policy_version 7800 (0.0007) [2023-03-06 14:38:00,323][04272] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-06 14:38:01,133][04272] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-06 14:38:01,956][04272] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-06 14:38:02,765][04272] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-06 14:38:03,588][04272] Updated weights for policy 0, policy_version 7850 (0.0007) [2023-03-06 14:38:03,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12520.6). Total num frames: 8042496. Throughput: 0: 12495.3. Samples: 8034535. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:38:03,941][03942] Avg episode reward: [(0, '723.857')] [2023-03-06 14:38:03,941][04221] Saving new best policy, reward=723.857! [2023-03-06 14:38:04,407][04272] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-06 14:38:05,208][04272] Updated weights for policy 0, policy_version 7870 (0.0007) [2023-03-06 14:38:06,033][04272] Updated weights for policy 0, policy_version 7880 (0.0007) [2023-03-06 14:38:06,845][04272] Updated weights for policy 0, policy_version 7890 (0.0007) [2023-03-06 14:38:07,657][04272] Updated weights for policy 0, policy_version 7900 (0.0006) [2023-03-06 14:38:08,481][04272] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-06 14:38:08,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 8104960. Throughput: 0: 12502.7. Samples: 8072231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:38:08,941][03942] Avg episode reward: [(0, '695.740')] [2023-03-06 14:38:09,286][04272] Updated weights for policy 0, policy_version 7920 (0.0006) [2023-03-06 14:38:10,102][04272] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-06 14:38:10,941][04272] Updated weights for policy 0, policy_version 7940 (0.0007) [2023-03-06 14:38:11,752][04272] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-06 14:38:12,560][04272] Updated weights for policy 0, policy_version 7960 (0.0007) [2023-03-06 14:38:13,403][04272] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-06 14:38:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 8167424. Throughput: 0: 12510.5. Samples: 8147503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:13,941][03942] Avg episode reward: [(0, '624.738')] [2023-03-06 14:38:14,213][04272] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-06 14:38:15,021][04272] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-06 14:38:15,831][04272] Updated weights for policy 0, policy_version 8000 (0.0006) [2023-03-06 14:38:16,657][04272] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-06 14:38:17,455][04272] Updated weights for policy 0, policy_version 8020 (0.0007) [2023-03-06 14:38:18,282][04272] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-06 14:38:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.9, 300 sec: 12517.1). Total num frames: 8229888. Throughput: 0: 12514.0. Samples: 8222663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:18,941][03942] Avg episode reward: [(0, '669.835')] [2023-03-06 14:38:19,110][04272] Updated weights for policy 0, policy_version 8040 (0.0007) [2023-03-06 14:38:19,926][04272] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-06 14:38:20,751][04272] Updated weights for policy 0, policy_version 8060 (0.0007) [2023-03-06 14:38:21,558][04272] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-06 14:38:22,388][04272] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-06 14:38:23,210][04272] Updated weights for policy 0, policy_version 8090 (0.0007) [2023-03-06 14:38:23,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12517.1). Total num frames: 8292352. Throughput: 0: 12510.4. Samples: 8260035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:23,941][03942] Avg episode reward: [(0, '711.633')] [2023-03-06 14:38:24,040][04272] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-06 14:38:24,854][04272] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-06 14:38:25,678][04272] Updated weights for policy 0, policy_version 8120 (0.0007) [2023-03-06 14:38:26,497][04272] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-06 14:38:27,336][04272] Updated weights for policy 0, policy_version 8140 (0.0007) [2023-03-06 14:38:28,145][04272] Updated weights for policy 0, policy_version 8150 (0.0005) [2023-03-06 14:38:28,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 8354816. Throughput: 0: 12508.3. Samples: 8334878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:28,941][03942] Avg episode reward: [(0, '677.832')] [2023-03-06 14:38:28,979][04272] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-06 14:38:29,798][04272] Updated weights for policy 0, policy_version 8170 (0.0007) [2023-03-06 14:38:30,594][04272] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-06 14:38:31,422][04272] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-06 14:38:32,237][04272] Updated weights for policy 0, policy_version 8200 (0.0007) [2023-03-06 14:38:33,035][04272] Updated weights for policy 0, policy_version 8210 (0.0007) [2023-03-06 14:38:33,876][04272] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-06 14:38:33,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 8417280. Throughput: 0: 12515.5. Samples: 8409993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:33,941][03942] Avg episode reward: [(0, '639.504')] [2023-03-06 14:38:34,689][04272] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-06 14:38:35,494][04272] Updated weights for policy 0, policy_version 8240 (0.0007) [2023-03-06 14:38:36,326][04272] Updated weights for policy 0, policy_version 8250 (0.0007) [2023-03-06 14:38:37,129][04272] Updated weights for policy 0, policy_version 8260 (0.0007) [2023-03-06 14:38:37,946][04272] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-06 14:38:38,784][04272] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-03-06 14:38:38,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12509.9, 300 sec: 12517.1). Total num frames: 8480768. Throughput: 0: 12518.8. Samples: 8447470. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:38,941][03942] Avg episode reward: [(0, '731.469')] [2023-03-06 14:38:38,945][04221] Saving new best policy, reward=731.469! [2023-03-06 14:38:39,583][04272] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-06 14:38:40,397][04272] Updated weights for policy 0, policy_version 8300 (0.0007) [2023-03-06 14:38:41,236][04272] Updated weights for policy 0, policy_version 8310 (0.0007) [2023-03-06 14:38:42,074][04272] Updated weights for policy 0, policy_version 8320 (0.0007) [2023-03-06 14:38:42,863][04272] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-06 14:38:43,689][04272] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-06 14:38:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12527.1, 300 sec: 12517.1). Total num frames: 8543232. Throughput: 0: 12511.4. Samples: 8522460. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:38:43,941][03942] Avg episode reward: [(0, '714.982')] [2023-03-06 14:38:44,512][04272] Updated weights for policy 0, policy_version 8350 (0.0007) [2023-03-06 14:38:45,311][04272] Updated weights for policy 0, policy_version 8360 (0.0007) [2023-03-06 14:38:46,140][04272] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-06 14:38:46,966][04272] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-06 14:38:47,764][04272] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-06 14:38:48,593][04272] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-06 14:38:48,941][03942] Fps is (10 sec: 12492.6, 60 sec: 12526.9, 300 sec: 12517.1). Total num frames: 8605696. Throughput: 0: 12514.6. Samples: 8597695. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:38:48,941][03942] Avg episode reward: [(0, '698.213')] [2023-03-06 14:38:49,411][04272] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-06 14:38:50,206][04272] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-06 14:38:51,056][04272] Updated weights for policy 0, policy_version 8430 (0.0007) [2023-03-06 14:38:51,870][04272] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-06 14:38:52,677][04272] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-06 14:38:53,504][04272] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-06 14:38:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12520.6). Total num frames: 8668160. Throughput: 0: 12511.8. Samples: 8635261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:53,941][03942] Avg episode reward: [(0, '667.991')] [2023-03-06 14:38:54,314][04272] Updated weights for policy 0, policy_version 8470 (0.0007) [2023-03-06 14:38:55,137][04272] Updated weights for policy 0, policy_version 8480 (0.0007) [2023-03-06 14:38:55,943][04272] Updated weights for policy 0, policy_version 8490 (0.0007) [2023-03-06 14:38:56,752][04272] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-06 14:38:57,577][04272] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-06 14:38:58,374][04272] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-06 14:38:58,940][03942] Fps is (10 sec: 12493.0, 60 sec: 12509.9, 300 sec: 12517.1). Total num frames: 8730624. Throughput: 0: 12512.9. Samples: 8710580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:38:58,941][03942] Avg episode reward: [(0, '621.755')] [2023-03-06 14:38:59,186][04272] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-06 14:39:00,000][04272] Updated weights for policy 0, policy_version 8540 (0.0006) [2023-03-06 14:39:00,823][04272] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-06 14:39:01,639][04272] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-06 14:39:02,452][04272] Updated weights for policy 0, policy_version 8570 (0.0007) [2023-03-06 14:39:03,246][04272] Updated weights for policy 0, policy_version 8580 (0.0007) [2023-03-06 14:39:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12520.6). Total num frames: 8794112. Throughput: 0: 12520.4. Samples: 8786082. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:03,941][03942] Avg episode reward: [(0, '509.120')] [2023-03-06 14:39:04,067][04272] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-06 14:39:04,869][04272] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-06 14:39:05,674][04272] Updated weights for policy 0, policy_version 8610 (0.0007) [2023-03-06 14:39:06,486][04272] Updated weights for policy 0, policy_version 8620 (0.0007) [2023-03-06 14:39:07,303][04272] Updated weights for policy 0, policy_version 8630 (0.0007) [2023-03-06 14:39:08,113][04272] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-06 14:39:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12524.0). Total num frames: 8856576. Throughput: 0: 12535.5. Samples: 8824132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:08,941][03942] Avg episode reward: [(0, '549.580')] [2023-03-06 14:39:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000008650_8857600.pth... [2023-03-06 14:39:08,946][04272] Updated weights for policy 0, policy_version 8650 (0.0007) [2023-03-06 14:39:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000005715_5852160.pth [2023-03-06 14:39:09,747][04272] Updated weights for policy 0, policy_version 8660 (0.0006) [2023-03-06 14:39:10,554][04272] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-06 14:39:11,366][04272] Updated weights for policy 0, policy_version 8680 (0.0007) [2023-03-06 14:39:12,186][04272] Updated weights for policy 0, policy_version 8690 (0.0007) [2023-03-06 14:39:12,998][04272] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-06 14:39:13,824][04272] Updated weights for policy 0, policy_version 8710 (0.0007) [2023-03-06 14:39:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12527.5). Total num frames: 8920064. Throughput: 0: 12552.1. Samples: 8899721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:13,941][03942] Avg episode reward: [(0, '598.551')] [2023-03-06 14:39:14,637][04272] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-06 14:39:15,460][04272] Updated weights for policy 0, policy_version 8730 (0.0007) [2023-03-06 14:39:16,279][04272] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-06 14:39:17,089][04272] Updated weights for policy 0, policy_version 8750 (0.0007) [2023-03-06 14:39:17,899][04272] Updated weights for policy 0, policy_version 8760 (0.0007) [2023-03-06 14:39:18,706][04272] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-06 14:39:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12544.0, 300 sec: 12527.5). Total num frames: 8982528. Throughput: 0: 12559.5. Samples: 8975172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:39:18,941][03942] Avg episode reward: [(0, '499.253')] [2023-03-06 14:39:19,520][04272] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-06 14:39:20,338][04272] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-06 14:39:21,136][04272] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-06 14:39:21,957][04272] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-06 14:39:22,765][04272] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-06 14:39:23,577][04272] Updated weights for policy 0, policy_version 8830 (0.0007) [2023-03-06 14:39:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12527.5). Total num frames: 9046016. Throughput: 0: 12563.5. Samples: 9012830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:39:23,941][03942] Avg episode reward: [(0, '512.816')] [2023-03-06 14:39:24,395][04272] Updated weights for policy 0, policy_version 8840 (0.0007) [2023-03-06 14:39:25,228][04272] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-06 14:39:26,029][04272] Updated weights for policy 0, policy_version 8860 (0.0006) [2023-03-06 14:39:26,862][04272] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-06 14:39:27,658][04272] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-06 14:39:28,464][04272] Updated weights for policy 0, policy_version 8890 (0.0007) [2023-03-06 14:39:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12527.5). Total num frames: 9108480. Throughput: 0: 12572.3. Samples: 9088215. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:28,941][03942] Avg episode reward: [(0, '650.967')] [2023-03-06 14:39:29,298][04272] Updated weights for policy 0, policy_version 8900 (0.0006) [2023-03-06 14:39:30,098][04272] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-06 14:39:30,911][04272] Updated weights for policy 0, policy_version 8920 (0.0007) [2023-03-06 14:39:31,737][04272] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-06 14:39:32,536][04272] Updated weights for policy 0, policy_version 8940 (0.0006) [2023-03-06 14:39:33,338][04272] Updated weights for policy 0, policy_version 8950 (0.0006) [2023-03-06 14:39:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12531.0). Total num frames: 9171968. Throughput: 0: 12587.2. Samples: 9164116. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:33,941][03942] Avg episode reward: [(0, '570.336')] [2023-03-06 14:39:34,163][04272] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-06 14:39:34,969][04272] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-06 14:39:35,789][04272] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-06 14:39:36,588][04272] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-06 14:39:37,422][04272] Updated weights for policy 0, policy_version 9000 (0.0007) [2023-03-06 14:39:38,228][04272] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-03-06 14:39:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.0, 300 sec: 12531.0). Total num frames: 9234432. Throughput: 0: 12590.7. Samples: 9201842. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:39:38,941][03942] Avg episode reward: [(0, '630.463')] [2023-03-06 14:39:39,039][04272] Updated weights for policy 0, policy_version 9020 (0.0007) [2023-03-06 14:39:39,850][04272] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-06 14:39:40,675][04272] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-06 14:39:41,486][04272] Updated weights for policy 0, policy_version 9050 (0.0008) [2023-03-06 14:39:42,294][04272] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-06 14:39:43,115][04272] Updated weights for policy 0, policy_version 9070 (0.0006) [2023-03-06 14:39:43,928][04272] Updated weights for policy 0, policy_version 9080 (0.0007) [2023-03-06 14:39:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12534.5). Total num frames: 9297920. Throughput: 0: 12591.4. Samples: 9277195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:39:43,941][03942] Avg episode reward: [(0, '678.230')] [2023-03-06 14:39:44,751][04272] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-06 14:39:45,558][04272] Updated weights for policy 0, policy_version 9100 (0.0007) [2023-03-06 14:39:46,375][04272] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-06 14:39:47,202][04272] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-06 14:39:48,004][04272] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-06 14:39:48,821][04272] Updated weights for policy 0, policy_version 9140 (0.0007) [2023-03-06 14:39:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12534.5). Total num frames: 9360384. Throughput: 0: 12589.7. Samples: 9352621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:39:48,941][03942] Avg episode reward: [(0, '647.258')] [2023-03-06 14:39:49,629][04272] Updated weights for policy 0, policy_version 9150 (0.0007) [2023-03-06 14:39:50,447][04272] Updated weights for policy 0, policy_version 9160 (0.0007) [2023-03-06 14:39:51,263][04272] Updated weights for policy 0, policy_version 9170 (0.0006) [2023-03-06 14:39:52,071][04272] Updated weights for policy 0, policy_version 9180 (0.0007) [2023-03-06 14:39:52,905][04272] Updated weights for policy 0, policy_version 9190 (0.0007) [2023-03-06 14:39:53,701][04272] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-06 14:39:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12537.9). Total num frames: 9423872. Throughput: 0: 12582.0. Samples: 9390322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:39:53,941][03942] Avg episode reward: [(0, '751.635')] [2023-03-06 14:39:53,941][04221] Saving new best policy, reward=751.635! [2023-03-06 14:39:54,522][04272] Updated weights for policy 0, policy_version 9210 (0.0006) [2023-03-06 14:39:55,355][04272] Updated weights for policy 0, policy_version 9220 (0.0006) [2023-03-06 14:39:56,195][04272] Updated weights for policy 0, policy_version 9230 (0.0006) [2023-03-06 14:39:56,994][04272] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-06 14:39:57,822][04272] Updated weights for policy 0, policy_version 9250 (0.0006) [2023-03-06 14:39:58,618][04272] Updated weights for policy 0, policy_version 9260 (0.0006) [2023-03-06 14:39:58,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.1, 300 sec: 12531.0). Total num frames: 9485312. Throughput: 0: 12567.8. Samples: 9465271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:39:58,941][03942] Avg episode reward: [(0, '760.715')] [2023-03-06 14:39:58,956][04221] Saving new best policy, reward=760.715! [2023-03-06 14:39:59,444][04272] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-06 14:40:00,269][04272] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-06 14:40:01,080][04272] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-06 14:40:01,898][04272] Updated weights for policy 0, policy_version 9300 (0.0006) [2023-03-06 14:40:02,719][04272] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-06 14:40:03,538][04272] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-06 14:40:03,940][03942] Fps is (10 sec: 12390.4, 60 sec: 12561.1, 300 sec: 12527.5). Total num frames: 9547776. Throughput: 0: 12561.6. Samples: 9540441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:03,941][03942] Avg episode reward: [(0, '728.482')] [2023-03-06 14:40:04,345][04272] Updated weights for policy 0, policy_version 9330 (0.0007) [2023-03-06 14:40:05,173][04272] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-06 14:40:06,000][04272] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-06 14:40:06,801][04272] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-06 14:40:07,636][04272] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-06 14:40:08,466][04272] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-06 14:40:08,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12561.1, 300 sec: 12527.5). Total num frames: 9610240. Throughput: 0: 12555.1. Samples: 9577808. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:40:08,941][03942] Avg episode reward: [(0, '754.178')] [2023-03-06 14:40:09,279][04272] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-06 14:40:10,120][04272] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-06 14:40:10,930][04272] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-06 14:40:11,745][04272] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-06 14:40:12,562][04272] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-06 14:40:13,382][04272] Updated weights for policy 0, policy_version 9440 (0.0007) [2023-03-06 14:40:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12524.0). Total num frames: 9672704. Throughput: 0: 12547.0. Samples: 9652830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:13,941][03942] Avg episode reward: [(0, '676.152')] [2023-03-06 14:40:14,183][04272] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-06 14:40:15,024][04272] Updated weights for policy 0, policy_version 9460 (0.0008) [2023-03-06 14:40:15,836][04272] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-06 14:40:16,667][04272] Updated weights for policy 0, policy_version 9480 (0.0007) [2023-03-06 14:40:17,482][04272] Updated weights for policy 0, policy_version 9490 (0.0006) [2023-03-06 14:40:18,316][04272] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-06 14:40:18,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12524.0). Total num frames: 9735168. Throughput: 0: 12518.3. Samples: 9727438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:18,941][03942] Avg episode reward: [(0, '725.953')] [2023-03-06 14:40:19,143][04272] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-06 14:40:19,959][04272] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-06 14:40:20,775][04272] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-06 14:40:21,592][04272] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-06 14:40:22,400][04272] Updated weights for policy 0, policy_version 9550 (0.0006) [2023-03-06 14:40:23,203][04272] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-06 14:40:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12544.0, 300 sec: 12527.5). Total num frames: 9798656. Throughput: 0: 12516.0. Samples: 9765062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:23,941][03942] Avg episode reward: [(0, '700.654')] [2023-03-06 14:40:24,014][04272] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-06 14:40:24,830][04272] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-06 14:40:25,649][04272] Updated weights for policy 0, policy_version 9590 (0.0007) [2023-03-06 14:40:26,473][04272] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-06 14:40:27,298][04272] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-06 14:40:28,103][04272] Updated weights for policy 0, policy_version 9620 (0.0006) [2023-03-06 14:40:28,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12524.0). Total num frames: 9860096. Throughput: 0: 12520.4. Samples: 9840613. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:40:28,941][03942] Avg episode reward: [(0, '694.683')] [2023-03-06 14:40:28,942][04272] Updated weights for policy 0, policy_version 9630 (0.0006) [2023-03-06 14:40:29,744][04272] Updated weights for policy 0, policy_version 9640 (0.0007) [2023-03-06 14:40:30,566][04272] Updated weights for policy 0, policy_version 9650 (0.0006) [2023-03-06 14:40:31,376][04272] Updated weights for policy 0, policy_version 9660 (0.0007) [2023-03-06 14:40:32,187][04272] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-06 14:40:33,008][04272] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-06 14:40:33,829][04272] Updated weights for policy 0, policy_version 9690 (0.0007) [2023-03-06 14:40:33,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12527.5). Total num frames: 9923584. Throughput: 0: 12507.9. Samples: 9915475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:40:33,941][03942] Avg episode reward: [(0, '735.844')] [2023-03-06 14:40:34,645][04272] Updated weights for policy 0, policy_version 9700 (0.0006) [2023-03-06 14:40:35,462][04272] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-06 14:40:36,284][04272] Updated weights for policy 0, policy_version 9720 (0.0007) [2023-03-06 14:40:37,077][04272] Updated weights for policy 0, policy_version 9730 (0.0007) [2023-03-06 14:40:37,896][04272] Updated weights for policy 0, policy_version 9740 (0.0006) [2023-03-06 14:40:38,743][04272] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-06 14:40:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12524.0). Total num frames: 9986048. Throughput: 0: 12506.5. Samples: 9953114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:38,941][03942] Avg episode reward: [(0, '692.698')] [2023-03-06 14:40:39,565][04272] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-06 14:40:40,386][04272] Updated weights for policy 0, policy_version 9770 (0.0006) [2023-03-06 14:40:41,202][04272] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-06 14:40:42,015][04272] Updated weights for policy 0, policy_version 9790 (0.0005) [2023-03-06 14:40:42,835][04272] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-06 14:40:43,652][04272] Updated weights for policy 0, policy_version 9810 (0.0007) [2023-03-06 14:40:43,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12524.0). Total num frames: 10048512. Throughput: 0: 12509.6. Samples: 10028204. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:40:43,941][03942] Avg episode reward: [(0, '729.382')] [2023-03-06 14:40:44,472][04272] Updated weights for policy 0, policy_version 9820 (0.0007) [2023-03-06 14:40:45,293][04272] Updated weights for policy 0, policy_version 9830 (0.0006) [2023-03-06 14:40:46,115][04272] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-06 14:40:46,923][04272] Updated weights for policy 0, policy_version 9850 (0.0007) [2023-03-06 14:40:47,758][04272] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-06 14:40:48,574][04272] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-06 14:40:48,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12524.0). Total num frames: 10110976. Throughput: 0: 12507.9. Samples: 10103297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:40:48,941][03942] Avg episode reward: [(0, '761.812')] [2023-03-06 14:40:48,944][04221] Saving new best policy, reward=761.812! [2023-03-06 14:40:49,381][04272] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-06 14:40:50,215][04272] Updated weights for policy 0, policy_version 9890 (0.0007) [2023-03-06 14:40:51,016][04272] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-06 14:40:51,849][04272] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-06 14:40:52,664][04272] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-06 14:40:53,489][04272] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-06 14:40:53,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12492.8, 300 sec: 12524.0). Total num frames: 10173440. Throughput: 0: 12512.8. Samples: 10140885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:53,941][03942] Avg episode reward: [(0, '753.310')] [2023-03-06 14:40:54,321][04272] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-06 14:40:55,170][04272] Updated weights for policy 0, policy_version 9950 (0.0007) [2023-03-06 14:40:55,978][04272] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-06 14:40:56,809][04272] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-06 14:40:57,630][04272] Updated weights for policy 0, policy_version 9980 (0.0007) [2023-03-06 14:40:58,462][04272] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-06 14:40:58,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12524.0). Total num frames: 10235904. Throughput: 0: 12497.6. Samples: 10215218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:40:58,941][03942] Avg episode reward: [(0, '777.580')] [2023-03-06 14:40:58,944][04221] Saving new best policy, reward=777.580! [2023-03-06 14:40:59,272][04272] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-03-06 14:41:00,095][04272] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-06 14:41:00,923][04272] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-06 14:41:01,729][04272] Updated weights for policy 0, policy_version 10030 (0.0006) [2023-03-06 14:41:02,546][04272] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-06 14:41:03,364][04272] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-06 14:41:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12520.6). Total num frames: 10298368. Throughput: 0: 12506.5. Samples: 10290229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:41:03,941][03942] Avg episode reward: [(0, '753.665')] [2023-03-06 14:41:04,188][04272] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-06 14:41:05,019][04272] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-06 14:41:05,870][04272] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-06 14:41:06,702][04272] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-06 14:41:07,542][04272] Updated weights for policy 0, policy_version 10100 (0.0007) [2023-03-06 14:41:08,354][04272] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-06 14:41:08,941][03942] Fps is (10 sec: 12287.8, 60 sec: 12475.7, 300 sec: 12513.6). Total num frames: 10358784. Throughput: 0: 12488.3. Samples: 10327038. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:41:08,941][03942] Avg episode reward: [(0, '673.868')] [2023-03-06 14:41:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000010117_10359808.pth... [2023-03-06 14:41:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000007183_7355392.pth [2023-03-06 14:41:09,169][04272] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-06 14:41:10,042][04272] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-03-06 14:41:10,869][04272] Updated weights for policy 0, policy_version 10140 (0.0007) [2023-03-06 14:41:11,696][04272] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-06 14:41:12,537][04272] Updated weights for policy 0, policy_version 10160 (0.0005) [2023-03-06 14:41:13,341][04272] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-06 14:41:13,941][03942] Fps is (10 sec: 12288.0, 60 sec: 12475.7, 300 sec: 12513.6). Total num frames: 10421248. Throughput: 0: 12450.8. Samples: 10400898. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:41:13,941][03942] Avg episode reward: [(0, '735.606')] [2023-03-06 14:41:14,179][04272] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-06 14:41:14,988][04272] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-06 14:41:15,797][04272] Updated weights for policy 0, policy_version 10200 (0.0007) [2023-03-06 14:41:16,613][04272] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-06 14:41:17,421][04272] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-06 14:41:18,258][04272] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-06 14:41:18,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12475.7, 300 sec: 12513.6). Total num frames: 10483712. Throughput: 0: 12455.5. Samples: 10475975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:41:18,941][03942] Avg episode reward: [(0, '706.174')] [2023-03-06 14:41:19,059][04272] Updated weights for policy 0, policy_version 10240 (0.0007) [2023-03-06 14:41:19,845][04272] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-06 14:41:20,659][04272] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-06 14:41:21,466][04272] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-06 14:41:22,286][04272] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-06 14:41:23,095][04272] Updated weights for policy 0, policy_version 10290 (0.0007) [2023-03-06 14:41:23,907][04272] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-06 14:41:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12475.7, 300 sec: 12517.1). Total num frames: 10547200. Throughput: 0: 12465.9. Samples: 10514079. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:41:23,941][03942] Avg episode reward: [(0, '657.913')] [2023-03-06 14:41:24,730][04272] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-06 14:41:25,552][04272] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-06 14:41:26,357][04272] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-06 14:41:27,166][04272] Updated weights for policy 0, policy_version 10340 (0.0006) [2023-03-06 14:41:27,976][04272] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-06 14:41:28,805][04272] Updated weights for policy 0, policy_version 10360 (0.0007) [2023-03-06 14:41:28,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12492.8, 300 sec: 12513.6). Total num frames: 10609664. Throughput: 0: 12470.3. Samples: 10589367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:41:28,941][03942] Avg episode reward: [(0, '678.305')] [2023-03-06 14:41:29,619][04272] Updated weights for policy 0, policy_version 10370 (0.0007) [2023-03-06 14:41:30,415][04272] Updated weights for policy 0, policy_version 10380 (0.0007) [2023-03-06 14:41:31,253][04272] Updated weights for policy 0, policy_version 10390 (0.0007) [2023-03-06 14:41:32,067][04272] Updated weights for policy 0, policy_version 10400 (0.0007) [2023-03-06 14:41:32,887][04272] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-06 14:41:33,679][04272] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-06 14:41:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 10673152. Throughput: 0: 12480.4. Samples: 10664914. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:41:33,941][03942] Avg episode reward: [(0, '616.644')] [2023-03-06 14:41:34,494][04272] Updated weights for policy 0, policy_version 10430 (0.0007) [2023-03-06 14:41:35,307][04272] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-06 14:41:36,141][04272] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-06 14:41:36,954][04272] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-06 14:41:37,761][04272] Updated weights for policy 0, policy_version 10470 (0.0006) [2023-03-06 14:41:38,566][04272] Updated weights for policy 0, policy_version 10480 (0.0007) [2023-03-06 14:41:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12492.8, 300 sec: 12517.1). Total num frames: 10735616. Throughput: 0: 12484.1. Samples: 10702670. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 14:41:38,941][03942] Avg episode reward: [(0, '640.406')] [2023-03-06 14:41:39,386][04272] Updated weights for policy 0, policy_version 10490 (0.0007) [2023-03-06 14:41:40,208][04272] Updated weights for policy 0, policy_version 10500 (0.0007) [2023-03-06 14:41:41,006][04272] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-06 14:41:41,814][04272] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-06 14:41:42,634][04272] Updated weights for policy 0, policy_version 10530 (0.0007) [2023-03-06 14:41:43,441][04272] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-06 14:41:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12509.9, 300 sec: 12520.6). Total num frames: 10799104. Throughput: 0: 12510.6. Samples: 10778195. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:41:43,941][03942] Avg episode reward: [(0, '710.590')] [2023-03-06 14:41:44,239][04272] Updated weights for policy 0, policy_version 10550 (0.0006) [2023-03-06 14:41:45,052][04272] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-06 14:41:45,859][04272] Updated weights for policy 0, policy_version 10570 (0.0007) [2023-03-06 14:41:46,681][04272] Updated weights for policy 0, policy_version 10580 (0.0007) [2023-03-06 14:41:47,504][04272] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-06 14:41:48,324][04272] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-06 14:41:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12509.9, 300 sec: 12520.6). Total num frames: 10861568. Throughput: 0: 12525.4. Samples: 10853871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:41:48,941][03942] Avg episode reward: [(0, '706.278')] [2023-03-06 14:41:49,136][04272] Updated weights for policy 0, policy_version 10610 (0.0007) [2023-03-06 14:41:49,959][04272] Updated weights for policy 0, policy_version 10620 (0.0007) [2023-03-06 14:41:50,775][04272] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-06 14:41:51,584][04272] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-06 14:41:52,419][04272] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-06 14:41:53,219][04272] Updated weights for policy 0, policy_version 10660 (0.0006) [2023-03-06 14:41:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12509.9, 300 sec: 12520.6). Total num frames: 10924032. Throughput: 0: 12543.6. Samples: 10891498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:41:53,941][03942] Avg episode reward: [(0, '730.450')] [2023-03-06 14:41:54,025][04272] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-06 14:41:54,826][04272] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-06 14:41:55,636][04272] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-06 14:41:56,438][04272] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-06 14:41:57,242][04272] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-06 14:41:58,070][04272] Updated weights for policy 0, policy_version 10720 (0.0007) [2023-03-06 14:41:58,873][04272] Updated weights for policy 0, policy_version 10730 (0.0006) [2023-03-06 14:41:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12524.0). Total num frames: 10987520. Throughput: 0: 12589.0. Samples: 10967403. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:41:58,941][03942] Avg episode reward: [(0, '677.858')] [2023-03-06 14:41:59,669][04272] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-06 14:42:00,493][04272] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-06 14:42:01,306][04272] Updated weights for policy 0, policy_version 10760 (0.0007) [2023-03-06 14:42:02,134][04272] Updated weights for policy 0, policy_version 10770 (0.0007) [2023-03-06 14:42:02,951][04272] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-06 14:42:03,771][04272] Updated weights for policy 0, policy_version 10790 (0.0006) [2023-03-06 14:42:03,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12544.0, 300 sec: 12527.5). Total num frames: 11051008. Throughput: 0: 12599.7. Samples: 11042959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:42:03,941][03942] Avg episode reward: [(0, '693.424')] [2023-03-06 14:42:04,583][04272] Updated weights for policy 0, policy_version 10800 (0.0007) [2023-03-06 14:42:05,426][04272] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-06 14:42:06,206][04272] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-06 14:42:07,060][04272] Updated weights for policy 0, policy_version 10830 (0.0007) [2023-03-06 14:42:07,863][04272] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-06 14:42:08,666][04272] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-06 14:42:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12527.5). Total num frames: 11113472. Throughput: 0: 12585.9. Samples: 11080447. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:42:08,941][03942] Avg episode reward: [(0, '782.986')] [2023-03-06 14:42:08,944][04221] Saving new best policy, reward=782.986! [2023-03-06 14:42:09,498][04272] Updated weights for policy 0, policy_version 10860 (0.0007) [2023-03-06 14:42:10,311][04272] Updated weights for policy 0, policy_version 10870 (0.0007) [2023-03-06 14:42:11,125][04272] Updated weights for policy 0, policy_version 10880 (0.0007) [2023-03-06 14:42:11,947][04272] Updated weights for policy 0, policy_version 10890 (0.0007) [2023-03-06 14:42:12,764][04272] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-06 14:42:13,583][04272] Updated weights for policy 0, policy_version 10910 (0.0007) [2023-03-06 14:42:13,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12527.5). Total num frames: 11175936. Throughput: 0: 12582.6. Samples: 11155583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:42:13,941][03942] Avg episode reward: [(0, '761.589')] [2023-03-06 14:42:14,398][04272] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-06 14:42:15,207][04272] Updated weights for policy 0, policy_version 10930 (0.0007) [2023-03-06 14:42:16,015][04272] Updated weights for policy 0, policy_version 10940 (0.0007) [2023-03-06 14:42:16,837][04272] Updated weights for policy 0, policy_version 10950 (0.0007) [2023-03-06 14:42:17,647][04272] Updated weights for policy 0, policy_version 10960 (0.0007) [2023-03-06 14:42:18,473][04272] Updated weights for policy 0, policy_version 10970 (0.0006) [2023-03-06 14:42:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.2, 300 sec: 12531.0). Total num frames: 11238400. Throughput: 0: 12577.2. Samples: 11230887. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:42:18,951][03942] Avg episode reward: [(0, '755.986')] [2023-03-06 14:42:19,285][04272] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-06 14:42:20,101][04272] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-06 14:42:20,923][04272] Updated weights for policy 0, policy_version 11000 (0.0007) [2023-03-06 14:42:21,756][04272] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-06 14:42:22,567][04272] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-06 14:42:23,377][04272] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-06 14:42:23,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12561.1, 300 sec: 12527.5). Total num frames: 11300864. Throughput: 0: 12571.3. Samples: 11268379. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:42:23,941][03942] Avg episode reward: [(0, '728.277')] [2023-03-06 14:42:24,196][04272] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-06 14:42:25,020][04272] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-06 14:42:25,814][04272] Updated weights for policy 0, policy_version 11060 (0.0006) [2023-03-06 14:42:26,637][04272] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-06 14:42:27,441][04272] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-06 14:42:28,250][04272] Updated weights for policy 0, policy_version 11090 (0.0007) [2023-03-06 14:42:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12531.0). Total num frames: 11364352. Throughput: 0: 12571.5. Samples: 11343915. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:42:28,941][03942] Avg episode reward: [(0, '730.916')] [2023-03-06 14:42:29,073][04272] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-06 14:42:29,883][04272] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-06 14:42:30,712][04272] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-06 14:42:31,521][04272] Updated weights for policy 0, policy_version 11130 (0.0007) [2023-03-06 14:42:32,346][04272] Updated weights for policy 0, policy_version 11140 (0.0007) [2023-03-06 14:42:33,177][04272] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-06 14:42:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12531.0). Total num frames: 11426816. Throughput: 0: 12558.3. Samples: 11418995. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:42:33,941][03942] Avg episode reward: [(0, '773.755')] [2023-03-06 14:42:33,964][04272] Updated weights for policy 0, policy_version 11160 (0.0007) [2023-03-06 14:42:34,797][04272] Updated weights for policy 0, policy_version 11170 (0.0006) [2023-03-06 14:42:35,604][04272] Updated weights for policy 0, policy_version 11180 (0.0007) [2023-03-06 14:42:36,411][04272] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-06 14:42:37,238][04272] Updated weights for policy 0, policy_version 11200 (0.0007) [2023-03-06 14:42:38,048][04272] Updated weights for policy 0, policy_version 11210 (0.0007) [2023-03-06 14:42:38,865][04272] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-06 14:42:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12538.0). Total num frames: 11490304. Throughput: 0: 12563.2. Samples: 11456842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:42:38,941][03942] Avg episode reward: [(0, '697.212')] [2023-03-06 14:42:39,670][04272] Updated weights for policy 0, policy_version 11230 (0.0007) [2023-03-06 14:42:40,497][04272] Updated weights for policy 0, policy_version 11240 (0.0006) [2023-03-06 14:42:41,295][04272] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-06 14:42:42,112][04272] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-06 14:42:42,912][04272] Updated weights for policy 0, policy_version 11270 (0.0007) [2023-03-06 14:42:43,738][04272] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-06 14:42:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12561.1, 300 sec: 12537.9). Total num frames: 11552768. Throughput: 0: 12554.4. Samples: 11532352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:42:43,941][03942] Avg episode reward: [(0, '750.427')] [2023-03-06 14:42:44,579][04272] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-06 14:42:45,398][04272] Updated weights for policy 0, policy_version 11300 (0.0006) [2023-03-06 14:42:46,225][04272] Updated weights for policy 0, policy_version 11310 (0.0006) [2023-03-06 14:42:47,031][04272] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-06 14:42:47,853][04272] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-06 14:42:48,666][04272] Updated weights for policy 0, policy_version 11340 (0.0007) [2023-03-06 14:42:48,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 11615232. Throughput: 0: 12543.7. Samples: 11607425. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:42:48,941][03942] Avg episode reward: [(0, '691.227')] [2023-03-06 14:42:49,475][04272] Updated weights for policy 0, policy_version 11350 (0.0007) [2023-03-06 14:42:50,287][04272] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-06 14:42:51,089][04272] Updated weights for policy 0, policy_version 11370 (0.0007) [2023-03-06 14:42:51,909][04272] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-06 14:42:52,749][04272] Updated weights for policy 0, policy_version 11390 (0.0007) [2023-03-06 14:42:53,563][04272] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-06 14:42:53,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 11677696. Throughput: 0: 12548.0. Samples: 11645106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:42:53,941][03942] Avg episode reward: [(0, '728.039')] [2023-03-06 14:42:54,368][04272] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-06 14:42:55,218][04272] Updated weights for policy 0, policy_version 11420 (0.0007) [2023-03-06 14:42:56,019][04272] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-06 14:42:56,837][04272] Updated weights for policy 0, policy_version 11440 (0.0008) [2023-03-06 14:42:57,657][04272] Updated weights for policy 0, policy_version 11450 (0.0007) [2023-03-06 14:42:58,440][04272] Updated weights for policy 0, policy_version 11460 (0.0006) [2023-03-06 14:42:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12537.9). Total num frames: 11741184. Throughput: 0: 12544.8. Samples: 11720098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:42:58,941][03942] Avg episode reward: [(0, '775.948')] [2023-03-06 14:42:59,281][04272] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-06 14:43:00,114][04272] Updated weights for policy 0, policy_version 11480 (0.0007) [2023-03-06 14:43:00,925][04272] Updated weights for policy 0, policy_version 11490 (0.0007) [2023-03-06 14:43:01,735][04272] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-06 14:43:02,570][04272] Updated weights for policy 0, policy_version 11510 (0.0007) [2023-03-06 14:43:03,389][04272] Updated weights for policy 0, policy_version 11520 (0.0007) [2023-03-06 14:43:03,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 11802624. Throughput: 0: 12539.8. Samples: 11795177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:03,941][03942] Avg episode reward: [(0, '763.041')] [2023-03-06 14:43:04,190][04272] Updated weights for policy 0, policy_version 11530 (0.0007) [2023-03-06 14:43:05,042][04272] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-06 14:43:05,854][04272] Updated weights for policy 0, policy_version 11550 (0.0007) [2023-03-06 14:43:06,670][04272] Updated weights for policy 0, policy_version 11560 (0.0007) [2023-03-06 14:43:07,481][04272] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-06 14:43:08,304][04272] Updated weights for policy 0, policy_version 11580 (0.0007) [2023-03-06 14:43:08,940][03942] Fps is (10 sec: 12390.4, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 11865088. Throughput: 0: 12540.5. Samples: 11832700. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:08,941][03942] Avg episode reward: [(0, '760.465')] [2023-03-06 14:43:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000011587_11865088.pth... [2023-03-06 14:43:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000008650_8857600.pth [2023-03-06 14:43:09,118][04272] Updated weights for policy 0, policy_version 11590 (0.0006) [2023-03-06 14:43:09,937][04272] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-06 14:43:10,767][04272] Updated weights for policy 0, policy_version 11610 (0.0007) [2023-03-06 14:43:11,582][04272] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-06 14:43:12,397][04272] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-06 14:43:13,229][04272] Updated weights for policy 0, policy_version 11640 (0.0007) [2023-03-06 14:43:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 11927552. Throughput: 0: 12525.6. Samples: 11907568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:13,941][03942] Avg episode reward: [(0, '766.347')] [2023-03-06 14:43:14,041][04272] Updated weights for policy 0, policy_version 11650 (0.0006) [2023-03-06 14:43:14,863][04272] Updated weights for policy 0, policy_version 11660 (0.0007) [2023-03-06 14:43:15,693][04272] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-06 14:43:16,505][04272] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-06 14:43:17,327][04272] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-06 14:43:18,121][04272] Updated weights for policy 0, policy_version 11700 (0.0007) [2023-03-06 14:43:18,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12544.0, 300 sec: 12537.9). Total num frames: 11991040. Throughput: 0: 12524.9. Samples: 11982619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:18,941][03942] Avg episode reward: [(0, '794.870')] [2023-03-06 14:43:18,942][04272] Updated weights for policy 0, policy_version 11710 (0.0007) [2023-03-06 14:43:18,944][04221] Saving new best policy, reward=794.870! [2023-03-06 14:43:19,772][04272] Updated weights for policy 0, policy_version 11720 (0.0007) [2023-03-06 14:43:20,590][04272] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-06 14:43:21,403][04272] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-06 14:43:22,229][04272] Updated weights for policy 0, policy_version 11750 (0.0007) [2023-03-06 14:43:23,048][04272] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-06 14:43:23,871][04272] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-06 14:43:23,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 12052480. Throughput: 0: 12516.5. Samples: 12020085. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:23,941][03942] Avg episode reward: [(0, '732.242')] [2023-03-06 14:43:24,685][04272] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-06 14:43:25,507][04272] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-06 14:43:26,326][04272] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-06 14:43:27,135][04272] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-06 14:43:27,966][04272] Updated weights for policy 0, policy_version 11820 (0.0007) [2023-03-06 14:43:28,773][04272] Updated weights for policy 0, policy_version 11830 (0.0007) [2023-03-06 14:43:28,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 12115968. Throughput: 0: 12508.5. Samples: 12095234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:28,941][03942] Avg episode reward: [(0, '688.102')] [2023-03-06 14:43:29,588][04272] Updated weights for policy 0, policy_version 11840 (0.0007) [2023-03-06 14:43:30,413][04272] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-06 14:43:31,228][04272] Updated weights for policy 0, policy_version 11860 (0.0008) [2023-03-06 14:43:32,038][04272] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-06 14:43:32,864][04272] Updated weights for policy 0, policy_version 11880 (0.0007) [2023-03-06 14:43:33,683][04272] Updated weights for policy 0, policy_version 11890 (0.0007) [2023-03-06 14:43:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12534.5). Total num frames: 12178432. Throughput: 0: 12508.8. Samples: 12170323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:33,941][03942] Avg episode reward: [(0, '711.720')] [2023-03-06 14:43:34,495][04272] Updated weights for policy 0, policy_version 11900 (0.0006) [2023-03-06 14:43:35,288][04272] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-06 14:43:36,107][04272] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-06 14:43:36,912][04272] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-06 14:43:37,735][04272] Updated weights for policy 0, policy_version 11940 (0.0006) [2023-03-06 14:43:38,548][04272] Updated weights for policy 0, policy_version 11950 (0.0007) [2023-03-06 14:43:38,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12534.5). Total num frames: 12240896. Throughput: 0: 12512.8. Samples: 12208184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:38,941][03942] Avg episode reward: [(0, '781.596')] [2023-03-06 14:43:39,373][04272] Updated weights for policy 0, policy_version 11960 (0.0006) [2023-03-06 14:43:40,179][04272] Updated weights for policy 0, policy_version 11970 (0.0007) [2023-03-06 14:43:40,994][04272] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-06 14:43:41,820][04272] Updated weights for policy 0, policy_version 11990 (0.0006) [2023-03-06 14:43:42,629][04272] Updated weights for policy 0, policy_version 12000 (0.0007) [2023-03-06 14:43:43,456][04272] Updated weights for policy 0, policy_version 12010 (0.0007) [2023-03-06 14:43:43,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12509.9, 300 sec: 12534.5). Total num frames: 12303360. Throughput: 0: 12524.0. Samples: 12283676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:43,941][03942] Avg episode reward: [(0, '752.769')] [2023-03-06 14:43:44,266][04272] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-06 14:43:45,073][04272] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-06 14:43:45,906][04272] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-06 14:43:46,704][04272] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-06 14:43:47,528][04272] Updated weights for policy 0, policy_version 12060 (0.0007) [2023-03-06 14:43:48,356][04272] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-06 14:43:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 12366848. Throughput: 0: 12525.3. Samples: 12358816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 14:43:48,941][03942] Avg episode reward: [(0, '773.666')] [2023-03-06 14:43:49,169][04272] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-06 14:43:49,985][04272] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-06 14:43:50,813][04272] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-06 14:43:51,621][04272] Updated weights for policy 0, policy_version 12110 (0.0007) [2023-03-06 14:43:52,439][04272] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-06 14:43:53,230][04272] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-06 14:43:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 12429312. Throughput: 0: 12525.1. Samples: 12396330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 14:43:53,941][03942] Avg episode reward: [(0, '772.393')] [2023-03-06 14:43:54,059][04272] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-06 14:43:54,857][04272] Updated weights for policy 0, policy_version 12150 (0.0006) [2023-03-06 14:43:55,690][04272] Updated weights for policy 0, policy_version 12160 (0.0006) [2023-03-06 14:43:56,491][04272] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-06 14:43:57,301][04272] Updated weights for policy 0, policy_version 12180 (0.0006) [2023-03-06 14:43:58,121][04272] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-06 14:43:58,938][04272] Updated weights for policy 0, policy_version 12200 (0.0008) [2023-03-06 14:43:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12537.9). Total num frames: 12492800. Throughput: 0: 12542.8. Samples: 12471992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:43:58,941][03942] Avg episode reward: [(0, '760.540')] [2023-03-06 14:43:59,755][04272] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-06 14:44:00,566][04272] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-06 14:44:01,381][04272] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-06 14:44:02,204][04272] Updated weights for policy 0, policy_version 12240 (0.0007) [2023-03-06 14:44:03,005][04272] Updated weights for policy 0, policy_version 12250 (0.0006) [2023-03-06 14:44:03,833][04272] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-06 14:44:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12537.9). Total num frames: 12555264. Throughput: 0: 12546.9. Samples: 12547225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:03,941][03942] Avg episode reward: [(0, '781.092')] [2023-03-06 14:44:04,645][04272] Updated weights for policy 0, policy_version 12270 (0.0007) [2023-03-06 14:44:05,446][04272] Updated weights for policy 0, policy_version 12280 (0.0007) [2023-03-06 14:44:06,276][04272] Updated weights for policy 0, policy_version 12290 (0.0007) [2023-03-06 14:44:07,115][04272] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-06 14:44:07,945][04272] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-06 14:44:08,739][04272] Updated weights for policy 0, policy_version 12320 (0.0007) [2023-03-06 14:44:08,941][03942] Fps is (10 sec: 12492.6, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 12617728. Throughput: 0: 12549.6. Samples: 12584817. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:08,941][03942] Avg episode reward: [(0, '761.634')] [2023-03-06 14:44:09,576][04272] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-06 14:44:10,383][04272] Updated weights for policy 0, policy_version 12340 (0.0007) [2023-03-06 14:44:11,194][04272] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-06 14:44:12,015][04272] Updated weights for policy 0, policy_version 12360 (0.0007) [2023-03-06 14:44:12,839][04272] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-06 14:44:13,663][04272] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-06 14:44:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 12680192. Throughput: 0: 12548.9. Samples: 12659934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:13,941][03942] Avg episode reward: [(0, '765.428')] [2023-03-06 14:44:14,472][04272] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-06 14:44:15,289][04272] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-06 14:44:16,097][04272] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-06 14:44:16,926][04272] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-06 14:44:17,731][04272] Updated weights for policy 0, policy_version 12430 (0.0007) [2023-03-06 14:44:18,540][04272] Updated weights for policy 0, policy_version 12440 (0.0007) [2023-03-06 14:44:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 12743680. Throughput: 0: 12557.1. Samples: 12735391. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:44:18,941][03942] Avg episode reward: [(0, '766.949')] [2023-03-06 14:44:19,362][04272] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-06 14:44:20,175][04272] Updated weights for policy 0, policy_version 12460 (0.0007) [2023-03-06 14:44:20,994][04272] Updated weights for policy 0, policy_version 12470 (0.0006) [2023-03-06 14:44:21,821][04272] Updated weights for policy 0, policy_version 12480 (0.0007) [2023-03-06 14:44:22,631][04272] Updated weights for policy 0, policy_version 12490 (0.0007) [2023-03-06 14:44:23,434][04272] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-06 14:44:23,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12531.0). Total num frames: 12805120. Throughput: 0: 12547.0. Samples: 12772798. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:44:23,941][03942] Avg episode reward: [(0, '786.812')] [2023-03-06 14:44:24,268][04272] Updated weights for policy 0, policy_version 12510 (0.0007) [2023-03-06 14:44:25,076][04272] Updated weights for policy 0, policy_version 12520 (0.0007) [2023-03-06 14:44:25,902][04272] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-06 14:44:26,727][04272] Updated weights for policy 0, policy_version 12540 (0.0007) [2023-03-06 14:44:27,534][04272] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-06 14:44:28,341][04272] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-06 14:44:28,941][03942] Fps is (10 sec: 12492.6, 60 sec: 12544.0, 300 sec: 12531.0). Total num frames: 12868608. Throughput: 0: 12541.9. Samples: 12848064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:28,941][03942] Avg episode reward: [(0, '767.771')] [2023-03-06 14:44:29,167][04272] Updated weights for policy 0, policy_version 12570 (0.0006) [2023-03-06 14:44:29,987][04272] Updated weights for policy 0, policy_version 12580 (0.0006) [2023-03-06 14:44:30,785][04272] Updated weights for policy 0, policy_version 12590 (0.0007) [2023-03-06 14:44:31,599][04272] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-06 14:44:32,406][04272] Updated weights for policy 0, policy_version 12610 (0.0005) [2023-03-06 14:44:33,224][04272] Updated weights for policy 0, policy_version 12620 (0.0007) [2023-03-06 14:44:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 12932096. Throughput: 0: 12550.1. Samples: 12923569. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:33,941][03942] Avg episode reward: [(0, '775.435')] [2023-03-06 14:44:34,036][04272] Updated weights for policy 0, policy_version 12630 (0.0007) [2023-03-06 14:44:34,856][04272] Updated weights for policy 0, policy_version 12640 (0.0007) [2023-03-06 14:44:35,673][04272] Updated weights for policy 0, policy_version 12650 (0.0006) [2023-03-06 14:44:36,482][04272] Updated weights for policy 0, policy_version 12660 (0.0007) [2023-03-06 14:44:37,298][04272] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-06 14:44:38,104][04272] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-06 14:44:38,932][04272] Updated weights for policy 0, policy_version 12690 (0.0007) [2023-03-06 14:44:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12531.0). Total num frames: 12994560. Throughput: 0: 12554.0. Samples: 12961260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:38,941][03942] Avg episode reward: [(0, '734.228')] [2023-03-06 14:44:39,714][04272] Updated weights for policy 0, policy_version 12700 (0.0007) [2023-03-06 14:44:40,559][04272] Updated weights for policy 0, policy_version 12710 (0.0007) [2023-03-06 14:44:41,364][04272] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-06 14:44:42,176][04272] Updated weights for policy 0, policy_version 12730 (0.0007) [2023-03-06 14:44:42,989][04272] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-06 14:44:43,806][04272] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-06 14:44:43,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12531.0). Total num frames: 13057024. Throughput: 0: 12548.8. Samples: 13036689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:43,941][03942] Avg episode reward: [(0, '642.120')] [2023-03-06 14:44:44,634][04272] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-06 14:44:45,441][04272] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-06 14:44:46,262][04272] Updated weights for policy 0, policy_version 12780 (0.0006) [2023-03-06 14:44:47,086][04272] Updated weights for policy 0, policy_version 12790 (0.0007) [2023-03-06 14:44:47,891][04272] Updated weights for policy 0, policy_version 12800 (0.0007) [2023-03-06 14:44:48,714][04272] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-06 14:44:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12531.0). Total num frames: 13120512. Throughput: 0: 12551.4. Samples: 13112039. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:44:48,941][03942] Avg episode reward: [(0, '734.349')] [2023-03-06 14:44:49,535][04272] Updated weights for policy 0, policy_version 12820 (0.0006) [2023-03-06 14:44:50,346][04272] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-06 14:44:51,138][04272] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-06 14:44:51,954][04272] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-06 14:44:52,769][04272] Updated weights for policy 0, policy_version 12860 (0.0007) [2023-03-06 14:44:53,581][04272] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-06 14:44:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12534.5). Total num frames: 13182976. Throughput: 0: 12554.8. Samples: 13149782. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:44:53,941][03942] Avg episode reward: [(0, '738.166')] [2023-03-06 14:44:54,390][04272] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-06 14:44:55,221][04272] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-06 14:44:56,039][04272] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-06 14:44:56,853][04272] Updated weights for policy 0, policy_version 12910 (0.0007) [2023-03-06 14:44:57,666][04272] Updated weights for policy 0, policy_version 12920 (0.0007) [2023-03-06 14:44:58,474][04272] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-06 14:44:58,941][03942] Fps is (10 sec: 12492.6, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 13245440. Throughput: 0: 12561.0. Samples: 13225179. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:44:58,941][03942] Avg episode reward: [(0, '703.438')] [2023-03-06 14:44:59,303][04272] Updated weights for policy 0, policy_version 12940 (0.0007) [2023-03-06 14:45:00,125][04272] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-06 14:45:00,922][04272] Updated weights for policy 0, policy_version 12960 (0.0007) [2023-03-06 14:45:01,736][04272] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-06 14:45:02,577][04272] Updated weights for policy 0, policy_version 12980 (0.0007) [2023-03-06 14:45:03,372][04272] Updated weights for policy 0, policy_version 12990 (0.0007) [2023-03-06 14:45:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12534.5). Total num frames: 13307904. Throughput: 0: 12558.6. Samples: 13300532. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:45:03,941][03942] Avg episode reward: [(0, '769.110')] [2023-03-06 14:45:04,175][04272] Updated weights for policy 0, policy_version 13000 (0.0007) [2023-03-06 14:45:04,988][04272] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-06 14:45:05,815][04272] Updated weights for policy 0, policy_version 13020 (0.0007) [2023-03-06 14:45:06,639][04272] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-06 14:45:07,456][04272] Updated weights for policy 0, policy_version 13040 (0.0007) [2023-03-06 14:45:08,282][04272] Updated weights for policy 0, policy_version 13050 (0.0007) [2023-03-06 14:45:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12537.9). Total num frames: 13371392. Throughput: 0: 12568.2. Samples: 13338369. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:45:08,941][03942] Avg episode reward: [(0, '737.217')] [2023-03-06 14:45:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000013058_13371392.pth... [2023-03-06 14:45:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000010117_10359808.pth [2023-03-06 14:45:09,090][04272] Updated weights for policy 0, policy_version 13060 (0.0007) [2023-03-06 14:45:09,898][04272] Updated weights for policy 0, policy_version 13070 (0.0005) [2023-03-06 14:45:10,720][04272] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-06 14:45:11,522][04272] Updated weights for policy 0, policy_version 13090 (0.0007) [2023-03-06 14:45:12,347][04272] Updated weights for policy 0, policy_version 13100 (0.0007) [2023-03-06 14:45:13,142][04272] Updated weights for policy 0, policy_version 13110 (0.0006) [2023-03-06 14:45:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12537.9). Total num frames: 13433856. Throughput: 0: 12567.1. Samples: 13413584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:13,941][03942] Avg episode reward: [(0, '747.984')] [2023-03-06 14:45:13,950][04272] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-06 14:45:14,774][04272] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-06 14:45:15,585][04272] Updated weights for policy 0, policy_version 13140 (0.0007) [2023-03-06 14:45:16,399][04272] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-06 14:45:17,218][04272] Updated weights for policy 0, policy_version 13160 (0.0007) [2023-03-06 14:45:18,039][04272] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-06 14:45:18,838][04272] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-06 14:45:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12561.0, 300 sec: 12537.9). Total num frames: 13497344. Throughput: 0: 12568.6. Samples: 13489156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:18,941][03942] Avg episode reward: [(0, '747.266')] [2023-03-06 14:45:19,668][04272] Updated weights for policy 0, policy_version 13190 (0.0006) [2023-03-06 14:45:20,486][04272] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-06 14:45:21,321][04272] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-06 14:45:22,126][04272] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-06 14:45:22,937][04272] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-06 14:45:23,755][04272] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-06 14:45:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12541.4). Total num frames: 13559808. Throughput: 0: 12563.2. Samples: 13526602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:23,941][03942] Avg episode reward: [(0, '740.857')] [2023-03-06 14:45:24,565][04272] Updated weights for policy 0, policy_version 13250 (0.0007) [2023-03-06 14:45:25,384][04272] Updated weights for policy 0, policy_version 13260 (0.0007) [2023-03-06 14:45:26,190][04272] Updated weights for policy 0, policy_version 13270 (0.0006) [2023-03-06 14:45:26,995][04272] Updated weights for policy 0, policy_version 13280 (0.0006) [2023-03-06 14:45:27,821][04272] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-06 14:45:28,629][04272] Updated weights for policy 0, policy_version 13300 (0.0007) [2023-03-06 14:45:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12541.4). Total num frames: 13623296. Throughput: 0: 12565.3. Samples: 13602127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:28,941][03942] Avg episode reward: [(0, '645.237')] [2023-03-06 14:45:29,428][04272] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-06 14:45:30,254][04272] Updated weights for policy 0, policy_version 13320 (0.0006) [2023-03-06 14:45:31,081][04272] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-03-06 14:45:31,894][04272] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-06 14:45:32,690][04272] Updated weights for policy 0, policy_version 13350 (0.0007) [2023-03-06 14:45:33,522][04272] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-06 14:45:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12541.4). Total num frames: 13685760. Throughput: 0: 12569.5. Samples: 13677670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:33,941][03942] Avg episode reward: [(0, '662.910')] [2023-03-06 14:45:34,322][04272] Updated weights for policy 0, policy_version 13370 (0.0006) [2023-03-06 14:45:35,130][04272] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-06 14:45:35,949][04272] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-06 14:45:36,759][04272] Updated weights for policy 0, policy_version 13400 (0.0007) [2023-03-06 14:45:37,560][04272] Updated weights for policy 0, policy_version 13410 (0.0007) [2023-03-06 14:45:38,377][04272] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-06 14:45:38,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12541.4). Total num frames: 13748224. Throughput: 0: 12574.9. Samples: 13715654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:38,941][03942] Avg episode reward: [(0, '723.990')] [2023-03-06 14:45:39,205][04272] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-06 14:45:40,015][04272] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-06 14:45:40,833][04272] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-06 14:45:41,638][04272] Updated weights for policy 0, policy_version 13460 (0.0007) [2023-03-06 14:45:42,454][04272] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-06 14:45:43,261][04272] Updated weights for policy 0, policy_version 13480 (0.0006) [2023-03-06 14:45:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12544.9). Total num frames: 13811712. Throughput: 0: 12576.4. Samples: 13791114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:43,941][03942] Avg episode reward: [(0, '709.635')] [2023-03-06 14:45:44,082][04272] Updated weights for policy 0, policy_version 13490 (0.0007) [2023-03-06 14:45:44,917][04272] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-06 14:45:45,715][04272] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-06 14:45:46,535][04272] Updated weights for policy 0, policy_version 13520 (0.0007) [2023-03-06 14:45:47,347][04272] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-06 14:45:48,165][04272] Updated weights for policy 0, policy_version 13540 (0.0007) [2023-03-06 14:45:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12544.9). Total num frames: 13874176. Throughput: 0: 12574.3. Samples: 13866376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:45:48,941][03942] Avg episode reward: [(0, '742.429')] [2023-03-06 14:45:48,990][04272] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-06 14:45:49,774][04272] Updated weights for policy 0, policy_version 13560 (0.0007) [2023-03-06 14:45:50,594][04272] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-06 14:45:51,406][04272] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-06 14:45:52,205][04272] Updated weights for policy 0, policy_version 13590 (0.0007) [2023-03-06 14:45:53,040][04272] Updated weights for policy 0, policy_version 13600 (0.0007) [2023-03-06 14:45:53,844][04272] Updated weights for policy 0, policy_version 13610 (0.0007) [2023-03-06 14:45:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12548.3). Total num frames: 13937664. Throughput: 0: 12576.4. Samples: 13904307. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:45:53,941][03942] Avg episode reward: [(0, '717.861')] [2023-03-06 14:45:54,633][04272] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-06 14:45:55,478][04272] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-06 14:45:56,307][04272] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-06 14:45:57,118][04272] Updated weights for policy 0, policy_version 13650 (0.0007) [2023-03-06 14:45:57,938][04272] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-06 14:45:58,747][04272] Updated weights for policy 0, policy_version 13670 (0.0007) [2023-03-06 14:45:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12548.3). Total num frames: 14000128. Throughput: 0: 12579.1. Samples: 13979643. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:45:58,941][03942] Avg episode reward: [(0, '732.555')] [2023-03-06 14:45:59,548][04272] Updated weights for policy 0, policy_version 13680 (0.0007) [2023-03-06 14:46:00,359][04272] Updated weights for policy 0, policy_version 13690 (0.0007) [2023-03-06 14:46:01,181][04272] Updated weights for policy 0, policy_version 13700 (0.0007) [2023-03-06 14:46:01,984][04272] Updated weights for policy 0, policy_version 13710 (0.0007) [2023-03-06 14:46:02,791][04272] Updated weights for policy 0, policy_version 13720 (0.0007) [2023-03-06 14:46:03,612][04272] Updated weights for policy 0, policy_version 13730 (0.0007) [2023-03-06 14:46:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12555.3). Total num frames: 14062592. Throughput: 0: 12580.3. Samples: 14055270. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:46:03,941][03942] Avg episode reward: [(0, '663.957')] [2023-03-06 14:46:04,439][04272] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-06 14:46:05,251][04272] Updated weights for policy 0, policy_version 13750 (0.0007) [2023-03-06 14:46:06,073][04272] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-06 14:46:06,882][04272] Updated weights for policy 0, policy_version 13770 (0.0007) [2023-03-06 14:46:07,685][04272] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-06 14:46:08,519][04272] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-06 14:46:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12558.7). Total num frames: 14126080. Throughput: 0: 12581.5. Samples: 14092772. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:46:08,941][03942] Avg episode reward: [(0, '736.082')] [2023-03-06 14:46:09,346][04272] Updated weights for policy 0, policy_version 13800 (0.0007) [2023-03-06 14:46:10,179][04272] Updated weights for policy 0, policy_version 13810 (0.0006) [2023-03-06 14:46:10,997][04272] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-06 14:46:11,813][04272] Updated weights for policy 0, policy_version 13830 (0.0007) [2023-03-06 14:46:12,631][04272] Updated weights for policy 0, policy_version 13840 (0.0008) [2023-03-06 14:46:13,458][04272] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-06 14:46:13,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12558.8). Total num frames: 14188544. Throughput: 0: 12571.7. Samples: 14167851. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:46:13,941][03942] Avg episode reward: [(0, '748.547')] [2023-03-06 14:46:14,261][04272] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-06 14:46:15,085][04272] Updated weights for policy 0, policy_version 13870 (0.0006) [2023-03-06 14:46:15,883][04272] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-06 14:46:16,698][04272] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-06 14:46:17,521][04272] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-06 14:46:18,340][04272] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-06 14:46:18,940][03942] Fps is (10 sec: 12493.0, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 14251008. Throughput: 0: 12562.3. Samples: 14242974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:46:18,941][03942] Avg episode reward: [(0, '767.472')] [2023-03-06 14:46:19,153][04272] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-06 14:46:19,990][04272] Updated weights for policy 0, policy_version 13930 (0.0007) [2023-03-06 14:46:20,795][04272] Updated weights for policy 0, policy_version 13940 (0.0007) [2023-03-06 14:46:21,614][04272] Updated weights for policy 0, policy_version 13950 (0.0007) [2023-03-06 14:46:22,447][04272] Updated weights for policy 0, policy_version 13960 (0.0007) [2023-03-06 14:46:23,254][04272] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-06 14:46:23,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 14313472. Throughput: 0: 12553.3. Samples: 14280552. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:23,941][03942] Avg episode reward: [(0, '750.610')] [2023-03-06 14:46:24,065][04272] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-06 14:46:24,861][04272] Updated weights for policy 0, policy_version 13990 (0.0007) [2023-03-06 14:46:25,683][04272] Updated weights for policy 0, policy_version 14000 (0.0007) [2023-03-06 14:46:26,507][04272] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-06 14:46:27,317][04272] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-06 14:46:28,130][04272] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-06 14:46:28,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12551.8). Total num frames: 14375936. Throughput: 0: 12550.5. Samples: 14355885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:28,941][03942] Avg episode reward: [(0, '717.188')] [2023-03-06 14:46:28,943][04272] Updated weights for policy 0, policy_version 14040 (0.0005) [2023-03-06 14:46:29,767][04272] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-06 14:46:30,563][04272] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-06 14:46:31,377][04272] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-06 14:46:32,198][04272] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-06 14:46:33,029][04272] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-06 14:46:33,838][04272] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-06 14:46:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 14439424. Throughput: 0: 12560.0. Samples: 14431579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:33,952][03942] Avg episode reward: [(0, '655.753')] [2023-03-06 14:46:34,650][04272] Updated weights for policy 0, policy_version 14110 (0.0007) [2023-03-06 14:46:35,460][04272] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-06 14:46:36,267][04272] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-06 14:46:37,086][04272] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-06 14:46:37,906][04272] Updated weights for policy 0, policy_version 14150 (0.0006) [2023-03-06 14:46:38,731][04272] Updated weights for policy 0, policy_version 14160 (0.0007) [2023-03-06 14:46:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12561.1, 300 sec: 12551.8). Total num frames: 14501888. Throughput: 0: 12551.2. Samples: 14469111. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:38,952][03942] Avg episode reward: [(0, '723.795')] [2023-03-06 14:46:39,549][04272] Updated weights for policy 0, policy_version 14170 (0.0007) [2023-03-06 14:46:40,361][04272] Updated weights for policy 0, policy_version 14180 (0.0007) [2023-03-06 14:46:41,186][04272] Updated weights for policy 0, policy_version 14190 (0.0007) [2023-03-06 14:46:41,996][04272] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-06 14:46:42,820][04272] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-06 14:46:43,633][04272] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-06 14:46:43,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12551.8). Total num frames: 14564352. Throughput: 0: 12548.2. Samples: 14544315. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:43,952][03942] Avg episode reward: [(0, '760.549')] [2023-03-06 14:46:44,460][04272] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-06 14:46:45,265][04272] Updated weights for policy 0, policy_version 14240 (0.0007) [2023-03-06 14:46:46,082][04272] Updated weights for policy 0, policy_version 14250 (0.0006) [2023-03-06 14:46:46,909][04272] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-06 14:46:47,731][04272] Updated weights for policy 0, policy_version 14270 (0.0006) [2023-03-06 14:46:48,543][04272] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-06 14:46:48,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12551.8). Total num frames: 14626816. Throughput: 0: 12538.0. Samples: 14619478. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:46:48,951][03942] Avg episode reward: [(0, '799.695')] [2023-03-06 14:46:48,955][04221] Saving new best policy, reward=799.695! [2023-03-06 14:46:49,354][04272] Updated weights for policy 0, policy_version 14290 (0.0006) [2023-03-06 14:46:50,182][04272] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-06 14:46:50,997][04272] Updated weights for policy 0, policy_version 14310 (0.0006) [2023-03-06 14:46:51,808][04272] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-06 14:46:52,623][04272] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-06 14:46:53,446][04272] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-06 14:46:53,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12544.0, 300 sec: 12551.8). Total num frames: 14690304. Throughput: 0: 12536.3. Samples: 14656903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:46:53,952][03942] Avg episode reward: [(0, '771.090')] [2023-03-06 14:46:54,262][04272] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-06 14:46:55,099][04272] Updated weights for policy 0, policy_version 14360 (0.0007) [2023-03-06 14:46:55,940][04272] Updated weights for policy 0, policy_version 14370 (0.0007) [2023-03-06 14:46:56,735][04272] Updated weights for policy 0, policy_version 14380 (0.0007) [2023-03-06 14:46:57,571][04272] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-06 14:46:58,389][04272] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-03-06 14:46:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12544.9). Total num frames: 14751744. Throughput: 0: 12531.2. Samples: 14731756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:46:58,952][03942] Avg episode reward: [(0, '792.224')] [2023-03-06 14:46:59,206][04272] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-06 14:47:00,018][04272] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-06 14:47:00,864][04272] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-06 14:47:01,679][04272] Updated weights for policy 0, policy_version 14440 (0.0007) [2023-03-06 14:47:02,482][04272] Updated weights for policy 0, policy_version 14450 (0.0007) [2023-03-06 14:47:03,323][04272] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-06 14:47:03,941][03942] Fps is (10 sec: 12390.2, 60 sec: 12526.9, 300 sec: 12544.9). Total num frames: 14814208. Throughput: 0: 12526.2. Samples: 14806655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:03,952][03942] Avg episode reward: [(0, '788.391')] [2023-03-06 14:47:04,145][04272] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-06 14:47:04,945][04272] Updated weights for policy 0, policy_version 14480 (0.0007) [2023-03-06 14:47:05,789][04272] Updated weights for policy 0, policy_version 14490 (0.0007) [2023-03-06 14:47:06,593][04272] Updated weights for policy 0, policy_version 14500 (0.0006) [2023-03-06 14:47:07,380][04272] Updated weights for policy 0, policy_version 14510 (0.0007) [2023-03-06 14:47:08,203][04272] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-06 14:47:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12548.3). Total num frames: 14877696. Throughput: 0: 12526.1. Samples: 14844228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:08,952][03942] Avg episode reward: [(0, '756.303')] [2023-03-06 14:47:08,956][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000014529_14877696.pth... [2023-03-06 14:47:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000011587_11865088.pth [2023-03-06 14:47:09,025][04272] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-06 14:47:09,861][04272] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-06 14:47:10,659][04272] Updated weights for policy 0, policy_version 14550 (0.0005) [2023-03-06 14:47:11,469][04272] Updated weights for policy 0, policy_version 14560 (0.0007) [2023-03-06 14:47:12,291][04272] Updated weights for policy 0, policy_version 14570 (0.0006) [2023-03-06 14:47:13,113][04272] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-06 14:47:13,906][04272] Updated weights for policy 0, policy_version 14590 (0.0007) [2023-03-06 14:47:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12526.9, 300 sec: 12548.3). Total num frames: 14940160. Throughput: 0: 12524.2. Samples: 14919475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:13,941][03942] Avg episode reward: [(0, '770.515')] [2023-03-06 14:47:14,730][04272] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-06 14:47:15,566][04272] Updated weights for policy 0, policy_version 14610 (0.0007) [2023-03-06 14:47:16,378][04272] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-06 14:47:17,192][04272] Updated weights for policy 0, policy_version 14630 (0.0007) [2023-03-06 14:47:18,021][04272] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-06 14:47:18,818][04272] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-06 14:47:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12548.3). Total num frames: 15002624. Throughput: 0: 12514.5. Samples: 14994730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:18,941][03942] Avg episode reward: [(0, '712.790')] [2023-03-06 14:47:19,630][04272] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-06 14:47:20,440][04272] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-06 14:47:21,264][04272] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-06 14:47:22,065][04272] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-06 14:47:22,894][04272] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-06 14:47:23,721][04272] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-06 14:47:23,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12527.0, 300 sec: 12544.9). Total num frames: 15065088. Throughput: 0: 12522.8. Samples: 15032638. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:23,941][03942] Avg episode reward: [(0, '629.924')] [2023-03-06 14:47:24,526][04272] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-06 14:47:25,337][04272] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-06 14:47:26,151][04272] Updated weights for policy 0, policy_version 14740 (0.0007) [2023-03-06 14:47:26,975][04272] Updated weights for policy 0, policy_version 14750 (0.0006) [2023-03-06 14:47:27,773][04272] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-06 14:47:28,573][04272] Updated weights for policy 0, policy_version 14770 (0.0006) [2023-03-06 14:47:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12548.3). Total num frames: 15128576. Throughput: 0: 12525.4. Samples: 15107956. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:47:28,941][03942] Avg episode reward: [(0, '721.885')] [2023-03-06 14:47:29,413][04272] Updated weights for policy 0, policy_version 14780 (0.0007) [2023-03-06 14:47:30,218][04272] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-06 14:47:31,032][04272] Updated weights for policy 0, policy_version 14800 (0.0007) [2023-03-06 14:47:31,835][04272] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-06 14:47:32,655][04272] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-06 14:47:33,476][04272] Updated weights for policy 0, policy_version 14830 (0.0007) [2023-03-06 14:47:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12544.9). Total num frames: 15191040. Throughput: 0: 12533.7. Samples: 15183494. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:47:33,941][03942] Avg episode reward: [(0, '748.767')] [2023-03-06 14:47:34,304][04272] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-06 14:47:35,107][04272] Updated weights for policy 0, policy_version 14850 (0.0007) [2023-03-06 14:47:35,926][04272] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-06 14:47:36,739][04272] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-06 14:47:37,555][04272] Updated weights for policy 0, policy_version 14880 (0.0006) [2023-03-06 14:47:38,348][04272] Updated weights for policy 0, policy_version 14890 (0.0006) [2023-03-06 14:47:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12544.0, 300 sec: 12548.3). Total num frames: 15254528. Throughput: 0: 12536.7. Samples: 15221054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:38,941][03942] Avg episode reward: [(0, '723.573')] [2023-03-06 14:47:39,169][04272] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-06 14:47:39,987][04272] Updated weights for policy 0, policy_version 14910 (0.0008) [2023-03-06 14:47:40,795][04272] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-06 14:47:41,627][04272] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-06 14:47:42,426][04272] Updated weights for policy 0, policy_version 14940 (0.0007) [2023-03-06 14:47:43,237][04272] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-06 14:47:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12544.0, 300 sec: 12548.3). Total num frames: 15316992. Throughput: 0: 12552.6. Samples: 15296620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:43,941][03942] Avg episode reward: [(0, '672.858')] [2023-03-06 14:47:44,069][04272] Updated weights for policy 0, policy_version 14960 (0.0006) [2023-03-06 14:47:44,877][04272] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-06 14:47:45,696][04272] Updated weights for policy 0, policy_version 14980 (0.0007) [2023-03-06 14:47:46,517][04272] Updated weights for policy 0, policy_version 14990 (0.0006) [2023-03-06 14:47:47,324][04272] Updated weights for policy 0, policy_version 15000 (0.0007) [2023-03-06 14:47:48,125][04272] Updated weights for policy 0, policy_version 15010 (0.0006) [2023-03-06 14:47:48,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12548.3). Total num frames: 15379456. Throughput: 0: 12561.1. Samples: 15371904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:47:48,941][03942] Avg episode reward: [(0, '654.323')] [2023-03-06 14:47:48,946][04272] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-06 14:47:49,769][04272] Updated weights for policy 0, policy_version 15030 (0.0006) [2023-03-06 14:47:50,586][04272] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-06 14:47:51,409][04272] Updated weights for policy 0, policy_version 15050 (0.0006) [2023-03-06 14:47:52,219][04272] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-06 14:47:53,042][04272] Updated weights for policy 0, policy_version 15070 (0.0007) [2023-03-06 14:47:53,845][04272] Updated weights for policy 0, policy_version 15080 (0.0007) [2023-03-06 14:47:53,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12544.0, 300 sec: 12548.3). Total num frames: 15442944. Throughput: 0: 12562.0. Samples: 15409517. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:47:53,941][03942] Avg episode reward: [(0, '686.699')] [2023-03-06 14:47:54,669][04272] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-06 14:47:55,494][04272] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-06 14:47:56,301][04272] Updated weights for policy 0, policy_version 15110 (0.0006) [2023-03-06 14:47:57,129][04272] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-06 14:47:57,932][04272] Updated weights for policy 0, policy_version 15130 (0.0006) [2023-03-06 14:47:58,732][04272] Updated weights for policy 0, policy_version 15140 (0.0006) [2023-03-06 14:47:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12561.1, 300 sec: 12551.8). Total num frames: 15505408. Throughput: 0: 12565.6. Samples: 15484930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:47:58,941][03942] Avg episode reward: [(0, '575.088')] [2023-03-06 14:47:59,556][04272] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-06 14:48:00,365][04272] Updated weights for policy 0, policy_version 15160 (0.0006) [2023-03-06 14:48:01,165][04272] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-06 14:48:01,995][04272] Updated weights for policy 0, policy_version 15180 (0.0006) [2023-03-06 14:48:02,818][04272] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-06 14:48:03,617][04272] Updated weights for policy 0, policy_version 15200 (0.0007) [2023-03-06 14:48:03,940][03942] Fps is (10 sec: 12493.0, 60 sec: 12561.1, 300 sec: 12551.8). Total num frames: 15567872. Throughput: 0: 12571.8. Samples: 15560458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:48:03,941][03942] Avg episode reward: [(0, '711.433')] [2023-03-06 14:48:04,437][04272] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-06 14:48:05,251][04272] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-06 14:48:06,054][04272] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-06 14:48:06,880][04272] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-06 14:48:07,693][04272] Updated weights for policy 0, policy_version 15250 (0.0006) [2023-03-06 14:48:08,505][04272] Updated weights for policy 0, policy_version 15260 (0.0007) [2023-03-06 14:48:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 15631360. Throughput: 0: 12569.0. Samples: 15598242. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:48:08,941][03942] Avg episode reward: [(0, '731.528')] [2023-03-06 14:48:09,316][04272] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-06 14:48:10,135][04272] Updated weights for policy 0, policy_version 15280 (0.0007) [2023-03-06 14:48:10,942][04272] Updated weights for policy 0, policy_version 15290 (0.0006) [2023-03-06 14:48:11,762][04272] Updated weights for policy 0, policy_version 15300 (0.0006) [2023-03-06 14:48:12,571][04272] Updated weights for policy 0, policy_version 15310 (0.0007) [2023-03-06 14:48:13,389][04272] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-06 14:48:13,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12561.0, 300 sec: 12551.8). Total num frames: 15693824. Throughput: 0: 12571.2. Samples: 15673658. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:48:13,941][03942] Avg episode reward: [(0, '743.477')] [2023-03-06 14:48:14,201][04272] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-06 14:48:15,021][04272] Updated weights for policy 0, policy_version 15340 (0.0007) [2023-03-06 14:48:15,838][04272] Updated weights for policy 0, policy_version 15350 (0.0007) [2023-03-06 14:48:16,649][04272] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-06 14:48:17,460][04272] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-06 14:48:18,287][04272] Updated weights for policy 0, policy_version 15380 (0.0006) [2023-03-06 14:48:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.2, 300 sec: 12558.8). Total num frames: 15757312. Throughput: 0: 12567.4. Samples: 15749026. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:48:18,941][03942] Avg episode reward: [(0, '779.733')] [2023-03-06 14:48:19,097][04272] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-06 14:48:19,917][04272] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-06 14:48:20,716][04272] Updated weights for policy 0, policy_version 15410 (0.0007) [2023-03-06 14:48:21,541][04272] Updated weights for policy 0, policy_version 15420 (0.0007) [2023-03-06 14:48:22,346][04272] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-06 14:48:23,162][04272] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-06 14:48:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12555.3). Total num frames: 15819776. Throughput: 0: 12573.2. Samples: 15786848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:48:23,941][03942] Avg episode reward: [(0, '793.211')] [2023-03-06 14:48:23,975][04272] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-06 14:48:24,798][04272] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-06 14:48:25,601][04272] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-06 14:48:26,417][04272] Updated weights for policy 0, policy_version 15480 (0.0007) [2023-03-06 14:48:27,258][04272] Updated weights for policy 0, policy_version 15490 (0.0006) [2023-03-06 14:48:28,085][04272] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-03-06 14:48:28,874][04272] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-06 14:48:28,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 15882240. Throughput: 0: 12566.9. Samples: 15862132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:48:28,941][03942] Avg episode reward: [(0, '806.797')] [2023-03-06 14:48:28,945][04221] Saving new best policy, reward=806.797! [2023-03-06 14:48:29,702][04272] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-06 14:48:30,519][04272] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-06 14:48:31,335][04272] Updated weights for policy 0, policy_version 15540 (0.0007) [2023-03-06 14:48:32,158][04272] Updated weights for policy 0, policy_version 15550 (0.0007) [2023-03-06 14:48:32,971][04272] Updated weights for policy 0, policy_version 15560 (0.0007) [2023-03-06 14:48:33,771][04272] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-06 14:48:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12558.8). Total num frames: 15945728. Throughput: 0: 12568.6. Samples: 15937492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:48:33,941][03942] Avg episode reward: [(0, '775.816')] [2023-03-06 14:48:34,586][04272] Updated weights for policy 0, policy_version 15580 (0.0007) [2023-03-06 14:48:35,390][04272] Updated weights for policy 0, policy_version 15590 (0.0007) [2023-03-06 14:48:36,197][04272] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-06 14:48:37,014][04272] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-06 14:48:37,826][04272] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-06 14:48:38,635][04272] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-06 14:48:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12558.8). Total num frames: 16008192. Throughput: 0: 12573.1. Samples: 15975304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:48:38,941][03942] Avg episode reward: [(0, '755.387')] [2023-03-06 14:48:39,457][04272] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-06 14:48:40,284][04272] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-06 14:48:41,093][04272] Updated weights for policy 0, policy_version 15660 (0.0007) [2023-03-06 14:48:41,917][04272] Updated weights for policy 0, policy_version 15670 (0.0007) [2023-03-06 14:48:42,713][04272] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-06 14:48:43,520][04272] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-06 14:48:43,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12558.8). Total num frames: 16071680. Throughput: 0: 12573.8. Samples: 16050749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:48:43,941][03942] Avg episode reward: [(0, '753.858')] [2023-03-06 14:48:44,363][04272] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-06 14:48:45,182][04272] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-06 14:48:46,001][04272] Updated weights for policy 0, policy_version 15720 (0.0007) [2023-03-06 14:48:46,805][04272] Updated weights for policy 0, policy_version 15730 (0.0007) [2023-03-06 14:48:47,616][04272] Updated weights for policy 0, policy_version 15740 (0.0005) [2023-03-06 14:48:48,421][04272] Updated weights for policy 0, policy_version 15750 (0.0005) [2023-03-06 14:48:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12558.7). Total num frames: 16134144. Throughput: 0: 12570.7. Samples: 16126142. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:48:48,941][03942] Avg episode reward: [(0, '733.243')] [2023-03-06 14:48:49,226][04272] Updated weights for policy 0, policy_version 15760 (0.0006) [2023-03-06 14:48:50,042][04272] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-06 14:48:50,862][04272] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-06 14:48:51,672][04272] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-06 14:48:52,509][04272] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-06 14:48:53,312][04272] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-06 14:48:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 16196608. Throughput: 0: 12573.2. Samples: 16164038. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:48:53,941][03942] Avg episode reward: [(0, '741.009')] [2023-03-06 14:48:54,121][04272] Updated weights for policy 0, policy_version 15820 (0.0007) [2023-03-06 14:48:54,946][04272] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-06 14:48:55,766][04272] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-06 14:48:56,583][04272] Updated weights for policy 0, policy_version 15850 (0.0007) [2023-03-06 14:48:57,398][04272] Updated weights for policy 0, policy_version 15860 (0.0006) [2023-03-06 14:48:58,219][04272] Updated weights for policy 0, policy_version 15870 (0.0006) [2023-03-06 14:48:58,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12555.3). Total num frames: 16259072. Throughput: 0: 12565.1. Samples: 16239087. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:48:58,941][03942] Avg episode reward: [(0, '743.813')] [2023-03-06 14:48:59,031][04272] Updated weights for policy 0, policy_version 15880 (0.0007) [2023-03-06 14:48:59,862][04272] Updated weights for policy 0, policy_version 15890 (0.0006) [2023-03-06 14:49:00,666][04272] Updated weights for policy 0, policy_version 15900 (0.0007) [2023-03-06 14:49:01,483][04272] Updated weights for policy 0, policy_version 15910 (0.0007) [2023-03-06 14:49:02,289][04272] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-06 14:49:03,110][04272] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-06 14:49:03,922][04272] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-06 14:49:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12558.8). Total num frames: 16322560. Throughput: 0: 12564.9. Samples: 16314445. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:03,941][03942] Avg episode reward: [(0, '727.883')] [2023-03-06 14:49:04,734][04272] Updated weights for policy 0, policy_version 15950 (0.0006) [2023-03-06 14:49:05,545][04272] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-06 14:49:06,356][04272] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-06 14:49:07,175][04272] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-06 14:49:07,995][04272] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-06 14:49:08,797][04272] Updated weights for policy 0, policy_version 16000 (0.0007) [2023-03-06 14:49:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12558.8). Total num frames: 16385024. Throughput: 0: 12563.7. Samples: 16352215. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:08,941][03942] Avg episode reward: [(0, '739.594')] [2023-03-06 14:49:08,952][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000016002_16386048.pth... [2023-03-06 14:49:08,981][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000013058_13371392.pth [2023-03-06 14:49:09,617][04272] Updated weights for policy 0, policy_version 16010 (0.0007) [2023-03-06 14:49:10,434][04272] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-06 14:49:11,249][04272] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-06 14:49:12,055][04272] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-06 14:49:12,872][04272] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-06 14:49:13,685][04272] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-06 14:49:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12558.7). Total num frames: 16448512. Throughput: 0: 12569.5. Samples: 16427758. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:13,941][03942] Avg episode reward: [(0, '689.664')] [2023-03-06 14:49:14,497][04272] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-06 14:49:15,311][04272] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-06 14:49:16,127][04272] Updated weights for policy 0, policy_version 16090 (0.0007) [2023-03-06 14:49:16,935][04272] Updated weights for policy 0, policy_version 16100 (0.0006) [2023-03-06 14:49:17,741][04272] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-06 14:49:18,572][04272] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-06 14:49:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12562.2). Total num frames: 16510976. Throughput: 0: 12575.8. Samples: 16503402. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:49:18,941][03942] Avg episode reward: [(0, '690.991')] [2023-03-06 14:49:19,369][04272] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-06 14:49:20,179][04272] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-06 14:49:21,005][04272] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-06 14:49:21,802][04272] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-06 14:49:22,621][04272] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-06 14:49:23,434][04272] Updated weights for policy 0, policy_version 16180 (0.0006) [2023-03-06 14:49:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 16574464. Throughput: 0: 12573.1. Samples: 16541095. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:49:23,941][03942] Avg episode reward: [(0, '671.854')] [2023-03-06 14:49:24,247][04272] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-06 14:49:25,047][04272] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-06 14:49:25,882][04272] Updated weights for policy 0, policy_version 16210 (0.0007) [2023-03-06 14:49:26,698][04272] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-06 14:49:27,505][04272] Updated weights for policy 0, policy_version 16230 (0.0007) [2023-03-06 14:49:28,336][04272] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-06 14:49:28,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12578.1, 300 sec: 12558.7). Total num frames: 16636928. Throughput: 0: 12576.0. Samples: 16616669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:49:28,941][03942] Avg episode reward: [(0, '706.481')] [2023-03-06 14:49:29,137][04272] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-06 14:49:29,950][04272] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-06 14:49:30,779][04272] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-06 14:49:31,579][04272] Updated weights for policy 0, policy_version 16280 (0.0007) [2023-03-06 14:49:32,386][04272] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-06 14:49:33,222][04272] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-06 14:49:33,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12561.1, 300 sec: 12558.8). Total num frames: 16699392. Throughput: 0: 12574.2. Samples: 16691980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:49:33,941][03942] Avg episode reward: [(0, '721.530')] [2023-03-06 14:49:34,025][04272] Updated weights for policy 0, policy_version 16310 (0.0007) [2023-03-06 14:49:34,827][04272] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-06 14:49:35,641][04272] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-06 14:49:36,453][04272] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-06 14:49:37,260][04272] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-06 14:49:38,078][04272] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-06 14:49:38,914][04272] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-06 14:49:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 16762880. Throughput: 0: 12574.7. Samples: 16729901. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:38,941][03942] Avg episode reward: [(0, '626.494')] [2023-03-06 14:49:39,709][04272] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-06 14:49:40,541][04272] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-06 14:49:41,363][04272] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-06 14:49:42,169][04272] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-06 14:49:42,967][04272] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-06 14:49:43,801][04272] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-06 14:49:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12558.7). Total num frames: 16825344. Throughput: 0: 12580.7. Samples: 16805216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:43,941][03942] Avg episode reward: [(0, '589.112')] [2023-03-06 14:49:44,616][04272] Updated weights for policy 0, policy_version 16440 (0.0007) [2023-03-06 14:49:45,419][04272] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-06 14:49:46,238][04272] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-06 14:49:47,047][04272] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-06 14:49:47,850][04272] Updated weights for policy 0, policy_version 16480 (0.0007) [2023-03-06 14:49:48,665][04272] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-06 14:49:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 16888832. Throughput: 0: 12587.6. Samples: 16880887. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:49:48,941][03942] Avg episode reward: [(0, '701.075')] [2023-03-06 14:49:49,484][04272] Updated weights for policy 0, policy_version 16500 (0.0006) [2023-03-06 14:49:50,304][04272] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-06 14:49:51,120][04272] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-06 14:49:51,931][04272] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-06 14:49:52,737][04272] Updated weights for policy 0, policy_version 16540 (0.0007) [2023-03-06 14:49:53,546][04272] Updated weights for policy 0, policy_version 16550 (0.0006) [2023-03-06 14:49:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 16951296. Throughput: 0: 12584.4. Samples: 16918514. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:49:53,941][03942] Avg episode reward: [(0, '747.900')] [2023-03-06 14:49:54,356][04272] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-06 14:49:55,177][04272] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-06 14:49:55,981][04272] Updated weights for policy 0, policy_version 16580 (0.0007) [2023-03-06 14:49:56,791][04272] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-06 14:49:57,622][04272] Updated weights for policy 0, policy_version 16600 (0.0007) [2023-03-06 14:49:58,430][04272] Updated weights for policy 0, policy_version 16610 (0.0006) [2023-03-06 14:49:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12565.7). Total num frames: 17014784. Throughput: 0: 12587.4. Samples: 16994190. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 14:49:58,941][03942] Avg episode reward: [(0, '748.438')] [2023-03-06 14:49:59,235][04272] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-06 14:50:00,048][04272] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-06 14:50:00,862][04272] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-06 14:50:01,687][04272] Updated weights for policy 0, policy_version 16650 (0.0007) [2023-03-06 14:50:02,490][04272] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-06 14:50:03,300][04272] Updated weights for policy 0, policy_version 16670 (0.0006) [2023-03-06 14:50:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12595.2, 300 sec: 12565.7). Total num frames: 17078272. Throughput: 0: 12586.6. Samples: 17069801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:50:03,941][03942] Avg episode reward: [(0, '758.597')] [2023-03-06 14:50:04,095][04272] Updated weights for policy 0, policy_version 16680 (0.0007) [2023-03-06 14:50:04,942][04272] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-06 14:50:05,726][04272] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-06 14:50:06,518][04272] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-06 14:50:07,363][04272] Updated weights for policy 0, policy_version 16720 (0.0006) [2023-03-06 14:50:08,173][04272] Updated weights for policy 0, policy_version 16730 (0.0007) [2023-03-06 14:50:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12565.7). Total num frames: 17140736. Throughput: 0: 12594.4. Samples: 17107845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:50:08,941][03942] Avg episode reward: [(0, '701.014')] [2023-03-06 14:50:08,992][04272] Updated weights for policy 0, policy_version 16740 (0.0007) [2023-03-06 14:50:09,792][04272] Updated weights for policy 0, policy_version 16750 (0.0007) [2023-03-06 14:50:10,615][04272] Updated weights for policy 0, policy_version 16760 (0.0007) [2023-03-06 14:50:11,442][04272] Updated weights for policy 0, policy_version 16770 (0.0007) [2023-03-06 14:50:12,247][04272] Updated weights for policy 0, policy_version 16780 (0.0007) [2023-03-06 14:50:13,061][04272] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-06 14:50:13,884][04272] Updated weights for policy 0, policy_version 16800 (0.0007) [2023-03-06 14:50:13,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17203200. Throughput: 0: 12588.0. Samples: 17183129. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:50:13,941][03942] Avg episode reward: [(0, '743.104')] [2023-03-06 14:50:14,694][04272] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-03-06 14:50:15,508][04272] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-06 14:50:16,318][04272] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-06 14:50:17,119][04272] Updated weights for policy 0, policy_version 16840 (0.0007) [2023-03-06 14:50:17,942][04272] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-06 14:50:18,756][04272] Updated weights for policy 0, policy_version 16860 (0.0007) [2023-03-06 14:50:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12565.7). Total num frames: 17266688. Throughput: 0: 12588.7. Samples: 17258470. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:50:18,941][03942] Avg episode reward: [(0, '710.759')] [2023-03-06 14:50:19,587][04272] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-06 14:50:20,397][04272] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-06 14:50:21,221][04272] Updated weights for policy 0, policy_version 16890 (0.0007) [2023-03-06 14:50:22,045][04272] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-06 14:50:22,861][04272] Updated weights for policy 0, policy_version 16910 (0.0007) [2023-03-06 14:50:23,682][04272] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-06 14:50:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17329152. Throughput: 0: 12582.0. Samples: 17296089. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:50:23,941][03942] Avg episode reward: [(0, '678.131')] [2023-03-06 14:50:24,507][04272] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-06 14:50:25,327][04272] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-06 14:50:26,139][04272] Updated weights for policy 0, policy_version 16950 (0.0007) [2023-03-06 14:50:26,946][04272] Updated weights for policy 0, policy_version 16960 (0.0007) [2023-03-06 14:50:27,786][04272] Updated weights for policy 0, policy_version 16970 (0.0006) [2023-03-06 14:50:28,588][04272] Updated weights for policy 0, policy_version 16980 (0.0007) [2023-03-06 14:50:28,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.2, 300 sec: 12562.2). Total num frames: 17391616. Throughput: 0: 12574.9. Samples: 17371088. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:50:28,941][03942] Avg episode reward: [(0, '722.935')] [2023-03-06 14:50:29,392][04272] Updated weights for policy 0, policy_version 16990 (0.0007) [2023-03-06 14:50:30,214][04272] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-06 14:50:31,039][04272] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-06 14:50:31,859][04272] Updated weights for policy 0, policy_version 17020 (0.0006) [2023-03-06 14:50:32,665][04272] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-06 14:50:33,479][04272] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-06 14:50:33,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17454080. Throughput: 0: 12566.7. Samples: 17446386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:50:33,941][03942] Avg episode reward: [(0, '736.642')] [2023-03-06 14:50:34,303][04272] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-06 14:50:35,113][04272] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-06 14:50:35,933][04272] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-06 14:50:36,747][04272] Updated weights for policy 0, policy_version 17080 (0.0007) [2023-03-06 14:50:37,552][04272] Updated weights for policy 0, policy_version 17090 (0.0006) [2023-03-06 14:50:38,382][04272] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-06 14:50:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17517568. Throughput: 0: 12569.0. Samples: 17484118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:50:38,941][03942] Avg episode reward: [(0, '714.715')] [2023-03-06 14:50:39,174][04272] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-06 14:50:39,989][04272] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-06 14:50:40,797][04272] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-06 14:50:41,607][04272] Updated weights for policy 0, policy_version 17140 (0.0007) [2023-03-06 14:50:42,436][04272] Updated weights for policy 0, policy_version 17150 (0.0007) [2023-03-06 14:50:43,253][04272] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-06 14:50:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17580032. Throughput: 0: 12566.8. Samples: 17559696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:50:43,941][03942] Avg episode reward: [(0, '779.710')] [2023-03-06 14:50:44,055][04272] Updated weights for policy 0, policy_version 17170 (0.0007) [2023-03-06 14:50:44,878][04272] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-06 14:50:45,707][04272] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-06 14:50:46,505][04272] Updated weights for policy 0, policy_version 17200 (0.0007) [2023-03-06 14:50:47,305][04272] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-06 14:50:48,125][04272] Updated weights for policy 0, policy_version 17220 (0.0007) [2023-03-06 14:50:48,931][04272] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-06 14:50:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17643520. Throughput: 0: 12565.9. Samples: 17635268. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:50:48,941][03942] Avg episode reward: [(0, '658.144')] [2023-03-06 14:50:49,751][04272] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-06 14:50:50,570][04272] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-06 14:50:51,383][04272] Updated weights for policy 0, policy_version 17260 (0.0007) [2023-03-06 14:50:52,182][04272] Updated weights for policy 0, policy_version 17270 (0.0007) [2023-03-06 14:50:53,012][04272] Updated weights for policy 0, policy_version 17280 (0.0007) [2023-03-06 14:50:53,810][04272] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-06 14:50:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12562.2). Total num frames: 17705984. Throughput: 0: 12556.1. Samples: 17672868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:50:53,941][03942] Avg episode reward: [(0, '743.334')] [2023-03-06 14:50:54,623][04272] Updated weights for policy 0, policy_version 17300 (0.0007) [2023-03-06 14:50:55,427][04272] Updated weights for policy 0, policy_version 17310 (0.0007) [2023-03-06 14:50:56,271][04272] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-06 14:50:57,063][04272] Updated weights for policy 0, policy_version 17330 (0.0006) [2023-03-06 14:50:57,905][04272] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-06 14:50:58,687][04272] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-06 14:50:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12565.7). Total num frames: 17769472. Throughput: 0: 12569.0. Samples: 17748733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:50:58,941][03942] Avg episode reward: [(0, '783.554')] [2023-03-06 14:50:59,510][04272] Updated weights for policy 0, policy_version 17360 (0.0007) [2023-03-06 14:51:00,335][04272] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-06 14:51:01,138][04272] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-06 14:51:01,954][04272] Updated weights for policy 0, policy_version 17390 (0.0006) [2023-03-06 14:51:02,774][04272] Updated weights for policy 0, policy_version 17400 (0.0007) [2023-03-06 14:51:03,579][04272] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-06 14:51:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12562.2). Total num frames: 17831936. Throughput: 0: 12566.5. Samples: 17823963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:03,941][03942] Avg episode reward: [(0, '746.566')] [2023-03-06 14:51:04,389][04272] Updated weights for policy 0, policy_version 17420 (0.0007) [2023-03-06 14:51:05,201][04272] Updated weights for policy 0, policy_version 17430 (0.0006) [2023-03-06 14:51:06,002][04272] Updated weights for policy 0, policy_version 17440 (0.0007) [2023-03-06 14:51:06,835][04272] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-06 14:51:07,637][04272] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-06 14:51:08,430][04272] Updated weights for policy 0, policy_version 17470 (0.0006) [2023-03-06 14:51:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.2, 300 sec: 12565.7). Total num frames: 17895424. Throughput: 0: 12572.4. Samples: 17861845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:08,941][03942] Avg episode reward: [(0, '763.069')] [2023-03-06 14:51:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000017476_17895424.pth... [2023-03-06 14:51:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000014529_14877696.pth [2023-03-06 14:51:09,254][04272] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-06 14:51:10,059][04272] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-06 14:51:10,903][04272] Updated weights for policy 0, policy_version 17500 (0.0007) [2023-03-06 14:51:11,716][04272] Updated weights for policy 0, policy_version 17510 (0.0007) [2023-03-06 14:51:12,517][04272] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-06 14:51:13,347][04272] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-06 14:51:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12565.7). Total num frames: 17957888. Throughput: 0: 12586.3. Samples: 17937471. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:13,941][03942] Avg episode reward: [(0, '826.147')] [2023-03-06 14:51:13,942][04221] Saving new best policy, reward=826.147! [2023-03-06 14:51:14,162][04272] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-06 14:51:14,964][04272] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-06 14:51:15,790][04272] Updated weights for policy 0, policy_version 17560 (0.0007) [2023-03-06 14:51:16,598][04272] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-06 14:51:17,417][04272] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-06 14:51:18,238][04272] Updated weights for policy 0, policy_version 17590 (0.0007) [2023-03-06 14:51:18,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12565.7). Total num frames: 18020352. Throughput: 0: 12587.6. Samples: 18012827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:18,941][03942] Avg episode reward: [(0, '761.970')] [2023-03-06 14:51:19,038][04272] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-06 14:51:19,836][04272] Updated weights for policy 0, policy_version 17610 (0.0007) [2023-03-06 14:51:20,659][04272] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-06 14:51:21,495][04272] Updated weights for policy 0, policy_version 17630 (0.0007) [2023-03-06 14:51:22,309][04272] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-06 14:51:23,113][04272] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-06 14:51:23,916][04272] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-06 14:51:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12569.2). Total num frames: 18083840. Throughput: 0: 12586.5. Samples: 18050509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:23,941][03942] Avg episode reward: [(0, '769.114')] [2023-03-06 14:51:24,731][04272] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-06 14:51:25,557][04272] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-06 14:51:26,378][04272] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-06 14:51:27,182][04272] Updated weights for policy 0, policy_version 17700 (0.0006) [2023-03-06 14:51:27,997][04272] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-06 14:51:28,826][04272] Updated weights for policy 0, policy_version 17720 (0.0006) [2023-03-06 14:51:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12565.7). Total num frames: 18146304. Throughput: 0: 12584.2. Samples: 18125986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:51:28,941][03942] Avg episode reward: [(0, '751.756')] [2023-03-06 14:51:29,636][04272] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-06 14:51:30,445][04272] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-06 14:51:31,262][04272] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-06 14:51:32,075][04272] Updated weights for policy 0, policy_version 17760 (0.0007) [2023-03-06 14:51:32,872][04272] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-06 14:51:33,704][04272] Updated weights for policy 0, policy_version 17780 (0.0007) [2023-03-06 14:51:33,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.1, 300 sec: 12565.7). Total num frames: 18208768. Throughput: 0: 12581.6. Samples: 18201440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:51:33,941][03942] Avg episode reward: [(0, '682.884')] [2023-03-06 14:51:34,525][04272] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-06 14:51:35,327][04272] Updated weights for policy 0, policy_version 17800 (0.0006) [2023-03-06 14:51:36,146][04272] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-06 14:51:36,957][04272] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-06 14:51:37,771][04272] Updated weights for policy 0, policy_version 17830 (0.0006) [2023-03-06 14:51:38,595][04272] Updated weights for policy 0, policy_version 17840 (0.0006) [2023-03-06 14:51:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12569.2). Total num frames: 18272256. Throughput: 0: 12585.9. Samples: 18239233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:38,941][03942] Avg episode reward: [(0, '713.230')] [2023-03-06 14:51:39,406][04272] Updated weights for policy 0, policy_version 17850 (0.0007) [2023-03-06 14:51:40,190][04272] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-06 14:51:41,010][04272] Updated weights for policy 0, policy_version 17870 (0.0007) [2023-03-06 14:51:41,851][04272] Updated weights for policy 0, policy_version 17880 (0.0008) [2023-03-06 14:51:42,653][04272] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-06 14:51:43,453][04272] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-06 14:51:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12569.2). Total num frames: 18334720. Throughput: 0: 12577.1. Samples: 18314702. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:43,941][03942] Avg episode reward: [(0, '779.383')] [2023-03-06 14:51:44,270][04272] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-06 14:51:45,065][04272] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-06 14:51:45,898][04272] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-06 14:51:46,721][04272] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-06 14:51:47,533][04272] Updated weights for policy 0, policy_version 17950 (0.0007) [2023-03-06 14:51:48,345][04272] Updated weights for policy 0, policy_version 17960 (0.0007) [2023-03-06 14:51:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12569.2). Total num frames: 18398208. Throughput: 0: 12583.6. Samples: 18390227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:51:48,941][03942] Avg episode reward: [(0, '886.349')] [2023-03-06 14:51:48,945][04221] Saving new best policy, reward=886.349! [2023-03-06 14:51:49,171][04272] Updated weights for policy 0, policy_version 17970 (0.0007) [2023-03-06 14:51:49,967][04272] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-06 14:51:50,781][04272] Updated weights for policy 0, policy_version 17990 (0.0007) [2023-03-06 14:51:51,594][04272] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-06 14:51:52,394][04272] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-06 14:51:53,218][04272] Updated weights for policy 0, policy_version 18020 (0.0007) [2023-03-06 14:51:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12572.6). Total num frames: 18460672. Throughput: 0: 12582.1. Samples: 18428042. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:51:53,941][03942] Avg episode reward: [(0, '816.827')] [2023-03-06 14:51:54,021][04272] Updated weights for policy 0, policy_version 18030 (0.0006) [2023-03-06 14:51:54,837][04272] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-06 14:51:55,634][04272] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-06 14:51:56,467][04272] Updated weights for policy 0, policy_version 18060 (0.0006) [2023-03-06 14:51:57,272][04272] Updated weights for policy 0, policy_version 18070 (0.0006) [2023-03-06 14:51:58,056][04272] Updated weights for policy 0, policy_version 18080 (0.0007) [2023-03-06 14:51:58,894][04272] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-06 14:51:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12576.1). Total num frames: 18524160. Throughput: 0: 12585.0. Samples: 18503796. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:51:58,941][03942] Avg episode reward: [(0, '807.412')] [2023-03-06 14:51:59,711][04272] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-06 14:52:00,505][04272] Updated weights for policy 0, policy_version 18110 (0.0005) [2023-03-06 14:52:01,329][04272] Updated weights for policy 0, policy_version 18120 (0.0007) [2023-03-06 14:52:02,139][04272] Updated weights for policy 0, policy_version 18130 (0.0007) [2023-03-06 14:52:02,957][04272] Updated weights for policy 0, policy_version 18140 (0.0007) [2023-03-06 14:52:03,782][04272] Updated weights for policy 0, policy_version 18150 (0.0007) [2023-03-06 14:52:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12572.6). Total num frames: 18586624. Throughput: 0: 12591.5. Samples: 18579445. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:52:03,941][03942] Avg episode reward: [(0, '779.159')] [2023-03-06 14:52:04,587][04272] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-06 14:52:05,406][04272] Updated weights for policy 0, policy_version 18170 (0.0007) [2023-03-06 14:52:06,222][04272] Updated weights for policy 0, policy_version 18180 (0.0008) [2023-03-06 14:52:07,047][04272] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-06 14:52:07,859][04272] Updated weights for policy 0, policy_version 18200 (0.0007) [2023-03-06 14:52:08,662][04272] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-06 14:52:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12578.1, 300 sec: 12576.1). Total num frames: 18650112. Throughput: 0: 12591.2. Samples: 18617113. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:52:08,941][03942] Avg episode reward: [(0, '715.304')] [2023-03-06 14:52:09,486][04272] Updated weights for policy 0, policy_version 18220 (0.0006) [2023-03-06 14:52:10,313][04272] Updated weights for policy 0, policy_version 18230 (0.0007) [2023-03-06 14:52:11,118][04272] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-06 14:52:11,924][04272] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-06 14:52:12,724][04272] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-06 14:52:13,537][04272] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-06 14:52:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 18713600. Throughput: 0: 12588.8. Samples: 18692480. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:52:13,941][03942] Avg episode reward: [(0, '699.499')] [2023-03-06 14:52:14,347][04272] Updated weights for policy 0, policy_version 18280 (0.0006) [2023-03-06 14:52:15,165][04272] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-06 14:52:15,971][04272] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-06 14:52:16,797][04272] Updated weights for policy 0, policy_version 18310 (0.0006) [2023-03-06 14:52:17,603][04272] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-06 14:52:18,429][04272] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-06 14:52:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 18776064. Throughput: 0: 12595.4. Samples: 18768233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:52:18,941][03942] Avg episode reward: [(0, '748.303')] [2023-03-06 14:52:19,215][04272] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-06 14:52:20,039][04272] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-06 14:52:20,836][04272] Updated weights for policy 0, policy_version 18360 (0.0007) [2023-03-06 14:52:21,663][04272] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-06 14:52:22,492][04272] Updated weights for policy 0, policy_version 18380 (0.0008) [2023-03-06 14:52:23,305][04272] Updated weights for policy 0, policy_version 18390 (0.0008) [2023-03-06 14:52:23,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.2, 300 sec: 12576.1). Total num frames: 18838528. Throughput: 0: 12594.7. Samples: 18805994. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:52:23,941][03942] Avg episode reward: [(0, '752.480')] [2023-03-06 14:52:24,114][04272] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-06 14:52:24,930][04272] Updated weights for policy 0, policy_version 18410 (0.0007) [2023-03-06 14:52:25,735][04272] Updated weights for policy 0, policy_version 18420 (0.0007) [2023-03-06 14:52:26,554][04272] Updated weights for policy 0, policy_version 18430 (0.0007) [2023-03-06 14:52:27,366][04272] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-06 14:52:28,179][04272] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-06 14:52:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 18902016. Throughput: 0: 12593.9. Samples: 18881427. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:52:28,941][03942] Avg episode reward: [(0, '757.279')] [2023-03-06 14:52:28,985][04272] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-06 14:52:29,807][04272] Updated weights for policy 0, policy_version 18470 (0.0007) [2023-03-06 14:52:30,619][04272] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-06 14:52:31,422][04272] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-06 14:52:32,250][04272] Updated weights for policy 0, policy_version 18500 (0.0006) [2023-03-06 14:52:33,056][04272] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-06 14:52:33,865][04272] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-06 14:52:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12579.6). Total num frames: 18965504. Throughput: 0: 12596.7. Samples: 18957077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:52:33,941][03942] Avg episode reward: [(0, '808.003')] [2023-03-06 14:52:34,678][04272] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-06 14:52:35,491][04272] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-06 14:52:36,307][04272] Updated weights for policy 0, policy_version 18550 (0.0006) [2023-03-06 14:52:37,134][04272] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-06 14:52:37,927][04272] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-06 14:52:38,740][04272] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-06 14:52:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 19027968. Throughput: 0: 12596.5. Samples: 18994884. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:52:38,941][03942] Avg episode reward: [(0, '824.218')] [2023-03-06 14:52:39,577][04272] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-06 14:52:40,374][04272] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-06 14:52:41,167][04272] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-06 14:52:41,994][04272] Updated weights for policy 0, policy_version 18620 (0.0006) [2023-03-06 14:52:42,807][04272] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-06 14:52:43,617][04272] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-06 14:52:43,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 19090432. Throughput: 0: 12598.2. Samples: 19070714. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:52:43,941][03942] Avg episode reward: [(0, '774.593')] [2023-03-06 14:52:44,441][04272] Updated weights for policy 0, policy_version 18650 (0.0006) [2023-03-06 14:52:45,233][04272] Updated weights for policy 0, policy_version 18660 (0.0007) [2023-03-06 14:52:46,053][04272] Updated weights for policy 0, policy_version 18670 (0.0006) [2023-03-06 14:52:46,870][04272] Updated weights for policy 0, policy_version 18680 (0.0007) [2023-03-06 14:52:47,700][04272] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-06 14:52:48,512][04272] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-06 14:52:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 19153920. Throughput: 0: 12588.4. Samples: 19145923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:52:48,941][03942] Avg episode reward: [(0, '843.248')] [2023-03-06 14:52:49,334][04272] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-06 14:52:50,160][04272] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-06 14:52:50,958][04272] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-06 14:52:51,787][04272] Updated weights for policy 0, policy_version 18740 (0.0007) [2023-03-06 14:52:52,605][04272] Updated weights for policy 0, policy_version 18750 (0.0006) [2023-03-06 14:52:53,411][04272] Updated weights for policy 0, policy_version 18760 (0.0007) [2023-03-06 14:52:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12579.6). Total num frames: 19216384. Throughput: 0: 12582.9. Samples: 19183340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:52:53,941][03942] Avg episode reward: [(0, '965.198')] [2023-03-06 14:52:53,941][04221] Saving new best policy, reward=965.198! [2023-03-06 14:52:54,215][04272] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-06 14:52:55,025][04272] Updated weights for policy 0, policy_version 18780 (0.0005) [2023-03-06 14:52:55,829][04272] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-06 14:52:56,631][04272] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-06 14:52:57,440][04272] Updated weights for policy 0, policy_version 18810 (0.0007) [2023-03-06 14:52:58,254][04272] Updated weights for policy 0, policy_version 18820 (0.0007) [2023-03-06 14:52:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12583.0). Total num frames: 19279872. Throughput: 0: 12595.9. Samples: 19259296. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-06 14:52:58,941][03942] Avg episode reward: [(0, '941.065')] [2023-03-06 14:52:59,048][04272] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-06 14:52:59,880][04272] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-06 14:53:00,676][04272] Updated weights for policy 0, policy_version 18850 (0.0005) [2023-03-06 14:53:01,485][04272] Updated weights for policy 0, policy_version 18860 (0.0007) [2023-03-06 14:53:02,302][04272] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-06 14:53:03,116][04272] Updated weights for policy 0, policy_version 18880 (0.0007) [2023-03-06 14:53:03,934][04272] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-06 14:53:03,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12583.1). Total num frames: 19343360. Throughput: 0: 12599.1. Samples: 19335192. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-06 14:53:03,941][03942] Avg episode reward: [(0, '961.185')] [2023-03-06 14:53:04,752][04272] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-06 14:53:05,567][04272] Updated weights for policy 0, policy_version 18910 (0.0006) [2023-03-06 14:53:06,389][04272] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-06 14:53:07,187][04272] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-06 14:53:07,997][04272] Updated weights for policy 0, policy_version 18940 (0.0005) [2023-03-06 14:53:08,810][04272] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-06 14:53:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12583.0). Total num frames: 19405824. Throughput: 0: 12595.3. Samples: 19372785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:08,941][03942] Avg episode reward: [(0, '921.019')] [2023-03-06 14:53:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000018951_19405824.pth... [2023-03-06 14:53:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000016002_16386048.pth [2023-03-06 14:53:09,625][04272] Updated weights for policy 0, policy_version 18960 (0.0006) [2023-03-06 14:53:10,446][04272] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-06 14:53:11,250][04272] Updated weights for policy 0, policy_version 18980 (0.0006) [2023-03-06 14:53:12,071][04272] Updated weights for policy 0, policy_version 18990 (0.0007) [2023-03-06 14:53:12,887][04272] Updated weights for policy 0, policy_version 19000 (0.0007) [2023-03-06 14:53:13,703][04272] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-06 14:53:13,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12579.6). Total num frames: 19468288. Throughput: 0: 12597.0. Samples: 19448293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:13,941][03942] Avg episode reward: [(0, '1010.104')] [2023-03-06 14:53:13,950][04221] Saving new best policy, reward=1010.104! [2023-03-06 14:53:14,504][04272] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-06 14:53:15,341][04272] Updated weights for policy 0, policy_version 19030 (0.0007) [2023-03-06 14:53:16,140][04272] Updated weights for policy 0, policy_version 19040 (0.0006) [2023-03-06 14:53:16,962][04272] Updated weights for policy 0, policy_version 19050 (0.0006) [2023-03-06 14:53:17,767][04272] Updated weights for policy 0, policy_version 19060 (0.0007) [2023-03-06 14:53:18,584][04272] Updated weights for policy 0, policy_version 19070 (0.0007) [2023-03-06 14:53:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 19531776. Throughput: 0: 12595.1. Samples: 19523857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:18,941][03942] Avg episode reward: [(0, '1046.138')] [2023-03-06 14:53:18,944][04221] Saving new best policy, reward=1046.138! [2023-03-06 14:53:19,388][04272] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-06 14:53:20,209][04272] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-06 14:53:21,012][04272] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-06 14:53:21,821][04272] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-06 14:53:22,647][04272] Updated weights for policy 0, policy_version 19120 (0.0006) [2023-03-06 14:53:23,463][04272] Updated weights for policy 0, policy_version 19130 (0.0006) [2023-03-06 14:53:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 19594240. Throughput: 0: 12598.0. Samples: 19561795. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:53:23,941][03942] Avg episode reward: [(0, '935.421')] [2023-03-06 14:53:24,281][04272] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-06 14:53:25,093][04272] Updated weights for policy 0, policy_version 19150 (0.0007) [2023-03-06 14:53:25,903][04272] Updated weights for policy 0, policy_version 19160 (0.0007) [2023-03-06 14:53:26,725][04272] Updated weights for policy 0, policy_version 19170 (0.0006) [2023-03-06 14:53:27,522][04272] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-06 14:53:28,351][04272] Updated weights for policy 0, policy_version 19190 (0.0007) [2023-03-06 14:53:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 19657728. Throughput: 0: 12591.1. Samples: 19637316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:53:28,941][03942] Avg episode reward: [(0, '1076.678')] [2023-03-06 14:53:28,944][04221] Saving new best policy, reward=1076.678! [2023-03-06 14:53:29,158][04272] Updated weights for policy 0, policy_version 19200 (0.0006) [2023-03-06 14:53:29,969][04272] Updated weights for policy 0, policy_version 19210 (0.0007) [2023-03-06 14:53:30,770][04272] Updated weights for policy 0, policy_version 19220 (0.0006) [2023-03-06 14:53:31,608][04272] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-06 14:53:32,407][04272] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-06 14:53:33,218][04272] Updated weights for policy 0, policy_version 19250 (0.0006) [2023-03-06 14:53:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 19720192. Throughput: 0: 12597.1. Samples: 19712794. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:33,941][03942] Avg episode reward: [(0, '1031.587')] [2023-03-06 14:53:34,032][04272] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-06 14:53:34,850][04272] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-06 14:53:35,666][04272] Updated weights for policy 0, policy_version 19280 (0.0007) [2023-03-06 14:53:36,490][04272] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-06 14:53:37,295][04272] Updated weights for policy 0, policy_version 19300 (0.0007) [2023-03-06 14:53:38,112][04272] Updated weights for policy 0, policy_version 19310 (0.0007) [2023-03-06 14:53:38,938][04272] Updated weights for policy 0, policy_version 19320 (0.0007) [2023-03-06 14:53:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 19783680. Throughput: 0: 12601.5. Samples: 19750407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:38,941][03942] Avg episode reward: [(0, '903.538')] [2023-03-06 14:53:39,751][04272] Updated weights for policy 0, policy_version 19330 (0.0007) [2023-03-06 14:53:40,567][04272] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-06 14:53:41,400][04272] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-06 14:53:42,197][04272] Updated weights for policy 0, policy_version 19360 (0.0007) [2023-03-06 14:53:43,020][04272] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-06 14:53:43,845][04272] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-06 14:53:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 19846144. Throughput: 0: 12587.1. Samples: 19825717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:43,952][03942] Avg episode reward: [(0, '989.794')] [2023-03-06 14:53:44,667][04272] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-06 14:53:45,479][04272] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-06 14:53:46,308][04272] Updated weights for policy 0, policy_version 19410 (0.0008) [2023-03-06 14:53:47,132][04272] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-06 14:53:47,949][04272] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-06 14:53:48,755][04272] Updated weights for policy 0, policy_version 19440 (0.0007) [2023-03-06 14:53:48,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 19908608. Throughput: 0: 12564.3. Samples: 19900584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:48,952][03942] Avg episode reward: [(0, '1025.049')] [2023-03-06 14:53:49,573][04272] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-06 14:53:50,406][04272] Updated weights for policy 0, policy_version 19460 (0.0007) [2023-03-06 14:53:51,206][04272] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-06 14:53:52,037][04272] Updated weights for policy 0, policy_version 19480 (0.0007) [2023-03-06 14:53:52,852][04272] Updated weights for policy 0, policy_version 19490 (0.0006) [2023-03-06 14:53:53,646][04272] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-06 14:53:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 19971072. Throughput: 0: 12566.9. Samples: 19938294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:53,951][03942] Avg episode reward: [(0, '927.671')] [2023-03-06 14:53:54,461][04272] Updated weights for policy 0, policy_version 19510 (0.0007) [2023-03-06 14:53:55,276][04272] Updated weights for policy 0, policy_version 19520 (0.0006) [2023-03-06 14:53:56,081][04272] Updated weights for policy 0, policy_version 19530 (0.0007) [2023-03-06 14:53:56,896][04272] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-03-06 14:53:57,713][04272] Updated weights for policy 0, policy_version 19550 (0.0007) [2023-03-06 14:53:58,549][04272] Updated weights for policy 0, policy_version 19560 (0.0007) [2023-03-06 14:53:58,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12561.1, 300 sec: 12579.6). Total num frames: 20033536. Throughput: 0: 12567.1. Samples: 20013810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:53:58,951][03942] Avg episode reward: [(0, '1031.084')] [2023-03-06 14:53:59,367][04272] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-06 14:54:00,166][04272] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-06 14:54:00,987][04272] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-06 14:54:01,816][04272] Updated weights for policy 0, policy_version 19600 (0.0007) [2023-03-06 14:54:02,648][04272] Updated weights for policy 0, policy_version 19610 (0.0007) [2023-03-06 14:54:03,449][04272] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-06 14:54:03,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12579.6). Total num frames: 20096000. Throughput: 0: 12551.4. Samples: 20088670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:03,951][03942] Avg episode reward: [(0, '1006.140')] [2023-03-06 14:54:04,254][04272] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-06 14:54:05,070][04272] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-06 14:54:05,882][04272] Updated weights for policy 0, policy_version 19650 (0.0007) [2023-03-06 14:54:06,699][04272] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-06 14:54:07,505][04272] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-06 14:54:08,337][04272] Updated weights for policy 0, policy_version 19680 (0.0008) [2023-03-06 14:54:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12579.6). Total num frames: 20159488. Throughput: 0: 12548.5. Samples: 20126477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:08,951][03942] Avg episode reward: [(0, '1134.570')] [2023-03-06 14:54:08,955][04221] Saving new best policy, reward=1134.570! [2023-03-06 14:54:09,141][04272] Updated weights for policy 0, policy_version 19690 (0.0006) [2023-03-06 14:54:09,949][04272] Updated weights for policy 0, policy_version 19700 (0.0007) [2023-03-06 14:54:10,779][04272] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-06 14:54:11,572][04272] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-06 14:54:12,380][04272] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-06 14:54:13,202][04272] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-06 14:54:13,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 20222976. Throughput: 0: 12552.8. Samples: 20202193. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:13,952][03942] Avg episode reward: [(0, '1119.597')] [2023-03-06 14:54:13,991][04272] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-06 14:54:14,805][04272] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-06 14:54:15,607][04272] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-06 14:54:16,421][04272] Updated weights for policy 0, policy_version 19780 (0.0007) [2023-03-06 14:54:17,221][04272] Updated weights for policy 0, policy_version 19790 (0.0007) [2023-03-06 14:54:18,028][04272] Updated weights for policy 0, policy_version 19800 (0.0007) [2023-03-06 14:54:18,841][04272] Updated weights for policy 0, policy_version 19810 (0.0007) [2023-03-06 14:54:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12579.6). Total num frames: 20285440. Throughput: 0: 12566.0. Samples: 20278264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:18,951][03942] Avg episode reward: [(0, '1001.566')] [2023-03-06 14:54:19,656][04272] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-06 14:54:20,467][04272] Updated weights for policy 0, policy_version 19830 (0.0007) [2023-03-06 14:54:21,282][04272] Updated weights for policy 0, policy_version 19840 (0.0006) [2023-03-06 14:54:22,103][04272] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-06 14:54:22,927][04272] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-06 14:54:23,725][04272] Updated weights for policy 0, policy_version 19870 (0.0006) [2023-03-06 14:54:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 20348928. Throughput: 0: 12567.6. Samples: 20315950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:23,951][03942] Avg episode reward: [(0, '819.553')] [2023-03-06 14:54:24,546][04272] Updated weights for policy 0, policy_version 19880 (0.0007) [2023-03-06 14:54:25,370][04272] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-06 14:54:26,193][04272] Updated weights for policy 0, policy_version 19900 (0.0007) [2023-03-06 14:54:26,996][04272] Updated weights for policy 0, policy_version 19910 (0.0007) [2023-03-06 14:54:27,799][04272] Updated weights for policy 0, policy_version 19920 (0.0007) [2023-03-06 14:54:28,624][04272] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-06 14:54:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12583.1). Total num frames: 20411392. Throughput: 0: 12568.3. Samples: 20391292. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:54:28,951][03942] Avg episode reward: [(0, '826.645')] [2023-03-06 14:54:29,423][04272] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-06 14:54:30,247][04272] Updated weights for policy 0, policy_version 19950 (0.0007) [2023-03-06 14:54:31,049][04272] Updated weights for policy 0, policy_version 19960 (0.0006) [2023-03-06 14:54:31,866][04272] Updated weights for policy 0, policy_version 19970 (0.0006) [2023-03-06 14:54:32,688][04272] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-06 14:54:33,490][04272] Updated weights for policy 0, policy_version 19990 (0.0007) [2023-03-06 14:54:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 20474880. Throughput: 0: 12589.5. Samples: 20467111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:54:33,952][03942] Avg episode reward: [(0, '966.021')] [2023-03-06 14:54:34,309][04272] Updated weights for policy 0, policy_version 20000 (0.0007) [2023-03-06 14:54:35,115][04272] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-06 14:54:35,943][04272] Updated weights for policy 0, policy_version 20020 (0.0009) [2023-03-06 14:54:36,747][04272] Updated weights for policy 0, policy_version 20030 (0.0006) [2023-03-06 14:54:37,543][04272] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-06 14:54:38,358][04272] Updated weights for policy 0, policy_version 20050 (0.0007) [2023-03-06 14:54:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12578.1, 300 sec: 12586.5). Total num frames: 20538368. Throughput: 0: 12588.7. Samples: 20504788. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:54:38,952][03942] Avg episode reward: [(0, '1226.586')] [2023-03-06 14:54:38,956][04221] Saving new best policy, reward=1226.586! [2023-03-06 14:54:39,186][04272] Updated weights for policy 0, policy_version 20060 (0.0007) [2023-03-06 14:54:40,006][04272] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-06 14:54:40,829][04272] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-06 14:54:41,644][04272] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-06 14:54:42,468][04272] Updated weights for policy 0, policy_version 20100 (0.0006) [2023-03-06 14:54:43,271][04272] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-06 14:54:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12583.1). Total num frames: 20600832. Throughput: 0: 12585.3. Samples: 20580150. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:54:43,951][03942] Avg episode reward: [(0, '1105.791')] [2023-03-06 14:54:44,087][04272] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-06 14:54:44,910][04272] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-06 14:54:45,709][04272] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-06 14:54:46,516][04272] Updated weights for policy 0, policy_version 20150 (0.0006) [2023-03-06 14:54:47,316][04272] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-06 14:54:48,129][04272] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-06 14:54:48,923][04272] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-06 14:54:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12586.5). Total num frames: 20664320. Throughput: 0: 12606.7. Samples: 20655974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:54:48,952][03942] Avg episode reward: [(0, '1073.065')] [2023-03-06 14:54:49,740][04272] Updated weights for policy 0, policy_version 20190 (0.0007) [2023-03-06 14:54:50,555][04272] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-06 14:54:51,352][04272] Updated weights for policy 0, policy_version 20210 (0.0007) [2023-03-06 14:54:52,156][04272] Updated weights for policy 0, policy_version 20220 (0.0007) [2023-03-06 14:54:52,971][04272] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-06 14:54:53,783][04272] Updated weights for policy 0, policy_version 20240 (0.0007) [2023-03-06 14:54:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12583.1). Total num frames: 20726784. Throughput: 0: 12613.0. Samples: 20694063. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:54:53,951][03942] Avg episode reward: [(0, '1088.970')] [2023-03-06 14:54:54,599][04272] Updated weights for policy 0, policy_version 20250 (0.0007) [2023-03-06 14:54:55,403][04272] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-06 14:54:56,217][04272] Updated weights for policy 0, policy_version 20270 (0.0007) [2023-03-06 14:54:57,059][04272] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-06 14:54:57,865][04272] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-06 14:54:58,650][04272] Updated weights for policy 0, policy_version 20300 (0.0007) [2023-03-06 14:54:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12583.0). Total num frames: 20790272. Throughput: 0: 12608.3. Samples: 20769568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:54:58,952][03942] Avg episode reward: [(0, '998.358')] [2023-03-06 14:54:59,480][04272] Updated weights for policy 0, policy_version 20310 (0.0006) [2023-03-06 14:55:00,284][04272] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-06 14:55:01,094][04272] Updated weights for policy 0, policy_version 20330 (0.0007) [2023-03-06 14:55:01,918][04272] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-06 14:55:02,732][04272] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-06 14:55:03,542][04272] Updated weights for policy 0, policy_version 20360 (0.0006) [2023-03-06 14:55:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12586.5). Total num frames: 20853760. Throughput: 0: 12606.3. Samples: 20845547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:55:03,941][03942] Avg episode reward: [(0, '964.696')] [2023-03-06 14:55:04,354][04272] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-06 14:55:05,177][04272] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-06 14:55:05,969][04272] Updated weights for policy 0, policy_version 20390 (0.0007) [2023-03-06 14:55:06,775][04272] Updated weights for policy 0, policy_version 20400 (0.0007) [2023-03-06 14:55:07,605][04272] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-03-06 14:55:08,416][04272] Updated weights for policy 0, policy_version 20420 (0.0006) [2023-03-06 14:55:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12586.5). Total num frames: 20916224. Throughput: 0: 12608.2. Samples: 20883318. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:08,941][03942] Avg episode reward: [(0, '939.631')] [2023-03-06 14:55:08,953][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000020427_20917248.pth... [2023-03-06 14:55:08,984][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000017476_17895424.pth [2023-03-06 14:55:09,221][04272] Updated weights for policy 0, policy_version 20430 (0.0007) [2023-03-06 14:55:10,031][04272] Updated weights for policy 0, policy_version 20440 (0.0007) [2023-03-06 14:55:10,869][04272] Updated weights for policy 0, policy_version 20450 (0.0007) [2023-03-06 14:55:11,664][04272] Updated weights for policy 0, policy_version 20460 (0.0007) [2023-03-06 14:55:12,481][04272] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-06 14:55:13,294][04272] Updated weights for policy 0, policy_version 20480 (0.0006) [2023-03-06 14:55:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12586.5). Total num frames: 20979712. Throughput: 0: 12608.8. Samples: 20958686. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:13,941][03942] Avg episode reward: [(0, '964.725')] [2023-03-06 14:55:14,098][04272] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-06 14:55:14,914][04272] Updated weights for policy 0, policy_version 20500 (0.0006) [2023-03-06 14:55:15,750][04272] Updated weights for policy 0, policy_version 20510 (0.0007) [2023-03-06 14:55:16,576][04272] Updated weights for policy 0, policy_version 20520 (0.0007) [2023-03-06 14:55:17,404][04272] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-06 14:55:18,222][04272] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-06 14:55:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12586.5). Total num frames: 21042176. Throughput: 0: 12595.6. Samples: 21033914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:18,941][03942] Avg episode reward: [(0, '1079.908')] [2023-03-06 14:55:19,026][04272] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-06 14:55:19,844][04272] Updated weights for policy 0, policy_version 20560 (0.0007) [2023-03-06 14:55:20,650][04272] Updated weights for policy 0, policy_version 20570 (0.0006) [2023-03-06 14:55:21,447][04272] Updated weights for policy 0, policy_version 20580 (0.0006) [2023-03-06 14:55:22,269][04272] Updated weights for policy 0, policy_version 20590 (0.0007) [2023-03-06 14:55:23,083][04272] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-06 14:55:23,889][04272] Updated weights for policy 0, policy_version 20610 (0.0006) [2023-03-06 14:55:23,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12586.5). Total num frames: 21104640. Throughput: 0: 12600.1. Samples: 21071792. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:55:23,941][03942] Avg episode reward: [(0, '908.705')] [2023-03-06 14:55:24,722][04272] Updated weights for policy 0, policy_version 20620 (0.0007) [2023-03-06 14:55:25,521][04272] Updated weights for policy 0, policy_version 20630 (0.0007) [2023-03-06 14:55:26,341][04272] Updated weights for policy 0, policy_version 20640 (0.0007) [2023-03-06 14:55:27,150][04272] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-06 14:55:27,966][04272] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-06 14:55:28,780][04272] Updated weights for policy 0, policy_version 20670 (0.0007) [2023-03-06 14:55:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12590.0). Total num frames: 21168128. Throughput: 0: 12602.5. Samples: 21147261. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:55:28,941][03942] Avg episode reward: [(0, '818.022')] [2023-03-06 14:55:29,580][04272] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-06 14:55:30,405][04272] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-06 14:55:31,222][04272] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-06 14:55:32,036][04272] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-06 14:55:32,834][04272] Updated weights for policy 0, policy_version 20720 (0.0007) [2023-03-06 14:55:33,661][04272] Updated weights for policy 0, policy_version 20730 (0.0007) [2023-03-06 14:55:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12586.5). Total num frames: 21230592. Throughput: 0: 12591.6. Samples: 21222594. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:55:33,941][03942] Avg episode reward: [(0, '921.646')] [2023-03-06 14:55:34,455][04272] Updated weights for policy 0, policy_version 20740 (0.0006) [2023-03-06 14:55:35,285][04272] Updated weights for policy 0, policy_version 20750 (0.0008) [2023-03-06 14:55:36,099][04272] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-06 14:55:36,907][04272] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-06 14:55:37,727][04272] Updated weights for policy 0, policy_version 20780 (0.0007) [2023-03-06 14:55:38,538][04272] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-06 14:55:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 21294080. Throughput: 0: 12586.1. Samples: 21260439. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:38,941][03942] Avg episode reward: [(0, '820.702')] [2023-03-06 14:55:39,346][04272] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-06 14:55:40,173][04272] Updated weights for policy 0, policy_version 20810 (0.0007) [2023-03-06 14:55:40,975][04272] Updated weights for policy 0, policy_version 20820 (0.0007) [2023-03-06 14:55:41,789][04272] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-06 14:55:42,609][04272] Updated weights for policy 0, policy_version 20840 (0.0007) [2023-03-06 14:55:43,425][04272] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-06 14:55:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12586.5). Total num frames: 21356544. Throughput: 0: 12586.8. Samples: 21335973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:43,941][03942] Avg episode reward: [(0, '892.502')] [2023-03-06 14:55:44,237][04272] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-06 14:55:45,053][04272] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-06 14:55:45,864][04272] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-06 14:55:46,669][04272] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-06 14:55:47,485][04272] Updated weights for policy 0, policy_version 20900 (0.0006) [2023-03-06 14:55:48,309][04272] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-06 14:55:48,940][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12586.5). Total num frames: 21419008. Throughput: 0: 12579.1. Samples: 21411607. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:55:48,941][03942] Avg episode reward: [(0, '827.741')] [2023-03-06 14:55:49,107][04272] Updated weights for policy 0, policy_version 20920 (0.0008) [2023-03-06 14:55:49,909][04272] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-06 14:55:50,736][04272] Updated weights for policy 0, policy_version 20940 (0.0007) [2023-03-06 14:55:51,547][04272] Updated weights for policy 0, policy_version 20950 (0.0007) [2023-03-06 14:55:52,356][04272] Updated weights for policy 0, policy_version 20960 (0.0008) [2023-03-06 14:55:53,165][04272] Updated weights for policy 0, policy_version 20970 (0.0005) [2023-03-06 14:55:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12586.5). Total num frames: 21482496. Throughput: 0: 12577.7. Samples: 21449313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:55:53,941][03942] Avg episode reward: [(0, '974.149')] [2023-03-06 14:55:54,004][04272] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-06 14:55:54,813][04272] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-06 14:55:55,613][04272] Updated weights for policy 0, policy_version 21000 (0.0006) [2023-03-06 14:55:56,428][04272] Updated weights for policy 0, policy_version 21010 (0.0007) [2023-03-06 14:55:57,229][04272] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-06 14:55:58,034][04272] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-06 14:55:58,845][04272] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-06 14:55:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 21545984. Throughput: 0: 12589.5. Samples: 21525216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:55:58,941][03942] Avg episode reward: [(0, '948.480')] [2023-03-06 14:55:59,635][04272] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-06 14:56:00,446][04272] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-06 14:56:01,262][04272] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-06 14:56:02,073][04272] Updated weights for policy 0, policy_version 21080 (0.0006) [2023-03-06 14:56:02,892][04272] Updated weights for policy 0, policy_version 21090 (0.0007) [2023-03-06 14:56:03,702][04272] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-06 14:56:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 21609472. Throughput: 0: 12604.3. Samples: 21601110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:03,941][03942] Avg episode reward: [(0, '1321.990')] [2023-03-06 14:56:03,941][04221] Saving new best policy, reward=1321.990! [2023-03-06 14:56:04,514][04272] Updated weights for policy 0, policy_version 21110 (0.0007) [2023-03-06 14:56:05,335][04272] Updated weights for policy 0, policy_version 21120 (0.0007) [2023-03-06 14:56:06,146][04272] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-06 14:56:06,960][04272] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-06 14:56:07,774][04272] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-06 14:56:08,567][04272] Updated weights for policy 0, policy_version 21160 (0.0007) [2023-03-06 14:56:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 21671936. Throughput: 0: 12604.8. Samples: 21639005. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:56:08,941][03942] Avg episode reward: [(0, '1182.753')] [2023-03-06 14:56:09,389][04272] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-06 14:56:10,191][04272] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-06 14:56:10,991][04272] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-06 14:56:11,797][04272] Updated weights for policy 0, policy_version 21200 (0.0006) [2023-03-06 14:56:12,609][04272] Updated weights for policy 0, policy_version 21210 (0.0005) [2023-03-06 14:56:13,422][04272] Updated weights for policy 0, policy_version 21220 (0.0007) [2023-03-06 14:56:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 21735424. Throughput: 0: 12613.7. Samples: 21714880. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:56:13,952][03942] Avg episode reward: [(0, '1006.610')] [2023-03-06 14:56:14,227][04272] Updated weights for policy 0, policy_version 21230 (0.0007) [2023-03-06 14:56:15,034][04272] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-06 14:56:15,872][04272] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-06 14:56:16,678][04272] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-06 14:56:17,490][04272] Updated weights for policy 0, policy_version 21270 (0.0007) [2023-03-06 14:56:18,302][04272] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-06 14:56:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 21797888. Throughput: 0: 12619.3. Samples: 21790461. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:56:18,941][03942] Avg episode reward: [(0, '1090.813')] [2023-03-06 14:56:19,115][04272] Updated weights for policy 0, policy_version 21290 (0.0006) [2023-03-06 14:56:19,944][04272] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-06 14:56:20,755][04272] Updated weights for policy 0, policy_version 21310 (0.0007) [2023-03-06 14:56:21,564][04272] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-06 14:56:22,393][04272] Updated weights for policy 0, policy_version 21330 (0.0007) [2023-03-06 14:56:23,189][04272] Updated weights for policy 0, policy_version 21340 (0.0007) [2023-03-06 14:56:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12593.5). Total num frames: 21861376. Throughput: 0: 12616.6. Samples: 21828188. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:56:23,941][03942] Avg episode reward: [(0, '1238.437')] [2023-03-06 14:56:23,995][04272] Updated weights for policy 0, policy_version 21350 (0.0007) [2023-03-06 14:56:24,815][04272] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-06 14:56:25,631][04272] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-06 14:56:26,442][04272] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-06 14:56:27,261][04272] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-06 14:56:28,081][04272] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-06 14:56:28,894][04272] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-06 14:56:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 21923840. Throughput: 0: 12612.7. Samples: 21903542. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:56:28,941][03942] Avg episode reward: [(0, '1189.698')] [2023-03-06 14:56:29,695][04272] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-06 14:56:30,504][04272] Updated weights for policy 0, policy_version 21430 (0.0006) [2023-03-06 14:56:31,315][04272] Updated weights for policy 0, policy_version 21440 (0.0006) [2023-03-06 14:56:32,119][04272] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-06 14:56:32,948][04272] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-06 14:56:33,753][04272] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-06 14:56:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12593.5). Total num frames: 21987328. Throughput: 0: 12614.3. Samples: 21979249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:56:33,941][03942] Avg episode reward: [(0, '1302.517')] [2023-03-06 14:56:34,570][04272] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-06 14:56:35,405][04272] Updated weights for policy 0, policy_version 21490 (0.0006) [2023-03-06 14:56:36,221][04272] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-06 14:56:37,007][04272] Updated weights for policy 0, policy_version 21510 (0.0007) [2023-03-06 14:56:37,838][04272] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-06 14:56:38,654][04272] Updated weights for policy 0, policy_version 21530 (0.0006) [2023-03-06 14:56:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 22049792. Throughput: 0: 12610.5. Samples: 22016784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:38,941][03942] Avg episode reward: [(0, '1303.849')] [2023-03-06 14:56:39,463][04272] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-06 14:56:40,265][04272] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-06 14:56:41,090][04272] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-03-06 14:56:41,893][04272] Updated weights for policy 0, policy_version 21570 (0.0006) [2023-03-06 14:56:42,689][04272] Updated weights for policy 0, policy_version 21580 (0.0007) [2023-03-06 14:56:43,503][04272] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-06 14:56:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12593.5). Total num frames: 22113280. Throughput: 0: 12605.8. Samples: 22092477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:43,941][03942] Avg episode reward: [(0, '1226.517')] [2023-03-06 14:56:44,330][04272] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-06 14:56:45,134][04272] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-06 14:56:45,941][04272] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-06 14:56:46,753][04272] Updated weights for policy 0, policy_version 21630 (0.0007) [2023-03-06 14:56:47,566][04272] Updated weights for policy 0, policy_version 21640 (0.0007) [2023-03-06 14:56:48,381][04272] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-06 14:56:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12593.5). Total num frames: 22175744. Throughput: 0: 12603.1. Samples: 22168251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:48,941][03942] Avg episode reward: [(0, '1350.144')] [2023-03-06 14:56:48,944][04221] Saving new best policy, reward=1350.144! [2023-03-06 14:56:49,196][04272] Updated weights for policy 0, policy_version 21660 (0.0006) [2023-03-06 14:56:50,010][04272] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-06 14:56:50,825][04272] Updated weights for policy 0, policy_version 21680 (0.0006) [2023-03-06 14:56:51,643][04272] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-06 14:56:52,456][04272] Updated weights for policy 0, policy_version 21700 (0.0007) [2023-03-06 14:56:53,257][04272] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-06 14:56:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12593.5). Total num frames: 22239232. Throughput: 0: 12599.1. Samples: 22205965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:53,941][03942] Avg episode reward: [(0, '1370.579')] [2023-03-06 14:56:53,941][04221] Saving new best policy, reward=1370.579! [2023-03-06 14:56:54,064][04272] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-06 14:56:54,890][04272] Updated weights for policy 0, policy_version 21730 (0.0005) [2023-03-06 14:56:55,689][04272] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-06 14:56:56,510][04272] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-06 14:56:57,329][04272] Updated weights for policy 0, policy_version 21760 (0.0006) [2023-03-06 14:56:58,148][04272] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-06 14:56:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 22301696. Throughput: 0: 12592.9. Samples: 22281559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:56:58,941][03942] Avg episode reward: [(0, '1388.980')] [2023-03-06 14:56:58,944][04221] Saving new best policy, reward=1388.980! [2023-03-06 14:56:58,994][04272] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-06 14:56:59,767][04272] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-06 14:57:00,590][04272] Updated weights for policy 0, policy_version 21800 (0.0006) [2023-03-06 14:57:01,410][04272] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-06 14:57:02,208][04272] Updated weights for policy 0, policy_version 21820 (0.0007) [2023-03-06 14:57:03,049][04272] Updated weights for policy 0, policy_version 21830 (0.0008) [2023-03-06 14:57:03,857][04272] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-06 14:57:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 22365184. Throughput: 0: 12590.4. Samples: 22357029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:03,941][03942] Avg episode reward: [(0, '1364.515')] [2023-03-06 14:57:04,669][04272] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-06 14:57:05,474][04272] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-06 14:57:06,294][04272] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-06 14:57:07,109][04272] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-06 14:57:07,913][04272] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-06 14:57:08,738][04272] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-06 14:57:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 22427648. Throughput: 0: 12592.6. Samples: 22394856. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:08,941][03942] Avg episode reward: [(0, '1365.729')] [2023-03-06 14:57:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000021902_22427648.pth... [2023-03-06 14:57:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000018951_19405824.pth [2023-03-06 14:57:09,561][04272] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-06 14:57:10,369][04272] Updated weights for policy 0, policy_version 21920 (0.0007) [2023-03-06 14:57:11,183][04272] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-06 14:57:11,985][04272] Updated weights for policy 0, policy_version 21940 (0.0006) [2023-03-06 14:57:12,807][04272] Updated weights for policy 0, policy_version 21950 (0.0007) [2023-03-06 14:57:13,631][04272] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-06 14:57:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 22490112. Throughput: 0: 12590.6. Samples: 22470119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:13,941][03942] Avg episode reward: [(0, '1351.023')] [2023-03-06 14:57:14,443][04272] Updated weights for policy 0, policy_version 21970 (0.0006) [2023-03-06 14:57:15,257][04272] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-06 14:57:16,072][04272] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-06 14:57:16,876][04272] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-06 14:57:17,709][04272] Updated weights for policy 0, policy_version 22010 (0.0007) [2023-03-06 14:57:18,525][04272] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-06 14:57:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 22553600. Throughput: 0: 12580.3. Samples: 22545364. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:57:18,941][03942] Avg episode reward: [(0, '1119.837')] [2023-03-06 14:57:19,346][04272] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-06 14:57:20,159][04272] Updated weights for policy 0, policy_version 22040 (0.0007) [2023-03-06 14:57:20,977][04272] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-06 14:57:21,782][04272] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-06 14:57:22,584][04272] Updated weights for policy 0, policy_version 22070 (0.0007) [2023-03-06 14:57:23,406][04272] Updated weights for policy 0, policy_version 22080 (0.0007) [2023-03-06 14:57:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 22616064. Throughput: 0: 12584.9. Samples: 22583102. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:57:23,941][03942] Avg episode reward: [(0, '1209.423')] [2023-03-06 14:57:24,212][04272] Updated weights for policy 0, policy_version 22090 (0.0006) [2023-03-06 14:57:25,022][04272] Updated weights for policy 0, policy_version 22100 (0.0006) [2023-03-06 14:57:25,838][04272] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-06 14:57:26,646][04272] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-06 14:57:27,475][04272] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-06 14:57:28,280][04272] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-06 14:57:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 22679552. Throughput: 0: 12582.9. Samples: 22658709. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 14:57:28,941][03942] Avg episode reward: [(0, '1197.822')] [2023-03-06 14:57:29,087][04272] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-06 14:57:29,923][04272] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-06 14:57:30,740][04272] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-06 14:57:31,548][04272] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-06 14:57:32,370][04272] Updated weights for policy 0, policy_version 22190 (0.0007) [2023-03-06 14:57:33,183][04272] Updated weights for policy 0, policy_version 22200 (0.0006) [2023-03-06 14:57:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 22742016. Throughput: 0: 12573.1. Samples: 22734039. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:33,941][03942] Avg episode reward: [(0, '1267.578')] [2023-03-06 14:57:33,991][04272] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-06 14:57:34,819][04272] Updated weights for policy 0, policy_version 22220 (0.0007) [2023-03-06 14:57:35,618][04272] Updated weights for policy 0, policy_version 22230 (0.0007) [2023-03-06 14:57:36,442][04272] Updated weights for policy 0, policy_version 22240 (0.0007) [2023-03-06 14:57:37,245][04272] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-06 14:57:38,054][04272] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-06 14:57:38,865][04272] Updated weights for policy 0, policy_version 22270 (0.0007) [2023-03-06 14:57:38,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.2, 300 sec: 12590.0). Total num frames: 22804480. Throughput: 0: 12574.8. Samples: 22771831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:38,941][03942] Avg episode reward: [(0, '1339.095')] [2023-03-06 14:57:39,689][04272] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-06 14:57:40,497][04272] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-06 14:57:41,317][04272] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-06 14:57:42,128][04272] Updated weights for policy 0, policy_version 22310 (0.0005) [2023-03-06 14:57:42,917][04272] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-06 14:57:43,766][04272] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-06 14:57:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 22867968. Throughput: 0: 12574.9. Samples: 22847428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:43,941][03942] Avg episode reward: [(0, '1168.428')] [2023-03-06 14:57:44,573][04272] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-06 14:57:45,378][04272] Updated weights for policy 0, policy_version 22350 (0.0006) [2023-03-06 14:57:46,203][04272] Updated weights for policy 0, policy_version 22360 (0.0008) [2023-03-06 14:57:47,012][04272] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-06 14:57:47,835][04272] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-06 14:57:48,645][04272] Updated weights for policy 0, policy_version 22390 (0.0007) [2023-03-06 14:57:48,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.2, 300 sec: 12590.0). Total num frames: 22930432. Throughput: 0: 12572.5. Samples: 22922790. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:48,941][03942] Avg episode reward: [(0, '1349.998')] [2023-03-06 14:57:49,441][04272] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-06 14:57:50,271][04272] Updated weights for policy 0, policy_version 22410 (0.0007) [2023-03-06 14:57:51,071][04272] Updated weights for policy 0, policy_version 22420 (0.0007) [2023-03-06 14:57:51,874][04272] Updated weights for policy 0, policy_version 22430 (0.0007) [2023-03-06 14:57:52,677][04272] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-06 14:57:53,491][04272] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-06 14:57:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 22993920. Throughput: 0: 12576.5. Samples: 22960796. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:53,941][03942] Avg episode reward: [(0, '1392.875')] [2023-03-06 14:57:53,941][04221] Saving new best policy, reward=1392.875! [2023-03-06 14:57:54,292][04272] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-06 14:57:55,114][04272] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-06 14:57:55,929][04272] Updated weights for policy 0, policy_version 22480 (0.0007) [2023-03-06 14:57:56,747][04272] Updated weights for policy 0, policy_version 22490 (0.0007) [2023-03-06 14:57:57,559][04272] Updated weights for policy 0, policy_version 22500 (0.0007) [2023-03-06 14:57:58,399][04272] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-06 14:57:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12586.5). Total num frames: 23056384. Throughput: 0: 12583.9. Samples: 23036395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:57:58,941][03942] Avg episode reward: [(0, '1475.023')] [2023-03-06 14:57:58,956][04221] Saving new best policy, reward=1475.023! [2023-03-06 14:57:59,193][04272] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-06 14:58:00,001][04272] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-06 14:58:00,827][04272] Updated weights for policy 0, policy_version 22540 (0.0006) [2023-03-06 14:58:01,627][04272] Updated weights for policy 0, policy_version 22550 (0.0006) [2023-03-06 14:58:02,435][04272] Updated weights for policy 0, policy_version 22560 (0.0006) [2023-03-06 14:58:03,243][04272] Updated weights for policy 0, policy_version 22570 (0.0006) [2023-03-06 14:58:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 23119872. Throughput: 0: 12591.9. Samples: 23112002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:03,941][03942] Avg episode reward: [(0, '1350.262')] [2023-03-06 14:58:04,080][04272] Updated weights for policy 0, policy_version 22580 (0.0007) [2023-03-06 14:58:04,873][04272] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-06 14:58:05,689][04272] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-06 14:58:06,509][04272] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-06 14:58:07,320][04272] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-06 14:58:08,130][04272] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-06 14:58:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 23182336. Throughput: 0: 12590.6. Samples: 23149677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:08,941][03942] Avg episode reward: [(0, '1445.937')] [2023-03-06 14:58:08,970][04272] Updated weights for policy 0, policy_version 22640 (0.0007) [2023-03-06 14:58:09,764][04272] Updated weights for policy 0, policy_version 22650 (0.0007) [2023-03-06 14:58:10,599][04272] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-06 14:58:11,398][04272] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-06 14:58:12,195][04272] Updated weights for policy 0, policy_version 22680 (0.0007) [2023-03-06 14:58:13,028][04272] Updated weights for policy 0, policy_version 22690 (0.0007) [2023-03-06 14:58:13,831][04272] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-06 14:58:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 23245824. Throughput: 0: 12589.6. Samples: 23225241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:13,941][03942] Avg episode reward: [(0, '1335.619')] [2023-03-06 14:58:14,637][04272] Updated weights for policy 0, policy_version 22710 (0.0006) [2023-03-06 14:58:15,470][04272] Updated weights for policy 0, policy_version 22720 (0.0008) [2023-03-06 14:58:16,275][04272] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-06 14:58:17,088][04272] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-06 14:58:17,889][04272] Updated weights for policy 0, policy_version 22750 (0.0007) [2023-03-06 14:58:18,705][04272] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-06 14:58:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 23308288. Throughput: 0: 12596.7. Samples: 23300887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:18,941][03942] Avg episode reward: [(0, '1316.850')] [2023-03-06 14:58:19,509][04272] Updated weights for policy 0, policy_version 22770 (0.0007) [2023-03-06 14:58:20,325][04272] Updated weights for policy 0, policy_version 22780 (0.0007) [2023-03-06 14:58:21,141][04272] Updated weights for policy 0, policy_version 22790 (0.0008) [2023-03-06 14:58:21,949][04272] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-06 14:58:22,761][04272] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-06 14:58:23,587][04272] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-06 14:58:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 23371776. Throughput: 0: 12594.8. Samples: 23338596. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:23,941][03942] Avg episode reward: [(0, '1358.535')] [2023-03-06 14:58:24,406][04272] Updated weights for policy 0, policy_version 22830 (0.0008) [2023-03-06 14:58:25,206][04272] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-06 14:58:26,029][04272] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-06 14:58:26,845][04272] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-06 14:58:27,644][04272] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-06 14:58:28,468][04272] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-06 14:58:28,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12578.1, 300 sec: 12590.0). Total num frames: 23434240. Throughput: 0: 12598.8. Samples: 23414376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:28,941][03942] Avg episode reward: [(0, '1220.396')] [2023-03-06 14:58:29,293][04272] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-06 14:58:30,078][04272] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-06 14:58:30,898][04272] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-06 14:58:31,701][04272] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-06 14:58:32,518][04272] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-06 14:58:33,341][04272] Updated weights for policy 0, policy_version 22940 (0.0005) [2023-03-06 14:58:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 23497728. Throughput: 0: 12602.8. Samples: 23489918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:33,941][03942] Avg episode reward: [(0, '1275.111')] [2023-03-06 14:58:34,143][04272] Updated weights for policy 0, policy_version 22950 (0.0007) [2023-03-06 14:58:34,957][04272] Updated weights for policy 0, policy_version 22960 (0.0007) [2023-03-06 14:58:35,782][04272] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-06 14:58:36,608][04272] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-06 14:58:37,413][04272] Updated weights for policy 0, policy_version 22990 (0.0007) [2023-03-06 14:58:38,213][04272] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-06 14:58:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12590.0). Total num frames: 23560192. Throughput: 0: 12596.2. Samples: 23527625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:38,941][03942] Avg episode reward: [(0, '1247.037')] [2023-03-06 14:58:39,043][04272] Updated weights for policy 0, policy_version 23010 (0.0007) [2023-03-06 14:58:39,843][04272] Updated weights for policy 0, policy_version 23020 (0.0006) [2023-03-06 14:58:40,661][04272] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-06 14:58:41,477][04272] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-06 14:58:42,301][04272] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-06 14:58:43,103][04272] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-06 14:58:43,933][04272] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-06 14:58:43,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 23623680. Throughput: 0: 12590.0. Samples: 23602943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:43,941][03942] Avg episode reward: [(0, '1201.151')] [2023-03-06 14:58:44,729][04272] Updated weights for policy 0, policy_version 23080 (0.0007) [2023-03-06 14:58:45,533][04272] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-06 14:58:46,348][04272] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-06 14:58:47,165][04272] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-06 14:58:47,974][04272] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-06 14:58:48,792][04272] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-06 14:58:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 23686144. Throughput: 0: 12593.7. Samples: 23678719. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:48,941][03942] Avg episode reward: [(0, '1187.170')] [2023-03-06 14:58:49,610][04272] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-06 14:58:50,404][04272] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-06 14:58:51,223][04272] Updated weights for policy 0, policy_version 23160 (0.0007) [2023-03-06 14:58:52,025][04272] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-06 14:58:52,841][04272] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-06 14:58:53,652][04272] Updated weights for policy 0, policy_version 23190 (0.0006) [2023-03-06 14:58:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 23749632. Throughput: 0: 12599.2. Samples: 23716642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:53,952][03942] Avg episode reward: [(0, '957.017')] [2023-03-06 14:58:54,479][04272] Updated weights for policy 0, policy_version 23200 (0.0006) [2023-03-06 14:58:55,278][04272] Updated weights for policy 0, policy_version 23210 (0.0008) [2023-03-06 14:58:56,078][04272] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-06 14:58:56,905][04272] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-06 14:58:57,709][04272] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-06 14:58:58,533][04272] Updated weights for policy 0, policy_version 23250 (0.0006) [2023-03-06 14:58:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 23813120. Throughput: 0: 12601.9. Samples: 23792326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:58:58,951][03942] Avg episode reward: [(0, '964.826')] [2023-03-06 14:58:59,332][04272] Updated weights for policy 0, policy_version 23260 (0.0007) [2023-03-06 14:59:00,154][04272] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-06 14:59:00,954][04272] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-06 14:59:01,774][04272] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-06 14:59:02,579][04272] Updated weights for policy 0, policy_version 23300 (0.0007) [2023-03-06 14:59:03,390][04272] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-06 14:59:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 23875584. Throughput: 0: 12603.7. Samples: 23868057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:59:03,952][03942] Avg episode reward: [(0, '1106.293')] [2023-03-06 14:59:04,196][04272] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-06 14:59:05,024][04272] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-06 14:59:05,818][04272] Updated weights for policy 0, policy_version 23340 (0.0007) [2023-03-06 14:59:06,622][04272] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-06 14:59:07,453][04272] Updated weights for policy 0, policy_version 23360 (0.0006) [2023-03-06 14:59:08,251][04272] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-06 14:59:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 23939072. Throughput: 0: 12609.4. Samples: 23906021. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:59:08,952][03942] Avg episode reward: [(0, '1119.756')] [2023-03-06 14:59:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000023378_23939072.pth... [2023-03-06 14:59:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000020427_20917248.pth [2023-03-06 14:59:09,063][04272] Updated weights for policy 0, policy_version 23380 (0.0008) [2023-03-06 14:59:09,876][04272] Updated weights for policy 0, policy_version 23390 (0.0007) [2023-03-06 14:59:10,685][04272] Updated weights for policy 0, policy_version 23400 (0.0006) [2023-03-06 14:59:11,507][04272] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-06 14:59:12,314][04272] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-06 14:59:13,117][04272] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-06 14:59:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 24001536. Throughput: 0: 12603.6. Samples: 23981536. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:59:13,941][03942] Avg episode reward: [(0, '1329.369')] [2023-03-06 14:59:13,946][04272] Updated weights for policy 0, policy_version 23440 (0.0006) [2023-03-06 14:59:14,750][04272] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-06 14:59:15,555][04272] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-06 14:59:16,371][04272] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-06 14:59:17,197][04272] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-06 14:59:18,011][04272] Updated weights for policy 0, policy_version 23490 (0.0006) [2023-03-06 14:59:18,823][04272] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-06 14:59:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24065024. Throughput: 0: 12605.2. Samples: 24057151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 14:59:18,941][03942] Avg episode reward: [(0, '1252.748')] [2023-03-06 14:59:19,656][04272] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-06 14:59:20,475][04272] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-06 14:59:21,291][04272] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-06 14:59:22,095][04272] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-06 14:59:22,917][04272] Updated weights for policy 0, policy_version 23550 (0.0006) [2023-03-06 14:59:23,733][04272] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-06 14:59:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 24127488. Throughput: 0: 12604.6. Samples: 24094829. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:59:23,941][03942] Avg episode reward: [(0, '1370.154')] [2023-03-06 14:59:24,538][04272] Updated weights for policy 0, policy_version 23570 (0.0006) [2023-03-06 14:59:25,360][04272] Updated weights for policy 0, policy_version 23580 (0.0007) [2023-03-06 14:59:26,170][04272] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-06 14:59:26,975][04272] Updated weights for policy 0, policy_version 23600 (0.0006) [2023-03-06 14:59:27,786][04272] Updated weights for policy 0, policy_version 23610 (0.0006) [2023-03-06 14:59:28,602][04272] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-06 14:59:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24190976. Throughput: 0: 12607.7. Samples: 24170292. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:59:28,941][03942] Avg episode reward: [(0, '1289.419')] [2023-03-06 14:59:29,405][04272] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-06 14:59:30,229][04272] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-06 14:59:31,036][04272] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-06 14:59:31,849][04272] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-06 14:59:32,669][04272] Updated weights for policy 0, policy_version 23670 (0.0007) [2023-03-06 14:59:33,471][04272] Updated weights for policy 0, policy_version 23680 (0.0008) [2023-03-06 14:59:33,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 24253440. Throughput: 0: 12604.2. Samples: 24245907. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:59:33,941][03942] Avg episode reward: [(0, '1245.680')] [2023-03-06 14:59:34,288][04272] Updated weights for policy 0, policy_version 23690 (0.0007) [2023-03-06 14:59:35,099][04272] Updated weights for policy 0, policy_version 23700 (0.0007) [2023-03-06 14:59:35,894][04272] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-06 14:59:36,709][04272] Updated weights for policy 0, policy_version 23720 (0.0008) [2023-03-06 14:59:37,511][04272] Updated weights for policy 0, policy_version 23730 (0.0007) [2023-03-06 14:59:38,321][04272] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-06 14:59:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24316928. Throughput: 0: 12609.1. Samples: 24284052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:59:38,941][03942] Avg episode reward: [(0, '1271.892')] [2023-03-06 14:59:39,121][04272] Updated weights for policy 0, policy_version 23750 (0.0006) [2023-03-06 14:59:39,924][04272] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-06 14:59:40,761][04272] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-06 14:59:41,561][04272] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-06 14:59:42,371][04272] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-06 14:59:43,190][04272] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-06 14:59:43,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.2, 300 sec: 12596.9). Total num frames: 24380416. Throughput: 0: 12610.1. Samples: 24359782. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:59:43,941][03942] Avg episode reward: [(0, '1423.648')] [2023-03-06 14:59:43,997][04272] Updated weights for policy 0, policy_version 23810 (0.0006) [2023-03-06 14:59:44,807][04272] Updated weights for policy 0, policy_version 23820 (0.0007) [2023-03-06 14:59:45,618][04272] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-06 14:59:46,423][04272] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-06 14:59:47,233][04272] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-06 14:59:48,050][04272] Updated weights for policy 0, policy_version 23860 (0.0005) [2023-03-06 14:59:48,845][04272] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-06 14:59:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12600.4). Total num frames: 24443904. Throughput: 0: 12612.3. Samples: 24435609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 14:59:48,941][03942] Avg episode reward: [(0, '1484.361')] [2023-03-06 14:59:48,945][04221] Saving new best policy, reward=1484.361! [2023-03-06 14:59:49,654][04272] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-06 14:59:50,490][04272] Updated weights for policy 0, policy_version 23890 (0.0007) [2023-03-06 14:59:51,298][04272] Updated weights for policy 0, policy_version 23900 (0.0008) [2023-03-06 14:59:52,109][04272] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-06 14:59:52,935][04272] Updated weights for policy 0, policy_version 23920 (0.0007) [2023-03-06 14:59:53,744][04272] Updated weights for policy 0, policy_version 23930 (0.0006) [2023-03-06 14:59:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24506368. Throughput: 0: 12608.7. Samples: 24473415. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:59:53,941][03942] Avg episode reward: [(0, '1461.508')] [2023-03-06 14:59:54,549][04272] Updated weights for policy 0, policy_version 23940 (0.0007) [2023-03-06 14:59:55,381][04272] Updated weights for policy 0, policy_version 23950 (0.0007) [2023-03-06 14:59:56,200][04272] Updated weights for policy 0, policy_version 23960 (0.0007) [2023-03-06 14:59:57,027][04272] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-06 14:59:57,838][04272] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-06 14:59:58,638][04272] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-06 14:59:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 24568832. Throughput: 0: 12600.3. Samples: 24548550. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 14:59:58,941][03942] Avg episode reward: [(0, '1428.557')] [2023-03-06 14:59:59,447][04272] Updated weights for policy 0, policy_version 24000 (0.0007) [2023-03-06 15:00:00,261][04272] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-06 15:00:01,069][04272] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-06 15:00:01,874][04272] Updated weights for policy 0, policy_version 24030 (0.0007) [2023-03-06 15:00:02,681][04272] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-06 15:00:03,504][04272] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-06 15:00:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24632320. Throughput: 0: 12604.1. Samples: 24624335. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:00:03,941][03942] Avg episode reward: [(0, '1582.871')] [2023-03-06 15:00:03,942][04221] Saving new best policy, reward=1582.871! [2023-03-06 15:00:04,325][04272] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-06 15:00:05,148][04272] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-06 15:00:05,957][04272] Updated weights for policy 0, policy_version 24080 (0.0006) [2023-03-06 15:00:06,756][04272] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-06 15:00:07,572][04272] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-06 15:00:08,385][04272] Updated weights for policy 0, policy_version 24110 (0.0007) [2023-03-06 15:00:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 24694784. Throughput: 0: 12603.8. Samples: 24661999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:00:08,941][03942] Avg episode reward: [(0, '1511.694')] [2023-03-06 15:00:09,183][04272] Updated weights for policy 0, policy_version 24120 (0.0006) [2023-03-06 15:00:10,023][04272] Updated weights for policy 0, policy_version 24130 (0.0006) [2023-03-06 15:00:10,830][04272] Updated weights for policy 0, policy_version 24140 (0.0007) [2023-03-06 15:00:11,632][04272] Updated weights for policy 0, policy_version 24150 (0.0006) [2023-03-06 15:00:12,452][04272] Updated weights for policy 0, policy_version 24160 (0.0007) [2023-03-06 15:00:13,280][04272] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-06 15:00:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 24758272. Throughput: 0: 12607.9. Samples: 24737649. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:00:13,941][03942] Avg episode reward: [(0, '1344.520')] [2023-03-06 15:00:14,085][04272] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-06 15:00:14,895][04272] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-06 15:00:15,692][04272] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-06 15:00:16,501][04272] Updated weights for policy 0, policy_version 24210 (0.0006) [2023-03-06 15:00:17,310][04272] Updated weights for policy 0, policy_version 24220 (0.0007) [2023-03-06 15:00:18,120][04272] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-06 15:00:18,924][04272] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-06 15:00:18,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12600.4). Total num frames: 24821760. Throughput: 0: 12613.3. Samples: 24813504. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:00:18,941][03942] Avg episode reward: [(0, '1336.223')] [2023-03-06 15:00:19,725][04272] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-06 15:00:20,555][04272] Updated weights for policy 0, policy_version 24260 (0.0006) [2023-03-06 15:00:21,378][04272] Updated weights for policy 0, policy_version 24270 (0.0007) [2023-03-06 15:00:22,184][04272] Updated weights for policy 0, policy_version 24280 (0.0007) [2023-03-06 15:00:22,997][04272] Updated weights for policy 0, policy_version 24290 (0.0007) [2023-03-06 15:00:23,812][04272] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-06 15:00:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12596.9). Total num frames: 24884224. Throughput: 0: 12603.5. Samples: 24851210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:00:23,941][03942] Avg episode reward: [(0, '1284.679')] [2023-03-06 15:00:24,611][04272] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-06 15:00:25,432][04272] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-06 15:00:26,261][04272] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-06 15:00:27,062][04272] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-06 15:00:27,871][04272] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-06 15:00:28,705][04272] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-06 15:00:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 24947712. Throughput: 0: 12603.4. Samples: 24926935. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:00:28,941][03942] Avg episode reward: [(0, '1427.744')] [2023-03-06 15:00:29,515][04272] Updated weights for policy 0, policy_version 24370 (0.0006) [2023-03-06 15:00:30,339][04272] Updated weights for policy 0, policy_version 24380 (0.0007) [2023-03-06 15:00:31,154][04272] Updated weights for policy 0, policy_version 24390 (0.0006) [2023-03-06 15:00:31,957][04272] Updated weights for policy 0, policy_version 24400 (0.0006) [2023-03-06 15:00:32,766][04272] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-06 15:00:33,586][04272] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-06 15:00:33,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 25010176. Throughput: 0: 12594.5. Samples: 25002362. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:00:33,941][03942] Avg episode reward: [(0, '1372.133')] [2023-03-06 15:00:34,398][04272] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-06 15:00:35,220][04272] Updated weights for policy 0, policy_version 24440 (0.0007) [2023-03-06 15:00:36,033][04272] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-06 15:00:36,847][04272] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-06 15:00:37,656][04272] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-06 15:00:38,472][04272] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-06 15:00:38,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25072640. Throughput: 0: 12589.6. Samples: 25039949. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:00:38,941][03942] Avg episode reward: [(0, '1451.996')] [2023-03-06 15:00:39,285][04272] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-06 15:00:40,099][04272] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-06 15:00:40,897][04272] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-06 15:00:41,726][04272] Updated weights for policy 0, policy_version 24520 (0.0006) [2023-03-06 15:00:42,534][04272] Updated weights for policy 0, policy_version 24530 (0.0006) [2023-03-06 15:00:43,340][04272] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-06 15:00:43,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 25136128. Throughput: 0: 12599.5. Samples: 25115530. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:00:43,941][03942] Avg episode reward: [(0, '1510.864')] [2023-03-06 15:00:44,153][04272] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-06 15:00:44,977][04272] Updated weights for policy 0, policy_version 24560 (0.0006) [2023-03-06 15:00:45,773][04272] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-06 15:00:46,596][04272] Updated weights for policy 0, policy_version 24580 (0.0007) [2023-03-06 15:00:47,405][04272] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-06 15:00:48,225][04272] Updated weights for policy 0, policy_version 24600 (0.0007) [2023-03-06 15:00:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 25198592. Throughput: 0: 12594.2. Samples: 25191073. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:00:48,941][03942] Avg episode reward: [(0, '1359.513')] [2023-03-06 15:00:49,046][04272] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-06 15:00:49,844][04272] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-06 15:00:50,653][04272] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-06 15:00:51,482][04272] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-06 15:00:52,280][04272] Updated weights for policy 0, policy_version 24650 (0.0006) [2023-03-06 15:00:53,118][04272] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-06 15:00:53,937][04272] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-06 15:00:53,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25262080. Throughput: 0: 12596.6. Samples: 25228846. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:00:53,941][03942] Avg episode reward: [(0, '1357.843')] [2023-03-06 15:00:54,738][04272] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-06 15:00:55,565][04272] Updated weights for policy 0, policy_version 24690 (0.0006) [2023-03-06 15:00:56,369][04272] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-06 15:00:57,165][04272] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-06 15:00:57,990][04272] Updated weights for policy 0, policy_version 24720 (0.0006) [2023-03-06 15:00:58,808][04272] Updated weights for policy 0, policy_version 24730 (0.0007) [2023-03-06 15:00:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 25324544. Throughput: 0: 12591.7. Samples: 25304277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:00:58,941][03942] Avg episode reward: [(0, '1341.730')] [2023-03-06 15:00:59,617][04272] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-06 15:01:00,437][04272] Updated weights for policy 0, policy_version 24750 (0.0007) [2023-03-06 15:01:01,254][04272] Updated weights for policy 0, policy_version 24760 (0.0007) [2023-03-06 15:01:02,064][04272] Updated weights for policy 0, policy_version 24770 (0.0007) [2023-03-06 15:01:02,882][04272] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-06 15:01:03,705][04272] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-06 15:01:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12593.5). Total num frames: 25387008. Throughput: 0: 12582.6. Samples: 25379720. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:01:03,952][03942] Avg episode reward: [(0, '1403.626')] [2023-03-06 15:01:04,506][04272] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-06 15:01:05,320][04272] Updated weights for policy 0, policy_version 24810 (0.0007) [2023-03-06 15:01:06,138][04272] Updated weights for policy 0, policy_version 24820 (0.0007) [2023-03-06 15:01:06,954][04272] Updated weights for policy 0, policy_version 24830 (0.0007) [2023-03-06 15:01:07,757][04272] Updated weights for policy 0, policy_version 24840 (0.0007) [2023-03-06 15:01:08,574][04272] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-06 15:01:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 25450496. Throughput: 0: 12580.6. Samples: 25417337. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:01:08,950][03942] Avg episode reward: [(0, '1378.679')] [2023-03-06 15:01:08,954][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000024854_25450496.pth... [2023-03-06 15:01:08,984][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000021902_22427648.pth [2023-03-06 15:01:09,399][04272] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-06 15:01:10,202][04272] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-06 15:01:11,017][04272] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-06 15:01:11,819][04272] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-06 15:01:12,607][04272] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-06 15:01:13,447][04272] Updated weights for policy 0, policy_version 24910 (0.0007) [2023-03-06 15:01:13,759][04221] KL-divergence is very high: 785.8881 [2023-03-06 15:01:13,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25513984. Throughput: 0: 12580.0. Samples: 25493033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:13,951][03942] Avg episode reward: [(0, '1261.041')] [2023-03-06 15:01:14,253][04272] Updated weights for policy 0, policy_version 24920 (0.0006) [2023-03-06 15:01:15,065][04272] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-06 15:01:15,873][04272] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-06 15:01:16,702][04272] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-06 15:01:17,504][04272] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-06 15:01:18,321][04272] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-06 15:01:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12593.5). Total num frames: 25576448. Throughput: 0: 12586.9. Samples: 25568773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:18,952][03942] Avg episode reward: [(0, '1405.537')] [2023-03-06 15:01:19,125][04272] Updated weights for policy 0, policy_version 24980 (0.0007) [2023-03-06 15:01:19,927][04272] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-06 15:01:20,724][04272] Updated weights for policy 0, policy_version 25000 (0.0007) [2023-03-06 15:01:21,535][04272] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-06 15:01:22,333][04272] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-06 15:01:23,150][04272] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-06 15:01:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25639936. Throughput: 0: 12597.4. Samples: 25606831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:23,941][03942] Avg episode reward: [(0, '1299.643')] [2023-03-06 15:01:23,967][04272] Updated weights for policy 0, policy_version 25040 (0.0007) [2023-03-06 15:01:24,775][04272] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-06 15:01:25,583][04272] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-06 15:01:26,409][04272] Updated weights for policy 0, policy_version 25070 (0.0007) [2023-03-06 15:01:27,213][04272] Updated weights for policy 0, policy_version 25080 (0.0007) [2023-03-06 15:01:28,014][04272] Updated weights for policy 0, policy_version 25090 (0.0007) [2023-03-06 15:01:28,845][04272] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-06 15:01:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25703424. Throughput: 0: 12607.9. Samples: 25682884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:28,941][03942] Avg episode reward: [(0, '1242.901')] [2023-03-06 15:01:29,645][04272] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-06 15:01:30,454][04272] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-06 15:01:31,256][04272] Updated weights for policy 0, policy_version 25130 (0.0006) [2023-03-06 15:01:32,089][04272] Updated weights for policy 0, policy_version 25140 (0.0007) [2023-03-06 15:01:32,908][04272] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-06 15:01:33,730][04272] Updated weights for policy 0, policy_version 25160 (0.0007) [2023-03-06 15:01:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25765888. Throughput: 0: 12604.4. Samples: 25758272. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:33,941][03942] Avg episode reward: [(0, '1379.268')] [2023-03-06 15:01:34,535][04272] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-06 15:01:35,354][04272] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-06 15:01:36,180][04272] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-06 15:01:36,990][04272] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-06 15:01:37,797][04272] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-06 15:01:38,598][04272] Updated weights for policy 0, policy_version 25220 (0.0007) [2023-03-06 15:01:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 25829376. Throughput: 0: 12600.1. Samples: 25795853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:38,941][03942] Avg episode reward: [(0, '1313.183')] [2023-03-06 15:01:39,408][04272] Updated weights for policy 0, policy_version 25230 (0.0007) [2023-03-06 15:01:40,214][04272] Updated weights for policy 0, policy_version 25240 (0.0007) [2023-03-06 15:01:41,034][04272] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-06 15:01:41,837][04272] Updated weights for policy 0, policy_version 25260 (0.0007) [2023-03-06 15:01:42,666][04272] Updated weights for policy 0, policy_version 25270 (0.0007) [2023-03-06 15:01:43,496][04272] Updated weights for policy 0, policy_version 25280 (0.0007) [2023-03-06 15:01:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 25891840. Throughput: 0: 12606.9. Samples: 25871587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:01:43,941][03942] Avg episode reward: [(0, '1170.442')] [2023-03-06 15:01:44,298][04272] Updated weights for policy 0, policy_version 25290 (0.0007) [2023-03-06 15:01:45,111][04272] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-06 15:01:45,910][04272] Updated weights for policy 0, policy_version 25310 (0.0006) [2023-03-06 15:01:46,724][04272] Updated weights for policy 0, policy_version 25320 (0.0007) [2023-03-06 15:01:47,536][04272] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-06 15:01:48,338][04272] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-06 15:01:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12596.9). Total num frames: 25955328. Throughput: 0: 12613.1. Samples: 25947310. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:01:48,941][03942] Avg episode reward: [(0, '1177.852')] [2023-03-06 15:01:49,168][04272] Updated weights for policy 0, policy_version 25350 (0.0007) [2023-03-06 15:01:49,969][04272] Updated weights for policy 0, policy_version 25360 (0.0007) [2023-03-06 15:01:50,801][04272] Updated weights for policy 0, policy_version 25370 (0.0007) [2023-03-06 15:01:51,611][04272] Updated weights for policy 0, policy_version 25380 (0.0007) [2023-03-06 15:01:52,405][04272] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-06 15:01:53,218][04272] Updated weights for policy 0, policy_version 25400 (0.0007) [2023-03-06 15:01:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 26017792. Throughput: 0: 12615.5. Samples: 25985035. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:01:53,941][03942] Avg episode reward: [(0, '1296.228')] [2023-03-06 15:01:54,034][04272] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-06 15:01:54,829][04272] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-06 15:01:55,647][04272] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-06 15:01:56,450][04272] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-06 15:01:57,262][04272] Updated weights for policy 0, policy_version 25450 (0.0006) [2023-03-06 15:01:58,075][04272] Updated weights for policy 0, policy_version 25460 (0.0007) [2023-03-06 15:01:58,891][04272] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-06 15:01:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 26081280. Throughput: 0: 12622.0. Samples: 26061021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:01:58,941][03942] Avg episode reward: [(0, '1337.127')] [2023-03-06 15:01:59,685][04272] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-06 15:02:00,488][04272] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-06 15:02:01,302][04272] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-06 15:02:02,138][04272] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-06 15:02:02,952][04272] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-06 15:02:03,759][04272] Updated weights for policy 0, policy_version 25530 (0.0007) [2023-03-06 15:02:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12600.4). Total num frames: 26144768. Throughput: 0: 12620.9. Samples: 26136712. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:02:03,941][03942] Avg episode reward: [(0, '1184.550')] [2023-03-06 15:02:04,574][04272] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-06 15:02:05,394][04272] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-06 15:02:06,221][04272] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-06 15:02:07,015][04272] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-06 15:02:07,834][04272] Updated weights for policy 0, policy_version 25580 (0.0007) [2023-03-06 15:02:08,645][04272] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-06 15:02:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12600.4). Total num frames: 26207232. Throughput: 0: 12609.7. Samples: 26174269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:08,941][03942] Avg episode reward: [(0, '1273.259')] [2023-03-06 15:02:09,446][04272] Updated weights for policy 0, policy_version 25600 (0.0007) [2023-03-06 15:02:10,263][04272] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-06 15:02:11,075][04272] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-06 15:02:11,878][04272] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-06 15:02:12,699][04272] Updated weights for policy 0, policy_version 25640 (0.0007) [2023-03-06 15:02:13,526][04272] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-06 15:02:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 26270720. Throughput: 0: 12603.4. Samples: 26250038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:13,941][03942] Avg episode reward: [(0, '1225.234')] [2023-03-06 15:02:14,340][04272] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-06 15:02:15,158][04272] Updated weights for policy 0, policy_version 25670 (0.0007) [2023-03-06 15:02:15,966][04272] Updated weights for policy 0, policy_version 25680 (0.0007) [2023-03-06 15:02:16,785][04272] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-06 15:02:17,599][04272] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-06 15:02:18,402][04272] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-06 15:02:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 26333184. Throughput: 0: 12603.4. Samples: 26325422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:18,941][03942] Avg episode reward: [(0, '1174.361')] [2023-03-06 15:02:19,225][04272] Updated weights for policy 0, policy_version 25720 (0.0007) [2023-03-06 15:02:20,031][04272] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-06 15:02:20,840][04272] Updated weights for policy 0, policy_version 25740 (0.0007) [2023-03-06 15:02:21,649][04272] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-06 15:02:22,475][04272] Updated weights for policy 0, policy_version 25760 (0.0006) [2023-03-06 15:02:23,277][04272] Updated weights for policy 0, policy_version 25770 (0.0007) [2023-03-06 15:02:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 26396672. Throughput: 0: 12609.6. Samples: 26363282. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:23,941][03942] Avg episode reward: [(0, '1201.121')] [2023-03-06 15:02:24,088][04272] Updated weights for policy 0, policy_version 25780 (0.0007) [2023-03-06 15:02:24,915][04272] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-06 15:02:25,714][04272] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-06 15:02:26,502][04272] Updated weights for policy 0, policy_version 25810 (0.0006) [2023-03-06 15:02:27,342][04272] Updated weights for policy 0, policy_version 25820 (0.0006) [2023-03-06 15:02:28,141][04272] Updated weights for policy 0, policy_version 25830 (0.0007) [2023-03-06 15:02:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 26459136. Throughput: 0: 12609.3. Samples: 26439005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:28,941][03942] Avg episode reward: [(0, '1125.461')] [2023-03-06 15:02:28,962][04272] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-06 15:02:29,762][04272] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-06 15:02:30,576][04272] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-06 15:02:31,380][04272] Updated weights for policy 0, policy_version 25870 (0.0007) [2023-03-06 15:02:32,189][04272] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-06 15:02:33,002][04272] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-06 15:02:33,813][04272] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-06 15:02:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26522624. Throughput: 0: 12611.1. Samples: 26514807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:33,941][03942] Avg episode reward: [(0, '1183.968')] [2023-03-06 15:02:34,616][04272] Updated weights for policy 0, policy_version 25910 (0.0007) [2023-03-06 15:02:35,427][04272] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-06 15:02:36,222][04272] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-06 15:02:37,032][04272] Updated weights for policy 0, policy_version 25940 (0.0007) [2023-03-06 15:02:37,846][04272] Updated weights for policy 0, policy_version 25950 (0.0006) [2023-03-06 15:02:38,666][04272] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-06 15:02:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26586112. Throughput: 0: 12619.0. Samples: 26552892. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:02:38,941][03942] Avg episode reward: [(0, '1110.593')] [2023-03-06 15:02:39,493][04272] Updated weights for policy 0, policy_version 25970 (0.0006) [2023-03-06 15:02:40,309][04272] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-06 15:02:41,117][04272] Updated weights for policy 0, policy_version 25990 (0.0007) [2023-03-06 15:02:41,945][04272] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-06 15:02:42,753][04272] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-06 15:02:43,564][04272] Updated weights for policy 0, policy_version 26020 (0.0007) [2023-03-06 15:02:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26648576. Throughput: 0: 12604.9. Samples: 26628244. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:02:43,941][03942] Avg episode reward: [(0, '703.342')] [2023-03-06 15:02:44,370][04272] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-06 15:02:45,175][04272] Updated weights for policy 0, policy_version 26040 (0.0007) [2023-03-06 15:02:45,984][04272] Updated weights for policy 0, policy_version 26050 (0.0007) [2023-03-06 15:02:46,798][04272] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-06 15:02:47,590][04272] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-06 15:02:48,410][04272] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-06 15:02:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26712064. Throughput: 0: 12613.0. Samples: 26704297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:02:48,941][03942] Avg episode reward: [(0, '302.778')] [2023-03-06 15:02:49,213][04272] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-06 15:02:50,021][04272] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-06 15:02:50,831][04272] Updated weights for policy 0, policy_version 26110 (0.0007) [2023-03-06 15:02:51,638][04272] Updated weights for policy 0, policy_version 26120 (0.0007) [2023-03-06 15:02:52,460][04272] Updated weights for policy 0, policy_version 26130 (0.0007) [2023-03-06 15:02:53,271][04272] Updated weights for policy 0, policy_version 26140 (0.0007) [2023-03-06 15:02:53,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 26775552. Throughput: 0: 12624.7. Samples: 26742380. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:02:53,941][03942] Avg episode reward: [(0, '518.869')] [2023-03-06 15:02:54,077][04272] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-06 15:02:54,892][04272] Updated weights for policy 0, policy_version 26160 (0.0007) [2023-03-06 15:02:55,702][04272] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-06 15:02:56,515][04272] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-06 15:02:57,329][04272] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-06 15:02:58,137][04272] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-06 15:02:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26838016. Throughput: 0: 12622.0. Samples: 26818025. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:02:58,941][03942] Avg episode reward: [(0, '900.281')] [2023-03-06 15:02:58,944][04272] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-06 15:02:59,765][04272] Updated weights for policy 0, policy_version 26220 (0.0007) [2023-03-06 15:03:00,578][04272] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-06 15:03:01,397][04272] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-06 15:03:02,194][04272] Updated weights for policy 0, policy_version 26250 (0.0007) [2023-03-06 15:03:03,011][04272] Updated weights for policy 0, policy_version 26260 (0.0007) [2023-03-06 15:03:03,838][04272] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-06 15:03:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 26901504. Throughput: 0: 12624.8. Samples: 26893539. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:03:03,941][03942] Avg episode reward: [(0, '1170.693')] [2023-03-06 15:03:04,659][04272] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-06 15:03:05,474][04272] Updated weights for policy 0, policy_version 26290 (0.0007) [2023-03-06 15:03:06,284][04272] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-06 15:03:07,101][04272] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-06 15:03:07,908][04272] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-06 15:03:08,717][04272] Updated weights for policy 0, policy_version 26330 (0.0007) [2023-03-06 15:03:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 26963968. Throughput: 0: 12621.7. Samples: 26931259. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:03:08,941][03942] Avg episode reward: [(0, '1166.678')] [2023-03-06 15:03:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000026332_26963968.pth... [2023-03-06 15:03:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000023378_23939072.pth [2023-03-06 15:03:09,531][04272] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-06 15:03:10,335][04272] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-06 15:03:11,146][04272] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-06 15:03:11,964][04272] Updated weights for policy 0, policy_version 26370 (0.0007) [2023-03-06 15:03:12,782][04272] Updated weights for policy 0, policy_version 26380 (0.0007) [2023-03-06 15:03:13,598][04272] Updated weights for policy 0, policy_version 26390 (0.0005) [2023-03-06 15:03:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 27027456. Throughput: 0: 12618.3. Samples: 27006830. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:13,941][03942] Avg episode reward: [(0, '1287.596')] [2023-03-06 15:03:14,402][04272] Updated weights for policy 0, policy_version 26400 (0.0007) [2023-03-06 15:03:15,219][04272] Updated weights for policy 0, policy_version 26410 (0.0007) [2023-03-06 15:03:16,006][04272] Updated weights for policy 0, policy_version 26420 (0.0005) [2023-03-06 15:03:16,829][04272] Updated weights for policy 0, policy_version 26430 (0.0007) [2023-03-06 15:03:17,626][04272] Updated weights for policy 0, policy_version 26440 (0.0007) [2023-03-06 15:03:18,440][04272] Updated weights for policy 0, policy_version 26450 (0.0007) [2023-03-06 15:03:18,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 27090944. Throughput: 0: 12619.9. Samples: 27082705. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:18,941][03942] Avg episode reward: [(0, '1335.450')] [2023-03-06 15:03:19,260][04272] Updated weights for policy 0, policy_version 26460 (0.0007) [2023-03-06 15:03:20,074][04272] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-06 15:03:20,883][04272] Updated weights for policy 0, policy_version 26480 (0.0006) [2023-03-06 15:03:21,699][04272] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-06 15:03:22,531][04272] Updated weights for policy 0, policy_version 26500 (0.0007) [2023-03-06 15:03:23,344][04272] Updated weights for policy 0, policy_version 26510 (0.0007) [2023-03-06 15:03:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 27153408. Throughput: 0: 12613.6. Samples: 27120500. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:23,941][03942] Avg episode reward: [(0, '1107.839')] [2023-03-06 15:03:24,151][04272] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-06 15:03:24,973][04272] Updated weights for policy 0, policy_version 26530 (0.0007) [2023-03-06 15:03:25,775][04272] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-06 15:03:26,577][04272] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-06 15:03:27,394][04272] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-06 15:03:28,202][04272] Updated weights for policy 0, policy_version 26570 (0.0007) [2023-03-06 15:03:28,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 27215872. Throughput: 0: 12618.3. Samples: 27196066. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:28,941][03942] Avg episode reward: [(0, '1287.554')] [2023-03-06 15:03:29,014][04272] Updated weights for policy 0, policy_version 26580 (0.0007) [2023-03-06 15:03:29,840][04272] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-06 15:03:30,638][04272] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-06 15:03:31,459][04272] Updated weights for policy 0, policy_version 26610 (0.0005) [2023-03-06 15:03:32,266][04272] Updated weights for policy 0, policy_version 26620 (0.0007) [2023-03-06 15:03:33,068][04272] Updated weights for policy 0, policy_version 26630 (0.0007) [2023-03-06 15:03:33,891][04272] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-06 15:03:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 27279360. Throughput: 0: 12606.8. Samples: 27271601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:03:33,941][03942] Avg episode reward: [(0, '1342.747')] [2023-03-06 15:03:34,703][04272] Updated weights for policy 0, policy_version 26650 (0.0007) [2023-03-06 15:03:35,509][04272] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-06 15:03:36,328][04272] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-06 15:03:37,133][04272] Updated weights for policy 0, policy_version 26680 (0.0007) [2023-03-06 15:03:37,929][04272] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-06 15:03:38,739][04272] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-06 15:03:38,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 27342848. Throughput: 0: 12601.5. Samples: 27309447. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:03:38,941][03942] Avg episode reward: [(0, '1365.051')] [2023-03-06 15:03:39,557][04272] Updated weights for policy 0, policy_version 26710 (0.0006) [2023-03-06 15:03:40,367][04272] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-06 15:03:41,164][04272] Updated weights for policy 0, policy_version 26730 (0.0006) [2023-03-06 15:03:41,976][04272] Updated weights for policy 0, policy_version 26740 (0.0007) [2023-03-06 15:03:42,772][04272] Updated weights for policy 0, policy_version 26750 (0.0007) [2023-03-06 15:03:43,581][04272] Updated weights for policy 0, policy_version 26760 (0.0007) [2023-03-06 15:03:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27406336. Throughput: 0: 12611.1. Samples: 27385527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:03:43,941][03942] Avg episode reward: [(0, '1397.095')] [2023-03-06 15:03:44,392][04272] Updated weights for policy 0, policy_version 26770 (0.0005) [2023-03-06 15:03:45,183][04272] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-06 15:03:46,003][04272] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-06 15:03:46,821][04272] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-06 15:03:47,633][04272] Updated weights for policy 0, policy_version 26810 (0.0007) [2023-03-06 15:03:48,461][04272] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-06 15:03:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27469824. Throughput: 0: 12624.5. Samples: 27461642. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:48,941][03942] Avg episode reward: [(0, '1363.806')] [2023-03-06 15:03:49,258][04272] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-06 15:03:50,075][04272] Updated weights for policy 0, policy_version 26840 (0.0008) [2023-03-06 15:03:50,895][04272] Updated weights for policy 0, policy_version 26850 (0.0007) [2023-03-06 15:03:51,692][04272] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-06 15:03:52,499][04272] Updated weights for policy 0, policy_version 26870 (0.0007) [2023-03-06 15:03:53,301][04272] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-06 15:03:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 27532288. Throughput: 0: 12629.8. Samples: 27499601. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:53,941][03942] Avg episode reward: [(0, '1335.481')] [2023-03-06 15:03:54,110][04272] Updated weights for policy 0, policy_version 26890 (0.0006) [2023-03-06 15:03:54,917][04272] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-06 15:03:55,718][04272] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-06 15:03:56,541][04272] Updated weights for policy 0, policy_version 26920 (0.0007) [2023-03-06 15:03:57,351][04272] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-06 15:03:58,156][04272] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-06 15:03:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27595776. Throughput: 0: 12633.7. Samples: 27575347. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:03:58,941][03942] Avg episode reward: [(0, '1443.489')] [2023-03-06 15:03:58,985][04272] Updated weights for policy 0, policy_version 26950 (0.0006) [2023-03-06 15:03:59,774][04272] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-06 15:04:00,594][04272] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-06 15:04:01,408][04272] Updated weights for policy 0, policy_version 26980 (0.0007) [2023-03-06 15:04:02,217][04272] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-06 15:04:03,045][04272] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-06 15:04:03,835][04272] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-06 15:04:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27659264. Throughput: 0: 12632.9. Samples: 27651183. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:04:03,941][03942] Avg episode reward: [(0, '1228.371')] [2023-03-06 15:04:04,638][04272] Updated weights for policy 0, policy_version 27020 (0.0007) [2023-03-06 15:04:05,461][04272] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-06 15:04:06,258][04272] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-06 15:04:07,067][04272] Updated weights for policy 0, policy_version 27050 (0.0007) [2023-03-06 15:04:07,882][04272] Updated weights for policy 0, policy_version 27060 (0.0007) [2023-03-06 15:04:08,677][04272] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-06 15:04:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27721728. Throughput: 0: 12636.6. Samples: 27689148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:08,941][03942] Avg episode reward: [(0, '1310.864')] [2023-03-06 15:04:09,496][04272] Updated weights for policy 0, policy_version 27080 (0.0007) [2023-03-06 15:04:10,285][04272] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-06 15:04:11,106][04272] Updated weights for policy 0, policy_version 27100 (0.0007) [2023-03-06 15:04:11,926][04272] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-06 15:04:12,725][04272] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-06 15:04:13,550][04272] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-06 15:04:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 27785216. Throughput: 0: 12644.8. Samples: 27765083. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:13,941][03942] Avg episode reward: [(0, '1283.185')] [2023-03-06 15:04:14,366][04272] Updated weights for policy 0, policy_version 27140 (0.0007) [2023-03-06 15:04:15,166][04272] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-06 15:04:15,987][04272] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-06 15:04:16,796][04272] Updated weights for policy 0, policy_version 27170 (0.0007) [2023-03-06 15:04:17,607][04272] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-06 15:04:18,420][04272] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-06 15:04:18,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 27848704. Throughput: 0: 12647.3. Samples: 27840726. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:18,941][03942] Avg episode reward: [(0, '1392.705')] [2023-03-06 15:04:19,235][04272] Updated weights for policy 0, policy_version 27200 (0.0007) [2023-03-06 15:04:20,021][04272] Updated weights for policy 0, policy_version 27210 (0.0007) [2023-03-06 15:04:20,826][04272] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-06 15:04:21,654][04272] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-06 15:04:22,458][04272] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-06 15:04:23,263][04272] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-06 15:04:23,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 27912192. Throughput: 0: 12654.2. Samples: 27878886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:23,941][03942] Avg episode reward: [(0, '1289.889')] [2023-03-06 15:04:24,082][04272] Updated weights for policy 0, policy_version 27260 (0.0007) [2023-03-06 15:04:24,874][04272] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-06 15:04:25,663][04272] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-06 15:04:26,479][04272] Updated weights for policy 0, policy_version 27290 (0.0007) [2023-03-06 15:04:27,281][04272] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-06 15:04:28,109][04272] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-06 15:04:28,914][04272] Updated weights for policy 0, policy_version 27320 (0.0007) [2023-03-06 15:04:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12617.8). Total num frames: 27975680. Throughput: 0: 12657.3. Samples: 27955104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:28,941][03942] Avg episode reward: [(0, '1275.310')] [2023-03-06 15:04:29,710][04272] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-06 15:04:30,527][04272] Updated weights for policy 0, policy_version 27340 (0.0007) [2023-03-06 15:04:31,343][04272] Updated weights for policy 0, policy_version 27350 (0.0006) [2023-03-06 15:04:32,148][04272] Updated weights for policy 0, policy_version 27360 (0.0007) [2023-03-06 15:04:32,957][04272] Updated weights for policy 0, policy_version 27370 (0.0007) [2023-03-06 15:04:33,756][04272] Updated weights for policy 0, policy_version 27380 (0.0007) [2023-03-06 15:04:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12617.8). Total num frames: 28039168. Throughput: 0: 12651.8. Samples: 28030972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:33,941][03942] Avg episode reward: [(0, '1222.386')] [2023-03-06 15:04:34,561][04272] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-06 15:04:35,388][04272] Updated weights for policy 0, policy_version 27400 (0.0007) [2023-03-06 15:04:36,191][04272] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-06 15:04:37,005][04272] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-06 15:04:37,815][04272] Updated weights for policy 0, policy_version 27430 (0.0007) [2023-03-06 15:04:38,613][04272] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-06 15:04:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12617.8). Total num frames: 28102656. Throughput: 0: 12650.1. Samples: 28068858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:38,941][03942] Avg episode reward: [(0, '1245.925')] [2023-03-06 15:04:39,438][04272] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-06 15:04:40,241][04272] Updated weights for policy 0, policy_version 27460 (0.0007) [2023-03-06 15:04:41,058][04272] Updated weights for policy 0, policy_version 27470 (0.0007) [2023-03-06 15:04:41,869][04272] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-06 15:04:42,679][04272] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-06 15:04:43,497][04272] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-06 15:04:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 28165120. Throughput: 0: 12651.5. Samples: 28144666. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:43,941][03942] Avg episode reward: [(0, '1406.013')] [2023-03-06 15:04:44,305][04272] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-06 15:04:45,123][04272] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-06 15:04:45,945][04272] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-06 15:04:46,742][04272] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-06 15:04:47,561][04272] Updated weights for policy 0, policy_version 27550 (0.0006) [2023-03-06 15:04:48,382][04272] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-06 15:04:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 28228608. Throughput: 0: 12647.6. Samples: 28220323. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:48,941][03942] Avg episode reward: [(0, '1403.687')] [2023-03-06 15:04:49,177][04272] Updated weights for policy 0, policy_version 27570 (0.0006) [2023-03-06 15:04:50,014][04272] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-06 15:04:50,813][04272] Updated weights for policy 0, policy_version 27590 (0.0007) [2023-03-06 15:04:51,625][04272] Updated weights for policy 0, policy_version 27600 (0.0006) [2023-03-06 15:04:52,451][04272] Updated weights for policy 0, policy_version 27610 (0.0007) [2023-03-06 15:04:53,257][04272] Updated weights for policy 0, policy_version 27620 (0.0006) [2023-03-06 15:04:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 28291072. Throughput: 0: 12639.9. Samples: 28257944. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:53,941][03942] Avg episode reward: [(0, '1338.904')] [2023-03-06 15:04:54,079][04272] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-06 15:04:54,886][04272] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-06 15:04:55,685][04272] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-06 15:04:56,490][04272] Updated weights for policy 0, policy_version 27660 (0.0006) [2023-03-06 15:04:57,310][04272] Updated weights for policy 0, policy_version 27670 (0.0007) [2023-03-06 15:04:58,124][04272] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-06 15:04:58,924][04272] Updated weights for policy 0, policy_version 27690 (0.0007) [2023-03-06 15:04:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 28354560. Throughput: 0: 12637.2. Samples: 28333758. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:04:58,941][03942] Avg episode reward: [(0, '1333.433')] [2023-03-06 15:04:59,739][04272] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-06 15:05:00,551][04272] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-06 15:05:01,349][04272] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-06 15:05:02,174][04272] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-06 15:05:02,973][04272] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-06 15:05:03,798][04272] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-06 15:05:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 28417024. Throughput: 0: 12643.0. Samples: 28409660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:03,941][03942] Avg episode reward: [(0, '1333.649')] [2023-03-06 15:05:04,613][04272] Updated weights for policy 0, policy_version 27760 (0.0006) [2023-03-06 15:05:05,409][04272] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-06 15:05:06,225][04272] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-06 15:05:07,041][04272] Updated weights for policy 0, policy_version 27790 (0.0007) [2023-03-06 15:05:07,849][04272] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-06 15:05:08,654][04272] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-06 15:05:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 28480512. Throughput: 0: 12636.7. Samples: 28447540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:08,941][03942] Avg episode reward: [(0, '1344.812')] [2023-03-06 15:05:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000027813_28480512.pth... [2023-03-06 15:05:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000024854_25450496.pth [2023-03-06 15:05:09,478][04272] Updated weights for policy 0, policy_version 27820 (0.0007) [2023-03-06 15:05:10,283][04272] Updated weights for policy 0, policy_version 27830 (0.0007) [2023-03-06 15:05:11,104][04272] Updated weights for policy 0, policy_version 27840 (0.0008) [2023-03-06 15:05:11,913][04272] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-06 15:05:12,737][04272] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-06 15:05:13,570][04272] Updated weights for policy 0, policy_version 27870 (0.0007) [2023-03-06 15:05:13,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 28542976. Throughput: 0: 12618.0. Samples: 28522913. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:13,941][03942] Avg episode reward: [(0, '1489.305')] [2023-03-06 15:05:14,369][04272] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-06 15:05:15,182][04272] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-06 15:05:15,988][04272] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-06 15:05:16,795][04272] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-06 15:05:17,620][04272] Updated weights for policy 0, policy_version 27920 (0.0007) [2023-03-06 15:05:18,430][04272] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-06 15:05:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 28606464. Throughput: 0: 12610.3. Samples: 28598437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:18,941][03942] Avg episode reward: [(0, '1437.266')] [2023-03-06 15:05:19,246][04272] Updated weights for policy 0, policy_version 27940 (0.0007) [2023-03-06 15:05:20,059][04272] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-06 15:05:20,883][04272] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-06 15:05:21,691][04272] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-06 15:05:22,520][04272] Updated weights for policy 0, policy_version 27980 (0.0007) [2023-03-06 15:05:23,314][04272] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-06 15:05:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 28668928. Throughput: 0: 12607.4. Samples: 28636190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:23,941][03942] Avg episode reward: [(0, '1353.963')] [2023-03-06 15:05:24,145][04272] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-06 15:05:24,957][04272] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-06 15:05:25,752][04272] Updated weights for policy 0, policy_version 28020 (0.0007) [2023-03-06 15:05:26,565][04272] Updated weights for policy 0, policy_version 28030 (0.0007) [2023-03-06 15:05:27,378][04272] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-06 15:05:28,185][04272] Updated weights for policy 0, policy_version 28050 (0.0007) [2023-03-06 15:05:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12617.8). Total num frames: 28732416. Throughput: 0: 12603.2. Samples: 28711810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:28,941][03942] Avg episode reward: [(0, '1322.815')] [2023-03-06 15:05:28,985][04272] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-06 15:05:29,798][04272] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-06 15:05:30,608][04272] Updated weights for policy 0, policy_version 28080 (0.0007) [2023-03-06 15:05:31,414][04272] Updated weights for policy 0, policy_version 28090 (0.0007) [2023-03-06 15:05:32,220][04272] Updated weights for policy 0, policy_version 28100 (0.0005) [2023-03-06 15:05:33,029][04272] Updated weights for policy 0, policy_version 28110 (0.0007) [2023-03-06 15:05:33,835][04272] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-06 15:05:33,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 28795904. Throughput: 0: 12612.3. Samples: 28787877. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:33,941][03942] Avg episode reward: [(0, '1062.312')] [2023-03-06 15:05:34,643][04272] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-06 15:05:35,476][04272] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-06 15:05:36,282][04272] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-06 15:05:37,111][04272] Updated weights for policy 0, policy_version 28160 (0.0007) [2023-03-06 15:05:37,922][04272] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-06 15:05:38,730][04272] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-06 15:05:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 28858368. Throughput: 0: 12612.4. Samples: 28825504. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:05:38,941][03942] Avg episode reward: [(0, '1254.563')] [2023-03-06 15:05:39,538][04272] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-06 15:05:40,344][04272] Updated weights for policy 0, policy_version 28200 (0.0007) [2023-03-06 15:05:41,142][04272] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-06 15:05:41,954][04272] Updated weights for policy 0, policy_version 28220 (0.0007) [2023-03-06 15:05:42,765][04272] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-06 15:05:43,576][04272] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-06 15:05:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 28921856. Throughput: 0: 12614.0. Samples: 28901389. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:05:43,941][03942] Avg episode reward: [(0, '1356.735')] [2023-03-06 15:05:44,395][04272] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-06 15:05:45,202][04272] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-06 15:05:46,007][04272] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-06 15:05:46,829][04272] Updated weights for policy 0, policy_version 28280 (0.0006) [2023-03-06 15:05:47,642][04272] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-06 15:05:48,443][04272] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-06 15:05:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 28985344. Throughput: 0: 12609.9. Samples: 28977105. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:05:48,941][03942] Avg episode reward: [(0, '1334.354')] [2023-03-06 15:05:49,270][04272] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-06 15:05:50,084][04272] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-06 15:05:50,889][04272] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-06 15:05:51,676][04272] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-06 15:05:52,491][04272] Updated weights for policy 0, policy_version 28350 (0.0006) [2023-03-06 15:05:53,304][04272] Updated weights for policy 0, policy_version 28360 (0.0007) [2023-03-06 15:05:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 29047808. Throughput: 0: 12609.9. Samples: 29014984. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:05:53,941][03942] Avg episode reward: [(0, '1427.628')] [2023-03-06 15:05:54,110][04272] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-06 15:05:54,922][04272] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-06 15:05:55,739][04272] Updated weights for policy 0, policy_version 28390 (0.0007) [2023-03-06 15:05:56,573][04272] Updated weights for policy 0, policy_version 28400 (0.0007) [2023-03-06 15:05:57,361][04272] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-06 15:05:58,179][04272] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-06 15:05:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 29111296. Throughput: 0: 12617.5. Samples: 29090701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:05:58,941][03942] Avg episode reward: [(0, '1355.640')] [2023-03-06 15:05:58,996][04272] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-06 15:05:59,810][04272] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-06 15:06:00,614][04272] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-06 15:06:01,431][04272] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-06 15:06:02,233][04272] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-06 15:06:03,057][04272] Updated weights for policy 0, policy_version 28480 (0.0006) [2023-03-06 15:06:03,843][04272] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-06 15:06:03,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29174784. Throughput: 0: 12626.3. Samples: 29166621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:03,941][03942] Avg episode reward: [(0, '1315.724')] [2023-03-06 15:06:04,673][04272] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-06 15:06:05,478][04272] Updated weights for policy 0, policy_version 28510 (0.0007) [2023-03-06 15:06:06,288][04272] Updated weights for policy 0, policy_version 28520 (0.0007) [2023-03-06 15:06:07,097][04272] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-06 15:06:07,907][04272] Updated weights for policy 0, policy_version 28540 (0.0007) [2023-03-06 15:06:08,721][04272] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-06 15:06:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 29237248. Throughput: 0: 12626.5. Samples: 29204384. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:08,952][03942] Avg episode reward: [(0, '1311.291')] [2023-03-06 15:06:09,541][04272] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-06 15:06:10,345][04272] Updated weights for policy 0, policy_version 28570 (0.0007) [2023-03-06 15:06:11,152][04272] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-06 15:06:11,965][04272] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-06 15:06:12,778][04272] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-06 15:06:13,583][04272] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-06 15:06:13,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29300736. Throughput: 0: 12628.9. Samples: 29280111. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:13,952][03942] Avg episode reward: [(0, '1264.465')] [2023-03-06 15:06:14,382][04272] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-06 15:06:15,202][04272] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-06 15:06:16,006][04272] Updated weights for policy 0, policy_version 28640 (0.0007) [2023-03-06 15:06:16,811][04272] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-06 15:06:17,626][04272] Updated weights for policy 0, policy_version 28660 (0.0006) [2023-03-06 15:06:18,454][04272] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-06 15:06:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29364224. Throughput: 0: 12626.6. Samples: 29356072. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:18,951][03942] Avg episode reward: [(0, '1182.605')] [2023-03-06 15:06:19,243][04272] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-06 15:06:20,057][04272] Updated weights for policy 0, policy_version 28690 (0.0007) [2023-03-06 15:06:20,875][04272] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-06 15:06:21,686][04272] Updated weights for policy 0, policy_version 28710 (0.0006) [2023-03-06 15:06:22,485][04272] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-06 15:06:23,301][04272] Updated weights for policy 0, policy_version 28730 (0.0006) [2023-03-06 15:06:23,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.4, 300 sec: 12621.2). Total num frames: 29426688. Throughput: 0: 12633.5. Samples: 29394011. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:23,941][03942] Avg episode reward: [(0, '1142.762')] [2023-03-06 15:06:24,105][04272] Updated weights for policy 0, policy_version 28740 (0.0007) [2023-03-06 15:06:24,907][04272] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-06 15:06:25,715][04272] Updated weights for policy 0, policy_version 28760 (0.0007) [2023-03-06 15:06:26,539][04272] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-06 15:06:27,337][04272] Updated weights for policy 0, policy_version 28780 (0.0007) [2023-03-06 15:06:28,136][04272] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-06 15:06:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29490176. Throughput: 0: 12634.1. Samples: 29469922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:28,941][03942] Avg episode reward: [(0, '869.245')] [2023-03-06 15:06:28,945][04272] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-06 15:06:29,766][04272] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-06 15:06:30,598][04272] Updated weights for policy 0, policy_version 28820 (0.0007) [2023-03-06 15:06:31,405][04272] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-06 15:06:32,213][04272] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-06 15:06:33,046][04272] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-06 15:06:33,840][04272] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-06 15:06:33,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29553664. Throughput: 0: 12632.4. Samples: 29545563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:33,941][03942] Avg episode reward: [(0, '1152.991')] [2023-03-06 15:06:34,644][04272] Updated weights for policy 0, policy_version 28870 (0.0007) [2023-03-06 15:06:35,461][04272] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-06 15:06:36,262][04272] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-06 15:06:37,092][04272] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-06 15:06:37,884][04272] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-06 15:06:38,717][04272] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-06 15:06:38,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 29617152. Throughput: 0: 12634.4. Samples: 29583534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:38,941][03942] Avg episode reward: [(0, '1171.628')] [2023-03-06 15:06:39,529][04272] Updated weights for policy 0, policy_version 28930 (0.0006) [2023-03-06 15:06:40,326][04272] Updated weights for policy 0, policy_version 28940 (0.0006) [2023-03-06 15:06:41,133][04272] Updated weights for policy 0, policy_version 28950 (0.0005) [2023-03-06 15:06:41,948][04272] Updated weights for policy 0, policy_version 28960 (0.0006) [2023-03-06 15:06:42,768][04272] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-06 15:06:43,562][04272] Updated weights for policy 0, policy_version 28980 (0.0006) [2023-03-06 15:06:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29679616. Throughput: 0: 12633.7. Samples: 29659218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:43,941][03942] Avg episode reward: [(0, '1266.190')] [2023-03-06 15:06:44,392][04272] Updated weights for policy 0, policy_version 28990 (0.0007) [2023-03-06 15:06:45,224][04272] Updated weights for policy 0, policy_version 29000 (0.0006) [2023-03-06 15:06:46,037][04272] Updated weights for policy 0, policy_version 29010 (0.0007) [2023-03-06 15:06:46,842][04272] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-06 15:06:47,663][04272] Updated weights for policy 0, policy_version 29030 (0.0007) [2023-03-06 15:06:48,466][04272] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-06 15:06:48,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 29742080. Throughput: 0: 12621.5. Samples: 29734587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:06:48,941][03942] Avg episode reward: [(0, '1268.401')] [2023-03-06 15:06:49,296][04272] Updated weights for policy 0, policy_version 29050 (0.0007) [2023-03-06 15:06:50,098][04272] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-06 15:06:50,919][04272] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-06 15:06:51,743][04272] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-06 15:06:52,565][04272] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-06 15:06:53,373][04272] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-06 15:06:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29805568. Throughput: 0: 12618.7. Samples: 29772223. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:53,941][03942] Avg episode reward: [(0, '1404.194')] [2023-03-06 15:06:54,185][04272] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-06 15:06:54,996][04272] Updated weights for policy 0, policy_version 29120 (0.0007) [2023-03-06 15:06:55,806][04272] Updated weights for policy 0, policy_version 29130 (0.0007) [2023-03-06 15:06:56,625][04272] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-06 15:06:57,425][04272] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-06 15:06:58,247][04272] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-06 15:06:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 29868032. Throughput: 0: 12612.7. Samples: 29847681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:06:58,941][03942] Avg episode reward: [(0, '1313.274')] [2023-03-06 15:06:59,053][04272] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-06 15:06:59,866][04272] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-06 15:07:00,682][04272] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-06 15:07:01,488][04272] Updated weights for policy 0, policy_version 29200 (0.0006) [2023-03-06 15:07:02,302][04272] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-06 15:07:03,114][04272] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-06 15:07:03,923][04272] Updated weights for policy 0, policy_version 29230 (0.0007) [2023-03-06 15:07:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 29931520. Throughput: 0: 12608.8. Samples: 29923469. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:07:03,941][03942] Avg episode reward: [(0, '1366.791')] [2023-03-06 15:07:04,718][04272] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-06 15:07:05,526][04272] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-06 15:07:06,345][04272] Updated weights for policy 0, policy_version 29260 (0.0007) [2023-03-06 15:07:07,149][04272] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-06 15:07:07,946][04272] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-06 15:07:08,764][04272] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-06 15:07:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 29995008. Throughput: 0: 12610.5. Samples: 29961483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:07:08,941][03942] Avg episode reward: [(0, '1285.557')] [2023-03-06 15:07:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000029292_29995008.pth... [2023-03-06 15:07:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000026332_26963968.pth [2023-03-06 15:07:09,582][04272] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-06 15:07:10,377][04272] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-06 15:07:11,216][04272] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-06 15:07:12,012][04272] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-06 15:07:12,827][04272] Updated weights for policy 0, policy_version 29340 (0.0006) [2023-03-06 15:07:13,631][04272] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-06 15:07:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 30057472. Throughput: 0: 12606.3. Samples: 30037204. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:13,941][03942] Avg episode reward: [(0, '1312.362')] [2023-03-06 15:07:14,460][04272] Updated weights for policy 0, policy_version 29360 (0.0007) [2023-03-06 15:07:15,269][04272] Updated weights for policy 0, policy_version 29370 (0.0007) [2023-03-06 15:07:16,072][04272] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-06 15:07:16,894][04272] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-06 15:07:17,718][04272] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-06 15:07:18,532][04272] Updated weights for policy 0, policy_version 29410 (0.0007) [2023-03-06 15:07:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 30120960. Throughput: 0: 12603.2. Samples: 30112707. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:18,941][03942] Avg episode reward: [(0, '1293.051')] [2023-03-06 15:07:19,352][04272] Updated weights for policy 0, policy_version 29420 (0.0007) [2023-03-06 15:07:20,158][04272] Updated weights for policy 0, policy_version 29430 (0.0007) [2023-03-06 15:07:20,974][04272] Updated weights for policy 0, policy_version 29440 (0.0007) [2023-03-06 15:07:21,781][04272] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-06 15:07:22,579][04272] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-06 15:07:23,394][04272] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-06 15:07:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12624.7). Total num frames: 30183424. Throughput: 0: 12599.8. Samples: 30150526. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:23,941][03942] Avg episode reward: [(0, '1332.413')] [2023-03-06 15:07:24,213][04272] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-06 15:07:25,016][04272] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-06 15:07:25,845][04272] Updated weights for policy 0, policy_version 29500 (0.0006) [2023-03-06 15:07:26,649][04272] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-06 15:07:27,467][04272] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-06 15:07:28,280][04272] Updated weights for policy 0, policy_version 29530 (0.0007) [2023-03-06 15:07:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 30246912. Throughput: 0: 12601.4. Samples: 30226281. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:28,941][03942] Avg episode reward: [(0, '1336.100')] [2023-03-06 15:07:29,077][04272] Updated weights for policy 0, policy_version 29540 (0.0006) [2023-03-06 15:07:29,904][04272] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-06 15:07:30,721][04272] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-06 15:07:31,530][04272] Updated weights for policy 0, policy_version 29570 (0.0007) [2023-03-06 15:07:32,352][04272] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-06 15:07:33,170][04272] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-06 15:07:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 30309376. Throughput: 0: 12596.7. Samples: 30301438. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:33,941][03942] Avg episode reward: [(0, '1358.279')] [2023-03-06 15:07:33,989][04272] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-06 15:07:34,812][04272] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-06 15:07:35,627][04272] Updated weights for policy 0, policy_version 29620 (0.0007) [2023-03-06 15:07:36,410][04272] Updated weights for policy 0, policy_version 29630 (0.0007) [2023-03-06 15:07:37,249][04272] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-06 15:07:38,061][04272] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-06 15:07:38,884][04272] Updated weights for policy 0, policy_version 29660 (0.0007) [2023-03-06 15:07:38,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 30371840. Throughput: 0: 12598.0. Samples: 30339134. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:38,941][03942] Avg episode reward: [(0, '1466.939')] [2023-03-06 15:07:39,709][04272] Updated weights for policy 0, policy_version 29670 (0.0007) [2023-03-06 15:07:40,525][04272] Updated weights for policy 0, policy_version 29680 (0.0007) [2023-03-06 15:07:41,328][04272] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-06 15:07:42,126][04272] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-06 15:07:42,958][04272] Updated weights for policy 0, policy_version 29710 (0.0005) [2023-03-06 15:07:43,770][04272] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-06 15:07:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 30435328. Throughput: 0: 12596.6. Samples: 30414529. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:07:43,941][03942] Avg episode reward: [(0, '1252.793')] [2023-03-06 15:07:44,574][04272] Updated weights for policy 0, policy_version 29730 (0.0007) [2023-03-06 15:07:45,399][04272] Updated weights for policy 0, policy_version 29740 (0.0008) [2023-03-06 15:07:46,209][04272] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-06 15:07:46,996][04272] Updated weights for policy 0, policy_version 29760 (0.0007) [2023-03-06 15:07:47,845][04272] Updated weights for policy 0, policy_version 29770 (0.0007) [2023-03-06 15:07:48,644][04272] Updated weights for policy 0, policy_version 29780 (0.0006) [2023-03-06 15:07:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 30497792. Throughput: 0: 12591.1. Samples: 30490070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:07:48,941][03942] Avg episode reward: [(0, '1295.484')] [2023-03-06 15:07:49,458][04272] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-06 15:07:50,281][04272] Updated weights for policy 0, policy_version 29800 (0.0006) [2023-03-06 15:07:51,093][04272] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-06 15:07:51,892][04272] Updated weights for policy 0, policy_version 29820 (0.0007) [2023-03-06 15:07:52,721][04272] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-06 15:07:53,521][04272] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-06 15:07:53,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 30560256. Throughput: 0: 12582.6. Samples: 30527701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:07:53,941][03942] Avg episode reward: [(0, '1304.771')] [2023-03-06 15:07:54,330][04272] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-06 15:07:55,145][04272] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-06 15:07:55,958][04272] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-06 15:07:56,774][04272] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-06 15:07:57,588][04272] Updated weights for policy 0, policy_version 29890 (0.0007) [2023-03-06 15:07:58,394][04272] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-06 15:07:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 30623744. Throughput: 0: 12584.1. Samples: 30603487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:07:58,941][03942] Avg episode reward: [(0, '1316.498')] [2023-03-06 15:07:59,214][04272] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-06 15:08:00,009][04272] Updated weights for policy 0, policy_version 29920 (0.0007) [2023-03-06 15:08:00,833][04272] Updated weights for policy 0, policy_version 29930 (0.0007) [2023-03-06 15:08:01,649][04272] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-06 15:08:02,465][04272] Updated weights for policy 0, policy_version 29950 (0.0006) [2023-03-06 15:08:03,290][04272] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-06 15:08:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 30687232. Throughput: 0: 12586.0. Samples: 30679075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:03,941][03942] Avg episode reward: [(0, '1419.494')] [2023-03-06 15:08:04,096][04272] Updated weights for policy 0, policy_version 29970 (0.0006) [2023-03-06 15:08:04,888][04272] Updated weights for policy 0, policy_version 29980 (0.0008) [2023-03-06 15:08:05,699][04272] Updated weights for policy 0, policy_version 29990 (0.0007) [2023-03-06 15:08:06,518][04272] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-06 15:08:07,327][04272] Updated weights for policy 0, policy_version 30010 (0.0006) [2023-03-06 15:08:08,140][04272] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-06 15:08:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 30749696. Throughput: 0: 12588.7. Samples: 30717019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:08:08,941][03942] Avg episode reward: [(0, '1360.001')] [2023-03-06 15:08:08,965][04272] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-06 15:08:09,763][04272] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-06 15:08:10,578][04272] Updated weights for policy 0, policy_version 30050 (0.0006) [2023-03-06 15:08:11,412][04272] Updated weights for policy 0, policy_version 30060 (0.0008) [2023-03-06 15:08:12,223][04272] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-06 15:08:13,038][04272] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-06 15:08:13,846][04272] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-06 15:08:13,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12614.3). Total num frames: 30812160. Throughput: 0: 12580.1. Samples: 30792386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:08:13,941][03942] Avg episode reward: [(0, '1420.262')] [2023-03-06 15:08:14,667][04272] Updated weights for policy 0, policy_version 30100 (0.0007) [2023-03-06 15:08:15,470][04272] Updated weights for policy 0, policy_version 30110 (0.0006) [2023-03-06 15:08:16,287][04272] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-06 15:08:17,101][04272] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-06 15:08:17,909][04272] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-06 15:08:18,725][04272] Updated weights for policy 0, policy_version 30150 (0.0008) [2023-03-06 15:08:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 30875648. Throughput: 0: 12591.2. Samples: 30868040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:08:18,941][03942] Avg episode reward: [(0, '1160.958')] [2023-03-06 15:08:19,546][04272] Updated weights for policy 0, policy_version 30160 (0.0007) [2023-03-06 15:08:20,350][04272] Updated weights for policy 0, policy_version 30170 (0.0006) [2023-03-06 15:08:21,162][04272] Updated weights for policy 0, policy_version 30180 (0.0006) [2023-03-06 15:08:21,974][04272] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-06 15:08:22,780][04272] Updated weights for policy 0, policy_version 30200 (0.0006) [2023-03-06 15:08:23,593][04272] Updated weights for policy 0, policy_version 30210 (0.0007) [2023-03-06 15:08:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 30939136. Throughput: 0: 12588.9. Samples: 30905634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:08:23,941][03942] Avg episode reward: [(0, '1262.281')] [2023-03-06 15:08:24,406][04272] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-06 15:08:25,245][04272] Updated weights for policy 0, policy_version 30230 (0.0007) [2023-03-06 15:08:26,043][04272] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-06 15:08:26,847][04272] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-06 15:08:27,660][04272] Updated weights for policy 0, policy_version 30260 (0.0008) [2023-03-06 15:08:28,469][04272] Updated weights for policy 0, policy_version 30270 (0.0008) [2023-03-06 15:08:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 31001600. Throughput: 0: 12597.1. Samples: 30981400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:28,941][03942] Avg episode reward: [(0, '1242.066')] [2023-03-06 15:08:29,278][04272] Updated weights for policy 0, policy_version 30280 (0.0007) [2023-03-06 15:08:30,103][04272] Updated weights for policy 0, policy_version 30290 (0.0007) [2023-03-06 15:08:30,913][04272] Updated weights for policy 0, policy_version 30300 (0.0007) [2023-03-06 15:08:31,739][04272] Updated weights for policy 0, policy_version 30310 (0.0007) [2023-03-06 15:08:32,539][04272] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-06 15:08:33,354][04272] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-06 15:08:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 31065088. Throughput: 0: 12597.5. Samples: 31056959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:33,941][03942] Avg episode reward: [(0, '1348.183')] [2023-03-06 15:08:34,167][04272] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-06 15:08:34,969][04272] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-06 15:08:35,786][04272] Updated weights for policy 0, policy_version 30360 (0.0007) [2023-03-06 15:08:36,616][04272] Updated weights for policy 0, policy_version 30370 (0.0008) [2023-03-06 15:08:37,410][04272] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-06 15:08:38,226][04272] Updated weights for policy 0, policy_version 30390 (0.0006) [2023-03-06 15:08:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 31127552. Throughput: 0: 12599.8. Samples: 31094693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:38,941][03942] Avg episode reward: [(0, '1429.461')] [2023-03-06 15:08:39,039][04272] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-06 15:08:39,835][04272] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-06 15:08:40,654][04272] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-06 15:08:41,460][04272] Updated weights for policy 0, policy_version 30430 (0.0006) [2023-03-06 15:08:42,281][04272] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-06 15:08:43,083][04272] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-06 15:08:43,889][04272] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-06 15:08:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 31191040. Throughput: 0: 12603.7. Samples: 31170652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:43,941][03942] Avg episode reward: [(0, '1372.686')] [2023-03-06 15:08:44,718][04272] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-06 15:08:45,517][04272] Updated weights for policy 0, policy_version 30480 (0.0007) [2023-03-06 15:08:46,326][04272] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-06 15:08:47,144][04272] Updated weights for policy 0, policy_version 30500 (0.0006) [2023-03-06 15:08:47,978][04272] Updated weights for policy 0, policy_version 30510 (0.0007) [2023-03-06 15:08:48,769][04272] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-06 15:08:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 31253504. Throughput: 0: 12604.4. Samples: 31246273. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:48,941][03942] Avg episode reward: [(0, '1237.487')] [2023-03-06 15:08:49,599][04272] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-06 15:08:50,408][04272] Updated weights for policy 0, policy_version 30540 (0.0006) [2023-03-06 15:08:51,228][04272] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-06 15:08:52,042][04272] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-06 15:08:52,849][04272] Updated weights for policy 0, policy_version 30570 (0.0006) [2023-03-06 15:08:53,666][04272] Updated weights for policy 0, policy_version 30580 (0.0007) [2023-03-06 15:08:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 31316992. Throughput: 0: 12598.3. Samples: 31283945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:53,941][03942] Avg episode reward: [(0, '1371.193')] [2023-03-06 15:08:54,485][04272] Updated weights for policy 0, policy_version 30590 (0.0007) [2023-03-06 15:08:55,293][04272] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-06 15:08:56,110][04272] Updated weights for policy 0, policy_version 30610 (0.0007) [2023-03-06 15:08:56,928][04272] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-06 15:08:57,723][04272] Updated weights for policy 0, policy_version 30630 (0.0007) [2023-03-06 15:08:58,537][04272] Updated weights for policy 0, policy_version 30640 (0.0007) [2023-03-06 15:08:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 31379456. Throughput: 0: 12602.6. Samples: 31359504. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:08:58,941][03942] Avg episode reward: [(0, '1365.942')] [2023-03-06 15:08:59,357][04272] Updated weights for policy 0, policy_version 30650 (0.0007) [2023-03-06 15:09:00,152][04272] Updated weights for policy 0, policy_version 30660 (0.0007) [2023-03-06 15:09:00,963][04272] Updated weights for policy 0, policy_version 30670 (0.0007) [2023-03-06 15:09:01,789][04272] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-06 15:09:02,631][04272] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-06 15:09:03,433][04272] Updated weights for policy 0, policy_version 30700 (0.0007) [2023-03-06 15:09:03,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 31442944. Throughput: 0: 12594.0. Samples: 31434768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:09:03,941][03942] Avg episode reward: [(0, '1455.770')] [2023-03-06 15:09:04,276][04272] Updated weights for policy 0, policy_version 30710 (0.0007) [2023-03-06 15:09:05,075][04272] Updated weights for policy 0, policy_version 30720 (0.0007) [2023-03-06 15:09:05,877][04272] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-06 15:09:06,701][04272] Updated weights for policy 0, policy_version 30740 (0.0006) [2023-03-06 15:09:07,518][04272] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-06 15:09:08,341][04272] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-06 15:09:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 31505408. Throughput: 0: 12595.2. Samples: 31472418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:09:08,941][03942] Avg episode reward: [(0, '1460.086')] [2023-03-06 15:09:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000030767_31505408.pth... [2023-03-06 15:09:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000027813_28480512.pth [2023-03-06 15:09:09,145][04272] Updated weights for policy 0, policy_version 30770 (0.0007) [2023-03-06 15:09:09,972][04272] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-06 15:09:10,761][04272] Updated weights for policy 0, policy_version 30790 (0.0006) [2023-03-06 15:09:11,570][04272] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-06 15:09:12,399][04272] Updated weights for policy 0, policy_version 30810 (0.0007) [2023-03-06 15:09:13,211][04272] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-06 15:09:13,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 31568896. Throughput: 0: 12590.0. Samples: 31547951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:09:13,941][03942] Avg episode reward: [(0, '1334.277')] [2023-03-06 15:09:14,002][04272] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-03-06 15:09:14,835][04272] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-06 15:09:15,644][04272] Updated weights for policy 0, policy_version 30850 (0.0007) [2023-03-06 15:09:16,471][04272] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-06 15:09:17,270][04272] Updated weights for policy 0, policy_version 30870 (0.0006) [2023-03-06 15:09:18,083][04272] Updated weights for policy 0, policy_version 30880 (0.0007) [2023-03-06 15:09:18,899][04272] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-06 15:09:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 31631360. Throughput: 0: 12591.1. Samples: 31623558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:09:18,941][03942] Avg episode reward: [(0, '1368.908')] [2023-03-06 15:09:19,722][04272] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-06 15:09:20,519][04272] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-06 15:09:21,341][04272] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-06 15:09:22,138][04272] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-06 15:09:22,961][04272] Updated weights for policy 0, policy_version 30940 (0.0007) [2023-03-06 15:09:23,762][04272] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-06 15:09:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 31694848. Throughput: 0: 12589.4. Samples: 31661219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:09:23,941][03942] Avg episode reward: [(0, '1400.636')] [2023-03-06 15:09:24,571][04272] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-06 15:09:25,385][04272] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-06 15:09:26,197][04272] Updated weights for policy 0, policy_version 30980 (0.0006) [2023-03-06 15:09:27,005][04272] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-06 15:09:27,835][04272] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-06 15:09:28,644][04272] Updated weights for policy 0, policy_version 31010 (0.0006) [2023-03-06 15:09:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 31757312. Throughput: 0: 12582.7. Samples: 31736873. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:28,941][03942] Avg episode reward: [(0, '1301.484')] [2023-03-06 15:09:29,476][04272] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-06 15:09:30,274][04272] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-06 15:09:31,089][04272] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-06 15:09:31,907][04272] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-06 15:09:32,709][04272] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-06 15:09:33,523][04272] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-06 15:09:33,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12600.4). Total num frames: 31819776. Throughput: 0: 12584.6. Samples: 31812579. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:33,941][03942] Avg episode reward: [(0, '1355.985')] [2023-03-06 15:09:34,331][04272] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-06 15:09:35,146][04272] Updated weights for policy 0, policy_version 31090 (0.0007) [2023-03-06 15:09:35,965][04272] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-06 15:09:36,758][04272] Updated weights for policy 0, policy_version 31110 (0.0007) [2023-03-06 15:09:37,572][04272] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-06 15:09:38,395][04272] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-06 15:09:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 31883264. Throughput: 0: 12590.1. Samples: 31850501. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:38,941][03942] Avg episode reward: [(0, '1321.895')] [2023-03-06 15:09:39,209][04272] Updated weights for policy 0, policy_version 31140 (0.0006) [2023-03-06 15:09:40,017][04272] Updated weights for policy 0, policy_version 31150 (0.0007) [2023-03-06 15:09:40,834][04272] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-06 15:09:41,646][04272] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-06 15:09:42,446][04272] Updated weights for policy 0, policy_version 31180 (0.0006) [2023-03-06 15:09:43,237][04272] Updated weights for policy 0, policy_version 31190 (0.0005) [2023-03-06 15:09:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 31946752. Throughput: 0: 12591.7. Samples: 31926131. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:43,941][03942] Avg episode reward: [(0, '1238.973')] [2023-03-06 15:09:44,062][04272] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-06 15:09:44,873][04272] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-06 15:09:45,685][04272] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-06 15:09:46,505][04272] Updated weights for policy 0, policy_version 31230 (0.0006) [2023-03-06 15:09:47,308][04272] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-06 15:09:48,118][04272] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-06 15:09:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 32010240. Throughput: 0: 12603.4. Samples: 32001922. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:48,941][04272] Updated weights for policy 0, policy_version 31260 (0.0006) [2023-03-06 15:09:48,941][03942] Avg episode reward: [(0, '1372.209')] [2023-03-06 15:09:49,749][04272] Updated weights for policy 0, policy_version 31270 (0.0007) [2023-03-06 15:09:50,561][04272] Updated weights for policy 0, policy_version 31280 (0.0007) [2023-03-06 15:09:51,385][04272] Updated weights for policy 0, policy_version 31290 (0.0006) [2023-03-06 15:09:52,216][04272] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-06 15:09:53,037][04272] Updated weights for policy 0, policy_version 31310 (0.0006) [2023-03-06 15:09:53,832][04272] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-06 15:09:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 32072704. Throughput: 0: 12604.7. Samples: 32039632. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:53,941][03942] Avg episode reward: [(0, '1385.902')] [2023-03-06 15:09:54,643][04272] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-06 15:09:55,463][04272] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-06 15:09:56,265][04272] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-06 15:09:57,095][04272] Updated weights for policy 0, policy_version 31360 (0.0006) [2023-03-06 15:09:57,887][04272] Updated weights for policy 0, policy_version 31370 (0.0007) [2023-03-06 15:09:58,701][04272] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-06 15:09:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 32136192. Throughput: 0: 12602.2. Samples: 32115049. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:09:58,941][03942] Avg episode reward: [(0, '1373.044')] [2023-03-06 15:09:59,521][04272] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-06 15:10:00,332][04272] Updated weights for policy 0, policy_version 31400 (0.0007) [2023-03-06 15:10:01,135][04272] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-06 15:10:01,944][04272] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-06 15:10:02,764][04272] Updated weights for policy 0, policy_version 31430 (0.0007) [2023-03-06 15:10:03,569][04272] Updated weights for policy 0, policy_version 31440 (0.0006) [2023-03-06 15:10:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 32198656. Throughput: 0: 12604.7. Samples: 32190767. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:10:03,941][03942] Avg episode reward: [(0, '1005.636')] [2023-03-06 15:10:04,400][04272] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-06 15:10:05,200][04272] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-03-06 15:10:06,028][04272] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-06 15:10:06,828][04272] Updated weights for policy 0, policy_version 31480 (0.0006) [2023-03-06 15:10:07,632][04272] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-06 15:10:08,458][04272] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-06 15:10:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 32262144. Throughput: 0: 12608.4. Samples: 32228597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:08,941][03942] Avg episode reward: [(0, '1121.958')] [2023-03-06 15:10:09,258][04272] Updated weights for policy 0, policy_version 31510 (0.0007) [2023-03-06 15:10:10,079][04272] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-06 15:10:10,863][04272] Updated weights for policy 0, policy_version 31530 (0.0006) [2023-03-06 15:10:11,680][04272] Updated weights for policy 0, policy_version 31540 (0.0007) [2023-03-06 15:10:12,498][04272] Updated weights for policy 0, policy_version 31550 (0.0006) [2023-03-06 15:10:13,310][04272] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-06 15:10:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 32324608. Throughput: 0: 12612.2. Samples: 32304422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:13,941][03942] Avg episode reward: [(0, '1322.581')] [2023-03-06 15:10:14,118][04272] Updated weights for policy 0, policy_version 31570 (0.0007) [2023-03-06 15:10:14,926][04272] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-06 15:10:15,730][04272] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-06 15:10:16,556][04272] Updated weights for policy 0, policy_version 31600 (0.0007) [2023-03-06 15:10:17,363][04272] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-06 15:10:18,169][04272] Updated weights for policy 0, policy_version 31620 (0.0007) [2023-03-06 15:10:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 32388096. Throughput: 0: 12614.9. Samples: 32380248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:18,941][03942] Avg episode reward: [(0, '1351.489')] [2023-03-06 15:10:18,961][04272] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-06 15:10:19,790][04272] Updated weights for policy 0, policy_version 31640 (0.0007) [2023-03-06 15:10:20,606][04272] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-06 15:10:21,407][04272] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-06 15:10:22,221][04272] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-06 15:10:23,045][04272] Updated weights for policy 0, policy_version 31680 (0.0007) [2023-03-06 15:10:23,851][04272] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-06 15:10:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 32451584. Throughput: 0: 12613.6. Samples: 32418112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:23,941][03942] Avg episode reward: [(0, '1283.956')] [2023-03-06 15:10:24,662][04272] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-06 15:10:25,462][04272] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-06 15:10:26,274][04272] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-06 15:10:27,081][04272] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-06 15:10:27,890][04272] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-06 15:10:28,710][04272] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-06 15:10:28,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 32514048. Throughput: 0: 12618.4. Samples: 32493959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:28,941][03942] Avg episode reward: [(0, '1428.497')] [2023-03-06 15:10:29,505][04272] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-06 15:10:30,302][04272] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-06 15:10:31,133][04272] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-06 15:10:31,920][04272] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-06 15:10:32,733][04272] Updated weights for policy 0, policy_version 31800 (0.0007) [2023-03-06 15:10:33,534][04272] Updated weights for policy 0, policy_version 31810 (0.0007) [2023-03-06 15:10:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 32577536. Throughput: 0: 12621.7. Samples: 32569898. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:33,941][03942] Avg episode reward: [(0, '1295.784')] [2023-03-06 15:10:34,361][04272] Updated weights for policy 0, policy_version 31820 (0.0006) [2023-03-06 15:10:35,167][04272] Updated weights for policy 0, policy_version 31830 (0.0007) [2023-03-06 15:10:35,978][04272] Updated weights for policy 0, policy_version 31840 (0.0007) [2023-03-06 15:10:36,817][04272] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-06 15:10:37,624][04272] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-06 15:10:38,406][04272] Updated weights for policy 0, policy_version 31870 (0.0007) [2023-03-06 15:10:38,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 32641024. Throughput: 0: 12624.2. Samples: 32607720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:38,941][03942] Avg episode reward: [(0, '1307.905')] [2023-03-06 15:10:39,234][04272] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-06 15:10:40,056][04272] Updated weights for policy 0, policy_version 31890 (0.0007) [2023-03-06 15:10:40,854][04272] Updated weights for policy 0, policy_version 31900 (0.0007) [2023-03-06 15:10:41,656][04272] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-06 15:10:42,461][04272] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-06 15:10:43,288][04272] Updated weights for policy 0, policy_version 31930 (0.0006) [2023-03-06 15:10:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 32704512. Throughput: 0: 12636.2. Samples: 32683677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:43,941][03942] Avg episode reward: [(0, '1289.609')] [2023-03-06 15:10:44,096][04272] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-06 15:10:44,902][04272] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-06 15:10:45,689][04272] Updated weights for policy 0, policy_version 31960 (0.0007) [2023-03-06 15:10:46,530][04272] Updated weights for policy 0, policy_version 31970 (0.0007) [2023-03-06 15:10:47,333][04272] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-06 15:10:48,144][04272] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-06 15:10:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 32766976. Throughput: 0: 12636.0. Samples: 32759388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:48,941][03942] Avg episode reward: [(0, '1189.524')] [2023-03-06 15:10:48,973][04272] Updated weights for policy 0, policy_version 32000 (0.0007) [2023-03-06 15:10:49,779][04272] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-06 15:10:50,613][04272] Updated weights for policy 0, policy_version 32020 (0.0006) [2023-03-06 15:10:51,438][04272] Updated weights for policy 0, policy_version 32030 (0.0007) [2023-03-06 15:10:52,234][04272] Updated weights for policy 0, policy_version 32040 (0.0006) [2023-03-06 15:10:53,034][04272] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-06 15:10:53,834][04272] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-06 15:10:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12607.4). Total num frames: 32830464. Throughput: 0: 12629.1. Samples: 32796903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:53,941][03942] Avg episode reward: [(0, '1197.534')] [2023-03-06 15:10:54,647][04272] Updated weights for policy 0, policy_version 32070 (0.0007) [2023-03-06 15:10:55,425][04272] Updated weights for policy 0, policy_version 32080 (0.0006) [2023-03-06 15:10:56,250][04272] Updated weights for policy 0, policy_version 32090 (0.0007) [2023-03-06 15:10:57,063][04272] Updated weights for policy 0, policy_version 32100 (0.0006) [2023-03-06 15:10:57,869][04272] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-06 15:10:58,696][04272] Updated weights for policy 0, policy_version 32120 (0.0006) [2023-03-06 15:10:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 32892928. Throughput: 0: 12638.1. Samples: 32873139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:10:58,941][03942] Avg episode reward: [(0, '1243.878')] [2023-03-06 15:10:59,493][04272] Updated weights for policy 0, policy_version 32130 (0.0006) [2023-03-06 15:11:00,310][04272] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-06 15:11:01,113][04272] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-06 15:11:01,908][04272] Updated weights for policy 0, policy_version 32160 (0.0007) [2023-03-06 15:11:02,708][04272] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-06 15:11:03,558][04272] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-06 15:11:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 32956416. Throughput: 0: 12638.4. Samples: 32948975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:03,941][03942] Avg episode reward: [(0, '1288.432')] [2023-03-06 15:11:04,346][04272] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-06 15:11:05,154][04272] Updated weights for policy 0, policy_version 32200 (0.0006) [2023-03-06 15:11:05,975][04272] Updated weights for policy 0, policy_version 32210 (0.0006) [2023-03-06 15:11:06,776][04272] Updated weights for policy 0, policy_version 32220 (0.0006) [2023-03-06 15:11:07,605][04272] Updated weights for policy 0, policy_version 32230 (0.0006) [2023-03-06 15:11:08,395][04272] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-06 15:11:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 33019904. Throughput: 0: 12638.4. Samples: 32986842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:08,941][03942] Avg episode reward: [(0, '1211.268')] [2023-03-06 15:11:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000032246_33019904.pth... [2023-03-06 15:11:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000029292_29995008.pth [2023-03-06 15:11:09,206][04272] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-06 15:11:10,033][04272] Updated weights for policy 0, policy_version 32260 (0.0007) [2023-03-06 15:11:10,830][04272] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-06 15:11:11,666][04272] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-06 15:11:12,468][04272] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-06 15:11:13,268][04272] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-06 15:11:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12607.3). Total num frames: 33083392. Throughput: 0: 12639.3. Samples: 33062729. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:11:13,941][03942] Avg episode reward: [(0, '1157.740')] [2023-03-06 15:11:14,059][04272] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-06 15:11:14,891][04272] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-06 15:11:15,690][04272] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-06 15:11:16,521][04272] Updated weights for policy 0, policy_version 32340 (0.0007) [2023-03-06 15:11:17,334][04272] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-06 15:11:18,143][04272] Updated weights for policy 0, policy_version 32360 (0.0007) [2023-03-06 15:11:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12607.3). Total num frames: 33145856. Throughput: 0: 12630.6. Samples: 33138272. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:11:18,941][03942] Avg episode reward: [(0, '1249.574')] [2023-03-06 15:11:18,981][04272] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-06 15:11:19,773][04272] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-06 15:11:20,595][04272] Updated weights for policy 0, policy_version 32390 (0.0006) [2023-03-06 15:11:21,424][04272] Updated weights for policy 0, policy_version 32400 (0.0007) [2023-03-06 15:11:22,247][04272] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-06 15:11:23,043][04272] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-06 15:11:23,869][04272] Updated weights for policy 0, policy_version 32430 (0.0007) [2023-03-06 15:11:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 33209344. Throughput: 0: 12628.8. Samples: 33176015. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:11:23,941][03942] Avg episode reward: [(0, '1329.659')] [2023-03-06 15:11:24,690][04272] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-06 15:11:25,511][04272] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-06 15:11:26,317][04272] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-06 15:11:27,120][04272] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-06 15:11:27,940][04272] Updated weights for policy 0, policy_version 32480 (0.0007) [2023-03-06 15:11:28,773][04272] Updated weights for policy 0, policy_version 32490 (0.0007) [2023-03-06 15:11:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 33271808. Throughput: 0: 12609.8. Samples: 33251116. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:11:28,941][03942] Avg episode reward: [(0, '1335.287')] [2023-03-06 15:11:29,573][04272] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-06 15:11:30,379][04272] Updated weights for policy 0, policy_version 32510 (0.0006) [2023-03-06 15:11:31,206][04272] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-06 15:11:32,009][04272] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-06 15:11:32,799][04272] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-06 15:11:33,622][04272] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-06 15:11:33,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 33334272. Throughput: 0: 12612.0. Samples: 33326926. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:33,941][03942] Avg episode reward: [(0, '1261.798')] [2023-03-06 15:11:34,436][04272] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-06 15:11:35,246][04272] Updated weights for policy 0, policy_version 32570 (0.0007) [2023-03-06 15:11:36,074][04272] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-06 15:11:36,886][04272] Updated weights for policy 0, policy_version 32590 (0.0006) [2023-03-06 15:11:37,713][04272] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-06 15:11:38,530][04272] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-06 15:11:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 33397760. Throughput: 0: 12611.7. Samples: 33364433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:38,941][03942] Avg episode reward: [(0, '1271.373')] [2023-03-06 15:11:39,333][04272] Updated weights for policy 0, policy_version 32620 (0.0007) [2023-03-06 15:11:40,159][04272] Updated weights for policy 0, policy_version 32630 (0.0006) [2023-03-06 15:11:40,963][04272] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-06 15:11:41,769][04272] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-06 15:11:42,612][04272] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-06 15:11:43,431][04272] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-06 15:11:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33460224. Throughput: 0: 12594.9. Samples: 33439911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:43,941][03942] Avg episode reward: [(0, '1238.005')] [2023-03-06 15:11:44,229][04272] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-06 15:11:45,035][04272] Updated weights for policy 0, policy_version 32690 (0.0007) [2023-03-06 15:11:45,855][04272] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-06 15:11:46,672][04272] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-06 15:11:47,489][04272] Updated weights for policy 0, policy_version 32720 (0.0007) [2023-03-06 15:11:48,291][04272] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-06 15:11:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 33523712. Throughput: 0: 12588.4. Samples: 33515452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:11:48,941][03942] Avg episode reward: [(0, '1232.445')] [2023-03-06 15:11:49,099][04272] Updated weights for policy 0, policy_version 32740 (0.0007) [2023-03-06 15:11:49,905][04272] Updated weights for policy 0, policy_version 32750 (0.0007) [2023-03-06 15:11:50,717][04272] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-06 15:11:51,549][04272] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-06 15:11:52,354][04272] Updated weights for policy 0, policy_version 32780 (0.0007) [2023-03-06 15:11:53,177][04272] Updated weights for policy 0, policy_version 32790 (0.0007) [2023-03-06 15:11:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33586176. Throughput: 0: 12584.3. Samples: 33553132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:11:53,941][03942] Avg episode reward: [(0, '1280.396')] [2023-03-06 15:11:53,991][04272] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-06 15:11:54,813][04272] Updated weights for policy 0, policy_version 32810 (0.0007) [2023-03-06 15:11:55,616][04272] Updated weights for policy 0, policy_version 32820 (0.0006) [2023-03-06 15:11:56,437][04272] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-06 15:11:57,234][04272] Updated weights for policy 0, policy_version 32840 (0.0006) [2023-03-06 15:11:58,053][04272] Updated weights for policy 0, policy_version 32850 (0.0006) [2023-03-06 15:11:58,859][04272] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-06 15:11:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 33649664. Throughput: 0: 12577.5. Samples: 33628717. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:11:58,941][03942] Avg episode reward: [(0, '1374.330')] [2023-03-06 15:11:59,693][04272] Updated weights for policy 0, policy_version 32870 (0.0007) [2023-03-06 15:12:00,491][04272] Updated weights for policy 0, policy_version 32880 (0.0007) [2023-03-06 15:12:01,298][04272] Updated weights for policy 0, policy_version 32890 (0.0006) [2023-03-06 15:12:02,093][04272] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-06 15:12:02,921][04272] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-06 15:12:03,745][04272] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-06 15:12:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 33712128. Throughput: 0: 12578.4. Samples: 33704298. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:03,941][03942] Avg episode reward: [(0, '1359.139')] [2023-03-06 15:12:04,545][04272] Updated weights for policy 0, policy_version 32930 (0.0005) [2023-03-06 15:12:05,360][04272] Updated weights for policy 0, policy_version 32940 (0.0007) [2023-03-06 15:12:06,164][04272] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-06 15:12:06,968][04272] Updated weights for policy 0, policy_version 32960 (0.0007) [2023-03-06 15:12:07,781][04272] Updated weights for policy 0, policy_version 32970 (0.0006) [2023-03-06 15:12:08,576][04272] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-06 15:12:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33775616. Throughput: 0: 12581.4. Samples: 33742177. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:08,941][03942] Avg episode reward: [(0, '1249.361')] [2023-03-06 15:12:09,403][04272] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-06 15:12:10,211][04272] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-06 15:12:11,023][04272] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-06 15:12:11,818][04272] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-06 15:12:12,633][04272] Updated weights for policy 0, policy_version 33030 (0.0006) [2023-03-06 15:12:13,445][04272] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-06 15:12:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33839104. Throughput: 0: 12603.8. Samples: 33818289. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:13,941][03942] Avg episode reward: [(0, '1108.585')] [2023-03-06 15:12:14,248][04272] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-06 15:12:15,071][04272] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-06 15:12:15,892][04272] Updated weights for policy 0, policy_version 33070 (0.0007) [2023-03-06 15:12:16,687][04272] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-06 15:12:17,520][04272] Updated weights for policy 0, policy_version 33090 (0.0006) [2023-03-06 15:12:18,332][04272] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-06 15:12:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33901568. Throughput: 0: 12595.7. Samples: 33893734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:18,941][03942] Avg episode reward: [(0, '1238.405')] [2023-03-06 15:12:19,134][04272] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-06 15:12:19,952][04272] Updated weights for policy 0, policy_version 33120 (0.0007) [2023-03-06 15:12:20,771][04272] Updated weights for policy 0, policy_version 33130 (0.0007) [2023-03-06 15:12:21,582][04272] Updated weights for policy 0, policy_version 33140 (0.0006) [2023-03-06 15:12:22,384][04272] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-06 15:12:23,207][04272] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-06 15:12:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 33965056. Throughput: 0: 12601.6. Samples: 33931502. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:23,941][03942] Avg episode reward: [(0, '1269.025')] [2023-03-06 15:12:24,012][04272] Updated weights for policy 0, policy_version 33170 (0.0007) [2023-03-06 15:12:24,828][04272] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-06 15:12:25,633][04272] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-06 15:12:26,429][04272] Updated weights for policy 0, policy_version 33200 (0.0006) [2023-03-06 15:12:27,239][04272] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-06 15:12:28,038][04272] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-06 15:12:28,863][04272] Updated weights for policy 0, policy_version 33230 (0.0007) [2023-03-06 15:12:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 34027520. Throughput: 0: 12616.4. Samples: 34007646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:28,941][03942] Avg episode reward: [(0, '1093.974')] [2023-03-06 15:12:29,683][04272] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-06 15:12:30,486][04272] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-06 15:12:31,301][04272] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-06 15:12:32,097][04272] Updated weights for policy 0, policy_version 33270 (0.0006) [2023-03-06 15:12:32,923][04272] Updated weights for policy 0, policy_version 33280 (0.0007) [2023-03-06 15:12:33,727][04272] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-06 15:12:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 34091008. Throughput: 0: 12620.9. Samples: 34083391. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:12:33,941][03942] Avg episode reward: [(0, '1013.773')] [2023-03-06 15:12:34,527][04272] Updated weights for policy 0, policy_version 33300 (0.0006) [2023-03-06 15:12:35,345][04272] Updated weights for policy 0, policy_version 33310 (0.0006) [2023-03-06 15:12:36,141][04272] Updated weights for policy 0, policy_version 33320 (0.0006) [2023-03-06 15:12:36,926][04272] Updated weights for policy 0, policy_version 33330 (0.0007) [2023-03-06 15:12:37,736][04272] Updated weights for policy 0, policy_version 33340 (0.0008) [2023-03-06 15:12:38,544][04272] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-06 15:12:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 34154496. Throughput: 0: 12628.4. Samples: 34121409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:12:38,952][03942] Avg episode reward: [(0, '1139.724')] [2023-03-06 15:12:39,357][04272] Updated weights for policy 0, policy_version 33360 (0.0006) [2023-03-06 15:12:40,182][04272] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-06 15:12:41,009][04272] Updated weights for policy 0, policy_version 33380 (0.0007) [2023-03-06 15:12:41,804][04272] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-06 15:12:42,618][04272] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-06 15:12:43,453][04272] Updated weights for policy 0, policy_version 33410 (0.0006) [2023-03-06 15:12:43,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 34217984. Throughput: 0: 12633.1. Samples: 34197205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:12:43,941][03942] Avg episode reward: [(0, '1231.072')] [2023-03-06 15:12:44,262][04272] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-06 15:12:45,071][04272] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-06 15:12:45,870][04272] Updated weights for policy 0, policy_version 33440 (0.0007) [2023-03-06 15:12:46,670][04272] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-06 15:12:47,490][04272] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-06 15:12:48,297][04272] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-06 15:12:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 34280448. Throughput: 0: 12640.4. Samples: 34273115. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:12:48,941][03942] Avg episode reward: [(0, '1091.592')] [2023-03-06 15:12:49,108][04272] Updated weights for policy 0, policy_version 33480 (0.0007) [2023-03-06 15:12:49,917][04272] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-06 15:12:50,737][04272] Updated weights for policy 0, policy_version 33500 (0.0007) [2023-03-06 15:12:51,557][04272] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-06 15:12:52,359][04272] Updated weights for policy 0, policy_version 33520 (0.0006) [2023-03-06 15:12:53,156][04272] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-06 15:12:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 34343936. Throughput: 0: 12640.4. Samples: 34310997. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:12:53,941][03942] Avg episode reward: [(0, '1026.739')] [2023-03-06 15:12:53,975][04272] Updated weights for policy 0, policy_version 33540 (0.0006) [2023-03-06 15:12:54,775][04272] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-06 15:12:55,598][04272] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-06 15:12:56,397][04272] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-06 15:12:57,220][04272] Updated weights for policy 0, policy_version 33580 (0.0006) [2023-03-06 15:12:58,046][04272] Updated weights for policy 0, policy_version 33590 (0.0008) [2023-03-06 15:12:58,854][04272] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-06 15:12:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 34406400. Throughput: 0: 12627.7. Samples: 34386535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:12:58,941][03942] Avg episode reward: [(0, '1075.029')] [2023-03-06 15:12:59,656][04272] Updated weights for policy 0, policy_version 33610 (0.0007) [2023-03-06 15:13:00,489][04272] Updated weights for policy 0, policy_version 33620 (0.0007) [2023-03-06 15:13:01,305][04272] Updated weights for policy 0, policy_version 33630 (0.0006) [2023-03-06 15:13:02,122][04272] Updated weights for policy 0, policy_version 33640 (0.0007) [2023-03-06 15:13:02,933][04272] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-06 15:13:03,742][04272] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-06 15:13:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 34469888. Throughput: 0: 12626.4. Samples: 34461922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:03,941][03942] Avg episode reward: [(0, '1120.713')] [2023-03-06 15:13:04,562][04272] Updated weights for policy 0, policy_version 33670 (0.0006) [2023-03-06 15:13:05,373][04272] Updated weights for policy 0, policy_version 33680 (0.0006) [2023-03-06 15:13:06,174][04272] Updated weights for policy 0, policy_version 33690 (0.0006) [2023-03-06 15:13:06,993][04272] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-06 15:13:07,793][04272] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-06 15:13:08,601][04272] Updated weights for policy 0, policy_version 33720 (0.0006) [2023-03-06 15:13:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 34533376. Throughput: 0: 12628.4. Samples: 34499779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:08,941][03942] Avg episode reward: [(0, '1142.364')] [2023-03-06 15:13:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000033724_34533376.pth... [2023-03-06 15:13:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000030767_31505408.pth [2023-03-06 15:13:09,416][04272] Updated weights for policy 0, policy_version 33730 (0.0007) [2023-03-06 15:13:10,214][04272] Updated weights for policy 0, policy_version 33740 (0.0006) [2023-03-06 15:13:11,042][04272] Updated weights for policy 0, policy_version 33750 (0.0007) [2023-03-06 15:13:11,845][04272] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-06 15:13:12,652][04272] Updated weights for policy 0, policy_version 33770 (0.0007) [2023-03-06 15:13:13,481][04272] Updated weights for policy 0, policy_version 33780 (0.0007) [2023-03-06 15:13:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 34595840. Throughput: 0: 12625.8. Samples: 34575807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:13,941][03942] Avg episode reward: [(0, '1235.992')] [2023-03-06 15:13:14,277][04272] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-06 15:13:15,087][04272] Updated weights for policy 0, policy_version 33800 (0.0007) [2023-03-06 15:13:15,912][04272] Updated weights for policy 0, policy_version 33810 (0.0007) [2023-03-06 15:13:16,713][04272] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-06 15:13:17,533][04272] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-06 15:13:18,340][04272] Updated weights for policy 0, policy_version 33840 (0.0007) [2023-03-06 15:13:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 34659328. Throughput: 0: 12622.4. Samples: 34651398. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:13:18,941][03942] Avg episode reward: [(0, '1210.000')] [2023-03-06 15:13:19,144][04272] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-06 15:13:19,962][04272] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-06 15:13:20,789][04272] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-06 15:13:21,571][04272] Updated weights for policy 0, policy_version 33880 (0.0006) [2023-03-06 15:13:22,388][04272] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-06 15:13:23,198][04272] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-06 15:13:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 34722816. Throughput: 0: 12618.0. Samples: 34689220. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:13:23,941][03942] Avg episode reward: [(0, '1158.466')] [2023-03-06 15:13:23,999][04272] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-06 15:13:24,822][04272] Updated weights for policy 0, policy_version 33920 (0.0007) [2023-03-06 15:13:25,633][04272] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-06 15:13:26,440][04272] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-06 15:13:27,258][04272] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-06 15:13:28,062][04272] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-06 15:13:28,864][04272] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-06 15:13:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 34786304. Throughput: 0: 12621.1. Samples: 34765155. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:13:28,941][03942] Avg episode reward: [(0, '1213.015')] [2023-03-06 15:13:29,668][04272] Updated weights for policy 0, policy_version 33980 (0.0007) [2023-03-06 15:13:30,486][04272] Updated weights for policy 0, policy_version 33990 (0.0007) [2023-03-06 15:13:31,289][04272] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-06 15:13:32,110][04272] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-06 15:13:32,922][04272] Updated weights for policy 0, policy_version 34020 (0.0006) [2023-03-06 15:13:33,714][04272] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-06 15:13:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 34848768. Throughput: 0: 12619.3. Samples: 34840986. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:13:33,941][03942] Avg episode reward: [(0, '1200.645')] [2023-03-06 15:13:34,546][04272] Updated weights for policy 0, policy_version 34040 (0.0006) [2023-03-06 15:13:35,335][04272] Updated weights for policy 0, policy_version 34050 (0.0007) [2023-03-06 15:13:36,157][04272] Updated weights for policy 0, policy_version 34060 (0.0007) [2023-03-06 15:13:36,954][04272] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-06 15:13:37,799][04272] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-06 15:13:38,596][04272] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-06 15:13:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 34912256. Throughput: 0: 12621.8. Samples: 34878977. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:13:38,941][03942] Avg episode reward: [(0, '1205.350')] [2023-03-06 15:13:39,419][04272] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-06 15:13:40,224][04272] Updated weights for policy 0, policy_version 34110 (0.0007) [2023-03-06 15:13:41,022][04272] Updated weights for policy 0, policy_version 34120 (0.0006) [2023-03-06 15:13:41,829][04272] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-06 15:13:42,628][04272] Updated weights for policy 0, policy_version 34140 (0.0006) [2023-03-06 15:13:43,423][04272] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-06 15:13:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 34975744. Throughput: 0: 12628.8. Samples: 34954829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:43,941][03942] Avg episode reward: [(0, '1124.831')] [2023-03-06 15:13:44,225][04272] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-06 15:13:45,038][04272] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-06 15:13:45,839][04272] Updated weights for policy 0, policy_version 34180 (0.0007) [2023-03-06 15:13:46,655][04272] Updated weights for policy 0, policy_version 34190 (0.0007) [2023-03-06 15:13:47,465][04272] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-06 15:13:48,263][04272] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-06 15:13:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 35039232. Throughput: 0: 12648.8. Samples: 35031118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:48,941][03942] Avg episode reward: [(0, '1241.476')] [2023-03-06 15:13:49,089][04272] Updated weights for policy 0, policy_version 34220 (0.0006) [2023-03-06 15:13:49,885][04272] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-06 15:13:50,696][04272] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-06 15:13:51,540][04272] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-06 15:13:52,339][04272] Updated weights for policy 0, policy_version 34260 (0.0006) [2023-03-06 15:13:53,152][04272] Updated weights for policy 0, policy_version 34270 (0.0007) [2023-03-06 15:13:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 35101696. Throughput: 0: 12645.2. Samples: 35068812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:53,941][03942] Avg episode reward: [(0, '1208.987')] [2023-03-06 15:13:53,974][04272] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-06 15:13:54,764][04272] Updated weights for policy 0, policy_version 34290 (0.0007) [2023-03-06 15:13:55,587][04272] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-06 15:13:56,413][04272] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-06 15:13:57,220][04272] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-06 15:13:58,028][04272] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-06 15:13:58,854][04272] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-06 15:13:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 35165184. Throughput: 0: 12634.7. Samples: 35144367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:13:58,941][03942] Avg episode reward: [(0, '1332.146')] [2023-03-06 15:13:59,674][04272] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-06 15:14:00,478][04272] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-06 15:14:01,299][04272] Updated weights for policy 0, policy_version 34370 (0.0005) [2023-03-06 15:14:02,098][04272] Updated weights for policy 0, policy_version 34380 (0.0007) [2023-03-06 15:14:02,921][04272] Updated weights for policy 0, policy_version 34390 (0.0007) [2023-03-06 15:14:03,712][04272] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-06 15:14:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 35227648. Throughput: 0: 12633.9. Samples: 35219925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:14:03,941][03942] Avg episode reward: [(0, '1086.165')] [2023-03-06 15:14:04,523][04272] Updated weights for policy 0, policy_version 34410 (0.0007) [2023-03-06 15:14:05,337][04272] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-06 15:14:06,131][04272] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-06 15:14:06,942][04272] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-06 15:14:07,767][04272] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-06 15:14:08,570][04272] Updated weights for policy 0, policy_version 34460 (0.0006) [2023-03-06 15:14:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 35291136. Throughput: 0: 12640.3. Samples: 35258034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:14:08,941][03942] Avg episode reward: [(0, '1178.433')] [2023-03-06 15:14:09,381][04272] Updated weights for policy 0, policy_version 34470 (0.0006) [2023-03-06 15:14:10,196][04272] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-06 15:14:11,016][04272] Updated weights for policy 0, policy_version 34490 (0.0006) [2023-03-06 15:14:11,825][04272] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-06 15:14:12,643][04272] Updated weights for policy 0, policy_version 34510 (0.0006) [2023-03-06 15:14:13,479][04272] Updated weights for policy 0, policy_version 34520 (0.0008) [2023-03-06 15:14:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 35353600. Throughput: 0: 12635.2. Samples: 35333738. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:14:13,941][03942] Avg episode reward: [(0, '1195.590')] [2023-03-06 15:14:14,283][04272] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-06 15:14:15,089][04272] Updated weights for policy 0, policy_version 34540 (0.0006) [2023-03-06 15:14:15,892][04272] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-06 15:14:16,711][04272] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-06 15:14:17,533][04272] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-06 15:14:18,353][04272] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-06 15:14:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 35417088. Throughput: 0: 12627.1. Samples: 35409207. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:14:18,941][03942] Avg episode reward: [(0, '1095.285')] [2023-03-06 15:14:19,157][04272] Updated weights for policy 0, policy_version 34590 (0.0006) [2023-03-06 15:14:19,953][04272] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-06 15:14:20,764][04272] Updated weights for policy 0, policy_version 34610 (0.0007) [2023-03-06 15:14:21,577][04272] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-06 15:14:22,389][04272] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-06 15:14:23,203][04272] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-06 15:14:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 35480576. Throughput: 0: 12627.7. Samples: 35447224. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:14:23,941][03942] Avg episode reward: [(0, '1165.369')] [2023-03-06 15:14:24,025][04272] Updated weights for policy 0, policy_version 34650 (0.0007) [2023-03-06 15:14:24,830][04272] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-06 15:14:25,647][04272] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-06 15:14:26,452][04272] Updated weights for policy 0, policy_version 34680 (0.0005) [2023-03-06 15:14:27,261][04272] Updated weights for policy 0, policy_version 34690 (0.0007) [2023-03-06 15:14:28,077][04272] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-06 15:14:28,890][04272] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-06 15:14:28,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 35543040. Throughput: 0: 12619.7. Samples: 35522716. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:14:28,941][03942] Avg episode reward: [(0, '1192.818')] [2023-03-06 15:14:29,694][04272] Updated weights for policy 0, policy_version 34720 (0.0007) [2023-03-06 15:14:30,517][04272] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-06 15:14:31,324][04272] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-06 15:14:32,135][04272] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-06 15:14:32,940][04272] Updated weights for policy 0, policy_version 34760 (0.0006) [2023-03-06 15:14:33,769][04272] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-06 15:14:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 35606528. Throughput: 0: 12606.3. Samples: 35598400. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:14:33,941][03942] Avg episode reward: [(0, '1175.457')] [2023-03-06 15:14:34,576][04272] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-06 15:14:35,382][04272] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-06 15:14:36,205][04272] Updated weights for policy 0, policy_version 34800 (0.0007) [2023-03-06 15:14:37,033][04272] Updated weights for policy 0, policy_version 34810 (0.0007) [2023-03-06 15:14:37,841][04272] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-06 15:14:38,627][04272] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-06 15:14:38,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 35670016. Throughput: 0: 12606.8. Samples: 35636118. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:14:38,941][03942] Avg episode reward: [(0, '1104.832')] [2023-03-06 15:14:39,419][04272] Updated weights for policy 0, policy_version 34840 (0.0007) [2023-03-06 15:14:40,247][04272] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-06 15:14:41,058][04272] Updated weights for policy 0, policy_version 34860 (0.0005) [2023-03-06 15:14:41,886][04272] Updated weights for policy 0, policy_version 34870 (0.0007) [2023-03-06 15:14:42,696][04272] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-06 15:14:43,501][04272] Updated weights for policy 0, policy_version 34890 (0.0007) [2023-03-06 15:14:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 35732480. Throughput: 0: 12611.8. Samples: 35711898. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:14:43,941][03942] Avg episode reward: [(0, '1151.129')] [2023-03-06 15:14:44,312][04272] Updated weights for policy 0, policy_version 34900 (0.0007) [2023-03-06 15:14:45,128][04272] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-06 15:14:45,918][04272] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-06 15:14:46,739][04272] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-06 15:14:47,547][04272] Updated weights for policy 0, policy_version 34940 (0.0005) [2023-03-06 15:14:48,358][04272] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-06 15:14:48,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 35795968. Throughput: 0: 12617.8. Samples: 35787727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:14:48,941][03942] Avg episode reward: [(0, '1111.192')] [2023-03-06 15:14:49,180][04272] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-06 15:14:49,992][04272] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-06 15:14:50,794][04272] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-06 15:14:51,611][04272] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-06 15:14:52,436][04272] Updated weights for policy 0, policy_version 35000 (0.0007) [2023-03-06 15:14:53,253][04272] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-06 15:14:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 35858432. Throughput: 0: 12609.9. Samples: 35825478. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:14:53,941][03942] Avg episode reward: [(0, '990.763')] [2023-03-06 15:14:54,064][04272] Updated weights for policy 0, policy_version 35020 (0.0007) [2023-03-06 15:14:54,871][04272] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-06 15:14:55,693][04272] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-06 15:14:56,511][04272] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-06 15:14:57,295][04272] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-06 15:14:58,115][04272] Updated weights for policy 0, policy_version 35070 (0.0007) [2023-03-06 15:14:58,926][04272] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-06 15:14:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 35921920. Throughput: 0: 12608.3. Samples: 35901110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:14:58,941][03942] Avg episode reward: [(0, '1215.205')] [2023-03-06 15:14:59,741][04272] Updated weights for policy 0, policy_version 35090 (0.0006) [2023-03-06 15:15:00,557][04272] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-06 15:15:01,356][04272] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-06 15:15:02,192][04272] Updated weights for policy 0, policy_version 35120 (0.0007) [2023-03-06 15:15:02,993][04272] Updated weights for policy 0, policy_version 35130 (0.0007) [2023-03-06 15:15:03,799][04272] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-06 15:15:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 35984384. Throughput: 0: 12611.1. Samples: 35976704. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:15:03,941][03942] Avg episode reward: [(0, '1125.856')] [2023-03-06 15:15:04,611][04272] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-06 15:15:05,417][04272] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-06 15:15:06,222][04272] Updated weights for policy 0, policy_version 35170 (0.0006) [2023-03-06 15:15:07,034][04272] Updated weights for policy 0, policy_version 35180 (0.0006) [2023-03-06 15:15:07,873][04272] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-06 15:15:08,679][04272] Updated weights for policy 0, policy_version 35200 (0.0008) [2023-03-06 15:15:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 36047872. Throughput: 0: 12611.8. Samples: 36014756. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:15:08,941][03942] Avg episode reward: [(0, '1168.886')] [2023-03-06 15:15:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000035203_36047872.pth... [2023-03-06 15:15:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000032246_33019904.pth [2023-03-06 15:15:09,498][04272] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-06 15:15:10,318][04272] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-06 15:15:11,128][04272] Updated weights for policy 0, policy_version 35230 (0.0007) [2023-03-06 15:15:11,935][04272] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-06 15:15:12,773][04272] Updated weights for policy 0, policy_version 35250 (0.0006) [2023-03-06 15:15:13,567][04272] Updated weights for policy 0, policy_version 35260 (0.0007) [2023-03-06 15:15:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 36110336. Throughput: 0: 12606.5. Samples: 36090008. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:15:13,941][03942] Avg episode reward: [(0, '1263.794')] [2023-03-06 15:15:14,397][04272] Updated weights for policy 0, policy_version 35270 (0.0007) [2023-03-06 15:15:15,199][04272] Updated weights for policy 0, policy_version 35280 (0.0007) [2023-03-06 15:15:15,994][04272] Updated weights for policy 0, policy_version 35290 (0.0007) [2023-03-06 15:15:16,805][04272] Updated weights for policy 0, policy_version 35300 (0.0007) [2023-03-06 15:15:17,634][04272] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-06 15:15:18,440][04272] Updated weights for policy 0, policy_version 35320 (0.0007) [2023-03-06 15:15:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 36173824. Throughput: 0: 12603.0. Samples: 36165532. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:15:18,941][03942] Avg episode reward: [(0, '1219.222')] [2023-03-06 15:15:19,254][04272] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-06 15:15:20,087][04272] Updated weights for policy 0, policy_version 35340 (0.0007) [2023-03-06 15:15:20,901][04272] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-06 15:15:21,724][04272] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-06 15:15:22,541][04272] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-06 15:15:23,335][04272] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-06 15:15:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 36236288. Throughput: 0: 12600.1. Samples: 36203123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:15:23,941][03942] Avg episode reward: [(0, '1096.815')] [2023-03-06 15:15:24,149][04272] Updated weights for policy 0, policy_version 35390 (0.0006) [2023-03-06 15:15:24,973][04272] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-06 15:15:25,786][04221] KL-divergence is very high: 18168.8223 [2023-03-06 15:15:25,794][04272] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-06 15:15:26,595][04272] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-06 15:15:27,424][04272] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-06 15:15:28,233][04272] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-06 15:15:28,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 36298752. Throughput: 0: 12593.1. Samples: 36278587. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:15:28,941][03942] Avg episode reward: [(0, '1169.972')] [2023-03-06 15:15:29,037][04272] Updated weights for policy 0, policy_version 35450 (0.0007) [2023-03-06 15:15:29,863][04272] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-06 15:15:30,674][04272] Updated weights for policy 0, policy_version 35470 (0.0006) [2023-03-06 15:15:31,477][04272] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-06 15:15:32,297][04272] Updated weights for policy 0, policy_version 35490 (0.0006) [2023-03-06 15:15:33,109][04272] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-06 15:15:33,907][04272] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-06 15:15:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 36362240. Throughput: 0: 12589.6. Samples: 36354258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:15:33,941][03942] Avg episode reward: [(0, '1155.401')] [2023-03-06 15:15:34,714][04272] Updated weights for policy 0, policy_version 35520 (0.0007) [2023-03-06 15:15:35,538][04272] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-06 15:15:36,360][04272] Updated weights for policy 0, policy_version 35540 (0.0006) [2023-03-06 15:15:37,165][04272] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-06 15:15:37,970][04272] Updated weights for policy 0, policy_version 35560 (0.0007) [2023-03-06 15:15:38,805][04272] Updated weights for policy 0, policy_version 35570 (0.0007) [2023-03-06 15:15:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 36424704. Throughput: 0: 12590.8. Samples: 36392061. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:15:38,951][03942] Avg episode reward: [(0, '1046.295')] [2023-03-06 15:15:39,612][04272] Updated weights for policy 0, policy_version 35580 (0.0007) [2023-03-06 15:15:40,416][04272] Updated weights for policy 0, policy_version 35590 (0.0007) [2023-03-06 15:15:41,228][04272] Updated weights for policy 0, policy_version 35600 (0.0007) [2023-03-06 15:15:42,053][04272] Updated weights for policy 0, policy_version 35610 (0.0006) [2023-03-06 15:15:42,856][04272] Updated weights for policy 0, policy_version 35620 (0.0007) [2023-03-06 15:15:43,684][04272] Updated weights for policy 0, policy_version 35630 (0.0007) [2023-03-06 15:15:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 36488192. Throughput: 0: 12585.1. Samples: 36467441. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:15:43,951][03942] Avg episode reward: [(0, '1170.670')] [2023-03-06 15:15:44,485][04272] Updated weights for policy 0, policy_version 35640 (0.0007) [2023-03-06 15:15:45,310][04272] Updated weights for policy 0, policy_version 35650 (0.0007) [2023-03-06 15:15:46,118][04272] Updated weights for policy 0, policy_version 35660 (0.0007) [2023-03-06 15:15:46,933][04272] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-06 15:15:47,745][04272] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-06 15:15:47,826][04221] KL-divergence is very high: 2670.7698 [2023-03-06 15:15:48,543][04272] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-06 15:15:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 36550656. Throughput: 0: 12585.3. Samples: 36543042. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:15:48,941][03942] Avg episode reward: [(0, '1177.544')] [2023-03-06 15:15:49,196][04221] KL-divergence is very high: 151.9791 [2023-03-06 15:15:49,358][04221] KL-divergence is very high: 613.2791 [2023-03-06 15:15:49,365][04272] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-06 15:15:49,607][04221] KL-divergence is very high: 133.9475 [2023-03-06 15:15:50,092][04221] KL-divergence is very high: 132.8026 [2023-03-06 15:15:50,166][04272] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-06 15:15:50,485][04221] KL-divergence is very high: 993.2189 [2023-03-06 15:15:50,997][04272] Updated weights for policy 0, policy_version 35720 (0.0006) [2023-03-06 15:15:51,801][04272] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-06 15:15:51,884][04221] KL-divergence is very high: 1438.2692 [2023-03-06 15:15:52,275][04221] KL-divergence is very high: 103.7412 [2023-03-06 15:15:52,598][04272] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-06 15:15:53,002][04221] KL-divergence is very high: 388.7158 [2023-03-06 15:15:53,070][04221] KL-divergence is very high: 760.6716 [2023-03-06 15:15:53,332][04221] KL-divergence is very high: 7596.0864 [2023-03-06 15:15:53,411][04272] Updated weights for policy 0, policy_version 35750 (0.0007) [2023-03-06 15:15:53,499][04221] KL-divergence is very high: 237.3691 [2023-03-06 15:15:53,891][04221] KL-divergence is very high: 4221.4434 [2023-03-06 15:15:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 36614144. Throughput: 0: 12580.8. Samples: 36580889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:15:53,941][03942] Avg episode reward: [(0, '832.114')] [2023-03-06 15:15:53,975][04221] KL-divergence is very high: 170.4007 [2023-03-06 15:15:54,134][04221] KL-divergence is very high: 153.4961 [2023-03-06 15:15:54,229][04272] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-06 15:15:54,475][04221] KL-divergence is very high: 456.5338 [2023-03-06 15:15:54,560][04221] KL-divergence is very high: 7507.3584 [2023-03-06 15:15:54,810][04221] KL-divergence is very high: 867.4406 [2023-03-06 15:15:55,049][04272] Updated weights for policy 0, policy_version 35770 (0.0007) [2023-03-06 15:15:55,136][04221] KL-divergence is very high: 117.5645 [2023-03-06 15:15:55,790][04221] KL-divergence is very high: 208.0078 [2023-03-06 15:15:55,862][04272] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-06 15:15:56,673][04272] Updated weights for policy 0, policy_version 35790 (0.0007) [2023-03-06 15:15:57,086][04221] KL-divergence is very high: 1099.3552 [2023-03-06 15:15:57,311][04221] KL-divergence is very high: 107.0989 [2023-03-06 15:15:57,482][04272] Updated weights for policy 0, policy_version 35800 (0.0007) [2023-03-06 15:15:57,631][04221] KL-divergence is very high: 176.2724 [2023-03-06 15:15:57,909][04221] KL-divergence is very high: 2805.9158 [2023-03-06 15:15:58,069][04221] KL-divergence is very high: 4406.5996 [2023-03-06 15:15:58,124][04221] KL-divergence is very high: 204.1719 [2023-03-06 15:15:58,297][04272] Updated weights for policy 0, policy_version 35810 (0.0007) [2023-03-06 15:15:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 36676608. Throughput: 0: 12588.4. Samples: 36656487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:15:58,941][03942] Avg episode reward: [(0, '727.313')] [2023-03-06 15:15:59,122][04272] Updated weights for policy 0, policy_version 35820 (0.0007) [2023-03-06 15:15:59,940][04272] Updated weights for policy 0, policy_version 35830 (0.0007) [2023-03-06 15:16:00,088][04221] KL-divergence is very high: 117.4032 [2023-03-06 15:16:00,751][04272] Updated weights for policy 0, policy_version 35840 (0.0007) [2023-03-06 15:16:00,897][04221] KL-divergence is very high: 125.6710 [2023-03-06 15:16:01,053][04221] KL-divergence is very high: 277.0238 [2023-03-06 15:16:01,223][04221] KL-divergence is very high: 101.1254 [2023-03-06 15:16:01,566][04272] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-06 15:16:01,611][04221] KL-divergence is very high: 311.5786 [2023-03-06 15:16:02,381][04272] Updated weights for policy 0, policy_version 35860 (0.0007) [2023-03-06 15:16:02,531][04221] KL-divergence is very high: 108.9649 [2023-03-06 15:16:03,197][04272] Updated weights for policy 0, policy_version 35870 (0.0007) [2023-03-06 15:16:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 36740096. Throughput: 0: 12589.2. Samples: 36732047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:16:03,941][03942] Avg episode reward: [(0, '538.442')] [2023-03-06 15:16:04,013][04272] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-06 15:16:04,823][04272] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-06 15:16:04,888][04221] KL-divergence is very high: 227.5872 [2023-03-06 15:16:05,137][04221] KL-divergence is very high: 1567.5470 [2023-03-06 15:16:05,474][04221] KL-divergence is very high: 165.0758 [2023-03-06 15:16:05,640][04272] Updated weights for policy 0, policy_version 35900 (0.0006) [2023-03-06 15:16:06,446][04272] Updated weights for policy 0, policy_version 35910 (0.0007) [2023-03-06 15:16:07,192][04221] KL-divergence is very high: 480.3168 [2023-03-06 15:16:07,266][04272] Updated weights for policy 0, policy_version 35920 (0.0007) [2023-03-06 15:16:07,993][04221] KL-divergence is very high: 221.8762 [2023-03-06 15:16:08,057][04221] KL-divergence is very high: 115.8430 [2023-03-06 15:16:08,066][04272] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-06 15:16:08,215][04221] KL-divergence is very high: 305.2457 [2023-03-06 15:16:08,877][04272] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-06 15:16:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12607.3). Total num frames: 36802560. Throughput: 0: 12590.2. Samples: 36769681. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:16:08,941][03942] Avg episode reward: [(0, '643.062')] [2023-03-06 15:16:09,357][04221] KL-divergence is very high: 305.7667 [2023-03-06 15:16:09,686][04272] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-06 15:16:10,523][04272] Updated weights for policy 0, policy_version 35960 (0.0006) [2023-03-06 15:16:11,337][04272] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-06 15:16:12,152][04272] Updated weights for policy 0, policy_version 35980 (0.0007) [2023-03-06 15:16:12,969][04272] Updated weights for policy 0, policy_version 35990 (0.0007) [2023-03-06 15:16:13,800][04272] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-06 15:16:13,864][04221] KL-divergence is very high: 2883.6531 [2023-03-06 15:16:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12607.3). Total num frames: 36865024. Throughput: 0: 12588.2. Samples: 36845058. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:16:13,941][03942] Avg episode reward: [(0, '726.779')] [2023-03-06 15:16:14,019][04221] KL-divergence is very high: 1376.8833 [2023-03-06 15:16:14,623][04272] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-06 15:16:14,932][04221] KL-divergence is very high: 213.7544 [2023-03-06 15:16:15,091][04221] KL-divergence is very high: 316.3470 [2023-03-06 15:16:15,323][04221] KL-divergence is very high: 388.9896 [2023-03-06 15:16:15,418][04272] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-06 15:16:15,567][04221] KL-divergence is very high: 239.9895 [2023-03-06 15:16:16,131][04221] KL-divergence is very high: 477.3546 [2023-03-06 15:16:16,237][04272] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-06 15:16:17,049][04272] Updated weights for policy 0, policy_version 36040 (0.0007) [2023-03-06 15:16:17,858][04272] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-06 15:16:18,503][04221] KL-divergence is very high: 695.0270 [2023-03-06 15:16:18,662][04221] KL-divergence is very high: 146.9874 [2023-03-06 15:16:18,669][04272] Updated weights for policy 0, policy_version 36060 (0.0007) [2023-03-06 15:16:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12607.3). Total num frames: 36928512. Throughput: 0: 12583.3. Samples: 36920506. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:18,941][03942] Avg episode reward: [(0, '709.866')] [2023-03-06 15:16:19,229][04221] KL-divergence is very high: 353.5042 [2023-03-06 15:16:19,299][04221] KL-divergence is very high: 203.3094 [2023-03-06 15:16:19,401][04221] KL-divergence is very high: 1523.5846 [2023-03-06 15:16:19,467][04272] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-06 15:16:19,878][04221] KL-divergence is very high: 421.5301 [2023-03-06 15:16:19,951][04221] KL-divergence is very high: 1660.6261 [2023-03-06 15:16:20,268][04272] Updated weights for policy 0, policy_version 36080 (0.0007) [2023-03-06 15:16:20,357][04221] KL-divergence is very high: 244.3212 [2023-03-06 15:16:20,515][04221] KL-divergence is very high: 703.2317 [2023-03-06 15:16:20,668][04221] KL-divergence is very high: 5719.4375 [2023-03-06 15:16:20,747][04221] KL-divergence is very high: 366.7299 [2023-03-06 15:16:20,842][04221] KL-divergence is very high: 569.6819 [2023-03-06 15:16:20,905][04221] KL-divergence is very high: 274.9664 [2023-03-06 15:16:21,001][04221] KL-divergence is very high: 15381.8086 [2023-03-06 15:16:21,064][04221] KL-divergence is very high: 11562.1162 [2023-03-06 15:16:21,070][04272] Updated weights for policy 0, policy_version 36090 (0.0005) [2023-03-06 15:16:21,164][04221] KL-divergence is very high: 14971.4170 [2023-03-06 15:16:21,238][04221] KL-divergence is very high: 1654.2108 [2023-03-06 15:16:21,325][04221] KL-divergence is very high: 5097.3062 [2023-03-06 15:16:21,393][04221] KL-divergence is very high: 7819.2871 [2023-03-06 15:16:21,551][04221] KL-divergence is very high: 1727.9921 [2023-03-06 15:16:21,717][04221] KL-divergence is very high: 45211.4336 [2023-03-06 15:16:21,885][04221] KL-divergence is very high: 22905.7773 [2023-03-06 15:16:21,892][04272] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-06 15:16:22,051][04221] KL-divergence is very high: 5893.9517 [2023-03-06 15:16:22,370][04221] KL-divergence is very high: 3032.1519 [2023-03-06 15:16:22,687][04221] KL-divergence is very high: 393.2677 [2023-03-06 15:16:22,695][04272] Updated weights for policy 0, policy_version 36110 (0.0007) [2023-03-06 15:16:22,789][04221] KL-divergence is very high: 116.5200 [2023-03-06 15:16:22,853][04221] KL-divergence is very high: 8842.9180 [2023-03-06 15:16:22,947][04221] KL-divergence is very high: 710.2408 [2023-03-06 15:16:23,012][04221] KL-divergence is very high: 212191.7500 [2023-03-06 15:16:23,116][04221] KL-divergence is very high: 955.7813 [2023-03-06 15:16:23,180][04221] KL-divergence is very high: 18792.5430 [2023-03-06 15:16:23,284][04221] KL-divergence is very high: 8257.4648 [2023-03-06 15:16:23,343][04221] KL-divergence is very high: 147.9298 [2023-03-06 15:16:23,444][04221] KL-divergence is very high: 4583.7319 [2023-03-06 15:16:23,521][04272] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-06 15:16:23,593][04221] KL-divergence is very high: 1577.0471 [2023-03-06 15:16:23,681][04221] KL-divergence is very high: 151.0831 [2023-03-06 15:16:23,758][04221] KL-divergence is very high: 606.3046 [2023-03-06 15:16:23,922][04221] KL-divergence is very high: 4546.5977 [2023-03-06 15:16:23,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 36992000. Throughput: 0: 12590.2. Samples: 36958621. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:23,941][03942] Avg episode reward: [(0, '705.132')] [2023-03-06 15:16:24,077][04221] KL-divergence is very high: 5342.8164 [2023-03-06 15:16:24,166][04221] KL-divergence is very high: 712.1511 [2023-03-06 15:16:24,231][04221] KL-divergence is very high: 16532.7090 [2023-03-06 15:16:24,323][04221] KL-divergence is very high: 3051.7600 [2023-03-06 15:16:24,329][04272] Updated weights for policy 0, policy_version 36130 (0.0006) [2023-03-06 15:16:24,394][04221] KL-divergence is very high: 176966.7031 [2023-03-06 15:16:24,560][04221] KL-divergence is very high: 25059.2480 [2023-03-06 15:16:24,726][04221] KL-divergence is very high: 21777.1953 [2023-03-06 15:16:24,804][04221] KL-divergence is very high: 1612.9590 [2023-03-06 15:16:24,887][04221] KL-divergence is very high: 1090.3035 [2023-03-06 15:16:24,964][04221] KL-divergence is very high: 3764.8005 [2023-03-06 15:16:25,052][04221] KL-divergence is very high: 170.5718 [2023-03-06 15:16:25,123][04221] KL-divergence is very high: 10921.7754 [2023-03-06 15:16:25,130][04272] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-06 15:16:25,221][04221] KL-divergence is very high: 1543.6194 [2023-03-06 15:16:25,288][04221] KL-divergence is very high: 81267.2656 [2023-03-06 15:16:25,452][04221] KL-divergence is very high: 7372.5186 [2023-03-06 15:16:25,708][04221] KL-divergence is very high: 4245.6401 [2023-03-06 15:16:25,775][04221] KL-divergence is very high: 1138.7225 [2023-03-06 15:16:25,933][04221] KL-divergence is very high: 641.5809 [2023-03-06 15:16:25,940][04272] Updated weights for policy 0, policy_version 36150 (0.0007) [2023-03-06 15:16:26,028][04221] KL-divergence is very high: 740.0845 [2023-03-06 15:16:26,206][04221] KL-divergence is very high: 2350.9736 [2023-03-06 15:16:26,269][04221] KL-divergence is very high: 233.2724 [2023-03-06 15:16:26,369][04221] KL-divergence is very high: 1098.6864 [2023-03-06 15:16:26,430][04221] KL-divergence is very high: 4723795.0000 [2023-03-06 15:16:26,595][04221] KL-divergence is very high: 4236.0581 [2023-03-06 15:16:26,680][04221] KL-divergence is very high: 3718.3096 [2023-03-06 15:16:26,759][04221] KL-divergence is very high: 36692.7578 [2023-03-06 15:16:26,766][04272] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-06 15:16:26,912][04221] KL-divergence is very high: 674.8839 [2023-03-06 15:16:26,994][04221] KL-divergence is very high: 109.5999 [2023-03-06 15:16:27,077][04221] KL-divergence is very high: 11232.6729 [2023-03-06 15:16:27,159][04221] KL-divergence is very high: 283.3683 [2023-03-06 15:16:27,237][04221] KL-divergence is very high: 31781.7988 [2023-03-06 15:16:27,319][04221] KL-divergence is very high: 8257.3223 [2023-03-06 15:16:27,398][04221] KL-divergence is very high: 67103.0781 [2023-03-06 15:16:27,488][04221] KL-divergence is very high: 911.1516 [2023-03-06 15:16:27,557][04221] KL-divergence is very high: 18628.1562 [2023-03-06 15:16:27,565][04272] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-06 15:16:27,648][04221] KL-divergence is very high: 10862.6035 [2023-03-06 15:16:27,717][04221] KL-divergence is very high: 3854.5894 [2023-03-06 15:16:27,810][04221] KL-divergence is very high: 18182.8086 [2023-03-06 15:16:27,869][04221] KL-divergence is very high: 86824.8750 [2023-03-06 15:16:27,974][04221] KL-divergence is very high: 66291.3438 [2023-03-06 15:16:28,032][04221] KL-divergence is very high: 86016.9219 [2023-03-06 15:16:28,132][04221] KL-divergence is very high: 8356.4092 [2023-03-06 15:16:28,201][04221] KL-divergence is very high: 7312.5439 [2023-03-06 15:16:28,297][04221] KL-divergence is very high: 4892.3608 [2023-03-06 15:16:28,360][04221] KL-divergence is very high: 9238.1895 [2023-03-06 15:16:28,368][04272] Updated weights for policy 0, policy_version 36180 (0.0007) [2023-03-06 15:16:28,454][04221] KL-divergence is very high: 88882.6250 [2023-03-06 15:16:28,525][04221] KL-divergence is very high: 11303.2617 [2023-03-06 15:16:28,619][04221] KL-divergence is very high: 12943.5576 [2023-03-06 15:16:28,687][04221] KL-divergence is very high: 18636.8477 [2023-03-06 15:16:28,768][04221] KL-divergence is very high: 37885.0977 [2023-03-06 15:16:28,854][04221] KL-divergence is very high: 26830.4395 [2023-03-06 15:16:28,931][04221] KL-divergence is very high: 15129.6172 [2023-03-06 15:16:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 37055488. Throughput: 0: 12601.3. Samples: 37034497. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:28,941][03942] Avg episode reward: [(0, '497.085')] [2023-03-06 15:16:29,032][04221] KL-divergence is very high: 1773.7079 [2023-03-06 15:16:29,086][04221] KL-divergence is very high: 18825.9863 [2023-03-06 15:16:29,184][04221] KL-divergence is very high: 12929.5820 [2023-03-06 15:16:29,193][04272] Updated weights for policy 0, policy_version 36190 (0.0006) [2023-03-06 15:16:29,265][04221] KL-divergence is very high: 16608.6055 [2023-03-06 15:16:29,339][04221] KL-divergence is very high: 75231.9844 [2023-03-06 15:16:29,423][04221] KL-divergence is very high: 1313.4414 [2023-03-06 15:16:29,505][04221] KL-divergence is very high: 3133.5381 [2023-03-06 15:16:29,582][04221] KL-divergence is very high: 11045.0596 [2023-03-06 15:16:29,667][04221] KL-divergence is very high: 14730.9336 [2023-03-06 15:16:29,750][04221] KL-divergence is very high: 7312.2090 [2023-03-06 15:16:29,839][04221] KL-divergence is very high: 1537.1694 [2023-03-06 15:16:29,911][04221] KL-divergence is very high: 18547.0234 [2023-03-06 15:16:29,999][04221] KL-divergence is very high: 10868.5771 [2023-03-06 15:16:30,006][04272] Updated weights for policy 0, policy_version 36200 (0.0006) [2023-03-06 15:16:30,061][04221] KL-divergence is very high: 1226.0923 [2023-03-06 15:16:30,164][04221] KL-divergence is very high: 7778.4517 [2023-03-06 15:16:30,230][04221] KL-divergence is very high: 19620.5449 [2023-03-06 15:16:30,318][04221] KL-divergence is very high: 6551.0708 [2023-03-06 15:16:30,472][04221] KL-divergence is very high: 4730.4077 [2023-03-06 15:16:30,634][04221] KL-divergence is very high: 9027.6270 [2023-03-06 15:16:30,727][04221] KL-divergence is very high: 135258.5156 [2023-03-06 15:16:30,793][04221] KL-divergence is very high: 484908.9688 [2023-03-06 15:16:30,801][04272] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-06 15:16:30,888][04221] KL-divergence is very high: 9634.2432 [2023-03-06 15:16:30,962][04221] KL-divergence is very high: 2863.5571 [2023-03-06 15:16:31,046][04221] KL-divergence is very high: 55797.5586 [2023-03-06 15:16:31,138][04221] KL-divergence is very high: 10412.6670 [2023-03-06 15:16:31,207][04221] KL-divergence is very high: 6005.9287 [2023-03-06 15:16:31,297][04221] KL-divergence is very high: 12217.3486 [2023-03-06 15:16:31,372][04221] KL-divergence is very high: 13846.0605 [2023-03-06 15:16:31,462][04221] KL-divergence is very high: 1812.0948 [2023-03-06 15:16:31,542][04221] KL-divergence is very high: 20592.8867 [2023-03-06 15:16:31,625][04221] KL-divergence is very high: 3451.0916 [2023-03-06 15:16:31,632][04272] Updated weights for policy 0, policy_version 36220 (0.0007) [2023-03-06 15:16:31,705][04221] KL-divergence is very high: 2073.0217 [2023-03-06 15:16:31,789][04221] KL-divergence is very high: 5267.4507 [2023-03-06 15:16:31,945][04221] KL-divergence is very high: 11368.8223 [2023-03-06 15:16:32,028][04221] KL-divergence is very high: 16860.6562 [2023-03-06 15:16:32,115][04221] KL-divergence is very high: 933.3149 [2023-03-06 15:16:32,191][04221] KL-divergence is very high: 2571.6938 [2023-03-06 15:16:32,275][04221] KL-divergence is very high: 2452.9995 [2023-03-06 15:16:32,352][04221] KL-divergence is very high: 284.6484 [2023-03-06 15:16:32,437][04221] KL-divergence is very high: 1673.1792 [2023-03-06 15:16:32,445][04272] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-06 15:16:32,509][04221] KL-divergence is very high: 889.9160 [2023-03-06 15:16:32,597][04221] KL-divergence is very high: 5629.2485 [2023-03-06 15:16:32,672][04221] KL-divergence is very high: 368.1420 [2023-03-06 15:16:32,764][04221] KL-divergence is very high: 2278.5837 [2023-03-06 15:16:32,839][04221] KL-divergence is very high: 5822.8081 [2023-03-06 15:16:32,921][04221] KL-divergence is very high: 1141.2065 [2023-03-06 15:16:33,006][04221] KL-divergence is very high: 6905.0405 [2023-03-06 15:16:33,084][04221] KL-divergence is very high: 13911.9268 [2023-03-06 15:16:33,175][04221] KL-divergence is very high: 43603.3242 [2023-03-06 15:16:33,250][04221] KL-divergence is very high: 106253.9922 [2023-03-06 15:16:33,258][04272] Updated weights for policy 0, policy_version 36240 (0.0007) [2023-03-06 15:16:33,332][04221] KL-divergence is very high: 23895.3750 [2023-03-06 15:16:33,409][04221] KL-divergence is very high: 29956.5488 [2023-03-06 15:16:33,479][04221] KL-divergence is very high: 1983.9917 [2023-03-06 15:16:33,565][04221] KL-divergence is very high: 43102.9648 [2023-03-06 15:16:33,638][04221] KL-divergence is very high: 10579.2529 [2023-03-06 15:16:33,735][04221] KL-divergence is very high: 7560.5244 [2023-03-06 15:16:33,804][04221] KL-divergence is very high: 1065.5763 [2023-03-06 15:16:33,888][04221] KL-divergence is very high: 777.8987 [2023-03-06 15:16:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 37117952. Throughput: 0: 12600.6. Samples: 37110066. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:33,941][03942] Avg episode reward: [(0, '470.362')] [2023-03-06 15:16:33,969][04221] KL-divergence is very high: 4690.5483 [2023-03-06 15:16:34,046][04221] KL-divergence is very high: 3073.0620 [2023-03-06 15:16:34,053][04272] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-06 15:16:34,137][04221] KL-divergence is very high: 7871.0317 [2023-03-06 15:16:34,204][04221] KL-divergence is very high: 4817.2666 [2023-03-06 15:16:34,294][04221] KL-divergence is very high: 13715.6172 [2023-03-06 15:16:34,364][04221] KL-divergence is very high: 948.2152 [2023-03-06 15:16:34,444][04221] KL-divergence is very high: 7613.9805 [2023-03-06 15:16:34,516][04221] KL-divergence is very high: 4983.6826 [2023-03-06 15:16:34,606][04221] KL-divergence is very high: 1649.4084 [2023-03-06 15:16:34,691][04221] KL-divergence is very high: 26068.9355 [2023-03-06 15:16:34,764][04221] KL-divergence is very high: 8942.6357 [2023-03-06 15:16:34,857][04221] KL-divergence is very high: 4527.5308 [2023-03-06 15:16:34,865][04272] Updated weights for policy 0, policy_version 36260 (0.0006) [2023-03-06 15:16:34,927][04221] KL-divergence is very high: 12295.3359 [2023-03-06 15:16:35,028][04221] KL-divergence is very high: 6027.9907 [2023-03-06 15:16:35,084][04221] KL-divergence is very high: 5637.9575 [2023-03-06 15:16:35,185][04221] KL-divergence is very high: 1079.3621 [2023-03-06 15:16:35,249][04221] KL-divergence is very high: 693.1069 [2023-03-06 15:16:35,351][04221] KL-divergence is very high: 939.5167 [2023-03-06 15:16:35,420][04221] KL-divergence is very high: 1139.8145 [2023-03-06 15:16:35,503][04221] KL-divergence is very high: 4608.6533 [2023-03-06 15:16:35,569][04221] KL-divergence is very high: 502.4474 [2023-03-06 15:16:35,660][04221] KL-divergence is very high: 1531.0430 [2023-03-06 15:16:35,666][04272] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-06 15:16:35,727][04221] KL-divergence is very high: 3161.4548 [2023-03-06 15:16:35,825][04221] KL-divergence is very high: 2988.3403 [2023-03-06 15:16:35,905][04221] KL-divergence is very high: 3201.5806 [2023-03-06 15:16:35,988][04221] KL-divergence is very high: 2992.2144 [2023-03-06 15:16:36,063][04221] KL-divergence is very high: 1045.7506 [2023-03-06 15:16:36,139][04221] KL-divergence is very high: 1047.1925 [2023-03-06 15:16:36,229][04221] KL-divergence is very high: 43236.7656 [2023-03-06 15:16:36,310][04221] KL-divergence is very high: 892.9310 [2023-03-06 15:16:36,385][04221] KL-divergence is very high: 1693.1880 [2023-03-06 15:16:36,466][04221] KL-divergence is very high: 1490.4244 [2023-03-06 15:16:36,473][04272] Updated weights for policy 0, policy_version 36280 (0.0007) [2023-03-06 15:16:36,549][04221] KL-divergence is very high: 1516.3597 [2023-03-06 15:16:36,627][04221] KL-divergence is very high: 2982.5068 [2023-03-06 15:16:36,707][04221] KL-divergence is very high: 882.8840 [2023-03-06 15:16:36,789][04221] KL-divergence is very high: 302.7091 [2023-03-06 15:16:36,869][04221] KL-divergence is very high: 572.6888 [2023-03-06 15:16:36,949][04221] KL-divergence is very high: 1757.7272 [2023-03-06 15:16:37,026][04221] KL-divergence is very high: 517.9850 [2023-03-06 15:16:37,106][04221] KL-divergence is very high: 1306.5381 [2023-03-06 15:16:37,192][04221] KL-divergence is very high: 778.0542 [2023-03-06 15:16:37,264][04221] KL-divergence is very high: 2699.0237 [2023-03-06 15:16:37,270][04272] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-06 15:16:37,353][04221] KL-divergence is very high: 978.5546 [2023-03-06 15:16:37,418][04221] KL-divergence is very high: 2111.0637 [2023-03-06 15:16:37,505][04221] KL-divergence is very high: 6892.3779 [2023-03-06 15:16:37,575][04221] KL-divergence is very high: 420.6126 [2023-03-06 15:16:37,662][04221] KL-divergence is very high: 177.5532 [2023-03-06 15:16:37,732][04221] KL-divergence is very high: 780.5383 [2023-03-06 15:16:37,813][04221] KL-divergence is very high: 2644.3916 [2023-03-06 15:16:37,890][04221] KL-divergence is very high: 1121.6256 [2023-03-06 15:16:37,972][04221] KL-divergence is very high: 775.4503 [2023-03-06 15:16:38,061][04221] KL-divergence is very high: 4269.8008 [2023-03-06 15:16:38,069][04272] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-06 15:16:38,147][04221] KL-divergence is very high: 1041.3215 [2023-03-06 15:16:38,238][04221] KL-divergence is very high: 2084.3999 [2023-03-06 15:16:38,316][04221] KL-divergence is very high: 3956.9785 [2023-03-06 15:16:38,415][04221] KL-divergence is very high: 1620.0233 [2023-03-06 15:16:38,483][04221] KL-divergence is very high: 806.4322 [2023-03-06 15:16:38,575][04221] KL-divergence is very high: 1430.4506 [2023-03-06 15:16:38,654][04221] KL-divergence is very high: 970.7991 [2023-03-06 15:16:38,740][04221] KL-divergence is very high: 1255.6917 [2023-03-06 15:16:38,822][04221] KL-divergence is very high: 29468.2285 [2023-03-06 15:16:38,893][04221] KL-divergence is very high: 2897.5854 [2023-03-06 15:16:38,899][04272] Updated weights for policy 0, policy_version 36310 (0.0007) [2023-03-06 15:16:38,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 37181440. Throughput: 0: 12609.0. Samples: 37148295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:38,941][03942] Avg episode reward: [(0, '301.188')] [2023-03-06 15:16:38,975][04221] KL-divergence is very high: 498.0213 [2023-03-06 15:16:39,067][04221] KL-divergence is very high: 3434.5486 [2023-03-06 15:16:39,146][04221] KL-divergence is very high: 1083.0386 [2023-03-06 15:16:39,222][04221] KL-divergence is very high: 1904.2465 [2023-03-06 15:16:39,307][04221] KL-divergence is very high: 232.3831 [2023-03-06 15:16:39,376][04221] KL-divergence is very high: 6496.2563 [2023-03-06 15:16:39,464][04221] KL-divergence is very high: 794.2767 [2023-03-06 15:16:39,527][04221] KL-divergence is very high: 5827.6235 [2023-03-06 15:16:39,627][04221] KL-divergence is very high: 822.9086 [2023-03-06 15:16:39,701][04221] KL-divergence is very high: 3912.7751 [2023-03-06 15:16:39,707][04272] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-06 15:16:39,785][04221] KL-divergence is very high: 869.3207 [2023-03-06 15:16:39,861][04221] KL-divergence is very high: 612.9102 [2023-03-06 15:16:40,017][04221] KL-divergence is very high: 233.2454 [2023-03-06 15:16:40,529][04221] KL-divergence is very high: 420.7125 [2023-03-06 15:16:40,536][04272] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-06 15:16:40,613][04221] KL-divergence is very high: 154.6899 [2023-03-06 15:16:40,683][04221] KL-divergence is very high: 105.3098 [2023-03-06 15:16:40,770][04221] KL-divergence is very high: 435.8299 [2023-03-06 15:16:40,845][04221] KL-divergence is very high: 203.9268 [2023-03-06 15:16:40,936][04221] KL-divergence is very high: 142.3659 [2023-03-06 15:16:41,100][04221] KL-divergence is very high: 168.0155 [2023-03-06 15:16:41,168][04221] KL-divergence is very high: 301.4181 [2023-03-06 15:16:41,262][04221] KL-divergence is very high: 140.9241 [2023-03-06 15:16:41,340][04221] KL-divergence is very high: 263.8633 [2023-03-06 15:16:41,347][04272] Updated weights for policy 0, policy_version 36340 (0.0006) [2023-03-06 15:16:41,420][04221] KL-divergence is very high: 131.0074 [2023-03-06 15:16:41,576][04221] KL-divergence is very high: 105.5777 [2023-03-06 15:16:41,667][04221] KL-divergence is very high: 406.7606 [2023-03-06 15:16:41,986][04221] KL-divergence is very high: 192.8930 [2023-03-06 15:16:42,078][04221] KL-divergence is very high: 219.9424 [2023-03-06 15:16:42,146][04272] Updated weights for policy 0, policy_version 36350 (0.0007) [2023-03-06 15:16:42,240][04221] KL-divergence is very high: 2298.9688 [2023-03-06 15:16:42,319][04221] KL-divergence is very high: 25051.5703 [2023-03-06 15:16:42,412][04221] KL-divergence is very high: 305.3925 [2023-03-06 15:16:42,487][04221] KL-divergence is very high: 352.4695 [2023-03-06 15:16:42,647][04221] KL-divergence is very high: 107.4020 [2023-03-06 15:16:42,797][04221] KL-divergence is very high: 143.8826 [2023-03-06 15:16:42,970][04272] Updated weights for policy 0, policy_version 36360 (0.0007) [2023-03-06 15:16:43,052][04221] KL-divergence is very high: 912.0643 [2023-03-06 15:16:43,213][04221] KL-divergence is very high: 1842.5599 [2023-03-06 15:16:43,450][04221] KL-divergence is very high: 656.7870 [2023-03-06 15:16:43,526][04221] KL-divergence is very high: 2832.5042 [2023-03-06 15:16:43,687][04221] KL-divergence is very high: 388.5712 [2023-03-06 15:16:43,786][04272] Updated weights for policy 0, policy_version 36370 (0.0006) [2023-03-06 15:16:43,853][04221] KL-divergence is very high: 290.8778 [2023-03-06 15:16:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 37243904. Throughput: 0: 12607.8. Samples: 37223839. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:43,941][03942] Avg episode reward: [(0, '351.461')] [2023-03-06 15:16:44,014][04221] KL-divergence is very high: 155.7780 [2023-03-06 15:16:44,176][04221] KL-divergence is very high: 782.7578 [2023-03-06 15:16:44,264][04221] KL-divergence is very high: 274.6335 [2023-03-06 15:16:44,339][04221] KL-divergence is very high: 495.1157 [2023-03-06 15:16:44,499][04221] KL-divergence is very high: 508.7589 [2023-03-06 15:16:44,596][04272] Updated weights for policy 0, policy_version 36380 (0.0008) [2023-03-06 15:16:44,662][04221] KL-divergence is very high: 489.7711 [2023-03-06 15:16:44,762][04221] KL-divergence is very high: 850.0688 [2023-03-06 15:16:44,819][04221] KL-divergence is very high: 165.3121 [2023-03-06 15:16:44,914][04221] KL-divergence is very high: 2662.1802 [2023-03-06 15:16:44,990][04221] KL-divergence is very high: 8406.0850 [2023-03-06 15:16:45,150][04221] KL-divergence is very high: 103.7289 [2023-03-06 15:16:45,234][04221] KL-divergence is very high: 1327.9313 [2023-03-06 15:16:45,403][04272] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-06 15:16:45,803][04221] KL-divergence is very high: 613.7491 [2023-03-06 15:16:45,963][04221] KL-divergence is very high: 495.9028 [2023-03-06 15:16:46,120][04221] KL-divergence is very high: 130.9857 [2023-03-06 15:16:46,205][04221] KL-divergence is very high: 387.8292 [2023-03-06 15:16:46,213][04272] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-06 15:16:46,287][04221] KL-divergence is very high: 227.6333 [2023-03-06 15:16:46,378][04221] KL-divergence is very high: 163.4903 [2023-03-06 15:16:46,463][04221] KL-divergence is very high: 352.6751 [2023-03-06 15:16:46,539][04221] KL-divergence is very high: 2109.9014 [2023-03-06 15:16:46,618][04221] KL-divergence is very high: 685.5219 [2023-03-06 15:16:46,720][04221] KL-divergence is very high: 748.3895 [2023-03-06 15:16:46,879][04221] KL-divergence is very high: 110.1828 [2023-03-06 15:16:47,041][04221] KL-divergence is very high: 3662.1018 [2023-03-06 15:16:47,049][04272] Updated weights for policy 0, policy_version 36410 (0.0006) [2023-03-06 15:16:47,092][04221] KL-divergence is very high: 331.0796 [2023-03-06 15:16:47,200][04221] KL-divergence is very high: 135.1067 [2023-03-06 15:16:47,365][04221] KL-divergence is very high: 110.1395 [2023-03-06 15:16:47,842][04272] Updated weights for policy 0, policy_version 36420 (0.0006) [2023-03-06 15:16:48,169][04221] KL-divergence is very high: 2021.7926 [2023-03-06 15:16:48,330][04221] KL-divergence is very high: 118.6829 [2023-03-06 15:16:48,663][04221] KL-divergence is very high: 149.1502 [2023-03-06 15:16:48,670][04272] Updated weights for policy 0, policy_version 36430 (0.0007) [2023-03-06 15:16:48,748][04221] KL-divergence is very high: 271.2299 [2023-03-06 15:16:48,908][04221] KL-divergence is very high: 187.7435 [2023-03-06 15:16:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 37307392. Throughput: 0: 12609.6. Samples: 37299479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:48,941][03942] Avg episode reward: [(0, '435.591')] [2023-03-06 15:16:48,985][04221] KL-divergence is very high: 179.7858 [2023-03-06 15:16:49,075][04221] KL-divergence is very high: 216.6439 [2023-03-06 15:16:49,146][04221] KL-divergence is very high: 231.5527 [2023-03-06 15:16:49,478][04221] KL-divergence is very high: 174.0125 [2023-03-06 15:16:49,486][04272] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-06 15:16:49,887][04221] KL-divergence is very high: 1462.3724 [2023-03-06 15:16:49,961][04221] KL-divergence is very high: 298.2152 [2023-03-06 15:16:50,129][04221] KL-divergence is very high: 818.3335 [2023-03-06 15:16:50,211][04221] KL-divergence is very high: 427.5954 [2023-03-06 15:16:50,286][04272] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-06 15:16:50,758][04221] KL-divergence is very high: 248.0256 [2023-03-06 15:16:51,079][04221] KL-divergence is very high: 632.8887 [2023-03-06 15:16:51,087][04272] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-06 15:16:51,252][04221] KL-divergence is very high: 1814.9741 [2023-03-06 15:16:51,347][04221] KL-divergence is very high: 209.3688 [2023-03-06 15:16:51,414][04221] KL-divergence is very high: 346.6256 [2023-03-06 15:16:51,512][04221] KL-divergence is very high: 133.5834 [2023-03-06 15:16:51,579][04221] KL-divergence is very high: 201.3613 [2023-03-06 15:16:51,667][04221] KL-divergence is very high: 267.8542 [2023-03-06 15:16:51,745][04221] KL-divergence is very high: 116.0705 [2023-03-06 15:16:51,836][04221] KL-divergence is very high: 831.8550 [2023-03-06 15:16:51,926][04272] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-06 15:16:51,993][04221] KL-divergence is very high: 981.5623 [2023-03-06 15:16:52,150][04221] KL-divergence is very high: 204.1729 [2023-03-06 15:16:52,492][04221] KL-divergence is very high: 459.9932 [2023-03-06 15:16:52,565][04221] KL-divergence is very high: 1774.0787 [2023-03-06 15:16:52,649][04221] KL-divergence is very high: 166.2018 [2023-03-06 15:16:52,733][04272] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-06 15:16:52,904][04221] KL-divergence is very high: 324.9781 [2023-03-06 15:16:52,973][04221] KL-divergence is very high: 113.8763 [2023-03-06 15:16:53,135][04221] KL-divergence is very high: 182.7481 [2023-03-06 15:16:53,553][04272] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-06 15:16:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 37369856. Throughput: 0: 12614.2. Samples: 37337322. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:53,941][03942] Avg episode reward: [(0, '400.509')] [2023-03-06 15:16:54,178][04221] KL-divergence is very high: 111.1666 [2023-03-06 15:16:54,336][04221] KL-divergence is very high: 140.4039 [2023-03-06 15:16:54,345][04272] Updated weights for policy 0, policy_version 36500 (0.0006) [2023-03-06 15:16:54,815][04221] KL-divergence is very high: 131.5684 [2023-03-06 15:16:54,903][04221] KL-divergence is very high: 171.9462 [2023-03-06 15:16:55,071][04221] KL-divergence is very high: 338.5374 [2023-03-06 15:16:55,155][04272] Updated weights for policy 0, policy_version 36510 (0.0007) [2023-03-06 15:16:55,239][04221] KL-divergence is very high: 2158.2917 [2023-03-06 15:16:55,486][04221] KL-divergence is very high: 221.5567 [2023-03-06 15:16:55,715][04221] KL-divergence is very high: 248.7065 [2023-03-06 15:16:55,805][04221] KL-divergence is very high: 179.4427 [2023-03-06 15:16:55,877][04221] KL-divergence is very high: 169.7240 [2023-03-06 15:16:55,956][04221] KL-divergence is very high: 514.3583 [2023-03-06 15:16:55,962][04272] Updated weights for policy 0, policy_version 36520 (0.0006) [2023-03-06 15:16:56,119][04221] KL-divergence is very high: 150.7067 [2023-03-06 15:16:56,192][04221] KL-divergence is very high: 1481.1354 [2023-03-06 15:16:56,282][04221] KL-divergence is very high: 324.3826 [2023-03-06 15:16:56,372][04221] KL-divergence is very high: 631.8408 [2023-03-06 15:16:56,444][04221] KL-divergence is very high: 365.8738 [2023-03-06 15:16:56,765][04272] Updated weights for policy 0, policy_version 36530 (0.0007) [2023-03-06 15:16:56,860][04221] KL-divergence is very high: 5157.4951 [2023-03-06 15:16:56,917][04221] KL-divergence is very high: 1245.9613 [2023-03-06 15:16:57,017][04221] KL-divergence is very high: 337.9434 [2023-03-06 15:16:57,585][04272] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-06 15:16:57,819][04221] KL-divergence is very high: 104.9187 [2023-03-06 15:16:57,971][04221] KL-divergence is very high: 410.5308 [2023-03-06 15:16:58,390][04272] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-06 15:16:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 37433344. Throughput: 0: 12623.7. Samples: 37413126. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:16:58,941][03942] Avg episode reward: [(0, '424.590')] [2023-03-06 15:16:59,200][04272] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-06 15:17:00,029][04272] Updated weights for policy 0, policy_version 36570 (0.0007) [2023-03-06 15:17:00,843][04272] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-06 15:17:01,661][04272] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-06 15:17:02,462][04272] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-06 15:17:03,167][04221] KL-divergence is very high: 125.1597 [2023-03-06 15:17:03,267][04272] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-06 15:17:03,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 37496832. Throughput: 0: 12629.4. Samples: 37488829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:03,941][03942] Avg episode reward: [(0, '577.097')] [2023-03-06 15:17:04,070][04272] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-06 15:17:04,862][04272] Updated weights for policy 0, policy_version 36630 (0.0006) [2023-03-06 15:17:05,524][04221] KL-divergence is very high: 144.8688 [2023-03-06 15:17:05,686][04272] Updated weights for policy 0, policy_version 36640 (0.0007) [2023-03-06 15:17:05,921][04221] KL-divergence is very high: 74706760.0000 [2023-03-06 15:17:06,495][04272] Updated weights for policy 0, policy_version 36650 (0.0008) [2023-03-06 15:17:07,137][04221] KL-divergence is very high: 121.0968 [2023-03-06 15:17:07,302][04272] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-06 15:17:08,115][04272] Updated weights for policy 0, policy_version 36670 (0.0007) [2023-03-06 15:17:08,928][04272] Updated weights for policy 0, policy_version 36680 (0.0005) [2023-03-06 15:17:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 37560320. Throughput: 0: 12627.4. Samples: 37526852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:08,941][03942] Avg episode reward: [(0, '603.545')] [2023-03-06 15:17:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000036680_37560320.pth... [2023-03-06 15:17:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000033724_34533376.pth [2023-03-06 15:17:09,765][04272] Updated weights for policy 0, policy_version 36690 (0.0007) [2023-03-06 15:17:10,566][04221] KL-divergence is very high: 161.1063 [2023-03-06 15:17:10,575][04272] Updated weights for policy 0, policy_version 36700 (0.0007) [2023-03-06 15:17:11,368][04272] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-06 15:17:12,192][04272] Updated weights for policy 0, policy_version 36720 (0.0007) [2023-03-06 15:17:13,017][04272] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-06 15:17:13,325][04221] KL-divergence is very high: 208.8736 [2023-03-06 15:17:13,810][04272] Updated weights for policy 0, policy_version 36740 (0.0007) [2023-03-06 15:17:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 37622784. Throughput: 0: 12620.8. Samples: 37602434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:13,941][03942] Avg episode reward: [(0, '571.224')] [2023-03-06 15:17:14,638][04272] Updated weights for policy 0, policy_version 36750 (0.0008) [2023-03-06 15:17:15,445][04272] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-06 15:17:16,072][04221] KL-divergence is very high: 516981600.0000 [2023-03-06 15:17:16,238][04272] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-06 15:17:17,057][04272] Updated weights for policy 0, policy_version 36780 (0.0007) [2023-03-06 15:17:17,875][04272] Updated weights for policy 0, policy_version 36790 (0.0008) [2023-03-06 15:17:18,666][04272] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-06 15:17:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 37686272. Throughput: 0: 12625.3. Samples: 37678203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:18,941][03942] Avg episode reward: [(0, '518.071')] [2023-03-06 15:17:19,473][04272] Updated weights for policy 0, policy_version 36810 (0.0005) [2023-03-06 15:17:20,272][04272] Updated weights for policy 0, policy_version 36820 (0.0006) [2023-03-06 15:17:21,091][04272] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-06 15:17:21,893][04272] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-06 15:17:22,708][04272] Updated weights for policy 0, policy_version 36850 (0.0007) [2023-03-06 15:17:23,520][04272] Updated weights for policy 0, policy_version 36860 (0.0007) [2023-03-06 15:17:23,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 37749760. Throughput: 0: 12621.9. Samples: 37716280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:23,941][03942] Avg episode reward: [(0, '742.833')] [2023-03-06 15:17:24,324][04272] Updated weights for policy 0, policy_version 36870 (0.0007) [2023-03-06 15:17:25,154][04272] Updated weights for policy 0, policy_version 36880 (0.0007) [2023-03-06 15:17:25,957][04272] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-06 15:17:26,767][04272] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-06 15:17:27,573][04272] Updated weights for policy 0, policy_version 36910 (0.0007) [2023-03-06 15:17:28,390][04272] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-06 15:17:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 37812224. Throughput: 0: 12624.9. Samples: 37791959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:28,941][03942] Avg episode reward: [(0, '731.319')] [2023-03-06 15:17:28,956][04221] KL-divergence is very high: 138.3642 [2023-03-06 15:17:29,213][04272] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-06 15:17:30,025][04272] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-06 15:17:30,848][04272] Updated weights for policy 0, policy_version 36950 (0.0006) [2023-03-06 15:17:30,924][04221] KL-divergence is very high: 189.8371 [2023-03-06 15:17:31,482][04221] KL-divergence is very high: 494.8389 [2023-03-06 15:17:31,650][04272] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-06 15:17:32,450][04221] KL-divergence is very high: 2382.4417 [2023-03-06 15:17:32,458][04272] Updated weights for policy 0, policy_version 36970 (0.0007) [2023-03-06 15:17:32,925][04221] KL-divergence is very high: 473.9758 [2023-03-06 15:17:33,083][04221] KL-divergence is very high: 293.3133 [2023-03-06 15:17:33,257][04272] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-06 15:17:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 37875712. Throughput: 0: 12629.4. Samples: 37867802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:33,941][03942] Avg episode reward: [(0, '782.679')] [2023-03-06 15:17:34,054][04272] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-06 15:17:34,873][04272] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-06 15:17:35,682][04272] Updated weights for policy 0, policy_version 37010 (0.0006) [2023-03-06 15:17:36,494][04272] Updated weights for policy 0, policy_version 37020 (0.0007) [2023-03-06 15:17:36,570][04221] KL-divergence is very high: 188.0941 [2023-03-06 15:17:37,295][04272] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-06 15:17:37,609][04221] KL-divergence is very high: 489.3328 [2023-03-06 15:17:38,096][04221] KL-divergence is very high: 334.2836 [2023-03-06 15:17:38,104][04272] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-06 15:17:38,330][04221] KL-divergence is very high: 264.6406 [2023-03-06 15:17:38,905][04272] Updated weights for policy 0, policy_version 37050 (0.0007) [2023-03-06 15:17:38,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.4, 300 sec: 12614.3). Total num frames: 37939200. Throughput: 0: 12632.2. Samples: 37905768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:38,941][03942] Avg episode reward: [(0, '708.426')] [2023-03-06 15:17:39,696][04221] KL-divergence is very high: 258.1661 [2023-03-06 15:17:39,702][04272] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-06 15:17:40,523][04272] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-06 15:17:41,326][04272] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-06 15:17:42,132][04272] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-06 15:17:42,941][04272] Updated weights for policy 0, policy_version 37100 (0.0007) [2023-03-06 15:17:43,757][04272] Updated weights for policy 0, policy_version 37110 (0.0007) [2023-03-06 15:17:43,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38002688. Throughput: 0: 12642.1. Samples: 37982020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:43,941][03942] Avg episode reward: [(0, '748.604')] [2023-03-06 15:17:44,574][04272] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-06 15:17:45,407][04272] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-06 15:17:46,213][04272] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-06 15:17:47,001][04272] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-06 15:17:47,801][04272] Updated weights for policy 0, policy_version 37160 (0.0006) [2023-03-06 15:17:48,614][04272] Updated weights for policy 0, policy_version 37170 (0.0007) [2023-03-06 15:17:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38066176. Throughput: 0: 12644.8. Samples: 38057845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:17:48,941][03942] Avg episode reward: [(0, '758.048')] [2023-03-06 15:17:49,423][04272] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-06 15:17:50,215][04272] Updated weights for policy 0, policy_version 37190 (0.0007) [2023-03-06 15:17:51,050][04272] Updated weights for policy 0, policy_version 37200 (0.0007) [2023-03-06 15:17:51,856][04272] Updated weights for policy 0, policy_version 37210 (0.0006) [2023-03-06 15:17:52,678][04272] Updated weights for policy 0, policy_version 37220 (0.0005) [2023-03-06 15:17:53,490][04272] Updated weights for policy 0, policy_version 37230 (0.0006) [2023-03-06 15:17:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38128640. Throughput: 0: 12644.6. Samples: 38095858. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:17:53,941][03942] Avg episode reward: [(0, '733.669')] [2023-03-06 15:17:54,292][04272] Updated weights for policy 0, policy_version 37240 (0.0007) [2023-03-06 15:17:55,098][04272] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-06 15:17:55,884][04272] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-06 15:17:56,701][04272] Updated weights for policy 0, policy_version 37270 (0.0006) [2023-03-06 15:17:57,505][04272] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-06 15:17:58,325][04272] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-06 15:17:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38192128. Throughput: 0: 12650.7. Samples: 38171714. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:17:58,941][03942] Avg episode reward: [(0, '659.026')] [2023-03-06 15:17:59,144][04272] Updated weights for policy 0, policy_version 37300 (0.0007) [2023-03-06 15:17:59,975][04272] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-06 15:18:00,785][04272] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-06 15:18:01,591][04272] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-06 15:18:02,392][04272] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-06 15:18:03,205][04272] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-06 15:18:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 38254592. Throughput: 0: 12646.9. Samples: 38247313. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:03,941][03942] Avg episode reward: [(0, '823.277')] [2023-03-06 15:18:04,006][04272] Updated weights for policy 0, policy_version 37360 (0.0007) [2023-03-06 15:18:04,836][04272] Updated weights for policy 0, policy_version 37370 (0.0007) [2023-03-06 15:18:05,669][04272] Updated weights for policy 0, policy_version 37380 (0.0006) [2023-03-06 15:18:06,467][04272] Updated weights for policy 0, policy_version 37390 (0.0007) [2023-03-06 15:18:07,282][04272] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-06 15:18:08,098][04272] Updated weights for policy 0, policy_version 37410 (0.0007) [2023-03-06 15:18:08,902][04272] Updated weights for policy 0, policy_version 37420 (0.0006) [2023-03-06 15:18:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 38318080. Throughput: 0: 12637.5. Samples: 38284970. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:08,941][03942] Avg episode reward: [(0, '754.163')] [2023-03-06 15:18:09,724][04272] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-06 15:18:10,523][04272] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-06 15:18:11,324][04272] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-06 15:18:12,141][04272] Updated weights for policy 0, policy_version 37460 (0.0008) [2023-03-06 15:18:12,631][04221] KL-divergence is very high: 109.3629 [2023-03-06 15:18:12,954][04272] Updated weights for policy 0, policy_version 37470 (0.0007) [2023-03-06 15:18:13,753][04272] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-06 15:18:13,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38381568. Throughput: 0: 12640.6. Samples: 38360784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:18:13,941][03942] Avg episode reward: [(0, '724.294')] [2023-03-06 15:18:14,553][04272] Updated weights for policy 0, policy_version 37490 (0.0006) [2023-03-06 15:18:15,371][04272] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-06 15:18:16,168][04272] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-06 15:18:16,995][04272] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-06 15:18:17,797][04272] Updated weights for policy 0, policy_version 37530 (0.0007) [2023-03-06 15:18:18,602][04272] Updated weights for policy 0, policy_version 37540 (0.0007) [2023-03-06 15:18:18,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38445056. Throughput: 0: 12645.4. Samples: 38436844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:18:18,941][03942] Avg episode reward: [(0, '809.315')] [2023-03-06 15:18:19,405][04272] Updated weights for policy 0, policy_version 37550 (0.0007) [2023-03-06 15:18:20,220][04272] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-06 15:18:21,030][04272] Updated weights for policy 0, policy_version 37570 (0.0007) [2023-03-06 15:18:21,836][04272] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-06 15:18:22,636][04272] Updated weights for policy 0, policy_version 37590 (0.0005) [2023-03-06 15:18:23,455][04272] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-06 15:18:23,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38508544. Throughput: 0: 12648.3. Samples: 38474943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:18:23,941][03942] Avg episode reward: [(0, '785.799')] [2023-03-06 15:18:24,265][04272] Updated weights for policy 0, policy_version 37610 (0.0007) [2023-03-06 15:18:25,065][04272] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-06 15:18:25,883][04272] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-03-06 15:18:26,696][04272] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-06 15:18:27,527][04272] Updated weights for policy 0, policy_version 37650 (0.0007) [2023-03-06 15:18:28,326][04272] Updated weights for policy 0, policy_version 37660 (0.0007) [2023-03-06 15:18:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38571008. Throughput: 0: 12634.7. Samples: 38550581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:18:28,941][03942] Avg episode reward: [(0, '924.273')] [2023-03-06 15:18:29,147][04272] Updated weights for policy 0, policy_version 37670 (0.0007) [2023-03-06 15:18:29,960][04272] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-06 15:18:30,768][04272] Updated weights for policy 0, policy_version 37690 (0.0006) [2023-03-06 15:18:31,566][04272] Updated weights for policy 0, policy_version 37700 (0.0005) [2023-03-06 15:18:32,389][04272] Updated weights for policy 0, policy_version 37710 (0.0006) [2023-03-06 15:18:33,203][04272] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-06 15:18:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 38634496. Throughput: 0: 12634.0. Samples: 38626375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:18:33,941][03942] Avg episode reward: [(0, '847.658')] [2023-03-06 15:18:34,006][04272] Updated weights for policy 0, policy_version 37730 (0.0007) [2023-03-06 15:18:34,830][04272] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-06 15:18:35,618][04272] Updated weights for policy 0, policy_version 37750 (0.0006) [2023-03-06 15:18:36,430][04272] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-06 15:18:37,247][04272] Updated weights for policy 0, policy_version 37770 (0.0006) [2023-03-06 15:18:38,067][04272] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-06 15:18:38,875][04272] Updated weights for policy 0, policy_version 37790 (0.0007) [2023-03-06 15:18:38,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 38696960. Throughput: 0: 12632.9. Samples: 38664336. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:38,941][03942] Avg episode reward: [(0, '853.567')] [2023-03-06 15:18:39,029][04221] KL-divergence is very high: 193.1759 [2023-03-06 15:18:39,701][04272] Updated weights for policy 0, policy_version 37800 (0.0007) [2023-03-06 15:18:40,501][04272] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-06 15:18:41,301][04272] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-06 15:18:42,098][04272] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-06 15:18:42,915][04272] Updated weights for policy 0, policy_version 37840 (0.0006) [2023-03-06 15:18:43,717][04272] Updated weights for policy 0, policy_version 37850 (0.0007) [2023-03-06 15:18:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12614.3). Total num frames: 38760448. Throughput: 0: 12630.8. Samples: 38740100. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:43,941][03942] Avg episode reward: [(0, '746.753')] [2023-03-06 15:18:44,529][04272] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-06 15:18:45,341][04272] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-06 15:18:45,653][04221] KL-divergence is very high: 299.5289 [2023-03-06 15:18:46,150][04272] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-06 15:18:46,979][04272] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-06 15:18:47,759][04272] Updated weights for policy 0, policy_version 37900 (0.0007) [2023-03-06 15:18:47,858][04221] KL-divergence is very high: 1778.2095 [2023-03-06 15:18:48,576][04272] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-06 15:18:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 38823936. Throughput: 0: 12637.6. Samples: 38816004. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:48,941][03942] Avg episode reward: [(0, '710.634')] [2023-03-06 15:18:49,398][04272] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-06 15:18:50,211][04272] Updated weights for policy 0, policy_version 37930 (0.0006) [2023-03-06 15:18:51,035][04272] Updated weights for policy 0, policy_version 37940 (0.0007) [2023-03-06 15:18:51,836][04272] Updated weights for policy 0, policy_version 37950 (0.0007) [2023-03-06 15:18:52,636][04272] Updated weights for policy 0, policy_version 37960 (0.0007) [2023-03-06 15:18:53,448][04272] Updated weights for policy 0, policy_version 37970 (0.0007) [2023-03-06 15:18:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 38886400. Throughput: 0: 12639.6. Samples: 38853751. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:53,941][03942] Avg episode reward: [(0, '690.717')] [2023-03-06 15:18:54,031][04221] KL-divergence is very high: 188.1727 [2023-03-06 15:18:54,289][04272] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-06 15:18:55,113][04272] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-03-06 15:18:55,908][04272] Updated weights for policy 0, policy_version 38000 (0.0008) [2023-03-06 15:18:56,751][04272] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-06 15:18:57,553][04272] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-06 15:18:58,355][04272] Updated weights for policy 0, policy_version 38030 (0.0007) [2023-03-06 15:18:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 38949888. Throughput: 0: 12628.7. Samples: 38929077. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:18:58,941][03942] Avg episode reward: [(0, '646.924')] [2023-03-06 15:18:59,152][04272] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-06 15:18:59,956][04272] Updated weights for policy 0, policy_version 38050 (0.0007) [2023-03-06 15:19:00,780][04272] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-06 15:19:01,594][04272] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-06 15:19:02,399][04272] Updated weights for policy 0, policy_version 38080 (0.0007) [2023-03-06 15:19:03,211][04272] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-06 15:19:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 39013376. Throughput: 0: 12626.4. Samples: 39005032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:03,941][03942] Avg episode reward: [(0, '618.438')] [2023-03-06 15:19:04,018][04272] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-06 15:19:04,831][04272] Updated weights for policy 0, policy_version 38110 (0.0006) [2023-03-06 15:19:05,650][04272] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-06 15:19:06,465][04272] Updated weights for policy 0, policy_version 38130 (0.0007) [2023-03-06 15:19:07,296][04272] Updated weights for policy 0, policy_version 38140 (0.0006) [2023-03-06 15:19:08,099][04272] Updated weights for policy 0, policy_version 38150 (0.0006) [2023-03-06 15:19:08,895][04272] Updated weights for policy 0, policy_version 38160 (0.0006) [2023-03-06 15:19:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 39075840. Throughput: 0: 12618.4. Samples: 39042774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:08,941][03942] Avg episode reward: [(0, '647.583')] [2023-03-06 15:19:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000038160_39075840.pth... [2023-03-06 15:19:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000035203_36047872.pth [2023-03-06 15:19:09,730][04272] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-06 15:19:10,542][04272] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-06 15:19:11,364][04272] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-06 15:19:12,167][04272] Updated weights for policy 0, policy_version 38200 (0.0007) [2023-03-06 15:19:12,994][04272] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-06 15:19:13,797][04272] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-06 15:19:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 39138304. Throughput: 0: 12610.5. Samples: 39118054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:13,941][03942] Avg episode reward: [(0, '801.713')] [2023-03-06 15:19:14,623][04272] Updated weights for policy 0, policy_version 38230 (0.0007) [2023-03-06 15:19:15,451][04272] Updated weights for policy 0, policy_version 38240 (0.0007) [2023-03-06 15:19:16,259][04272] Updated weights for policy 0, policy_version 38250 (0.0005) [2023-03-06 15:19:17,051][04272] Updated weights for policy 0, policy_version 38260 (0.0008) [2023-03-06 15:19:17,886][04272] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-06 15:19:18,701][04272] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-06 15:19:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 39201792. Throughput: 0: 12599.7. Samples: 39193363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:18,941][03942] Avg episode reward: [(0, '874.168')] [2023-03-06 15:19:19,485][04272] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-06 15:19:20,310][04272] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-06 15:19:21,115][04272] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-06 15:19:21,922][04272] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-06 15:19:22,757][04272] Updated weights for policy 0, policy_version 38330 (0.0007) [2023-03-06 15:19:23,575][04272] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-06 15:19:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 39264256. Throughput: 0: 12604.7. Samples: 39231546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:23,941][03942] Avg episode reward: [(0, '823.916')] [2023-03-06 15:19:24,406][04272] Updated weights for policy 0, policy_version 38350 (0.0006) [2023-03-06 15:19:25,204][04272] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-06 15:19:26,009][04272] Updated weights for policy 0, policy_version 38370 (0.0007) [2023-03-06 15:19:26,830][04272] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-06 15:19:27,637][04272] Updated weights for policy 0, policy_version 38390 (0.0007) [2023-03-06 15:19:28,442][04272] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-06 15:19:28,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 39326720. Throughput: 0: 12596.3. Samples: 39306937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:28,941][03942] Avg episode reward: [(0, '784.402')] [2023-03-06 15:19:29,256][04272] Updated weights for policy 0, policy_version 38410 (0.0008) [2023-03-06 15:19:30,068][04272] Updated weights for policy 0, policy_version 38420 (0.0007) [2023-03-06 15:19:30,883][04272] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-06 15:19:31,672][04272] Updated weights for policy 0, policy_version 38440 (0.0005) [2023-03-06 15:19:32,509][04272] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-06 15:19:33,327][04272] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-06 15:19:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 39390208. Throughput: 0: 12583.5. Samples: 39382261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:33,941][03942] Avg episode reward: [(0, '929.106')] [2023-03-06 15:19:34,141][04272] Updated weights for policy 0, policy_version 38470 (0.0006) [2023-03-06 15:19:34,937][04272] Updated weights for policy 0, policy_version 38480 (0.0007) [2023-03-06 15:19:35,781][04272] Updated weights for policy 0, policy_version 38490 (0.0007) [2023-03-06 15:19:36,573][04272] Updated weights for policy 0, policy_version 38500 (0.0006) [2023-03-06 15:19:37,376][04272] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-06 15:19:38,203][04272] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-06 15:19:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 39453696. Throughput: 0: 12586.6. Samples: 39420150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:38,941][03942] Avg episode reward: [(0, '971.249')] [2023-03-06 15:19:39,006][04272] Updated weights for policy 0, policy_version 38530 (0.0007) [2023-03-06 15:19:39,082][04221] KL-divergence is very high: 1595.2343 [2023-03-06 15:19:39,837][04272] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-06 15:19:40,640][04272] Updated weights for policy 0, policy_version 38550 (0.0007) [2023-03-06 15:19:41,466][04272] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-06 15:19:42,271][04272] Updated weights for policy 0, policy_version 38570 (0.0006) [2023-03-06 15:19:43,084][04272] Updated weights for policy 0, policy_version 38580 (0.0007) [2023-03-06 15:19:43,907][04272] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-06 15:19:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 39516160. Throughput: 0: 12593.6. Samples: 39495787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:43,941][03942] Avg episode reward: [(0, '733.365')] [2023-03-06 15:19:44,711][04272] Updated weights for policy 0, policy_version 38600 (0.0007) [2023-03-06 15:19:45,529][04272] Updated weights for policy 0, policy_version 38610 (0.0006) [2023-03-06 15:19:46,357][04272] Updated weights for policy 0, policy_version 38620 (0.0007) [2023-03-06 15:19:47,159][04272] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-06 15:19:47,968][04272] Updated weights for policy 0, policy_version 38640 (0.0007) [2023-03-06 15:19:48,787][04272] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-06 15:19:48,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 39578624. Throughput: 0: 12582.4. Samples: 39571242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:19:48,941][03942] Avg episode reward: [(0, '866.923')] [2023-03-06 15:19:49,604][04272] Updated weights for policy 0, policy_version 38660 (0.0007) [2023-03-06 15:19:50,416][04272] Updated weights for policy 0, policy_version 38670 (0.0007) [2023-03-06 15:19:51,230][04272] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-06 15:19:52,039][04272] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-06 15:19:52,850][04272] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-06 15:19:53,663][04272] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-06 15:19:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 39642112. Throughput: 0: 12581.3. Samples: 39608931. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:19:53,941][03942] Avg episode reward: [(0, '814.765')] [2023-03-06 15:19:54,472][04272] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-06 15:19:55,270][04272] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-06 15:19:56,079][04272] Updated weights for policy 0, policy_version 38740 (0.0007) [2023-03-06 15:19:56,886][04272] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-06 15:19:57,689][04272] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-06 15:19:58,502][04272] Updated weights for policy 0, policy_version 38770 (0.0006) [2023-03-06 15:19:58,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 39705600. Throughput: 0: 12599.4. Samples: 39685026. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:19:58,941][03942] Avg episode reward: [(0, '882.201')] [2023-03-06 15:19:59,321][04272] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-06 15:20:00,128][04272] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-06 15:20:00,948][04272] Updated weights for policy 0, policy_version 38800 (0.0007) [2023-03-06 15:20:01,742][04272] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-06 15:20:02,545][04272] Updated weights for policy 0, policy_version 38820 (0.0008) [2023-03-06 15:20:03,359][04272] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-06 15:20:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 39768064. Throughput: 0: 12611.0. Samples: 39760858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:03,941][03942] Avg episode reward: [(0, '838.202')] [2023-03-06 15:20:04,186][04272] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-06 15:20:04,993][04272] Updated weights for policy 0, policy_version 38850 (0.0007) [2023-03-06 15:20:05,805][04272] Updated weights for policy 0, policy_version 38860 (0.0007) [2023-03-06 15:20:06,623][04272] Updated weights for policy 0, policy_version 38870 (0.0006) [2023-03-06 15:20:07,432][04272] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-06 15:20:08,247][04272] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-06 15:20:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 39831552. Throughput: 0: 12599.4. Samples: 39798519. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:08,941][03942] Avg episode reward: [(0, '885.572')] [2023-03-06 15:20:09,064][04272] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-06 15:20:09,878][04272] Updated weights for policy 0, policy_version 38910 (0.0007) [2023-03-06 15:20:10,692][04272] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-06 15:20:11,508][04272] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-06 15:20:12,325][04272] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-06 15:20:13,128][04272] Updated weights for policy 0, policy_version 38950 (0.0007) [2023-03-06 15:20:13,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 39895040. Throughput: 0: 12601.6. Samples: 39874009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:13,941][03942] Avg episode reward: [(0, '844.089')] [2023-03-06 15:20:13,941][04272] Updated weights for policy 0, policy_version 38960 (0.0007) [2023-03-06 15:20:14,755][04272] Updated weights for policy 0, policy_version 38970 (0.0007) [2023-03-06 15:20:15,547][04272] Updated weights for policy 0, policy_version 38980 (0.0007) [2023-03-06 15:20:16,370][04272] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-06 15:20:17,199][04272] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-06 15:20:17,997][04272] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-06 15:20:18,806][04272] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-06 15:20:18,941][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 39957504. Throughput: 0: 12609.1. Samples: 39949669. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:18,941][03942] Avg episode reward: [(0, '795.198')] [2023-03-06 15:20:19,616][04272] Updated weights for policy 0, policy_version 39030 (0.0007) [2023-03-06 15:20:20,428][04272] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-06 15:20:21,242][04272] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-06 15:20:22,061][04272] Updated weights for policy 0, policy_version 39060 (0.0007) [2023-03-06 15:20:22,858][04272] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-06 15:20:23,672][04272] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-06 15:20:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 40020992. Throughput: 0: 12612.1. Samples: 39987692. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:23,941][03942] Avg episode reward: [(0, '902.913')] [2023-03-06 15:20:24,482][04272] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-06 15:20:25,274][04272] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-06 15:20:26,085][04272] Updated weights for policy 0, policy_version 39110 (0.0007) [2023-03-06 15:20:26,909][04272] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-06 15:20:27,724][04272] Updated weights for policy 0, policy_version 39130 (0.0006) [2023-03-06 15:20:28,525][04272] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-06 15:20:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 40083456. Throughput: 0: 12617.4. Samples: 40063572. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:28,941][03942] Avg episode reward: [(0, '827.989')] [2023-03-06 15:20:29,351][04272] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-06 15:20:30,148][04272] Updated weights for policy 0, policy_version 39160 (0.0006) [2023-03-06 15:20:30,949][04272] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-06 15:20:31,762][04272] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-06 15:20:32,592][04272] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-06 15:20:33,399][04272] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-06 15:20:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 40146944. Throughput: 0: 12620.8. Samples: 40139176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:33,941][03942] Avg episode reward: [(0, '970.333')] [2023-03-06 15:20:34,211][04272] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-06 15:20:35,041][04272] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-06 15:20:35,856][04272] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-06 15:20:36,649][04272] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-06 15:20:37,469][04272] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-06 15:20:38,256][04272] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-06 15:20:38,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 40210432. Throughput: 0: 12624.0. Samples: 40177009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:20:38,941][03942] Avg episode reward: [(0, '962.781')] [2023-03-06 15:20:39,067][04272] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-06 15:20:39,886][04272] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-06 15:20:40,688][04272] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-06 15:20:41,513][04272] Updated weights for policy 0, policy_version 39300 (0.0006) [2023-03-06 15:20:42,318][04272] Updated weights for policy 0, policy_version 39310 (0.0007) [2023-03-06 15:20:43,119][04272] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-06 15:20:43,933][04272] Updated weights for policy 0, policy_version 39330 (0.0007) [2023-03-06 15:20:43,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 40273920. Throughput: 0: 12617.8. Samples: 40252826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:20:43,941][03942] Avg episode reward: [(0, '877.369')] [2023-03-06 15:20:44,762][04272] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-06 15:20:45,571][04272] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-06 15:20:46,372][04272] Updated weights for policy 0, policy_version 39360 (0.0006) [2023-03-06 15:20:47,180][04272] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-06 15:20:47,997][04272] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-06 15:20:48,805][04272] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-06 15:20:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 40336384. Throughput: 0: 12621.2. Samples: 40328813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:20:48,941][03942] Avg episode reward: [(0, '890.142')] [2023-03-06 15:20:49,605][04272] Updated weights for policy 0, policy_version 39400 (0.0007) [2023-03-06 15:20:50,432][04272] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-06 15:20:51,223][04272] Updated weights for policy 0, policy_version 39420 (0.0007) [2023-03-06 15:20:52,046][04272] Updated weights for policy 0, policy_version 39430 (0.0006) [2023-03-06 15:20:52,856][04272] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-06 15:20:53,667][04272] Updated weights for policy 0, policy_version 39450 (0.0007) [2023-03-06 15:20:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 40399872. Throughput: 0: 12627.1. Samples: 40366736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:20:53,941][03942] Avg episode reward: [(0, '818.266')] [2023-03-06 15:20:54,482][04272] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-06 15:20:55,294][04272] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-06 15:20:56,078][04272] Updated weights for policy 0, policy_version 39480 (0.0006) [2023-03-06 15:20:56,906][04272] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-06 15:20:57,705][04272] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-06 15:20:58,501][04272] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-06 15:20:58,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 40463360. Throughput: 0: 12637.0. Samples: 40442677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:20:58,952][03942] Avg episode reward: [(0, '876.636')] [2023-03-06 15:20:59,312][04272] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-06 15:21:00,121][04272] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-06 15:21:00,938][04272] Updated weights for policy 0, policy_version 39540 (0.0006) [2023-03-06 15:21:01,744][04272] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-06 15:21:02,556][04272] Updated weights for policy 0, policy_version 39560 (0.0007) [2023-03-06 15:21:03,354][04272] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-06 15:21:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 40526848. Throughput: 0: 12644.8. Samples: 40518686. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:03,951][03942] Avg episode reward: [(0, '1057.545')] [2023-03-06 15:21:04,165][04272] Updated weights for policy 0, policy_version 39580 (0.0006) [2023-03-06 15:21:04,967][04272] Updated weights for policy 0, policy_version 39590 (0.0006) [2023-03-06 15:21:05,767][04272] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-06 15:21:06,585][04272] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-06 15:21:07,410][04272] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-06 15:21:08,198][04272] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-06 15:21:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12624.7). Total num frames: 40589312. Throughput: 0: 12648.0. Samples: 40556851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:08,952][03942] Avg episode reward: [(0, '849.006')] [2023-03-06 15:21:08,964][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000039639_40590336.pth... [2023-03-06 15:21:08,995][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000036680_37560320.pth [2023-03-06 15:21:09,028][04272] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-06 15:21:09,833][04272] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-06 15:21:10,656][04272] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-06 15:21:11,477][04272] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-06 15:21:12,272][04272] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-06 15:21:13,079][04272] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-06 15:21:13,900][04272] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-06 15:21:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 40652800. Throughput: 0: 12639.1. Samples: 40632331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:13,951][03942] Avg episode reward: [(0, '953.917')] [2023-03-06 15:21:14,706][04272] Updated weights for policy 0, policy_version 39710 (0.0006) [2023-03-06 15:21:15,534][04272] Updated weights for policy 0, policy_version 39720 (0.0007) [2023-03-06 15:21:16,330][04272] Updated weights for policy 0, policy_version 39730 (0.0007) [2023-03-06 15:21:17,136][04272] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-06 15:21:17,958][04272] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-06 15:21:18,785][04272] Updated weights for policy 0, policy_version 39760 (0.0007) [2023-03-06 15:21:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 40716288. Throughput: 0: 12645.1. Samples: 40708204. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:18,951][03942] Avg episode reward: [(0, '1136.969')] [2023-03-06 15:21:19,593][04272] Updated weights for policy 0, policy_version 39770 (0.0007) [2023-03-06 15:21:20,423][04272] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-06 15:21:21,230][04272] Updated weights for policy 0, policy_version 39790 (0.0007) [2023-03-06 15:21:22,042][04272] Updated weights for policy 0, policy_version 39800 (0.0007) [2023-03-06 15:21:22,838][04272] Updated weights for policy 0, policy_version 39810 (0.0006) [2023-03-06 15:21:23,642][04272] Updated weights for policy 0, policy_version 39820 (0.0008) [2023-03-06 15:21:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 40778752. Throughput: 0: 12639.5. Samples: 40745786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:23,952][03942] Avg episode reward: [(0, '1021.125')] [2023-03-06 15:21:24,467][04272] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-06 15:21:25,265][04272] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-06 15:21:26,080][04272] Updated weights for policy 0, policy_version 39850 (0.0007) [2023-03-06 15:21:26,895][04272] Updated weights for policy 0, policy_version 39860 (0.0007) [2023-03-06 15:21:27,703][04272] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-06 15:21:28,510][04272] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-06 15:21:28,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 40842240. Throughput: 0: 12638.9. Samples: 40821578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:28,951][03942] Avg episode reward: [(0, '1134.102')] [2023-03-06 15:21:29,346][04272] Updated weights for policy 0, policy_version 39890 (0.0008) [2023-03-06 15:21:30,167][04272] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-06 15:21:30,985][04272] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-06 15:21:31,783][04272] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-06 15:21:32,595][04272] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-06 15:21:33,406][04272] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-06 15:21:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 40904704. Throughput: 0: 12625.9. Samples: 40896979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:33,941][03942] Avg episode reward: [(0, '1027.704')] [2023-03-06 15:21:34,190][04272] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-06 15:21:35,013][04272] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-06 15:21:35,834][04272] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-06 15:21:36,647][04272] Updated weights for policy 0, policy_version 39980 (0.0007) [2023-03-06 15:21:37,470][04272] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-06 15:21:38,281][04272] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-06 15:21:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 40968192. Throughput: 0: 12625.5. Samples: 40934884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:38,941][03942] Avg episode reward: [(0, '1139.424')] [2023-03-06 15:21:39,091][04272] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-06 15:21:39,891][04272] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-06 15:21:40,706][04272] Updated weights for policy 0, policy_version 40030 (0.0007) [2023-03-06 15:21:41,526][04272] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-06 15:21:42,314][04272] Updated weights for policy 0, policy_version 40050 (0.0006) [2023-03-06 15:21:43,123][04272] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-06 15:21:43,934][04272] Updated weights for policy 0, policy_version 40070 (0.0007) [2023-03-06 15:21:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 41031680. Throughput: 0: 12621.4. Samples: 41010641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:43,941][03942] Avg episode reward: [(0, '969.408')] [2023-03-06 15:21:44,758][04272] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-06 15:21:45,573][04272] Updated weights for policy 0, policy_version 40090 (0.0006) [2023-03-06 15:21:46,352][04272] Updated weights for policy 0, policy_version 40100 (0.0006) [2023-03-06 15:21:47,204][04272] Updated weights for policy 0, policy_version 40110 (0.0007) [2023-03-06 15:21:47,998][04272] Updated weights for policy 0, policy_version 40120 (0.0007) [2023-03-06 15:21:48,807][04272] Updated weights for policy 0, policy_version 40130 (0.0006) [2023-03-06 15:21:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 41094144. Throughput: 0: 12617.7. Samples: 41086485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:48,941][03942] Avg episode reward: [(0, '1082.471')] [2023-03-06 15:21:49,616][04272] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-06 15:21:50,426][04272] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-06 15:21:51,236][04272] Updated weights for policy 0, policy_version 40160 (0.0007) [2023-03-06 15:21:52,053][04272] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-06 15:21:52,857][04272] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-06 15:21:53,667][04272] Updated weights for policy 0, policy_version 40190 (0.0007) [2023-03-06 15:21:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 41157632. Throughput: 0: 12612.2. Samples: 41124401. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:21:53,941][03942] Avg episode reward: [(0, '1012.875')] [2023-03-06 15:21:54,485][04272] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-06 15:21:55,302][04272] Updated weights for policy 0, policy_version 40210 (0.0007) [2023-03-06 15:21:56,110][04272] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-06 15:21:56,931][04272] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-06 15:21:57,754][04272] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-06 15:21:58,555][04272] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-06 15:21:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 41220096. Throughput: 0: 12611.2. Samples: 41199835. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:21:58,941][03942] Avg episode reward: [(0, '1184.065')] [2023-03-06 15:21:59,355][04272] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-06 15:22:00,187][04272] Updated weights for policy 0, policy_version 40270 (0.0006) [2023-03-06 15:22:01,002][04272] Updated weights for policy 0, policy_version 40280 (0.0006) [2023-03-06 15:22:01,797][04272] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-06 15:22:02,631][04272] Updated weights for policy 0, policy_version 40300 (0.0007) [2023-03-06 15:22:03,438][04272] Updated weights for policy 0, policy_version 40310 (0.0007) [2023-03-06 15:22:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 41283584. Throughput: 0: 12605.5. Samples: 41275452. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:22:03,941][03942] Avg episode reward: [(0, '1086.083')] [2023-03-06 15:22:04,235][04272] Updated weights for policy 0, policy_version 40320 (0.0006) [2023-03-06 15:22:05,055][04272] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-06 15:22:05,865][04272] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-06 15:22:06,676][04272] Updated weights for policy 0, policy_version 40350 (0.0006) [2023-03-06 15:22:07,487][04272] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-06 15:22:08,297][04272] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-06 15:22:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 41346048. Throughput: 0: 12613.0. Samples: 41313371. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:22:08,941][03942] Avg episode reward: [(0, '1038.060')] [2023-03-06 15:22:09,124][04272] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-06 15:22:09,918][04272] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-06 15:22:10,748][04272] Updated weights for policy 0, policy_version 40400 (0.0007) [2023-03-06 15:22:11,546][04272] Updated weights for policy 0, policy_version 40410 (0.0006) [2023-03-06 15:22:12,346][04272] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-06 15:22:13,172][04272] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-06 15:22:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 41409536. Throughput: 0: 12614.4. Samples: 41389229. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:22:13,941][03942] Avg episode reward: [(0, '1124.504')] [2023-03-06 15:22:13,970][04272] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-06 15:22:14,793][04272] Updated weights for policy 0, policy_version 40450 (0.0006) [2023-03-06 15:22:15,596][04272] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-06 15:22:16,419][04272] Updated weights for policy 0, policy_version 40470 (0.0007) [2023-03-06 15:22:17,218][04272] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-06 15:22:18,037][04272] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-06 15:22:18,856][04272] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-06 15:22:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 41472000. Throughput: 0: 12619.2. Samples: 41464844. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:22:18,941][03942] Avg episode reward: [(0, '1143.973')] [2023-03-06 15:22:19,656][04272] Updated weights for policy 0, policy_version 40510 (0.0007) [2023-03-06 15:22:20,471][04272] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-06 15:22:21,284][04272] Updated weights for policy 0, policy_version 40530 (0.0006) [2023-03-06 15:22:22,102][04272] Updated weights for policy 0, policy_version 40540 (0.0006) [2023-03-06 15:22:22,905][04272] Updated weights for policy 0, policy_version 40550 (0.0006) [2023-03-06 15:22:23,710][04272] Updated weights for policy 0, policy_version 40560 (0.0007) [2023-03-06 15:22:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 41535488. Throughput: 0: 12618.2. Samples: 41502701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:22:23,941][03942] Avg episode reward: [(0, '1181.565')] [2023-03-06 15:22:24,536][04272] Updated weights for policy 0, policy_version 40570 (0.0007) [2023-03-06 15:22:25,366][04272] Updated weights for policy 0, policy_version 40580 (0.0007) [2023-03-06 15:22:26,169][04272] Updated weights for policy 0, policy_version 40590 (0.0006) [2023-03-06 15:22:26,975][04272] Updated weights for policy 0, policy_version 40600 (0.0006) [2023-03-06 15:22:27,802][04272] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-06 15:22:28,616][04272] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-06 15:22:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 41598976. Throughput: 0: 12612.1. Samples: 41578184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:22:28,941][03942] Avg episode reward: [(0, '1131.652')] [2023-03-06 15:22:29,428][04272] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-06 15:22:30,270][04272] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-06 15:22:31,082][04272] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-06 15:22:31,881][04272] Updated weights for policy 0, policy_version 40660 (0.0007) [2023-03-06 15:22:32,689][04272] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-06 15:22:33,498][04272] Updated weights for policy 0, policy_version 40680 (0.0005) [2023-03-06 15:22:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 41661440. Throughput: 0: 12601.5. Samples: 41653554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:22:33,941][03942] Avg episode reward: [(0, '1226.209')] [2023-03-06 15:22:34,312][04272] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-06 15:22:35,116][04272] Updated weights for policy 0, policy_version 40700 (0.0007) [2023-03-06 15:22:35,922][04272] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-06 15:22:36,749][04272] Updated weights for policy 0, policy_version 40720 (0.0007) [2023-03-06 15:22:37,567][04272] Updated weights for policy 0, policy_version 40730 (0.0007) [2023-03-06 15:22:38,371][04272] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-06 15:22:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 41724928. Throughput: 0: 12601.9. Samples: 41691486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:22:38,941][03942] Avg episode reward: [(0, '1161.795')] [2023-03-06 15:22:39,165][04272] Updated weights for policy 0, policy_version 40750 (0.0007) [2023-03-06 15:22:39,978][04272] Updated weights for policy 0, policy_version 40760 (0.0006) [2023-03-06 15:22:40,798][04272] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-06 15:22:41,587][04272] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-06 15:22:42,401][04272] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-06 15:22:43,238][04272] Updated weights for policy 0, policy_version 40800 (0.0007) [2023-03-06 15:22:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 41787392. Throughput: 0: 12611.2. Samples: 41767336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:22:43,941][03942] Avg episode reward: [(0, '1106.136')] [2023-03-06 15:22:44,045][04272] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-06 15:22:44,859][04272] Updated weights for policy 0, policy_version 40820 (0.0007) [2023-03-06 15:22:45,674][04272] Updated weights for policy 0, policy_version 40830 (0.0005) [2023-03-06 15:22:46,477][04272] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-06 15:22:47,291][04272] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-06 15:22:48,102][04272] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-06 15:22:48,912][04272] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-06 15:22:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 41850880. Throughput: 0: 12608.7. Samples: 41842843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:22:48,941][03942] Avg episode reward: [(0, '1250.035')] [2023-03-06 15:22:49,713][04272] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-06 15:22:50,527][04272] Updated weights for policy 0, policy_version 40890 (0.0007) [2023-03-06 15:22:51,369][04272] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-06 15:22:52,173][04272] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-06 15:22:52,979][04272] Updated weights for policy 0, policy_version 40920 (0.0006) [2023-03-06 15:22:53,811][04272] Updated weights for policy 0, policy_version 40930 (0.0007) [2023-03-06 15:22:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 41913344. Throughput: 0: 12604.9. Samples: 41880592. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:22:53,941][03942] Avg episode reward: [(0, '1181.077')] [2023-03-06 15:22:54,630][04272] Updated weights for policy 0, policy_version 40940 (0.0007) [2023-03-06 15:22:55,430][04272] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-06 15:22:56,262][04272] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-06 15:22:57,074][04272] Updated weights for policy 0, policy_version 40970 (0.0007) [2023-03-06 15:22:57,885][04272] Updated weights for policy 0, policy_version 40980 (0.0006) [2023-03-06 15:22:58,705][04272] Updated weights for policy 0, policy_version 40990 (0.0008) [2023-03-06 15:22:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 41975808. Throughput: 0: 12592.5. Samples: 41955893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:22:58,941][03942] Avg episode reward: [(0, '1250.824')] [2023-03-06 15:22:59,528][04272] Updated weights for policy 0, policy_version 41000 (0.0007) [2023-03-06 15:23:00,339][04272] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-06 15:23:01,148][04272] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-06 15:23:01,968][04272] Updated weights for policy 0, policy_version 41030 (0.0007) [2023-03-06 15:23:02,776][04272] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-06 15:23:03,587][04272] Updated weights for policy 0, policy_version 41050 (0.0006) [2023-03-06 15:23:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 42039296. Throughput: 0: 12586.9. Samples: 42031256. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:03,941][03942] Avg episode reward: [(0, '1222.745')] [2023-03-06 15:23:04,413][04272] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-06 15:23:05,220][04272] Updated weights for policy 0, policy_version 41070 (0.0007) [2023-03-06 15:23:06,011][04272] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-06 15:23:06,846][04272] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-06 15:23:07,631][04272] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-06 15:23:08,464][04272] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-06 15:23:08,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 42102784. Throughput: 0: 12586.3. Samples: 42069082. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:08,941][03942] Avg episode reward: [(0, '1274.296')] [2023-03-06 15:23:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000041116_42102784.pth... [2023-03-06 15:23:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000038160_39075840.pth [2023-03-06 15:23:09,271][04272] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-06 15:23:10,088][04272] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-06 15:23:10,902][04272] Updated weights for policy 0, policy_version 41140 (0.0006) [2023-03-06 15:23:11,708][04272] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-06 15:23:12,518][04272] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-06 15:23:13,344][04272] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-06 15:23:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 42165248. Throughput: 0: 12592.9. Samples: 42144863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:13,941][03942] Avg episode reward: [(0, '1235.660')] [2023-03-06 15:23:14,152][04272] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-06 15:23:14,944][04272] Updated weights for policy 0, policy_version 41190 (0.0007) [2023-03-06 15:23:15,752][04272] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-06 15:23:16,575][04272] Updated weights for policy 0, policy_version 41210 (0.0006) [2023-03-06 15:23:17,398][04272] Updated weights for policy 0, policy_version 41220 (0.0007) [2023-03-06 15:23:18,209][04272] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-06 15:23:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 42228736. Throughput: 0: 12602.0. Samples: 42220643. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:18,941][03942] Avg episode reward: [(0, '1167.454')] [2023-03-06 15:23:19,027][04272] Updated weights for policy 0, policy_version 41240 (0.0006) [2023-03-06 15:23:19,842][04272] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-06 15:23:20,658][04272] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-06 15:23:21,466][04272] Updated weights for policy 0, policy_version 41270 (0.0007) [2023-03-06 15:23:22,275][04272] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-06 15:23:23,064][04272] Updated weights for policy 0, policy_version 41290 (0.0007) [2023-03-06 15:23:23,882][04272] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-06 15:23:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 42291200. Throughput: 0: 12594.7. Samples: 42258249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:23,941][03942] Avg episode reward: [(0, '1205.844')] [2023-03-06 15:23:24,683][04272] Updated weights for policy 0, policy_version 41310 (0.0008) [2023-03-06 15:23:25,477][04272] Updated weights for policy 0, policy_version 41320 (0.0006) [2023-03-06 15:23:26,311][04272] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-06 15:23:27,125][04272] Updated weights for policy 0, policy_version 41340 (0.0006) [2023-03-06 15:23:27,933][04272] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-06 15:23:28,749][04272] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-06 15:23:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 42354688. Throughput: 0: 12593.7. Samples: 42334052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:28,941][03942] Avg episode reward: [(0, '1310.885')] [2023-03-06 15:23:29,550][04272] Updated weights for policy 0, policy_version 41370 (0.0007) [2023-03-06 15:23:30,368][04272] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-06 15:23:31,178][04272] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-06 15:23:31,997][04272] Updated weights for policy 0, policy_version 41400 (0.0007) [2023-03-06 15:23:32,798][04272] Updated weights for policy 0, policy_version 41410 (0.0007) [2023-03-06 15:23:33,616][04272] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-06 15:23:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 42418176. Throughput: 0: 12603.2. Samples: 42409989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:33,941][03942] Avg episode reward: [(0, '1321.616')] [2023-03-06 15:23:34,416][04272] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-06 15:23:35,221][04272] Updated weights for policy 0, policy_version 41440 (0.0008) [2023-03-06 15:23:36,023][04272] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-06 15:23:36,826][04272] Updated weights for policy 0, policy_version 41460 (0.0006) [2023-03-06 15:23:37,630][04272] Updated weights for policy 0, policy_version 41470 (0.0007) [2023-03-06 15:23:38,447][04272] Updated weights for policy 0, policy_version 41480 (0.0007) [2023-03-06 15:23:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 42481664. Throughput: 0: 12609.7. Samples: 42448030. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:23:38,941][03942] Avg episode reward: [(0, '1342.472')] [2023-03-06 15:23:39,247][04272] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-06 15:23:40,091][04272] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-06 15:23:40,887][04272] Updated weights for policy 0, policy_version 41510 (0.0007) [2023-03-06 15:23:41,705][04272] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-06 15:23:42,524][04272] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-06 15:23:43,346][04272] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-06 15:23:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 42544128. Throughput: 0: 12615.9. Samples: 42523610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:23:43,941][03942] Avg episode reward: [(0, '1302.930')] [2023-03-06 15:23:44,170][04272] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-06 15:23:44,995][04272] Updated weights for policy 0, policy_version 41560 (0.0006) [2023-03-06 15:23:45,818][04272] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-06 15:23:46,615][04272] Updated weights for policy 0, policy_version 41580 (0.0006) [2023-03-06 15:23:47,442][04272] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-06 15:23:48,245][04272] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-06 15:23:48,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 42606592. Throughput: 0: 12613.3. Samples: 42598852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:23:48,941][03942] Avg episode reward: [(0, '1274.896')] [2023-03-06 15:23:49,038][04272] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-06 15:23:49,868][04272] Updated weights for policy 0, policy_version 41620 (0.0007) [2023-03-06 15:23:50,679][04272] Updated weights for policy 0, policy_version 41630 (0.0007) [2023-03-06 15:23:51,502][04272] Updated weights for policy 0, policy_version 41640 (0.0006) [2023-03-06 15:23:52,306][04272] Updated weights for policy 0, policy_version 41650 (0.0007) [2023-03-06 15:23:53,112][04272] Updated weights for policy 0, policy_version 41660 (0.0007) [2023-03-06 15:23:53,912][04272] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-06 15:23:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 42670080. Throughput: 0: 12611.6. Samples: 42636605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:23:53,941][03942] Avg episode reward: [(0, '1231.081')] [2023-03-06 15:23:54,715][04272] Updated weights for policy 0, policy_version 41680 (0.0006) [2023-03-06 15:23:55,530][04272] Updated weights for policy 0, policy_version 41690 (0.0007) [2023-03-06 15:23:56,328][04272] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-06 15:23:57,143][04272] Updated weights for policy 0, policy_version 41710 (0.0006) [2023-03-06 15:23:57,963][04272] Updated weights for policy 0, policy_version 41720 (0.0007) [2023-03-06 15:23:58,767][04272] Updated weights for policy 0, policy_version 41730 (0.0006) [2023-03-06 15:23:58,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 42733568. Throughput: 0: 12616.5. Samples: 42712607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:23:58,941][03942] Avg episode reward: [(0, '1257.944')] [2023-03-06 15:23:59,578][04272] Updated weights for policy 0, policy_version 41740 (0.0006) [2023-03-06 15:24:00,390][04272] Updated weights for policy 0, policy_version 41750 (0.0007) [2023-03-06 15:24:01,209][04272] Updated weights for policy 0, policy_version 41760 (0.0007) [2023-03-06 15:24:02,041][04272] Updated weights for policy 0, policy_version 41770 (0.0007) [2023-03-06 15:24:02,852][04272] Updated weights for policy 0, policy_version 41780 (0.0007) [2023-03-06 15:24:03,646][04272] Updated weights for policy 0, policy_version 41790 (0.0006) [2023-03-06 15:24:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 42796032. Throughput: 0: 12610.3. Samples: 42788107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:03,941][03942] Avg episode reward: [(0, '1249.162')] [2023-03-06 15:24:04,482][04272] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-06 15:24:05,285][04272] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-06 15:24:06,122][04272] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-06 15:24:06,937][04272] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-06 15:24:07,718][04272] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-06 15:24:08,533][04272] Updated weights for policy 0, policy_version 41850 (0.0007) [2023-03-06 15:24:08,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 42858496. Throughput: 0: 12612.4. Samples: 42825806. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:24:08,941][03942] Avg episode reward: [(0, '1276.007')] [2023-03-06 15:24:09,363][04272] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-06 15:24:10,171][04272] Updated weights for policy 0, policy_version 41870 (0.0007) [2023-03-06 15:24:10,979][04272] Updated weights for policy 0, policy_version 41880 (0.0006) [2023-03-06 15:24:11,801][04272] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-06 15:24:12,625][04272] Updated weights for policy 0, policy_version 41900 (0.0006) [2023-03-06 15:24:13,432][04272] Updated weights for policy 0, policy_version 41910 (0.0007) [2023-03-06 15:24:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12610.8). Total num frames: 42921984. Throughput: 0: 12607.1. Samples: 42901371. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:24:13,941][03942] Avg episode reward: [(0, '1243.070')] [2023-03-06 15:24:14,246][04272] Updated weights for policy 0, policy_version 41920 (0.0007) [2023-03-06 15:24:15,053][04272] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-06 15:24:15,863][04272] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-06 15:24:16,681][04272] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-06 15:24:17,495][04272] Updated weights for policy 0, policy_version 41960 (0.0007) [2023-03-06 15:24:18,293][04272] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-06 15:24:18,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 42985472. Throughput: 0: 12600.6. Samples: 42977016. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:24:18,941][03942] Avg episode reward: [(0, '1077.485')] [2023-03-06 15:24:19,114][04272] Updated weights for policy 0, policy_version 41980 (0.0007) [2023-03-06 15:24:19,933][04272] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-06 15:24:20,751][04272] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-06 15:24:21,559][04272] Updated weights for policy 0, policy_version 42010 (0.0005) [2023-03-06 15:24:22,361][04272] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-06 15:24:23,180][04272] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-06 15:24:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 43047936. Throughput: 0: 12593.6. Samples: 43014742. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:24:23,941][03942] Avg episode reward: [(0, '934.925')] [2023-03-06 15:24:23,995][04272] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-06 15:24:24,805][04272] Updated weights for policy 0, policy_version 42050 (0.0006) [2023-03-06 15:24:25,628][04272] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-06 15:24:26,437][04272] Updated weights for policy 0, policy_version 42070 (0.0007) [2023-03-06 15:24:27,254][04272] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-06 15:24:28,067][04272] Updated weights for policy 0, policy_version 42090 (0.0007) [2023-03-06 15:24:28,858][04272] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-06 15:24:28,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 43110400. Throughput: 0: 12593.9. Samples: 43090337. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:24:28,941][03942] Avg episode reward: [(0, '1205.161')] [2023-03-06 15:24:29,686][04272] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-06 15:24:30,498][04272] Updated weights for policy 0, policy_version 42120 (0.0007) [2023-03-06 15:24:31,307][04272] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-06 15:24:32,139][04272] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-06 15:24:32,947][04272] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-06 15:24:33,756][04272] Updated weights for policy 0, policy_version 42160 (0.0007) [2023-03-06 15:24:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 43173888. Throughput: 0: 12599.0. Samples: 43165806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:33,941][03942] Avg episode reward: [(0, '1275.694')] [2023-03-06 15:24:34,567][04272] Updated weights for policy 0, policy_version 42170 (0.0008) [2023-03-06 15:24:35,398][04272] Updated weights for policy 0, policy_version 42180 (0.0007) [2023-03-06 15:24:36,202][04272] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-06 15:24:37,025][04272] Updated weights for policy 0, policy_version 42200 (0.0007) [2023-03-06 15:24:37,854][04272] Updated weights for policy 0, policy_version 42210 (0.0006) [2023-03-06 15:24:38,664][04272] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-06 15:24:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 43236352. Throughput: 0: 12595.8. Samples: 43203417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:38,941][03942] Avg episode reward: [(0, '1194.133')] [2023-03-06 15:24:39,474][04272] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-06 15:24:40,299][04272] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-06 15:24:41,099][04272] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-06 15:24:41,929][04272] Updated weights for policy 0, policy_version 42260 (0.0008) [2023-03-06 15:24:42,730][04272] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-06 15:24:43,549][04272] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-06 15:24:43,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 43298816. Throughput: 0: 12578.7. Samples: 43278649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:43,941][03942] Avg episode reward: [(0, '1189.764')] [2023-03-06 15:24:44,360][04272] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-06 15:24:45,177][04272] Updated weights for policy 0, policy_version 42300 (0.0007) [2023-03-06 15:24:45,988][04272] Updated weights for policy 0, policy_version 42310 (0.0007) [2023-03-06 15:24:46,804][04272] Updated weights for policy 0, policy_version 42320 (0.0008) [2023-03-06 15:24:47,614][04272] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-06 15:24:48,428][04272] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-06 15:24:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 43362304. Throughput: 0: 12579.5. Samples: 43354184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:48,941][03942] Avg episode reward: [(0, '1016.792')] [2023-03-06 15:24:49,231][04272] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-06 15:24:50,052][04272] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-06 15:24:50,870][04272] Updated weights for policy 0, policy_version 42370 (0.0007) [2023-03-06 15:24:51,664][04272] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-06 15:24:52,497][04272] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-06 15:24:53,309][04272] Updated weights for policy 0, policy_version 42400 (0.0007) [2023-03-06 15:24:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12607.3). Total num frames: 43424768. Throughput: 0: 12584.3. Samples: 43392098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:53,941][03942] Avg episode reward: [(0, '1197.193')] [2023-03-06 15:24:54,119][04272] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-06 15:24:54,941][04272] Updated weights for policy 0, policy_version 42420 (0.0007) [2023-03-06 15:24:55,749][04272] Updated weights for policy 0, policy_version 42430 (0.0007) [2023-03-06 15:24:56,552][04272] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-06 15:24:57,375][04272] Updated weights for policy 0, policy_version 42450 (0.0007) [2023-03-06 15:24:58,176][04272] Updated weights for policy 0, policy_version 42460 (0.0007) [2023-03-06 15:24:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 43488256. Throughput: 0: 12583.8. Samples: 43467643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:24:58,941][03942] Avg episode reward: [(0, '1153.687')] [2023-03-06 15:24:58,987][04272] Updated weights for policy 0, policy_version 42470 (0.0007) [2023-03-06 15:24:59,792][04272] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-06 15:25:00,621][04272] Updated weights for policy 0, policy_version 42490 (0.0006) [2023-03-06 15:25:01,432][04272] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-06 15:25:02,259][04272] Updated weights for policy 0, policy_version 42510 (0.0006) [2023-03-06 15:25:03,059][04272] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-03-06 15:25:03,886][04272] Updated weights for policy 0, policy_version 42530 (0.0007) [2023-03-06 15:25:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12607.4). Total num frames: 43550720. Throughput: 0: 12580.3. Samples: 43543128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:25:03,941][03942] Avg episode reward: [(0, '1295.758')] [2023-03-06 15:25:04,702][04272] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-06 15:25:05,501][04272] Updated weights for policy 0, policy_version 42550 (0.0006) [2023-03-06 15:25:06,322][04272] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-06 15:25:07,111][04272] Updated weights for policy 0, policy_version 42570 (0.0005) [2023-03-06 15:25:07,934][04272] Updated weights for policy 0, policy_version 42580 (0.0007) [2023-03-06 15:25:08,744][04272] Updated weights for policy 0, policy_version 42590 (0.0007) [2023-03-06 15:25:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 43614208. Throughput: 0: 12579.8. Samples: 43580835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:25:08,941][03942] Avg episode reward: [(0, '1260.308')] [2023-03-06 15:25:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000042592_43614208.pth... [2023-03-06 15:25:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000039639_40590336.pth [2023-03-06 15:25:09,546][04272] Updated weights for policy 0, policy_version 42600 (0.0006) [2023-03-06 15:25:10,353][04272] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-06 15:25:11,169][04272] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-06 15:25:11,995][04272] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-06 15:25:12,794][04272] Updated weights for policy 0, policy_version 42640 (0.0006) [2023-03-06 15:25:13,613][04272] Updated weights for policy 0, policy_version 42650 (0.0006) [2023-03-06 15:25:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 43677696. Throughput: 0: 12587.5. Samples: 43656773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:25:13,941][03942] Avg episode reward: [(0, '1197.583')] [2023-03-06 15:25:14,425][04272] Updated weights for policy 0, policy_version 42660 (0.0007) [2023-03-06 15:25:15,226][04272] Updated weights for policy 0, policy_version 42670 (0.0007) [2023-03-06 15:25:16,065][04272] Updated weights for policy 0, policy_version 42680 (0.0007) [2023-03-06 15:25:16,858][04272] Updated weights for policy 0, policy_version 42690 (0.0007) [2023-03-06 15:25:17,663][04272] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-06 15:25:18,490][04272] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-06 15:25:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.2, 300 sec: 12607.4). Total num frames: 43740160. Throughput: 0: 12593.4. Samples: 43732509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:25:18,941][03942] Avg episode reward: [(0, '1227.205')] [2023-03-06 15:25:19,294][04272] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-06 15:25:20,090][04272] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-06 15:25:20,901][04272] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-06 15:25:21,718][04272] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-06 15:25:22,526][04272] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-06 15:25:23,328][04272] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-06 15:25:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 43803648. Throughput: 0: 12599.9. Samples: 43770413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:25:23,941][03942] Avg episode reward: [(0, '1208.629')] [2023-03-06 15:25:24,130][04272] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-06 15:25:24,952][04272] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-06 15:25:25,746][04272] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-06 15:25:26,571][04272] Updated weights for policy 0, policy_version 42810 (0.0007) [2023-03-06 15:25:27,368][04272] Updated weights for policy 0, policy_version 42820 (0.0005) [2023-03-06 15:25:28,173][04272] Updated weights for policy 0, policy_version 42830 (0.0007) [2023-03-06 15:25:28,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 43867136. Throughput: 0: 12620.0. Samples: 43846550. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:25:28,941][03942] Avg episode reward: [(0, '1244.860')] [2023-03-06 15:25:28,990][04272] Updated weights for policy 0, policy_version 42840 (0.0006) [2023-03-06 15:25:29,775][04272] Updated weights for policy 0, policy_version 42850 (0.0006) [2023-03-06 15:25:30,597][04272] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-06 15:25:31,404][04272] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-06 15:25:32,202][04272] Updated weights for policy 0, policy_version 42880 (0.0007) [2023-03-06 15:25:33,001][04272] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-06 15:25:33,821][04272] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-06 15:25:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 43930624. Throughput: 0: 12635.4. Samples: 43922776. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:25:33,941][03942] Avg episode reward: [(0, '1079.627')] [2023-03-06 15:25:34,644][04272] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-06 15:25:35,459][04272] Updated weights for policy 0, policy_version 42920 (0.0005) [2023-03-06 15:25:36,277][04272] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-06 15:25:37,082][04272] Updated weights for policy 0, policy_version 42940 (0.0007) [2023-03-06 15:25:37,897][04272] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-06 15:25:38,700][04272] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-06 15:25:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 43993088. Throughput: 0: 12628.8. Samples: 43960392. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:25:38,941][03942] Avg episode reward: [(0, '1140.602')] [2023-03-06 15:25:39,495][04272] Updated weights for policy 0, policy_version 42970 (0.0007) [2023-03-06 15:25:40,305][04272] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-06 15:25:41,117][04272] Updated weights for policy 0, policy_version 42990 (0.0007) [2023-03-06 15:25:41,943][04272] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-06 15:25:42,750][04272] Updated weights for policy 0, policy_version 43010 (0.0006) [2023-03-06 15:25:43,566][04272] Updated weights for policy 0, policy_version 43020 (0.0006) [2023-03-06 15:25:43,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 44056576. Throughput: 0: 12633.3. Samples: 44036140. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:25:43,941][03942] Avg episode reward: [(0, '1070.461')] [2023-03-06 15:25:44,394][04272] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-06 15:25:45,201][04272] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-06 15:25:46,015][04272] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-06 15:25:46,343][04221] KL-divergence is very high: 156.5010 [2023-03-06 15:25:46,833][04272] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-06 15:25:46,908][04221] KL-divergence is very high: 128.0037 [2023-03-06 15:25:47,644][04272] Updated weights for policy 0, policy_version 43070 (0.0007) [2023-03-06 15:25:48,452][04272] Updated weights for policy 0, policy_version 43080 (0.0007) [2023-03-06 15:25:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 44119040. Throughput: 0: 12633.9. Samples: 44111656. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:25:48,941][03942] Avg episode reward: [(0, '1010.464')] [2023-03-06 15:25:49,262][04272] Updated weights for policy 0, policy_version 43090 (0.0006) [2023-03-06 15:25:50,073][04272] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-06 15:25:50,902][04272] Updated weights for policy 0, policy_version 43110 (0.0006) [2023-03-06 15:25:51,702][04272] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-06 15:25:52,507][04272] Updated weights for policy 0, policy_version 43130 (0.0007) [2023-03-06 15:25:53,331][04272] Updated weights for policy 0, policy_version 43140 (0.0006) [2023-03-06 15:25:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 44182528. Throughput: 0: 12638.4. Samples: 44149562. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:25:53,941][03942] Avg episode reward: [(0, '910.428')] [2023-03-06 15:25:54,121][04272] Updated weights for policy 0, policy_version 43150 (0.0006) [2023-03-06 15:25:54,938][04272] Updated weights for policy 0, policy_version 43160 (0.0006) [2023-03-06 15:25:55,573][04221] KL-divergence is very high: 115.9679 [2023-03-06 15:25:55,734][04272] Updated weights for policy 0, policy_version 43170 (0.0006) [2023-03-06 15:25:56,380][04221] KL-divergence is very high: 578.5638 [2023-03-06 15:25:56,467][04221] KL-divergence is very high: 177.0767 [2023-03-06 15:25:56,554][04272] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-06 15:25:57,378][04272] Updated weights for policy 0, policy_version 43190 (0.0007) [2023-03-06 15:25:58,181][04272] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-06 15:25:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44246016. Throughput: 0: 12636.3. Samples: 44225408. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:25:58,941][03942] Avg episode reward: [(0, '1010.534')] [2023-03-06 15:25:58,991][04272] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-06 15:25:59,081][04221] KL-divergence is very high: 424.3940 [2023-03-06 15:25:59,813][04272] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-06 15:26:00,624][04272] Updated weights for policy 0, policy_version 43230 (0.0007) [2023-03-06 15:26:01,434][04272] Updated weights for policy 0, policy_version 43240 (0.0007) [2023-03-06 15:26:01,902][04221] KL-divergence is very high: 476.2168 [2023-03-06 15:26:02,232][04272] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-06 15:26:03,054][04272] Updated weights for policy 0, policy_version 43260 (0.0008) [2023-03-06 15:26:03,131][04221] KL-divergence is very high: 101.9627 [2023-03-06 15:26:03,877][04272] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-06 15:26:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 44308480. Throughput: 0: 12632.5. Samples: 44300969. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:26:03,941][03942] Avg episode reward: [(0, '909.463')] [2023-03-06 15:26:04,685][04272] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-06 15:26:05,501][04272] Updated weights for policy 0, policy_version 43290 (0.0007) [2023-03-06 15:26:06,302][04272] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-06 15:26:07,101][04272] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-06 15:26:07,918][04272] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-06 15:26:08,712][04272] Updated weights for policy 0, policy_version 43330 (0.0007) [2023-03-06 15:26:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44371968. Throughput: 0: 12635.1. Samples: 44338992. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:26:08,941][03942] Avg episode reward: [(0, '1033.118')] [2023-03-06 15:26:09,532][04272] Updated weights for policy 0, policy_version 43340 (0.0007) [2023-03-06 15:26:10,340][04272] Updated weights for policy 0, policy_version 43350 (0.0007) [2023-03-06 15:26:11,154][04272] Updated weights for policy 0, policy_version 43360 (0.0006) [2023-03-06 15:26:11,208][04221] KL-divergence is very high: 119.1546 [2023-03-06 15:26:11,889][04221] KL-divergence is very high: 215.7650 [2023-03-06 15:26:11,961][04221] KL-divergence is very high: 1275.7450 [2023-03-06 15:26:11,967][04272] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-06 15:26:12,372][04221] KL-divergence is very high: 331.8473 [2023-03-06 15:26:12,785][04272] Updated weights for policy 0, policy_version 43380 (0.0006) [2023-03-06 15:26:13,610][04272] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-06 15:26:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44435456. Throughput: 0: 12627.5. Samples: 44414789. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:26:13,941][03942] Avg episode reward: [(0, '1110.231')] [2023-03-06 15:26:14,091][04221] KL-divergence is very high: 119.0555 [2023-03-06 15:26:14,419][04272] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-06 15:26:14,835][04221] KL-divergence is very high: 109.6302 [2023-03-06 15:26:15,233][04272] Updated weights for policy 0, policy_version 43410 (0.0007) [2023-03-06 15:26:16,053][04272] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-06 15:26:16,864][04272] Updated weights for policy 0, policy_version 43430 (0.0007) [2023-03-06 15:26:17,671][04272] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-06 15:26:17,990][04221] KL-divergence is very high: 254.6980 [2023-03-06 15:26:18,469][04272] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-06 15:26:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44497920. Throughput: 0: 12608.0. Samples: 44490134. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:26:18,941][03942] Avg episode reward: [(0, '959.771')] [2023-03-06 15:26:19,303][04272] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-06 15:26:20,101][04272] Updated weights for policy 0, policy_version 43470 (0.0006) [2023-03-06 15:26:20,918][04272] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-06 15:26:21,738][04272] Updated weights for policy 0, policy_version 43490 (0.0006) [2023-03-06 15:26:21,982][04221] KL-divergence is very high: 1080.0468 [2023-03-06 15:26:22,299][04221] KL-divergence is very high: 526.0292 [2023-03-06 15:26:22,538][04272] Updated weights for policy 0, policy_version 43500 (0.0006) [2023-03-06 15:26:23,357][04272] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-06 15:26:23,666][04221] KL-divergence is very high: 288.1663 [2023-03-06 15:26:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44561408. Throughput: 0: 12614.6. Samples: 44528049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:23,941][03942] Avg episode reward: [(0, '896.470')] [2023-03-06 15:26:24,162][04272] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-06 15:26:24,476][04221] KL-divergence is very high: 121.7142 [2023-03-06 15:26:24,963][04272] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-06 15:26:25,787][04272] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-06 15:26:26,592][04272] Updated weights for policy 0, policy_version 43550 (0.0007) [2023-03-06 15:26:27,393][04272] Updated weights for policy 0, policy_version 43560 (0.0006) [2023-03-06 15:26:27,954][04221] KL-divergence is very high: 694.0330 [2023-03-06 15:26:28,199][04272] Updated weights for policy 0, policy_version 43570 (0.0007) [2023-03-06 15:26:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.4, 300 sec: 12610.8). Total num frames: 44624896. Throughput: 0: 12617.7. Samples: 44603936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:28,941][03942] Avg episode reward: [(0, '835.166')] [2023-03-06 15:26:29,017][04272] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-06 15:26:29,823][04272] Updated weights for policy 0, policy_version 43590 (0.0007) [2023-03-06 15:26:30,656][04272] Updated weights for policy 0, policy_version 43600 (0.0007) [2023-03-06 15:26:31,472][04272] Updated weights for policy 0, policy_version 43610 (0.0006) [2023-03-06 15:26:32,275][04272] Updated weights for policy 0, policy_version 43620 (0.0008) [2023-03-06 15:26:33,093][04272] Updated weights for policy 0, policy_version 43630 (0.0007) [2023-03-06 15:26:33,900][04272] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-06 15:26:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 44687360. Throughput: 0: 12619.3. Samples: 44679525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:33,941][03942] Avg episode reward: [(0, '864.585')] [2023-03-06 15:26:34,715][04272] Updated weights for policy 0, policy_version 43650 (0.0006) [2023-03-06 15:26:35,516][04272] Updated weights for policy 0, policy_version 43660 (0.0007) [2023-03-06 15:26:36,335][04272] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-06 15:26:37,146][04272] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-06 15:26:37,948][04272] Updated weights for policy 0, policy_version 43690 (0.0007) [2023-03-06 15:26:38,757][04272] Updated weights for policy 0, policy_version 43700 (0.0007) [2023-03-06 15:26:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44750848. Throughput: 0: 12621.5. Samples: 44717529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:38,941][03942] Avg episode reward: [(0, '1029.210')] [2023-03-06 15:26:39,574][04272] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-06 15:26:40,380][04272] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-06 15:26:41,205][04272] Updated weights for policy 0, policy_version 43730 (0.0009) [2023-03-06 15:26:41,279][04221] KL-divergence is very high: 111.7043 [2023-03-06 15:26:42,005][04272] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-06 15:26:42,817][04272] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-06 15:26:43,657][04272] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-06 15:26:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 44813312. Throughput: 0: 12611.5. Samples: 44792924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:43,952][03942] Avg episode reward: [(0, '981.791')] [2023-03-06 15:26:44,460][04272] Updated weights for policy 0, policy_version 43770 (0.0007) [2023-03-06 15:26:45,276][04272] Updated weights for policy 0, policy_version 43780 (0.0007) [2023-03-06 15:26:46,085][04272] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-06 15:26:46,890][04272] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-06 15:26:47,706][04272] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-06 15:26:47,867][04221] KL-divergence is very high: 3143.0947 [2023-03-06 15:26:48,514][04272] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-06 15:26:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 44876800. Throughput: 0: 12614.0. Samples: 44868602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:48,952][03942] Avg episode reward: [(0, '990.771')] [2023-03-06 15:26:49,347][04272] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-06 15:26:49,499][04221] KL-divergence is very high: 1356.3346 [2023-03-06 15:26:50,154][04272] Updated weights for policy 0, policy_version 43840 (0.0007) [2023-03-06 15:26:50,969][04272] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-06 15:26:51,781][04272] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-06 15:26:52,577][04272] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-06 15:26:53,402][04272] Updated weights for policy 0, policy_version 43880 (0.0008) [2023-03-06 15:26:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 44939264. Throughput: 0: 12609.0. Samples: 44906395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:53,952][03942] Avg episode reward: [(0, '985.315')] [2023-03-06 15:26:54,218][04272] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-06 15:26:55,019][04272] Updated weights for policy 0, policy_version 43900 (0.0007) [2023-03-06 15:26:55,824][04272] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-06 15:26:56,645][04272] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-06 15:26:56,901][04221] KL-divergence is very high: 342.2658 [2023-03-06 15:26:57,453][04272] Updated weights for policy 0, policy_version 43930 (0.0007) [2023-03-06 15:26:58,269][04272] Updated weights for policy 0, policy_version 43940 (0.0006) [2023-03-06 15:26:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 45002752. Throughput: 0: 12605.8. Samples: 44982051. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:26:58,952][03942] Avg episode reward: [(0, '893.233')] [2023-03-06 15:26:59,089][04272] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-06 15:26:59,884][04272] Updated weights for policy 0, policy_version 43960 (0.0007) [2023-03-06 15:27:00,692][04272] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-06 15:27:01,511][04272] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-06 15:27:02,316][04272] Updated weights for policy 0, policy_version 43990 (0.0006) [2023-03-06 15:27:03,136][04272] Updated weights for policy 0, policy_version 44000 (0.0007) [2023-03-06 15:27:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 45066240. Throughput: 0: 12612.8. Samples: 45057706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:03,942][04272] Updated weights for policy 0, policy_version 44010 (0.0006) [2023-03-06 15:27:03,951][03942] Avg episode reward: [(0, '913.745')] [2023-03-06 15:27:04,765][04272] Updated weights for policy 0, policy_version 44020 (0.0007) [2023-03-06 15:27:05,576][04272] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-06 15:27:06,366][04272] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-06 15:27:07,196][04272] Updated weights for policy 0, policy_version 44050 (0.0006) [2023-03-06 15:27:07,971][04272] Updated weights for policy 0, policy_version 44060 (0.0006) [2023-03-06 15:27:08,811][04272] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-06 15:27:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 45128704. Throughput: 0: 12613.3. Samples: 45095651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:08,952][03942] Avg episode reward: [(0, '985.044')] [2023-03-06 15:27:08,956][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000044071_45128704.pth... [2023-03-06 15:27:08,987][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000041116_42102784.pth [2023-03-06 15:27:09,652][04272] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-06 15:27:10,437][04272] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-06 15:27:11,253][04272] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-06 15:27:12,059][04272] Updated weights for policy 0, policy_version 44110 (0.0006) [2023-03-06 15:27:12,877][04272] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-06 15:27:13,689][04272] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-06 15:27:13,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 45192192. Throughput: 0: 12609.5. Samples: 45171363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:13,952][03942] Avg episode reward: [(0, '1181.366')] [2023-03-06 15:27:14,493][04272] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-06 15:27:15,307][04272] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-06 15:27:16,107][04272] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-06 15:27:16,926][04272] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-06 15:27:17,719][04272] Updated weights for policy 0, policy_version 44180 (0.0007) [2023-03-06 15:27:18,531][04272] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-06 15:27:18,940][03942] Fps is (10 sec: 12697.9, 60 sec: 12629.4, 300 sec: 12610.8). Total num frames: 45255680. Throughput: 0: 12617.7. Samples: 45247323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:18,941][03942] Avg episode reward: [(0, '1154.449')] [2023-03-06 15:27:19,322][04272] Updated weights for policy 0, policy_version 44200 (0.0006) [2023-03-06 15:27:20,140][04272] Updated weights for policy 0, policy_version 44210 (0.0007) [2023-03-06 15:27:20,929][04272] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-06 15:27:21,739][04272] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-06 15:27:22,578][04272] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-06 15:27:23,390][04272] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-06 15:27:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 45318144. Throughput: 0: 12622.1. Samples: 45285524. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:23,941][03942] Avg episode reward: [(0, '1136.074')] [2023-03-06 15:27:24,186][04272] Updated weights for policy 0, policy_version 44260 (0.0006) [2023-03-06 15:27:25,001][04272] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-06 15:27:25,817][04272] Updated weights for policy 0, policy_version 44280 (0.0006) [2023-03-06 15:27:26,647][04272] Updated weights for policy 0, policy_version 44290 (0.0006) [2023-03-06 15:27:27,445][04272] Updated weights for policy 0, policy_version 44300 (0.0005) [2023-03-06 15:27:28,268][04272] Updated weights for policy 0, policy_version 44310 (0.0007) [2023-03-06 15:27:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 45381632. Throughput: 0: 12625.5. Samples: 45361072. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:28,941][03942] Avg episode reward: [(0, '892.152')] [2023-03-06 15:27:29,079][04272] Updated weights for policy 0, policy_version 44320 (0.0007) [2023-03-06 15:27:29,894][04272] Updated weights for policy 0, policy_version 44330 (0.0006) [2023-03-06 15:27:30,697][04272] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-06 15:27:31,495][04272] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-06 15:27:32,304][04272] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-03-06 15:27:33,110][04272] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-06 15:27:33,923][04272] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-06 15:27:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 45445120. Throughput: 0: 12629.8. Samples: 45436945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:33,941][03942] Avg episode reward: [(0, '1081.718')] [2023-03-06 15:27:34,745][04272] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-06 15:27:35,547][04272] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-06 15:27:36,357][04272] Updated weights for policy 0, policy_version 44410 (0.0007) [2023-03-06 15:27:37,170][04272] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-06 15:27:37,994][04272] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-06 15:27:38,802][04272] Updated weights for policy 0, policy_version 44440 (0.0006) [2023-03-06 15:27:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 45507584. Throughput: 0: 12636.9. Samples: 45475054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:38,941][03942] Avg episode reward: [(0, '962.481')] [2023-03-06 15:27:39,598][04272] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-06 15:27:40,404][04272] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-06 15:27:41,221][04272] Updated weights for policy 0, policy_version 44470 (0.0006) [2023-03-06 15:27:42,054][04272] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-06 15:27:42,861][04272] Updated weights for policy 0, policy_version 44490 (0.0006) [2023-03-06 15:27:43,673][04272] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-06 15:27:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 45571072. Throughput: 0: 12630.5. Samples: 45550422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:43,941][03942] Avg episode reward: [(0, '892.918')] [2023-03-06 15:27:44,493][04272] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-06 15:27:45,296][04272] Updated weights for policy 0, policy_version 44520 (0.0007) [2023-03-06 15:27:46,102][04272] Updated weights for policy 0, policy_version 44530 (0.0007) [2023-03-06 15:27:46,922][04272] Updated weights for policy 0, policy_version 44540 (0.0007) [2023-03-06 15:27:47,724][04272] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-06 15:27:48,534][04272] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-06 15:27:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 45634560. Throughput: 0: 12634.1. Samples: 45626241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:48,941][03942] Avg episode reward: [(0, '1055.112')] [2023-03-06 15:27:49,320][04272] Updated weights for policy 0, policy_version 44570 (0.0007) [2023-03-06 15:27:50,133][04272] Updated weights for policy 0, policy_version 44580 (0.0007) [2023-03-06 15:27:50,958][04272] Updated weights for policy 0, policy_version 44590 (0.0007) [2023-03-06 15:27:51,761][04272] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-06 15:27:52,552][04272] Updated weights for policy 0, policy_version 44610 (0.0006) [2023-03-06 15:27:53,366][04272] Updated weights for policy 0, policy_version 44620 (0.0006) [2023-03-06 15:27:53,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 45698048. Throughput: 0: 12637.2. Samples: 45664322. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:53,941][03942] Avg episode reward: [(0, '1024.119')] [2023-03-06 15:27:54,170][04272] Updated weights for policy 0, policy_version 44630 (0.0007) [2023-03-06 15:27:54,991][04272] Updated weights for policy 0, policy_version 44640 (0.0007) [2023-03-06 15:27:55,810][04272] Updated weights for policy 0, policy_version 44650 (0.0007) [2023-03-06 15:27:56,621][04272] Updated weights for policy 0, policy_version 44660 (0.0006) [2023-03-06 15:27:57,414][04272] Updated weights for policy 0, policy_version 44670 (0.0007) [2023-03-06 15:27:58,224][04272] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-06 15:27:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 45760512. Throughput: 0: 12643.1. Samples: 45740302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:27:58,941][03942] Avg episode reward: [(0, '1077.520')] [2023-03-06 15:27:59,029][04272] Updated weights for policy 0, policy_version 44690 (0.0007) [2023-03-06 15:27:59,833][04272] Updated weights for policy 0, policy_version 44700 (0.0007) [2023-03-06 15:28:00,657][04272] Updated weights for policy 0, policy_version 44710 (0.0006) [2023-03-06 15:28:01,460][04272] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-06 15:28:02,282][04272] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-06 15:28:03,091][04272] Updated weights for policy 0, policy_version 44740 (0.0006) [2023-03-06 15:28:03,890][04272] Updated weights for policy 0, policy_version 44750 (0.0006) [2023-03-06 15:28:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 45824000. Throughput: 0: 12640.1. Samples: 45816128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:28:03,941][03942] Avg episode reward: [(0, '1061.547')] [2023-03-06 15:28:04,711][04272] Updated weights for policy 0, policy_version 44760 (0.0006) [2023-03-06 15:28:05,505][04272] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-06 15:28:06,305][04272] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-06 15:28:07,127][04272] Updated weights for policy 0, policy_version 44790 (0.0007) [2023-03-06 15:28:07,934][04272] Updated weights for policy 0, policy_version 44800 (0.0007) [2023-03-06 15:28:08,743][04272] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-06 15:28:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 45887488. Throughput: 0: 12638.1. Samples: 45854239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:28:08,941][03942] Avg episode reward: [(0, '900.388')] [2023-03-06 15:28:09,558][04272] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-06 15:28:10,370][04221] KL-divergence is very high: 109.5101 [2023-03-06 15:28:10,389][04272] Updated weights for policy 0, policy_version 44830 (0.0006) [2023-03-06 15:28:10,856][04221] KL-divergence is very high: 115.1938 [2023-03-06 15:28:11,196][04272] Updated weights for policy 0, policy_version 44840 (0.0006) [2023-03-06 15:28:11,822][04221] KL-divergence is very high: 150.5173 [2023-03-06 15:28:12,001][04272] Updated weights for policy 0, policy_version 44850 (0.0007) [2023-03-06 15:28:12,809][04272] Updated weights for policy 0, policy_version 44860 (0.0006) [2023-03-06 15:28:13,281][04221] KL-divergence is very high: 243.1249 [2023-03-06 15:28:13,638][04272] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-06 15:28:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 45949952. Throughput: 0: 12643.4. Samples: 45930024. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:13,941][03942] Avg episode reward: [(0, '990.085')] [2023-03-06 15:28:14,431][04272] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-06 15:28:15,238][04272] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-06 15:28:16,050][04272] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-06 15:28:16,705][04221] KL-divergence is very high: 2434.5879 [2023-03-06 15:28:16,868][04221] KL-divergence is very high: 169.4566 [2023-03-06 15:28:16,875][04272] Updated weights for policy 0, policy_version 44910 (0.0006) [2023-03-06 15:28:17,680][04272] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-06 15:28:18,481][04272] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-06 15:28:18,630][04221] KL-divergence is very high: 217.6656 [2023-03-06 15:28:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 46013440. Throughput: 0: 12639.2. Samples: 46005709. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:18,941][03942] Avg episode reward: [(0, '944.574')] [2023-03-06 15:28:19,300][04272] Updated weights for policy 0, policy_version 44940 (0.0007) [2023-03-06 15:28:20,105][04272] Updated weights for policy 0, policy_version 44950 (0.0006) [2023-03-06 15:28:20,506][04221] KL-divergence is very high: 276.5358 [2023-03-06 15:28:20,912][04272] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-06 15:28:21,073][04221] KL-divergence is very high: 126.0044 [2023-03-06 15:28:21,747][04272] Updated weights for policy 0, policy_version 44970 (0.0006) [2023-03-06 15:28:22,561][04272] Updated weights for policy 0, policy_version 44980 (0.0006) [2023-03-06 15:28:23,366][04272] Updated weights for policy 0, policy_version 44990 (0.0005) [2023-03-06 15:28:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 46076928. Throughput: 0: 12631.8. Samples: 46043484. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:23,941][03942] Avg episode reward: [(0, '933.452')] [2023-03-06 15:28:24,182][04272] Updated weights for policy 0, policy_version 45000 (0.0007) [2023-03-06 15:28:24,809][04221] KL-divergence is very high: 313.1248 [2023-03-06 15:28:24,980][04272] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-06 15:28:25,796][04272] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-06 15:28:26,610][04272] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-06 15:28:27,416][04272] Updated weights for policy 0, policy_version 45040 (0.0007) [2023-03-06 15:28:28,209][04272] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-06 15:28:28,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 46139392. Throughput: 0: 12639.7. Samples: 46119210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:28,941][03942] Avg episode reward: [(0, '1054.572')] [2023-03-06 15:28:29,031][04272] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-06 15:28:29,833][04272] Updated weights for policy 0, policy_version 45070 (0.0006) [2023-03-06 15:28:30,646][04272] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-06 15:28:31,464][04272] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-06 15:28:32,256][04272] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-06 15:28:32,346][04221] KL-divergence is very high: 1315.4783 [2023-03-06 15:28:33,093][04272] Updated weights for policy 0, policy_version 45110 (0.0007) [2023-03-06 15:28:33,483][04221] KL-divergence is very high: 139.6965 [2023-03-06 15:28:33,885][04272] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-06 15:28:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 46202880. Throughput: 0: 12642.1. Samples: 46195136. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:33,941][03942] Avg episode reward: [(0, '802.483')] [2023-03-06 15:28:34,671][04221] KL-divergence is very high: 1502.9922 [2023-03-06 15:28:34,679][04272] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-06 15:28:34,759][04221] KL-divergence is very high: 1241.4470 [2023-03-06 15:28:35,383][04221] KL-divergence is very high: 178.2046 [2023-03-06 15:28:35,494][04272] Updated weights for policy 0, policy_version 45140 (0.0007) [2023-03-06 15:28:36,299][04272] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-06 15:28:36,709][04221] KL-divergence is very high: 126.2891 [2023-03-06 15:28:37,051][04221] KL-divergence is very high: 559.7687 [2023-03-06 15:28:37,115][04272] Updated weights for policy 0, policy_version 45160 (0.0007) [2023-03-06 15:28:37,375][04221] KL-divergence is very high: 362.6405 [2023-03-06 15:28:37,925][04272] Updated weights for policy 0, policy_version 45170 (0.0007) [2023-03-06 15:28:38,719][04272] Updated weights for policy 0, policy_version 45180 (0.0006) [2023-03-06 15:28:38,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 46266368. Throughput: 0: 12645.9. Samples: 46233388. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:38,941][03942] Avg episode reward: [(0, '913.323')] [2023-03-06 15:28:39,539][04272] Updated weights for policy 0, policy_version 45190 (0.0007) [2023-03-06 15:28:40,338][04272] Updated weights for policy 0, policy_version 45200 (0.0007) [2023-03-06 15:28:41,146][04272] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-06 15:28:41,965][04272] Updated weights for policy 0, policy_version 45220 (0.0006) [2023-03-06 15:28:42,763][04272] Updated weights for policy 0, policy_version 45230 (0.0007) [2023-03-06 15:28:43,081][04221] KL-divergence is very high: 350.2178 [2023-03-06 15:28:43,584][04272] Updated weights for policy 0, policy_version 45240 (0.0006) [2023-03-06 15:28:43,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 46329856. Throughput: 0: 12642.4. Samples: 46309210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:43,941][03942] Avg episode reward: [(0, '1046.875')] [2023-03-06 15:28:44,385][04272] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-06 15:28:45,191][04272] Updated weights for policy 0, policy_version 45260 (0.0007) [2023-03-06 15:28:45,998][04272] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-06 15:28:46,503][04221] KL-divergence is very high: 2095.1313 [2023-03-06 15:28:46,831][04272] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-06 15:28:47,640][04272] Updated weights for policy 0, policy_version 45290 (0.0005) [2023-03-06 15:28:48,437][04272] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-06 15:28:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 46393344. Throughput: 0: 12647.4. Samples: 46385262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:48,941][03942] Avg episode reward: [(0, '1087.383')] [2023-03-06 15:28:49,250][04272] Updated weights for policy 0, policy_version 45310 (0.0007) [2023-03-06 15:28:49,894][04221] KL-divergence is very high: 112.8059 [2023-03-06 15:28:50,069][04272] Updated weights for policy 0, policy_version 45320 (0.0007) [2023-03-06 15:28:50,843][04272] Updated weights for policy 0, policy_version 45330 (0.0007) [2023-03-06 15:28:51,100][04221] KL-divergence is very high: 104.5670 [2023-03-06 15:28:51,173][04221] KL-divergence is very high: 1195.6136 [2023-03-06 15:28:51,431][04221] KL-divergence is very high: 6305.0977 [2023-03-06 15:28:51,599][04221] KL-divergence is very high: 758.9954 [2023-03-06 15:28:51,668][04272] Updated weights for policy 0, policy_version 45340 (0.0007) [2023-03-06 15:28:52,085][04221] KL-divergence is very high: 173.2414 [2023-03-06 15:28:52,481][04272] Updated weights for policy 0, policy_version 45350 (0.0006) [2023-03-06 15:28:53,282][04272] Updated weights for policy 0, policy_version 45360 (0.0006) [2023-03-06 15:28:53,914][04221] KL-divergence is very high: 259.0471 [2023-03-06 15:28:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 46456832. Throughput: 0: 12642.6. Samples: 46423158. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:53,941][03942] Avg episode reward: [(0, '1015.641')] [2023-03-06 15:28:54,083][04272] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-06 15:28:54,906][04272] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-06 15:28:55,618][04221] KL-divergence is very high: 434.4685 [2023-03-06 15:28:55,721][04272] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-06 15:28:56,195][04221] KL-divergence is very high: 759.2377 [2023-03-06 15:28:56,525][04272] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-06 15:28:56,628][04221] KL-divergence is very high: 2293655.0000 [2023-03-06 15:28:57,346][04272] Updated weights for policy 0, policy_version 45410 (0.0006) [2023-03-06 15:28:58,155][04272] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-06 15:28:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 46519296. Throughput: 0: 12643.7. Samples: 46498990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:28:58,941][03942] Avg episode reward: [(0, '840.362')] [2023-03-06 15:28:58,953][04272] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-06 15:28:59,759][04272] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-06 15:29:00,591][04272] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-06 15:29:01,395][04272] Updated weights for policy 0, policy_version 45460 (0.0007) [2023-03-06 15:29:02,192][04272] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-06 15:29:03,008][04272] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-06 15:29:03,807][04272] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-06 15:29:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 46582784. Throughput: 0: 12651.2. Samples: 46575013. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:29:03,941][03942] Avg episode reward: [(0, '1015.133')] [2023-03-06 15:29:04,640][04272] Updated weights for policy 0, policy_version 45500 (0.0007) [2023-03-06 15:29:05,436][04272] Updated weights for policy 0, policy_version 45510 (0.0007) [2023-03-06 15:29:06,233][04272] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-06 15:29:07,046][04272] Updated weights for policy 0, policy_version 45530 (0.0007) [2023-03-06 15:29:07,853][04272] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-06 15:29:08,666][04272] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-06 15:29:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 46646272. Throughput: 0: 12655.9. Samples: 46612999. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:08,941][03942] Avg episode reward: [(0, '1190.351')] [2023-03-06 15:29:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000045553_46646272.pth... [2023-03-06 15:29:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000042592_43614208.pth [2023-03-06 15:29:09,482][04272] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-06 15:29:10,295][04272] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-06 15:29:11,111][04272] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-06 15:29:11,923][04272] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-06 15:29:12,722][04272] Updated weights for policy 0, policy_version 45600 (0.0007) [2023-03-06 15:29:13,554][04272] Updated weights for policy 0, policy_version 45610 (0.0007) [2023-03-06 15:29:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 46708736. Throughput: 0: 12654.2. Samples: 46688645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:13,941][03942] Avg episode reward: [(0, '1098.454')] [2023-03-06 15:29:14,354][04272] Updated weights for policy 0, policy_version 45620 (0.0007) [2023-03-06 15:29:15,168][04272] Updated weights for policy 0, policy_version 45630 (0.0007) [2023-03-06 15:29:15,967][04272] Updated weights for policy 0, policy_version 45640 (0.0007) [2023-03-06 15:29:16,770][04272] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-06 15:29:17,493][04221] KL-divergence is very high: 786.3524 [2023-03-06 15:29:17,595][04272] Updated weights for policy 0, policy_version 45660 (0.0007) [2023-03-06 15:29:18,402][04272] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-06 15:29:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 46772224. Throughput: 0: 12656.3. Samples: 46764668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:18,941][03942] Avg episode reward: [(0, '1022.023')] [2023-03-06 15:29:19,199][04272] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-06 15:29:20,017][04272] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-06 15:29:20,818][04272] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-06 15:29:21,615][04272] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-06 15:29:22,421][04272] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-06 15:29:23,218][04272] Updated weights for policy 0, policy_version 45730 (0.0007) [2023-03-06 15:29:23,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 46835712. Throughput: 0: 12653.2. Samples: 46802784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:23,941][03942] Avg episode reward: [(0, '994.337')] [2023-03-06 15:29:24,050][04272] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-06 15:29:24,854][04272] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-06 15:29:25,658][04272] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-06 15:29:26,459][04272] Updated weights for policy 0, policy_version 45770 (0.0007) [2023-03-06 15:29:27,298][04272] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-06 15:29:28,106][04272] Updated weights for policy 0, policy_version 45790 (0.0006) [2023-03-06 15:29:28,910][04272] Updated weights for policy 0, policy_version 45800 (0.0006) [2023-03-06 15:29:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12628.2). Total num frames: 46899200. Throughput: 0: 12650.9. Samples: 46878501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:28,941][03942] Avg episode reward: [(0, '1120.984')] [2023-03-06 15:29:29,724][04272] Updated weights for policy 0, policy_version 45810 (0.0007) [2023-03-06 15:29:30,531][04272] Updated weights for policy 0, policy_version 45820 (0.0007) [2023-03-06 15:29:31,360][04272] Updated weights for policy 0, policy_version 45830 (0.0007) [2023-03-06 15:29:32,190][04272] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-06 15:29:32,994][04272] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-06 15:29:33,810][04272] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-06 15:29:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 46961664. Throughput: 0: 12642.3. Samples: 46954165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:33,941][03942] Avg episode reward: [(0, '1169.238')] [2023-03-06 15:29:34,629][04272] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-06 15:29:35,434][04272] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-06 15:29:36,234][04272] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-06 15:29:37,049][04272] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-06 15:29:37,854][04272] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-06 15:29:38,646][04272] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-06 15:29:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 47025152. Throughput: 0: 12641.8. Samples: 46992041. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:38,941][03942] Avg episode reward: [(0, '1078.133')] [2023-03-06 15:29:39,448][04272] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-06 15:29:40,261][04272] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-06 15:29:41,070][04272] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-06 15:29:41,894][04272] Updated weights for policy 0, policy_version 45960 (0.0007) [2023-03-06 15:29:42,707][04272] Updated weights for policy 0, policy_version 45970 (0.0007) [2023-03-06 15:29:43,524][04272] Updated weights for policy 0, policy_version 45980 (0.0007) [2023-03-06 15:29:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 47088640. Throughput: 0: 12643.0. Samples: 47067922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:43,941][03942] Avg episode reward: [(0, '1096.510')] [2023-03-06 15:29:44,334][04272] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-06 15:29:45,151][04272] Updated weights for policy 0, policy_version 46000 (0.0007) [2023-03-06 15:29:45,950][04272] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-06 15:29:46,777][04272] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-06 15:29:47,581][04272] Updated weights for policy 0, policy_version 46030 (0.0005) [2023-03-06 15:29:48,392][04272] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-06 15:29:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 47151104. Throughput: 0: 12634.2. Samples: 47143550. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:48,941][03942] Avg episode reward: [(0, '1148.451')] [2023-03-06 15:29:49,207][04272] Updated weights for policy 0, policy_version 46050 (0.0006) [2023-03-06 15:29:50,033][04272] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-06 15:29:50,826][04272] Updated weights for policy 0, policy_version 46070 (0.0007) [2023-03-06 15:29:51,644][04272] Updated weights for policy 0, policy_version 46080 (0.0006) [2023-03-06 15:29:52,455][04272] Updated weights for policy 0, policy_version 46090 (0.0007) [2023-03-06 15:29:53,256][04272] Updated weights for policy 0, policy_version 46100 (0.0007) [2023-03-06 15:29:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.7). Total num frames: 47214592. Throughput: 0: 12631.8. Samples: 47181427. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:53,941][03942] Avg episode reward: [(0, '1215.981')] [2023-03-06 15:29:54,071][04272] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-06 15:29:54,887][04272] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-06 15:29:55,704][04272] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-06 15:29:56,508][04272] Updated weights for policy 0, policy_version 46140 (0.0007) [2023-03-06 15:29:57,312][04272] Updated weights for policy 0, policy_version 46150 (0.0007) [2023-03-06 15:29:58,131][04272] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-06 15:29:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 47277056. Throughput: 0: 12630.7. Samples: 47257027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:29:58,941][03942] Avg episode reward: [(0, '1298.982')] [2023-03-06 15:29:58,946][04272] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-06 15:29:59,769][04272] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-06 15:30:00,585][04272] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-06 15:30:01,385][04272] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-06 15:30:02,220][04272] Updated weights for policy 0, policy_version 46210 (0.0007) [2023-03-06 15:30:03,030][04272] Updated weights for policy 0, policy_version 46220 (0.0006) [2023-03-06 15:30:03,837][04272] Updated weights for policy 0, policy_version 46230 (0.0007) [2023-03-06 15:30:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 47340544. Throughput: 0: 12619.2. Samples: 47332534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:03,941][03942] Avg episode reward: [(0, '1251.097')] [2023-03-06 15:30:04,634][04272] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-06 15:30:05,459][04272] Updated weights for policy 0, policy_version 46250 (0.0007) [2023-03-06 15:30:06,263][04272] Updated weights for policy 0, policy_version 46260 (0.0006) [2023-03-06 15:30:07,067][04272] Updated weights for policy 0, policy_version 46270 (0.0007) [2023-03-06 15:30:07,869][04272] Updated weights for policy 0, policy_version 46280 (0.0007) [2023-03-06 15:30:08,685][04272] Updated weights for policy 0, policy_version 46290 (0.0007) [2023-03-06 15:30:08,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.4, 300 sec: 12631.7). Total num frames: 47404032. Throughput: 0: 12616.3. Samples: 47370515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:08,941][03942] Avg episode reward: [(0, '1079.615')] [2023-03-06 15:30:09,498][04272] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-06 15:30:10,301][04272] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-06 15:30:11,115][04272] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-06 15:30:11,944][04272] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-06 15:30:12,749][04272] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-06 15:30:13,565][04272] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-06 15:30:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 47466496. Throughput: 0: 12614.3. Samples: 47446143. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:13,941][03942] Avg episode reward: [(0, '1115.944')] [2023-03-06 15:30:14,395][04272] Updated weights for policy 0, policy_version 46360 (0.0007) [2023-03-06 15:30:15,209][04272] Updated weights for policy 0, policy_version 46370 (0.0006) [2023-03-06 15:30:16,009][04272] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-06 15:30:16,825][04272] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-06 15:30:17,638][04272] Updated weights for policy 0, policy_version 46400 (0.0006) [2023-03-06 15:30:18,439][04272] Updated weights for policy 0, policy_version 46410 (0.0007) [2023-03-06 15:30:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 47529984. Throughput: 0: 12612.6. Samples: 47521730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:18,941][03942] Avg episode reward: [(0, '1112.496')] [2023-03-06 15:30:19,253][04272] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-06 15:30:20,074][04272] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-06 15:30:20,890][04272] Updated weights for policy 0, policy_version 46440 (0.0006) [2023-03-06 15:30:21,696][04272] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-06 15:30:22,505][04272] Updated weights for policy 0, policy_version 46460 (0.0007) [2023-03-06 15:30:23,303][04272] Updated weights for policy 0, policy_version 46470 (0.0007) [2023-03-06 15:30:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 47592448. Throughput: 0: 12607.9. Samples: 47559398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:23,950][03942] Avg episode reward: [(0, '1201.385')] [2023-03-06 15:30:24,134][04272] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-06 15:30:24,940][04272] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-06 15:30:25,769][04272] Updated weights for policy 0, policy_version 46500 (0.0006) [2023-03-06 15:30:26,563][04272] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-06 15:30:27,374][04272] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-06 15:30:28,217][04272] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-06 15:30:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 47655936. Throughput: 0: 12610.0. Samples: 47635373. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:28,951][03942] Avg episode reward: [(0, '1048.378')] [2023-03-06 15:30:29,001][04272] Updated weights for policy 0, policy_version 46540 (0.0007) [2023-03-06 15:30:29,819][04272] Updated weights for policy 0, policy_version 46550 (0.0007) [2023-03-06 15:30:30,626][04272] Updated weights for policy 0, policy_version 46560 (0.0007) [2023-03-06 15:30:31,432][04272] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-06 15:30:32,251][04272] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-06 15:30:33,095][04272] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-06 15:30:33,882][04272] Updated weights for policy 0, policy_version 46600 (0.0006) [2023-03-06 15:30:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 47718400. Throughput: 0: 12600.3. Samples: 47710564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:33,952][03942] Avg episode reward: [(0, '1208.361')] [2023-03-06 15:30:34,675][04272] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-06 15:30:35,498][04272] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-06 15:30:36,305][04272] Updated weights for policy 0, policy_version 46630 (0.0007) [2023-03-06 15:30:37,122][04272] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-06 15:30:37,935][04272] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-06 15:30:38,733][04221] KL-divergence is very high: 1735.7113 [2023-03-06 15:30:38,739][04272] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-06 15:30:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 47781888. Throughput: 0: 12604.3. Samples: 47748619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:38,952][03942] Avg episode reward: [(0, '1177.028')] [2023-03-06 15:30:39,550][04272] Updated weights for policy 0, policy_version 46670 (0.0007) [2023-03-06 15:30:40,381][04272] Updated weights for policy 0, policy_version 46680 (0.0007) [2023-03-06 15:30:41,182][04272] Updated weights for policy 0, policy_version 46690 (0.0006) [2023-03-06 15:30:42,006][04272] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-06 15:30:42,806][04272] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-06 15:30:43,624][04272] Updated weights for policy 0, policy_version 46720 (0.0007) [2023-03-06 15:30:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 47845376. Throughput: 0: 12607.0. Samples: 47824343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:43,941][03942] Avg episode reward: [(0, '1066.401')] [2023-03-06 15:30:44,430][04272] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-06 15:30:45,244][04272] Updated weights for policy 0, policy_version 46740 (0.0007) [2023-03-06 15:30:46,050][04272] Updated weights for policy 0, policy_version 46750 (0.0007) [2023-03-06 15:30:46,867][04272] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-06 15:30:47,675][04272] Updated weights for policy 0, policy_version 46770 (0.0007) [2023-03-06 15:30:48,495][04272] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-06 15:30:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 47907840. Throughput: 0: 12609.9. Samples: 47899979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:48,941][03942] Avg episode reward: [(0, '1098.239')] [2023-03-06 15:30:49,298][04272] Updated weights for policy 0, policy_version 46790 (0.0006) [2023-03-06 15:30:50,119][04272] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-06 15:30:50,933][04272] Updated weights for policy 0, policy_version 46810 (0.0007) [2023-03-06 15:30:51,747][04272] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-06 15:30:52,564][04272] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-06 15:30:53,384][04272] Updated weights for policy 0, policy_version 46840 (0.0007) [2023-03-06 15:30:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 47970304. Throughput: 0: 12604.8. Samples: 47937730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:53,941][03942] Avg episode reward: [(0, '1052.515')] [2023-03-06 15:30:54,189][04272] Updated weights for policy 0, policy_version 46850 (0.0006) [2023-03-06 15:30:54,997][04272] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-06 15:30:55,813][04272] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-06 15:30:56,633][04272] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-06 15:30:57,462][04272] Updated weights for policy 0, policy_version 46890 (0.0006) [2023-03-06 15:30:58,258][04272] Updated weights for policy 0, policy_version 46900 (0.0007) [2023-03-06 15:30:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 48033792. Throughput: 0: 12602.6. Samples: 48013261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:30:58,941][03942] Avg episode reward: [(0, '1297.968')] [2023-03-06 15:30:59,089][04272] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-06 15:30:59,894][04272] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-06 15:31:00,697][04272] Updated weights for policy 0, policy_version 46930 (0.0005) [2023-03-06 15:31:01,506][04272] Updated weights for policy 0, policy_version 46940 (0.0007) [2023-03-06 15:31:02,327][04272] Updated weights for policy 0, policy_version 46950 (0.0007) [2023-03-06 15:31:03,113][04272] Updated weights for policy 0, policy_version 46960 (0.0007) [2023-03-06 15:31:03,938][04272] Updated weights for policy 0, policy_version 46970 (0.0006) [2023-03-06 15:31:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 48097280. Throughput: 0: 12607.0. Samples: 48089048. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:03,942][03942] Avg episode reward: [(0, '1168.360')] [2023-03-06 15:31:04,749][04272] Updated weights for policy 0, policy_version 46980 (0.0006) [2023-03-06 15:31:05,581][04272] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-06 15:31:06,386][04272] Updated weights for policy 0, policy_version 47000 (0.0007) [2023-03-06 15:31:07,186][04272] Updated weights for policy 0, policy_version 47010 (0.0007) [2023-03-06 15:31:08,006][04272] Updated weights for policy 0, policy_version 47020 (0.0007) [2023-03-06 15:31:08,817][04272] Updated weights for policy 0, policy_version 47030 (0.0006) [2023-03-06 15:31:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 48159744. Throughput: 0: 12605.2. Samples: 48126634. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:08,941][03942] Avg episode reward: [(0, '1243.846')] [2023-03-06 15:31:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000047031_48159744.pth... [2023-03-06 15:31:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000044071_45128704.pth [2023-03-06 15:31:09,647][04272] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-06 15:31:10,473][04272] Updated weights for policy 0, policy_version 47050 (0.0007) [2023-03-06 15:31:11,275][04272] Updated weights for policy 0, policy_version 47060 (0.0007) [2023-03-06 15:31:12,073][04272] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-06 15:31:12,913][04272] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-06 15:31:13,720][04272] Updated weights for policy 0, policy_version 47090 (0.0007) [2023-03-06 15:31:13,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 48222208. Throughput: 0: 12593.2. Samples: 48202067. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:13,941][03942] Avg episode reward: [(0, '1319.275')] [2023-03-06 15:31:14,538][04272] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-06 15:31:15,360][04272] Updated weights for policy 0, policy_version 47110 (0.0007) [2023-03-06 15:31:16,169][04272] Updated weights for policy 0, policy_version 47120 (0.0006) [2023-03-06 15:31:16,964][04272] Updated weights for policy 0, policy_version 47130 (0.0006) [2023-03-06 15:31:17,790][04272] Updated weights for policy 0, policy_version 47140 (0.0006) [2023-03-06 15:31:18,598][04272] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-06 15:31:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 48285696. Throughput: 0: 12601.5. Samples: 48277634. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:18,941][03942] Avg episode reward: [(0, '1245.962')] [2023-03-06 15:31:19,406][04272] Updated weights for policy 0, policy_version 47160 (0.0006) [2023-03-06 15:31:20,227][04272] Updated weights for policy 0, policy_version 47170 (0.0007) [2023-03-06 15:31:21,030][04272] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-06 15:31:21,841][04272] Updated weights for policy 0, policy_version 47190 (0.0006) [2023-03-06 15:31:22,658][04272] Updated weights for policy 0, policy_version 47200 (0.0007) [2023-03-06 15:31:23,469][04272] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-06 15:31:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 48348160. Throughput: 0: 12594.4. Samples: 48315366. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:23,941][03942] Avg episode reward: [(0, '1265.005')] [2023-03-06 15:31:24,269][04272] Updated weights for policy 0, policy_version 47220 (0.0007) [2023-03-06 15:31:25,098][04272] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-06 15:31:25,920][04272] Updated weights for policy 0, policy_version 47240 (0.0007) [2023-03-06 15:31:26,705][04272] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-06 15:31:27,532][04272] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-06 15:31:28,338][04272] Updated weights for policy 0, policy_version 47270 (0.0007) [2023-03-06 15:31:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 48411648. Throughput: 0: 12594.3. Samples: 48391086. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:28,941][03942] Avg episode reward: [(0, '1203.002')] [2023-03-06 15:31:29,176][04272] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-06 15:31:29,965][04272] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-06 15:31:30,782][04272] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-06 15:31:31,604][04272] Updated weights for policy 0, policy_version 47310 (0.0007) [2023-03-06 15:31:32,418][04272] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-06 15:31:33,229][04272] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-06 15:31:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 48474112. Throughput: 0: 12589.5. Samples: 48466506. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:33,952][03942] Avg episode reward: [(0, '1122.684')] [2023-03-06 15:31:34,065][04272] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-06 15:31:34,876][04272] Updated weights for policy 0, policy_version 47350 (0.0007) [2023-03-06 15:31:35,679][04272] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-06 15:31:36,498][04272] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-06 15:31:37,320][04272] Updated weights for policy 0, policy_version 47380 (0.0006) [2023-03-06 15:31:38,136][04272] Updated weights for policy 0, policy_version 47390 (0.0007) [2023-03-06 15:31:38,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 48536576. Throughput: 0: 12586.5. Samples: 48504123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:38,941][03942] Avg episode reward: [(0, '1249.927')] [2023-03-06 15:31:38,950][04272] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-06 15:31:39,762][04272] Updated weights for policy 0, policy_version 47410 (0.0007) [2023-03-06 15:31:40,585][04272] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-06 15:31:41,402][04272] Updated weights for policy 0, policy_version 47430 (0.0006) [2023-03-06 15:31:42,219][04272] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-06 15:31:43,030][04272] Updated weights for policy 0, policy_version 47450 (0.0007) [2023-03-06 15:31:43,847][04272] Updated weights for policy 0, policy_version 47460 (0.0007) [2023-03-06 15:31:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 48600064. Throughput: 0: 12582.2. Samples: 48579459. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:43,941][03942] Avg episode reward: [(0, '1249.050')] [2023-03-06 15:31:44,656][04272] Updated weights for policy 0, policy_version 47470 (0.0007) [2023-03-06 15:31:45,481][04272] Updated weights for policy 0, policy_version 47480 (0.0008) [2023-03-06 15:31:46,279][04272] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-06 15:31:47,097][04272] Updated weights for policy 0, policy_version 47500 (0.0007) [2023-03-06 15:31:47,907][04272] Updated weights for policy 0, policy_version 47510 (0.0007) [2023-03-06 15:31:48,711][04272] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-06 15:31:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 48662528. Throughput: 0: 12577.5. Samples: 48655035. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:48,941][03942] Avg episode reward: [(0, '1107.080')] [2023-03-06 15:31:49,533][04272] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-06 15:31:50,345][04272] Updated weights for policy 0, policy_version 47540 (0.0007) [2023-03-06 15:31:50,834][04221] KL-divergence is very high: 128754.6719 [2023-03-06 15:31:51,174][04272] Updated weights for policy 0, policy_version 47550 (0.0006) [2023-03-06 15:31:51,978][04272] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-06 15:31:52,781][04272] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-06 15:31:53,592][04272] Updated weights for policy 0, policy_version 47580 (0.0006) [2023-03-06 15:31:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 48726016. Throughput: 0: 12578.4. Samples: 48692662. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:53,941][03942] Avg episode reward: [(0, '1213.494')] [2023-03-06 15:31:53,987][04221] KL-divergence is very high: 575.4880 [2023-03-06 15:31:54,411][04272] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-06 15:31:54,566][04221] KL-divergence is very high: 1136.2384 [2023-03-06 15:31:55,231][04272] Updated weights for policy 0, policy_version 47600 (0.0007) [2023-03-06 15:31:55,719][04221] KL-divergence is very high: 376486.1250 [2023-03-06 15:31:55,874][04221] KL-divergence is very high: 54254.8867 [2023-03-06 15:31:56,040][04272] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-06 15:31:56,849][04272] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-06 15:31:57,660][04272] Updated weights for policy 0, policy_version 47630 (0.0007) [2023-03-06 15:31:58,457][04272] Updated weights for policy 0, policy_version 47640 (0.0007) [2023-03-06 15:31:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 48788480. Throughput: 0: 12582.2. Samples: 48768268. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:31:58,941][03942] Avg episode reward: [(0, '1171.164')] [2023-03-06 15:31:59,289][04272] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-06 15:32:00,085][04272] Updated weights for policy 0, policy_version 47660 (0.0007) [2023-03-06 15:32:00,894][04272] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-06 15:32:01,690][04272] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-06 15:32:02,513][04272] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-06 15:32:03,322][04272] Updated weights for policy 0, policy_version 47700 (0.0008) [2023-03-06 15:32:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12621.2). Total num frames: 48851968. Throughput: 0: 12588.2. Samples: 48844101. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:03,941][03942] Avg episode reward: [(0, '1202.723')] [2023-03-06 15:32:04,129][04272] Updated weights for policy 0, policy_version 47710 (0.0007) [2023-03-06 15:32:04,947][04272] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-06 15:32:05,194][04221] KL-divergence is very high: 341.8444 [2023-03-06 15:32:05,753][04272] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-06 15:32:06,579][04272] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-06 15:32:07,397][04272] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-06 15:32:08,208][04272] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-06 15:32:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 48914432. Throughput: 0: 12589.8. Samples: 48881909. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:08,941][03942] Avg episode reward: [(0, '1245.450')] [2023-03-06 15:32:09,025][04272] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-06 15:32:09,837][04272] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-06 15:32:10,653][04272] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-06 15:32:11,468][04272] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-06 15:32:12,276][04272] Updated weights for policy 0, policy_version 47810 (0.0008) [2023-03-06 15:32:13,093][04272] Updated weights for policy 0, policy_version 47820 (0.0007) [2023-03-06 15:32:13,916][04272] Updated weights for policy 0, policy_version 47830 (0.0007) [2023-03-06 15:32:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 48977920. Throughput: 0: 12584.0. Samples: 48957365. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:13,941][03942] Avg episode reward: [(0, '905.281')] [2023-03-06 15:32:14,720][04272] Updated weights for policy 0, policy_version 47840 (0.0007) [2023-03-06 15:32:15,535][04272] Updated weights for policy 0, policy_version 47850 (0.0006) [2023-03-06 15:32:16,356][04272] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-06 15:32:17,166][04272] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-06 15:32:17,966][04272] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-06 15:32:18,780][04272] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-06 15:32:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 49041408. Throughput: 0: 12590.2. Samples: 49033066. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:18,941][03942] Avg episode reward: [(0, '629.490')] [2023-03-06 15:32:19,557][04272] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-06 15:32:20,387][04272] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-06 15:32:21,206][04272] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-06 15:32:22,024][04272] Updated weights for policy 0, policy_version 47930 (0.0007) [2023-03-06 15:32:22,835][04272] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-06 15:32:23,639][04272] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-06 15:32:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 49103872. Throughput: 0: 12597.9. Samples: 49071027. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:23,941][03942] Avg episode reward: [(0, '1055.027')] [2023-03-06 15:32:24,465][04272] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-06 15:32:25,290][04272] Updated weights for policy 0, policy_version 47970 (0.0007) [2023-03-06 15:32:26,088][04272] Updated weights for policy 0, policy_version 47980 (0.0006) [2023-03-06 15:32:26,907][04272] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-06 15:32:27,707][04272] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-06 15:32:28,497][04272] Updated weights for policy 0, policy_version 48010 (0.0007) [2023-03-06 15:32:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 49167360. Throughput: 0: 12602.7. Samples: 49146580. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:32:28,941][03942] Avg episode reward: [(0, '1201.953')] [2023-03-06 15:32:29,326][04272] Updated weights for policy 0, policy_version 48020 (0.0006) [2023-03-06 15:32:30,143][04272] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-06 15:32:30,934][04272] Updated weights for policy 0, policy_version 48040 (0.0007) [2023-03-06 15:32:31,754][04272] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-06 15:32:32,581][04272] Updated weights for policy 0, policy_version 48060 (0.0007) [2023-03-06 15:32:33,381][04272] Updated weights for policy 0, policy_version 48070 (0.0007) [2023-03-06 15:32:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 49230848. Throughput: 0: 12610.1. Samples: 49222491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:33,941][03942] Avg episode reward: [(0, '1163.553')] [2023-03-06 15:32:34,197][04272] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-06 15:32:34,996][04272] Updated weights for policy 0, policy_version 48090 (0.0007) [2023-03-06 15:32:35,801][04272] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-06 15:32:36,615][04272] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-06 15:32:37,435][04272] Updated weights for policy 0, policy_version 48120 (0.0006) [2023-03-06 15:32:38,247][04272] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-06 15:32:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 49293312. Throughput: 0: 12618.3. Samples: 49260487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:38,941][03942] Avg episode reward: [(0, '1136.193')] [2023-03-06 15:32:39,054][04272] Updated weights for policy 0, policy_version 48140 (0.0006) [2023-03-06 15:32:39,880][04272] Updated weights for policy 0, policy_version 48150 (0.0007) [2023-03-06 15:32:40,693][04272] Updated weights for policy 0, policy_version 48160 (0.0007) [2023-03-06 15:32:41,512][04272] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-06 15:32:42,306][04272] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-06 15:32:43,118][04272] Updated weights for policy 0, policy_version 48190 (0.0007) [2023-03-06 15:32:43,910][04272] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-06 15:32:43,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 49356800. Throughput: 0: 12615.6. Samples: 49335971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:43,941][03942] Avg episode reward: [(0, '1165.194')] [2023-03-06 15:32:44,733][04272] Updated weights for policy 0, policy_version 48210 (0.0007) [2023-03-06 15:32:45,558][04272] Updated weights for policy 0, policy_version 48220 (0.0007) [2023-03-06 15:32:46,347][04272] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-06 15:32:47,155][04272] Updated weights for policy 0, policy_version 48240 (0.0007) [2023-03-06 15:32:47,963][04272] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-06 15:32:48,759][04272] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-06 15:32:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49420288. Throughput: 0: 12620.6. Samples: 49412029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:48,941][03942] Avg episode reward: [(0, '1167.256')] [2023-03-06 15:32:49,586][04272] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-06 15:32:50,402][04272] Updated weights for policy 0, policy_version 48280 (0.0007) [2023-03-06 15:32:51,200][04272] Updated weights for policy 0, policy_version 48290 (0.0006) [2023-03-06 15:32:52,046][04272] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-06 15:32:52,853][04272] Updated weights for policy 0, policy_version 48310 (0.0006) [2023-03-06 15:32:53,658][04272] Updated weights for policy 0, policy_version 48320 (0.0006) [2023-03-06 15:32:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 49482752. Throughput: 0: 12619.4. Samples: 49449783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:53,941][03942] Avg episode reward: [(0, '1265.854')] [2023-03-06 15:32:54,470][04272] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-06 15:32:55,284][04272] Updated weights for policy 0, policy_version 48340 (0.0007) [2023-03-06 15:32:56,090][04272] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-06 15:32:56,917][04272] Updated weights for policy 0, policy_version 48360 (0.0006) [2023-03-06 15:32:57,720][04272] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-06 15:32:58,534][04272] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-06 15:32:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49546240. Throughput: 0: 12620.9. Samples: 49525303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:32:58,941][03942] Avg episode reward: [(0, '1230.120')] [2023-03-06 15:32:59,353][04272] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-06 15:33:00,155][04272] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-06 15:33:00,970][04272] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-06 15:33:01,773][04272] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-06 15:33:02,565][04272] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-06 15:33:03,386][04272] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-06 15:33:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 49608704. Throughput: 0: 12626.3. Samples: 49601250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:03,941][03942] Avg episode reward: [(0, '1202.605')] [2023-03-06 15:33:04,181][04272] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-06 15:33:04,993][04272] Updated weights for policy 0, policy_version 48460 (0.0006) [2023-03-06 15:33:05,807][04272] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-06 15:33:06,631][04272] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-06 15:33:07,439][04272] Updated weights for policy 0, policy_version 48490 (0.0007) [2023-03-06 15:33:08,262][04272] Updated weights for policy 0, policy_version 48500 (0.0007) [2023-03-06 15:33:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49672192. Throughput: 0: 12626.0. Samples: 49639196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:08,941][03942] Avg episode reward: [(0, '1115.899')] [2023-03-06 15:33:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000048508_49672192.pth... [2023-03-06 15:33:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000045553_46646272.pth [2023-03-06 15:33:09,053][04272] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-06 15:33:09,871][04272] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-06 15:33:10,686][04272] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-06 15:33:11,482][04272] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-06 15:33:12,308][04272] Updated weights for policy 0, policy_version 48550 (0.0007) [2023-03-06 15:33:13,118][04272] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-06 15:33:13,925][04272] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-06 15:33:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49735680. Throughput: 0: 12627.6. Samples: 49714822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:13,941][03942] Avg episode reward: [(0, '1227.049')] [2023-03-06 15:33:14,728][04272] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-06 15:33:15,540][04272] Updated weights for policy 0, policy_version 48590 (0.0006) [2023-03-06 15:33:16,369][04272] Updated weights for policy 0, policy_version 48600 (0.0007) [2023-03-06 15:33:17,164][04272] Updated weights for policy 0, policy_version 48610 (0.0007) [2023-03-06 15:33:17,963][04272] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-06 15:33:18,792][04272] Updated weights for policy 0, policy_version 48630 (0.0007) [2023-03-06 15:33:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 49798144. Throughput: 0: 12625.8. Samples: 49790651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:18,941][03942] Avg episode reward: [(0, '1273.170')] [2023-03-06 15:33:19,606][04272] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-06 15:33:20,404][04272] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-06 15:33:21,204][04272] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-06 15:33:22,010][04272] Updated weights for policy 0, policy_version 48670 (0.0006) [2023-03-06 15:33:22,816][04272] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-06 15:33:23,609][04272] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-06 15:33:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49861632. Throughput: 0: 12629.0. Samples: 49828793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:23,941][03942] Avg episode reward: [(0, '1248.487')] [2023-03-06 15:33:24,429][04272] Updated weights for policy 0, policy_version 48700 (0.0007) [2023-03-06 15:33:25,252][04272] Updated weights for policy 0, policy_version 48710 (0.0008) [2023-03-06 15:33:26,051][04272] Updated weights for policy 0, policy_version 48720 (0.0007) [2023-03-06 15:33:26,880][04272] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-06 15:33:27,677][04272] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-06 15:33:28,500][04272] Updated weights for policy 0, policy_version 48750 (0.0007) [2023-03-06 15:33:28,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 49925120. Throughput: 0: 12638.0. Samples: 49904679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:33:28,941][03942] Avg episode reward: [(0, '1270.928')] [2023-03-06 15:33:29,306][04272] Updated weights for policy 0, policy_version 48760 (0.0006) [2023-03-06 15:33:30,108][04272] Updated weights for policy 0, policy_version 48770 (0.0007) [2023-03-06 15:33:30,944][04272] Updated weights for policy 0, policy_version 48780 (0.0007) [2023-03-06 15:33:31,755][04272] Updated weights for policy 0, policy_version 48790 (0.0007) [2023-03-06 15:33:32,557][04272] Updated weights for policy 0, policy_version 48800 (0.0007) [2023-03-06 15:33:33,352][04272] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-06 15:33:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 49987584. Throughput: 0: 12629.7. Samples: 49980363. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:33,941][03942] Avg episode reward: [(0, '1258.613')] [2023-03-06 15:33:34,144][04272] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-06 15:33:34,956][04272] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-06 15:33:35,798][04272] Updated weights for policy 0, policy_version 48840 (0.0007) [2023-03-06 15:33:36,602][04272] Updated weights for policy 0, policy_version 48850 (0.0007) [2023-03-06 15:33:37,405][04272] Updated weights for policy 0, policy_version 48860 (0.0006) [2023-03-06 15:33:38,218][04272] Updated weights for policy 0, policy_version 48870 (0.0007) [2023-03-06 15:33:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 50051072. Throughput: 0: 12635.9. Samples: 50018398. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:38,941][03942] Avg episode reward: [(0, '1054.205')] [2023-03-06 15:33:39,030][04272] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-06 15:33:39,829][04272] Updated weights for policy 0, policy_version 48890 (0.0007) [2023-03-06 15:33:40,636][04272] Updated weights for policy 0, policy_version 48900 (0.0007) [2023-03-06 15:33:41,458][04272] Updated weights for policy 0, policy_version 48910 (0.0007) [2023-03-06 15:33:42,275][04272] Updated weights for policy 0, policy_version 48920 (0.0006) [2023-03-06 15:33:43,082][04272] Updated weights for policy 0, policy_version 48930 (0.0007) [2023-03-06 15:33:43,906][04272] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-06 15:33:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.4, 300 sec: 12614.3). Total num frames: 50114560. Throughput: 0: 12643.2. Samples: 50094249. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:43,941][03942] Avg episode reward: [(0, '1201.124')] [2023-03-06 15:33:44,733][04272] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-06 15:33:45,518][04272] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-06 15:33:46,326][04272] Updated weights for policy 0, policy_version 48970 (0.0006) [2023-03-06 15:33:47,134][04272] Updated weights for policy 0, policy_version 48980 (0.0007) [2023-03-06 15:33:47,946][04272] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-06 15:33:48,749][04272] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-06 15:33:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 50178048. Throughput: 0: 12635.7. Samples: 50169856. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:48,941][03942] Avg episode reward: [(0, '1038.893')] [2023-03-06 15:33:49,577][04272] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-06 15:33:50,377][04272] Updated weights for policy 0, policy_version 49020 (0.0007) [2023-03-06 15:33:51,175][04272] Updated weights for policy 0, policy_version 49030 (0.0007) [2023-03-06 15:33:51,990][04272] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-06 15:33:52,805][04272] Updated weights for policy 0, policy_version 49050 (0.0006) [2023-03-06 15:33:53,610][04272] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-06 15:33:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 50241536. Throughput: 0: 12635.9. Samples: 50207811. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:53,941][03942] Avg episode reward: [(0, '990.876')] [2023-03-06 15:33:54,413][04272] Updated weights for policy 0, policy_version 49070 (0.0006) [2023-03-06 15:33:55,229][04272] Updated weights for policy 0, policy_version 49080 (0.0007) [2023-03-06 15:33:56,039][04272] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-06 15:33:56,861][04272] Updated weights for policy 0, policy_version 49100 (0.0006) [2023-03-06 15:33:57,665][04272] Updated weights for policy 0, policy_version 49110 (0.0008) [2023-03-06 15:33:58,474][04272] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-06 15:33:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 50304000. Throughput: 0: 12641.2. Samples: 50283676. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:33:58,941][03942] Avg episode reward: [(0, '924.704')] [2023-03-06 15:33:59,288][04272] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-06 15:34:00,099][04272] Updated weights for policy 0, policy_version 49140 (0.0007) [2023-03-06 15:34:00,909][04272] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-06 15:34:01,719][04272] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-06 15:34:02,534][04272] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-06 15:34:03,332][04272] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-06 15:34:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 50367488. Throughput: 0: 12641.9. Samples: 50359535. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:03,941][03942] Avg episode reward: [(0, '991.807')] [2023-03-06 15:34:04,137][04272] Updated weights for policy 0, policy_version 49190 (0.0005) [2023-03-06 15:34:04,941][04272] Updated weights for policy 0, policy_version 49200 (0.0006) [2023-03-06 15:34:05,762][04272] Updated weights for policy 0, policy_version 49210 (0.0007) [2023-03-06 15:34:06,580][04272] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-06 15:34:07,390][04272] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-06 15:34:08,217][04272] Updated weights for policy 0, policy_version 49240 (0.0007) [2023-03-06 15:34:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 50430976. Throughput: 0: 12638.5. Samples: 50397527. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:08,941][03942] Avg episode reward: [(0, '1094.567')] [2023-03-06 15:34:09,019][04272] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-06 15:34:09,838][04272] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-06 15:34:10,651][04272] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-06 15:34:11,464][04272] Updated weights for policy 0, policy_version 49280 (0.0007) [2023-03-06 15:34:12,257][04272] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-06 15:34:13,066][04272] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-06 15:34:13,896][04272] Updated weights for policy 0, policy_version 49310 (0.0007) [2023-03-06 15:34:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 50493440. Throughput: 0: 12632.5. Samples: 50473141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:13,941][03942] Avg episode reward: [(0, '1106.869')] [2023-03-06 15:34:14,690][04272] Updated weights for policy 0, policy_version 49320 (0.0006) [2023-03-06 15:34:15,497][04272] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-06 15:34:16,322][04272] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-06 15:34:17,115][04272] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-06 15:34:17,937][04272] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-06 15:34:18,745][04272] Updated weights for policy 0, policy_version 49370 (0.0006) [2023-03-06 15:34:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 50556928. Throughput: 0: 12636.0. Samples: 50548984. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:18,941][03942] Avg episode reward: [(0, '1199.837')] [2023-03-06 15:34:19,558][04272] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-06 15:34:20,367][04272] Updated weights for policy 0, policy_version 49390 (0.0007) [2023-03-06 15:34:21,189][04272] Updated weights for policy 0, policy_version 49400 (0.0007) [2023-03-06 15:34:21,980][04272] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-06 15:34:22,797][04272] Updated weights for policy 0, policy_version 49420 (0.0007) [2023-03-06 15:34:23,609][04272] Updated weights for policy 0, policy_version 49430 (0.0007) [2023-03-06 15:34:23,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 50620416. Throughput: 0: 12632.7. Samples: 50586868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:23,941][03942] Avg episode reward: [(0, '829.878')] [2023-03-06 15:34:24,425][04272] Updated weights for policy 0, policy_version 49440 (0.0007) [2023-03-06 15:34:25,230][04272] Updated weights for policy 0, policy_version 49450 (0.0007) [2023-03-06 15:34:26,038][04272] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-06 15:34:26,855][04272] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-06 15:34:27,656][04272] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-06 15:34:28,462][04272] Updated weights for policy 0, policy_version 49490 (0.0007) [2023-03-06 15:34:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 50682880. Throughput: 0: 12633.5. Samples: 50662756. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:28,941][03942] Avg episode reward: [(0, '1183.478')] [2023-03-06 15:34:29,282][04272] Updated weights for policy 0, policy_version 49500 (0.0006) [2023-03-06 15:34:30,086][04272] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-06 15:34:30,893][04272] Updated weights for policy 0, policy_version 49520 (0.0006) [2023-03-06 15:34:31,697][04272] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-06 15:34:32,511][04272] Updated weights for policy 0, policy_version 49540 (0.0007) [2023-03-06 15:34:33,313][04272] Updated weights for policy 0, policy_version 49550 (0.0007) [2023-03-06 15:34:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 50746368. Throughput: 0: 12638.7. Samples: 50738597. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:34:33,941][03942] Avg episode reward: [(0, '1195.767')] [2023-03-06 15:34:34,127][04272] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-06 15:34:34,926][04272] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-06 15:34:35,727][04272] Updated weights for policy 0, policy_version 49580 (0.0006) [2023-03-06 15:34:36,555][04272] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-06 15:34:37,366][04272] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-06 15:34:38,169][04272] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-06 15:34:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 50809856. Throughput: 0: 12640.4. Samples: 50776628. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:34:38,941][03942] Avg episode reward: [(0, '1062.406')] [2023-03-06 15:34:38,984][04272] Updated weights for policy 0, policy_version 49620 (0.0007) [2023-03-06 15:34:39,791][04272] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-06 15:34:40,589][04272] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-06 15:34:41,385][04272] Updated weights for policy 0, policy_version 49650 (0.0007) [2023-03-06 15:34:42,196][04272] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-06 15:34:43,003][04272] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-06 15:34:43,827][04272] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-06 15:34:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 50873344. Throughput: 0: 12646.7. Samples: 50852776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:34:43,941][03942] Avg episode reward: [(0, '1186.059')] [2023-03-06 15:34:44,629][04272] Updated weights for policy 0, policy_version 49690 (0.0006) [2023-03-06 15:34:45,429][04272] Updated weights for policy 0, policy_version 49700 (0.0006) [2023-03-06 15:34:46,223][04272] Updated weights for policy 0, policy_version 49710 (0.0006) [2023-03-06 15:34:46,630][04221] KL-divergence is very high: 184.2555 [2023-03-06 15:34:47,043][04272] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-06 15:34:47,824][04272] Updated weights for policy 0, policy_version 49730 (0.0006) [2023-03-06 15:34:48,649][04272] Updated weights for policy 0, policy_version 49740 (0.0007) [2023-03-06 15:34:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 50936832. Throughput: 0: 12657.0. Samples: 50929101. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:34:48,941][03942] Avg episode reward: [(0, '1131.762')] [2023-03-06 15:34:49,449][04272] Updated weights for policy 0, policy_version 49750 (0.0007) [2023-03-06 15:34:50,263][04272] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-06 15:34:51,066][04272] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-06 15:34:51,873][04272] Updated weights for policy 0, policy_version 49780 (0.0007) [2023-03-06 15:34:52,685][04272] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-06 15:34:53,489][04272] Updated weights for policy 0, policy_version 49800 (0.0006) [2023-03-06 15:34:53,872][04221] KL-divergence is very high: 165.7026 [2023-03-06 15:34:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 51000320. Throughput: 0: 12657.5. Samples: 50967113. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:34:53,941][03942] Avg episode reward: [(0, '1149.640')] [2023-03-06 15:34:54,296][04272] Updated weights for policy 0, policy_version 49810 (0.0006) [2023-03-06 15:34:55,098][04272] Updated weights for policy 0, policy_version 49820 (0.0006) [2023-03-06 15:34:55,905][04272] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-06 15:34:56,707][04272] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-06 15:34:56,870][04221] KL-divergence is very high: 143.9739 [2023-03-06 15:34:57,531][04272] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-06 15:34:57,606][04221] KL-divergence is very high: 163.7479 [2023-03-06 15:34:58,319][04272] Updated weights for policy 0, policy_version 49860 (0.0007) [2023-03-06 15:34:58,405][04221] KL-divergence is very high: 156958.1094 [2023-03-06 15:34:58,486][04221] KL-divergence is very high: 17715.6230 [2023-03-06 15:34:58,647][04221] KL-divergence is very high: 592.4879 [2023-03-06 15:34:58,820][04221] KL-divergence is very high: 97933.1875 [2023-03-06 15:34:58,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12621.2). Total num frames: 51063808. Throughput: 0: 12672.0. Samples: 51043380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:34:58,941][03942] Avg episode reward: [(0, '1292.217')] [2023-03-06 15:34:58,976][04221] KL-divergence is very high: 219.5310 [2023-03-06 15:34:59,033][04221] KL-divergence is very high: 79175.7344 [2023-03-06 15:34:59,141][04272] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-06 15:34:59,941][04272] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-06 15:35:00,749][04272] Updated weights for policy 0, policy_version 49890 (0.0007) [2023-03-06 15:35:01,550][04272] Updated weights for policy 0, policy_version 49900 (0.0006) [2023-03-06 15:35:01,636][04221] KL-divergence is very high: 144.1357 [2023-03-06 15:35:02,358][04272] Updated weights for policy 0, policy_version 49910 (0.0007) [2023-03-06 15:35:03,169][04272] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-06 15:35:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12621.2). Total num frames: 51127296. Throughput: 0: 12673.7. Samples: 51119299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:03,941][03942] Avg episode reward: [(0, '1216.901')] [2023-03-06 15:35:03,985][04272] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-06 15:35:04,802][04272] Updated weights for policy 0, policy_version 49940 (0.0007) [2023-03-06 15:35:05,599][04272] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-06 15:35:06,415][04272] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-06 15:35:07,224][04272] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-06 15:35:08,032][04272] Updated weights for policy 0, policy_version 49980 (0.0007) [2023-03-06 15:35:08,849][04272] Updated weights for policy 0, policy_version 49990 (0.0007) [2023-03-06 15:35:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 51190784. Throughput: 0: 12677.2. Samples: 51157342. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:08,941][03942] Avg episode reward: [(0, '1228.331')] [2023-03-06 15:35:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000049991_51190784.pth... [2023-03-06 15:35:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000047031_48159744.pth [2023-03-06 15:35:09,660][04272] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-06 15:35:10,462][04272] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-06 15:35:11,270][04272] Updated weights for policy 0, policy_version 50020 (0.0006) [2023-03-06 15:35:12,097][04272] Updated weights for policy 0, policy_version 50030 (0.0007) [2023-03-06 15:35:12,897][04272] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-06 15:35:13,699][04272] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-06 15:35:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12621.2). Total num frames: 51253248. Throughput: 0: 12673.8. Samples: 51233076. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:13,941][03942] Avg episode reward: [(0, '1192.049')] [2023-03-06 15:35:14,506][04272] Updated weights for policy 0, policy_version 50060 (0.0007) [2023-03-06 15:35:15,311][04272] Updated weights for policy 0, policy_version 50070 (0.0007) [2023-03-06 15:35:16,133][04272] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-06 15:35:16,914][04272] Updated weights for policy 0, policy_version 50090 (0.0007) [2023-03-06 15:35:17,740][04272] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-06 15:35:18,539][04272] Updated weights for policy 0, policy_version 50110 (0.0006) [2023-03-06 15:35:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 51316736. Throughput: 0: 12684.8. Samples: 51309411. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:18,941][03942] Avg episode reward: [(0, '1279.000')] [2023-03-06 15:35:19,342][04272] Updated weights for policy 0, policy_version 50120 (0.0006) [2023-03-06 15:35:20,150][04272] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-06 15:35:20,962][04272] Updated weights for policy 0, policy_version 50140 (0.0007) [2023-03-06 15:35:21,765][04272] Updated weights for policy 0, policy_version 50150 (0.0007) [2023-03-06 15:35:22,590][04272] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-03-06 15:35:23,385][04272] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-06 15:35:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 51380224. Throughput: 0: 12685.1. Samples: 51347459. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:23,941][03942] Avg episode reward: [(0, '1283.216')] [2023-03-06 15:35:24,229][04272] Updated weights for policy 0, policy_version 50180 (0.0007) [2023-03-06 15:35:25,039][04272] Updated weights for policy 0, policy_version 50190 (0.0008) [2023-03-06 15:35:25,849][04272] Updated weights for policy 0, policy_version 50200 (0.0006) [2023-03-06 15:35:26,675][04272] Updated weights for policy 0, policy_version 50210 (0.0006) [2023-03-06 15:35:27,470][04272] Updated weights for policy 0, policy_version 50220 (0.0007) [2023-03-06 15:35:28,275][04272] Updated weights for policy 0, policy_version 50230 (0.0006) [2023-03-06 15:35:28,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12628.2). Total num frames: 51443712. Throughput: 0: 12667.9. Samples: 51422834. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:28,941][03942] Avg episode reward: [(0, '1276.679')] [2023-03-06 15:35:29,080][04272] Updated weights for policy 0, policy_version 50240 (0.0007) [2023-03-06 15:35:29,898][04272] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-06 15:35:30,701][04272] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-06 15:35:31,511][04272] Updated weights for policy 0, policy_version 50270 (0.0007) [2023-03-06 15:35:32,318][04272] Updated weights for policy 0, policy_version 50280 (0.0007) [2023-03-06 15:35:33,126][04272] Updated weights for policy 0, policy_version 50290 (0.0006) [2023-03-06 15:35:33,933][04272] Updated weights for policy 0, policy_version 50300 (0.0005) [2023-03-06 15:35:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12628.2). Total num frames: 51507200. Throughput: 0: 12662.7. Samples: 51498921. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:35:33,941][03942] Avg episode reward: [(0, '1368.611')] [2023-03-06 15:35:34,762][04272] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-06 15:35:35,558][04272] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-06 15:35:36,358][04272] Updated weights for policy 0, policy_version 50330 (0.0007) [2023-03-06 15:35:37,166][04272] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-06 15:35:38,005][04272] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-06 15:35:38,789][04272] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-06 15:35:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 51569664. Throughput: 0: 12661.9. Samples: 51536899. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:35:38,941][03942] Avg episode reward: [(0, '1325.432')] [2023-03-06 15:35:39,599][04272] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-06 15:35:40,403][04272] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-06 15:35:41,210][04272] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-06 15:35:42,010][04272] Updated weights for policy 0, policy_version 50400 (0.0006) [2023-03-06 15:35:42,816][04272] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-06 15:35:43,638][04272] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-06 15:35:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12663.4, 300 sec: 12628.2). Total num frames: 51633152. Throughput: 0: 12658.1. Samples: 51612995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:35:43,941][03942] Avg episode reward: [(0, '1154.008')] [2023-03-06 15:35:44,445][04272] Updated weights for policy 0, policy_version 50430 (0.0005) [2023-03-06 15:35:45,275][04272] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-06 15:35:46,082][04272] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-06 15:35:46,894][04272] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-06 15:35:47,709][04272] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-06 15:35:48,502][04272] Updated weights for policy 0, policy_version 50480 (0.0007) [2023-03-06 15:35:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12631.7). Total num frames: 51696640. Throughput: 0: 12653.0. Samples: 51688685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:35:48,941][03942] Avg episode reward: [(0, '1294.876')] [2023-03-06 15:35:49,321][04272] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-06 15:35:50,134][04272] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-06 15:35:50,934][04272] Updated weights for policy 0, policy_version 50510 (0.0007) [2023-03-06 15:35:51,752][04272] Updated weights for policy 0, policy_version 50520 (0.0005) [2023-03-06 15:35:52,545][04272] Updated weights for policy 0, policy_version 50530 (0.0006) [2023-03-06 15:35:53,340][04272] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-06 15:35:53,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12631.6). Total num frames: 51760128. Throughput: 0: 12651.4. Samples: 51726654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:35:53,941][03942] Avg episode reward: [(0, '899.564')] [2023-03-06 15:35:54,188][04272] Updated weights for policy 0, policy_version 50550 (0.0005) [2023-03-06 15:35:54,983][04272] Updated weights for policy 0, policy_version 50560 (0.0007) [2023-03-06 15:35:55,774][04272] Updated weights for policy 0, policy_version 50570 (0.0006) [2023-03-06 15:35:56,596][04272] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-06 15:35:57,414][04272] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-06 15:35:58,210][04272] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-06 15:35:58,941][04221] KL-divergence is very high: 729.5499 [2023-03-06 15:35:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 51822592. Throughput: 0: 12654.9. Samples: 51802546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:35:58,941][03942] Avg episode reward: [(0, '1080.019')] [2023-03-06 15:35:59,025][04272] Updated weights for policy 0, policy_version 50610 (0.0006) [2023-03-06 15:35:59,259][04221] KL-divergence is very high: 323.3760 [2023-03-06 15:35:59,842][04272] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-06 15:36:00,646][04272] Updated weights for policy 0, policy_version 50630 (0.0008) [2023-03-06 15:36:01,455][04272] Updated weights for policy 0, policy_version 50640 (0.0007) [2023-03-06 15:36:02,261][04272] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-06 15:36:03,078][04272] Updated weights for policy 0, policy_version 50660 (0.0007) [2023-03-06 15:36:03,897][04272] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-06 15:36:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 51886080. Throughput: 0: 12640.3. Samples: 51878224. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:03,952][03942] Avg episode reward: [(0, '723.859')] [2023-03-06 15:36:04,717][04272] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-06 15:36:05,521][04272] Updated weights for policy 0, policy_version 50690 (0.0007) [2023-03-06 15:36:06,304][04272] Updated weights for policy 0, policy_version 50700 (0.0007) [2023-03-06 15:36:07,123][04272] Updated weights for policy 0, policy_version 50710 (0.0007) [2023-03-06 15:36:07,942][04272] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-06 15:36:08,725][04272] Updated weights for policy 0, policy_version 50730 (0.0006) [2023-03-06 15:36:08,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 51949568. Throughput: 0: 12643.5. Samples: 51916416. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:08,952][03942] Avg episode reward: [(0, '1229.268')] [2023-03-06 15:36:09,552][04272] Updated weights for policy 0, policy_version 50740 (0.0007) [2023-03-06 15:36:10,369][04272] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-06 15:36:11,175][04272] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-06 15:36:11,988][04272] Updated weights for policy 0, policy_version 50770 (0.0007) [2023-03-06 15:36:12,788][04272] Updated weights for policy 0, policy_version 50780 (0.0007) [2023-03-06 15:36:13,597][04272] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-06 15:36:13,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12635.1). Total num frames: 52013056. Throughput: 0: 12652.3. Samples: 51992188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:13,951][03942] Avg episode reward: [(0, '1225.429')] [2023-03-06 15:36:14,426][04272] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-06 15:36:15,237][04272] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-06 15:36:16,038][04272] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-06 15:36:16,865][04272] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-06 15:36:17,679][04272] Updated weights for policy 0, policy_version 50840 (0.0007) [2023-03-06 15:36:18,473][04272] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-06 15:36:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 52075520. Throughput: 0: 12644.4. Samples: 52067918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:18,951][03942] Avg episode reward: [(0, '1288.449')] [2023-03-06 15:36:19,309][04272] Updated weights for policy 0, policy_version 50860 (0.0007) [2023-03-06 15:36:20,128][04272] Updated weights for policy 0, policy_version 50870 (0.0007) [2023-03-06 15:36:20,938][04272] Updated weights for policy 0, policy_version 50880 (0.0007) [2023-03-06 15:36:21,766][04272] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-06 15:36:22,569][04272] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-06 15:36:23,365][04272] Updated weights for policy 0, policy_version 50910 (0.0007) [2023-03-06 15:36:23,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 52137984. Throughput: 0: 12636.8. Samples: 52105555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:23,952][03942] Avg episode reward: [(0, '1306.146')] [2023-03-06 15:36:24,189][04272] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-06 15:36:25,002][04272] Updated weights for policy 0, policy_version 50930 (0.0006) [2023-03-06 15:36:25,800][04272] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-06 15:36:26,616][04272] Updated weights for policy 0, policy_version 50950 (0.0006) [2023-03-06 15:36:27,423][04272] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-06 15:36:28,241][04272] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-06 15:36:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12635.1). Total num frames: 52201472. Throughput: 0: 12625.8. Samples: 52181154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:28,951][03942] Avg episode reward: [(0, '1317.369')] [2023-03-06 15:36:29,039][04272] Updated weights for policy 0, policy_version 50980 (0.0006) [2023-03-06 15:36:29,854][04272] Updated weights for policy 0, policy_version 50990 (0.0007) [2023-03-06 15:36:30,665][04272] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-06 15:36:31,487][04272] Updated weights for policy 0, policy_version 51010 (0.0007) [2023-03-06 15:36:32,295][04272] Updated weights for policy 0, policy_version 51020 (0.0007) [2023-03-06 15:36:33,095][04272] Updated weights for policy 0, policy_version 51030 (0.0007) [2023-03-06 15:36:33,899][04272] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-06 15:36:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52264960. Throughput: 0: 12628.8. Samples: 52256981. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:33,952][03942] Avg episode reward: [(0, '1285.071')] [2023-03-06 15:36:34,732][04272] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-06 15:36:35,530][04272] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-06 15:36:36,346][04272] Updated weights for policy 0, policy_version 51070 (0.0007) [2023-03-06 15:36:37,171][04272] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-06 15:36:37,972][04272] Updated weights for policy 0, policy_version 51090 (0.0007) [2023-03-06 15:36:38,768][04272] Updated weights for policy 0, policy_version 51100 (0.0007) [2023-03-06 15:36:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 52328448. Throughput: 0: 12627.5. Samples: 52294893. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:38,941][03942] Avg episode reward: [(0, '1266.540')] [2023-03-06 15:36:39,591][04272] Updated weights for policy 0, policy_version 51110 (0.0007) [2023-03-06 15:36:40,403][04272] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-06 15:36:41,204][04272] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-06 15:36:42,028][04272] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-06 15:36:42,831][04272] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-06 15:36:43,645][04272] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-06 15:36:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52390912. Throughput: 0: 12624.6. Samples: 52370653. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:43,952][03942] Avg episode reward: [(0, '1323.863')] [2023-03-06 15:36:44,450][04272] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-06 15:36:45,269][04272] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-06 15:36:46,063][04272] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-06 15:36:46,895][04272] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-06 15:36:47,704][04272] Updated weights for policy 0, policy_version 51210 (0.0007) [2023-03-06 15:36:48,502][04272] Updated weights for policy 0, policy_version 51220 (0.0007) [2023-03-06 15:36:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52454400. Throughput: 0: 12624.6. Samples: 52446330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:48,951][03942] Avg episode reward: [(0, '1251.654')] [2023-03-06 15:36:49,317][04272] Updated weights for policy 0, policy_version 51230 (0.0007) [2023-03-06 15:36:50,137][04272] Updated weights for policy 0, policy_version 51240 (0.0007) [2023-03-06 15:36:50,944][04272] Updated weights for policy 0, policy_version 51250 (0.0007) [2023-03-06 15:36:51,758][04272] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-06 15:36:52,565][04272] Updated weights for policy 0, policy_version 51270 (0.0006) [2023-03-06 15:36:53,376][04272] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-06 15:36:53,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 52517888. Throughput: 0: 12618.7. Samples: 52484257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:53,952][03942] Avg episode reward: [(0, '1333.853')] [2023-03-06 15:36:54,189][04272] Updated weights for policy 0, policy_version 51290 (0.0007) [2023-03-06 15:36:55,010][04272] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-06 15:36:55,835][04272] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-06 15:36:56,638][04272] Updated weights for policy 0, policy_version 51320 (0.0007) [2023-03-06 15:36:57,461][04272] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-06 15:36:58,259][04272] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-06 15:36:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52580352. Throughput: 0: 12611.5. Samples: 52559707. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:36:58,952][03942] Avg episode reward: [(0, '1193.613')] [2023-03-06 15:36:59,072][04272] Updated weights for policy 0, policy_version 51350 (0.0007) [2023-03-06 15:36:59,877][04272] Updated weights for policy 0, policy_version 51360 (0.0007) [2023-03-06 15:37:00,713][04272] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-06 15:37:01,512][04272] Updated weights for policy 0, policy_version 51380 (0.0006) [2023-03-06 15:37:02,343][04272] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-06 15:37:03,154][04272] Updated weights for policy 0, policy_version 51400 (0.0007) [2023-03-06 15:37:03,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12638.6). Total num frames: 52642816. Throughput: 0: 12607.3. Samples: 52635244. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:03,951][03942] Avg episode reward: [(0, '1306.110')] [2023-03-06 15:37:03,954][04272] Updated weights for policy 0, policy_version 51410 (0.0007) [2023-03-06 15:37:04,772][04272] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-06 15:37:05,574][04272] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-06 15:37:06,378][04272] Updated weights for policy 0, policy_version 51440 (0.0007) [2023-03-06 15:37:07,196][04272] Updated weights for policy 0, policy_version 51450 (0.0007) [2023-03-06 15:37:08,001][04272] Updated weights for policy 0, policy_version 51460 (0.0006) [2023-03-06 15:37:08,819][04272] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-06 15:37:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12638.6). Total num frames: 52706304. Throughput: 0: 12615.1. Samples: 52673236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:08,952][03942] Avg episode reward: [(0, '1283.533')] [2023-03-06 15:37:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000051471_52706304.pth... [2023-03-06 15:37:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000048508_49672192.pth [2023-03-06 15:37:09,631][04272] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-06 15:37:10,448][04272] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-06 15:37:11,250][04272] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-06 15:37:11,403][04221] KL-divergence is very high: 170.9823 [2023-03-06 15:37:12,049][04272] Updated weights for policy 0, policy_version 51510 (0.0006) [2023-03-06 15:37:12,203][04221] KL-divergence is very high: 325.1392 [2023-03-06 15:37:12,868][04221] KL-divergence is very high: 789.4005 [2023-03-06 15:37:12,874][04272] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-06 15:37:13,682][04272] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-06 15:37:13,746][04221] KL-divergence is very high: 686.3275 [2023-03-06 15:37:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12638.6). Total num frames: 52769792. Throughput: 0: 12623.7. Samples: 52749221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:13,952][03942] Avg episode reward: [(0, '1230.305')] [2023-03-06 15:37:14,317][04221] KL-divergence is very high: 4606.1201 [2023-03-06 15:37:14,486][04272] Updated weights for policy 0, policy_version 51540 (0.0007) [2023-03-06 15:37:14,566][04221] KL-divergence is very high: 400.0823 [2023-03-06 15:37:14,651][04221] KL-divergence is very high: 8012.4722 [2023-03-06 15:37:15,051][04221] KL-divergence is very high: 15960.6240 [2023-03-06 15:37:15,211][04221] KL-divergence is very high: 2214.4580 [2023-03-06 15:37:15,315][04272] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-06 15:37:15,538][04221] KL-divergence is very high: 235.3206 [2023-03-06 15:37:15,697][04221] KL-divergence is very high: 3599.5437 [2023-03-06 15:37:15,865][04221] KL-divergence is very high: 4531.2529 [2023-03-06 15:37:16,019][04221] KL-divergence is very high: 3457.8428 [2023-03-06 15:37:16,117][04272] Updated weights for policy 0, policy_version 51560 (0.0007) [2023-03-06 15:37:16,177][04221] KL-divergence is very high: 19881.4414 [2023-03-06 15:37:16,349][04221] KL-divergence is very high: 4483.4756 [2023-03-06 15:37:16,509][04221] KL-divergence is very high: 4477.4502 [2023-03-06 15:37:16,666][04221] KL-divergence is very high: 907.6780 [2023-03-06 15:37:16,822][04221] KL-divergence is very high: 375.6542 [2023-03-06 15:37:16,912][04272] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-06 15:37:16,988][04221] KL-divergence is very high: 126.1902 [2023-03-06 15:37:17,079][04221] KL-divergence is very high: 212.3690 [2023-03-06 15:37:17,323][04221] KL-divergence is very high: 1617.7903 [2023-03-06 15:37:17,473][04221] KL-divergence is very high: 4436.3867 [2023-03-06 15:37:17,724][04221] KL-divergence is very high: 1215.6949 [2023-03-06 15:37:17,732][04272] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-06 15:37:18,542][04272] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-06 15:37:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12638.6). Total num frames: 52832256. Throughput: 0: 12619.1. Samples: 52824839. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:18,951][03942] Avg episode reward: [(0, '1084.160')] [2023-03-06 15:37:19,345][04272] Updated weights for policy 0, policy_version 51600 (0.0007) [2023-03-06 15:37:20,179][04272] Updated weights for policy 0, policy_version 51610 (0.0007) [2023-03-06 15:37:20,969][04272] Updated weights for policy 0, policy_version 51620 (0.0006) [2023-03-06 15:37:21,777][04272] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-06 15:37:22,594][04272] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-06 15:37:23,253][04221] KL-divergence is very high: 106.2435 [2023-03-06 15:37:23,417][04272] Updated weights for policy 0, policy_version 51650 (0.0006) [2023-03-06 15:37:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52895744. Throughput: 0: 12618.7. Samples: 52862736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:23,941][03942] Avg episode reward: [(0, '1108.002')] [2023-03-06 15:37:24,208][04272] Updated weights for policy 0, policy_version 51660 (0.0007) [2023-03-06 15:37:24,841][04221] KL-divergence is very high: 1222.3151 [2023-03-06 15:37:25,019][04221] KL-divergence is very high: 181.9583 [2023-03-06 15:37:25,027][04272] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-06 15:37:25,097][04221] KL-divergence is very high: 412.7121 [2023-03-06 15:37:25,187][04221] KL-divergence is very high: 3934.2515 [2023-03-06 15:37:25,260][04221] KL-divergence is very high: 180.4308 [2023-03-06 15:37:25,414][04221] KL-divergence is very high: 7933.7544 [2023-03-06 15:37:25,810][04272] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-06 15:37:26,615][04272] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-06 15:37:26,860][04221] KL-divergence is very high: 728.5692 [2023-03-06 15:37:27,091][04221] KL-divergence is very high: 278.0139 [2023-03-06 15:37:27,260][04221] KL-divergence is very high: 265.1059 [2023-03-06 15:37:27,433][04272] Updated weights for policy 0, policy_version 51700 (0.0007) [2023-03-06 15:37:27,912][04221] KL-divergence is very high: 418.8973 [2023-03-06 15:37:28,235][04272] Updated weights for policy 0, policy_version 51710 (0.0007) [2023-03-06 15:37:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 52959232. Throughput: 0: 12628.8. Samples: 52938947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:28,941][03942] Avg episode reward: [(0, '1004.044')] [2023-03-06 15:37:29,044][04272] Updated weights for policy 0, policy_version 51720 (0.0007) [2023-03-06 15:37:29,853][04221] KL-divergence is very high: 14502.2861 [2023-03-06 15:37:29,860][04272] Updated weights for policy 0, policy_version 51730 (0.0007) [2023-03-06 15:37:29,915][04221] KL-divergence is very high: 299.1747 [2023-03-06 15:37:30,005][04221] KL-divergence is very high: 1098.3274 [2023-03-06 15:37:30,413][04221] KL-divergence is very high: 1505.4399 [2023-03-06 15:37:30,567][04221] KL-divergence is very high: 352.7566 [2023-03-06 15:37:30,687][04272] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-06 15:37:31,478][04272] Updated weights for policy 0, policy_version 51750 (0.0007) [2023-03-06 15:37:31,785][04221] KL-divergence is very high: 570.8815 [2023-03-06 15:37:32,024][04221] KL-divergence is very high: 1019.3687 [2023-03-06 15:37:32,287][04272] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-06 15:37:32,444][04221] KL-divergence is very high: 350.9006 [2023-03-06 15:37:33,079][04272] Updated weights for policy 0, policy_version 51770 (0.0007) [2023-03-06 15:37:33,887][04221] KL-divergence is very high: 4709.4370 [2023-03-06 15:37:33,895][04272] Updated weights for policy 0, policy_version 51780 (0.0007) [2023-03-06 15:37:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53022720. Throughput: 0: 12634.5. Samples: 53014882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:33,941][03942] Avg episode reward: [(0, '958.427')] [2023-03-06 15:37:34,698][04272] Updated weights for policy 0, policy_version 51790 (0.0006) [2023-03-06 15:37:35,523][04272] Updated weights for policy 0, policy_version 51800 (0.0007) [2023-03-06 15:37:36,317][04272] Updated weights for policy 0, policy_version 51810 (0.0006) [2023-03-06 15:37:37,136][04272] Updated weights for policy 0, policy_version 51820 (0.0007) [2023-03-06 15:37:37,945][04272] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-06 15:37:38,757][04272] Updated weights for policy 0, policy_version 51840 (0.0007) [2023-03-06 15:37:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53086208. Throughput: 0: 12636.1. Samples: 53052884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:38,941][03942] Avg episode reward: [(0, '966.824')] [2023-03-06 15:37:39,546][04272] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-06 15:37:40,039][04221] KL-divergence is very high: 1677.7915 [2023-03-06 15:37:40,364][04272] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-06 15:37:40,860][04221] KL-divergence is very high: 1324.6774 [2023-03-06 15:37:40,937][04221] KL-divergence is very high: 768.9167 [2023-03-06 15:37:41,178][04272] Updated weights for policy 0, policy_version 51870 (0.0007) [2023-03-06 15:37:41,990][04272] Updated weights for policy 0, policy_version 51880 (0.0007) [2023-03-06 15:37:42,308][04221] KL-divergence is very high: 737.4923 [2023-03-06 15:37:42,380][04221] KL-divergence is very high: 3919.0408 [2023-03-06 15:37:42,798][04272] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-06 15:37:43,207][04221] KL-divergence is very high: 868.4467 [2023-03-06 15:37:43,287][04221] KL-divergence is very high: 1858.1525 [2023-03-06 15:37:43,365][04221] KL-divergence is very high: 5785.8018 [2023-03-06 15:37:43,605][04272] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-06 15:37:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 53148672. Throughput: 0: 12645.5. Samples: 53128754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:43,941][03942] Avg episode reward: [(0, '1154.099')] [2023-03-06 15:37:44,412][04272] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-06 15:37:45,226][04272] Updated weights for policy 0, policy_version 51920 (0.0007) [2023-03-06 15:37:46,029][04272] Updated weights for policy 0, policy_version 51930 (0.0005) [2023-03-06 15:37:46,857][04272] Updated weights for policy 0, policy_version 51940 (0.0007) [2023-03-06 15:37:47,166][04221] KL-divergence is very high: 259.8060 [2023-03-06 15:37:47,647][04272] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-06 15:37:48,314][04221] KL-divergence is very high: 1073.4976 [2023-03-06 15:37:48,391][04221] KL-divergence is very high: 1326.7649 [2023-03-06 15:37:48,487][04272] Updated weights for policy 0, policy_version 51960 (0.0007) [2023-03-06 15:37:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53212160. Throughput: 0: 12650.8. Samples: 53204528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:48,941][03942] Avg episode reward: [(0, '1212.805')] [2023-03-06 15:37:49,275][04272] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-06 15:37:50,085][04272] Updated weights for policy 0, policy_version 51980 (0.0007) [2023-03-06 15:37:50,912][04272] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-06 15:37:51,713][04272] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-06 15:37:52,513][04272] Updated weights for policy 0, policy_version 52010 (0.0007) [2023-03-06 15:37:53,342][04272] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-06 15:37:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53275648. Throughput: 0: 12650.1. Samples: 53242488. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:53,941][03942] Avg episode reward: [(0, '1203.726')] [2023-03-06 15:37:54,128][04272] Updated weights for policy 0, policy_version 52030 (0.0006) [2023-03-06 15:37:54,947][04272] Updated weights for policy 0, policy_version 52040 (0.0007) [2023-03-06 15:37:55,753][04272] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-06 15:37:56,538][04272] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-06 15:37:57,377][04272] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-06 15:37:58,190][04272] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-06 15:37:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12645.5). Total num frames: 53339136. Throughput: 0: 12650.6. Samples: 53318498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:37:58,941][03942] Avg episode reward: [(0, '1118.435')] [2023-03-06 15:37:58,994][04272] Updated weights for policy 0, policy_version 52090 (0.0007) [2023-03-06 15:37:59,805][04272] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-06 15:38:00,613][04272] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-06 15:38:01,406][04272] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-06 15:38:02,222][04272] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-06 15:38:03,050][04272] Updated weights for policy 0, policy_version 52140 (0.0007) [2023-03-06 15:38:03,842][04272] Updated weights for policy 0, policy_version 52150 (0.0007) [2023-03-06 15:38:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12645.5). Total num frames: 53402624. Throughput: 0: 12656.1. Samples: 53394363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:03,941][03942] Avg episode reward: [(0, '1081.575')] [2023-03-06 15:38:04,670][04272] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-06 15:38:05,476][04272] Updated weights for policy 0, policy_version 52170 (0.0007) [2023-03-06 15:38:06,261][04272] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-06 15:38:07,086][04272] Updated weights for policy 0, policy_version 52190 (0.0006) [2023-03-06 15:38:07,886][04272] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-06 15:38:08,710][04272] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-06 15:38:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 53465088. Throughput: 0: 12658.2. Samples: 53432353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:08,941][03942] Avg episode reward: [(0, '1119.613')] [2023-03-06 15:38:09,510][04272] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-06 15:38:10,328][04272] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-06 15:38:11,153][04272] Updated weights for policy 0, policy_version 52240 (0.0007) [2023-03-06 15:38:11,960][04272] Updated weights for policy 0, policy_version 52250 (0.0006) [2023-03-06 15:38:12,779][04272] Updated weights for policy 0, policy_version 52260 (0.0007) [2023-03-06 15:38:13,596][04272] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-06 15:38:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12645.5). Total num frames: 53528576. Throughput: 0: 12645.4. Samples: 53507988. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:13,941][03942] Avg episode reward: [(0, '1135.300')] [2023-03-06 15:38:14,399][04272] Updated weights for policy 0, policy_version 52280 (0.0007) [2023-03-06 15:38:15,219][04272] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-06 15:38:16,036][04272] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-06 15:38:16,849][04272] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-06 15:38:17,656][04272] Updated weights for policy 0, policy_version 52320 (0.0007) [2023-03-06 15:38:18,473][04272] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-06 15:38:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 53591040. Throughput: 0: 12635.3. Samples: 53583473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:18,941][03942] Avg episode reward: [(0, '1317.957')] [2023-03-06 15:38:19,269][04272] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-06 15:38:20,075][04272] Updated weights for policy 0, policy_version 52350 (0.0007) [2023-03-06 15:38:20,892][04272] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-06 15:38:21,700][04272] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-06 15:38:22,518][04272] Updated weights for policy 0, policy_version 52380 (0.0006) [2023-03-06 15:38:23,328][04272] Updated weights for policy 0, policy_version 52390 (0.0006) [2023-03-06 15:38:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 53654528. Throughput: 0: 12634.6. Samples: 53621442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:23,941][03942] Avg episode reward: [(0, '1197.019')] [2023-03-06 15:38:24,133][04272] Updated weights for policy 0, policy_version 52400 (0.0007) [2023-03-06 15:38:24,223][04221] KL-divergence is very high: 138.8728 [2023-03-06 15:38:24,932][04272] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-06 15:38:25,758][04272] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-06 15:38:26,575][04272] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-06 15:38:27,395][04272] Updated weights for policy 0, policy_version 52440 (0.0007) [2023-03-06 15:38:28,219][04272] Updated weights for policy 0, policy_version 52450 (0.0007) [2023-03-06 15:38:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53716992. Throughput: 0: 12627.3. Samples: 53696983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:28,941][03942] Avg episode reward: [(0, '1195.626')] [2023-03-06 15:38:29,026][04272] Updated weights for policy 0, policy_version 52460 (0.0007) [2023-03-06 15:38:29,846][04272] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-06 15:38:30,664][04272] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-06 15:38:31,458][04272] Updated weights for policy 0, policy_version 52490 (0.0006) [2023-03-06 15:38:32,278][04272] Updated weights for policy 0, policy_version 52500 (0.0006) [2023-03-06 15:38:33,079][04272] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-06 15:38:33,887][04272] Updated weights for policy 0, policy_version 52520 (0.0007) [2023-03-06 15:38:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53780480. Throughput: 0: 12626.9. Samples: 53772737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:33,941][03942] Avg episode reward: [(0, '1111.242')] [2023-03-06 15:38:34,694][04272] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-06 15:38:35,495][04272] Updated weights for policy 0, policy_version 52540 (0.0007) [2023-03-06 15:38:36,341][04272] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-06 15:38:37,118][04272] Updated weights for policy 0, policy_version 52560 (0.0006) [2023-03-06 15:38:37,936][04272] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-06 15:38:38,741][04272] Updated weights for policy 0, policy_version 52580 (0.0006) [2023-03-06 15:38:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 53843968. Throughput: 0: 12626.1. Samples: 53810664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:38,941][03942] Avg episode reward: [(0, '1166.147')] [2023-03-06 15:38:39,537][04272] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-06 15:38:40,339][04272] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-06 15:38:41,171][04272] Updated weights for policy 0, policy_version 52610 (0.0007) [2023-03-06 15:38:41,977][04272] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-06 15:38:42,465][04221] KL-divergence is very high: 590.6167 [2023-03-06 15:38:42,627][04221] KL-divergence is very high: 223.6291 [2023-03-06 15:38:42,791][04272] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-06 15:38:43,355][04221] KL-divergence is very high: 406.0641 [2023-03-06 15:38:43,610][04272] Updated weights for policy 0, policy_version 52640 (0.0007) [2023-03-06 15:38:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 53907456. Throughput: 0: 12624.7. Samples: 53886607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:43,941][03942] Avg episode reward: [(0, '840.034')] [2023-03-06 15:38:44,068][04221] KL-divergence is very high: 131.7888 [2023-03-06 15:38:44,406][04221] KL-divergence is very high: 218.9465 [2023-03-06 15:38:44,414][04272] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-06 15:38:45,112][04221] KL-divergence is very high: 361.1465 [2023-03-06 15:38:45,213][04272] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-06 15:38:45,281][04221] KL-divergence is very high: 148.6331 [2023-03-06 15:38:45,438][04221] KL-divergence is very high: 211.9220 [2023-03-06 15:38:45,610][04221] KL-divergence is very high: 129.0442 [2023-03-06 15:38:45,775][04221] KL-divergence is very high: 153.1218 [2023-03-06 15:38:46,026][04272] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-06 15:38:46,837][04272] Updated weights for policy 0, policy_version 52680 (0.0007) [2023-03-06 15:38:47,648][04272] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-06 15:38:48,456][04272] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-06 15:38:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 53969920. Throughput: 0: 12626.6. Samples: 53962560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:38:48,941][03942] Avg episode reward: [(0, '971.284')] [2023-03-06 15:38:49,275][04272] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-06 15:38:50,062][04272] Updated weights for policy 0, policy_version 52720 (0.0007) [2023-03-06 15:38:50,882][04272] Updated weights for policy 0, policy_version 52730 (0.0006) [2023-03-06 15:38:51,696][04272] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-06 15:38:52,505][04272] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-06 15:38:53,335][04272] Updated weights for policy 0, policy_version 52760 (0.0007) [2023-03-06 15:38:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 54033408. Throughput: 0: 12624.3. Samples: 54000447. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:38:53,941][03942] Avg episode reward: [(0, '1149.775')] [2023-03-06 15:38:54,141][04272] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-06 15:38:54,946][04272] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-06 15:38:55,752][04272] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-06 15:38:56,562][04272] Updated weights for policy 0, policy_version 52800 (0.0006) [2023-03-06 15:38:57,366][04272] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-06 15:38:58,157][04272] Updated weights for policy 0, policy_version 52820 (0.0006) [2023-03-06 15:38:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 54096896. Throughput: 0: 12631.6. Samples: 54076412. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:38:58,941][03942] Avg episode reward: [(0, '1168.100')] [2023-03-06 15:38:58,968][04272] Updated weights for policy 0, policy_version 52830 (0.0006) [2023-03-06 15:38:59,213][04221] KL-divergence is very high: 540.8591 [2023-03-06 15:38:59,532][04221] KL-divergence is very high: 1217.9807 [2023-03-06 15:38:59,769][04272] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-06 15:39:00,605][04272] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-06 15:39:01,424][04272] Updated weights for policy 0, policy_version 52860 (0.0007) [2023-03-06 15:39:02,209][04272] Updated weights for policy 0, policy_version 52870 (0.0006) [2023-03-06 15:39:02,695][04221] KL-divergence is very high: 675.2507 [2023-03-06 15:39:03,019][04272] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-06 15:39:03,268][04221] KL-divergence is very high: 353.3527 [2023-03-06 15:39:03,834][04221] KL-divergence is very high: 244.0636 [2023-03-06 15:39:03,841][04272] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-06 15:39:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 54160384. Throughput: 0: 12639.3. Samples: 54152242. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:03,941][03942] Avg episode reward: [(0, '1027.441')] [2023-03-06 15:39:04,638][04221] KL-divergence is very high: 694.0128 [2023-03-06 15:39:04,645][04272] Updated weights for policy 0, policy_version 52900 (0.0007) [2023-03-06 15:39:05,444][04272] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-06 15:39:06,273][04272] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-06 15:39:07,085][04272] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-06 15:39:07,900][04272] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-06 15:39:08,722][04272] Updated weights for policy 0, policy_version 52950 (0.0007) [2023-03-06 15:39:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 54222848. Throughput: 0: 12640.5. Samples: 54190266. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:08,941][03942] Avg episode reward: [(0, '1026.262')] [2023-03-06 15:39:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000052952_54222848.pth... [2023-03-06 15:39:08,974][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000049991_51190784.pth [2023-03-06 15:39:09,533][04272] Updated weights for policy 0, policy_version 52960 (0.0006) [2023-03-06 15:39:10,331][04272] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-06 15:39:11,140][04272] Updated weights for policy 0, policy_version 52980 (0.0007) [2023-03-06 15:39:11,770][04221] KL-divergence is very high: 298.2114 [2023-03-06 15:39:11,941][04272] Updated weights for policy 0, policy_version 52990 (0.0007) [2023-03-06 15:39:12,766][04272] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-06 15:39:13,152][04221] KL-divergence is very high: 125.8026 [2023-03-06 15:39:13,573][04221] KL-divergence is very high: 1044.2744 [2023-03-06 15:39:13,582][04272] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-06 15:39:13,740][04221] KL-divergence is very high: 1060.1812 [2023-03-06 15:39:13,899][04221] KL-divergence is very high: 3444.7261 [2023-03-06 15:39:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12642.1). Total num frames: 54286336. Throughput: 0: 12645.3. Samples: 54266023. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:13,941][03942] Avg episode reward: [(0, '1085.288')] [2023-03-06 15:39:14,390][04272] Updated weights for policy 0, policy_version 53020 (0.0006) [2023-03-06 15:39:14,939][04221] KL-divergence is very high: 1406.2885 [2023-03-06 15:39:15,217][04272] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-06 15:39:16,011][04272] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-06 15:39:16,816][04272] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-03-06 15:39:17,621][04272] Updated weights for policy 0, policy_version 53060 (0.0007) [2023-03-06 15:39:18,422][04272] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-06 15:39:18,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 54349824. Throughput: 0: 12646.8. Samples: 54341841. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:18,941][03942] Avg episode reward: [(0, '1061.744')] [2023-03-06 15:39:18,981][04221] KL-divergence is very high: 159.7250 [2023-03-06 15:39:19,221][04272] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-06 15:39:19,297][04221] KL-divergence is very high: 1476.3428 [2023-03-06 15:39:20,030][04272] Updated weights for policy 0, policy_version 53090 (0.0006) [2023-03-06 15:39:20,178][04221] KL-divergence is very high: 424.1071 [2023-03-06 15:39:20,832][04272] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-06 15:39:21,654][04272] Updated weights for policy 0, policy_version 53110 (0.0006) [2023-03-06 15:39:22,439][04272] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-06 15:39:23,259][04272] Updated weights for policy 0, policy_version 53130 (0.0006) [2023-03-06 15:39:23,411][04221] KL-divergence is very high: 12657.5566 [2023-03-06 15:39:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12645.5). Total num frames: 54413312. Throughput: 0: 12651.8. Samples: 54379993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:23,941][03942] Avg episode reward: [(0, '862.932')] [2023-03-06 15:39:24,070][04272] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-06 15:39:24,850][04272] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-06 15:39:25,678][04272] Updated weights for policy 0, policy_version 53160 (0.0007) [2023-03-06 15:39:26,484][04272] Updated weights for policy 0, policy_version 53170 (0.0007) [2023-03-06 15:39:27,286][04272] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-06 15:39:28,119][04272] Updated weights for policy 0, policy_version 53190 (0.0006) [2023-03-06 15:39:28,930][04272] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-06 15:39:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12645.5). Total num frames: 54476800. Throughput: 0: 12655.6. Samples: 54456109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:28,941][03942] Avg episode reward: [(0, '946.317')] [2023-03-06 15:39:29,737][04272] Updated weights for policy 0, policy_version 53210 (0.0006) [2023-03-06 15:39:30,535][04272] Updated weights for policy 0, policy_version 53220 (0.0006) [2023-03-06 15:39:31,370][04272] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-06 15:39:32,164][04272] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-06 15:39:32,966][04272] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-06 15:39:33,798][04272] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-06 15:39:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 54539264. Throughput: 0: 12650.9. Samples: 54531851. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:33,941][03942] Avg episode reward: [(0, '1173.773')] [2023-03-06 15:39:34,591][04272] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-06 15:39:35,386][04272] Updated weights for policy 0, policy_version 53280 (0.0007) [2023-03-06 15:39:36,206][04272] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-06 15:39:37,017][04272] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-06 15:39:37,829][04272] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-06 15:39:38,638][04272] Updated weights for policy 0, policy_version 53320 (0.0006) [2023-03-06 15:39:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 54602752. Throughput: 0: 12656.6. Samples: 54569996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:38,941][03942] Avg episode reward: [(0, '1093.489')] [2023-03-06 15:39:39,446][04272] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-06 15:39:40,246][04272] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-06 15:39:41,063][04272] Updated weights for policy 0, policy_version 53350 (0.0007) [2023-03-06 15:39:41,874][04272] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-06 15:39:42,697][04272] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-06 15:39:43,494][04272] Updated weights for policy 0, policy_version 53380 (0.0006) [2023-03-06 15:39:43,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 54666240. Throughput: 0: 12653.5. Samples: 54645819. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:43,941][03942] Avg episode reward: [(0, '1017.502')] [2023-03-06 15:39:44,308][04272] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-06 15:39:45,123][04272] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-06 15:39:45,929][04272] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-06 15:39:46,732][04272] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-06 15:39:47,559][04272] Updated weights for policy 0, policy_version 53430 (0.0007) [2023-03-06 15:39:48,368][04272] Updated weights for policy 0, policy_version 53440 (0.0007) [2023-03-06 15:39:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12642.1). Total num frames: 54729728. Throughput: 0: 12648.6. Samples: 54721430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:48,941][03942] Avg episode reward: [(0, '1068.416')] [2023-03-06 15:39:49,161][04272] Updated weights for policy 0, policy_version 53450 (0.0007) [2023-03-06 15:39:49,966][04272] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-06 15:39:50,799][04272] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-06 15:39:51,592][04272] Updated weights for policy 0, policy_version 53480 (0.0007) [2023-03-06 15:39:52,399][04272] Updated weights for policy 0, policy_version 53490 (0.0006) [2023-03-06 15:39:53,210][04272] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-06 15:39:53,454][04221] KL-divergence is very high: 1231397.5000 [2023-03-06 15:39:53,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12663.5, 300 sec: 12642.1). Total num frames: 54793216. Throughput: 0: 12649.0. Samples: 54759467. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:53,941][03942] Avg episode reward: [(0, '1090.566')] [2023-03-06 15:39:54,022][04272] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-06 15:39:54,850][04272] Updated weights for policy 0, policy_version 53520 (0.0006) [2023-03-06 15:39:55,638][04272] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-06 15:39:56,458][04272] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-06 15:39:57,262][04272] Updated weights for policy 0, policy_version 53550 (0.0006) [2023-03-06 15:39:58,074][04272] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-06 15:39:58,881][04272] Updated weights for policy 0, policy_version 53570 (0.0006) [2023-03-06 15:39:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 54855680. Throughput: 0: 12651.7. Samples: 54835350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:39:58,941][03942] Avg episode reward: [(0, '1209.063')] [2023-03-06 15:39:59,687][04272] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-06 15:40:00,489][04272] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-06 15:40:01,313][04272] Updated weights for policy 0, policy_version 53600 (0.0007) [2023-03-06 15:40:02,105][04272] Updated weights for policy 0, policy_version 53610 (0.0006) [2023-03-06 15:40:02,904][04272] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-06 15:40:03,714][04272] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-06 15:40:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 54919168. Throughput: 0: 12663.0. Samples: 54911674. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:40:03,941][03942] Avg episode reward: [(0, '1158.106')] [2023-03-06 15:40:04,520][04272] Updated weights for policy 0, policy_version 53640 (0.0006) [2023-03-06 15:40:05,349][04272] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-06 15:40:05,573][04221] KL-divergence is very high: 3541.1819 [2023-03-06 15:40:05,902][04221] KL-divergence is very high: 257.7433 [2023-03-06 15:40:06,066][04221] KL-divergence is very high: 204.3225 [2023-03-06 15:40:06,151][04272] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-06 15:40:06,471][04221] KL-divergence is very high: 306.4073 [2023-03-06 15:40:06,639][04221] KL-divergence is very high: 136.0686 [2023-03-06 15:40:06,793][04221] KL-divergence is very high: 160.2368 [2023-03-06 15:40:06,961][04272] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-06 15:40:07,603][04221] KL-divergence is very high: 482.8806 [2023-03-06 15:40:07,755][04221] KL-divergence is very high: 118.6264 [2023-03-06 15:40:07,763][04272] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-06 15:40:08,579][04272] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-06 15:40:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12642.1). Total num frames: 54982656. Throughput: 0: 12654.0. Samples: 54949424. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:40:08,941][03942] Avg episode reward: [(0, '1169.454')] [2023-03-06 15:40:09,388][04272] Updated weights for policy 0, policy_version 53700 (0.0007) [2023-03-06 15:40:10,183][04272] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-06 15:40:10,977][04272] Updated weights for policy 0, policy_version 53720 (0.0006) [2023-03-06 15:40:11,798][04272] Updated weights for policy 0, policy_version 53730 (0.0006) [2023-03-06 15:40:12,619][04272] Updated weights for policy 0, policy_version 53740 (0.0007) [2023-03-06 15:40:13,424][04272] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-06 15:40:13,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12642.1). Total num frames: 55046144. Throughput: 0: 12650.6. Samples: 55025385. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:40:13,941][03942] Avg episode reward: [(0, '1065.674')] [2023-03-06 15:40:14,253][04272] Updated weights for policy 0, policy_version 53760 (0.0007) [2023-03-06 15:40:15,061][04272] Updated weights for policy 0, policy_version 53770 (0.0006) [2023-03-06 15:40:15,875][04272] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-06 15:40:16,685][04272] Updated weights for policy 0, policy_version 53790 (0.0007) [2023-03-06 15:40:17,491][04272] Updated weights for policy 0, policy_version 53800 (0.0007) [2023-03-06 15:40:18,304][04272] Updated weights for policy 0, policy_version 53810 (0.0007) [2023-03-06 15:40:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55108608. Throughput: 0: 12651.1. Samples: 55101148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:40:18,941][03942] Avg episode reward: [(0, '1080.565')] [2023-03-06 15:40:19,114][04272] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-06 15:40:19,908][04272] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-06 15:40:20,730][04272] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-06 15:40:21,557][04272] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-06 15:40:22,366][04272] Updated weights for policy 0, policy_version 53860 (0.0007) [2023-03-06 15:40:23,159][04272] Updated weights for policy 0, policy_version 53870 (0.0006) [2023-03-06 15:40:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55172096. Throughput: 0: 12644.6. Samples: 55139003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:40:23,941][03942] Avg episode reward: [(0, '1222.467')] [2023-03-06 15:40:23,977][04272] Updated weights for policy 0, policy_version 53880 (0.0007) [2023-03-06 15:40:24,789][04272] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-06 15:40:25,601][04272] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-06 15:40:26,404][04272] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-06 15:40:27,218][04272] Updated weights for policy 0, policy_version 53920 (0.0006) [2023-03-06 15:40:28,024][04272] Updated weights for policy 0, policy_version 53930 (0.0006) [2023-03-06 15:40:28,819][04272] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-06 15:40:28,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55235584. Throughput: 0: 12647.8. Samples: 55214970. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:28,941][03942] Avg episode reward: [(0, '1249.662')] [2023-03-06 15:40:29,651][04272] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-06 15:40:30,470][04272] Updated weights for policy 0, policy_version 53960 (0.0006) [2023-03-06 15:40:31,274][04272] Updated weights for policy 0, policy_version 53970 (0.0007) [2023-03-06 15:40:32,078][04272] Updated weights for policy 0, policy_version 53980 (0.0006) [2023-03-06 15:40:32,899][04272] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-06 15:40:33,695][04272] Updated weights for policy 0, policy_version 54000 (0.0007) [2023-03-06 15:40:33,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12642.1). Total num frames: 55299072. Throughput: 0: 12654.8. Samples: 55290897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:33,941][03942] Avg episode reward: [(0, '1194.900')] [2023-03-06 15:40:34,492][04272] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-06 15:40:35,307][04272] Updated weights for policy 0, policy_version 54020 (0.0007) [2023-03-06 15:40:36,114][04272] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-06 15:40:36,930][04272] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-06 15:40:37,737][04272] Updated weights for policy 0, policy_version 54050 (0.0007) [2023-03-06 15:40:38,567][04272] Updated weights for policy 0, policy_version 54060 (0.0007) [2023-03-06 15:40:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55361536. Throughput: 0: 12651.6. Samples: 55328793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:38,941][03942] Avg episode reward: [(0, '1257.810')] [2023-03-06 15:40:39,388][04272] Updated weights for policy 0, policy_version 54070 (0.0007) [2023-03-06 15:40:40,191][04272] Updated weights for policy 0, policy_version 54080 (0.0007) [2023-03-06 15:40:41,001][04272] Updated weights for policy 0, policy_version 54090 (0.0007) [2023-03-06 15:40:41,809][04272] Updated weights for policy 0, policy_version 54100 (0.0007) [2023-03-06 15:40:42,620][04272] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-06 15:40:43,427][04272] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-06 15:40:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55425024. Throughput: 0: 12644.5. Samples: 55404352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:43,941][03942] Avg episode reward: [(0, '1186.613')] [2023-03-06 15:40:44,252][04272] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-06 15:40:45,042][04272] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-06 15:40:45,857][04272] Updated weights for policy 0, policy_version 54150 (0.0007) [2023-03-06 15:40:46,669][04272] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-06 15:40:47,466][04272] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-06 15:40:48,265][04272] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-06 15:40:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55488512. Throughput: 0: 12636.6. Samples: 55480321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:48,941][03942] Avg episode reward: [(0, '1212.116')] [2023-03-06 15:40:49,083][04272] Updated weights for policy 0, policy_version 54190 (0.0007) [2023-03-06 15:40:49,877][04272] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-06 15:40:50,689][04272] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-06 15:40:51,502][04272] Updated weights for policy 0, policy_version 54220 (0.0006) [2023-03-06 15:40:52,298][04272] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-06 15:40:53,122][04272] Updated weights for policy 0, policy_version 54240 (0.0007) [2023-03-06 15:40:53,931][04272] Updated weights for policy 0, policy_version 54250 (0.0007) [2023-03-06 15:40:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 55552000. Throughput: 0: 12641.7. Samples: 55518301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:53,941][03942] Avg episode reward: [(0, '1186.149')] [2023-03-06 15:40:54,734][04272] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-06 15:40:55,545][04272] Updated weights for policy 0, policy_version 54270 (0.0006) [2023-03-06 15:40:56,343][04272] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-06 15:40:57,159][04272] Updated weights for policy 0, policy_version 54290 (0.0006) [2023-03-06 15:40:57,968][04272] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-06 15:40:58,783][04272] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-06 15:40:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55614464. Throughput: 0: 12642.9. Samples: 55594316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:40:58,941][03942] Avg episode reward: [(0, '1225.630')] [2023-03-06 15:40:59,609][04272] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-06 15:41:00,428][04272] Updated weights for policy 0, policy_version 54330 (0.0007) [2023-03-06 15:41:01,247][04272] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-06 15:41:02,048][04272] Updated weights for policy 0, policy_version 54350 (0.0006) [2023-03-06 15:41:02,865][04272] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-06 15:41:03,668][04272] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-06 15:41:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55677952. Throughput: 0: 12640.6. Samples: 55669974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:03,941][03942] Avg episode reward: [(0, '1248.516')] [2023-03-06 15:41:04,465][04272] Updated weights for policy 0, policy_version 54380 (0.0006) [2023-03-06 15:41:05,271][04272] Updated weights for policy 0, policy_version 54390 (0.0007) [2023-03-06 15:41:06,092][04272] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-06 15:41:06,902][04272] Updated weights for policy 0, policy_version 54410 (0.0007) [2023-03-06 15:41:07,702][04272] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-06 15:41:08,522][04272] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-06 15:41:08,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12638.6). Total num frames: 55741440. Throughput: 0: 12642.9. Samples: 55707933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:08,941][03942] Avg episode reward: [(0, '1130.395')] [2023-03-06 15:41:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000054435_55741440.pth... [2023-03-06 15:41:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000051471_52706304.pth [2023-03-06 15:41:09,349][04272] Updated weights for policy 0, policy_version 54440 (0.0007) [2023-03-06 15:41:10,142][04272] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-06 15:41:10,961][04272] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-06 15:41:11,784][04272] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-06 15:41:12,586][04272] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-06 15:41:13,411][04272] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-06 15:41:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 55803904. Throughput: 0: 12634.4. Samples: 55783517. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:13,941][03942] Avg episode reward: [(0, '1249.962')] [2023-03-06 15:41:14,229][04272] Updated weights for policy 0, policy_version 54500 (0.0007) [2023-03-06 15:41:15,033][04272] Updated weights for policy 0, policy_version 54510 (0.0007) [2023-03-06 15:41:15,850][04272] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-06 15:41:16,654][04272] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-06 15:41:17,465][04272] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-06 15:41:18,279][04272] Updated weights for policy 0, policy_version 54550 (0.0007) [2023-03-06 15:41:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12642.1). Total num frames: 55867392. Throughput: 0: 12628.4. Samples: 55859174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:18,941][03942] Avg episode reward: [(0, '1247.464')] [2023-03-06 15:41:19,097][04272] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-06 15:41:19,898][04272] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-06 15:41:20,716][04272] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-06 15:41:21,537][04272] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-06 15:41:22,342][04272] Updated weights for policy 0, policy_version 54600 (0.0007) [2023-03-06 15:41:23,147][04272] Updated weights for policy 0, policy_version 54610 (0.0008) [2023-03-06 15:41:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 55929856. Throughput: 0: 12627.6. Samples: 55897035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:23,941][03942] Avg episode reward: [(0, '1211.941')] [2023-03-06 15:41:23,969][04272] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-06 15:41:24,771][04272] Updated weights for policy 0, policy_version 54630 (0.0007) [2023-03-06 15:41:25,567][04272] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-06 15:41:26,382][04272] Updated weights for policy 0, policy_version 54650 (0.0006) [2023-03-06 15:41:27,182][04272] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-06 15:41:27,987][04272] Updated weights for policy 0, policy_version 54670 (0.0006) [2023-03-06 15:41:28,816][04272] Updated weights for policy 0, policy_version 54680 (0.0006) [2023-03-06 15:41:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12638.6). Total num frames: 55993344. Throughput: 0: 12635.0. Samples: 55972928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:28,941][03942] Avg episode reward: [(0, '1213.287')] [2023-03-06 15:41:29,638][04272] Updated weights for policy 0, policy_version 54690 (0.0007) [2023-03-06 15:41:30,434][04272] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-06 15:41:31,250][04272] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-06 15:41:32,048][04272] Updated weights for policy 0, policy_version 54720 (0.0007) [2023-03-06 15:41:32,865][04272] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-06 15:41:33,672][04272] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-06 15:41:33,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 56056832. Throughput: 0: 12633.3. Samples: 56048822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:41:33,941][03942] Avg episode reward: [(0, '1185.574')] [2023-03-06 15:41:34,495][04272] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-06 15:41:35,297][04272] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-06 15:41:36,119][04272] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-06 15:41:36,933][04272] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-06 15:41:37,743][04272] Updated weights for policy 0, policy_version 54790 (0.0007) [2023-03-06 15:41:38,561][04272] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-06 15:41:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 56119296. Throughput: 0: 12628.2. Samples: 56086571. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:41:38,941][03942] Avg episode reward: [(0, '1271.737')] [2023-03-06 15:41:39,381][04272] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-06 15:41:40,198][04272] Updated weights for policy 0, policy_version 54820 (0.0007) [2023-03-06 15:41:40,988][04272] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-06 15:41:41,809][04272] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-06 15:41:42,632][04272] Updated weights for policy 0, policy_version 54850 (0.0006) [2023-03-06 15:41:43,442][04272] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-06 15:41:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12638.6). Total num frames: 56182784. Throughput: 0: 12615.1. Samples: 56161993. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:41:43,941][03942] Avg episode reward: [(0, '1289.844')] [2023-03-06 15:41:44,256][04272] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-06 15:41:45,055][04272] Updated weights for policy 0, policy_version 54880 (0.0006) [2023-03-06 15:41:45,885][04272] Updated weights for policy 0, policy_version 54890 (0.0007) [2023-03-06 15:41:46,702][04272] Updated weights for policy 0, policy_version 54900 (0.0007) [2023-03-06 15:41:47,514][04272] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-06 15:41:48,345][04272] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-06 15:41:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12635.1). Total num frames: 56245248. Throughput: 0: 12608.0. Samples: 56237334. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:41:48,941][03942] Avg episode reward: [(0, '1238.699')] [2023-03-06 15:41:49,158][04272] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-06 15:41:49,964][04272] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-06 15:41:50,773][04272] Updated weights for policy 0, policy_version 54950 (0.0006) [2023-03-06 15:41:51,578][04272] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-06 15:41:52,399][04272] Updated weights for policy 0, policy_version 54970 (0.0007) [2023-03-06 15:41:53,201][04272] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-06 15:41:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12638.6). Total num frames: 56308736. Throughput: 0: 12606.8. Samples: 56275238. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:41:53,941][03942] Avg episode reward: [(0, '1087.511')] [2023-03-06 15:41:54,008][04272] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-06 15:41:54,821][04272] Updated weights for policy 0, policy_version 55000 (0.0006) [2023-03-06 15:41:55,626][04272] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-06 15:41:56,465][04272] Updated weights for policy 0, policy_version 55020 (0.0007) [2023-03-06 15:41:57,270][04272] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-06 15:41:58,077][04272] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-06 15:41:58,903][04272] Updated weights for policy 0, policy_version 55050 (0.0007) [2023-03-06 15:41:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12638.6). Total num frames: 56371200. Throughput: 0: 12606.1. Samples: 56350792. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:41:58,941][03942] Avg episode reward: [(0, '1155.464')] [2023-03-06 15:41:59,713][04272] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-06 15:42:00,504][04272] Updated weights for policy 0, policy_version 55070 (0.0007) [2023-03-06 15:42:01,337][04272] Updated weights for policy 0, policy_version 55080 (0.0007) [2023-03-06 15:42:02,153][04272] Updated weights for policy 0, policy_version 55090 (0.0006) [2023-03-06 15:42:02,953][04272] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-06 15:42:03,771][04272] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-06 15:42:03,941][03942] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 12635.1). Total num frames: 56433664. Throughput: 0: 12606.4. Samples: 56426461. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:42:03,941][03942] Avg episode reward: [(0, '1223.875')] [2023-03-06 15:42:04,579][04272] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-06 15:42:05,395][04272] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-06 15:42:06,190][04272] Updated weights for policy 0, policy_version 55140 (0.0006) [2023-03-06 15:42:07,004][04272] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-06 15:42:07,818][04272] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-06 15:42:08,646][04272] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-06 15:42:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12635.1). Total num frames: 56497152. Throughput: 0: 12608.4. Samples: 56464413. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:42:08,941][03942] Avg episode reward: [(0, '1213.526')] [2023-03-06 15:42:09,444][04272] Updated weights for policy 0, policy_version 55180 (0.0007) [2023-03-06 15:42:10,269][04272] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-06 15:42:11,078][04272] Updated weights for policy 0, policy_version 55200 (0.0007) [2023-03-06 15:42:11,878][04272] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-06 15:42:12,711][04272] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-06 15:42:13,510][04272] Updated weights for policy 0, policy_version 55230 (0.0008) [2023-03-06 15:42:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12638.6). Total num frames: 56560640. Throughput: 0: 12600.0. Samples: 56539930. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:13,941][03942] Avg episode reward: [(0, '1162.517')] [2023-03-06 15:42:14,318][04272] Updated weights for policy 0, policy_version 55240 (0.0006) [2023-03-06 15:42:15,125][04272] Updated weights for policy 0, policy_version 55250 (0.0005) [2023-03-06 15:42:15,943][04272] Updated weights for policy 0, policy_version 55260 (0.0007) [2023-03-06 15:42:16,761][04272] Updated weights for policy 0, policy_version 55270 (0.0006) [2023-03-06 15:42:17,586][04272] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-06 15:42:18,405][04272] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-06 15:42:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12635.1). Total num frames: 56623104. Throughput: 0: 12594.1. Samples: 56615557. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:18,941][03942] Avg episode reward: [(0, '1286.781')] [2023-03-06 15:42:19,221][04272] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-06 15:42:20,031][04272] Updated weights for policy 0, policy_version 55310 (0.0007) [2023-03-06 15:42:20,866][04272] Updated weights for policy 0, policy_version 55320 (0.0007) [2023-03-06 15:42:21,667][04272] Updated weights for policy 0, policy_version 55330 (0.0007) [2023-03-06 15:42:22,481][04272] Updated weights for policy 0, policy_version 55340 (0.0007) [2023-03-06 15:42:23,294][04272] Updated weights for policy 0, policy_version 55350 (0.0006) [2023-03-06 15:42:23,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 56685568. Throughput: 0: 12588.6. Samples: 56653056. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:23,941][03942] Avg episode reward: [(0, '1106.312')] [2023-03-06 15:42:24,108][04272] Updated weights for policy 0, policy_version 55360 (0.0006) [2023-03-06 15:42:24,929][04272] Updated weights for policy 0, policy_version 55370 (0.0006) [2023-03-06 15:42:25,737][04272] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-06 15:42:26,556][04272] Updated weights for policy 0, policy_version 55390 (0.0007) [2023-03-06 15:42:27,360][04272] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-06 15:42:28,161][04272] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-06 15:42:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 56749056. Throughput: 0: 12594.8. Samples: 56728759. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:28,951][03942] Avg episode reward: [(0, '1222.526')] [2023-03-06 15:42:28,979][04272] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-06 15:42:29,785][04272] Updated weights for policy 0, policy_version 55430 (0.0007) [2023-03-06 15:42:30,601][04272] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-06 15:42:31,409][04272] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-06 15:42:32,209][04272] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-06 15:42:33,030][04272] Updated weights for policy 0, policy_version 55470 (0.0007) [2023-03-06 15:42:33,854][04272] Updated weights for policy 0, policy_version 55480 (0.0008) [2023-03-06 15:42:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 56812544. Throughput: 0: 12602.2. Samples: 56804435. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:33,952][03942] Avg episode reward: [(0, '1340.758')] [2023-03-06 15:42:34,663][04272] Updated weights for policy 0, policy_version 55490 (0.0007) [2023-03-06 15:42:35,482][04272] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-06 15:42:36,291][04272] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-06 15:42:37,109][04272] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-06 15:42:37,897][04272] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-06 15:42:38,726][04272] Updated weights for policy 0, policy_version 55540 (0.0006) [2023-03-06 15:42:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 56875008. Throughput: 0: 12600.3. Samples: 56842253. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:38,952][03942] Avg episode reward: [(0, '1214.212')] [2023-03-06 15:42:39,561][04272] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-06 15:42:40,374][04272] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-06 15:42:41,165][04272] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-06 15:42:41,975][04272] Updated weights for policy 0, policy_version 55580 (0.0007) [2023-03-06 15:42:42,778][04272] Updated weights for policy 0, policy_version 55590 (0.0007) [2023-03-06 15:42:43,577][04272] Updated weights for policy 0, policy_version 55600 (0.0007) [2023-03-06 15:42:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 56938496. Throughput: 0: 12599.9. Samples: 56917786. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 15:42:43,941][03942] Avg episode reward: [(0, '1261.665')] [2023-03-06 15:42:44,404][04272] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-06 15:42:45,217][04272] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-06 15:42:46,024][04272] Updated weights for policy 0, policy_version 55630 (0.0007) [2023-03-06 15:42:46,827][04272] Updated weights for policy 0, policy_version 55640 (0.0007) [2023-03-06 15:42:47,653][04272] Updated weights for policy 0, policy_version 55650 (0.0007) [2023-03-06 15:42:48,433][04272] Updated weights for policy 0, policy_version 55660 (0.0007) [2023-03-06 15:42:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12631.6). Total num frames: 57001984. Throughput: 0: 12606.2. Samples: 56993742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:42:48,941][03942] Avg episode reward: [(0, '1137.801')] [2023-03-06 15:42:49,248][04272] Updated weights for policy 0, policy_version 55670 (0.0006) [2023-03-06 15:42:50,082][04272] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-06 15:42:50,878][04272] Updated weights for policy 0, policy_version 55690 (0.0007) [2023-03-06 15:42:51,692][04272] Updated weights for policy 0, policy_version 55700 (0.0007) [2023-03-06 15:42:52,497][04272] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-06 15:42:53,323][04272] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-06 15:42:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12628.2). Total num frames: 57064448. Throughput: 0: 12604.9. Samples: 57031633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:42:53,941][03942] Avg episode reward: [(0, '940.896')] [2023-03-06 15:42:54,134][04272] Updated weights for policy 0, policy_version 55730 (0.0007) [2023-03-06 15:42:54,949][04272] Updated weights for policy 0, policy_version 55740 (0.0007) [2023-03-06 15:42:55,745][04272] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-06 15:42:56,568][04272] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-06 15:42:57,373][04272] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-06 15:42:58,183][04272] Updated weights for policy 0, policy_version 55780 (0.0007) [2023-03-06 15:42:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57127936. Throughput: 0: 12608.0. Samples: 57107288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:42:58,941][03942] Avg episode reward: [(0, '1205.902')] [2023-03-06 15:42:58,986][04272] Updated weights for policy 0, policy_version 55790 (0.0009) [2023-03-06 15:42:59,797][04272] Updated weights for policy 0, policy_version 55800 (0.0007) [2023-03-06 15:43:00,635][04272] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-06 15:43:01,449][04272] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-06 15:43:02,265][04272] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-06 15:43:03,082][04272] Updated weights for policy 0, policy_version 55840 (0.0006) [2023-03-06 15:43:03,890][04272] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-06 15:43:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57190400. Throughput: 0: 12603.4. Samples: 57182709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:43:03,941][03942] Avg episode reward: [(0, '1177.993')] [2023-03-06 15:43:04,697][04272] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-06 15:43:05,501][04272] Updated weights for policy 0, policy_version 55870 (0.0006) [2023-03-06 15:43:06,306][04272] Updated weights for policy 0, policy_version 55880 (0.0006) [2023-03-06 15:43:07,117][04272] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-06 15:43:07,941][04272] Updated weights for policy 0, policy_version 55900 (0.0007) [2023-03-06 15:43:08,758][04272] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-06 15:43:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57253888. Throughput: 0: 12617.9. Samples: 57220862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:43:08,941][03942] Avg episode reward: [(0, '1207.756')] [2023-03-06 15:43:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000055912_57253888.pth... [2023-03-06 15:43:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000052952_54222848.pth [2023-03-06 15:43:09,577][04272] Updated weights for policy 0, policy_version 55920 (0.0007) [2023-03-06 15:43:10,376][04272] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-06 15:43:11,197][04272] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-06 15:43:11,999][04272] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-06 15:43:12,819][04272] Updated weights for policy 0, policy_version 55960 (0.0007) [2023-03-06 15:43:13,615][04272] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-06 15:43:13,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12628.2). Total num frames: 57316352. Throughput: 0: 12612.2. Samples: 57296308. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:43:13,941][03942] Avg episode reward: [(0, '1058.295')] [2023-03-06 15:43:14,423][04272] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-06 15:43:15,253][04272] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-06 15:43:16,076][04272] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-06 15:43:16,875][04272] Updated weights for policy 0, policy_version 56010 (0.0005) [2023-03-06 15:43:17,698][04272] Updated weights for policy 0, policy_version 56020 (0.0007) [2023-03-06 15:43:18,496][04272] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-06 15:43:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57379840. Throughput: 0: 12611.0. Samples: 57371931. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:18,941][03942] Avg episode reward: [(0, '941.740')] [2023-03-06 15:43:19,306][04272] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-06 15:43:20,115][04272] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-06 15:43:20,934][04272] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-06 15:43:21,749][04272] Updated weights for policy 0, policy_version 56070 (0.0005) [2023-03-06 15:43:22,551][04272] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-06 15:43:23,374][04272] Updated weights for policy 0, policy_version 56090 (0.0007) [2023-03-06 15:43:23,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 57443328. Throughput: 0: 12615.1. Samples: 57409931. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:23,941][03942] Avg episode reward: [(0, '1134.271')] [2023-03-06 15:43:24,195][04272] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-06 15:43:25,004][04272] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-06 15:43:25,807][04272] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-06 15:43:26,605][04272] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-06 15:43:27,417][04272] Updated weights for policy 0, policy_version 56140 (0.0006) [2023-03-06 15:43:28,227][04272] Updated weights for policy 0, policy_version 56150 (0.0007) [2023-03-06 15:43:28,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57505792. Throughput: 0: 12619.5. Samples: 57485663. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:28,941][03942] Avg episode reward: [(0, '1153.548')] [2023-03-06 15:43:29,053][04272] Updated weights for policy 0, policy_version 56160 (0.0005) [2023-03-06 15:43:29,862][04272] Updated weights for policy 0, policy_version 56170 (0.0007) [2023-03-06 15:43:30,670][04272] Updated weights for policy 0, policy_version 56180 (0.0006) [2023-03-06 15:43:31,484][04272] Updated weights for policy 0, policy_version 56190 (0.0007) [2023-03-06 15:43:32,292][04272] Updated weights for policy 0, policy_version 56200 (0.0007) [2023-03-06 15:43:33,126][04272] Updated weights for policy 0, policy_version 56210 (0.0006) [2023-03-06 15:43:33,932][04272] Updated weights for policy 0, policy_version 56220 (0.0006) [2023-03-06 15:43:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57569280. Throughput: 0: 12602.4. Samples: 57560850. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:33,941][03942] Avg episode reward: [(0, '1226.788')] [2023-03-06 15:43:34,750][04272] Updated weights for policy 0, policy_version 56230 (0.0007) [2023-03-06 15:43:35,565][04272] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-06 15:43:36,362][04272] Updated weights for policy 0, policy_version 56250 (0.0007) [2023-03-06 15:43:37,178][04272] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-06 15:43:37,992][04272] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-06 15:43:38,805][04272] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-06 15:43:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 57631744. Throughput: 0: 12602.7. Samples: 57598754. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:38,941][03942] Avg episode reward: [(0, '1242.294')] [2023-03-06 15:43:39,610][04272] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-06 15:43:40,413][04272] Updated weights for policy 0, policy_version 56300 (0.0007) [2023-03-06 15:43:41,223][04272] Updated weights for policy 0, policy_version 56310 (0.0006) [2023-03-06 15:43:42,042][04272] Updated weights for policy 0, policy_version 56320 (0.0007) [2023-03-06 15:43:42,838][04272] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-06 15:43:43,646][04272] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-06 15:43:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57695232. Throughput: 0: 12610.2. Samples: 57674746. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:43,941][03942] Avg episode reward: [(0, '1269.326')] [2023-03-06 15:43:44,468][04272] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-06 15:43:45,278][04272] Updated weights for policy 0, policy_version 56360 (0.0007) [2023-03-06 15:43:46,084][04272] Updated weights for policy 0, policy_version 56370 (0.0006) [2023-03-06 15:43:46,902][04272] Updated weights for policy 0, policy_version 56380 (0.0007) [2023-03-06 15:43:47,709][04272] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-06 15:43:48,530][04272] Updated weights for policy 0, policy_version 56400 (0.0006) [2023-03-06 15:43:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 57758720. Throughput: 0: 12618.0. Samples: 57750518. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:43:48,941][03942] Avg episode reward: [(0, '1393.941')] [2023-03-06 15:43:49,334][04272] Updated weights for policy 0, policy_version 56410 (0.0007) [2023-03-06 15:43:50,134][04272] Updated weights for policy 0, policy_version 56420 (0.0006) [2023-03-06 15:43:50,968][04272] Updated weights for policy 0, policy_version 56430 (0.0007) [2023-03-06 15:43:51,754][04272] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-06 15:43:52,574][04272] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-06 15:43:53,379][04272] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-06 15:43:53,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 57821184. Throughput: 0: 12609.1. Samples: 57788270. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:43:53,941][03942] Avg episode reward: [(0, '1330.940')] [2023-03-06 15:43:54,176][04272] Updated weights for policy 0, policy_version 56470 (0.0007) [2023-03-06 15:43:54,977][04272] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-06 15:43:55,807][04272] Updated weights for policy 0, policy_version 56490 (0.0007) [2023-03-06 15:43:56,641][04272] Updated weights for policy 0, policy_version 56500 (0.0006) [2023-03-06 15:43:57,448][04272] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-06 15:43:58,275][04272] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-06 15:43:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 57884672. Throughput: 0: 12616.5. Samples: 57864053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:43:58,941][03942] Avg episode reward: [(0, '686.875')] [2023-03-06 15:43:59,092][04272] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-06 15:43:59,911][04272] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-06 15:44:00,730][04272] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-06 15:44:01,528][04272] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-06 15:44:02,346][04272] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-06 15:44:03,156][04272] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-06 15:44:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 57947136. Throughput: 0: 12614.4. Samples: 57939578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:44:03,941][03942] Avg episode reward: [(0, '432.108')] [2023-03-06 15:44:03,959][04272] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-06 15:44:04,768][04272] Updated weights for policy 0, policy_version 56600 (0.0006) [2023-03-06 15:44:05,582][04272] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-06 15:44:06,391][04272] Updated weights for policy 0, policy_version 56620 (0.0006) [2023-03-06 15:44:07,210][04272] Updated weights for policy 0, policy_version 56630 (0.0006) [2023-03-06 15:44:08,021][04272] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-06 15:44:08,850][04272] Updated weights for policy 0, policy_version 56650 (0.0006) [2023-03-06 15:44:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12624.7). Total num frames: 58010624. Throughput: 0: 12613.6. Samples: 57977543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:44:08,941][03942] Avg episode reward: [(0, '330.414')] [2023-03-06 15:44:09,653][04272] Updated weights for policy 0, policy_version 56660 (0.0007) [2023-03-06 15:44:10,459][04272] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-06 15:44:11,278][04272] Updated weights for policy 0, policy_version 56680 (0.0007) [2023-03-06 15:44:12,083][04272] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-06 15:44:12,885][04272] Updated weights for policy 0, policy_version 56700 (0.0006) [2023-03-06 15:44:13,513][04221] KL-divergence is very high: 524.9973 [2023-03-06 15:44:13,680][04272] Updated weights for policy 0, policy_version 56710 (0.0006) [2023-03-06 15:44:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 58074112. Throughput: 0: 12610.9. Samples: 58053153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:44:13,941][03942] Avg episode reward: [(0, '624.551')] [2023-03-06 15:44:14,477][04272] Updated weights for policy 0, policy_version 56720 (0.0007) [2023-03-06 15:44:15,293][04272] Updated weights for policy 0, policy_version 56730 (0.0007) [2023-03-06 15:44:16,101][04272] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-06 15:44:16,913][04272] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-06 15:44:17,733][04272] Updated weights for policy 0, policy_version 56760 (0.0006) [2023-03-06 15:44:18,534][04272] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-06 15:44:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 58137600. Throughput: 0: 12630.6. Samples: 58129229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:44:18,941][03942] Avg episode reward: [(0, '880.009')] [2023-03-06 15:44:19,333][04272] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-06 15:44:20,148][04272] Updated weights for policy 0, policy_version 56790 (0.0007) [2023-03-06 15:44:20,954][04272] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-06 15:44:21,761][04272] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-06 15:44:22,584][04272] Updated weights for policy 0, policy_version 56820 (0.0006) [2023-03-06 15:44:23,371][04272] Updated weights for policy 0, policy_version 56830 (0.0006) [2023-03-06 15:44:23,614][04221] KL-divergence is very high: 649.5172 [2023-03-06 15:44:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 58200064. Throughput: 0: 12637.5. Samples: 58167440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:44:23,941][03942] Avg episode reward: [(0, '731.852')] [2023-03-06 15:44:24,211][04272] Updated weights for policy 0, policy_version 56840 (0.0007) [2023-03-06 15:44:25,010][04272] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-06 15:44:25,810][04272] Updated weights for policy 0, policy_version 56860 (0.0005) [2023-03-06 15:44:26,616][04272] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-06 15:44:27,414][04272] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-06 15:44:28,218][04272] Updated weights for policy 0, policy_version 56890 (0.0007) [2023-03-06 15:44:28,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 58263552. Throughput: 0: 12635.5. Samples: 58243342. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:28,941][03942] Avg episode reward: [(0, '907.664')] [2023-03-06 15:44:29,035][04272] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-06 15:44:29,852][04272] Updated weights for policy 0, policy_version 56910 (0.0006) [2023-03-06 15:44:30,645][04272] Updated weights for policy 0, policy_version 56920 (0.0007) [2023-03-06 15:44:31,463][04272] Updated weights for policy 0, policy_version 56930 (0.0007) [2023-03-06 15:44:32,261][04272] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-06 15:44:33,073][04272] Updated weights for policy 0, policy_version 56950 (0.0007) [2023-03-06 15:44:33,866][04272] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-06 15:44:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 58327040. Throughput: 0: 12645.8. Samples: 58319579. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:33,941][03942] Avg episode reward: [(0, '739.628')] [2023-03-06 15:44:34,672][04272] Updated weights for policy 0, policy_version 56970 (0.0007) [2023-03-06 15:44:35,481][04272] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-06 15:44:36,307][04272] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-06 15:44:37,101][04272] Updated weights for policy 0, policy_version 57000 (0.0007) [2023-03-06 15:44:37,902][04272] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-06 15:44:38,718][04272] Updated weights for policy 0, policy_version 57020 (0.0007) [2023-03-06 15:44:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58390528. Throughput: 0: 12649.9. Samples: 58357516. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:38,941][03942] Avg episode reward: [(0, '684.522')] [2023-03-06 15:44:39,517][04272] Updated weights for policy 0, policy_version 57030 (0.0006) [2023-03-06 15:44:40,330][04272] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-06 15:44:41,130][04272] Updated weights for policy 0, policy_version 57050 (0.0006) [2023-03-06 15:44:41,936][04272] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-06 15:44:42,747][04272] Updated weights for policy 0, policy_version 57070 (0.0006) [2023-03-06 15:44:43,565][04272] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-06 15:44:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58454016. Throughput: 0: 12658.7. Samples: 58433692. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:43,941][03942] Avg episode reward: [(0, '1040.077')] [2023-03-06 15:44:44,357][04272] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-06 15:44:45,182][04272] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-06 15:44:46,002][04272] Updated weights for policy 0, policy_version 57110 (0.0006) [2023-03-06 15:44:46,797][04272] Updated weights for policy 0, policy_version 57120 (0.0007) [2023-03-06 15:44:47,600][04272] Updated weights for policy 0, policy_version 57130 (0.0006) [2023-03-06 15:44:48,415][04272] Updated weights for policy 0, policy_version 57140 (0.0006) [2023-03-06 15:44:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58517504. Throughput: 0: 12665.0. Samples: 58509504. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:48,941][03942] Avg episode reward: [(0, '944.402')] [2023-03-06 15:44:49,225][04272] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-06 15:44:50,046][04272] Updated weights for policy 0, policy_version 57160 (0.0007) [2023-03-06 15:44:50,854][04272] Updated weights for policy 0, policy_version 57170 (0.0006) [2023-03-06 15:44:51,674][04272] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-06 15:44:52,469][04272] Updated weights for policy 0, policy_version 57190 (0.0007) [2023-03-06 15:44:53,277][04272] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-06 15:44:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12628.2). Total num frames: 58580992. Throughput: 0: 12664.8. Samples: 58547457. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:53,941][03942] Avg episode reward: [(0, '1167.829')] [2023-03-06 15:44:54,084][04272] Updated weights for policy 0, policy_version 57210 (0.0007) [2023-03-06 15:44:54,902][04272] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-06 15:44:55,709][04272] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-06 15:44:56,518][04272] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-06 15:44:57,330][04272] Updated weights for policy 0, policy_version 57250 (0.0006) [2023-03-06 15:44:58,156][04272] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-06 15:44:58,941][03942] Fps is (10 sec: 12595.4, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58643456. Throughput: 0: 12671.9. Samples: 58623389. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:44:58,941][03942] Avg episode reward: [(0, '1267.670')] [2023-03-06 15:44:58,957][04272] Updated weights for policy 0, policy_version 57270 (0.0006) [2023-03-06 15:44:59,750][04272] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-06 15:45:00,583][04272] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-06 15:45:01,391][04272] Updated weights for policy 0, policy_version 57300 (0.0007) [2023-03-06 15:45:02,198][04272] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-06 15:45:03,005][04272] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-06 15:45:03,814][04272] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-06 15:45:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 58706944. Throughput: 0: 12662.3. Samples: 58699033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:03,941][03942] Avg episode reward: [(0, '1251.170')] [2023-03-06 15:45:04,615][04272] Updated weights for policy 0, policy_version 57340 (0.0006) [2023-03-06 15:45:05,440][04272] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-06 15:45:06,237][04272] Updated weights for policy 0, policy_version 57360 (0.0007) [2023-03-06 15:45:07,055][04272] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-06 15:45:07,862][04272] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-06 15:45:08,654][04272] Updated weights for policy 0, policy_version 57390 (0.0006) [2023-03-06 15:45:08,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 58770432. Throughput: 0: 12655.9. Samples: 58736957. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:08,941][03942] Avg episode reward: [(0, '1174.921')] [2023-03-06 15:45:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000057393_58770432.pth... [2023-03-06 15:45:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000054435_55741440.pth [2023-03-06 15:45:09,466][04272] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-06 15:45:10,275][04272] Updated weights for policy 0, policy_version 57410 (0.0005) [2023-03-06 15:45:11,101][04272] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-06 15:45:11,913][04272] Updated weights for policy 0, policy_version 57430 (0.0007) [2023-03-06 15:45:12,724][04272] Updated weights for policy 0, policy_version 57440 (0.0007) [2023-03-06 15:45:13,549][04272] Updated weights for policy 0, policy_version 57450 (0.0007) [2023-03-06 15:45:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58832896. Throughput: 0: 12655.4. Samples: 58812836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:13,941][03942] Avg episode reward: [(0, '1092.066')] [2023-03-06 15:45:14,349][04272] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-06 15:45:15,168][04272] Updated weights for policy 0, policy_version 57470 (0.0007) [2023-03-06 15:45:16,000][04272] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-06 15:45:16,795][04272] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-06 15:45:17,621][04272] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-06 15:45:18,432][04272] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-06 15:45:18,940][03942] Fps is (10 sec: 12595.5, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 58896384. Throughput: 0: 12641.1. Samples: 58888428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:18,941][03942] Avg episode reward: [(0, '1241.148')] [2023-03-06 15:45:19,246][04272] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-06 15:45:20,053][04272] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-06 15:45:20,861][04272] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-06 15:45:21,676][04272] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-06 15:45:22,493][04272] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-06 15:45:23,289][04272] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-06 15:45:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 58958848. Throughput: 0: 12636.6. Samples: 58926163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:23,941][03942] Avg episode reward: [(0, '1186.754')] [2023-03-06 15:45:24,134][04272] Updated weights for policy 0, policy_version 57580 (0.0006) [2023-03-06 15:45:24,942][04272] Updated weights for policy 0, policy_version 57590 (0.0006) [2023-03-06 15:45:25,747][04272] Updated weights for policy 0, policy_version 57600 (0.0007) [2023-03-06 15:45:26,553][04272] Updated weights for policy 0, policy_version 57610 (0.0007) [2023-03-06 15:45:27,368][04272] Updated weights for policy 0, policy_version 57620 (0.0007) [2023-03-06 15:45:28,174][04272] Updated weights for policy 0, policy_version 57630 (0.0006) [2023-03-06 15:45:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 59022336. Throughput: 0: 12623.9. Samples: 59001766. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:28,941][03942] Avg episode reward: [(0, '1179.971')] [2023-03-06 15:45:28,983][04272] Updated weights for policy 0, policy_version 57640 (0.0007) [2023-03-06 15:45:29,809][04272] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-06 15:45:30,605][04272] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-06 15:45:31,430][04272] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-06 15:45:32,237][04272] Updated weights for policy 0, policy_version 57680 (0.0006) [2023-03-06 15:45:33,075][04272] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-06 15:45:33,871][04272] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-06 15:45:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59084800. Throughput: 0: 12616.4. Samples: 59077243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:33,941][03942] Avg episode reward: [(0, '1303.216')] [2023-03-06 15:45:34,681][04272] Updated weights for policy 0, policy_version 57710 (0.0007) [2023-03-06 15:45:35,491][04272] Updated weights for policy 0, policy_version 57720 (0.0007) [2023-03-06 15:45:36,313][04272] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-06 15:45:37,105][04272] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-06 15:45:37,917][04272] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-06 15:45:38,734][04272] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-06 15:45:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12621.2). Total num frames: 59148288. Throughput: 0: 12617.0. Samples: 59115222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:45:38,951][03942] Avg episode reward: [(0, '1389.330')] [2023-03-06 15:45:39,540][04272] Updated weights for policy 0, policy_version 57770 (0.0007) [2023-03-06 15:45:40,366][04272] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-06 15:45:41,183][04272] Updated weights for policy 0, policy_version 57790 (0.0007) [2023-03-06 15:45:41,989][04272] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-06 15:45:42,801][04272] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-06 15:45:43,614][04272] Updated weights for policy 0, policy_version 57820 (0.0007) [2023-03-06 15:45:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 59210752. Throughput: 0: 12610.7. Samples: 59190871. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:45:43,952][03942] Avg episode reward: [(0, '1366.657')] [2023-03-06 15:45:44,424][04272] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-06 15:45:45,223][04272] Updated weights for policy 0, policy_version 57840 (0.0005) [2023-03-06 15:45:46,037][04272] Updated weights for policy 0, policy_version 57850 (0.0007) [2023-03-06 15:45:46,858][04272] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-06 15:45:47,638][04272] Updated weights for policy 0, policy_version 57870 (0.0006) [2023-03-06 15:45:48,437][04272] Updated weights for policy 0, policy_version 57880 (0.0008) [2023-03-06 15:45:48,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 59274240. Throughput: 0: 12619.7. Samples: 59266919. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:45:48,941][03942] Avg episode reward: [(0, '1413.753')] [2023-03-06 15:45:49,262][04272] Updated weights for policy 0, policy_version 57890 (0.0006) [2023-03-06 15:45:50,071][04272] Updated weights for policy 0, policy_version 57900 (0.0006) [2023-03-06 15:45:50,888][04272] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-06 15:45:51,697][04272] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-06 15:45:52,509][04272] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-06 15:45:53,314][04272] Updated weights for policy 0, policy_version 57940 (0.0007) [2023-03-06 15:45:53,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 59337728. Throughput: 0: 12620.5. Samples: 59304878. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:45:53,941][03942] Avg episode reward: [(0, '1356.882')] [2023-03-06 15:45:54,119][04272] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-06 15:45:54,926][04272] Updated weights for policy 0, policy_version 57960 (0.0006) [2023-03-06 15:45:55,738][04272] Updated weights for policy 0, policy_version 57970 (0.0007) [2023-03-06 15:45:56,543][04272] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-06 15:45:57,362][04272] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-06 15:45:58,174][04272] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-06 15:45:58,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59401216. Throughput: 0: 12625.8. Samples: 59380996. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:45:58,941][03942] Avg episode reward: [(0, '1259.472')] [2023-03-06 15:45:58,979][04272] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-06 15:45:59,775][04272] Updated weights for policy 0, policy_version 58020 (0.0006) [2023-03-06 15:46:00,594][04272] Updated weights for policy 0, policy_version 58030 (0.0006) [2023-03-06 15:46:01,409][04272] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-06 15:46:02,219][04272] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-06 15:46:03,021][04272] Updated weights for policy 0, policy_version 58060 (0.0007) [2023-03-06 15:46:03,831][04272] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-06 15:46:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59464704. Throughput: 0: 12630.9. Samples: 59456820. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:03,941][03942] Avg episode reward: [(0, '1250.662')] [2023-03-06 15:46:04,632][04272] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-06 15:46:05,443][04272] Updated weights for policy 0, policy_version 58090 (0.0007) [2023-03-06 15:46:06,263][04272] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-06 15:46:07,060][04272] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-06 15:46:07,889][04272] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-06 15:46:08,707][04272] Updated weights for policy 0, policy_version 58130 (0.0006) [2023-03-06 15:46:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 59527168. Throughput: 0: 12632.7. Samples: 59494634. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:08,941][03942] Avg episode reward: [(0, '1371.590')] [2023-03-06 15:46:09,506][04272] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-06 15:46:10,321][04272] Updated weights for policy 0, policy_version 58150 (0.0007) [2023-03-06 15:46:11,129][04272] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-06 15:46:11,954][04272] Updated weights for policy 0, policy_version 58170 (0.0007) [2023-03-06 15:46:12,755][04272] Updated weights for policy 0, policy_version 58180 (0.0007) [2023-03-06 15:46:13,566][04272] Updated weights for policy 0, policy_version 58190 (0.0007) [2023-03-06 15:46:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12621.2). Total num frames: 59590656. Throughput: 0: 12636.6. Samples: 59570411. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:13,941][03942] Avg episode reward: [(0, '1394.852')] [2023-03-06 15:46:14,381][04272] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-06 15:46:15,196][04272] Updated weights for policy 0, policy_version 58210 (0.0007) [2023-03-06 15:46:16,014][04272] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-06 15:46:16,826][04272] Updated weights for policy 0, policy_version 58230 (0.0006) [2023-03-06 15:46:17,619][04272] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-06 15:46:18,425][04272] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-06 15:46:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 59654144. Throughput: 0: 12642.7. Samples: 59646162. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:18,941][03942] Avg episode reward: [(0, '1342.919')] [2023-03-06 15:46:19,257][04272] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-06 15:46:20,049][04272] Updated weights for policy 0, policy_version 58270 (0.0006) [2023-03-06 15:46:20,856][04272] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-06 15:46:21,678][04272] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-06 15:46:22,485][04272] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-06 15:46:23,298][04272] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-06 15:46:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59716608. Throughput: 0: 12642.6. Samples: 59684141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:23,941][03942] Avg episode reward: [(0, '1276.950')] [2023-03-06 15:46:24,110][04272] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-06 15:46:24,915][04272] Updated weights for policy 0, policy_version 58330 (0.0007) [2023-03-06 15:46:25,753][04272] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-06 15:46:26,550][04272] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-06 15:46:27,357][04272] Updated weights for policy 0, policy_version 58360 (0.0007) [2023-03-06 15:46:28,166][04272] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-06 15:46:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59780096. Throughput: 0: 12640.6. Samples: 59759699. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:28,941][03942] Avg episode reward: [(0, '1291.742')] [2023-03-06 15:46:28,972][04272] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-06 15:46:29,793][04272] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-06 15:46:30,621][04272] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-06 15:46:31,427][04272] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-06 15:46:32,229][04272] Updated weights for policy 0, policy_version 58420 (0.0007) [2023-03-06 15:46:33,041][04272] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-06 15:46:33,859][04272] Updated weights for policy 0, policy_version 58440 (0.0006) [2023-03-06 15:46:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 59842560. Throughput: 0: 12630.3. Samples: 59835280. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:33,941][03942] Avg episode reward: [(0, '900.395')] [2023-03-06 15:46:34,654][04272] Updated weights for policy 0, policy_version 58450 (0.0007) [2023-03-06 15:46:35,455][04272] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-06 15:46:36,263][04272] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-06 15:46:37,088][04272] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-06 15:46:37,891][04272] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-06 15:46:38,692][04272] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-06 15:46:38,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 59907072. Throughput: 0: 12633.4. Samples: 59873379. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:38,941][03942] Avg episode reward: [(0, '944.611')] [2023-03-06 15:46:39,489][04272] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-06 15:46:40,307][04272] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-06 15:46:41,121][04272] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-06 15:46:41,925][04272] Updated weights for policy 0, policy_version 58540 (0.0007) [2023-03-06 15:46:42,717][04272] Updated weights for policy 0, policy_version 58550 (0.0007) [2023-03-06 15:46:43,541][04272] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-06 15:46:43,940][03942] Fps is (10 sec: 12800.2, 60 sec: 12663.5, 300 sec: 12628.2). Total num frames: 59970560. Throughput: 0: 12632.9. Samples: 59949476. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:43,941][03942] Avg episode reward: [(0, '1113.646')] [2023-03-06 15:46:44,344][04272] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-06 15:46:45,142][04272] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-06 15:46:45,957][04272] Updated weights for policy 0, policy_version 58590 (0.0006) [2023-03-06 15:46:46,778][04272] Updated weights for policy 0, policy_version 58600 (0.0007) [2023-03-06 15:46:47,583][04272] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-06 15:46:48,398][04272] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-06 15:46:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 60033024. Throughput: 0: 12634.8. Samples: 60025389. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:48,941][03942] Avg episode reward: [(0, '1179.338')] [2023-03-06 15:46:49,219][04272] Updated weights for policy 0, policy_version 58630 (0.0005) [2023-03-06 15:46:50,042][04272] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-06 15:46:50,855][04272] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-06 15:46:51,650][04272] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-06 15:46:52,473][04272] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-06 15:46:53,274][04272] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-06 15:46:53,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 60096512. Throughput: 0: 12630.2. Samples: 60062993. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:53,952][03942] Avg episode reward: [(0, '1186.538')] [2023-03-06 15:46:54,072][04272] Updated weights for policy 0, policy_version 58690 (0.0007) [2023-03-06 15:46:54,896][04272] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-06 15:46:55,694][04272] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-06 15:46:56,499][04272] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-06 15:46:57,301][04272] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-06 15:46:58,122][04272] Updated weights for policy 0, policy_version 58740 (0.0007) [2023-03-06 15:46:58,924][04272] Updated weights for policy 0, policy_version 58750 (0.0007) [2023-03-06 15:46:58,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 60160000. Throughput: 0: 12638.8. Samples: 60139157. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:46:58,951][03942] Avg episode reward: [(0, '1263.830')] [2023-03-06 15:46:59,741][04272] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-06 15:47:00,543][04272] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-06 15:47:01,350][04272] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-06 15:47:02,172][04272] Updated weights for policy 0, policy_version 58790 (0.0007) [2023-03-06 15:47:02,968][04272] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-06 15:47:03,780][04272] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-06 15:47:03,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12631.7). Total num frames: 60223488. Throughput: 0: 12646.0. Samples: 60215233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:47:03,951][03942] Avg episode reward: [(0, '1224.003')] [2023-03-06 15:47:04,586][04272] Updated weights for policy 0, policy_version 58820 (0.0007) [2023-03-06 15:47:05,384][04272] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-06 15:47:06,193][04272] Updated weights for policy 0, policy_version 58840 (0.0006) [2023-03-06 15:47:07,009][04272] Updated weights for policy 0, policy_version 58850 (0.0007) [2023-03-06 15:47:07,821][04272] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-06 15:47:08,638][04272] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-06 15:47:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 60285952. Throughput: 0: 12648.6. Samples: 60253328. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:47:08,952][03942] Avg episode reward: [(0, '1217.053')] [2023-03-06 15:47:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000058874_60286976.pth... [2023-03-06 15:47:08,988][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000055912_57253888.pth [2023-03-06 15:47:09,458][04272] Updated weights for policy 0, policy_version 58880 (0.0007) [2023-03-06 15:47:10,245][04272] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-06 15:47:11,072][04272] Updated weights for policy 0, policy_version 58900 (0.0007) [2023-03-06 15:47:11,869][04272] Updated weights for policy 0, policy_version 58910 (0.0005) [2023-03-06 15:47:12,682][04272] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-06 15:47:13,504][04272] Updated weights for policy 0, policy_version 58930 (0.0007) [2023-03-06 15:47:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 60349440. Throughput: 0: 12650.7. Samples: 60328980. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:47:13,952][03942] Avg episode reward: [(0, '959.746')] [2023-03-06 15:47:14,330][04272] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-06 15:47:15,134][04272] Updated weights for policy 0, policy_version 58950 (0.0006) [2023-03-06 15:47:15,937][04272] Updated weights for policy 0, policy_version 58960 (0.0007) [2023-03-06 15:47:16,759][04272] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-06 15:47:17,558][04272] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-06 15:47:18,379][04272] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-06 15:47:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 60411904. Throughput: 0: 12650.4. Samples: 60404547. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:47:18,951][03942] Avg episode reward: [(0, '1081.527')] [2023-03-06 15:47:19,197][04272] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-06 15:47:19,978][04272] Updated weights for policy 0, policy_version 59010 (0.0007) [2023-03-06 15:47:20,799][04272] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-06 15:47:21,607][04272] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-06 15:47:22,410][04272] Updated weights for policy 0, policy_version 59040 (0.0007) [2023-03-06 15:47:23,226][04272] Updated weights for policy 0, policy_version 59050 (0.0007) [2023-03-06 15:47:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 60475392. Throughput: 0: 12644.8. Samples: 60442397. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:47:23,952][03942] Avg episode reward: [(0, '868.864')] [2023-03-06 15:47:24,037][04272] Updated weights for policy 0, policy_version 59060 (0.0007) [2023-03-06 15:47:24,851][04272] Updated weights for policy 0, policy_version 59070 (0.0006) [2023-03-06 15:47:25,641][04272] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-06 15:47:26,448][04272] Updated weights for policy 0, policy_version 59090 (0.0007) [2023-03-06 15:47:27,274][04272] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-06 15:47:28,070][04272] Updated weights for policy 0, policy_version 59110 (0.0006) [2023-03-06 15:47:28,875][04272] Updated weights for policy 0, policy_version 59120 (0.0007) [2023-03-06 15:47:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 60538880. Throughput: 0: 12646.1. Samples: 60518553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:28,951][03942] Avg episode reward: [(0, '863.656')] [2023-03-06 15:47:29,713][04272] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-06 15:47:30,510][04272] Updated weights for policy 0, policy_version 59140 (0.0006) [2023-03-06 15:47:31,322][04272] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-06 15:47:32,127][04272] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-06 15:47:32,934][04272] Updated weights for policy 0, policy_version 59170 (0.0006) [2023-03-06 15:47:33,766][04272] Updated weights for policy 0, policy_version 59180 (0.0007) [2023-03-06 15:47:33,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12635.1). Total num frames: 60602368. Throughput: 0: 12643.1. Samples: 60594329. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:33,952][03942] Avg episode reward: [(0, '958.525')] [2023-03-06 15:47:34,582][04272] Updated weights for policy 0, policy_version 59190 (0.0007) [2023-03-06 15:47:35,398][04272] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-06 15:47:36,220][04272] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-06 15:47:37,034][04272] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-06 15:47:37,838][04272] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-06 15:47:38,653][04272] Updated weights for policy 0, policy_version 59240 (0.0007) [2023-03-06 15:47:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 60664832. Throughput: 0: 12638.1. Samples: 60631705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:38,952][03942] Avg episode reward: [(0, '800.350')] [2023-03-06 15:47:39,481][04272] Updated weights for policy 0, policy_version 59250 (0.0007) [2023-03-06 15:47:40,288][04272] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-06 15:47:41,091][04272] Updated weights for policy 0, policy_version 59270 (0.0007) [2023-03-06 15:47:41,893][04272] Updated weights for policy 0, policy_version 59280 (0.0006) [2023-03-06 15:47:42,696][04272] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-06 15:47:43,502][04272] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-06 15:47:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 60728320. Throughput: 0: 12631.1. Samples: 60707556. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:43,952][03942] Avg episode reward: [(0, '991.064')] [2023-03-06 15:47:44,321][04272] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-06 15:47:45,138][04272] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-06 15:47:45,916][04272] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-06 15:47:46,724][04272] Updated weights for policy 0, policy_version 59340 (0.0006) [2023-03-06 15:47:47,537][04272] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-06 15:47:48,349][04272] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-06 15:47:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 60791808. Throughput: 0: 12632.7. Samples: 60783708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:48,952][03942] Avg episode reward: [(0, '1093.161')] [2023-03-06 15:47:49,173][04272] Updated weights for policy 0, policy_version 59370 (0.0006) [2023-03-06 15:47:49,977][04272] Updated weights for policy 0, policy_version 59380 (0.0007) [2023-03-06 15:47:50,788][04272] Updated weights for policy 0, policy_version 59390 (0.0007) [2023-03-06 15:47:51,610][04272] Updated weights for policy 0, policy_version 59400 (0.0007) [2023-03-06 15:47:52,412][04272] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-06 15:47:53,231][04272] Updated weights for policy 0, policy_version 59420 (0.0007) [2023-03-06 15:47:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12631.6). Total num frames: 60854272. Throughput: 0: 12628.4. Samples: 60821607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:53,941][03942] Avg episode reward: [(0, '1045.777')] [2023-03-06 15:47:54,037][04272] Updated weights for policy 0, policy_version 59430 (0.0007) [2023-03-06 15:47:54,857][04272] Updated weights for policy 0, policy_version 59440 (0.0006) [2023-03-06 15:47:55,657][04272] Updated weights for policy 0, policy_version 59450 (0.0006) [2023-03-06 15:47:56,461][04272] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-06 15:47:57,277][04272] Updated weights for policy 0, policy_version 59470 (0.0007) [2023-03-06 15:47:58,079][04272] Updated weights for policy 0, policy_version 59480 (0.0007) [2023-03-06 15:47:58,889][04272] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-06 15:47:58,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 60917760. Throughput: 0: 12628.6. Samples: 60897269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:47:58,941][03942] Avg episode reward: [(0, '979.462')] [2023-03-06 15:47:59,711][04272] Updated weights for policy 0, policy_version 59500 (0.0007) [2023-03-06 15:48:00,521][04272] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-06 15:48:01,333][04272] Updated weights for policy 0, policy_version 59520 (0.0006) [2023-03-06 15:48:02,127][04272] Updated weights for policy 0, policy_version 59530 (0.0007) [2023-03-06 15:48:02,940][04272] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-06 15:48:03,754][04272] Updated weights for policy 0, policy_version 59550 (0.0006) [2023-03-06 15:48:03,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 60981248. Throughput: 0: 12636.4. Samples: 60973184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:03,941][03942] Avg episode reward: [(0, '1092.544')] [2023-03-06 15:48:04,586][04272] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-06 15:48:05,383][04272] Updated weights for policy 0, policy_version 59570 (0.0006) [2023-03-06 15:48:06,179][04272] Updated weights for policy 0, policy_version 59580 (0.0007) [2023-03-06 15:48:06,998][04272] Updated weights for policy 0, policy_version 59590 (0.0006) [2023-03-06 15:48:07,789][04272] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-06 15:48:08,614][04272] Updated weights for policy 0, policy_version 59610 (0.0005) [2023-03-06 15:48:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61043712. Throughput: 0: 12637.0. Samples: 61011064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:08,941][03942] Avg episode reward: [(0, '1224.467')] [2023-03-06 15:48:09,441][04272] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-06 15:48:10,238][04272] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-06 15:48:11,063][04272] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-06 15:48:11,865][04272] Updated weights for policy 0, policy_version 59650 (0.0007) [2023-03-06 15:48:12,688][04272] Updated weights for policy 0, policy_version 59660 (0.0006) [2023-03-06 15:48:13,494][04272] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-06 15:48:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61107200. Throughput: 0: 12624.7. Samples: 61086663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:13,941][03942] Avg episode reward: [(0, '1319.397')] [2023-03-06 15:48:14,289][04272] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-06 15:48:15,111][04272] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-06 15:48:15,923][04272] Updated weights for policy 0, policy_version 59700 (0.0007) [2023-03-06 15:48:16,740][04272] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-06 15:48:17,552][04272] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-06 15:48:18,361][04272] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-06 15:48:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 61170688. Throughput: 0: 12628.0. Samples: 61162588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:18,941][03942] Avg episode reward: [(0, '1308.473')] [2023-03-06 15:48:19,177][04272] Updated weights for policy 0, policy_version 59740 (0.0006) [2023-03-06 15:48:19,983][04272] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-06 15:48:20,801][04272] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-06 15:48:21,621][04272] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-06 15:48:22,424][04272] Updated weights for policy 0, policy_version 59780 (0.0006) [2023-03-06 15:48:23,248][04272] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-06 15:48:23,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61233152. Throughput: 0: 12636.2. Samples: 61200334. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:23,941][03942] Avg episode reward: [(0, '1197.827')] [2023-03-06 15:48:24,045][04272] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-06 15:48:24,870][04272] Updated weights for policy 0, policy_version 59810 (0.0007) [2023-03-06 15:48:25,682][04272] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-06 15:48:26,470][04272] Updated weights for policy 0, policy_version 59830 (0.0007) [2023-03-06 15:48:27,289][04272] Updated weights for policy 0, policy_version 59840 (0.0006) [2023-03-06 15:48:28,111][04272] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-06 15:48:28,919][04272] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-06 15:48:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61296640. Throughput: 0: 12632.9. Samples: 61276037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:28,941][03942] Avg episode reward: [(0, '1263.229')] [2023-03-06 15:48:29,728][04272] Updated weights for policy 0, policy_version 59870 (0.0007) [2023-03-06 15:48:30,530][04272] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-06 15:48:31,359][04272] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-06 15:48:32,160][04272] Updated weights for policy 0, policy_version 59900 (0.0007) [2023-03-06 15:48:32,978][04272] Updated weights for policy 0, policy_version 59910 (0.0006) [2023-03-06 15:48:33,784][04272] Updated weights for policy 0, policy_version 59920 (0.0007) [2023-03-06 15:48:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12635.1). Total num frames: 61359104. Throughput: 0: 12618.1. Samples: 61351523. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:33,941][03942] Avg episode reward: [(0, '1324.811')] [2023-03-06 15:48:34,605][04272] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-06 15:48:35,406][04272] Updated weights for policy 0, policy_version 59940 (0.0007) [2023-03-06 15:48:36,214][04272] Updated weights for policy 0, policy_version 59950 (0.0007) [2023-03-06 15:48:37,034][04272] Updated weights for policy 0, policy_version 59960 (0.0007) [2023-03-06 15:48:37,850][04272] Updated weights for policy 0, policy_version 59970 (0.0007) [2023-03-06 15:48:38,676][04272] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-06 15:48:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61422592. Throughput: 0: 12619.6. Samples: 61389492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:48:38,941][03942] Avg episode reward: [(0, '1221.574')] [2023-03-06 15:48:39,485][04272] Updated weights for policy 0, policy_version 59990 (0.0006) [2023-03-06 15:48:40,298][04272] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-06 15:48:41,115][04272] Updated weights for policy 0, policy_version 60010 (0.0007) [2023-03-06 15:48:41,922][04272] Updated weights for policy 0, policy_version 60020 (0.0007) [2023-03-06 15:48:42,737][04272] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-06 15:48:43,559][04272] Updated weights for policy 0, policy_version 60040 (0.0007) [2023-03-06 15:48:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.6). Total num frames: 61485056. Throughput: 0: 12613.4. Samples: 61464872. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:48:43,941][03942] Avg episode reward: [(0, '1197.716')] [2023-03-06 15:48:44,365][04272] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-06 15:48:45,158][04272] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-06 15:48:45,971][04272] Updated weights for policy 0, policy_version 60070 (0.0007) [2023-03-06 15:48:46,819][04272] Updated weights for policy 0, policy_version 60080 (0.0007) [2023-03-06 15:48:47,618][04272] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-06 15:48:48,425][04272] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-06 15:48:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12635.1). Total num frames: 61548544. Throughput: 0: 12607.9. Samples: 61540540. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:48:48,941][03942] Avg episode reward: [(0, '1252.939')] [2023-03-06 15:48:49,218][04272] Updated weights for policy 0, policy_version 60110 (0.0007) [2023-03-06 15:48:50,033][04272] Updated weights for policy 0, policy_version 60120 (0.0007) [2023-03-06 15:48:50,838][04272] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-06 15:48:51,659][04272] Updated weights for policy 0, policy_version 60140 (0.0007) [2023-03-06 15:48:52,474][04272] Updated weights for policy 0, policy_version 60150 (0.0007) [2023-03-06 15:48:53,282][04272] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-06 15:48:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 61612032. Throughput: 0: 12612.2. Samples: 61578612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:48:53,941][03942] Avg episode reward: [(0, '1250.509')] [2023-03-06 15:48:54,102][04272] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-06 15:48:54,905][04272] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-06 15:48:55,720][04272] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-06 15:48:56,530][04272] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-06 15:48:57,338][04272] Updated weights for policy 0, policy_version 60210 (0.0006) [2023-03-06 15:48:58,145][04272] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-06 15:48:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12635.1). Total num frames: 61674496. Throughput: 0: 12614.3. Samples: 61654306. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:48:58,941][03942] Avg episode reward: [(0, '1272.212')] [2023-03-06 15:48:58,953][04272] Updated weights for policy 0, policy_version 60230 (0.0007) [2023-03-06 15:48:59,767][04272] Updated weights for policy 0, policy_version 60240 (0.0006) [2023-03-06 15:49:00,572][04272] Updated weights for policy 0, policy_version 60250 (0.0007) [2023-03-06 15:49:01,391][04272] Updated weights for policy 0, policy_version 60260 (0.0007) [2023-03-06 15:49:02,203][04272] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-06 15:49:03,035][04272] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-06 15:49:03,855][04272] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-06 15:49:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12635.1). Total num frames: 61737984. Throughput: 0: 12609.2. Samples: 61729999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:49:03,941][03942] Avg episode reward: [(0, '1328.482')] [2023-03-06 15:49:04,658][04272] Updated weights for policy 0, policy_version 60300 (0.0007) [2023-03-06 15:49:05,465][04272] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-06 15:49:06,282][04272] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-06 15:49:07,099][04272] Updated weights for policy 0, policy_version 60330 (0.0007) [2023-03-06 15:49:07,917][04272] Updated weights for policy 0, policy_version 60340 (0.0007) [2023-03-06 15:49:08,715][04272] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-06 15:49:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 61800448. Throughput: 0: 12607.7. Samples: 61767680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:49:08,941][03942] Avg episode reward: [(0, '1202.143')] [2023-03-06 15:49:08,948][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000060353_61801472.pth... [2023-03-06 15:49:08,979][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000057393_58770432.pth [2023-03-06 15:49:09,538][04272] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-06 15:49:10,346][04272] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-06 15:49:11,153][04272] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-06 15:49:11,970][04272] Updated weights for policy 0, policy_version 60390 (0.0007) [2023-03-06 15:49:12,783][04272] Updated weights for policy 0, policy_version 60400 (0.0006) [2023-03-06 15:49:13,596][04272] Updated weights for policy 0, policy_version 60410 (0.0009) [2023-03-06 15:49:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 61863936. Throughput: 0: 12608.1. Samples: 61843400. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:49:13,941][03942] Avg episode reward: [(0, '1306.771')] [2023-03-06 15:49:14,413][04272] Updated weights for policy 0, policy_version 60420 (0.0007) [2023-03-06 15:49:15,221][04272] Updated weights for policy 0, policy_version 60430 (0.0007) [2023-03-06 15:49:16,029][04272] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-06 15:49:16,851][04272] Updated weights for policy 0, policy_version 60450 (0.0006) [2023-03-06 15:49:17,662][04272] Updated weights for policy 0, policy_version 60460 (0.0007) [2023-03-06 15:49:18,464][04272] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-06 15:49:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12631.6). Total num frames: 61926400. Throughput: 0: 12607.4. Samples: 61918857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:18,941][03942] Avg episode reward: [(0, '1236.172')] [2023-03-06 15:49:19,277][04272] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-06 15:49:20,086][04272] Updated weights for policy 0, policy_version 60490 (0.0007) [2023-03-06 15:49:20,907][04272] Updated weights for policy 0, policy_version 60500 (0.0006) [2023-03-06 15:49:21,715][04272] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-06 15:49:22,528][04272] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-06 15:49:23,359][04272] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-06 15:49:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.6). Total num frames: 61989888. Throughput: 0: 12604.7. Samples: 61956702. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:23,952][03942] Avg episode reward: [(0, '1270.721')] [2023-03-06 15:49:24,169][04272] Updated weights for policy 0, policy_version 60540 (0.0007) [2023-03-06 15:49:24,962][04272] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-06 15:49:25,776][04272] Updated weights for policy 0, policy_version 60560 (0.0006) [2023-03-06 15:49:26,582][04272] Updated weights for policy 0, policy_version 60570 (0.0006) [2023-03-06 15:49:27,398][04272] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-03-06 15:49:28,230][04272] Updated weights for policy 0, policy_version 60590 (0.0006) [2023-03-06 15:49:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12628.2). Total num frames: 62052352. Throughput: 0: 12610.9. Samples: 62032363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:28,951][03942] Avg episode reward: [(0, '1196.880')] [2023-03-06 15:49:29,043][04272] Updated weights for policy 0, policy_version 60600 (0.0006) [2023-03-06 15:49:29,857][04272] Updated weights for policy 0, policy_version 60610 (0.0006) [2023-03-06 15:49:30,677][04272] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-06 15:49:31,480][04272] Updated weights for policy 0, policy_version 60630 (0.0007) [2023-03-06 15:49:32,301][04272] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-06 15:49:33,095][04272] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-06 15:49:33,931][04272] Updated weights for policy 0, policy_version 60660 (0.0007) [2023-03-06 15:49:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 62115840. Throughput: 0: 12605.0. Samples: 62107764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:33,952][03942] Avg episode reward: [(0, '1222.750')] [2023-03-06 15:49:34,730][04272] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-06 15:49:35,545][04272] Updated weights for policy 0, policy_version 60680 (0.0007) [2023-03-06 15:49:36,370][04272] Updated weights for policy 0, policy_version 60690 (0.0006) [2023-03-06 15:49:37,176][04272] Updated weights for policy 0, policy_version 60700 (0.0006) [2023-03-06 15:49:37,984][04272] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-06 15:49:38,788][04272] Updated weights for policy 0, policy_version 60720 (0.0007) [2023-03-06 15:49:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 62178304. Throughput: 0: 12599.3. Samples: 62145582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:38,952][03942] Avg episode reward: [(0, '1197.959')] [2023-03-06 15:49:39,614][04272] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-06 15:49:40,428][04272] Updated weights for policy 0, policy_version 60740 (0.0007) [2023-03-06 15:49:41,234][04272] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-06 15:49:42,062][04272] Updated weights for policy 0, policy_version 60760 (0.0007) [2023-03-06 15:49:42,877][04272] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-06 15:49:43,686][04272] Updated weights for policy 0, policy_version 60780 (0.0007) [2023-03-06 15:49:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 62241792. Throughput: 0: 12592.3. Samples: 62220961. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:43,941][03942] Avg episode reward: [(0, '1326.329')] [2023-03-06 15:49:44,494][04272] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-06 15:49:45,300][04272] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-06 15:49:46,106][04272] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-06 15:49:46,927][04272] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-06 15:49:47,732][04272] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-06 15:49:48,547][04272] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-06 15:49:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 62304256. Throughput: 0: 12594.0. Samples: 62296730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:48,941][03942] Avg episode reward: [(0, '1132.304')] [2023-03-06 15:49:49,357][04272] Updated weights for policy 0, policy_version 60850 (0.0006) [2023-03-06 15:49:50,165][04272] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-06 15:49:50,986][04272] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-06 15:49:51,788][04272] Updated weights for policy 0, policy_version 60880 (0.0007) [2023-03-06 15:49:52,608][04272] Updated weights for policy 0, policy_version 60890 (0.0006) [2023-03-06 15:49:53,427][04272] Updated weights for policy 0, policy_version 60900 (0.0007) [2023-03-06 15:49:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 62367744. Throughput: 0: 12596.6. Samples: 62334529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:53,941][03942] Avg episode reward: [(0, '1302.859')] [2023-03-06 15:49:54,254][04272] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-06 15:49:55,052][04272] Updated weights for policy 0, policy_version 60920 (0.0006) [2023-03-06 15:49:55,869][04272] Updated weights for policy 0, policy_version 60930 (0.0007) [2023-03-06 15:49:56,681][04272] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-06 15:49:57,489][04272] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-06 15:49:58,289][04272] Updated weights for policy 0, policy_version 60960 (0.0007) [2023-03-06 15:49:58,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 62431232. Throughput: 0: 12592.1. Samples: 62410044. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:49:58,941][03942] Avg episode reward: [(0, '1294.729')] [2023-03-06 15:49:59,110][04272] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-06 15:49:59,928][04272] Updated weights for policy 0, policy_version 60980 (0.0007) [2023-03-06 15:50:00,735][04272] Updated weights for policy 0, policy_version 60990 (0.0007) [2023-03-06 15:50:01,556][04272] Updated weights for policy 0, policy_version 61000 (0.0007) [2023-03-06 15:50:02,367][04272] Updated weights for policy 0, policy_version 61010 (0.0006) [2023-03-06 15:50:03,185][04272] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-06 15:50:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 62493696. Throughput: 0: 12599.3. Samples: 62485826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:03,941][03942] Avg episode reward: [(0, '1187.881')] [2023-03-06 15:50:04,005][04272] Updated weights for policy 0, policy_version 61030 (0.0007) [2023-03-06 15:50:04,799][04272] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-06 15:50:05,619][04272] Updated weights for policy 0, policy_version 61050 (0.0005) [2023-03-06 15:50:06,430][04272] Updated weights for policy 0, policy_version 61060 (0.0007) [2023-03-06 15:50:07,247][04272] Updated weights for policy 0, policy_version 61070 (0.0006) [2023-03-06 15:50:08,048][04272] Updated weights for policy 0, policy_version 61080 (0.0007) [2023-03-06 15:50:08,862][04272] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-06 15:50:08,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 62556160. Throughput: 0: 12599.2. Samples: 62523666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:08,941][03942] Avg episode reward: [(0, '1345.217')] [2023-03-06 15:50:09,676][04272] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-06 15:50:10,490][04272] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-06 15:50:11,327][04272] Updated weights for policy 0, policy_version 61120 (0.0006) [2023-03-06 15:50:12,144][04272] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-06 15:50:12,945][04272] Updated weights for policy 0, policy_version 61140 (0.0005) [2023-03-06 15:50:13,755][04272] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-06 15:50:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12621.2). Total num frames: 62619648. Throughput: 0: 12589.1. Samples: 62598873. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:13,941][03942] Avg episode reward: [(0, '1290.087')] [2023-03-06 15:50:14,582][04272] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-06 15:50:15,404][04272] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-06 15:50:16,229][04272] Updated weights for policy 0, policy_version 61180 (0.0007) [2023-03-06 15:50:17,045][04272] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-06 15:50:17,866][04272] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-06 15:50:18,693][04272] Updated weights for policy 0, policy_version 61210 (0.0006) [2023-03-06 15:50:18,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.2, 300 sec: 12617.8). Total num frames: 62681088. Throughput: 0: 12581.3. Samples: 62673923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:18,941][03942] Avg episode reward: [(0, '1325.664')] [2023-03-06 15:50:19,498][04272] Updated weights for policy 0, policy_version 61220 (0.0006) [2023-03-06 15:50:20,309][04272] Updated weights for policy 0, policy_version 61230 (0.0005) [2023-03-06 15:50:21,122][04272] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-06 15:50:21,953][04272] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-06 15:50:22,766][04272] Updated weights for policy 0, policy_version 61260 (0.0007) [2023-03-06 15:50:23,590][04272] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-06 15:50:23,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 62744576. Throughput: 0: 12578.8. Samples: 62711630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:23,941][03942] Avg episode reward: [(0, '1363.457')] [2023-03-06 15:50:24,417][04272] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-06 15:50:25,221][04272] Updated weights for policy 0, policy_version 61290 (0.0007) [2023-03-06 15:50:26,027][04272] Updated weights for policy 0, policy_version 61300 (0.0007) [2023-03-06 15:50:26,856][04272] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-03-06 15:50:27,673][04272] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-03-06 15:50:28,499][04272] Updated weights for policy 0, policy_version 61330 (0.0007) [2023-03-06 15:50:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 62807040. Throughput: 0: 12572.3. Samples: 62786715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:50:28,941][03942] Avg episode reward: [(0, '1366.711')] [2023-03-06 15:50:29,286][04272] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-06 15:50:30,096][04272] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-06 15:50:30,905][04272] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-06 15:50:31,735][04272] Updated weights for policy 0, policy_version 61370 (0.0007) [2023-03-06 15:50:32,539][04272] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-06 15:50:33,369][04272] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-06 15:50:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 62870528. Throughput: 0: 12569.3. Samples: 62862347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:33,941][03942] Avg episode reward: [(0, '1268.461')] [2023-03-06 15:50:34,169][04272] Updated weights for policy 0, policy_version 61400 (0.0007) [2023-03-06 15:50:34,977][04272] Updated weights for policy 0, policy_version 61410 (0.0007) [2023-03-06 15:50:35,793][04272] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-06 15:50:36,581][04272] Updated weights for policy 0, policy_version 61430 (0.0007) [2023-03-06 15:50:37,422][04272] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-06 15:50:38,212][04272] Updated weights for policy 0, policy_version 61450 (0.0006) [2023-03-06 15:50:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 62932992. Throughput: 0: 12570.8. Samples: 62900217. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:38,941][03942] Avg episode reward: [(0, '1261.373')] [2023-03-06 15:50:39,037][04272] Updated weights for policy 0, policy_version 61460 (0.0007) [2023-03-06 15:50:39,864][04272] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-06 15:50:40,655][04272] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-06 15:50:41,462][04272] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-06 15:50:42,300][04272] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-06 15:50:43,099][04272] Updated weights for policy 0, policy_version 61510 (0.0007) [2023-03-06 15:50:43,914][04272] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-06 15:50:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12617.8). Total num frames: 62996480. Throughput: 0: 12575.1. Samples: 62975922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:43,941][03942] Avg episode reward: [(0, '1270.943')] [2023-03-06 15:50:44,711][04272] Updated weights for policy 0, policy_version 61530 (0.0006) [2023-03-06 15:50:45,538][04272] Updated weights for policy 0, policy_version 61540 (0.0005) [2023-03-06 15:50:46,334][04272] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-06 15:50:47,158][04272] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-06 15:50:47,984][04272] Updated weights for policy 0, policy_version 61570 (0.0007) [2023-03-06 15:50:48,789][04272] Updated weights for policy 0, policy_version 61580 (0.0006) [2023-03-06 15:50:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.2, 300 sec: 12614.3). Total num frames: 63058944. Throughput: 0: 12567.7. Samples: 63051373. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:48,941][03942] Avg episode reward: [(0, '1227.208')] [2023-03-06 15:50:49,594][04272] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-06 15:50:50,401][04272] Updated weights for policy 0, policy_version 61600 (0.0007) [2023-03-06 15:50:51,213][04272] Updated weights for policy 0, policy_version 61610 (0.0006) [2023-03-06 15:50:52,029][04272] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-06 15:50:52,853][04272] Updated weights for policy 0, policy_version 61630 (0.0006) [2023-03-06 15:50:53,663][04272] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-06 15:50:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12578.1, 300 sec: 12614.3). Total num frames: 63122432. Throughput: 0: 12570.2. Samples: 63089324. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:53,941][03942] Avg episode reward: [(0, '1213.156')] [2023-03-06 15:50:54,473][04272] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-06 15:50:55,280][04272] Updated weights for policy 0, policy_version 61660 (0.0007) [2023-03-06 15:50:56,101][04272] Updated weights for policy 0, policy_version 61670 (0.0007) [2023-03-06 15:50:56,909][04272] Updated weights for policy 0, policy_version 61680 (0.0007) [2023-03-06 15:50:57,726][04272] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-06 15:50:58,534][04272] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-06 15:50:58,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12578.1, 300 sec: 12614.3). Total num frames: 63185920. Throughput: 0: 12581.3. Samples: 63165033. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:50:58,952][03942] Avg episode reward: [(0, '1175.694')] [2023-03-06 15:50:59,350][04272] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-06 15:51:00,170][04272] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-06 15:51:00,989][04272] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-06 15:51:01,781][04272] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-06 15:51:02,590][04272] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-06 15:51:03,406][04272] Updated weights for policy 0, policy_version 61760 (0.0006) [2023-03-06 15:51:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12614.3). Total num frames: 63248384. Throughput: 0: 12596.0. Samples: 63240741. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:51:03,951][03942] Avg episode reward: [(0, '1299.272')] [2023-03-06 15:51:04,203][04272] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-06 15:51:04,998][04272] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-06 15:51:05,825][04272] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-06 15:51:06,611][04272] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-06 15:51:07,450][04272] Updated weights for policy 0, policy_version 61810 (0.0007) [2023-03-06 15:51:08,270][04272] Updated weights for policy 0, policy_version 61820 (0.0008) [2023-03-06 15:51:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 63311872. Throughput: 0: 12604.0. Samples: 63278808. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:51:08,952][03942] Avg episode reward: [(0, '1184.575')] [2023-03-06 15:51:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000061828_63311872.pth... [2023-03-06 15:51:08,987][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000058874_60286976.pth [2023-03-06 15:51:09,079][04272] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-06 15:51:09,895][04272] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-06 15:51:10,710][04272] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-06 15:51:11,528][04272] Updated weights for policy 0, policy_version 61860 (0.0008) [2023-03-06 15:51:12,321][04272] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-06 15:51:13,134][04272] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-06 15:51:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 63374336. Throughput: 0: 12611.7. Samples: 63354242. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:13,952][03942] Avg episode reward: [(0, '1215.220')] [2023-03-06 15:51:13,957][04272] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-06 15:51:14,758][04272] Updated weights for policy 0, policy_version 61900 (0.0007) [2023-03-06 15:51:15,576][04272] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-06 15:51:16,375][04272] Updated weights for policy 0, policy_version 61920 (0.0006) [2023-03-06 15:51:17,185][04272] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-06 15:51:18,012][04272] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-06 15:51:18,813][04272] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-06 15:51:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 63437824. Throughput: 0: 12614.4. Samples: 63429992. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:18,952][03942] Avg episode reward: [(0, '1232.141')] [2023-03-06 15:51:19,610][04272] Updated weights for policy 0, policy_version 61960 (0.0007) [2023-03-06 15:51:20,416][04272] Updated weights for policy 0, policy_version 61970 (0.0007) [2023-03-06 15:51:21,238][04272] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-06 15:51:22,055][04272] Updated weights for policy 0, policy_version 61990 (0.0006) [2023-03-06 15:51:22,874][04272] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-06 15:51:23,697][04272] Updated weights for policy 0, policy_version 62010 (0.0006) [2023-03-06 15:51:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 63500288. Throughput: 0: 12619.6. Samples: 63468097. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:23,941][03942] Avg episode reward: [(0, '1178.549')] [2023-03-06 15:51:24,502][04272] Updated weights for policy 0, policy_version 62020 (0.0007) [2023-03-06 15:51:25,328][04272] Updated weights for policy 0, policy_version 62030 (0.0006) [2023-03-06 15:51:26,141][04272] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-06 15:51:26,945][04272] Updated weights for policy 0, policy_version 62050 (0.0007) [2023-03-06 15:51:27,763][04272] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-06 15:51:28,582][04272] Updated weights for policy 0, policy_version 62070 (0.0007) [2023-03-06 15:51:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 63563776. Throughput: 0: 12612.1. Samples: 63543467. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:28,941][03942] Avg episode reward: [(0, '1307.157')] [2023-03-06 15:51:29,395][04272] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-06 15:51:30,210][04272] Updated weights for policy 0, policy_version 62090 (0.0008) [2023-03-06 15:51:31,018][04272] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-06 15:51:31,826][04272] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-06 15:51:32,629][04272] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-06 15:51:33,455][04272] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-06 15:51:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 63627264. Throughput: 0: 12613.8. Samples: 63618994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:33,941][03942] Avg episode reward: [(0, '1341.529')] [2023-03-06 15:51:34,256][04272] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-06 15:51:35,068][04272] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-06 15:51:35,885][04272] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-06 15:51:36,716][04272] Updated weights for policy 0, policy_version 62170 (0.0008) [2023-03-06 15:51:37,514][04272] Updated weights for policy 0, policy_version 62180 (0.0007) [2023-03-06 15:51:38,334][04272] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-06 15:51:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 63689728. Throughput: 0: 12609.8. Samples: 63656763. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:38,941][03942] Avg episode reward: [(0, '1337.367')] [2023-03-06 15:51:39,146][04272] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-06 15:51:39,970][04272] Updated weights for policy 0, policy_version 62210 (0.0006) [2023-03-06 15:51:40,797][04272] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-03-06 15:51:41,626][04272] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-06 15:51:42,427][04272] Updated weights for policy 0, policy_version 62240 (0.0007) [2023-03-06 15:51:43,249][04272] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-06 15:51:43,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 63752192. Throughput: 0: 12600.6. Samples: 63732060. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:43,941][03942] Avg episode reward: [(0, '1332.752')] [2023-03-06 15:51:44,070][04272] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-06 15:51:44,875][04272] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-06 15:51:45,683][04272] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-06 15:51:46,487][04272] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-06 15:51:47,284][04272] Updated weights for policy 0, policy_version 62300 (0.0007) [2023-03-06 15:51:48,102][04272] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-06 15:51:48,902][04272] Updated weights for policy 0, policy_version 62320 (0.0007) [2023-03-06 15:51:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 63815680. Throughput: 0: 12597.3. Samples: 63807621. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:51:48,941][03942] Avg episode reward: [(0, '1350.492')] [2023-03-06 15:51:49,723][04272] Updated weights for policy 0, policy_version 62330 (0.0007) [2023-03-06 15:51:50,544][04272] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-06 15:51:51,354][04272] Updated weights for policy 0, policy_version 62350 (0.0007) [2023-03-06 15:51:52,151][04272] Updated weights for policy 0, policy_version 62360 (0.0007) [2023-03-06 15:51:52,981][04272] Updated weights for policy 0, policy_version 62370 (0.0006) [2023-03-06 15:51:53,803][04272] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-06 15:51:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 63878144. Throughput: 0: 12593.2. Samples: 63845503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:51:53,941][03942] Avg episode reward: [(0, '1309.617')] [2023-03-06 15:51:54,615][04272] Updated weights for policy 0, policy_version 62390 (0.0006) [2023-03-06 15:51:55,434][04272] Updated weights for policy 0, policy_version 62400 (0.0006) [2023-03-06 15:51:56,251][04272] Updated weights for policy 0, policy_version 62410 (0.0007) [2023-03-06 15:51:57,063][04272] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-06 15:51:57,878][04272] Updated weights for policy 0, policy_version 62430 (0.0007) [2023-03-06 15:51:58,695][04272] Updated weights for policy 0, policy_version 62440 (0.0007) [2023-03-06 15:51:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 63941632. Throughput: 0: 12590.0. Samples: 63920789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:51:58,941][03942] Avg episode reward: [(0, '1266.518')] [2023-03-06 15:51:59,506][04272] Updated weights for policy 0, policy_version 62450 (0.0007) [2023-03-06 15:52:00,307][04272] Updated weights for policy 0, policy_version 62460 (0.0006) [2023-03-06 15:52:01,141][04272] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-06 15:52:01,945][04272] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-06 15:52:02,760][04272] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-06 15:52:03,582][04272] Updated weights for policy 0, policy_version 62500 (0.0007) [2023-03-06 15:52:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 64004096. Throughput: 0: 12582.1. Samples: 63996188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:52:03,941][03942] Avg episode reward: [(0, '1260.567')] [2023-03-06 15:52:04,395][04272] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-06 15:52:05,193][04272] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-06 15:52:06,015][04272] Updated weights for policy 0, policy_version 62530 (0.0007) [2023-03-06 15:52:06,817][04272] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-06 15:52:07,612][04272] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-06 15:52:08,417][04272] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-06 15:52:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 64067584. Throughput: 0: 12581.4. Samples: 64034260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:52:08,941][03942] Avg episode reward: [(0, '1293.561')] [2023-03-06 15:52:09,230][04272] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-06 15:52:10,018][04272] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-06 15:52:10,837][04272] Updated weights for policy 0, policy_version 62590 (0.0007) [2023-03-06 15:52:11,666][04272] Updated weights for policy 0, policy_version 62600 (0.0007) [2023-03-06 15:52:12,467][04272] Updated weights for policy 0, policy_version 62610 (0.0007) [2023-03-06 15:52:13,314][04272] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-06 15:52:13,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 64131072. Throughput: 0: 12592.2. Samples: 64110113. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:52:13,941][03942] Avg episode reward: [(0, '1301.939')] [2023-03-06 15:52:14,105][04272] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-06 15:52:14,898][04272] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-06 15:52:15,721][04272] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-06 15:52:16,509][04272] Updated weights for policy 0, policy_version 62660 (0.0007) [2023-03-06 15:52:17,331][04272] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-06 15:52:18,131][04272] Updated weights for policy 0, policy_version 62680 (0.0006) [2023-03-06 15:52:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 64193536. Throughput: 0: 12602.9. Samples: 64186125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:52:18,941][03942] Avg episode reward: [(0, '1275.344')] [2023-03-06 15:52:18,958][04272] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-06 15:52:19,757][04272] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-06 15:52:20,564][04272] Updated weights for policy 0, policy_version 62710 (0.0007) [2023-03-06 15:52:21,385][04272] Updated weights for policy 0, policy_version 62720 (0.0007) [2023-03-06 15:52:22,205][04272] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-06 15:52:22,997][04272] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-06 15:52:23,836][04272] Updated weights for policy 0, policy_version 62750 (0.0007) [2023-03-06 15:52:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 64257024. Throughput: 0: 12605.1. Samples: 64223992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:52:23,941][03942] Avg episode reward: [(0, '1337.489')] [2023-03-06 15:52:24,640][04272] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-06 15:52:25,442][04272] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-06 15:52:26,258][04272] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-06 15:52:27,070][04272] Updated weights for policy 0, policy_version 62790 (0.0006) [2023-03-06 15:52:27,885][04272] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-06 15:52:28,683][04272] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-06 15:52:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 64320512. Throughput: 0: 12612.7. Samples: 64299628. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:28,941][03942] Avg episode reward: [(0, '1234.389')] [2023-03-06 15:52:29,510][04272] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-06 15:52:30,315][04272] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-06 15:52:31,149][04272] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-06 15:52:31,959][04272] Updated weights for policy 0, policy_version 62850 (0.0007) [2023-03-06 15:52:32,765][04272] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-06 15:52:33,586][04272] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-06 15:52:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 64382976. Throughput: 0: 12611.7. Samples: 64375148. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:33,941][03942] Avg episode reward: [(0, '1227.448')] [2023-03-06 15:52:34,394][04272] Updated weights for policy 0, policy_version 62880 (0.0007) [2023-03-06 15:52:35,206][04272] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-06 15:52:36,008][04272] Updated weights for policy 0, policy_version 62900 (0.0006) [2023-03-06 15:52:36,815][04272] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-06 15:52:37,615][04272] Updated weights for policy 0, policy_version 62920 (0.0007) [2023-03-06 15:52:38,421][04272] Updated weights for policy 0, policy_version 62930 (0.0007) [2023-03-06 15:52:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 64446464. Throughput: 0: 12610.3. Samples: 64412964. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:38,941][03942] Avg episode reward: [(0, '1293.142')] [2023-03-06 15:52:39,253][04272] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-06 15:52:40,058][04272] Updated weights for policy 0, policy_version 62950 (0.0007) [2023-03-06 15:52:40,879][04272] Updated weights for policy 0, policy_version 62960 (0.0007) [2023-03-06 15:52:41,706][04272] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-06 15:52:42,507][04272] Updated weights for policy 0, policy_version 62980 (0.0007) [2023-03-06 15:52:43,327][04272] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-06 15:52:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 64508928. Throughput: 0: 12620.5. Samples: 64488711. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:43,941][03942] Avg episode reward: [(0, '1349.179')] [2023-03-06 15:52:44,142][04272] Updated weights for policy 0, policy_version 63000 (0.0007) [2023-03-06 15:52:44,945][04272] Updated weights for policy 0, policy_version 63010 (0.0007) [2023-03-06 15:52:45,761][04272] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-06 15:52:46,567][04272] Updated weights for policy 0, policy_version 63030 (0.0007) [2023-03-06 15:52:47,378][04272] Updated weights for policy 0, policy_version 63040 (0.0007) [2023-03-06 15:52:48,202][04272] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-06 15:52:48,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 64571392. Throughput: 0: 12625.3. Samples: 64564325. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:48,941][03942] Avg episode reward: [(0, '1348.641')] [2023-03-06 15:52:49,011][04272] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-06 15:52:49,838][04272] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-06 15:52:50,658][04272] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-06 15:52:51,466][04272] Updated weights for policy 0, policy_version 63090 (0.0006) [2023-03-06 15:52:52,294][04272] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-06 15:52:53,107][04272] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-06 15:52:53,926][04272] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-06 15:52:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 64634880. Throughput: 0: 12609.3. Samples: 64601678. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:53,941][03942] Avg episode reward: [(0, '1129.870')] [2023-03-06 15:52:54,753][04272] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-06 15:52:55,551][04272] Updated weights for policy 0, policy_version 63140 (0.0007) [2023-03-06 15:52:56,376][04272] Updated weights for policy 0, policy_version 63150 (0.0007) [2023-03-06 15:52:57,190][04272] Updated weights for policy 0, policy_version 63160 (0.0006) [2023-03-06 15:52:58,007][04272] Updated weights for policy 0, policy_version 63170 (0.0006) [2023-03-06 15:52:58,825][04272] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-06 15:52:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 64697344. Throughput: 0: 12594.3. Samples: 64676856. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:52:58,952][03942] Avg episode reward: [(0, '1238.309')] [2023-03-06 15:52:59,631][04272] Updated weights for policy 0, policy_version 63190 (0.0007) [2023-03-06 15:53:00,429][04272] Updated weights for policy 0, policy_version 63200 (0.0006) [2023-03-06 15:53:01,243][04272] Updated weights for policy 0, policy_version 63210 (0.0007) [2023-03-06 15:53:02,066][04272] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-06 15:53:02,858][04272] Updated weights for policy 0, policy_version 63230 (0.0008) [2023-03-06 15:53:03,679][04272] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-06 15:53:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 64760832. Throughput: 0: 12590.2. Samples: 64752682. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:53:03,951][03942] Avg episode reward: [(0, '1255.570')] [2023-03-06 15:53:04,505][04272] Updated weights for policy 0, policy_version 63250 (0.0007) [2023-03-06 15:53:05,318][04272] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-06 15:53:06,144][04272] Updated weights for policy 0, policy_version 63270 (0.0007) [2023-03-06 15:53:06,941][04272] Updated weights for policy 0, policy_version 63280 (0.0007) [2023-03-06 15:53:07,761][04272] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-06 15:53:08,592][04272] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-06 15:53:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 64823296. Throughput: 0: 12582.3. Samples: 64790198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:08,952][03942] Avg episode reward: [(0, '1291.004')] [2023-03-06 15:53:08,957][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000063304_64823296.pth... [2023-03-06 15:53:08,987][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000060353_61801472.pth [2023-03-06 15:53:09,402][04272] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-06 15:53:10,205][04272] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-06 15:53:11,013][04272] Updated weights for policy 0, policy_version 63330 (0.0006) [2023-03-06 15:53:11,824][04272] Updated weights for policy 0, policy_version 63340 (0.0007) [2023-03-06 15:53:12,644][04272] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-06 15:53:13,441][04272] Updated weights for policy 0, policy_version 63360 (0.0006) [2023-03-06 15:53:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 64886784. Throughput: 0: 12581.4. Samples: 64865793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:13,941][03942] Avg episode reward: [(0, '1125.897')] [2023-03-06 15:53:14,266][04272] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-06 15:53:15,074][04272] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-06 15:53:15,892][04272] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-06 15:53:16,699][04272] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-06 15:53:17,509][04272] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-06 15:53:18,314][04272] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-06 15:53:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 64949248. Throughput: 0: 12587.5. Samples: 64941587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:18,941][03942] Avg episode reward: [(0, '1112.232')] [2023-03-06 15:53:19,137][04272] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-06 15:53:19,937][04272] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-06 15:53:20,747][04272] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-06 15:53:21,552][04272] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-06 15:53:22,356][04272] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-06 15:53:23,181][04272] Updated weights for policy 0, policy_version 63480 (0.0007) [2023-03-06 15:53:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 65012736. Throughput: 0: 12588.1. Samples: 64979429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:23,941][03942] Avg episode reward: [(0, '1229.573')] [2023-03-06 15:53:24,006][04272] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-06 15:53:24,808][04272] Updated weights for policy 0, policy_version 63500 (0.0007) [2023-03-06 15:53:25,608][04272] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-06 15:53:26,421][04272] Updated weights for policy 0, policy_version 63520 (0.0007) [2023-03-06 15:53:27,234][04272] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-06 15:53:28,032][04272] Updated weights for policy 0, policy_version 63540 (0.0007) [2023-03-06 15:53:28,857][04272] Updated weights for policy 0, policy_version 63550 (0.0007) [2023-03-06 15:53:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 65076224. Throughput: 0: 12591.2. Samples: 65055316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:28,941][03942] Avg episode reward: [(0, '1171.450')] [2023-03-06 15:53:29,681][04272] Updated weights for policy 0, policy_version 63560 (0.0007) [2023-03-06 15:53:30,501][04272] Updated weights for policy 0, policy_version 63570 (0.0007) [2023-03-06 15:53:31,333][04272] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-06 15:53:32,150][04272] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-06 15:53:32,963][04272] Updated weights for policy 0, policy_version 63600 (0.0007) [2023-03-06 15:53:33,782][04272] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-06 15:53:33,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12593.5). Total num frames: 65137664. Throughput: 0: 12580.9. Samples: 65130465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:33,941][03942] Avg episode reward: [(0, '1242.798')] [2023-03-06 15:53:34,609][04272] Updated weights for policy 0, policy_version 63620 (0.0007) [2023-03-06 15:53:35,413][04272] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-06 15:53:36,238][04272] Updated weights for policy 0, policy_version 63640 (0.0007) [2023-03-06 15:53:37,054][04272] Updated weights for policy 0, policy_version 63650 (0.0006) [2023-03-06 15:53:37,876][04272] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-06 15:53:38,700][04272] Updated weights for policy 0, policy_version 63670 (0.0006) [2023-03-06 15:53:38,940][03942] Fps is (10 sec: 12492.7, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 65201152. Throughput: 0: 12584.7. Samples: 65167989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:38,941][03942] Avg episode reward: [(0, '1279.594')] [2023-03-06 15:53:39,496][04272] Updated weights for policy 0, policy_version 63680 (0.0007) [2023-03-06 15:53:40,317][04272] Updated weights for policy 0, policy_version 63690 (0.0007) [2023-03-06 15:53:41,130][04272] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-06 15:53:41,952][04272] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-06 15:53:42,767][04272] Updated weights for policy 0, policy_version 63720 (0.0007) [2023-03-06 15:53:43,578][04272] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-06 15:53:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12593.5). Total num frames: 65263616. Throughput: 0: 12585.8. Samples: 65243219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:43,941][03942] Avg episode reward: [(0, '1226.650')] [2023-03-06 15:53:44,404][04272] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-06 15:53:45,226][04272] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-06 15:53:46,011][04272] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-06 15:53:46,832][04272] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-06 15:53:47,635][04272] Updated weights for policy 0, policy_version 63780 (0.0006) [2023-03-06 15:53:48,442][04272] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-06 15:53:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 65327104. Throughput: 0: 12581.4. Samples: 65318844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:48,941][03942] Avg episode reward: [(0, '1145.492')] [2023-03-06 15:53:49,262][04272] Updated weights for policy 0, policy_version 63800 (0.0007) [2023-03-06 15:53:50,068][04272] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-06 15:53:50,854][04272] Updated weights for policy 0, policy_version 63820 (0.0007) [2023-03-06 15:53:51,685][04272] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-06 15:53:52,497][04272] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-06 15:53:53,298][04272] Updated weights for policy 0, policy_version 63850 (0.0007) [2023-03-06 15:53:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12593.5). Total num frames: 65389568. Throughput: 0: 12589.9. Samples: 65356741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:53,941][03942] Avg episode reward: [(0, '1189.771')] [2023-03-06 15:53:54,118][04272] Updated weights for policy 0, policy_version 63860 (0.0006) [2023-03-06 15:53:54,953][04272] Updated weights for policy 0, policy_version 63870 (0.0007) [2023-03-06 15:53:55,749][04272] Updated weights for policy 0, policy_version 63880 (0.0007) [2023-03-06 15:53:56,564][04272] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-06 15:53:57,357][04272] Updated weights for policy 0, policy_version 63900 (0.0006) [2023-03-06 15:53:58,166][04272] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-06 15:53:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 65453056. Throughput: 0: 12593.0. Samples: 65432477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:53:58,941][03942] Avg episode reward: [(0, '1227.419')] [2023-03-06 15:53:58,995][04272] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-06 15:53:59,798][04272] Updated weights for policy 0, policy_version 63930 (0.0006) [2023-03-06 15:54:00,604][04272] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-06 15:54:01,434][04272] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-06 15:54:02,248][04272] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-06 15:54:03,046][04272] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-06 15:54:03,871][04272] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-06 15:54:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 65516544. Throughput: 0: 12592.0. Samples: 65508229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:03,941][03942] Avg episode reward: [(0, '1116.494')] [2023-03-06 15:54:04,666][04272] Updated weights for policy 0, policy_version 63990 (0.0006) [2023-03-06 15:54:05,459][04272] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-06 15:54:06,270][04272] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-06 15:54:07,088][04272] Updated weights for policy 0, policy_version 64020 (0.0006) [2023-03-06 15:54:07,881][04272] Updated weights for policy 0, policy_version 64030 (0.0007) [2023-03-06 15:54:08,704][04272] Updated weights for policy 0, policy_version 64040 (0.0006) [2023-03-06 15:54:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 65579008. Throughput: 0: 12597.9. Samples: 65546336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:08,941][03942] Avg episode reward: [(0, '1312.884')] [2023-03-06 15:54:09,518][04272] Updated weights for policy 0, policy_version 64050 (0.0007) [2023-03-06 15:54:10,337][04272] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-06 15:54:11,145][04272] Updated weights for policy 0, policy_version 64070 (0.0006) [2023-03-06 15:54:11,964][04272] Updated weights for policy 0, policy_version 64080 (0.0007) [2023-03-06 15:54:12,762][04272] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-06 15:54:13,576][04272] Updated weights for policy 0, policy_version 64100 (0.0007) [2023-03-06 15:54:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 65642496. Throughput: 0: 12593.0. Samples: 65622003. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:13,941][03942] Avg episode reward: [(0, '1297.850')] [2023-03-06 15:54:14,387][04272] Updated weights for policy 0, policy_version 64110 (0.0007) [2023-03-06 15:54:15,176][04272] Updated weights for policy 0, policy_version 64120 (0.0005) [2023-03-06 15:54:15,982][04272] Updated weights for policy 0, policy_version 64130 (0.0007) [2023-03-06 15:54:16,799][04272] Updated weights for policy 0, policy_version 64140 (0.0006) [2023-03-06 15:54:17,610][04272] Updated weights for policy 0, policy_version 64150 (0.0007) [2023-03-06 15:54:18,434][04272] Updated weights for policy 0, policy_version 64160 (0.0007) [2023-03-06 15:54:18,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 65705984. Throughput: 0: 12610.8. Samples: 65697949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:18,951][03942] Avg episode reward: [(0, '1202.669')] [2023-03-06 15:54:19,248][04272] Updated weights for policy 0, policy_version 64170 (0.0007) [2023-03-06 15:54:20,071][04272] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-06 15:54:20,892][04272] Updated weights for policy 0, policy_version 64190 (0.0006) [2023-03-06 15:54:21,706][04272] Updated weights for policy 0, policy_version 64200 (0.0007) [2023-03-06 15:54:22,528][04272] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-06 15:54:23,331][04272] Updated weights for policy 0, policy_version 64220 (0.0006) [2023-03-06 15:54:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 65768448. Throughput: 0: 12608.9. Samples: 65735388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:23,941][03942] Avg episode reward: [(0, '1276.848')] [2023-03-06 15:54:24,134][04272] Updated weights for policy 0, policy_version 64230 (0.0007) [2023-03-06 15:54:24,970][04272] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-06 15:54:25,764][04272] Updated weights for policy 0, policy_version 64250 (0.0007) [2023-03-06 15:54:26,588][04272] Updated weights for policy 0, policy_version 64260 (0.0005) [2023-03-06 15:54:27,385][04272] Updated weights for policy 0, policy_version 64270 (0.0007) [2023-03-06 15:54:28,206][04272] Updated weights for policy 0, policy_version 64280 (0.0006) [2023-03-06 15:54:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 65831936. Throughput: 0: 12620.6. Samples: 65811145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:28,951][03942] Avg episode reward: [(0, '1160.078')] [2023-03-06 15:54:29,027][04272] Updated weights for policy 0, policy_version 64290 (0.0007) [2023-03-06 15:54:29,848][04272] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-06 15:54:30,649][04272] Updated weights for policy 0, policy_version 64310 (0.0007) [2023-03-06 15:54:31,466][04272] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-06 15:54:32,281][04272] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-06 15:54:33,095][04272] Updated weights for policy 0, policy_version 64340 (0.0006) [2023-03-06 15:54:33,916][04272] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-06 15:54:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 65894400. Throughput: 0: 12614.6. Samples: 65886501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:33,952][03942] Avg episode reward: [(0, '1247.111')] [2023-03-06 15:54:34,725][04272] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-06 15:54:35,550][04272] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-06 15:54:36,354][04272] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-06 15:54:37,176][04272] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-06 15:54:37,990][04272] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-06 15:54:38,817][04272] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-06 15:54:38,940][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 65956864. Throughput: 0: 12610.3. Samples: 65924206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:38,951][03942] Avg episode reward: [(0, '1154.137')] [2023-03-06 15:54:39,642][04272] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-06 15:54:40,433][04272] Updated weights for policy 0, policy_version 64430 (0.0005) [2023-03-06 15:54:41,270][04272] Updated weights for policy 0, policy_version 64440 (0.0007) [2023-03-06 15:54:42,085][04272] Updated weights for policy 0, policy_version 64450 (0.0007) [2023-03-06 15:54:42,709][04221] KL-divergence is very high: 298.7138 [2023-03-06 15:54:42,807][04221] KL-divergence is very high: 984.5442 [2023-03-06 15:54:42,879][04272] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-06 15:54:43,690][04272] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-06 15:54:43,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 66020352. Throughput: 0: 12598.8. Samples: 65999424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:43,951][03942] Avg episode reward: [(0, '1241.328')] [2023-03-06 15:54:44,002][04221] KL-divergence is very high: 190.9275 [2023-03-06 15:54:44,282][04221] KL-divergence is very high: 285.6480 [2023-03-06 15:54:44,519][04272] Updated weights for policy 0, policy_version 64480 (0.0007) [2023-03-06 15:54:45,322][04272] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-06 15:54:46,126][04272] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-06 15:54:46,949][04272] Updated weights for policy 0, policy_version 64510 (0.0007) [2023-03-06 15:54:47,751][04272] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-06 15:54:48,573][04272] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-06 15:54:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 66082816. Throughput: 0: 12595.5. Samples: 66075028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:48,951][03942] Avg episode reward: [(0, '1134.716')] [2023-03-06 15:54:49,397][04272] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-06 15:54:50,191][04272] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-06 15:54:51,017][04272] Updated weights for policy 0, policy_version 64560 (0.0007) [2023-03-06 15:54:51,832][04272] Updated weights for policy 0, policy_version 64570 (0.0007) [2023-03-06 15:54:52,641][04272] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-06 15:54:52,744][04221] KL-divergence is very high: 638.6639 [2023-03-06 15:54:52,806][04221] KL-divergence is very high: 285.0575 [2023-03-06 15:54:53,391][04221] KL-divergence is very high: 379.6931 [2023-03-06 15:54:53,455][04272] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-06 15:54:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12593.5). Total num frames: 66146304. Throughput: 0: 12588.2. Samples: 66112806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:53,951][03942] Avg episode reward: [(0, '1152.122')] [2023-03-06 15:54:54,277][04272] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-03-06 15:54:55,091][04272] Updated weights for policy 0, policy_version 64610 (0.0006) [2023-03-06 15:54:55,902][04272] Updated weights for policy 0, policy_version 64620 (0.0007) [2023-03-06 15:54:56,722][04272] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-06 15:54:57,520][04272] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-06 15:54:58,329][04272] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-06 15:54:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 66208768. Throughput: 0: 12582.1. Samples: 66188198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:54:58,951][03942] Avg episode reward: [(0, '1134.481')] [2023-03-06 15:54:59,155][04272] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-06 15:54:59,958][04272] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-06 15:55:00,781][04272] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-06 15:55:01,595][04272] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-06 15:55:02,394][04272] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-06 15:55:03,209][04272] Updated weights for policy 0, policy_version 64710 (0.0006) [2023-03-06 15:55:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 66272256. Throughput: 0: 12576.6. Samples: 66263895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:03,941][03942] Avg episode reward: [(0, '1163.831')] [2023-03-06 15:55:04,010][04272] Updated weights for policy 0, policy_version 64720 (0.0007) [2023-03-06 15:55:04,837][04272] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-06 15:55:05,659][04272] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-06 15:55:06,475][04272] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-06 15:55:07,286][04272] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-06 15:55:08,107][04272] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-06 15:55:08,906][04272] Updated weights for policy 0, policy_version 64780 (0.0006) [2023-03-06 15:55:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12593.5). Total num frames: 66334720. Throughput: 0: 12581.8. Samples: 66301567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:08,941][03942] Avg episode reward: [(0, '1086.139')] [2023-03-06 15:55:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000064780_66334720.pth... [2023-03-06 15:55:08,978][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000061828_63311872.pth [2023-03-06 15:55:09,721][04272] Updated weights for policy 0, policy_version 64790 (0.0007) [2023-03-06 15:55:09,773][04221] KL-divergence is very high: 241.6165 [2023-03-06 15:55:10,528][04272] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-06 15:55:11,348][04272] Updated weights for policy 0, policy_version 64810 (0.0007) [2023-03-06 15:55:12,158][04272] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-06 15:55:12,979][04272] Updated weights for policy 0, policy_version 64830 (0.0007) [2023-03-06 15:55:13,788][04272] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-06 15:55:13,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 66397184. Throughput: 0: 12579.6. Samples: 66377229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:13,941][03942] Avg episode reward: [(0, '1218.977')] [2023-03-06 15:55:14,591][04272] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-06 15:55:15,415][04272] Updated weights for policy 0, policy_version 64860 (0.0006) [2023-03-06 15:55:16,205][04272] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-06 15:55:17,023][04272] Updated weights for policy 0, policy_version 64880 (0.0007) [2023-03-06 15:55:17,838][04272] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-06 15:55:18,640][04272] Updated weights for policy 0, policy_version 64900 (0.0006) [2023-03-06 15:55:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 66460672. Throughput: 0: 12588.0. Samples: 66452962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:18,941][03942] Avg episode reward: [(0, '1274.886')] [2023-03-06 15:55:19,459][04272] Updated weights for policy 0, policy_version 64910 (0.0007) [2023-03-06 15:55:20,259][04272] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-06 15:55:21,079][04272] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-06 15:55:21,885][04272] Updated weights for policy 0, policy_version 64940 (0.0006) [2023-03-06 15:55:22,688][04272] Updated weights for policy 0, policy_version 64950 (0.0007) [2023-03-06 15:55:23,490][04272] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-06 15:55:23,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 66524160. Throughput: 0: 12591.6. Samples: 66490827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:23,941][03942] Avg episode reward: [(0, '1261.235')] [2023-03-06 15:55:24,310][04272] Updated weights for policy 0, policy_version 64970 (0.0007) [2023-03-06 15:55:25,123][04272] Updated weights for policy 0, policy_version 64980 (0.0006) [2023-03-06 15:55:25,943][04272] Updated weights for policy 0, policy_version 64990 (0.0006) [2023-03-06 15:55:26,748][04272] Updated weights for policy 0, policy_version 65000 (0.0007) [2023-03-06 15:55:27,565][04272] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-06 15:55:28,380][04272] Updated weights for policy 0, policy_version 65020 (0.0007) [2023-03-06 15:55:28,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 66587648. Throughput: 0: 12600.2. Samples: 66566430. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:28,941][03942] Avg episode reward: [(0, '1266.570')] [2023-03-06 15:55:29,190][04272] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-06 15:55:30,005][04272] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-06 15:55:30,397][04221] KL-divergence is very high: 133.0618 [2023-03-06 15:55:30,804][04272] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-06 15:55:31,625][04272] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-06 15:55:32,426][04272] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-06 15:55:33,238][04272] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-06 15:55:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 66650112. Throughput: 0: 12609.9. Samples: 66642474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:33,941][03942] Avg episode reward: [(0, '1114.333')] [2023-03-06 15:55:34,046][04272] Updated weights for policy 0, policy_version 65090 (0.0007) [2023-03-06 15:55:34,851][04272] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-06 15:55:35,665][04272] Updated weights for policy 0, policy_version 65110 (0.0006) [2023-03-06 15:55:36,477][04272] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-06 15:55:37,286][04272] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-06 15:55:38,088][04272] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-06 15:55:38,916][04272] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-06 15:55:38,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 66713600. Throughput: 0: 12613.7. Samples: 66680423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:38,941][03942] Avg episode reward: [(0, '1162.062')] [2023-03-06 15:55:39,742][04272] Updated weights for policy 0, policy_version 65160 (0.0007) [2023-03-06 15:55:40,570][04272] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-06 15:55:41,377][04272] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-06 15:55:42,187][04272] Updated weights for policy 0, policy_version 65190 (0.0006) [2023-03-06 15:55:42,992][04272] Updated weights for policy 0, policy_version 65200 (0.0007) [2023-03-06 15:55:43,814][04272] Updated weights for policy 0, policy_version 65210 (0.0007) [2023-03-06 15:55:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 66776064. Throughput: 0: 12612.8. Samples: 66755777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:55:43,941][03942] Avg episode reward: [(0, '1082.929')] [2023-03-06 15:55:44,624][04272] Updated weights for policy 0, policy_version 65220 (0.0007) [2023-03-06 15:55:45,420][04272] Updated weights for policy 0, policy_version 65230 (0.0006) [2023-03-06 15:55:46,250][04272] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-06 15:55:47,057][04272] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-06 15:55:47,886][04272] Updated weights for policy 0, policy_version 65260 (0.0007) [2023-03-06 15:55:48,687][04272] Updated weights for policy 0, policy_version 65270 (0.0007) [2023-03-06 15:55:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 66839552. Throughput: 0: 12609.7. Samples: 66831333. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:55:48,941][03942] Avg episode reward: [(0, '1145.354')] [2023-03-06 15:55:49,492][04272] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-06 15:55:50,315][04272] Updated weights for policy 0, policy_version 65290 (0.0007) [2023-03-06 15:55:51,133][04272] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-06 15:55:51,942][04272] Updated weights for policy 0, policy_version 65310 (0.0006) [2023-03-06 15:55:52,761][04272] Updated weights for policy 0, policy_version 65320 (0.0007) [2023-03-06 15:55:53,570][04272] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-06 15:55:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 66902016. Throughput: 0: 12610.9. Samples: 66869059. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:55:53,941][03942] Avg episode reward: [(0, '1189.650')] [2023-03-06 15:55:54,368][04272] Updated weights for policy 0, policy_version 65340 (0.0007) [2023-03-06 15:55:55,172][04272] Updated weights for policy 0, policy_version 65350 (0.0007) [2023-03-06 15:55:55,977][04272] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-06 15:55:56,792][04272] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-06 15:55:57,610][04272] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-06 15:55:58,444][04272] Updated weights for policy 0, policy_version 65390 (0.0007) [2023-03-06 15:55:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 66965504. Throughput: 0: 12616.6. Samples: 66944974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:55:58,941][03942] Avg episode reward: [(0, '1165.100')] [2023-03-06 15:55:59,247][04272] Updated weights for policy 0, policy_version 65400 (0.0007) [2023-03-06 15:56:00,060][04272] Updated weights for policy 0, policy_version 65410 (0.0006) [2023-03-06 15:56:00,844][04272] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-06 15:56:01,678][04272] Updated weights for policy 0, policy_version 65430 (0.0006) [2023-03-06 15:56:02,486][04272] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-06 15:56:03,285][04272] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-06 15:56:03,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67028992. Throughput: 0: 12613.2. Samples: 67020556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:56:03,941][03942] Avg episode reward: [(0, '1247.407')] [2023-03-06 15:56:04,097][04272] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-06 15:56:04,911][04272] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-06 15:56:05,730][04272] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-06 15:56:06,540][04272] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-06 15:56:07,354][04272] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-06 15:56:08,162][04272] Updated weights for policy 0, policy_version 65510 (0.0007) [2023-03-06 15:56:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67091456. Throughput: 0: 12615.2. Samples: 67058513. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:56:08,952][03942] Avg episode reward: [(0, '1192.741')] [2023-03-06 15:56:08,978][04272] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-06 15:56:09,791][04272] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-06 15:56:10,602][04272] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-06 15:56:11,406][04272] Updated weights for policy 0, policy_version 65550 (0.0007) [2023-03-06 15:56:12,210][04272] Updated weights for policy 0, policy_version 65560 (0.0006) [2023-03-06 15:56:13,014][04272] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-06 15:56:13,843][04272] Updated weights for policy 0, policy_version 65580 (0.0006) [2023-03-06 15:56:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12600.4). Total num frames: 67154944. Throughput: 0: 12616.9. Samples: 67134190. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:56:13,951][03942] Avg episode reward: [(0, '1249.510')] [2023-03-06 15:56:14,670][04272] Updated weights for policy 0, policy_version 65590 (0.0006) [2023-03-06 15:56:15,458][04272] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-06 15:56:16,271][04272] Updated weights for policy 0, policy_version 65610 (0.0007) [2023-03-06 15:56:17,074][04272] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-06 15:56:17,893][04272] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-06 15:56:18,718][04272] Updated weights for policy 0, policy_version 65640 (0.0006) [2023-03-06 15:56:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67217408. Throughput: 0: 12609.9. Samples: 67209919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:56:18,952][03942] Avg episode reward: [(0, '1211.403')] [2023-03-06 15:56:19,532][04272] Updated weights for policy 0, policy_version 65650 (0.0007) [2023-03-06 15:56:20,334][04272] Updated weights for policy 0, policy_version 65660 (0.0007) [2023-03-06 15:56:21,154][04272] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-06 15:56:21,943][04272] Updated weights for policy 0, policy_version 65680 (0.0007) [2023-03-06 15:56:22,751][04272] Updated weights for policy 0, policy_version 65690 (0.0007) [2023-03-06 15:56:23,566][04272] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-06 15:56:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67280896. Throughput: 0: 12611.8. Samples: 67247954. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:56:23,952][03942] Avg episode reward: [(0, '998.801')] [2023-03-06 15:56:24,382][04272] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-06 15:56:25,198][04272] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-06 15:56:26,011][04272] Updated weights for policy 0, policy_version 65730 (0.0006) [2023-03-06 15:56:26,828][04272] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-06 15:56:27,658][04272] Updated weights for policy 0, policy_version 65750 (0.0007) [2023-03-06 15:56:28,466][04272] Updated weights for policy 0, policy_version 65760 (0.0007) [2023-03-06 15:56:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 67343360. Throughput: 0: 12617.3. Samples: 67323555. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:28,951][03942] Avg episode reward: [(0, '905.732')] [2023-03-06 15:56:29,266][04272] Updated weights for policy 0, policy_version 65770 (0.0006) [2023-03-06 15:56:30,086][04272] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-06 15:56:30,873][04272] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-06 15:56:31,685][04272] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-06 15:56:32,484][04272] Updated weights for policy 0, policy_version 65810 (0.0007) [2023-03-06 15:56:33,316][04272] Updated weights for policy 0, policy_version 65820 (0.0007) [2023-03-06 15:56:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67406848. Throughput: 0: 12620.9. Samples: 67399272. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:33,951][03942] Avg episode reward: [(0, '1178.962')] [2023-03-06 15:56:34,130][04272] Updated weights for policy 0, policy_version 65830 (0.0007) [2023-03-06 15:56:34,926][04272] Updated weights for policy 0, policy_version 65840 (0.0006) [2023-03-06 15:56:35,761][04272] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-06 15:56:36,557][04272] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-06 15:56:37,373][04272] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-06 15:56:38,188][04272] Updated weights for policy 0, policy_version 65880 (0.0007) [2023-03-06 15:56:38,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 67470336. Throughput: 0: 12618.9. Samples: 67436908. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:38,951][03942] Avg episode reward: [(0, '1185.208')] [2023-03-06 15:56:38,985][04272] Updated weights for policy 0, policy_version 65890 (0.0006) [2023-03-06 15:56:39,793][04272] Updated weights for policy 0, policy_version 65900 (0.0007) [2023-03-06 15:56:40,609][04272] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-06 15:56:41,422][04272] Updated weights for policy 0, policy_version 65920 (0.0007) [2023-03-06 15:56:42,239][04272] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-06 15:56:43,059][04272] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-06 15:56:43,875][04272] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-06 15:56:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67532800. Throughput: 0: 12616.0. Samples: 67512694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:43,941][03942] Avg episode reward: [(0, '1153.227')] [2023-03-06 15:56:44,694][04272] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-06 15:56:45,513][04272] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-06 15:56:46,333][04272] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-06 15:56:47,124][04272] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-06 15:56:47,951][04272] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-06 15:56:48,756][04272] Updated weights for policy 0, policy_version 66010 (0.0007) [2023-03-06 15:56:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 67596288. Throughput: 0: 12613.6. Samples: 67588170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:48,941][03942] Avg episode reward: [(0, '1158.703')] [2023-03-06 15:56:49,558][04272] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-06 15:56:50,366][04272] Updated weights for policy 0, policy_version 66030 (0.0007) [2023-03-06 15:56:51,184][04272] Updated weights for policy 0, policy_version 66040 (0.0007) [2023-03-06 15:56:51,985][04272] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-06 15:56:52,799][04272] Updated weights for policy 0, policy_version 66060 (0.0007) [2023-03-06 15:56:53,614][04272] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-06 15:56:53,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 67659776. Throughput: 0: 12613.0. Samples: 67626096. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:53,951][03942] Avg episode reward: [(0, '1081.729')] [2023-03-06 15:56:54,427][04272] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-06 15:56:55,229][04272] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-06 15:56:56,050][04272] Updated weights for policy 0, policy_version 66100 (0.0007) [2023-03-06 15:56:56,857][04272] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-06 15:56:57,671][04272] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-06 15:56:58,492][04272] Updated weights for policy 0, policy_version 66130 (0.0006) [2023-03-06 15:56:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12603.9). Total num frames: 67722240. Throughput: 0: 12614.9. Samples: 67701862. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:56:58,952][03942] Avg episode reward: [(0, '965.149')] [2023-03-06 15:56:59,299][04272] Updated weights for policy 0, policy_version 66140 (0.0007) [2023-03-06 15:57:00,110][04272] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-06 15:57:00,933][04272] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-06 15:57:01,737][04272] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-06 15:57:02,557][04272] Updated weights for policy 0, policy_version 66180 (0.0007) [2023-03-06 15:57:03,379][04272] Updated weights for policy 0, policy_version 66190 (0.0007) [2023-03-06 15:57:03,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 67784704. Throughput: 0: 12608.0. Samples: 67777277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 15:57:03,952][03942] Avg episode reward: [(0, '1134.973')] [2023-03-06 15:57:04,185][04272] Updated weights for policy 0, policy_version 66200 (0.0007) [2023-03-06 15:57:05,005][04272] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-06 15:57:05,819][04272] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-06 15:57:06,631][04272] Updated weights for policy 0, policy_version 66230 (0.0007) [2023-03-06 15:57:07,445][04272] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-06 15:57:08,261][04272] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-06 15:57:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 67848192. Throughput: 0: 12601.3. Samples: 67815012. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:08,952][03942] Avg episode reward: [(0, '1202.063')] [2023-03-06 15:57:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000066258_67848192.pth... [2023-03-06 15:57:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000063304_64823296.pth [2023-03-06 15:57:09,055][04272] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-06 15:57:09,878][04272] Updated weights for policy 0, policy_version 66270 (0.0008) [2023-03-06 15:57:10,669][04272] Updated weights for policy 0, policy_version 66280 (0.0007) [2023-03-06 15:57:11,502][04272] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-06 15:57:12,298][04272] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-06 15:57:13,114][04272] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-06 15:57:13,925][04272] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-06 15:57:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12603.9). Total num frames: 67911680. Throughput: 0: 12609.2. Samples: 67890972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:13,952][03942] Avg episode reward: [(0, '1240.024')] [2023-03-06 15:57:14,708][04272] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-06 15:57:15,506][04272] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-06 15:57:16,321][04272] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-06 15:57:17,118][04272] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-06 15:57:17,925][04272] Updated weights for policy 0, policy_version 66370 (0.0006) [2023-03-06 15:57:18,748][04272] Updated weights for policy 0, policy_version 66380 (0.0007) [2023-03-06 15:57:18,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.4, 300 sec: 12603.9). Total num frames: 67975168. Throughput: 0: 12622.8. Samples: 67967299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:18,951][03942] Avg episode reward: [(0, '1254.508')] [2023-03-06 15:57:19,551][04272] Updated weights for policy 0, policy_version 66390 (0.0007) [2023-03-06 15:57:20,373][04272] Updated weights for policy 0, policy_version 66400 (0.0007) [2023-03-06 15:57:21,197][04272] Updated weights for policy 0, policy_version 66410 (0.0005) [2023-03-06 15:57:22,007][04272] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-06 15:57:22,813][04272] Updated weights for policy 0, policy_version 66430 (0.0007) [2023-03-06 15:57:23,607][04272] Updated weights for policy 0, policy_version 66440 (0.0006) [2023-03-06 15:57:23,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 68038656. Throughput: 0: 12625.3. Samples: 68005049. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:23,952][03942] Avg episode reward: [(0, '1325.737')] [2023-03-06 15:57:24,425][04272] Updated weights for policy 0, policy_version 66450 (0.0005) [2023-03-06 15:57:25,226][04272] Updated weights for policy 0, policy_version 66460 (0.0007) [2023-03-06 15:57:26,040][04272] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-06 15:57:26,848][04272] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-06 15:57:27,665][04272] Updated weights for policy 0, policy_version 66490 (0.0007) [2023-03-06 15:57:28,490][04272] Updated weights for policy 0, policy_version 66500 (0.0006) [2023-03-06 15:57:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 68101120. Throughput: 0: 12624.4. Samples: 68080790. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:28,951][03942] Avg episode reward: [(0, '1302.441')] [2023-03-06 15:57:29,303][04272] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-06 15:57:30,099][04272] Updated weights for policy 0, policy_version 66520 (0.0007) [2023-03-06 15:57:30,934][04272] Updated weights for policy 0, policy_version 66530 (0.0007) [2023-03-06 15:57:31,737][04272] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-06 15:57:32,539][04272] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-06 15:57:33,366][04272] Updated weights for policy 0, policy_version 66560 (0.0007) [2023-03-06 15:57:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 68164608. Throughput: 0: 12629.6. Samples: 68156501. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:33,951][03942] Avg episode reward: [(0, '1345.005')] [2023-03-06 15:57:34,161][04272] Updated weights for policy 0, policy_version 66570 (0.0007) [2023-03-06 15:57:34,970][04272] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-06 15:57:35,782][04272] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-06 15:57:36,610][04272] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-06 15:57:37,405][04272] Updated weights for policy 0, policy_version 66610 (0.0007) [2023-03-06 15:57:38,230][04272] Updated weights for policy 0, policy_version 66620 (0.0007) [2023-03-06 15:57:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 68227072. Throughput: 0: 12625.9. Samples: 68194264. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:38,951][03942] Avg episode reward: [(0, '1302.117')] [2023-03-06 15:57:39,057][04272] Updated weights for policy 0, policy_version 66630 (0.0007) [2023-03-06 15:57:39,851][04272] Updated weights for policy 0, policy_version 66640 (0.0007) [2023-03-06 15:57:40,673][04272] Updated weights for policy 0, policy_version 66650 (0.0007) [2023-03-06 15:57:41,505][04272] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-06 15:57:42,296][04272] Updated weights for policy 0, policy_version 66670 (0.0007) [2023-03-06 15:57:43,100][04272] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-06 15:57:43,899][04272] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-03-06 15:57:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12607.4). Total num frames: 68290560. Throughput: 0: 12619.6. Samples: 68269743. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:57:43,951][03942] Avg episode reward: [(0, '1295.094')] [2023-03-06 15:57:44,718][04272] Updated weights for policy 0, policy_version 66700 (0.0007) [2023-03-06 15:57:45,536][04272] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-06 15:57:46,338][04272] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-06 15:57:47,156][04272] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-06 15:57:47,983][04272] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-06 15:57:48,801][04272] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-06 15:57:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 68353024. Throughput: 0: 12623.0. Samples: 68345313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:57:48,952][03942] Avg episode reward: [(0, '1220.130')] [2023-03-06 15:57:49,612][04272] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-06 15:57:50,428][04272] Updated weights for policy 0, policy_version 66770 (0.0007) [2023-03-06 15:57:51,246][04272] Updated weights for policy 0, policy_version 66780 (0.0007) [2023-03-06 15:57:52,058][04272] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-06 15:57:52,865][04272] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-06 15:57:53,659][04272] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-06 15:57:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 68416512. Throughput: 0: 12627.9. Samples: 68383268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:57:53,952][03942] Avg episode reward: [(0, '1228.812')] [2023-03-06 15:57:54,487][04272] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-06 15:57:55,293][04272] Updated weights for policy 0, policy_version 66830 (0.0007) [2023-03-06 15:57:56,118][04272] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-06 15:57:56,930][04272] Updated weights for policy 0, policy_version 66850 (0.0006) [2023-03-06 15:57:57,735][04272] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-06 15:57:58,570][04272] Updated weights for policy 0, policy_version 66870 (0.0007) [2023-03-06 15:57:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 68478976. Throughput: 0: 12618.2. Samples: 68458791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:57:58,952][03942] Avg episode reward: [(0, '1251.550')] [2023-03-06 15:57:59,370][04272] Updated weights for policy 0, policy_version 66880 (0.0006) [2023-03-06 15:58:00,177][04272] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-06 15:58:01,006][04272] Updated weights for policy 0, policy_version 66900 (0.0006) [2023-03-06 15:58:01,813][04272] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-06 15:58:02,613][04272] Updated weights for policy 0, policy_version 66920 (0.0008) [2023-03-06 15:58:03,470][04272] Updated weights for policy 0, policy_version 66930 (0.0007) [2023-03-06 15:58:03,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 68541440. Throughput: 0: 12598.3. Samples: 68534221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:58:03,951][03942] Avg episode reward: [(0, '1213.944')] [2023-03-06 15:58:04,266][04272] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-06 15:58:05,092][04272] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-06 15:58:05,924][04272] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-06 15:58:06,732][04272] Updated weights for policy 0, policy_version 66970 (0.0006) [2023-03-06 15:58:07,537][04272] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-06 15:58:08,349][04272] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-06 15:58:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 68604928. Throughput: 0: 12593.6. Samples: 68571761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:58:08,952][03942] Avg episode reward: [(0, '1251.744')] [2023-03-06 15:58:09,148][04272] Updated weights for policy 0, policy_version 67000 (0.0007) [2023-03-06 15:58:09,949][04272] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-06 15:58:10,756][04272] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-06 15:58:11,575][04272] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-06 15:58:12,373][04272] Updated weights for policy 0, policy_version 67040 (0.0007) [2023-03-06 15:58:13,194][04272] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-06 15:58:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 68667392. Throughput: 0: 12594.8. Samples: 68647555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:58:13,952][03942] Avg episode reward: [(0, '1266.279')] [2023-03-06 15:58:14,004][04272] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-06 15:58:14,819][04272] Updated weights for policy 0, policy_version 67070 (0.0008) [2023-03-06 15:58:15,627][04272] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-06 15:58:16,426][04272] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-06 15:58:17,226][04272] Updated weights for policy 0, policy_version 67100 (0.0007) [2023-03-06 15:58:18,056][04272] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-06 15:58:18,864][04272] Updated weights for policy 0, policy_version 67120 (0.0007) [2023-03-06 15:58:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 68731904. Throughput: 0: 12598.7. Samples: 68723441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:58:18,951][03942] Avg episode reward: [(0, '1282.099')] [2023-03-06 15:58:19,676][04272] Updated weights for policy 0, policy_version 67130 (0.0006) [2023-03-06 15:58:20,498][04272] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-06 15:58:21,291][04272] Updated weights for policy 0, policy_version 67150 (0.0007) [2023-03-06 15:58:22,132][04272] Updated weights for policy 0, policy_version 67160 (0.0007) [2023-03-06 15:58:22,929][04272] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-06 15:58:23,734][04272] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-06 15:58:23,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 68794368. Throughput: 0: 12604.3. Samples: 68761458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:58:23,952][03942] Avg episode reward: [(0, '1266.892')] [2023-03-06 15:58:24,551][04272] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-06 15:58:25,358][04272] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-06 15:58:26,175][04272] Updated weights for policy 0, policy_version 67210 (0.0006) [2023-03-06 15:58:26,992][04272] Updated weights for policy 0, policy_version 67220 (0.0008) [2023-03-06 15:58:27,783][04272] Updated weights for policy 0, policy_version 67230 (0.0007) [2023-03-06 15:58:28,604][04272] Updated weights for policy 0, policy_version 67240 (0.0007) [2023-03-06 15:58:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 68857856. Throughput: 0: 12606.7. Samples: 68837043. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:28,951][03942] Avg episode reward: [(0, '1265.077')] [2023-03-06 15:58:29,421][04272] Updated weights for policy 0, policy_version 67250 (0.0007) [2023-03-06 15:58:30,226][04272] Updated weights for policy 0, policy_version 67260 (0.0005) [2023-03-06 15:58:31,034][04272] Updated weights for policy 0, policy_version 67270 (0.0006) [2023-03-06 15:58:31,849][04272] Updated weights for policy 0, policy_version 67280 (0.0006) [2023-03-06 15:58:32,665][04272] Updated weights for policy 0, policy_version 67290 (0.0007) [2023-03-06 15:58:33,451][04272] Updated weights for policy 0, policy_version 67300 (0.0007) [2023-03-06 15:58:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 68921344. Throughput: 0: 12614.8. Samples: 68912980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:33,941][03942] Avg episode reward: [(0, '1342.651')] [2023-03-06 15:58:34,259][04272] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-06 15:58:35,055][04272] Updated weights for policy 0, policy_version 67320 (0.0007) [2023-03-06 15:58:35,880][04272] Updated weights for policy 0, policy_version 67330 (0.0007) [2023-03-06 15:58:36,693][04272] Updated weights for policy 0, policy_version 67340 (0.0006) [2023-03-06 15:58:37,505][04272] Updated weights for policy 0, policy_version 67350 (0.0007) [2023-03-06 15:58:38,323][04272] Updated weights for policy 0, policy_version 67360 (0.0007) [2023-03-06 15:58:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 68983808. Throughput: 0: 12616.7. Samples: 68951020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:38,941][03942] Avg episode reward: [(0, '1196.509')] [2023-03-06 15:58:39,133][04272] Updated weights for policy 0, policy_version 67370 (0.0007) [2023-03-06 15:58:39,945][04272] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-06 15:58:40,762][04272] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-06 15:58:41,570][04272] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-06 15:58:42,394][04272] Updated weights for policy 0, policy_version 67410 (0.0006) [2023-03-06 15:58:43,186][04272] Updated weights for policy 0, policy_version 67420 (0.0007) [2023-03-06 15:58:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 69047296. Throughput: 0: 12615.9. Samples: 69026504. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:43,941][03942] Avg episode reward: [(0, '1234.391')] [2023-03-06 15:58:43,998][04272] Updated weights for policy 0, policy_version 67430 (0.0006) [2023-03-06 15:58:44,823][04272] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-06 15:58:45,611][04272] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-06 15:58:46,438][04272] Updated weights for policy 0, policy_version 67460 (0.0007) [2023-03-06 15:58:47,253][04272] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-03-06 15:58:48,066][04272] Updated weights for policy 0, policy_version 67480 (0.0007) [2023-03-06 15:58:48,882][04272] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-06 15:58:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 69109760. Throughput: 0: 12621.7. Samples: 69102197. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:48,941][03942] Avg episode reward: [(0, '1161.889')] [2023-03-06 15:58:49,705][04272] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-06 15:58:50,495][04272] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-06 15:58:51,312][04272] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-06 15:58:52,134][04272] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-06 15:58:52,939][04272] Updated weights for policy 0, policy_version 67540 (0.0007) [2023-03-06 15:58:53,755][04272] Updated weights for policy 0, policy_version 67550 (0.0006) [2023-03-06 15:58:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 69173248. Throughput: 0: 12630.1. Samples: 69140117. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:53,941][03942] Avg episode reward: [(0, '1310.017')] [2023-03-06 15:58:54,573][04272] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-06 15:58:55,394][04272] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-06 15:58:56,205][04272] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-06 15:58:57,003][04272] Updated weights for policy 0, policy_version 67590 (0.0007) [2023-03-06 15:58:57,814][04272] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-06 15:58:58,618][04272] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-06 15:58:58,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.4, 300 sec: 12610.8). Total num frames: 69236736. Throughput: 0: 12626.3. Samples: 69215736. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:58:58,941][03942] Avg episode reward: [(0, '1162.445')] [2023-03-06 15:58:59,442][04272] Updated weights for policy 0, policy_version 67620 (0.0007) [2023-03-06 15:59:00,281][04272] Updated weights for policy 0, policy_version 67630 (0.0007) [2023-03-06 15:59:01,085][04272] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-06 15:59:01,883][04272] Updated weights for policy 0, policy_version 67650 (0.0006) [2023-03-06 15:59:02,704][04272] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-06 15:59:03,530][04272] Updated weights for policy 0, policy_version 67670 (0.0007) [2023-03-06 15:59:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 69299200. Throughput: 0: 12615.1. Samples: 69291121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 15:59:03,941][03942] Avg episode reward: [(0, '1297.342')] [2023-03-06 15:59:04,342][04272] Updated weights for policy 0, policy_version 67680 (0.0006) [2023-03-06 15:59:05,139][04272] Updated weights for policy 0, policy_version 67690 (0.0006) [2023-03-06 15:59:05,937][04272] Updated weights for policy 0, policy_version 67700 (0.0007) [2023-03-06 15:59:06,752][04272] Updated weights for policy 0, policy_version 67710 (0.0007) [2023-03-06 15:59:07,549][04272] Updated weights for policy 0, policy_version 67720 (0.0006) [2023-03-06 15:59:08,349][04272] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-06 15:59:08,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 69361664. Throughput: 0: 12612.8. Samples: 69329036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:08,941][03942] Avg episode reward: [(0, '1382.747')] [2023-03-06 15:59:08,948][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000067737_69362688.pth... [2023-03-06 15:59:08,980][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000064780_66334720.pth [2023-03-06 15:59:09,178][04272] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-06 15:59:09,993][04272] Updated weights for policy 0, policy_version 67750 (0.0007) [2023-03-06 15:59:10,808][04272] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-06 15:59:11,629][04272] Updated weights for policy 0, policy_version 67770 (0.0007) [2023-03-06 15:59:12,433][04272] Updated weights for policy 0, policy_version 67780 (0.0006) [2023-03-06 15:59:13,253][04272] Updated weights for policy 0, policy_version 67790 (0.0006) [2023-03-06 15:59:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12607.3). Total num frames: 69425152. Throughput: 0: 12612.3. Samples: 69404598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:13,941][03942] Avg episode reward: [(0, '1311.362')] [2023-03-06 15:59:14,061][04272] Updated weights for policy 0, policy_version 67800 (0.0006) [2023-03-06 15:59:14,867][04272] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-06 15:59:15,702][04272] Updated weights for policy 0, policy_version 67820 (0.0007) [2023-03-06 15:59:16,510][04272] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-06 15:59:17,342][04272] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-06 15:59:18,129][04272] Updated weights for policy 0, policy_version 67850 (0.0007) [2023-03-06 15:59:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 69487616. Throughput: 0: 12607.1. Samples: 69480300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:18,941][03942] Avg episode reward: [(0, '1257.076')] [2023-03-06 15:59:18,949][04272] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-06 15:59:19,764][04272] Updated weights for policy 0, policy_version 67870 (0.0007) [2023-03-06 15:59:20,555][04272] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-06 15:59:21,382][04272] Updated weights for policy 0, policy_version 67890 (0.0007) [2023-03-06 15:59:22,192][04272] Updated weights for policy 0, policy_version 67900 (0.0007) [2023-03-06 15:59:22,987][04272] Updated weights for policy 0, policy_version 67910 (0.0007) [2023-03-06 15:59:23,818][04272] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-06 15:59:23,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 69551104. Throughput: 0: 12602.2. Samples: 69518119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:23,941][03942] Avg episode reward: [(0, '1267.597')] [2023-03-06 15:59:24,633][04272] Updated weights for policy 0, policy_version 67930 (0.0007) [2023-03-06 15:59:25,435][04272] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-06 15:59:26,267][04272] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-06 15:59:27,087][04272] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-06 15:59:27,898][04272] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-06 15:59:28,713][04272] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-06 15:59:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12607.4). Total num frames: 69613568. Throughput: 0: 12599.0. Samples: 69593460. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:28,941][03942] Avg episode reward: [(0, '1362.777')] [2023-03-06 15:59:29,514][04272] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-06 15:59:30,329][04272] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-06 15:59:31,132][04272] Updated weights for policy 0, policy_version 68010 (0.0007) [2023-03-06 15:59:31,933][04272] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-06 15:59:32,766][04272] Updated weights for policy 0, policy_version 68030 (0.0007) [2023-03-06 15:59:33,553][04272] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-06 15:59:33,941][03942] Fps is (10 sec: 12594.9, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 69677056. Throughput: 0: 12603.2. Samples: 69669341. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:33,941][03942] Avg episode reward: [(0, '1304.325')] [2023-03-06 15:59:34,371][04272] Updated weights for policy 0, policy_version 68050 (0.0006) [2023-03-06 15:59:35,174][04272] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-06 15:59:35,995][04272] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-06 15:59:36,819][04272] Updated weights for policy 0, policy_version 68080 (0.0007) [2023-03-06 15:59:37,621][04272] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-06 15:59:38,437][04272] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-06 15:59:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 69740544. Throughput: 0: 12602.7. Samples: 69707240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:38,941][03942] Avg episode reward: [(0, '1196.526')] [2023-03-06 15:59:39,250][04272] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-06 15:59:40,059][04272] Updated weights for policy 0, policy_version 68120 (0.0006) [2023-03-06 15:59:40,862][04272] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-06 15:59:41,693][04272] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-06 15:59:42,499][04272] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-06 15:59:43,325][04272] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-06 15:59:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 69803008. Throughput: 0: 12598.6. Samples: 69782673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:43,941][03942] Avg episode reward: [(0, '1248.292')] [2023-03-06 15:59:44,138][04272] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-06 15:59:44,946][04272] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-06 15:59:45,766][04272] Updated weights for policy 0, policy_version 68190 (0.0006) [2023-03-06 15:59:46,574][04272] Updated weights for policy 0, policy_version 68200 (0.0007) [2023-03-06 15:59:47,387][04272] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-06 15:59:48,202][04272] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-06 15:59:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 69866496. Throughput: 0: 12605.1. Samples: 69858352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:48,941][03942] Avg episode reward: [(0, '1335.307')] [2023-03-06 15:59:49,007][04272] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-06 15:59:49,813][04272] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-06 15:59:50,639][04272] Updated weights for policy 0, policy_version 68250 (0.0007) [2023-03-06 15:59:51,442][04272] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-06 15:59:52,266][04272] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-06 15:59:53,066][04272] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-06 15:59:53,869][04272] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-06 15:59:53,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 69928960. Throughput: 0: 12602.5. Samples: 69896151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:53,941][03942] Avg episode reward: [(0, '1318.409')] [2023-03-06 15:59:54,693][04272] Updated weights for policy 0, policy_version 68300 (0.0007) [2023-03-06 15:59:55,492][04272] Updated weights for policy 0, policy_version 68310 (0.0006) [2023-03-06 15:59:56,297][04272] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-06 15:59:57,117][04272] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-06 15:59:57,931][04272] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-06 15:59:58,745][04272] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-06 15:59:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 69992448. Throughput: 0: 12607.5. Samples: 69971936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 15:59:58,941][03942] Avg episode reward: [(0, '1353.401')] [2023-03-06 15:59:59,542][04272] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-06 16:00:00,372][04272] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-06 16:00:01,177][04272] Updated weights for policy 0, policy_version 68380 (0.0006) [2023-03-06 16:00:01,985][04272] Updated weights for policy 0, policy_version 68390 (0.0007) [2023-03-06 16:00:02,809][04272] Updated weights for policy 0, policy_version 68400 (0.0007) [2023-03-06 16:00:03,633][04272] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-06 16:00:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 70054912. Throughput: 0: 12602.8. Samples: 70047426. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:03,941][03942] Avg episode reward: [(0, '1301.671')] [2023-03-06 16:00:04,442][04272] Updated weights for policy 0, policy_version 68420 (0.0005) [2023-03-06 16:00:05,241][04272] Updated weights for policy 0, policy_version 68430 (0.0007) [2023-03-06 16:00:06,062][04272] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-06 16:00:06,869][04272] Updated weights for policy 0, policy_version 68450 (0.0007) [2023-03-06 16:00:07,685][04272] Updated weights for policy 0, policy_version 68460 (0.0007) [2023-03-06 16:00:08,505][04272] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-06 16:00:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 70118400. Throughput: 0: 12602.8. Samples: 70085247. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:08,941][03942] Avg episode reward: [(0, '1183.475')] [2023-03-06 16:00:09,319][04272] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-06 16:00:10,117][04272] Updated weights for policy 0, policy_version 68490 (0.0007) [2023-03-06 16:00:10,922][04272] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-06 16:00:11,738][04272] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-06 16:00:12,559][04272] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-06 16:00:13,361][04272] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-06 16:00:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 70181888. Throughput: 0: 12613.2. Samples: 70161057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:13,952][03942] Avg episode reward: [(0, '1289.833')] [2023-03-06 16:00:14,172][04272] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-06 16:00:14,986][04272] Updated weights for policy 0, policy_version 68550 (0.0007) [2023-03-06 16:00:15,797][04272] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-06 16:00:16,608][04272] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-06 16:00:17,399][04272] Updated weights for policy 0, policy_version 68580 (0.0007) [2023-03-06 16:00:18,210][04272] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-06 16:00:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 70245376. Throughput: 0: 12617.3. Samples: 70237120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:18,941][03942] Avg episode reward: [(0, '1305.435')] [2023-03-06 16:00:19,013][04272] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-06 16:00:19,818][04272] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-06 16:00:20,617][04272] Updated weights for policy 0, policy_version 68620 (0.0007) [2023-03-06 16:00:21,424][04272] Updated weights for policy 0, policy_version 68630 (0.0007) [2023-03-06 16:00:22,233][04272] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-06 16:00:23,065][04272] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-06 16:00:23,848][04272] Updated weights for policy 0, policy_version 68660 (0.0007) [2023-03-06 16:00:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12610.8). Total num frames: 70307840. Throughput: 0: 12619.3. Samples: 70275109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:23,941][03942] Avg episode reward: [(0, '1263.255')] [2023-03-06 16:00:24,683][04272] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-06 16:00:25,514][04272] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-06 16:00:26,318][04272] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-06 16:00:27,104][04272] Updated weights for policy 0, policy_version 68700 (0.0007) [2023-03-06 16:00:27,935][04272] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-06 16:00:28,770][04272] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-06 16:00:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 70371328. Throughput: 0: 12621.0. Samples: 70350617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:28,941][03942] Avg episode reward: [(0, '1239.523')] [2023-03-06 16:00:29,572][04272] Updated weights for policy 0, policy_version 68730 (0.0007) [2023-03-06 16:00:30,418][04272] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-06 16:00:31,223][04272] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-06 16:00:32,034][04272] Updated weights for policy 0, policy_version 68760 (0.0007) [2023-03-06 16:00:32,849][04272] Updated weights for policy 0, policy_version 68770 (0.0007) [2023-03-06 16:00:33,642][04272] Updated weights for policy 0, policy_version 68780 (0.0005) [2023-03-06 16:00:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 70433792. Throughput: 0: 12609.6. Samples: 70425782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:33,941][03942] Avg episode reward: [(0, '1307.852')] [2023-03-06 16:00:34,459][04272] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-06 16:00:35,290][04272] Updated weights for policy 0, policy_version 68800 (0.0006) [2023-03-06 16:00:36,096][04272] Updated weights for policy 0, policy_version 68810 (0.0007) [2023-03-06 16:00:36,902][04272] Updated weights for policy 0, policy_version 68820 (0.0006) [2023-03-06 16:00:37,737][04272] Updated weights for policy 0, policy_version 68830 (0.0005) [2023-03-06 16:00:38,556][04272] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-06 16:00:38,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 70496256. Throughput: 0: 12612.6. Samples: 70463715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:38,941][03942] Avg episode reward: [(0, '1263.176')] [2023-03-06 16:00:39,355][04272] Updated weights for policy 0, policy_version 68850 (0.0007) [2023-03-06 16:00:40,171][04272] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-06 16:00:40,977][04272] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-06 16:00:41,777][04272] Updated weights for policy 0, policy_version 68880 (0.0007) [2023-03-06 16:00:42,588][04272] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-06 16:00:43,393][04272] Updated weights for policy 0, policy_version 68900 (0.0007) [2023-03-06 16:00:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 70559744. Throughput: 0: 12612.1. Samples: 70539479. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:43,941][03942] Avg episode reward: [(0, '1271.473')] [2023-03-06 16:00:44,210][04272] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-06 16:00:45,006][04272] Updated weights for policy 0, policy_version 68920 (0.0007) [2023-03-06 16:00:45,832][04272] Updated weights for policy 0, policy_version 68930 (0.0007) [2023-03-06 16:00:46,626][04272] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-06 16:00:47,443][04272] Updated weights for policy 0, policy_version 68950 (0.0008) [2023-03-06 16:00:48,246][04272] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-06 16:00:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 70623232. Throughput: 0: 12621.8. Samples: 70615404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:48,941][03942] Avg episode reward: [(0, '1102.196')] [2023-03-06 16:00:49,044][04272] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-06 16:00:49,853][04272] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-06 16:00:50,676][04272] Updated weights for policy 0, policy_version 68990 (0.0007) [2023-03-06 16:00:51,486][04272] Updated weights for policy 0, policy_version 69000 (0.0006) [2023-03-06 16:00:52,290][04272] Updated weights for policy 0, policy_version 69010 (0.0005) [2023-03-06 16:00:53,101][04272] Updated weights for policy 0, policy_version 69020 (0.0007) [2023-03-06 16:00:53,929][04272] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-06 16:00:53,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 70686720. Throughput: 0: 12621.8. Samples: 70653226. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:53,941][03942] Avg episode reward: [(0, '1181.009')] [2023-03-06 16:00:54,729][04272] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-06 16:00:55,550][04272] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-06 16:00:56,367][04272] Updated weights for policy 0, policy_version 69060 (0.0006) [2023-03-06 16:00:57,161][04272] Updated weights for policy 0, policy_version 69070 (0.0007) [2023-03-06 16:00:57,988][04272] Updated weights for policy 0, policy_version 69080 (0.0007) [2023-03-06 16:00:58,797][04272] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-06 16:00:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 70749184. Throughput: 0: 12620.7. Samples: 70728986. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:00:58,941][03942] Avg episode reward: [(0, '1251.365')] [2023-03-06 16:00:59,622][04272] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-06 16:01:00,423][04272] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-06 16:01:01,230][04272] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-06 16:01:02,039][04272] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-06 16:01:02,874][04272] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-06 16:01:03,685][04272] Updated weights for policy 0, policy_version 69150 (0.0007) [2023-03-06 16:01:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 70812672. Throughput: 0: 12605.2. Samples: 70804354. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:01:03,941][03942] Avg episode reward: [(0, '1205.660')] [2023-03-06 16:01:04,515][04272] Updated weights for policy 0, policy_version 69160 (0.0005) [2023-03-06 16:01:05,329][04272] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-06 16:01:06,122][04272] Updated weights for policy 0, policy_version 69180 (0.0007) [2023-03-06 16:01:06,939][04272] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-06 16:01:07,756][04272] Updated weights for policy 0, policy_version 69200 (0.0007) [2023-03-06 16:01:08,546][04272] Updated weights for policy 0, policy_version 69210 (0.0007) [2023-03-06 16:01:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.2, 300 sec: 12610.8). Total num frames: 70875136. Throughput: 0: 12600.7. Samples: 70842140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:01:08,941][03942] Avg episode reward: [(0, '1102.678')] [2023-03-06 16:01:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000069214_70875136.pth... [2023-03-06 16:01:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000066258_67848192.pth [2023-03-06 16:01:09,366][04272] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-06 16:01:10,199][04272] Updated weights for policy 0, policy_version 69230 (0.0007) [2023-03-06 16:01:11,010][04272] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-06 16:01:11,810][04272] Updated weights for policy 0, policy_version 69250 (0.0007) [2023-03-06 16:01:12,624][04272] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-06 16:01:13,424][04272] Updated weights for policy 0, policy_version 69270 (0.0006) [2023-03-06 16:01:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 70938624. Throughput: 0: 12604.8. Samples: 70917834. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:13,952][03942] Avg episode reward: [(0, '1194.875')] [2023-03-06 16:01:14,223][04272] Updated weights for policy 0, policy_version 69280 (0.0007) [2023-03-06 16:01:15,053][04272] Updated weights for policy 0, policy_version 69290 (0.0006) [2023-03-06 16:01:15,867][04272] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-06 16:01:16,676][04272] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-06 16:01:17,486][04272] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-06 16:01:18,300][04272] Updated weights for policy 0, policy_version 69330 (0.0007) [2023-03-06 16:01:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 71001088. Throughput: 0: 12616.5. Samples: 70993526. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:18,951][03942] Avg episode reward: [(0, '1223.295')] [2023-03-06 16:01:19,123][04272] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-06 16:01:19,931][04272] Updated weights for policy 0, policy_version 69350 (0.0007) [2023-03-06 16:01:20,742][04272] Updated weights for policy 0, policy_version 69360 (0.0006) [2023-03-06 16:01:21,551][04272] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-06 16:01:22,368][04272] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-06 16:01:23,189][04272] Updated weights for policy 0, policy_version 69390 (0.0006) [2023-03-06 16:01:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 71064576. Throughput: 0: 12613.3. Samples: 71031311. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:23,952][03942] Avg episode reward: [(0, '1211.744')] [2023-03-06 16:01:23,997][04272] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-06 16:01:24,819][04272] Updated weights for policy 0, policy_version 69410 (0.0007) [2023-03-06 16:01:25,620][04272] Updated weights for policy 0, policy_version 69420 (0.0008) [2023-03-06 16:01:26,422][04272] Updated weights for policy 0, policy_version 69430 (0.0007) [2023-03-06 16:01:27,215][04272] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-06 16:01:28,046][04272] Updated weights for policy 0, policy_version 69450 (0.0007) [2023-03-06 16:01:28,857][04272] Updated weights for policy 0, policy_version 69460 (0.0007) [2023-03-06 16:01:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 71127040. Throughput: 0: 12620.0. Samples: 71107380. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:28,951][03942] Avg episode reward: [(0, '1227.106')] [2023-03-06 16:01:29,657][04272] Updated weights for policy 0, policy_version 69470 (0.0005) [2023-03-06 16:01:30,473][04272] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-06 16:01:31,272][04272] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-06 16:01:32,099][04272] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-06 16:01:32,923][04272] Updated weights for policy 0, policy_version 69510 (0.0007) [2023-03-06 16:01:33,744][04272] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-06 16:01:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 71190528. Throughput: 0: 12608.2. Samples: 71182773. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:33,951][03942] Avg episode reward: [(0, '1285.702')] [2023-03-06 16:01:34,531][04272] Updated weights for policy 0, policy_version 69530 (0.0005) [2023-03-06 16:01:35,345][04272] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-06 16:01:36,155][04272] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-06 16:01:36,939][04272] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-06 16:01:37,756][04272] Updated weights for policy 0, policy_version 69570 (0.0006) [2023-03-06 16:01:38,585][04272] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-06 16:01:38,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 71254016. Throughput: 0: 12617.7. Samples: 71221022. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:38,952][03942] Avg episode reward: [(0, '1273.887')] [2023-03-06 16:01:39,396][04272] Updated weights for policy 0, policy_version 69590 (0.0007) [2023-03-06 16:01:40,197][04272] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-06 16:01:41,016][04272] Updated weights for policy 0, policy_version 69610 (0.0007) [2023-03-06 16:01:41,829][04272] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-06 16:01:42,631][04272] Updated weights for policy 0, policy_version 69630 (0.0007) [2023-03-06 16:01:43,450][04272] Updated weights for policy 0, policy_version 69640 (0.0006) [2023-03-06 16:01:43,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 71317504. Throughput: 0: 12615.6. Samples: 71296688. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:43,952][03942] Avg episode reward: [(0, '1273.147')] [2023-03-06 16:01:44,253][04272] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-06 16:01:45,048][04272] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-03-06 16:01:45,863][04272] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-06 16:01:46,679][04272] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-06 16:01:47,479][04272] Updated weights for policy 0, policy_version 69690 (0.0006) [2023-03-06 16:01:48,305][04272] Updated weights for policy 0, policy_version 69700 (0.0007) [2023-03-06 16:01:48,941][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 71379968. Throughput: 0: 12626.7. Samples: 71372556. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:48,941][03942] Avg episode reward: [(0, '1242.938')] [2023-03-06 16:01:49,110][04272] Updated weights for policy 0, policy_version 69710 (0.0006) [2023-03-06 16:01:49,910][04272] Updated weights for policy 0, policy_version 69720 (0.0006) [2023-03-06 16:01:50,717][04272] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-06 16:01:51,537][04272] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-06 16:01:52,342][04272] Updated weights for policy 0, policy_version 69750 (0.0007) [2023-03-06 16:01:53,149][04272] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-06 16:01:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 71443456. Throughput: 0: 12631.0. Samples: 71410535. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:01:53,941][03942] Avg episode reward: [(0, '1216.070')] [2023-03-06 16:01:53,963][04272] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-06 16:01:54,767][04272] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-06 16:01:55,583][04272] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-06 16:01:56,390][04272] Updated weights for policy 0, policy_version 69800 (0.0007) [2023-03-06 16:01:57,211][04272] Updated weights for policy 0, policy_version 69810 (0.0007) [2023-03-06 16:01:58,041][04272] Updated weights for policy 0, policy_version 69820 (0.0006) [2023-03-06 16:01:58,826][04272] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-06 16:01:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 71506944. Throughput: 0: 12633.3. Samples: 71486333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:01:58,941][03942] Avg episode reward: [(0, '1260.005')] [2023-03-06 16:01:59,634][04272] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-06 16:02:00,452][04272] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-06 16:02:01,252][04272] Updated weights for policy 0, policy_version 69860 (0.0006) [2023-03-06 16:02:02,071][04272] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-06 16:02:02,865][04272] Updated weights for policy 0, policy_version 69880 (0.0006) [2023-03-06 16:02:03,700][04272] Updated weights for policy 0, policy_version 69890 (0.0007) [2023-03-06 16:02:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 71570432. Throughput: 0: 12639.8. Samples: 71562319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:03,941][03942] Avg episode reward: [(0, '1201.917')] [2023-03-06 16:02:04,501][04272] Updated weights for policy 0, policy_version 69900 (0.0007) [2023-03-06 16:02:05,307][04272] Updated weights for policy 0, policy_version 69910 (0.0007) [2023-03-06 16:02:06,118][04272] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-06 16:02:06,930][04272] Updated weights for policy 0, policy_version 69930 (0.0006) [2023-03-06 16:02:07,747][04272] Updated weights for policy 0, policy_version 69940 (0.0007) [2023-03-06 16:02:08,565][04272] Updated weights for policy 0, policy_version 69950 (0.0006) [2023-03-06 16:02:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 71632896. Throughput: 0: 12638.0. Samples: 71600023. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:08,941][03942] Avg episode reward: [(0, '1263.840')] [2023-03-06 16:02:09,391][04272] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-06 16:02:10,194][04272] Updated weights for policy 0, policy_version 69970 (0.0006) [2023-03-06 16:02:11,003][04272] Updated weights for policy 0, policy_version 69980 (0.0007) [2023-03-06 16:02:11,797][04272] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-06 16:02:12,610][04272] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-06 16:02:13,422][04272] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-06 16:02:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12614.3). Total num frames: 71696384. Throughput: 0: 12631.9. Samples: 71675814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:13,941][03942] Avg episode reward: [(0, '1245.956')] [2023-03-06 16:02:14,249][04272] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-06 16:02:15,049][04272] Updated weights for policy 0, policy_version 70030 (0.0007) [2023-03-06 16:02:15,870][04272] Updated weights for policy 0, policy_version 70040 (0.0007) [2023-03-06 16:02:16,689][04272] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-06 16:02:17,495][04272] Updated weights for policy 0, policy_version 70060 (0.0007) [2023-03-06 16:02:18,317][04272] Updated weights for policy 0, policy_version 70070 (0.0008) [2023-03-06 16:02:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 71758848. Throughput: 0: 12629.9. Samples: 71751118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:18,941][03942] Avg episode reward: [(0, '1265.941')] [2023-03-06 16:02:19,134][04272] Updated weights for policy 0, policy_version 70080 (0.0007) [2023-03-06 16:02:19,937][04272] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-06 16:02:20,760][04272] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-06 16:02:21,575][04272] Updated weights for policy 0, policy_version 70110 (0.0007) [2023-03-06 16:02:22,378][04272] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-06 16:02:23,195][04272] Updated weights for policy 0, policy_version 70130 (0.0007) [2023-03-06 16:02:23,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 71821312. Throughput: 0: 12618.7. Samples: 71788861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:23,941][03942] Avg episode reward: [(0, '1318.343')] [2023-03-06 16:02:24,014][04272] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-06 16:02:24,822][04272] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-06 16:02:25,659][04272] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-06 16:02:26,466][04272] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-06 16:02:27,286][04272] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-06 16:02:28,083][04272] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-06 16:02:28,895][04272] Updated weights for policy 0, policy_version 70200 (0.0007) [2023-03-06 16:02:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 71884800. Throughput: 0: 12611.8. Samples: 71864219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:28,941][03942] Avg episode reward: [(0, '1305.911')] [2023-03-06 16:02:29,722][04272] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-06 16:02:30,519][04272] Updated weights for policy 0, policy_version 70220 (0.0007) [2023-03-06 16:02:31,329][04272] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-06 16:02:32,159][04272] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-06 16:02:32,967][04272] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-06 16:02:33,785][04272] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-06 16:02:33,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 71947264. Throughput: 0: 12607.4. Samples: 71939891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:33,941][03942] Avg episode reward: [(0, '1231.722')] [2023-03-06 16:02:34,607][04272] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-06 16:02:35,422][04272] Updated weights for policy 0, policy_version 70280 (0.0007) [2023-03-06 16:02:36,231][04272] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-06 16:02:37,053][04272] Updated weights for policy 0, policy_version 70300 (0.0008) [2023-03-06 16:02:37,874][04272] Updated weights for policy 0, policy_version 70310 (0.0007) [2023-03-06 16:02:38,678][04272] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-06 16:02:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72010752. Throughput: 0: 12598.8. Samples: 71977482. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:38,941][03942] Avg episode reward: [(0, '1278.425')] [2023-03-06 16:02:39,489][04272] Updated weights for policy 0, policy_version 70330 (0.0006) [2023-03-06 16:02:40,292][04272] Updated weights for policy 0, policy_version 70340 (0.0006) [2023-03-06 16:02:41,098][04272] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-06 16:02:41,910][04272] Updated weights for policy 0, policy_version 70360 (0.0007) [2023-03-06 16:02:42,718][04272] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-06 16:02:43,543][04272] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-06 16:02:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 72073216. Throughput: 0: 12598.8. Samples: 72053281. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:43,941][03942] Avg episode reward: [(0, '1238.249')] [2023-03-06 16:02:44,359][04272] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-06 16:02:45,178][04272] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-06 16:02:45,980][04272] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-06 16:02:46,793][04272] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-06 16:02:47,602][04272] Updated weights for policy 0, policy_version 70430 (0.0006) [2023-03-06 16:02:48,398][04272] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-06 16:02:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12610.8). Total num frames: 72136704. Throughput: 0: 12594.9. Samples: 72129088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:48,941][03942] Avg episode reward: [(0, '1255.664')] [2023-03-06 16:02:49,215][04272] Updated weights for policy 0, policy_version 70450 (0.0007) [2023-03-06 16:02:50,039][04272] Updated weights for policy 0, policy_version 70460 (0.0006) [2023-03-06 16:02:50,843][04272] Updated weights for policy 0, policy_version 70470 (0.0007) [2023-03-06 16:02:51,657][04272] Updated weights for policy 0, policy_version 70480 (0.0007) [2023-03-06 16:02:52,468][04272] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-06 16:02:53,288][04272] Updated weights for policy 0, policy_version 70500 (0.0007) [2023-03-06 16:02:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 72200192. Throughput: 0: 12595.1. Samples: 72166802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:53,941][03942] Avg episode reward: [(0, '1272.891')] [2023-03-06 16:02:54,079][04272] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-06 16:02:54,898][04272] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-06 16:02:55,697][04272] Updated weights for policy 0, policy_version 70530 (0.0007) [2023-03-06 16:02:56,506][04272] Updated weights for policy 0, policy_version 70540 (0.0006) [2023-03-06 16:02:57,334][04272] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-06 16:02:58,133][04272] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-06 16:02:58,930][04272] Updated weights for policy 0, policy_version 70570 (0.0006) [2023-03-06 16:02:58,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 72263680. Throughput: 0: 12596.3. Samples: 72242649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:02:58,941][03942] Avg episode reward: [(0, '1247.858')] [2023-03-06 16:02:59,746][04272] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-06 16:03:00,560][04272] Updated weights for policy 0, policy_version 70590 (0.0007) [2023-03-06 16:03:01,372][04272] Updated weights for policy 0, policy_version 70600 (0.0006) [2023-03-06 16:03:02,189][04272] Updated weights for policy 0, policy_version 70610 (0.0006) [2023-03-06 16:03:03,005][04272] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-06 16:03:03,821][04272] Updated weights for policy 0, policy_version 70630 (0.0008) [2023-03-06 16:03:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 72326144. Throughput: 0: 12602.1. Samples: 72318213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:03:03,941][03942] Avg episode reward: [(0, '1227.026')] [2023-03-06 16:03:04,623][04272] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-06 16:03:05,455][04272] Updated weights for policy 0, policy_version 70650 (0.0006) [2023-03-06 16:03:06,253][04272] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-06 16:03:07,046][04272] Updated weights for policy 0, policy_version 70670 (0.0007) [2023-03-06 16:03:07,906][04272] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-06 16:03:08,703][04272] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-06 16:03:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 72389632. Throughput: 0: 12605.7. Samples: 72356117. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:03:08,941][03942] Avg episode reward: [(0, '948.074')] [2023-03-06 16:03:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000070693_72389632.pth... [2023-03-06 16:03:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000067737_69362688.pth [2023-03-06 16:03:09,488][04272] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-06 16:03:10,336][04272] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-06 16:03:11,141][04272] Updated weights for policy 0, policy_version 70720 (0.0007) [2023-03-06 16:03:11,957][04272] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-06 16:03:12,777][04272] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-06 16:03:13,568][04272] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-06 16:03:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 72452096. Throughput: 0: 12607.7. Samples: 72431567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:03:13,941][03942] Avg episode reward: [(0, '891.530')] [2023-03-06 16:03:14,377][04272] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-06 16:03:15,196][04272] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-06 16:03:16,017][04272] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-06 16:03:16,809][04221] KL-divergence is very high: 104.6279 [2023-03-06 16:03:16,817][04272] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-06 16:03:17,610][04272] Updated weights for policy 0, policy_version 70800 (0.0007) [2023-03-06 16:03:18,433][04272] Updated weights for policy 0, policy_version 70810 (0.0007) [2023-03-06 16:03:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 72515584. Throughput: 0: 12616.2. Samples: 72507620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:03:18,941][03942] Avg episode reward: [(0, '979.801')] [2023-03-06 16:03:19,241][04272] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-06 16:03:20,049][04272] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-06 16:03:20,861][04272] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-06 16:03:21,175][04221] KL-divergence is very high: 128.3665 [2023-03-06 16:03:21,675][04272] Updated weights for policy 0, policy_version 70850 (0.0007) [2023-03-06 16:03:22,471][04272] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-06 16:03:23,294][04272] Updated weights for policy 0, policy_version 70870 (0.0006) [2023-03-06 16:03:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72578048. Throughput: 0: 12624.2. Samples: 72545572. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:23,941][03942] Avg episode reward: [(0, '797.882')] [2023-03-06 16:03:24,107][04272] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-06 16:03:24,919][04272] Updated weights for policy 0, policy_version 70890 (0.0006) [2023-03-06 16:03:25,749][04272] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-06 16:03:26,561][04272] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-06 16:03:27,362][04272] Updated weights for policy 0, policy_version 70920 (0.0007) [2023-03-06 16:03:28,187][04272] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-06 16:03:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72641536. Throughput: 0: 12613.8. Samples: 72620901. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:28,941][03942] Avg episode reward: [(0, '897.955')] [2023-03-06 16:03:29,022][04272] Updated weights for policy 0, policy_version 70940 (0.0007) [2023-03-06 16:03:29,825][04272] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-06 16:03:30,632][04272] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-06 16:03:31,441][04272] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-06 16:03:32,244][04272] Updated weights for policy 0, policy_version 70980 (0.0006) [2023-03-06 16:03:33,060][04272] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-06 16:03:33,894][04272] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-06 16:03:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72704000. Throughput: 0: 12608.1. Samples: 72696454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:33,941][03942] Avg episode reward: [(0, '772.669')] [2023-03-06 16:03:34,686][04272] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-06 16:03:35,512][04272] Updated weights for policy 0, policy_version 71020 (0.0007) [2023-03-06 16:03:36,337][04272] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-06 16:03:37,136][04272] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-06 16:03:37,928][04272] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-06 16:03:38,733][04272] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-06 16:03:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12610.8). Total num frames: 72767488. Throughput: 0: 12607.1. Samples: 72734123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:38,952][03942] Avg episode reward: [(0, '838.139')] [2023-03-06 16:03:39,561][04272] Updated weights for policy 0, policy_version 71070 (0.0007) [2023-03-06 16:03:40,368][04272] Updated weights for policy 0, policy_version 71080 (0.0007) [2023-03-06 16:03:41,177][04272] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-06 16:03:41,974][04272] Updated weights for policy 0, policy_version 71100 (0.0007) [2023-03-06 16:03:42,789][04272] Updated weights for policy 0, policy_version 71110 (0.0006) [2023-03-06 16:03:43,601][04272] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-06 16:03:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 72830976. Throughput: 0: 12610.4. Samples: 72810116. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:43,952][03942] Avg episode reward: [(0, '771.643')] [2023-03-06 16:03:44,417][04272] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-06 16:03:45,218][04272] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-06 16:03:46,034][04272] Updated weights for policy 0, policy_version 71150 (0.0007) [2023-03-06 16:03:46,847][04272] Updated weights for policy 0, policy_version 71160 (0.0007) [2023-03-06 16:03:47,646][04272] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-06 16:03:48,453][04272] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-06 16:03:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72893440. Throughput: 0: 12615.6. Samples: 72885916. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:48,952][03942] Avg episode reward: [(0, '803.271')] [2023-03-06 16:03:49,266][04272] Updated weights for policy 0, policy_version 71190 (0.0006) [2023-03-06 16:03:50,075][04272] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-06 16:03:50,882][04272] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-06 16:03:51,702][04272] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-06 16:03:52,518][04272] Updated weights for policy 0, policy_version 71230 (0.0007) [2023-03-06 16:03:53,324][04272] Updated weights for policy 0, policy_version 71240 (0.0007) [2023-03-06 16:03:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 72956928. Throughput: 0: 12618.7. Samples: 72923958. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:53,952][03942] Avg episode reward: [(0, '918.848')] [2023-03-06 16:03:54,144][04272] Updated weights for policy 0, policy_version 71250 (0.0007) [2023-03-06 16:03:54,954][04272] Updated weights for policy 0, policy_version 71260 (0.0006) [2023-03-06 16:03:55,758][04272] Updated weights for policy 0, policy_version 71270 (0.0006) [2023-03-06 16:03:56,573][04272] Updated weights for policy 0, policy_version 71280 (0.0006) [2023-03-06 16:03:57,395][04272] Updated weights for policy 0, policy_version 71290 (0.0007) [2023-03-06 16:03:58,210][04272] Updated weights for policy 0, policy_version 71300 (0.0007) [2023-03-06 16:03:58,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 73020416. Throughput: 0: 12625.8. Samples: 72999727. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:03:58,952][03942] Avg episode reward: [(0, '845.916')] [2023-03-06 16:03:59,009][04272] Updated weights for policy 0, policy_version 71310 (0.0007) [2023-03-06 16:03:59,821][04272] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-06 16:04:00,624][04272] Updated weights for policy 0, policy_version 71330 (0.0007) [2023-03-06 16:04:01,428][04272] Updated weights for policy 0, policy_version 71340 (0.0006) [2023-03-06 16:04:02,228][04272] Updated weights for policy 0, policy_version 71350 (0.0007) [2023-03-06 16:04:03,043][04272] Updated weights for policy 0, policy_version 71360 (0.0007) [2023-03-06 16:04:03,865][04272] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-06 16:04:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73083904. Throughput: 0: 12623.1. Samples: 73075658. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:04:03,941][03942] Avg episode reward: [(0, '932.306')] [2023-03-06 16:04:04,670][04272] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-06 16:04:05,486][04272] Updated weights for policy 0, policy_version 71390 (0.0006) [2023-03-06 16:04:06,294][04272] Updated weights for policy 0, policy_version 71400 (0.0006) [2023-03-06 16:04:07,104][04272] Updated weights for policy 0, policy_version 71410 (0.0006) [2023-03-06 16:04:07,893][04272] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-06 16:04:08,713][04272] Updated weights for policy 0, policy_version 71430 (0.0007) [2023-03-06 16:04:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 73146368. Throughput: 0: 12619.7. Samples: 73113458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:08,941][03942] Avg episode reward: [(0, '853.233')] [2023-03-06 16:04:09,540][04272] Updated weights for policy 0, policy_version 71440 (0.0007) [2023-03-06 16:04:10,332][04272] Updated weights for policy 0, policy_version 71450 (0.0007) [2023-03-06 16:04:11,163][04272] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-06 16:04:11,977][04272] Updated weights for policy 0, policy_version 71470 (0.0008) [2023-03-06 16:04:12,786][04272] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-06 16:04:13,588][04272] Updated weights for policy 0, policy_version 71490 (0.0006) [2023-03-06 16:04:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 73209856. Throughput: 0: 12627.6. Samples: 73189141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:13,941][03942] Avg episode reward: [(0, '946.507')] [2023-03-06 16:04:14,397][04272] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-06 16:04:15,198][04272] Updated weights for policy 0, policy_version 71510 (0.0007) [2023-03-06 16:04:16,004][04272] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-06 16:04:16,831][04272] Updated weights for policy 0, policy_version 71530 (0.0006) [2023-03-06 16:04:17,630][04272] Updated weights for policy 0, policy_version 71540 (0.0007) [2023-03-06 16:04:18,445][04272] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-06 16:04:18,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 73273344. Throughput: 0: 12636.3. Samples: 73265084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:18,941][03942] Avg episode reward: [(0, '980.928')] [2023-03-06 16:04:19,267][04272] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-06 16:04:20,087][04272] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-06 16:04:20,892][04272] Updated weights for policy 0, policy_version 71580 (0.0007) [2023-03-06 16:04:21,690][04272] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-06 16:04:22,522][04272] Updated weights for policy 0, policy_version 71600 (0.0006) [2023-03-06 16:04:23,318][04272] Updated weights for policy 0, policy_version 71610 (0.0006) [2023-03-06 16:04:23,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73335808. Throughput: 0: 12638.7. Samples: 73302866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:23,941][03942] Avg episode reward: [(0, '902.125')] [2023-03-06 16:04:24,124][04272] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-06 16:04:24,969][04272] Updated weights for policy 0, policy_version 71630 (0.0007) [2023-03-06 16:04:25,770][04272] Updated weights for policy 0, policy_version 71640 (0.0005) [2023-03-06 16:04:26,570][04272] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-06 16:04:27,392][04272] Updated weights for policy 0, policy_version 71660 (0.0006) [2023-03-06 16:04:28,189][04272] Updated weights for policy 0, policy_version 71670 (0.0007) [2023-03-06 16:04:28,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73399296. Throughput: 0: 12630.0. Samples: 73378469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:28,941][03942] Avg episode reward: [(0, '921.013')] [2023-03-06 16:04:29,022][04272] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-06 16:04:29,835][04272] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-06 16:04:30,648][04272] Updated weights for policy 0, policy_version 71700 (0.0007) [2023-03-06 16:04:31,453][04272] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-06 16:04:32,267][04272] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-06 16:04:33,066][04272] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-06 16:04:33,867][04272] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-06 16:04:33,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 73461760. Throughput: 0: 12624.0. Samples: 73453993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:33,941][03942] Avg episode reward: [(0, '875.944')] [2023-03-06 16:04:34,673][04272] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-06 16:04:35,487][04272] Updated weights for policy 0, policy_version 71760 (0.0007) [2023-03-06 16:04:36,306][04272] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-06 16:04:37,123][04272] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-06 16:04:37,929][04272] Updated weights for policy 0, policy_version 71790 (0.0007) [2023-03-06 16:04:38,753][04272] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-06 16:04:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 73525248. Throughput: 0: 12624.6. Samples: 73492065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:38,941][03942] Avg episode reward: [(0, '923.719')] [2023-03-06 16:04:39,561][04272] Updated weights for policy 0, policy_version 71810 (0.0007) [2023-03-06 16:04:40,356][04272] Updated weights for policy 0, policy_version 71820 (0.0008) [2023-03-06 16:04:41,176][04272] Updated weights for policy 0, policy_version 71830 (0.0005) [2023-03-06 16:04:41,973][04272] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-06 16:04:42,786][04272] Updated weights for policy 0, policy_version 71850 (0.0006) [2023-03-06 16:04:43,596][04272] Updated weights for policy 0, policy_version 71860 (0.0006) [2023-03-06 16:04:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73588736. Throughput: 0: 12627.0. Samples: 73567941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:43,941][03942] Avg episode reward: [(0, '944.293')] [2023-03-06 16:04:44,403][04272] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-06 16:04:45,210][04272] Updated weights for policy 0, policy_version 71880 (0.0007) [2023-03-06 16:04:46,023][04272] Updated weights for policy 0, policy_version 71890 (0.0007) [2023-03-06 16:04:46,841][04272] Updated weights for policy 0, policy_version 71900 (0.0006) [2023-03-06 16:04:47,634][04272] Updated weights for policy 0, policy_version 71910 (0.0006) [2023-03-06 16:04:48,454][04272] Updated weights for policy 0, policy_version 71920 (0.0006) [2023-03-06 16:04:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 73651200. Throughput: 0: 12628.8. Samples: 73643954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:04:48,941][03942] Avg episode reward: [(0, '993.597')] [2023-03-06 16:04:49,281][04272] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-06 16:04:50,091][04272] Updated weights for policy 0, policy_version 71940 (0.0008) [2023-03-06 16:04:50,905][04272] Updated weights for policy 0, policy_version 71950 (0.0007) [2023-03-06 16:04:51,706][04272] Updated weights for policy 0, policy_version 71960 (0.0007) [2023-03-06 16:04:52,521][04272] Updated weights for policy 0, policy_version 71970 (0.0007) [2023-03-06 16:04:53,358][04272] Updated weights for policy 0, policy_version 71980 (0.0007) [2023-03-06 16:04:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73714688. Throughput: 0: 12623.9. Samples: 73681534. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:04:53,941][03942] Avg episode reward: [(0, '981.260')] [2023-03-06 16:04:54,166][04272] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-06 16:04:54,965][04272] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-06 16:04:55,784][04272] Updated weights for policy 0, policy_version 72010 (0.0008) [2023-03-06 16:04:56,600][04272] Updated weights for policy 0, policy_version 72020 (0.0007) [2023-03-06 16:04:57,398][04272] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-06 16:04:58,214][04272] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-06 16:04:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 73778176. Throughput: 0: 12624.6. Samples: 73757251. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:04:58,941][03942] Avg episode reward: [(0, '1003.507')] [2023-03-06 16:04:59,033][04272] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-06 16:04:59,836][04272] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-06 16:05:00,653][04272] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-06 16:05:01,453][04272] Updated weights for policy 0, policy_version 72080 (0.0006) [2023-03-06 16:05:02,270][04272] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-06 16:05:03,103][04272] Updated weights for policy 0, policy_version 72100 (0.0007) [2023-03-06 16:05:03,895][04272] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-06 16:05:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 73840640. Throughput: 0: 12614.2. Samples: 73832725. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:03,941][03942] Avg episode reward: [(0, '1066.161')] [2023-03-06 16:05:04,697][04272] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-06 16:05:05,527][04272] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-06 16:05:06,317][04272] Updated weights for policy 0, policy_version 72140 (0.0006) [2023-03-06 16:05:07,138][04272] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-06 16:05:07,971][04272] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-06 16:05:08,750][04272] Updated weights for policy 0, policy_version 72170 (0.0007) [2023-03-06 16:05:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 73904128. Throughput: 0: 12614.9. Samples: 73870537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:08,941][03942] Avg episode reward: [(0, '1143.135')] [2023-03-06 16:05:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000072172_73904128.pth... [2023-03-06 16:05:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000069214_70875136.pth [2023-03-06 16:05:09,574][04272] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-06 16:05:10,393][04272] Updated weights for policy 0, policy_version 72190 (0.0007) [2023-03-06 16:05:11,201][04272] Updated weights for policy 0, policy_version 72200 (0.0007) [2023-03-06 16:05:12,006][04272] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-06 16:05:12,843][04272] Updated weights for policy 0, policy_version 72220 (0.0006) [2023-03-06 16:05:13,645][04272] Updated weights for policy 0, policy_version 72230 (0.0007) [2023-03-06 16:05:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 73966592. Throughput: 0: 12618.2. Samples: 73946285. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:13,941][03942] Avg episode reward: [(0, '1128.465')] [2023-03-06 16:05:14,461][04272] Updated weights for policy 0, policy_version 72240 (0.0006) [2023-03-06 16:05:15,278][04272] Updated weights for policy 0, policy_version 72250 (0.0007) [2023-03-06 16:05:16,091][04272] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-06 16:05:16,900][04272] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-06 16:05:17,705][04272] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-06 16:05:18,526][04272] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-06 16:05:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12617.8). Total num frames: 74030080. Throughput: 0: 12619.2. Samples: 74021859. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:18,941][03942] Avg episode reward: [(0, '1140.291')] [2023-03-06 16:05:19,356][04272] Updated weights for policy 0, policy_version 72300 (0.0007) [2023-03-06 16:05:20,152][04272] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-06 16:05:20,956][04272] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-06 16:05:21,755][04272] Updated weights for policy 0, policy_version 72330 (0.0006) [2023-03-06 16:05:22,571][04272] Updated weights for policy 0, policy_version 72340 (0.0007) [2023-03-06 16:05:23,409][04272] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-06 16:05:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 74092544. Throughput: 0: 12616.6. Samples: 74059812. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:23,941][03942] Avg episode reward: [(0, '1169.593')] [2023-03-06 16:05:24,208][04272] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-06 16:05:25,021][04272] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-06 16:05:25,834][04272] Updated weights for policy 0, policy_version 72380 (0.0007) [2023-03-06 16:05:26,654][04272] Updated weights for policy 0, policy_version 72390 (0.0006) [2023-03-06 16:05:27,449][04272] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-06 16:05:28,261][04272] Updated weights for policy 0, policy_version 72410 (0.0007) [2023-03-06 16:05:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74156032. Throughput: 0: 12610.9. Samples: 74135432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:05:28,941][03942] Avg episode reward: [(0, '1213.360')] [2023-03-06 16:05:29,078][04272] Updated weights for policy 0, policy_version 72420 (0.0007) [2023-03-06 16:05:29,888][04272] Updated weights for policy 0, policy_version 72430 (0.0007) [2023-03-06 16:05:30,717][04272] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-06 16:05:31,529][04272] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-06 16:05:32,334][04272] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-06 16:05:33,145][04272] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-06 16:05:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74218496. Throughput: 0: 12597.2. Samples: 74210829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:33,941][03942] Avg episode reward: [(0, '1103.673')] [2023-03-06 16:05:33,952][04272] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-06 16:05:34,767][04272] Updated weights for policy 0, policy_version 72490 (0.0007) [2023-03-06 16:05:35,601][04272] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-06 16:05:36,410][04272] Updated weights for policy 0, policy_version 72510 (0.0007) [2023-03-06 16:05:37,205][04272] Updated weights for policy 0, policy_version 72520 (0.0008) [2023-03-06 16:05:38,006][04272] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-06 16:05:38,817][04272] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-06 16:05:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74281984. Throughput: 0: 12599.5. Samples: 74248509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:38,952][03942] Avg episode reward: [(0, '1087.755')] [2023-03-06 16:05:39,618][04272] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-06 16:05:40,432][04272] Updated weights for policy 0, policy_version 72560 (0.0007) [2023-03-06 16:05:41,262][04272] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-06 16:05:42,053][04272] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-06 16:05:42,871][04272] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-06 16:05:43,680][04272] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-06 16:05:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74345472. Throughput: 0: 12611.0. Samples: 74324743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:43,941][03942] Avg episode reward: [(0, '1150.502')] [2023-03-06 16:05:44,502][04272] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-06 16:05:45,321][04272] Updated weights for policy 0, policy_version 72620 (0.0006) [2023-03-06 16:05:46,130][04272] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-06 16:05:46,949][04272] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-06 16:05:47,746][04272] Updated weights for policy 0, policy_version 72650 (0.0006) [2023-03-06 16:05:48,566][04272] Updated weights for policy 0, policy_version 72660 (0.0007) [2023-03-06 16:05:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 74407936. Throughput: 0: 12612.4. Samples: 74400283. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:48,941][03942] Avg episode reward: [(0, '1151.779')] [2023-03-06 16:05:49,389][04272] Updated weights for policy 0, policy_version 72670 (0.0007) [2023-03-06 16:05:50,203][04272] Updated weights for policy 0, policy_version 72680 (0.0006) [2023-03-06 16:05:51,021][04272] Updated weights for policy 0, policy_version 72690 (0.0006) [2023-03-06 16:05:51,827][04272] Updated weights for policy 0, policy_version 72700 (0.0006) [2023-03-06 16:05:52,626][04272] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-06 16:05:53,443][04272] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-06 16:05:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 74470400. Throughput: 0: 12608.4. Samples: 74437915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:53,941][03942] Avg episode reward: [(0, '1269.011')] [2023-03-06 16:05:54,267][04272] Updated weights for policy 0, policy_version 72730 (0.0007) [2023-03-06 16:05:55,056][04272] Updated weights for policy 0, policy_version 72740 (0.0007) [2023-03-06 16:05:55,882][04272] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-06 16:05:56,693][04272] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-06 16:05:57,498][04272] Updated weights for policy 0, policy_version 72770 (0.0006) [2023-03-06 16:05:58,306][04272] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-06 16:05:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 74533888. Throughput: 0: 12607.4. Samples: 74513619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:05:58,941][03942] Avg episode reward: [(0, '1220.762')] [2023-03-06 16:05:59,118][04272] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-06 16:05:59,911][04272] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-06 16:06:00,743][04272] Updated weights for policy 0, policy_version 72810 (0.0007) [2023-03-06 16:06:01,538][04272] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-06 16:06:02,362][04272] Updated weights for policy 0, policy_version 72830 (0.0007) [2023-03-06 16:06:03,170][04272] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-06 16:06:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74597376. Throughput: 0: 12612.8. Samples: 74589434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:03,952][03942] Avg episode reward: [(0, '1212.325')] [2023-03-06 16:06:03,976][04272] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-06 16:06:04,794][04272] Updated weights for policy 0, policy_version 72860 (0.0006) [2023-03-06 16:06:05,615][04272] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-06 16:06:06,407][04272] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-06 16:06:07,231][04272] Updated weights for policy 0, policy_version 72890 (0.0007) [2023-03-06 16:06:08,051][04272] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-06 16:06:08,869][04272] Updated weights for policy 0, policy_version 72910 (0.0007) [2023-03-06 16:06:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74660864. Throughput: 0: 12611.5. Samples: 74627331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:08,952][03942] Avg episode reward: [(0, '1146.748')] [2023-03-06 16:06:09,680][04272] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-06 16:06:10,483][04272] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-06 16:06:11,307][04272] Updated weights for policy 0, policy_version 72940 (0.0007) [2023-03-06 16:06:12,113][04272] Updated weights for policy 0, policy_version 72950 (0.0005) [2023-03-06 16:06:12,947][04272] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-06 16:06:13,754][04272] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-06 16:06:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12617.8). Total num frames: 74723328. Throughput: 0: 12608.1. Samples: 74702797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:13,952][03942] Avg episode reward: [(0, '1291.642')] [2023-03-06 16:06:14,566][04272] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-06 16:06:15,396][04272] Updated weights for policy 0, policy_version 72990 (0.0008) [2023-03-06 16:06:16,194][04272] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-06 16:06:17,007][04272] Updated weights for policy 0, policy_version 73010 (0.0006) [2023-03-06 16:06:17,834][04272] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-06 16:06:18,631][04272] Updated weights for policy 0, policy_version 73030 (0.0008) [2023-03-06 16:06:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 74785792. Throughput: 0: 12606.9. Samples: 74778140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:18,952][03942] Avg episode reward: [(0, '1284.977')] [2023-03-06 16:06:19,441][04272] Updated weights for policy 0, policy_version 73040 (0.0007) [2023-03-06 16:06:20,265][04272] Updated weights for policy 0, policy_version 73050 (0.0007) [2023-03-06 16:06:21,062][04272] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-06 16:06:21,866][04272] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-06 16:06:22,671][04272] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-06 16:06:23,497][04272] Updated weights for policy 0, policy_version 73090 (0.0007) [2023-03-06 16:06:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74849280. Throughput: 0: 12613.6. Samples: 74816123. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:23,952][03942] Avg episode reward: [(0, '1106.395')] [2023-03-06 16:06:24,304][04272] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-06 16:06:25,110][04272] Updated weights for policy 0, policy_version 73110 (0.0007) [2023-03-06 16:06:25,934][04272] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-06 16:06:26,734][04272] Updated weights for policy 0, policy_version 73130 (0.0006) [2023-03-06 16:06:27,554][04272] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-06 16:06:28,374][04272] Updated weights for policy 0, policy_version 73150 (0.0007) [2023-03-06 16:06:28,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 74912768. Throughput: 0: 12605.1. Samples: 74891975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:28,952][03942] Avg episode reward: [(0, '1150.629')] [2023-03-06 16:06:29,168][04272] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-06 16:06:29,982][04272] Updated weights for policy 0, policy_version 73170 (0.0007) [2023-03-06 16:06:30,803][04272] Updated weights for policy 0, policy_version 73180 (0.0006) [2023-03-06 16:06:31,604][04272] Updated weights for policy 0, policy_version 73190 (0.0007) [2023-03-06 16:06:32,417][04272] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-06 16:06:33,214][04272] Updated weights for policy 0, policy_version 73210 (0.0007) [2023-03-06 16:06:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 74975232. Throughput: 0: 12612.5. Samples: 74967843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:33,951][03942] Avg episode reward: [(0, '1151.455')] [2023-03-06 16:06:34,021][04272] Updated weights for policy 0, policy_version 73220 (0.0007) [2023-03-06 16:06:34,825][04272] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-06 16:06:35,625][04272] Updated weights for policy 0, policy_version 73240 (0.0007) [2023-03-06 16:06:36,434][04272] Updated weights for policy 0, policy_version 73250 (0.0008) [2023-03-06 16:06:37,257][04272] Updated weights for policy 0, policy_version 73260 (0.0007) [2023-03-06 16:06:38,065][04272] Updated weights for policy 0, policy_version 73270 (0.0007) [2023-03-06 16:06:38,873][04272] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-06 16:06:38,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 75039744. Throughput: 0: 12623.8. Samples: 75005985. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:38,952][03942] Avg episode reward: [(0, '1247.013')] [2023-03-06 16:06:39,685][04272] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-06 16:06:40,486][04272] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-06 16:06:41,290][04272] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-06 16:06:42,103][04272] Updated weights for policy 0, policy_version 73320 (0.0007) [2023-03-06 16:06:42,914][04272] Updated weights for policy 0, policy_version 73330 (0.0006) [2023-03-06 16:06:43,725][04272] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-06 16:06:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 75102208. Throughput: 0: 12630.2. Samples: 75081976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:43,951][03942] Avg episode reward: [(0, '1185.890')] [2023-03-06 16:06:44,531][04272] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-06 16:06:45,345][04272] Updated weights for policy 0, policy_version 73360 (0.0006) [2023-03-06 16:06:46,133][04272] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-06 16:06:46,961][04272] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-06 16:06:47,770][04272] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-06 16:06:48,570][04272] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-06 16:06:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 75165696. Throughput: 0: 12632.4. Samples: 75157894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:48,951][03942] Avg episode reward: [(0, '1189.639')] [2023-03-06 16:06:49,373][04272] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-06 16:06:50,202][04272] Updated weights for policy 0, policy_version 73420 (0.0007) [2023-03-06 16:06:51,013][04272] Updated weights for policy 0, policy_version 73430 (0.0006) [2023-03-06 16:06:51,836][04272] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-06 16:06:52,650][04272] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-06 16:06:53,465][04272] Updated weights for policy 0, policy_version 73460 (0.0006) [2023-03-06 16:06:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 75228160. Throughput: 0: 12634.0. Samples: 75195861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:53,941][03942] Avg episode reward: [(0, '1144.222')] [2023-03-06 16:06:54,270][04272] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-06 16:06:55,087][04272] Updated weights for policy 0, policy_version 73480 (0.0006) [2023-03-06 16:06:55,892][04272] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-06 16:06:56,703][04272] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-06 16:06:57,510][04272] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-06 16:06:58,318][04272] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-06 16:06:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 75291648. Throughput: 0: 12634.7. Samples: 75271356. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:06:58,951][03942] Avg episode reward: [(0, '1105.823')] [2023-03-06 16:06:59,127][04272] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-06 16:06:59,921][04272] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-06 16:07:00,749][04272] Updated weights for policy 0, policy_version 73550 (0.0007) [2023-03-06 16:07:01,557][04272] Updated weights for policy 0, policy_version 73560 (0.0006) [2023-03-06 16:07:02,352][04272] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-06 16:07:03,178][04272] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-06 16:07:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 75355136. Throughput: 0: 12647.5. Samples: 75347276. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:03,951][03942] Avg episode reward: [(0, '1198.841')] [2023-03-06 16:07:03,966][04272] Updated weights for policy 0, policy_version 73590 (0.0007) [2023-03-06 16:07:04,796][04272] Updated weights for policy 0, policy_version 73600 (0.0007) [2023-03-06 16:07:05,607][04272] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-06 16:07:06,426][04272] Updated weights for policy 0, policy_version 73620 (0.0007) [2023-03-06 16:07:07,228][04272] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-06 16:07:08,014][04272] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-06 16:07:08,812][04272] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-06 16:07:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 75418624. Throughput: 0: 12644.7. Samples: 75385135. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:08,952][03942] Avg episode reward: [(0, '1203.134')] [2023-03-06 16:07:08,965][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000073652_75419648.pth... [2023-03-06 16:07:08,996][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000070693_72389632.pth [2023-03-06 16:07:09,626][04272] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-06 16:07:10,426][04272] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-06 16:07:11,246][04272] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-06 16:07:12,061][04272] Updated weights for policy 0, policy_version 73690 (0.0007) [2023-03-06 16:07:12,873][04272] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-06 16:07:13,669][04272] Updated weights for policy 0, policy_version 73710 (0.0007) [2023-03-06 16:07:13,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 75482112. Throughput: 0: 12653.0. Samples: 75461359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:13,952][03942] Avg episode reward: [(0, '1204.626')] [2023-03-06 16:07:14,484][04272] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-06 16:07:15,295][04272] Updated weights for policy 0, policy_version 73730 (0.0007) [2023-03-06 16:07:16,085][04272] Updated weights for policy 0, policy_version 73740 (0.0006) [2023-03-06 16:07:16,872][04272] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-06 16:07:17,695][04272] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-06 16:07:18,524][04272] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-06 16:07:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 75545600. Throughput: 0: 12662.9. Samples: 75537673. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:18,952][03942] Avg episode reward: [(0, '1153.292')] [2023-03-06 16:07:19,315][04272] Updated weights for policy 0, policy_version 73780 (0.0007) [2023-03-06 16:07:20,125][04272] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-06 16:07:20,933][04272] Updated weights for policy 0, policy_version 73800 (0.0005) [2023-03-06 16:07:21,754][04272] Updated weights for policy 0, policy_version 73810 (0.0007) [2023-03-06 16:07:22,562][04272] Updated weights for policy 0, policy_version 73820 (0.0006) [2023-03-06 16:07:23,373][04272] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-06 16:07:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 75609088. Throughput: 0: 12656.7. Samples: 75575536. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:23,952][03942] Avg episode reward: [(0, '1082.542')] [2023-03-06 16:07:24,176][04272] Updated weights for policy 0, policy_version 73840 (0.0007) [2023-03-06 16:07:24,984][04272] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-06 16:07:25,801][04272] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-06 16:07:26,610][04272] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-06 16:07:27,405][04272] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-06 16:07:28,218][04272] Updated weights for policy 0, policy_version 73890 (0.0006) [2023-03-06 16:07:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 75671552. Throughput: 0: 12654.0. Samples: 75651405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:28,951][03942] Avg episode reward: [(0, '1054.845')] [2023-03-06 16:07:29,014][04272] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-06 16:07:29,831][04272] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-06 16:07:30,623][04272] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-06 16:07:31,453][04272] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-06 16:07:32,274][04272] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-06 16:07:33,082][04272] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-06 16:07:33,893][04272] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-06 16:07:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 75735040. Throughput: 0: 12651.3. Samples: 75727203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:33,941][03942] Avg episode reward: [(0, '1142.028')] [2023-03-06 16:07:34,716][04272] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-06 16:07:35,510][04272] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-06 16:07:36,329][04272] Updated weights for policy 0, policy_version 73990 (0.0006) [2023-03-06 16:07:37,142][04272] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-06 16:07:37,944][04272] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-06 16:07:38,761][04272] Updated weights for policy 0, policy_version 74020 (0.0007) [2023-03-06 16:07:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 75798528. Throughput: 0: 12653.7. Samples: 75765277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:38,952][03942] Avg episode reward: [(0, '1081.092')] [2023-03-06 16:07:39,570][04272] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-06 16:07:40,370][04272] Updated weights for policy 0, policy_version 74040 (0.0007) [2023-03-06 16:07:41,190][04272] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-06 16:07:42,013][04272] Updated weights for policy 0, policy_version 74060 (0.0007) [2023-03-06 16:07:42,829][04272] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-06 16:07:43,630][04272] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-06 16:07:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 75860992. Throughput: 0: 12653.8. Samples: 75840777. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:07:43,941][03942] Avg episode reward: [(0, '1003.553')] [2023-03-06 16:07:44,456][04272] Updated weights for policy 0, policy_version 74090 (0.0008) [2023-03-06 16:07:45,250][04272] Updated weights for policy 0, policy_version 74100 (0.0006) [2023-03-06 16:07:46,053][04272] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-06 16:07:46,868][04272] Updated weights for policy 0, policy_version 74120 (0.0007) [2023-03-06 16:07:47,693][04272] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-06 16:07:48,484][04272] Updated weights for policy 0, policy_version 74140 (0.0005) [2023-03-06 16:07:48,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 75924480. Throughput: 0: 12656.3. Samples: 75916811. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:07:48,941][03942] Avg episode reward: [(0, '1096.399')] [2023-03-06 16:07:49,301][04272] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-06 16:07:50,116][04272] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-06 16:07:50,918][04272] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-06 16:07:51,740][04272] Updated weights for policy 0, policy_version 74180 (0.0008) [2023-03-06 16:07:52,556][04272] Updated weights for policy 0, policy_version 74190 (0.0006) [2023-03-06 16:07:53,367][04272] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-06 16:07:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12624.7). Total num frames: 75987968. Throughput: 0: 12655.3. Samples: 75954622. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:07:53,941][03942] Avg episode reward: [(0, '983.487')] [2023-03-06 16:07:54,155][04272] Updated weights for policy 0, policy_version 74210 (0.0007) [2023-03-06 16:07:54,976][04272] Updated weights for policy 0, policy_version 74220 (0.0007) [2023-03-06 16:07:55,789][04272] Updated weights for policy 0, policy_version 74230 (0.0006) [2023-03-06 16:07:56,581][04272] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-06 16:07:57,397][04272] Updated weights for policy 0, policy_version 74250 (0.0006) [2023-03-06 16:07:58,221][04272] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-06 16:07:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12628.2). Total num frames: 76051456. Throughput: 0: 12645.1. Samples: 76030387. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:07:58,941][03942] Avg episode reward: [(0, '1054.205')] [2023-03-06 16:07:59,008][04272] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-06 16:07:59,864][04272] Updated weights for policy 0, policy_version 74280 (0.0007) [2023-03-06 16:08:00,653][04272] Updated weights for policy 0, policy_version 74290 (0.0006) [2023-03-06 16:08:01,454][04272] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-03-06 16:08:02,281][04272] Updated weights for policy 0, policy_version 74310 (0.0007) [2023-03-06 16:08:03,085][04272] Updated weights for policy 0, policy_version 74320 (0.0007) [2023-03-06 16:08:03,882][04272] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-06 16:08:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12624.7). Total num frames: 76113920. Throughput: 0: 12632.4. Samples: 76106131. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:03,941][03942] Avg episode reward: [(0, '1203.956')] [2023-03-06 16:08:04,698][04272] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-06 16:08:05,515][04272] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-06 16:08:06,349][04272] Updated weights for policy 0, policy_version 74360 (0.0007) [2023-03-06 16:08:07,158][04272] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-06 16:08:07,970][04272] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-06 16:08:08,802][04272] Updated weights for policy 0, policy_version 74390 (0.0006) [2023-03-06 16:08:08,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 76176384. Throughput: 0: 12632.4. Samples: 76143995. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:08,941][03942] Avg episode reward: [(0, '887.532')] [2023-03-06 16:08:09,599][04272] Updated weights for policy 0, policy_version 74400 (0.0007) [2023-03-06 16:08:10,419][04272] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-06 16:08:11,227][04272] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-06 16:08:12,031][04272] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-06 16:08:12,850][04272] Updated weights for policy 0, policy_version 74440 (0.0007) [2023-03-06 16:08:13,649][04272] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-06 16:08:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 76239872. Throughput: 0: 12625.4. Samples: 76219546. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:13,941][03942] Avg episode reward: [(0, '1053.757')] [2023-03-06 16:08:14,458][04272] Updated weights for policy 0, policy_version 74460 (0.0007) [2023-03-06 16:08:15,270][04272] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-06 16:08:16,088][04272] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-06 16:08:16,905][04272] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-06 16:08:17,714][04272] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-06 16:08:18,516][04272] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-06 16:08:18,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 76303360. Throughput: 0: 12621.2. Samples: 76295158. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:18,941][03942] Avg episode reward: [(0, '1041.183')] [2023-03-06 16:08:19,326][04272] Updated weights for policy 0, policy_version 74520 (0.0007) [2023-03-06 16:08:20,146][04272] Updated weights for policy 0, policy_version 74530 (0.0007) [2023-03-06 16:08:20,974][04272] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-06 16:08:21,778][04272] Updated weights for policy 0, policy_version 74550 (0.0005) [2023-03-06 16:08:22,582][04272] Updated weights for policy 0, policy_version 74560 (0.0007) [2023-03-06 16:08:23,380][04272] Updated weights for policy 0, policy_version 74570 (0.0007) [2023-03-06 16:08:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76365824. Throughput: 0: 12613.7. Samples: 76332890. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:23,941][03942] Avg episode reward: [(0, '1056.977')] [2023-03-06 16:08:24,191][04272] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-06 16:08:25,007][04272] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-06 16:08:25,830][04272] Updated weights for policy 0, policy_version 74600 (0.0007) [2023-03-06 16:08:26,647][04272] Updated weights for policy 0, policy_version 74610 (0.0007) [2023-03-06 16:08:27,467][04272] Updated weights for policy 0, policy_version 74620 (0.0007) [2023-03-06 16:08:28,272][04272] Updated weights for policy 0, policy_version 74630 (0.0007) [2023-03-06 16:08:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 76429312. Throughput: 0: 12620.1. Samples: 76408684. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-06 16:08:28,941][03942] Avg episode reward: [(0, '1058.044')] [2023-03-06 16:08:29,093][04272] Updated weights for policy 0, policy_version 74640 (0.0007) [2023-03-06 16:08:29,903][04272] Updated weights for policy 0, policy_version 74650 (0.0006) [2023-03-06 16:08:30,713][04272] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-06 16:08:31,549][04272] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-06 16:08:32,356][04272] Updated weights for policy 0, policy_version 74680 (0.0006) [2023-03-06 16:08:33,163][04272] Updated weights for policy 0, policy_version 74690 (0.0007) [2023-03-06 16:08:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76491776. Throughput: 0: 12605.3. Samples: 76484049. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:33,941][03942] Avg episode reward: [(0, '1070.867')] [2023-03-06 16:08:33,965][04272] Updated weights for policy 0, policy_version 74700 (0.0006) [2023-03-06 16:08:34,778][04272] Updated weights for policy 0, policy_version 74710 (0.0006) [2023-03-06 16:08:35,577][04272] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-06 16:08:36,396][04272] Updated weights for policy 0, policy_version 74730 (0.0007) [2023-03-06 16:08:37,224][04272] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-06 16:08:38,033][04272] Updated weights for policy 0, policy_version 74750 (0.0007) [2023-03-06 16:08:38,837][04272] Updated weights for policy 0, policy_version 74760 (0.0006) [2023-03-06 16:08:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76555264. Throughput: 0: 12608.2. Samples: 76521991. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:38,941][03942] Avg episode reward: [(0, '1131.995')] [2023-03-06 16:08:39,645][04272] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-06 16:08:40,449][04272] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-06 16:08:41,272][04272] Updated weights for policy 0, policy_version 74790 (0.0007) [2023-03-06 16:08:42,062][04272] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-06 16:08:42,877][04272] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-06 16:08:43,687][04272] Updated weights for policy 0, policy_version 74820 (0.0007) [2023-03-06 16:08:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 76618752. Throughput: 0: 12610.9. Samples: 76597875. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:43,941][03942] Avg episode reward: [(0, '1149.461')] [2023-03-06 16:08:44,497][04272] Updated weights for policy 0, policy_version 74830 (0.0007) [2023-03-06 16:08:45,302][04272] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-06 16:08:46,116][04272] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-06 16:08:46,919][04272] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-06 16:08:47,725][04272] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-06 16:08:48,520][04272] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-06 16:08:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76681216. Throughput: 0: 12618.0. Samples: 76673940. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:48,941][03942] Avg episode reward: [(0, '1186.447')] [2023-03-06 16:08:49,353][04272] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-06 16:08:50,146][04272] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-06 16:08:50,975][04272] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-06 16:08:51,782][04272] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-06 16:08:52,605][04272] Updated weights for policy 0, policy_version 74930 (0.0008) [2023-03-06 16:08:53,414][04272] Updated weights for policy 0, policy_version 74940 (0.0006) [2023-03-06 16:08:53,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76744704. Throughput: 0: 12617.4. Samples: 76711779. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:53,941][03942] Avg episode reward: [(0, '1262.313')] [2023-03-06 16:08:54,222][04272] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-06 16:08:55,044][04272] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-06 16:08:55,853][04272] Updated weights for policy 0, policy_version 74970 (0.0007) [2023-03-06 16:08:56,656][04272] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-06 16:08:57,478][04272] Updated weights for policy 0, policy_version 74990 (0.0006) [2023-03-06 16:08:58,269][04272] Updated weights for policy 0, policy_version 75000 (0.0007) [2023-03-06 16:08:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76808192. Throughput: 0: 12618.8. Samples: 76787392. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:08:58,941][03942] Avg episode reward: [(0, '1277.628')] [2023-03-06 16:08:59,077][04272] Updated weights for policy 0, policy_version 75010 (0.0007) [2023-03-06 16:08:59,901][04272] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-06 16:09:00,691][04272] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-06 16:09:01,520][04272] Updated weights for policy 0, policy_version 75040 (0.0007) [2023-03-06 16:09:02,338][04272] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-06 16:09:03,142][04272] Updated weights for policy 0, policy_version 75060 (0.0007) [2023-03-06 16:09:03,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 76870656. Throughput: 0: 12619.0. Samples: 76863011. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:03,941][03942] Avg episode reward: [(0, '1092.132')] [2023-03-06 16:09:03,973][04272] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-06 16:09:04,782][04272] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-06 16:09:05,575][04272] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-06 16:09:06,401][04272] Updated weights for policy 0, policy_version 75100 (0.0006) [2023-03-06 16:09:07,215][04272] Updated weights for policy 0, policy_version 75110 (0.0006) [2023-03-06 16:09:08,029][04272] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-06 16:09:08,843][04272] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-06 16:09:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 76934144. Throughput: 0: 12620.6. Samples: 76900819. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:08,941][03942] Avg episode reward: [(0, '1231.827')] [2023-03-06 16:09:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000075131_76934144.pth... [2023-03-06 16:09:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000072172_73904128.pth [2023-03-06 16:09:09,641][04272] Updated weights for policy 0, policy_version 75140 (0.0007) [2023-03-06 16:09:10,469][04272] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-06 16:09:11,281][04272] Updated weights for policy 0, policy_version 75160 (0.0006) [2023-03-06 16:09:12,085][04272] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-06 16:09:12,903][04272] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-06 16:09:13,722][04272] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-06 16:09:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 76996608. Throughput: 0: 12612.6. Samples: 76976251. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:13,941][03942] Avg episode reward: [(0, '1270.976')] [2023-03-06 16:09:14,519][04272] Updated weights for policy 0, policy_version 75200 (0.0007) [2023-03-06 16:09:15,335][04272] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-06 16:09:16,132][04272] Updated weights for policy 0, policy_version 75220 (0.0006) [2023-03-06 16:09:16,945][04272] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-06 16:09:17,757][04272] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-06 16:09:18,585][04272] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-03-06 16:09:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 77060096. Throughput: 0: 12627.2. Samples: 77052273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:18,941][03942] Avg episode reward: [(0, '1224.573')] [2023-03-06 16:09:19,397][04272] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-06 16:09:20,202][04272] Updated weights for policy 0, policy_version 75270 (0.0005) [2023-03-06 16:09:20,999][04272] Updated weights for policy 0, policy_version 75280 (0.0007) [2023-03-06 16:09:21,824][04272] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-06 16:09:22,622][04272] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-06 16:09:23,432][04272] Updated weights for policy 0, policy_version 75310 (0.0006) [2023-03-06 16:09:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 77123584. Throughput: 0: 12623.4. Samples: 77090045. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:23,941][03942] Avg episode reward: [(0, '1250.601')] [2023-03-06 16:09:24,257][04272] Updated weights for policy 0, policy_version 75320 (0.0007) [2023-03-06 16:09:25,066][04272] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-06 16:09:25,878][04272] Updated weights for policy 0, policy_version 75340 (0.0007) [2023-03-06 16:09:26,692][04272] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-06 16:09:27,497][04272] Updated weights for policy 0, policy_version 75360 (0.0007) [2023-03-06 16:09:28,309][04272] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-06 16:09:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 77186048. Throughput: 0: 12622.0. Samples: 77165867. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:28,941][03942] Avg episode reward: [(0, '1191.486')] [2023-03-06 16:09:29,109][04272] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-06 16:09:29,937][04272] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-06 16:09:30,731][04272] Updated weights for policy 0, policy_version 75400 (0.0007) [2023-03-06 16:09:31,527][04272] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-06 16:09:32,353][04272] Updated weights for policy 0, policy_version 75420 (0.0007) [2023-03-06 16:09:33,174][04272] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-06 16:09:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 77249536. Throughput: 0: 12616.7. Samples: 77241693. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:33,941][03942] Avg episode reward: [(0, '1170.657')] [2023-03-06 16:09:33,988][04272] Updated weights for policy 0, policy_version 75440 (0.0007) [2023-03-06 16:09:34,803][04272] Updated weights for policy 0, policy_version 75450 (0.0006) [2023-03-06 16:09:35,618][04272] Updated weights for policy 0, policy_version 75460 (0.0007) [2023-03-06 16:09:36,432][04272] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-06 16:09:37,243][04272] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-06 16:09:38,049][04272] Updated weights for policy 0, policy_version 75490 (0.0008) [2023-03-06 16:09:38,862][04272] Updated weights for policy 0, policy_version 75500 (0.0007) [2023-03-06 16:09:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 77312000. Throughput: 0: 12613.8. Samples: 77279396. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:38,941][03942] Avg episode reward: [(0, '1206.619')] [2023-03-06 16:09:39,679][04272] Updated weights for policy 0, policy_version 75510 (0.0007) [2023-03-06 16:09:40,491][04272] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-06 16:09:41,301][04272] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-06 16:09:42,114][04272] Updated weights for policy 0, policy_version 75540 (0.0007) [2023-03-06 16:09:42,937][04272] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-06 16:09:43,738][04272] Updated weights for policy 0, policy_version 75560 (0.0006) [2023-03-06 16:09:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 77375488. Throughput: 0: 12615.6. Samples: 77355092. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:43,941][03942] Avg episode reward: [(0, '976.262')] [2023-03-06 16:09:44,546][04272] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-06 16:09:45,350][04272] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-06 16:09:46,150][04272] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-06 16:09:46,967][04272] Updated weights for policy 0, policy_version 75600 (0.0006) [2023-03-06 16:09:47,764][04272] Updated weights for policy 0, policy_version 75610 (0.0007) [2023-03-06 16:09:48,570][04272] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-06 16:09:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 77438976. Throughput: 0: 12624.1. Samples: 77431097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:48,941][03942] Avg episode reward: [(0, '1184.215')] [2023-03-06 16:09:49,366][04272] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-06 16:09:50,174][04272] Updated weights for policy 0, policy_version 75640 (0.0006) [2023-03-06 16:09:51,009][04272] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-06 16:09:51,822][04272] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-06 16:09:52,614][04272] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-06 16:09:53,437][04272] Updated weights for policy 0, policy_version 75680 (0.0007) [2023-03-06 16:09:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.4, 300 sec: 12624.7). Total num frames: 77502464. Throughput: 0: 12626.0. Samples: 77468991. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:53,941][03942] Avg episode reward: [(0, '1248.773')] [2023-03-06 16:09:54,260][04272] Updated weights for policy 0, policy_version 75690 (0.0006) [2023-03-06 16:09:55,059][04272] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-06 16:09:55,870][04272] Updated weights for policy 0, policy_version 75710 (0.0006) [2023-03-06 16:09:56,674][04272] Updated weights for policy 0, policy_version 75720 (0.0007) [2023-03-06 16:09:57,501][04272] Updated weights for policy 0, policy_version 75730 (0.0006) [2023-03-06 16:09:58,308][04272] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-06 16:09:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 77564928. Throughput: 0: 12633.9. Samples: 77544777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:09:58,941][03942] Avg episode reward: [(0, '1318.912')] [2023-03-06 16:09:59,105][04272] Updated weights for policy 0, policy_version 75750 (0.0005) [2023-03-06 16:09:59,926][04272] Updated weights for policy 0, policy_version 75760 (0.0007) [2023-03-06 16:10:00,722][04272] Updated weights for policy 0, policy_version 75770 (0.0007) [2023-03-06 16:10:01,540][04272] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-06 16:10:02,360][04272] Updated weights for policy 0, policy_version 75790 (0.0007) [2023-03-06 16:10:03,158][04272] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-06 16:10:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 77628416. Throughput: 0: 12630.5. Samples: 77620647. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:03,941][03942] Avg episode reward: [(0, '1276.483')] [2023-03-06 16:10:03,971][04272] Updated weights for policy 0, policy_version 75810 (0.0006) [2023-03-06 16:10:04,789][04272] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-06 16:10:05,611][04272] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-06 16:10:06,415][04272] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-06 16:10:07,231][04272] Updated weights for policy 0, policy_version 75850 (0.0007) [2023-03-06 16:10:08,033][04272] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-06 16:10:08,835][04272] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-06 16:10:08,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 77691904. Throughput: 0: 12629.5. Samples: 77658373. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:08,941][03942] Avg episode reward: [(0, '1295.614')] [2023-03-06 16:10:09,641][04272] Updated weights for policy 0, policy_version 75880 (0.0006) [2023-03-06 16:10:10,474][04272] Updated weights for policy 0, policy_version 75890 (0.0007) [2023-03-06 16:10:11,280][04272] Updated weights for policy 0, policy_version 75900 (0.0006) [2023-03-06 16:10:12,088][04272] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-06 16:10:12,908][04272] Updated weights for policy 0, policy_version 75920 (0.0007) [2023-03-06 16:10:13,694][04272] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-06 16:10:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 77754368. Throughput: 0: 12628.3. Samples: 77734141. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:13,941][03942] Avg episode reward: [(0, '1233.087')] [2023-03-06 16:10:14,518][04272] Updated weights for policy 0, policy_version 75940 (0.0007) [2023-03-06 16:10:15,326][04272] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-06 16:10:16,138][04272] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-06 16:10:16,934][04272] Updated weights for policy 0, policy_version 75970 (0.0006) [2023-03-06 16:10:17,742][04272] Updated weights for policy 0, policy_version 75980 (0.0007) [2023-03-06 16:10:18,548][04272] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-06 16:10:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 77817856. Throughput: 0: 12635.3. Samples: 77810281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:18,941][03942] Avg episode reward: [(0, '1275.586')] [2023-03-06 16:10:19,354][04272] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-06 16:10:20,169][04272] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-06 16:10:20,991][04272] Updated weights for policy 0, policy_version 76020 (0.0007) [2023-03-06 16:10:21,786][04272] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-06 16:10:22,588][04272] Updated weights for policy 0, policy_version 76040 (0.0007) [2023-03-06 16:10:23,403][04272] Updated weights for policy 0, policy_version 76050 (0.0007) [2023-03-06 16:10:23,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 77881344. Throughput: 0: 12641.6. Samples: 77848268. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:23,941][03942] Avg episode reward: [(0, '1245.284')] [2023-03-06 16:10:24,209][04272] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-06 16:10:25,024][04272] Updated weights for policy 0, policy_version 76070 (0.0006) [2023-03-06 16:10:25,831][04272] Updated weights for policy 0, policy_version 76080 (0.0006) [2023-03-06 16:10:26,633][04272] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-06 16:10:27,449][04272] Updated weights for policy 0, policy_version 76100 (0.0006) [2023-03-06 16:10:28,258][04272] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-06 16:10:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 77944832. Throughput: 0: 12645.5. Samples: 77924138. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:28,941][03942] Avg episode reward: [(0, '1286.458')] [2023-03-06 16:10:29,078][04272] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-06 16:10:29,890][04272] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-06 16:10:30,714][04272] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-06 16:10:31,508][04272] Updated weights for policy 0, policy_version 76150 (0.0006) [2023-03-06 16:10:32,334][04272] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-06 16:10:33,145][04272] Updated weights for policy 0, policy_version 76170 (0.0007) [2023-03-06 16:10:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 78007296. Throughput: 0: 12636.8. Samples: 77999753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:33,941][03942] Avg episode reward: [(0, '1215.468')] [2023-03-06 16:10:33,947][04272] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-06 16:10:34,785][04272] Updated weights for policy 0, policy_version 76190 (0.0006) [2023-03-06 16:10:35,576][04272] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-06 16:10:36,379][04272] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-06 16:10:37,197][04272] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-06 16:10:38,009][04272] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-06 16:10:38,824][04272] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-06 16:10:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 78070784. Throughput: 0: 12638.4. Samples: 78037720. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:38,941][03942] Avg episode reward: [(0, '1266.904')] [2023-03-06 16:10:39,638][04272] Updated weights for policy 0, policy_version 76250 (0.0007) [2023-03-06 16:10:40,443][04272] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-06 16:10:41,267][04272] Updated weights for policy 0, policy_version 76270 (0.0007) [2023-03-06 16:10:42,082][04272] Updated weights for policy 0, policy_version 76280 (0.0006) [2023-03-06 16:10:42,887][04272] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-06 16:10:43,694][04272] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-06 16:10:43,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12631.7). Total num frames: 78134272. Throughput: 0: 12632.6. Samples: 78113245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:43,941][03942] Avg episode reward: [(0, '1206.480')] [2023-03-06 16:10:44,506][04272] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-06 16:10:45,313][04272] Updated weights for policy 0, policy_version 76320 (0.0006) [2023-03-06 16:10:46,121][04272] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-06 16:10:46,916][04272] Updated weights for policy 0, policy_version 76340 (0.0007) [2023-03-06 16:10:47,748][04272] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-06 16:10:48,557][04272] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-06 16:10:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 78197760. Throughput: 0: 12636.3. Samples: 78189280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:10:48,941][03942] Avg episode reward: [(0, '1260.449')] [2023-03-06 16:10:49,358][04272] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-06 16:10:50,178][04272] Updated weights for policy 0, policy_version 76380 (0.0007) [2023-03-06 16:10:50,989][04272] Updated weights for policy 0, policy_version 76390 (0.0007) [2023-03-06 16:10:51,816][04272] Updated weights for policy 0, policy_version 76400 (0.0007) [2023-03-06 16:10:52,623][04272] Updated weights for policy 0, policy_version 76410 (0.0008) [2023-03-06 16:10:53,434][04272] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-06 16:10:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.7). Total num frames: 78260224. Throughput: 0: 12639.8. Samples: 78227165. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:10:53,941][03942] Avg episode reward: [(0, '1262.767')] [2023-03-06 16:10:54,244][04272] Updated weights for policy 0, policy_version 76430 (0.0006) [2023-03-06 16:10:55,050][04272] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-06 16:10:55,863][04272] Updated weights for policy 0, policy_version 76450 (0.0007) [2023-03-06 16:10:56,678][04272] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-06 16:10:57,510][04272] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-06 16:10:58,306][04272] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-06 16:10:58,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 78322688. Throughput: 0: 12631.0. Samples: 78302537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:10:58,941][03942] Avg episode reward: [(0, '1170.169')] [2023-03-06 16:10:59,114][04272] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-06 16:10:59,936][04272] Updated weights for policy 0, policy_version 76500 (0.0007) [2023-03-06 16:11:00,756][04272] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-06 16:11:01,549][04272] Updated weights for policy 0, policy_version 76520 (0.0006) [2023-03-06 16:11:02,362][04272] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-06 16:11:03,177][04272] Updated weights for policy 0, policy_version 76540 (0.0006) [2023-03-06 16:11:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 78386176. Throughput: 0: 12623.4. Samples: 78378333. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:03,941][03942] Avg episode reward: [(0, '1272.213')] [2023-03-06 16:11:03,972][04272] Updated weights for policy 0, policy_version 76550 (0.0006) [2023-03-06 16:11:04,787][04272] Updated weights for policy 0, policy_version 76560 (0.0008) [2023-03-06 16:11:05,622][04272] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-06 16:11:06,414][04272] Updated weights for policy 0, policy_version 76580 (0.0006) [2023-03-06 16:11:07,213][04272] Updated weights for policy 0, policy_version 76590 (0.0006) [2023-03-06 16:11:08,046][04272] Updated weights for policy 0, policy_version 76600 (0.0007) [2023-03-06 16:11:08,845][04272] Updated weights for policy 0, policy_version 76610 (0.0007) [2023-03-06 16:11:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 78449664. Throughput: 0: 12621.7. Samples: 78416246. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:08,941][03942] Avg episode reward: [(0, '1192.051')] [2023-03-06 16:11:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000076611_78449664.pth... [2023-03-06 16:11:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000073652_75419648.pth [2023-03-06 16:11:09,664][04272] Updated weights for policy 0, policy_version 76620 (0.0007) [2023-03-06 16:11:10,469][04272] Updated weights for policy 0, policy_version 76630 (0.0007) [2023-03-06 16:11:11,271][04272] Updated weights for policy 0, policy_version 76640 (0.0007) [2023-03-06 16:11:12,078][04272] Updated weights for policy 0, policy_version 76650 (0.0007) [2023-03-06 16:11:12,914][04272] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-06 16:11:13,706][04272] Updated weights for policy 0, policy_version 76670 (0.0006) [2023-03-06 16:11:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 78513152. Throughput: 0: 12623.2. Samples: 78492181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:13,941][03942] Avg episode reward: [(0, '1146.197')] [2023-03-06 16:11:14,528][04272] Updated weights for policy 0, policy_version 76680 (0.0007) [2023-03-06 16:11:15,344][04272] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-06 16:11:16,147][04272] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-06 16:11:16,963][04272] Updated weights for policy 0, policy_version 76710 (0.0005) [2023-03-06 16:11:17,771][04272] Updated weights for policy 0, policy_version 76720 (0.0007) [2023-03-06 16:11:18,579][04272] Updated weights for policy 0, policy_version 76730 (0.0007) [2023-03-06 16:11:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 78575616. Throughput: 0: 12621.5. Samples: 78567721. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:18,941][03942] Avg episode reward: [(0, '1125.371')] [2023-03-06 16:11:19,371][04272] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-06 16:11:20,174][04272] Updated weights for policy 0, policy_version 76750 (0.0006) [2023-03-06 16:11:20,989][04272] Updated weights for policy 0, policy_version 76760 (0.0007) [2023-03-06 16:11:21,781][04272] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-06 16:11:22,591][04272] Updated weights for policy 0, policy_version 76780 (0.0007) [2023-03-06 16:11:23,401][04272] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-06 16:11:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 78639104. Throughput: 0: 12625.8. Samples: 78605881. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:23,941][03942] Avg episode reward: [(0, '1172.659')] [2023-03-06 16:11:24,217][04272] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-06 16:11:25,037][04272] Updated weights for policy 0, policy_version 76810 (0.0007) [2023-03-06 16:11:25,853][04272] Updated weights for policy 0, policy_version 76820 (0.0007) [2023-03-06 16:11:26,674][04272] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-06 16:11:27,487][04272] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-06 16:11:28,293][04272] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-06 16:11:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 78702592. Throughput: 0: 12631.0. Samples: 78681640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:28,941][03942] Avg episode reward: [(0, '1021.524')] [2023-03-06 16:11:29,091][04272] Updated weights for policy 0, policy_version 76860 (0.0007) [2023-03-06 16:11:29,911][04272] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-06 16:11:30,723][04272] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-06 16:11:31,543][04272] Updated weights for policy 0, policy_version 76890 (0.0006) [2023-03-06 16:11:32,349][04272] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-06 16:11:33,157][04272] Updated weights for policy 0, policy_version 76910 (0.0006) [2023-03-06 16:11:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12628.2). Total num frames: 78765056. Throughput: 0: 12621.8. Samples: 78757259. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:11:33,941][03942] Avg episode reward: [(0, '1152.913')] [2023-03-06 16:11:33,985][04272] Updated weights for policy 0, policy_version 76920 (0.0007) [2023-03-06 16:11:34,797][04272] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-06 16:11:35,621][04272] Updated weights for policy 0, policy_version 76940 (0.0007) [2023-03-06 16:11:36,444][04272] Updated weights for policy 0, policy_version 76950 (0.0006) [2023-03-06 16:11:37,243][04272] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-06 16:11:38,055][04272] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-06 16:11:38,857][04272] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-06 16:11:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 78828544. Throughput: 0: 12614.4. Samples: 78794813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:11:38,941][03942] Avg episode reward: [(0, '1155.929')] [2023-03-06 16:11:39,655][04272] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-06 16:11:40,484][04272] Updated weights for policy 0, policy_version 77000 (0.0007) [2023-03-06 16:11:41,275][04272] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-06 16:11:42,071][04272] Updated weights for policy 0, policy_version 77020 (0.0006) [2023-03-06 16:11:42,892][04272] Updated weights for policy 0, policy_version 77030 (0.0007) [2023-03-06 16:11:43,701][04272] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-06 16:11:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 78891008. Throughput: 0: 12633.2. Samples: 78871032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:11:43,941][03942] Avg episode reward: [(0, '1190.188')] [2023-03-06 16:11:44,498][04272] Updated weights for policy 0, policy_version 77050 (0.0005) [2023-03-06 16:11:45,329][04272] Updated weights for policy 0, policy_version 77060 (0.0006) [2023-03-06 16:11:46,151][04272] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-06 16:11:46,955][04272] Updated weights for policy 0, policy_version 77080 (0.0006) [2023-03-06 16:11:47,763][04272] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-06 16:11:48,572][04272] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-06 16:11:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.6). Total num frames: 78954496. Throughput: 0: 12629.2. Samples: 78946646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:11:48,941][03942] Avg episode reward: [(0, '1292.264')] [2023-03-06 16:11:49,368][04272] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-06 16:11:50,186][04272] Updated weights for policy 0, policy_version 77120 (0.0006) [2023-03-06 16:11:50,998][04272] Updated weights for policy 0, policy_version 77130 (0.0006) [2023-03-06 16:11:51,798][04272] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-06 16:11:52,633][04272] Updated weights for policy 0, policy_version 77150 (0.0007) [2023-03-06 16:11:53,445][04272] Updated weights for policy 0, policy_version 77160 (0.0007) [2023-03-06 16:11:53,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 79017984. Throughput: 0: 12631.8. Samples: 78984677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:11:53,941][03942] Avg episode reward: [(0, '1209.833')] [2023-03-06 16:11:54,248][04272] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-06 16:11:55,073][04272] Updated weights for policy 0, policy_version 77180 (0.0007) [2023-03-06 16:11:55,868][04272] Updated weights for policy 0, policy_version 77190 (0.0007) [2023-03-06 16:11:56,677][04272] Updated weights for policy 0, policy_version 77200 (0.0007) [2023-03-06 16:11:57,484][04272] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-06 16:11:58,292][04272] Updated weights for policy 0, policy_version 77220 (0.0007) [2023-03-06 16:11:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 79080448. Throughput: 0: 12625.0. Samples: 79060309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:11:58,941][03942] Avg episode reward: [(0, '1248.644')] [2023-03-06 16:11:59,117][04272] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-06 16:11:59,920][04272] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-06 16:12:00,741][04272] Updated weights for policy 0, policy_version 77250 (0.0007) [2023-03-06 16:12:01,549][04272] Updated weights for policy 0, policy_version 77260 (0.0007) [2023-03-06 16:12:02,382][04272] Updated weights for policy 0, policy_version 77270 (0.0006) [2023-03-06 16:12:03,207][04272] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-06 16:12:03,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 79142912. Throughput: 0: 12621.7. Samples: 79135697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:03,941][03942] Avg episode reward: [(0, '1192.841')] [2023-03-06 16:12:04,018][04272] Updated weights for policy 0, policy_version 77290 (0.0006) [2023-03-06 16:12:04,811][04272] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-06 16:12:05,636][04272] Updated weights for policy 0, policy_version 77310 (0.0007) [2023-03-06 16:12:06,449][04272] Updated weights for policy 0, policy_version 77320 (0.0007) [2023-03-06 16:12:07,245][04272] Updated weights for policy 0, policy_version 77330 (0.0006) [2023-03-06 16:12:08,066][04272] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-06 16:12:08,868][04272] Updated weights for policy 0, policy_version 77350 (0.0006) [2023-03-06 16:12:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 79206400. Throughput: 0: 12616.3. Samples: 79173616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:08,941][03942] Avg episode reward: [(0, '1264.333')] [2023-03-06 16:12:09,680][04272] Updated weights for policy 0, policy_version 77360 (0.0007) [2023-03-06 16:12:10,490][04272] Updated weights for policy 0, policy_version 77370 (0.0008) [2023-03-06 16:12:11,301][04272] Updated weights for policy 0, policy_version 77380 (0.0006) [2023-03-06 16:12:12,112][04272] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-06 16:12:12,933][04272] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-06 16:12:13,745][04272] Updated weights for policy 0, policy_version 77410 (0.0008) [2023-03-06 16:12:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 79269888. Throughput: 0: 12617.0. Samples: 79249405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:13,941][03942] Avg episode reward: [(0, '1082.185')] [2023-03-06 16:12:14,538][04272] Updated weights for policy 0, policy_version 77420 (0.0006) [2023-03-06 16:12:15,346][04272] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-03-06 16:12:16,149][04272] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-06 16:12:16,938][04272] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-06 16:12:17,766][04272] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-06 16:12:18,577][04272] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-06 16:12:18,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 79333376. Throughput: 0: 12627.8. Samples: 79325509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:18,941][03942] Avg episode reward: [(0, '1165.493')] [2023-03-06 16:12:19,383][04272] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-06 16:12:20,186][04272] Updated weights for policy 0, policy_version 77490 (0.0007) [2023-03-06 16:12:20,997][04272] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-06 16:12:21,818][04272] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-06 16:12:22,629][04272] Updated weights for policy 0, policy_version 77520 (0.0007) [2023-03-06 16:12:23,427][04272] Updated weights for policy 0, policy_version 77530 (0.0006) [2023-03-06 16:12:23,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 79396864. Throughput: 0: 12637.8. Samples: 79363512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:23,941][03942] Avg episode reward: [(0, '1210.441')] [2023-03-06 16:12:24,248][04272] Updated weights for policy 0, policy_version 77540 (0.0007) [2023-03-06 16:12:25,069][04272] Updated weights for policy 0, policy_version 77550 (0.0007) [2023-03-06 16:12:25,882][04272] Updated weights for policy 0, policy_version 77560 (0.0007) [2023-03-06 16:12:26,678][04272] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-06 16:12:27,477][04272] Updated weights for policy 0, policy_version 77580 (0.0006) [2023-03-06 16:12:28,295][04272] Updated weights for policy 0, policy_version 77590 (0.0007) [2023-03-06 16:12:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 79460352. Throughput: 0: 12625.3. Samples: 79439171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:28,951][03942] Avg episode reward: [(0, '1213.924')] [2023-03-06 16:12:29,091][04272] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-06 16:12:29,905][04272] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-06 16:12:30,741][04272] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-06 16:12:31,548][04272] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-06 16:12:32,361][04272] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-06 16:12:33,149][04272] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-06 16:12:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 79522816. Throughput: 0: 12631.4. Samples: 79515059. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:33,951][03942] Avg episode reward: [(0, '1261.910')] [2023-03-06 16:12:33,965][04272] Updated weights for policy 0, policy_version 77660 (0.0007) [2023-03-06 16:12:34,764][04272] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-06 16:12:35,584][04272] Updated weights for policy 0, policy_version 77680 (0.0007) [2023-03-06 16:12:36,394][04272] Updated weights for policy 0, policy_version 77690 (0.0007) [2023-03-06 16:12:37,182][04272] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-06 16:12:38,010][04272] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-06 16:12:38,827][04272] Updated weights for policy 0, policy_version 77720 (0.0006) [2023-03-06 16:12:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 79586304. Throughput: 0: 12634.9. Samples: 79553246. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:38,951][03942] Avg episode reward: [(0, '1260.654')] [2023-03-06 16:12:39,624][04272] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-06 16:12:40,435][04272] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-06 16:12:41,238][04272] Updated weights for policy 0, policy_version 77750 (0.0006) [2023-03-06 16:12:42,044][04272] Updated weights for policy 0, policy_version 77760 (0.0007) [2023-03-06 16:12:42,838][04272] Updated weights for policy 0, policy_version 77770 (0.0007) [2023-03-06 16:12:43,665][04272] Updated weights for policy 0, policy_version 77780 (0.0007) [2023-03-06 16:12:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 79649792. Throughput: 0: 12644.1. Samples: 79629292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:43,952][03942] Avg episode reward: [(0, '1203.483')] [2023-03-06 16:12:44,461][04272] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-06 16:12:45,271][04272] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-06 16:12:46,103][04272] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-06 16:12:46,893][04272] Updated weights for policy 0, policy_version 77820 (0.0007) [2023-03-06 16:12:47,706][04272] Updated weights for policy 0, policy_version 77830 (0.0007) [2023-03-06 16:12:48,500][04272] Updated weights for policy 0, policy_version 77840 (0.0007) [2023-03-06 16:12:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 79713280. Throughput: 0: 12656.0. Samples: 79705218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:48,951][03942] Avg episode reward: [(0, '1150.108')] [2023-03-06 16:12:49,309][04272] Updated weights for policy 0, policy_version 77850 (0.0007) [2023-03-06 16:12:50,114][04272] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-06 16:12:50,950][04272] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-06 16:12:51,752][04272] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-06 16:12:52,578][04272] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-06 16:12:53,409][04272] Updated weights for policy 0, policy_version 77900 (0.0006) [2023-03-06 16:12:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 79775744. Throughput: 0: 12654.5. Samples: 79743067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:53,952][03942] Avg episode reward: [(0, '1206.904')] [2023-03-06 16:12:54,213][04272] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-06 16:12:55,012][04272] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-06 16:12:55,826][04272] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-06 16:12:56,636][04272] Updated weights for policy 0, policy_version 77940 (0.0006) [2023-03-06 16:12:57,429][04272] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-06 16:12:58,233][04272] Updated weights for policy 0, policy_version 77960 (0.0006) [2023-03-06 16:12:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 79839232. Throughput: 0: 12652.9. Samples: 79818785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:12:58,952][03942] Avg episode reward: [(0, '1262.470')] [2023-03-06 16:12:59,050][04272] Updated weights for policy 0, policy_version 77970 (0.0006) [2023-03-06 16:12:59,871][04272] Updated weights for policy 0, policy_version 77980 (0.0007) [2023-03-06 16:13:00,675][04272] Updated weights for policy 0, policy_version 77990 (0.0006) [2023-03-06 16:13:01,497][04272] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-06 16:13:02,295][04272] Updated weights for policy 0, policy_version 78010 (0.0007) [2023-03-06 16:13:03,102][04272] Updated weights for policy 0, policy_version 78020 (0.0007) [2023-03-06 16:13:03,914][04272] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-06 16:13:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12631.6). Total num frames: 79902720. Throughput: 0: 12647.3. Samples: 79894637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:13:03,952][03942] Avg episode reward: [(0, '1201.568')] [2023-03-06 16:13:04,738][04272] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-06 16:13:05,540][04272] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-06 16:13:06,349][04272] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-06 16:13:07,180][04272] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-06 16:13:07,972][04272] Updated weights for policy 0, policy_version 78080 (0.0006) [2023-03-06 16:13:08,794][04272] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-06 16:13:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 79965184. Throughput: 0: 12642.9. Samples: 79932442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:13:08,952][03942] Avg episode reward: [(0, '1133.945')] [2023-03-06 16:13:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000078092_79966208.pth... [2023-03-06 16:13:08,985][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000075131_76934144.pth [2023-03-06 16:13:09,610][04272] Updated weights for policy 0, policy_version 78100 (0.0005) [2023-03-06 16:13:10,418][04272] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-06 16:13:11,224][04272] Updated weights for policy 0, policy_version 78120 (0.0006) [2023-03-06 16:13:12,034][04272] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-06 16:13:12,848][04272] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-06 16:13:13,656][04272] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-06 16:13:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12628.2). Total num frames: 80028672. Throughput: 0: 12646.2. Samples: 80008251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:13,941][03942] Avg episode reward: [(0, '1042.608')] [2023-03-06 16:13:14,471][04272] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-06 16:13:15,278][04272] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-06 16:13:16,107][04272] Updated weights for policy 0, policy_version 78180 (0.0007) [2023-03-06 16:13:16,917][04272] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-06 16:13:17,727][04272] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-06 16:13:18,529][04272] Updated weights for policy 0, policy_version 78210 (0.0007) [2023-03-06 16:13:18,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 80092160. Throughput: 0: 12641.3. Samples: 80083919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:18,941][03942] Avg episode reward: [(0, '1231.498')] [2023-03-06 16:13:19,339][04272] Updated weights for policy 0, policy_version 78220 (0.0006) [2023-03-06 16:13:20,143][04272] Updated weights for policy 0, policy_version 78230 (0.0007) [2023-03-06 16:13:20,961][04272] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-06 16:13:21,768][04272] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-06 16:13:22,565][04272] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-06 16:13:23,401][04272] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-06 16:13:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12628.2). Total num frames: 80154624. Throughput: 0: 12636.9. Samples: 80121906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:23,941][03942] Avg episode reward: [(0, '1261.150')] [2023-03-06 16:13:24,223][04272] Updated weights for policy 0, policy_version 78280 (0.0007) [2023-03-06 16:13:25,034][04272] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-06 16:13:25,833][04272] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-06 16:13:26,626][04272] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-06 16:13:27,446][04272] Updated weights for policy 0, policy_version 78320 (0.0007) [2023-03-06 16:13:28,268][04272] Updated weights for policy 0, policy_version 78330 (0.0007) [2023-03-06 16:13:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80218112. Throughput: 0: 12627.7. Samples: 80197538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:28,941][03942] Avg episode reward: [(0, '1167.041')] [2023-03-06 16:13:29,086][04272] Updated weights for policy 0, policy_version 78340 (0.0007) [2023-03-06 16:13:29,882][04272] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-06 16:13:30,683][04272] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-06 16:13:31,495][04272] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-06 16:13:32,310][04272] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-06 16:13:33,149][04272] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-06 16:13:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 80281600. Throughput: 0: 12623.7. Samples: 80273284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:33,941][03942] Avg episode reward: [(0, '1185.526')] [2023-03-06 16:13:33,941][04272] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-06 16:13:34,752][04272] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-06 16:13:35,556][04272] Updated weights for policy 0, policy_version 78420 (0.0007) [2023-03-06 16:13:36,377][04272] Updated weights for policy 0, policy_version 78430 (0.0007) [2023-03-06 16:13:37,186][04272] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-06 16:13:37,969][04272] Updated weights for policy 0, policy_version 78450 (0.0007) [2023-03-06 16:13:38,783][04272] Updated weights for policy 0, policy_version 78460 (0.0007) [2023-03-06 16:13:38,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 80345088. Throughput: 0: 12627.6. Samples: 80311310. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:38,941][03942] Avg episode reward: [(0, '1285.578')] [2023-03-06 16:13:39,583][04272] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-06 16:13:40,384][04272] Updated weights for policy 0, policy_version 78480 (0.0007) [2023-03-06 16:13:41,204][04272] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-06 16:13:42,026][04272] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-06 16:13:42,815][04272] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-06 16:13:43,628][04272] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-06 16:13:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80407552. Throughput: 0: 12635.9. Samples: 80387399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:43,941][03942] Avg episode reward: [(0, '1257.313')] [2023-03-06 16:13:44,450][04272] Updated weights for policy 0, policy_version 78530 (0.0006) [2023-03-06 16:13:45,278][04272] Updated weights for policy 0, policy_version 78540 (0.0006) [2023-03-06 16:13:46,076][04272] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-06 16:13:46,895][04272] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-06 16:13:47,697][04272] Updated weights for policy 0, policy_version 78570 (0.0007) [2023-03-06 16:13:48,518][04272] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-06 16:13:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.7). Total num frames: 80471040. Throughput: 0: 12630.8. Samples: 80463024. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:48,941][03942] Avg episode reward: [(0, '1330.057')] [2023-03-06 16:13:49,338][04272] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-06 16:13:50,148][04272] Updated weights for policy 0, policy_version 78600 (0.0008) [2023-03-06 16:13:50,966][04272] Updated weights for policy 0, policy_version 78610 (0.0008) [2023-03-06 16:13:51,772][04272] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-06 16:13:52,578][04272] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-06 16:13:53,373][04272] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-06 16:13:53,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12631.6). Total num frames: 80534528. Throughput: 0: 12629.4. Samples: 80500764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:53,941][03942] Avg episode reward: [(0, '1141.438')] [2023-03-06 16:13:54,182][04272] Updated weights for policy 0, policy_version 78650 (0.0007) [2023-03-06 16:13:54,995][04272] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-06 16:13:55,826][04272] Updated weights for policy 0, policy_version 78670 (0.0007) [2023-03-06 16:13:56,618][04272] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-06 16:13:57,431][04272] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-06 16:13:58,246][04272] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-06 16:13:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80596992. Throughput: 0: 12630.1. Samples: 80576604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 16:13:58,941][03942] Avg episode reward: [(0, '933.457')] [2023-03-06 16:13:59,040][04272] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-06 16:13:59,871][04272] Updated weights for policy 0, policy_version 78720 (0.0007) [2023-03-06 16:14:00,691][04272] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-06 16:14:01,486][04272] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-06 16:14:02,307][04272] Updated weights for policy 0, policy_version 78750 (0.0006) [2023-03-06 16:14:03,119][04272] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-06 16:14:03,898][04272] Updated weights for policy 0, policy_version 78770 (0.0007) [2023-03-06 16:14:03,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80660480. Throughput: 0: 12634.0. Samples: 80652448. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:03,941][03942] Avg episode reward: [(0, '956.527')] [2023-03-06 16:14:04,718][04272] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-06 16:14:05,524][04272] Updated weights for policy 0, policy_version 78790 (0.0007) [2023-03-06 16:14:06,333][04272] Updated weights for policy 0, policy_version 78800 (0.0006) [2023-03-06 16:14:07,145][04272] Updated weights for policy 0, policy_version 78810 (0.0006) [2023-03-06 16:14:07,946][04272] Updated weights for policy 0, policy_version 78820 (0.0007) [2023-03-06 16:14:08,759][04272] Updated weights for policy 0, policy_version 78830 (0.0006) [2023-03-06 16:14:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 80723968. Throughput: 0: 12639.8. Samples: 80690696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:08,941][03942] Avg episode reward: [(0, '871.142')] [2023-03-06 16:14:09,585][04272] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-06 16:14:10,384][04272] Updated weights for policy 0, policy_version 78850 (0.0007) [2023-03-06 16:14:11,206][04272] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-06 16:14:12,014][04272] Updated weights for policy 0, policy_version 78870 (0.0006) [2023-03-06 16:14:12,815][04272] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-06 16:14:13,627][04272] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-06 16:14:13,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80786432. Throughput: 0: 12639.3. Samples: 80766305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:13,941][03942] Avg episode reward: [(0, '1059.212')] [2023-03-06 16:14:14,433][04272] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-06 16:14:15,231][04272] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-06 16:14:16,051][04272] Updated weights for policy 0, policy_version 78920 (0.0005) [2023-03-06 16:14:16,843][04272] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-06 16:14:17,651][04272] Updated weights for policy 0, policy_version 78940 (0.0007) [2023-03-06 16:14:18,454][04272] Updated weights for policy 0, policy_version 78950 (0.0007) [2023-03-06 16:14:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 80849920. Throughput: 0: 12651.1. Samples: 80842585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:18,941][03942] Avg episode reward: [(0, '1141.598')] [2023-03-06 16:14:19,280][04272] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-06 16:14:20,082][04272] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-06 16:14:20,883][04272] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-06 16:14:21,708][04272] Updated weights for policy 0, policy_version 78990 (0.0007) [2023-03-06 16:14:22,523][04272] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-06 16:14:23,345][04272] Updated weights for policy 0, policy_version 79010 (0.0005) [2023-03-06 16:14:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 80913408. Throughput: 0: 12646.2. Samples: 80880388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:23,941][03942] Avg episode reward: [(0, '1175.830')] [2023-03-06 16:14:24,148][04272] Updated weights for policy 0, policy_version 79020 (0.0006) [2023-03-06 16:14:24,973][04272] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-06 16:14:25,772][04272] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-06 16:14:26,573][04272] Updated weights for policy 0, policy_version 79050 (0.0007) [2023-03-06 16:14:27,366][04272] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-06 16:14:28,207][04272] Updated weights for policy 0, policy_version 79070 (0.0007) [2023-03-06 16:14:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 80976896. Throughput: 0: 12638.7. Samples: 80956139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:28,941][03942] Avg episode reward: [(0, '1174.337')] [2023-03-06 16:14:29,001][04272] Updated weights for policy 0, policy_version 79080 (0.0007) [2023-03-06 16:14:29,814][04272] Updated weights for policy 0, policy_version 79090 (0.0006) [2023-03-06 16:14:30,629][04272] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-06 16:14:31,438][04272] Updated weights for policy 0, policy_version 79110 (0.0008) [2023-03-06 16:14:32,265][04272] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-06 16:14:33,069][04272] Updated weights for policy 0, policy_version 79130 (0.0007) [2023-03-06 16:14:33,888][04272] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-06 16:14:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 81039360. Throughput: 0: 12640.4. Samples: 81031841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:33,952][03942] Avg episode reward: [(0, '1114.642')] [2023-03-06 16:14:34,697][04272] Updated weights for policy 0, policy_version 79150 (0.0007) [2023-03-06 16:14:35,489][04272] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-06 16:14:36,298][04272] Updated weights for policy 0, policy_version 79170 (0.0006) [2023-03-06 16:14:37,106][04272] Updated weights for policy 0, policy_version 79180 (0.0006) [2023-03-06 16:14:37,910][04272] Updated weights for policy 0, policy_version 79190 (0.0007) [2023-03-06 16:14:38,738][04272] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-06 16:14:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 81102848. Throughput: 0: 12646.4. Samples: 81069854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:38,952][03942] Avg episode reward: [(0, '1230.575')] [2023-03-06 16:14:39,541][04272] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-06 16:14:40,349][04272] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-06 16:14:41,173][04272] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-06 16:14:41,980][04272] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-06 16:14:42,790][04272] Updated weights for policy 0, policy_version 79250 (0.0007) [2023-03-06 16:14:43,579][04272] Updated weights for policy 0, policy_version 79260 (0.0007) [2023-03-06 16:14:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 81166336. Throughput: 0: 12643.8. Samples: 81145576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:43,952][03942] Avg episode reward: [(0, '1177.060')] [2023-03-06 16:14:44,412][04272] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-06 16:14:45,219][04272] Updated weights for policy 0, policy_version 79280 (0.0007) [2023-03-06 16:14:46,025][04272] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-06 16:14:46,840][04272] Updated weights for policy 0, policy_version 79300 (0.0007) [2023-03-06 16:14:47,658][04272] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-06 16:14:48,461][04272] Updated weights for policy 0, policy_version 79320 (0.0007) [2023-03-06 16:14:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 81228800. Throughput: 0: 12642.0. Samples: 81221339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:48,951][03942] Avg episode reward: [(0, '1255.617')] [2023-03-06 16:14:49,266][04272] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-06 16:14:50,098][04272] Updated weights for policy 0, policy_version 79340 (0.0007) [2023-03-06 16:14:50,894][04272] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-06 16:14:51,697][04272] Updated weights for policy 0, policy_version 79360 (0.0006) [2023-03-06 16:14:52,522][04272] Updated weights for policy 0, policy_version 79370 (0.0007) [2023-03-06 16:14:53,329][04272] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-06 16:14:53,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 81292288. Throughput: 0: 12634.7. Samples: 81259258. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:53,941][03942] Avg episode reward: [(0, '1242.275')] [2023-03-06 16:14:54,149][04272] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-06 16:14:54,945][04272] Updated weights for policy 0, policy_version 79400 (0.0006) [2023-03-06 16:14:55,747][04272] Updated weights for policy 0, policy_version 79410 (0.0007) [2023-03-06 16:14:56,548][04272] Updated weights for policy 0, policy_version 79420 (0.0007) [2023-03-06 16:14:57,381][04272] Updated weights for policy 0, policy_version 79430 (0.0005) [2023-03-06 16:14:58,193][04272] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-06 16:14:58,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 81355776. Throughput: 0: 12640.2. Samples: 81335114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:14:58,941][03942] Avg episode reward: [(0, '1257.008')] [2023-03-06 16:14:58,995][04272] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-06 16:14:59,804][04272] Updated weights for policy 0, policy_version 79460 (0.0007) [2023-03-06 16:15:00,633][04272] Updated weights for policy 0, policy_version 79470 (0.0007) [2023-03-06 16:15:01,450][04272] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-06 16:15:02,270][04272] Updated weights for policy 0, policy_version 79490 (0.0006) [2023-03-06 16:15:03,078][04272] Updated weights for policy 0, policy_version 79500 (0.0007) [2023-03-06 16:15:03,893][04272] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-06 16:15:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12631.6). Total num frames: 81418240. Throughput: 0: 12620.5. Samples: 81410506. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:03,941][03942] Avg episode reward: [(0, '1249.107')] [2023-03-06 16:15:04,697][04272] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-06 16:15:05,507][04272] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-06 16:15:06,315][04272] Updated weights for policy 0, policy_version 79540 (0.0007) [2023-03-06 16:15:07,143][04272] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-06 16:15:07,949][04272] Updated weights for policy 0, policy_version 79560 (0.0006) [2023-03-06 16:15:08,767][04272] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-06 16:15:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 81481728. Throughput: 0: 12622.1. Samples: 81448383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:08,941][03942] Avg episode reward: [(0, '1297.053')] [2023-03-06 16:15:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000079572_81481728.pth... [2023-03-06 16:15:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000076611_78449664.pth [2023-03-06 16:15:09,582][04272] Updated weights for policy 0, policy_version 79580 (0.0007) [2023-03-06 16:15:10,399][04272] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-06 16:15:11,195][04272] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-06 16:15:12,025][04272] Updated weights for policy 0, policy_version 79610 (0.0007) [2023-03-06 16:15:12,831][04272] Updated weights for policy 0, policy_version 79620 (0.0007) [2023-03-06 16:15:13,646][04272] Updated weights for policy 0, policy_version 79630 (0.0006) [2023-03-06 16:15:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 81544192. Throughput: 0: 12619.8. Samples: 81524032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:13,941][03942] Avg episode reward: [(0, '1235.366')] [2023-03-06 16:15:14,434][04272] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-06 16:15:15,257][04272] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-06 16:15:16,076][04272] Updated weights for policy 0, policy_version 79660 (0.0007) [2023-03-06 16:15:16,878][04272] Updated weights for policy 0, policy_version 79670 (0.0006) [2023-03-06 16:15:17,691][04272] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-06 16:15:18,504][04272] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-06 16:15:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.3, 300 sec: 12631.7). Total num frames: 81607680. Throughput: 0: 12619.5. Samples: 81599718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:18,941][03942] Avg episode reward: [(0, '1269.214')] [2023-03-06 16:15:19,316][04272] Updated weights for policy 0, policy_version 79700 (0.0006) [2023-03-06 16:15:20,121][04272] Updated weights for policy 0, policy_version 79710 (0.0007) [2023-03-06 16:15:20,933][04272] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-06 16:15:21,761][04272] Updated weights for policy 0, policy_version 79730 (0.0007) [2023-03-06 16:15:22,567][04272] Updated weights for policy 0, policy_version 79740 (0.0007) [2023-03-06 16:15:23,378][04272] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-06 16:15:23,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 81671168. Throughput: 0: 12615.6. Samples: 81637557. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:23,941][03942] Avg episode reward: [(0, '1236.085')] [2023-03-06 16:15:24,183][04272] Updated weights for policy 0, policy_version 79760 (0.0006) [2023-03-06 16:15:24,981][04272] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-06 16:15:25,798][04272] Updated weights for policy 0, policy_version 79780 (0.0006) [2023-03-06 16:15:26,597][04272] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-06 16:15:27,402][04272] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-06 16:15:28,213][04272] Updated weights for policy 0, policy_version 79810 (0.0006) [2023-03-06 16:15:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 81733632. Throughput: 0: 12623.7. Samples: 81713642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:28,941][03942] Avg episode reward: [(0, '1166.863')] [2023-03-06 16:15:29,028][04272] Updated weights for policy 0, policy_version 79820 (0.0006) [2023-03-06 16:15:29,857][04272] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-06 16:15:30,657][04272] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-06 16:15:31,473][04272] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-06 16:15:32,272][04272] Updated weights for policy 0, policy_version 79860 (0.0005) [2023-03-06 16:15:33,091][04272] Updated weights for policy 0, policy_version 79870 (0.0006) [2023-03-06 16:15:33,901][04272] Updated weights for policy 0, policy_version 79880 (0.0007) [2023-03-06 16:15:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12631.6). Total num frames: 81797120. Throughput: 0: 12617.3. Samples: 81789118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:33,941][03942] Avg episode reward: [(0, '1300.597')] [2023-03-06 16:15:34,714][04272] Updated weights for policy 0, policy_version 79890 (0.0007) [2023-03-06 16:15:35,530][04272] Updated weights for policy 0, policy_version 79900 (0.0007) [2023-03-06 16:15:36,346][04272] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-06 16:15:37,159][04272] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-06 16:15:37,968][04272] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-06 16:15:38,805][04272] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-06 16:15:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 81859584. Throughput: 0: 12616.1. Samples: 81826982. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:38,941][03942] Avg episode reward: [(0, '1265.370')] [2023-03-06 16:15:39,593][04272] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-06 16:15:40,420][04272] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-06 16:15:41,236][04272] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-06 16:15:42,037][04272] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-06 16:15:42,836][04272] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-06 16:15:43,657][04272] Updated weights for policy 0, policy_version 80000 (0.0007) [2023-03-06 16:15:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 81923072. Throughput: 0: 12613.8. Samples: 81902736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:43,941][03942] Avg episode reward: [(0, '1143.748')] [2023-03-06 16:15:44,466][04272] Updated weights for policy 0, policy_version 80010 (0.0007) [2023-03-06 16:15:45,265][04272] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-06 16:15:46,078][04272] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-06 16:15:46,892][04272] Updated weights for policy 0, policy_version 80040 (0.0007) [2023-03-06 16:15:47,704][04272] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-06 16:15:48,512][04272] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-06 16:15:48,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 81986560. Throughput: 0: 12624.9. Samples: 81978628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:48,941][03942] Avg episode reward: [(0, '1260.118')] [2023-03-06 16:15:49,338][04272] Updated weights for policy 0, policy_version 80070 (0.0007) [2023-03-06 16:15:50,141][04272] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-06 16:15:50,937][04272] Updated weights for policy 0, policy_version 80090 (0.0006) [2023-03-06 16:15:51,745][04272] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-06 16:15:52,555][04272] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-06 16:15:53,377][04272] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-06 16:15:53,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12635.1). Total num frames: 82050048. Throughput: 0: 12623.4. Samples: 82016433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:53,941][03942] Avg episode reward: [(0, '1308.264')] [2023-03-06 16:15:54,184][04272] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-06 16:15:54,990][04272] Updated weights for policy 0, policy_version 80140 (0.0006) [2023-03-06 16:15:55,797][04272] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-06 16:15:56,629][04272] Updated weights for policy 0, policy_version 80160 (0.0007) [2023-03-06 16:15:57,432][04272] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-06 16:15:58,246][04272] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-06 16:15:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 82112512. Throughput: 0: 12625.4. Samples: 82092174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:15:58,941][03942] Avg episode reward: [(0, '1324.025')] [2023-03-06 16:15:59,053][04272] Updated weights for policy 0, policy_version 80190 (0.0007) [2023-03-06 16:15:59,870][04272] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-06 16:16:00,681][04272] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-06 16:16:01,497][04272] Updated weights for policy 0, policy_version 80220 (0.0008) [2023-03-06 16:16:02,299][04272] Updated weights for policy 0, policy_version 80230 (0.0007) [2023-03-06 16:16:03,102][04272] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-06 16:16:03,937][04272] Updated weights for policy 0, policy_version 80250 (0.0006) [2023-03-06 16:16:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 82176000. Throughput: 0: 12628.5. Samples: 82168000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:03,941][03942] Avg episode reward: [(0, '1224.221')] [2023-03-06 16:16:04,753][04272] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-06 16:16:05,557][04272] Updated weights for policy 0, policy_version 80270 (0.0006) [2023-03-06 16:16:06,357][04272] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-06 16:16:07,171][04272] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-06 16:16:07,963][04272] Updated weights for policy 0, policy_version 80300 (0.0007) [2023-03-06 16:16:08,783][04272] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-06 16:16:08,940][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.4, 300 sec: 12631.6). Total num frames: 82239488. Throughput: 0: 12626.2. Samples: 82205733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:08,941][03942] Avg episode reward: [(0, '1311.438')] [2023-03-06 16:16:09,590][04272] Updated weights for policy 0, policy_version 80320 (0.0007) [2023-03-06 16:16:10,402][04272] Updated weights for policy 0, policy_version 80330 (0.0006) [2023-03-06 16:16:11,211][04272] Updated weights for policy 0, policy_version 80340 (0.0006) [2023-03-06 16:16:12,025][04272] Updated weights for policy 0, policy_version 80350 (0.0006) [2023-03-06 16:16:12,807][04272] Updated weights for policy 0, policy_version 80360 (0.0007) [2023-03-06 16:16:13,630][04272] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-06 16:16:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 82301952. Throughput: 0: 12623.8. Samples: 82281715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:13,941][03942] Avg episode reward: [(0, '1282.863')] [2023-03-06 16:16:14,455][04272] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-06 16:16:15,255][04272] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-06 16:16:16,069][04272] Updated weights for policy 0, policy_version 80400 (0.0006) [2023-03-06 16:16:16,872][04272] Updated weights for policy 0, policy_version 80410 (0.0007) [2023-03-06 16:16:17,670][04272] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-06 16:16:18,482][04272] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-06 16:16:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 82365440. Throughput: 0: 12634.1. Samples: 82357652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:18,941][03942] Avg episode reward: [(0, '1353.170')] [2023-03-06 16:16:19,285][04272] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-06 16:16:20,090][04272] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-06 16:16:20,914][04272] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-06 16:16:21,730][04272] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-06 16:16:22,551][04272] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-06 16:16:23,373][04272] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-06 16:16:23,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12631.7). Total num frames: 82428928. Throughput: 0: 12636.1. Samples: 82395605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:23,941][03942] Avg episode reward: [(0, '1167.041')] [2023-03-06 16:16:24,174][04272] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-06 16:16:24,981][04272] Updated weights for policy 0, policy_version 80510 (0.0007) [2023-03-06 16:16:25,802][04272] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-06 16:16:26,606][04272] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-06 16:16:27,420][04272] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-06 16:16:28,242][04272] Updated weights for policy 0, policy_version 80550 (0.0006) [2023-03-06 16:16:28,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 82492416. Throughput: 0: 12633.5. Samples: 82471243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:28,941][03942] Avg episode reward: [(0, '1229.741')] [2023-03-06 16:16:29,058][04272] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-06 16:16:29,861][04272] Updated weights for policy 0, policy_version 80570 (0.0006) [2023-03-06 16:16:30,652][04272] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-06 16:16:31,469][04272] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-06 16:16:32,282][04272] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-06 16:16:33,078][04272] Updated weights for policy 0, policy_version 80610 (0.0007) [2023-03-06 16:16:33,903][04272] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-06 16:16:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 82554880. Throughput: 0: 12632.0. Samples: 82547070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:33,941][03942] Avg episode reward: [(0, '1158.711')] [2023-03-06 16:16:34,708][04272] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-06 16:16:35,521][04272] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-06 16:16:36,330][04272] Updated weights for policy 0, policy_version 80650 (0.0007) [2023-03-06 16:16:37,130][04272] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-06 16:16:37,953][04272] Updated weights for policy 0, policy_version 80670 (0.0007) [2023-03-06 16:16:38,759][04272] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-06 16:16:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12646.4, 300 sec: 12635.1). Total num frames: 82618368. Throughput: 0: 12633.6. Samples: 82584945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:38,941][03942] Avg episode reward: [(0, '887.410')] [2023-03-06 16:16:39,580][04272] Updated weights for policy 0, policy_version 80690 (0.0007) [2023-03-06 16:16:40,411][04272] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-06 16:16:41,212][04272] Updated weights for policy 0, policy_version 80710 (0.0007) [2023-03-06 16:16:42,036][04272] Updated weights for policy 0, policy_version 80720 (0.0007) [2023-03-06 16:16:42,851][04272] Updated weights for policy 0, policy_version 80730 (0.0007) [2023-03-06 16:16:43,673][04272] Updated weights for policy 0, policy_version 80740 (0.0006) [2023-03-06 16:16:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12631.6). Total num frames: 82680832. Throughput: 0: 12626.2. Samples: 82660353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:43,941][03942] Avg episode reward: [(0, '1160.740')] [2023-03-06 16:16:44,501][04272] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-06 16:16:45,305][04272] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-06 16:16:46,089][04272] Updated weights for policy 0, policy_version 80770 (0.0006) [2023-03-06 16:16:46,911][04272] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-06 16:16:47,721][04272] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-06 16:16:48,535][04272] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-06 16:16:48,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.2, 300 sec: 12628.2). Total num frames: 82743296. Throughput: 0: 12620.2. Samples: 82735910. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:48,941][03942] Avg episode reward: [(0, '1140.422')] [2023-03-06 16:16:49,356][04272] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-06 16:16:50,175][04272] Updated weights for policy 0, policy_version 80820 (0.0005) [2023-03-06 16:16:50,981][04272] Updated weights for policy 0, policy_version 80830 (0.0007) [2023-03-06 16:16:51,804][04272] Updated weights for policy 0, policy_version 80840 (0.0006) [2023-03-06 16:16:52,606][04272] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-06 16:16:53,430][04272] Updated weights for policy 0, policy_version 80860 (0.0006) [2023-03-06 16:16:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12631.7). Total num frames: 82806784. Throughput: 0: 12620.3. Samples: 82773646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:53,941][03942] Avg episode reward: [(0, '1374.510')] [2023-03-06 16:16:54,239][04272] Updated weights for policy 0, policy_version 80870 (0.0007) [2023-03-06 16:16:55,046][04272] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-06 16:16:55,867][04272] Updated weights for policy 0, policy_version 80890 (0.0007) [2023-03-06 16:16:56,670][04272] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-06 16:16:57,493][04272] Updated weights for policy 0, policy_version 80910 (0.0006) [2023-03-06 16:16:58,309][04272] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-06 16:16:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.2, 300 sec: 12631.6). Total num frames: 82869248. Throughput: 0: 12606.3. Samples: 82848997. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:16:58,941][03942] Avg episode reward: [(0, '1251.961')] [2023-03-06 16:16:59,126][04272] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-06 16:16:59,916][04272] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-06 16:17:00,713][04272] Updated weights for policy 0, policy_version 80950 (0.0006) [2023-03-06 16:17:01,560][04272] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-06 16:17:02,369][04272] Updated weights for policy 0, policy_version 80970 (0.0007) [2023-03-06 16:17:03,190][04272] Updated weights for policy 0, policy_version 80980 (0.0007) [2023-03-06 16:17:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12631.6). Total num frames: 82932736. Throughput: 0: 12601.6. Samples: 82924726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:17:03,941][03942] Avg episode reward: [(0, '1277.019')] [2023-03-06 16:17:04,017][04272] Updated weights for policy 0, policy_version 80990 (0.0007) [2023-03-06 16:17:04,825][04272] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-06 16:17:05,640][04272] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-06 16:17:06,453][04272] Updated weights for policy 0, policy_version 81020 (0.0007) [2023-03-06 16:17:07,258][04272] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-06 16:17:08,062][04272] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-06 16:17:08,877][04272] Updated weights for policy 0, policy_version 81050 (0.0007) [2023-03-06 16:17:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12628.2). Total num frames: 82995200. Throughput: 0: 12595.8. Samples: 82962414. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:17:08,941][03942] Avg episode reward: [(0, '1268.408')] [2023-03-06 16:17:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000081050_82995200.pth... [2023-03-06 16:17:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000078092_79966208.pth [2023-03-06 16:17:09,699][04272] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-06 16:17:10,507][04272] Updated weights for policy 0, policy_version 81070 (0.0007) [2023-03-06 16:17:11,312][04272] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-06 16:17:12,121][04272] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-06 16:17:12,933][04272] Updated weights for policy 0, policy_version 81100 (0.0007) [2023-03-06 16:17:13,740][04272] Updated weights for policy 0, policy_version 81110 (0.0006) [2023-03-06 16:17:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12628.2). Total num frames: 83058688. Throughput: 0: 12595.8. Samples: 83038057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:17:13,941][03942] Avg episode reward: [(0, '1234.352')] [2023-03-06 16:17:14,563][04272] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-06 16:17:15,376][04272] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-06 16:17:16,167][04272] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-06 16:17:16,988][04272] Updated weights for policy 0, policy_version 81150 (0.0007) [2023-03-06 16:17:17,806][04272] Updated weights for policy 0, policy_version 81160 (0.0007) [2023-03-06 16:17:18,627][04272] Updated weights for policy 0, policy_version 81170 (0.0007) [2023-03-06 16:17:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 83121152. Throughput: 0: 12591.3. Samples: 83113680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:18,941][03942] Avg episode reward: [(0, '1190.420')] [2023-03-06 16:17:19,450][04272] Updated weights for policy 0, policy_version 81180 (0.0005) [2023-03-06 16:17:20,272][04272] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-03-06 16:17:21,072][04272] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-03-06 16:17:21,893][04272] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-06 16:17:22,701][04272] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-06 16:17:23,509][04272] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-06 16:17:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 83184640. Throughput: 0: 12585.4. Samples: 83151286. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:23,941][03942] Avg episode reward: [(0, '1227.085')] [2023-03-06 16:17:24,334][04272] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-06 16:17:25,129][04272] Updated weights for policy 0, policy_version 81250 (0.0007) [2023-03-06 16:17:25,933][04272] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-06 16:17:26,745][04272] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-06 16:17:27,571][04272] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-06 16:17:28,361][04272] Updated weights for policy 0, policy_version 81290 (0.0006) [2023-03-06 16:17:28,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12628.2). Total num frames: 83248128. Throughput: 0: 12591.6. Samples: 83226973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:28,941][03942] Avg episode reward: [(0, '1226.188')] [2023-03-06 16:17:29,157][04272] Updated weights for policy 0, policy_version 81300 (0.0007) [2023-03-06 16:17:29,977][04272] Updated weights for policy 0, policy_version 81310 (0.0005) [2023-03-06 16:17:30,789][04272] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-03-06 16:17:31,599][04272] Updated weights for policy 0, policy_version 81330 (0.0006) [2023-03-06 16:17:32,411][04272] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-06 16:17:33,220][04272] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-06 16:17:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 83310592. Throughput: 0: 12603.6. Samples: 83303069. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:33,941][03942] Avg episode reward: [(0, '1121.192')] [2023-03-06 16:17:34,028][04272] Updated weights for policy 0, policy_version 81360 (0.0007) [2023-03-06 16:17:34,858][04272] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-06 16:17:35,667][04272] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-06 16:17:36,474][04272] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-06 16:17:37,287][04272] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-06 16:17:38,095][04272] Updated weights for policy 0, policy_version 81410 (0.0006) [2023-03-06 16:17:38,908][04272] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-06 16:17:38,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12624.7). Total num frames: 83374080. Throughput: 0: 12608.6. Samples: 83341033. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:38,941][03942] Avg episode reward: [(0, '1197.790')] [2023-03-06 16:17:39,723][04272] Updated weights for policy 0, policy_version 81430 (0.0007) [2023-03-06 16:17:40,539][04272] Updated weights for policy 0, policy_version 81440 (0.0007) [2023-03-06 16:17:41,337][04272] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-06 16:17:42,152][04272] Updated weights for policy 0, policy_version 81460 (0.0008) [2023-03-06 16:17:42,955][04272] Updated weights for policy 0, policy_version 81470 (0.0006) [2023-03-06 16:17:43,754][04272] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-06 16:17:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 83437568. Throughput: 0: 12614.8. Samples: 83416662. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:43,941][03942] Avg episode reward: [(0, '1260.348')] [2023-03-06 16:17:44,597][04272] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-06 16:17:45,413][04272] Updated weights for policy 0, policy_version 81500 (0.0007) [2023-03-06 16:17:46,216][04272] Updated weights for policy 0, policy_version 81510 (0.0007) [2023-03-06 16:17:47,037][04272] Updated weights for policy 0, policy_version 81520 (0.0006) [2023-03-06 16:17:47,857][04272] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-06 16:17:48,671][04272] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-06 16:17:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 83500032. Throughput: 0: 12610.0. Samples: 83492176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:48,951][03942] Avg episode reward: [(0, '1332.736')] [2023-03-06 16:17:49,469][04272] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-06 16:17:50,279][04272] Updated weights for policy 0, policy_version 81560 (0.0007) [2023-03-06 16:17:51,071][04272] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-06 16:17:51,880][04272] Updated weights for policy 0, policy_version 81580 (0.0006) [2023-03-06 16:17:52,699][04272] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-06 16:17:53,495][04272] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-06 16:17:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 83563520. Throughput: 0: 12618.5. Samples: 83530248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:53,952][03942] Avg episode reward: [(0, '1171.267')] [2023-03-06 16:17:54,317][04272] Updated weights for policy 0, policy_version 81610 (0.0005) [2023-03-06 16:17:55,130][04272] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-06 16:17:55,924][04272] Updated weights for policy 0, policy_version 81630 (0.0007) [2023-03-06 16:17:56,745][04272] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-06 16:17:57,567][04272] Updated weights for policy 0, policy_version 81650 (0.0007) [2023-03-06 16:17:58,366][04272] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-06 16:17:58,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 83627008. Throughput: 0: 12622.1. Samples: 83606052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:17:58,941][03942] Avg episode reward: [(0, '1150.578')] [2023-03-06 16:17:59,181][04272] Updated weights for policy 0, policy_version 81670 (0.0006) [2023-03-06 16:18:00,010][04272] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-06 16:18:00,797][04272] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-06 16:18:01,605][04272] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-06 16:18:02,437][04272] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-06 16:18:03,230][04272] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-06 16:18:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 83689472. Throughput: 0: 12626.4. Samples: 83681870. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:03,951][03942] Avg episode reward: [(0, '1124.295')] [2023-03-06 16:18:04,034][04272] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-06 16:18:04,853][04272] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-06 16:18:05,650][04272] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-06 16:18:06,482][04272] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-06 16:18:07,289][04272] Updated weights for policy 0, policy_version 81770 (0.0007) [2023-03-06 16:18:08,108][04272] Updated weights for policy 0, policy_version 81780 (0.0007) [2023-03-06 16:18:08,936][04272] Updated weights for policy 0, policy_version 81790 (0.0007) [2023-03-06 16:18:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 83752960. Throughput: 0: 12633.3. Samples: 83719787. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:08,952][03942] Avg episode reward: [(0, '1168.430')] [2023-03-06 16:18:09,748][04272] Updated weights for policy 0, policy_version 81800 (0.0007) [2023-03-06 16:18:10,563][04272] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-06 16:18:11,367][04272] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-06 16:18:12,173][04272] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-06 16:18:12,994][04272] Updated weights for policy 0, policy_version 81840 (0.0007) [2023-03-06 16:18:13,795][04272] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-06 16:18:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 83815424. Throughput: 0: 12626.7. Samples: 83795176. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:13,952][03942] Avg episode reward: [(0, '1225.803')] [2023-03-06 16:18:14,597][04272] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-06 16:18:15,422][04272] Updated weights for policy 0, policy_version 81870 (0.0007) [2023-03-06 16:18:16,241][04272] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-06 16:18:17,044][04272] Updated weights for policy 0, policy_version 81890 (0.0007) [2023-03-06 16:18:17,852][04272] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-03-06 16:18:18,663][04272] Updated weights for policy 0, policy_version 81910 (0.0007) [2023-03-06 16:18:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 83878912. Throughput: 0: 12621.4. Samples: 83871031. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:18,951][03942] Avg episode reward: [(0, '1039.229')] [2023-03-06 16:18:19,467][04272] Updated weights for policy 0, policy_version 81920 (0.0006) [2023-03-06 16:18:20,266][04272] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-06 16:18:21,074][04272] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-06 16:18:21,883][04272] Updated weights for policy 0, policy_version 81950 (0.0006) [2023-03-06 16:18:22,709][04272] Updated weights for policy 0, policy_version 81960 (0.0007) [2023-03-06 16:18:23,522][04272] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-06 16:18:23,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 83942400. Throughput: 0: 12620.4. Samples: 83908948. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:23,951][03942] Avg episode reward: [(0, '1093.126')] [2023-03-06 16:18:24,334][04272] Updated weights for policy 0, policy_version 81980 (0.0007) [2023-03-06 16:18:25,154][04272] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-06 16:18:25,974][04272] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-06 16:18:26,780][04272] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-06 16:18:27,597][04272] Updated weights for policy 0, policy_version 82020 (0.0007) [2023-03-06 16:18:28,383][04272] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-06 16:18:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 84004864. Throughput: 0: 12618.8. Samples: 83984511. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:28,952][03942] Avg episode reward: [(0, '1226.705')] [2023-03-06 16:18:29,206][04272] Updated weights for policy 0, policy_version 82040 (0.0007) [2023-03-06 16:18:30,029][04272] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-06 16:18:30,844][04272] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-06 16:18:31,648][04272] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-06 16:18:32,463][04272] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-06 16:18:33,271][04272] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-06 16:18:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 84068352. Throughput: 0: 12625.0. Samples: 84060299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:33,951][03942] Avg episode reward: [(0, '1175.170')] [2023-03-06 16:18:34,076][04272] Updated weights for policy 0, policy_version 82100 (0.0007) [2023-03-06 16:18:34,898][04272] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-06 16:18:35,734][04272] Updated weights for policy 0, policy_version 82120 (0.0007) [2023-03-06 16:18:36,534][04272] Updated weights for policy 0, policy_version 82130 (0.0007) [2023-03-06 16:18:37,357][04272] Updated weights for policy 0, policy_version 82140 (0.0006) [2023-03-06 16:18:38,146][04272] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-06 16:18:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 84130816. Throughput: 0: 12616.4. Samples: 84097988. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:38,947][04272] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-03-06 16:18:38,952][03942] Avg episode reward: [(0, '1154.499')] [2023-03-06 16:18:39,777][04272] Updated weights for policy 0, policy_version 82170 (0.0006) [2023-03-06 16:18:40,577][04272] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-06 16:18:41,389][04272] Updated weights for policy 0, policy_version 82190 (0.0006) [2023-03-06 16:18:42,185][04272] Updated weights for policy 0, policy_version 82200 (0.0006) [2023-03-06 16:18:43,010][04272] Updated weights for policy 0, policy_version 82210 (0.0007) [2023-03-06 16:18:43,824][04272] Updated weights for policy 0, policy_version 82220 (0.0006) [2023-03-06 16:18:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 84194304. Throughput: 0: 12618.7. Samples: 84173892. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:43,941][03942] Avg episode reward: [(0, '1192.357')] [2023-03-06 16:18:44,647][04272] Updated weights for policy 0, policy_version 82230 (0.0006) [2023-03-06 16:18:45,453][04272] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-06 16:18:46,265][04272] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-06 16:18:47,070][04272] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-06 16:18:47,878][04272] Updated weights for policy 0, policy_version 82270 (0.0007) [2023-03-06 16:18:48,693][04272] Updated weights for policy 0, policy_version 82280 (0.0006) [2023-03-06 16:18:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 84257792. Throughput: 0: 12616.1. Samples: 84249594. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:18:48,941][03942] Avg episode reward: [(0, '1219.934')] [2023-03-06 16:18:49,490][04272] Updated weights for policy 0, policy_version 82290 (0.0007) [2023-03-06 16:18:50,303][04272] Updated weights for policy 0, policy_version 82300 (0.0007) [2023-03-06 16:18:51,106][04272] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-06 16:18:51,905][04272] Updated weights for policy 0, policy_version 82320 (0.0007) [2023-03-06 16:18:52,727][04272] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-06 16:18:53,529][04272] Updated weights for policy 0, policy_version 82340 (0.0007) [2023-03-06 16:18:53,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12624.7). Total num frames: 84321280. Throughput: 0: 12617.9. Samples: 84287593. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:18:53,941][03942] Avg episode reward: [(0, '1244.913')] [2023-03-06 16:18:54,334][04272] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-06 16:18:55,158][04272] Updated weights for policy 0, policy_version 82360 (0.0007) [2023-03-06 16:18:55,979][04272] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-06 16:18:56,774][04272] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-06 16:18:57,585][04272] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-06 16:18:58,392][04272] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-06 16:18:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 84383744. Throughput: 0: 12628.4. Samples: 84363452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:18:58,941][03942] Avg episode reward: [(0, '1320.921')] [2023-03-06 16:18:59,206][04272] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-06 16:19:00,015][04272] Updated weights for policy 0, policy_version 82420 (0.0006) [2023-03-06 16:19:00,833][04272] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-06 16:19:01,626][04272] Updated weights for policy 0, policy_version 82440 (0.0006) [2023-03-06 16:19:02,440][04272] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-06 16:19:03,274][04272] Updated weights for policy 0, policy_version 82460 (0.0007) [2023-03-06 16:19:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 84447232. Throughput: 0: 12626.2. Samples: 84439211. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:03,941][03942] Avg episode reward: [(0, '1292.844')] [2023-03-06 16:19:04,064][04272] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-06 16:19:04,889][04272] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-06 16:19:05,698][04272] Updated weights for policy 0, policy_version 82490 (0.0006) [2023-03-06 16:19:06,515][04272] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-06 16:19:07,334][04272] Updated weights for policy 0, policy_version 82510 (0.0006) [2023-03-06 16:19:08,150][04272] Updated weights for policy 0, policy_version 82520 (0.0008) [2023-03-06 16:19:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 84509696. Throughput: 0: 12622.6. Samples: 84476968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:08,941][03942] Avg episode reward: [(0, '1359.283')] [2023-03-06 16:19:08,957][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000082530_84510720.pth... [2023-03-06 16:19:08,958][04272] Updated weights for policy 0, policy_version 82530 (0.0007) [2023-03-06 16:19:08,987][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000079572_81481728.pth [2023-03-06 16:19:09,788][04272] Updated weights for policy 0, policy_version 82540 (0.0007) [2023-03-06 16:19:10,612][04272] Updated weights for policy 0, policy_version 82550 (0.0007) [2023-03-06 16:19:11,412][04272] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-06 16:19:12,229][04272] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-06 16:19:13,062][04272] Updated weights for policy 0, policy_version 82580 (0.0007) [2023-03-06 16:19:13,863][04272] Updated weights for policy 0, policy_version 82590 (0.0006) [2023-03-06 16:19:13,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84572160. Throughput: 0: 12618.4. Samples: 84552339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:13,941][03942] Avg episode reward: [(0, '1200.429')] [2023-03-06 16:19:14,694][04272] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-06 16:19:15,501][04272] Updated weights for policy 0, policy_version 82610 (0.0006) [2023-03-06 16:19:16,316][04272] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-06 16:19:17,114][04272] Updated weights for policy 0, policy_version 82630 (0.0007) [2023-03-06 16:19:17,938][04272] Updated weights for policy 0, policy_version 82640 (0.0007) [2023-03-06 16:19:18,753][04272] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-06 16:19:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84635648. Throughput: 0: 12608.9. Samples: 84627701. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:18,941][03942] Avg episode reward: [(0, '1226.488')] [2023-03-06 16:19:19,540][04272] Updated weights for policy 0, policy_version 82660 (0.0007) [2023-03-06 16:19:20,349][04272] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-06 16:19:21,166][04272] Updated weights for policy 0, policy_version 82680 (0.0008) [2023-03-06 16:19:21,967][04272] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-06 16:19:22,781][04272] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-06 16:19:23,601][04272] Updated weights for policy 0, policy_version 82710 (0.0007) [2023-03-06 16:19:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84699136. Throughput: 0: 12617.2. Samples: 84665762. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:23,941][03942] Avg episode reward: [(0, '1182.877')] [2023-03-06 16:19:24,419][04272] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-06 16:19:25,224][04272] Updated weights for policy 0, policy_version 82730 (0.0006) [2023-03-06 16:19:26,052][04272] Updated weights for policy 0, policy_version 82740 (0.0007) [2023-03-06 16:19:26,853][04272] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-06 16:19:27,659][04272] Updated weights for policy 0, policy_version 82760 (0.0006) [2023-03-06 16:19:28,461][04272] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-06 16:19:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84761600. Throughput: 0: 12612.6. Samples: 84741461. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:28,941][03942] Avg episode reward: [(0, '1193.650')] [2023-03-06 16:19:29,282][04272] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-06 16:19:30,072][04272] Updated weights for policy 0, policy_version 82790 (0.0007) [2023-03-06 16:19:30,901][04272] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-06 16:19:31,690][04272] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-06 16:19:32,522][04272] Updated weights for policy 0, policy_version 82820 (0.0006) [2023-03-06 16:19:33,329][04272] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-06 16:19:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84825088. Throughput: 0: 12613.0. Samples: 84817177. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:33,941][03942] Avg episode reward: [(0, '1254.343')] [2023-03-06 16:19:34,146][04272] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-06 16:19:34,953][04272] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-06 16:19:35,761][04272] Updated weights for policy 0, policy_version 82860 (0.0007) [2023-03-06 16:19:36,572][04272] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-06 16:19:37,382][04272] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-06 16:19:38,202][04272] Updated weights for policy 0, policy_version 82890 (0.0006) [2023-03-06 16:19:38,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 84888576. Throughput: 0: 12612.9. Samples: 84855172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:19:38,941][03942] Avg episode reward: [(0, '1190.200')] [2023-03-06 16:19:39,016][04272] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-06 16:19:39,834][04272] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-06 16:19:40,643][04272] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-06 16:19:41,456][04272] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-06 16:19:42,265][04272] Updated weights for policy 0, policy_version 82940 (0.0007) [2023-03-06 16:19:43,092][04272] Updated weights for policy 0, policy_version 82950 (0.0006) [2023-03-06 16:19:43,904][04272] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-06 16:19:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 84951040. Throughput: 0: 12603.9. Samples: 84930629. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:19:43,941][03942] Avg episode reward: [(0, '1129.220')] [2023-03-06 16:19:44,728][04272] Updated weights for policy 0, policy_version 82970 (0.0006) [2023-03-06 16:19:45,536][04272] Updated weights for policy 0, policy_version 82980 (0.0007) [2023-03-06 16:19:46,355][04272] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-06 16:19:47,170][04272] Updated weights for policy 0, policy_version 83000 (0.0006) [2023-03-06 16:19:47,993][04272] Updated weights for policy 0, policy_version 83010 (0.0007) [2023-03-06 16:19:48,790][04272] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-06 16:19:48,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85013504. Throughput: 0: 12596.4. Samples: 85006051. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:19:48,941][03942] Avg episode reward: [(0, '1167.152')] [2023-03-06 16:19:49,601][04272] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-06 16:19:50,415][04272] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-06 16:19:51,226][04272] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-06 16:19:52,025][04272] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-06 16:19:52,848][04272] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-06 16:19:53,661][04272] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-06 16:19:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85076992. Throughput: 0: 12599.9. Samples: 85043962. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:19:53,941][03942] Avg episode reward: [(0, '1146.887')] [2023-03-06 16:19:54,485][04272] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-06 16:19:55,298][04272] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-06 16:19:56,106][04272] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-06 16:19:56,916][04272] Updated weights for policy 0, policy_version 83120 (0.0007) [2023-03-06 16:19:57,714][04272] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-06 16:19:58,547][04272] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-06 16:19:58,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 85140480. Throughput: 0: 12603.7. Samples: 85119505. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:19:58,941][03942] Avg episode reward: [(0, '1284.769')] [2023-03-06 16:19:59,359][04272] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-06 16:20:00,137][04272] Updated weights for policy 0, policy_version 83160 (0.0007) [2023-03-06 16:20:00,965][04272] Updated weights for policy 0, policy_version 83170 (0.0007) [2023-03-06 16:20:01,783][04272] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-06 16:20:02,602][04272] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-06 16:20:03,421][04272] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-06 16:20:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85202944. Throughput: 0: 12607.5. Samples: 85195040. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:03,941][03942] Avg episode reward: [(0, '1288.909')] [2023-03-06 16:20:04,219][04272] Updated weights for policy 0, policy_version 83210 (0.0007) [2023-03-06 16:20:05,040][04272] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-06 16:20:05,850][04272] Updated weights for policy 0, policy_version 83230 (0.0007) [2023-03-06 16:20:06,649][04272] Updated weights for policy 0, policy_version 83240 (0.0006) [2023-03-06 16:20:07,473][04272] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-06 16:20:08,284][04272] Updated weights for policy 0, policy_version 83260 (0.0007) [2023-03-06 16:20:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 85266432. Throughput: 0: 12605.5. Samples: 85233010. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:08,941][03942] Avg episode reward: [(0, '1320.206')] [2023-03-06 16:20:09,105][04272] Updated weights for policy 0, policy_version 83270 (0.0007) [2023-03-06 16:20:09,927][04272] Updated weights for policy 0, policy_version 83280 (0.0006) [2023-03-06 16:20:10,738][04272] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-06 16:20:11,549][04272] Updated weights for policy 0, policy_version 83300 (0.0006) [2023-03-06 16:20:12,366][04272] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-06 16:20:13,164][04272] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-03-06 16:20:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 85328896. Throughput: 0: 12599.7. Samples: 85308447. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:13,952][03942] Avg episode reward: [(0, '1226.449')] [2023-03-06 16:20:13,985][04272] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-06 16:20:14,818][04272] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-06 16:20:15,637][04272] Updated weights for policy 0, policy_version 83350 (0.0007) [2023-03-06 16:20:16,450][04272] Updated weights for policy 0, policy_version 83360 (0.0007) [2023-03-06 16:20:17,253][04272] Updated weights for policy 0, policy_version 83370 (0.0007) [2023-03-06 16:20:18,070][04272] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-06 16:20:18,880][04272] Updated weights for policy 0, policy_version 83390 (0.0006) [2023-03-06 16:20:18,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 85391360. Throughput: 0: 12592.5. Samples: 85383839. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:18,951][03942] Avg episode reward: [(0, '1237.256')] [2023-03-06 16:20:19,690][04272] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-06 16:20:20,511][04272] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-06 16:20:21,325][04272] Updated weights for policy 0, policy_version 83420 (0.0006) [2023-03-06 16:20:22,125][04272] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-06 16:20:22,930][04272] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-06 16:20:23,734][04272] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-06 16:20:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85454848. Throughput: 0: 12586.5. Samples: 85421566. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:23,951][03942] Avg episode reward: [(0, '1190.531')] [2023-03-06 16:20:24,528][04272] Updated weights for policy 0, policy_version 83460 (0.0006) [2023-03-06 16:20:25,351][04272] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-06 16:20:26,175][04272] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-06 16:20:26,985][04272] Updated weights for policy 0, policy_version 83490 (0.0006) [2023-03-06 16:20:27,784][04272] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-06 16:20:28,606][04272] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-06 16:20:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 85518336. Throughput: 0: 12596.9. Samples: 85497491. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:28,952][03942] Avg episode reward: [(0, '1246.340')] [2023-03-06 16:20:29,394][04272] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-06 16:20:30,204][04272] Updated weights for policy 0, policy_version 83530 (0.0006) [2023-03-06 16:20:31,026][04272] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-06 16:20:31,826][04272] Updated weights for policy 0, policy_version 83550 (0.0007) [2023-03-06 16:20:32,648][04272] Updated weights for policy 0, policy_version 83560 (0.0007) [2023-03-06 16:20:33,464][04272] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-06 16:20:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85580800. Throughput: 0: 12605.8. Samples: 85573313. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:20:33,941][03942] Avg episode reward: [(0, '1147.316')] [2023-03-06 16:20:34,274][04272] Updated weights for policy 0, policy_version 83580 (0.0007) [2023-03-06 16:20:35,086][04272] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-06 16:20:35,893][04272] Updated weights for policy 0, policy_version 83600 (0.0007) [2023-03-06 16:20:36,698][04272] Updated weights for policy 0, policy_version 83610 (0.0007) [2023-03-06 16:20:37,533][04272] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-06 16:20:38,331][04272] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-06 16:20:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 85644288. Throughput: 0: 12602.4. Samples: 85611069. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:20:38,941][03942] Avg episode reward: [(0, '1229.348')] [2023-03-06 16:20:39,125][04272] Updated weights for policy 0, policy_version 83640 (0.0007) [2023-03-06 16:20:39,938][04272] Updated weights for policy 0, policy_version 83650 (0.0007) [2023-03-06 16:20:40,757][04272] Updated weights for policy 0, policy_version 83660 (0.0007) [2023-03-06 16:20:41,570][04272] Updated weights for policy 0, policy_version 83670 (0.0006) [2023-03-06 16:20:42,364][04272] Updated weights for policy 0, policy_version 83680 (0.0007) [2023-03-06 16:20:43,167][04272] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-06 16:20:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 85707776. Throughput: 0: 12615.4. Samples: 85687198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:20:43,941][03942] Avg episode reward: [(0, '1208.250')] [2023-03-06 16:20:43,993][04272] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-06 16:20:44,797][04272] Updated weights for policy 0, policy_version 83710 (0.0006) [2023-03-06 16:20:45,613][04272] Updated weights for policy 0, policy_version 83720 (0.0007) [2023-03-06 16:20:46,431][04272] Updated weights for policy 0, policy_version 83730 (0.0005) [2023-03-06 16:20:47,226][04272] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-06 16:20:48,029][04272] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-06 16:20:48,853][04272] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-06 16:20:48,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 85771264. Throughput: 0: 12619.6. Samples: 85762921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:20:48,941][03942] Avg episode reward: [(0, '1183.157')] [2023-03-06 16:20:49,674][04272] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-06 16:20:50,459][04272] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-06 16:20:51,299][04272] Updated weights for policy 0, policy_version 83790 (0.0005) [2023-03-06 16:20:52,102][04272] Updated weights for policy 0, policy_version 83800 (0.0007) [2023-03-06 16:20:52,921][04272] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-06 16:20:53,738][04272] Updated weights for policy 0, policy_version 83820 (0.0007) [2023-03-06 16:20:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 85833728. Throughput: 0: 12617.9. Samples: 85800815. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:20:53,941][03942] Avg episode reward: [(0, '1236.310')] [2023-03-06 16:20:54,535][04272] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-06 16:20:55,345][04272] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-06 16:20:56,177][04272] Updated weights for policy 0, policy_version 83850 (0.0006) [2023-03-06 16:20:56,965][04272] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-06 16:20:57,764][04272] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-06 16:20:58,588][04272] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-06 16:20:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 85897216. Throughput: 0: 12625.8. Samples: 85876607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:20:58,941][03942] Avg episode reward: [(0, '1124.669')] [2023-03-06 16:20:59,386][04272] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-06 16:21:00,190][04272] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-06 16:21:01,017][04272] Updated weights for policy 0, policy_version 83910 (0.0006) [2023-03-06 16:21:01,823][04272] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-06 16:21:02,631][04272] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-06 16:21:03,443][04272] Updated weights for policy 0, policy_version 83940 (0.0007) [2023-03-06 16:21:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 85959680. Throughput: 0: 12636.7. Samples: 85952490. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:21:03,941][03942] Avg episode reward: [(0, '1001.164')] [2023-03-06 16:21:04,267][04272] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-06 16:21:05,053][04272] Updated weights for policy 0, policy_version 83960 (0.0007) [2023-03-06 16:21:05,856][04272] Updated weights for policy 0, policy_version 83970 (0.0007) [2023-03-06 16:21:06,669][04272] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-06 16:21:07,481][04272] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-06 16:21:08,301][04272] Updated weights for policy 0, policy_version 84000 (0.0007) [2023-03-06 16:21:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86023168. Throughput: 0: 12642.1. Samples: 85990463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:21:08,952][03942] Avg episode reward: [(0, '1083.938')] [2023-03-06 16:21:08,954][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000084008_86024192.pth... [2023-03-06 16:21:08,986][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000081050_82995200.pth [2023-03-06 16:21:09,113][04272] Updated weights for policy 0, policy_version 84010 (0.0007) [2023-03-06 16:21:09,910][04272] Updated weights for policy 0, policy_version 84020 (0.0005) [2023-03-06 16:21:10,723][04272] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-06 16:21:11,530][04272] Updated weights for policy 0, policy_version 84040 (0.0007) [2023-03-06 16:21:12,345][04272] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-06 16:21:13,155][04272] Updated weights for policy 0, policy_version 84060 (0.0007) [2023-03-06 16:21:13,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 86086656. Throughput: 0: 12642.7. Samples: 86066412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:21:13,952][03942] Avg episode reward: [(0, '1182.940')] [2023-03-06 16:21:13,963][04272] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-06 16:21:14,782][04272] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-06 16:21:15,588][04272] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-06 16:21:16,406][04272] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-06 16:21:17,225][04272] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-06 16:21:18,027][04272] Updated weights for policy 0, policy_version 84120 (0.0007) [2023-03-06 16:21:18,840][04272] Updated weights for policy 0, policy_version 84130 (0.0007) [2023-03-06 16:21:18,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 86150144. Throughput: 0: 12638.2. Samples: 86142031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:21:18,952][03942] Avg episode reward: [(0, '1151.093')] [2023-03-06 16:21:19,650][04272] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-06 16:21:20,465][04272] Updated weights for policy 0, policy_version 84150 (0.0008) [2023-03-06 16:21:21,271][04272] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-06 16:21:22,087][04272] Updated weights for policy 0, policy_version 84170 (0.0007) [2023-03-06 16:21:22,899][04272] Updated weights for policy 0, policy_version 84180 (0.0008) [2023-03-06 16:21:23,722][04272] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-06 16:21:23,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 86212608. Throughput: 0: 12641.5. Samples: 86179936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:21:23,952][03942] Avg episode reward: [(0, '1191.853')] [2023-03-06 16:21:24,528][04272] Updated weights for policy 0, policy_version 84200 (0.0008) [2023-03-06 16:21:25,347][04272] Updated weights for policy 0, policy_version 84210 (0.0007) [2023-03-06 16:21:26,166][04272] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-06 16:21:27,000][04272] Updated weights for policy 0, policy_version 84230 (0.0007) [2023-03-06 16:21:27,817][04272] Updated weights for policy 0, policy_version 84240 (0.0006) [2023-03-06 16:21:28,598][04272] Updated weights for policy 0, policy_version 84250 (0.0006) [2023-03-06 16:21:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 86276096. Throughput: 0: 12624.0. Samples: 86255279. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:28,952][03942] Avg episode reward: [(0, '1131.686')] [2023-03-06 16:21:29,409][04272] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-06 16:21:30,220][04272] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-06 16:21:31,037][04272] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-06 16:21:31,854][04272] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-06 16:21:32,661][04272] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-06 16:21:33,473][04272] Updated weights for policy 0, policy_version 84310 (0.0007) [2023-03-06 16:21:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 86338560. Throughput: 0: 12622.0. Samples: 86330910. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:33,951][03942] Avg episode reward: [(0, '1265.155')] [2023-03-06 16:21:34,280][04272] Updated weights for policy 0, policy_version 84320 (0.0006) [2023-03-06 16:21:35,115][04272] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-06 16:21:35,924][04272] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-06 16:21:36,742][04272] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-06 16:21:37,559][04272] Updated weights for policy 0, policy_version 84360 (0.0007) [2023-03-06 16:21:38,370][04272] Updated weights for policy 0, policy_version 84370 (0.0007) [2023-03-06 16:21:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 86402048. Throughput: 0: 12618.2. Samples: 86368633. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:38,951][03942] Avg episode reward: [(0, '1236.338')] [2023-03-06 16:21:39,209][04272] Updated weights for policy 0, policy_version 84380 (0.0007) [2023-03-06 16:21:40,022][04272] Updated weights for policy 0, policy_version 84390 (0.0007) [2023-03-06 16:21:40,838][04272] Updated weights for policy 0, policy_version 84400 (0.0007) [2023-03-06 16:21:41,645][04272] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-06 16:21:42,445][04272] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-06 16:21:43,257][04272] Updated weights for policy 0, policy_version 84430 (0.0007) [2023-03-06 16:21:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86464512. Throughput: 0: 12606.6. Samples: 86443903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:43,951][03942] Avg episode reward: [(0, '1224.078')] [2023-03-06 16:21:44,073][04272] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-06 16:21:44,881][04272] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-06 16:21:45,685][04272] Updated weights for policy 0, policy_version 84460 (0.0006) [2023-03-06 16:21:46,494][04272] Updated weights for policy 0, policy_version 84470 (0.0006) [2023-03-06 16:21:47,316][04272] Updated weights for policy 0, policy_version 84480 (0.0007) [2023-03-06 16:21:48,127][04272] Updated weights for policy 0, policy_version 84490 (0.0007) [2023-03-06 16:21:48,928][04272] Updated weights for policy 0, policy_version 84500 (0.0007) [2023-03-06 16:21:48,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86528000. Throughput: 0: 12606.1. Samples: 86519766. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:48,951][03942] Avg episode reward: [(0, '1282.146')] [2023-03-06 16:21:49,730][04272] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-06 16:21:50,566][04272] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-06 16:21:51,371][04272] Updated weights for policy 0, policy_version 84530 (0.0008) [2023-03-06 16:21:52,176][04272] Updated weights for policy 0, policy_version 84540 (0.0006) [2023-03-06 16:21:52,984][04272] Updated weights for policy 0, policy_version 84550 (0.0006) [2023-03-06 16:21:53,793][04272] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-06 16:21:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86590464. Throughput: 0: 12602.9. Samples: 86557591. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:53,952][03942] Avg episode reward: [(0, '1171.751')] [2023-03-06 16:21:54,619][04272] Updated weights for policy 0, policy_version 84570 (0.0007) [2023-03-06 16:21:55,424][04272] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-03-06 16:21:56,253][04272] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-06 16:21:57,044][04272] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-06 16:21:57,852][04272] Updated weights for policy 0, policy_version 84610 (0.0007) [2023-03-06 16:21:58,669][04272] Updated weights for policy 0, policy_version 84620 (0.0007) [2023-03-06 16:21:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86653952. Throughput: 0: 12598.8. Samples: 86633357. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:21:58,952][03942] Avg episode reward: [(0, '1246.541')] [2023-03-06 16:21:59,476][04272] Updated weights for policy 0, policy_version 84630 (0.0006) [2023-03-06 16:22:00,290][04272] Updated weights for policy 0, policy_version 84640 (0.0007) [2023-03-06 16:22:01,092][04272] Updated weights for policy 0, policy_version 84650 (0.0008) [2023-03-06 16:22:01,897][04272] Updated weights for policy 0, policy_version 84660 (0.0007) [2023-03-06 16:22:02,706][04272] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-06 16:22:03,505][04272] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-06 16:22:03,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 86717440. Throughput: 0: 12605.4. Samples: 86709273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:22:03,941][03942] Avg episode reward: [(0, '1320.175')] [2023-03-06 16:22:04,318][04272] Updated weights for policy 0, policy_version 84690 (0.0007) [2023-03-06 16:22:05,142][04272] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-06 16:22:05,965][04272] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-06 16:22:06,783][04272] Updated weights for policy 0, policy_version 84720 (0.0006) [2023-03-06 16:22:07,574][04272] Updated weights for policy 0, policy_version 84730 (0.0007) [2023-03-06 16:22:08,403][04272] Updated weights for policy 0, policy_version 84740 (0.0007) [2023-03-06 16:22:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86779904. Throughput: 0: 12603.6. Samples: 86747097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:22:08,941][03942] Avg episode reward: [(0, '1101.699')] [2023-03-06 16:22:09,199][04272] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-06 16:22:09,994][04272] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-06 16:22:10,812][04272] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-06 16:22:11,619][04272] Updated weights for policy 0, policy_version 84780 (0.0007) [2023-03-06 16:22:12,422][04272] Updated weights for policy 0, policy_version 84790 (0.0007) [2023-03-06 16:22:13,214][04272] Updated weights for policy 0, policy_version 84800 (0.0007) [2023-03-06 16:22:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 86843392. Throughput: 0: 12618.3. Samples: 86823103. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:22:13,941][03942] Avg episode reward: [(0, '1111.066')] [2023-03-06 16:22:14,045][04272] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-06 16:22:14,853][04272] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-06 16:22:15,658][04272] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-06 16:22:16,470][04272] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-06 16:22:17,290][04272] Updated weights for policy 0, policy_version 84850 (0.0008) [2023-03-06 16:22:18,107][04272] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-06 16:22:18,921][04272] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-06 16:22:18,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12617.8). Total num frames: 86906880. Throughput: 0: 12621.1. Samples: 86898862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:18,941][03942] Avg episode reward: [(0, '1249.955')] [2023-03-06 16:22:19,734][04272] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-06 16:22:20,538][04272] Updated weights for policy 0, policy_version 84890 (0.0007) [2023-03-06 16:22:21,016][04221] KL-divergence is very high: 170.3874 [2023-03-06 16:22:21,106][04221] KL-divergence is very high: 105.3975 [2023-03-06 16:22:21,350][04272] Updated weights for policy 0, policy_version 84900 (0.0007) [2023-03-06 16:22:22,187][04272] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-06 16:22:22,984][04272] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-06 16:22:23,786][04272] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-06 16:22:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 86969344. Throughput: 0: 12622.7. Samples: 86936654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:23,941][03942] Avg episode reward: [(0, '1207.230')] [2023-03-06 16:22:24,605][04272] Updated weights for policy 0, policy_version 84940 (0.0007) [2023-03-06 16:22:25,415][04272] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-06 16:22:26,244][04272] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-06 16:22:27,044][04272] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-06 16:22:27,860][04272] Updated weights for policy 0, policy_version 84980 (0.0006) [2023-03-06 16:22:28,663][04272] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-06 16:22:28,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 87032832. Throughput: 0: 12625.6. Samples: 87012053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:28,941][03942] Avg episode reward: [(0, '1199.356')] [2023-03-06 16:22:29,496][04272] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-06 16:22:30,303][04272] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-06 16:22:31,108][04272] Updated weights for policy 0, policy_version 85020 (0.0007) [2023-03-06 16:22:31,937][04272] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-06 16:22:32,738][04272] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-06 16:22:33,554][04272] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-06 16:22:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 87095296. Throughput: 0: 12618.6. Samples: 87087603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:33,941][03942] Avg episode reward: [(0, '1220.740')] [2023-03-06 16:22:34,373][04272] Updated weights for policy 0, policy_version 85060 (0.0006) [2023-03-06 16:22:35,168][04272] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-06 16:22:35,956][04272] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-06 16:22:36,793][04272] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-06 16:22:37,603][04272] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-06 16:22:38,422][04272] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-06 16:22:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 87158784. Throughput: 0: 12625.6. Samples: 87125742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:38,941][03942] Avg episode reward: [(0, '1176.933')] [2023-03-06 16:22:39,245][04272] Updated weights for policy 0, policy_version 85120 (0.0007) [2023-03-06 16:22:40,067][04272] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-06 16:22:40,861][04272] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-06 16:22:41,674][04272] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-06 16:22:42,490][04272] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-06 16:22:43,283][04272] Updated weights for policy 0, policy_version 85170 (0.0005) [2023-03-06 16:22:43,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87222272. Throughput: 0: 12619.4. Samples: 87201229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:43,941][03942] Avg episode reward: [(0, '1189.922')] [2023-03-06 16:22:44,104][04272] Updated weights for policy 0, policy_version 85180 (0.0007) [2023-03-06 16:22:44,935][04272] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-06 16:22:45,734][04272] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-06 16:22:46,543][04272] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-06 16:22:47,351][04272] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-06 16:22:48,148][04272] Updated weights for policy 0, policy_version 85230 (0.0007) [2023-03-06 16:22:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 87284736. Throughput: 0: 12618.6. Samples: 87277112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:48,941][03942] Avg episode reward: [(0, '1286.086')] [2023-03-06 16:22:48,945][04272] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-06 16:22:49,763][04272] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-06 16:22:50,563][04272] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-06 16:22:51,390][04272] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-06 16:22:52,182][04272] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-06 16:22:53,010][04272] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-06 16:22:53,820][04272] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-06 16:22:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 87348224. Throughput: 0: 12623.2. Samples: 87315139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:53,941][03942] Avg episode reward: [(0, '1347.452')] [2023-03-06 16:22:54,646][04272] Updated weights for policy 0, policy_version 85310 (0.0007) [2023-03-06 16:22:55,445][04272] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-06 16:22:56,254][04272] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-06 16:22:57,080][04272] Updated weights for policy 0, policy_version 85340 (0.0007) [2023-03-06 16:22:57,883][04272] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-06 16:22:58,681][04272] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-06 16:22:58,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87411712. Throughput: 0: 12614.8. Samples: 87390772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:22:58,941][03942] Avg episode reward: [(0, '1283.523')] [2023-03-06 16:22:59,499][04272] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-03-06 16:23:00,304][04272] Updated weights for policy 0, policy_version 85380 (0.0007) [2023-03-06 16:23:01,125][04272] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-06 16:23:01,935][04272] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-06 16:23:02,745][04272] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-06 16:23:03,550][04272] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-06 16:23:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 87474176. Throughput: 0: 12615.0. Samples: 87466537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:23:03,942][03942] Avg episode reward: [(0, '1318.929')] [2023-03-06 16:23:04,359][04272] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-06 16:23:05,178][04272] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-06 16:23:05,989][04272] Updated weights for policy 0, policy_version 85450 (0.0007) [2023-03-06 16:23:06,798][04272] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-06 16:23:07,613][04272] Updated weights for policy 0, policy_version 85470 (0.0007) [2023-03-06 16:23:08,423][04272] Updated weights for policy 0, policy_version 85480 (0.0007) [2023-03-06 16:23:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 87537664. Throughput: 0: 12614.8. Samples: 87504321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:08,941][03942] Avg episode reward: [(0, '1217.616')] [2023-03-06 16:23:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000085486_87537664.pth... [2023-03-06 16:23:08,976][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000082530_84510720.pth [2023-03-06 16:23:09,234][04272] Updated weights for policy 0, policy_version 85490 (0.0006) [2023-03-06 16:23:10,051][04272] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-06 16:23:10,873][04272] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-06 16:23:11,658][04272] Updated weights for policy 0, policy_version 85520 (0.0007) [2023-03-06 16:23:12,467][04272] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-06 16:23:13,273][04272] Updated weights for policy 0, policy_version 85540 (0.0007) [2023-03-06 16:23:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 87600128. Throughput: 0: 12628.0. Samples: 87580316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:13,941][03942] Avg episode reward: [(0, '1332.009')] [2023-03-06 16:23:14,106][04272] Updated weights for policy 0, policy_version 85550 (0.0007) [2023-03-06 16:23:14,909][04272] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-06 16:23:15,701][04272] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-06 16:23:16,516][04272] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-06 16:23:17,335][04272] Updated weights for policy 0, policy_version 85590 (0.0007) [2023-03-06 16:23:18,161][04272] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-06 16:23:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 87663616. Throughput: 0: 12629.6. Samples: 87655935. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:18,941][03942] Avg episode reward: [(0, '1227.384')] [2023-03-06 16:23:18,958][04272] Updated weights for policy 0, policy_version 85610 (0.0007) [2023-03-06 16:23:19,792][04272] Updated weights for policy 0, policy_version 85620 (0.0007) [2023-03-06 16:23:20,593][04272] Updated weights for policy 0, policy_version 85630 (0.0006) [2023-03-06 16:23:21,401][04272] Updated weights for policy 0, policy_version 85640 (0.0007) [2023-03-06 16:23:22,224][04272] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-06 16:23:23,029][04272] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-06 16:23:23,839][04272] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-06 16:23:23,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87727104. Throughput: 0: 12623.7. Samples: 87693809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:23,941][03942] Avg episode reward: [(0, '1342.003')] [2023-03-06 16:23:24,644][04272] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-06 16:23:25,445][04272] Updated weights for policy 0, policy_version 85690 (0.0006) [2023-03-06 16:23:26,257][04272] Updated weights for policy 0, policy_version 85700 (0.0007) [2023-03-06 16:23:27,071][04272] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-06 16:23:27,872][04272] Updated weights for policy 0, policy_version 85720 (0.0007) [2023-03-06 16:23:28,667][04272] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-06 16:23:28,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87790592. Throughput: 0: 12633.9. Samples: 87769756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:28,941][03942] Avg episode reward: [(0, '1306.598')] [2023-03-06 16:23:29,476][04272] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-06 16:23:30,287][04272] Updated weights for policy 0, policy_version 85750 (0.0007) [2023-03-06 16:23:31,099][04272] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-06 16:23:31,917][04272] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-06 16:23:32,718][04272] Updated weights for policy 0, policy_version 85780 (0.0006) [2023-03-06 16:23:33,544][04272] Updated weights for policy 0, policy_version 85790 (0.0007) [2023-03-06 16:23:33,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 87854080. Throughput: 0: 12636.7. Samples: 87845762. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:33,941][03942] Avg episode reward: [(0, '1264.941')] [2023-03-06 16:23:34,349][04272] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-06 16:23:35,142][04272] Updated weights for policy 0, policy_version 85810 (0.0007) [2023-03-06 16:23:35,958][04272] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-06 16:23:36,761][04272] Updated weights for policy 0, policy_version 85830 (0.0007) [2023-03-06 16:23:37,584][04272] Updated weights for policy 0, policy_version 85840 (0.0007) [2023-03-06 16:23:38,406][04272] Updated weights for policy 0, policy_version 85850 (0.0007) [2023-03-06 16:23:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87916544. Throughput: 0: 12636.6. Samples: 87883784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:38,941][03942] Avg episode reward: [(0, '1211.875')] [2023-03-06 16:23:39,198][04272] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-06 16:23:40,008][04272] Updated weights for policy 0, policy_version 85870 (0.0007) [2023-03-06 16:23:40,833][04272] Updated weights for policy 0, policy_version 85880 (0.0007) [2023-03-06 16:23:41,637][04272] Updated weights for policy 0, policy_version 85890 (0.0006) [2023-03-06 16:23:42,461][04272] Updated weights for policy 0, policy_version 85900 (0.0007) [2023-03-06 16:23:43,269][04272] Updated weights for policy 0, policy_version 85910 (0.0006) [2023-03-06 16:23:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 87980032. Throughput: 0: 12635.8. Samples: 87959383. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:43,941][03942] Avg episode reward: [(0, '1225.725')] [2023-03-06 16:23:44,079][04272] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-06 16:23:44,898][04272] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-06 16:23:45,706][04272] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-06 16:23:46,514][04272] Updated weights for policy 0, policy_version 85950 (0.0007) [2023-03-06 16:23:47,308][04272] Updated weights for policy 0, policy_version 85960 (0.0007) [2023-03-06 16:23:48,119][04272] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-06 16:23:48,933][04272] Updated weights for policy 0, policy_version 85980 (0.0007) [2023-03-06 16:23:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 88043520. Throughput: 0: 12639.2. Samples: 88035299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:48,941][03942] Avg episode reward: [(0, '1051.310')] [2023-03-06 16:23:49,732][04272] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-06 16:23:50,547][04272] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-06 16:23:51,357][04272] Updated weights for policy 0, policy_version 86010 (0.0007) [2023-03-06 16:23:52,158][04272] Updated weights for policy 0, policy_version 86020 (0.0007) [2023-03-06 16:23:52,972][04272] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-06 16:23:53,790][04272] Updated weights for policy 0, policy_version 86040 (0.0007) [2023-03-06 16:23:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 88105984. Throughput: 0: 12642.1. Samples: 88073217. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:53,941][03942] Avg episode reward: [(0, '1357.980')] [2023-03-06 16:23:54,612][04272] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-06 16:23:55,410][04272] Updated weights for policy 0, policy_version 86060 (0.0007) [2023-03-06 16:23:56,221][04272] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-06 16:23:57,046][04272] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-06 16:23:57,842][04272] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-06 16:23:58,660][04272] Updated weights for policy 0, policy_version 86100 (0.0007) [2023-03-06 16:23:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 88169472. Throughput: 0: 12638.7. Samples: 88149057. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:23:58,941][03942] Avg episode reward: [(0, '1185.084')] [2023-03-06 16:23:59,482][04272] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-06 16:24:00,302][04272] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-06 16:24:01,108][04272] Updated weights for policy 0, policy_version 86130 (0.0005) [2023-03-06 16:24:01,914][04272] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-06 16:24:02,742][04272] Updated weights for policy 0, policy_version 86150 (0.0007) [2023-03-06 16:24:03,553][04272] Updated weights for policy 0, policy_version 86160 (0.0007) [2023-03-06 16:24:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 88231936. Throughput: 0: 12635.2. Samples: 88224521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:03,941][03942] Avg episode reward: [(0, '1230.749')] [2023-03-06 16:24:04,365][04272] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-06 16:24:05,193][04272] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-06 16:24:05,980][04272] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-06 16:24:06,788][04272] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-06 16:24:07,592][04272] Updated weights for policy 0, policy_version 86210 (0.0007) [2023-03-06 16:24:08,406][04272] Updated weights for policy 0, policy_version 86220 (0.0007) [2023-03-06 16:24:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 88295424. Throughput: 0: 12633.8. Samples: 88262331. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:08,952][03942] Avg episode reward: [(0, '1332.639')] [2023-03-06 16:24:09,208][04272] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-06 16:24:10,028][04272] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-06 16:24:10,847][04272] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-06 16:24:11,655][04272] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-06 16:24:12,473][04272] Updated weights for policy 0, policy_version 86270 (0.0007) [2023-03-06 16:24:13,270][04272] Updated weights for policy 0, policy_version 86280 (0.0005) [2023-03-06 16:24:13,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12621.2). Total num frames: 88358912. Throughput: 0: 12630.8. Samples: 88338144. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:13,941][03942] Avg episode reward: [(0, '1251.859')] [2023-03-06 16:24:14,091][04272] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-06 16:24:14,909][04272] Updated weights for policy 0, policy_version 86300 (0.0007) [2023-03-06 16:24:15,712][04272] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-06 16:24:16,528][04272] Updated weights for policy 0, policy_version 86320 (0.0007) [2023-03-06 16:24:17,336][04272] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-06 16:24:18,146][04272] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-06 16:24:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 88421376. Throughput: 0: 12623.6. Samples: 88413824. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:18,941][03942] Avg episode reward: [(0, '1224.130')] [2023-03-06 16:24:18,949][04272] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-06 16:24:19,776][04272] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-06 16:24:20,563][04272] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-06 16:24:21,384][04272] Updated weights for policy 0, policy_version 86380 (0.0007) [2023-03-06 16:24:22,219][04272] Updated weights for policy 0, policy_version 86390 (0.0006) [2023-03-06 16:24:23,034][04272] Updated weights for policy 0, policy_version 86400 (0.0007) [2023-03-06 16:24:23,847][04272] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-06 16:24:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 88484864. Throughput: 0: 12623.9. Samples: 88451859. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:23,941][03942] Avg episode reward: [(0, '1040.048')] [2023-03-06 16:24:24,659][04272] Updated weights for policy 0, policy_version 86420 (0.0006) [2023-03-06 16:24:25,461][04272] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-06 16:24:26,271][04272] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-06 16:24:27,091][04272] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-06 16:24:27,912][04272] Updated weights for policy 0, policy_version 86460 (0.0007) [2023-03-06 16:24:28,717][04272] Updated weights for policy 0, policy_version 86470 (0.0006) [2023-03-06 16:24:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 88547328. Throughput: 0: 12616.6. Samples: 88527130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:28,941][03942] Avg episode reward: [(0, '1064.435')] [2023-03-06 16:24:29,543][04272] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-06 16:24:30,360][04272] Updated weights for policy 0, policy_version 86490 (0.0007) [2023-03-06 16:24:31,156][04272] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-06 16:24:31,975][04272] Updated weights for policy 0, policy_version 86510 (0.0006) [2023-03-06 16:24:32,782][04272] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-06 16:24:33,593][04272] Updated weights for policy 0, policy_version 86530 (0.0007) [2023-03-06 16:24:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 88610816. Throughput: 0: 12609.0. Samples: 88602704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:33,941][03942] Avg episode reward: [(0, '945.206')] [2023-03-06 16:24:34,414][04272] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-06 16:24:35,224][04272] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-06 16:24:36,045][04272] Updated weights for policy 0, policy_version 86560 (0.0008) [2023-03-06 16:24:36,851][04272] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-06 16:24:37,653][04272] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-03-06 16:24:38,454][04272] Updated weights for policy 0, policy_version 86590 (0.0006) [2023-03-06 16:24:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 88673280. Throughput: 0: 12603.1. Samples: 88640356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:38,941][03942] Avg episode reward: [(0, '976.451')] [2023-03-06 16:24:39,273][04272] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-06 16:24:40,076][04272] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-06 16:24:40,897][04272] Updated weights for policy 0, policy_version 86620 (0.0007) [2023-03-06 16:24:41,706][04272] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-06 16:24:42,522][04272] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-06 16:24:43,316][04272] Updated weights for policy 0, policy_version 86650 (0.0007) [2023-03-06 16:24:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 88736768. Throughput: 0: 12608.0. Samples: 88716420. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:43,941][03942] Avg episode reward: [(0, '1206.993')] [2023-03-06 16:24:44,115][04272] Updated weights for policy 0, policy_version 86660 (0.0006) [2023-03-06 16:24:44,947][04272] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-06 16:24:45,755][04272] Updated weights for policy 0, policy_version 86680 (0.0007) [2023-03-06 16:24:46,568][04272] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-06 16:24:47,388][04272] Updated weights for policy 0, policy_version 86700 (0.0007) [2023-03-06 16:24:48,184][04272] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-06 16:24:48,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 88800256. Throughput: 0: 12616.2. Samples: 88792249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:24:48,941][03942] Avg episode reward: [(0, '1224.932')] [2023-03-06 16:24:48,990][04272] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-06 16:24:49,797][04272] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-06 16:24:50,623][04272] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-06 16:24:51,407][04272] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-06 16:24:52,246][04272] Updated weights for policy 0, policy_version 86760 (0.0007) [2023-03-06 16:24:53,047][04272] Updated weights for policy 0, policy_version 86770 (0.0007) [2023-03-06 16:24:53,869][04272] Updated weights for policy 0, policy_version 86780 (0.0006) [2023-03-06 16:24:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 88862720. Throughput: 0: 12619.6. Samples: 88830215. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:24:53,941][03942] Avg episode reward: [(0, '1229.169')] [2023-03-06 16:24:54,675][04272] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-06 16:24:55,499][04272] Updated weights for policy 0, policy_version 86800 (0.0007) [2023-03-06 16:24:56,304][04272] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-06 16:24:57,128][04272] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-06 16:24:57,942][04272] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-06 16:24:58,737][04272] Updated weights for policy 0, policy_version 86840 (0.0007) [2023-03-06 16:24:58,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.2, 300 sec: 12621.2). Total num frames: 88926208. Throughput: 0: 12610.2. Samples: 88905604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:24:58,941][03942] Avg episode reward: [(0, '1211.906')] [2023-03-06 16:24:59,551][04272] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-06 16:25:00,369][04272] Updated weights for policy 0, policy_version 86860 (0.0005) [2023-03-06 16:25:01,161][04272] Updated weights for policy 0, policy_version 86870 (0.0007) [2023-03-06 16:25:01,982][04272] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-06 16:25:02,801][04272] Updated weights for policy 0, policy_version 86890 (0.0007) [2023-03-06 16:25:03,607][04272] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-06 16:25:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 88989696. Throughput: 0: 12613.6. Samples: 88981434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:03,941][03942] Avg episode reward: [(0, '1218.667')] [2023-03-06 16:25:04,430][04272] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-06 16:25:05,245][04272] Updated weights for policy 0, policy_version 86920 (0.0005) [2023-03-06 16:25:06,059][04272] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-06 16:25:06,877][04272] Updated weights for policy 0, policy_version 86940 (0.0007) [2023-03-06 16:25:07,687][04272] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-06 16:25:08,498][04272] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-06 16:25:08,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89052160. Throughput: 0: 12604.7. Samples: 89019068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:08,941][03942] Avg episode reward: [(0, '1304.468')] [2023-03-06 16:25:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000086965_89052160.pth... [2023-03-06 16:25:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000084008_86024192.pth [2023-03-06 16:25:09,315][04272] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-06 16:25:10,147][04272] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-06 16:25:10,942][04272] Updated weights for policy 0, policy_version 86990 (0.0006) [2023-03-06 16:25:11,749][04272] Updated weights for policy 0, policy_version 87000 (0.0007) [2023-03-06 16:25:12,571][04272] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-06 16:25:13,390][04272] Updated weights for policy 0, policy_version 87020 (0.0007) [2023-03-06 16:25:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12624.7). Total num frames: 89115648. Throughput: 0: 12614.8. Samples: 89094796. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:13,941][03942] Avg episode reward: [(0, '1190.816')] [2023-03-06 16:25:14,186][04272] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-06 16:25:15,021][04272] Updated weights for policy 0, policy_version 87040 (0.0007) [2023-03-06 16:25:15,819][04272] Updated weights for policy 0, policy_version 87050 (0.0006) [2023-03-06 16:25:16,622][04272] Updated weights for policy 0, policy_version 87060 (0.0007) [2023-03-06 16:25:17,410][04272] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-06 16:25:18,215][04272] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-06 16:25:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89178112. Throughput: 0: 12619.7. Samples: 89170589. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:18,941][03942] Avg episode reward: [(0, '1163.300')] [2023-03-06 16:25:19,049][04272] Updated weights for policy 0, policy_version 87090 (0.0007) [2023-03-06 16:25:19,858][04272] Updated weights for policy 0, policy_version 87100 (0.0007) [2023-03-06 16:25:20,678][04272] Updated weights for policy 0, policy_version 87110 (0.0006) [2023-03-06 16:25:21,484][04272] Updated weights for policy 0, policy_version 87120 (0.0008) [2023-03-06 16:25:22,289][04272] Updated weights for policy 0, policy_version 87130 (0.0006) [2023-03-06 16:25:23,114][04272] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-06 16:25:23,909][04272] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-06 16:25:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89241600. Throughput: 0: 12622.4. Samples: 89208362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:23,941][03942] Avg episode reward: [(0, '1128.244')] [2023-03-06 16:25:24,729][04272] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-06 16:25:25,533][04272] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-06 16:25:26,346][04272] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-06 16:25:27,158][04272] Updated weights for policy 0, policy_version 87190 (0.0007) [2023-03-06 16:25:27,963][04272] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-06 16:25:28,785][04272] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-06 16:25:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89304064. Throughput: 0: 12610.6. Samples: 89283899. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:28,941][03942] Avg episode reward: [(0, '1150.829')] [2023-03-06 16:25:29,584][04272] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-06 16:25:30,394][04272] Updated weights for policy 0, policy_version 87230 (0.0007) [2023-03-06 16:25:31,228][04272] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-06 16:25:32,039][04272] Updated weights for policy 0, policy_version 87250 (0.0008) [2023-03-06 16:25:32,839][04272] Updated weights for policy 0, policy_version 87260 (0.0007) [2023-03-06 16:25:33,658][04272] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-06 16:25:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89367552. Throughput: 0: 12611.3. Samples: 89359757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:33,941][03942] Avg episode reward: [(0, '1167.924')] [2023-03-06 16:25:34,457][04272] Updated weights for policy 0, policy_version 87280 (0.0007) [2023-03-06 16:25:35,267][04272] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-06 16:25:36,078][04272] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-06 16:25:36,880][04272] Updated weights for policy 0, policy_version 87310 (0.0007) [2023-03-06 16:25:37,698][04272] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-06 16:25:38,521][04272] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-06 16:25:38,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 89431040. Throughput: 0: 12610.9. Samples: 89397706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:38,941][03942] Avg episode reward: [(0, '1225.021')] [2023-03-06 16:25:39,338][04272] Updated weights for policy 0, policy_version 87340 (0.0007) [2023-03-06 16:25:40,159][04272] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-06 16:25:40,977][04272] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-03-06 16:25:41,777][04272] Updated weights for policy 0, policy_version 87370 (0.0007) [2023-03-06 16:25:42,597][04272] Updated weights for policy 0, policy_version 87380 (0.0008) [2023-03-06 16:25:43,410][04272] Updated weights for policy 0, policy_version 87390 (0.0006) [2023-03-06 16:25:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 89493504. Throughput: 0: 12610.2. Samples: 89473062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:25:43,941][03942] Avg episode reward: [(0, '1198.557')] [2023-03-06 16:25:44,233][04272] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-06 16:25:45,044][04272] Updated weights for policy 0, policy_version 87410 (0.0006) [2023-03-06 16:25:45,854][04272] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-06 16:25:46,665][04272] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-06 16:25:47,466][04272] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-06 16:25:48,282][04272] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-06 16:25:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89556992. Throughput: 0: 12605.6. Samples: 89548684. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:25:48,941][03942] Avg episode reward: [(0, '1203.339')] [2023-03-06 16:25:49,094][04272] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-06 16:25:49,890][04272] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-06 16:25:50,707][04272] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-06 16:25:51,501][04272] Updated weights for policy 0, policy_version 87490 (0.0007) [2023-03-06 16:25:52,313][04272] Updated weights for policy 0, policy_version 87500 (0.0007) [2023-03-06 16:25:53,134][04272] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-06 16:25:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 89619456. Throughput: 0: 12618.0. Samples: 89586877. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:25:53,941][03942] Avg episode reward: [(0, '909.409')] [2023-03-06 16:25:53,951][04272] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-06 16:25:54,766][04272] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-06 16:25:55,582][04272] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-06 16:25:56,390][04272] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-06 16:25:57,209][04272] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-06 16:25:58,007][04272] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-06 16:25:58,814][04272] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-06 16:25:58,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12621.2). Total num frames: 89682944. Throughput: 0: 12612.2. Samples: 89662346. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:25:58,941][03942] Avg episode reward: [(0, '768.954')] [2023-03-06 16:25:59,637][04272] Updated weights for policy 0, policy_version 87590 (0.0005) [2023-03-06 16:26:00,446][04272] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-06 16:26:01,250][04272] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-06 16:26:02,070][04272] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-06 16:26:02,891][04272] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-06 16:26:03,694][04272] Updated weights for policy 0, policy_version 87640 (0.0007) [2023-03-06 16:26:03,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 89745408. Throughput: 0: 12608.5. Samples: 89737973. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:03,941][03942] Avg episode reward: [(0, '924.285')] [2023-03-06 16:26:04,509][04272] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-06 16:26:05,349][04272] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-06 16:26:06,157][04272] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-06 16:26:06,972][04272] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-06 16:26:07,789][04272] Updated weights for policy 0, policy_version 87690 (0.0006) [2023-03-06 16:26:08,601][04272] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-06 16:26:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 89808896. Throughput: 0: 12604.2. Samples: 89775552. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:08,941][03942] Avg episode reward: [(0, '942.361')] [2023-03-06 16:26:09,412][04272] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-06 16:26:10,230][04272] Updated weights for policy 0, policy_version 87720 (0.0007) [2023-03-06 16:26:11,048][04272] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-06 16:26:11,863][04272] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-06 16:26:12,659][04272] Updated weights for policy 0, policy_version 87750 (0.0007) [2023-03-06 16:26:13,461][04272] Updated weights for policy 0, policy_version 87760 (0.0007) [2023-03-06 16:26:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 89871360. Throughput: 0: 12606.2. Samples: 89851177. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:13,941][03942] Avg episode reward: [(0, '1040.709')] [2023-03-06 16:26:14,285][04272] Updated weights for policy 0, policy_version 87770 (0.0005) [2023-03-06 16:26:15,104][04272] Updated weights for policy 0, policy_version 87780 (0.0007) [2023-03-06 16:26:15,919][04272] Updated weights for policy 0, policy_version 87790 (0.0006) [2023-03-06 16:26:16,750][04272] Updated weights for policy 0, policy_version 87800 (0.0007) [2023-03-06 16:26:17,557][04272] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-06 16:26:18,374][04272] Updated weights for policy 0, policy_version 87820 (0.0006) [2023-03-06 16:26:18,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 89933824. Throughput: 0: 12593.9. Samples: 89926482. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:18,941][03942] Avg episode reward: [(0, '1092.945')] [2023-03-06 16:26:19,191][04272] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-06 16:26:19,985][04272] Updated weights for policy 0, policy_version 87840 (0.0006) [2023-03-06 16:26:20,802][04272] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-06 16:26:21,594][04272] Updated weights for policy 0, policy_version 87860 (0.0007) [2023-03-06 16:26:22,408][04272] Updated weights for policy 0, policy_version 87870 (0.0006) [2023-03-06 16:26:23,236][04272] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-06 16:26:23,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 89997312. Throughput: 0: 12598.2. Samples: 89964626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:23,941][03942] Avg episode reward: [(0, '1199.559')] [2023-03-06 16:26:24,038][04272] Updated weights for policy 0, policy_version 87890 (0.0007) [2023-03-06 16:26:24,840][04272] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-06 16:26:25,652][04272] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-06 16:26:26,481][04272] Updated weights for policy 0, policy_version 87920 (0.0007) [2023-03-06 16:26:27,290][04272] Updated weights for policy 0, policy_version 87930 (0.0009) [2023-03-06 16:26:28,101][04272] Updated weights for policy 0, policy_version 87940 (0.0006) [2023-03-06 16:26:28,917][04272] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-06 16:26:28,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 90060800. Throughput: 0: 12602.4. Samples: 90040170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:28,941][03942] Avg episode reward: [(0, '1146.110')] [2023-03-06 16:26:29,718][04272] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-06 16:26:30,540][04272] Updated weights for policy 0, policy_version 87970 (0.0007) [2023-03-06 16:26:31,371][04272] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-06 16:26:32,170][04272] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-06 16:26:32,970][04272] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-06 16:26:33,786][04272] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-06 16:26:33,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 90124288. Throughput: 0: 12603.3. Samples: 90115832. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:33,941][03942] Avg episode reward: [(0, '1118.194')] [2023-03-06 16:26:34,585][04272] Updated weights for policy 0, policy_version 88020 (0.0007) [2023-03-06 16:26:35,392][04272] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-06 16:26:36,206][04272] Updated weights for policy 0, policy_version 88040 (0.0006) [2023-03-06 16:26:37,006][04272] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-06 16:26:37,819][04272] Updated weights for policy 0, policy_version 88060 (0.0007) [2023-03-06 16:26:38,646][04272] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-06 16:26:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 90186752. Throughput: 0: 12601.0. Samples: 90153924. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:38,952][03942] Avg episode reward: [(0, '1222.332')] [2023-03-06 16:26:39,450][04272] Updated weights for policy 0, policy_version 88080 (0.0007) [2023-03-06 16:26:40,263][04272] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-06 16:26:41,069][04272] Updated weights for policy 0, policy_version 88100 (0.0007) [2023-03-06 16:26:41,874][04272] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-06 16:26:42,682][04272] Updated weights for policy 0, policy_version 88120 (0.0007) [2023-03-06 16:26:43,507][04272] Updated weights for policy 0, policy_version 88130 (0.0006) [2023-03-06 16:26:43,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 90250240. Throughput: 0: 12606.5. Samples: 90229637. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:43,952][03942] Avg episode reward: [(0, '1164.383')] [2023-03-06 16:26:44,322][04272] Updated weights for policy 0, policy_version 88140 (0.0006) [2023-03-06 16:26:45,122][04272] Updated weights for policy 0, policy_version 88150 (0.0007) [2023-03-06 16:26:45,943][04272] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-06 16:26:46,772][04272] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-06 16:26:47,569][04272] Updated weights for policy 0, policy_version 88180 (0.0006) [2023-03-06 16:26:48,388][04272] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-06 16:26:48,940][03942] Fps is (10 sec: 12595.5, 60 sec: 12595.2, 300 sec: 12617.8). Total num frames: 90312704. Throughput: 0: 12603.1. Samples: 90305111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:48,951][03942] Avg episode reward: [(0, '1253.749')] [2023-03-06 16:26:49,210][04272] Updated weights for policy 0, policy_version 88200 (0.0007) [2023-03-06 16:26:50,014][04272] Updated weights for policy 0, policy_version 88210 (0.0006) [2023-03-06 16:26:50,840][04272] Updated weights for policy 0, policy_version 88220 (0.0007) [2023-03-06 16:26:51,649][04272] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-06 16:26:52,472][04272] Updated weights for policy 0, policy_version 88240 (0.0006) [2023-03-06 16:26:53,287][04272] Updated weights for policy 0, policy_version 88250 (0.0007) [2023-03-06 16:26:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 90376192. Throughput: 0: 12603.3. Samples: 90342703. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:53,952][03942] Avg episode reward: [(0, '1219.255')] [2023-03-06 16:26:54,109][04272] Updated weights for policy 0, policy_version 88260 (0.0007) [2023-03-06 16:26:54,945][04272] Updated weights for policy 0, policy_version 88270 (0.0006) [2023-03-06 16:26:55,747][04272] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-06 16:26:56,573][04272] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-06 16:26:57,376][04272] Updated weights for policy 0, policy_version 88300 (0.0007) [2023-03-06 16:26:58,178][04272] Updated weights for policy 0, policy_version 88310 (0.0007) [2023-03-06 16:26:58,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 90438656. Throughput: 0: 12601.6. Samples: 90418252. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:26:58,952][03942] Avg episode reward: [(0, '1228.346')] [2023-03-06 16:26:59,023][04272] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-06 16:26:59,818][04272] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-06 16:27:00,615][04272] Updated weights for policy 0, policy_version 88340 (0.0007) [2023-03-06 16:27:01,446][04272] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-06 16:27:02,251][04272] Updated weights for policy 0, policy_version 88360 (0.0007) [2023-03-06 16:27:03,055][04272] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-06 16:27:03,871][04272] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-06 16:27:03,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 90501120. Throughput: 0: 12604.5. Samples: 90493683. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:03,941][03942] Avg episode reward: [(0, '1231.000')] [2023-03-06 16:27:04,683][04272] Updated weights for policy 0, policy_version 88390 (0.0007) [2023-03-06 16:27:05,494][04272] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-06 16:27:06,314][04272] Updated weights for policy 0, policy_version 88410 (0.0007) [2023-03-06 16:27:07,129][04272] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-06 16:27:07,950][04272] Updated weights for policy 0, policy_version 88430 (0.0007) [2023-03-06 16:27:08,768][04272] Updated weights for policy 0, policy_version 88440 (0.0007) [2023-03-06 16:27:08,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 90564608. Throughput: 0: 12593.8. Samples: 90531348. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:08,941][03942] Avg episode reward: [(0, '1217.371')] [2023-03-06 16:27:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000088442_90564608.pth... [2023-03-06 16:27:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000085486_87537664.pth [2023-03-06 16:27:09,581][04272] Updated weights for policy 0, policy_version 88450 (0.0006) [2023-03-06 16:27:10,374][04272] Updated weights for policy 0, policy_version 88460 (0.0007) [2023-03-06 16:27:11,178][04272] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-06 16:27:12,010][04272] Updated weights for policy 0, policy_version 88480 (0.0006) [2023-03-06 16:27:12,820][04272] Updated weights for policy 0, policy_version 88490 (0.0007) [2023-03-06 16:27:13,618][04272] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-06 16:27:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 90627072. Throughput: 0: 12593.3. Samples: 90606867. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:13,941][03942] Avg episode reward: [(0, '1203.151')] [2023-03-06 16:27:14,436][04272] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-06 16:27:15,230][04272] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-06 16:27:16,044][04272] Updated weights for policy 0, policy_version 88530 (0.0007) [2023-03-06 16:27:16,871][04272] Updated weights for policy 0, policy_version 88540 (0.0007) [2023-03-06 16:27:17,681][04272] Updated weights for policy 0, policy_version 88550 (0.0006) [2023-03-06 16:27:18,514][04272] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-06 16:27:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 90690560. Throughput: 0: 12594.4. Samples: 90682581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:18,941][03942] Avg episode reward: [(0, '1230.927')] [2023-03-06 16:27:19,322][04272] Updated weights for policy 0, policy_version 88570 (0.0007) [2023-03-06 16:27:20,149][04272] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-06 16:27:20,938][04272] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-06 16:27:21,759][04272] Updated weights for policy 0, policy_version 88600 (0.0007) [2023-03-06 16:27:22,571][04272] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-06 16:27:23,403][04272] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-06 16:27:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 90753024. Throughput: 0: 12587.6. Samples: 90720363. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:23,941][03942] Avg episode reward: [(0, '1147.680')] [2023-03-06 16:27:24,221][04272] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-06 16:27:25,028][04272] Updated weights for policy 0, policy_version 88640 (0.0007) [2023-03-06 16:27:25,840][04272] Updated weights for policy 0, policy_version 88650 (0.0006) [2023-03-06 16:27:26,647][04272] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-06 16:27:27,450][04272] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-06 16:27:28,249][04272] Updated weights for policy 0, policy_version 88680 (0.0006) [2023-03-06 16:27:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 90816512. Throughput: 0: 12586.4. Samples: 90796024. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:27:28,952][03942] Avg episode reward: [(0, '1187.748')] [2023-03-06 16:27:29,079][04272] Updated weights for policy 0, policy_version 88690 (0.0007) [2023-03-06 16:27:29,882][04272] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-06 16:27:30,706][04272] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-06 16:27:31,518][04272] Updated weights for policy 0, policy_version 88720 (0.0007) [2023-03-06 16:27:32,322][04272] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-06 16:27:33,138][04272] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-06 16:27:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12610.8). Total num frames: 90878976. Throughput: 0: 12585.3. Samples: 90871449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:33,952][03942] Avg episode reward: [(0, '1249.231')] [2023-03-06 16:27:33,985][04272] Updated weights for policy 0, policy_version 88750 (0.0007) [2023-03-06 16:27:34,783][04272] Updated weights for policy 0, policy_version 88760 (0.0007) [2023-03-06 16:27:35,585][04272] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-06 16:27:36,403][04272] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-06 16:27:37,202][04272] Updated weights for policy 0, policy_version 88790 (0.0007) [2023-03-06 16:27:37,988][04272] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-06 16:27:38,820][04272] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-06 16:27:38,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 90942464. Throughput: 0: 12589.4. Samples: 90909227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:38,951][03942] Avg episode reward: [(0, '1177.624')] [2023-03-06 16:27:39,627][04272] Updated weights for policy 0, policy_version 88820 (0.0007) [2023-03-06 16:27:40,426][04272] Updated weights for policy 0, policy_version 88830 (0.0006) [2023-03-06 16:27:41,250][04272] Updated weights for policy 0, policy_version 88840 (0.0007) [2023-03-06 16:27:42,062][04272] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-06 16:27:42,872][04272] Updated weights for policy 0, policy_version 88860 (0.0007) [2023-03-06 16:27:43,702][04272] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-06 16:27:43,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 91005952. Throughput: 0: 12597.2. Samples: 90985127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:43,952][03942] Avg episode reward: [(0, '1236.904')] [2023-03-06 16:27:44,513][04272] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-06 16:27:45,318][04272] Updated weights for policy 0, policy_version 88890 (0.0007) [2023-03-06 16:27:46,139][04272] Updated weights for policy 0, policy_version 88900 (0.0006) [2023-03-06 16:27:46,936][04272] Updated weights for policy 0, policy_version 88910 (0.0006) [2023-03-06 16:27:47,745][04272] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-06 16:27:48,542][04272] Updated weights for policy 0, policy_version 88930 (0.0007) [2023-03-06 16:27:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 91068416. Throughput: 0: 12605.0. Samples: 91060907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:48,951][03942] Avg episode reward: [(0, '1254.795')] [2023-03-06 16:27:49,358][04272] Updated weights for policy 0, policy_version 88940 (0.0007) [2023-03-06 16:27:50,158][04272] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-06 16:27:50,990][04272] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-06 16:27:51,809][04272] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-06 16:27:52,619][04272] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-06 16:27:53,410][04272] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-06 16:27:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 91131904. Throughput: 0: 12607.0. Samples: 91098663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:53,941][03942] Avg episode reward: [(0, '1163.759')] [2023-03-06 16:27:54,254][04272] Updated weights for policy 0, policy_version 89000 (0.0006) [2023-03-06 16:27:55,066][04272] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-06 16:27:55,863][04272] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-06 16:27:56,685][04272] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-06 16:27:57,481][04272] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-06 16:27:58,298][04272] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-06 16:27:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 91194368. Throughput: 0: 12606.9. Samples: 91174178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:27:58,941][03942] Avg episode reward: [(0, '1148.548')] [2023-03-06 16:27:59,110][04272] Updated weights for policy 0, policy_version 89060 (0.0007) [2023-03-06 16:27:59,925][04272] Updated weights for policy 0, policy_version 89070 (0.0007) [2023-03-06 16:28:00,747][04272] Updated weights for policy 0, policy_version 89080 (0.0006) [2023-03-06 16:28:01,559][04272] Updated weights for policy 0, policy_version 89090 (0.0006) [2023-03-06 16:28:02,374][04272] Updated weights for policy 0, policy_version 89100 (0.0007) [2023-03-06 16:28:03,190][04272] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-06 16:28:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 91257856. Throughput: 0: 12606.3. Samples: 91249865. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:28:03,941][03942] Avg episode reward: [(0, '1205.820')] [2023-03-06 16:28:04,001][04272] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-06 16:28:04,809][04272] Updated weights for policy 0, policy_version 89130 (0.0006) [2023-03-06 16:28:05,601][04272] Updated weights for policy 0, policy_version 89140 (0.0007) [2023-03-06 16:28:06,422][04272] Updated weights for policy 0, policy_version 89150 (0.0007) [2023-03-06 16:28:07,241][04272] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-06 16:28:08,052][04272] Updated weights for policy 0, policy_version 89170 (0.0007) [2023-03-06 16:28:08,875][04272] Updated weights for policy 0, policy_version 89180 (0.0007) [2023-03-06 16:28:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12610.8). Total num frames: 91320320. Throughput: 0: 12610.7. Samples: 91287845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:28:08,941][03942] Avg episode reward: [(0, '1259.502')] [2023-03-06 16:28:09,704][04272] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-06 16:28:10,506][04272] Updated weights for policy 0, policy_version 89200 (0.0007) [2023-03-06 16:28:11,329][04272] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-06 16:28:12,158][04272] Updated weights for policy 0, policy_version 89220 (0.0007) [2023-03-06 16:28:12,979][04272] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-06 16:28:13,781][04272] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-06 16:28:13,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12607.3). Total num frames: 91382784. Throughput: 0: 12594.6. Samples: 91362778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:28:13,941][03942] Avg episode reward: [(0, '1299.793')] [2023-03-06 16:28:14,594][04272] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-06 16:28:15,401][04272] Updated weights for policy 0, policy_version 89260 (0.0007) [2023-03-06 16:28:16,211][04272] Updated weights for policy 0, policy_version 89270 (0.0006) [2023-03-06 16:28:17,017][04272] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-06 16:28:17,837][04272] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-06 16:28:18,643][04272] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-06 16:28:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12607.4). Total num frames: 91446272. Throughput: 0: 12603.0. Samples: 91438582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:28:18,941][03942] Avg episode reward: [(0, '1257.179')] [2023-03-06 16:28:19,457][04272] Updated weights for policy 0, policy_version 89310 (0.0006) [2023-03-06 16:28:20,273][04272] Updated weights for policy 0, policy_version 89320 (0.0006) [2023-03-06 16:28:21,082][04272] Updated weights for policy 0, policy_version 89330 (0.0007) [2023-03-06 16:28:21,906][04272] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-06 16:28:22,714][04272] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-06 16:28:23,523][04272] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-06 16:28:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91508736. Throughput: 0: 12603.5. Samples: 91476385. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:28:23,941][03942] Avg episode reward: [(0, '1224.651')] [2023-03-06 16:28:24,358][04272] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-06 16:28:25,168][04272] Updated weights for policy 0, policy_version 89380 (0.0007) [2023-03-06 16:28:25,978][04272] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-06 16:28:26,789][04272] Updated weights for policy 0, policy_version 89400 (0.0006) [2023-03-06 16:28:27,616][04272] Updated weights for policy 0, policy_version 89410 (0.0008) [2023-03-06 16:28:28,424][04272] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-06 16:28:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91572224. Throughput: 0: 12590.2. Samples: 91551687. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:28,941][03942] Avg episode reward: [(0, '1208.952')] [2023-03-06 16:28:29,228][04272] Updated weights for policy 0, policy_version 89430 (0.0005) [2023-03-06 16:28:30,043][04272] Updated weights for policy 0, policy_version 89440 (0.0007) [2023-03-06 16:28:30,846][04272] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-06 16:28:31,661][04272] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-06 16:28:32,489][04272] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-06 16:28:33,310][04272] Updated weights for policy 0, policy_version 89480 (0.0007) [2023-03-06 16:28:33,940][03942] Fps is (10 sec: 12595.5, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91634688. Throughput: 0: 12585.3. Samples: 91627244. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:33,941][03942] Avg episode reward: [(0, '1290.346')] [2023-03-06 16:28:34,120][04272] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-06 16:28:34,926][04272] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-06 16:28:35,736][04272] Updated weights for policy 0, policy_version 89510 (0.0007) [2023-03-06 16:28:36,548][04272] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-06 16:28:37,346][04272] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-06 16:28:38,153][04272] Updated weights for policy 0, policy_version 89540 (0.0006) [2023-03-06 16:28:38,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91698176. Throughput: 0: 12590.6. Samples: 91665238. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:38,941][03942] Avg episode reward: [(0, '1250.866')] [2023-03-06 16:28:38,965][04272] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-06 16:28:39,769][04272] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-06 16:28:40,585][04272] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-06 16:28:41,395][04272] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-06 16:28:42,219][04272] Updated weights for policy 0, policy_version 89590 (0.0007) [2023-03-06 16:28:43,030][04272] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-06 16:28:43,824][04272] Updated weights for policy 0, policy_version 89610 (0.0006) [2023-03-06 16:28:43,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91761664. Throughput: 0: 12595.0. Samples: 91740954. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:43,941][03942] Avg episode reward: [(0, '1196.580')] [2023-03-06 16:28:44,642][04272] Updated weights for policy 0, policy_version 89620 (0.0007) [2023-03-06 16:28:45,457][04272] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-06 16:28:46,276][04272] Updated weights for policy 0, policy_version 89640 (0.0007) [2023-03-06 16:28:47,094][04272] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-06 16:28:47,918][04272] Updated weights for policy 0, policy_version 89660 (0.0007) [2023-03-06 16:28:48,702][04272] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-06 16:28:48,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91824128. Throughput: 0: 12591.3. Samples: 91816474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:48,941][03942] Avg episode reward: [(0, '1319.591')] [2023-03-06 16:28:49,535][04272] Updated weights for policy 0, policy_version 89680 (0.0007) [2023-03-06 16:28:50,331][04272] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-06 16:28:51,146][04272] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-06 16:28:51,968][04272] Updated weights for policy 0, policy_version 89710 (0.0007) [2023-03-06 16:28:52,769][04272] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-06 16:28:53,573][04272] Updated weights for policy 0, policy_version 89730 (0.0007) [2023-03-06 16:28:53,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91887616. Throughput: 0: 12593.2. Samples: 91854540. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:53,941][03942] Avg episode reward: [(0, '1250.592')] [2023-03-06 16:28:54,397][04272] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-06 16:28:55,212][04272] Updated weights for policy 0, policy_version 89750 (0.0007) [2023-03-06 16:28:56,025][04272] Updated weights for policy 0, policy_version 89760 (0.0007) [2023-03-06 16:28:56,849][04272] Updated weights for policy 0, policy_version 89770 (0.0007) [2023-03-06 16:28:57,674][04272] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-06 16:28:58,493][04272] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-06 16:28:58,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 91950080. Throughput: 0: 12598.5. Samples: 91929712. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:28:58,941][03942] Avg episode reward: [(0, '1293.234')] [2023-03-06 16:28:59,302][04272] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-06 16:29:00,090][04272] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-06 16:29:00,902][04272] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-06 16:29:01,709][04272] Updated weights for policy 0, policy_version 89830 (0.0006) [2023-03-06 16:29:02,529][04272] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-06 16:29:03,350][04272] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-06 16:29:03,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 92013568. Throughput: 0: 12597.5. Samples: 92005470. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:29:03,941][03942] Avg episode reward: [(0, '1165.090')] [2023-03-06 16:29:04,171][04272] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-06 16:29:04,982][04272] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-06 16:29:05,787][04272] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-06 16:29:06,616][04272] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-06 16:29:07,429][04272] Updated weights for policy 0, policy_version 89900 (0.0006) [2023-03-06 16:29:08,230][04272] Updated weights for policy 0, policy_version 89910 (0.0006) [2023-03-06 16:29:08,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92076032. Throughput: 0: 12594.7. Samples: 92043147. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:29:08,941][03942] Avg episode reward: [(0, '1273.737')] [2023-03-06 16:29:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000089918_92076032.pth... [2023-03-06 16:29:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000086965_89052160.pth [2023-03-06 16:29:09,052][04272] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-06 16:29:09,865][04272] Updated weights for policy 0, policy_version 89930 (0.0006) [2023-03-06 16:29:10,675][04272] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-06 16:29:11,481][04272] Updated weights for policy 0, policy_version 89950 (0.0006) [2023-03-06 16:29:12,297][04272] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-06 16:29:13,114][04272] Updated weights for policy 0, policy_version 89970 (0.0007) [2023-03-06 16:29:13,929][04272] Updated weights for policy 0, policy_version 89980 (0.0007) [2023-03-06 16:29:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 92139520. Throughput: 0: 12599.9. Samples: 92118684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:29:13,941][03942] Avg episode reward: [(0, '1260.563')] [2023-03-06 16:29:14,741][04272] Updated weights for policy 0, policy_version 89990 (0.0007) [2023-03-06 16:29:15,556][04272] Updated weights for policy 0, policy_version 90000 (0.0007) [2023-03-06 16:29:16,365][04272] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-06 16:29:17,178][04272] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-06 16:29:18,005][04272] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-06 16:29:18,818][04272] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-06 16:29:18,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92201984. Throughput: 0: 12596.4. Samples: 92194084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:29:18,941][03942] Avg episode reward: [(0, '1257.127')] [2023-03-06 16:29:19,641][04272] Updated weights for policy 0, policy_version 90050 (0.0007) [2023-03-06 16:29:20,450][04272] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-06 16:29:21,241][04272] Updated weights for policy 0, policy_version 90070 (0.0007) [2023-03-06 16:29:22,057][04272] Updated weights for policy 0, policy_version 90080 (0.0007) [2023-03-06 16:29:22,880][04272] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-06 16:29:23,676][04272] Updated weights for policy 0, policy_version 90100 (0.0007) [2023-03-06 16:29:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 92265472. Throughput: 0: 12599.1. Samples: 92232198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:23,952][03942] Avg episode reward: [(0, '1290.402')] [2023-03-06 16:29:24,485][04272] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-06 16:29:25,293][04272] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-06 16:29:26,114][04272] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-06 16:29:26,929][04272] Updated weights for policy 0, policy_version 90140 (0.0006) [2023-03-06 16:29:27,758][04272] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-06 16:29:28,555][04272] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-06 16:29:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92327936. Throughput: 0: 12595.5. Samples: 92307752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:28,952][03942] Avg episode reward: [(0, '1217.796')] [2023-03-06 16:29:29,363][04272] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-06 16:29:30,178][04272] Updated weights for policy 0, policy_version 90180 (0.0007) [2023-03-06 16:29:30,980][04272] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-06 16:29:31,787][04272] Updated weights for policy 0, policy_version 90200 (0.0007) [2023-03-06 16:29:32,597][04272] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-06 16:29:33,410][04272] Updated weights for policy 0, policy_version 90220 (0.0007) [2023-03-06 16:29:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12603.9). Total num frames: 92391424. Throughput: 0: 12604.8. Samples: 92383693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:33,952][03942] Avg episode reward: [(0, '1277.837')] [2023-03-06 16:29:34,234][04272] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-06 16:29:35,049][04272] Updated weights for policy 0, policy_version 90240 (0.0006) [2023-03-06 16:29:35,876][04272] Updated weights for policy 0, policy_version 90250 (0.0007) [2023-03-06 16:29:36,688][04272] Updated weights for policy 0, policy_version 90260 (0.0006) [2023-03-06 16:29:37,506][04272] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-06 16:29:38,281][04272] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-06 16:29:38,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 92454912. Throughput: 0: 12593.6. Samples: 92421251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:38,951][03942] Avg episode reward: [(0, '1200.187')] [2023-03-06 16:29:39,097][04272] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-06 16:29:39,892][04272] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-06 16:29:40,727][04272] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-06 16:29:41,533][04272] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-06 16:29:42,358][04272] Updated weights for policy 0, policy_version 90330 (0.0007) [2023-03-06 16:29:43,166][04272] Updated weights for policy 0, policy_version 90340 (0.0007) [2023-03-06 16:29:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92517376. Throughput: 0: 12606.6. Samples: 92497010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:43,941][03942] Avg episode reward: [(0, '1310.971')] [2023-03-06 16:29:43,961][04272] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-06 16:29:44,793][04272] Updated weights for policy 0, policy_version 90360 (0.0007) [2023-03-06 16:29:45,610][04272] Updated weights for policy 0, policy_version 90370 (0.0008) [2023-03-06 16:29:46,417][04272] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-06 16:29:47,236][04272] Updated weights for policy 0, policy_version 90390 (0.0007) [2023-03-06 16:29:48,043][04272] Updated weights for policy 0, policy_version 90400 (0.0006) [2023-03-06 16:29:48,857][04272] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-06 16:29:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 92580864. Throughput: 0: 12602.2. Samples: 92572566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:48,941][03942] Avg episode reward: [(0, '1250.486')] [2023-03-06 16:29:49,666][04272] Updated weights for policy 0, policy_version 90420 (0.0007) [2023-03-06 16:29:50,485][04272] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-06 16:29:51,302][04272] Updated weights for policy 0, policy_version 90440 (0.0006) [2023-03-06 16:29:52,118][04272] Updated weights for policy 0, policy_version 90450 (0.0007) [2023-03-06 16:29:52,940][04272] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-06 16:29:53,739][04272] Updated weights for policy 0, policy_version 90470 (0.0007) [2023-03-06 16:29:53,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92643328. Throughput: 0: 12601.2. Samples: 92610199. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:53,941][03942] Avg episode reward: [(0, '1250.739')] [2023-03-06 16:29:54,561][04272] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-06 16:29:55,376][04272] Updated weights for policy 0, policy_version 90490 (0.0007) [2023-03-06 16:29:56,181][04272] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-06 16:29:56,982][04272] Updated weights for policy 0, policy_version 90510 (0.0006) [2023-03-06 16:29:57,806][04272] Updated weights for policy 0, policy_version 90520 (0.0006) [2023-03-06 16:29:58,625][04272] Updated weights for policy 0, policy_version 90530 (0.0007) [2023-03-06 16:29:58,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 92705792. Throughput: 0: 12602.0. Samples: 92685773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:29:58,941][03942] Avg episode reward: [(0, '1202.972')] [2023-03-06 16:29:59,444][04272] Updated weights for policy 0, policy_version 90540 (0.0007) [2023-03-06 16:30:00,242][04272] Updated weights for policy 0, policy_version 90550 (0.0007) [2023-03-06 16:30:01,058][04272] Updated weights for policy 0, policy_version 90560 (0.0006) [2023-03-06 16:30:01,851][04272] Updated weights for policy 0, policy_version 90570 (0.0006) [2023-03-06 16:30:02,669][04272] Updated weights for policy 0, policy_version 90580 (0.0007) [2023-03-06 16:30:03,490][04272] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-06 16:30:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92769280. Throughput: 0: 12610.8. Samples: 92761571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:30:03,941][03942] Avg episode reward: [(0, '1218.105')] [2023-03-06 16:30:04,305][04272] Updated weights for policy 0, policy_version 90600 (0.0006) [2023-03-06 16:30:05,109][04272] Updated weights for policy 0, policy_version 90610 (0.0006) [2023-03-06 16:30:05,928][04272] Updated weights for policy 0, policy_version 90620 (0.0007) [2023-03-06 16:30:06,716][04272] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-06 16:30:07,549][04272] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-06 16:30:08,352][04272] Updated weights for policy 0, policy_version 90650 (0.0007) [2023-03-06 16:30:08,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 92832768. Throughput: 0: 12603.4. Samples: 92799350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:30:08,941][03942] Avg episode reward: [(0, '1235.953')] [2023-03-06 16:30:09,166][04272] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-06 16:30:09,993][04272] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-06 16:30:10,808][04272] Updated weights for policy 0, policy_version 90680 (0.0007) [2023-03-06 16:30:11,613][04272] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-06 16:30:12,437][04272] Updated weights for policy 0, policy_version 90700 (0.0006) [2023-03-06 16:30:13,238][04272] Updated weights for policy 0, policy_version 90710 (0.0007) [2023-03-06 16:30:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 92895232. Throughput: 0: 12604.5. Samples: 92874952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:30:13,941][03942] Avg episode reward: [(0, '1220.996')] [2023-03-06 16:30:14,049][04272] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-06 16:30:14,872][04272] Updated weights for policy 0, policy_version 90730 (0.0006) [2023-03-06 16:30:15,682][04272] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-06 16:30:16,505][04272] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-06 16:30:17,327][04272] Updated weights for policy 0, policy_version 90760 (0.0008) [2023-03-06 16:30:18,127][04272] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-06 16:30:18,933][04272] Updated weights for policy 0, policy_version 90780 (0.0006) [2023-03-06 16:30:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 92958720. Throughput: 0: 12596.3. Samples: 92950527. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:18,941][03942] Avg episode reward: [(0, '1126.945')] [2023-03-06 16:30:19,750][04272] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-06 16:30:20,570][04272] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-06 16:30:21,374][04272] Updated weights for policy 0, policy_version 90810 (0.0007) [2023-03-06 16:30:22,173][04272] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-06 16:30:22,990][04272] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-06 16:30:23,802][04272] Updated weights for policy 0, policy_version 90840 (0.0007) [2023-03-06 16:30:23,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 93021184. Throughput: 0: 12600.5. Samples: 92988273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:23,941][03942] Avg episode reward: [(0, '1265.383')] [2023-03-06 16:30:24,594][04272] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-06 16:30:25,407][04272] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-06 16:30:26,223][04272] Updated weights for policy 0, policy_version 90870 (0.0006) [2023-03-06 16:30:27,042][04272] Updated weights for policy 0, policy_version 90880 (0.0006) [2023-03-06 16:30:27,868][04272] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-06 16:30:28,684][04272] Updated weights for policy 0, policy_version 90900 (0.0007) [2023-03-06 16:30:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 93084672. Throughput: 0: 12605.1. Samples: 93064239. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:28,941][03942] Avg episode reward: [(0, '1161.813')] [2023-03-06 16:30:29,507][04272] Updated weights for policy 0, policy_version 90910 (0.0007) [2023-03-06 16:30:30,337][04272] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-06 16:30:31,155][04272] Updated weights for policy 0, policy_version 90930 (0.0007) [2023-03-06 16:30:31,947][04272] Updated weights for policy 0, policy_version 90940 (0.0007) [2023-03-06 16:30:32,781][04272] Updated weights for policy 0, policy_version 90950 (0.0007) [2023-03-06 16:30:33,583][04272] Updated weights for policy 0, policy_version 90960 (0.0007) [2023-03-06 16:30:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93147136. Throughput: 0: 12596.9. Samples: 93139428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:33,941][03942] Avg episode reward: [(0, '1132.676')] [2023-03-06 16:30:34,384][04272] Updated weights for policy 0, policy_version 90970 (0.0007) [2023-03-06 16:30:35,221][04272] Updated weights for policy 0, policy_version 90980 (0.0006) [2023-03-06 16:30:36,028][04272] Updated weights for policy 0, policy_version 90990 (0.0007) [2023-03-06 16:30:36,835][04272] Updated weights for policy 0, policy_version 91000 (0.0008) [2023-03-06 16:30:37,639][04272] Updated weights for policy 0, policy_version 91010 (0.0007) [2023-03-06 16:30:38,446][04272] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-06 16:30:38,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 93209600. Throughput: 0: 12597.1. Samples: 93177067. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:38,941][03942] Avg episode reward: [(0, '1296.207')] [2023-03-06 16:30:39,281][04272] Updated weights for policy 0, policy_version 91030 (0.0007) [2023-03-06 16:30:40,099][04272] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-06 16:30:40,896][04272] Updated weights for policy 0, policy_version 91050 (0.0007) [2023-03-06 16:30:41,712][04272] Updated weights for policy 0, policy_version 91060 (0.0006) [2023-03-06 16:30:42,504][04272] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-06 16:30:43,318][04272] Updated weights for policy 0, policy_version 91080 (0.0007) [2023-03-06 16:30:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93273088. Throughput: 0: 12601.7. Samples: 93252848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:43,941][03942] Avg episode reward: [(0, '1242.982')] [2023-03-06 16:30:44,118][04272] Updated weights for policy 0, policy_version 91090 (0.0007) [2023-03-06 16:30:44,938][04272] Updated weights for policy 0, policy_version 91100 (0.0006) [2023-03-06 16:30:45,745][04272] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-06 16:30:46,541][04272] Updated weights for policy 0, policy_version 91120 (0.0007) [2023-03-06 16:30:47,361][04272] Updated weights for policy 0, policy_version 91130 (0.0007) [2023-03-06 16:30:48,174][04272] Updated weights for policy 0, policy_version 91140 (0.0006) [2023-03-06 16:30:48,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 93336576. Throughput: 0: 12602.4. Samples: 93328677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:48,941][03942] Avg episode reward: [(0, '1246.640')] [2023-03-06 16:30:49,011][04272] Updated weights for policy 0, policy_version 91150 (0.0007) [2023-03-06 16:30:49,814][04272] Updated weights for policy 0, policy_version 91160 (0.0005) [2023-03-06 16:30:50,634][04272] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-06 16:30:51,437][04272] Updated weights for policy 0, policy_version 91180 (0.0006) [2023-03-06 16:30:52,239][04272] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-06 16:30:53,063][04272] Updated weights for policy 0, policy_version 91200 (0.0006) [2023-03-06 16:30:53,873][04272] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-06 16:30:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93399040. Throughput: 0: 12599.9. Samples: 93366347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:53,941][03942] Avg episode reward: [(0, '1284.479')] [2023-03-06 16:30:54,660][04272] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-06 16:30:55,500][04272] Updated weights for policy 0, policy_version 91230 (0.0007) [2023-03-06 16:30:56,314][04272] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-03-06 16:30:57,108][04272] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-06 16:30:57,934][04272] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-06 16:30:58,739][04272] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-06 16:30:58,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 93462528. Throughput: 0: 12603.5. Samples: 93442108. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:30:58,941][03942] Avg episode reward: [(0, '1208.661')] [2023-03-06 16:30:59,558][04272] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-06 16:31:00,367][04272] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-06 16:31:01,179][04272] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-06 16:31:01,984][04272] Updated weights for policy 0, policy_version 91310 (0.0007) [2023-03-06 16:31:02,810][04272] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-06 16:31:03,613][04272] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-06 16:31:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93524992. Throughput: 0: 12604.2. Samples: 93517715. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:31:03,941][03942] Avg episode reward: [(0, '1217.498')] [2023-03-06 16:31:04,424][04272] Updated weights for policy 0, policy_version 91340 (0.0005) [2023-03-06 16:31:05,233][04272] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-06 16:31:06,046][04272] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-06 16:31:06,867][04272] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-06 16:31:07,665][04272] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-06 16:31:08,488][04272] Updated weights for policy 0, policy_version 91390 (0.0007) [2023-03-06 16:31:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 93588480. Throughput: 0: 12605.7. Samples: 93555528. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:31:08,941][03942] Avg episode reward: [(0, '1266.639')] [2023-03-06 16:31:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000091395_93588480.pth... [2023-03-06 16:31:08,975][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000088442_90564608.pth [2023-03-06 16:31:09,304][04272] Updated weights for policy 0, policy_version 91400 (0.0007) [2023-03-06 16:31:10,117][04272] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-06 16:31:10,953][04272] Updated weights for policy 0, policy_version 91420 (0.0007) [2023-03-06 16:31:11,756][04272] Updated weights for policy 0, policy_version 91430 (0.0007) [2023-03-06 16:31:12,578][04272] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-06 16:31:13,398][04272] Updated weights for policy 0, policy_version 91450 (0.0007) [2023-03-06 16:31:13,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 93650944. Throughput: 0: 12590.5. Samples: 93630812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:13,941][03942] Avg episode reward: [(0, '1264.855')] [2023-03-06 16:31:14,211][04272] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-06 16:31:15,009][04272] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-06 16:31:15,840][04272] Updated weights for policy 0, policy_version 91480 (0.0007) [2023-03-06 16:31:16,638][04272] Updated weights for policy 0, policy_version 91490 (0.0007) [2023-03-06 16:31:17,454][04272] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-06 16:31:18,265][04272] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-06 16:31:18,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 93714432. Throughput: 0: 12598.7. Samples: 93706369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:18,941][03942] Avg episode reward: [(0, '1170.918')] [2023-03-06 16:31:19,085][04272] Updated weights for policy 0, policy_version 91520 (0.0007) [2023-03-06 16:31:19,885][04272] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-06 16:31:20,693][04272] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-06 16:31:21,500][04272] Updated weights for policy 0, policy_version 91550 (0.0006) [2023-03-06 16:31:22,309][04272] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-06 16:31:23,112][04272] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-06 16:31:23,925][04272] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-06 16:31:23,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 93777920. Throughput: 0: 12605.3. Samples: 93744304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:23,941][03942] Avg episode reward: [(0, '1165.254')] [2023-03-06 16:31:24,721][04272] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-06 16:31:25,536][04272] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-06 16:31:26,364][04272] Updated weights for policy 0, policy_version 91610 (0.0007) [2023-03-06 16:31:27,159][04272] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-06 16:31:27,974][04272] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-06 16:31:28,781][04272] Updated weights for policy 0, policy_version 91640 (0.0007) [2023-03-06 16:31:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93840384. Throughput: 0: 12609.9. Samples: 93820294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:28,941][03942] Avg episode reward: [(0, '1224.753')] [2023-03-06 16:31:29,621][04272] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-06 16:31:30,409][04272] Updated weights for policy 0, policy_version 91660 (0.0007) [2023-03-06 16:31:31,218][04272] Updated weights for policy 0, policy_version 91670 (0.0007) [2023-03-06 16:31:32,053][04272] Updated weights for policy 0, policy_version 91680 (0.0007) [2023-03-06 16:31:32,871][04272] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-06 16:31:33,691][04272] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-06 16:31:33,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 93902848. Throughput: 0: 12597.0. Samples: 93895541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:33,941][03942] Avg episode reward: [(0, '1217.398')] [2023-03-06 16:31:34,510][04272] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-06 16:31:35,310][04272] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-06 16:31:36,129][04272] Updated weights for policy 0, policy_version 91730 (0.0006) [2023-03-06 16:31:36,939][04272] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-06 16:31:37,763][04272] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-06 16:31:38,569][04272] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-06 16:31:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12596.9). Total num frames: 93966336. Throughput: 0: 12597.7. Samples: 93933241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:38,941][03942] Avg episode reward: [(0, '1208.848')] [2023-03-06 16:31:39,369][04272] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-06 16:31:40,194][04272] Updated weights for policy 0, policy_version 91780 (0.0006) [2023-03-06 16:31:41,007][04272] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-06 16:31:41,806][04272] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-06 16:31:42,643][04272] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-06 16:31:43,434][04272] Updated weights for policy 0, policy_version 91820 (0.0006) [2023-03-06 16:31:43,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 94029824. Throughput: 0: 12596.9. Samples: 94008966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:43,941][03942] Avg episode reward: [(0, '1202.734')] [2023-03-06 16:31:44,261][04272] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-06 16:31:45,070][04272] Updated weights for policy 0, policy_version 91840 (0.0007) [2023-03-06 16:31:45,877][04272] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-06 16:31:46,705][04272] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-06 16:31:47,510][04272] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-06 16:31:48,323][04272] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-06 16:31:48,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94092288. Throughput: 0: 12597.3. Samples: 94084593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:48,941][03942] Avg episode reward: [(0, '1153.834')] [2023-03-06 16:31:49,147][04272] Updated weights for policy 0, policy_version 91890 (0.0006) [2023-03-06 16:31:49,965][04272] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-06 16:31:50,775][04272] Updated weights for policy 0, policy_version 91910 (0.0007) [2023-03-06 16:31:51,595][04272] Updated weights for policy 0, policy_version 91920 (0.0007) [2023-03-06 16:31:52,408][04272] Updated weights for policy 0, policy_version 91930 (0.0006) [2023-03-06 16:31:53,223][04272] Updated weights for policy 0, policy_version 91940 (0.0007) [2023-03-06 16:31:53,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94154752. Throughput: 0: 12591.4. Samples: 94122141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:53,941][03942] Avg episode reward: [(0, '1302.710')] [2023-03-06 16:31:54,028][04272] Updated weights for policy 0, policy_version 91950 (0.0006) [2023-03-06 16:31:54,839][04272] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-06 16:31:55,658][04272] Updated weights for policy 0, policy_version 91970 (0.0007) [2023-03-06 16:31:56,477][04272] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-06 16:31:57,286][04272] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-06 16:31:58,090][04272] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-06 16:31:58,906][04272] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-06 16:31:58,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 94218240. Throughput: 0: 12596.3. Samples: 94197646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:31:58,941][03942] Avg episode reward: [(0, '1229.719')] [2023-03-06 16:31:59,723][04272] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-06 16:32:00,537][04272] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-06 16:32:01,356][04272] Updated weights for policy 0, policy_version 92040 (0.0008) [2023-03-06 16:32:02,176][04272] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-06 16:32:02,998][04272] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-06 16:32:03,803][04272] Updated weights for policy 0, policy_version 92070 (0.0007) [2023-03-06 16:32:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94280704. Throughput: 0: 12595.6. Samples: 94273171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:03,941][03942] Avg episode reward: [(0, '1208.935')] [2023-03-06 16:32:04,606][04272] Updated weights for policy 0, policy_version 92080 (0.0006) [2023-03-06 16:32:05,421][04272] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-06 16:32:06,236][04272] Updated weights for policy 0, policy_version 92100 (0.0007) [2023-03-06 16:32:07,050][04272] Updated weights for policy 0, policy_version 92110 (0.0007) [2023-03-06 16:32:07,851][04272] Updated weights for policy 0, policy_version 92120 (0.0007) [2023-03-06 16:32:08,666][04272] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-06 16:32:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 94344192. Throughput: 0: 12591.8. Samples: 94310936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:08,941][03942] Avg episode reward: [(0, '1086.709')] [2023-03-06 16:32:09,465][04272] Updated weights for policy 0, policy_version 92140 (0.0006) [2023-03-06 16:32:10,293][04272] Updated weights for policy 0, policy_version 92150 (0.0006) [2023-03-06 16:32:11,102][04272] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-06 16:32:11,930][04272] Updated weights for policy 0, policy_version 92170 (0.0005) [2023-03-06 16:32:12,745][04272] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-06 16:32:13,566][04272] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-06 16:32:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94406656. Throughput: 0: 12582.6. Samples: 94386512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:13,941][03942] Avg episode reward: [(0, '1226.791')] [2023-03-06 16:32:14,369][04272] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-06 16:32:15,181][04272] Updated weights for policy 0, policy_version 92210 (0.0006) [2023-03-06 16:32:16,002][04272] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-06 16:32:16,821][04272] Updated weights for policy 0, policy_version 92230 (0.0006) [2023-03-06 16:32:17,621][04272] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-06 16:32:18,422][04272] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-03-06 16:32:18,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 94470144. Throughput: 0: 12590.0. Samples: 94462092. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:18,941][03942] Avg episode reward: [(0, '1244.689')] [2023-03-06 16:32:19,249][04272] Updated weights for policy 0, policy_version 92260 (0.0008) [2023-03-06 16:32:20,055][04272] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-06 16:32:20,886][04272] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-06 16:32:21,694][04272] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-06 16:32:22,510][04272] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-06 16:32:23,318][04272] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-06 16:32:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 94532608. Throughput: 0: 12590.3. Samples: 94499804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:23,941][03942] Avg episode reward: [(0, '1286.251')] [2023-03-06 16:32:24,134][04272] Updated weights for policy 0, policy_version 92320 (0.0008) [2023-03-06 16:32:24,945][04272] Updated weights for policy 0, policy_version 92330 (0.0007) [2023-03-06 16:32:25,761][04272] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-06 16:32:26,573][04272] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-06 16:32:27,365][04272] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-06 16:32:28,175][04272] Updated weights for policy 0, policy_version 92370 (0.0007) [2023-03-06 16:32:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 94596096. Throughput: 0: 12589.5. Samples: 94575492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:28,941][03942] Avg episode reward: [(0, '981.499')] [2023-03-06 16:32:28,989][04272] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-06 16:32:29,795][04272] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-06 16:32:30,611][04272] Updated weights for policy 0, policy_version 92400 (0.0007) [2023-03-06 16:32:31,430][04272] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-06 16:32:32,247][04272] Updated weights for policy 0, policy_version 92420 (0.0007) [2023-03-06 16:32:33,066][04272] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-06 16:32:33,890][04272] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-06 16:32:33,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94658560. Throughput: 0: 12587.6. Samples: 94651035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:33,941][03942] Avg episode reward: [(0, '1053.719')] [2023-03-06 16:32:34,716][04272] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-06 16:32:35,524][04272] Updated weights for policy 0, policy_version 92460 (0.0007) [2023-03-06 16:32:36,331][04272] Updated weights for policy 0, policy_version 92470 (0.0007) [2023-03-06 16:32:37,151][04272] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-06 16:32:37,978][04272] Updated weights for policy 0, policy_version 92490 (0.0007) [2023-03-06 16:32:38,772][04272] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-06 16:32:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94722048. Throughput: 0: 12586.4. Samples: 94688529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:38,941][03942] Avg episode reward: [(0, '1112.970')] [2023-03-06 16:32:39,588][04272] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-06 16:32:40,403][04272] Updated weights for policy 0, policy_version 92520 (0.0006) [2023-03-06 16:32:41,196][04272] Updated weights for policy 0, policy_version 92530 (0.0007) [2023-03-06 16:32:42,026][04272] Updated weights for policy 0, policy_version 92540 (0.0007) [2023-03-06 16:32:42,831][04272] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-06 16:32:43,637][04272] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-06 16:32:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12578.1, 300 sec: 12596.9). Total num frames: 94784512. Throughput: 0: 12591.2. Samples: 94764249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:43,941][03942] Avg episode reward: [(0, '1233.703')] [2023-03-06 16:32:44,454][04272] Updated weights for policy 0, policy_version 92570 (0.0007) [2023-03-06 16:32:45,258][04272] Updated weights for policy 0, policy_version 92580 (0.0007) [2023-03-06 16:32:46,069][04272] Updated weights for policy 0, policy_version 92590 (0.0006) [2023-03-06 16:32:46,890][04272] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-06 16:32:47,700][04272] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-06 16:32:48,528][04272] Updated weights for policy 0, policy_version 92620 (0.0007) [2023-03-06 16:32:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94848000. Throughput: 0: 12596.0. Samples: 94839991. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:48,941][03942] Avg episode reward: [(0, '1210.442')] [2023-03-06 16:32:49,330][04272] Updated weights for policy 0, policy_version 92630 (0.0007) [2023-03-06 16:32:50,124][04272] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-06 16:32:50,952][04272] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-06 16:32:51,755][04272] Updated weights for policy 0, policy_version 92660 (0.0007) [2023-03-06 16:32:52,552][04272] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-06 16:32:53,380][04272] Updated weights for policy 0, policy_version 92680 (0.0007) [2023-03-06 16:32:53,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 94911488. Throughput: 0: 12599.0. Samples: 94877891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:53,941][03942] Avg episode reward: [(0, '1264.443')] [2023-03-06 16:32:54,196][04272] Updated weights for policy 0, policy_version 92690 (0.0007) [2023-03-06 16:32:54,994][04272] Updated weights for policy 0, policy_version 92700 (0.0007) [2023-03-06 16:32:55,809][04272] Updated weights for policy 0, policy_version 92710 (0.0007) [2023-03-06 16:32:56,626][04272] Updated weights for policy 0, policy_version 92720 (0.0006) [2023-03-06 16:32:57,431][04272] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-06 16:32:58,249][04272] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-06 16:32:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 94973952. Throughput: 0: 12601.1. Samples: 94953564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:32:58,941][03942] Avg episode reward: [(0, '1283.848')] [2023-03-06 16:32:59,053][04272] Updated weights for policy 0, policy_version 92750 (0.0007) [2023-03-06 16:32:59,867][04272] Updated weights for policy 0, policy_version 92760 (0.0007) [2023-03-06 16:33:00,689][04272] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-06 16:33:01,498][04272] Updated weights for policy 0, policy_version 92780 (0.0007) [2023-03-06 16:33:02,298][04272] Updated weights for policy 0, policy_version 92790 (0.0006) [2023-03-06 16:33:03,117][04272] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-06 16:33:03,935][04272] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-06 16:33:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 95037440. Throughput: 0: 12605.1. Samples: 95029321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:03,941][03942] Avg episode reward: [(0, '1269.113')] [2023-03-06 16:33:04,745][04272] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-06 16:33:05,573][04272] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-06 16:33:06,398][04272] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-06 16:33:07,195][04272] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-06 16:33:08,017][04272] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-06 16:33:08,813][04272] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-06 16:33:08,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 95099904. Throughput: 0: 12599.1. Samples: 95066765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:08,941][03942] Avg episode reward: [(0, '1247.495')] [2023-03-06 16:33:08,945][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000092871_95099904.pth... [2023-03-06 16:33:08,978][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000089918_92076032.pth [2023-03-06 16:33:09,634][04272] Updated weights for policy 0, policy_version 92880 (0.0007) [2023-03-06 16:33:10,446][04272] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-06 16:33:11,260][04272] Updated weights for policy 0, policy_version 92900 (0.0006) [2023-03-06 16:33:12,095][04272] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-06 16:33:12,893][04272] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-06 16:33:13,712][04272] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-06 16:33:13,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95162368. Throughput: 0: 12594.1. Samples: 95142227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:13,941][03942] Avg episode reward: [(0, '1209.714')] [2023-03-06 16:33:14,536][04272] Updated weights for policy 0, policy_version 92940 (0.0007) [2023-03-06 16:33:15,345][04272] Updated weights for policy 0, policy_version 92950 (0.0006) [2023-03-06 16:33:16,160][04272] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-06 16:33:16,973][04272] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-06 16:33:17,771][04272] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-06 16:33:18,608][04272] Updated weights for policy 0, policy_version 92990 (0.0007) [2023-03-06 16:33:18,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 95225856. Throughput: 0: 12597.1. Samples: 95217903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:18,941][03942] Avg episode reward: [(0, '1159.770')] [2023-03-06 16:33:19,417][04272] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-06 16:33:20,219][04272] Updated weights for policy 0, policy_version 93010 (0.0006) [2023-03-06 16:33:21,047][04272] Updated weights for policy 0, policy_version 93020 (0.0007) [2023-03-06 16:33:21,867][04272] Updated weights for policy 0, policy_version 93030 (0.0006) [2023-03-06 16:33:22,678][04272] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-06 16:33:23,481][04272] Updated weights for policy 0, policy_version 93050 (0.0007) [2023-03-06 16:33:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95288320. Throughput: 0: 12597.6. Samples: 95255420. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:23,941][03942] Avg episode reward: [(0, '1016.082')] [2023-03-06 16:33:24,303][04272] Updated weights for policy 0, policy_version 93060 (0.0007) [2023-03-06 16:33:25,097][04272] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-06 16:33:25,914][04272] Updated weights for policy 0, policy_version 93080 (0.0007) [2023-03-06 16:33:26,730][04272] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-06 16:33:27,520][04272] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-06 16:33:28,341][04272] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-06 16:33:28,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 95351808. Throughput: 0: 12601.2. Samples: 95331304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:28,941][03942] Avg episode reward: [(0, '1238.509')] [2023-03-06 16:33:29,155][04272] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-06 16:33:29,961][04272] Updated weights for policy 0, policy_version 93130 (0.0007) [2023-03-06 16:33:30,758][04272] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-06 16:33:31,591][04272] Updated weights for policy 0, policy_version 93150 (0.0006) [2023-03-06 16:33:32,395][04272] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-06 16:33:33,215][04272] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-06 16:33:33,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 95415296. Throughput: 0: 12600.7. Samples: 95407023. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:33,941][03942] Avg episode reward: [(0, '1174.333')] [2023-03-06 16:33:33,995][04272] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-06 16:33:34,807][04272] Updated weights for policy 0, policy_version 93190 (0.0007) [2023-03-06 16:33:35,623][04272] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-06 16:33:36,429][04272] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-06 16:33:37,262][04272] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-06 16:33:38,078][04272] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-03-06 16:33:38,879][04272] Updated weights for policy 0, policy_version 93240 (0.0007) [2023-03-06 16:33:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95477760. Throughput: 0: 12603.0. Samples: 95445027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:38,941][03942] Avg episode reward: [(0, '1283.632')] [2023-03-06 16:33:39,678][04272] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-06 16:33:40,521][04272] Updated weights for policy 0, policy_version 93260 (0.0007) [2023-03-06 16:33:41,326][04272] Updated weights for policy 0, policy_version 93270 (0.0006) [2023-03-06 16:33:42,145][04272] Updated weights for policy 0, policy_version 93280 (0.0006) [2023-03-06 16:33:42,954][04272] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-06 16:33:43,749][04272] Updated weights for policy 0, policy_version 93300 (0.0007) [2023-03-06 16:33:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 95541248. Throughput: 0: 12596.7. Samples: 95520417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:43,941][03942] Avg episode reward: [(0, '1199.227')] [2023-03-06 16:33:44,576][04272] Updated weights for policy 0, policy_version 93310 (0.0007) [2023-03-06 16:33:45,385][04272] Updated weights for policy 0, policy_version 93320 (0.0007) [2023-03-06 16:33:46,209][04272] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-06 16:33:47,008][04272] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-06 16:33:47,828][04272] Updated weights for policy 0, policy_version 93350 (0.0006) [2023-03-06 16:33:48,649][04272] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-06 16:33:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95603712. Throughput: 0: 12594.4. Samples: 95596070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:48,941][03942] Avg episode reward: [(0, '1145.128')] [2023-03-06 16:33:49,461][04272] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-06 16:33:50,291][04272] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-06 16:33:51,102][04272] Updated weights for policy 0, policy_version 93390 (0.0007) [2023-03-06 16:33:51,895][04272] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-06 16:33:52,719][04272] Updated weights for policy 0, policy_version 93410 (0.0007) [2023-03-06 16:33:53,504][04272] Updated weights for policy 0, policy_version 93420 (0.0006) [2023-03-06 16:33:53,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 95667200. Throughput: 0: 12600.5. Samples: 95633787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:33:53,941][03942] Avg episode reward: [(0, '1232.496')] [2023-03-06 16:33:54,307][04272] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-06 16:33:55,117][04272] Updated weights for policy 0, policy_version 93440 (0.0007) [2023-03-06 16:33:55,924][04272] Updated weights for policy 0, policy_version 93450 (0.0007) [2023-03-06 16:33:56,740][04272] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-06 16:33:57,563][04272] Updated weights for policy 0, policy_version 93470 (0.0007) [2023-03-06 16:33:58,371][04272] Updated weights for policy 0, policy_version 93480 (0.0006) [2023-03-06 16:33:58,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95729664. Throughput: 0: 12609.9. Samples: 95709670. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:33:58,941][03942] Avg episode reward: [(0, '1183.660')] [2023-03-06 16:33:59,195][04272] Updated weights for policy 0, policy_version 93490 (0.0007) [2023-03-06 16:33:59,994][04272] Updated weights for policy 0, policy_version 93500 (0.0007) [2023-03-06 16:34:00,825][04272] Updated weights for policy 0, policy_version 93510 (0.0007) [2023-03-06 16:34:01,634][04272] Updated weights for policy 0, policy_version 93520 (0.0007) [2023-03-06 16:34:02,453][04272] Updated weights for policy 0, policy_version 93530 (0.0007) [2023-03-06 16:34:03,273][04272] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-06 16:34:03,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 95793152. Throughput: 0: 12602.1. Samples: 95784998. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:03,941][03942] Avg episode reward: [(0, '1215.061')] [2023-03-06 16:34:04,096][04272] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-06 16:34:04,903][04272] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-06 16:34:05,709][04272] Updated weights for policy 0, policy_version 93570 (0.0007) [2023-03-06 16:34:06,523][04272] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-06 16:34:07,337][04272] Updated weights for policy 0, policy_version 93590 (0.0006) [2023-03-06 16:34:08,135][04272] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-06 16:34:08,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 95855616. Throughput: 0: 12610.2. Samples: 95822879. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:08,941][03942] Avg episode reward: [(0, '1268.103')] [2023-03-06 16:34:08,945][04272] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-06 16:34:09,757][04272] Updated weights for policy 0, policy_version 93620 (0.0007) [2023-03-06 16:34:10,553][04272] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-06 16:34:11,369][04272] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-06 16:34:12,193][04272] Updated weights for policy 0, policy_version 93650 (0.0007) [2023-03-06 16:34:12,984][04272] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-06 16:34:13,813][04272] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-06 16:34:13,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 95919104. Throughput: 0: 12610.1. Samples: 95898757. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:13,941][03942] Avg episode reward: [(0, '1209.108')] [2023-03-06 16:34:14,631][04272] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-06 16:34:15,426][04272] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-06 16:34:16,250][04272] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-06 16:34:17,057][04272] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-06 16:34:17,870][04272] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-06 16:34:18,694][04272] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-06 16:34:18,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 95982592. Throughput: 0: 12607.3. Samples: 95974351. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:18,941][03942] Avg episode reward: [(0, '1143.823')] [2023-03-06 16:34:19,500][04272] Updated weights for policy 0, policy_version 93740 (0.0008) [2023-03-06 16:34:20,320][04272] Updated weights for policy 0, policy_version 93750 (0.0007) [2023-03-06 16:34:21,124][04272] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-06 16:34:21,938][04272] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-06 16:34:22,759][04272] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-06 16:34:23,565][04272] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-06 16:34:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96045056. Throughput: 0: 12605.4. Samples: 96012271. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:23,941][03942] Avg episode reward: [(0, '1221.239')] [2023-03-06 16:34:24,377][04272] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-06 16:34:25,187][04272] Updated weights for policy 0, policy_version 93810 (0.0007) [2023-03-06 16:34:25,998][04272] Updated weights for policy 0, policy_version 93820 (0.0006) [2023-03-06 16:34:26,796][04272] Updated weights for policy 0, policy_version 93830 (0.0007) [2023-03-06 16:34:27,622][04272] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-06 16:34:28,438][04272] Updated weights for policy 0, policy_version 93850 (0.0007) [2023-03-06 16:34:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96108544. Throughput: 0: 12610.0. Samples: 96087869. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:28,941][03942] Avg episode reward: [(0, '1264.660')] [2023-03-06 16:34:29,242][04272] Updated weights for policy 0, policy_version 93860 (0.0007) [2023-03-06 16:34:30,065][04272] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-06 16:34:30,881][04272] Updated weights for policy 0, policy_version 93880 (0.0007) [2023-03-06 16:34:31,687][04272] Updated weights for policy 0, policy_version 93890 (0.0007) [2023-03-06 16:34:32,502][04272] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-06 16:34:33,312][04272] Updated weights for policy 0, policy_version 93910 (0.0008) [2023-03-06 16:34:33,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 96171008. Throughput: 0: 12608.7. Samples: 96163459. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:33,941][03942] Avg episode reward: [(0, '1213.973')] [2023-03-06 16:34:34,117][04272] Updated weights for policy 0, policy_version 93920 (0.0007) [2023-03-06 16:34:34,924][04272] Updated weights for policy 0, policy_version 93930 (0.0007) [2023-03-06 16:34:35,747][04272] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-06 16:34:36,554][04272] Updated weights for policy 0, policy_version 93950 (0.0007) [2023-03-06 16:34:37,365][04272] Updated weights for policy 0, policy_version 93960 (0.0007) [2023-03-06 16:34:38,178][04272] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-06 16:34:38,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96234496. Throughput: 0: 12614.7. Samples: 96201449. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:38,941][03942] Avg episode reward: [(0, '1132.753')] [2023-03-06 16:34:39,002][04272] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-06 16:34:39,810][04272] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-06 16:34:40,606][04272] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-06 16:34:41,429][04272] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-06 16:34:42,249][04272] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-06 16:34:43,058][04272] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-06 16:34:43,866][04272] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-06 16:34:43,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12596.9). Total num frames: 96296960. Throughput: 0: 12603.3. Samples: 96276818. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:43,941][03942] Avg episode reward: [(0, '1236.047')] [2023-03-06 16:34:44,669][04272] Updated weights for policy 0, policy_version 94050 (0.0006) [2023-03-06 16:34:45,493][04272] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-06 16:34:46,299][04272] Updated weights for policy 0, policy_version 94070 (0.0007) [2023-03-06 16:34:47,122][04272] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-06 16:34:47,936][04272] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-06 16:34:48,741][04272] Updated weights for policy 0, policy_version 94100 (0.0006) [2023-03-06 16:34:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96360448. Throughput: 0: 12611.2. Samples: 96352504. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:34:48,941][03942] Avg episode reward: [(0, '1200.674')] [2023-03-06 16:34:49,553][04272] Updated weights for policy 0, policy_version 94110 (0.0006) [2023-03-06 16:34:50,346][04272] Updated weights for policy 0, policy_version 94120 (0.0006) [2023-03-06 16:34:51,174][04272] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-06 16:34:51,960][04272] Updated weights for policy 0, policy_version 94140 (0.0007) [2023-03-06 16:34:52,774][04272] Updated weights for policy 0, policy_version 94150 (0.0007) [2023-03-06 16:34:53,600][04272] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-06 16:34:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.2, 300 sec: 12603.9). Total num frames: 96423936. Throughput: 0: 12615.2. Samples: 96390564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:34:53,941][03942] Avg episode reward: [(0, '1232.510')] [2023-03-06 16:34:54,417][04272] Updated weights for policy 0, policy_version 94170 (0.0006) [2023-03-06 16:34:55,221][04272] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-06 16:34:56,017][04272] Updated weights for policy 0, policy_version 94190 (0.0006) [2023-03-06 16:34:56,837][04272] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-06 16:34:57,634][04272] Updated weights for policy 0, policy_version 94210 (0.0006) [2023-03-06 16:34:58,441][04272] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-06 16:34:58,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12629.4, 300 sec: 12603.9). Total num frames: 96487424. Throughput: 0: 12614.5. Samples: 96466409. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:34:58,941][03942] Avg episode reward: [(0, '1386.924')] [2023-03-06 16:34:59,270][04272] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-06 16:35:00,066][04272] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-06 16:35:00,880][04272] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-06 16:35:01,694][04272] Updated weights for policy 0, policy_version 94260 (0.0006) [2023-03-06 16:35:02,501][04272] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-06 16:35:03,323][04272] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-06 16:35:03,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96549888. Throughput: 0: 12619.1. Samples: 96542210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:03,951][03942] Avg episode reward: [(0, '1316.753')] [2023-03-06 16:35:04,111][04272] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-06 16:35:04,942][04272] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-06 16:35:05,758][04272] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-06 16:35:06,596][04272] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-06 16:35:07,398][04272] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-06 16:35:08,217][04272] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-06 16:35:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12629.3, 300 sec: 12603.9). Total num frames: 96613376. Throughput: 0: 12613.9. Samples: 96579895. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:08,952][03942] Avg episode reward: [(0, '1375.537')] [2023-03-06 16:35:08,956][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000094349_96613376.pth... [2023-03-06 16:35:08,987][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000091395_93588480.pth [2023-03-06 16:35:09,047][04272] Updated weights for policy 0, policy_version 94350 (0.0007) [2023-03-06 16:35:09,834][04272] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-06 16:35:10,653][04272] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-06 16:35:11,466][04272] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-06 16:35:12,278][04272] Updated weights for policy 0, policy_version 94390 (0.0006) [2023-03-06 16:35:13,101][04272] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-06 16:35:13,912][04272] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-06 16:35:13,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96675840. Throughput: 0: 12612.2. Samples: 96655416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:13,952][03942] Avg episode reward: [(0, '1293.391')] [2023-03-06 16:35:14,730][04272] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-06 16:35:15,557][04272] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-06 16:35:16,358][04272] Updated weights for policy 0, policy_version 94440 (0.0005) [2023-03-06 16:35:17,165][04272] Updated weights for policy 0, policy_version 94450 (0.0006) [2023-03-06 16:35:17,989][04272] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-06 16:35:18,807][04272] Updated weights for policy 0, policy_version 94470 (0.0007) [2023-03-06 16:35:18,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 96738304. Throughput: 0: 12603.1. Samples: 96730600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:18,952][03942] Avg episode reward: [(0, '1237.871')] [2023-03-06 16:35:19,621][04272] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-06 16:35:20,426][04272] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-06 16:35:21,233][04272] Updated weights for policy 0, policy_version 94500 (0.0007) [2023-03-06 16:35:22,054][04272] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-06 16:35:22,859][04272] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-06 16:35:23,668][04272] Updated weights for policy 0, policy_version 94530 (0.0008) [2023-03-06 16:35:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 96801792. Throughput: 0: 12600.1. Samples: 96768456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:23,941][03942] Avg episode reward: [(0, '1137.786')] [2023-03-06 16:35:24,485][04272] Updated weights for policy 0, policy_version 94540 (0.0007) [2023-03-06 16:35:25,288][04272] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-06 16:35:26,103][04272] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-06 16:35:26,914][04272] Updated weights for policy 0, policy_version 94570 (0.0007) [2023-03-06 16:35:27,722][04272] Updated weights for policy 0, policy_version 94580 (0.0006) [2023-03-06 16:35:28,548][04272] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-06 16:35:28,941][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 96865280. Throughput: 0: 12612.0. Samples: 96844358. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:28,941][03942] Avg episode reward: [(0, '1160.672')] [2023-03-06 16:35:29,357][04272] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-06 16:35:30,167][04272] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-06 16:35:30,969][04272] Updated weights for policy 0, policy_version 94620 (0.0006) [2023-03-06 16:35:31,797][04272] Updated weights for policy 0, policy_version 94630 (0.0007) [2023-03-06 16:35:32,591][04272] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-06 16:35:33,377][04272] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-06 16:35:33,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.2, 300 sec: 12603.9). Total num frames: 96927744. Throughput: 0: 12613.3. Samples: 96920102. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:33,941][03942] Avg episode reward: [(0, '1146.366')] [2023-03-06 16:35:34,194][04272] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-06 16:35:35,006][04272] Updated weights for policy 0, policy_version 94670 (0.0007) [2023-03-06 16:35:35,813][04272] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-06 16:35:36,649][04272] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-06 16:35:37,460][04272] Updated weights for policy 0, policy_version 94700 (0.0007) [2023-03-06 16:35:38,280][04272] Updated weights for policy 0, policy_version 94710 (0.0007) [2023-03-06 16:35:38,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 96991232. Throughput: 0: 12608.6. Samples: 96957952. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:38,941][03942] Avg episode reward: [(0, '1199.854')] [2023-03-06 16:35:39,097][04272] Updated weights for policy 0, policy_version 94720 (0.0006) [2023-03-06 16:35:39,882][04272] Updated weights for policy 0, policy_version 94730 (0.0006) [2023-03-06 16:35:40,701][04272] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-06 16:35:41,517][04272] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-06 16:35:42,304][04272] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-06 16:35:43,139][04272] Updated weights for policy 0, policy_version 94770 (0.0007) [2023-03-06 16:35:43,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 97053696. Throughput: 0: 12604.5. Samples: 97033612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:43,941][03942] Avg episode reward: [(0, '1253.078')] [2023-03-06 16:35:43,962][04272] Updated weights for policy 0, policy_version 94780 (0.0007) [2023-03-06 16:35:44,762][04272] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-06 16:35:45,579][04272] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-06 16:35:46,403][04272] Updated weights for policy 0, policy_version 94810 (0.0007) [2023-03-06 16:35:47,199][04272] Updated weights for policy 0, policy_version 94820 (0.0006) [2023-03-06 16:35:48,021][04272] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-06 16:35:48,833][04272] Updated weights for policy 0, policy_version 94840 (0.0007) [2023-03-06 16:35:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97117184. Throughput: 0: 12599.5. Samples: 97109187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:35:48,941][03942] Avg episode reward: [(0, '1293.613')] [2023-03-06 16:35:49,648][04272] Updated weights for policy 0, policy_version 94850 (0.0007) [2023-03-06 16:35:50,456][04272] Updated weights for policy 0, policy_version 94860 (0.0006) [2023-03-06 16:35:51,260][04272] Updated weights for policy 0, policy_version 94870 (0.0006) [2023-03-06 16:35:52,058][04272] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-06 16:35:52,875][04272] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-06 16:35:53,692][04272] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-06 16:35:53,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12600.4). Total num frames: 97179648. Throughput: 0: 12602.9. Samples: 97147027. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:35:53,941][03942] Avg episode reward: [(0, '1111.059')] [2023-03-06 16:35:54,509][04272] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-06 16:35:55,330][04272] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-06 16:35:56,154][04272] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-06 16:35:56,945][04272] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-06 16:35:57,760][04272] Updated weights for policy 0, policy_version 94950 (0.0007) [2023-03-06 16:35:58,572][04272] Updated weights for policy 0, policy_version 94960 (0.0008) [2023-03-06 16:35:58,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 97243136. Throughput: 0: 12606.9. Samples: 97222724. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:35:58,941][03942] Avg episode reward: [(0, '1216.328')] [2023-03-06 16:35:59,389][04272] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-06 16:36:00,196][04272] Updated weights for policy 0, policy_version 94980 (0.0006) [2023-03-06 16:36:01,014][04272] Updated weights for policy 0, policy_version 94990 (0.0007) [2023-03-06 16:36:01,833][04272] Updated weights for policy 0, policy_version 95000 (0.0007) [2023-03-06 16:36:02,635][04272] Updated weights for policy 0, policy_version 95010 (0.0007) [2023-03-06 16:36:03,452][04272] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-06 16:36:03,940][03942] Fps is (10 sec: 12697.7, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97306624. Throughput: 0: 12616.9. Samples: 97298357. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:03,941][03942] Avg episode reward: [(0, '1096.577')] [2023-03-06 16:36:04,266][04272] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-06 16:36:05,080][04272] Updated weights for policy 0, policy_version 95040 (0.0007) [2023-03-06 16:36:05,886][04272] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-06 16:36:06,706][04272] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-06 16:36:07,509][04272] Updated weights for policy 0, policy_version 95070 (0.0007) [2023-03-06 16:36:08,337][04272] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-06 16:36:08,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 97369088. Throughput: 0: 12614.0. Samples: 97336084. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:08,941][03942] Avg episode reward: [(0, '1271.068')] [2023-03-06 16:36:09,134][04272] Updated weights for policy 0, policy_version 95090 (0.0006) [2023-03-06 16:36:09,953][04272] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-06 16:36:10,751][04272] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-06 16:36:11,568][04272] Updated weights for policy 0, policy_version 95120 (0.0007) [2023-03-06 16:36:12,396][04272] Updated weights for policy 0, policy_version 95130 (0.0007) [2023-03-06 16:36:13,201][04272] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-06 16:36:13,941][03942] Fps is (10 sec: 12595.0, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97432576. Throughput: 0: 12608.7. Samples: 97411750. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:13,941][03942] Avg episode reward: [(0, '1272.577')] [2023-03-06 16:36:14,019][04272] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-06 16:36:14,828][04272] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-06 16:36:15,628][04272] Updated weights for policy 0, policy_version 95170 (0.0007) [2023-03-06 16:36:16,438][04272] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-06 16:36:17,258][04272] Updated weights for policy 0, policy_version 95190 (0.0007) [2023-03-06 16:36:18,054][04272] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-06 16:36:18,880][04272] Updated weights for policy 0, policy_version 95210 (0.0006) [2023-03-06 16:36:18,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12600.4). Total num frames: 97495040. Throughput: 0: 12609.3. Samples: 97487520. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:18,951][03942] Avg episode reward: [(0, '1221.283')] [2023-03-06 16:36:19,693][04272] Updated weights for policy 0, policy_version 95220 (0.0007) [2023-03-06 16:36:20,500][04272] Updated weights for policy 0, policy_version 95230 (0.0007) [2023-03-06 16:36:21,323][04272] Updated weights for policy 0, policy_version 95240 (0.0007) [2023-03-06 16:36:22,137][04272] Updated weights for policy 0, policy_version 95250 (0.0007) [2023-03-06 16:36:22,929][04272] Updated weights for policy 0, policy_version 95260 (0.0007) [2023-03-06 16:36:23,761][04272] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-06 16:36:23,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97558528. Throughput: 0: 12609.7. Samples: 97525386. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:23,951][03942] Avg episode reward: [(0, '1288.374')] [2023-03-06 16:36:24,561][04272] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-06 16:36:25,377][04272] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-06 16:36:26,187][04272] Updated weights for policy 0, policy_version 95300 (0.0007) [2023-03-06 16:36:26,991][04272] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-06 16:36:27,800][04272] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-06 16:36:28,634][04272] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-06 16:36:28,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12603.9). Total num frames: 97620992. Throughput: 0: 12611.5. Samples: 97601128. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:28,951][03942] Avg episode reward: [(0, '1270.457')] [2023-03-06 16:36:29,427][04272] Updated weights for policy 0, policy_version 95340 (0.0007) [2023-03-06 16:36:30,250][04272] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-06 16:36:31,050][04272] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-06 16:36:31,883][04272] Updated weights for policy 0, policy_version 95370 (0.0007) [2023-03-06 16:36:32,691][04272] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-06 16:36:33,509][04272] Updated weights for policy 0, policy_version 95390 (0.0007) [2023-03-06 16:36:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97684480. Throughput: 0: 12612.0. Samples: 97676727. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:33,952][03942] Avg episode reward: [(0, '1212.872')] [2023-03-06 16:36:34,306][04272] Updated weights for policy 0, policy_version 95400 (0.0006) [2023-03-06 16:36:35,135][04272] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-06 16:36:35,940][04272] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-06 16:36:36,748][04272] Updated weights for policy 0, policy_version 95430 (0.0007) [2023-03-06 16:36:37,565][04272] Updated weights for policy 0, policy_version 95440 (0.0007) [2023-03-06 16:36:38,366][04272] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-06 16:36:38,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97747968. Throughput: 0: 12608.0. Samples: 97714387. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:38,952][03942] Avg episode reward: [(0, '1026.626')] [2023-03-06 16:36:39,178][04272] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-06 16:36:39,982][04272] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-06 16:36:40,803][04272] Updated weights for policy 0, policy_version 95480 (0.0007) [2023-03-06 16:36:41,595][04272] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-06 16:36:42,409][04272] Updated weights for policy 0, policy_version 95500 (0.0006) [2023-03-06 16:36:43,228][04272] Updated weights for policy 0, policy_version 95510 (0.0007) [2023-03-06 16:36:43,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97810432. Throughput: 0: 12614.0. Samples: 97790354. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:36:43,951][03942] Avg episode reward: [(0, '1086.684')] [2023-03-06 16:36:44,034][04272] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-06 16:36:44,852][04272] Updated weights for policy 0, policy_version 95530 (0.0007) [2023-03-06 16:36:45,681][04272] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-06 16:36:46,479][04272] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-06 16:36:47,281][04272] Updated weights for policy 0, policy_version 95560 (0.0007) [2023-03-06 16:36:48,110][04272] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-06 16:36:48,905][04272] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-06 16:36:48,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 97873920. Throughput: 0: 12611.4. Samples: 97865874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:36:48,952][03942] Avg episode reward: [(0, '1274.218')] [2023-03-06 16:36:49,719][04272] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-06 16:36:50,534][04272] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-06 16:36:51,348][04272] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-06 16:36:52,168][04272] Updated weights for policy 0, policy_version 95620 (0.0007) [2023-03-06 16:36:53,003][04272] Updated weights for policy 0, policy_version 95630 (0.0007) [2023-03-06 16:36:53,804][04272] Updated weights for policy 0, policy_version 95640 (0.0006) [2023-03-06 16:36:53,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12603.9). Total num frames: 97936384. Throughput: 0: 12613.7. Samples: 97903700. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:36:53,952][03942] Avg episode reward: [(0, '1370.147')] [2023-03-06 16:36:54,589][04272] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-06 16:36:55,418][04272] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-06 16:36:56,204][04272] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-06 16:36:57,017][04272] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-06 16:36:57,829][04272] Updated weights for policy 0, policy_version 95690 (0.0007) [2023-03-06 16:36:58,629][04272] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-06 16:36:58,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 97999872. Throughput: 0: 12620.8. Samples: 97979687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:36:58,952][03942] Avg episode reward: [(0, '715.184')] [2023-03-06 16:36:59,445][04272] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-06 16:37:00,254][04272] Updated weights for policy 0, policy_version 95720 (0.0007) [2023-03-06 16:37:01,059][04272] Updated weights for policy 0, policy_version 95730 (0.0007) [2023-03-06 16:37:01,862][04272] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-06 16:37:02,671][04272] Updated weights for policy 0, policy_version 95750 (0.0007) [2023-03-06 16:37:03,498][04272] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-06 16:37:03,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12612.2, 300 sec: 12607.3). Total num frames: 98063360. Throughput: 0: 12621.5. Samples: 98055489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:03,952][03942] Avg episode reward: [(0, '1171.858')] [2023-03-06 16:37:04,300][04272] Updated weights for policy 0, policy_version 95770 (0.0007) [2023-03-06 16:37:05,112][04272] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-06 16:37:05,928][04272] Updated weights for policy 0, policy_version 95790 (0.0007) [2023-03-06 16:37:06,755][04272] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-06 16:37:07,558][04272] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-06 16:37:08,367][04272] Updated weights for policy 0, policy_version 95820 (0.0006) [2023-03-06 16:37:08,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 98126848. Throughput: 0: 12619.1. Samples: 98093246. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:08,952][03942] Avg episode reward: [(0, '1168.177')] [2023-03-06 16:37:08,955][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000095827_98126848.pth... [2023-03-06 16:37:08,985][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000092871_95099904.pth [2023-03-06 16:37:09,195][04272] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-06 16:37:09,996][04272] Updated weights for policy 0, policy_version 95840 (0.0007) [2023-03-06 16:37:10,808][04272] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-06 16:37:11,632][04272] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-06 16:37:12,441][04272] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-06 16:37:13,261][04272] Updated weights for policy 0, policy_version 95880 (0.0006) [2023-03-06 16:37:13,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.4). Total num frames: 98189312. Throughput: 0: 12614.9. Samples: 98168798. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:13,941][03942] Avg episode reward: [(0, '1215.929')] [2023-03-06 16:37:14,062][04272] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-06 16:37:14,873][04272] Updated weights for policy 0, policy_version 95900 (0.0006) [2023-03-06 16:37:15,690][04272] Updated weights for policy 0, policy_version 95910 (0.0007) [2023-03-06 16:37:16,498][04272] Updated weights for policy 0, policy_version 95920 (0.0007) [2023-03-06 16:37:17,322][04272] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-06 16:37:18,157][04272] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-06 16:37:18,940][03942] Fps is (10 sec: 12492.9, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 98251776. Throughput: 0: 12612.0. Samples: 98244266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:18,941][03942] Avg episode reward: [(0, '1245.712')] [2023-03-06 16:37:18,945][04272] Updated weights for policy 0, policy_version 95950 (0.0007) [2023-03-06 16:37:19,757][04272] Updated weights for policy 0, policy_version 95960 (0.0007) [2023-03-06 16:37:20,573][04272] Updated weights for policy 0, policy_version 95970 (0.0007) [2023-03-06 16:37:21,379][04272] Updated weights for policy 0, policy_version 95980 (0.0006) [2023-03-06 16:37:22,197][04272] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-06 16:37:23,013][04272] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-06 16:37:23,832][04272] Updated weights for policy 0, policy_version 96010 (0.0007) [2023-03-06 16:37:23,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12607.3). Total num frames: 98315264. Throughput: 0: 12616.5. Samples: 98282129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:23,941][03942] Avg episode reward: [(0, '1148.994')] [2023-03-06 16:37:24,625][04272] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-06 16:37:25,420][04272] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-06 16:37:26,240][04272] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-06 16:37:27,034][04272] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-06 16:37:27,850][04272] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-06 16:37:28,652][04272] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-06 16:37:28,941][03942] Fps is (10 sec: 12697.4, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 98378752. Throughput: 0: 12622.1. Samples: 98358352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:28,941][03942] Avg episode reward: [(0, '218.100')] [2023-03-06 16:37:29,474][04272] Updated weights for policy 0, policy_version 96080 (0.0007) [2023-03-06 16:37:30,286][04272] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-06 16:37:31,103][04272] Updated weights for policy 0, policy_version 96100 (0.0007) [2023-03-06 16:37:31,898][04272] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-06 16:37:32,714][04272] Updated weights for policy 0, policy_version 96120 (0.0006) [2023-03-06 16:37:33,555][04272] Updated weights for policy 0, policy_version 96130 (0.0006) [2023-03-06 16:37:33,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 98442240. Throughput: 0: 12623.2. Samples: 98433919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:33,941][03942] Avg episode reward: [(0, '196.351')] [2023-03-06 16:37:34,334][04272] Updated weights for policy 0, policy_version 96140 (0.0007) [2023-03-06 16:37:35,137][04272] Updated weights for policy 0, policy_version 96150 (0.0007) [2023-03-06 16:37:35,951][04272] Updated weights for policy 0, policy_version 96160 (0.0007) [2023-03-06 16:37:36,758][04272] Updated weights for policy 0, policy_version 96170 (0.0007) [2023-03-06 16:37:37,559][04272] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-06 16:37:38,358][04272] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-06 16:37:38,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12610.8). Total num frames: 98504704. Throughput: 0: 12627.2. Samples: 98471923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:38,952][03942] Avg episode reward: [(0, '201.810')] [2023-03-06 16:37:39,201][04272] Updated weights for policy 0, policy_version 96200 (0.0006) [2023-03-06 16:37:39,995][04272] Updated weights for policy 0, policy_version 96210 (0.0007) [2023-03-06 16:37:40,793][04272] Updated weights for policy 0, policy_version 96220 (0.0006) [2023-03-06 16:37:41,604][04272] Updated weights for policy 0, policy_version 96230 (0.0007) [2023-03-06 16:37:42,416][04272] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-06 16:37:43,212][04272] Updated weights for policy 0, policy_version 96250 (0.0007) [2023-03-06 16:37:43,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12610.8). Total num frames: 98568192. Throughput: 0: 12626.9. Samples: 98547898. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:37:43,952][03942] Avg episode reward: [(0, '193.551')] [2023-03-06 16:37:44,023][04272] Updated weights for policy 0, policy_version 96260 (0.0007) [2023-03-06 16:37:44,833][04272] Updated weights for policy 0, policy_version 96270 (0.0006) [2023-03-06 16:37:45,634][04272] Updated weights for policy 0, policy_version 96280 (0.0007) [2023-03-06 16:37:46,447][04272] Updated weights for policy 0, policy_version 96290 (0.0007) [2023-03-06 16:37:47,263][04272] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-06 16:37:48,062][04272] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-06 16:37:48,867][04272] Updated weights for policy 0, policy_version 96320 (0.0006) [2023-03-06 16:37:48,940][03942] Fps is (10 sec: 12800.0, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 98632704. Throughput: 0: 12637.8. Samples: 98624189. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:37:48,951][03942] Avg episode reward: [(0, '268.316')] [2023-03-06 16:37:49,681][04272] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-06 16:37:50,485][04272] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-06 16:37:51,305][04272] Updated weights for policy 0, policy_version 96350 (0.0007) [2023-03-06 16:37:52,121][04272] Updated weights for policy 0, policy_version 96360 (0.0007) [2023-03-06 16:37:52,916][04272] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-06 16:37:53,746][04272] Updated weights for policy 0, policy_version 96380 (0.0006) [2023-03-06 16:37:53,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 98695168. Throughput: 0: 12642.7. Samples: 98662167. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:37:53,952][03942] Avg episode reward: [(0, '793.960')] [2023-03-06 16:37:54,553][04272] Updated weights for policy 0, policy_version 96390 (0.0005) [2023-03-06 16:37:55,362][04272] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-06 16:37:56,178][04272] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-06 16:37:56,983][04272] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-06 16:37:57,798][04272] Updated weights for policy 0, policy_version 96430 (0.0007) [2023-03-06 16:37:58,610][04272] Updated weights for policy 0, policy_version 96440 (0.0006) [2023-03-06 16:37:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12614.3). Total num frames: 98758656. Throughput: 0: 12643.3. Samples: 98737747. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:37:58,941][03942] Avg episode reward: [(0, '865.182')] [2023-03-06 16:37:59,428][04272] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-06 16:38:00,237][04272] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-06 16:38:01,055][04272] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-06 16:38:01,857][04272] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-06 16:38:02,674][04272] Updated weights for policy 0, policy_version 96490 (0.0006) [2023-03-06 16:38:03,470][04272] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-06 16:38:03,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 98821120. Throughput: 0: 12648.0. Samples: 98813428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:03,941][03942] Avg episode reward: [(0, '1020.622')] [2023-03-06 16:38:04,297][04272] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-06 16:38:05,106][04272] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-06 16:38:05,923][04272] Updated weights for policy 0, policy_version 96530 (0.0007) [2023-03-06 16:38:06,744][04272] Updated weights for policy 0, policy_version 96540 (0.0007) [2023-03-06 16:38:07,567][04272] Updated weights for policy 0, policy_version 96550 (0.0007) [2023-03-06 16:38:08,386][04272] Updated weights for policy 0, policy_version 96560 (0.0007) [2023-03-06 16:38:08,940][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 98883584. Throughput: 0: 12644.8. Samples: 98851143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:08,941][03942] Avg episode reward: [(0, '1071.411')] [2023-03-06 16:38:09,184][04272] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-06 16:38:09,986][04272] Updated weights for policy 0, policy_version 96580 (0.0007) [2023-03-06 16:38:10,802][04272] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-06 16:38:11,609][04272] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-06 16:38:12,416][04272] Updated weights for policy 0, policy_version 96610 (0.0007) [2023-03-06 16:38:13,241][04272] Updated weights for policy 0, policy_version 96620 (0.0008) [2023-03-06 16:38:13,940][03942] Fps is (10 sec: 12595.4, 60 sec: 12629.4, 300 sec: 12614.3). Total num frames: 98947072. Throughput: 0: 12634.3. Samples: 98926891. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:13,941][03942] Avg episode reward: [(0, '1173.168')] [2023-03-06 16:38:14,057][04272] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-06 16:38:14,879][04272] Updated weights for policy 0, policy_version 96640 (0.0007) [2023-03-06 16:38:15,671][04272] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-06 16:38:16,482][04272] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-06 16:38:17,303][04272] Updated weights for policy 0, policy_version 96670 (0.0007) [2023-03-06 16:38:18,106][04272] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-06 16:38:18,927][04272] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-06 16:38:18,941][03942] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 99010560. Throughput: 0: 12633.2. Samples: 99002413. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:18,941][03942] Avg episode reward: [(0, '1216.703')] [2023-03-06 16:38:19,726][04272] Updated weights for policy 0, policy_version 96700 (0.0006) [2023-03-06 16:38:20,532][04272] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-06 16:38:21,346][04272] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-06 16:38:22,150][04272] Updated weights for policy 0, policy_version 96730 (0.0007) [2023-03-06 16:38:22,964][04272] Updated weights for policy 0, policy_version 96740 (0.0007) [2023-03-06 16:38:23,774][04272] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-06 16:38:23,940][03942] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 99074048. Throughput: 0: 12635.3. Samples: 99040510. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:23,941][03942] Avg episode reward: [(0, '1239.799')] [2023-03-06 16:38:24,603][04272] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-06 16:38:25,404][04272] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-06 16:38:26,227][04272] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-06 16:38:27,049][04272] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-06 16:38:27,835][04272] Updated weights for policy 0, policy_version 96800 (0.0006) [2023-03-06 16:38:28,646][04272] Updated weights for policy 0, policy_version 96810 (0.0007) [2023-03-06 16:38:28,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12614.3). Total num frames: 99136512. Throughput: 0: 12627.9. Samples: 99116156. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:28,941][03942] Avg episode reward: [(0, '1166.431')] [2023-03-06 16:38:29,455][04272] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-06 16:38:30,274][04272] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-06 16:38:31,068][04272] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-06 16:38:31,880][04272] Updated weights for policy 0, policy_version 96850 (0.0007) [2023-03-06 16:38:32,686][04272] Updated weights for policy 0, policy_version 96860 (0.0007) [2023-03-06 16:38:33,497][04272] Updated weights for policy 0, policy_version 96870 (0.0005) [2023-03-06 16:38:33,940][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.4, 300 sec: 12617.8). Total num frames: 99200000. Throughput: 0: 12619.5. Samples: 99192065. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:33,941][03942] Avg episode reward: [(0, '1139.894')] [2023-03-06 16:38:34,318][04272] Updated weights for policy 0, policy_version 96880 (0.0007) [2023-03-06 16:38:35,129][04272] Updated weights for policy 0, policy_version 96890 (0.0006) [2023-03-06 16:38:35,925][04272] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-06 16:38:36,750][04272] Updated weights for policy 0, policy_version 96910 (0.0006) [2023-03-06 16:38:37,555][04272] Updated weights for policy 0, policy_version 96920 (0.0007) [2023-03-06 16:38:38,365][04272] Updated weights for policy 0, policy_version 96930 (0.0007) [2023-03-06 16:38:38,940][03942] Fps is (10 sec: 12697.8, 60 sec: 12646.4, 300 sec: 12617.8). Total num frames: 99263488. Throughput: 0: 12616.8. Samples: 99229921. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:38,941][03942] Avg episode reward: [(0, '1288.520')] [2023-03-06 16:38:39,183][04272] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-06 16:38:39,978][04272] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-06 16:38:40,800][04272] Updated weights for policy 0, policy_version 96960 (0.0007) [2023-03-06 16:38:41,598][04272] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-06 16:38:42,434][04272] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-06 16:38:43,223][04272] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-06 16:38:43,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12617.8). Total num frames: 99325952. Throughput: 0: 12622.7. Samples: 99305771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:43,941][03942] Avg episode reward: [(0, '1279.280')] [2023-03-06 16:38:44,039][04272] Updated weights for policy 0, policy_version 97000 (0.0007) [2023-03-06 16:38:44,855][04272] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-06 16:38:45,681][04272] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-06 16:38:46,481][04272] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-06 16:38:47,298][04272] Updated weights for policy 0, policy_version 97040 (0.0006) [2023-03-06 16:38:48,097][04272] Updated weights for policy 0, policy_version 97050 (0.0007) [2023-03-06 16:38:48,893][04272] Updated weights for policy 0, policy_version 97060 (0.0006) [2023-03-06 16:38:48,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 99389440. Throughput: 0: 12624.2. Samples: 99381518. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:48,941][03942] Avg episode reward: [(0, '1302.968')] [2023-03-06 16:38:49,714][04272] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-06 16:38:50,509][04272] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-06 16:38:51,347][04272] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-06 16:38:52,144][04272] Updated weights for policy 0, policy_version 97100 (0.0007) [2023-03-06 16:38:52,965][04272] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-06 16:38:53,759][04272] Updated weights for policy 0, policy_version 97120 (0.0007) [2023-03-06 16:38:53,941][03942] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12621.2). Total num frames: 99452928. Throughput: 0: 12628.2. Samples: 99419411. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:53,941][03942] Avg episode reward: [(0, '1180.178')] [2023-03-06 16:38:54,584][04272] Updated weights for policy 0, policy_version 97130 (0.0007) [2023-03-06 16:38:55,402][04272] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-06 16:38:56,214][04272] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-06 16:38:57,028][04272] Updated weights for policy 0, policy_version 97160 (0.0007) [2023-03-06 16:38:57,846][04272] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-06 16:38:58,676][04272] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-06 16:38:58,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 99515392. Throughput: 0: 12624.6. Samples: 99495000. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:38:58,941][03942] Avg episode reward: [(0, '1214.023')] [2023-03-06 16:38:59,507][04272] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-06 16:39:00,315][04272] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-06 16:39:01,117][04272] Updated weights for policy 0, policy_version 97210 (0.0007) [2023-03-06 16:39:01,932][04272] Updated weights for policy 0, policy_version 97220 (0.0007) [2023-03-06 16:39:02,757][04272] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-06 16:39:03,569][04272] Updated weights for policy 0, policy_version 97240 (0.0007) [2023-03-06 16:39:03,941][03942] Fps is (10 sec: 12492.8, 60 sec: 12612.3, 300 sec: 12617.8). Total num frames: 99577856. Throughput: 0: 12616.7. Samples: 99570164. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:03,941][03942] Avg episode reward: [(0, '1242.995')] [2023-03-06 16:39:04,402][04272] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-06 16:39:05,200][04272] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-06 16:39:06,014][04272] Updated weights for policy 0, policy_version 97270 (0.0007) [2023-03-06 16:39:06,813][04272] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-06 16:39:07,629][04272] Updated weights for policy 0, policy_version 97290 (0.0006) [2023-03-06 16:39:08,448][04272] Updated weights for policy 0, policy_version 97300 (0.0008) [2023-03-06 16:39:08,941][03942] Fps is (10 sec: 12492.7, 60 sec: 12612.2, 300 sec: 12614.3). Total num frames: 99640320. Throughput: 0: 12608.8. Samples: 99607908. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:08,941][03942] Avg episode reward: [(0, '1097.907')] [2023-03-06 16:39:08,944][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000097306_99641344.pth... [2023-03-06 16:39:08,977][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000094349_96613376.pth [2023-03-06 16:39:09,260][04272] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-06 16:39:10,081][04272] Updated weights for policy 0, policy_version 97320 (0.0007) [2023-03-06 16:39:10,897][04272] Updated weights for policy 0, policy_version 97330 (0.0007) [2023-03-06 16:39:11,708][04272] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-06 16:39:12,527][04272] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-06 16:39:13,335][04272] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-06 16:39:13,940][03942] Fps is (10 sec: 12595.3, 60 sec: 12612.3, 300 sec: 12614.3). Total num frames: 99703808. Throughput: 0: 12607.3. Samples: 99683484. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:13,941][03942] Avg episode reward: [(0, '1255.654')] [2023-03-06 16:39:14,146][04272] Updated weights for policy 0, policy_version 97370 (0.0006) [2023-03-06 16:39:14,974][04272] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-06 16:39:15,778][04272] Updated weights for policy 0, policy_version 97390 (0.0006) [2023-03-06 16:39:16,589][04272] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-03-06 16:39:17,415][04272] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-06 16:39:18,227][04272] Updated weights for policy 0, policy_version 97420 (0.0006) [2023-03-06 16:39:18,941][03942] Fps is (10 sec: 12595.3, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 99766272. Throughput: 0: 12594.5. Samples: 99758817. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:18,941][03942] Avg episode reward: [(0, '1285.878')] [2023-03-06 16:39:19,040][04272] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-06 16:39:19,855][04272] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-06 16:39:20,653][04272] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-06 16:39:21,472][04272] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-06 16:39:22,282][04272] Updated weights for policy 0, policy_version 97470 (0.0007) [2023-03-06 16:39:23,097][04272] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-06 16:39:23,894][04272] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-06 16:39:23,940][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 99829760. Throughput: 0: 12595.4. Samples: 99796714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:23,941][03942] Avg episode reward: [(0, '1255.369')] [2023-03-06 16:39:24,706][04272] Updated weights for policy 0, policy_version 97500 (0.0007) [2023-03-06 16:39:25,545][04272] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-06 16:39:26,360][04272] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-06 16:39:27,166][04272] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-06 16:39:27,977][04272] Updated weights for policy 0, policy_version 97540 (0.0007) [2023-03-06 16:39:28,784][04272] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-06 16:39:28,941][03942] Fps is (10 sec: 12595.2, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 99892224. Throughput: 0: 12587.2. Samples: 99872196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:28,941][03942] Avg episode reward: [(0, '1231.797')] [2023-03-06 16:39:29,598][04272] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-06 16:39:30,418][04272] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-06 16:39:31,230][04272] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-06 16:39:32,040][04272] Updated weights for policy 0, policy_version 97590 (0.0006) [2023-03-06 16:39:32,849][04272] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-06 16:39:33,664][04272] Updated weights for policy 0, policy_version 97610 (0.0007) [2023-03-06 16:39:33,941][03942] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 12614.3). Total num frames: 99955712. Throughput: 0: 12584.0. Samples: 99947797. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:39:33,941][03942] Avg episode reward: [(0, '1252.215')] [2023-03-06 16:39:34,482][04272] Updated weights for policy 0, policy_version 97620 (0.0007) [2023-03-06 16:39:35,295][04272] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-06 16:39:36,119][04272] Updated weights for policy 0, policy_version 97640 (0.0007) [2023-03-06 16:39:36,911][04272] Updated weights for policy 0, policy_version 97650 (0.0006) [2023-03-06 16:39:37,570][04609] Stopping RolloutWorker_w24... [2023-03-06 16:39:37,570][04437] Stopping RolloutWorker_w18... [2023-03-06 16:39:37,570][04513] Stopping RolloutWorker_w23... [2023-03-06 16:39:37,570][04433] Stopping RolloutWorker_w5... [2023-03-06 16:39:37,570][04274] Stopping RolloutWorker_w0... [2023-03-06 16:39:37,570][04478] Stopping RolloutWorker_w17... [2023-03-06 16:39:37,570][04609] Loop rollout_proc24_evt_loop terminating... [2023-03-06 16:39:37,570][04481] Stopping RolloutWorker_w22... [2023-03-06 16:39:37,570][04437] Loop rollout_proc18_evt_loop terminating... [2023-03-06 16:39:37,570][04513] Loop rollout_proc23_evt_loop terminating... [2023-03-06 16:39:37,570][04277] Stopping RolloutWorker_w4... [2023-03-06 16:39:37,570][04276] Stopping RolloutWorker_w3... [2023-03-06 16:39:37,570][04274] Loop rollout_proc0_evt_loop terminating... [2023-03-06 16:39:37,570][04433] Loop rollout_proc5_evt_loop terminating... [2023-03-06 16:39:37,570][04435] Stopping RolloutWorker_w15... [2023-03-06 16:39:37,570][04478] Loop rollout_proc17_evt_loop terminating... [2023-03-06 16:39:37,570][04438] Stopping RolloutWorker_w16... [2023-03-06 16:39:37,570][04480] Stopping RolloutWorker_w21... [2023-03-06 16:39:37,570][04436] Stopping RolloutWorker_w20... [2023-03-06 16:39:37,570][04471] Stopping RolloutWorker_w9... [2023-03-06 16:39:37,570][04474] Stopping RolloutWorker_w8... [2023-03-06 16:39:37,570][04275] Stopping RolloutWorker_w2... [2023-03-06 16:39:37,570][04481] Loop rollout_proc22_evt_loop terminating... [2023-03-06 16:39:37,570][04479] Stopping RolloutWorker_w14... [2023-03-06 16:39:37,570][04476] Stopping RolloutWorker_w19... [2023-03-06 16:39:37,570][04577] Stopping RolloutWorker_w26... [2023-03-06 16:39:37,570][04477] Stopping RolloutWorker_w12... [2023-03-06 16:39:37,570][04475] Stopping RolloutWorker_w13... [2023-03-06 16:39:37,570][04277] Loop rollout_proc4_evt_loop terminating... [2023-03-06 16:39:37,570][04674] Stopping RolloutWorker_w28... [2023-03-06 16:39:37,570][04276] Loop rollout_proc3_evt_loop terminating... [2023-03-06 16:39:37,570][04473] Stopping RolloutWorker_w7... [2023-03-06 16:39:37,570][04435] Loop rollout_proc15_evt_loop terminating... [2023-03-06 16:39:37,570][04273] Stopping RolloutWorker_w1... [2023-03-06 16:39:37,570][04438] Loop rollout_proc16_evt_loop terminating... [2023-03-06 16:39:37,570][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 16:39:37,570][04480] Loop rollout_proc21_evt_loop terminating... [2023-03-06 16:39:37,570][04709] Stopping RolloutWorker_w31... [2023-03-06 16:39:37,570][04436] Loop rollout_proc20_evt_loop terminating... [2023-03-06 16:39:37,570][04471] Loop rollout_proc9_evt_loop terminating... [2023-03-06 16:39:37,570][04474] Loop rollout_proc8_evt_loop terminating... [2023-03-06 16:39:37,570][04275] Loop rollout_proc2_evt_loop terminating... [2023-03-06 16:39:37,570][04479] Loop rollout_proc14_evt_loop terminating... [2023-03-06 16:39:37,570][04476] Loop rollout_proc19_evt_loop terminating... [2023-03-06 16:39:37,570][04577] Loop rollout_proc26_evt_loop terminating... [2023-03-06 16:39:37,570][04475] Loop rollout_proc13_evt_loop terminating... [2023-03-06 16:39:37,571][04477] Loop rollout_proc12_evt_loop terminating... [2023-03-06 16:39:37,571][04709] Loop rollout_proc31_evt_loop terminating... [2023-03-06 16:39:37,571][04473] Loop rollout_proc7_evt_loop terminating... [2023-03-06 16:39:37,571][04273] Loop rollout_proc1_evt_loop terminating... [2023-03-06 16:39:37,571][04674] Loop rollout_proc28_evt_loop terminating... [2023-03-06 16:39:37,571][04545] Stopping RolloutWorker_w25... [2023-03-06 16:39:37,570][03942] Component RolloutWorker_w24 stopped! [2023-03-06 16:39:37,571][04472] Stopping RolloutWorker_w11... [2023-03-06 16:39:37,571][04545] Loop rollout_proc25_evt_loop terminating... [2023-03-06 16:39:37,571][04472] Loop rollout_proc11_evt_loop terminating... [2023-03-06 16:39:37,571][04642] Stopping RolloutWorker_w27... [2023-03-06 16:39:37,571][03942] Component RolloutWorker_w18 stopped! [2023-03-06 16:39:37,572][04642] Loop rollout_proc27_evt_loop terminating... [2023-03-06 16:39:37,572][03942] Component RolloutWorker_w5 stopped! [2023-03-06 16:39:37,572][03942] Component RolloutWorker_w23 stopped! [2023-03-06 16:39:37,572][03942] Component RolloutWorker_w0 stopped! [2023-03-06 16:39:37,572][03942] Component RolloutWorker_w17 stopped! [2023-03-06 16:39:37,573][03942] Component RolloutWorker_w22 stopped! [2023-03-06 16:39:37,573][03942] Component RolloutWorker_w4 stopped! [2023-03-06 16:39:37,573][03942] Component RolloutWorker_w3 stopped! [2023-03-06 16:39:37,573][03942] Component RolloutWorker_w15 stopped! [2023-03-06 16:39:37,574][03942] Component RolloutWorker_w16 stopped! [2023-03-06 16:39:37,574][03942] Component RolloutWorker_w21 stopped! [2023-03-06 16:39:37,574][03942] Component RolloutWorker_w20 stopped! [2023-03-06 16:39:37,575][03942] Component RolloutWorker_w9 stopped! [2023-03-06 16:39:37,575][03942] Component RolloutWorker_w8 stopped! [2023-03-06 16:39:37,575][03942] Component RolloutWorker_w2 stopped! [2023-03-06 16:39:37,576][03942] Component RolloutWorker_w14 stopped! [2023-03-06 16:39:37,576][03942] Component RolloutWorker_w12 stopped! [2023-03-06 16:39:37,577][03942] Component RolloutWorker_w28 stopped! [2023-03-06 16:39:37,577][03942] Component RolloutWorker_w26 stopped! [2023-03-06 16:39:37,577][03942] Component RolloutWorker_w19 stopped! [2023-03-06 16:39:37,578][03942] Component RolloutWorker_w13 stopped! [2023-03-06 16:39:37,578][03942] Component RolloutWorker_w7 stopped! [2023-03-06 16:39:37,578][03942] Component RolloutWorker_w1 stopped! [2023-03-06 16:39:37,579][03942] Component RolloutWorker_w31 stopped! [2023-03-06 16:39:37,570][04221] Stopping Batcher_0... [2023-03-06 16:39:37,579][03942] Component Batcher_0 stopped! [2023-03-06 16:39:37,579][04470] Stopping RolloutWorker_w6... [2023-03-06 16:39:37,579][03942] Component RolloutWorker_w25 stopped! [2023-03-06 16:39:37,579][04470] Loop rollout_proc6_evt_loop terminating... [2023-03-06 16:39:37,579][03942] Component RolloutWorker_w11 stopped! [2023-03-06 16:39:37,580][03942] Component RolloutWorker_w27 stopped! [2023-03-06 16:39:37,580][03942] Component RolloutWorker_w6 stopped! [2023-03-06 16:39:37,586][03942] Component RolloutWorker_w30 stopped! [2023-03-06 16:39:37,587][04708] Stopping RolloutWorker_w30... [2023-03-06 16:39:37,588][04708] Loop rollout_proc30_evt_loop terminating... [2023-03-06 16:39:37,596][04221] Loop batcher_evt_loop terminating... [2023-03-06 16:39:37,597][04676] Stopping RolloutWorker_w29... [2023-03-06 16:39:37,598][04676] Loop rollout_proc29_evt_loop terminating... [2023-03-06 16:39:37,598][03942] Component RolloutWorker_w29 stopped! [2023-03-06 16:39:37,624][04434] Stopping RolloutWorker_w10... [2023-03-06 16:39:37,625][04434] Loop rollout_proc10_evt_loop terminating... [2023-03-06 16:39:37,624][03942] Component RolloutWorker_w10 stopped! [2023-03-06 16:39:37,640][04272] Weights refcount: 2 0 [2023-03-06 16:39:37,642][04272] Stopping InferenceWorker_p0-w0... [2023-03-06 16:39:37,642][04272] Loop inference_proc0-0_evt_loop terminating... [2023-03-06 16:39:37,643][03942] Component InferenceWorker_p0-w0 stopped! [2023-03-06 16:39:37,682][04221] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000095827_98126848.pth [2023-03-06 16:39:37,691][04221] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 16:39:37,782][04221] Stopping LearnerWorker_p0... [2023-03-06 16:39:37,783][04221] Loop learner_proc0_evt_loop terminating... [2023-03-06 16:39:37,783][03942] Component LearnerWorker_p0 stopped! [2023-03-06 16:39:37,784][03942] Waiting for process learner_proc0 to stop... [2023-03-06 16:39:38,971][03942] Waiting for process inference_proc0-0 to join... [2023-03-06 16:39:38,972][03942] Waiting for process rollout_proc0 to join... [2023-03-06 16:39:38,972][03942] Waiting for process rollout_proc1 to join... [2023-03-06 16:39:38,972][03942] Waiting for process rollout_proc2 to join... [2023-03-06 16:39:38,972][03942] Waiting for process rollout_proc3 to join... [2023-03-06 16:39:38,973][03942] Waiting for process rollout_proc4 to join... [2023-03-06 16:39:38,973][03942] Waiting for process rollout_proc5 to join... [2023-03-06 16:39:38,973][03942] Waiting for process rollout_proc6 to join... [2023-03-06 16:39:38,973][03942] Waiting for process rollout_proc7 to join... [2023-03-06 16:39:38,974][03942] Waiting for process rollout_proc8 to join... [2023-03-06 16:39:38,974][03942] Waiting for process rollout_proc9 to join... [2023-03-06 16:39:38,974][03942] Waiting for process rollout_proc10 to join... [2023-03-06 16:39:38,974][03942] Waiting for process rollout_proc11 to join... [2023-03-06 16:39:38,974][03942] Waiting for process rollout_proc12 to join... [2023-03-06 16:39:38,975][03942] Waiting for process rollout_proc13 to join... [2023-03-06 16:39:38,975][03942] Waiting for process rollout_proc14 to join... [2023-03-06 16:39:38,975][03942] Waiting for process rollout_proc15 to join... [2023-03-06 16:39:38,975][03942] Waiting for process rollout_proc16 to join... [2023-03-06 16:39:38,976][03942] Waiting for process rollout_proc17 to join... [2023-03-06 16:39:38,976][03942] Waiting for process rollout_proc18 to join... [2023-03-06 16:39:38,976][03942] Waiting for process rollout_proc19 to join... [2023-03-06 16:39:38,976][03942] Waiting for process rollout_proc20 to join... [2023-03-06 16:39:38,977][03942] Waiting for process rollout_proc21 to join... [2023-03-06 16:39:38,977][03942] Waiting for process rollout_proc22 to join... [2023-03-06 16:39:38,977][03942] Waiting for process rollout_proc23 to join... [2023-03-06 16:39:38,977][03942] Waiting for process rollout_proc24 to join... [2023-03-06 16:39:38,978][03942] Waiting for process rollout_proc25 to join... [2023-03-06 16:39:38,978][03942] Waiting for process rollout_proc26 to join... [2023-03-06 16:39:38,978][03942] Waiting for process rollout_proc27 to join... [2023-03-06 16:39:38,978][03942] Waiting for process rollout_proc28 to join... [2023-03-06 16:39:38,979][03942] Waiting for process rollout_proc29 to join... [2023-03-06 16:39:38,979][03942] Waiting for process rollout_proc30 to join... [2023-03-06 16:39:38,979][03942] Waiting for process rollout_proc31 to join... [2023-03-06 16:39:38,979][03942] Batcher 0 profile tree view: batching: 791.7678, releasing_batches: 1.6486 [2023-03-06 16:39:38,979][03942] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 237.0456 update_model: 136.9497 weight_update: 0.0007 one_step: 0.0098 handle_policy_step: 7170.8511 deserialize: 215.3171, stack: 37.2172, obs_to_device_normalize: 1226.6384, forward: 3289.7030, send_messages: 1388.8977 prepare_outputs: 734.7773 to_cpu: 366.1117 [2023-03-06 16:39:38,980][03942] Learner 0 profile tree view: misc: 0.4297, prepare_batch: 379.8086 train: 887.1025 epoch_init: 0.3808, minibatch_init: 0.3704, losses_postprocess: 32.1455, kl_divergence: 35.0721, after_optimizer: 123.9243 calculate_losses: 291.1051 losses_init: 0.1946, forward_head: 15.7076, bptt_initial: 106.2852, tail: 58.7599, advantages_returns: 7.3131, losses: 27.3594 bptt: 66.7864 bptt_forward_core: 64.4641 update: 382.0613 clip: 54.1955 [2023-03-06 16:39:38,980][03942] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.1935, enqueue_policy_requests: 155.5981, env_step: 3152.0450, overhead: 131.2746, complete_rollouts: 7.7265 save_policy_outputs: 177.9378 split_output_tensors: 86.9606 [2023-03-06 16:39:38,980][03942] RolloutWorker_w31 profile tree view: wait_for_trajectories: 3.2845, enqueue_policy_requests: 158.8641, env_step: 3209.8810, overhead: 131.7818, complete_rollouts: 8.0789 save_policy_outputs: 180.2440 split_output_tensors: 89.1498 [2023-03-06 16:39:38,980][03942] Loop Runner_EvtLoop terminating... [2023-03-06 16:39:38,981][03942] Runner profile tree view: main_loop: 7946.4586 [2023-03-06 16:39:38,981][03942] Collected {0: 100001792}, FPS: 12584.4