diff --git "a/sf_log.txt" "b/sf_log.txt" new file mode 100644--- /dev/null +++ "b/sf_log.txt" @@ -0,0 +1,2108 @@ +[2023-03-06 11:16:58,587][1834018] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/config.json... +[2023-03-06 11:16:58,601][1834018] Rollout worker 0 uses device cpu +[2023-03-06 11:16:58,601][1834018] Rollout worker 1 uses device cpu +[2023-03-06 11:16:58,601][1834018] Rollout worker 2 uses device cpu +[2023-03-06 11:16:58,601][1834018] Rollout worker 3 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 4 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 5 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 6 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 7 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 8 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 9 uses device cpu +[2023-03-06 11:16:58,602][1834018] Rollout worker 10 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 11 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 12 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 13 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 14 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 15 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 16 uses device cpu +[2023-03-06 11:16:58,603][1834018] Rollout worker 17 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 18 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 19 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 20 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 21 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 22 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 23 uses device cpu +[2023-03-06 11:16:58,604][1834018] Rollout worker 24 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 25 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 26 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 27 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 28 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 29 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 30 uses device cpu +[2023-03-06 11:16:58,605][1834018] Rollout worker 31 uses device cpu +[2023-03-06 11:16:58,621][1834018] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 11:16:58,621][1834018] InferenceWorker_p0-w0: min num requests: 10 +[2023-03-06 11:16:58,686][1834018] Starting all processes... +[2023-03-06 11:16:58,686][1834018] Starting process learner_proc0 +[2023-03-06 11:16:58,736][1834018] Starting all processes... +[2023-03-06 11:16:58,781][1834018] Starting process inference_proc0-0 +[2023-03-06 11:16:58,789][1834018] Starting process rollout_proc0 +[2023-03-06 11:16:58,790][1834018] Starting process rollout_proc1 +[2023-03-06 11:16:58,790][1834018] Starting process rollout_proc2 +[2023-03-06 11:16:58,790][1834018] Starting process rollout_proc3 +[2023-03-06 11:16:58,790][1834018] Starting process rollout_proc4 +[2023-03-06 11:16:58,791][1834018] Starting process rollout_proc5 +[2023-03-06 11:16:58,791][1834018] Starting process rollout_proc6 +[2023-03-06 11:16:58,791][1834018] Starting process rollout_proc7 +[2023-03-06 11:16:58,791][1834018] Starting process rollout_proc8 +[2023-03-06 11:16:58,791][1834018] Starting process rollout_proc9 +[2023-03-06 11:16:58,793][1834018] Starting process rollout_proc10 +[2023-03-06 11:16:58,800][1834018] Starting process rollout_proc11 +[2023-03-06 11:16:58,803][1834018] Starting process rollout_proc12 +[2023-03-06 11:16:58,808][1834018] Starting process rollout_proc13 +[2023-03-06 11:16:58,813][1834018] Starting process rollout_proc14 +[2023-03-06 11:16:58,813][1834018] Starting process rollout_proc15 +[2023-03-06 11:16:58,813][1834018] Starting process rollout_proc16 +[2023-03-06 11:16:58,814][1834018] Starting process rollout_proc17 +[2023-03-06 11:16:58,816][1834018] Starting process rollout_proc18 +[2023-03-06 11:16:58,821][1834018] Starting process rollout_proc19 +[2023-03-06 11:16:58,827][1834018] Starting process rollout_proc20 +[2023-03-06 11:16:58,832][1834018] Starting process rollout_proc21 +[2023-03-06 11:16:58,849][1834018] Starting process rollout_proc22 +[2023-03-06 11:16:58,860][1834018] Starting process rollout_proc23 +[2023-03-06 11:16:58,879][1834018] Starting process rollout_proc24 +[2023-03-06 11:16:58,899][1834018] Starting process rollout_proc25 +[2023-03-06 11:16:58,908][1834018] Starting process rollout_proc26 +[2023-03-06 11:16:58,920][1834018] Starting process rollout_proc27 +[2023-03-06 11:16:58,927][1834018] Starting process rollout_proc28 +[2023-03-06 11:16:58,929][1834018] Starting process rollout_proc29 +[2023-03-06 11:16:58,931][1834018] Starting process rollout_proc30 +[2023-03-06 11:16:58,949][1834018] Starting process rollout_proc31 +[2023-03-06 11:17:00,592][1834298] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 11:17:00,592][1834298] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 +[2023-03-06 11:17:00,601][1834298] Num visible devices: 1 +[2023-03-06 11:17:00,655][1834298] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 +[2023-03-06 11:17:00,655][1834298] Starting seed is not provided +[2023-03-06 11:17:00,655][1834298] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 11:17:00,656][1834298] Initializing actor-critic model on device cuda:0 +[2023-03-06 11:17:00,656][1834298] RunningMeanStd input shape: (39,) +[2023-03-06 11:17:00,656][1834298] RunningMeanStd input shape: (1,) +[2023-03-06 11:17:00,797][1834298] Created Actor Critic model with architecture: +[2023-03-06 11:17:00,797][1834298] ActorCriticSharedWeights( + (obs_normalizer): ObservationNormalizer( + (running_mean_std): RunningMeanStdDictInPlace( + (running_mean_std): ModuleDict( + (obs): RunningMeanStdInPlace() + ) + ) + ) + (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) + (encoder): MultiInputEncoder( + (encoders): ModuleDict( + (obs): MlpEncoder( + (mlp_head): RecursiveScriptModule( + original_name=Sequential + (0): RecursiveScriptModule(original_name=Linear) + (1): RecursiveScriptModule(original_name=ELU) + (2): RecursiveScriptModule(original_name=Linear) + (3): RecursiveScriptModule(original_name=ELU) + ) + ) + ) + ) + (core): ModelCoreRNN( + (core): GRU(512, 512) + ) + (decoder): MlpDecoder( + (mlp): Identity() + ) + (critic_linear): Linear(in_features=512, out_features=1, bias=True) + (action_parameterization): ActionParameterizationDefault( + (distribution_linear): Linear(in_features=512, out_features=8, bias=True) + ) +) +[2023-03-06 11:17:00,810][1834350] Worker 1 uses CPU cores [1] +[2023-03-06 11:17:00,922][1834355] Worker 5 uses CPU cores [5] +[2023-03-06 11:17:00,926][1834812] Worker 26 uses CPU cores [26] +[2023-03-06 11:17:01,099][1834482] Worker 8 uses CPU cores [8] +[2023-03-06 11:17:01,266][1834813] Worker 29 uses CPU cores [29] +[2023-03-06 11:17:01,379][1834744] Worker 22 uses CPU cores [22] +[2023-03-06 11:17:01,420][1834533] Worker 16 uses CPU cores [16] +[2023-03-06 11:17:01,577][1834354] Worker 4 uses CPU cores [4] +[2023-03-06 11:17:01,698][1834351] Worker 0 uses CPU cores [0] +[2023-03-06 11:17:01,768][1834387] Worker 6 uses CPU cores [6] +[2023-03-06 11:17:01,946][1834554] Worker 18 uses CPU cores [18] +[2023-03-06 11:17:01,970][1834517] Worker 13 uses CPU cores [13] +[2023-03-06 11:17:02,074][1834617] Worker 19 uses CPU cores [19] +[2023-03-06 11:17:02,150][1834518] Worker 12 uses CPU cores [12] +[2023-03-06 11:17:02,317][1834352] Worker 2 uses CPU cores [2] +[2023-03-06 11:17:02,387][1834419] Worker 7 uses CPU cores [7] +[2023-03-06 11:17:02,474][1834298] Using optimizer +[2023-03-06 11:17:02,475][1834298] No checkpoints found +[2023-03-06 11:17:02,475][1834298] Did not load from checkpoint, starting from scratch! +[2023-03-06 11:17:02,475][1834298] Initialized policy 0 weights for model version 0 +[2023-03-06 11:17:02,477][1834298] LearnerWorker_p0 finished initialization! +[2023-03-06 11:17:02,477][1834298] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 11:17:02,546][1834745] Worker 24 uses CPU cores [24] +[2023-03-06 11:17:02,606][1834680] Worker 20 uses CPU cores [20] +[2023-03-06 11:17:02,694][1834521] Worker 17 uses CPU cores [17] +[2023-03-06 11:17:02,758][1834748] Worker 27 uses CPU cores [27] +[2023-03-06 11:17:02,958][1834780] Worker 28 uses CPU cores [28] +[2023-03-06 11:17:02,988][1834349] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 11:17:02,988][1834349] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 +[2023-03-06 11:17:02,998][1834349] Num visible devices: 1 +[2023-03-06 11:17:03,116][1834349] RunningMeanStd input shape: (39,) +[2023-03-06 11:17:03,117][1834349] RunningMeanStd input shape: (1,) +[2023-03-06 11:17:03,204][1834846] Worker 31 uses CPU cores [31] +[2023-03-06 11:17:03,215][1834519] Worker 15 uses CPU cores [15] +[2023-03-06 11:17:03,312][1834845] Worker 30 uses CPU cores [30] +[2023-03-06 11:17:03,430][1834746] Worker 25 uses CPU cores [25] +[2023-03-06 11:17:03,574][1834747] Worker 23 uses CPU cores [23] +[2023-03-06 11:17:03,585][1834353] Worker 3 uses CPU cores [3] +[2023-03-06 11:17:03,760][1834516] Worker 11 uses CPU cores [11] +[2023-03-06 11:17:03,802][1834520] Worker 14 uses CPU cores [14] +[2023-03-06 11:17:03,833][1834515] Worker 10 uses CPU cores [10] +[2023-03-06 11:17:03,840][1834018] Inference worker 0-0 is ready! +[2023-03-06 11:17:03,840][1834018] All inference workers are ready! Signal rollout workers to start! +[2023-03-06 11:17:04,073][1834514] Worker 9 uses CPU cores [9] +[2023-03-06 11:17:04,359][1834743] Worker 21 uses CPU cores [21] +[2023-03-06 11:17:05,317][1834018] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-06 11:17:05,769][1834355] Decorrelating experience for 0 frames... +[2023-03-06 11:17:05,923][1834746] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,002][1834845] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,011][1834748] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,036][1834519] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,052][1834812] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,064][1834419] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,064][1834617] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,128][1834518] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,207][1834351] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,207][1834533] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,210][1834744] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,215][1834745] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,216][1834482] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,219][1834780] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,229][1834680] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,233][1834354] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,240][1834747] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,249][1834350] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,250][1834521] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,252][1834352] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,252][1834554] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,269][1834813] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,271][1834387] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,274][1834846] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,292][1834353] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,318][1834517] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,380][1834520] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,388][1834516] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,460][1834515] Decorrelating experience for 0 frames... +[2023-03-06 11:17:06,738][1834514] Decorrelating experience for 0 frames... +[2023-03-06 11:17:07,062][1834743] Decorrelating experience for 0 frames... +[2023-03-06 11:17:08,107][1834355] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,178][1834746] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,268][1834748] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,270][1834845] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,280][1834519] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,305][1834812] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,343][1834419] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,356][1834617] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,420][1834518] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,438][1834846] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,456][1834533] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,458][1834351] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,470][1834747] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,472][1834780] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,503][1834353] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,508][1834520] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,509][1834482] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,524][1834516] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,534][1834680] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,540][1834354] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,553][1834521] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,554][1834813] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,557][1834745] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,568][1834744] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,570][1834515] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,573][1834554] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,575][1834352] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,577][1834350] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,582][1834517] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,605][1834387] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,689][1834514] Decorrelating experience for 32 frames... +[2023-03-06 11:17:08,939][1834743] Decorrelating experience for 32 frames... +[2023-03-06 11:17:09,060][1834298] Signal inference workers to stop experience collection... +[2023-03-06 11:17:09,064][1834349] InferenceWorker_p0-w0: stopping experience collection +[2023-03-06 11:17:09,443][1834298] Signal inference workers to resume experience collection... +[2023-03-06 11:17:09,444][1834349] InferenceWorker_p0-w0: resuming experience collection +[2023-03-06 11:17:10,317][1834018] Fps is (10 sec: 1228.8, 60 sec: 1228.8, 300 sec: 1228.8). Total num frames: 6144. Throughput: 0: 836.6. Samples: 4183. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-06 11:17:10,617][1834349] Updated weights for policy 0, policy_version 10 (0.0205) +[2023-03-06 11:17:11,400][1834349] Updated weights for policy 0, policy_version 20 (0.0007) +[2023-03-06 11:17:12,234][1834349] Updated weights for policy 0, policy_version 30 (0.0005) +[2023-03-06 11:17:13,019][1834349] Updated weights for policy 0, policy_version 40 (0.0007) +[2023-03-06 11:17:13,800][1834349] Updated weights for policy 0, policy_version 50 (0.0006) +[2023-03-06 11:17:14,582][1834349] Updated weights for policy 0, policy_version 60 (0.0007) +[2023-03-06 11:17:15,317][1834018] Fps is (10 sec: 7065.7, 60 sec: 7065.7, 300 sec: 7065.7). Total num frames: 70656. Throughput: 0: 4027.7. Samples: 40277. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) +[2023-03-06 11:17:15,324][1834018] Avg episode reward: [(0, '211.109')] +[2023-03-06 11:17:15,403][1834349] Updated weights for policy 0, policy_version 70 (0.0006) +[2023-03-06 11:17:16,181][1834349] Updated weights for policy 0, policy_version 80 (0.0007) +[2023-03-06 11:17:16,986][1834349] Updated weights for policy 0, policy_version 90 (0.0007) +[2023-03-06 11:17:17,792][1834349] Updated weights for policy 0, policy_version 100 (0.0006) +[2023-03-06 11:17:18,567][1834349] Updated weights for policy 0, policy_version 110 (0.0006) +[2023-03-06 11:17:18,616][1834018] Heartbeat connected on Batcher_0 +[2023-03-06 11:17:18,624][1834018] Heartbeat connected on RolloutWorker_w0 +[2023-03-06 11:17:18,625][1834018] Heartbeat connected on RolloutWorker_w1 +[2023-03-06 11:17:18,627][1834018] Heartbeat connected on InferenceWorker_p0-w0 +[2023-03-06 11:17:18,628][1834018] Heartbeat connected on LearnerWorker_p0 +[2023-03-06 11:17:18,628][1834018] Heartbeat connected on RolloutWorker_w2 +[2023-03-06 11:17:18,629][1834018] Heartbeat connected on RolloutWorker_w3 +[2023-03-06 11:17:18,632][1834018] Heartbeat connected on RolloutWorker_w4 +[2023-03-06 11:17:18,634][1834018] Heartbeat connected on RolloutWorker_w5 +[2023-03-06 11:17:18,636][1834018] Heartbeat connected on RolloutWorker_w6 +[2023-03-06 11:17:18,639][1834018] Heartbeat connected on RolloutWorker_w7 +[2023-03-06 11:17:18,639][1834018] Heartbeat connected on RolloutWorker_w8 +[2023-03-06 11:17:18,642][1834018] Heartbeat connected on RolloutWorker_w9 +[2023-03-06 11:17:18,643][1834018] Heartbeat connected on RolloutWorker_w10 +[2023-03-06 11:17:18,646][1834018] Heartbeat connected on RolloutWorker_w11 +[2023-03-06 11:17:18,647][1834018] Heartbeat connected on RolloutWorker_w12 +[2023-03-06 11:17:18,649][1834018] Heartbeat connected on RolloutWorker_w13 +[2023-03-06 11:17:18,652][1834018] Heartbeat connected on RolloutWorker_w14 +[2023-03-06 11:17:18,653][1834018] Heartbeat connected on RolloutWorker_w15 +[2023-03-06 11:17:18,655][1834018] Heartbeat connected on RolloutWorker_w16 +[2023-03-06 11:17:18,657][1834018] Heartbeat connected on RolloutWorker_w17 +[2023-03-06 11:17:18,659][1834018] Heartbeat connected on RolloutWorker_w18 +[2023-03-06 11:17:18,661][1834018] Heartbeat connected on RolloutWorker_w19 +[2023-03-06 11:17:18,663][1834018] Heartbeat connected on RolloutWorker_w20 +[2023-03-06 11:17:18,664][1834018] Heartbeat connected on RolloutWorker_w21 +[2023-03-06 11:17:18,666][1834018] Heartbeat connected on RolloutWorker_w22 +[2023-03-06 11:17:18,669][1834018] Heartbeat connected on RolloutWorker_w23 +[2023-03-06 11:17:18,671][1834018] Heartbeat connected on RolloutWorker_w24 +[2023-03-06 11:17:18,673][1834018] Heartbeat connected on RolloutWorker_w25 +[2023-03-06 11:17:18,674][1834018] Heartbeat connected on RolloutWorker_w26 +[2023-03-06 11:17:18,676][1834018] Heartbeat connected on RolloutWorker_w27 +[2023-03-06 11:17:18,678][1834018] Heartbeat connected on RolloutWorker_w28 +[2023-03-06 11:17:18,681][1834018] Heartbeat connected on RolloutWorker_w29 +[2023-03-06 11:17:18,682][1834018] Heartbeat connected on RolloutWorker_w30 +[2023-03-06 11:17:18,684][1834018] Heartbeat connected on RolloutWorker_w31 +[2023-03-06 11:17:19,345][1834349] Updated weights for policy 0, policy_version 120 (0.0007) +[2023-03-06 11:17:20,165][1834349] Updated weights for policy 0, policy_version 130 (0.0007) +[2023-03-06 11:17:20,317][1834018] Fps is (10 sec: 12800.0, 60 sec: 8943.0, 300 sec: 8943.0). Total num frames: 134144. Throughput: 0: 7836.5. Samples: 117547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:17:20,317][1834018] Avg episode reward: [(0, '197.787')] +[2023-03-06 11:17:20,321][1834298] Saving new best policy, reward=197.787! +[2023-03-06 11:17:20,951][1834349] Updated weights for policy 0, policy_version 140 (0.0007) +[2023-03-06 11:17:21,769][1834349] Updated weights for policy 0, policy_version 150 (0.0007) +[2023-03-06 11:17:22,585][1834349] Updated weights for policy 0, policy_version 160 (0.0006) +[2023-03-06 11:17:23,376][1834349] Updated weights for policy 0, policy_version 170 (0.0007) +[2023-03-06 11:17:24,189][1834349] Updated weights for policy 0, policy_version 180 (0.0006) +[2023-03-06 11:17:25,002][1834349] Updated weights for policy 0, policy_version 190 (0.0006) +[2023-03-06 11:17:25,317][1834018] Fps is (10 sec: 12800.0, 60 sec: 9932.9, 300 sec: 9932.9). Total num frames: 198656. Throughput: 0: 9684.2. Samples: 193682. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 11:17:25,317][1834018] Avg episode reward: [(0, '191.561')] +[2023-03-06 11:17:25,793][1834349] Updated weights for policy 0, policy_version 200 (0.0007) +[2023-03-06 11:17:26,592][1834349] Updated weights for policy 0, policy_version 210 (0.0006) +[2023-03-06 11:17:27,409][1834349] Updated weights for policy 0, policy_version 220 (0.0007) +[2023-03-06 11:17:28,197][1834349] Updated weights for policy 0, policy_version 230 (0.0007) +[2023-03-06 11:17:28,998][1834349] Updated weights for policy 0, policy_version 240 (0.0007) +[2023-03-06 11:17:29,821][1834349] Updated weights for policy 0, policy_version 250 (0.0007) +[2023-03-06 11:17:30,317][1834018] Fps is (10 sec: 12800.0, 60 sec: 10485.8, 300 sec: 10485.8). Total num frames: 262144. Throughput: 0: 9290.5. Samples: 232262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 11:17:30,317][1834018] Avg episode reward: [(0, '180.363')] +[2023-03-06 11:17:30,612][1834349] Updated weights for policy 0, policy_version 260 (0.0007) +[2023-03-06 11:17:31,405][1834349] Updated weights for policy 0, policy_version 270 (0.0006) +[2023-03-06 11:17:32,225][1834349] Updated weights for policy 0, policy_version 280 (0.0006) +[2023-03-06 11:17:33,016][1834349] Updated weights for policy 0, policy_version 290 (0.0006) +[2023-03-06 11:17:33,812][1834349] Updated weights for policy 0, policy_version 300 (0.0006) +[2023-03-06 11:17:34,616][1834349] Updated weights for policy 0, policy_version 310 (0.0006) +[2023-03-06 11:17:35,317][1834018] Fps is (10 sec: 12697.5, 60 sec: 10854.4, 300 sec: 10854.4). Total num frames: 325632. Throughput: 0: 10296.8. Samples: 308904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 11:17:35,317][1834018] Avg episode reward: [(0, '213.396')] +[2023-03-06 11:17:35,321][1834298] Saving new best policy, reward=213.396! +[2023-03-06 11:17:35,413][1834349] Updated weights for policy 0, policy_version 320 (0.0007) +[2023-03-06 11:17:36,205][1834349] Updated weights for policy 0, policy_version 330 (0.0007) +[2023-03-06 11:17:37,028][1834349] Updated weights for policy 0, policy_version 340 (0.0007) +[2023-03-06 11:17:37,812][1834349] Updated weights for policy 0, policy_version 350 (0.0007) +[2023-03-06 11:17:38,586][1834349] Updated weights for policy 0, policy_version 360 (0.0006) +[2023-03-06 11:17:39,398][1834349] Updated weights for policy 0, policy_version 370 (0.0006) +[2023-03-06 11:17:40,224][1834349] Updated weights for policy 0, policy_version 380 (0.0007) +[2023-03-06 11:17:40,317][1834018] Fps is (10 sec: 12799.8, 60 sec: 11147.0, 300 sec: 11147.0). Total num frames: 390144. Throughput: 0: 11012.4. Samples: 385433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 11:17:40,317][1834018] Avg episode reward: [(0, '215.275')] +[2023-03-06 11:17:40,318][1834298] Saving new best policy, reward=215.275! +[2023-03-06 11:17:41,014][1834349] Updated weights for policy 0, policy_version 390 (0.0006) +[2023-03-06 11:17:41,808][1834349] Updated weights for policy 0, policy_version 400 (0.0006) +[2023-03-06 11:17:42,618][1834349] Updated weights for policy 0, policy_version 410 (0.0006) +[2023-03-06 11:17:43,413][1834349] Updated weights for policy 0, policy_version 420 (0.0007) +[2023-03-06 11:17:44,219][1834349] Updated weights for policy 0, policy_version 430 (0.0006) +[2023-03-06 11:17:45,046][1834349] Updated weights for policy 0, policy_version 440 (0.0007) +[2023-03-06 11:17:45,317][1834018] Fps is (10 sec: 12800.0, 60 sec: 11340.8, 300 sec: 11340.8). Total num frames: 453632. Throughput: 0: 10597.7. Samples: 423909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:17:45,317][1834018] Avg episode reward: [(0, '275.411')] +[2023-03-06 11:17:45,321][1834298] Saving new best policy, reward=275.411! +[2023-03-06 11:17:45,847][1834349] Updated weights for policy 0, policy_version 450 (0.0006) +[2023-03-06 11:17:46,645][1834349] Updated weights for policy 0, policy_version 460 (0.0006) +[2023-03-06 11:17:47,476][1834349] Updated weights for policy 0, policy_version 470 (0.0006) +[2023-03-06 11:17:48,279][1834349] Updated weights for policy 0, policy_version 480 (0.0007) +[2023-03-06 11:17:49,068][1834349] Updated weights for policy 0, policy_version 490 (0.0006) +[2023-03-06 11:17:49,889][1834349] Updated weights for policy 0, policy_version 500 (0.0007) +[2023-03-06 11:17:50,317][1834018] Fps is (10 sec: 12697.8, 60 sec: 11491.6, 300 sec: 11491.6). Total num frames: 517120. Throughput: 0: 11113.8. Samples: 500118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:17:50,317][1834018] Avg episode reward: [(0, '319.853')] +[2023-03-06 11:17:50,318][1834298] Saving new best policy, reward=319.853! +[2023-03-06 11:17:50,688][1834349] Updated weights for policy 0, policy_version 510 (0.0007) +[2023-03-06 11:17:51,483][1834349] Updated weights for policy 0, policy_version 520 (0.0006) +[2023-03-06 11:17:52,297][1834349] Updated weights for policy 0, policy_version 530 (0.0006) +[2023-03-06 11:17:53,123][1834349] Updated weights for policy 0, policy_version 540 (0.0006) +[2023-03-06 11:17:53,915][1834349] Updated weights for policy 0, policy_version 550 (0.0006) +[2023-03-06 11:17:54,719][1834349] Updated weights for policy 0, policy_version 560 (0.0006) +[2023-03-06 11:17:55,317][1834018] Fps is (10 sec: 12697.6, 60 sec: 11612.2, 300 sec: 11612.2). Total num frames: 580608. Throughput: 0: 12711.9. Samples: 576217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:17:55,317][1834018] Avg episode reward: [(0, '279.825')] +[2023-03-06 11:17:55,542][1834349] Updated weights for policy 0, policy_version 570 (0.0006) +[2023-03-06 11:17:56,343][1834349] Updated weights for policy 0, policy_version 580 (0.0006) +[2023-03-06 11:17:57,153][1834349] Updated weights for policy 0, policy_version 590 (0.0007) +[2023-03-06 11:17:57,972][1834349] Updated weights for policy 0, policy_version 600 (0.0006) +[2023-03-06 11:17:58,777][1834349] Updated weights for policy 0, policy_version 610 (0.0006) +[2023-03-06 11:17:59,586][1834349] Updated weights for policy 0, policy_version 620 (0.0006) +[2023-03-06 11:18:00,317][1834018] Fps is (10 sec: 12697.5, 60 sec: 11710.8, 300 sec: 11710.8). Total num frames: 644096. Throughput: 0: 12751.0. Samples: 614072. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:18:00,317][1834018] Avg episode reward: [(0, '277.740')] +[2023-03-06 11:18:00,394][1834349] Updated weights for policy 0, policy_version 630 (0.0007) +[2023-03-06 11:18:01,202][1834349] Updated weights for policy 0, policy_version 640 (0.0006) +[2023-03-06 11:18:01,988][1834349] Updated weights for policy 0, policy_version 650 (0.0007) +[2023-03-06 11:18:02,811][1834349] Updated weights for policy 0, policy_version 660 (0.0007) +[2023-03-06 11:18:03,635][1834349] Updated weights for policy 0, policy_version 670 (0.0006) +[2023-03-06 11:18:04,439][1834349] Updated weights for policy 0, policy_version 680 (0.0007) +[2023-03-06 11:18:05,260][1834349] Updated weights for policy 0, policy_version 690 (0.0006) +[2023-03-06 11:18:05,317][1834018] Fps is (10 sec: 12595.3, 60 sec: 11776.0, 300 sec: 11776.0). Total num frames: 706560. Throughput: 0: 12723.9. Samples: 690121. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 11:18:05,317][1834018] Avg episode reward: [(0, '369.632')] +[2023-03-06 11:18:05,327][1834298] Saving new best policy, reward=369.632! +[2023-03-06 11:18:06,060][1834349] Updated weights for policy 0, policy_version 700 (0.0006) +[2023-03-06 11:18:06,880][1834349] Updated weights for policy 0, policy_version 710 (0.0007) +[2023-03-06 11:18:07,713][1834349] Updated weights for policy 0, policy_version 720 (0.0006) +[2023-03-06 11:18:08,496][1834349] Updated weights for policy 0, policy_version 730 (0.0006) +[2023-03-06 11:18:09,285][1834349] Updated weights for policy 0, policy_version 740 (0.0006) +[2023-03-06 11:18:10,105][1834349] Updated weights for policy 0, policy_version 750 (0.0006) +[2023-03-06 11:18:10,317][1834018] Fps is (10 sec: 12595.2, 60 sec: 12731.7, 300 sec: 11846.9). Total num frames: 770048. Throughput: 0: 12716.3. Samples: 765917. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 11:18:10,317][1834018] Avg episode reward: [(0, '339.455')] +[2023-03-06 11:18:10,914][1834349] Updated weights for policy 0, policy_version 760 (0.0006) +[2023-03-06 11:18:11,728][1834349] Updated weights for policy 0, policy_version 770 (0.0006) +[2023-03-06 11:18:12,531][1834349] Updated weights for policy 0, policy_version 780 (0.0007) +[2023-03-06 11:18:13,374][1834349] Updated weights for policy 0, policy_version 790 (0.0007) +[2023-03-06 11:18:14,169][1834349] Updated weights for policy 0, policy_version 800 (0.0007) +[2023-03-06 11:18:14,994][1834349] Updated weights for policy 0, policy_version 810 (0.0006) +[2023-03-06 11:18:15,317][1834018] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 11893.0). Total num frames: 832512. Throughput: 0: 12694.3. Samples: 803508. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) +[2023-03-06 11:18:15,317][1834018] Avg episode reward: [(0, '428.337')] +[2023-03-06 11:18:15,324][1834298] Saving new best policy, reward=428.337! +[2023-03-06 11:18:15,816][1834349] Updated weights for policy 0, policy_version 820 (0.0006) +[2023-03-06 11:18:16,600][1834349] Updated weights for policy 0, policy_version 830 (0.0006) +[2023-03-06 11:18:17,423][1834349] Updated weights for policy 0, policy_version 840 (0.0006) +[2023-03-06 11:18:18,227][1834349] Updated weights for policy 0, policy_version 850 (0.0007) +[2023-03-06 11:18:19,044][1834349] Updated weights for policy 0, policy_version 860 (0.0006) +[2023-03-06 11:18:19,485][1834018] Keyboard interrupt detected in the event loop EvtLoop [Runner_EvtLoop, process=main process 1834018], exiting... +[2023-03-06 11:18:19,486][1834355] Stopping RolloutWorker_w5... +[2023-03-06 11:18:19,486][1834521] Stopping RolloutWorker_w17... +[2023-03-06 11:18:19,486][1834554] Stopping RolloutWorker_w18... +[2023-03-06 11:18:19,486][1834617] Stopping RolloutWorker_w19... +[2023-03-06 11:18:19,486][1834350] Stopping RolloutWorker_w1... +[2023-03-06 11:18:19,486][1834743] Stopping RolloutWorker_w21... +[2023-03-06 11:18:19,486][1834745] Stopping RolloutWorker_w24... +[2023-03-06 11:18:19,486][1834298] Stopping Batcher_0... +[2023-03-06 11:18:19,486][1834353] Stopping RolloutWorker_w3... +[2023-03-06 11:18:19,486][1834018] Runner profile tree view: +main_loop: 80.8005 +[2023-03-06 11:18:19,486][1834846] Stopping RolloutWorker_w31... +[2023-03-06 11:18:19,486][1834554] Loop rollout_proc18_evt_loop terminating... +[2023-03-06 11:18:19,486][1834521] Loop rollout_proc17_evt_loop terminating... +[2023-03-06 11:18:19,486][1834355] Loop rollout_proc5_evt_loop terminating... +[2023-03-06 11:18:19,486][1834680] Stopping RolloutWorker_w20... +[2023-03-06 11:18:19,486][1834745] Loop rollout_proc24_evt_loop terminating... +[2023-03-06 11:18:19,486][1834617] Loop rollout_proc19_evt_loop terminating... +[2023-03-06 11:18:19,486][1834354] Stopping RolloutWorker_w4... +[2023-03-06 11:18:19,486][1834353] Loop rollout_proc3_evt_loop terminating... +[2023-03-06 11:18:19,486][1834298] Loop batcher_evt_loop terminating... +[2023-03-06 11:18:19,486][1834846] Loop rollout_proc31_evt_loop terminating... +[2023-03-06 11:18:19,486][1834743] Loop rollout_proc21_evt_loop terminating... +[2023-03-06 11:18:19,486][1834515] Stopping RolloutWorker_w10... +[2023-03-06 11:18:19,486][1834018] Collected {0: 885760}, FPS: 10962.3 +[2023-03-06 11:18:19,486][1834350] Loop rollout_proc1_evt_loop terminating... +[2023-03-06 11:18:19,486][1834482] Stopping RolloutWorker_w8... +[2023-03-06 11:18:19,487][1834680] Loop rollout_proc20_evt_loop terminating... +[2023-03-06 11:18:19,486][1834748] Stopping RolloutWorker_w27... +[2023-03-06 11:18:19,486][1834419] Stopping RolloutWorker_w7... +[2023-03-06 11:18:19,487][1834354] Loop rollout_proc4_evt_loop terminating... +[2023-03-06 11:18:19,486][1834387] Stopping RolloutWorker_w6... +[2023-03-06 11:18:19,487][1834515] Loop rollout_proc10_evt_loop terminating... +[2023-03-06 11:18:19,487][1834780] Stopping RolloutWorker_w28... +[2023-03-06 11:18:19,487][1834482] Loop rollout_proc8_evt_loop terminating... +[2023-03-06 11:18:19,487][1834520] Stopping RolloutWorker_w14... +[2023-03-06 11:18:19,487][1834517] Stopping RolloutWorker_w13... +[2023-03-06 11:18:19,487][1834748] Loop rollout_proc27_evt_loop terminating... +[2023-03-06 11:18:19,487][1834419] Loop rollout_proc7_evt_loop terminating... +[2023-03-06 11:18:19,487][1834780] Loop rollout_proc28_evt_loop terminating... +[2023-03-06 11:18:19,487][1834387] Loop rollout_proc6_evt_loop terminating... +[2023-03-06 11:18:19,487][1834514] Stopping RolloutWorker_w9... +[2023-03-06 11:18:19,487][1834516] Stopping RolloutWorker_w11... +[2023-03-06 11:18:19,487][1834520] Loop rollout_proc14_evt_loop terminating... +[2023-03-06 11:18:19,487][1834517] Loop rollout_proc13_evt_loop terminating... +[2023-03-06 11:18:19,487][1834813] Stopping RolloutWorker_w29... +[2023-03-06 11:18:19,487][1834518] Stopping RolloutWorker_w12... +[2023-03-06 11:18:19,487][1834352] Stopping RolloutWorker_w2... +[2023-03-06 11:18:19,487][1834514] Loop rollout_proc9_evt_loop terminating... +[2023-03-06 11:18:19,487][1834516] Loop rollout_proc11_evt_loop terminating... +[2023-03-06 11:18:19,487][1834813] Loop rollout_proc29_evt_loop terminating... +[2023-03-06 11:18:19,487][1834746] Stopping RolloutWorker_w25... +[2023-03-06 11:18:19,487][1834352] Loop rollout_proc2_evt_loop terminating... +[2023-03-06 11:18:19,487][1834298] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000000865_885760.pth... +[2023-03-06 11:18:19,487][1834518] Loop rollout_proc12_evt_loop terminating... +[2023-03-06 11:18:19,488][1834746] Loop rollout_proc25_evt_loop terminating... +[2023-03-06 11:18:19,488][1834519] Stopping RolloutWorker_w15... +[2023-03-06 11:18:19,488][1834519] Loop rollout_proc15_evt_loop terminating... +[2023-03-06 11:18:19,488][1834533] Stopping RolloutWorker_w16... +[2023-03-06 11:18:19,489][1834533] Loop rollout_proc16_evt_loop terminating... +[2023-03-06 11:18:19,490][1834845] Stopping RolloutWorker_w30... +[2023-03-06 11:18:19,491][1834845] Loop rollout_proc30_evt_loop terminating... +[2023-03-06 11:18:19,491][1834351] Stopping RolloutWorker_w0... +[2023-03-06 11:18:19,492][1834351] Loop rollout_proc0_evt_loop terminating... +[2023-03-06 11:18:19,495][1834812] Stopping RolloutWorker_w26... +[2023-03-06 11:18:19,496][1834812] Loop rollout_proc26_evt_loop terminating... +[2023-03-06 11:18:19,511][1834747] Stopping RolloutWorker_w23... +[2023-03-06 11:18:19,512][1834747] Loop rollout_proc23_evt_loop terminating... +[2023-03-06 11:18:19,560][1834349] Weights refcount: 2 0 +[2023-03-06 11:18:19,562][1834349] Stopping InferenceWorker_p0-w0... +[2023-03-06 11:18:19,567][1834349] Loop inference_proc0-0_evt_loop terminating... +[2023-03-06 11:18:19,594][1834298] Stopping LearnerWorker_p0... +[2023-03-06 11:18:19,595][1834298] Loop learner_proc0_evt_loop terminating... +[2023-03-06 11:18:19,645][1834744] Stopping RolloutWorker_w22... +[2023-03-06 11:18:19,646][1834744] Loop rollout_proc22_evt_loop terminating... +[2023-03-06 12:53:56,968][1853846] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/config.json... +[2023-03-06 12:53:56,982][1853846] Rollout worker 0 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 1 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 2 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 3 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 4 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 5 uses device cpu +[2023-03-06 12:53:56,983][1853846] Rollout worker 6 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 7 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 8 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 9 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 10 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 11 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 12 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 13 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 14 uses device cpu +[2023-03-06 12:53:56,984][1853846] Rollout worker 15 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 16 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 17 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 18 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 19 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 20 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 21 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 22 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 23 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 24 uses device cpu +[2023-03-06 12:53:56,985][1853846] Rollout worker 25 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 26 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 27 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 28 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 29 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 30 uses device cpu +[2023-03-06 12:53:56,986][1853846] Rollout worker 31 uses device cpu +[2023-03-06 12:53:56,999][1853846] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 12:53:57,000][1853846] InferenceWorker_p0-w0: min num requests: 10 +[2023-03-06 12:53:57,096][1853846] Starting all processes... +[2023-03-06 12:53:57,096][1853846] Starting process learner_proc0 +[2023-03-06 12:53:57,146][1853846] Starting all processes... +[2023-03-06 12:53:57,213][1853846] Starting process inference_proc0-0 +[2023-03-06 12:53:57,213][1853846] Starting process rollout_proc0 +[2023-03-06 12:53:57,213][1853846] Starting process rollout_proc1 +[2023-03-06 12:53:57,214][1853846] Starting process rollout_proc2 +[2023-03-06 12:53:57,214][1853846] Starting process rollout_proc3 +[2023-03-06 12:53:57,214][1853846] Starting process rollout_proc4 +[2023-03-06 12:53:57,214][1853846] Starting process rollout_proc5 +[2023-03-06 12:53:57,216][1853846] Starting process rollout_proc6 +[2023-03-06 12:53:57,221][1853846] Starting process rollout_proc7 +[2023-03-06 12:53:57,225][1853846] Starting process rollout_proc8 +[2023-03-06 12:53:57,226][1853846] Starting process rollout_proc9 +[2023-03-06 12:53:57,226][1853846] Starting process rollout_proc10 +[2023-03-06 12:53:57,226][1853846] Starting process rollout_proc11 +[2023-03-06 12:53:57,226][1853846] Starting process rollout_proc12 +[2023-03-06 12:53:57,227][1853846] Starting process rollout_proc13 +[2023-03-06 12:53:57,227][1853846] Starting process rollout_proc14 +[2023-03-06 12:53:57,231][1853846] Starting process rollout_proc15 +[2023-03-06 12:53:57,234][1853846] Starting process rollout_proc16 +[2023-03-06 12:53:57,235][1853846] Starting process rollout_proc17 +[2023-03-06 12:53:57,239][1853846] Starting process rollout_proc18 +[2023-03-06 12:53:57,246][1853846] Starting process rollout_proc19 +[2023-03-06 12:53:57,319][1853846] Starting process rollout_proc20 +[2023-03-06 12:53:57,320][1853846] Starting process rollout_proc21 +[2023-03-06 12:53:57,324][1853846] Starting process rollout_proc22 +[2023-03-06 12:53:57,355][1853846] Starting process rollout_proc23 +[2023-03-06 12:53:57,355][1853846] Starting process rollout_proc24 +[2023-03-06 12:53:57,386][1853846] Starting process rollout_proc25 +[2023-03-06 12:53:57,386][1853846] Starting process rollout_proc26 +[2023-03-06 12:53:57,386][1853846] Starting process rollout_proc27 +[2023-03-06 12:53:57,387][1853846] Starting process rollout_proc28 +[2023-03-06 12:53:57,387][1853846] Starting process rollout_proc29 +[2023-03-06 12:53:57,387][1853846] Starting process rollout_proc30 +[2023-03-06 12:53:57,387][1853846] Starting process rollout_proc31 +[2023-03-06 12:53:58,983][1854119] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 12:53:58,983][1854119] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 +[2023-03-06 12:53:58,993][1854119] Num visible devices: 1 +[2023-03-06 12:53:59,025][1854119] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 +[2023-03-06 12:53:59,026][1854119] Starting seed is not provided +[2023-03-06 12:53:59,026][1854119] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 12:53:59,026][1854119] Initializing actor-critic model on device cuda:0 +[2023-03-06 12:53:59,026][1854119] RunningMeanStd input shape: (39,) +[2023-03-06 12:53:59,026][1854119] RunningMeanStd input shape: (1,) +[2023-03-06 12:53:59,127][1854119] Created Actor Critic model with architecture: +[2023-03-06 12:53:59,127][1854119] ActorCriticSharedWeights( + (obs_normalizer): ObservationNormalizer( + (running_mean_std): RunningMeanStdDictInPlace( + (running_mean_std): ModuleDict( + (obs): RunningMeanStdInPlace() + ) + ) + ) + (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) + (encoder): MultiInputEncoder( + (encoders): ModuleDict( + (obs): MlpEncoder( + (mlp_head): RecursiveScriptModule( + original_name=Sequential + (0): RecursiveScriptModule(original_name=Linear) + (1): RecursiveScriptModule(original_name=ELU) + (2): RecursiveScriptModule(original_name=Linear) + (3): RecursiveScriptModule(original_name=ELU) + ) + ) + ) + ) + (core): ModelCoreRNN( + (core): GRU(512, 512) + ) + (decoder): MlpDecoder( + (mlp): Identity() + ) + (critic_linear): Linear(in_features=512, out_features=1, bias=True) + (action_parameterization): ActionParameterizationDefault( + (distribution_linear): Linear(in_features=512, out_features=8, bias=True) + ) +) +[2023-03-06 12:53:59,238][1854171] Worker 1 uses CPU cores [1] +[2023-03-06 12:53:59,390][1854174] Worker 4 uses CPU cores [4] +[2023-03-06 12:53:59,396][1854504] Worker 21 uses CPU cores [21] +[2023-03-06 12:53:59,531][1854334] Worker 9 uses CPU cores [9] +[2023-03-06 12:53:59,650][1854337] Worker 16 uses CPU cores [16] +[2023-03-06 12:53:59,860][1854635] Worker 28 uses CPU cores [28] +[2023-03-06 12:53:59,902][1854301] Worker 3 uses CPU cores [3] +[2023-03-06 12:54:00,003][1854322] Worker 14 uses CPU cores [14] +[2023-03-06 12:54:00,046][1854731] Worker 31 uses CPU cores [31] +[2023-03-06 12:54:00,190][1854344] Worker 11 uses CPU cores [11] +[2023-03-06 12:54:00,286][1854667] Worker 29 uses CPU cores [29] +[2023-03-06 12:54:00,389][1854170] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 12:54:00,389][1854170] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 +[2023-03-06 12:54:00,399][1854170] Num visible devices: 1 +[2023-03-06 12:54:00,530][1854599] Worker 24 uses CPU cores [24] +[2023-03-06 12:54:00,581][1854634] Worker 27 uses CPU cores [27] +[2023-03-06 12:54:00,720][1854119] Using optimizer +[2023-03-06 12:54:00,720][1854119] Loading state from checkpoint /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000000865_885760.pth... +[2023-03-06 12:54:00,731][1854338] Worker 10 uses CPU cores [10] +[2023-03-06 12:54:00,735][1854119] Loading model from checkpoint +[2023-03-06 12:54:00,739][1854119] Loaded experiment state at self.train_step=865, self.env_steps=885760 +[2023-03-06 12:54:00,739][1854119] Initialized policy 0 weights for model version 865 +[2023-03-06 12:54:00,751][1854119] Using GPUs [0] for process 0 (actually maps to GPUs [0]) +[2023-03-06 12:54:00,754][1854119] LearnerWorker_p0 finished initialization! +[2023-03-06 12:54:00,804][1854170] RunningMeanStd input shape: (39,) +[2023-03-06 12:54:00,804][1854170] RunningMeanStd input shape: (1,) +[2023-03-06 12:54:00,817][1854597] Worker 25 uses CPU cores [25] +[2023-03-06 12:54:01,036][1854336] Worker 7 uses CPU cores [7] +[2023-03-06 12:54:01,065][1854275] Worker 8 uses CPU cores [8] +[2023-03-06 12:54:01,102][1854343] Worker 15 uses CPU cores [15] +[2023-03-06 12:54:01,336][1854441] Worker 22 uses CPU cores [22] +[2023-03-06 12:54:01,358][1854172] Worker 0 uses CPU cores [0] +[2023-03-06 12:54:01,512][1854668] Worker 30 uses CPU cores [30] +[2023-03-06 12:54:01,543][1854335] Worker 12 uses CPU cores [12] +[2023-03-06 12:54:01,618][1854340] Worker 13 uses CPU cores [13] +[2023-03-06 12:54:01,632][1853846] Inference worker 0-0 is ready! +[2023-03-06 12:54:01,634][1853846] All inference workers are ready! Signal rollout workers to start! +[2023-03-06 12:54:01,728][1854339] Worker 18 uses CPU cores [18] +[2023-03-06 12:54:01,962][1854581] Worker 23 uses CPU cores [23] +[2023-03-06 12:54:02,067][1854173] Worker 2 uses CPU cores [2] +[2023-03-06 12:54:02,246][1854300] Worker 5 uses CPU cores [5] +[2023-03-06 12:54:02,477][1854346] Worker 20 uses CPU cores [20] +[2023-03-06 12:54:02,540][1854633] Worker 26 uses CPU cores [26] +[2023-03-06 12:54:02,646][1854341] Worker 19 uses CPU cores [19] +[2023-03-06 12:54:02,912][1854342] Worker 6 uses CPU cores [6] +[2023-03-06 12:54:03,028][1854345] Worker 17 uses CPU cores [17] +[2023-03-06 12:54:03,395][1854441] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,678][1854171] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,690][1854174] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,701][1853846] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 885760. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-06 12:54:03,736][1854301] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,747][1854504] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,793][1854338] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,794][1854336] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,853][1854667] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,910][1854337] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,919][1854172] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,946][1854635] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,949][1854334] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,967][1854668] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,977][1854275] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,980][1854340] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,983][1854344] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,993][1854597] Decorrelating experience for 0 frames... +[2023-03-06 12:54:03,999][1854599] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,034][1854322] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,041][1854339] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,072][1854335] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,180][1854634] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,335][1854581] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,477][1854173] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,501][1854343] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,631][1854300] Decorrelating experience for 0 frames... +[2023-03-06 12:54:04,653][1854731] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,068][1854346] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,077][1854341] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,117][1854633] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,517][1854342] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,584][1854441] Decorrelating experience for 32 frames... +[2023-03-06 12:54:05,602][1854345] Decorrelating experience for 0 frames... +[2023-03-06 12:54:05,869][1854171] Decorrelating experience for 32 frames... +[2023-03-06 12:54:05,906][1854174] Decorrelating experience for 32 frames... +[2023-03-06 12:54:05,967][1854301] Decorrelating experience for 32 frames... +[2023-03-06 12:54:05,991][1854504] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,006][1854338] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,022][1854336] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,159][1854635] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,174][1854334] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,177][1854667] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,186][1854172] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,193][1854275] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,201][1854340] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,205][1854597] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,218][1854668] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,232][1854337] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,237][1854344] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,263][1854339] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,266][1854599] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,279][1854322] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,327][1854335] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,388][1854634] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,444][1854581] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,505][1854119] Signal inference workers to stop experience collection... +[2023-03-06 12:54:06,508][1854170] InferenceWorker_p0-w0: stopping experience collection +[2023-03-06 12:54:06,619][1854173] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,644][1854300] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,681][1854343] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,779][1854731] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,833][1854119] Signal inference workers to resume experience collection... +[2023-03-06 12:54:06,834][1854170] InferenceWorker_p0-w0: resuming experience collection +[2023-03-06 12:54:06,875][1854346] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,895][1854341] Decorrelating experience for 32 frames... +[2023-03-06 12:54:06,912][1854633] Decorrelating experience for 32 frames... +[2023-03-06 12:54:07,159][1854345] Decorrelating experience for 32 frames... +[2023-03-06 12:54:07,165][1854342] Decorrelating experience for 32 frames... +[2023-03-06 12:54:08,036][1854170] Updated weights for policy 0, policy_version 875 (0.0218) +[2023-03-06 12:54:08,701][1853846] Fps is (10 sec: 3686.5, 60 sec: 3686.5, 300 sec: 3686.5). Total num frames: 904192. Throughput: 0: 1280.0. Samples: 6400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-06 12:54:08,859][1854170] Updated weights for policy 0, policy_version 885 (0.0007) +[2023-03-06 12:54:09,689][1854170] Updated weights for policy 0, policy_version 895 (0.0006) +[2023-03-06 12:54:10,484][1854170] Updated weights for policy 0, policy_version 905 (0.0006) +[2023-03-06 12:54:11,294][1854170] Updated weights for policy 0, policy_version 915 (0.0006) +[2023-03-06 12:54:12,096][1854170] Updated weights for policy 0, policy_version 925 (0.0006) +[2023-03-06 12:54:12,913][1854170] Updated weights for policy 0, policy_version 935 (0.0007) +[2023-03-06 12:54:13,701][1853846] Fps is (10 sec: 8089.7, 60 sec: 8089.7, 300 sec: 8089.7). Total num frames: 966656. Throughput: 0: 8242.7. Samples: 82426. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:54:13,701][1853846] Avg episode reward: [(0, '494.362')] +[2023-03-06 12:54:13,705][1854170] Updated weights for policy 0, policy_version 945 (0.0007) +[2023-03-06 12:54:13,707][1854119] Saving new best policy, reward=494.362! +[2023-03-06 12:54:14,505][1854170] Updated weights for policy 0, policy_version 955 (0.0006) +[2023-03-06 12:54:15,333][1854170] Updated weights for policy 0, policy_version 965 (0.0006) +[2023-03-06 12:54:16,125][1854170] Updated weights for policy 0, policy_version 975 (0.0006) +[2023-03-06 12:54:16,922][1854170] Updated weights for policy 0, policy_version 985 (0.0008) +[2023-03-06 12:54:16,996][1853846] Heartbeat connected on Batcher_0 +[2023-03-06 12:54:16,997][1853846] Heartbeat connected on LearnerWorker_p0 +[2023-03-06 12:54:17,002][1853846] Heartbeat connected on InferenceWorker_p0-w0 +[2023-03-06 12:54:17,005][1853846] Heartbeat connected on RolloutWorker_w0 +[2023-03-06 12:54:17,005][1853846] Heartbeat connected on RolloutWorker_w1 +[2023-03-06 12:54:17,006][1853846] Heartbeat connected on RolloutWorker_w2 +[2023-03-06 12:54:17,008][1853846] Heartbeat connected on RolloutWorker_w3 +[2023-03-06 12:54:17,011][1853846] Heartbeat connected on RolloutWorker_w4 +[2023-03-06 12:54:17,012][1853846] Heartbeat connected on RolloutWorker_w5 +[2023-03-06 12:54:17,014][1853846] Heartbeat connected on RolloutWorker_w6 +[2023-03-06 12:54:17,016][1853846] Heartbeat connected on RolloutWorker_w7 +[2023-03-06 12:54:17,018][1853846] Heartbeat connected on RolloutWorker_w8 +[2023-03-06 12:54:17,021][1853846] Heartbeat connected on RolloutWorker_w10 +[2023-03-06 12:54:17,029][1853846] Heartbeat connected on RolloutWorker_w9 +[2023-03-06 12:54:17,056][1853846] Heartbeat connected on RolloutWorker_w11 +[2023-03-06 12:54:17,058][1853846] Heartbeat connected on RolloutWorker_w12 +[2023-03-06 12:54:17,061][1853846] Heartbeat connected on RolloutWorker_w13 +[2023-03-06 12:54:17,066][1853846] Heartbeat connected on RolloutWorker_w16 +[2023-03-06 12:54:17,068][1853846] Heartbeat connected on RolloutWorker_w17 +[2023-03-06 12:54:17,068][1853846] Heartbeat connected on RolloutWorker_w14 +[2023-03-06 12:54:17,069][1853846] Heartbeat connected on RolloutWorker_w18 +[2023-03-06 12:54:17,071][1853846] Heartbeat connected on RolloutWorker_w19 +[2023-03-06 12:54:17,073][1853846] Heartbeat connected on RolloutWorker_w20 +[2023-03-06 12:54:17,077][1853846] Heartbeat connected on RolloutWorker_w22 +[2023-03-06 12:54:17,077][1853846] Heartbeat connected on RolloutWorker_w15 +[2023-03-06 12:54:17,078][1853846] Heartbeat connected on RolloutWorker_w21 +[2023-03-06 12:54:17,079][1853846] Heartbeat connected on RolloutWorker_w23 +[2023-03-06 12:54:17,081][1853846] Heartbeat connected on RolloutWorker_w24 +[2023-03-06 12:54:17,083][1853846] Heartbeat connected on RolloutWorker_w25 +[2023-03-06 12:54:17,084][1853846] Heartbeat connected on RolloutWorker_w26 +[2023-03-06 12:54:17,086][1853846] Heartbeat connected on RolloutWorker_w27 +[2023-03-06 12:54:17,088][1853846] Heartbeat connected on RolloutWorker_w28 +[2023-03-06 12:54:17,091][1853846] Heartbeat connected on RolloutWorker_w29 +[2023-03-06 12:54:17,093][1853846] Heartbeat connected on RolloutWorker_w30 +[2023-03-06 12:54:17,094][1853846] Heartbeat connected on RolloutWorker_w31 +[2023-03-06 12:54:17,753][1854170] Updated weights for policy 0, policy_version 995 (0.0006) +[2023-03-06 12:54:18,563][1854170] Updated weights for policy 0, policy_version 1005 (0.0006) +[2023-03-06 12:54:18,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 9625.6, 300 sec: 9625.6). Total num frames: 1030144. Throughput: 0: 8044.1. Samples: 120661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:54:18,701][1853846] Avg episode reward: [(0, '505.186')] +[2023-03-06 12:54:18,715][1854119] Saving new best policy, reward=505.186! +[2023-03-06 12:54:19,365][1854170] Updated weights for policy 0, policy_version 1015 (0.0006) +[2023-03-06 12:54:20,219][1854170] Updated weights for policy 0, policy_version 1025 (0.0007) +[2023-03-06 12:54:21,028][1854170] Updated weights for policy 0, policy_version 1035 (0.0006) +[2023-03-06 12:54:21,824][1854170] Updated weights for policy 0, policy_version 1045 (0.0006) +[2023-03-06 12:54:22,645][1854170] Updated weights for policy 0, policy_version 1055 (0.0006) +[2023-03-06 12:54:23,458][1854170] Updated weights for policy 0, policy_version 1065 (0.0007) +[2023-03-06 12:54:23,701][1853846] Fps is (10 sec: 12697.6, 60 sec: 10393.7, 300 sec: 10393.7). Total num frames: 1093632. Throughput: 0: 9776.3. Samples: 195525. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) +[2023-03-06 12:54:23,701][1853846] Avg episode reward: [(0, '564.623')] +[2023-03-06 12:54:23,704][1854119] Saving new best policy, reward=564.623! +[2023-03-06 12:54:24,274][1854170] Updated weights for policy 0, policy_version 1075 (0.0007) +[2023-03-06 12:54:25,105][1854170] Updated weights for policy 0, policy_version 1085 (0.0006) +[2023-03-06 12:54:25,927][1854170] Updated weights for policy 0, policy_version 1095 (0.0007) +[2023-03-06 12:54:26,711][1854170] Updated weights for policy 0, policy_version 1105 (0.0006) +[2023-03-06 12:54:27,544][1854170] Updated weights for policy 0, policy_version 1115 (0.0006) +[2023-03-06 12:54:28,362][1854170] Updated weights for policy 0, policy_version 1125 (0.0006) +[2023-03-06 12:54:28,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 10813.5, 300 sec: 10813.5). Total num frames: 1156096. Throughput: 0: 10844.3. Samples: 271106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:54:28,701][1853846] Avg episode reward: [(0, '543.574')] +[2023-03-06 12:54:29,163][1854170] Updated weights for policy 0, policy_version 1135 (0.0007) +[2023-03-06 12:54:29,990][1854170] Updated weights for policy 0, policy_version 1145 (0.0006) +[2023-03-06 12:54:30,804][1854170] Updated weights for policy 0, policy_version 1155 (0.0006) +[2023-03-06 12:54:31,602][1854170] Updated weights for policy 0, policy_version 1165 (0.0006) +[2023-03-06 12:54:32,414][1854170] Updated weights for policy 0, policy_version 1175 (0.0006) +[2023-03-06 12:54:33,220][1854170] Updated weights for policy 0, policy_version 1185 (0.0006) +[2023-03-06 12:54:33,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 11127.5, 300 sec: 11127.5). Total num frames: 1219584. Throughput: 0: 10297.4. Samples: 308921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:54:33,701][1853846] Avg episode reward: [(0, '565.412')] +[2023-03-06 12:54:33,704][1854119] Saving new best policy, reward=565.412! +[2023-03-06 12:54:34,025][1854170] Updated weights for policy 0, policy_version 1195 (0.0005) +[2023-03-06 12:54:34,835][1854170] Updated weights for policy 0, policy_version 1205 (0.0007) +[2023-03-06 12:54:35,677][1854170] Updated weights for policy 0, policy_version 1215 (0.0007) +[2023-03-06 12:54:36,469][1854170] Updated weights for policy 0, policy_version 1225 (0.0006) +[2023-03-06 12:54:37,269][1854170] Updated weights for policy 0, policy_version 1235 (0.0006) +[2023-03-06 12:54:38,111][1854170] Updated weights for policy 0, policy_version 1245 (0.0006) +[2023-03-06 12:54:38,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 11322.6, 300 sec: 11322.6). Total num frames: 1282048. Throughput: 0: 10985.4. Samples: 384486. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) +[2023-03-06 12:54:38,701][1853846] Avg episode reward: [(0, '584.541')] +[2023-03-06 12:54:38,702][1854119] Saving new best policy, reward=584.541! +[2023-03-06 12:54:38,919][1854170] Updated weights for policy 0, policy_version 1255 (0.0006) +[2023-03-06 12:54:39,700][1854170] Updated weights for policy 0, policy_version 1265 (0.0006) +[2023-03-06 12:54:40,533][1854170] Updated weights for policy 0, policy_version 1275 (0.0006) +[2023-03-06 12:54:41,353][1854170] Updated weights for policy 0, policy_version 1285 (0.0006) +[2023-03-06 12:54:42,150][1854170] Updated weights for policy 0, policy_version 1295 (0.0006) +[2023-03-06 12:54:42,986][1854170] Updated weights for policy 0, policy_version 1305 (0.0006) +[2023-03-06 12:54:43,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 1344512. Throughput: 0: 11503.0. Samples: 460119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:54:43,710][1853846] Avg episode reward: [(0, '561.019')] +[2023-03-06 12:54:43,802][1854170] Updated weights for policy 0, policy_version 1315 (0.0006) +[2023-03-06 12:54:44,595][1854170] Updated weights for policy 0, policy_version 1325 (0.0006) +[2023-03-06 12:54:45,425][1854170] Updated weights for policy 0, policy_version 1335 (0.0006) +[2023-03-06 12:54:46,229][1854170] Updated weights for policy 0, policy_version 1345 (0.0006) +[2023-03-06 12:54:47,046][1854170] Updated weights for policy 0, policy_version 1355 (0.0006) +[2023-03-06 12:54:47,881][1854170] Updated weights for policy 0, policy_version 1365 (0.0006) +[2023-03-06 12:54:48,680][1854170] Updated weights for policy 0, policy_version 1375 (0.0006) +[2023-03-06 12:54:48,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 11605.4, 300 sec: 11605.4). Total num frames: 1408000. Throughput: 0: 11064.5. Samples: 497903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:54:48,701][1853846] Avg episode reward: [(0, '559.506')] +[2023-03-06 12:54:49,491][1854170] Updated weights for policy 0, policy_version 1385 (0.0006) +[2023-03-06 12:54:50,308][1854170] Updated weights for policy 0, policy_version 1395 (0.0006) +[2023-03-06 12:54:51,123][1854170] Updated weights for policy 0, policy_version 1405 (0.0006) +[2023-03-06 12:54:51,933][1854170] Updated weights for policy 0, policy_version 1415 (0.0006) +[2023-03-06 12:54:52,727][1854170] Updated weights for policy 0, policy_version 1425 (0.0007) +[2023-03-06 12:54:53,561][1854170] Updated weights for policy 0, policy_version 1435 (0.0006) +[2023-03-06 12:54:53,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 11694.1, 300 sec: 11694.1). Total num frames: 1470464. Throughput: 0: 12599.4. Samples: 573371. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) +[2023-03-06 12:54:53,701][1853846] Avg episode reward: [(0, '580.511')] +[2023-03-06 12:54:54,358][1854170] Updated weights for policy 0, policy_version 1445 (0.0006) +[2023-03-06 12:54:55,182][1854170] Updated weights for policy 0, policy_version 1455 (0.0007) +[2023-03-06 12:54:55,996][1854170] Updated weights for policy 0, policy_version 1465 (0.0007) +[2023-03-06 12:54:56,802][1854170] Updated weights for policy 0, policy_version 1475 (0.0007) +[2023-03-06 12:54:57,614][1854170] Updated weights for policy 0, policy_version 1485 (0.0006) +[2023-03-06 12:54:58,442][1854170] Updated weights for policy 0, policy_version 1495 (0.0006) +[2023-03-06 12:54:58,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 11785.3, 300 sec: 11785.3). Total num frames: 1533952. Throughput: 0: 12588.3. Samples: 648900. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:54:58,701][1853846] Avg episode reward: [(0, '500.226')] +[2023-03-06 12:54:59,255][1854170] Updated weights for policy 0, policy_version 1505 (0.0007) +[2023-03-06 12:55:00,043][1854170] Updated weights for policy 0, policy_version 1515 (0.0007) +[2023-03-06 12:55:00,870][1854170] Updated weights for policy 0, policy_version 1525 (0.0008) +[2023-03-06 12:55:01,682][1854170] Updated weights for policy 0, policy_version 1535 (0.0007) +[2023-03-06 12:55:02,491][1854170] Updated weights for policy 0, policy_version 1545 (0.0006) +[2023-03-06 12:55:03,312][1854170] Updated weights for policy 0, policy_version 1555 (0.0007) +[2023-03-06 12:55:03,701][1853846] Fps is (10 sec: 12697.5, 60 sec: 11861.3, 300 sec: 11861.3). Total num frames: 1597440. Throughput: 0: 12583.2. Samples: 686907. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:55:03,701][1853846] Avg episode reward: [(0, '517.693')] +[2023-03-06 12:55:04,132][1854170] Updated weights for policy 0, policy_version 1565 (0.0007) +[2023-03-06 12:55:04,934][1854170] Updated weights for policy 0, policy_version 1575 (0.0006) +[2023-03-06 12:55:05,764][1854170] Updated weights for policy 0, policy_version 1585 (0.0007) +[2023-03-06 12:55:06,580][1854170] Updated weights for policy 0, policy_version 1595 (0.0006) +[2023-03-06 12:55:07,388][1854170] Updated weights for policy 0, policy_version 1605 (0.0006) +[2023-03-06 12:55:08,214][1854170] Updated weights for policy 0, policy_version 1615 (0.0006) +[2023-03-06 12:55:08,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12595.2, 300 sec: 11909.9). Total num frames: 1659904. Throughput: 0: 12592.6. Samples: 762193. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:08,712][1853846] Avg episode reward: [(0, '572.544')] +[2023-03-06 12:55:09,017][1854170] Updated weights for policy 0, policy_version 1625 (0.0006) +[2023-03-06 12:55:09,816][1854170] Updated weights for policy 0, policy_version 1635 (0.0006) +[2023-03-06 12:55:10,636][1854170] Updated weights for policy 0, policy_version 1645 (0.0006) +[2023-03-06 12:55:11,453][1854170] Updated weights for policy 0, policy_version 1655 (0.0006) +[2023-03-06 12:55:12,246][1854170] Updated weights for policy 0, policy_version 1665 (0.0006) +[2023-03-06 12:55:13,071][1854170] Updated weights for policy 0, policy_version 1675 (0.0008) +[2023-03-06 12:55:13,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12595.2, 300 sec: 11951.6). Total num frames: 1722368. Throughput: 0: 12589.3. Samples: 837625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:13,712][1853846] Avg episode reward: [(0, '605.342')] +[2023-03-06 12:55:13,730][1854119] Saving new best policy, reward=605.342! +[2023-03-06 12:55:13,884][1854170] Updated weights for policy 0, policy_version 1685 (0.0007) +[2023-03-06 12:55:14,693][1854170] Updated weights for policy 0, policy_version 1695 (0.0007) +[2023-03-06 12:55:15,527][1854170] Updated weights for policy 0, policy_version 1705 (0.0006) +[2023-03-06 12:55:16,343][1854170] Updated weights for policy 0, policy_version 1715 (0.0006) +[2023-03-06 12:55:17,156][1854170] Updated weights for policy 0, policy_version 1725 (0.0006) +[2023-03-06 12:55:17,969][1854170] Updated weights for policy 0, policy_version 1735 (0.0007) +[2023-03-06 12:55:18,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12578.1, 300 sec: 11987.6). Total num frames: 1784832. Throughput: 0: 12586.9. Samples: 875330. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:55:18,701][1853846] Avg episode reward: [(0, '646.889')] +[2023-03-06 12:55:18,701][1854119] Saving new best policy, reward=646.889! +[2023-03-06 12:55:18,796][1854170] Updated weights for policy 0, policy_version 1745 (0.0006) +[2023-03-06 12:55:19,608][1854170] Updated weights for policy 0, policy_version 1755 (0.0008) +[2023-03-06 12:55:20,405][1854170] Updated weights for policy 0, policy_version 1765 (0.0006) +[2023-03-06 12:55:21,240][1854170] Updated weights for policy 0, policy_version 1775 (0.0006) +[2023-03-06 12:55:22,050][1854170] Updated weights for policy 0, policy_version 1785 (0.0007) +[2023-03-06 12:55:22,859][1854170] Updated weights for policy 0, policy_version 1795 (0.0007) +[2023-03-06 12:55:23,703][1854170] Updated weights for policy 0, policy_version 1805 (0.0006) +[2023-03-06 12:55:23,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12019.2). Total num frames: 1847296. Throughput: 0: 12581.4. Samples: 950650. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:55:23,712][1853846] Avg episode reward: [(0, '693.274')] +[2023-03-06 12:55:23,714][1854119] Saving new best policy, reward=693.274! +[2023-03-06 12:55:24,538][1854170] Updated weights for policy 0, policy_version 1815 (0.0006) +[2023-03-06 12:55:25,338][1854170] Updated weights for policy 0, policy_version 1825 (0.0007) +[2023-03-06 12:55:26,180][1854170] Updated weights for policy 0, policy_version 1835 (0.0007) +[2023-03-06 12:55:26,996][1854170] Updated weights for policy 0, policy_version 1845 (0.0006) +[2023-03-06 12:55:27,810][1854170] Updated weights for policy 0, policy_version 1855 (0.0006) +[2023-03-06 12:55:28,640][1854170] Updated weights for policy 0, policy_version 1865 (0.0006) +[2023-03-06 12:55:28,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12561.1, 300 sec: 12047.1). Total num frames: 1909760. Throughput: 0: 12560.8. Samples: 1025353. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:55:28,701][1853846] Avg episode reward: [(0, '713.674')] +[2023-03-06 12:55:28,705][1854119] Saving new best policy, reward=713.674! +[2023-03-06 12:55:29,445][1854170] Updated weights for policy 0, policy_version 1875 (0.0006) +[2023-03-06 12:55:30,260][1854170] Updated weights for policy 0, policy_version 1885 (0.0007) +[2023-03-06 12:55:31,096][1854170] Updated weights for policy 0, policy_version 1895 (0.0006) +[2023-03-06 12:55:31,928][1854170] Updated weights for policy 0, policy_version 1905 (0.0006) +[2023-03-06 12:55:32,735][1854170] Updated weights for policy 0, policy_version 1915 (0.0006) +[2023-03-06 12:55:33,562][1854170] Updated weights for policy 0, policy_version 1925 (0.0006) +[2023-03-06 12:55:33,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12071.8). Total num frames: 1972224. Throughput: 0: 12551.1. Samples: 1062703. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:55:33,701][1853846] Avg episode reward: [(0, '711.724')] +[2023-03-06 12:55:34,384][1854170] Updated weights for policy 0, policy_version 1935 (0.0006) +[2023-03-06 12:55:35,189][1854170] Updated weights for policy 0, policy_version 1945 (0.0006) +[2023-03-06 12:55:36,030][1854170] Updated weights for policy 0, policy_version 1955 (0.0006) +[2023-03-06 12:55:36,840][1854170] Updated weights for policy 0, policy_version 1965 (0.0006) +[2023-03-06 12:55:37,652][1854170] Updated weights for policy 0, policy_version 1975 (0.0007) +[2023-03-06 12:55:38,468][1854170] Updated weights for policy 0, policy_version 1985 (0.0007) +[2023-03-06 12:55:38,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12094.0). Total num frames: 2034688. Throughput: 0: 12542.5. Samples: 1137783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:38,701][1853846] Avg episode reward: [(0, '707.560')] +[2023-03-06 12:55:39,281][1854170] Updated weights for policy 0, policy_version 1995 (0.0007) +[2023-03-06 12:55:40,071][1854170] Updated weights for policy 0, policy_version 2005 (0.0006) +[2023-03-06 12:55:40,900][1854170] Updated weights for policy 0, policy_version 2015 (0.0006) +[2023-03-06 12:55:41,714][1854170] Updated weights for policy 0, policy_version 2025 (0.0007) +[2023-03-06 12:55:42,516][1854170] Updated weights for policy 0, policy_version 2035 (0.0007) +[2023-03-06 12:55:43,347][1854170] Updated weights for policy 0, policy_version 2045 (0.0006) +[2023-03-06 12:55:43,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12124.2). Total num frames: 2098176. Throughput: 0: 12539.5. Samples: 1213179. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:43,701][1853846] Avg episode reward: [(0, '670.970')] +[2023-03-06 12:55:44,168][1854170] Updated weights for policy 0, policy_version 2055 (0.0005) +[2023-03-06 12:55:44,992][1854170] Updated weights for policy 0, policy_version 2065 (0.0007) +[2023-03-06 12:55:45,787][1854170] Updated weights for policy 0, policy_version 2075 (0.0007) +[2023-03-06 12:55:46,606][1854170] Updated weights for policy 0, policy_version 2085 (0.0006) +[2023-03-06 12:55:47,446][1854170] Updated weights for policy 0, policy_version 2095 (0.0006) +[2023-03-06 12:55:48,227][1854170] Updated weights for policy 0, policy_version 2105 (0.0007) +[2023-03-06 12:55:48,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12141.7). Total num frames: 2160640. Throughput: 0: 12535.4. Samples: 1250999. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:48,701][1853846] Avg episode reward: [(0, '663.153')] +[2023-03-06 12:55:49,053][1854170] Updated weights for policy 0, policy_version 2115 (0.0006) +[2023-03-06 12:55:49,866][1854170] Updated weights for policy 0, policy_version 2125 (0.0005) +[2023-03-06 12:55:50,656][1854170] Updated weights for policy 0, policy_version 2135 (0.0007) +[2023-03-06 12:55:51,477][1854170] Updated weights for policy 0, policy_version 2145 (0.0006) +[2023-03-06 12:55:52,284][1854170] Updated weights for policy 0, policy_version 2155 (0.0007) +[2023-03-06 12:55:53,081][1854170] Updated weights for policy 0, policy_version 2165 (0.0006) +[2023-03-06 12:55:53,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12561.1, 300 sec: 12167.0). Total num frames: 2224128. Throughput: 0: 12548.3. Samples: 1326865. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:55:53,701][1853846] Avg episode reward: [(0, '539.310')] +[2023-03-06 12:55:53,704][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000002172_2224128.pth... +[2023-03-06 12:55:53,915][1854170] Updated weights for policy 0, policy_version 2175 (0.0007) +[2023-03-06 12:55:54,726][1854170] Updated weights for policy 0, policy_version 2185 (0.0007) +[2023-03-06 12:55:55,522][1854170] Updated weights for policy 0, policy_version 2195 (0.0007) +[2023-03-06 12:55:56,354][1854170] Updated weights for policy 0, policy_version 2205 (0.0006) +[2023-03-06 12:55:57,155][1854170] Updated weights for policy 0, policy_version 2215 (0.0006) +[2023-03-06 12:55:57,963][1854170] Updated weights for policy 0, policy_version 2225 (0.0006) +[2023-03-06 12:55:58,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12181.2). Total num frames: 2286592. Throughput: 0: 12546.2. Samples: 1402204. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:55:58,701][1853846] Avg episode reward: [(0, '630.346')] +[2023-03-06 12:55:58,804][1854170] Updated weights for policy 0, policy_version 2235 (0.0007) +[2023-03-06 12:55:59,597][1854170] Updated weights for policy 0, policy_version 2245 (0.0007) +[2023-03-06 12:56:00,394][1854170] Updated weights for policy 0, policy_version 2255 (0.0006) +[2023-03-06 12:56:01,228][1854170] Updated weights for policy 0, policy_version 2265 (0.0006) +[2023-03-06 12:56:02,046][1854170] Updated weights for policy 0, policy_version 2275 (0.0006) +[2023-03-06 12:56:02,841][1854170] Updated weights for policy 0, policy_version 2285 (0.0007) +[2023-03-06 12:56:03,662][1854170] Updated weights for policy 0, policy_version 2295 (0.0007) +[2023-03-06 12:56:03,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12544.0, 300 sec: 12202.7). Total num frames: 2350080. Throughput: 0: 12547.6. Samples: 1439971. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:56:03,701][1853846] Avg episode reward: [(0, '690.768')] +[2023-03-06 12:56:04,465][1854170] Updated weights for policy 0, policy_version 2305 (0.0006) +[2023-03-06 12:56:05,290][1854170] Updated weights for policy 0, policy_version 2315 (0.0007) +[2023-03-06 12:56:06,119][1854170] Updated weights for policy 0, policy_version 2325 (0.0007) +[2023-03-06 12:56:06,956][1854170] Updated weights for policy 0, policy_version 2335 (0.0006) +[2023-03-06 12:56:07,775][1854170] Updated weights for policy 0, policy_version 2345 (0.0006) +[2023-03-06 12:56:08,604][1854170] Updated weights for policy 0, policy_version 2355 (0.0006) +[2023-03-06 12:56:08,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12544.0, 300 sec: 12214.3). Total num frames: 2412544. Throughput: 0: 12542.4. Samples: 1515059. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:56:08,701][1853846] Avg episode reward: [(0, '658.474')] +[2023-03-06 12:56:09,429][1854170] Updated weights for policy 0, policy_version 2365 (0.0007) +[2023-03-06 12:56:10,239][1854170] Updated weights for policy 0, policy_version 2375 (0.0006) +[2023-03-06 12:56:11,066][1854170] Updated weights for policy 0, policy_version 2385 (0.0006) +[2023-03-06 12:56:11,875][1854170] Updated weights for policy 0, policy_version 2395 (0.0007) +[2023-03-06 12:56:12,697][1854170] Updated weights for policy 0, policy_version 2405 (0.0006) +[2023-03-06 12:56:13,508][1854170] Updated weights for policy 0, policy_version 2415 (0.0006) +[2023-03-06 12:56:13,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12225.0). Total num frames: 2475008. Throughput: 0: 12550.2. Samples: 1590112. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:56:13,701][1853846] Avg episode reward: [(0, '699.042')] +[2023-03-06 12:56:14,339][1854170] Updated weights for policy 0, policy_version 2425 (0.0006) +[2023-03-06 12:56:15,152][1854170] Updated weights for policy 0, policy_version 2435 (0.0007) +[2023-03-06 12:56:15,971][1854170] Updated weights for policy 0, policy_version 2445 (0.0006) +[2023-03-06 12:56:16,799][1854170] Updated weights for policy 0, policy_version 2455 (0.0007) +[2023-03-06 12:56:17,613][1854170] Updated weights for policy 0, policy_version 2465 (0.0006) +[2023-03-06 12:56:18,418][1854170] Updated weights for policy 0, policy_version 2475 (0.0006) +[2023-03-06 12:56:18,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12234.9). Total num frames: 2537472. Throughput: 0: 12549.8. Samples: 1627444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:56:18,701][1853846] Avg episode reward: [(0, '622.134')] +[2023-03-06 12:56:19,252][1854170] Updated weights for policy 0, policy_version 2485 (0.0007) +[2023-03-06 12:56:20,083][1854170] Updated weights for policy 0, policy_version 2495 (0.0007) +[2023-03-06 12:56:20,882][1854170] Updated weights for policy 0, policy_version 2505 (0.0006) +[2023-03-06 12:56:21,706][1854170] Updated weights for policy 0, policy_version 2515 (0.0006) +[2023-03-06 12:56:22,515][1854170] Updated weights for policy 0, policy_version 2525 (0.0006) +[2023-03-06 12:56:23,317][1854170] Updated weights for policy 0, policy_version 2535 (0.0006) +[2023-03-06 12:56:23,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12544.0, 300 sec: 12244.1). Total num frames: 2599936. Throughput: 0: 12553.9. Samples: 1702708. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:56:23,701][1853846] Avg episode reward: [(0, '558.606')] +[2023-03-06 12:56:24,107][1854170] Updated weights for policy 0, policy_version 2545 (0.0006) +[2023-03-06 12:56:24,949][1854170] Updated weights for policy 0, policy_version 2555 (0.0006) +[2023-03-06 12:56:25,763][1854170] Updated weights for policy 0, policy_version 2565 (0.0007) +[2023-03-06 12:56:26,572][1854170] Updated weights for policy 0, policy_version 2575 (0.0006) +[2023-03-06 12:56:27,414][1854170] Updated weights for policy 0, policy_version 2585 (0.0007) +[2023-03-06 12:56:28,205][1854170] Updated weights for policy 0, policy_version 2595 (0.0007) +[2023-03-06 12:56:28,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12252.7). Total num frames: 2662400. Throughput: 0: 12554.4. Samples: 1778126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:56:28,701][1853846] Avg episode reward: [(0, '595.657')] +[2023-03-06 12:56:29,016][1854170] Updated weights for policy 0, policy_version 2605 (0.0006) +[2023-03-06 12:56:29,849][1854170] Updated weights for policy 0, policy_version 2615 (0.0006) +[2023-03-06 12:56:30,661][1854170] Updated weights for policy 0, policy_version 2625 (0.0007) +[2023-03-06 12:56:31,476][1854170] Updated weights for policy 0, policy_version 2635 (0.0006) +[2023-03-06 12:56:32,320][1854170] Updated weights for policy 0, policy_version 2645 (0.0007) +[2023-03-06 12:56:33,119][1854170] Updated weights for policy 0, policy_version 2655 (0.0007) +[2023-03-06 12:56:33,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 12561.1, 300 sec: 12267.5). Total num frames: 2725888. Throughput: 0: 12549.1. Samples: 1815710. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:56:33,701][1853846] Avg episode reward: [(0, '566.095')] +[2023-03-06 12:56:33,940][1854170] Updated weights for policy 0, policy_version 2665 (0.0006) +[2023-03-06 12:56:34,766][1854170] Updated weights for policy 0, policy_version 2675 (0.0007) +[2023-03-06 12:56:35,576][1854170] Updated weights for policy 0, policy_version 2685 (0.0006) +[2023-03-06 12:56:36,408][1854170] Updated weights for policy 0, policy_version 2695 (0.0006) +[2023-03-06 12:56:37,221][1854170] Updated weights for policy 0, policy_version 2705 (0.0006) +[2023-03-06 12:56:38,033][1854170] Updated weights for policy 0, policy_version 2715 (0.0006) +[2023-03-06 12:56:38,700][1853846] Fps is (10 sec: 12595.3, 60 sec: 12561.1, 300 sec: 12274.8). Total num frames: 2788352. Throughput: 0: 12528.0. Samples: 1890624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:56:38,701][1853846] Avg episode reward: [(0, '583.616')] +[2023-03-06 12:56:38,848][1854170] Updated weights for policy 0, policy_version 2725 (0.0007) +[2023-03-06 12:56:39,645][1854170] Updated weights for policy 0, policy_version 2735 (0.0006) +[2023-03-06 12:56:40,468][1854170] Updated weights for policy 0, policy_version 2745 (0.0007) +[2023-03-06 12:56:41,263][1854170] Updated weights for policy 0, policy_version 2755 (0.0007) +[2023-03-06 12:56:42,073][1854170] Updated weights for policy 0, policy_version 2765 (0.0006) +[2023-03-06 12:56:42,897][1854170] Updated weights for policy 0, policy_version 2775 (0.0006) +[2023-03-06 12:56:43,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12281.6). Total num frames: 2850816. Throughput: 0: 12535.2. Samples: 1966288. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:56:43,701][1853846] Avg episode reward: [(0, '570.534')] +[2023-03-06 12:56:43,726][1854170] Updated weights for policy 0, policy_version 2785 (0.0006) +[2023-03-06 12:56:44,540][1854170] Updated weights for policy 0, policy_version 2795 (0.0006) +[2023-03-06 12:56:45,367][1854170] Updated weights for policy 0, policy_version 2805 (0.0007) +[2023-03-06 12:56:46,189][1854170] Updated weights for policy 0, policy_version 2815 (0.0006) +[2023-03-06 12:56:47,000][1854170] Updated weights for policy 0, policy_version 2825 (0.0006) +[2023-03-06 12:56:47,814][1854170] Updated weights for policy 0, policy_version 2835 (0.0007) +[2023-03-06 12:56:48,644][1854170] Updated weights for policy 0, policy_version 2845 (0.0006) +[2023-03-06 12:56:48,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12544.0, 300 sec: 12288.0). Total num frames: 2913280. Throughput: 0: 12525.2. Samples: 2003605. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:56:48,701][1853846] Avg episode reward: [(0, '621.915')] +[2023-03-06 12:56:49,457][1854170] Updated weights for policy 0, policy_version 2855 (0.0007) +[2023-03-06 12:56:50,287][1854170] Updated weights for policy 0, policy_version 2865 (0.0006) +[2023-03-06 12:56:51,110][1854170] Updated weights for policy 0, policy_version 2875 (0.0007) +[2023-03-06 12:56:51,924][1854170] Updated weights for policy 0, policy_version 2885 (0.0006) +[2023-03-06 12:56:52,749][1854170] Updated weights for policy 0, policy_version 2895 (0.0007) +[2023-03-06 12:56:53,561][1854170] Updated weights for policy 0, policy_version 2905 (0.0007) +[2023-03-06 12:56:53,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12294.0). Total num frames: 2975744. Throughput: 0: 12525.6. Samples: 2078709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:56:53,701][1853846] Avg episode reward: [(0, '611.028')] +[2023-03-06 12:56:54,361][1854170] Updated weights for policy 0, policy_version 2915 (0.0006) +[2023-03-06 12:56:55,197][1854170] Updated weights for policy 0, policy_version 2925 (0.0007) +[2023-03-06 12:56:56,001][1854170] Updated weights for policy 0, policy_version 2935 (0.0007) +[2023-03-06 12:56:56,831][1854170] Updated weights for policy 0, policy_version 2945 (0.0007) +[2023-03-06 12:56:57,633][1854170] Updated weights for policy 0, policy_version 2955 (0.0006) +[2023-03-06 12:56:58,459][1854170] Updated weights for policy 0, policy_version 2965 (0.0006) +[2023-03-06 12:56:58,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12299.7). Total num frames: 3038208. Throughput: 0: 12531.3. Samples: 2154017. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:56:58,701][1853846] Avg episode reward: [(0, '671.051')] +[2023-03-06 12:56:59,279][1854170] Updated weights for policy 0, policy_version 2975 (0.0007) +[2023-03-06 12:57:00,088][1854170] Updated weights for policy 0, policy_version 2985 (0.0006) +[2023-03-06 12:57:00,896][1854170] Updated weights for policy 0, policy_version 2995 (0.0007) +[2023-03-06 12:57:01,730][1854170] Updated weights for policy 0, policy_version 3005 (0.0006) +[2023-03-06 12:57:02,548][1854170] Updated weights for policy 0, policy_version 3015 (0.0007) +[2023-03-06 12:57:03,374][1854170] Updated weights for policy 0, policy_version 3025 (0.0007) +[2023-03-06 12:57:03,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12509.9, 300 sec: 12305.1). Total num frames: 3100672. Throughput: 0: 12533.1. Samples: 2191435. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:03,711][1853846] Avg episode reward: [(0, '663.145')] +[2023-03-06 12:57:04,201][1854170] Updated weights for policy 0, policy_version 3035 (0.0007) +[2023-03-06 12:57:05,024][1854170] Updated weights for policy 0, policy_version 3045 (0.0007) +[2023-03-06 12:57:05,838][1854170] Updated weights for policy 0, policy_version 3055 (0.0007) +[2023-03-06 12:57:06,656][1854170] Updated weights for policy 0, policy_version 3065 (0.0006) +[2023-03-06 12:57:07,470][1854170] Updated weights for policy 0, policy_version 3075 (0.0006) +[2023-03-06 12:57:08,277][1854170] Updated weights for policy 0, policy_version 3085 (0.0006) +[2023-03-06 12:57:08,700][1853846] Fps is (10 sec: 12595.2, 60 sec: 12527.0, 300 sec: 12315.7). Total num frames: 3164160. Throughput: 0: 12530.8. Samples: 2266591. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:08,711][1853846] Avg episode reward: [(0, '670.392')] +[2023-03-06 12:57:09,099][1854170] Updated weights for policy 0, policy_version 3095 (0.0007) +[2023-03-06 12:57:09,905][1854170] Updated weights for policy 0, policy_version 3105 (0.0008) +[2023-03-06 12:57:10,706][1854170] Updated weights for policy 0, policy_version 3115 (0.0006) +[2023-03-06 12:57:11,518][1854170] Updated weights for policy 0, policy_version 3125 (0.0006) +[2023-03-06 12:57:12,345][1854170] Updated weights for policy 0, policy_version 3135 (0.0006) +[2023-03-06 12:57:13,149][1854170] Updated weights for policy 0, policy_version 3145 (0.0006) +[2023-03-06 12:57:13,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 12526.9, 300 sec: 12320.3). Total num frames: 3226624. Throughput: 0: 12532.1. Samples: 2342068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:13,711][1853846] Avg episode reward: [(0, '620.756')] +[2023-03-06 12:57:13,978][1854170] Updated weights for policy 0, policy_version 3155 (0.0007) +[2023-03-06 12:57:14,797][1854170] Updated weights for policy 0, policy_version 3165 (0.0006) +[2023-03-06 12:57:15,602][1854170] Updated weights for policy 0, policy_version 3175 (0.0006) +[2023-03-06 12:57:16,430][1854170] Updated weights for policy 0, policy_version 3185 (0.0008) +[2023-03-06 12:57:17,234][1854170] Updated weights for policy 0, policy_version 3195 (0.0006) +[2023-03-06 12:57:18,030][1854170] Updated weights for policy 0, policy_version 3205 (0.0007) +[2023-03-06 12:57:18,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12324.8). Total num frames: 3289088. Throughput: 0: 12533.2. Samples: 2379706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:18,701][1853846] Avg episode reward: [(0, '669.571')] +[2023-03-06 12:57:18,854][1854170] Updated weights for policy 0, policy_version 3215 (0.0011) +[2023-03-06 12:57:19,685][1854170] Updated weights for policy 0, policy_version 3225 (0.0006) +[2023-03-06 12:57:20,502][1854170] Updated weights for policy 0, policy_version 3235 (0.0006) +[2023-03-06 12:57:21,313][1854170] Updated weights for policy 0, policy_version 3245 (0.0007) +[2023-03-06 12:57:22,154][1854170] Updated weights for policy 0, policy_version 3255 (0.0007) +[2023-03-06 12:57:22,972][1854170] Updated weights for policy 0, policy_version 3265 (0.0006) +[2023-03-06 12:57:23,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12544.0, 300 sec: 12334.1). Total num frames: 3352576. Throughput: 0: 12534.5. Samples: 2454677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:57:23,701][1853846] Avg episode reward: [(0, '670.430')] +[2023-03-06 12:57:23,773][1854170] Updated weights for policy 0, policy_version 3275 (0.0007) +[2023-03-06 12:57:24,624][1854170] Updated weights for policy 0, policy_version 3285 (0.0006) +[2023-03-06 12:57:25,432][1854170] Updated weights for policy 0, policy_version 3295 (0.0007) +[2023-03-06 12:57:26,235][1854170] Updated weights for policy 0, policy_version 3305 (0.0006) +[2023-03-06 12:57:27,071][1854170] Updated weights for policy 0, policy_version 3315 (0.0007) +[2023-03-06 12:57:27,873][1854170] Updated weights for policy 0, policy_version 3325 (0.0007) +[2023-03-06 12:57:28,701][1854170] Updated weights for policy 0, policy_version 3335 (0.0006) +[2023-03-06 12:57:28,701][1853846] Fps is (10 sec: 12595.2, 60 sec: 12544.0, 300 sec: 12338.0). Total num frames: 3415040. Throughput: 0: 12524.2. Samples: 2529878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:28,701][1853846] Avg episode reward: [(0, '564.999')] +[2023-03-06 12:57:29,527][1854170] Updated weights for policy 0, policy_version 3345 (0.0007) +[2023-03-06 12:57:30,349][1854170] Updated weights for policy 0, policy_version 3355 (0.0007) +[2023-03-06 12:57:31,142][1854170] Updated weights for policy 0, policy_version 3365 (0.0006) +[2023-03-06 12:57:31,975][1854170] Updated weights for policy 0, policy_version 3375 (0.0007) +[2023-03-06 12:57:32,788][1854170] Updated weights for policy 0, policy_version 3385 (0.0006) +[2023-03-06 12:57:33,593][1854170] Updated weights for policy 0, policy_version 3395 (0.0006) +[2023-03-06 12:57:33,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12341.6). Total num frames: 3477504. Throughput: 0: 12527.7. Samples: 2567351. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) +[2023-03-06 12:57:33,701][1853846] Avg episode reward: [(0, '560.606')] +[2023-03-06 12:57:34,423][1854170] Updated weights for policy 0, policy_version 3405 (0.0007) +[2023-03-06 12:57:35,234][1854170] Updated weights for policy 0, policy_version 3415 (0.0006) +[2023-03-06 12:57:36,054][1854170] Updated weights for policy 0, policy_version 3425 (0.0006) +[2023-03-06 12:57:36,862][1854170] Updated weights for policy 0, policy_version 3435 (0.0007) +[2023-03-06 12:57:37,703][1854170] Updated weights for policy 0, policy_version 3445 (0.0006) +[2023-03-06 12:57:38,505][1854170] Updated weights for policy 0, policy_version 3455 (0.0007) +[2023-03-06 12:57:38,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12345.2). Total num frames: 3539968. Throughput: 0: 12525.5. Samples: 2642356. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:38,701][1853846] Avg episode reward: [(0, '663.607')] +[2023-03-06 12:57:39,343][1854170] Updated weights for policy 0, policy_version 3465 (0.0007) +[2023-03-06 12:57:40,165][1854170] Updated weights for policy 0, policy_version 3475 (0.0006) +[2023-03-06 12:57:40,966][1854170] Updated weights for policy 0, policy_version 3485 (0.0006) +[2023-03-06 12:57:41,800][1854170] Updated weights for policy 0, policy_version 3495 (0.0008) +[2023-03-06 12:57:42,611][1854170] Updated weights for policy 0, policy_version 3505 (0.0007) +[2023-03-06 12:57:43,427][1854170] Updated weights for policy 0, policy_version 3515 (0.0007) +[2023-03-06 12:57:43,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12348.5). Total num frames: 3602432. Throughput: 0: 12519.8. Samples: 2717410. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) +[2023-03-06 12:57:43,701][1853846] Avg episode reward: [(0, '669.217')] +[2023-03-06 12:57:44,260][1854170] Updated weights for policy 0, policy_version 3525 (0.0006) +[2023-03-06 12:57:45,077][1854170] Updated weights for policy 0, policy_version 3535 (0.0006) +[2023-03-06 12:57:45,885][1854170] Updated weights for policy 0, policy_version 3545 (0.0007) +[2023-03-06 12:57:46,711][1854170] Updated weights for policy 0, policy_version 3555 (0.0007) +[2023-03-06 12:57:47,525][1854170] Updated weights for policy 0, policy_version 3565 (0.0006) +[2023-03-06 12:57:48,344][1854170] Updated weights for policy 0, policy_version 3575 (0.0007) +[2023-03-06 12:57:48,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12526.9, 300 sec: 12351.7). Total num frames: 3664896. Throughput: 0: 12523.0. Samples: 2754969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:57:48,701][1853846] Avg episode reward: [(0, '740.778')] +[2023-03-06 12:57:48,702][1854119] Saving new best policy, reward=740.778! +[2023-03-06 12:57:49,144][1854170] Updated weights for policy 0, policy_version 3585 (0.0006) +[2023-03-06 12:57:50,000][1854170] Updated weights for policy 0, policy_version 3595 (0.0006) +[2023-03-06 12:57:50,818][1854170] Updated weights for policy 0, policy_version 3605 (0.0006) +[2023-03-06 12:57:51,616][1854170] Updated weights for policy 0, policy_version 3615 (0.0007) +[2023-03-06 12:57:52,434][1854170] Updated weights for policy 0, policy_version 3625 (0.0006) +[2023-03-06 12:57:53,261][1854170] Updated weights for policy 0, policy_version 3635 (0.0007) +[2023-03-06 12:57:53,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12354.8). Total num frames: 3727360. Throughput: 0: 12520.3. Samples: 2830003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:57:53,701][1853846] Avg episode reward: [(0, '678.310')] +[2023-03-06 12:57:53,704][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000003640_3727360.pth... +[2023-03-06 12:57:53,737][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000000865_885760.pth +[2023-03-06 12:57:54,060][1854170] Updated weights for policy 0, policy_version 3645 (0.0007) +[2023-03-06 12:57:54,892][1854170] Updated weights for policy 0, policy_version 3655 (0.0007) +[2023-03-06 12:57:55,706][1854170] Updated weights for policy 0, policy_version 3665 (0.0007) +[2023-03-06 12:57:56,520][1854170] Updated weights for policy 0, policy_version 3675 (0.0007) +[2023-03-06 12:57:57,338][1854170] Updated weights for policy 0, policy_version 3685 (0.0007) +[2023-03-06 12:57:58,164][1854170] Updated weights for policy 0, policy_version 3695 (0.0007) +[2023-03-06 12:57:58,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12526.9, 300 sec: 12357.7). Total num frames: 3789824. Throughput: 0: 12511.2. Samples: 2905071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:57:58,701][1853846] Avg episode reward: [(0, '700.615')] +[2023-03-06 12:57:58,974][1854170] Updated weights for policy 0, policy_version 3705 (0.0007) +[2023-03-06 12:57:59,791][1854170] Updated weights for policy 0, policy_version 3715 (0.0006) +[2023-03-06 12:58:00,621][1854170] Updated weights for policy 0, policy_version 3725 (0.0007) +[2023-03-06 12:58:01,454][1854170] Updated weights for policy 0, policy_version 3735 (0.0007) +[2023-03-06 12:58:02,282][1854170] Updated weights for policy 0, policy_version 3745 (0.0006) +[2023-03-06 12:58:03,098][1854170] Updated weights for policy 0, policy_version 3755 (0.0006) +[2023-03-06 12:58:03,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12526.9, 300 sec: 12360.5). Total num frames: 3852288. Throughput: 0: 12503.2. Samples: 2942349. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:58:03,701][1853846] Avg episode reward: [(0, '683.341')] +[2023-03-06 12:58:03,926][1854170] Updated weights for policy 0, policy_version 3765 (0.0006) +[2023-03-06 12:58:04,747][1854170] Updated weights for policy 0, policy_version 3775 (0.0006) +[2023-03-06 12:58:05,569][1854170] Updated weights for policy 0, policy_version 3785 (0.0007) +[2023-03-06 12:58:06,398][1854170] Updated weights for policy 0, policy_version 3795 (0.0006) +[2023-03-06 12:58:07,209][1854170] Updated weights for policy 0, policy_version 3805 (0.0007) +[2023-03-06 12:58:08,051][1854170] Updated weights for policy 0, policy_version 3815 (0.0006) +[2023-03-06 12:58:08,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12509.8, 300 sec: 12363.2). Total num frames: 3914752. Throughput: 0: 12495.6. Samples: 3016980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:58:08,701][1853846] Avg episode reward: [(0, '639.643')] +[2023-03-06 12:58:08,868][1854170] Updated weights for policy 0, policy_version 3825 (0.0006) +[2023-03-06 12:58:09,673][1854170] Updated weights for policy 0, policy_version 3835 (0.0007) +[2023-03-06 12:58:10,502][1854170] Updated weights for policy 0, policy_version 3845 (0.0007) +[2023-03-06 12:58:11,313][1854170] Updated weights for policy 0, policy_version 3855 (0.0007) +[2023-03-06 12:58:12,126][1854170] Updated weights for policy 0, policy_version 3865 (0.0007) +[2023-03-06 12:58:12,945][1854170] Updated weights for policy 0, policy_version 3875 (0.0006) +[2023-03-06 12:58:13,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12509.9, 300 sec: 12365.8). Total num frames: 3977216. Throughput: 0: 12496.0. Samples: 3092200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:58:13,701][1853846] Avg episode reward: [(0, '649.718')] +[2023-03-06 12:58:13,762][1854170] Updated weights for policy 0, policy_version 3885 (0.0006) +[2023-03-06 12:58:14,563][1854170] Updated weights for policy 0, policy_version 3895 (0.0006) +[2023-03-06 12:58:15,388][1854170] Updated weights for policy 0, policy_version 3905 (0.0007) +[2023-03-06 12:58:16,228][1854170] Updated weights for policy 0, policy_version 3915 (0.0007) +[2023-03-06 12:58:17,052][1854170] Updated weights for policy 0, policy_version 3925 (0.0007) +[2023-03-06 12:58:17,850][1854170] Updated weights for policy 0, policy_version 3935 (0.0007) +[2023-03-06 12:58:18,662][1854170] Updated weights for policy 0, policy_version 3945 (0.0007) +[2023-03-06 12:58:18,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12509.9, 300 sec: 12368.3). Total num frames: 4039680. Throughput: 0: 12496.1. Samples: 3129677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:58:18,701][1853846] Avg episode reward: [(0, '647.881')] +[2023-03-06 12:58:19,485][1854170] Updated weights for policy 0, policy_version 3955 (0.0007) +[2023-03-06 12:58:20,311][1854170] Updated weights for policy 0, policy_version 3965 (0.0008) +[2023-03-06 12:58:21,115][1854170] Updated weights for policy 0, policy_version 3975 (0.0007) +[2023-03-06 12:58:21,940][1854170] Updated weights for policy 0, policy_version 3985 (0.0007) +[2023-03-06 12:58:22,747][1854170] Updated weights for policy 0, policy_version 3995 (0.0006) +[2023-03-06 12:58:23,563][1854170] Updated weights for policy 0, policy_version 4005 (0.0006) +[2023-03-06 12:58:23,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12370.7). Total num frames: 4102144. Throughput: 0: 12506.4. Samples: 3205143. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) +[2023-03-06 12:58:23,701][1853846] Avg episode reward: [(0, '632.355')] +[2023-03-06 12:58:24,375][1854170] Updated weights for policy 0, policy_version 4015 (0.0006) +[2023-03-06 12:58:25,210][1854170] Updated weights for policy 0, policy_version 4025 (0.0006) +[2023-03-06 12:58:26,029][1854170] Updated weights for policy 0, policy_version 4035 (0.0006) +[2023-03-06 12:58:26,837][1854170] Updated weights for policy 0, policy_version 4045 (0.0007) +[2023-03-06 12:58:27,650][1854170] Updated weights for policy 0, policy_version 4055 (0.0006) +[2023-03-06 12:58:28,476][1854170] Updated weights for policy 0, policy_version 4065 (0.0006) +[2023-03-06 12:58:28,701][1853846] Fps is (10 sec: 12595.1, 60 sec: 12509.9, 300 sec: 12376.9). Total num frames: 4165632. Throughput: 0: 12507.8. Samples: 3280260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:58:28,701][1853846] Avg episode reward: [(0, '624.311')] +[2023-03-06 12:58:29,286][1854170] Updated weights for policy 0, policy_version 4075 (0.0006) +[2023-03-06 12:58:30,106][1854170] Updated weights for policy 0, policy_version 4085 (0.0007) +[2023-03-06 12:58:30,944][1854170] Updated weights for policy 0, policy_version 4095 (0.0007) +[2023-03-06 12:58:31,763][1854170] Updated weights for policy 0, policy_version 4105 (0.0007) +[2023-03-06 12:58:32,581][1854170] Updated weights for policy 0, policy_version 4115 (0.0007) +[2023-03-06 12:58:33,413][1854170] Updated weights for policy 0, policy_version 4125 (0.0007) +[2023-03-06 12:58:33,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12492.8, 300 sec: 12375.2). Total num frames: 4227072. Throughput: 0: 12499.7. Samples: 3317454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:58:33,701][1853846] Avg episode reward: [(0, '557.825')] +[2023-03-06 12:58:34,238][1854170] Updated weights for policy 0, policy_version 4135 (0.0006) +[2023-03-06 12:58:35,043][1854170] Updated weights for policy 0, policy_version 4145 (0.0007) +[2023-03-06 12:58:35,867][1854170] Updated weights for policy 0, policy_version 4155 (0.0006) +[2023-03-06 12:58:36,686][1854170] Updated weights for policy 0, policy_version 4165 (0.0007) +[2023-03-06 12:58:37,511][1854170] Updated weights for policy 0, policy_version 4175 (0.0007) +[2023-03-06 12:58:38,324][1854170] Updated weights for policy 0, policy_version 4185 (0.0007) +[2023-03-06 12:58:38,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12492.8, 300 sec: 12377.4). Total num frames: 4289536. Throughput: 0: 12497.7. Samples: 3392400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:58:38,701][1853846] Avg episode reward: [(0, '688.406')] +[2023-03-06 12:58:39,161][1854170] Updated weights for policy 0, policy_version 4195 (0.0007) +[2023-03-06 12:58:39,977][1854170] Updated weights for policy 0, policy_version 4205 (0.0006) +[2023-03-06 12:58:40,803][1854170] Updated weights for policy 0, policy_version 4215 (0.0007) +[2023-03-06 12:58:41,615][1854170] Updated weights for policy 0, policy_version 4225 (0.0006) +[2023-03-06 12:58:42,445][1854170] Updated weights for policy 0, policy_version 4235 (0.0006) +[2023-03-06 12:58:43,264][1854170] Updated weights for policy 0, policy_version 4245 (0.0006) +[2023-03-06 12:58:43,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12492.8, 300 sec: 12379.4). Total num frames: 4352000. Throughput: 0: 12487.4. Samples: 3467006. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:58:43,701][1853846] Avg episode reward: [(0, '631.653')] +[2023-03-06 12:58:44,090][1854170] Updated weights for policy 0, policy_version 4255 (0.0006) +[2023-03-06 12:58:44,877][1854170] Updated weights for policy 0, policy_version 4265 (0.0007) +[2023-03-06 12:58:45,687][1854170] Updated weights for policy 0, policy_version 4275 (0.0006) +[2023-03-06 12:58:46,525][1854170] Updated weights for policy 0, policy_version 4285 (0.0006) +[2023-03-06 12:58:47,341][1854170] Updated weights for policy 0, policy_version 4295 (0.0007) +[2023-03-06 12:58:48,166][1854170] Updated weights for policy 0, policy_version 4305 (0.0006) +[2023-03-06 12:58:48,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12381.4). Total num frames: 4414464. Throughput: 0: 12495.6. Samples: 3504651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:58:48,701][1853846] Avg episode reward: [(0, '577.702')] +[2023-03-06 12:58:48,966][1854170] Updated weights for policy 0, policy_version 4315 (0.0007) +[2023-03-06 12:58:49,776][1854170] Updated weights for policy 0, policy_version 4325 (0.0007) +[2023-03-06 12:58:50,612][1854170] Updated weights for policy 0, policy_version 4335 (0.0006) +[2023-03-06 12:58:51,422][1854170] Updated weights for policy 0, policy_version 4345 (0.0006) +[2023-03-06 12:58:52,240][1854170] Updated weights for policy 0, policy_version 4355 (0.0007) +[2023-03-06 12:58:53,108][1854170] Updated weights for policy 0, policy_version 4365 (0.0006) +[2023-03-06 12:58:53,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12492.8, 300 sec: 12383.3). Total num frames: 4476928. Throughput: 0: 12511.4. Samples: 3579995. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:58:53,701][1853846] Avg episode reward: [(0, '541.348')] +[2023-03-06 12:58:53,932][1854170] Updated weights for policy 0, policy_version 4375 (0.0007) +[2023-03-06 12:58:54,746][1854170] Updated weights for policy 0, policy_version 4385 (0.0006) +[2023-03-06 12:58:55,562][1854170] Updated weights for policy 0, policy_version 4395 (0.0007) +[2023-03-06 12:58:56,371][1854170] Updated weights for policy 0, policy_version 4405 (0.0007) +[2023-03-06 12:58:57,196][1854170] Updated weights for policy 0, policy_version 4415 (0.0006) +[2023-03-06 12:58:58,032][1854170] Updated weights for policy 0, policy_version 4425 (0.0006) +[2023-03-06 12:58:58,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12492.8, 300 sec: 12385.2). Total num frames: 4539392. Throughput: 0: 12488.0. Samples: 3654157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:58:58,701][1853846] Avg episode reward: [(0, '634.944')] +[2023-03-06 12:58:58,862][1854170] Updated weights for policy 0, policy_version 4435 (0.0007) +[2023-03-06 12:58:59,684][1854170] Updated weights for policy 0, policy_version 4445 (0.0006) +[2023-03-06 12:59:00,501][1854170] Updated weights for policy 0, policy_version 4455 (0.0006) +[2023-03-06 12:59:01,317][1854170] Updated weights for policy 0, policy_version 4465 (0.0006) +[2023-03-06 12:59:02,126][1854170] Updated weights for policy 0, policy_version 4475 (0.0007) +[2023-03-06 12:59:02,943][1854170] Updated weights for policy 0, policy_version 4485 (0.0006) +[2023-03-06 12:59:03,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12492.8, 300 sec: 12534.5). Total num frames: 4601856. Throughput: 0: 12491.0. Samples: 3691772. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:59:03,701][1853846] Avg episode reward: [(0, '547.010')] +[2023-03-06 12:59:03,769][1854170] Updated weights for policy 0, policy_version 4495 (0.0006) +[2023-03-06 12:59:04,586][1854170] Updated weights for policy 0, policy_version 4505 (0.0007) +[2023-03-06 12:59:05,404][1854170] Updated weights for policy 0, policy_version 4515 (0.0007) +[2023-03-06 12:59:06,240][1854170] Updated weights for policy 0, policy_version 4525 (0.0006) +[2023-03-06 12:59:07,050][1854170] Updated weights for policy 0, policy_version 4535 (0.0008) +[2023-03-06 12:59:07,878][1854170] Updated weights for policy 0, policy_version 4545 (0.0007) +[2023-03-06 12:59:08,699][1854170] Updated weights for policy 0, policy_version 4555 (0.0007) +[2023-03-06 12:59:08,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12492.8, 300 sec: 12534.5). Total num frames: 4664320. Throughput: 0: 12475.8. Samples: 3766551. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:59:08,701][1853846] Avg episode reward: [(0, '576.562')] +[2023-03-06 12:59:09,510][1854170] Updated weights for policy 0, policy_version 4565 (0.0006) +[2023-03-06 12:59:10,338][1854170] Updated weights for policy 0, policy_version 4575 (0.0008) +[2023-03-06 12:59:11,159][1854170] Updated weights for policy 0, policy_version 4585 (0.0007) +[2023-03-06 12:59:11,988][1854170] Updated weights for policy 0, policy_version 4595 (0.0006) +[2023-03-06 12:59:12,812][1854170] Updated weights for policy 0, policy_version 4605 (0.0006) +[2023-03-06 12:59:13,645][1854170] Updated weights for policy 0, policy_version 4615 (0.0006) +[2023-03-06 12:59:13,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12475.8, 300 sec: 12527.5). Total num frames: 4725760. Throughput: 0: 12466.3. Samples: 3841241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:59:13,701][1853846] Avg episode reward: [(0, '528.511')] +[2023-03-06 12:59:14,470][1854170] Updated weights for policy 0, policy_version 4625 (0.0006) +[2023-03-06 12:59:15,293][1854170] Updated weights for policy 0, policy_version 4635 (0.0007) +[2023-03-06 12:59:16,107][1854170] Updated weights for policy 0, policy_version 4645 (0.0007) +[2023-03-06 12:59:16,930][1854170] Updated weights for policy 0, policy_version 4655 (0.0006) +[2023-03-06 12:59:17,748][1854170] Updated weights for policy 0, policy_version 4665 (0.0006) +[2023-03-06 12:59:18,577][1854170] Updated weights for policy 0, policy_version 4675 (0.0006) +[2023-03-06 12:59:18,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12475.7, 300 sec: 12524.0). Total num frames: 4788224. Throughput: 0: 12467.3. Samples: 3878481. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:59:18,701][1853846] Avg episode reward: [(0, '557.104')] +[2023-03-06 12:59:19,405][1854170] Updated weights for policy 0, policy_version 4685 (0.0006) +[2023-03-06 12:59:20,232][1854170] Updated weights for policy 0, policy_version 4695 (0.0006) +[2023-03-06 12:59:21,035][1854170] Updated weights for policy 0, policy_version 4705 (0.0006) +[2023-03-06 12:59:21,868][1854170] Updated weights for policy 0, policy_version 4715 (0.0007) +[2023-03-06 12:59:22,688][1854170] Updated weights for policy 0, policy_version 4725 (0.0007) +[2023-03-06 12:59:23,494][1854170] Updated weights for policy 0, policy_version 4735 (0.0007) +[2023-03-06 12:59:23,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12475.7, 300 sec: 12524.0). Total num frames: 4850688. Throughput: 0: 12463.1. Samples: 3953240. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:59:23,704][1853846] Avg episode reward: [(0, '591.774')] +[2023-03-06 12:59:24,318][1854170] Updated weights for policy 0, policy_version 4745 (0.0008) +[2023-03-06 12:59:25,152][1854170] Updated weights for policy 0, policy_version 4755 (0.0007) +[2023-03-06 12:59:25,963][1854170] Updated weights for policy 0, policy_version 4765 (0.0007) +[2023-03-06 12:59:26,786][1854170] Updated weights for policy 0, policy_version 4775 (0.0006) +[2023-03-06 12:59:27,621][1854170] Updated weights for policy 0, policy_version 4785 (0.0006) +[2023-03-06 12:59:28,426][1854170] Updated weights for policy 0, policy_version 4795 (0.0006) +[2023-03-06 12:59:28,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12458.7, 300 sec: 12520.6). Total num frames: 4913152. Throughput: 0: 12467.3. Samples: 4028034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:59:28,712][1853846] Avg episode reward: [(0, '530.705')] +[2023-03-06 12:59:29,269][1854170] Updated weights for policy 0, policy_version 4805 (0.0007) +[2023-03-06 12:59:30,090][1854170] Updated weights for policy 0, policy_version 4815 (0.0006) +[2023-03-06 12:59:30,905][1854170] Updated weights for policy 0, policy_version 4825 (0.0007) +[2023-03-06 12:59:31,736][1854170] Updated weights for policy 0, policy_version 4835 (0.0006) +[2023-03-06 12:59:32,562][1854170] Updated weights for policy 0, policy_version 4845 (0.0006) +[2023-03-06 12:59:33,378][1854170] Updated weights for policy 0, policy_version 4855 (0.0007) +[2023-03-06 12:59:33,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12475.8, 300 sec: 12520.6). Total num frames: 4975616. Throughput: 0: 12461.1. Samples: 4065398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:59:33,712][1853846] Avg episode reward: [(0, '620.004')] +[2023-03-06 12:59:34,190][1854170] Updated weights for policy 0, policy_version 4865 (0.0007) +[2023-03-06 12:59:35,027][1854170] Updated weights for policy 0, policy_version 4875 (0.0006) +[2023-03-06 12:59:35,841][1854170] Updated weights for policy 0, policy_version 4885 (0.0006) +[2023-03-06 12:59:36,639][1854170] Updated weights for policy 0, policy_version 4895 (0.0007) +[2023-03-06 12:59:37,491][1854170] Updated weights for policy 0, policy_version 4905 (0.0007) +[2023-03-06 12:59:38,290][1854170] Updated weights for policy 0, policy_version 4915 (0.0007) +[2023-03-06 12:59:38,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12458.7, 300 sec: 12517.1). Total num frames: 5037056. Throughput: 0: 12447.8. Samples: 4140146. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 12:59:38,712][1853846] Avg episode reward: [(0, '544.470')] +[2023-03-06 12:59:39,111][1854170] Updated weights for policy 0, policy_version 4925 (0.0006) +[2023-03-06 12:59:39,959][1854170] Updated weights for policy 0, policy_version 4935 (0.0006) +[2023-03-06 12:59:40,772][1854170] Updated weights for policy 0, policy_version 4945 (0.0006) +[2023-03-06 12:59:41,585][1854170] Updated weights for policy 0, policy_version 4955 (0.0007) +[2023-03-06 12:59:42,408][1854170] Updated weights for policy 0, policy_version 4965 (0.0006) +[2023-03-06 12:59:43,232][1854170] Updated weights for policy 0, policy_version 4975 (0.0006) +[2023-03-06 12:59:43,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12458.7, 300 sec: 12513.6). Total num frames: 5099520. Throughput: 0: 12463.8. Samples: 4215029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:59:43,701][1853846] Avg episode reward: [(0, '495.889')] +[2023-03-06 12:59:44,067][1854170] Updated weights for policy 0, policy_version 4985 (0.0006) +[2023-03-06 12:59:44,866][1854170] Updated weights for policy 0, policy_version 4995 (0.0006) +[2023-03-06 12:59:45,693][1854170] Updated weights for policy 0, policy_version 5005 (0.0006) +[2023-03-06 12:59:46,514][1854170] Updated weights for policy 0, policy_version 5015 (0.0006) +[2023-03-06 12:59:47,319][1854170] Updated weights for policy 0, policy_version 5025 (0.0006) +[2023-03-06 12:59:48,134][1854170] Updated weights for policy 0, policy_version 5035 (0.0006) +[2023-03-06 12:59:48,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12458.7, 300 sec: 12513.6). Total num frames: 5161984. Throughput: 0: 12462.4. Samples: 4252581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:59:48,701][1853846] Avg episode reward: [(0, '468.238')] +[2023-03-06 12:59:48,960][1854170] Updated weights for policy 0, policy_version 5045 (0.0007) +[2023-03-06 12:59:49,797][1854170] Updated weights for policy 0, policy_version 5055 (0.0006) +[2023-03-06 12:59:50,625][1854170] Updated weights for policy 0, policy_version 5065 (0.0007) +[2023-03-06 12:59:51,444][1854170] Updated weights for policy 0, policy_version 5075 (0.0007) +[2023-03-06 12:59:52,276][1854170] Updated weights for policy 0, policy_version 5085 (0.0006) +[2023-03-06 12:59:53,094][1854170] Updated weights for policy 0, policy_version 5095 (0.0006) +[2023-03-06 12:59:53,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12458.7, 300 sec: 12510.1). Total num frames: 5224448. Throughput: 0: 12455.5. Samples: 4327052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 12:59:53,701][1853846] Avg episode reward: [(0, '527.898')] +[2023-03-06 12:59:53,705][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000005102_5224448.pth... +[2023-03-06 12:59:53,736][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000002172_2224128.pth +[2023-03-06 12:59:53,905][1854170] Updated weights for policy 0, policy_version 5105 (0.0006) +[2023-03-06 12:59:54,738][1854170] Updated weights for policy 0, policy_version 5115 (0.0006) +[2023-03-06 12:59:55,578][1854170] Updated weights for policy 0, policy_version 5125 (0.0007) +[2023-03-06 12:59:56,391][1854170] Updated weights for policy 0, policy_version 5135 (0.0007) +[2023-03-06 12:59:57,206][1854170] Updated weights for policy 0, policy_version 5145 (0.0007) +[2023-03-06 12:59:58,032][1854170] Updated weights for policy 0, policy_version 5155 (0.0007) +[2023-03-06 12:59:58,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12458.7, 300 sec: 12506.7). Total num frames: 5286912. Throughput: 0: 12457.2. Samples: 4401814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 12:59:58,701][1853846] Avg episode reward: [(0, '437.085')] +[2023-03-06 12:59:58,858][1854170] Updated weights for policy 0, policy_version 5165 (0.0007) +[2023-03-06 12:59:59,674][1854170] Updated weights for policy 0, policy_version 5175 (0.0006) +[2023-03-06 13:00:00,504][1854170] Updated weights for policy 0, policy_version 5185 (0.0007) +[2023-03-06 13:00:01,322][1854170] Updated weights for policy 0, policy_version 5195 (0.0007) +[2023-03-06 13:00:02,138][1854170] Updated weights for policy 0, policy_version 5205 (0.0006) +[2023-03-06 13:00:02,957][1854170] Updated weights for policy 0, policy_version 5215 (0.0007) +[2023-03-06 13:00:03,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12458.7, 300 sec: 12506.7). Total num frames: 5349376. Throughput: 0: 12460.2. Samples: 4439188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) +[2023-03-06 13:00:03,701][1853846] Avg episode reward: [(0, '436.118')] +[2023-03-06 13:00:03,774][1854170] Updated weights for policy 0, policy_version 5225 (0.0007) +[2023-03-06 13:00:04,581][1854170] Updated weights for policy 0, policy_version 5235 (0.0007) +[2023-03-06 13:00:05,396][1854170] Updated weights for policy 0, policy_version 5245 (0.0006) +[2023-03-06 13:00:06,211][1854170] Updated weights for policy 0, policy_version 5255 (0.0006) +[2023-03-06 13:00:07,041][1854170] Updated weights for policy 0, policy_version 5265 (0.0007) +[2023-03-06 13:00:07,847][1854170] Updated weights for policy 0, policy_version 5275 (0.0006) +[2023-03-06 13:00:08,656][1854170] Updated weights for policy 0, policy_version 5285 (0.0007) +[2023-03-06 13:00:08,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12458.7, 300 sec: 12506.7). Total num frames: 5411840. Throughput: 0: 12469.9. Samples: 4514385. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:00:08,701][1853846] Avg episode reward: [(0, '429.408')] +[2023-03-06 13:00:09,475][1854170] Updated weights for policy 0, policy_version 5295 (0.0006) +[2023-03-06 13:00:10,290][1854170] Updated weights for policy 0, policy_version 5305 (0.0006) +[2023-03-06 13:00:11,111][1854170] Updated weights for policy 0, policy_version 5315 (0.0006) +[2023-03-06 13:00:11,930][1854170] Updated weights for policy 0, policy_version 5325 (0.0007) +[2023-03-06 13:00:12,750][1854170] Updated weights for policy 0, policy_version 5335 (0.0006) +[2023-03-06 13:00:13,590][1854170] Updated weights for policy 0, policy_version 5345 (0.0007) +[2023-03-06 13:00:13,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12475.7, 300 sec: 12506.7). Total num frames: 5474304. Throughput: 0: 12474.2. Samples: 4589371. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:00:13,701][1853846] Avg episode reward: [(0, '469.715')] +[2023-03-06 13:00:14,403][1854170] Updated weights for policy 0, policy_version 5355 (0.0007) +[2023-03-06 13:00:15,225][1854170] Updated weights for policy 0, policy_version 5365 (0.0006) +[2023-03-06 13:00:16,056][1854170] Updated weights for policy 0, policy_version 5375 (0.0007) +[2023-03-06 13:00:16,888][1854170] Updated weights for policy 0, policy_version 5385 (0.0006) +[2023-03-06 13:00:17,686][1854170] Updated weights for policy 0, policy_version 5395 (0.0006) +[2023-03-06 13:00:18,523][1854170] Updated weights for policy 0, policy_version 5405 (0.0007) +[2023-03-06 13:00:18,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12475.7, 300 sec: 12506.7). Total num frames: 5536768. Throughput: 0: 12474.2. Samples: 4626737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:00:18,701][1853846] Avg episode reward: [(0, '491.870')] +[2023-03-06 13:00:19,334][1854170] Updated weights for policy 0, policy_version 5415 (0.0008) +[2023-03-06 13:00:20,140][1854170] Updated weights for policy 0, policy_version 5425 (0.0005) +[2023-03-06 13:00:20,981][1854170] Updated weights for policy 0, policy_version 5435 (0.0006) +[2023-03-06 13:00:21,799][1854170] Updated weights for policy 0, policy_version 5445 (0.0007) +[2023-03-06 13:00:22,606][1854170] Updated weights for policy 0, policy_version 5455 (0.0006) +[2023-03-06 13:00:23,452][1854170] Updated weights for policy 0, policy_version 5465 (0.0007) +[2023-03-06 13:00:23,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12475.7, 300 sec: 12506.7). Total num frames: 5599232. Throughput: 0: 12477.6. Samples: 4701637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:00:23,701][1853846] Avg episode reward: [(0, '538.850')] +[2023-03-06 13:00:24,272][1854170] Updated weights for policy 0, policy_version 5475 (0.0007) +[2023-03-06 13:00:25,083][1854170] Updated weights for policy 0, policy_version 5485 (0.0006) +[2023-03-06 13:00:25,895][1854170] Updated weights for policy 0, policy_version 5495 (0.0006) +[2023-03-06 13:00:26,708][1854170] Updated weights for policy 0, policy_version 5505 (0.0006) +[2023-03-06 13:00:27,538][1854170] Updated weights for policy 0, policy_version 5515 (0.0006) +[2023-03-06 13:00:28,354][1854170] Updated weights for policy 0, policy_version 5525 (0.0007) +[2023-03-06 13:00:28,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12475.7, 300 sec: 12506.7). Total num frames: 5661696. Throughput: 0: 12481.5. Samples: 4776697. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:00:28,701][1853846] Avg episode reward: [(0, '569.469')] +[2023-03-06 13:00:29,176][1854170] Updated weights for policy 0, policy_version 5535 (0.0007) +[2023-03-06 13:00:29,999][1854170] Updated weights for policy 0, policy_version 5545 (0.0006) +[2023-03-06 13:00:30,814][1854170] Updated weights for policy 0, policy_version 5555 (0.0006) +[2023-03-06 13:00:31,633][1854170] Updated weights for policy 0, policy_version 5565 (0.0006) +[2023-03-06 13:00:32,478][1854170] Updated weights for policy 0, policy_version 5575 (0.0007) +[2023-03-06 13:00:33,291][1854170] Updated weights for policy 0, policy_version 5585 (0.0006) +[2023-03-06 13:00:33,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12475.7, 300 sec: 12506.7). Total num frames: 5724160. Throughput: 0: 12476.7. Samples: 4814030. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:00:33,701][1853846] Avg episode reward: [(0, '587.234')] +[2023-03-06 13:00:34,116][1854170] Updated weights for policy 0, policy_version 5595 (0.0006) +[2023-03-06 13:00:34,958][1854170] Updated weights for policy 0, policy_version 5605 (0.0007) +[2023-03-06 13:00:35,778][1854170] Updated weights for policy 0, policy_version 5615 (0.0007) +[2023-03-06 13:00:36,586][1854170] Updated weights for policy 0, policy_version 5625 (0.0006) +[2023-03-06 13:00:37,414][1854170] Updated weights for policy 0, policy_version 5635 (0.0006) +[2023-03-06 13:00:38,259][1854170] Updated weights for policy 0, policy_version 5645 (0.0006) +[2023-03-06 13:00:38,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12475.7, 300 sec: 12499.7). Total num frames: 5785600. Throughput: 0: 12473.9. Samples: 4888374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:00:38,701][1853846] Avg episode reward: [(0, '686.266')] +[2023-03-06 13:00:39,078][1854170] Updated weights for policy 0, policy_version 5655 (0.0006) +[2023-03-06 13:00:39,892][1854170] Updated weights for policy 0, policy_version 5665 (0.0006) +[2023-03-06 13:00:40,710][1854170] Updated weights for policy 0, policy_version 5675 (0.0006) +[2023-03-06 13:00:41,535][1854170] Updated weights for policy 0, policy_version 5685 (0.0007) +[2023-03-06 13:00:42,361][1854170] Updated weights for policy 0, policy_version 5695 (0.0006) +[2023-03-06 13:00:43,211][1854170] Updated weights for policy 0, policy_version 5705 (0.0007) +[2023-03-06 13:00:43,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12475.7, 300 sec: 12499.7). Total num frames: 5848064. Throughput: 0: 12470.0. Samples: 4962964. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:00:43,701][1853846] Avg episode reward: [(0, '669.399')] +[2023-03-06 13:00:44,018][1854170] Updated weights for policy 0, policy_version 5715 (0.0007) +[2023-03-06 13:00:44,830][1854170] Updated weights for policy 0, policy_version 5725 (0.0006) +[2023-03-06 13:00:45,665][1854170] Updated weights for policy 0, policy_version 5735 (0.0007) +[2023-03-06 13:00:46,478][1854170] Updated weights for policy 0, policy_version 5745 (0.0007) +[2023-03-06 13:00:47,303][1854170] Updated weights for policy 0, policy_version 5755 (0.0007) +[2023-03-06 13:00:48,133][1854170] Updated weights for policy 0, policy_version 5765 (0.0006) +[2023-03-06 13:00:48,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12475.7, 300 sec: 12496.3). Total num frames: 5910528. Throughput: 0: 12469.4. Samples: 5000313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:00:48,701][1853846] Avg episode reward: [(0, '691.026')] +[2023-03-06 13:00:48,948][1854170] Updated weights for policy 0, policy_version 5775 (0.0007) +[2023-03-06 13:00:49,765][1854170] Updated weights for policy 0, policy_version 5785 (0.0007) +[2023-03-06 13:00:50,602][1854170] Updated weights for policy 0, policy_version 5795 (0.0006) +[2023-03-06 13:00:51,410][1854170] Updated weights for policy 0, policy_version 5805 (0.0006) +[2023-03-06 13:00:52,233][1854170] Updated weights for policy 0, policy_version 5815 (0.0007) +[2023-03-06 13:00:53,060][1854170] Updated weights for policy 0, policy_version 5825 (0.0007) +[2023-03-06 13:00:53,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12458.7, 300 sec: 12492.8). Total num frames: 5971968. Throughput: 0: 12458.7. Samples: 5075027. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) +[2023-03-06 13:00:53,701][1853846] Avg episode reward: [(0, '708.837')] +[2023-03-06 13:00:53,885][1854170] Updated weights for policy 0, policy_version 5835 (0.0007) +[2023-03-06 13:00:54,705][1854170] Updated weights for policy 0, policy_version 5845 (0.0006) +[2023-03-06 13:00:55,538][1854170] Updated weights for policy 0, policy_version 5855 (0.0007) +[2023-03-06 13:00:56,382][1854170] Updated weights for policy 0, policy_version 5865 (0.0007) +[2023-03-06 13:00:57,213][1854170] Updated weights for policy 0, policy_version 5875 (0.0006) +[2023-03-06 13:00:58,050][1854170] Updated weights for policy 0, policy_version 5885 (0.0007) +[2023-03-06 13:00:58,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12458.7, 300 sec: 12489.3). Total num frames: 6034432. Throughput: 0: 12438.2. Samples: 5149088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:00:58,701][1853846] Avg episode reward: [(0, '769.343')] +[2023-03-06 13:00:58,702][1854119] Saving new best policy, reward=769.343! +[2023-03-06 13:00:58,875][1854170] Updated weights for policy 0, policy_version 5895 (0.0007) +[2023-03-06 13:00:59,682][1854170] Updated weights for policy 0, policy_version 5905 (0.0006) +[2023-03-06 13:01:00,526][1854170] Updated weights for policy 0, policy_version 5915 (0.0007) +[2023-03-06 13:01:01,346][1854170] Updated weights for policy 0, policy_version 5925 (0.0006) +[2023-03-06 13:01:02,180][1854170] Updated weights for policy 0, policy_version 5935 (0.0006) +[2023-03-06 13:01:03,008][1854170] Updated weights for policy 0, policy_version 5945 (0.0006) +[2023-03-06 13:01:03,701][1853846] Fps is (10 sec: 12390.5, 60 sec: 12441.6, 300 sec: 12485.9). Total num frames: 6095872. Throughput: 0: 12434.0. Samples: 5186267. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:01:03,701][1853846] Avg episode reward: [(0, '763.939')] +[2023-03-06 13:01:03,833][1854170] Updated weights for policy 0, policy_version 5955 (0.0006) +[2023-03-06 13:01:04,648][1854170] Updated weights for policy 0, policy_version 5965 (0.0006) +[2023-03-06 13:01:05,467][1854170] Updated weights for policy 0, policy_version 5975 (0.0007) +[2023-03-06 13:01:06,285][1854170] Updated weights for policy 0, policy_version 5985 (0.0006) +[2023-03-06 13:01:07,089][1854170] Updated weights for policy 0, policy_version 5995 (0.0006) +[2023-03-06 13:01:07,898][1854170] Updated weights for policy 0, policy_version 6005 (0.0006) +[2023-03-06 13:01:08,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12441.6, 300 sec: 12485.9). Total num frames: 6158336. Throughput: 0: 12436.6. Samples: 5261282. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:01:08,701][1853846] Avg episode reward: [(0, '726.516')] +[2023-03-06 13:01:08,747][1854170] Updated weights for policy 0, policy_version 6015 (0.0007) +[2023-03-06 13:01:09,550][1854170] Updated weights for policy 0, policy_version 6025 (0.0007) +[2023-03-06 13:01:10,364][1854170] Updated weights for policy 0, policy_version 6035 (0.0006) +[2023-03-06 13:01:11,198][1854170] Updated weights for policy 0, policy_version 6045 (0.0006) +[2023-03-06 13:01:12,014][1854170] Updated weights for policy 0, policy_version 6055 (0.0007) +[2023-03-06 13:01:12,812][1854170] Updated weights for policy 0, policy_version 6065 (0.0006) +[2023-03-06 13:01:13,669][1854170] Updated weights for policy 0, policy_version 6075 (0.0007) +[2023-03-06 13:01:13,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12485.9). Total num frames: 6220800. Throughput: 0: 12428.7. Samples: 5335987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:13,701][1853846] Avg episode reward: [(0, '779.010')] +[2023-03-06 13:01:13,704][1854119] Saving new best policy, reward=779.010! +[2023-03-06 13:01:14,484][1854170] Updated weights for policy 0, policy_version 6085 (0.0007) +[2023-03-06 13:01:15,308][1854170] Updated weights for policy 0, policy_version 6095 (0.0007) +[2023-03-06 13:01:16,148][1854170] Updated weights for policy 0, policy_version 6105 (0.0006) +[2023-03-06 13:01:16,960][1854170] Updated weights for policy 0, policy_version 6115 (0.0006) +[2023-03-06 13:01:17,782][1854170] Updated weights for policy 0, policy_version 6125 (0.0007) +[2023-03-06 13:01:18,604][1854170] Updated weights for policy 0, policy_version 6135 (0.0007) +[2023-03-06 13:01:18,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12485.9). Total num frames: 6283264. Throughput: 0: 12426.3. Samples: 5373212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:18,701][1853846] Avg episode reward: [(0, '753.006')] +[2023-03-06 13:01:19,426][1854170] Updated weights for policy 0, policy_version 6145 (0.0006) +[2023-03-06 13:01:20,253][1854170] Updated weights for policy 0, policy_version 6155 (0.0007) +[2023-03-06 13:01:21,082][1854170] Updated weights for policy 0, policy_version 6165 (0.0006) +[2023-03-06 13:01:21,915][1854170] Updated weights for policy 0, policy_version 6175 (0.0007) +[2023-03-06 13:01:22,725][1854170] Updated weights for policy 0, policy_version 6185 (0.0007) +[2023-03-06 13:01:23,554][1854170] Updated weights for policy 0, policy_version 6195 (0.0006) +[2023-03-06 13:01:23,700][1853846] Fps is (10 sec: 12390.5, 60 sec: 12424.6, 300 sec: 12482.4). Total num frames: 6344704. Throughput: 0: 12434.6. Samples: 5447929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:23,701][1853846] Avg episode reward: [(0, '687.125')] +[2023-03-06 13:01:24,370][1854170] Updated weights for policy 0, policy_version 6205 (0.0007) +[2023-03-06 13:01:25,188][1854170] Updated weights for policy 0, policy_version 6215 (0.0006) +[2023-03-06 13:01:26,004][1854170] Updated weights for policy 0, policy_version 6225 (0.0007) +[2023-03-06 13:01:26,835][1854170] Updated weights for policy 0, policy_version 6235 (0.0007) +[2023-03-06 13:01:27,634][1854170] Updated weights for policy 0, policy_version 6245 (0.0006) +[2023-03-06 13:01:28,474][1854170] Updated weights for policy 0, policy_version 6255 (0.0006) +[2023-03-06 13:01:28,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12424.5, 300 sec: 12478.9). Total num frames: 6407168. Throughput: 0: 12436.7. Samples: 5522617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:01:28,701][1853846] Avg episode reward: [(0, '653.179')] +[2023-03-06 13:01:29,297][1854170] Updated weights for policy 0, policy_version 6265 (0.0007) +[2023-03-06 13:01:30,119][1854170] Updated weights for policy 0, policy_version 6275 (0.0007) +[2023-03-06 13:01:30,934][1854170] Updated weights for policy 0, policy_version 6285 (0.0007) +[2023-03-06 13:01:31,754][1854170] Updated weights for policy 0, policy_version 6295 (0.0007) +[2023-03-06 13:01:32,600][1854170] Updated weights for policy 0, policy_version 6305 (0.0007) +[2023-03-06 13:01:33,418][1854170] Updated weights for policy 0, policy_version 6315 (0.0006) +[2023-03-06 13:01:33,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12424.5, 300 sec: 12478.9). Total num frames: 6469632. Throughput: 0: 12437.9. Samples: 5560017. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:01:33,701][1853846] Avg episode reward: [(0, '640.027')] +[2023-03-06 13:01:34,237][1854170] Updated weights for policy 0, policy_version 6325 (0.0007) +[2023-03-06 13:01:35,065][1854170] Updated weights for policy 0, policy_version 6335 (0.0006) +[2023-03-06 13:01:35,872][1854170] Updated weights for policy 0, policy_version 6345 (0.0007) +[2023-03-06 13:01:36,697][1854170] Updated weights for policy 0, policy_version 6355 (0.0007) +[2023-03-06 13:01:37,521][1854170] Updated weights for policy 0, policy_version 6365 (0.0007) +[2023-03-06 13:01:38,340][1854170] Updated weights for policy 0, policy_version 6375 (0.0006) +[2023-03-06 13:01:38,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12478.9). Total num frames: 6532096. Throughput: 0: 12437.3. Samples: 5634704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:01:38,701][1853846] Avg episode reward: [(0, '743.556')] +[2023-03-06 13:01:39,170][1854170] Updated weights for policy 0, policy_version 6385 (0.0007) +[2023-03-06 13:01:39,997][1854170] Updated weights for policy 0, policy_version 6395 (0.0006) +[2023-03-06 13:01:40,824][1854170] Updated weights for policy 0, policy_version 6405 (0.0006) +[2023-03-06 13:01:41,639][1854170] Updated weights for policy 0, policy_version 6415 (0.0006) +[2023-03-06 13:01:42,459][1854170] Updated weights for policy 0, policy_version 6425 (0.0006) +[2023-03-06 13:01:43,282][1854170] Updated weights for policy 0, policy_version 6435 (0.0006) +[2023-03-06 13:01:43,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12441.6, 300 sec: 12478.9). Total num frames: 6594560. Throughput: 0: 12450.7. Samples: 5709367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:01:43,701][1853846] Avg episode reward: [(0, '737.253')] +[2023-03-06 13:01:44,110][1854170] Updated weights for policy 0, policy_version 6445 (0.0007) +[2023-03-06 13:01:44,936][1854170] Updated weights for policy 0, policy_version 6455 (0.0006) +[2023-03-06 13:01:45,751][1854170] Updated weights for policy 0, policy_version 6465 (0.0007) +[2023-03-06 13:01:46,555][1854170] Updated weights for policy 0, policy_version 6475 (0.0006) +[2023-03-06 13:01:47,393][1854170] Updated weights for policy 0, policy_version 6485 (0.0006) +[2023-03-06 13:01:48,196][1854170] Updated weights for policy 0, policy_version 6495 (0.0006) +[2023-03-06 13:01:48,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12424.5, 300 sec: 12475.4). Total num frames: 6656000. Throughput: 0: 12456.5. Samples: 5746810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:48,701][1853846] Avg episode reward: [(0, '755.031')] +[2023-03-06 13:01:49,035][1854170] Updated weights for policy 0, policy_version 6505 (0.0007) +[2023-03-06 13:01:49,867][1854170] Updated weights for policy 0, policy_version 6515 (0.0006) +[2023-03-06 13:01:50,684][1854170] Updated weights for policy 0, policy_version 6525 (0.0006) +[2023-03-06 13:01:51,493][1854170] Updated weights for policy 0, policy_version 6535 (0.0007) +[2023-03-06 13:01:52,324][1854170] Updated weights for policy 0, policy_version 6545 (0.0008) +[2023-03-06 13:01:53,138][1854170] Updated weights for policy 0, policy_version 6555 (0.0006) +[2023-03-06 13:01:53,701][1853846] Fps is (10 sec: 12390.2, 60 sec: 12441.6, 300 sec: 12475.4). Total num frames: 6718464. Throughput: 0: 12452.3. Samples: 5821635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:53,701][1853846] Avg episode reward: [(0, '732.804')] +[2023-03-06 13:01:53,709][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000006562_6719488.pth... +[2023-03-06 13:01:53,740][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000003640_3727360.pth +[2023-03-06 13:01:53,957][1854170] Updated weights for policy 0, policy_version 6565 (0.0006) +[2023-03-06 13:01:54,791][1854170] Updated weights for policy 0, policy_version 6575 (0.0007) +[2023-03-06 13:01:55,611][1854170] Updated weights for policy 0, policy_version 6585 (0.0008) +[2023-03-06 13:01:56,424][1854170] Updated weights for policy 0, policy_version 6595 (0.0007) +[2023-03-06 13:01:57,257][1854170] Updated weights for policy 0, policy_version 6605 (0.0008) +[2023-03-06 13:01:58,085][1854170] Updated weights for policy 0, policy_version 6615 (0.0006) +[2023-03-06 13:01:58,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12441.6, 300 sec: 12475.4). Total num frames: 6780928. Throughput: 0: 12448.1. Samples: 5896150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:01:58,701][1853846] Avg episode reward: [(0, '737.615')] +[2023-03-06 13:01:58,907][1854170] Updated weights for policy 0, policy_version 6625 (0.0007) +[2023-03-06 13:01:59,731][1854170] Updated weights for policy 0, policy_version 6635 (0.0007) +[2023-03-06 13:02:00,554][1854170] Updated weights for policy 0, policy_version 6645 (0.0006) +[2023-03-06 13:02:01,362][1854170] Updated weights for policy 0, policy_version 6655 (0.0006) +[2023-03-06 13:02:02,192][1854170] Updated weights for policy 0, policy_version 6665 (0.0007) +[2023-03-06 13:02:03,022][1854170] Updated weights for policy 0, policy_version 6675 (0.0007) +[2023-03-06 13:02:03,701][1853846] Fps is (10 sec: 12492.9, 60 sec: 12458.7, 300 sec: 12472.0). Total num frames: 6843392. Throughput: 0: 12452.0. Samples: 5933552. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:02:03,701][1853846] Avg episode reward: [(0, '743.622')] +[2023-03-06 13:02:03,824][1854170] Updated weights for policy 0, policy_version 6685 (0.0006) +[2023-03-06 13:02:04,674][1854170] Updated weights for policy 0, policy_version 6695 (0.0006) +[2023-03-06 13:02:05,482][1854170] Updated weights for policy 0, policy_version 6705 (0.0006) +[2023-03-06 13:02:06,308][1854170] Updated weights for policy 0, policy_version 6715 (0.0007) +[2023-03-06 13:02:07,137][1854170] Updated weights for policy 0, policy_version 6725 (0.0006) +[2023-03-06 13:02:07,943][1854170] Updated weights for policy 0, policy_version 6735 (0.0006) +[2023-03-06 13:02:08,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12468.5). Total num frames: 6904832. Throughput: 0: 12449.0. Samples: 6008136. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:02:08,701][1853846] Avg episode reward: [(0, '604.676')] +[2023-03-06 13:02:08,790][1854170] Updated weights for policy 0, policy_version 6745 (0.0007) +[2023-03-06 13:02:09,613][1854170] Updated weights for policy 0, policy_version 6755 (0.0006) +[2023-03-06 13:02:10,432][1854170] Updated weights for policy 0, policy_version 6765 (0.0007) +[2023-03-06 13:02:11,244][1854170] Updated weights for policy 0, policy_version 6775 (0.0006) +[2023-03-06 13:02:12,067][1854170] Updated weights for policy 0, policy_version 6785 (0.0006) +[2023-03-06 13:02:12,900][1854170] Updated weights for policy 0, policy_version 6795 (0.0007) +[2023-03-06 13:02:13,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12468.5). Total num frames: 6967296. Throughput: 0: 12449.8. Samples: 6082859. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:02:13,701][1853846] Avg episode reward: [(0, '674.428')] +[2023-03-06 13:02:13,713][1854170] Updated weights for policy 0, policy_version 6805 (0.0006) +[2023-03-06 13:02:14,534][1854170] Updated weights for policy 0, policy_version 6815 (0.0006) +[2023-03-06 13:02:15,386][1854170] Updated weights for policy 0, policy_version 6825 (0.0007) +[2023-03-06 13:02:16,205][1854170] Updated weights for policy 0, policy_version 6835 (0.0006) +[2023-03-06 13:02:17,033][1854170] Updated weights for policy 0, policy_version 6845 (0.0006) +[2023-03-06 13:02:17,863][1854170] Updated weights for policy 0, policy_version 6855 (0.0007) +[2023-03-06 13:02:18,671][1854170] Updated weights for policy 0, policy_version 6865 (0.0007) +[2023-03-06 13:02:18,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12465.0). Total num frames: 7029760. Throughput: 0: 12443.5. Samples: 6119977. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) +[2023-03-06 13:02:18,701][1853846] Avg episode reward: [(0, '759.707')] +[2023-03-06 13:02:19,502][1854170] Updated weights for policy 0, policy_version 6875 (0.0006) +[2023-03-06 13:02:20,312][1854170] Updated weights for policy 0, policy_version 6885 (0.0007) +[2023-03-06 13:02:21,127][1854170] Updated weights for policy 0, policy_version 6895 (0.0006) +[2023-03-06 13:02:21,963][1854170] Updated weights for policy 0, policy_version 6905 (0.0006) +[2023-03-06 13:02:22,795][1854170] Updated weights for policy 0, policy_version 6915 (0.0007) +[2023-03-06 13:02:23,628][1854170] Updated weights for policy 0, policy_version 6925 (0.0007) +[2023-03-06 13:02:23,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12461.6). Total num frames: 7091200. Throughput: 0: 12441.6. Samples: 6194578. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) +[2023-03-06 13:02:23,701][1853846] Avg episode reward: [(0, '740.226')] +[2023-03-06 13:02:24,449][1854170] Updated weights for policy 0, policy_version 6935 (0.0006) +[2023-03-06 13:02:25,273][1854170] Updated weights for policy 0, policy_version 6945 (0.0007) +[2023-03-06 13:02:26,106][1854170] Updated weights for policy 0, policy_version 6955 (0.0006) +[2023-03-06 13:02:26,935][1854170] Updated weights for policy 0, policy_version 6965 (0.0006) +[2023-03-06 13:02:27,755][1854170] Updated weights for policy 0, policy_version 6975 (0.0007) +[2023-03-06 13:02:28,580][1854170] Updated weights for policy 0, policy_version 6985 (0.0006) +[2023-03-06 13:02:28,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12461.6). Total num frames: 7153664. Throughput: 0: 12433.8. Samples: 6268890. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:02:28,701][1853846] Avg episode reward: [(0, '807.209')] +[2023-03-06 13:02:28,702][1854119] Saving new best policy, reward=807.209! +[2023-03-06 13:02:29,401][1854170] Updated weights for policy 0, policy_version 6995 (0.0007) +[2023-03-06 13:02:30,216][1854170] Updated weights for policy 0, policy_version 7005 (0.0007) +[2023-03-06 13:02:31,049][1854170] Updated weights for policy 0, policy_version 7015 (0.0006) +[2023-03-06 13:02:31,874][1854170] Updated weights for policy 0, policy_version 7025 (0.0006) +[2023-03-06 13:02:32,685][1854170] Updated weights for policy 0, policy_version 7035 (0.0006) +[2023-03-06 13:02:33,530][1854170] Updated weights for policy 0, policy_version 7045 (0.0007) +[2023-03-06 13:02:33,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12461.6). Total num frames: 7216128. Throughput: 0: 12432.6. Samples: 6306278. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:02:33,701][1853846] Avg episode reward: [(0, '775.517')] +[2023-03-06 13:02:34,336][1854170] Updated weights for policy 0, policy_version 7055 (0.0006) +[2023-03-06 13:02:35,148][1854170] Updated weights for policy 0, policy_version 7065 (0.0006) +[2023-03-06 13:02:35,982][1854170] Updated weights for policy 0, policy_version 7075 (0.0007) +[2023-03-06 13:02:36,809][1854170] Updated weights for policy 0, policy_version 7085 (0.0007) +[2023-03-06 13:02:37,636][1854170] Updated weights for policy 0, policy_version 7095 (0.0006) +[2023-03-06 13:02:38,462][1854170] Updated weights for policy 0, policy_version 7105 (0.0006) +[2023-03-06 13:02:38,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12424.5, 300 sec: 12458.1). Total num frames: 7277568. Throughput: 0: 12425.2. Samples: 6380770. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:02:38,701][1853846] Avg episode reward: [(0, '773.751')] +[2023-03-06 13:02:39,292][1854170] Updated weights for policy 0, policy_version 7115 (0.0006) +[2023-03-06 13:02:40,128][1854170] Updated weights for policy 0, policy_version 7125 (0.0009) +[2023-03-06 13:02:40,945][1854170] Updated weights for policy 0, policy_version 7135 (0.0008) +[2023-03-06 13:02:41,769][1854170] Updated weights for policy 0, policy_version 7145 (0.0007) +[2023-03-06 13:02:42,588][1854170] Updated weights for policy 0, policy_version 7155 (0.0007) +[2023-03-06 13:02:43,418][1854170] Updated weights for policy 0, policy_version 7165 (0.0006) +[2023-03-06 13:02:43,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12424.5, 300 sec: 12458.1). Total num frames: 7340032. Throughput: 0: 12422.6. Samples: 6455167. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:02:43,701][1853846] Avg episode reward: [(0, '809.353')] +[2023-03-06 13:02:43,704][1854119] Saving new best policy, reward=809.353! +[2023-03-06 13:02:44,236][1854170] Updated weights for policy 0, policy_version 7175 (0.0007) +[2023-03-06 13:02:45,068][1854170] Updated weights for policy 0, policy_version 7185 (0.0006) +[2023-03-06 13:02:45,890][1854170] Updated weights for policy 0, policy_version 7195 (0.0007) +[2023-03-06 13:02:46,724][1854170] Updated weights for policy 0, policy_version 7205 (0.0006) +[2023-03-06 13:02:47,544][1854170] Updated weights for policy 0, policy_version 7215 (0.0006) +[2023-03-06 13:02:48,370][1854170] Updated weights for policy 0, policy_version 7225 (0.0006) +[2023-03-06 13:02:48,700][1853846] Fps is (10 sec: 12390.5, 60 sec: 12424.6, 300 sec: 12454.6). Total num frames: 7401472. Throughput: 0: 12418.9. Samples: 6492403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:02:48,701][1853846] Avg episode reward: [(0, '809.628')] +[2023-03-06 13:02:48,708][1854119] Saving new best policy, reward=809.628! +[2023-03-06 13:02:49,206][1854170] Updated weights for policy 0, policy_version 7235 (0.0006) +[2023-03-06 13:02:50,027][1854170] Updated weights for policy 0, policy_version 7245 (0.0006) +[2023-03-06 13:02:50,849][1854170] Updated weights for policy 0, policy_version 7255 (0.0006) +[2023-03-06 13:02:51,670][1854170] Updated weights for policy 0, policy_version 7265 (0.0006) +[2023-03-06 13:02:52,503][1854170] Updated weights for policy 0, policy_version 7275 (0.0007) +[2023-03-06 13:02:53,339][1854170] Updated weights for policy 0, policy_version 7285 (0.0007) +[2023-03-06 13:02:53,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12424.5, 300 sec: 12454.6). Total num frames: 7463936. Throughput: 0: 12413.8. Samples: 6566759. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:02:53,701][1853846] Avg episode reward: [(0, '776.539')] +[2023-03-06 13:02:54,189][1854170] Updated weights for policy 0, policy_version 7295 (0.0006) +[2023-03-06 13:02:55,008][1854170] Updated weights for policy 0, policy_version 7305 (0.0006) +[2023-03-06 13:02:55,841][1854170] Updated weights for policy 0, policy_version 7315 (0.0007) +[2023-03-06 13:02:56,687][1854170] Updated weights for policy 0, policy_version 7325 (0.0007) +[2023-03-06 13:02:57,493][1854170] Updated weights for policy 0, policy_version 7335 (0.0006) +[2023-03-06 13:02:58,330][1854170] Updated weights for policy 0, policy_version 7345 (0.0006) +[2023-03-06 13:02:58,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12407.5, 300 sec: 12451.1). Total num frames: 7525376. Throughput: 0: 12395.9. Samples: 6640675. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:02:58,701][1853846] Avg episode reward: [(0, '781.792')] +[2023-03-06 13:02:59,150][1854170] Updated weights for policy 0, policy_version 7355 (0.0006) +[2023-03-06 13:02:59,975][1854170] Updated weights for policy 0, policy_version 7365 (0.0006) +[2023-03-06 13:03:00,802][1854170] Updated weights for policy 0, policy_version 7375 (0.0007) +[2023-03-06 13:03:01,630][1854170] Updated weights for policy 0, policy_version 7385 (0.0006) +[2023-03-06 13:03:02,461][1854170] Updated weights for policy 0, policy_version 7395 (0.0006) +[2023-03-06 13:03:03,283][1854170] Updated weights for policy 0, policy_version 7405 (0.0007) +[2023-03-06 13:03:03,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12407.5, 300 sec: 12451.1). Total num frames: 7587840. Throughput: 0: 12395.5. Samples: 6677773. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:03,701][1853846] Avg episode reward: [(0, '793.148')] +[2023-03-06 13:03:04,116][1854170] Updated weights for policy 0, policy_version 7415 (0.0007) +[2023-03-06 13:03:04,936][1854170] Updated weights for policy 0, policy_version 7425 (0.0006) +[2023-03-06 13:03:05,743][1854170] Updated weights for policy 0, policy_version 7435 (0.0006) +[2023-03-06 13:03:06,586][1854170] Updated weights for policy 0, policy_version 7445 (0.0006) +[2023-03-06 13:03:07,388][1854170] Updated weights for policy 0, policy_version 7455 (0.0007) +[2023-03-06 13:03:08,207][1854170] Updated weights for policy 0, policy_version 7465 (0.0006) +[2023-03-06 13:03:08,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12407.5, 300 sec: 12447.7). Total num frames: 7649280. Throughput: 0: 12396.8. Samples: 6752434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:03:08,701][1853846] Avg episode reward: [(0, '790.756')] +[2023-03-06 13:03:09,037][1854170] Updated weights for policy 0, policy_version 7475 (0.0008) +[2023-03-06 13:03:09,857][1854170] Updated weights for policy 0, policy_version 7485 (0.0007) +[2023-03-06 13:03:10,681][1854170] Updated weights for policy 0, policy_version 7495 (0.0007) +[2023-03-06 13:03:11,510][1854170] Updated weights for policy 0, policy_version 7505 (0.0006) +[2023-03-06 13:03:12,342][1854170] Updated weights for policy 0, policy_version 7515 (0.0006) +[2023-03-06 13:03:13,161][1854170] Updated weights for policy 0, policy_version 7525 (0.0006) +[2023-03-06 13:03:13,700][1853846] Fps is (10 sec: 12390.5, 60 sec: 12407.5, 300 sec: 12447.7). Total num frames: 7711744. Throughput: 0: 12405.0. Samples: 6827114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:03:13,701][1853846] Avg episode reward: [(0, '751.684')] +[2023-03-06 13:03:13,988][1854170] Updated weights for policy 0, policy_version 7535 (0.0007) +[2023-03-06 13:03:14,824][1854170] Updated weights for policy 0, policy_version 7545 (0.0007) +[2023-03-06 13:03:15,639][1854170] Updated weights for policy 0, policy_version 7555 (0.0006) +[2023-03-06 13:03:16,447][1854170] Updated weights for policy 0, policy_version 7565 (0.0007) +[2023-03-06 13:03:17,278][1854170] Updated weights for policy 0, policy_version 7575 (0.0007) +[2023-03-06 13:03:18,102][1854170] Updated weights for policy 0, policy_version 7585 (0.0006) +[2023-03-06 13:03:18,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12407.5, 300 sec: 12447.7). Total num frames: 7774208. Throughput: 0: 12403.0. Samples: 6864414. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:18,701][1853846] Avg episode reward: [(0, '784.211')] +[2023-03-06 13:03:18,912][1854170] Updated weights for policy 0, policy_version 7595 (0.0006) +[2023-03-06 13:03:19,757][1854170] Updated weights for policy 0, policy_version 7605 (0.0007) +[2023-03-06 13:03:20,595][1854170] Updated weights for policy 0, policy_version 7615 (0.0007) +[2023-03-06 13:03:21,445][1854170] Updated weights for policy 0, policy_version 7625 (0.0006) +[2023-03-06 13:03:22,268][1854170] Updated weights for policy 0, policy_version 7635 (0.0007) +[2023-03-06 13:03:23,077][1854170] Updated weights for policy 0, policy_version 7645 (0.0006) +[2023-03-06 13:03:23,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12407.5, 300 sec: 12440.7). Total num frames: 7835648. Throughput: 0: 12393.5. Samples: 6938476. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) +[2023-03-06 13:03:23,701][1853846] Avg episode reward: [(0, '737.747')] +[2023-03-06 13:03:23,929][1854170] Updated weights for policy 0, policy_version 7655 (0.0006) +[2023-03-06 13:03:24,731][1854170] Updated weights for policy 0, policy_version 7665 (0.0007) +[2023-03-06 13:03:25,560][1854170] Updated weights for policy 0, policy_version 7675 (0.0006) +[2023-03-06 13:03:26,391][1854170] Updated weights for policy 0, policy_version 7685 (0.0006) +[2023-03-06 13:03:27,212][1854170] Updated weights for policy 0, policy_version 7695 (0.0007) +[2023-03-06 13:03:28,049][1854170] Updated weights for policy 0, policy_version 7705 (0.0006) +[2023-03-06 13:03:28,701][1853846] Fps is (10 sec: 12288.0, 60 sec: 12390.4, 300 sec: 12440.7). Total num frames: 7897088. Throughput: 0: 12393.6. Samples: 7012878. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:28,701][1853846] Avg episode reward: [(0, '691.066')] +[2023-03-06 13:03:28,872][1854170] Updated weights for policy 0, policy_version 7715 (0.0007) +[2023-03-06 13:03:29,699][1854170] Updated weights for policy 0, policy_version 7725 (0.0007) +[2023-03-06 13:03:30,535][1854170] Updated weights for policy 0, policy_version 7735 (0.0007) +[2023-03-06 13:03:31,352][1854170] Updated weights for policy 0, policy_version 7745 (0.0006) +[2023-03-06 13:03:32,185][1854170] Updated weights for policy 0, policy_version 7755 (0.0006) +[2023-03-06 13:03:33,025][1854170] Updated weights for policy 0, policy_version 7765 (0.0007) +[2023-03-06 13:03:33,700][1853846] Fps is (10 sec: 12390.5, 60 sec: 12390.4, 300 sec: 12440.7). Total num frames: 7959552. Throughput: 0: 12387.1. Samples: 7049822. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:33,701][1853846] Avg episode reward: [(0, '605.470')] +[2023-03-06 13:03:33,842][1854170] Updated weights for policy 0, policy_version 7775 (0.0007) +[2023-03-06 13:03:34,677][1854170] Updated weights for policy 0, policy_version 7785 (0.0006) +[2023-03-06 13:03:35,502][1854170] Updated weights for policy 0, policy_version 7795 (0.0007) +[2023-03-06 13:03:36,317][1854170] Updated weights for policy 0, policy_version 7805 (0.0006) +[2023-03-06 13:03:37,159][1854170] Updated weights for policy 0, policy_version 7815 (0.0006) +[2023-03-06 13:03:37,969][1854170] Updated weights for policy 0, policy_version 7825 (0.0006) +[2023-03-06 13:03:38,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12390.4, 300 sec: 12437.3). Total num frames: 8020992. Throughput: 0: 12384.2. Samples: 7124046. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:38,701][1853846] Avg episode reward: [(0, '688.392')] +[2023-03-06 13:03:38,799][1854170] Updated weights for policy 0, policy_version 7835 (0.0007) +[2023-03-06 13:03:39,648][1854170] Updated weights for policy 0, policy_version 7845 (0.0007) +[2023-03-06 13:03:40,472][1854170] Updated weights for policy 0, policy_version 7855 (0.0006) +[2023-03-06 13:03:41,285][1854170] Updated weights for policy 0, policy_version 7865 (0.0007) +[2023-03-06 13:03:42,126][1854170] Updated weights for policy 0, policy_version 7875 (0.0006) +[2023-03-06 13:03:42,964][1854170] Updated weights for policy 0, policy_version 7885 (0.0006) +[2023-03-06 13:03:43,701][1853846] Fps is (10 sec: 12288.0, 60 sec: 12373.3, 300 sec: 12433.8). Total num frames: 8082432. Throughput: 0: 12387.4. Samples: 7198107. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:43,701][1853846] Avg episode reward: [(0, '646.532')] +[2023-03-06 13:03:43,781][1854170] Updated weights for policy 0, policy_version 7895 (0.0006) +[2023-03-06 13:03:44,604][1854170] Updated weights for policy 0, policy_version 7905 (0.0006) +[2023-03-06 13:03:45,440][1854170] Updated weights for policy 0, policy_version 7915 (0.0007) +[2023-03-06 13:03:46,271][1854170] Updated weights for policy 0, policy_version 7925 (0.0006) +[2023-03-06 13:03:47,088][1854170] Updated weights for policy 0, policy_version 7935 (0.0007) +[2023-03-06 13:03:47,901][1854170] Updated weights for policy 0, policy_version 7945 (0.0006) +[2023-03-06 13:03:48,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12390.4, 300 sec: 12433.8). Total num frames: 8144896. Throughput: 0: 12393.5. Samples: 7235479. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:48,701][1853846] Avg episode reward: [(0, '692.402')] +[2023-03-06 13:03:48,742][1854170] Updated weights for policy 0, policy_version 7955 (0.0006) +[2023-03-06 13:03:49,566][1854170] Updated weights for policy 0, policy_version 7965 (0.0006) +[2023-03-06 13:03:50,381][1854170] Updated weights for policy 0, policy_version 7975 (0.0006) +[2023-03-06 13:03:51,237][1854170] Updated weights for policy 0, policy_version 7985 (0.0006) +[2023-03-06 13:03:52,052][1854170] Updated weights for policy 0, policy_version 7995 (0.0007) +[2023-03-06 13:03:52,870][1854170] Updated weights for policy 0, policy_version 8005 (0.0006) +[2023-03-06 13:03:53,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12373.3, 300 sec: 12430.3). Total num frames: 8206336. Throughput: 0: 12384.0. Samples: 7309711. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:03:53,701][1853846] Avg episode reward: [(0, '642.737')] +[2023-03-06 13:03:53,708][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000008015_8207360.pth... +[2023-03-06 13:03:53,710][1854170] Updated weights for policy 0, policy_version 8015 (0.0006) +[2023-03-06 13:03:53,742][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000005102_5224448.pth +[2023-03-06 13:03:54,522][1854170] Updated weights for policy 0, policy_version 8025 (0.0006) +[2023-03-06 13:03:55,352][1854170] Updated weights for policy 0, policy_version 8035 (0.0006) +[2023-03-06 13:03:56,193][1854170] Updated weights for policy 0, policy_version 8045 (0.0006) +[2023-03-06 13:03:56,996][1854170] Updated weights for policy 0, policy_version 8055 (0.0006) +[2023-03-06 13:03:57,818][1854170] Updated weights for policy 0, policy_version 8065 (0.0006) +[2023-03-06 13:03:58,654][1854170] Updated weights for policy 0, policy_version 8075 (0.0007) +[2023-03-06 13:03:58,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12390.4, 300 sec: 12430.3). Total num frames: 8268800. Throughput: 0: 12376.6. Samples: 7384062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:03:58,701][1853846] Avg episode reward: [(0, '666.435')] +[2023-03-06 13:03:59,474][1854170] Updated weights for policy 0, policy_version 8085 (0.0007) +[2023-03-06 13:04:00,297][1854170] Updated weights for policy 0, policy_version 8095 (0.0006) +[2023-03-06 13:04:01,130][1854170] Updated weights for policy 0, policy_version 8105 (0.0006) +[2023-03-06 13:04:01,957][1854170] Updated weights for policy 0, policy_version 8115 (0.0007) +[2023-03-06 13:04:02,786][1854170] Updated weights for policy 0, policy_version 8125 (0.0006) +[2023-03-06 13:04:03,622][1854170] Updated weights for policy 0, policy_version 8135 (0.0006) +[2023-03-06 13:04:03,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12373.3, 300 sec: 12426.8). Total num frames: 8330240. Throughput: 0: 12371.2. Samples: 7421119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:04:03,701][1853846] Avg episode reward: [(0, '694.249')] +[2023-03-06 13:04:04,441][1854170] Updated weights for policy 0, policy_version 8145 (0.0006) +[2023-03-06 13:04:05,283][1854170] Updated weights for policy 0, policy_version 8155 (0.0006) +[2023-03-06 13:04:06,127][1854170] Updated weights for policy 0, policy_version 8165 (0.0007) +[2023-03-06 13:04:06,940][1854170] Updated weights for policy 0, policy_version 8175 (0.0006) +[2023-03-06 13:04:07,765][1854170] Updated weights for policy 0, policy_version 8185 (0.0007) +[2023-03-06 13:04:08,595][1854170] Updated weights for policy 0, policy_version 8195 (0.0007) +[2023-03-06 13:04:08,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12390.4, 300 sec: 12430.3). Total num frames: 8392704. Throughput: 0: 12374.7. Samples: 7495336. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:04:08,701][1853846] Avg episode reward: [(0, '724.925')] +[2023-03-06 13:04:09,428][1854170] Updated weights for policy 0, policy_version 8205 (0.0007) +[2023-03-06 13:04:10,253][1854170] Updated weights for policy 0, policy_version 8215 (0.0006) +[2023-03-06 13:04:11,083][1854170] Updated weights for policy 0, policy_version 8225 (0.0007) +[2023-03-06 13:04:11,913][1854170] Updated weights for policy 0, policy_version 8235 (0.0007) +[2023-03-06 13:04:12,751][1854170] Updated weights for policy 0, policy_version 8245 (0.0006) +[2023-03-06 13:04:13,564][1854170] Updated weights for policy 0, policy_version 8255 (0.0006) +[2023-03-06 13:04:13,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12373.3, 300 sec: 12426.8). Total num frames: 8454144. Throughput: 0: 12370.7. Samples: 7569559. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:04:13,701][1853846] Avg episode reward: [(0, '741.363')] +[2023-03-06 13:04:14,396][1854170] Updated weights for policy 0, policy_version 8265 (0.0006) +[2023-03-06 13:04:15,224][1854170] Updated weights for policy 0, policy_version 8275 (0.0006) +[2023-03-06 13:04:16,040][1854170] Updated weights for policy 0, policy_version 8285 (0.0007) +[2023-03-06 13:04:16,889][1854170] Updated weights for policy 0, policy_version 8295 (0.0006) +[2023-03-06 13:04:17,703][1854170] Updated weights for policy 0, policy_version 8305 (0.0007) +[2023-03-06 13:04:18,516][1854170] Updated weights for policy 0, policy_version 8315 (0.0007) +[2023-03-06 13:04:18,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12373.3, 300 sec: 12426.8). Total num frames: 8516608. Throughput: 0: 12374.2. Samples: 7606660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:04:18,701][1853846] Avg episode reward: [(0, '785.877')] +[2023-03-06 13:04:19,364][1854170] Updated weights for policy 0, policy_version 8325 (0.0007) +[2023-03-06 13:04:20,188][1854170] Updated weights for policy 0, policy_version 8335 (0.0006) +[2023-03-06 13:04:21,013][1854170] Updated weights for policy 0, policy_version 8345 (0.0006) +[2023-03-06 13:04:21,845][1854170] Updated weights for policy 0, policy_version 8355 (0.0006) +[2023-03-06 13:04:22,657][1854170] Updated weights for policy 0, policy_version 8365 (0.0006) +[2023-03-06 13:04:23,478][1854170] Updated weights for policy 0, policy_version 8375 (0.0006) +[2023-03-06 13:04:23,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12373.3, 300 sec: 12423.4). Total num frames: 8578048. Throughput: 0: 12378.0. Samples: 7681056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:04:23,701][1853846] Avg episode reward: [(0, '822.972')] +[2023-03-06 13:04:23,707][1854119] Saving new best policy, reward=822.972! +[2023-03-06 13:04:24,299][1854170] Updated weights for policy 0, policy_version 8385 (0.0006) +[2023-03-06 13:04:25,132][1854170] Updated weights for policy 0, policy_version 8395 (0.0006) +[2023-03-06 13:04:25,953][1854170] Updated weights for policy 0, policy_version 8405 (0.0007) +[2023-03-06 13:04:26,781][1854170] Updated weights for policy 0, policy_version 8415 (0.0006) +[2023-03-06 13:04:27,599][1854170] Updated weights for policy 0, policy_version 8425 (0.0007) +[2023-03-06 13:04:28,434][1854170] Updated weights for policy 0, policy_version 8435 (0.0007) +[2023-03-06 13:04:28,701][1853846] Fps is (10 sec: 12390.5, 60 sec: 12390.4, 300 sec: 12423.4). Total num frames: 8640512. Throughput: 0: 12385.5. Samples: 7755454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:04:28,701][1853846] Avg episode reward: [(0, '763.006')] +[2023-03-06 13:04:29,253][1854170] Updated weights for policy 0, policy_version 8445 (0.0006) +[2023-03-06 13:04:30,093][1854170] Updated weights for policy 0, policy_version 8455 (0.0006) +[2023-03-06 13:04:30,906][1854170] Updated weights for policy 0, policy_version 8465 (0.0006) +[2023-03-06 13:04:31,735][1854170] Updated weights for policy 0, policy_version 8475 (0.0006) +[2023-03-06 13:04:32,556][1854170] Updated weights for policy 0, policy_version 8485 (0.0006) +[2023-03-06 13:04:33,400][1854170] Updated weights for policy 0, policy_version 8495 (0.0006) +[2023-03-06 13:04:33,700][1853846] Fps is (10 sec: 12390.5, 60 sec: 12373.3, 300 sec: 12423.4). Total num frames: 8701952. Throughput: 0: 12378.9. Samples: 7792527. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:04:33,701][1853846] Avg episode reward: [(0, '762.429')] +[2023-03-06 13:04:34,203][1854170] Updated weights for policy 0, policy_version 8505 (0.0007) +[2023-03-06 13:04:35,034][1854170] Updated weights for policy 0, policy_version 8515 (0.0007) +[2023-03-06 13:04:35,859][1854170] Updated weights for policy 0, policy_version 8525 (0.0007) +[2023-03-06 13:04:36,678][1854170] Updated weights for policy 0, policy_version 8535 (0.0007) +[2023-03-06 13:04:37,508][1854170] Updated weights for policy 0, policy_version 8545 (0.0006) +[2023-03-06 13:04:38,312][1854170] Updated weights for policy 0, policy_version 8555 (0.0006) +[2023-03-06 13:04:38,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12390.4, 300 sec: 12423.4). Total num frames: 8764416. Throughput: 0: 12394.8. Samples: 7867475. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:04:38,701][1853846] Avg episode reward: [(0, '740.542')] +[2023-03-06 13:04:39,124][1854170] Updated weights for policy 0, policy_version 8565 (0.0006) +[2023-03-06 13:04:39,973][1854170] Updated weights for policy 0, policy_version 8575 (0.0006) +[2023-03-06 13:04:40,781][1854170] Updated weights for policy 0, policy_version 8585 (0.0006) +[2023-03-06 13:04:41,580][1854170] Updated weights for policy 0, policy_version 8595 (0.0007) +[2023-03-06 13:04:42,417][1854170] Updated weights for policy 0, policy_version 8605 (0.0007) +[2023-03-06 13:04:43,228][1854170] Updated weights for policy 0, policy_version 8615 (0.0006) +[2023-03-06 13:04:43,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12407.5, 300 sec: 12423.4). Total num frames: 8826880. Throughput: 0: 12407.4. Samples: 7942395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:04:43,701][1853846] Avg episode reward: [(0, '706.017')] +[2023-03-06 13:04:44,042][1854170] Updated weights for policy 0, policy_version 8625 (0.0006) +[2023-03-06 13:04:44,866][1854170] Updated weights for policy 0, policy_version 8635 (0.0007) +[2023-03-06 13:04:45,678][1854170] Updated weights for policy 0, policy_version 8645 (0.0006) +[2023-03-06 13:04:46,514][1854170] Updated weights for policy 0, policy_version 8655 (0.0006) +[2023-03-06 13:04:47,339][1854170] Updated weights for policy 0, policy_version 8665 (0.0006) +[2023-03-06 13:04:48,161][1854170] Updated weights for policy 0, policy_version 8675 (0.0007) +[2023-03-06 13:04:48,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12407.5, 300 sec: 12423.4). Total num frames: 8889344. Throughput: 0: 12411.1. Samples: 7979620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:04:48,701][1853846] Avg episode reward: [(0, '750.298')] +[2023-03-06 13:04:48,988][1854170] Updated weights for policy 0, policy_version 8685 (0.0007) +[2023-03-06 13:04:49,844][1854170] Updated weights for policy 0, policy_version 8695 (0.0007) +[2023-03-06 13:04:50,661][1854170] Updated weights for policy 0, policy_version 8705 (0.0007) +[2023-03-06 13:04:51,497][1854170] Updated weights for policy 0, policy_version 8715 (0.0006) +[2023-03-06 13:04:52,324][1854170] Updated weights for policy 0, policy_version 8725 (0.0007) +[2023-03-06 13:04:53,161][1854170] Updated weights for policy 0, policy_version 8735 (0.0006) +[2023-03-06 13:04:53,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12407.5, 300 sec: 12419.9). Total num frames: 8950784. Throughput: 0: 12412.4. Samples: 8053895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:04:53,701][1853846] Avg episode reward: [(0, '719.109')] +[2023-03-06 13:04:53,966][1854170] Updated weights for policy 0, policy_version 8745 (0.0006) +[2023-03-06 13:04:54,783][1854170] Updated weights for policy 0, policy_version 8755 (0.0006) +[2023-03-06 13:04:55,604][1854170] Updated weights for policy 0, policy_version 8765 (0.0007) +[2023-03-06 13:04:56,433][1854170] Updated weights for policy 0, policy_version 8775 (0.0007) +[2023-03-06 13:04:57,289][1854170] Updated weights for policy 0, policy_version 8785 (0.0006) +[2023-03-06 13:04:58,104][1854170] Updated weights for policy 0, policy_version 8795 (0.0007) +[2023-03-06 13:04:58,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12407.5, 300 sec: 12419.9). Total num frames: 9013248. Throughput: 0: 12413.7. Samples: 8128175. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:04:58,701][1853846] Avg episode reward: [(0, '695.350')] +[2023-03-06 13:04:58,928][1854170] Updated weights for policy 0, policy_version 8805 (0.0007) +[2023-03-06 13:04:59,768][1854170] Updated weights for policy 0, policy_version 8815 (0.0006) +[2023-03-06 13:05:00,572][1854170] Updated weights for policy 0, policy_version 8825 (0.0006) +[2023-03-06 13:05:01,393][1854170] Updated weights for policy 0, policy_version 8835 (0.0006) +[2023-03-06 13:05:02,206][1854170] Updated weights for policy 0, policy_version 8845 (0.0006) +[2023-03-06 13:05:03,037][1854170] Updated weights for policy 0, policy_version 8855 (0.0007) +[2023-03-06 13:05:03,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12424.5, 300 sec: 12419.9). Total num frames: 9075712. Throughput: 0: 12422.1. Samples: 8165655. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) +[2023-03-06 13:05:03,701][1853846] Avg episode reward: [(0, '650.019')] +[2023-03-06 13:05:03,852][1854170] Updated weights for policy 0, policy_version 8865 (0.0006) +[2023-03-06 13:05:04,674][1854170] Updated weights for policy 0, policy_version 8875 (0.0006) +[2023-03-06 13:05:05,500][1854170] Updated weights for policy 0, policy_version 8885 (0.0006) +[2023-03-06 13:05:06,300][1854170] Updated weights for policy 0, policy_version 8895 (0.0006) +[2023-03-06 13:05:07,140][1854170] Updated weights for policy 0, policy_version 8905 (0.0006) +[2023-03-06 13:05:07,969][1854170] Updated weights for policy 0, policy_version 8915 (0.0007) +[2023-03-06 13:05:08,701][1853846] Fps is (10 sec: 12492.7, 60 sec: 12424.5, 300 sec: 12419.9). Total num frames: 9138176. Throughput: 0: 12428.0. Samples: 8240314. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) +[2023-03-06 13:05:08,701][1853846] Avg episode reward: [(0, '668.493')] +[2023-03-06 13:05:08,793][1854170] Updated weights for policy 0, policy_version 8925 (0.0006) +[2023-03-06 13:05:09,608][1854170] Updated weights for policy 0, policy_version 8935 (0.0006) +[2023-03-06 13:05:10,427][1854170] Updated weights for policy 0, policy_version 8945 (0.0007) +[2023-03-06 13:05:11,244][1854170] Updated weights for policy 0, policy_version 8955 (0.0006) +[2023-03-06 13:05:12,058][1854170] Updated weights for policy 0, policy_version 8965 (0.0006) +[2023-03-06 13:05:12,888][1854170] Updated weights for policy 0, policy_version 8975 (0.0006) +[2023-03-06 13:05:13,700][1854170] Updated weights for policy 0, policy_version 8985 (0.0006) +[2023-03-06 13:05:13,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12419.9). Total num frames: 9200640. Throughput: 0: 12440.6. Samples: 8315279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:13,701][1853846] Avg episode reward: [(0, '668.494')] +[2023-03-06 13:05:14,539][1854170] Updated weights for policy 0, policy_version 8995 (0.0006) +[2023-03-06 13:05:15,354][1854170] Updated weights for policy 0, policy_version 9005 (0.0006) +[2023-03-06 13:05:16,180][1854170] Updated weights for policy 0, policy_version 9015 (0.0006) +[2023-03-06 13:05:17,009][1854170] Updated weights for policy 0, policy_version 9025 (0.0007) +[2023-03-06 13:05:17,831][1854170] Updated weights for policy 0, policy_version 9035 (0.0007) +[2023-03-06 13:05:18,641][1854170] Updated weights for policy 0, policy_version 9045 (0.0006) +[2023-03-06 13:05:18,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12424.5, 300 sec: 12416.4). Total num frames: 9262080. Throughput: 0: 12444.7. Samples: 8352540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:18,701][1853846] Avg episode reward: [(0, '676.439')] +[2023-03-06 13:05:19,480][1854170] Updated weights for policy 0, policy_version 9055 (0.0006) +[2023-03-06 13:05:20,297][1854170] Updated weights for policy 0, policy_version 9065 (0.0006) +[2023-03-06 13:05:21,117][1854170] Updated weights for policy 0, policy_version 9075 (0.0007) +[2023-03-06 13:05:21,944][1854170] Updated weights for policy 0, policy_version 9085 (0.0006) +[2023-03-06 13:05:22,763][1854170] Updated weights for policy 0, policy_version 9095 (0.0007) +[2023-03-06 13:05:23,584][1854170] Updated weights for policy 0, policy_version 9105 (0.0007) +[2023-03-06 13:05:23,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9324544. Throughput: 0: 12437.2. Samples: 8427148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:23,701][1853846] Avg episode reward: [(0, '753.667')] +[2023-03-06 13:05:24,411][1854170] Updated weights for policy 0, policy_version 9115 (0.0006) +[2023-03-06 13:05:25,238][1854170] Updated weights for policy 0, policy_version 9125 (0.0006) +[2023-03-06 13:05:26,077][1854170] Updated weights for policy 0, policy_version 9135 (0.0007) +[2023-03-06 13:05:26,886][1854170] Updated weights for policy 0, policy_version 9145 (0.0006) +[2023-03-06 13:05:27,703][1854170] Updated weights for policy 0, policy_version 9155 (0.0006) +[2023-03-06 13:05:28,527][1854170] Updated weights for policy 0, policy_version 9165 (0.0006) +[2023-03-06 13:05:28,700][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9387008. Throughput: 0: 12429.3. Samples: 8501712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:28,701][1853846] Avg episode reward: [(0, '770.181')] +[2023-03-06 13:05:29,372][1854170] Updated weights for policy 0, policy_version 9175 (0.0007) +[2023-03-06 13:05:30,161][1854170] Updated weights for policy 0, policy_version 9185 (0.0006) +[2023-03-06 13:05:31,010][1854170] Updated weights for policy 0, policy_version 9195 (0.0007) +[2023-03-06 13:05:31,811][1854170] Updated weights for policy 0, policy_version 9205 (0.0007) +[2023-03-06 13:05:32,620][1854170] Updated weights for policy 0, policy_version 9215 (0.0007) +[2023-03-06 13:05:33,462][1854170] Updated weights for policy 0, policy_version 9225 (0.0006) +[2023-03-06 13:05:33,701][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9448448. Throughput: 0: 12434.2. Samples: 8539159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:33,701][1853846] Avg episode reward: [(0, '756.992')] +[2023-03-06 13:05:34,282][1854170] Updated weights for policy 0, policy_version 9235 (0.0008) +[2023-03-06 13:05:35,116][1854170] Updated weights for policy 0, policy_version 9245 (0.0007) +[2023-03-06 13:05:35,938][1854170] Updated weights for policy 0, policy_version 9255 (0.0007) +[2023-03-06 13:05:36,743][1854170] Updated weights for policy 0, policy_version 9265 (0.0007) +[2023-03-06 13:05:37,571][1854170] Updated weights for policy 0, policy_version 9275 (0.0007) +[2023-03-06 13:05:38,413][1854170] Updated weights for policy 0, policy_version 9285 (0.0007) +[2023-03-06 13:05:38,700][1853846] Fps is (10 sec: 12390.4, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9510912. Throughput: 0: 12443.1. Samples: 8613835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:38,701][1853846] Avg episode reward: [(0, '795.097')] +[2023-03-06 13:05:39,228][1854170] Updated weights for policy 0, policy_version 9295 (0.0006) +[2023-03-06 13:05:40,052][1854170] Updated weights for policy 0, policy_version 9305 (0.0007) +[2023-03-06 13:05:40,899][1854170] Updated weights for policy 0, policy_version 9315 (0.0007) +[2023-03-06 13:05:41,704][1854170] Updated weights for policy 0, policy_version 9325 (0.0006) +[2023-03-06 13:05:42,537][1854170] Updated weights for policy 0, policy_version 9335 (0.0007) +[2023-03-06 13:05:43,370][1854170] Updated weights for policy 0, policy_version 9345 (0.0006) +[2023-03-06 13:05:43,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9573376. Throughput: 0: 12444.0. Samples: 8688156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:43,701][1853846] Avg episode reward: [(0, '809.620')] +[2023-03-06 13:05:44,183][1854170] Updated weights for policy 0, policy_version 9355 (0.0006) +[2023-03-06 13:05:45,008][1854170] Updated weights for policy 0, policy_version 9365 (0.0006) +[2023-03-06 13:05:45,830][1854170] Updated weights for policy 0, policy_version 9375 (0.0007) +[2023-03-06 13:05:46,658][1854170] Updated weights for policy 0, policy_version 9385 (0.0006) +[2023-03-06 13:05:47,483][1854170] Updated weights for policy 0, policy_version 9395 (0.0006) +[2023-03-06 13:05:48,326][1854170] Updated weights for policy 0, policy_version 9405 (0.0007) +[2023-03-06 13:05:48,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12424.5, 300 sec: 12416.4). Total num frames: 9634816. Throughput: 0: 12439.9. Samples: 8725451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:48,701][1853846] Avg episode reward: [(0, '769.819')] +[2023-03-06 13:05:49,145][1854170] Updated weights for policy 0, policy_version 9415 (0.0006) +[2023-03-06 13:05:49,965][1854170] Updated weights for policy 0, policy_version 9425 (0.0008) +[2023-03-06 13:05:50,774][1854170] Updated weights for policy 0, policy_version 9435 (0.0007) +[2023-03-06 13:05:51,603][1854170] Updated weights for policy 0, policy_version 9445 (0.0007) +[2023-03-06 13:05:52,427][1854170] Updated weights for policy 0, policy_version 9455 (0.0006) +[2023-03-06 13:05:53,231][1854170] Updated weights for policy 0, policy_version 9465 (0.0006) +[2023-03-06 13:05:53,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12441.6, 300 sec: 12416.4). Total num frames: 9697280. Throughput: 0: 12440.1. Samples: 8800121. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:53,701][1853846] Avg episode reward: [(0, '704.446')] +[2023-03-06 13:05:53,717][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000009471_9698304.pth... +[2023-03-06 13:05:53,749][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000006562_6719488.pth +[2023-03-06 13:05:54,057][1854170] Updated weights for policy 0, policy_version 9475 (0.0007) +[2023-03-06 13:05:54,906][1854170] Updated weights for policy 0, policy_version 9485 (0.0007) +[2023-03-06 13:05:55,712][1854170] Updated weights for policy 0, policy_version 9495 (0.0007) +[2023-03-06 13:05:56,527][1854170] Updated weights for policy 0, policy_version 9505 (0.0006) +[2023-03-06 13:05:57,345][1854170] Updated weights for policy 0, policy_version 9515 (0.0006) +[2023-03-06 13:05:58,173][1854170] Updated weights for policy 0, policy_version 9525 (0.0007) +[2023-03-06 13:05:58,700][1853846] Fps is (10 sec: 12492.9, 60 sec: 12441.6, 300 sec: 12419.9). Total num frames: 9759744. Throughput: 0: 12437.7. Samples: 8874975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:05:58,701][1853846] Avg episode reward: [(0, '703.278')] +[2023-03-06 13:05:59,002][1854170] Updated weights for policy 0, policy_version 9535 (0.0006) +[2023-03-06 13:05:59,824][1854170] Updated weights for policy 0, policy_version 9545 (0.0006) +[2023-03-06 13:06:00,640][1854170] Updated weights for policy 0, policy_version 9555 (0.0007) +[2023-03-06 13:06:01,478][1854170] Updated weights for policy 0, policy_version 9565 (0.0006) +[2023-03-06 13:06:02,297][1854170] Updated weights for policy 0, policy_version 9575 (0.0007) +[2023-03-06 13:06:03,106][1854170] Updated weights for policy 0, policy_version 9585 (0.0006) +[2023-03-06 13:06:03,701][1853846] Fps is (10 sec: 12492.8, 60 sec: 12441.6, 300 sec: 12419.9). Total num frames: 9822208. Throughput: 0: 12436.3. Samples: 8912174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:06:03,701][1853846] Avg episode reward: [(0, '720.490')] +[2023-03-06 13:06:03,929][1854170] Updated weights for policy 0, policy_version 9595 (0.0006) +[2023-03-06 13:06:04,748][1854170] Updated weights for policy 0, policy_version 9605 (0.0007) +[2023-03-06 13:06:05,580][1854170] Updated weights for policy 0, policy_version 9615 (0.0006) +[2023-03-06 13:06:06,406][1854170] Updated weights for policy 0, policy_version 9625 (0.0006) +[2023-03-06 13:06:07,212][1854170] Updated weights for policy 0, policy_version 9635 (0.0007) +[2023-03-06 13:06:08,042][1854170] Updated weights for policy 0, policy_version 9645 (0.0006) +[2023-03-06 13:06:08,701][1853846] Fps is (10 sec: 12390.3, 60 sec: 12424.5, 300 sec: 12416.4). Total num frames: 9883648. Throughput: 0: 12438.5. Samples: 8986882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:06:08,701][1853846] Avg episode reward: [(0, '658.348')] +[2023-03-06 13:06:08,885][1854170] Updated weights for policy 0, policy_version 9655 (0.0006) +[2023-03-06 13:06:09,717][1854170] Updated weights for policy 0, policy_version 9665 (0.0006) +[2023-03-06 13:06:10,553][1854170] Updated weights for policy 0, policy_version 9675 (0.0007) +[2023-03-06 13:06:11,370][1854170] Updated weights for policy 0, policy_version 9685 (0.0007) +[2023-03-06 13:06:12,186][1854170] Updated weights for policy 0, policy_version 9695 (0.0006) +[2023-03-06 13:06:13,024][1854170] Updated weights for policy 0, policy_version 9705 (0.0007) +[2023-03-06 13:06:13,701][1853846] Fps is (10 sec: 12390.5, 60 sec: 12424.5, 300 sec: 12416.4). Total num frames: 9946112. Throughput: 0: 12429.8. Samples: 9061055. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) +[2023-03-06 13:06:13,701][1853846] Avg episode reward: [(0, '766.409')] +[2023-03-06 13:06:13,851][1854170] Updated weights for policy 0, policy_version 9715 (0.0007) +[2023-03-06 13:06:14,704][1854170] Updated weights for policy 0, policy_version 9725 (0.0007) +[2023-03-06 13:06:15,515][1854170] Updated weights for policy 0, policy_version 9735 (0.0007) +[2023-03-06 13:06:16,338][1854170] Updated weights for policy 0, policy_version 9745 (0.0007) +[2023-03-06 13:06:17,157][1854170] Updated weights for policy 0, policy_version 9755 (0.0006) +[2023-03-06 13:06:17,975][1854170] Updated weights for policy 0, policy_version 9765 (0.0006) +[2023-03-06 13:06:18,147][1854344] Stopping RolloutWorker_w11... +[2023-03-06 13:06:18,147][1854581] Stopping RolloutWorker_w23... +[2023-03-06 13:06:18,147][1854338] Stopping RolloutWorker_w10... +[2023-03-06 13:06:18,147][1854300] Stopping RolloutWorker_w5... +[2023-03-06 13:06:18,147][1854174] Stopping RolloutWorker_w4... +[2023-03-06 13:06:18,146][1854667] Stopping RolloutWorker_w29... +[2023-03-06 13:06:18,147][1854634] Stopping RolloutWorker_w27... +[2023-03-06 13:06:18,147][1854119] Stopping Batcher_0... +[2023-03-06 13:06:18,147][1854301] Stopping RolloutWorker_w3... +[2023-03-06 13:06:18,147][1854275] Stopping RolloutWorker_w8... +[2023-03-06 13:06:18,147][1854342] Stopping RolloutWorker_w6... +[2023-03-06 13:06:18,147][1854344] Loop rollout_proc11_evt_loop terminating... +[2023-03-06 13:06:18,147][1854346] Stopping RolloutWorker_w20... +[2023-03-06 13:06:18,147][1854581] Loop rollout_proc23_evt_loop terminating... +[2023-03-06 13:06:18,147][1854335] Stopping RolloutWorker_w12... +[2023-03-06 13:06:18,147][1854338] Loop rollout_proc10_evt_loop terminating... +[2023-03-06 13:06:18,147][1854633] Stopping RolloutWorker_w26... +[2023-03-06 13:06:18,147][1854336] Stopping RolloutWorker_w7... +[2023-03-06 13:06:18,147][1854174] Loop rollout_proc4_evt_loop terminating... +[2023-03-06 13:06:18,147][1854340] Stopping RolloutWorker_w13... +[2023-03-06 13:06:18,147][1854634] Loop rollout_proc27_evt_loop terminating... +[2023-03-06 13:06:18,147][1854341] Stopping RolloutWorker_w19... +[2023-03-06 13:06:18,147][1854172] Stopping RolloutWorker_w0... +[2023-03-06 13:06:18,147][1854635] Stopping RolloutWorker_w28... +[2023-03-06 13:06:18,147][1854599] Stopping RolloutWorker_w24... +[2023-03-06 13:06:18,147][1854300] Loop rollout_proc5_evt_loop terminating... +[2023-03-06 13:06:18,147][1854667] Loop rollout_proc29_evt_loop terminating... +[2023-03-06 13:06:18,147][1854597] Stopping RolloutWorker_w25... +[2023-03-06 13:06:18,147][1854668] Stopping RolloutWorker_w30... +[2023-03-06 13:06:18,147][1854119] Loop batcher_evt_loop terminating... +[2023-03-06 13:06:18,147][1854301] Loop rollout_proc3_evt_loop terminating... +[2023-03-06 13:06:18,147][1854275] Loop rollout_proc8_evt_loop terminating... +[2023-03-06 13:06:18,147][1854342] Loop rollout_proc6_evt_loop terminating... +[2023-03-06 13:06:18,147][1854346] Loop rollout_proc20_evt_loop terminating... +[2023-03-06 13:06:18,147][1854731] Stopping RolloutWorker_w31... +[2023-03-06 13:06:18,147][1854341] Loop rollout_proc19_evt_loop terminating... +[2023-03-06 13:06:18,147][1854339] Stopping RolloutWorker_w18... +[2023-03-06 13:06:18,147][1854335] Loop rollout_proc12_evt_loop terminating... +[2023-03-06 13:06:18,147][1854172] Loop rollout_proc0_evt_loop terminating... +[2023-03-06 13:06:18,147][1854340] Loop rollout_proc13_evt_loop terminating... +[2023-03-06 13:06:18,147][1854599] Loop rollout_proc24_evt_loop terminating... +[2023-03-06 13:06:18,147][1854441] Stopping RolloutWorker_w22... +[2023-03-06 13:06:18,147][1854635] Loop rollout_proc28_evt_loop terminating... +[2023-03-06 13:06:18,147][1854668] Loop rollout_proc30_evt_loop terminating... +[2023-03-06 13:06:18,147][1854597] Loop rollout_proc25_evt_loop terminating... +[2023-03-06 13:06:18,147][1854731] Loop rollout_proc31_evt_loop terminating... +[2023-03-06 13:06:18,147][1854339] Loop rollout_proc18_evt_loop terminating... +[2023-03-06 13:06:18,147][1854334] Stopping RolloutWorker_w9... +[2023-03-06 13:06:18,148][1854441] Loop rollout_proc22_evt_loop terminating... +[2023-03-06 13:06:18,147][1854322] Stopping RolloutWorker_w14... +[2023-03-06 13:06:18,147][1853846] Component RolloutWorker_w29 stopped! +[2023-03-06 13:06:18,148][1854334] Loop rollout_proc9_evt_loop terminating... +[2023-03-06 13:06:18,148][1854345] Stopping RolloutWorker_w17... +[2023-03-06 13:06:18,148][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000009767_10001408.pth... +[2023-03-06 13:06:18,148][1854343] Stopping RolloutWorker_w15... +[2023-03-06 13:06:18,148][1854337] Stopping RolloutWorker_w16... +[2023-03-06 13:06:18,148][1854322] Loop rollout_proc14_evt_loop terminating... +[2023-03-06 13:06:18,148][1854345] Loop rollout_proc17_evt_loop terminating... +[2023-03-06 13:06:18,148][1853846] Component RolloutWorker_w11 stopped! +[2023-03-06 13:06:18,148][1854343] Loop rollout_proc15_evt_loop terminating... +[2023-03-06 13:06:18,148][1854337] Loop rollout_proc16_evt_loop terminating... +[2023-03-06 13:06:18,148][1853846] Component RolloutWorker_w23 stopped! +[2023-03-06 13:06:18,148][1853846] Component RolloutWorker_w5 stopped! +[2023-03-06 13:06:18,149][1853846] Component RolloutWorker_w10 stopped! +[2023-03-06 13:06:18,149][1853846] Component RolloutWorker_w4 stopped! +[2023-03-06 13:06:18,149][1853846] Component RolloutWorker_w27 stopped! +[2023-03-06 13:06:18,149][1853846] Component Batcher_0 stopped! +[2023-03-06 13:06:18,150][1853846] Component RolloutWorker_w28 stopped! +[2023-03-06 13:06:18,150][1853846] Component RolloutWorker_w3 stopped! +[2023-03-06 13:06:18,150][1853846] Component RolloutWorker_w6 stopped! +[2023-03-06 13:06:18,150][1853846] Component RolloutWorker_w8 stopped! +[2023-03-06 13:06:18,150][1853846] Component RolloutWorker_w12 stopped! +[2023-03-06 13:06:18,150][1854173] Stopping RolloutWorker_w2... +[2023-03-06 13:06:18,151][1853846] Component RolloutWorker_w20 stopped! +[2023-03-06 13:06:18,151][1854173] Loop rollout_proc2_evt_loop terminating... +[2023-03-06 13:06:18,151][1853846] Component RolloutWorker_w22 stopped! +[2023-03-06 13:06:18,151][1853846] Component RolloutWorker_w26 stopped! +[2023-03-06 13:06:18,151][1853846] Component RolloutWorker_w13 stopped! +[2023-03-06 13:06:18,152][1853846] Component RolloutWorker_w7 stopped! +[2023-03-06 13:06:18,152][1853846] Component RolloutWorker_w19 stopped! +[2023-03-06 13:06:18,152][1853846] Component RolloutWorker_w18 stopped! +[2023-03-06 13:06:18,153][1853846] Component RolloutWorker_w25 stopped! +[2023-03-06 13:06:18,153][1853846] Component RolloutWorker_w0 stopped! +[2023-03-06 13:06:18,153][1853846] Component RolloutWorker_w30 stopped! +[2023-03-06 13:06:18,153][1853846] Component RolloutWorker_w24 stopped! +[2023-03-06 13:06:18,153][1853846] Component RolloutWorker_w31 stopped! +[2023-03-06 13:06:18,154][1853846] Component RolloutWorker_w9 stopped! +[2023-03-06 13:06:18,154][1853846] Component RolloutWorker_w14 stopped! +[2023-03-06 13:06:18,154][1853846] Component RolloutWorker_w17 stopped! +[2023-03-06 13:06:18,154][1853846] Component RolloutWorker_w15 stopped! +[2023-03-06 13:06:18,155][1853846] Component RolloutWorker_w16 stopped! +[2023-03-06 13:06:18,155][1853846] Component RolloutWorker_w2 stopped! +[2023-03-06 13:06:18,155][1853846] Component RolloutWorker_w21 stopped! +[2023-03-06 13:06:18,155][1854504] Stopping RolloutWorker_w21... +[2023-03-06 13:06:18,156][1854504] Loop rollout_proc21_evt_loop terminating... +[2023-03-06 13:06:18,163][1854171] Stopping RolloutWorker_w1... +[2023-03-06 13:06:18,164][1854171] Loop rollout_proc1_evt_loop terminating... +[2023-03-06 13:06:18,166][1853846] Component RolloutWorker_w1 stopped! +[2023-03-06 13:06:18,147][1854336] Loop rollout_proc7_evt_loop terminating... +[2023-03-06 13:06:18,147][1854633] Loop rollout_proc26_evt_loop terminating... +[2023-03-06 13:06:18,215][1854170] Weights refcount: 2 0 +[2023-03-06 13:06:18,217][1854170] Stopping InferenceWorker_p0-w0... +[2023-03-06 13:06:18,218][1854170] Loop inference_proc0-0_evt_loop terminating... +[2023-03-06 13:06:18,218][1853846] Component InferenceWorker_p0-w0 stopped! +[2023-03-06 13:06:18,274][1854119] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000008015_8207360.pth +[2023-03-06 13:06:18,283][1854119] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/assembly-v2/checkpoint_p0/checkpoint_000009767_10001408.pth... +[2023-03-06 13:06:18,359][1854119] Stopping LearnerWorker_p0... +[2023-03-06 13:06:18,360][1854119] Loop learner_proc0_evt_loop terminating... +[2023-03-06 13:06:18,359][1853846] Component LearnerWorker_p0 stopped! +[2023-03-06 13:06:18,360][1853846] Waiting for process learner_proc0 to stop... +[2023-03-06 13:06:19,552][1853846] Waiting for process inference_proc0-0 to join... +[2023-03-06 13:06:19,553][1853846] Waiting for process rollout_proc0 to join... +[2023-03-06 13:06:19,553][1853846] Waiting for process rollout_proc1 to join... +[2023-03-06 13:06:19,553][1853846] Waiting for process rollout_proc2 to join... +[2023-03-06 13:06:19,554][1853846] Waiting for process rollout_proc3 to join... +[2023-03-06 13:06:19,554][1853846] Waiting for process rollout_proc4 to join... +[2023-03-06 13:06:19,555][1853846] Waiting for process rollout_proc5 to join... +[2023-03-06 13:06:19,555][1853846] Waiting for process rollout_proc6 to join... +[2023-03-06 13:06:19,555][1853846] Waiting for process rollout_proc7 to join... +[2023-03-06 13:06:19,556][1853846] Waiting for process rollout_proc8 to join... +[2023-03-06 13:06:19,556][1853846] Waiting for process rollout_proc9 to join... +[2023-03-06 13:06:19,556][1853846] Waiting for process rollout_proc10 to join... +[2023-03-06 13:06:19,557][1853846] Waiting for process rollout_proc11 to join... +[2023-03-06 13:06:19,557][1853846] Waiting for process rollout_proc12 to join... +[2023-03-06 13:06:19,557][1853846] Waiting for process rollout_proc13 to join... +[2023-03-06 13:06:19,558][1853846] Waiting for process rollout_proc14 to join... +[2023-03-06 13:06:19,558][1853846] Waiting for process rollout_proc15 to join... +[2023-03-06 13:06:19,559][1853846] Waiting for process rollout_proc16 to join... +[2023-03-06 13:06:19,559][1853846] Waiting for process rollout_proc17 to join... +[2023-03-06 13:06:19,559][1853846] Waiting for process rollout_proc18 to join... +[2023-03-06 13:06:19,560][1853846] Waiting for process rollout_proc19 to join... +[2023-03-06 13:06:19,560][1853846] Waiting for process rollout_proc20 to join... +[2023-03-06 13:06:19,560][1853846] Waiting for process rollout_proc21 to join... +[2023-03-06 13:06:19,561][1853846] Waiting for process rollout_proc22 to join... +[2023-03-06 13:06:19,561][1853846] Waiting for process rollout_proc23 to join... +[2023-03-06 13:06:19,561][1853846] Waiting for process rollout_proc24 to join... +[2023-03-06 13:06:19,562][1853846] Waiting for process rollout_proc25 to join... +[2023-03-06 13:06:19,562][1853846] Waiting for process rollout_proc26 to join... +[2023-03-06 13:06:19,563][1853846] Waiting for process rollout_proc27 to join... +[2023-03-06 13:06:19,563][1853846] Waiting for process rollout_proc28 to join... +[2023-03-06 13:06:19,563][1853846] Waiting for process rollout_proc29 to join... +[2023-03-06 13:06:19,564][1853846] Waiting for process rollout_proc30 to join... +[2023-03-06 13:06:19,564][1853846] Waiting for process rollout_proc31 to join... +[2023-03-06 13:06:19,564][1853846] Batcher 0 profile tree view: +batching: 73.5971, releasing_batches: 0.1355 +[2023-03-06 13:06:19,565][1853846] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0001 + wait_policy_total: 26.0628 +update_model: 13.1989 + weight_update: 0.0007 +one_step: 0.0052 + handle_policy_step: 659.7235 + deserialize: 19.6112, stack: 3.3167, obs_to_device_normalize: 112.8014, forward: 304.7839, send_messages: 126.9188 + prepare_outputs: 66.4959 + to_cpu: 33.0655 +[2023-03-06 13:06:19,565][1853846] Learner 0 profile tree view: +misc: 0.0421, prepare_batch: 36.6257 +train: 90.7562 + epoch_init: 0.0357, minibatch_init: 0.0355, losses_postprocess: 2.7080, kl_divergence: 3.2012, after_optimizer: 1.2951 + calculate_losses: 26.6530 + losses_init: 0.0189, forward_head: 1.4804, bptt_initial: 9.5192, tail: 5.4685, advantages_returns: 0.6752, losses: 2.5860 + bptt: 6.1178 + bptt_forward_core: 5.8966 + update: 54.6037 + clip: 5.0431 +[2023-03-06 13:06:19,565][1853846] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 0.2849, enqueue_policy_requests: 13.8574, env_step: 290.8410, overhead: 11.2040, complete_rollouts: 0.6842 +save_policy_outputs: 15.5832 + split_output_tensors: 7.7051 +[2023-03-06 13:06:19,566][1853846] RolloutWorker_w31 profile tree view: +wait_for_trajectories: 0.2907, enqueue_policy_requests: 13.7370, env_step: 294.3572, overhead: 11.7352, complete_rollouts: 0.7030 +save_policy_outputs: 15.8549 + split_output_tensors: 7.7470 +[2023-03-06 13:06:19,566][1853846] Loop Runner_EvtLoop terminating... +[2023-03-06 13:06:19,567][1853846] Runner profile tree view: +main_loop: 742.4714 +[2023-03-06 13:06:19,567][1853846] Collected {0: 10001408}, FPS: 12277.4