[2023-03-06 16:39:59,690][23556] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/config.json... [2023-03-06 16:39:59,703][23556] Rollout worker 0 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 1 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 2 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 3 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 4 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 5 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 6 uses device cpu [2023-03-06 16:39:59,704][23556] Rollout worker 7 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 8 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 9 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 10 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 11 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 12 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 13 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 14 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 15 uses device cpu [2023-03-06 16:39:59,705][23556] Rollout worker 16 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 17 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 18 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 19 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 20 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 21 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 22 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 23 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 24 uses device cpu [2023-03-06 16:39:59,706][23556] Rollout worker 25 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 26 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 27 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 28 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 29 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 30 uses device cpu [2023-03-06 16:39:59,707][23556] Rollout worker 31 uses device cpu [2023-03-06 16:39:59,720][23556] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 16:39:59,720][23556] InferenceWorker_p0-w0: min num requests: 10 [2023-03-06 16:39:59,803][23556] Starting all processes... [2023-03-06 16:39:59,803][23556] Starting process learner_proc0 [2023-03-06 16:39:59,853][23556] Starting all processes... [2023-03-06 16:39:59,920][23556] Starting process inference_proc0-0 [2023-03-06 16:39:59,920][23556] Starting process rollout_proc0 [2023-03-06 16:39:59,920][23556] Starting process rollout_proc1 [2023-03-06 16:39:59,920][23556] Starting process rollout_proc2 [2023-03-06 16:39:59,920][23556] Starting process rollout_proc3 [2023-03-06 16:39:59,921][23556] Starting process rollout_proc4 [2023-03-06 16:39:59,922][23556] Starting process rollout_proc5 [2023-03-06 16:39:59,923][23556] Starting process rollout_proc6 [2023-03-06 16:39:59,923][23556] Starting process rollout_proc7 [2023-03-06 16:39:59,931][23556] Starting process rollout_proc8 [2023-03-06 16:39:59,931][23556] Starting process rollout_proc9 [2023-03-06 16:39:59,936][23556] Starting process rollout_proc10 [2023-03-06 16:39:59,936][23556] Starting process rollout_proc11 [2023-03-06 16:39:59,936][23556] Starting process rollout_proc12 [2023-03-06 16:39:59,937][23556] Starting process rollout_proc13 [2023-03-06 16:39:59,938][23556] Starting process rollout_proc14 [2023-03-06 16:39:59,939][23556] Starting process rollout_proc15 [2023-03-06 16:39:59,944][23556] Starting process rollout_proc16 [2023-03-06 16:39:59,944][23556] Starting process rollout_proc17 [2023-03-06 16:39:59,945][23556] Starting process rollout_proc18 [2023-03-06 16:39:59,953][23556] Starting process rollout_proc19 [2023-03-06 16:39:59,958][23556] Starting process rollout_proc20 [2023-03-06 16:39:59,966][23556] Starting process rollout_proc21 [2023-03-06 16:40:00,089][23556] Starting process rollout_proc22 [2023-03-06 16:40:00,090][23556] Starting process rollout_proc23 [2023-03-06 16:40:00,103][23556] Starting process rollout_proc24 [2023-03-06 16:40:00,118][23556] Starting process rollout_proc25 [2023-03-06 16:40:00,127][23556] Starting process rollout_proc26 [2023-03-06 16:40:00,127][23556] Starting process rollout_proc27 [2023-03-06 16:40:00,135][23556] Starting process rollout_proc28 [2023-03-06 16:40:00,135][23556] Starting process rollout_proc29 [2023-03-06 16:40:00,143][23556] Starting process rollout_proc30 [2023-03-06 16:40:00,153][23556] Starting process rollout_proc31 [2023-03-06 16:40:01,779][23831] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 16:40:01,779][23831] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-06 16:40:01,789][23831] Num visible devices: 1 [2023-03-06 16:40:01,829][23831] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-06 16:40:01,829][23831] Starting seed is not provided [2023-03-06 16:40:01,829][23831] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 16:40:01,830][23831] Initializing actor-critic model on device cuda:0 [2023-03-06 16:40:01,830][23831] RunningMeanStd input shape: (39,) [2023-03-06 16:40:01,830][23831] RunningMeanStd input shape: (1,) [2023-03-06 16:40:01,928][23831] Created Actor Critic model with architecture: [2023-03-06 16:40:01,928][23831] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-06 16:40:02,027][23884] Worker 1 uses CPU cores [1] [2023-03-06 16:40:02,032][23919] Worker 5 uses CPU cores [5] [2023-03-06 16:40:02,217][24150] Worker 21 uses CPU cores [21] [2023-03-06 16:40:02,363][24297] Worker 31 uses CPU cores [31] [2023-03-06 16:40:02,395][24044] Worker 6 uses CPU cores [6] [2023-03-06 16:40:02,446][24231] Worker 27 uses CPU cores [27] [2023-03-06 16:40:02,638][24147] Worker 11 uses CPU cores [11] [2023-03-06 16:40:02,770][23885] Worker 2 uses CPU cores [2] [2023-03-06 16:40:02,847][24080] Worker 20 uses CPU cores [20] [2023-03-06 16:40:02,948][24153] Worker 9 uses CPU cores [9] [2023-03-06 16:40:03,027][24296] Worker 29 uses CPU cores [29] [2023-03-06 16:40:03,104][24263] Worker 28 uses CPU cores [28] [2023-03-06 16:40:03,211][24149] Worker 16 uses CPU cores [16] [2023-03-06 16:40:03,265][24114] Worker 10 uses CPU cores [10] [2023-03-06 16:40:03,486][24148] Worker 15 uses CPU cores [15] [2023-03-06 16:40:03,495][24198] Worker 26 uses CPU cores [26] [2023-03-06 16:40:03,549][23831] Using optimizer [2023-03-06 16:40:03,550][23831] No checkpoints found [2023-03-06 16:40:03,550][23831] Did not load from checkpoint, starting from scratch! [2023-03-06 16:40:03,550][23831] Initialized policy 0 weights for model version 0 [2023-03-06 16:40:03,553][23831] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 16:40:03,556][23831] LearnerWorker_p0 finished initialization! [2023-03-06 16:40:03,735][24188] Worker 25 uses CPU cores [25] [2023-03-06 16:40:03,735][24146] Worker 17 uses CPU cores [17] [2023-03-06 16:40:03,797][23886] Worker 3 uses CPU cores [3] [2023-03-06 16:40:03,912][24151] Worker 8 uses CPU cores [8] [2023-03-06 16:40:04,089][24078] Worker 18 uses CPU cores [18] [2023-03-06 16:40:04,160][24264] Worker 30 uses CPU cores [30] [2023-03-06 16:40:04,323][24046] Worker 13 uses CPU cores [13] [2023-03-06 16:40:04,415][23883] Worker 0 uses CPU cores [0] [2023-03-06 16:40:04,536][24197] Worker 24 uses CPU cores [24] [2023-03-06 16:40:04,571][24112] Worker 12 uses CPU cores [12] [2023-03-06 16:40:04,607][24079] Worker 14 uses CPU cores [14] [2023-03-06 16:40:04,804][23882] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 16:40:04,804][23882] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-06 16:40:04,813][23882] Num visible devices: 1 [2023-03-06 16:40:04,887][23882] RunningMeanStd input shape: (39,) [2023-03-06 16:40:04,887][23882] RunningMeanStd input shape: (1,) [2023-03-06 16:40:04,905][24113] Worker 7 uses CPU cores [7] [2023-03-06 16:40:05,048][23887] Worker 4 uses CPU cores [4] [2023-03-06 16:40:05,185][24152] Worker 22 uses CPU cores [22] [2023-03-06 16:40:05,228][24045] Worker 19 uses CPU cores [19] [2023-03-06 16:40:05,393][24186] Worker 23 uses CPU cores [23] [2023-03-06 16:40:05,488][23556] Inference worker 0-0 is ready! [2023-03-06 16:40:05,489][23556] All inference workers are ready! Signal rollout workers to start! [2023-03-06 16:40:06,748][23556] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-06 16:40:07,096][24113] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,199][23886] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,233][24044] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,272][24197] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,279][24151] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,305][23883] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,308][24149] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,342][24152] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,360][23884] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,360][24146] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,364][24045] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,368][24147] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,373][24231] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,387][23887] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,390][24080] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,397][24148] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,398][23885] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,398][24078] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,398][24150] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,399][24079] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,399][23919] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,399][24264] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,404][24112] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,405][24046] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,405][24296] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,406][24263] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,409][24188] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,410][24198] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,412][24114] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,413][24153] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,423][24186] Decorrelating experience for 0 frames... [2023-03-06 16:40:07,423][24297] Decorrelating experience for 0 frames... [2023-03-06 16:40:08,839][24113] Decorrelating experience for 32 frames... [2023-03-06 16:40:08,975][24044] Decorrelating experience for 32 frames... [2023-03-06 16:40:08,976][23886] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,006][24151] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,036][23883] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,036][24149] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,039][24152] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,043][24186] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,068][24045] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,120][23884] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,122][24146] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,133][24147] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,135][24231] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,144][24197] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,148][24080] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,148][23887] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,168][24114] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,179][24150] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,187][24079] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,194][24264] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,197][23919] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,198][23885] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,200][24078] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,201][24148] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,202][24112] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,203][24297] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,203][24263] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,204][24198] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,205][24188] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,209][24153] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,212][24046] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,213][24296] Decorrelating experience for 32 frames... [2023-03-06 16:40:09,488][23831] Signal inference workers to stop experience collection... [2023-03-06 16:40:09,492][23882] InferenceWorker_p0-w0: stopping experience collection [2023-03-06 16:40:09,858][23831] Signal inference workers to resume experience collection... [2023-03-06 16:40:09,859][23882] InferenceWorker_p0-w0: resuming experience collection [2023-03-06 16:40:11,006][23882] Updated weights for policy 0, policy_version 10 (0.0216) [2023-03-06 16:40:11,748][23556] Fps is (10 sec: 3891.2, 60 sec: 3891.2, 300 sec: 3891.2). Total num frames: 19456. Throughput: 0: 3501.8. Samples: 17509. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-06 16:40:11,809][23882] Updated weights for policy 0, policy_version 20 (0.0007) [2023-03-06 16:40:12,622][23882] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-06 16:40:13,386][23882] Updated weights for policy 0, policy_version 40 (0.0007) [2023-03-06 16:40:14,152][23882] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-06 16:40:14,921][23882] Updated weights for policy 0, policy_version 60 (0.0008) [2023-03-06 16:40:15,695][23882] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-06 16:40:16,458][23882] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-06 16:40:16,748][23556] Fps is (10 sec: 8499.3, 60 sec: 8499.3, 300 sec: 8499.3). Total num frames: 84992. Throughput: 0: 5706.5. Samples: 57064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:40:16,748][23556] Avg episode reward: [(0, '6.616')] [2023-03-06 16:40:17,242][23882] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-06 16:40:18,028][23882] Updated weights for policy 0, policy_version 100 (0.0007) [2023-03-06 16:40:18,794][23882] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-06 16:40:19,582][23882] Updated weights for policy 0, policy_version 120 (0.0007) [2023-03-06 16:40:19,716][23556] Heartbeat connected on Batcher_0 [2023-03-06 16:40:19,723][23556] Heartbeat connected on RolloutWorker_w0 [2023-03-06 16:40:19,724][23556] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-06 16:40:19,725][23556] Heartbeat connected on RolloutWorker_w1 [2023-03-06 16:40:19,728][23556] Heartbeat connected on RolloutWorker_w3 [2023-03-06 16:40:19,728][23556] Heartbeat connected on RolloutWorker_w2 [2023-03-06 16:40:19,730][23556] Heartbeat connected on LearnerWorker_p0 [2023-03-06 16:40:19,730][23556] Heartbeat connected on RolloutWorker_w4 [2023-03-06 16:40:19,732][23556] Heartbeat connected on RolloutWorker_w5 [2023-03-06 16:40:19,737][23556] Heartbeat connected on RolloutWorker_w8 [2023-03-06 16:40:19,739][23556] Heartbeat connected on RolloutWorker_w9 [2023-03-06 16:40:19,741][23556] Heartbeat connected on RolloutWorker_w10 [2023-03-06 16:40:19,743][23556] Heartbeat connected on RolloutWorker_w6 [2023-03-06 16:40:19,753][23556] Heartbeat connected on RolloutWorker_w7 [2023-03-06 16:40:19,765][23556] Heartbeat connected on RolloutWorker_w11 [2023-03-06 16:40:19,767][23556] Heartbeat connected on RolloutWorker_w12 [2023-03-06 16:40:19,770][23556] Heartbeat connected on RolloutWorker_w13 [2023-03-06 16:40:19,771][23556] Heartbeat connected on RolloutWorker_w14 [2023-03-06 16:40:19,773][23556] Heartbeat connected on RolloutWorker_w15 [2023-03-06 16:40:19,776][23556] Heartbeat connected on RolloutWorker_w17 [2023-03-06 16:40:19,778][23556] Heartbeat connected on RolloutWorker_w18 [2023-03-06 16:40:19,780][23556] Heartbeat connected on RolloutWorker_w19 [2023-03-06 16:40:19,781][23556] Heartbeat connected on RolloutWorker_w20 [2023-03-06 16:40:19,783][23556] Heartbeat connected on RolloutWorker_w21 [2023-03-06 16:40:19,786][23556] Heartbeat connected on RolloutWorker_w22 [2023-03-06 16:40:19,792][23556] Heartbeat connected on RolloutWorker_w24 [2023-03-06 16:40:19,792][23556] Heartbeat connected on RolloutWorker_w26 [2023-03-06 16:40:19,793][23556] Heartbeat connected on RolloutWorker_w23 [2023-03-06 16:40:19,794][23556] Heartbeat connected on RolloutWorker_w27 [2023-03-06 16:40:19,795][23556] Heartbeat connected on RolloutWorker_w16 [2023-03-06 16:40:19,796][23556] Heartbeat connected on RolloutWorker_w28 [2023-03-06 16:40:19,797][23556] Heartbeat connected on RolloutWorker_w29 [2023-03-06 16:40:19,799][23556] Heartbeat connected on RolloutWorker_w25 [2023-03-06 16:40:19,799][23556] Heartbeat connected on RolloutWorker_w30 [2023-03-06 16:40:19,801][23556] Heartbeat connected on RolloutWorker_w31 [2023-03-06 16:40:20,373][23882] Updated weights for policy 0, policy_version 130 (0.0007) [2023-03-06 16:40:21,141][23882] Updated weights for policy 0, policy_version 140 (0.0007) [2023-03-06 16:40:21,748][23556] Fps is (10 sec: 13107.4, 60 sec: 10035.3, 300 sec: 10035.3). Total num frames: 150528. Throughput: 0: 9068.9. Samples: 136032. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:40:21,748][23556] Avg episode reward: [(0, '7.581')] [2023-03-06 16:40:21,750][23831] Saving new best policy, reward=7.581! [2023-03-06 16:40:21,909][23882] Updated weights for policy 0, policy_version 150 (0.0006) [2023-03-06 16:40:22,689][23882] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-06 16:40:23,473][23882] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-06 16:40:24,241][23882] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-06 16:40:25,041][23882] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-06 16:40:25,817][23882] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-06 16:40:26,590][23882] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-06 16:40:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 10803.2, 300 sec: 10803.2). Total num frames: 216064. Throughput: 0: 10759.5. Samples: 215189. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:40:26,748][23556] Avg episode reward: [(0, '9.406')] [2023-03-06 16:40:26,755][23831] Saving new best policy, reward=9.406! [2023-03-06 16:40:27,376][23882] Updated weights for policy 0, policy_version 220 (0.0007) [2023-03-06 16:40:28,144][23882] Updated weights for policy 0, policy_version 230 (0.0007) [2023-03-06 16:40:28,933][23882] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-06 16:40:29,725][23882] Updated weights for policy 0, policy_version 250 (0.0007) [2023-03-06 16:40:30,502][23882] Updated weights for policy 0, policy_version 260 (0.0006) [2023-03-06 16:40:31,290][23882] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-06 16:40:31,748][23556] Fps is (10 sec: 13107.0, 60 sec: 11264.0, 300 sec: 11264.0). Total num frames: 281600. Throughput: 0: 10176.1. Samples: 254404. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:40:31,748][23556] Avg episode reward: [(0, '11.116')] [2023-03-06 16:40:31,762][23831] Saving new best policy, reward=11.116! [2023-03-06 16:40:32,091][23882] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-06 16:40:32,871][23882] Updated weights for policy 0, policy_version 290 (0.0007) [2023-03-06 16:40:33,632][23882] Updated weights for policy 0, policy_version 300 (0.0005) [2023-03-06 16:40:34,419][23882] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-06 16:40:35,202][23882] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-06 16:40:35,974][23882] Updated weights for policy 0, policy_version 330 (0.0005) [2023-03-06 16:40:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 11571.2, 300 sec: 11571.2). Total num frames: 347136. Throughput: 0: 11090.1. Samples: 332702. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:40:36,748][23556] Avg episode reward: [(0, '10.683')] [2023-03-06 16:40:36,766][23882] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-06 16:40:37,550][23882] Updated weights for policy 0, policy_version 350 (0.0007) [2023-03-06 16:40:38,321][23882] Updated weights for policy 0, policy_version 360 (0.0006) [2023-03-06 16:40:39,107][23882] Updated weights for policy 0, policy_version 370 (0.0005) [2023-03-06 16:40:39,922][23882] Updated weights for policy 0, policy_version 380 (0.0006) [2023-03-06 16:40:40,685][23882] Updated weights for policy 0, policy_version 390 (0.0007) [2023-03-06 16:40:41,457][23882] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-06 16:40:41,748][23556] Fps is (10 sec: 13107.4, 60 sec: 11790.7, 300 sec: 11790.7). Total num frames: 412672. Throughput: 0: 11752.3. Samples: 411329. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:40:41,748][23556] Avg episode reward: [(0, '12.141')] [2023-03-06 16:40:41,749][23831] Saving new best policy, reward=12.141! [2023-03-06 16:40:42,262][23882] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-06 16:40:43,049][23882] Updated weights for policy 0, policy_version 420 (0.0007) [2023-03-06 16:40:43,830][23882] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-06 16:40:44,625][23882] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-06 16:40:45,388][23882] Updated weights for policy 0, policy_version 450 (0.0006) [2023-03-06 16:40:46,183][23882] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-06 16:40:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 11955.2, 300 sec: 11955.2). Total num frames: 478208. Throughput: 0: 11251.5. Samples: 450058. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:40:46,748][23556] Avg episode reward: [(0, '13.657')] [2023-03-06 16:40:46,752][23831] Saving new best policy, reward=13.657! [2023-03-06 16:40:46,963][23882] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-06 16:40:47,732][23882] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-06 16:40:48,497][23882] Updated weights for policy 0, policy_version 490 (0.0006) [2023-03-06 16:40:49,295][23882] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-06 16:40:50,070][23882] Updated weights for policy 0, policy_version 510 (0.0007) [2023-03-06 16:40:50,860][23882] Updated weights for policy 0, policy_version 520 (0.0007) [2023-03-06 16:40:51,649][23882] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-06 16:40:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 12083.3, 300 sec: 12083.3). Total num frames: 543744. Throughput: 0: 11750.9. Samples: 528786. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 16:40:51,748][23556] Avg episode reward: [(0, '13.575')] [2023-03-06 16:40:52,431][23882] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-06 16:40:53,212][23882] Updated weights for policy 0, policy_version 550 (0.0007) [2023-03-06 16:40:53,998][23882] Updated weights for policy 0, policy_version 560 (0.0007) [2023-03-06 16:40:54,790][23882] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-06 16:40:55,567][23882] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-06 16:40:56,358][23882] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-06 16:40:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 12185.6, 300 sec: 12185.6). Total num frames: 609280. Throughput: 0: 13106.1. Samples: 607281. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:40:56,748][23556] Avg episode reward: [(0, '13.620')] [2023-03-06 16:40:57,140][23882] Updated weights for policy 0, policy_version 600 (0.0006) [2023-03-06 16:40:57,918][23882] Updated weights for policy 0, policy_version 610 (0.0007) [2023-03-06 16:40:58,704][23882] Updated weights for policy 0, policy_version 620 (0.0007) [2023-03-06 16:40:59,500][23882] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-06 16:41:00,275][23882] Updated weights for policy 0, policy_version 640 (0.0007) [2023-03-06 16:41:01,062][23882] Updated weights for policy 0, policy_version 650 (0.0007) [2023-03-06 16:41:01,748][23556] Fps is (10 sec: 13004.5, 60 sec: 12250.8, 300 sec: 12250.8). Total num frames: 673792. Throughput: 0: 13096.0. Samples: 646387. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:41:01,748][23556] Avg episode reward: [(0, '13.907')] [2023-03-06 16:41:01,749][23831] Saving new best policy, reward=13.907! [2023-03-06 16:41:01,850][23882] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-06 16:41:02,632][23882] Updated weights for policy 0, policy_version 670 (0.0007) [2023-03-06 16:41:03,419][23882] Updated weights for policy 0, policy_version 680 (0.0007) [2023-03-06 16:41:04,201][23882] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-06 16:41:04,969][23882] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-06 16:41:05,744][23882] Updated weights for policy 0, policy_version 710 (0.0006) [2023-03-06 16:41:06,535][23882] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-06 16:41:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12322.2, 300 sec: 12322.2). Total num frames: 739328. Throughput: 0: 13089.4. Samples: 725054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:41:06,748][23556] Avg episode reward: [(0, '14.595')] [2023-03-06 16:41:06,753][23831] Saving new best policy, reward=14.595! [2023-03-06 16:41:07,340][23882] Updated weights for policy 0, policy_version 730 (0.0007) [2023-03-06 16:41:08,123][23882] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-06 16:41:08,914][23882] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-06 16:41:09,694][23882] Updated weights for policy 0, policy_version 760 (0.0007) [2023-03-06 16:41:10,469][23882] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-06 16:41:11,272][23882] Updated weights for policy 0, policy_version 780 (0.0007) [2023-03-06 16:41:11,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13090.2, 300 sec: 12382.5). Total num frames: 804864. Throughput: 0: 13059.2. Samples: 802854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:41:11,748][23556] Avg episode reward: [(0, '14.157')] [2023-03-06 16:41:12,054][23882] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-06 16:41:12,819][23882] Updated weights for policy 0, policy_version 800 (0.0007) [2023-03-06 16:41:13,581][23882] Updated weights for policy 0, policy_version 810 (0.0005) [2023-03-06 16:41:14,379][23882] Updated weights for policy 0, policy_version 820 (0.0007) [2023-03-06 16:41:15,146][23882] Updated weights for policy 0, policy_version 830 (0.0006) [2023-03-06 16:41:15,929][23882] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-06 16:41:16,690][23882] Updated weights for policy 0, policy_version 850 (0.0007) [2023-03-06 16:41:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 12434.3). Total num frames: 870400. Throughput: 0: 13069.2. Samples: 842518. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:41:16,748][23556] Avg episode reward: [(0, '14.389')] [2023-03-06 16:41:17,465][23882] Updated weights for policy 0, policy_version 860 (0.0007) [2023-03-06 16:41:18,269][23882] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-06 16:41:19,066][23882] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-06 16:41:19,840][23882] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-06 16:41:20,617][23882] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-06 16:41:21,397][23882] Updated weights for policy 0, policy_version 910 (0.0007) [2023-03-06 16:41:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 12479.2). Total num frames: 935936. Throughput: 0: 13078.2. Samples: 921220. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:41:21,748][23556] Avg episode reward: [(0, '12.966')] [2023-03-06 16:41:22,181][23882] Updated weights for policy 0, policy_version 920 (0.0007) [2023-03-06 16:41:22,950][23882] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-06 16:41:23,751][23882] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-06 16:41:24,518][23882] Updated weights for policy 0, policy_version 950 (0.0007) [2023-03-06 16:41:25,296][23882] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-06 16:41:26,081][23882] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-06 16:41:26,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 12518.4). Total num frames: 1001472. Throughput: 0: 13078.7. Samples: 999874. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:41:26,749][23556] Avg episode reward: [(0, '13.848')] [2023-03-06 16:41:26,871][23882] Updated weights for policy 0, policy_version 980 (0.0005) [2023-03-06 16:41:27,647][23882] Updated weights for policy 0, policy_version 990 (0.0007) [2023-03-06 16:41:28,438][23882] Updated weights for policy 0, policy_version 1000 (0.0007) [2023-03-06 16:41:29,215][23882] Updated weights for policy 0, policy_version 1010 (0.0007) [2023-03-06 16:41:29,998][23882] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-06 16:41:30,794][23882] Updated weights for policy 0, policy_version 1030 (0.0007) [2023-03-06 16:41:31,571][23882] Updated weights for policy 0, policy_version 1040 (0.0007) [2023-03-06 16:41:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.2, 300 sec: 12553.1). Total num frames: 1067008. Throughput: 0: 13094.6. Samples: 1039316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:41:31,748][23556] Avg episode reward: [(0, '13.672')] [2023-03-06 16:41:32,349][23882] Updated weights for policy 0, policy_version 1050 (0.0007) [2023-03-06 16:41:33,157][23882] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-06 16:41:33,947][23882] Updated weights for policy 0, policy_version 1070 (0.0007) [2023-03-06 16:41:34,734][23882] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-06 16:41:35,541][23882] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-06 16:41:36,338][23882] Updated weights for policy 0, policy_version 1100 (0.0007) [2023-03-06 16:41:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.0, 300 sec: 12572.4). Total num frames: 1131520. Throughput: 0: 13070.5. Samples: 1116962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:41:36,748][23556] Avg episode reward: [(0, '12.590')] [2023-03-06 16:41:37,107][23882] Updated weights for policy 0, policy_version 1110 (0.0007) [2023-03-06 16:41:37,907][23882] Updated weights for policy 0, policy_version 1120 (0.0007) [2023-03-06 16:41:38,685][23882] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-06 16:41:39,478][23882] Updated weights for policy 0, policy_version 1140 (0.0006) [2023-03-06 16:41:40,270][23882] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-06 16:41:41,053][23882] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-06 16:41:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 12600.6). Total num frames: 1197056. Throughput: 0: 13059.0. Samples: 1194938. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:41:41,748][23556] Avg episode reward: [(0, '13.294')] [2023-03-06 16:41:41,822][23882] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-06 16:41:42,604][23882] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-06 16:41:43,401][23882] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-06 16:41:44,190][23882] Updated weights for policy 0, policy_version 1200 (0.0006) [2023-03-06 16:41:44,988][23882] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-06 16:41:45,770][23882] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-06 16:41:46,543][23882] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-06 16:41:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12615.7). Total num frames: 1261568. Throughput: 0: 13060.0. Samples: 1234088. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:41:46,748][23556] Avg episode reward: [(0, '14.261')] [2023-03-06 16:41:47,323][23882] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-06 16:41:48,130][23882] Updated weights for policy 0, policy_version 1250 (0.0007) [2023-03-06 16:41:48,915][23882] Updated weights for policy 0, policy_version 1260 (0.0007) [2023-03-06 16:41:49,682][23882] Updated weights for policy 0, policy_version 1270 (0.0007) [2023-03-06 16:41:50,469][23882] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-06 16:41:51,269][23882] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-06 16:41:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 12639.1). Total num frames: 1327104. Throughput: 0: 13051.3. Samples: 1312365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:41:51,748][23556] Avg episode reward: [(0, '15.415')] [2023-03-06 16:41:51,749][23831] Saving new best policy, reward=15.415! [2023-03-06 16:41:52,029][23882] Updated weights for policy 0, policy_version 1300 (0.0006) [2023-03-06 16:41:52,841][23882] Updated weights for policy 0, policy_version 1310 (0.0007) [2023-03-06 16:41:53,621][23882] Updated weights for policy 0, policy_version 1320 (0.0007) [2023-03-06 16:41:54,408][23882] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-06 16:41:55,214][23882] Updated weights for policy 0, policy_version 1340 (0.0007) [2023-03-06 16:41:55,997][23882] Updated weights for policy 0, policy_version 1350 (0.0006) [2023-03-06 16:41:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12651.1). Total num frames: 1391616. Throughput: 0: 13052.9. Samples: 1390237. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:41:56,748][23556] Avg episode reward: [(0, '16.721')] [2023-03-06 16:41:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000001359_1391616.pth... [2023-03-06 16:41:56,784][23831] Saving new best policy, reward=16.721! [2023-03-06 16:41:56,839][23882] Updated weights for policy 0, policy_version 1360 (0.0007) [2023-03-06 16:41:57,576][23882] Updated weights for policy 0, policy_version 1370 (0.0005) [2023-03-06 16:41:58,347][23882] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-06 16:41:59,129][23882] Updated weights for policy 0, policy_version 1390 (0.0006) [2023-03-06 16:41:59,911][23882] Updated weights for policy 0, policy_version 1400 (0.0006) [2023-03-06 16:42:00,711][23882] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-06 16:42:01,474][23882] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-06 16:42:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12670.9). Total num frames: 1457152. Throughput: 0: 13045.1. Samples: 1429547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:42:01,748][23556] Avg episode reward: [(0, '18.038')] [2023-03-06 16:42:01,749][23831] Saving new best policy, reward=18.038! [2023-03-06 16:42:02,254][23882] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-06 16:42:03,052][23882] Updated weights for policy 0, policy_version 1440 (0.0007) [2023-03-06 16:42:03,837][23882] Updated weights for policy 0, policy_version 1450 (0.0005) [2023-03-06 16:42:04,629][23882] Updated weights for policy 0, policy_version 1460 (0.0006) [2023-03-06 16:42:05,399][23882] Updated weights for policy 0, policy_version 1470 (0.0006) [2023-03-06 16:42:06,181][23882] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-06 16:42:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 12689.1). Total num frames: 1522688. Throughput: 0: 13033.9. Samples: 1507747. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:42:06,748][23556] Avg episode reward: [(0, '18.058')] [2023-03-06 16:42:06,752][23831] Saving new best policy, reward=18.058! [2023-03-06 16:42:06,983][23882] Updated weights for policy 0, policy_version 1490 (0.0006) [2023-03-06 16:42:07,779][23882] Updated weights for policy 0, policy_version 1500 (0.0007) [2023-03-06 16:42:08,555][23882] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-06 16:42:09,346][23882] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-06 16:42:10,121][23882] Updated weights for policy 0, policy_version 1530 (0.0006) [2023-03-06 16:42:10,884][23882] Updated weights for policy 0, policy_version 1540 (0.0007) [2023-03-06 16:42:11,673][23882] Updated weights for policy 0, policy_version 1550 (0.0007) [2023-03-06 16:42:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 12697.6). Total num frames: 1587200. Throughput: 0: 13025.6. Samples: 1586022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:42:11,748][23556] Avg episode reward: [(0, '19.995')] [2023-03-06 16:42:11,752][23831] Saving new best policy, reward=19.995! [2023-03-06 16:42:12,477][23882] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-06 16:42:13,262][23882] Updated weights for policy 0, policy_version 1570 (0.0005) [2023-03-06 16:42:14,040][23882] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-06 16:42:14,825][23882] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-06 16:42:15,593][23882] Updated weights for policy 0, policy_version 1600 (0.0007) [2023-03-06 16:42:16,390][23882] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-06 16:42:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 12713.4). Total num frames: 1652736. Throughput: 0: 13015.5. Samples: 1625014. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:42:16,749][23556] Avg episode reward: [(0, '20.879')] [2023-03-06 16:42:16,753][23831] Saving new best policy, reward=20.879! [2023-03-06 16:42:17,176][23882] Updated weights for policy 0, policy_version 1620 (0.0007) [2023-03-06 16:42:17,963][23882] Updated weights for policy 0, policy_version 1630 (0.0007) [2023-03-06 16:42:18,743][23882] Updated weights for policy 0, policy_version 1640 (0.0007) [2023-03-06 16:42:19,535][23882] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-06 16:42:20,318][23882] Updated weights for policy 0, policy_version 1660 (0.0007) [2023-03-06 16:42:21,105][23882] Updated weights for policy 0, policy_version 1670 (0.0007) [2023-03-06 16:42:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 12728.0). Total num frames: 1718272. Throughput: 0: 13029.4. Samples: 1703285. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:42:21,748][23556] Avg episode reward: [(0, '21.094')] [2023-03-06 16:42:21,749][23831] Saving new best policy, reward=21.094! [2023-03-06 16:42:21,898][23882] Updated weights for policy 0, policy_version 1680 (0.0007) [2023-03-06 16:42:22,694][23882] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-06 16:42:23,481][23882] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-06 16:42:24,280][23882] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-06 16:42:25,052][23882] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-06 16:42:25,833][23882] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-06 16:42:26,629][23882] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-06 16:42:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12734.2). Total num frames: 1782784. Throughput: 0: 13030.4. Samples: 1781308. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:42:26,748][23556] Avg episode reward: [(0, '24.495')] [2023-03-06 16:42:26,753][23831] Saving new best policy, reward=24.495! [2023-03-06 16:42:27,405][23882] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-06 16:42:28,200][23882] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-06 16:42:28,976][23882] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-06 16:42:29,770][23882] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-06 16:42:30,558][23882] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-06 16:42:31,354][23882] Updated weights for policy 0, policy_version 1800 (0.0007) [2023-03-06 16:42:31,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 12740.0). Total num frames: 1847296. Throughput: 0: 13025.8. Samples: 1820248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:42:31,748][23556] Avg episode reward: [(0, '25.433')] [2023-03-06 16:42:31,752][23831] Saving new best policy, reward=25.433! [2023-03-06 16:42:32,118][23882] Updated weights for policy 0, policy_version 1810 (0.0007) [2023-03-06 16:42:32,926][23882] Updated weights for policy 0, policy_version 1820 (0.0008) [2023-03-06 16:42:33,696][23882] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-06 16:42:34,474][23882] Updated weights for policy 0, policy_version 1840 (0.0006) [2023-03-06 16:42:35,261][23882] Updated weights for policy 0, policy_version 1850 (0.0006) [2023-03-06 16:42:36,045][23882] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-06 16:42:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 12752.2). Total num frames: 1912832. Throughput: 0: 13027.5. Samples: 1898603. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:42:36,748][23556] Avg episode reward: [(0, '28.043')] [2023-03-06 16:42:36,751][23831] Saving new best policy, reward=28.043! [2023-03-06 16:42:36,842][23882] Updated weights for policy 0, policy_version 1870 (0.0007) [2023-03-06 16:42:37,633][23882] Updated weights for policy 0, policy_version 1880 (0.0007) [2023-03-06 16:42:38,417][23882] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-06 16:42:39,203][23882] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-06 16:42:39,980][23882] Updated weights for policy 0, policy_version 1910 (0.0007) [2023-03-06 16:42:40,772][23882] Updated weights for policy 0, policy_version 1920 (0.0007) [2023-03-06 16:42:41,566][23882] Updated weights for policy 0, policy_version 1930 (0.0007) [2023-03-06 16:42:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 12763.7). Total num frames: 1978368. Throughput: 0: 13030.2. Samples: 1976595. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:42:41,748][23556] Avg episode reward: [(0, '27.225')] [2023-03-06 16:42:42,353][23882] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-06 16:42:43,131][23882] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-06 16:42:43,922][23882] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-06 16:42:44,692][23882] Updated weights for policy 0, policy_version 1970 (0.0006) [2023-03-06 16:42:45,478][23882] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-06 16:42:46,267][23882] Updated weights for policy 0, policy_version 1990 (0.0007) [2023-03-06 16:42:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 12774.4). Total num frames: 2043904. Throughput: 0: 13031.0. Samples: 2015943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:42:46,748][23556] Avg episode reward: [(0, '27.384')] [2023-03-06 16:42:47,060][23882] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-06 16:42:47,833][23882] Updated weights for policy 0, policy_version 2010 (0.0007) [2023-03-06 16:42:48,627][23882] Updated weights for policy 0, policy_version 2020 (0.0007) [2023-03-06 16:42:49,397][23882] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-06 16:42:50,193][23882] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-06 16:42:50,971][23882] Updated weights for policy 0, policy_version 2050 (0.0006) [2023-03-06 16:42:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12778.3). Total num frames: 2108416. Throughput: 0: 13030.7. Samples: 2094129. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:42:51,748][23556] Avg episode reward: [(0, '26.886')] [2023-03-06 16:42:51,762][23882] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-06 16:42:52,535][23882] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-06 16:42:53,324][23882] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-06 16:42:54,120][23882] Updated weights for policy 0, policy_version 2090 (0.0005) [2023-03-06 16:42:54,911][23882] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-06 16:42:55,691][23882] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-06 16:42:56,472][23882] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-06 16:42:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12788.0). Total num frames: 2173952. Throughput: 0: 13029.2. Samples: 2172339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:42:56,748][23556] Avg episode reward: [(0, '25.668')] [2023-03-06 16:42:57,249][23882] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-06 16:42:58,041][23882] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-06 16:42:58,821][23882] Updated weights for policy 0, policy_version 2150 (0.0007) [2023-03-06 16:42:59,613][23882] Updated weights for policy 0, policy_version 2160 (0.0007) [2023-03-06 16:43:00,383][23882] Updated weights for policy 0, policy_version 2170 (0.0007) [2023-03-06 16:43:01,152][23882] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-06 16:43:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 12797.1). Total num frames: 2239488. Throughput: 0: 13032.1. Samples: 2211461. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:43:01,748][23556] Avg episode reward: [(0, '27.748')] [2023-03-06 16:43:01,943][23882] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-06 16:43:02,709][23882] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-06 16:43:03,507][23882] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-06 16:43:04,286][23882] Updated weights for policy 0, policy_version 2220 (0.0007) [2023-03-06 16:43:05,071][23882] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-06 16:43:05,858][23882] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-06 16:43:06,648][23882] Updated weights for policy 0, policy_version 2250 (0.0006) [2023-03-06 16:43:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 12805.7). Total num frames: 2305024. Throughput: 0: 13042.5. Samples: 2290197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:43:06,748][23556] Avg episode reward: [(0, '31.733')] [2023-03-06 16:43:06,753][23831] Saving new best policy, reward=31.733! [2023-03-06 16:43:07,430][23882] Updated weights for policy 0, policy_version 2260 (0.0007) [2023-03-06 16:43:08,202][23882] Updated weights for policy 0, policy_version 2270 (0.0005) [2023-03-06 16:43:09,015][23882] Updated weights for policy 0, policy_version 2280 (0.0007) [2023-03-06 16:43:09,786][23882] Updated weights for policy 0, policy_version 2290 (0.0006) [2023-03-06 16:43:10,572][23882] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-06 16:43:11,363][23882] Updated weights for policy 0, policy_version 2310 (0.0007) [2023-03-06 16:43:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 12813.8). Total num frames: 2370560. Throughput: 0: 13047.4. Samples: 2368439. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:43:11,748][23556] Avg episode reward: [(0, '31.126')] [2023-03-06 16:43:12,149][23882] Updated weights for policy 0, policy_version 2320 (0.0007) [2023-03-06 16:43:12,936][23882] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-06 16:43:13,730][23882] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-06 16:43:14,510][23882] Updated weights for policy 0, policy_version 2350 (0.0007) [2023-03-06 16:43:15,281][23882] Updated weights for policy 0, policy_version 2360 (0.0007) [2023-03-06 16:43:16,069][23882] Updated weights for policy 0, policy_version 2370 (0.0007) [2023-03-06 16:43:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 12816.2). Total num frames: 2435072. Throughput: 0: 13051.9. Samples: 2407583. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:43:16,748][23556] Avg episode reward: [(0, '35.528')] [2023-03-06 16:43:16,752][23831] Saving new best policy, reward=35.528! [2023-03-06 16:43:16,859][23882] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-06 16:43:17,633][23882] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-06 16:43:18,411][23882] Updated weights for policy 0, policy_version 2400 (0.0006) [2023-03-06 16:43:19,189][23882] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-06 16:43:19,953][23882] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-06 16:43:20,750][23882] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-06 16:43:21,540][23882] Updated weights for policy 0, policy_version 2440 (0.0007) [2023-03-06 16:43:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12823.6). Total num frames: 2500608. Throughput: 0: 13058.4. Samples: 2486233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:43:21,748][23556] Avg episode reward: [(0, '34.481')] [2023-03-06 16:43:22,323][23882] Updated weights for policy 0, policy_version 2450 (0.0007) [2023-03-06 16:43:23,114][23882] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-06 16:43:23,883][23882] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-06 16:43:24,657][23882] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-06 16:43:25,433][23882] Updated weights for policy 0, policy_version 2490 (0.0005) [2023-03-06 16:43:26,206][23882] Updated weights for policy 0, policy_version 2500 (0.0006) [2023-03-06 16:43:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 12830.7). Total num frames: 2566144. Throughput: 0: 13075.0. Samples: 2564971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:43:26,748][23556] Avg episode reward: [(0, '36.751')] [2023-03-06 16:43:26,752][23831] Saving new best policy, reward=36.751! [2023-03-06 16:43:26,977][23882] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-06 16:43:27,778][23882] Updated weights for policy 0, policy_version 2520 (0.0007) [2023-03-06 16:43:28,552][23882] Updated weights for policy 0, policy_version 2530 (0.0006) [2023-03-06 16:43:29,341][23882] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-06 16:43:30,137][23882] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-06 16:43:30,910][23882] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-06 16:43:31,682][23882] Updated weights for policy 0, policy_version 2570 (0.0007) [2023-03-06 16:43:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 12837.5). Total num frames: 2631680. Throughput: 0: 13069.3. Samples: 2604059. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:43:31,748][23556] Avg episode reward: [(0, '34.530')] [2023-03-06 16:43:32,489][23882] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-06 16:43:33,266][23882] Updated weights for policy 0, policy_version 2590 (0.0007) [2023-03-06 16:43:34,044][23882] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-06 16:43:34,815][23882] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-06 16:43:35,605][23882] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-06 16:43:36,394][23882] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-06 16:43:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 12843.9). Total num frames: 2697216. Throughput: 0: 13080.0. Samples: 2682730. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:43:36,748][23556] Avg episode reward: [(0, '35.321')] [2023-03-06 16:43:37,184][23882] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-06 16:43:37,948][23882] Updated weights for policy 0, policy_version 2650 (0.0005) [2023-03-06 16:43:38,745][23882] Updated weights for policy 0, policy_version 2660 (0.0007) [2023-03-06 16:43:39,540][23882] Updated weights for policy 0, policy_version 2670 (0.0007) [2023-03-06 16:43:40,310][23882] Updated weights for policy 0, policy_version 2680 (0.0008) [2023-03-06 16:43:41,102][23882] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-06 16:43:41,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 12850.0). Total num frames: 2762752. Throughput: 0: 13080.5. Samples: 2760963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:43:41,748][23556] Avg episode reward: [(0, '37.235')] [2023-03-06 16:43:41,749][23831] Saving new best policy, reward=37.235! [2023-03-06 16:43:41,884][23882] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-06 16:43:42,665][23882] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-06 16:43:43,459][23882] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-06 16:43:44,238][23882] Updated weights for policy 0, policy_version 2730 (0.0007) [2023-03-06 16:43:45,036][23882] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-06 16:43:45,810][23882] Updated weights for policy 0, policy_version 2750 (0.0007) [2023-03-06 16:43:46,617][23882] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-06 16:43:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 12851.2). Total num frames: 2827264. Throughput: 0: 13076.7. Samples: 2799915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:43:46,748][23556] Avg episode reward: [(0, '38.458')] [2023-03-06 16:43:46,761][23831] Saving new best policy, reward=38.458! [2023-03-06 16:43:47,414][23882] Updated weights for policy 0, policy_version 2770 (0.0007) [2023-03-06 16:43:48,189][23882] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-06 16:43:48,960][23882] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-06 16:43:49,752][23882] Updated weights for policy 0, policy_version 2800 (0.0007) [2023-03-06 16:43:50,529][23882] Updated weights for policy 0, policy_version 2810 (0.0007) [2023-03-06 16:43:51,318][23882] Updated weights for policy 0, policy_version 2820 (0.0006) [2023-03-06 16:43:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 12856.9). Total num frames: 2892800. Throughput: 0: 13068.8. Samples: 2878291. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:43:51,748][23556] Avg episode reward: [(0, '37.557')] [2023-03-06 16:43:52,110][23882] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-06 16:43:52,889][23882] Updated weights for policy 0, policy_version 2840 (0.0007) [2023-03-06 16:43:53,674][23882] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-06 16:43:54,462][23882] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-06 16:43:55,247][23882] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-06 16:43:56,015][23882] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-06 16:43:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 12862.3). Total num frames: 2958336. Throughput: 0: 13066.7. Samples: 2956443. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:43:56,749][23556] Avg episode reward: [(0, '36.662')] [2023-03-06 16:43:56,754][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000002889_2958336.pth... [2023-03-06 16:43:56,805][23882] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-06 16:43:57,579][23882] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-06 16:43:58,379][23882] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-06 16:43:59,158][23882] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-06 16:43:59,942][23882] Updated weights for policy 0, policy_version 2930 (0.0006) [2023-03-06 16:44:00,742][23882] Updated weights for policy 0, policy_version 2940 (0.0006) [2023-03-06 16:44:01,538][23882] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-06 16:44:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12863.2). Total num frames: 3022848. Throughput: 0: 13067.5. Samples: 2995619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:44:01,748][23556] Avg episode reward: [(0, '36.607')] [2023-03-06 16:44:02,332][23882] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-06 16:44:03,104][23882] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-06 16:44:03,906][23882] Updated weights for policy 0, policy_version 2980 (0.0006) [2023-03-06 16:44:04,687][23882] Updated weights for policy 0, policy_version 2990 (0.0007) [2023-03-06 16:44:05,451][23882] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-06 16:44:06,242][23882] Updated weights for policy 0, policy_version 3010 (0.0006) [2023-03-06 16:44:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 12868.3). Total num frames: 3088384. Throughput: 0: 13060.7. Samples: 3073964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:44:06,759][23556] Avg episode reward: [(0, '38.590')] [2023-03-06 16:44:06,764][23831] Saving new best policy, reward=38.590! [2023-03-06 16:44:07,041][23882] Updated weights for policy 0, policy_version 3020 (0.0007) [2023-03-06 16:44:07,822][23882] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-06 16:44:08,602][23882] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-06 16:44:09,374][23882] Updated weights for policy 0, policy_version 3050 (0.0006) [2023-03-06 16:44:10,167][23882] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-06 16:44:10,948][23882] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-06 16:44:11,736][23882] Updated weights for policy 0, policy_version 3080 (0.0008) [2023-03-06 16:44:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 12873.1). Total num frames: 3153920. Throughput: 0: 13045.9. Samples: 3152035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:44:11,754][23556] Avg episode reward: [(0, '39.056')] [2023-03-06 16:44:11,755][23831] Saving new best policy, reward=39.056! [2023-03-06 16:44:12,522][23882] Updated weights for policy 0, policy_version 3090 (0.0006) [2023-03-06 16:44:13,319][23882] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-06 16:44:14,091][23882] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-06 16:44:14,877][23882] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-06 16:44:15,667][23882] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-06 16:44:16,445][23882] Updated weights for policy 0, policy_version 3140 (0.0006) [2023-03-06 16:44:16,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 12873.7). Total num frames: 3218432. Throughput: 0: 13048.3. Samples: 3191233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:44:16,759][23556] Avg episode reward: [(0, '39.737')] [2023-03-06 16:44:16,762][23831] Saving new best policy, reward=39.737! [2023-03-06 16:44:17,218][23882] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-03-06 16:44:18,009][23882] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-06 16:44:18,788][23882] Updated weights for policy 0, policy_version 3170 (0.0006) [2023-03-06 16:44:19,578][23882] Updated weights for policy 0, policy_version 3180 (0.0006) [2023-03-06 16:44:20,341][23882] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-06 16:44:21,129][23882] Updated weights for policy 0, policy_version 3200 (0.0006) [2023-03-06 16:44:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12878.3). Total num frames: 3283968. Throughput: 0: 13044.4. Samples: 3269726. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:44:21,748][23556] Avg episode reward: [(0, '41.760')] [2023-03-06 16:44:21,753][23831] Saving new best policy, reward=41.760! [2023-03-06 16:44:21,909][23882] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-06 16:44:22,668][23882] Updated weights for policy 0, policy_version 3220 (0.0007) [2023-03-06 16:44:23,457][23882] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-06 16:44:24,235][23882] Updated weights for policy 0, policy_version 3240 (0.0007) [2023-03-06 16:44:25,016][23882] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-06 16:44:25,796][23882] Updated weights for policy 0, policy_version 3260 (0.0007) [2023-03-06 16:44:26,589][23882] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-06 16:44:26,748][23556] Fps is (10 sec: 13209.7, 60 sec: 13073.1, 300 sec: 12886.7). Total num frames: 3350528. Throughput: 0: 13058.1. Samples: 3348577. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:44:26,748][23556] Avg episode reward: [(0, '38.254')] [2023-03-06 16:44:27,368][23882] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-06 16:44:28,136][23882] Updated weights for policy 0, policy_version 3290 (0.0006) [2023-03-06 16:44:28,933][23882] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-06 16:44:29,718][23882] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-06 16:44:30,498][23882] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-06 16:44:31,283][23882] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-06 16:44:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 12887.0). Total num frames: 3415040. Throughput: 0: 13062.7. Samples: 3387737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:44:31,748][23556] Avg episode reward: [(0, '37.262')] [2023-03-06 16:44:32,070][23882] Updated weights for policy 0, policy_version 3340 (0.0007) [2023-03-06 16:44:32,849][23882] Updated weights for policy 0, policy_version 3350 (0.0007) [2023-03-06 16:44:33,653][23882] Updated weights for policy 0, policy_version 3360 (0.0007) [2023-03-06 16:44:34,425][23882] Updated weights for policy 0, policy_version 3370 (0.0007) [2023-03-06 16:44:35,206][23882] Updated weights for policy 0, policy_version 3380 (0.0005) [2023-03-06 16:44:35,984][23882] Updated weights for policy 0, policy_version 3390 (0.0007) [2023-03-06 16:44:36,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 12891.0). Total num frames: 3480576. Throughput: 0: 13061.9. Samples: 3466076. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:44:36,748][23556] Avg episode reward: [(0, '38.977')] [2023-03-06 16:44:36,761][23882] Updated weights for policy 0, policy_version 3400 (0.0005) [2023-03-06 16:44:37,532][23882] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-06 16:44:38,328][23882] Updated weights for policy 0, policy_version 3420 (0.0007) [2023-03-06 16:44:39,125][23882] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-06 16:44:39,891][23882] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-06 16:44:40,686][23882] Updated weights for policy 0, policy_version 3450 (0.0007) [2023-03-06 16:44:41,464][23882] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-06 16:44:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 12895.0). Total num frames: 3546112. Throughput: 0: 13071.0. Samples: 3544636. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:44:41,748][23556] Avg episode reward: [(0, '40.774')] [2023-03-06 16:44:42,246][23882] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-06 16:44:43,060][23882] Updated weights for policy 0, policy_version 3480 (0.0007) [2023-03-06 16:44:43,828][23882] Updated weights for policy 0, policy_version 3490 (0.0007) [2023-03-06 16:44:44,610][23882] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-06 16:44:45,408][23882] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-06 16:44:46,199][23882] Updated weights for policy 0, policy_version 3520 (0.0007) [2023-03-06 16:44:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 12898.7). Total num frames: 3611648. Throughput: 0: 13067.7. Samples: 3583666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:44:46,748][23556] Avg episode reward: [(0, '39.761')] [2023-03-06 16:44:46,970][23882] Updated weights for policy 0, policy_version 3530 (0.0007) [2023-03-06 16:44:47,754][23882] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-06 16:44:48,536][23882] Updated weights for policy 0, policy_version 3550 (0.0006) [2023-03-06 16:44:49,299][23882] Updated weights for policy 0, policy_version 3560 (0.0007) [2023-03-06 16:44:50,097][23882] Updated weights for policy 0, policy_version 3570 (0.0007) [2023-03-06 16:44:50,881][23882] Updated weights for policy 0, policy_version 3580 (0.0007) [2023-03-06 16:44:51,650][23882] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-06 16:44:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 12902.4). Total num frames: 3677184. Throughput: 0: 13069.6. Samples: 3662094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:44:51,748][23556] Avg episode reward: [(0, '39.479')] [2023-03-06 16:44:52,437][23882] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-06 16:44:53,226][23882] Updated weights for policy 0, policy_version 3610 (0.0006) [2023-03-06 16:44:53,994][23882] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-06 16:44:54,777][23882] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-06 16:44:55,557][23882] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-06 16:44:56,346][23882] Updated weights for policy 0, policy_version 3650 (0.0006) [2023-03-06 16:44:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 12902.4). Total num frames: 3741696. Throughput: 0: 13084.6. Samples: 3740843. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:44:56,748][23556] Avg episode reward: [(0, '40.378')] [2023-03-06 16:44:57,172][23882] Updated weights for policy 0, policy_version 3660 (0.0007) [2023-03-06 16:44:57,960][23882] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-06 16:44:58,744][23882] Updated weights for policy 0, policy_version 3680 (0.0008) [2023-03-06 16:44:59,506][23882] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-06 16:45:00,302][23882] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-06 16:45:01,088][23882] Updated weights for policy 0, policy_version 3710 (0.0007) [2023-03-06 16:45:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 12905.9). Total num frames: 3807232. Throughput: 0: 13075.1. Samples: 3779611. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:45:01,748][23556] Avg episode reward: [(0, '41.660')] [2023-03-06 16:45:01,886][23882] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-06 16:45:02,674][23882] Updated weights for policy 0, policy_version 3730 (0.0006) [2023-03-06 16:45:03,444][23882] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-06 16:45:04,232][23882] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-06 16:45:05,004][23882] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-06 16:45:05,781][23882] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-06 16:45:06,565][23882] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-06 16:45:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 3872768. Throughput: 0: 13070.8. Samples: 3857912. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:45:06,748][23556] Avg episode reward: [(0, '42.668')] [2023-03-06 16:45:06,752][23831] Saving new best policy, reward=42.668! [2023-03-06 16:45:07,351][23882] Updated weights for policy 0, policy_version 3790 (0.0007) [2023-03-06 16:45:08,134][23882] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-06 16:45:08,927][23882] Updated weights for policy 0, policy_version 3810 (0.0007) [2023-03-06 16:45:09,698][23882] Updated weights for policy 0, policy_version 3820 (0.0007) [2023-03-06 16:45:10,473][23882] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-06 16:45:11,270][23882] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-06 16:45:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 3937280. Throughput: 0: 13060.2. Samples: 3936286. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:45:11,748][23556] Avg episode reward: [(0, '41.918')] [2023-03-06 16:45:12,077][23882] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-06 16:45:12,862][23882] Updated weights for policy 0, policy_version 3860 (0.0007) [2023-03-06 16:45:13,642][23882] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-06 16:45:14,437][23882] Updated weights for policy 0, policy_version 3880 (0.0006) [2023-03-06 16:45:15,227][23882] Updated weights for policy 0, policy_version 3890 (0.0007) [2023-03-06 16:45:15,991][23882] Updated weights for policy 0, policy_version 3900 (0.0006) [2023-03-06 16:45:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 4002816. Throughput: 0: 13055.0. Samples: 3975213. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:45:16,748][23556] Avg episode reward: [(0, '41.626')] [2023-03-06 16:45:16,760][23882] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-06 16:45:17,544][23882] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-06 16:45:18,326][23882] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-06 16:45:19,098][23882] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-06 16:45:19,893][23882] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-06 16:45:20,668][23882] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-06 16:45:21,439][23882] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-06 16:45:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 4068352. Throughput: 0: 13063.7. Samples: 4053941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:45:21,748][23556] Avg episode reward: [(0, '44.219')] [2023-03-06 16:45:21,755][23831] Saving new best policy, reward=44.219! [2023-03-06 16:45:22,225][23882] Updated weights for policy 0, policy_version 3980 (0.0007) [2023-03-06 16:45:23,007][23882] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-06 16:45:23,806][23882] Updated weights for policy 0, policy_version 4000 (0.0006) [2023-03-06 16:45:24,588][23882] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-03-06 16:45:25,372][23882] Updated weights for policy 0, policy_version 4020 (0.0007) [2023-03-06 16:45:26,151][23882] Updated weights for policy 0, policy_version 4030 (0.0006) [2023-03-06 16:45:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 4133888. Throughput: 0: 13062.1. Samples: 4132433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:45:26,748][23556] Avg episode reward: [(0, '45.294')] [2023-03-06 16:45:26,752][23831] Saving new best policy, reward=45.294! [2023-03-06 16:45:26,956][23882] Updated weights for policy 0, policy_version 4040 (0.0007) [2023-03-06 16:45:27,744][23882] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-06 16:45:28,525][23882] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-06 16:45:29,322][23882] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-06 16:45:30,101][23882] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-06 16:45:30,882][23882] Updated weights for policy 0, policy_version 4090 (0.0006) [2023-03-06 16:45:31,678][23882] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-06 16:45:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 4199424. Throughput: 0: 13056.4. Samples: 4171202. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:45:31,748][23556] Avg episode reward: [(0, '42.862')] [2023-03-06 16:45:32,459][23882] Updated weights for policy 0, policy_version 4110 (0.0006) [2023-03-06 16:45:33,250][23882] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-06 16:45:34,021][23882] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-06 16:45:34,817][23882] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-06 16:45:35,587][23882] Updated weights for policy 0, policy_version 4150 (0.0006) [2023-03-06 16:45:36,382][23882] Updated weights for policy 0, policy_version 4160 (0.0007) [2023-03-06 16:45:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 4263936. Throughput: 0: 13055.8. Samples: 4249605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:45:36,748][23556] Avg episode reward: [(0, '43.889')] [2023-03-06 16:45:37,165][23882] Updated weights for policy 0, policy_version 4170 (0.0006) [2023-03-06 16:45:37,942][23882] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-06 16:45:38,715][23882] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-06 16:45:39,504][23882] Updated weights for policy 0, policy_version 4200 (0.0007) [2023-03-06 16:45:40,286][23882] Updated weights for policy 0, policy_version 4210 (0.0007) [2023-03-06 16:45:41,075][23882] Updated weights for policy 0, policy_version 4220 (0.0007) [2023-03-06 16:45:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 4329472. Throughput: 0: 13047.7. Samples: 4327989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:45:41,748][23556] Avg episode reward: [(0, '45.319')] [2023-03-06 16:45:41,749][23831] Saving new best policy, reward=45.319! [2023-03-06 16:45:41,874][23882] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-06 16:45:42,655][23882] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-06 16:45:43,433][23882] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-06 16:45:44,225][23882] Updated weights for policy 0, policy_version 4260 (0.0007) [2023-03-06 16:45:45,002][23882] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-06 16:45:45,794][23882] Updated weights for policy 0, policy_version 4280 (0.0005) [2023-03-06 16:45:46,594][23882] Updated weights for policy 0, policy_version 4290 (0.0007) [2023-03-06 16:45:46,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 4395008. Throughput: 0: 13050.1. Samples: 4366866. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:45:46,748][23556] Avg episode reward: [(0, '44.940')] [2023-03-06 16:45:47,367][23882] Updated weights for policy 0, policy_version 4300 (0.0005) [2023-03-06 16:45:48,161][23882] Updated weights for policy 0, policy_version 4310 (0.0007) [2023-03-06 16:45:48,947][23882] Updated weights for policy 0, policy_version 4320 (0.0007) [2023-03-06 16:45:49,751][23882] Updated weights for policy 0, policy_version 4330 (0.0007) [2023-03-06 16:45:50,534][23882] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-06 16:45:51,315][23882] Updated weights for policy 0, policy_version 4350 (0.0006) [2023-03-06 16:45:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 4459520. Throughput: 0: 13050.0. Samples: 4445161. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:45:51,748][23556] Avg episode reward: [(0, '46.526')] [2023-03-06 16:45:51,749][23831] Saving new best policy, reward=46.526! [2023-03-06 16:45:52,094][23882] Updated weights for policy 0, policy_version 4360 (0.0007) [2023-03-06 16:45:52,877][23882] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-06 16:45:53,673][23882] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-06 16:45:54,463][23882] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-06 16:45:55,234][23882] Updated weights for policy 0, policy_version 4400 (0.0005) [2023-03-06 16:45:56,007][23882] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-06 16:45:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 4525056. Throughput: 0: 13042.6. Samples: 4523203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:45:56,748][23556] Avg episode reward: [(0, '46.879')] [2023-03-06 16:45:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000004419_4525056.pth... [2023-03-06 16:45:56,780][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000001359_1391616.pth [2023-03-06 16:45:56,783][23831] Saving new best policy, reward=46.879! [2023-03-06 16:45:56,835][23882] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-06 16:45:57,588][23882] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-06 16:45:58,387][23882] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-06 16:45:59,173][23882] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-06 16:45:59,947][23882] Updated weights for policy 0, policy_version 4460 (0.0005) [2023-03-06 16:46:00,710][23882] Updated weights for policy 0, policy_version 4470 (0.0006) [2023-03-06 16:46:01,500][23882] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-06 16:46:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 4590592. Throughput: 0: 13049.5. Samples: 4562441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:46:01,748][23556] Avg episode reward: [(0, '48.596')] [2023-03-06 16:46:01,749][23831] Saving new best policy, reward=48.596! [2023-03-06 16:46:02,287][23882] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-06 16:46:03,066][23882] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-06 16:46:03,852][23882] Updated weights for policy 0, policy_version 4510 (0.0007) [2023-03-06 16:46:04,631][23882] Updated weights for policy 0, policy_version 4520 (0.0006) [2023-03-06 16:46:05,398][23882] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-06 16:46:06,179][23882] Updated weights for policy 0, policy_version 4540 (0.0007) [2023-03-06 16:46:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 4655104. Throughput: 0: 13045.5. Samples: 4640989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:06,748][23556] Avg episode reward: [(0, '43.437')] [2023-03-06 16:46:06,969][23882] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-06 16:46:07,768][23882] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-06 16:46:08,558][23882] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-06 16:46:09,351][23882] Updated weights for policy 0, policy_version 4580 (0.0006) [2023-03-06 16:46:10,140][23882] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-06 16:46:10,907][23882] Updated weights for policy 0, policy_version 4600 (0.0007) [2023-03-06 16:46:11,702][23882] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-06 16:46:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 4720640. Throughput: 0: 13041.7. Samples: 4719307. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:11,748][23556] Avg episode reward: [(0, '45.761')] [2023-03-06 16:46:12,477][23882] Updated weights for policy 0, policy_version 4620 (0.0006) [2023-03-06 16:46:13,252][23882] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-06 16:46:14,045][23882] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-06 16:46:14,813][23882] Updated weights for policy 0, policy_version 4650 (0.0007) [2023-03-06 16:46:15,584][23882] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-06 16:46:16,391][23882] Updated weights for policy 0, policy_version 4670 (0.0007) [2023-03-06 16:46:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 4786176. Throughput: 0: 13053.9. Samples: 4758626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:46:16,748][23556] Avg episode reward: [(0, '45.795')] [2023-03-06 16:46:17,166][23882] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-06 16:46:17,938][23882] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-06 16:46:18,730][23882] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-06 16:46:19,517][23882] Updated weights for policy 0, policy_version 4710 (0.0006) [2023-03-06 16:46:20,315][23882] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-06 16:46:21,103][23882] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-06 16:46:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 4850688. Throughput: 0: 13050.0. Samples: 4836855. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:46:21,748][23556] Avg episode reward: [(0, '46.104')] [2023-03-06 16:46:21,907][23882] Updated weights for policy 0, policy_version 4740 (0.0007) [2023-03-06 16:46:22,686][23882] Updated weights for policy 0, policy_version 4750 (0.0007) [2023-03-06 16:46:23,486][23882] Updated weights for policy 0, policy_version 4760 (0.0006) [2023-03-06 16:46:24,275][23882] Updated weights for policy 0, policy_version 4770 (0.0007) [2023-03-06 16:46:25,050][23882] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-06 16:46:25,826][23882] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-06 16:46:26,597][23882] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-06 16:46:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 4916224. Throughput: 0: 13046.6. Samples: 4915087. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:26,748][23556] Avg episode reward: [(0, '47.038')] [2023-03-06 16:46:27,366][23882] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-06 16:46:28,158][23882] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-06 16:46:28,956][23882] Updated weights for policy 0, policy_version 4830 (0.0007) [2023-03-06 16:46:29,729][23882] Updated weights for policy 0, policy_version 4840 (0.0007) [2023-03-06 16:46:30,515][23882] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-06 16:46:31,299][23882] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-06 16:46:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 4981760. Throughput: 0: 13053.1. Samples: 4954257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:46:31,748][23556] Avg episode reward: [(0, '48.174')] [2023-03-06 16:46:32,095][23882] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-06 16:46:32,894][23882] Updated weights for policy 0, policy_version 4880 (0.0006) [2023-03-06 16:46:33,689][23882] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-06 16:46:34,457][23882] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-06 16:46:35,267][23882] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-06 16:46:36,052][23882] Updated weights for policy 0, policy_version 4920 (0.0008) [2023-03-06 16:46:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 5047296. Throughput: 0: 13044.2. Samples: 5032150. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:36,749][23556] Avg episode reward: [(0, '48.740')] [2023-03-06 16:46:36,753][23831] Saving new best policy, reward=48.740! [2023-03-06 16:46:36,837][23882] Updated weights for policy 0, policy_version 4930 (0.0006) [2023-03-06 16:46:37,617][23882] Updated weights for policy 0, policy_version 4940 (0.0007) [2023-03-06 16:46:38,404][23882] Updated weights for policy 0, policy_version 4950 (0.0008) [2023-03-06 16:46:39,182][23882] Updated weights for policy 0, policy_version 4960 (0.0007) [2023-03-06 16:46:39,979][23882] Updated weights for policy 0, policy_version 4970 (0.0005) [2023-03-06 16:46:40,757][23882] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-06 16:46:41,547][23882] Updated weights for policy 0, policy_version 4990 (0.0007) [2023-03-06 16:46:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5111808. Throughput: 0: 13048.6. Samples: 5110392. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:41,748][23556] Avg episode reward: [(0, '47.480')] [2023-03-06 16:46:42,338][23882] Updated weights for policy 0, policy_version 5000 (0.0007) [2023-03-06 16:46:43,127][23882] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-06 16:46:43,904][23882] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-06 16:46:44,690][23882] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-06 16:46:45,477][23882] Updated weights for policy 0, policy_version 5040 (0.0006) [2023-03-06 16:46:46,269][23882] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-06 16:46:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5177344. Throughput: 0: 13047.1. Samples: 5149561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:46:46,749][23556] Avg episode reward: [(0, '47.655')] [2023-03-06 16:46:47,038][23882] Updated weights for policy 0, policy_version 5060 (0.0007) [2023-03-06 16:46:47,830][23882] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-06 16:46:48,609][23882] Updated weights for policy 0, policy_version 5080 (0.0005) [2023-03-06 16:46:49,389][23882] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-06 16:46:50,163][23882] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-06 16:46:50,952][23882] Updated weights for policy 0, policy_version 5110 (0.0007) [2023-03-06 16:46:51,746][23882] Updated weights for policy 0, policy_version 5120 (0.0007) [2023-03-06 16:46:51,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 5242880. Throughput: 0: 13042.5. Samples: 5227899. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:46:51,748][23556] Avg episode reward: [(0, '49.683')] [2023-03-06 16:46:51,749][23831] Saving new best policy, reward=49.683! [2023-03-06 16:46:52,536][23882] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-06 16:46:53,319][23882] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-06 16:46:54,112][23882] Updated weights for policy 0, policy_version 5150 (0.0007) [2023-03-06 16:46:54,894][23882] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-06 16:46:55,675][23882] Updated weights for policy 0, policy_version 5170 (0.0006) [2023-03-06 16:46:56,465][23882] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-06 16:46:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5307392. Throughput: 0: 13038.0. Samples: 5306016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:46:56,748][23556] Avg episode reward: [(0, '48.443')] [2023-03-06 16:46:57,268][23882] Updated weights for policy 0, policy_version 5190 (0.0007) [2023-03-06 16:46:58,021][23882] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-06 16:46:58,825][23882] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-06 16:46:59,599][23882] Updated weights for policy 0, policy_version 5220 (0.0007) [2023-03-06 16:47:00,378][23882] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-06 16:47:01,174][23882] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-06 16:47:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5372928. Throughput: 0: 13033.7. Samples: 5345142. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:01,748][23556] Avg episode reward: [(0, '49.616')] [2023-03-06 16:47:01,946][23882] Updated weights for policy 0, policy_version 5250 (0.0007) [2023-03-06 16:47:02,720][23882] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-03-06 16:47:03,499][23882] Updated weights for policy 0, policy_version 5270 (0.0007) [2023-03-06 16:47:04,278][23882] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-06 16:47:05,083][23882] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-06 16:47:05,858][23882] Updated weights for policy 0, policy_version 5300 (0.0007) [2023-03-06 16:47:06,662][23882] Updated weights for policy 0, policy_version 5310 (0.0007) [2023-03-06 16:47:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 5438464. Throughput: 0: 13038.6. Samples: 5423592. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:06,748][23556] Avg episode reward: [(0, '49.259')] [2023-03-06 16:47:07,448][23882] Updated weights for policy 0, policy_version 5320 (0.0007) [2023-03-06 16:47:08,230][23882] Updated weights for policy 0, policy_version 5330 (0.0006) [2023-03-06 16:47:09,011][23882] Updated weights for policy 0, policy_version 5340 (0.0006) [2023-03-06 16:47:09,809][23882] Updated weights for policy 0, policy_version 5350 (0.0007) [2023-03-06 16:47:10,594][23882] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-06 16:47:11,384][23882] Updated weights for policy 0, policy_version 5370 (0.0007) [2023-03-06 16:47:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5502976. Throughput: 0: 13031.4. Samples: 5501500. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:11,748][23556] Avg episode reward: [(0, '49.454')] [2023-03-06 16:47:12,166][23882] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-06 16:47:12,926][23882] Updated weights for policy 0, policy_version 5390 (0.0006) [2023-03-06 16:47:13,739][23882] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-06 16:47:14,513][23882] Updated weights for policy 0, policy_version 5410 (0.0007) [2023-03-06 16:47:15,294][23882] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-06 16:47:16,085][23882] Updated weights for policy 0, policy_version 5430 (0.0007) [2023-03-06 16:47:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5568512. Throughput: 0: 13033.8. Samples: 5540777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:47:16,748][23556] Avg episode reward: [(0, '51.157')] [2023-03-06 16:47:16,752][23831] Saving new best policy, reward=51.157! [2023-03-06 16:47:16,902][23882] Updated weights for policy 0, policy_version 5440 (0.0007) [2023-03-06 16:47:17,681][23882] Updated weights for policy 0, policy_version 5450 (0.0006) [2023-03-06 16:47:18,457][23882] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-06 16:47:19,230][23882] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-06 16:47:20,008][23882] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-06 16:47:20,805][23882] Updated weights for policy 0, policy_version 5490 (0.0007) [2023-03-06 16:47:21,621][23882] Updated weights for policy 0, policy_version 5500 (0.0006) [2023-03-06 16:47:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 5633024. Throughput: 0: 13042.7. Samples: 5619070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:21,748][23556] Avg episode reward: [(0, '50.580')] [2023-03-06 16:47:22,427][23882] Updated weights for policy 0, policy_version 5510 (0.0006) [2023-03-06 16:47:23,230][23882] Updated weights for policy 0, policy_version 5520 (0.0007) [2023-03-06 16:47:24,032][23882] Updated weights for policy 0, policy_version 5530 (0.0007) [2023-03-06 16:47:24,847][23882] Updated weights for policy 0, policy_version 5540 (0.0007) [2023-03-06 16:47:25,693][23882] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-06 16:47:26,496][23882] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-06 16:47:26,748][23556] Fps is (10 sec: 12800.0, 60 sec: 13004.8, 300 sec: 13048.2). Total num frames: 5696512. Throughput: 0: 12984.7. Samples: 5694700. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:47:26,748][23556] Avg episode reward: [(0, '50.314')] [2023-03-06 16:47:27,289][23882] Updated weights for policy 0, policy_version 5570 (0.0006) [2023-03-06 16:47:28,069][23882] Updated weights for policy 0, policy_version 5580 (0.0008) [2023-03-06 16:47:28,847][23882] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-06 16:47:29,635][23882] Updated weights for policy 0, policy_version 5600 (0.0007) [2023-03-06 16:47:30,421][23882] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-06 16:47:31,197][23882] Updated weights for policy 0, policy_version 5620 (0.0007) [2023-03-06 16:47:31,748][23556] Fps is (10 sec: 12800.0, 60 sec: 12987.7, 300 sec: 13044.7). Total num frames: 5761024. Throughput: 0: 12982.3. Samples: 5733761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:31,759][23556] Avg episode reward: [(0, '48.940')] [2023-03-06 16:47:31,987][23882] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-06 16:47:32,804][23882] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-06 16:47:33,604][23882] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-06 16:47:34,415][23882] Updated weights for policy 0, policy_version 5660 (0.0007) [2023-03-06 16:47:35,229][23882] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-06 16:47:36,068][23882] Updated weights for policy 0, policy_version 5680 (0.0007) [2023-03-06 16:47:36,748][23556] Fps is (10 sec: 12799.8, 60 sec: 12953.6, 300 sec: 13037.8). Total num frames: 5824512. Throughput: 0: 12951.2. Samples: 5810706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:36,759][23556] Avg episode reward: [(0, '49.849')] [2023-03-06 16:47:36,898][23882] Updated weights for policy 0, policy_version 5690 (0.0006) [2023-03-06 16:47:37,735][23882] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-06 16:47:38,561][23882] Updated weights for policy 0, policy_version 5710 (0.0007) [2023-03-06 16:47:39,385][23882] Updated weights for policy 0, policy_version 5720 (0.0005) [2023-03-06 16:47:40,202][23882] Updated weights for policy 0, policy_version 5730 (0.0007) [2023-03-06 16:47:41,070][23882] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-06 16:47:41,748][23556] Fps is (10 sec: 12492.8, 60 sec: 12902.4, 300 sec: 13023.9). Total num frames: 5885952. Throughput: 0: 12849.5. Samples: 5884245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:41,758][23556] Avg episode reward: [(0, '49.970')] [2023-03-06 16:47:41,906][23882] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-06 16:47:42,745][23882] Updated weights for policy 0, policy_version 5760 (0.0007) [2023-03-06 16:47:43,569][23882] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-06 16:47:44,415][23882] Updated weights for policy 0, policy_version 5780 (0.0007) [2023-03-06 16:47:45,256][23882] Updated weights for policy 0, policy_version 5790 (0.0007) [2023-03-06 16:47:46,090][23882] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-06 16:47:46,748][23556] Fps is (10 sec: 12185.7, 60 sec: 12817.1, 300 sec: 13010.0). Total num frames: 5946368. Throughput: 0: 12799.0. Samples: 5921096. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:47:46,759][23556] Avg episode reward: [(0, '50.719')] [2023-03-06 16:47:46,936][23882] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-06 16:47:47,755][23882] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-06 16:47:48,586][23882] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-06 16:47:49,433][23882] Updated weights for policy 0, policy_version 5840 (0.0007) [2023-03-06 16:47:50,265][23882] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-06 16:47:51,099][23882] Updated weights for policy 0, policy_version 5860 (0.0006) [2023-03-06 16:47:51,748][23556] Fps is (10 sec: 12185.6, 60 sec: 12748.8, 300 sec: 12996.1). Total num frames: 6007808. Throughput: 0: 12686.0. Samples: 5994461. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:47:51,759][23556] Avg episode reward: [(0, '51.496')] [2023-03-06 16:47:51,759][23831] Saving new best policy, reward=51.496! [2023-03-06 16:47:51,921][23882] Updated weights for policy 0, policy_version 5870 (0.0007) [2023-03-06 16:47:52,778][23882] Updated weights for policy 0, policy_version 5880 (0.0007) [2023-03-06 16:47:53,613][23882] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-06 16:47:54,446][23882] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-06 16:47:55,298][23882] Updated weights for policy 0, policy_version 5910 (0.0007) [2023-03-06 16:47:56,122][23882] Updated weights for policy 0, policy_version 5920 (0.0007) [2023-03-06 16:47:56,748][23556] Fps is (10 sec: 12287.9, 60 sec: 12697.6, 300 sec: 12982.2). Total num frames: 6069248. Throughput: 0: 12587.3. Samples: 6067930. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:47:56,754][23556] Avg episode reward: [(0, '51.499')] [2023-03-06 16:47:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000005927_6069248.pth... [2023-03-06 16:47:56,787][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000002889_2958336.pth [2023-03-06 16:47:56,790][23831] Saving new best policy, reward=51.499! [2023-03-06 16:47:56,970][23882] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-06 16:47:57,801][23882] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-06 16:47:58,630][23882] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-06 16:47:59,462][23882] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-06 16:48:00,306][23882] Updated weights for policy 0, policy_version 5970 (0.0007) [2023-03-06 16:48:01,118][23882] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-06 16:48:01,748][23556] Fps is (10 sec: 12288.0, 60 sec: 12629.3, 300 sec: 12968.4). Total num frames: 6130688. Throughput: 0: 12529.4. Samples: 6104601. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:48:01,759][23556] Avg episode reward: [(0, '52.715')] [2023-03-06 16:48:01,760][23831] Saving new best policy, reward=52.715! [2023-03-06 16:48:01,925][23882] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-06 16:48:02,729][23882] Updated weights for policy 0, policy_version 6000 (0.0007) [2023-03-06 16:48:03,509][23882] Updated weights for policy 0, policy_version 6010 (0.0007) [2023-03-06 16:48:04,294][23882] Updated weights for policy 0, policy_version 6020 (0.0007) [2023-03-06 16:48:05,090][23882] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-06 16:48:05,869][23882] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-06 16:48:06,651][23882] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-06 16:48:06,748][23556] Fps is (10 sec: 12697.7, 60 sec: 12629.3, 300 sec: 12968.3). Total num frames: 6196224. Throughput: 0: 12495.5. Samples: 6181370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:06,759][23556] Avg episode reward: [(0, '51.856')] [2023-03-06 16:48:07,450][23882] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-06 16:48:08,232][23882] Updated weights for policy 0, policy_version 6070 (0.0008) [2023-03-06 16:48:09,014][23882] Updated weights for policy 0, policy_version 6080 (0.0007) [2023-03-06 16:48:09,792][23882] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-06 16:48:10,681][23882] Updated weights for policy 0, policy_version 6100 (0.0007) [2023-03-06 16:48:11,543][23882] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-06 16:48:11,748][23556] Fps is (10 sec: 12799.8, 60 sec: 12595.2, 300 sec: 12961.4). Total num frames: 6258688. Throughput: 0: 12500.6. Samples: 6257227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:11,757][23556] Avg episode reward: [(0, '52.638')] [2023-03-06 16:48:12,437][23882] Updated weights for policy 0, policy_version 6120 (0.0007) [2023-03-06 16:48:13,308][23882] Updated weights for policy 0, policy_version 6130 (0.0007) [2023-03-06 16:48:14,193][23882] Updated weights for policy 0, policy_version 6140 (0.0007) [2023-03-06 16:48:14,951][23882] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-06 16:48:15,727][23882] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-06 16:48:16,486][23882] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-06 16:48:16,748][23556] Fps is (10 sec: 12492.8, 60 sec: 12544.0, 300 sec: 12951.0). Total num frames: 6321152. Throughput: 0: 12422.3. Samples: 6292764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:16,759][23556] Avg episode reward: [(0, '51.155')] [2023-03-06 16:48:17,254][23882] Updated weights for policy 0, policy_version 6180 (0.0007) [2023-03-06 16:48:18,035][23882] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-06 16:48:18,782][23882] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-06 16:48:19,555][23882] Updated weights for policy 0, policy_version 6210 (0.0007) [2023-03-06 16:48:20,336][23882] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-06 16:48:21,133][23882] Updated weights for policy 0, policy_version 6230 (0.0007) [2023-03-06 16:48:21,748][23556] Fps is (10 sec: 12800.2, 60 sec: 12561.1, 300 sec: 12951.0). Total num frames: 6386688. Throughput: 0: 12483.1. Samples: 6372444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:48:21,748][23556] Avg episode reward: [(0, '51.000')] [2023-03-06 16:48:21,923][23882] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-06 16:48:22,705][23882] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-06 16:48:23,486][23882] Updated weights for policy 0, policy_version 6260 (0.0008) [2023-03-06 16:48:24,272][23882] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-06 16:48:25,062][23882] Updated weights for policy 0, policy_version 6280 (0.0006) [2023-03-06 16:48:25,880][23882] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-06 16:48:26,676][23882] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-06 16:48:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 12578.1, 300 sec: 12947.5). Total num frames: 6451200. Throughput: 0: 12574.8. Samples: 6450111. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:48:26,748][23556] Avg episode reward: [(0, '51.661')] [2023-03-06 16:48:27,455][23882] Updated weights for policy 0, policy_version 6310 (0.0007) [2023-03-06 16:48:28,244][23882] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-06 16:48:29,025][23882] Updated weights for policy 0, policy_version 6330 (0.0005) [2023-03-06 16:48:29,813][23882] Updated weights for policy 0, policy_version 6340 (0.0007) [2023-03-06 16:48:30,594][23882] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-06 16:48:31,362][23882] Updated weights for policy 0, policy_version 6360 (0.0007) [2023-03-06 16:48:31,748][23556] Fps is (10 sec: 13004.6, 60 sec: 12595.2, 300 sec: 12947.5). Total num frames: 6516736. Throughput: 0: 12625.5. Samples: 6489246. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:31,749][23556] Avg episode reward: [(0, '50.050')] [2023-03-06 16:48:32,169][23882] Updated weights for policy 0, policy_version 6370 (0.0007) [2023-03-06 16:48:32,943][23882] Updated weights for policy 0, policy_version 6380 (0.0006) [2023-03-06 16:48:33,723][23882] Updated weights for policy 0, policy_version 6390 (0.0006) [2023-03-06 16:48:34,519][23882] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-06 16:48:35,298][23882] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-06 16:48:36,111][23882] Updated weights for policy 0, policy_version 6420 (0.0005) [2023-03-06 16:48:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 12629.4, 300 sec: 12947.5). Total num frames: 6582272. Throughput: 0: 12736.4. Samples: 6567598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:36,748][23556] Avg episode reward: [(0, '49.673')] [2023-03-06 16:48:36,888][23882] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-06 16:48:37,683][23882] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-06 16:48:38,447][23882] Updated weights for policy 0, policy_version 6450 (0.0007) [2023-03-06 16:48:39,211][23882] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-06 16:48:39,986][23882] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-06 16:48:40,767][23882] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-06 16:48:41,570][23882] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-06 16:48:41,748][23556] Fps is (10 sec: 13107.5, 60 sec: 12697.6, 300 sec: 12951.0). Total num frames: 6647808. Throughput: 0: 12846.2. Samples: 6646005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:41,748][23556] Avg episode reward: [(0, '51.004')] [2023-03-06 16:48:42,348][23882] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-06 16:48:43,124][23882] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-06 16:48:43,901][23882] Updated weights for policy 0, policy_version 6520 (0.0006) [2023-03-06 16:48:44,670][23882] Updated weights for policy 0, policy_version 6530 (0.0007) [2023-03-06 16:48:45,478][23882] Updated weights for policy 0, policy_version 6540 (0.0006) [2023-03-06 16:48:46,250][23882] Updated weights for policy 0, policy_version 6550 (0.0007) [2023-03-06 16:48:46,748][23556] Fps is (10 sec: 13106.9, 60 sec: 12782.9, 300 sec: 12951.0). Total num frames: 6713344. Throughput: 0: 12907.9. Samples: 6685458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:46,749][23556] Avg episode reward: [(0, '50.413')] [2023-03-06 16:48:47,042][23882] Updated weights for policy 0, policy_version 6560 (0.0007) [2023-03-06 16:48:47,827][23882] Updated weights for policy 0, policy_version 6570 (0.0007) [2023-03-06 16:48:48,608][23882] Updated weights for policy 0, policy_version 6580 (0.0007) [2023-03-06 16:48:49,395][23882] Updated weights for policy 0, policy_version 6590 (0.0007) [2023-03-06 16:48:50,190][23882] Updated weights for policy 0, policy_version 6600 (0.0007) [2023-03-06 16:48:50,977][23882] Updated weights for policy 0, policy_version 6610 (0.0006) [2023-03-06 16:48:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 12834.1, 300 sec: 12947.5). Total num frames: 6777856. Throughput: 0: 12940.5. Samples: 6763694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:51,748][23556] Avg episode reward: [(0, '51.582')] [2023-03-06 16:48:51,781][23882] Updated weights for policy 0, policy_version 6620 (0.0006) [2023-03-06 16:48:52,561][23882] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-06 16:48:53,337][23882] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-06 16:48:54,145][23882] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-06 16:48:54,929][23882] Updated weights for policy 0, policy_version 6660 (0.0007) [2023-03-06 16:48:55,703][23882] Updated weights for policy 0, policy_version 6670 (0.0007) [2023-03-06 16:48:56,495][23882] Updated weights for policy 0, policy_version 6680 (0.0007) [2023-03-06 16:48:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 12902.4, 300 sec: 12951.0). Total num frames: 6843392. Throughput: 0: 12985.6. Samples: 6841576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:48:56,748][23556] Avg episode reward: [(0, '50.308')] [2023-03-06 16:48:57,283][23882] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-06 16:48:58,062][23882] Updated weights for policy 0, policy_version 6700 (0.0008) [2023-03-06 16:48:58,839][23882] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-06 16:48:59,631][23882] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-06 16:49:00,424][23882] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-06 16:49:01,198][23882] Updated weights for policy 0, policy_version 6740 (0.0007) [2023-03-06 16:49:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12953.6, 300 sec: 12947.5). Total num frames: 6907904. Throughput: 0: 13064.5. Samples: 6880667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:49:01,748][23556] Avg episode reward: [(0, '50.746')] [2023-03-06 16:49:01,990][23882] Updated weights for policy 0, policy_version 6750 (0.0007) [2023-03-06 16:49:02,772][23882] Updated weights for policy 0, policy_version 6760 (0.0005) [2023-03-06 16:49:03,561][23882] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-06 16:49:04,344][23882] Updated weights for policy 0, policy_version 6780 (0.0006) [2023-03-06 16:49:05,120][23882] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-06 16:49:05,905][23882] Updated weights for policy 0, policy_version 6800 (0.0007) [2023-03-06 16:49:06,711][23882] Updated weights for policy 0, policy_version 6810 (0.0007) [2023-03-06 16:49:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12953.6, 300 sec: 12947.5). Total num frames: 6973440. Throughput: 0: 13037.9. Samples: 6959148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:49:06,749][23556] Avg episode reward: [(0, '52.543')] [2023-03-06 16:49:07,484][23882] Updated weights for policy 0, policy_version 6820 (0.0007) [2023-03-06 16:49:08,266][23882] Updated weights for policy 0, policy_version 6830 (0.0007) [2023-03-06 16:49:09,051][23882] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-06 16:49:09,836][23882] Updated weights for policy 0, policy_version 6850 (0.0007) [2023-03-06 16:49:10,636][23882] Updated weights for policy 0, policy_version 6860 (0.0006) [2023-03-06 16:49:11,404][23882] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-06 16:49:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 12951.0). Total num frames: 7038976. Throughput: 0: 13045.6. Samples: 7037163. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 16:49:11,748][23556] Avg episode reward: [(0, '51.033')] [2023-03-06 16:49:12,202][23882] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-06 16:49:12,989][23882] Updated weights for policy 0, policy_version 6890 (0.0006) [2023-03-06 16:49:13,793][23882] Updated weights for policy 0, policy_version 6900 (0.0006) [2023-03-06 16:49:14,572][23882] Updated weights for policy 0, policy_version 6910 (0.0007) [2023-03-06 16:49:15,346][23882] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-06 16:49:16,133][23882] Updated weights for policy 0, policy_version 6930 (0.0007) [2023-03-06 16:49:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12947.5). Total num frames: 7103488. Throughput: 0: 13043.0. Samples: 7076181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 16:49:16,749][23556] Avg episode reward: [(0, '51.662')] [2023-03-06 16:49:16,916][23882] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-06 16:49:17,691][23882] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-06 16:49:18,476][23882] Updated weights for policy 0, policy_version 6960 (0.0006) [2023-03-06 16:49:19,262][23882] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-06 16:49:20,046][23882] Updated weights for policy 0, policy_version 6980 (0.0005) [2023-03-06 16:49:20,826][23882] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-06 16:49:21,605][23882] Updated weights for policy 0, policy_version 7000 (0.0007) [2023-03-06 16:49:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12944.0). Total num frames: 7169024. Throughput: 0: 13048.0. Samples: 7154758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:49:21,749][23556] Avg episode reward: [(0, '52.380')] [2023-03-06 16:49:22,383][23882] Updated weights for policy 0, policy_version 7010 (0.0007) [2023-03-06 16:49:23,159][23882] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-06 16:49:23,942][23882] Updated weights for policy 0, policy_version 7030 (0.0008) [2023-03-06 16:49:24,751][23882] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-06 16:49:25,522][23882] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-06 16:49:26,313][23882] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-06 16:49:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 12947.5). Total num frames: 7234560. Throughput: 0: 13049.0. Samples: 7233210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:49:26,748][23556] Avg episode reward: [(0, '51.220')] [2023-03-06 16:49:27,090][23882] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-06 16:49:27,864][23882] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-06 16:49:28,653][23882] Updated weights for policy 0, policy_version 7090 (0.0007) [2023-03-06 16:49:29,439][23882] Updated weights for policy 0, policy_version 7100 (0.0006) [2023-03-06 16:49:30,211][23882] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-06 16:49:31,003][23882] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-06 16:49:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 12947.5). Total num frames: 7300096. Throughput: 0: 13043.5. Samples: 7272412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:49:31,748][23556] Avg episode reward: [(0, '50.078')] [2023-03-06 16:49:31,798][23882] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-06 16:49:32,601][23882] Updated weights for policy 0, policy_version 7140 (0.0005) [2023-03-06 16:49:33,388][23882] Updated weights for policy 0, policy_version 7150 (0.0006) [2023-03-06 16:49:34,184][23882] Updated weights for policy 0, policy_version 7160 (0.0007) [2023-03-06 16:49:34,970][23882] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-06 16:49:35,755][23882] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-06 16:49:36,548][23882] Updated weights for policy 0, policy_version 7190 (0.0007) [2023-03-06 16:49:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12944.0). Total num frames: 7364608. Throughput: 0: 13031.1. Samples: 7350096. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:49:36,748][23556] Avg episode reward: [(0, '53.884')] [2023-03-06 16:49:36,753][23831] Saving new best policy, reward=53.884! [2023-03-06 16:49:37,338][23882] Updated weights for policy 0, policy_version 7200 (0.0007) [2023-03-06 16:49:38,120][23882] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-06 16:49:38,918][23882] Updated weights for policy 0, policy_version 7220 (0.0007) [2023-03-06 16:49:39,687][23882] Updated weights for policy 0, policy_version 7230 (0.0007) [2023-03-06 16:49:40,486][23882] Updated weights for policy 0, policy_version 7240 (0.0006) [2023-03-06 16:49:41,285][23882] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-06 16:49:41,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13021.8, 300 sec: 12940.6). Total num frames: 7429120. Throughput: 0: 13034.5. Samples: 7428130. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:49:41,748][23556] Avg episode reward: [(0, '50.303')] [2023-03-06 16:49:42,081][23882] Updated weights for policy 0, policy_version 7260 (0.0007) [2023-03-06 16:49:42,849][23882] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-06 16:49:43,635][23882] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-06 16:49:44,422][23882] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-06 16:49:45,190][23882] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-06 16:49:45,995][23882] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-06 16:49:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 12940.6). Total num frames: 7494656. Throughput: 0: 13035.3. Samples: 7467255. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:49:46,748][23556] Avg episode reward: [(0, '50.057')] [2023-03-06 16:49:46,800][23882] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-06 16:49:47,569][23882] Updated weights for policy 0, policy_version 7330 (0.0007) [2023-03-06 16:49:48,126][23831] KL-divergence is very high: 128.8753 [2023-03-06 16:49:48,369][23882] Updated weights for policy 0, policy_version 7340 (0.0007) [2023-03-06 16:49:49,145][23882] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-06 16:49:49,917][23882] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-06 16:49:50,727][23882] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-06 16:49:51,503][23882] Updated weights for policy 0, policy_version 7380 (0.0007) [2023-03-06 16:49:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 12940.6). Total num frames: 7559168. Throughput: 0: 13021.0. Samples: 7545093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:49:51,748][23556] Avg episode reward: [(0, '51.392')] [2023-03-06 16:49:52,297][23882] Updated weights for policy 0, policy_version 7390 (0.0005) [2023-03-06 16:49:53,099][23882] Updated weights for policy 0, policy_version 7400 (0.0006) [2023-03-06 16:49:53,888][23882] Updated weights for policy 0, policy_version 7410 (0.0006) [2023-03-06 16:49:54,660][23882] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-06 16:49:55,462][23882] Updated weights for policy 0, policy_version 7430 (0.0007) [2023-03-06 16:49:56,239][23882] Updated weights for policy 0, policy_version 7440 (0.0007) [2023-03-06 16:49:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 12940.6). Total num frames: 7624704. Throughput: 0: 13022.2. Samples: 7623163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:49:56,748][23556] Avg episode reward: [(0, '50.230')] [2023-03-06 16:49:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000007446_7624704.pth... [2023-03-06 16:49:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000004419_4525056.pth [2023-03-06 16:49:57,021][23882] Updated weights for policy 0, policy_version 7450 (0.0008) [2023-03-06 16:49:57,808][23882] Updated weights for policy 0, policy_version 7460 (0.0007) [2023-03-06 16:49:58,586][23882] Updated weights for policy 0, policy_version 7470 (0.0007) [2023-03-06 16:49:59,386][23882] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-06 16:50:00,173][23882] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-06 16:50:00,955][23882] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-06 16:50:01,732][23882] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-06 16:50:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 12940.6). Total num frames: 7690240. Throughput: 0: 13022.9. Samples: 7662212. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:50:01,748][23556] Avg episode reward: [(0, '50.377')] [2023-03-06 16:50:02,526][23882] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-06 16:50:03,324][23882] Updated weights for policy 0, policy_version 7530 (0.0006) [2023-03-06 16:50:04,087][23882] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-06 16:50:04,871][23882] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-06 16:50:05,658][23882] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-06 16:50:06,436][23882] Updated weights for policy 0, policy_version 7570 (0.0007) [2023-03-06 16:50:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 12940.6). Total num frames: 7754752. Throughput: 0: 13015.9. Samples: 7740471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:50:06,748][23556] Avg episode reward: [(0, '52.509')] [2023-03-06 16:50:07,218][23882] Updated weights for policy 0, policy_version 7580 (0.0006) [2023-03-06 16:50:08,015][23882] Updated weights for policy 0, policy_version 7590 (0.0006) [2023-03-06 16:50:08,800][23882] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-06 16:50:09,575][23882] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-06 16:50:10,366][23882] Updated weights for policy 0, policy_version 7620 (0.0007) [2023-03-06 16:50:11,154][23882] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-06 16:50:11,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 12940.6). Total num frames: 7820288. Throughput: 0: 13009.2. Samples: 7818620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:11,748][23556] Avg episode reward: [(0, '51.199')] [2023-03-06 16:50:11,936][23882] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-06 16:50:12,737][23882] Updated weights for policy 0, policy_version 7650 (0.0007) [2023-03-06 16:50:13,508][23882] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-06 16:50:14,301][23882] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-06 16:50:15,097][23882] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-06 16:50:15,882][23882] Updated weights for policy 0, policy_version 7690 (0.0006) [2023-03-06 16:50:16,670][23882] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-06 16:50:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 12940.6). Total num frames: 7885824. Throughput: 0: 13008.9. Samples: 7857813. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 16:50:16,748][23556] Avg episode reward: [(0, '48.368')] [2023-03-06 16:50:17,456][23882] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-06 16:50:18,238][23882] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-06 16:50:19,029][23882] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-06 16:50:19,803][23882] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-06 16:50:20,613][23882] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-06 16:50:21,388][23882] Updated weights for policy 0, policy_version 7760 (0.0008) [2023-03-06 16:50:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 12937.1). Total num frames: 7950336. Throughput: 0: 13016.5. Samples: 7935838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:21,748][23556] Avg episode reward: [(0, '48.526')] [2023-03-06 16:50:22,184][23882] Updated weights for policy 0, policy_version 7770 (0.0007) [2023-03-06 16:50:22,982][23882] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-06 16:50:23,760][23882] Updated weights for policy 0, policy_version 7790 (0.0007) [2023-03-06 16:50:24,560][23882] Updated weights for policy 0, policy_version 7800 (0.0007) [2023-03-06 16:50:25,346][23882] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-06 16:50:26,133][23882] Updated weights for policy 0, policy_version 7820 (0.0007) [2023-03-06 16:50:26,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 12933.6). Total num frames: 8014848. Throughput: 0: 13011.4. Samples: 8013640. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:26,748][23556] Avg episode reward: [(0, '56.030')] [2023-03-06 16:50:26,761][23831] Saving new best policy, reward=56.030! [2023-03-06 16:50:26,917][23882] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-06 16:50:27,725][23882] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-06 16:50:28,506][23882] Updated weights for policy 0, policy_version 7850 (0.0006) [2023-03-06 16:50:29,287][23882] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-06 16:50:30,101][23882] Updated weights for policy 0, policy_version 7870 (0.0007) [2023-03-06 16:50:30,889][23882] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-06 16:50:31,662][23882] Updated weights for policy 0, policy_version 7890 (0.0007) [2023-03-06 16:50:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 12937.1). Total num frames: 8080384. Throughput: 0: 12999.8. Samples: 8052247. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:31,749][23556] Avg episode reward: [(0, '52.533')] [2023-03-06 16:50:32,446][23882] Updated weights for policy 0, policy_version 7900 (0.0008) [2023-03-06 16:50:33,228][23882] Updated weights for policy 0, policy_version 7910 (0.0007) [2023-03-06 16:50:34,027][23882] Updated weights for policy 0, policy_version 7920 (0.0007) [2023-03-06 16:50:34,827][23882] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-06 16:50:35,604][23882] Updated weights for policy 0, policy_version 7940 (0.0007) [2023-03-06 16:50:36,376][23882] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-06 16:50:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12933.6). Total num frames: 8144896. Throughput: 0: 13009.0. Samples: 8130498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:36,748][23556] Avg episode reward: [(0, '51.498')] [2023-03-06 16:50:37,167][23882] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-06 16:50:37,940][23882] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-06 16:50:38,716][23882] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-06 16:50:39,506][23882] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-06 16:50:40,297][23882] Updated weights for policy 0, policy_version 8000 (0.0006) [2023-03-06 16:50:41,078][23882] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-06 16:50:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 12933.6). Total num frames: 8210432. Throughput: 0: 13014.2. Samples: 8208801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:41,748][23556] Avg episode reward: [(0, '52.645')] [2023-03-06 16:50:41,869][23882] Updated weights for policy 0, policy_version 8020 (0.0007) [2023-03-06 16:50:42,658][23882] Updated weights for policy 0, policy_version 8030 (0.0007) [2023-03-06 16:50:43,443][23882] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-06 16:50:44,231][23882] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-06 16:50:45,002][23882] Updated weights for policy 0, policy_version 8060 (0.0007) [2023-03-06 16:50:45,796][23882] Updated weights for policy 0, policy_version 8070 (0.0007) [2023-03-06 16:50:46,590][23882] Updated weights for policy 0, policy_version 8080 (0.0005) [2023-03-06 16:50:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 12937.1). Total num frames: 8275968. Throughput: 0: 13013.3. Samples: 8247810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:50:46,748][23556] Avg episode reward: [(0, '59.524')] [2023-03-06 16:50:46,753][23831] Saving new best policy, reward=59.524! [2023-03-06 16:50:47,356][23882] Updated weights for policy 0, policy_version 8090 (0.0007) [2023-03-06 16:50:48,161][23882] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-06 16:50:48,945][23882] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-06 16:50:49,714][23882] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-06 16:50:50,503][23882] Updated weights for policy 0, policy_version 8130 (0.0007) [2023-03-06 16:50:51,289][23882] Updated weights for policy 0, policy_version 8140 (0.0006) [2023-03-06 16:50:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 12933.6). Total num frames: 8340480. Throughput: 0: 13018.2. Samples: 8326291. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 16:50:51,748][23556] Avg episode reward: [(0, '58.516')] [2023-03-06 16:50:52,083][23882] Updated weights for policy 0, policy_version 8150 (0.0007) [2023-03-06 16:50:52,873][23882] Updated weights for policy 0, policy_version 8160 (0.0007) [2023-03-06 16:50:53,649][23882] Updated weights for policy 0, policy_version 8170 (0.0007) [2023-03-06 16:50:54,417][23882] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-06 16:50:55,206][23882] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-06 16:50:56,001][23882] Updated weights for policy 0, policy_version 8200 (0.0006) [2023-03-06 16:50:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12933.6). Total num frames: 8406016. Throughput: 0: 13021.7. Samples: 8404598. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:50:56,748][23556] Avg episode reward: [(0, '53.696')] [2023-03-06 16:50:56,788][23882] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-06 16:50:57,561][23882] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-06 16:50:58,344][23882] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-06 16:50:59,153][23882] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-06 16:50:59,945][23882] Updated weights for policy 0, policy_version 8250 (0.0006) [2023-03-06 16:51:00,722][23882] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-06 16:51:01,537][23882] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-06 16:51:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 12933.6). Total num frames: 8470528. Throughput: 0: 13014.1. Samples: 8443449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:51:01,748][23556] Avg episode reward: [(0, '60.807')] [2023-03-06 16:51:01,752][23831] Saving new best policy, reward=60.807! [2023-03-06 16:51:02,322][23882] Updated weights for policy 0, policy_version 8280 (0.0006) [2023-03-06 16:51:03,099][23882] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-06 16:51:03,892][23882] Updated weights for policy 0, policy_version 8300 (0.0007) [2023-03-06 16:51:04,678][23882] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-06 16:51:05,461][23882] Updated weights for policy 0, policy_version 8320 (0.0006) [2023-03-06 16:51:06,247][23882] Updated weights for policy 0, policy_version 8330 (0.0007) [2023-03-06 16:51:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12933.6). Total num frames: 8536064. Throughput: 0: 13013.3. Samples: 8521435. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:51:06,748][23556] Avg episode reward: [(0, '67.149')] [2023-03-06 16:51:06,751][23831] Saving new best policy, reward=67.149! [2023-03-06 16:51:07,046][23882] Updated weights for policy 0, policy_version 8340 (0.0007) [2023-03-06 16:51:07,829][23882] Updated weights for policy 0, policy_version 8350 (0.0006) [2023-03-06 16:51:08,617][23882] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-06 16:51:09,400][23882] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-06 16:51:10,205][23882] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-06 16:51:10,994][23882] Updated weights for policy 0, policy_version 8390 (0.0007) [2023-03-06 16:51:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12930.2). Total num frames: 8600576. Throughput: 0: 13008.2. Samples: 8599012. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:51:11,748][23556] Avg episode reward: [(0, '70.892')] [2023-03-06 16:51:11,749][23831] Saving new best policy, reward=70.892! [2023-03-06 16:51:11,797][23882] Updated weights for policy 0, policy_version 8400 (0.0007) [2023-03-06 16:51:12,580][23882] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-06 16:51:13,343][23882] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-06 16:51:14,135][23882] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-06 16:51:14,934][23882] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-06 16:51:15,708][23882] Updated weights for policy 0, policy_version 8450 (0.0007) [2023-03-06 16:51:16,500][23882] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-06 16:51:16,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 12933.6). Total num frames: 8666112. Throughput: 0: 13019.4. Samples: 8638122. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:51:16,749][23556] Avg episode reward: [(0, '67.962')] [2023-03-06 16:51:17,285][23882] Updated weights for policy 0, policy_version 8470 (0.0007) [2023-03-06 16:51:18,088][23882] Updated weights for policy 0, policy_version 8480 (0.0007) [2023-03-06 16:51:18,866][23882] Updated weights for policy 0, policy_version 8490 (0.0006) [2023-03-06 16:51:19,639][23882] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-06 16:51:20,415][23882] Updated weights for policy 0, policy_version 8510 (0.0007) [2023-03-06 16:51:21,216][23882] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-06 16:51:21,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 12930.2). Total num frames: 8730624. Throughput: 0: 13023.2. Samples: 8716543. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:51:21,748][23556] Avg episode reward: [(0, '75.003')] [2023-03-06 16:51:21,757][23831] Saving new best policy, reward=75.003! [2023-03-06 16:51:22,014][23882] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-06 16:51:22,817][23882] Updated weights for policy 0, policy_version 8540 (0.0006) [2023-03-06 16:51:23,593][23882] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-06 16:51:24,365][23882] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-06 16:51:25,162][23882] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-06 16:51:25,951][23882] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-06 16:51:26,737][23882] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-06 16:51:26,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 8796160. Throughput: 0: 13010.9. Samples: 8794292. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:51:26,748][23556] Avg episode reward: [(0, '77.523')] [2023-03-06 16:51:26,751][23831] Saving new best policy, reward=77.523! [2023-03-06 16:51:27,497][23882] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-06 16:51:28,289][23882] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-06 16:51:29,072][23882] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-06 16:51:29,863][23882] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-06 16:51:30,652][23882] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-06 16:51:31,427][23882] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-06 16:51:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 8861696. Throughput: 0: 13018.9. Samples: 8833659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:31,748][23556] Avg episode reward: [(0, '71.928')] [2023-03-06 16:51:32,204][23882] Updated weights for policy 0, policy_version 8660 (0.0007) [2023-03-06 16:51:33,002][23882] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-06 16:51:33,787][23882] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-06 16:51:34,574][23882] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-06 16:51:35,376][23882] Updated weights for policy 0, policy_version 8700 (0.0007) [2023-03-06 16:51:36,156][23882] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-06 16:51:36,531][23831] KL-divergence is very high: 225.7810 [2023-03-06 16:51:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 8926208. Throughput: 0: 13007.0. Samples: 8911605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:36,748][23556] Avg episode reward: [(0, '72.765')] [2023-03-06 16:51:36,941][23882] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-06 16:51:37,722][23882] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-06 16:51:38,511][23882] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-06 16:51:39,289][23882] Updated weights for policy 0, policy_version 8750 (0.0006) [2023-03-06 16:51:40,067][23882] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-06 16:51:40,833][23882] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-06 16:51:41,623][23882] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-06 16:51:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 8991744. Throughput: 0: 13016.7. Samples: 8990349. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:41,748][23556] Avg episode reward: [(0, '64.559')] [2023-03-06 16:51:42,400][23882] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-06 16:51:43,185][23882] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-06 16:51:43,976][23882] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-06 16:51:44,771][23882] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-06 16:51:45,550][23882] Updated weights for policy 0, policy_version 8830 (0.0006) [2023-03-06 16:51:46,343][23882] Updated weights for policy 0, policy_version 8840 (0.0007) [2023-03-06 16:51:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 9057280. Throughput: 0: 13018.9. Samples: 9029299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:46,748][23556] Avg episode reward: [(0, '61.167')] [2023-03-06 16:51:47,108][23882] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-06 16:51:47,921][23882] Updated weights for policy 0, policy_version 8860 (0.0007) [2023-03-06 16:51:48,686][23882] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-06 16:51:49,476][23882] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-06 16:51:50,278][23882] Updated weights for policy 0, policy_version 8890 (0.0007) [2023-03-06 16:51:51,060][23882] Updated weights for policy 0, policy_version 8900 (0.0007) [2023-03-06 16:51:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 9121792. Throughput: 0: 13021.9. Samples: 9107419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:51,759][23556] Avg episode reward: [(0, '61.940')] [2023-03-06 16:51:51,831][23882] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-06 16:51:52,621][23882] Updated weights for policy 0, policy_version 8920 (0.0007) [2023-03-06 16:51:53,384][23882] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-06 16:51:54,172][23882] Updated weights for policy 0, policy_version 8940 (0.0007) [2023-03-06 16:51:54,972][23882] Updated weights for policy 0, policy_version 8950 (0.0006) [2023-03-06 16:51:55,733][23882] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-06 16:51:56,520][23882] Updated weights for policy 0, policy_version 8970 (0.0007) [2023-03-06 16:51:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12930.2). Total num frames: 9187328. Throughput: 0: 13049.0. Samples: 9186218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:51:56,754][23556] Avg episode reward: [(0, '64.089')] [2023-03-06 16:51:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000008973_9188352.pth... [2023-03-06 16:51:56,787][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000005927_6069248.pth [2023-03-06 16:51:57,306][23882] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-06 16:51:58,094][23882] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-06 16:51:58,880][23882] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-06 16:51:59,670][23882] Updated weights for policy 0, policy_version 9010 (0.0006) [2023-03-06 16:52:00,453][23882] Updated weights for policy 0, policy_version 9020 (0.0006) [2023-03-06 16:52:01,209][23882] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-06 16:52:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 12930.2). Total num frames: 9252864. Throughput: 0: 13048.7. Samples: 9225313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:01,759][23556] Avg episode reward: [(0, '75.158')] [2023-03-06 16:52:02,013][23882] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-06 16:52:02,802][23882] Updated weights for policy 0, policy_version 9050 (0.0007) [2023-03-06 16:52:03,596][23882] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-06 16:52:04,379][23882] Updated weights for policy 0, policy_version 9070 (0.0007) [2023-03-06 16:52:05,142][23882] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-06 16:52:05,937][23882] Updated weights for policy 0, policy_version 9090 (0.0007) [2023-03-06 16:52:06,703][23882] Updated weights for policy 0, policy_version 9100 (0.0005) [2023-03-06 16:52:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 12933.6). Total num frames: 9318400. Throughput: 0: 13045.1. Samples: 9303576. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:52:06,759][23556] Avg episode reward: [(0, '77.142')] [2023-03-06 16:52:07,489][23882] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-06 16:52:08,261][23882] Updated weights for policy 0, policy_version 9120 (0.0007) [2023-03-06 16:52:09,071][23882] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-06 16:52:09,852][23882] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-06 16:52:10,642][23882] Updated weights for policy 0, policy_version 9150 (0.0006) [2023-03-06 16:52:11,424][23882] Updated weights for policy 0, policy_version 9160 (0.0007) [2023-03-06 16:52:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 12933.6). Total num frames: 9383936. Throughput: 0: 13062.2. Samples: 9382090. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:52:11,748][23556] Avg episode reward: [(0, '66.222')] [2023-03-06 16:52:12,178][23882] Updated weights for policy 0, policy_version 9170 (0.0007) [2023-03-06 16:52:12,986][23882] Updated weights for policy 0, policy_version 9180 (0.0006) [2023-03-06 16:52:13,768][23882] Updated weights for policy 0, policy_version 9190 (0.0006) [2023-03-06 16:52:14,551][23882] Updated weights for policy 0, policy_version 9200 (0.0007) [2023-03-06 16:52:15,352][23882] Updated weights for policy 0, policy_version 9210 (0.0007) [2023-03-06 16:52:16,113][23882] Updated weights for policy 0, policy_version 9220 (0.0007) [2023-03-06 16:52:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 12933.6). Total num frames: 9448448. Throughput: 0: 13060.0. Samples: 9421362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:16,748][23556] Avg episode reward: [(0, '66.826')] [2023-03-06 16:52:16,909][23882] Updated weights for policy 0, policy_version 9230 (0.0007) [2023-03-06 16:52:17,685][23882] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-06 16:52:18,483][23882] Updated weights for policy 0, policy_version 9250 (0.0008) [2023-03-06 16:52:19,262][23882] Updated weights for policy 0, policy_version 9260 (0.0006) [2023-03-06 16:52:20,032][23882] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-06 16:52:20,830][23882] Updated weights for policy 0, policy_version 9280 (0.0007) [2023-03-06 16:52:21,627][23882] Updated weights for policy 0, policy_version 9290 (0.0007) [2023-03-06 16:52:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12940.6). Total num frames: 9513984. Throughput: 0: 13064.6. Samples: 9499514. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:52:21,748][23556] Avg episode reward: [(0, '67.256')] [2023-03-06 16:52:22,412][23882] Updated weights for policy 0, policy_version 9300 (0.0006) [2023-03-06 16:52:23,203][23882] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-06 16:52:24,006][23882] Updated weights for policy 0, policy_version 9320 (0.0007) [2023-03-06 16:52:24,793][23882] Updated weights for policy 0, policy_version 9330 (0.0006) [2023-03-06 16:52:25,557][23882] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-06 16:52:26,333][23882] Updated weights for policy 0, policy_version 9350 (0.0007) [2023-03-06 16:52:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 12944.1). Total num frames: 9579520. Throughput: 0: 13054.3. Samples: 9577793. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:52:26,749][23556] Avg episode reward: [(0, '70.256')] [2023-03-06 16:52:27,134][23882] Updated weights for policy 0, policy_version 9360 (0.0007) [2023-03-06 16:52:27,929][23882] Updated weights for policy 0, policy_version 9370 (0.0007) [2023-03-06 16:52:28,700][23882] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-06 16:52:29,487][23882] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-06 16:52:30,273][23882] Updated weights for policy 0, policy_version 9400 (0.0007) [2023-03-06 16:52:31,057][23882] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-06 16:52:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12947.5). Total num frames: 9644032. Throughput: 0: 13054.6. Samples: 9616754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:31,748][23556] Avg episode reward: [(0, '60.297')] [2023-03-06 16:52:31,845][23882] Updated weights for policy 0, policy_version 9420 (0.0007) [2023-03-06 16:52:32,630][23882] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-06 16:52:33,425][23882] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-06 16:52:34,216][23882] Updated weights for policy 0, policy_version 9450 (0.0007) [2023-03-06 16:52:34,985][23882] Updated weights for policy 0, policy_version 9460 (0.0006) [2023-03-06 16:52:35,786][23882] Updated weights for policy 0, policy_version 9470 (0.0007) [2023-03-06 16:52:36,560][23882] Updated weights for policy 0, policy_version 9480 (0.0007) [2023-03-06 16:52:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 12961.4). Total num frames: 9709568. Throughput: 0: 13052.3. Samples: 9694774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:36,748][23556] Avg episode reward: [(0, '65.569')] [2023-03-06 16:52:37,322][23882] Updated weights for policy 0, policy_version 9490 (0.0007) [2023-03-06 16:52:38,122][23882] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-06 16:52:38,890][23882] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-06 16:52:39,684][23882] Updated weights for policy 0, policy_version 9520 (0.0007) [2023-03-06 16:52:40,473][23882] Updated weights for policy 0, policy_version 9530 (0.0007) [2023-03-06 16:52:41,245][23882] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-06 16:52:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 12978.8). Total num frames: 9775104. Throughput: 0: 13048.4. Samples: 9773396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:41,748][23556] Avg episode reward: [(0, '70.333')] [2023-03-06 16:52:42,032][23882] Updated weights for policy 0, policy_version 9550 (0.0006) [2023-03-06 16:52:42,830][23882] Updated weights for policy 0, policy_version 9560 (0.0007) [2023-03-06 16:52:43,609][23882] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-06 16:52:44,387][23882] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-06 16:52:45,180][23882] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-06 16:52:45,966][23882] Updated weights for policy 0, policy_version 9600 (0.0007) [2023-03-06 16:52:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 12989.2). Total num frames: 9839616. Throughput: 0: 13049.0. Samples: 9812521. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:46,748][23556] Avg episode reward: [(0, '64.281')] [2023-03-06 16:52:46,768][23882] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-06 16:52:47,549][23882] Updated weights for policy 0, policy_version 9620 (0.0006) [2023-03-06 16:52:48,332][23882] Updated weights for policy 0, policy_version 9630 (0.0007) [2023-03-06 16:52:49,117][23882] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-06 16:52:49,901][23882] Updated weights for policy 0, policy_version 9650 (0.0006) [2023-03-06 16:52:50,684][23882] Updated weights for policy 0, policy_version 9660 (0.0006) [2023-03-06 16:52:51,465][23882] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-06 16:52:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13003.1). Total num frames: 9905152. Throughput: 0: 13045.1. Samples: 9890604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:51,759][23556] Avg episode reward: [(0, '64.611')] [2023-03-06 16:52:52,255][23882] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-06 16:52:53,051][23882] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-06 16:52:53,816][23882] Updated weights for policy 0, policy_version 9700 (0.0007) [2023-03-06 16:52:54,637][23882] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-06 16:52:55,416][23882] Updated weights for policy 0, policy_version 9720 (0.0007) [2023-03-06 16:52:56,201][23882] Updated weights for policy 0, policy_version 9730 (0.0007) [2023-03-06 16:52:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 9969664. Throughput: 0: 13031.7. Samples: 9968519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:52:56,749][23556] Avg episode reward: [(0, '53.274')] [2023-03-06 16:52:56,993][23882] Updated weights for policy 0, policy_version 9740 (0.0007) [2023-03-06 16:52:57,786][23882] Updated weights for policy 0, policy_version 9750 (0.0007) [2023-03-06 16:52:58,566][23882] Updated weights for policy 0, policy_version 9760 (0.0007) [2023-03-06 16:52:59,391][23882] Updated weights for policy 0, policy_version 9770 (0.0007) [2023-03-06 16:53:00,152][23882] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-06 16:53:00,922][23882] Updated weights for policy 0, policy_version 9790 (0.0007) [2023-03-06 16:53:01,729][23882] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-06 16:53:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 10035200. Throughput: 0: 13025.3. Samples: 10007501. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:53:01,759][23556] Avg episode reward: [(0, '54.262')] [2023-03-06 16:53:02,510][23882] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-06 16:53:03,294][23882] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-06 16:53:04,078][23882] Updated weights for policy 0, policy_version 9830 (0.0007) [2023-03-06 16:53:04,858][23882] Updated weights for policy 0, policy_version 9840 (0.0007) [2023-03-06 16:53:05,625][23882] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-06 16:53:06,413][23882] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-06 16:53:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 10100736. Throughput: 0: 13030.5. Samples: 10085886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:06,750][23556] Avg episode reward: [(0, '58.271')] [2023-03-06 16:53:07,193][23882] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-06 16:53:07,964][23882] Updated weights for policy 0, policy_version 9880 (0.0007) [2023-03-06 16:53:08,739][23882] Updated weights for policy 0, policy_version 9890 (0.0006) [2023-03-06 16:53:09,512][23882] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-06 16:53:10,302][23882] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-06 16:53:11,067][23882] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-06 16:53:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 10166272. Throughput: 0: 13046.9. Samples: 10164905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:11,748][23556] Avg episode reward: [(0, '51.688')] [2023-03-06 16:53:11,850][23882] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-06 16:53:12,626][23882] Updated weights for policy 0, policy_version 9940 (0.0007) [2023-03-06 16:53:13,410][23882] Updated weights for policy 0, policy_version 9950 (0.0005) [2023-03-06 16:53:14,191][23882] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-06 16:53:14,961][23882] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-06 16:53:15,757][23882] Updated weights for policy 0, policy_version 9980 (0.0007) [2023-03-06 16:53:16,541][23882] Updated weights for policy 0, policy_version 9990 (0.0007) [2023-03-06 16:53:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10231808. Throughput: 0: 13060.0. Samples: 10204451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:16,748][23556] Avg episode reward: [(0, '54.717')] [2023-03-06 16:53:17,321][23882] Updated weights for policy 0, policy_version 10000 (0.0006) [2023-03-06 16:53:18,095][23882] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-06 16:53:18,902][23882] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-06 16:53:19,683][23882] Updated weights for policy 0, policy_version 10030 (0.0006) [2023-03-06 16:53:20,461][23882] Updated weights for policy 0, policy_version 10040 (0.0005) [2023-03-06 16:53:21,249][23882] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-06 16:53:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 10297344. Throughput: 0: 13063.7. Samples: 10282641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:21,748][23556] Avg episode reward: [(0, '53.683')] [2023-03-06 16:53:22,036][23882] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-06 16:53:22,816][23882] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-06 16:53:23,598][23882] Updated weights for policy 0, policy_version 10080 (0.0008) [2023-03-06 16:53:24,400][23882] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-06 16:53:25,177][23882] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-06 16:53:25,959][23882] Updated weights for policy 0, policy_version 10110 (0.0007) [2023-03-06 16:53:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 10361856. Throughput: 0: 13054.7. Samples: 10360854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:26,748][23556] Avg episode reward: [(0, '67.557')] [2023-03-06 16:53:26,758][23882] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-06 16:53:27,537][23882] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-03-06 16:53:28,321][23882] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-06 16:53:29,116][23882] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-06 16:53:29,881][23882] Updated weights for policy 0, policy_version 10160 (0.0006) [2023-03-06 16:53:30,674][23882] Updated weights for policy 0, policy_version 10170 (0.0007) [2023-03-06 16:53:31,459][23882] Updated weights for policy 0, policy_version 10180 (0.0007) [2023-03-06 16:53:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10427392. Throughput: 0: 13053.8. Samples: 10399942. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:53:31,748][23556] Avg episode reward: [(0, '74.925')] [2023-03-06 16:53:32,238][23882] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-06 16:53:33,019][23882] Updated weights for policy 0, policy_version 10200 (0.0007) [2023-03-06 16:53:33,820][23882] Updated weights for policy 0, policy_version 10210 (0.0007) [2023-03-06 16:53:34,605][23882] Updated weights for policy 0, policy_version 10220 (0.0007) [2023-03-06 16:53:35,392][23882] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-06 16:53:36,170][23882] Updated weights for policy 0, policy_version 10240 (0.0007) [2023-03-06 16:53:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10492928. Throughput: 0: 13057.5. Samples: 10478190. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:53:36,748][23556] Avg episode reward: [(0, '82.303')] [2023-03-06 16:53:36,753][23831] Saving new best policy, reward=82.303! [2023-03-06 16:53:36,949][23882] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-06 16:53:37,741][23882] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-06 16:53:38,520][23882] Updated weights for policy 0, policy_version 10270 (0.0007) [2023-03-06 16:53:39,297][23882] Updated weights for policy 0, policy_version 10280 (0.0007) [2023-03-06 16:53:40,083][23882] Updated weights for policy 0, policy_version 10290 (0.0006) [2023-03-06 16:53:40,873][23882] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-06 16:53:41,661][23882] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-06 16:53:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10558464. Throughput: 0: 13070.4. Samples: 10556684. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:53:41,748][23556] Avg episode reward: [(0, '89.147')] [2023-03-06 16:53:41,749][23831] Saving new best policy, reward=89.147! [2023-03-06 16:53:42,460][23882] Updated weights for policy 0, policy_version 10320 (0.0007) [2023-03-06 16:53:43,228][23882] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-06 16:53:44,020][23882] Updated weights for policy 0, policy_version 10340 (0.0007) [2023-03-06 16:53:44,805][23882] Updated weights for policy 0, policy_version 10350 (0.0007) [2023-03-06 16:53:45,597][23882] Updated weights for policy 0, policy_version 10360 (0.0006) [2023-03-06 16:53:46,372][23882] Updated weights for policy 0, policy_version 10370 (0.0007) [2023-03-06 16:53:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 10624000. Throughput: 0: 13072.3. Samples: 10595753. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:53:46,748][23556] Avg episode reward: [(0, '121.531')] [2023-03-06 16:53:46,753][23831] Saving new best policy, reward=121.531! [2023-03-06 16:53:47,157][23882] Updated weights for policy 0, policy_version 10380 (0.0007) [2023-03-06 16:53:47,953][23882] Updated weights for policy 0, policy_version 10390 (0.0007) [2023-03-06 16:53:48,724][23882] Updated weights for policy 0, policy_version 10400 (0.0006) [2023-03-06 16:53:49,520][23882] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-06 16:53:50,295][23882] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-06 16:53:51,084][23882] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-06 16:53:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10688512. Throughput: 0: 13068.7. Samples: 10673977. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:53:51,748][23556] Avg episode reward: [(0, '106.888')] [2023-03-06 16:53:51,870][23882] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-06 16:53:52,653][23882] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-06 16:53:53,431][23882] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-06 16:53:54,214][23882] Updated weights for policy 0, policy_version 10470 (0.0007) [2023-03-06 16:53:54,993][23882] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-06 16:53:55,795][23882] Updated weights for policy 0, policy_version 10490 (0.0006) [2023-03-06 16:53:56,566][23882] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-06 16:53:56,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 10754048. Throughput: 0: 13051.8. Samples: 10752235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:53:56,748][23556] Avg episode reward: [(0, '98.215')] [2023-03-06 16:53:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000010502_10754048.pth... [2023-03-06 16:53:56,780][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000007446_7624704.pth [2023-03-06 16:53:57,365][23882] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-06 16:53:58,152][23882] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-06 16:53:58,931][23882] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-06 16:53:59,728][23882] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-06 16:54:00,510][23882] Updated weights for policy 0, policy_version 10550 (0.0006) [2023-03-06 16:54:01,287][23882] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-06 16:54:01,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10818560. Throughput: 0: 13038.7. Samples: 10791196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:01,749][23556] Avg episode reward: [(0, '105.330')] [2023-03-06 16:54:02,082][23882] Updated weights for policy 0, policy_version 10570 (0.0005) [2023-03-06 16:54:02,859][23882] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-06 16:54:03,653][23882] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-06 16:54:04,421][23882] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-06 16:54:05,198][23882] Updated weights for policy 0, policy_version 10610 (0.0007) [2023-03-06 16:54:05,976][23882] Updated weights for policy 0, policy_version 10620 (0.0007) [2023-03-06 16:54:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 10884096. Throughput: 0: 13048.3. Samples: 10869815. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:06,748][23556] Avg episode reward: [(0, '103.965')] [2023-03-06 16:54:06,753][23882] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-06 16:54:07,522][23882] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-06 16:54:08,319][23882] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-06 16:54:09,096][23882] Updated weights for policy 0, policy_version 10660 (0.0007) [2023-03-06 16:54:09,879][23882] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-06 16:54:10,668][23882] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-06 16:54:11,444][23882] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-06 16:54:11,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 10949632. Throughput: 0: 13059.5. Samples: 10948532. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:11,748][23556] Avg episode reward: [(0, '105.268')] [2023-03-06 16:54:12,222][23882] Updated weights for policy 0, policy_version 10700 (0.0007) [2023-03-06 16:54:13,009][23882] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-06 16:54:13,778][23882] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-06 16:54:14,577][23882] Updated weights for policy 0, policy_version 10730 (0.0006) [2023-03-06 16:54:15,353][23882] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-06 16:54:16,129][23882] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-06 16:54:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 11015168. Throughput: 0: 13061.8. Samples: 10987721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:16,748][23556] Avg episode reward: [(0, '115.742')] [2023-03-06 16:54:16,917][23882] Updated weights for policy 0, policy_version 10760 (0.0008) [2023-03-06 16:54:17,713][23882] Updated weights for policy 0, policy_version 10770 (0.0007) [2023-03-06 16:54:18,483][23882] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-06 16:54:19,276][23882] Updated weights for policy 0, policy_version 10790 (0.0006) [2023-03-06 16:54:20,067][23882] Updated weights for policy 0, policy_version 10800 (0.0006) [2023-03-06 16:54:20,853][23882] Updated weights for policy 0, policy_version 10810 (0.0007) [2023-03-06 16:54:21,623][23882] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-06 16:54:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 11080704. Throughput: 0: 13063.1. Samples: 11066031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:21,748][23556] Avg episode reward: [(0, '97.758')] [2023-03-06 16:54:22,419][23882] Updated weights for policy 0, policy_version 10830 (0.0007) [2023-03-06 16:54:23,215][23882] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-06 16:54:24,001][23882] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-06 16:54:24,769][23882] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-06 16:54:25,549][23882] Updated weights for policy 0, policy_version 10870 (0.0007) [2023-03-06 16:54:26,318][23882] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-06 16:54:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13037.8). Total num frames: 11146240. Throughput: 0: 13069.4. Samples: 11144810. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:26,748][23556] Avg episode reward: [(0, '107.783')] [2023-03-06 16:54:27,090][23882] Updated weights for policy 0, policy_version 10890 (0.0006) [2023-03-06 16:54:27,869][23882] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-06 16:54:28,656][23882] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-06 16:54:29,430][23882] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-06 16:54:30,210][23882] Updated weights for policy 0, policy_version 10930 (0.0006) [2023-03-06 16:54:30,998][23882] Updated weights for policy 0, policy_version 10940 (0.0007) [2023-03-06 16:54:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13041.3). Total num frames: 11211776. Throughput: 0: 13073.3. Samples: 11184055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:31,748][23556] Avg episode reward: [(0, '95.804')] [2023-03-06 16:54:31,769][23882] Updated weights for policy 0, policy_version 10950 (0.0007) [2023-03-06 16:54:32,551][23882] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-06 16:54:33,332][23882] Updated weights for policy 0, policy_version 10970 (0.0006) [2023-03-06 16:54:34,129][23882] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-06 16:54:34,907][23882] Updated weights for policy 0, policy_version 10990 (0.0007) [2023-03-06 16:54:35,681][23882] Updated weights for policy 0, policy_version 11000 (0.0007) [2023-03-06 16:54:36,472][23882] Updated weights for policy 0, policy_version 11010 (0.0007) [2023-03-06 16:54:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 11277312. Throughput: 0: 13084.4. Samples: 11262778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:36,748][23556] Avg episode reward: [(0, '112.584')] [2023-03-06 16:54:37,259][23882] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-06 16:54:38,049][23882] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-06 16:54:38,837][23882] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-06 16:54:39,628][23882] Updated weights for policy 0, policy_version 11050 (0.0007) [2023-03-06 16:54:40,401][23882] Updated weights for policy 0, policy_version 11060 (0.0006) [2023-03-06 16:54:41,184][23882] Updated weights for policy 0, policy_version 11070 (0.0007) [2023-03-06 16:54:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 11342848. Throughput: 0: 13082.2. Samples: 11340932. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:41,748][23556] Avg episode reward: [(0, '105.249')] [2023-03-06 16:54:41,988][23882] Updated weights for policy 0, policy_version 11080 (0.0007) [2023-03-06 16:54:42,767][23882] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-06 16:54:43,550][23882] Updated weights for policy 0, policy_version 11100 (0.0007) [2023-03-06 16:54:44,346][23882] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-06 16:54:45,133][23882] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-06 16:54:45,898][23882] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-06 16:54:46,700][23882] Updated weights for policy 0, policy_version 11140 (0.0007) [2023-03-06 16:54:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13055.9, 300 sec: 13044.7). Total num frames: 11407360. Throughput: 0: 13083.6. Samples: 11379959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:46,749][23556] Avg episode reward: [(0, '109.732')] [2023-03-06 16:54:47,493][23882] Updated weights for policy 0, policy_version 11150 (0.0007) [2023-03-06 16:54:48,266][23882] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-06 16:54:49,054][23882] Updated weights for policy 0, policy_version 11170 (0.0007) [2023-03-06 16:54:49,824][23882] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-06 16:54:50,623][23882] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-06 16:54:51,409][23882] Updated weights for policy 0, policy_version 11200 (0.0007) [2023-03-06 16:54:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 11472896. Throughput: 0: 13076.8. Samples: 11458274. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:54:51,748][23556] Avg episode reward: [(0, '126.182')] [2023-03-06 16:54:51,749][23831] Saving new best policy, reward=126.182! [2023-03-06 16:54:52,193][23882] Updated weights for policy 0, policy_version 11210 (0.0006) [2023-03-06 16:54:52,981][23882] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-06 16:54:53,750][23882] Updated weights for policy 0, policy_version 11230 (0.0007) [2023-03-06 16:54:54,552][23882] Updated weights for policy 0, policy_version 11240 (0.0007) [2023-03-06 16:54:55,337][23882] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-06 16:54:56,126][23882] Updated weights for policy 0, policy_version 11260 (0.0007) [2023-03-06 16:54:56,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 11538432. Throughput: 0: 13061.2. Samples: 11536284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:54:56,748][23556] Avg episode reward: [(0, '96.651')] [2023-03-06 16:54:56,917][23882] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-06 16:54:57,701][23882] Updated weights for policy 0, policy_version 11280 (0.0007) [2023-03-06 16:54:58,483][23882] Updated weights for policy 0, policy_version 11290 (0.0007) [2023-03-06 16:54:59,257][23882] Updated weights for policy 0, policy_version 11300 (0.0006) [2023-03-06 16:55:00,021][23882] Updated weights for policy 0, policy_version 11310 (0.0006) [2023-03-06 16:55:00,804][23882] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-06 16:55:01,596][23882] Updated weights for policy 0, policy_version 11330 (0.0007) [2023-03-06 16:55:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 11602944. Throughput: 0: 13064.9. Samples: 11575640. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:01,748][23556] Avg episode reward: [(0, '123.505')] [2023-03-06 16:55:02,382][23882] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-06 16:55:03,166][23882] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-06 16:55:03,948][23882] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-06 16:55:04,724][23882] Updated weights for policy 0, policy_version 11370 (0.0006) [2023-03-06 16:55:05,495][23882] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-06 16:55:06,281][23882] Updated weights for policy 0, policy_version 11390 (0.0007) [2023-03-06 16:55:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 11668480. Throughput: 0: 13071.0. Samples: 11654226. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:06,748][23556] Avg episode reward: [(0, '107.770')] [2023-03-06 16:55:07,074][23882] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-06 16:55:07,834][23882] Updated weights for policy 0, policy_version 11410 (0.0007) [2023-03-06 16:55:08,625][23882] Updated weights for policy 0, policy_version 11420 (0.0008) [2023-03-06 16:55:09,407][23882] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-06 16:55:10,189][23882] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-06 16:55:10,998][23882] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-06 16:55:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 11734016. Throughput: 0: 13057.9. Samples: 11732415. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:55:11,748][23556] Avg episode reward: [(0, '130.863')] [2023-03-06 16:55:11,749][23831] Saving new best policy, reward=130.863! [2023-03-06 16:55:11,802][23882] Updated weights for policy 0, policy_version 11460 (0.0007) [2023-03-06 16:55:12,561][23882] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-06 16:55:13,359][23882] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-06 16:55:14,138][23882] Updated weights for policy 0, policy_version 11490 (0.0006) [2023-03-06 16:55:14,906][23882] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-06 16:55:15,683][23882] Updated weights for policy 0, policy_version 11510 (0.0005) [2023-03-06 16:55:16,465][23882] Updated weights for policy 0, policy_version 11520 (0.0006) [2023-03-06 16:55:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13048.2). Total num frames: 11799552. Throughput: 0: 13062.0. Samples: 11771846. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:55:16,749][23556] Avg episode reward: [(0, '104.660')] [2023-03-06 16:55:17,236][23882] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-06 16:55:18,017][23882] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-06 16:55:18,798][23882] Updated weights for policy 0, policy_version 11550 (0.0006) [2023-03-06 16:55:19,600][23882] Updated weights for policy 0, policy_version 11560 (0.0007) [2023-03-06 16:55:20,381][23882] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-06 16:55:21,170][23882] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-06 16:55:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 11865088. Throughput: 0: 13056.3. Samples: 11850309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:21,748][23556] Avg episode reward: [(0, '99.914')] [2023-03-06 16:55:21,962][23882] Updated weights for policy 0, policy_version 11590 (0.0006) [2023-03-06 16:55:22,733][23882] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-06 16:55:23,518][23882] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-06 16:55:24,309][23882] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-06 16:55:25,079][23882] Updated weights for policy 0, policy_version 11630 (0.0007) [2023-03-06 16:55:25,868][23882] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-06 16:55:26,644][23882] Updated weights for policy 0, policy_version 11650 (0.0006) [2023-03-06 16:55:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 11930624. Throughput: 0: 13064.8. Samples: 11928850. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:26,748][23556] Avg episode reward: [(0, '92.809')] [2023-03-06 16:55:27,420][23882] Updated weights for policy 0, policy_version 11660 (0.0007) [2023-03-06 16:55:28,214][23882] Updated weights for policy 0, policy_version 11670 (0.0007) [2023-03-06 16:55:28,996][23882] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-06 16:55:29,771][23882] Updated weights for policy 0, policy_version 11690 (0.0007) [2023-03-06 16:55:30,557][23882] Updated weights for policy 0, policy_version 11700 (0.0007) [2023-03-06 16:55:31,345][23882] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-06 16:55:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 11996160. Throughput: 0: 13071.7. Samples: 11968184. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:55:31,748][23556] Avg episode reward: [(0, '108.659')] [2023-03-06 16:55:32,107][23882] Updated weights for policy 0, policy_version 11720 (0.0007) [2023-03-06 16:55:32,886][23882] Updated weights for policy 0, policy_version 11730 (0.0005) [2023-03-06 16:55:33,675][23882] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-06 16:55:34,457][23882] Updated weights for policy 0, policy_version 11750 (0.0005) [2023-03-06 16:55:35,246][23882] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-06 16:55:36,019][23882] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-06 16:55:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13055.1). Total num frames: 12061696. Throughput: 0: 13078.0. Samples: 12046785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:36,749][23556] Avg episode reward: [(0, '87.639')] [2023-03-06 16:55:36,801][23882] Updated weights for policy 0, policy_version 11780 (0.0005) [2023-03-06 16:55:37,604][23882] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-06 16:55:38,373][23882] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-06 16:55:39,158][23882] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-06 16:55:39,937][23882] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-06 16:55:40,706][23882] Updated weights for policy 0, policy_version 11830 (0.0006) [2023-03-06 16:55:41,501][23882] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-06 16:55:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 12127232. Throughput: 0: 13091.6. Samples: 12125409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:41,748][23556] Avg episode reward: [(0, '113.244')] [2023-03-06 16:55:42,276][23882] Updated weights for policy 0, policy_version 11850 (0.0007) [2023-03-06 16:55:43,052][23882] Updated weights for policy 0, policy_version 11860 (0.0007) [2023-03-06 16:55:43,844][23882] Updated weights for policy 0, policy_version 11870 (0.0005) [2023-03-06 16:55:44,637][23882] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-06 16:55:45,397][23882] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-06 16:55:46,193][23882] Updated weights for policy 0, policy_version 11900 (0.0007) [2023-03-06 16:55:46,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13090.2, 300 sec: 13058.6). Total num frames: 12192768. Throughput: 0: 13088.5. Samples: 12164622. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:55:46,748][23556] Avg episode reward: [(0, '92.146')] [2023-03-06 16:55:46,956][23882] Updated weights for policy 0, policy_version 11910 (0.0007) [2023-03-06 16:55:47,747][23882] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-06 16:55:48,526][23882] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-06 16:55:49,297][23882] Updated weights for policy 0, policy_version 11940 (0.0007) [2023-03-06 16:55:50,081][23882] Updated weights for policy 0, policy_version 11950 (0.0006) [2023-03-06 16:55:50,879][23882] Updated weights for policy 0, policy_version 11960 (0.0006) [2023-03-06 16:55:51,648][23882] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-06 16:55:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13058.6). Total num frames: 12258304. Throughput: 0: 13091.4. Samples: 12243340. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:55:51,748][23556] Avg episode reward: [(0, '120.252')] [2023-03-06 16:55:52,418][23882] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-06 16:55:53,185][23882] Updated weights for policy 0, policy_version 11990 (0.0007) [2023-03-06 16:55:53,973][23882] Updated weights for policy 0, policy_version 12000 (0.0007) [2023-03-06 16:55:54,746][23882] Updated weights for policy 0, policy_version 12010 (0.0007) [2023-03-06 16:55:55,526][23882] Updated weights for policy 0, policy_version 12020 (0.0005) [2023-03-06 16:55:56,305][23882] Updated weights for policy 0, policy_version 12030 (0.0007) [2023-03-06 16:55:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13062.1). Total num frames: 12323840. Throughput: 0: 13107.2. Samples: 12322240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:55:56,748][23556] Avg episode reward: [(0, '120.096')] [2023-03-06 16:55:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000012035_12323840.pth... [2023-03-06 16:55:56,781][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000008973_9188352.pth [2023-03-06 16:55:57,082][23882] Updated weights for policy 0, policy_version 12040 (0.0006) [2023-03-06 16:55:57,886][23882] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-06 16:55:58,659][23882] Updated weights for policy 0, policy_version 12060 (0.0007) [2023-03-06 16:55:59,436][23882] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-06 16:56:00,223][23882] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-06 16:56:01,012][23882] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-06 16:56:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13062.1). Total num frames: 12389376. Throughput: 0: 13109.2. Samples: 12361758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:01,748][23556] Avg episode reward: [(0, '125.676')] [2023-03-06 16:56:01,785][23882] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-06 16:56:02,559][23882] Updated weights for policy 0, policy_version 12110 (0.0005) [2023-03-06 16:56:03,341][23882] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-06 16:56:04,125][23882] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-06 16:56:04,909][23882] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-06 16:56:05,695][23882] Updated weights for policy 0, policy_version 12150 (0.0005) [2023-03-06 16:56:06,471][23882] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-06 16:56:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13065.5). Total num frames: 12454912. Throughput: 0: 13107.4. Samples: 12440143. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:56:06,748][23556] Avg episode reward: [(0, '94.096')] [2023-03-06 16:56:07,245][23882] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-06 16:56:08,025][23882] Updated weights for policy 0, policy_version 12180 (0.0007) [2023-03-06 16:56:08,802][23882] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-06 16:56:09,569][23882] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-06 16:56:10,354][23882] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-06 16:56:11,126][23882] Updated weights for policy 0, policy_version 12220 (0.0007) [2023-03-06 16:56:11,748][23556] Fps is (10 sec: 13209.8, 60 sec: 13124.3, 300 sec: 13069.0). Total num frames: 12521472. Throughput: 0: 13124.1. Samples: 12519432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:56:11,748][23556] Avg episode reward: [(0, '90.270')] [2023-03-06 16:56:11,910][23882] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-06 16:56:12,692][23882] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-06 16:56:13,470][23882] Updated weights for policy 0, policy_version 12250 (0.0006) [2023-03-06 16:56:14,240][23882] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-06 16:56:15,023][23882] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-06 16:56:15,803][23882] Updated weights for policy 0, policy_version 12280 (0.0006) [2023-03-06 16:56:16,590][23882] Updated weights for policy 0, policy_version 12290 (0.0007) [2023-03-06 16:56:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13069.0). Total num frames: 12585984. Throughput: 0: 13125.3. Samples: 12558824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:16,748][23556] Avg episode reward: [(0, '94.711')] [2023-03-06 16:56:17,351][23882] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-06 16:56:18,141][23882] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-06 16:56:18,902][23882] Updated weights for policy 0, policy_version 12320 (0.0007) [2023-03-06 16:56:19,677][23882] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-06 16:56:20,461][23882] Updated weights for policy 0, policy_version 12340 (0.0006) [2023-03-06 16:56:21,254][23882] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-06 16:56:21,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13072.5). Total num frames: 12652544. Throughput: 0: 13132.4. Samples: 12637744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:21,748][23556] Avg episode reward: [(0, '95.074')] [2023-03-06 16:56:22,025][23882] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-06 16:56:22,806][23882] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-06 16:56:23,573][23882] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-06 16:56:24,358][23882] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-06 16:56:25,129][23882] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-06 16:56:25,920][23882] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-06 16:56:26,693][23882] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-06 16:56:26,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13072.5). Total num frames: 12718080. Throughput: 0: 13140.0. Samples: 12716707. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:26,748][23556] Avg episode reward: [(0, '90.457')] [2023-03-06 16:56:27,471][23882] Updated weights for policy 0, policy_version 12430 (0.0007) [2023-03-06 16:56:28,255][23882] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-06 16:56:29,017][23882] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-06 16:56:29,796][23882] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-06 16:56:30,578][23882] Updated weights for policy 0, policy_version 12470 (0.0007) [2023-03-06 16:56:31,357][23882] Updated weights for policy 0, policy_version 12480 (0.0007) [2023-03-06 16:56:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13076.0). Total num frames: 12783616. Throughput: 0: 13142.7. Samples: 12756047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:31,748][23556] Avg episode reward: [(0, '113.956')] [2023-03-06 16:56:32,143][23882] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-06 16:56:32,938][23882] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-06 16:56:33,709][23882] Updated weights for policy 0, policy_version 12510 (0.0007) [2023-03-06 16:56:34,510][23882] Updated weights for policy 0, policy_version 12520 (0.0007) [2023-03-06 16:56:35,300][23882] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-06 16:56:36,083][23882] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-06 16:56:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13076.0). Total num frames: 12849152. Throughput: 0: 13137.4. Samples: 12834525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:36,748][23556] Avg episode reward: [(0, '99.652')] [2023-03-06 16:56:36,877][23882] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-06 16:56:37,665][23882] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-06 16:56:38,453][23882] Updated weights for policy 0, policy_version 12570 (0.0007) [2023-03-06 16:56:39,236][23882] Updated weights for policy 0, policy_version 12580 (0.0006) [2023-03-06 16:56:40,014][23882] Updated weights for policy 0, policy_version 12590 (0.0007) [2023-03-06 16:56:40,785][23882] Updated weights for policy 0, policy_version 12600 (0.0007) [2023-03-06 16:56:41,562][23882] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-06 16:56:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13076.0). Total num frames: 12914688. Throughput: 0: 13126.5. Samples: 12912934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:41,748][23556] Avg episode reward: [(0, '100.281')] [2023-03-06 16:56:42,356][23882] Updated weights for policy 0, policy_version 12620 (0.0007) [2023-03-06 16:56:43,137][23882] Updated weights for policy 0, policy_version 12630 (0.0007) [2023-03-06 16:56:43,906][23882] Updated weights for policy 0, policy_version 12640 (0.0007) [2023-03-06 16:56:44,697][23882] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-06 16:56:45,455][23882] Updated weights for policy 0, policy_version 12660 (0.0006) [2023-03-06 16:56:46,232][23882] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-06 16:56:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13079.4). Total num frames: 12980224. Throughput: 0: 13119.7. Samples: 12952145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:46,748][23556] Avg episode reward: [(0, '114.921')] [2023-03-06 16:56:47,042][23882] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-06 16:56:47,826][23882] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-06 16:56:48,601][23882] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-06 16:56:49,391][23882] Updated weights for policy 0, policy_version 12710 (0.0007) [2023-03-06 16:56:50,165][23882] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-06 16:56:50,949][23882] Updated weights for policy 0, policy_version 12730 (0.0006) [2023-03-06 16:56:51,730][23882] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-06 16:56:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13079.4). Total num frames: 13045760. Throughput: 0: 13123.1. Samples: 13030683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:51,748][23556] Avg episode reward: [(0, '136.203')] [2023-03-06 16:56:51,749][23831] Saving new best policy, reward=136.203! [2023-03-06 16:56:52,510][23882] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-06 16:56:53,293][23882] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-06 16:56:54,089][23882] Updated weights for policy 0, policy_version 12770 (0.0007) [2023-03-06 16:56:54,870][23882] Updated weights for policy 0, policy_version 12780 (0.0007) [2023-03-06 16:56:55,641][23882] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-06 16:56:56,435][23882] Updated weights for policy 0, policy_version 12800 (0.0007) [2023-03-06 16:56:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13079.4). Total num frames: 13111296. Throughput: 0: 13108.9. Samples: 13109336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:56:56,748][23556] Avg episode reward: [(0, '113.471')] [2023-03-06 16:56:57,210][23882] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-06 16:56:57,988][23882] Updated weights for policy 0, policy_version 12820 (0.0006) [2023-03-06 16:56:58,758][23882] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-06 16:56:59,546][23882] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-06 16:57:00,321][23882] Updated weights for policy 0, policy_version 12850 (0.0007) [2023-03-06 16:57:01,107][23882] Updated weights for policy 0, policy_version 12860 (0.0006) [2023-03-06 16:57:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13079.4). Total num frames: 13176832. Throughput: 0: 13109.9. Samples: 13148768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:57:01,748][23556] Avg episode reward: [(0, '106.585')] [2023-03-06 16:57:01,874][23882] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-06 16:57:02,659][23882] Updated weights for policy 0, policy_version 12880 (0.0007) [2023-03-06 16:57:03,437][23882] Updated weights for policy 0, policy_version 12890 (0.0007) [2023-03-06 16:57:04,218][23882] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-06 16:57:04,999][23882] Updated weights for policy 0, policy_version 12910 (0.0006) [2023-03-06 16:57:05,790][23882] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-06 16:57:06,571][23882] Updated weights for policy 0, policy_version 12930 (0.0007) [2023-03-06 16:57:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13079.4). Total num frames: 13242368. Throughput: 0: 13107.3. Samples: 13227572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:57:06,748][23556] Avg episode reward: [(0, '88.260')] [2023-03-06 16:57:07,346][23882] Updated weights for policy 0, policy_version 12940 (0.0007) [2023-03-06 16:57:08,142][23882] Updated weights for policy 0, policy_version 12950 (0.0007) [2023-03-06 16:57:08,930][23882] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-06 16:57:09,708][23882] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-06 16:57:10,497][23882] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-06 16:57:11,277][23882] Updated weights for policy 0, policy_version 12990 (0.0007) [2023-03-06 16:57:11,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13082.9). Total num frames: 13307904. Throughput: 0: 13092.2. Samples: 13305857. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:57:11,748][23556] Avg episode reward: [(0, '93.327')] [2023-03-06 16:57:12,050][23882] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-06 16:57:12,829][23882] Updated weights for policy 0, policy_version 13010 (0.0007) [2023-03-06 16:57:13,602][23882] Updated weights for policy 0, policy_version 13020 (0.0006) [2023-03-06 16:57:14,392][23882] Updated weights for policy 0, policy_version 13030 (0.0007) [2023-03-06 16:57:15,168][23882] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-06 16:57:15,958][23882] Updated weights for policy 0, policy_version 13050 (0.0007) [2023-03-06 16:57:16,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13082.9). Total num frames: 13373440. Throughput: 0: 13096.8. Samples: 13345402. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:57:16,748][23556] Avg episode reward: [(0, '109.345')] [2023-03-06 16:57:16,748][23882] Updated weights for policy 0, policy_version 13060 (0.0007) [2023-03-06 16:57:17,518][23882] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-06 16:57:18,314][23882] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-06 16:57:19,101][23882] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-06 16:57:19,881][23882] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-06 16:57:20,657][23882] Updated weights for policy 0, policy_version 13110 (0.0005) [2023-03-06 16:57:21,445][23882] Updated weights for policy 0, policy_version 13120 (0.0007) [2023-03-06 16:57:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13082.9). Total num frames: 13438976. Throughput: 0: 13092.0. Samples: 13423666. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:57:21,748][23556] Avg episode reward: [(0, '92.364')] [2023-03-06 16:57:22,213][23882] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-06 16:57:23,001][23882] Updated weights for policy 0, policy_version 13140 (0.0006) [2023-03-06 16:57:23,786][23882] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-06 16:57:24,556][23882] Updated weights for policy 0, policy_version 13160 (0.0007) [2023-03-06 16:57:25,362][23882] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-06 16:57:26,142][23882] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-06 16:57:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 13503488. Throughput: 0: 13095.8. Samples: 13502243. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:57:26,748][23556] Avg episode reward: [(0, '77.466')] [2023-03-06 16:57:26,913][23882] Updated weights for policy 0, policy_version 13190 (0.0006) [2023-03-06 16:57:27,696][23882] Updated weights for policy 0, policy_version 13200 (0.0007) [2023-03-06 16:57:28,481][23882] Updated weights for policy 0, policy_version 13210 (0.0005) [2023-03-06 16:57:29,281][23882] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-06 16:57:30,064][23882] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-06 16:57:30,848][23882] Updated weights for policy 0, policy_version 13240 (0.0007) [2023-03-06 16:57:31,626][23882] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-06 16:57:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 13569024. Throughput: 0: 13091.3. Samples: 13541253. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:57:31,748][23556] Avg episode reward: [(0, '94.153')] [2023-03-06 16:57:32,396][23882] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-06 16:57:33,203][23882] Updated weights for policy 0, policy_version 13270 (0.0007) [2023-03-06 16:57:33,977][23882] Updated weights for policy 0, policy_version 13280 (0.0007) [2023-03-06 16:57:34,742][23882] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-06 16:57:35,526][23882] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-06 16:57:36,322][23882] Updated weights for policy 0, policy_version 13310 (0.0007) [2023-03-06 16:57:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13082.9). Total num frames: 13634560. Throughput: 0: 13093.3. Samples: 13619880. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 16:57:36,748][23556] Avg episode reward: [(0, '75.110')] [2023-03-06 16:57:37,091][23882] Updated weights for policy 0, policy_version 13320 (0.0006) [2023-03-06 16:57:37,892][23882] Updated weights for policy 0, policy_version 13330 (0.0009) [2023-03-06 16:57:38,658][23882] Updated weights for policy 0, policy_version 13340 (0.0007) [2023-03-06 16:57:39,435][23882] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-06 16:57:40,209][23882] Updated weights for policy 0, policy_version 13360 (0.0007) [2023-03-06 16:57:40,980][23882] Updated weights for policy 0, policy_version 13370 (0.0006) [2023-03-06 16:57:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13086.4). Total num frames: 13700096. Throughput: 0: 13102.0. Samples: 13698926. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:57:41,748][23556] Avg episode reward: [(0, '82.329')] [2023-03-06 16:57:41,755][23882] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-06 16:57:42,538][23882] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-06 16:57:43,334][23882] Updated weights for policy 0, policy_version 13400 (0.0005) [2023-03-06 16:57:44,104][23882] Updated weights for policy 0, policy_version 13410 (0.0008) [2023-03-06 16:57:44,896][23882] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-06 16:57:45,696][23882] Updated weights for policy 0, policy_version 13430 (0.0007) [2023-03-06 16:57:46,474][23882] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-06 16:57:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13086.4). Total num frames: 13765632. Throughput: 0: 13095.9. Samples: 13738082. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 16:57:46,748][23556] Avg episode reward: [(0, '77.215')] [2023-03-06 16:57:47,234][23882] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-06 16:57:48,021][23882] Updated weights for policy 0, policy_version 13460 (0.0006) [2023-03-06 16:57:48,811][23882] Updated weights for policy 0, policy_version 13470 (0.0007) [2023-03-06 16:57:49,600][23882] Updated weights for policy 0, policy_version 13480 (0.0006) [2023-03-06 16:57:50,399][23882] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-06 16:57:51,176][23882] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-06 16:57:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 13831168. Throughput: 0: 13084.4. Samples: 13816368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:57:51,748][23556] Avg episode reward: [(0, '70.533')] [2023-03-06 16:57:51,942][23882] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-06 16:57:52,727][23882] Updated weights for policy 0, policy_version 13520 (0.0006) [2023-03-06 16:57:53,516][23882] Updated weights for policy 0, policy_version 13530 (0.0007) [2023-03-06 16:57:54,284][23882] Updated weights for policy 0, policy_version 13540 (0.0007) [2023-03-06 16:57:55,073][23882] Updated weights for policy 0, policy_version 13550 (0.0005) [2023-03-06 16:57:55,839][23882] Updated weights for policy 0, policy_version 13560 (0.0005) [2023-03-06 16:57:56,609][23882] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-06 16:57:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 13896704. Throughput: 0: 13103.5. Samples: 13895516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:57:56,748][23556] Avg episode reward: [(0, '77.011')] [2023-03-06 16:57:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000013571_13896704.pth... [2023-03-06 16:57:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000010502_10754048.pth [2023-03-06 16:57:57,397][23882] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-06 16:57:58,163][23882] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-06 16:57:58,954][23882] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-06 16:57:59,758][23882] Updated weights for policy 0, policy_version 13610 (0.0006) [2023-03-06 16:58:00,526][23882] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-06 16:58:01,288][23882] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-06 16:58:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 13962240. Throughput: 0: 13094.2. Samples: 13934645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:01,749][23556] Avg episode reward: [(0, '71.935')] [2023-03-06 16:58:02,085][23882] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-06 16:58:02,859][23882] Updated weights for policy 0, policy_version 13650 (0.0006) [2023-03-06 16:58:03,636][23882] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-06 16:58:04,421][23882] Updated weights for policy 0, policy_version 13670 (0.0007) [2023-03-06 16:58:05,201][23882] Updated weights for policy 0, policy_version 13680 (0.0007) [2023-03-06 16:58:05,982][23882] Updated weights for policy 0, policy_version 13690 (0.0006) [2023-03-06 16:58:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 14027776. Throughput: 0: 13104.4. Samples: 14013362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:06,748][23556] Avg episode reward: [(0, '85.959')] [2023-03-06 16:58:06,766][23882] Updated weights for policy 0, policy_version 13700 (0.0006) [2023-03-06 16:58:07,551][23882] Updated weights for policy 0, policy_version 13710 (0.0006) [2023-03-06 16:58:08,328][23882] Updated weights for policy 0, policy_version 13720 (0.0007) [2023-03-06 16:58:09,121][23882] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-06 16:58:09,891][23882] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-06 16:58:10,689][23882] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-06 16:58:11,468][23882] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-06 16:58:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 14093312. Throughput: 0: 13101.9. Samples: 14091829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:11,748][23556] Avg episode reward: [(0, '58.061')] [2023-03-06 16:58:12,257][23882] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-06 16:58:13,049][23882] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-06 16:58:13,814][23882] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-06 16:58:14,587][23882] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-06 16:58:15,393][23882] Updated weights for policy 0, policy_version 13810 (0.0005) [2023-03-06 16:58:16,148][23882] Updated weights for policy 0, policy_version 13820 (0.0007) [2023-03-06 16:58:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13089.8). Total num frames: 14158848. Throughput: 0: 13107.9. Samples: 14131110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:16,748][23556] Avg episode reward: [(0, '82.904')] [2023-03-06 16:58:16,946][23882] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-06 16:58:17,734][23882] Updated weights for policy 0, policy_version 13840 (0.0006) [2023-03-06 16:58:18,510][23882] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-06 16:58:19,299][23882] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-06 16:58:20,088][23882] Updated weights for policy 0, policy_version 13870 (0.0006) [2023-03-06 16:58:20,861][23882] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-06 16:58:21,637][23882] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-06 16:58:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13093.3). Total num frames: 14224384. Throughput: 0: 13105.2. Samples: 14209617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:58:21,748][23556] Avg episode reward: [(0, '97.644')] [2023-03-06 16:58:22,416][23882] Updated weights for policy 0, policy_version 13900 (0.0007) [2023-03-06 16:58:23,198][23882] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-06 16:58:23,962][23882] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-06 16:58:24,750][23882] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-06 16:58:25,537][23882] Updated weights for policy 0, policy_version 13940 (0.0007) [2023-03-06 16:58:26,304][23882] Updated weights for policy 0, policy_version 13950 (0.0006) [2023-03-06 16:58:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13093.3). Total num frames: 14289920. Throughput: 0: 13101.8. Samples: 14288508. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:58:26,748][23556] Avg episode reward: [(0, '75.622')] [2023-03-06 16:58:27,103][23882] Updated weights for policy 0, policy_version 13960 (0.0007) [2023-03-06 16:58:27,872][23882] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-06 16:58:28,656][23882] Updated weights for policy 0, policy_version 13980 (0.0007) [2023-03-06 16:58:29,443][23882] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-06 16:58:30,218][23882] Updated weights for policy 0, policy_version 14000 (0.0007) [2023-03-06 16:58:31,006][23882] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-06 16:58:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13093.3). Total num frames: 14355456. Throughput: 0: 13101.8. Samples: 14327660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:31,748][23556] Avg episode reward: [(0, '91.489')] [2023-03-06 16:58:31,789][23882] Updated weights for policy 0, policy_version 14020 (0.0007) [2023-03-06 16:58:32,565][23882] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-06 16:58:33,348][23882] Updated weights for policy 0, policy_version 14040 (0.0006) [2023-03-06 16:58:34,113][23882] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-06 16:58:34,894][23882] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-06 16:58:35,679][23882] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-06 16:58:36,457][23882] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-06 16:58:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13093.3). Total num frames: 14420992. Throughput: 0: 13117.0. Samples: 14406633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:36,748][23556] Avg episode reward: [(0, '71.190')] [2023-03-06 16:58:37,262][23882] Updated weights for policy 0, policy_version 14090 (0.0007) [2023-03-06 16:58:38,033][23882] Updated weights for policy 0, policy_version 14100 (0.0007) [2023-03-06 16:58:38,821][23882] Updated weights for policy 0, policy_version 14110 (0.0006) [2023-03-06 16:58:39,601][23882] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-06 16:58:40,378][23882] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-06 16:58:41,161][23882] Updated weights for policy 0, policy_version 14140 (0.0007) [2023-03-06 16:58:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13093.3). Total num frames: 14486528. Throughput: 0: 13098.9. Samples: 14484965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:41,748][23556] Avg episode reward: [(0, '72.047')] [2023-03-06 16:58:41,946][23882] Updated weights for policy 0, policy_version 14150 (0.0007) [2023-03-06 16:58:42,718][23882] Updated weights for policy 0, policy_version 14160 (0.0006) [2023-03-06 16:58:43,496][23882] Updated weights for policy 0, policy_version 14170 (0.0007) [2023-03-06 16:58:44,284][23882] Updated weights for policy 0, policy_version 14180 (0.0006) [2023-03-06 16:58:45,062][23882] Updated weights for policy 0, policy_version 14190 (0.0007) [2023-03-06 16:58:45,846][23882] Updated weights for policy 0, policy_version 14200 (0.0007) [2023-03-06 16:58:46,632][23882] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-06 16:58:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 14552064. Throughput: 0: 13106.5. Samples: 14524436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:46,748][23556] Avg episode reward: [(0, '84.503')] [2023-03-06 16:58:47,410][23882] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-06 16:58:48,181][23882] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-06 16:58:48,976][23882] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-06 16:58:49,752][23882] Updated weights for policy 0, policy_version 14250 (0.0007) [2023-03-06 16:58:50,541][23882] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-06 16:58:51,320][23882] Updated weights for policy 0, policy_version 14270 (0.0006) [2023-03-06 16:58:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 14617600. Throughput: 0: 13104.0. Samples: 14603041. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:51,748][23556] Avg episode reward: [(0, '72.701')] [2023-03-06 16:58:52,090][23882] Updated weights for policy 0, policy_version 14280 (0.0007) [2023-03-06 16:58:52,869][23882] Updated weights for policy 0, policy_version 14290 (0.0007) [2023-03-06 16:58:53,668][23882] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-06 16:58:54,442][23882] Updated weights for policy 0, policy_version 14310 (0.0007) [2023-03-06 16:58:55,227][23882] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-06 16:58:56,004][23882] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-06 16:58:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 14683136. Throughput: 0: 13104.8. Samples: 14681544. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:58:56,748][23556] Avg episode reward: [(0, '84.841')] [2023-03-06 16:58:56,778][23882] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-06 16:58:57,548][23882] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-06 16:58:58,339][23882] Updated weights for policy 0, policy_version 14360 (0.0006) [2023-03-06 16:58:59,115][23882] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-06 16:58:59,897][23882] Updated weights for policy 0, policy_version 14380 (0.0006) [2023-03-06 16:59:00,687][23882] Updated weights for policy 0, policy_version 14390 (0.0007) [2023-03-06 16:59:01,472][23882] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-03-06 16:59:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 14748672. Throughput: 0: 13109.5. Samples: 14721040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:01,748][23556] Avg episode reward: [(0, '66.596')] [2023-03-06 16:59:02,259][23882] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-06 16:59:03,046][23882] Updated weights for policy 0, policy_version 14420 (0.0007) [2023-03-06 16:59:03,838][23882] Updated weights for policy 0, policy_version 14430 (0.0007) [2023-03-06 16:59:04,602][23882] Updated weights for policy 0, policy_version 14440 (0.0007) [2023-03-06 16:59:05,382][23882] Updated weights for policy 0, policy_version 14450 (0.0007) [2023-03-06 16:59:06,148][23882] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-06 16:59:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 14814208. Throughput: 0: 13111.6. Samples: 14799639. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:06,748][23556] Avg episode reward: [(0, '89.890')] [2023-03-06 16:59:06,956][23882] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-06 16:59:07,735][23882] Updated weights for policy 0, policy_version 14480 (0.0006) [2023-03-06 16:59:08,531][23882] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-06 16:59:09,318][23882] Updated weights for policy 0, policy_version 14500 (0.0007) [2023-03-06 16:59:10,094][23882] Updated weights for policy 0, policy_version 14510 (0.0007) [2023-03-06 16:59:10,893][23882] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-06 16:59:11,684][23882] Updated weights for policy 0, policy_version 14530 (0.0007) [2023-03-06 16:59:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 14878720. Throughput: 0: 13092.0. Samples: 14877650. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:11,748][23556] Avg episode reward: [(0, '101.658')] [2023-03-06 16:59:12,448][23882] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-06 16:59:13,235][23882] Updated weights for policy 0, policy_version 14550 (0.0006) [2023-03-06 16:59:14,032][23882] Updated weights for policy 0, policy_version 14560 (0.0007) [2023-03-06 16:59:14,801][23882] Updated weights for policy 0, policy_version 14570 (0.0006) [2023-03-06 16:59:15,579][23882] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-06 16:59:16,354][23882] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-06 16:59:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 14945280. Throughput: 0: 13096.3. Samples: 14916996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:16,748][23556] Avg episode reward: [(0, '97.185')] [2023-03-06 16:59:17,138][23882] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-06 16:59:17,918][23882] Updated weights for policy 0, policy_version 14610 (0.0006) [2023-03-06 16:59:18,709][23882] Updated weights for policy 0, policy_version 14620 (0.0007) [2023-03-06 16:59:19,492][23882] Updated weights for policy 0, policy_version 14630 (0.0005) [2023-03-06 16:59:20,270][23882] Updated weights for policy 0, policy_version 14640 (0.0007) [2023-03-06 16:59:21,055][23882] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-06 16:59:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 15009792. Throughput: 0: 13087.1. Samples: 14995555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:59:21,748][23556] Avg episode reward: [(0, '90.719')] [2023-03-06 16:59:21,842][23882] Updated weights for policy 0, policy_version 14660 (0.0007) [2023-03-06 16:59:22,640][23882] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-06 16:59:23,421][23882] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-06 16:59:24,206][23882] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-06 16:59:24,983][23882] Updated weights for policy 0, policy_version 14700 (0.0007) [2023-03-06 16:59:25,769][23882] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-06 16:59:26,551][23882] Updated weights for policy 0, policy_version 14720 (0.0007) [2023-03-06 16:59:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 15075328. Throughput: 0: 13088.1. Samples: 15073929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 16:59:26,748][23556] Avg episode reward: [(0, '72.787')] [2023-03-06 16:59:27,319][23882] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-06 16:59:28,115][23882] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-06 16:59:28,906][23882] Updated weights for policy 0, policy_version 14750 (0.0007) [2023-03-06 16:59:29,685][23882] Updated weights for policy 0, policy_version 14760 (0.0007) [2023-03-06 16:59:30,459][23882] Updated weights for policy 0, policy_version 14770 (0.0006) [2023-03-06 16:59:31,260][23882] Updated weights for policy 0, policy_version 14780 (0.0007) [2023-03-06 16:59:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 15140864. Throughput: 0: 13080.3. Samples: 15113048. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:31,748][23556] Avg episode reward: [(0, '64.938')] [2023-03-06 16:59:32,044][23882] Updated weights for policy 0, policy_version 14790 (0.0007) [2023-03-06 16:59:32,840][23882] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-06 16:59:33,634][23882] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-06 16:59:34,398][23882] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-06 16:59:35,176][23882] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-06 16:59:35,991][23882] Updated weights for policy 0, policy_version 14840 (0.0005) [2023-03-06 16:59:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13093.3). Total num frames: 15205376. Throughput: 0: 13073.1. Samples: 15191330. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:36,748][23556] Avg episode reward: [(0, '51.599')] [2023-03-06 16:59:36,758][23882] Updated weights for policy 0, policy_version 14850 (0.0006) [2023-03-06 16:59:37,526][23882] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-06 16:59:38,323][23882] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-06 16:59:39,121][23882] Updated weights for policy 0, policy_version 14880 (0.0006) [2023-03-06 16:59:39,909][23882] Updated weights for policy 0, policy_version 14890 (0.0006) [2023-03-06 16:59:40,686][23882] Updated weights for policy 0, policy_version 14900 (0.0007) [2023-03-06 16:59:41,458][23882] Updated weights for policy 0, policy_version 14910 (0.0006) [2023-03-06 16:59:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 15270912. Throughput: 0: 13068.9. Samples: 15269645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:41,749][23556] Avg episode reward: [(0, '67.729')] [2023-03-06 16:59:42,233][23882] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-06 16:59:43,020][23882] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-06 16:59:43,794][23882] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-06 16:59:44,567][23882] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-06 16:59:45,357][23882] Updated weights for policy 0, policy_version 14960 (0.0006) [2023-03-06 16:59:46,162][23882] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-06 16:59:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 15336448. Throughput: 0: 13065.8. Samples: 15309001. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:46,748][23556] Avg episode reward: [(0, '67.698')] [2023-03-06 16:59:46,941][23882] Updated weights for policy 0, policy_version 14980 (0.0006) [2023-03-06 16:59:47,730][23882] Updated weights for policy 0, policy_version 14990 (0.0006) [2023-03-06 16:59:48,526][23882] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-06 16:59:49,301][23882] Updated weights for policy 0, policy_version 15010 (0.0006) [2023-03-06 16:59:50,086][23882] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-06 16:59:50,872][23882] Updated weights for policy 0, policy_version 15030 (0.0007) [2023-03-06 16:59:51,660][23882] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-06 16:59:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 15401984. Throughput: 0: 13050.9. Samples: 15386927. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:51,748][23556] Avg episode reward: [(0, '66.170')] [2023-03-06 16:59:52,437][23882] Updated weights for policy 0, policy_version 15050 (0.0007) [2023-03-06 16:59:53,244][23882] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-06 16:59:54,025][23882] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-06 16:59:54,805][23882] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-06 16:59:55,582][23882] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-06 16:59:56,377][23882] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-06 16:59:56,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 15466496. Throughput: 0: 13056.1. Samples: 15465176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 16:59:56,749][23556] Avg episode reward: [(0, '50.930')] [2023-03-06 16:59:56,763][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000015105_15467520.pth... [2023-03-06 16:59:56,794][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000012035_12323840.pth [2023-03-06 16:59:57,161][23882] Updated weights for policy 0, policy_version 15110 (0.0007) [2023-03-06 16:59:57,950][23882] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-06 16:59:58,736][23882] Updated weights for policy 0, policy_version 15130 (0.0006) [2023-03-06 16:59:59,526][23882] Updated weights for policy 0, policy_version 15140 (0.0007) [2023-03-06 17:00:00,310][23882] Updated weights for policy 0, policy_version 15150 (0.0007) [2023-03-06 17:00:01,093][23882] Updated weights for policy 0, policy_version 15160 (0.0007) [2023-03-06 17:00:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 15532032. Throughput: 0: 13051.1. Samples: 15504293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:00:01,748][23556] Avg episode reward: [(0, '53.062')] [2023-03-06 17:00:01,892][23882] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-06 17:00:02,658][23882] Updated weights for policy 0, policy_version 15180 (0.0007) [2023-03-06 17:00:03,445][23882] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-06 17:00:04,223][23882] Updated weights for policy 0, policy_version 15200 (0.0006) [2023-03-06 17:00:05,015][23882] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-06 17:00:05,795][23882] Updated weights for policy 0, policy_version 15220 (0.0007) [2023-03-06 17:00:06,580][23882] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-06 17:00:06,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 15597568. Throughput: 0: 13044.0. Samples: 15582531. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:06,748][23556] Avg episode reward: [(0, '80.749')] [2023-03-06 17:00:07,366][23882] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-06 17:00:08,137][23882] Updated weights for policy 0, policy_version 15250 (0.0006) [2023-03-06 17:00:08,921][23882] Updated weights for policy 0, policy_version 15260 (0.0007) [2023-03-06 17:00:09,715][23882] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-06 17:00:10,490][23882] Updated weights for policy 0, policy_version 15280 (0.0007) [2023-03-06 17:00:11,277][23882] Updated weights for policy 0, policy_version 15290 (0.0006) [2023-03-06 17:00:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 15662080. Throughput: 0: 13046.3. Samples: 15661012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:11,748][23556] Avg episode reward: [(0, '78.995')] [2023-03-06 17:00:12,069][23882] Updated weights for policy 0, policy_version 15300 (0.0007) [2023-03-06 17:00:12,850][23882] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-06 17:00:13,634][23882] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-06 17:00:14,418][23882] Updated weights for policy 0, policy_version 15330 (0.0007) [2023-03-06 17:00:15,202][23882] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-06 17:00:15,969][23882] Updated weights for policy 0, policy_version 15350 (0.0007) [2023-03-06 17:00:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13093.3). Total num frames: 15727616. Throughput: 0: 13046.1. Samples: 15700122. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:00:16,748][23556] Avg episode reward: [(0, '105.303')] [2023-03-06 17:00:16,757][23882] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-06 17:00:17,546][23882] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-06 17:00:18,345][23882] Updated weights for policy 0, policy_version 15380 (0.0007) [2023-03-06 17:00:19,130][23882] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-06 17:00:19,907][23882] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-06 17:00:20,689][23882] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-06 17:00:21,468][23882] Updated weights for policy 0, policy_version 15420 (0.0007) [2023-03-06 17:00:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 15793152. Throughput: 0: 13047.9. Samples: 15778486. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:00:21,748][23556] Avg episode reward: [(0, '103.054')] [2023-03-06 17:00:22,262][23882] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-06 17:00:23,047][23882] Updated weights for policy 0, policy_version 15440 (0.0007) [2023-03-06 17:00:23,817][23882] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-06 17:00:24,623][23882] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-06 17:00:25,411][23882] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-06 17:00:26,202][23882] Updated weights for policy 0, policy_version 15480 (0.0007) [2023-03-06 17:00:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13089.8). Total num frames: 15857664. Throughput: 0: 13042.4. Samples: 15856553. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:00:26,748][23556] Avg episode reward: [(0, '73.841')] [2023-03-06 17:00:26,991][23882] Updated weights for policy 0, policy_version 15490 (0.0008) [2023-03-06 17:00:27,776][23882] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-03-06 17:00:28,557][23882] Updated weights for policy 0, policy_version 15510 (0.0007) [2023-03-06 17:00:29,356][23882] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-06 17:00:30,128][23882] Updated weights for policy 0, policy_version 15530 (0.0007) [2023-03-06 17:00:30,914][23882] Updated weights for policy 0, policy_version 15540 (0.0006) [2023-03-06 17:00:31,697][23882] Updated weights for policy 0, policy_version 15550 (0.0007) [2023-03-06 17:00:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13089.8). Total num frames: 15923200. Throughput: 0: 13036.7. Samples: 15895653. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:00:31,748][23556] Avg episode reward: [(0, '92.303')] [2023-03-06 17:00:32,508][23882] Updated weights for policy 0, policy_version 15560 (0.0007) [2023-03-06 17:00:33,293][23882] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-06 17:00:34,068][23882] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-06 17:00:34,865][23882] Updated weights for policy 0, policy_version 15590 (0.0007) [2023-03-06 17:00:35,633][23882] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-06 17:00:36,425][23882] Updated weights for policy 0, policy_version 15610 (0.0007) [2023-03-06 17:00:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13089.8). Total num frames: 15988736. Throughput: 0: 13043.7. Samples: 15973894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:36,748][23556] Avg episode reward: [(0, '100.995')] [2023-03-06 17:00:37,184][23882] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-06 17:00:37,965][23882] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-06 17:00:38,741][23882] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-06 17:00:39,518][23882] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-06 17:00:40,300][23882] Updated weights for policy 0, policy_version 15660 (0.0007) [2023-03-06 17:00:41,082][23882] Updated weights for policy 0, policy_version 15670 (0.0006) [2023-03-06 17:00:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13089.8). Total num frames: 16054272. Throughput: 0: 13060.5. Samples: 16052895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:41,748][23556] Avg episode reward: [(0, '97.784')] [2023-03-06 17:00:41,858][23882] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-06 17:00:42,641][23882] Updated weights for policy 0, policy_version 15690 (0.0007) [2023-03-06 17:00:43,425][23882] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-06 17:00:44,206][23882] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-06 17:00:44,997][23882] Updated weights for policy 0, policy_version 15720 (0.0007) [2023-03-06 17:00:45,769][23882] Updated weights for policy 0, policy_version 15730 (0.0007) [2023-03-06 17:00:46,559][23882] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-06 17:00:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13089.8). Total num frames: 16119808. Throughput: 0: 13062.6. Samples: 16092111. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:46,748][23556] Avg episode reward: [(0, '103.243')] [2023-03-06 17:00:47,361][23882] Updated weights for policy 0, policy_version 15750 (0.0007) [2023-03-06 17:00:48,130][23882] Updated weights for policy 0, policy_version 15760 (0.0006) [2023-03-06 17:00:48,905][23882] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-06 17:00:49,691][23882] Updated weights for policy 0, policy_version 15780 (0.0005) [2023-03-06 17:00:50,475][23882] Updated weights for policy 0, policy_version 15790 (0.0007) [2023-03-06 17:00:51,266][23882] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-06 17:00:51,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13089.8). Total num frames: 16185344. Throughput: 0: 13062.0. Samples: 16170322. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:51,748][23556] Avg episode reward: [(0, '106.912')] [2023-03-06 17:00:52,062][23882] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-06 17:00:52,848][23882] Updated weights for policy 0, policy_version 15820 (0.0007) [2023-03-06 17:00:53,625][23882] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-06 17:00:54,413][23882] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-06 17:00:55,189][23882] Updated weights for policy 0, policy_version 15850 (0.0006) [2023-03-06 17:00:55,975][23882] Updated weights for policy 0, policy_version 15860 (0.0007) [2023-03-06 17:00:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13086.4). Total num frames: 16249856. Throughput: 0: 13054.4. Samples: 16248458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:00:56,748][23556] Avg episode reward: [(0, '81.415')] [2023-03-06 17:00:56,773][23882] Updated weights for policy 0, policy_version 15870 (0.0007) [2023-03-06 17:00:57,556][23882] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-06 17:00:58,333][23882] Updated weights for policy 0, policy_version 15890 (0.0007) [2023-03-06 17:00:59,126][23882] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-06 17:00:59,915][23882] Updated weights for policy 0, policy_version 15910 (0.0006) [2023-03-06 17:01:00,697][23882] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-06 17:01:01,459][23882] Updated weights for policy 0, policy_version 15930 (0.0005) [2023-03-06 17:01:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13086.4). Total num frames: 16315392. Throughput: 0: 13055.4. Samples: 16287616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:01,748][23556] Avg episode reward: [(0, '73.208')] [2023-03-06 17:01:02,241][23882] Updated weights for policy 0, policy_version 15940 (0.0007) [2023-03-06 17:01:03,029][23882] Updated weights for policy 0, policy_version 15950 (0.0008) [2023-03-06 17:01:03,806][23882] Updated weights for policy 0, policy_version 15960 (0.0005) [2023-03-06 17:01:04,586][23882] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-06 17:01:05,384][23882] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-06 17:01:06,158][23882] Updated weights for policy 0, policy_version 15990 (0.0007) [2023-03-06 17:01:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13082.9). Total num frames: 16380928. Throughput: 0: 13065.7. Samples: 16366444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:06,748][23556] Avg episode reward: [(0, '91.561')] [2023-03-06 17:01:06,933][23882] Updated weights for policy 0, policy_version 16000 (0.0006) [2023-03-06 17:01:07,713][23882] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-06 17:01:08,485][23882] Updated weights for policy 0, policy_version 16020 (0.0007) [2023-03-06 17:01:09,289][23882] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-06 17:01:10,071][23882] Updated weights for policy 0, policy_version 16040 (0.0007) [2023-03-06 17:01:10,855][23882] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-06 17:01:11,641][23882] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-06 17:01:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13086.4). Total num frames: 16446464. Throughput: 0: 13070.7. Samples: 16444734. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:11,749][23556] Avg episode reward: [(0, '76.003')] [2023-03-06 17:01:12,424][23882] Updated weights for policy 0, policy_version 16070 (0.0007) [2023-03-06 17:01:13,229][23882] Updated weights for policy 0, policy_version 16080 (0.0005) [2023-03-06 17:01:14,004][23882] Updated weights for policy 0, policy_version 16090 (0.0007) [2023-03-06 17:01:14,770][23882] Updated weights for policy 0, policy_version 16100 (0.0007) [2023-03-06 17:01:15,565][23882] Updated weights for policy 0, policy_version 16110 (0.0007) [2023-03-06 17:01:16,351][23882] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-06 17:01:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13082.9). Total num frames: 16512000. Throughput: 0: 13072.3. Samples: 16483907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:16,748][23556] Avg episode reward: [(0, '70.093')] [2023-03-06 17:01:17,117][23882] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-06 17:01:17,913][23882] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-06 17:01:18,698][23882] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-06 17:01:19,469][23882] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-06 17:01:20,258][23882] Updated weights for policy 0, policy_version 16170 (0.0007) [2023-03-06 17:01:21,050][23882] Updated weights for policy 0, policy_version 16180 (0.0008) [2023-03-06 17:01:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13082.9). Total num frames: 16577536. Throughput: 0: 13080.8. Samples: 16562533. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:01:21,749][23556] Avg episode reward: [(0, '89.074')] [2023-03-06 17:01:21,818][23882] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-06 17:01:22,600][23882] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-06 17:01:23,396][23882] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-06 17:01:24,182][23882] Updated weights for policy 0, policy_version 16220 (0.0008) [2023-03-06 17:01:24,943][23882] Updated weights for policy 0, policy_version 16230 (0.0007) [2023-03-06 17:01:25,731][23882] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-06 17:01:26,522][23882] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-06 17:01:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 16642048. Throughput: 0: 13068.3. Samples: 16640970. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:01:26,748][23556] Avg episode reward: [(0, '89.330')] [2023-03-06 17:01:27,290][23882] Updated weights for policy 0, policy_version 16260 (0.0007) [2023-03-06 17:01:28,066][23882] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-06 17:01:28,850][23882] Updated weights for policy 0, policy_version 16280 (0.0007) [2023-03-06 17:01:29,647][23882] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-06 17:01:30,429][23882] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-06 17:01:31,214][23882] Updated weights for policy 0, policy_version 16310 (0.0007) [2023-03-06 17:01:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 16707584. Throughput: 0: 13068.6. Samples: 16680197. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:01:31,748][23556] Avg episode reward: [(0, '117.883')] [2023-03-06 17:01:31,994][23882] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-06 17:01:32,778][23882] Updated weights for policy 0, policy_version 16330 (0.0007) [2023-03-06 17:01:33,561][23882] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-06 17:01:34,362][23882] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-06 17:01:35,147][23882] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-06 17:01:35,943][23882] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-06 17:01:36,721][23882] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-06 17:01:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 16773120. Throughput: 0: 13068.3. Samples: 16758393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:36,748][23556] Avg episode reward: [(0, '120.357')] [2023-03-06 17:01:37,502][23882] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-06 17:01:38,289][23882] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-06 17:01:39,078][23882] Updated weights for policy 0, policy_version 16410 (0.0007) [2023-03-06 17:01:39,885][23882] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-06 17:01:40,654][23882] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-06 17:01:41,444][23882] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-06 17:01:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 16837632. Throughput: 0: 13070.3. Samples: 16836624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:01:41,748][23556] Avg episode reward: [(0, '120.586')] [2023-03-06 17:01:42,209][23882] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-06 17:01:43,013][23882] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-06 17:01:43,801][23882] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-06 17:01:44,573][23882] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-06 17:01:45,342][23882] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-06 17:01:46,136][23882] Updated weights for policy 0, policy_version 16500 (0.0007) [2023-03-06 17:01:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 16903168. Throughput: 0: 13070.0. Samples: 16875768. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:01:46,748][23556] Avg episode reward: [(0, '132.374')] [2023-03-06 17:01:46,923][23882] Updated weights for policy 0, policy_version 16510 (0.0006) [2023-03-06 17:01:47,712][23882] Updated weights for policy 0, policy_version 16520 (0.0007) [2023-03-06 17:01:48,496][23882] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-06 17:01:49,290][23882] Updated weights for policy 0, policy_version 16540 (0.0007) [2023-03-06 17:01:50,078][23882] Updated weights for policy 0, policy_version 16550 (0.0006) [2023-03-06 17:01:50,860][23882] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-06 17:01:51,640][23882] Updated weights for policy 0, policy_version 16570 (0.0007) [2023-03-06 17:01:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 16968704. Throughput: 0: 13051.0. Samples: 16953741. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:01:51,748][23556] Avg episode reward: [(0, '129.058')] [2023-03-06 17:01:52,425][23882] Updated weights for policy 0, policy_version 16580 (0.0007) [2023-03-06 17:01:53,213][23882] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-06 17:01:54,009][23882] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-06 17:01:54,801][23882] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-06 17:01:55,606][23882] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-06 17:01:56,386][23882] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-06 17:01:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17033216. Throughput: 0: 13049.6. Samples: 17031964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:01:56,748][23556] Avg episode reward: [(0, '124.754')] [2023-03-06 17:01:56,759][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000016635_17034240.pth... [2023-03-06 17:01:56,788][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000013571_13896704.pth [2023-03-06 17:01:57,165][23882] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-06 17:01:57,938][23882] Updated weights for policy 0, policy_version 16650 (0.0006) [2023-03-06 17:01:58,742][23882] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-06 17:01:59,506][23882] Updated weights for policy 0, policy_version 16670 (0.0006) [2023-03-06 17:02:00,290][23882] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-06 17:02:01,060][23882] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-06 17:02:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17098752. Throughput: 0: 13050.4. Samples: 17071176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:02:01,748][23556] Avg episode reward: [(0, '112.115')] [2023-03-06 17:02:01,835][23882] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-06 17:02:02,640][23882] Updated weights for policy 0, policy_version 16710 (0.0007) [2023-03-06 17:02:03,407][23882] Updated weights for policy 0, policy_version 16720 (0.0007) [2023-03-06 17:02:04,190][23882] Updated weights for policy 0, policy_version 16730 (0.0007) [2023-03-06 17:02:04,974][23882] Updated weights for policy 0, policy_version 16740 (0.0007) [2023-03-06 17:02:05,743][23882] Updated weights for policy 0, policy_version 16750 (0.0007) [2023-03-06 17:02:06,530][23882] Updated weights for policy 0, policy_version 16760 (0.0006) [2023-03-06 17:02:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17164288. Throughput: 0: 13050.1. Samples: 17149784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:02:06,748][23556] Avg episode reward: [(0, '94.048')] [2023-03-06 17:02:07,315][23882] Updated weights for policy 0, policy_version 16770 (0.0006) [2023-03-06 17:02:08,085][23882] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-06 17:02:08,862][23882] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-06 17:02:09,645][23882] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-06 17:02:10,418][23882] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-03-06 17:02:11,199][23882] Updated weights for policy 0, policy_version 16820 (0.0005) [2023-03-06 17:02:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17229824. Throughput: 0: 13060.9. Samples: 17228710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:11,748][23556] Avg episode reward: [(0, '96.229')] [2023-03-06 17:02:11,978][23882] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-06 17:02:12,761][23882] Updated weights for policy 0, policy_version 16840 (0.0005) [2023-03-06 17:02:13,552][23882] Updated weights for policy 0, policy_version 16850 (0.0007) [2023-03-06 17:02:14,327][23882] Updated weights for policy 0, policy_version 16860 (0.0006) [2023-03-06 17:02:15,121][23882] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-06 17:02:15,910][23882] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-06 17:02:16,679][23882] Updated weights for policy 0, policy_version 16890 (0.0007) [2023-03-06 17:02:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17295360. Throughput: 0: 13062.5. Samples: 17268011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:16,748][23556] Avg episode reward: [(0, '61.235')] [2023-03-06 17:02:17,488][23882] Updated weights for policy 0, policy_version 16900 (0.0007) [2023-03-06 17:02:18,280][23882] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-06 17:02:19,056][23882] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-06 17:02:19,836][23882] Updated weights for policy 0, policy_version 16930 (0.0007) [2023-03-06 17:02:20,604][23882] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-06 17:02:21,401][23882] Updated weights for policy 0, policy_version 16950 (0.0006) [2023-03-06 17:02:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 17360896. Throughput: 0: 13064.9. Samples: 17346314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:21,748][23556] Avg episode reward: [(0, '88.803')] [2023-03-06 17:02:22,185][23882] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-06 17:02:22,955][23882] Updated weights for policy 0, policy_version 16970 (0.0007) [2023-03-06 17:02:23,743][23882] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-06 17:02:24,531][23882] Updated weights for policy 0, policy_version 16990 (0.0005) [2023-03-06 17:02:25,314][23882] Updated weights for policy 0, policy_version 17000 (0.0007) [2023-03-06 17:02:26,102][23882] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-06 17:02:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 17426432. Throughput: 0: 13064.8. Samples: 17424538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:26,748][23556] Avg episode reward: [(0, '113.820')] [2023-03-06 17:02:26,898][23882] Updated weights for policy 0, policy_version 17020 (0.0007) [2023-03-06 17:02:27,686][23882] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-06 17:02:28,466][23882] Updated weights for policy 0, policy_version 17040 (0.0007) [2023-03-06 17:02:29,254][23882] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-06 17:02:30,053][23882] Updated weights for policy 0, policy_version 17060 (0.0007) [2023-03-06 17:02:30,831][23882] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-06 17:02:31,610][23882] Updated weights for policy 0, policy_version 17080 (0.0006) [2023-03-06 17:02:31,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17490944. Throughput: 0: 13062.0. Samples: 17463560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:02:31,748][23556] Avg episode reward: [(0, '111.182')] [2023-03-06 17:02:32,396][23882] Updated weights for policy 0, policy_version 17090 (0.0006) [2023-03-06 17:02:33,177][23882] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-06 17:02:33,964][23882] Updated weights for policy 0, policy_version 17110 (0.0007) [2023-03-06 17:02:34,753][23882] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-06 17:02:35,522][23882] Updated weights for policy 0, policy_version 17130 (0.0007) [2023-03-06 17:02:36,311][23882] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-06 17:02:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 17556480. Throughput: 0: 13068.8. Samples: 17541835. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:02:36,748][23556] Avg episode reward: [(0, '113.218')] [2023-03-06 17:02:37,091][23882] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-06 17:02:37,888][23882] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-06 17:02:38,651][23882] Updated weights for policy 0, policy_version 17170 (0.0006) [2023-03-06 17:02:39,437][23882] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-06 17:02:40,224][23882] Updated weights for policy 0, policy_version 17190 (0.0007) [2023-03-06 17:02:41,002][23882] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-06 17:02:41,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 17622016. Throughput: 0: 13078.6. Samples: 17620499. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:02:41,748][23556] Avg episode reward: [(0, '125.029')] [2023-03-06 17:02:41,789][23882] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-06 17:02:42,563][23882] Updated weights for policy 0, policy_version 17220 (0.0007) [2023-03-06 17:02:43,341][23882] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-06 17:02:44,117][23882] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-06 17:02:44,921][23882] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-06 17:02:45,694][23882] Updated weights for policy 0, policy_version 17260 (0.0006) [2023-03-06 17:02:46,456][23882] Updated weights for policy 0, policy_version 17270 (0.0005) [2023-03-06 17:02:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 17687552. Throughput: 0: 13083.7. Samples: 17659940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:46,748][23556] Avg episode reward: [(0, '79.492')] [2023-03-06 17:02:47,232][23882] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-06 17:02:48,017][23882] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-06 17:02:48,798][23882] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-06 17:02:49,602][23882] Updated weights for policy 0, policy_version 17310 (0.0006) [2023-03-06 17:02:50,380][23882] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-06 17:02:51,161][23882] Updated weights for policy 0, policy_version 17330 (0.0007) [2023-03-06 17:02:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 17753088. Throughput: 0: 13082.5. Samples: 17738496. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:02:51,748][23556] Avg episode reward: [(0, '90.376')] [2023-03-06 17:02:51,961][23882] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-06 17:02:52,736][23882] Updated weights for policy 0, policy_version 17350 (0.0007) [2023-03-06 17:02:53,511][23882] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-06 17:02:54,300][23882] Updated weights for policy 0, policy_version 17370 (0.0007) [2023-03-06 17:02:55,086][23882] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-06 17:02:55,868][23882] Updated weights for policy 0, policy_version 17390 (0.0007) [2023-03-06 17:02:56,664][23882] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-06 17:02:56,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 17818624. Throughput: 0: 13070.3. Samples: 17816875. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:02:56,748][23556] Avg episode reward: [(0, '98.375')] [2023-03-06 17:02:57,450][23882] Updated weights for policy 0, policy_version 17410 (0.0007) [2023-03-06 17:02:58,228][23882] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-06 17:02:59,006][23882] Updated weights for policy 0, policy_version 17430 (0.0007) [2023-03-06 17:02:59,789][23882] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-06 17:03:00,577][23882] Updated weights for policy 0, policy_version 17450 (0.0005) [2023-03-06 17:03:01,378][23882] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-06 17:03:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 17883136. Throughput: 0: 13063.9. Samples: 17855888. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:03:01,748][23556] Avg episode reward: [(0, '115.105')] [2023-03-06 17:03:02,163][23882] Updated weights for policy 0, policy_version 17470 (0.0007) [2023-03-06 17:03:02,930][23882] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-06 17:03:03,728][23882] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-06 17:03:04,509][23882] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-06 17:03:05,289][23882] Updated weights for policy 0, policy_version 17510 (0.0007) [2023-03-06 17:03:06,065][23882] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-06 17:03:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 17948672. Throughput: 0: 13064.4. Samples: 17934215. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:03:06,748][23556] Avg episode reward: [(0, '92.310')] [2023-03-06 17:03:06,845][23882] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-06 17:03:07,635][23882] Updated weights for policy 0, policy_version 17540 (0.0007) [2023-03-06 17:03:08,400][23882] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-06 17:03:09,179][23882] Updated weights for policy 0, policy_version 17560 (0.0007) [2023-03-06 17:03:09,966][23882] Updated weights for policy 0, policy_version 17570 (0.0007) [2023-03-06 17:03:10,757][23882] Updated weights for policy 0, policy_version 17580 (0.0007) [2023-03-06 17:03:11,544][23882] Updated weights for policy 0, policy_version 17590 (0.0007) [2023-03-06 17:03:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 18014208. Throughput: 0: 13075.1. Samples: 18012918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:03:11,749][23556] Avg episode reward: [(0, '65.254')] [2023-03-06 17:03:12,327][23882] Updated weights for policy 0, policy_version 17600 (0.0007) [2023-03-06 17:03:13,114][23882] Updated weights for policy 0, policy_version 17610 (0.0007) [2023-03-06 17:03:13,896][23882] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-06 17:03:14,672][23882] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-06 17:03:15,465][23882] Updated weights for policy 0, policy_version 17640 (0.0007) [2023-03-06 17:03:16,236][23882] Updated weights for policy 0, policy_version 17650 (0.0007) [2023-03-06 17:03:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 18079744. Throughput: 0: 13075.1. Samples: 18051940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:03:16,749][23556] Avg episode reward: [(0, '81.997')] [2023-03-06 17:03:17,011][23882] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-06 17:03:17,803][23882] Updated weights for policy 0, policy_version 17670 (0.0007) [2023-03-06 17:03:18,577][23882] Updated weights for policy 0, policy_version 17680 (0.0007) [2023-03-06 17:03:19,360][23882] Updated weights for policy 0, policy_version 17690 (0.0007) [2023-03-06 17:03:20,138][23882] Updated weights for policy 0, policy_version 17700 (0.0006) [2023-03-06 17:03:20,927][23882] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-06 17:03:21,713][23882] Updated weights for policy 0, policy_version 17720 (0.0007) [2023-03-06 17:03:21,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 18145280. Throughput: 0: 13080.4. Samples: 18130452. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:03:21,748][23556] Avg episode reward: [(0, '83.609')] [2023-03-06 17:03:22,494][23882] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-06 17:03:23,318][23882] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-06 17:03:24,082][23882] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-06 17:03:24,856][23882] Updated weights for policy 0, policy_version 17760 (0.0007) [2023-03-06 17:03:25,648][23882] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-06 17:03:26,439][23882] Updated weights for policy 0, policy_version 17780 (0.0006) [2023-03-06 17:03:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 18209792. Throughput: 0: 13069.8. Samples: 18208642. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:03:26,748][23556] Avg episode reward: [(0, '75.924')] [2023-03-06 17:03:27,225][23882] Updated weights for policy 0, policy_version 17790 (0.0005) [2023-03-06 17:03:28,006][23882] Updated weights for policy 0, policy_version 17800 (0.0007) [2023-03-06 17:03:28,802][23882] Updated weights for policy 0, policy_version 17810 (0.0007) [2023-03-06 17:03:29,578][23882] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-06 17:03:30,358][23882] Updated weights for policy 0, policy_version 17830 (0.0006) [2023-03-06 17:03:31,125][23882] Updated weights for policy 0, policy_version 17840 (0.0006) [2023-03-06 17:03:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 18275328. Throughput: 0: 13065.9. Samples: 18247908. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:03:31,748][23556] Avg episode reward: [(0, '80.154')] [2023-03-06 17:03:31,905][23882] Updated weights for policy 0, policy_version 17850 (0.0006) [2023-03-06 17:03:32,685][23882] Updated weights for policy 0, policy_version 17860 (0.0007) [2023-03-06 17:03:33,468][23882] Updated weights for policy 0, policy_version 17870 (0.0007) [2023-03-06 17:03:34,244][23882] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-06 17:03:35,014][23882] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-06 17:03:35,790][23882] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-06 17:03:36,569][23882] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-06 17:03:36,748][23556] Fps is (10 sec: 13209.7, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 18341888. Throughput: 0: 13074.4. Samples: 18326845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:03:36,748][23556] Avg episode reward: [(0, '110.422')] [2023-03-06 17:03:37,344][23882] Updated weights for policy 0, policy_version 17920 (0.0007) [2023-03-06 17:03:38,125][23882] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-06 17:03:38,926][23882] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-06 17:03:39,733][23882] Updated weights for policy 0, policy_version 17950 (0.0006) [2023-03-06 17:03:40,510][23882] Updated weights for policy 0, policy_version 17960 (0.0006) [2023-03-06 17:03:41,305][23882] Updated weights for policy 0, policy_version 17970 (0.0007) [2023-03-06 17:03:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 18406400. Throughput: 0: 13071.8. Samples: 18405104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:03:41,748][23556] Avg episode reward: [(0, '96.188')] [2023-03-06 17:03:42,099][23882] Updated weights for policy 0, policy_version 17980 (0.0007) [2023-03-06 17:03:42,878][23882] Updated weights for policy 0, policy_version 17990 (0.0006) [2023-03-06 17:03:43,658][23882] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-06 17:03:44,437][23882] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-06 17:03:45,216][23882] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-06 17:03:45,989][23882] Updated weights for policy 0, policy_version 18030 (0.0007) [2023-03-06 17:03:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13065.5). Total num frames: 18471936. Throughput: 0: 13075.3. Samples: 18444275. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:03:46,748][23556] Avg episode reward: [(0, '113.089')] [2023-03-06 17:03:46,790][23882] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-06 17:03:47,561][23882] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-06 17:03:48,337][23882] Updated weights for policy 0, policy_version 18060 (0.0007) [2023-03-06 17:03:49,130][23882] Updated weights for policy 0, policy_version 18070 (0.0005) [2023-03-06 17:03:49,906][23882] Updated weights for policy 0, policy_version 18080 (0.0007) [2023-03-06 17:03:50,693][23882] Updated weights for policy 0, policy_version 18090 (0.0007) [2023-03-06 17:03:51,470][23882] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-06 17:03:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 18537472. Throughput: 0: 13079.4. Samples: 18522786. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:03:51,748][23556] Avg episode reward: [(0, '142.698')] [2023-03-06 17:03:51,749][23831] Saving new best policy, reward=142.698! [2023-03-06 17:03:52,261][23882] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-06 17:03:53,047][23882] Updated weights for policy 0, policy_version 18120 (0.0006) [2023-03-06 17:03:53,828][23882] Updated weights for policy 0, policy_version 18130 (0.0008) [2023-03-06 17:03:54,595][23882] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-06 17:03:55,396][23882] Updated weights for policy 0, policy_version 18150 (0.0007) [2023-03-06 17:03:56,165][23882] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-06 17:03:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 18603008. Throughput: 0: 13076.3. Samples: 18601351. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:03:56,748][23556] Avg episode reward: [(0, '141.370')] [2023-03-06 17:03:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000018167_18603008.pth... [2023-03-06 17:03:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000015105_15467520.pth [2023-03-06 17:03:56,950][23882] Updated weights for policy 0, policy_version 18170 (0.0007) [2023-03-06 17:03:57,749][23882] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-06 17:03:58,511][23882] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-06 17:03:59,272][23882] Updated weights for policy 0, policy_version 18200 (0.0007) [2023-03-06 17:04:00,074][23882] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-06 17:04:00,840][23882] Updated weights for policy 0, policy_version 18220 (0.0007) [2023-03-06 17:04:01,628][23882] Updated weights for policy 0, policy_version 18230 (0.0007) [2023-03-06 17:04:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13065.6). Total num frames: 18668544. Throughput: 0: 13086.9. Samples: 18640849. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:04:01,748][23556] Avg episode reward: [(0, '154.479')] [2023-03-06 17:04:01,749][23831] Saving new best policy, reward=154.479! [2023-03-06 17:04:02,421][23882] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-06 17:04:03,209][23882] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-06 17:04:03,991][23882] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-06 17:04:04,767][23882] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-06 17:04:05,560][23882] Updated weights for policy 0, policy_version 18280 (0.0006) [2023-03-06 17:04:06,340][23882] Updated weights for policy 0, policy_version 18290 (0.0007) [2023-03-06 17:04:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 18734080. Throughput: 0: 13085.8. Samples: 18719314. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:04:06,748][23556] Avg episode reward: [(0, '153.066')] [2023-03-06 17:04:07,126][23882] Updated weights for policy 0, policy_version 18300 (0.0007) [2023-03-06 17:04:07,900][23882] Updated weights for policy 0, policy_version 18310 (0.0006) [2023-03-06 17:04:08,681][23882] Updated weights for policy 0, policy_version 18320 (0.0007) [2023-03-06 17:04:09,473][23882] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-06 17:04:10,247][23882] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-06 17:04:11,025][23882] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-06 17:04:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.2, 300 sec: 13065.5). Total num frames: 18799616. Throughput: 0: 13089.1. Samples: 18797652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:04:11,748][23556] Avg episode reward: [(0, '170.091')] [2023-03-06 17:04:11,749][23831] Saving new best policy, reward=170.091! [2023-03-06 17:04:11,821][23882] Updated weights for policy 0, policy_version 18360 (0.0007) [2023-03-06 17:04:12,623][23882] Updated weights for policy 0, policy_version 18370 (0.0007) [2023-03-06 17:04:13,387][23882] Updated weights for policy 0, policy_version 18380 (0.0006) [2023-03-06 17:04:14,169][23882] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-06 17:04:14,946][23882] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-06 17:04:15,731][23882] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-06 17:04:16,528][23882] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-06 17:04:16,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13090.2, 300 sec: 13069.0). Total num frames: 18865152. Throughput: 0: 13089.6. Samples: 18836940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:04:16,748][23556] Avg episode reward: [(0, '168.138')] [2023-03-06 17:04:17,297][23882] Updated weights for policy 0, policy_version 18430 (0.0007) [2023-03-06 17:04:18,080][23882] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-06 17:04:18,858][23882] Updated weights for policy 0, policy_version 18450 (0.0007) [2023-03-06 17:04:19,658][23882] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-06 17:04:20,422][23882] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-06 17:04:21,224][23882] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-06 17:04:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 18929664. Throughput: 0: 13078.4. Samples: 18915374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:04:21,748][23556] Avg episode reward: [(0, '160.714')] [2023-03-06 17:04:22,001][23882] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-06 17:04:22,778][23882] Updated weights for policy 0, policy_version 18500 (0.0007) [2023-03-06 17:04:23,568][23882] Updated weights for policy 0, policy_version 18510 (0.0007) [2023-03-06 17:04:24,339][23882] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-06 17:04:25,127][23882] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-06 17:04:25,917][23882] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-06 17:04:26,691][23882] Updated weights for policy 0, policy_version 18550 (0.0006) [2023-03-06 17:04:26,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13090.1, 300 sec: 13065.5). Total num frames: 18995200. Throughput: 0: 13087.2. Samples: 18994028. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:26,748][23556] Avg episode reward: [(0, '150.224')] [2023-03-06 17:04:27,473][23882] Updated weights for policy 0, policy_version 18560 (0.0007) [2023-03-06 17:04:28,249][23882] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-06 17:04:29,030][23882] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-06 17:04:29,813][23882] Updated weights for policy 0, policy_version 18590 (0.0007) [2023-03-06 17:04:30,604][23882] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-06 17:04:31,386][23882] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-06 17:04:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 19060736. Throughput: 0: 13088.4. Samples: 19033253. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:31,748][23556] Avg episode reward: [(0, '152.430')] [2023-03-06 17:04:32,177][23882] Updated weights for policy 0, policy_version 18620 (0.0006) [2023-03-06 17:04:32,954][23882] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-06 17:04:33,725][23882] Updated weights for policy 0, policy_version 18640 (0.0007) [2023-03-06 17:04:34,528][23882] Updated weights for policy 0, policy_version 18650 (0.0006) [2023-03-06 17:04:35,301][23882] Updated weights for policy 0, policy_version 18660 (0.0007) [2023-03-06 17:04:36,089][23882] Updated weights for policy 0, policy_version 18670 (0.0006) [2023-03-06 17:04:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 19126272. Throughput: 0: 13084.1. Samples: 19111569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:36,748][23556] Avg episode reward: [(0, '148.344')] [2023-03-06 17:04:36,868][23882] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-06 17:04:37,645][23882] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-06 17:04:38,425][23882] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-06 17:04:39,198][23882] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-06 17:04:39,978][23882] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-06 17:04:40,758][23882] Updated weights for policy 0, policy_version 18730 (0.0007) [2023-03-06 17:04:41,538][23882] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-06 17:04:41,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 19191808. Throughput: 0: 13093.8. Samples: 19190572. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:41,748][23556] Avg episode reward: [(0, '159.028')] [2023-03-06 17:04:42,322][23882] Updated weights for policy 0, policy_version 18750 (0.0006) [2023-03-06 17:04:43,108][23882] Updated weights for policy 0, policy_version 18760 (0.0006) [2023-03-06 17:04:43,902][23882] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-06 17:04:44,684][23882] Updated weights for policy 0, policy_version 18780 (0.0006) [2023-03-06 17:04:45,445][23882] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-06 17:04:46,222][23882] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-06 17:04:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 19257344. Throughput: 0: 13079.0. Samples: 19229405. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:46,748][23556] Avg episode reward: [(0, '144.535')] [2023-03-06 17:04:47,022][23882] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-06 17:04:47,790][23882] Updated weights for policy 0, policy_version 18820 (0.0006) [2023-03-06 17:04:48,580][23882] Updated weights for policy 0, policy_version 18830 (0.0007) [2023-03-06 17:04:49,377][23882] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-06 17:04:50,160][23882] Updated weights for policy 0, policy_version 18850 (0.0006) [2023-03-06 17:04:50,948][23882] Updated weights for policy 0, policy_version 18860 (0.0007) [2023-03-06 17:04:51,705][23882] Updated weights for policy 0, policy_version 18870 (0.0007) [2023-03-06 17:04:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 19322880. Throughput: 0: 13080.0. Samples: 19307914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:51,748][23556] Avg episode reward: [(0, '149.896')] [2023-03-06 17:04:52,498][23882] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-06 17:04:53,281][23882] Updated weights for policy 0, policy_version 18890 (0.0007) [2023-03-06 17:04:54,049][23882] Updated weights for policy 0, policy_version 18900 (0.0005) [2023-03-06 17:04:54,836][23882] Updated weights for policy 0, policy_version 18910 (0.0007) [2023-03-06 17:04:55,636][23882] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-06 17:04:56,417][23882] Updated weights for policy 0, policy_version 18930 (0.0007) [2023-03-06 17:04:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 19388416. Throughput: 0: 13084.4. Samples: 19386454. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:04:56,749][23556] Avg episode reward: [(0, '162.003')] [2023-03-06 17:04:57,203][23882] Updated weights for policy 0, policy_version 18940 (0.0007) [2023-03-06 17:04:57,982][23882] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-06 17:04:58,782][23882] Updated weights for policy 0, policy_version 18960 (0.0008) [2023-03-06 17:04:59,553][23882] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-06 17:05:00,333][23882] Updated weights for policy 0, policy_version 18980 (0.0008) [2023-03-06 17:05:01,110][23882] Updated weights for policy 0, policy_version 18990 (0.0007) [2023-03-06 17:05:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 19453952. Throughput: 0: 13085.9. Samples: 19425809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:01,748][23556] Avg episode reward: [(0, '127.068')] [2023-03-06 17:05:01,897][23882] Updated weights for policy 0, policy_version 19000 (0.0007) [2023-03-06 17:05:02,671][23882] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-06 17:05:03,465][23882] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-06 17:05:04,250][23882] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-06 17:05:05,037][23882] Updated weights for policy 0, policy_version 19040 (0.0007) [2023-03-06 17:05:05,826][23882] Updated weights for policy 0, policy_version 19050 (0.0007) [2023-03-06 17:05:06,604][23882] Updated weights for policy 0, policy_version 19060 (0.0007) [2023-03-06 17:05:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 19518464. Throughput: 0: 13084.7. Samples: 19504186. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:06,749][23556] Avg episode reward: [(0, '147.338')] [2023-03-06 17:05:07,378][23882] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-06 17:05:08,170][23882] Updated weights for policy 0, policy_version 19080 (0.0007) [2023-03-06 17:05:08,938][23882] Updated weights for policy 0, policy_version 19090 (0.0007) [2023-03-06 17:05:09,734][23882] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-06 17:05:10,511][23882] Updated weights for policy 0, policy_version 19110 (0.0007) [2023-03-06 17:05:11,297][23882] Updated weights for policy 0, policy_version 19120 (0.0006) [2023-03-06 17:05:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 19584000. Throughput: 0: 13082.8. Samples: 19582752. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:11,748][23556] Avg episode reward: [(0, '126.736')] [2023-03-06 17:05:12,083][23882] Updated weights for policy 0, policy_version 19130 (0.0008) [2023-03-06 17:05:12,867][23882] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-06 17:05:13,643][23882] Updated weights for policy 0, policy_version 19150 (0.0007) [2023-03-06 17:05:14,427][23882] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-06 17:05:15,213][23882] Updated weights for policy 0, policy_version 19170 (0.0007) [2023-03-06 17:05:15,999][23882] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-06 17:05:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 19649536. Throughput: 0: 13083.6. Samples: 19622016. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:16,748][23556] Avg episode reward: [(0, '134.612')] [2023-03-06 17:05:16,771][23882] Updated weights for policy 0, policy_version 19190 (0.0005) [2023-03-06 17:05:17,544][23882] Updated weights for policy 0, policy_version 19200 (0.0007) [2023-03-06 17:05:18,342][23882] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-06 17:05:19,117][23882] Updated weights for policy 0, policy_version 19220 (0.0007) [2023-03-06 17:05:19,905][23882] Updated weights for policy 0, policy_version 19230 (0.0007) [2023-03-06 17:05:20,685][23882] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-06 17:05:21,486][23882] Updated weights for policy 0, policy_version 19250 (0.0007) [2023-03-06 17:05:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 19715072. Throughput: 0: 13088.9. Samples: 19700571. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:21,748][23556] Avg episode reward: [(0, '139.430')] [2023-03-06 17:05:22,264][23882] Updated weights for policy 0, policy_version 19260 (0.0007) [2023-03-06 17:05:23,019][23882] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-06 17:05:23,802][23882] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-06 17:05:24,590][23882] Updated weights for policy 0, policy_version 19290 (0.0007) [2023-03-06 17:05:25,378][23882] Updated weights for policy 0, policy_version 19300 (0.0006) [2023-03-06 17:05:26,154][23882] Updated weights for policy 0, policy_version 19310 (0.0007) [2023-03-06 17:05:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 19780608. Throughput: 0: 13079.3. Samples: 19779141. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:26,748][23556] Avg episode reward: [(0, '121.583')] [2023-03-06 17:05:26,935][23882] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-06 17:05:27,714][23882] Updated weights for policy 0, policy_version 19330 (0.0007) [2023-03-06 17:05:28,489][23882] Updated weights for policy 0, policy_version 19340 (0.0005) [2023-03-06 17:05:29,279][23882] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-06 17:05:30,061][23882] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-06 17:05:30,860][23882] Updated weights for policy 0, policy_version 19370 (0.0007) [2023-03-06 17:05:31,629][23882] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-06 17:05:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 19846144. Throughput: 0: 13089.9. Samples: 19818452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:05:31,749][23556] Avg episode reward: [(0, '144.664')] [2023-03-06 17:05:32,405][23882] Updated weights for policy 0, policy_version 19390 (0.0007) [2023-03-06 17:05:33,184][23882] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-06 17:05:33,962][23882] Updated weights for policy 0, policy_version 19410 (0.0006) [2023-03-06 17:05:34,756][23882] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-06 17:05:35,531][23882] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-06 17:05:36,330][23882] Updated weights for policy 0, policy_version 19440 (0.0006) [2023-03-06 17:05:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 19911680. Throughput: 0: 13090.3. Samples: 19896979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:05:36,748][23556] Avg episode reward: [(0, '146.534')] [2023-03-06 17:05:37,105][23882] Updated weights for policy 0, policy_version 19450 (0.0007) [2023-03-06 17:05:37,878][23882] Updated weights for policy 0, policy_version 19460 (0.0005) [2023-03-06 17:05:38,677][23882] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-06 17:05:39,455][23882] Updated weights for policy 0, policy_version 19480 (0.0007) [2023-03-06 17:05:40,244][23882] Updated weights for policy 0, policy_version 19490 (0.0007) [2023-03-06 17:05:41,034][23882] Updated weights for policy 0, policy_version 19500 (0.0007) [2023-03-06 17:05:41,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13090.2, 300 sec: 13076.0). Total num frames: 19977216. Throughput: 0: 13087.0. Samples: 19975366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:41,748][23556] Avg episode reward: [(0, '127.891')] [2023-03-06 17:05:41,814][23882] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-06 17:05:42,606][23882] Updated weights for policy 0, policy_version 19520 (0.0006) [2023-03-06 17:05:43,382][23882] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-06 17:05:44,160][23882] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-03-06 17:05:44,936][23882] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-06 17:05:45,722][23882] Updated weights for policy 0, policy_version 19560 (0.0007) [2023-03-06 17:05:46,493][23882] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-06 17:05:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 20042752. Throughput: 0: 13087.9. Samples: 20014766. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:46,748][23556] Avg episode reward: [(0, '142.490')] [2023-03-06 17:05:47,262][23882] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-06 17:05:48,042][23882] Updated weights for policy 0, policy_version 19590 (0.0007) [2023-03-06 17:05:48,833][23882] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-06 17:05:49,608][23882] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-06 17:05:50,393][23882] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-06 17:05:51,181][23882] Updated weights for policy 0, policy_version 19630 (0.0007) [2023-03-06 17:05:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 20107264. Throughput: 0: 13094.0. Samples: 20093413. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:05:51,748][23556] Avg episode reward: [(0, '139.844')] [2023-03-06 17:05:51,968][23882] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-06 17:05:52,748][23882] Updated weights for policy 0, policy_version 19650 (0.0007) [2023-03-06 17:05:53,534][23882] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-06 17:05:54,312][23882] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-06 17:05:55,095][23882] Updated weights for policy 0, policy_version 19680 (0.0006) [2023-03-06 17:05:55,878][23882] Updated weights for policy 0, policy_version 19690 (0.0005) [2023-03-06 17:05:56,669][23882] Updated weights for policy 0, policy_version 19700 (0.0008) [2023-03-06 17:05:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 20172800. Throughput: 0: 13088.1. Samples: 20171719. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:05:56,748][23556] Avg episode reward: [(0, '161.744')] [2023-03-06 17:05:56,754][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000019701_20173824.pth... [2023-03-06 17:05:56,785][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000016635_17034240.pth [2023-03-06 17:05:57,428][23882] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-06 17:05:58,227][23882] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-06 17:05:59,025][23882] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-06 17:05:59,805][23882] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-06 17:06:00,582][23882] Updated weights for policy 0, policy_version 19750 (0.0007) [2023-03-06 17:06:01,366][23882] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-06 17:06:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 20238336. Throughput: 0: 13085.6. Samples: 20210868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:01,748][23556] Avg episode reward: [(0, '170.019')] [2023-03-06 17:06:02,141][23882] Updated weights for policy 0, policy_version 19770 (0.0005) [2023-03-06 17:06:02,922][23882] Updated weights for policy 0, policy_version 19780 (0.0007) [2023-03-06 17:06:03,709][23882] Updated weights for policy 0, policy_version 19790 (0.0006) [2023-03-06 17:06:04,487][23882] Updated weights for policy 0, policy_version 19800 (0.0007) [2023-03-06 17:06:05,285][23882] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-06 17:06:06,059][23882] Updated weights for policy 0, policy_version 19820 (0.0007) [2023-03-06 17:06:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13076.0). Total num frames: 20303872. Throughput: 0: 13084.3. Samples: 20289361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:06,748][23556] Avg episode reward: [(0, '150.801')] [2023-03-06 17:06:06,856][23882] Updated weights for policy 0, policy_version 19830 (0.0007) [2023-03-06 17:06:07,629][23882] Updated weights for policy 0, policy_version 19840 (0.0007) [2023-03-06 17:06:08,434][23882] Updated weights for policy 0, policy_version 19850 (0.0007) [2023-03-06 17:06:09,213][23882] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-06 17:06:10,001][23882] Updated weights for policy 0, policy_version 19870 (0.0007) [2023-03-06 17:06:10,761][23882] Updated weights for policy 0, policy_version 19880 (0.0007) [2023-03-06 17:06:11,547][23882] Updated weights for policy 0, policy_version 19890 (0.0007) [2023-03-06 17:06:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 20369408. Throughput: 0: 13078.0. Samples: 20367651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:11,748][23556] Avg episode reward: [(0, '150.456')] [2023-03-06 17:06:12,348][23882] Updated weights for policy 0, policy_version 19900 (0.0005) [2023-03-06 17:06:13,134][23882] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-06 17:06:13,918][23882] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-06 17:06:14,681][23882] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-06 17:06:15,472][23882] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-06 17:06:16,259][23882] Updated weights for policy 0, policy_version 19950 (0.0007) [2023-03-06 17:06:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 20434944. Throughput: 0: 13077.4. Samples: 20406933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:16,748][23556] Avg episode reward: [(0, '148.168')] [2023-03-06 17:06:17,049][23882] Updated weights for policy 0, policy_version 19960 (0.0007) [2023-03-06 17:06:17,829][23882] Updated weights for policy 0, policy_version 19970 (0.0007) [2023-03-06 17:06:18,597][23882] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-06 17:06:19,394][23882] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-06 17:06:20,172][23882] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-06 17:06:20,967][23882] Updated weights for policy 0, policy_version 20010 (0.0007) [2023-03-06 17:06:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 20499456. Throughput: 0: 13076.0. Samples: 20485399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:21,748][23556] Avg episode reward: [(0, '132.499')] [2023-03-06 17:06:21,750][23882] Updated weights for policy 0, policy_version 20020 (0.0006) [2023-03-06 17:06:22,537][23882] Updated weights for policy 0, policy_version 20030 (0.0008) [2023-03-06 17:06:23,318][23882] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-06 17:06:24,085][23882] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-06 17:06:24,886][23882] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-06 17:06:25,649][23882] Updated weights for policy 0, policy_version 20070 (0.0007) [2023-03-06 17:06:26,426][23882] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-06 17:06:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 20566016. Throughput: 0: 13081.5. Samples: 20564032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:26,748][23556] Avg episode reward: [(0, '141.393')] [2023-03-06 17:06:27,201][23882] Updated weights for policy 0, policy_version 20090 (0.0007) [2023-03-06 17:06:27,996][23882] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-06 17:06:28,782][23882] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-06 17:06:29,554][23882] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-06 17:06:30,362][23882] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-06 17:06:31,135][23882] Updated weights for policy 0, policy_version 20140 (0.0007) [2023-03-06 17:06:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 20630528. Throughput: 0: 13074.9. Samples: 20603138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:06:31,748][23556] Avg episode reward: [(0, '157.462')] [2023-03-06 17:06:31,919][23882] Updated weights for policy 0, policy_version 20150 (0.0006) [2023-03-06 17:06:32,707][23882] Updated weights for policy 0, policy_version 20160 (0.0007) [2023-03-06 17:06:33,500][23882] Updated weights for policy 0, policy_version 20170 (0.0007) [2023-03-06 17:06:34,268][23882] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-06 17:06:35,074][23882] Updated weights for policy 0, policy_version 20190 (0.0006) [2023-03-06 17:06:35,845][23882] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-06 17:06:36,613][23882] Updated weights for policy 0, policy_version 20210 (0.0007) [2023-03-06 17:06:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 20696064. Throughput: 0: 13066.8. Samples: 20681420. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:06:36,748][23556] Avg episode reward: [(0, '143.989')] [2023-03-06 17:06:37,388][23882] Updated weights for policy 0, policy_version 20220 (0.0006) [2023-03-06 17:06:38,162][23882] Updated weights for policy 0, policy_version 20230 (0.0007) [2023-03-06 17:06:38,959][23882] Updated weights for policy 0, policy_version 20240 (0.0006) [2023-03-06 17:06:39,729][23882] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-06 17:06:40,512][23882] Updated weights for policy 0, policy_version 20260 (0.0005) [2023-03-06 17:06:41,298][23882] Updated weights for policy 0, policy_version 20270 (0.0006) [2023-03-06 17:06:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13079.4). Total num frames: 20761600. Throughput: 0: 13081.4. Samples: 20760382. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:06:41,748][23556] Avg episode reward: [(0, '141.433')] [2023-03-06 17:06:42,076][23882] Updated weights for policy 0, policy_version 20280 (0.0008) [2023-03-06 17:06:42,865][23882] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-06 17:06:43,638][23882] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-06 17:06:44,448][23882] Updated weights for policy 0, policy_version 20310 (0.0006) [2023-03-06 17:06:45,246][23882] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-06 17:06:46,022][23882] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-06 17:06:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 20827136. Throughput: 0: 13079.3. Samples: 20799439. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:06:46,748][23556] Avg episode reward: [(0, '145.103')] [2023-03-06 17:06:46,807][23882] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-06 17:06:47,594][23882] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-06 17:06:48,368][23882] Updated weights for policy 0, policy_version 20360 (0.0007) [2023-03-06 17:06:49,156][23882] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-06 17:06:49,923][23882] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-06 17:06:50,706][23882] Updated weights for policy 0, policy_version 20390 (0.0006) [2023-03-06 17:06:51,504][23882] Updated weights for policy 0, policy_version 20400 (0.0007) [2023-03-06 17:06:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 20891648. Throughput: 0: 13073.3. Samples: 20877661. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:06:51,748][23556] Avg episode reward: [(0, '152.061')] [2023-03-06 17:06:52,279][23882] Updated weights for policy 0, policy_version 20410 (0.0005) [2023-03-06 17:06:53,074][23882] Updated weights for policy 0, policy_version 20420 (0.0007) [2023-03-06 17:06:53,850][23882] Updated weights for policy 0, policy_version 20430 (0.0007) [2023-03-06 17:06:54,612][23882] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-06 17:06:55,418][23882] Updated weights for policy 0, policy_version 20450 (0.0006) [2023-03-06 17:06:56,177][23882] Updated weights for policy 0, policy_version 20460 (0.0008) [2023-03-06 17:06:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 20958208. Throughput: 0: 13084.1. Samples: 20956433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:06:56,748][23556] Avg episode reward: [(0, '152.060')] [2023-03-06 17:06:56,951][23882] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-06 17:06:57,745][23882] Updated weights for policy 0, policy_version 20480 (0.0007) [2023-03-06 17:06:58,503][23882] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-06 17:06:59,294][23882] Updated weights for policy 0, policy_version 20500 (0.0007) [2023-03-06 17:07:00,079][23882] Updated weights for policy 0, policy_version 20510 (0.0005) [2023-03-06 17:07:00,854][23882] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-06 17:07:01,622][23882] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-06 17:07:01,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 21023744. Throughput: 0: 13088.9. Samples: 20995933. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:01,748][23556] Avg episode reward: [(0, '151.914')] [2023-03-06 17:07:02,406][23882] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-06 17:07:03,187][23882] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-06 17:07:03,978][23882] Updated weights for policy 0, policy_version 20560 (0.0007) [2023-03-06 17:07:04,745][23882] Updated weights for policy 0, policy_version 20570 (0.0006) [2023-03-06 17:07:05,538][23882] Updated weights for policy 0, policy_version 20580 (0.0007) [2023-03-06 17:07:06,320][23882] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-06 17:07:06,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 21089280. Throughput: 0: 13092.0. Samples: 21074541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:06,749][23556] Avg episode reward: [(0, '134.815')] [2023-03-06 17:07:07,095][23882] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-06 17:07:07,871][23882] Updated weights for policy 0, policy_version 20610 (0.0007) [2023-03-06 17:07:08,642][23882] Updated weights for policy 0, policy_version 20620 (0.0006) [2023-03-06 17:07:09,428][23882] Updated weights for policy 0, policy_version 20630 (0.0006) [2023-03-06 17:07:10,191][23882] Updated weights for policy 0, policy_version 20640 (0.0007) [2023-03-06 17:07:10,985][23882] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-06 17:07:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 21154816. Throughput: 0: 13102.4. Samples: 21153643. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:11,748][23556] Avg episode reward: [(0, '142.599')] [2023-03-06 17:07:11,753][23882] Updated weights for policy 0, policy_version 20660 (0.0007) [2023-03-06 17:07:12,533][23882] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-06 17:07:13,321][23882] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-06 17:07:14,091][23882] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-06 17:07:14,877][23882] Updated weights for policy 0, policy_version 20700 (0.0005) [2023-03-06 17:07:15,636][23882] Updated weights for policy 0, policy_version 20710 (0.0007) [2023-03-06 17:07:16,426][23882] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-06 17:07:16,748][23556] Fps is (10 sec: 13209.7, 60 sec: 13107.2, 300 sec: 13086.4). Total num frames: 21221376. Throughput: 0: 13107.9. Samples: 21192994. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:16,748][23556] Avg episode reward: [(0, '138.596')] [2023-03-06 17:07:17,200][23882] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-06 17:07:17,975][23882] Updated weights for policy 0, policy_version 20740 (0.0006) [2023-03-06 17:07:18,745][23882] Updated weights for policy 0, policy_version 20750 (0.0007) [2023-03-06 17:07:19,541][23882] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-06 17:07:20,325][23882] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-06 17:07:21,108][23882] Updated weights for policy 0, policy_version 20780 (0.0007) [2023-03-06 17:07:21,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13086.4). Total num frames: 21286912. Throughput: 0: 13123.4. Samples: 21271974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:21,748][23556] Avg episode reward: [(0, '143.253')] [2023-03-06 17:07:21,894][23882] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-06 17:07:22,673][23882] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-06 17:07:23,443][23882] Updated weights for policy 0, policy_version 20810 (0.0006) [2023-03-06 17:07:24,225][23882] Updated weights for policy 0, policy_version 20820 (0.0006) [2023-03-06 17:07:24,994][23882] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-06 17:07:25,769][23882] Updated weights for policy 0, policy_version 20840 (0.0007) [2023-03-06 17:07:26,569][23882] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-06 17:07:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13089.9). Total num frames: 21352448. Throughput: 0: 13121.8. Samples: 21350863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:26,748][23556] Avg episode reward: [(0, '120.643')] [2023-03-06 17:07:27,325][23882] Updated weights for policy 0, policy_version 20860 (0.0005) [2023-03-06 17:07:28,115][23882] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-06 17:07:28,885][23882] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-06 17:07:29,669][23882] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-06 17:07:30,458][23882] Updated weights for policy 0, policy_version 20900 (0.0007) [2023-03-06 17:07:31,244][23882] Updated weights for policy 0, policy_version 20910 (0.0007) [2023-03-06 17:07:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13089.8). Total num frames: 21417984. Throughput: 0: 13130.8. Samples: 21390327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:31,748][23556] Avg episode reward: [(0, '103.310')] [2023-03-06 17:07:32,037][23882] Updated weights for policy 0, policy_version 20920 (0.0005) [2023-03-06 17:07:32,821][23882] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-06 17:07:33,602][23882] Updated weights for policy 0, policy_version 20940 (0.0007) [2023-03-06 17:07:34,392][23882] Updated weights for policy 0, policy_version 20950 (0.0007) [2023-03-06 17:07:35,168][23882] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-06 17:07:35,953][23882] Updated weights for policy 0, policy_version 20970 (0.0005) [2023-03-06 17:07:36,742][23882] Updated weights for policy 0, policy_version 20980 (0.0007) [2023-03-06 17:07:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13089.8). Total num frames: 21483520. Throughput: 0: 13132.9. Samples: 21468640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:07:36,748][23556] Avg episode reward: [(0, '93.193')] [2023-03-06 17:07:37,502][23882] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-06 17:07:38,289][23882] Updated weights for policy 0, policy_version 21000 (0.0007) [2023-03-06 17:07:39,060][23882] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-06 17:07:39,853][23882] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-06 17:07:40,626][23882] Updated weights for policy 0, policy_version 21030 (0.0007) [2023-03-06 17:07:41,432][23882] Updated weights for policy 0, policy_version 21040 (0.0007) [2023-03-06 17:07:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13089.8). Total num frames: 21549056. Throughput: 0: 13127.4. Samples: 21547168. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:07:41,748][23556] Avg episode reward: [(0, '63.957')] [2023-03-06 17:07:42,196][23882] Updated weights for policy 0, policy_version 21050 (0.0005) [2023-03-06 17:07:42,980][23882] Updated weights for policy 0, policy_version 21060 (0.0007) [2023-03-06 17:07:43,759][23882] Updated weights for policy 0, policy_version 21070 (0.0007) [2023-03-06 17:07:44,549][23882] Updated weights for policy 0, policy_version 21080 (0.0008) [2023-03-06 17:07:45,310][23882] Updated weights for policy 0, policy_version 21090 (0.0007) [2023-03-06 17:07:46,095][23882] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-06 17:07:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13089.8). Total num frames: 21614592. Throughput: 0: 13126.4. Samples: 21586623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:46,749][23556] Avg episode reward: [(0, '49.245')] [2023-03-06 17:07:46,876][23882] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-06 17:07:47,653][23882] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-06 17:07:48,449][23882] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-06 17:07:49,228][23882] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-06 17:07:50,011][23882] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-06 17:07:50,791][23882] Updated weights for policy 0, policy_version 21160 (0.0006) [2023-03-06 17:07:51,572][23882] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-06 17:07:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13089.8). Total num frames: 21680128. Throughput: 0: 13127.7. Samples: 21665287. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:51,748][23556] Avg episode reward: [(0, '60.149')] [2023-03-06 17:07:52,346][23882] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-06 17:07:53,128][23882] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-06 17:07:53,913][23882] Updated weights for policy 0, policy_version 21200 (0.0007) [2023-03-06 17:07:54,695][23882] Updated weights for policy 0, policy_version 21210 (0.0006) [2023-03-06 17:07:55,472][23882] Updated weights for policy 0, policy_version 21220 (0.0005) [2023-03-06 17:07:56,250][23882] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-06 17:07:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13093.3). Total num frames: 21745664. Throughput: 0: 13117.7. Samples: 21743943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:07:56,749][23556] Avg episode reward: [(0, '66.936')] [2023-03-06 17:07:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000021236_21745664.pth... [2023-03-06 17:07:56,785][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000018167_18603008.pth [2023-03-06 17:07:57,030][23882] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-06 17:07:57,800][23882] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-06 17:07:58,568][23882] Updated weights for policy 0, policy_version 21260 (0.0007) [2023-03-06 17:07:59,350][23882] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-06 17:08:00,138][23882] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-06 17:08:00,928][23882] Updated weights for policy 0, policy_version 21290 (0.0006) [2023-03-06 17:08:01,709][23882] Updated weights for policy 0, policy_version 21300 (0.0005) [2023-03-06 17:08:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13093.3). Total num frames: 21811200. Throughput: 0: 13123.6. Samples: 21783555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:08:01,748][23556] Avg episode reward: [(0, '40.122')] [2023-03-06 17:08:02,483][23882] Updated weights for policy 0, policy_version 21310 (0.0006) [2023-03-06 17:08:03,233][23882] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-06 17:08:04,025][23882] Updated weights for policy 0, policy_version 21330 (0.0006) [2023-03-06 17:08:04,792][23882] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-06 17:08:05,577][23882] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-06 17:08:06,350][23882] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-06 17:08:06,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13124.3, 300 sec: 13093.3). Total num frames: 21876736. Throughput: 0: 13122.6. Samples: 21862489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:08:06,748][23556] Avg episode reward: [(0, '35.130')] [2023-03-06 17:08:07,142][23882] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-06 17:08:07,929][23882] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-06 17:08:08,696][23882] Updated weights for policy 0, policy_version 21390 (0.0007) [2023-03-06 17:08:09,482][23882] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-06 17:08:10,269][23882] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-06 17:08:11,040][23882] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-06 17:08:11,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13096.8). Total num frames: 21943296. Throughput: 0: 13126.9. Samples: 21941574. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:08:11,748][23556] Avg episode reward: [(0, '39.644')] [2023-03-06 17:08:11,824][23882] Updated weights for policy 0, policy_version 21430 (0.0006) [2023-03-06 17:08:12,594][23882] Updated weights for policy 0, policy_version 21440 (0.0006) [2023-03-06 17:08:13,366][23882] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-06 17:08:14,155][23882] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-06 17:08:14,942][23882] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-06 17:08:15,710][23882] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-06 17:08:16,502][23882] Updated weights for policy 0, policy_version 21490 (0.0005) [2023-03-06 17:08:16,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13096.8). Total num frames: 22008832. Throughput: 0: 13121.2. Samples: 21980781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:16,748][23556] Avg episode reward: [(0, '57.247')] [2023-03-06 17:08:17,295][23882] Updated weights for policy 0, policy_version 21500 (0.0005) [2023-03-06 17:08:18,078][23882] Updated weights for policy 0, policy_version 21510 (0.0006) [2023-03-06 17:08:18,870][23882] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-06 17:08:19,648][23882] Updated weights for policy 0, policy_version 21530 (0.0006) [2023-03-06 17:08:20,425][23882] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-06 17:08:21,192][23882] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-06 17:08:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22073344. Throughput: 0: 13123.0. Samples: 22059175. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:21,748][23556] Avg episode reward: [(0, '40.977')] [2023-03-06 17:08:21,987][23882] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-03-06 17:08:22,769][23882] Updated weights for policy 0, policy_version 21570 (0.0007) [2023-03-06 17:08:23,560][23882] Updated weights for policy 0, policy_version 21580 (0.0007) [2023-03-06 17:08:24,333][23882] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-06 17:08:25,126][23882] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-06 17:08:25,906][23882] Updated weights for policy 0, policy_version 21610 (0.0007) [2023-03-06 17:08:26,706][23882] Updated weights for policy 0, policy_version 21620 (0.0007) [2023-03-06 17:08:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22138880. Throughput: 0: 13115.4. Samples: 22137361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:26,748][23556] Avg episode reward: [(0, '40.957')] [2023-03-06 17:08:27,482][23882] Updated weights for policy 0, policy_version 21630 (0.0005) [2023-03-06 17:08:28,270][23882] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-06 17:08:29,045][23882] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-06 17:08:29,832][23882] Updated weights for policy 0, policy_version 21660 (0.0006) [2023-03-06 17:08:30,622][23882] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-06 17:08:31,403][23882] Updated weights for policy 0, policy_version 21680 (0.0006) [2023-03-06 17:08:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13093.3). Total num frames: 22204416. Throughput: 0: 13115.9. Samples: 22176834. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:08:31,748][23556] Avg episode reward: [(0, '33.997')] [2023-03-06 17:08:32,180][23882] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-06 17:08:32,955][23882] Updated weights for policy 0, policy_version 21700 (0.0006) [2023-03-06 17:08:33,742][23882] Updated weights for policy 0, policy_version 21710 (0.0007) [2023-03-06 17:08:34,557][23882] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-06 17:08:35,329][23882] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-06 17:08:36,102][23882] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-06 17:08:36,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22269952. Throughput: 0: 13101.5. Samples: 22254857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:08:36,748][23556] Avg episode reward: [(0, '26.525')] [2023-03-06 17:08:36,882][23882] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-06 17:08:37,677][23882] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-06 17:08:38,437][23882] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-06 17:08:39,227][23882] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-06 17:08:40,014][23882] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-06 17:08:40,795][23882] Updated weights for policy 0, policy_version 21800 (0.0007) [2023-03-06 17:08:41,557][23882] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-06 17:08:41,748][23556] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22335488. Throughput: 0: 13109.0. Samples: 22333845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:08:41,749][23556] Avg episode reward: [(0, '25.430')] [2023-03-06 17:08:42,342][23882] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-06 17:08:43,129][23882] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-06 17:08:43,893][23882] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-06 17:08:44,681][23882] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-06 17:08:45,446][23882] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-06 17:08:46,223][23882] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-06 17:08:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22401024. Throughput: 0: 13105.8. Samples: 22373317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:46,748][23556] Avg episode reward: [(0, '28.428')] [2023-03-06 17:08:47,006][23882] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-06 17:08:47,799][23882] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-06 17:08:48,590][23882] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-06 17:08:49,374][23882] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-06 17:08:50,182][23882] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-06 17:08:50,949][23882] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-06 17:08:51,721][23882] Updated weights for policy 0, policy_version 21940 (0.0007) [2023-03-06 17:08:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13096.8). Total num frames: 22466560. Throughput: 0: 13088.1. Samples: 22451457. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:51,748][23556] Avg episode reward: [(0, '26.301')] [2023-03-06 17:08:52,509][23882] Updated weights for policy 0, policy_version 21950 (0.0008) [2023-03-06 17:08:53,285][23882] Updated weights for policy 0, policy_version 21960 (0.0007) [2023-03-06 17:08:54,074][23882] Updated weights for policy 0, policy_version 21970 (0.0007) [2023-03-06 17:08:54,862][23882] Updated weights for policy 0, policy_version 21980 (0.0005) [2023-03-06 17:08:55,635][23882] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-06 17:08:56,430][23882] Updated weights for policy 0, policy_version 22000 (0.0007) [2023-03-06 17:08:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13096.8). Total num frames: 22532096. Throughput: 0: 13076.6. Samples: 22530017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:08:56,748][23556] Avg episode reward: [(0, '39.505')] [2023-03-06 17:08:57,209][23882] Updated weights for policy 0, policy_version 22010 (0.0006) [2023-03-06 17:08:57,998][23882] Updated weights for policy 0, policy_version 22020 (0.0007) [2023-03-06 17:08:58,775][23882] Updated weights for policy 0, policy_version 22030 (0.0007) [2023-03-06 17:08:59,556][23882] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-06 17:09:00,341][23882] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-06 17:09:01,122][23882] Updated weights for policy 0, policy_version 22060 (0.0005) [2023-03-06 17:09:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13093.3). Total num frames: 22596608. Throughput: 0: 13078.6. Samples: 22569317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:09:01,748][23556] Avg episode reward: [(0, '40.432')] [2023-03-06 17:09:01,916][23882] Updated weights for policy 0, policy_version 22070 (0.0007) [2023-03-06 17:09:02,698][23882] Updated weights for policy 0, policy_version 22080 (0.0007) [2023-03-06 17:09:03,501][23882] Updated weights for policy 0, policy_version 22090 (0.0007) [2023-03-06 17:09:04,278][23882] Updated weights for policy 0, policy_version 22100 (0.0006) [2023-03-06 17:09:05,061][23882] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-06 17:09:05,846][23882] Updated weights for policy 0, policy_version 22120 (0.0008) [2023-03-06 17:09:06,617][23882] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-06 17:09:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13093.3). Total num frames: 22662144. Throughput: 0: 13076.2. Samples: 22647604. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:09:06,748][23556] Avg episode reward: [(0, '55.263')] [2023-03-06 17:09:07,405][23882] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-06 17:09:08,196][23882] Updated weights for policy 0, policy_version 22150 (0.0007) [2023-03-06 17:09:08,982][23882] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-06 17:09:09,772][23882] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-06 17:09:10,530][23882] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-06 17:09:11,322][23882] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-06 17:09:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13093.3). Total num frames: 22727680. Throughput: 0: 13083.2. Samples: 22726103. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:09:11,748][23556] Avg episode reward: [(0, '52.151')] [2023-03-06 17:09:12,107][23882] Updated weights for policy 0, policy_version 22200 (0.0007) [2023-03-06 17:09:12,880][23882] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-06 17:09:13,656][23882] Updated weights for policy 0, policy_version 22220 (0.0007) [2023-03-06 17:09:14,429][23882] Updated weights for policy 0, policy_version 22230 (0.0006) [2023-03-06 17:09:15,243][23882] Updated weights for policy 0, policy_version 22240 (0.0007) [2023-03-06 17:09:16,016][23882] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-06 17:09:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 22793216. Throughput: 0: 13080.1. Samples: 22765439. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:09:16,748][23556] Avg episode reward: [(0, '52.810')] [2023-03-06 17:09:16,797][23882] Updated weights for policy 0, policy_version 22260 (0.0007) [2023-03-06 17:09:17,573][23882] Updated weights for policy 0, policy_version 22270 (0.0007) [2023-03-06 17:09:18,336][23882] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-06 17:09:19,123][23882] Updated weights for policy 0, policy_version 22290 (0.0007) [2023-03-06 17:09:19,905][23882] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-06 17:09:20,673][23882] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-06 17:09:21,459][23882] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-06 17:09:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 22858752. Throughput: 0: 13094.8. Samples: 22844122. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:09:21,749][23556] Avg episode reward: [(0, '59.230')] [2023-03-06 17:09:22,247][23882] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-06 17:09:23,011][23882] Updated weights for policy 0, policy_version 22340 (0.0008) [2023-03-06 17:09:23,798][23882] Updated weights for policy 0, policy_version 22350 (0.0007) [2023-03-06 17:09:24,577][23882] Updated weights for policy 0, policy_version 22360 (0.0007) [2023-03-06 17:09:25,363][23882] Updated weights for policy 0, policy_version 22370 (0.0007) [2023-03-06 17:09:26,141][23882] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-06 17:09:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 22924288. Throughput: 0: 13094.3. Samples: 22923088. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:09:26,748][23556] Avg episode reward: [(0, '65.960')] [2023-03-06 17:09:26,921][23882] Updated weights for policy 0, policy_version 22390 (0.0006) [2023-03-06 17:09:27,698][23882] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-06 17:09:28,488][23882] Updated weights for policy 0, policy_version 22410 (0.0007) [2023-03-06 17:09:29,254][23882] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-06 17:09:30,028][23882] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-06 17:09:30,807][23882] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-06 17:09:31,580][23882] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-06 17:09:31,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 22990848. Throughput: 0: 13090.9. Samples: 22962409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:09:31,748][23556] Avg episode reward: [(0, '67.152')] [2023-03-06 17:09:32,375][23882] Updated weights for policy 0, policy_version 22460 (0.0007) [2023-03-06 17:09:33,154][23882] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-06 17:09:33,945][23882] Updated weights for policy 0, policy_version 22480 (0.0007) [2023-03-06 17:09:34,725][23882] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-06 17:09:35,501][23882] Updated weights for policy 0, policy_version 22500 (0.0007) [2023-03-06 17:09:36,279][23882] Updated weights for policy 0, policy_version 22510 (0.0007) [2023-03-06 17:09:36,748][23556] Fps is (10 sec: 13209.8, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 23056384. Throughput: 0: 13104.4. Samples: 23041153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:09:36,748][23556] Avg episode reward: [(0, '62.669')] [2023-03-06 17:09:37,070][23882] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-06 17:09:37,855][23882] Updated weights for policy 0, policy_version 22530 (0.0007) [2023-03-06 17:09:38,633][23882] Updated weights for policy 0, policy_version 22540 (0.0007) [2023-03-06 17:09:39,409][23882] Updated weights for policy 0, policy_version 22550 (0.0006) [2023-03-06 17:09:40,209][23882] Updated weights for policy 0, policy_version 22560 (0.0006) [2023-03-06 17:09:40,984][23882] Updated weights for policy 0, policy_version 22570 (0.0006) [2023-03-06 17:09:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.2, 300 sec: 13096.8). Total num frames: 23120896. Throughput: 0: 13104.6. Samples: 23119725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:09:41,748][23556] Avg episode reward: [(0, '68.360')] [2023-03-06 17:09:41,763][23882] Updated weights for policy 0, policy_version 22580 (0.0007) [2023-03-06 17:09:42,533][23882] Updated weights for policy 0, policy_version 22590 (0.0007) [2023-03-06 17:09:43,318][23882] Updated weights for policy 0, policy_version 22600 (0.0008) [2023-03-06 17:09:44,094][23882] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-06 17:09:44,875][23882] Updated weights for policy 0, policy_version 22620 (0.0007) [2023-03-06 17:09:45,665][23882] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-06 17:09:46,458][23882] Updated weights for policy 0, policy_version 22640 (0.0007) [2023-03-06 17:09:46,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 23186432. Throughput: 0: 13106.3. Samples: 23159100. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:09:46,749][23556] Avg episode reward: [(0, '75.726')] [2023-03-06 17:09:47,234][23882] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-06 17:09:48,014][23882] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-06 17:09:48,785][23882] Updated weights for policy 0, policy_version 22670 (0.0007) [2023-03-06 17:09:49,570][23882] Updated weights for policy 0, policy_version 22680 (0.0007) [2023-03-06 17:09:50,343][23882] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-06 17:09:51,137][23882] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-06 17:09:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 23251968. Throughput: 0: 13117.4. Samples: 23237889. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:09:51,748][23556] Avg episode reward: [(0, '86.529')] [2023-03-06 17:09:51,925][23882] Updated weights for policy 0, policy_version 22710 (0.0006) [2023-03-06 17:09:52,699][23882] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-06 17:09:53,477][23882] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-06 17:09:54,260][23882] Updated weights for policy 0, policy_version 22740 (0.0007) [2023-03-06 17:09:55,033][23882] Updated weights for policy 0, policy_version 22750 (0.0007) [2023-03-06 17:09:55,809][23882] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-06 17:09:56,595][23882] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-06 17:09:56,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 23317504. Throughput: 0: 13117.7. Samples: 23316400. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:09:56,748][23556] Avg episode reward: [(0, '86.710')] [2023-03-06 17:09:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000022772_23318528.pth... [2023-03-06 17:09:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000019701_20173824.pth [2023-03-06 17:09:57,379][23882] Updated weights for policy 0, policy_version 22780 (0.0006) [2023-03-06 17:09:58,149][23882] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-06 17:09:58,942][23882] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-06 17:09:59,729][23882] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-06 17:10:00,498][23882] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-06 17:10:01,290][23882] Updated weights for policy 0, policy_version 22830 (0.0007) [2023-03-06 17:10:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 23383040. Throughput: 0: 13112.8. Samples: 23355514. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:01,748][23556] Avg episode reward: [(0, '98.898')] [2023-03-06 17:10:02,066][23882] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-06 17:10:02,829][23882] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-06 17:10:03,594][23882] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-06 17:10:04,379][23882] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-06 17:10:05,165][23882] Updated weights for policy 0, policy_version 22880 (0.0007) [2023-03-06 17:10:05,959][23882] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-06 17:10:06,742][23882] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-06 17:10:06,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13124.2, 300 sec: 13103.7). Total num frames: 23449600. Throughput: 0: 13120.8. Samples: 23434557. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:06,748][23556] Avg episode reward: [(0, '103.163')] [2023-03-06 17:10:07,537][23882] Updated weights for policy 0, policy_version 22910 (0.0007) [2023-03-06 17:10:08,305][23882] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-06 17:10:09,106][23882] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-06 17:10:09,893][23882] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-06 17:10:10,679][23882] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-06 17:10:11,455][23882] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-06 17:10:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 23514112. Throughput: 0: 13101.7. Samples: 23512663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:11,748][23556] Avg episode reward: [(0, '88.515')] [2023-03-06 17:10:12,240][23882] Updated weights for policy 0, policy_version 22970 (0.0007) [2023-03-06 17:10:13,038][23882] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-06 17:10:13,813][23882] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-06 17:10:14,587][23882] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-06 17:10:15,372][23882] Updated weights for policy 0, policy_version 23010 (0.0007) [2023-03-06 17:10:16,162][23882] Updated weights for policy 0, policy_version 23020 (0.0007) [2023-03-06 17:10:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 23579648. Throughput: 0: 13100.9. Samples: 23551949. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:16,748][23556] Avg episode reward: [(0, '88.102')] [2023-03-06 17:10:16,937][23882] Updated weights for policy 0, policy_version 23030 (0.0007) [2023-03-06 17:10:17,727][23882] Updated weights for policy 0, policy_version 23040 (0.0007) [2023-03-06 17:10:18,510][23882] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-06 17:10:19,273][23882] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-06 17:10:20,041][23882] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-06 17:10:20,835][23882] Updated weights for policy 0, policy_version 23080 (0.0007) [2023-03-06 17:10:21,620][23882] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-06 17:10:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13100.3). Total num frames: 23645184. Throughput: 0: 13103.2. Samples: 23630797. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:21,748][23556] Avg episode reward: [(0, '97.549')] [2023-03-06 17:10:22,399][23882] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-06 17:10:23,194][23882] Updated weights for policy 0, policy_version 23110 (0.0007) [2023-03-06 17:10:23,985][23882] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-06 17:10:24,774][23882] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-06 17:10:25,561][23882] Updated weights for policy 0, policy_version 23140 (0.0007) [2023-03-06 17:10:26,360][23882] Updated weights for policy 0, policy_version 23150 (0.0007) [2023-03-06 17:10:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 23709696. Throughput: 0: 13087.0. Samples: 23708643. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:26,748][23556] Avg episode reward: [(0, '95.108')] [2023-03-06 17:10:27,134][23882] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-06 17:10:27,922][23882] Updated weights for policy 0, policy_version 23170 (0.0007) [2023-03-06 17:10:28,704][23882] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-06 17:10:29,478][23882] Updated weights for policy 0, policy_version 23190 (0.0006) [2023-03-06 17:10:30,274][23882] Updated weights for policy 0, policy_version 23200 (0.0007) [2023-03-06 17:10:31,063][23882] Updated weights for policy 0, policy_version 23210 (0.0008) [2023-03-06 17:10:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 23775232. Throughput: 0: 13084.4. Samples: 23747897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:31,748][23556] Avg episode reward: [(0, '86.535')] [2023-03-06 17:10:31,830][23882] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-06 17:10:32,620][23882] Updated weights for policy 0, policy_version 23230 (0.0007) [2023-03-06 17:10:33,403][23882] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-06 17:10:34,187][23882] Updated weights for policy 0, policy_version 23250 (0.0007) [2023-03-06 17:10:34,972][23882] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-06 17:10:35,756][23882] Updated weights for policy 0, policy_version 23270 (0.0007) [2023-03-06 17:10:36,538][23882] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-06 17:10:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13096.8). Total num frames: 23840768. Throughput: 0: 13075.6. Samples: 23826291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:36,748][23556] Avg episode reward: [(0, '94.794')] [2023-03-06 17:10:37,330][23882] Updated weights for policy 0, policy_version 23290 (0.0007) [2023-03-06 17:10:38,085][23882] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-06 17:10:38,873][23882] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-06 17:10:39,650][23882] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-06 17:10:40,448][23882] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-06 17:10:41,230][23882] Updated weights for policy 0, policy_version 23340 (0.0007) [2023-03-06 17:10:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13096.8). Total num frames: 23906304. Throughput: 0: 13077.8. Samples: 23904899. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:10:41,748][23556] Avg episode reward: [(0, '93.447')] [2023-03-06 17:10:42,026][23882] Updated weights for policy 0, policy_version 23350 (0.0007) [2023-03-06 17:10:42,805][23882] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-06 17:10:43,581][23882] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-06 17:10:44,362][23882] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-06 17:10:45,144][23882] Updated weights for policy 0, policy_version 23390 (0.0008) [2023-03-06 17:10:45,924][23882] Updated weights for policy 0, policy_version 23400 (0.0006) [2023-03-06 17:10:46,711][23882] Updated weights for policy 0, policy_version 23410 (0.0007) [2023-03-06 17:10:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13100.3). Total num frames: 23971840. Throughput: 0: 13082.0. Samples: 23944201. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:46,748][23556] Avg episode reward: [(0, '87.556')] [2023-03-06 17:10:47,494][23882] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-06 17:10:48,270][23882] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-06 17:10:49,077][23882] Updated weights for policy 0, policy_version 23440 (0.0007) [2023-03-06 17:10:49,839][23882] Updated weights for policy 0, policy_version 23450 (0.0007) [2023-03-06 17:10:50,636][23882] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-06 17:10:51,406][23882] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-06 17:10:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13100.3). Total num frames: 24037376. Throughput: 0: 13064.2. Samples: 24022446. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:51,748][23556] Avg episode reward: [(0, '97.202')] [2023-03-06 17:10:52,191][23882] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-06 17:10:52,974][23882] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-03-06 17:10:53,769][23882] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-06 17:10:54,554][23882] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-06 17:10:55,333][23882] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-06 17:10:56,121][23882] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-06 17:10:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13100.3). Total num frames: 24102912. Throughput: 0: 13071.0. Samples: 24100858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:10:56,748][23556] Avg episode reward: [(0, '79.059')] [2023-03-06 17:10:56,902][23882] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-06 17:10:57,706][23882] Updated weights for policy 0, policy_version 23550 (0.0005) [2023-03-06 17:10:58,488][23882] Updated weights for policy 0, policy_version 23560 (0.0007) [2023-03-06 17:10:59,275][23882] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-06 17:11:00,052][23882] Updated weights for policy 0, policy_version 23580 (0.0006) [2023-03-06 17:11:00,810][23882] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-06 17:11:01,608][23882] Updated weights for policy 0, policy_version 23600 (0.0007) [2023-03-06 17:11:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 24167424. Throughput: 0: 13065.1. Samples: 24139877. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:11:01,748][23556] Avg episode reward: [(0, '80.714')] [2023-03-06 17:11:02,392][23882] Updated weights for policy 0, policy_version 23610 (0.0008) [2023-03-06 17:11:03,166][23882] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-06 17:11:03,947][23882] Updated weights for policy 0, policy_version 23630 (0.0007) [2023-03-06 17:11:04,742][23882] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-06 17:11:05,562][23882] Updated weights for policy 0, policy_version 23650 (0.0007) [2023-03-06 17:11:06,332][23882] Updated weights for policy 0, policy_version 23660 (0.0007) [2023-03-06 17:11:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 24232960. Throughput: 0: 13053.0. Samples: 24218181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:06,748][23556] Avg episode reward: [(0, '64.892')] [2023-03-06 17:11:07,118][23882] Updated weights for policy 0, policy_version 23670 (0.0007) [2023-03-06 17:11:07,921][23882] Updated weights for policy 0, policy_version 23680 (0.0006) [2023-03-06 17:11:08,704][23882] Updated weights for policy 0, policy_version 23690 (0.0007) [2023-03-06 17:11:09,484][23882] Updated weights for policy 0, policy_version 23700 (0.0006) [2023-03-06 17:11:10,278][23882] Updated weights for policy 0, policy_version 23710 (0.0007) [2023-03-06 17:11:11,056][23882] Updated weights for policy 0, policy_version 23720 (0.0007) [2023-03-06 17:11:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24297472. Throughput: 0: 13059.0. Samples: 24296296. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:11,748][23556] Avg episode reward: [(0, '71.769')] [2023-03-06 17:11:11,843][23882] Updated weights for policy 0, policy_version 23730 (0.0007) [2023-03-06 17:11:12,615][23882] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-06 17:11:13,405][23882] Updated weights for policy 0, policy_version 23750 (0.0007) [2023-03-06 17:11:14,175][23882] Updated weights for policy 0, policy_version 23760 (0.0007) [2023-03-06 17:11:14,977][23882] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-06 17:11:15,765][23882] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-06 17:11:16,529][23882] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-06 17:11:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 24363008. Throughput: 0: 13058.8. Samples: 24335541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:16,748][23556] Avg episode reward: [(0, '74.830')] [2023-03-06 17:11:17,317][23882] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-06 17:11:18,107][23882] Updated weights for policy 0, policy_version 23810 (0.0007) [2023-03-06 17:11:18,878][23882] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-06 17:11:19,661][23882] Updated weights for policy 0, policy_version 23830 (0.0007) [2023-03-06 17:11:20,465][23882] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-06 17:11:21,230][23882] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-06 17:11:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24428544. Throughput: 0: 13060.3. Samples: 24414004. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:21,748][23556] Avg episode reward: [(0, '79.141')] [2023-03-06 17:11:22,010][23882] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-06 17:11:22,794][23882] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-06 17:11:23,577][23882] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-06 17:11:24,390][23882] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-06 17:11:25,170][23882] Updated weights for policy 0, policy_version 23900 (0.0006) [2023-03-06 17:11:25,966][23882] Updated weights for policy 0, policy_version 23910 (0.0007) [2023-03-06 17:11:26,742][23882] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-06 17:11:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13096.8). Total num frames: 24494080. Throughput: 0: 13048.4. Samples: 24492079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:26,748][23556] Avg episode reward: [(0, '85.189')] [2023-03-06 17:11:27,530][23882] Updated weights for policy 0, policy_version 23930 (0.0007) [2023-03-06 17:11:28,319][23882] Updated weights for policy 0, policy_version 23940 (0.0006) [2023-03-06 17:11:29,089][23882] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-06 17:11:29,871][23882] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-06 17:11:30,646][23882] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-06 17:11:31,425][23882] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-06 17:11:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24558592. Throughput: 0: 13045.0. Samples: 24531228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:31,748][23556] Avg episode reward: [(0, '83.653')] [2023-03-06 17:11:32,217][23882] Updated weights for policy 0, policy_version 23990 (0.0007) [2023-03-06 17:11:32,982][23882] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-06 17:11:33,761][23882] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-06 17:11:34,566][23882] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-06 17:11:35,358][23882] Updated weights for policy 0, policy_version 24030 (0.0006) [2023-03-06 17:11:36,126][23882] Updated weights for policy 0, policy_version 24040 (0.0007) [2023-03-06 17:11:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24624128. Throughput: 0: 13052.5. Samples: 24609807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:36,748][23556] Avg episode reward: [(0, '116.952')] [2023-03-06 17:11:36,906][23882] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-06 17:11:37,694][23882] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-06 17:11:38,474][23882] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-06 17:11:39,259][23882] Updated weights for policy 0, policy_version 24080 (0.0006) [2023-03-06 17:11:40,049][23882] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-06 17:11:40,816][23882] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-06 17:11:41,607][23882] Updated weights for policy 0, policy_version 24110 (0.0005) [2023-03-06 17:11:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24689664. Throughput: 0: 13055.7. Samples: 24688367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:41,748][23556] Avg episode reward: [(0, '168.757')] [2023-03-06 17:11:42,397][23882] Updated weights for policy 0, policy_version 24120 (0.0007) [2023-03-06 17:11:43,180][23882] Updated weights for policy 0, policy_version 24130 (0.0007) [2023-03-06 17:11:43,977][23882] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-06 17:11:44,750][23882] Updated weights for policy 0, policy_version 24150 (0.0006) [2023-03-06 17:11:45,531][23882] Updated weights for policy 0, policy_version 24160 (0.0007) [2023-03-06 17:11:46,312][23882] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-06 17:11:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 24755200. Throughput: 0: 13055.7. Samples: 24727384. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:46,748][23556] Avg episode reward: [(0, '131.791')] [2023-03-06 17:11:47,088][23882] Updated weights for policy 0, policy_version 24180 (0.0005) [2023-03-06 17:11:47,865][23882] Updated weights for policy 0, policy_version 24190 (0.0007) [2023-03-06 17:11:48,656][23882] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-06 17:11:49,441][23882] Updated weights for policy 0, policy_version 24210 (0.0006) [2023-03-06 17:11:50,224][23882] Updated weights for policy 0, policy_version 24220 (0.0007) [2023-03-06 17:11:51,014][23882] Updated weights for policy 0, policy_version 24230 (0.0007) [2023-03-06 17:11:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24820736. Throughput: 0: 13064.7. Samples: 24806091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:51,748][23556] Avg episode reward: [(0, '110.934')] [2023-03-06 17:11:51,802][23882] Updated weights for policy 0, policy_version 24240 (0.0007) [2023-03-06 17:11:52,577][23882] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-06 17:11:53,353][23882] Updated weights for policy 0, policy_version 24260 (0.0007) [2023-03-06 17:11:54,130][23882] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-06 17:11:54,913][23882] Updated weights for policy 0, policy_version 24280 (0.0005) [2023-03-06 17:11:55,698][23882] Updated weights for policy 0, policy_version 24290 (0.0005) [2023-03-06 17:11:56,482][23882] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-06 17:11:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 24886272. Throughput: 0: 13075.1. Samples: 24884676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:11:56,754][23556] Avg episode reward: [(0, '134.755')] [2023-03-06 17:11:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000024303_24886272.pth... [2023-03-06 17:11:56,790][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000021236_21745664.pth [2023-03-06 17:11:57,265][23882] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-06 17:11:58,050][23882] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-06 17:11:58,825][23882] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-06 17:11:59,620][23882] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-06 17:12:00,398][23882] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-06 17:12:01,210][23882] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-06 17:12:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13093.3). Total num frames: 24951808. Throughput: 0: 13074.8. Samples: 24923909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:12:01,759][23556] Avg episode reward: [(0, '136.879')] [2023-03-06 17:12:01,985][23882] Updated weights for policy 0, policy_version 24370 (0.0007) [2023-03-06 17:12:02,758][23882] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-06 17:12:03,541][23882] Updated weights for policy 0, policy_version 24390 (0.0007) [2023-03-06 17:12:04,341][23882] Updated weights for policy 0, policy_version 24400 (0.0007) [2023-03-06 17:12:05,121][23882] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-06 17:12:05,888][23882] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-06 17:12:06,685][23882] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-06 17:12:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13089.8). Total num frames: 25016320. Throughput: 0: 13068.0. Samples: 25002064. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:12:06,759][23556] Avg episode reward: [(0, '145.477')] [2023-03-06 17:12:07,470][23882] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-06 17:12:08,253][23882] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-06 17:12:09,028][23882] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-06 17:12:09,793][23882] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-06 17:12:10,595][23882] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-06 17:12:11,393][23882] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-06 17:12:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13086.4). Total num frames: 25081856. Throughput: 0: 13074.9. Samples: 25080452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:12:11,755][23556] Avg episode reward: [(0, '143.540')] [2023-03-06 17:12:12,180][23882] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-06 17:12:12,965][23882] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-06 17:12:13,772][23882] Updated weights for policy 0, policy_version 24520 (0.0007) [2023-03-06 17:12:14,555][23882] Updated weights for policy 0, policy_version 24530 (0.0006) [2023-03-06 17:12:15,332][23882] Updated weights for policy 0, policy_version 24540 (0.0007) [2023-03-06 17:12:16,106][23882] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-06 17:12:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13086.4). Total num frames: 25147392. Throughput: 0: 13067.9. Samples: 25119282. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:12:16,759][23556] Avg episode reward: [(0, '130.925')] [2023-03-06 17:12:16,898][23882] Updated weights for policy 0, policy_version 24560 (0.0006) [2023-03-06 17:12:17,690][23882] Updated weights for policy 0, policy_version 24570 (0.0007) [2023-03-06 17:12:18,477][23882] Updated weights for policy 0, policy_version 24580 (0.0006) [2023-03-06 17:12:19,278][23882] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-06 17:12:20,047][23882] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-06 17:12:20,833][23882] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-06 17:12:21,598][23882] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-06 17:12:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13082.9). Total num frames: 25211904. Throughput: 0: 13063.6. Samples: 25197672. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:12:21,748][23556] Avg episode reward: [(0, '130.626')] [2023-03-06 17:12:22,381][23882] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-06 17:12:23,170][23882] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-06 17:12:23,941][23882] Updated weights for policy 0, policy_version 24650 (0.0007) [2023-03-06 17:12:24,732][23882] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-06 17:12:25,525][23882] Updated weights for policy 0, policy_version 24670 (0.0007) [2023-03-06 17:12:26,291][23882] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-06 17:12:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13082.9). Total num frames: 25277440. Throughput: 0: 13063.5. Samples: 25276225. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:12:26,748][23556] Avg episode reward: [(0, '100.089')] [2023-03-06 17:12:27,072][23882] Updated weights for policy 0, policy_version 24690 (0.0007) [2023-03-06 17:12:27,858][23882] Updated weights for policy 0, policy_version 24700 (0.0005) [2023-03-06 17:12:28,657][23882] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-06 17:12:29,444][23882] Updated weights for policy 0, policy_version 24720 (0.0006) [2023-03-06 17:12:30,229][23882] Updated weights for policy 0, policy_version 24730 (0.0007) [2023-03-06 17:12:31,031][23882] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-06 17:12:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13082.9). Total num frames: 25342976. Throughput: 0: 13061.7. Samples: 25315163. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:12:31,749][23556] Avg episode reward: [(0, '111.879')] [2023-03-06 17:12:31,818][23882] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-06 17:12:32,589][23882] Updated weights for policy 0, policy_version 24760 (0.0006) [2023-03-06 17:12:33,395][23882] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-06 17:12:34,182][23882] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-06 17:12:34,968][23882] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-06 17:12:35,722][23882] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-06 17:12:36,541][23882] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-06 17:12:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13079.4). Total num frames: 25407488. Throughput: 0: 13048.7. Samples: 25393282. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:12:36,748][23556] Avg episode reward: [(0, '105.307')] [2023-03-06 17:12:37,312][23882] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-06 17:12:38,094][23882] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-06 17:12:38,868][23882] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-06 17:12:39,656][23882] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-06 17:12:40,436][23882] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-06 17:12:41,219][23882] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-06 17:12:41,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13079.4). Total num frames: 25473024. Throughput: 0: 13047.7. Samples: 25471823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:12:41,748][23556] Avg episode reward: [(0, '126.718')] [2023-03-06 17:12:41,993][23882] Updated weights for policy 0, policy_version 24880 (0.0007) [2023-03-06 17:12:42,765][23882] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-06 17:12:43,552][23882] Updated weights for policy 0, policy_version 24900 (0.0007) [2023-03-06 17:12:44,346][23882] Updated weights for policy 0, policy_version 24910 (0.0007) [2023-03-06 17:12:45,139][23882] Updated weights for policy 0, policy_version 24920 (0.0006) [2023-03-06 17:12:45,948][23882] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-06 17:12:46,725][23882] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-06 17:12:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13079.4). Total num frames: 25538560. Throughput: 0: 13044.0. Samples: 25510888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:12:46,748][23556] Avg episode reward: [(0, '126.239')] [2023-03-06 17:12:47,498][23882] Updated weights for policy 0, policy_version 24950 (0.0007) [2023-03-06 17:12:48,287][23882] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-06 17:12:49,049][23882] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-06 17:12:49,839][23882] Updated weights for policy 0, policy_version 24980 (0.0006) [2023-03-06 17:12:50,623][23882] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-06 17:12:51,404][23882] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-06 17:12:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13079.4). Total num frames: 25604096. Throughput: 0: 13049.6. Samples: 25589295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:12:51,749][23556] Avg episode reward: [(0, '105.663')] [2023-03-06 17:12:52,174][23882] Updated weights for policy 0, policy_version 25010 (0.0007) [2023-03-06 17:12:52,961][23882] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-06 17:12:53,754][23882] Updated weights for policy 0, policy_version 25030 (0.0007) [2023-03-06 17:12:54,522][23882] Updated weights for policy 0, policy_version 25040 (0.0007) [2023-03-06 17:12:55,305][23882] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-06 17:12:56,096][23882] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-06 17:12:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13079.4). Total num frames: 25669632. Throughput: 0: 13052.3. Samples: 25667808. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:12:56,748][23556] Avg episode reward: [(0, '120.672')] [2023-03-06 17:12:56,874][23882] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-06 17:12:57,671][23882] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-06 17:12:58,445][23882] Updated weights for policy 0, policy_version 25090 (0.0007) [2023-03-06 17:12:59,250][23882] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-06 17:13:00,039][23882] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-06 17:13:00,806][23882] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-06 17:13:01,598][23882] Updated weights for policy 0, policy_version 25130 (0.0006) [2023-03-06 17:13:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13076.0). Total num frames: 25734144. Throughput: 0: 13055.6. Samples: 25706784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:01,748][23556] Avg episode reward: [(0, '101.881')] [2023-03-06 17:13:02,380][23882] Updated weights for policy 0, policy_version 25140 (0.0006) [2023-03-06 17:13:03,158][23882] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-06 17:13:03,955][23882] Updated weights for policy 0, policy_version 25160 (0.0006) [2023-03-06 17:13:04,749][23882] Updated weights for policy 0, policy_version 25170 (0.0007) [2023-03-06 17:13:05,532][23882] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-06 17:13:06,298][23882] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-06 17:13:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 25799680. Throughput: 0: 13059.1. Samples: 25785331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:06,748][23556] Avg episode reward: [(0, '116.483')] [2023-03-06 17:13:07,102][23882] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-06 17:13:07,877][23882] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-06 17:13:08,666][23882] Updated weights for policy 0, policy_version 25220 (0.0007) [2023-03-06 17:13:09,446][23882] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-06 17:13:10,248][23882] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-06 17:13:11,031][23882] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-06 17:13:11,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 25865216. Throughput: 0: 13048.4. Samples: 25863405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:11,749][23556] Avg episode reward: [(0, '118.008')] [2023-03-06 17:13:11,803][23882] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-06 17:13:12,575][23882] Updated weights for policy 0, policy_version 25270 (0.0006) [2023-03-06 17:13:13,372][23882] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-06 17:13:14,154][23882] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-06 17:13:14,952][23882] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-06 17:13:15,738][23882] Updated weights for policy 0, policy_version 25310 (0.0007) [2023-03-06 17:13:16,550][23882] Updated weights for policy 0, policy_version 25320 (0.0006) [2023-03-06 17:13:16,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13072.5). Total num frames: 25929728. Throughput: 0: 13052.8. Samples: 25902536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:13:16,759][23556] Avg episode reward: [(0, '154.663')] [2023-03-06 17:13:17,309][23882] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-06 17:13:18,102][23882] Updated weights for policy 0, policy_version 25340 (0.0007) [2023-03-06 17:13:18,862][23882] Updated weights for policy 0, policy_version 25350 (0.0008) [2023-03-06 17:13:19,665][23882] Updated weights for policy 0, policy_version 25360 (0.0006) [2023-03-06 17:13:20,449][23882] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-06 17:13:21,244][23882] Updated weights for policy 0, policy_version 25380 (0.0008) [2023-03-06 17:13:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 25995264. Throughput: 0: 13057.5. Samples: 25980871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:13:21,760][23556] Avg episode reward: [(0, '153.520')] [2023-03-06 17:13:22,016][23882] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-06 17:13:22,806][23882] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-06 17:13:23,579][23882] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-06 17:13:24,366][23882] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-06 17:13:25,141][23882] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-06 17:13:25,928][23882] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-06 17:13:26,721][23882] Updated weights for policy 0, policy_version 25450 (0.0007) [2023-03-06 17:13:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 26060800. Throughput: 0: 13054.4. Samples: 26059272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:13:26,758][23556] Avg episode reward: [(0, '137.986')] [2023-03-06 17:13:27,498][23882] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-06 17:13:28,280][23882] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-06 17:13:29,051][23882] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-06 17:13:29,826][23882] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-06 17:13:30,602][23882] Updated weights for policy 0, policy_version 25500 (0.0007) [2023-03-06 17:13:31,390][23882] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-06 17:13:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 26126336. Throughput: 0: 13061.5. Samples: 26098653. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:31,759][23556] Avg episode reward: [(0, '108.971')] [2023-03-06 17:13:32,159][23882] Updated weights for policy 0, policy_version 25520 (0.0007) [2023-03-06 17:13:32,945][23882] Updated weights for policy 0, policy_version 25530 (0.0007) [2023-03-06 17:13:33,711][23882] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-06 17:13:34,506][23882] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-06 17:13:35,273][23882] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-06 17:13:36,057][23882] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-06 17:13:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 26191872. Throughput: 0: 13070.7. Samples: 26177477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:36,759][23556] Avg episode reward: [(0, '55.964')] [2023-03-06 17:13:36,848][23882] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-06 17:13:37,640][23882] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-06 17:13:38,418][23882] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-06 17:13:39,205][23882] Updated weights for policy 0, policy_version 25610 (0.0008) [2023-03-06 17:13:39,995][23882] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-06 17:13:40,780][23882] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-06 17:13:41,554][23882] Updated weights for policy 0, policy_version 25640 (0.0006) [2023-03-06 17:13:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 26257408. Throughput: 0: 13065.4. Samples: 26255748. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:13:41,754][23556] Avg episode reward: [(0, '68.263')] [2023-03-06 17:13:42,337][23882] Updated weights for policy 0, policy_version 25650 (0.0007) [2023-03-06 17:13:43,138][23882] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-06 17:13:43,909][23882] Updated weights for policy 0, policy_version 25670 (0.0007) [2023-03-06 17:13:44,689][23882] Updated weights for policy 0, policy_version 25680 (0.0007) [2023-03-06 17:13:45,485][23882] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-06 17:13:46,267][23882] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-06 17:13:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 26322944. Throughput: 0: 13069.7. Samples: 26294922. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:13:46,759][23556] Avg episode reward: [(0, '96.639')] [2023-03-06 17:13:47,037][23882] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-06 17:13:47,828][23882] Updated weights for policy 0, policy_version 25720 (0.0007) [2023-03-06 17:13:48,620][23882] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-06 17:13:49,389][23882] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-06 17:13:50,182][23882] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-06 17:13:50,955][23882] Updated weights for policy 0, policy_version 25760 (0.0006) [2023-03-06 17:13:51,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 26387456. Throughput: 0: 13066.6. Samples: 26373326. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:13:51,748][23556] Avg episode reward: [(0, '97.099')] [2023-03-06 17:13:51,763][23882] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-06 17:13:52,534][23882] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-06 17:13:53,300][23882] Updated weights for policy 0, policy_version 25790 (0.0007) [2023-03-06 17:13:54,089][23882] Updated weights for policy 0, policy_version 25800 (0.0005) [2023-03-06 17:13:54,881][23882] Updated weights for policy 0, policy_version 25810 (0.0006) [2023-03-06 17:13:55,656][23882] Updated weights for policy 0, policy_version 25820 (0.0007) [2023-03-06 17:13:56,449][23882] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-06 17:13:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 26452992. Throughput: 0: 13074.5. Samples: 26451760. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:13:56,748][23556] Avg episode reward: [(0, '104.591')] [2023-03-06 17:13:56,762][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000025834_26454016.pth... [2023-03-06 17:13:56,792][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000022772_23318528.pth [2023-03-06 17:13:57,218][23882] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-06 17:13:58,009][23882] Updated weights for policy 0, policy_version 25850 (0.0005) [2023-03-06 17:13:58,782][23882] Updated weights for policy 0, policy_version 25860 (0.0007) [2023-03-06 17:13:59,553][23882] Updated weights for policy 0, policy_version 25870 (0.0007) [2023-03-06 17:14:00,358][23882] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-06 17:14:01,134][23882] Updated weights for policy 0, policy_version 25890 (0.0007) [2023-03-06 17:14:01,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 26518528. Throughput: 0: 13081.7. Samples: 26491212. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:14:01,748][23556] Avg episode reward: [(0, '99.573')] [2023-03-06 17:14:01,915][23882] Updated weights for policy 0, policy_version 25900 (0.0007) [2023-03-06 17:14:02,704][23882] Updated weights for policy 0, policy_version 25910 (0.0006) [2023-03-06 17:14:03,488][23882] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-06 17:14:04,263][23882] Updated weights for policy 0, policy_version 25930 (0.0007) [2023-03-06 17:14:05,048][23882] Updated weights for policy 0, policy_version 25940 (0.0006) [2023-03-06 17:14:05,845][23882] Updated weights for policy 0, policy_version 25950 (0.0007) [2023-03-06 17:14:06,628][23882] Updated weights for policy 0, policy_version 25960 (0.0007) [2023-03-06 17:14:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 26584064. Throughput: 0: 13085.1. Samples: 26569700. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:14:06,748][23556] Avg episode reward: [(0, '118.115')] [2023-03-06 17:14:07,402][23882] Updated weights for policy 0, policy_version 25970 (0.0007) [2023-03-06 17:14:08,170][23882] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-06 17:14:08,949][23882] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-06 17:14:09,740][23882] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-06 17:14:10,522][23882] Updated weights for policy 0, policy_version 26010 (0.0007) [2023-03-06 17:14:11,305][23882] Updated weights for policy 0, policy_version 26020 (0.0006) [2023-03-06 17:14:11,748][23556] Fps is (10 sec: 13106.2, 60 sec: 13072.9, 300 sec: 13072.5). Total num frames: 26649600. Throughput: 0: 13085.1. Samples: 26648112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:14:11,749][23556] Avg episode reward: [(0, '116.626')] [2023-03-06 17:14:12,096][23882] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-06 17:14:12,876][23882] Updated weights for policy 0, policy_version 26040 (0.0007) [2023-03-06 17:14:13,654][23882] Updated weights for policy 0, policy_version 26050 (0.0006) [2023-03-06 17:14:14,453][23882] Updated weights for policy 0, policy_version 26060 (0.0007) [2023-03-06 17:14:15,240][23882] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-06 17:14:16,033][23882] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-06 17:14:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 26715136. Throughput: 0: 13082.6. Samples: 26687370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:14:16,748][23556] Avg episode reward: [(0, '86.346')] [2023-03-06 17:14:16,814][23882] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-06 17:14:17,606][23882] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-06 17:14:18,382][23882] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-06 17:14:19,149][23882] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-06 17:14:19,944][23882] Updated weights for policy 0, policy_version 26130 (0.0006) [2023-03-06 17:14:20,710][23882] Updated weights for policy 0, policy_version 26140 (0.0006) [2023-03-06 17:14:21,502][23882] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-06 17:14:21,748][23556] Fps is (10 sec: 13108.2, 60 sec: 13090.2, 300 sec: 13072.5). Total num frames: 26780672. Throughput: 0: 13071.1. Samples: 26765676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-06 17:14:21,748][23556] Avg episode reward: [(0, '122.709')] [2023-03-06 17:14:22,292][23882] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-06 17:14:23,077][23882] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-06 17:14:23,854][23882] Updated weights for policy 0, policy_version 26180 (0.0008) [2023-03-06 17:14:24,653][23882] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-06 17:14:25,449][23882] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-06 17:14:26,221][23882] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-06 17:14:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 26845184. Throughput: 0: 13066.4. Samples: 26843734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-06 17:14:26,748][23556] Avg episode reward: [(0, '100.322')] [2023-03-06 17:14:26,997][23882] Updated weights for policy 0, policy_version 26220 (0.0006) [2023-03-06 17:14:27,801][23882] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-06 17:14:28,583][23882] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-06 17:14:29,379][23882] Updated weights for policy 0, policy_version 26250 (0.0006) [2023-03-06 17:14:30,160][23882] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-06 17:14:30,942][23882] Updated weights for policy 0, policy_version 26270 (0.0007) [2023-03-06 17:14:31,733][23882] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-06 17:14:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 26910720. Throughput: 0: 13065.6. Samples: 26882873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-06 17:14:31,748][23556] Avg episode reward: [(0, '106.251')] [2023-03-06 17:14:32,510][23882] Updated weights for policy 0, policy_version 26290 (0.0007) [2023-03-06 17:14:33,297][23882] Updated weights for policy 0, policy_version 26300 (0.0007) [2023-03-06 17:14:34,070][23882] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-06 17:14:34,854][23882] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-06 17:14:35,656][23882] Updated weights for policy 0, policy_version 26330 (0.0007) [2023-03-06 17:14:36,426][23882] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-06 17:14:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 26976256. Throughput: 0: 13063.5. Samples: 26961184. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-06 17:14:36,748][23556] Avg episode reward: [(0, '95.262')] [2023-03-06 17:14:37,215][23882] Updated weights for policy 0, policy_version 26350 (0.0005) [2023-03-06 17:14:38,006][23882] Updated weights for policy 0, policy_version 26360 (0.0007) [2023-03-06 17:14:38,782][23882] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-06 17:14:39,572][23882] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-06 17:14:40,353][23882] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-06 17:14:41,125][23882] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-06 17:14:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 27040768. Throughput: 0: 13063.8. Samples: 27039630. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:14:41,748][23556] Avg episode reward: [(0, '92.646')] [2023-03-06 17:14:41,922][23882] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-06 17:14:42,690][23882] Updated weights for policy 0, policy_version 26420 (0.0006) [2023-03-06 17:14:43,457][23882] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-06 17:14:44,226][23882] Updated weights for policy 0, policy_version 26440 (0.0008) [2023-03-06 17:14:45,033][23882] Updated weights for policy 0, policy_version 26450 (0.0006) [2023-03-06 17:14:45,819][23882] Updated weights for policy 0, policy_version 26460 (0.0007) [2023-03-06 17:14:46,594][23882] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-06 17:14:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 27106304. Throughput: 0: 13063.2. Samples: 27079057. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:14:46,748][23556] Avg episode reward: [(0, '110.942')] [2023-03-06 17:14:47,376][23882] Updated weights for policy 0, policy_version 26480 (0.0006) [2023-03-06 17:14:48,154][23882] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-06 17:14:48,935][23882] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-06 17:14:49,708][23882] Updated weights for policy 0, policy_version 26510 (0.0006) [2023-03-06 17:14:50,494][23882] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-06 17:14:51,290][23882] Updated weights for policy 0, policy_version 26530 (0.0006) [2023-03-06 17:14:51,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 27172864. Throughput: 0: 13067.5. Samples: 27157738. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:14:51,759][23556] Avg episode reward: [(0, '120.099')] [2023-03-06 17:14:52,068][23882] Updated weights for policy 0, policy_version 26540 (0.0007) [2023-03-06 17:14:52,824][23882] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-06 17:14:53,614][23882] Updated weights for policy 0, policy_version 26560 (0.0007) [2023-03-06 17:14:54,403][23882] Updated weights for policy 0, policy_version 26570 (0.0006) [2023-03-06 17:14:55,197][23882] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-06 17:14:55,978][23882] Updated weights for policy 0, policy_version 26590 (0.0007) [2023-03-06 17:14:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13065.6). Total num frames: 27237376. Throughput: 0: 13066.7. Samples: 27236103. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:14:56,754][23556] Avg episode reward: [(0, '102.486')] [2023-03-06 17:14:56,769][23882] Updated weights for policy 0, policy_version 26600 (0.0007) [2023-03-06 17:14:57,531][23882] Updated weights for policy 0, policy_version 26610 (0.0007) [2023-03-06 17:14:58,299][23882] Updated weights for policy 0, policy_version 26620 (0.0006) [2023-03-06 17:14:59,077][23882] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-06 17:14:59,857][23882] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-06 17:15:00,649][23882] Updated weights for policy 0, policy_version 26650 (0.0007) [2023-03-06 17:15:01,432][23882] Updated weights for policy 0, policy_version 26660 (0.0007) [2023-03-06 17:15:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.2, 300 sec: 13065.5). Total num frames: 27303936. Throughput: 0: 13073.9. Samples: 27275694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:15:01,759][23556] Avg episode reward: [(0, '96.021')] [2023-03-06 17:15:02,214][23882] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-06 17:15:02,993][23882] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-06 17:15:03,780][23882] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-06 17:15:04,557][23882] Updated weights for policy 0, policy_version 26700 (0.0007) [2023-03-06 17:15:05,325][23882] Updated weights for policy 0, policy_version 26710 (0.0007) [2023-03-06 17:15:06,121][23882] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-06 17:15:06,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 27369472. Throughput: 0: 13080.5. Samples: 27354298. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:15:06,759][23556] Avg episode reward: [(0, '126.396')] [2023-03-06 17:15:06,900][23882] Updated weights for policy 0, policy_version 26730 (0.0007) [2023-03-06 17:15:07,671][23882] Updated weights for policy 0, policy_version 26740 (0.0006) [2023-03-06 17:15:08,459][23882] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-06 17:15:09,255][23882] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-06 17:15:10,037][23882] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-06 17:15:10,812][23882] Updated weights for policy 0, policy_version 26780 (0.0007) [2023-03-06 17:15:11,602][23882] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-06 17:15:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.2, 300 sec: 13065.5). Total num frames: 27433984. Throughput: 0: 13088.5. Samples: 27432716. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:15:11,754][23556] Avg episode reward: [(0, '105.374')] [2023-03-06 17:15:12,390][23882] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-06 17:15:13,174][23882] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-06 17:15:13,962][23882] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-06 17:15:14,774][23882] Updated weights for policy 0, policy_version 26830 (0.0007) [2023-03-06 17:15:15,555][23882] Updated weights for policy 0, policy_version 26840 (0.0007) [2023-03-06 17:15:16,347][23882] Updated weights for policy 0, policy_version 26850 (0.0006) [2023-03-06 17:15:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 27499520. Throughput: 0: 13085.0. Samples: 27471697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:16,759][23556] Avg episode reward: [(0, '105.063')] [2023-03-06 17:15:17,138][23882] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-06 17:15:17,897][23882] Updated weights for policy 0, policy_version 26870 (0.0006) [2023-03-06 17:15:18,689][23882] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-06 17:15:19,478][23882] Updated weights for policy 0, policy_version 26890 (0.0007) [2023-03-06 17:15:20,259][23882] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-06 17:15:21,058][23882] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-06 17:15:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 27564032. Throughput: 0: 13079.4. Samples: 27549757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:21,748][23556] Avg episode reward: [(0, '117.035')] [2023-03-06 17:15:21,841][23882] Updated weights for policy 0, policy_version 26920 (0.0006) [2023-03-06 17:15:22,622][23882] Updated weights for policy 0, policy_version 26930 (0.0007) [2023-03-06 17:15:23,405][23882] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-06 17:15:24,184][23882] Updated weights for policy 0, policy_version 26950 (0.0006) [2023-03-06 17:15:24,972][23882] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-06 17:15:25,778][23882] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-06 17:15:26,553][23882] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-06 17:15:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 27629568. Throughput: 0: 13076.0. Samples: 27628049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:26,748][23556] Avg episode reward: [(0, '119.379')] [2023-03-06 17:15:27,349][23882] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-06 17:15:28,121][23882] Updated weights for policy 0, policy_version 27000 (0.0007) [2023-03-06 17:15:28,886][23882] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-06 17:15:29,662][23882] Updated weights for policy 0, policy_version 27020 (0.0007) [2023-03-06 17:15:30,422][23882] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-06 17:15:31,206][23882] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-06 17:15:31,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13065.5). Total num frames: 27695104. Throughput: 0: 13074.1. Samples: 27667393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:31,749][23556] Avg episode reward: [(0, '124.328')] [2023-03-06 17:15:31,987][23882] Updated weights for policy 0, policy_version 27050 (0.0006) [2023-03-06 17:15:32,767][23882] Updated weights for policy 0, policy_version 27060 (0.0007) [2023-03-06 17:15:33,547][23882] Updated weights for policy 0, policy_version 27070 (0.0007) [2023-03-06 17:15:34,324][23882] Updated weights for policy 0, policy_version 27080 (0.0005) [2023-03-06 17:15:35,106][23882] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-06 17:15:35,899][23882] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-06 17:15:36,673][23882] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-06 17:15:36,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 27761664. Throughput: 0: 13083.0. Samples: 27746474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:36,748][23556] Avg episode reward: [(0, '130.938')] [2023-03-06 17:15:37,437][23882] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-06 17:15:38,233][23882] Updated weights for policy 0, policy_version 27130 (0.0007) [2023-03-06 17:15:39,008][23882] Updated weights for policy 0, policy_version 27140 (0.0008) [2023-03-06 17:15:39,792][23882] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-06 17:15:40,573][23882] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-06 17:15:41,370][23882] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-06 17:15:41,748][23556] Fps is (10 sec: 13209.9, 60 sec: 13107.2, 300 sec: 13069.0). Total num frames: 27827200. Throughput: 0: 13089.3. Samples: 27825119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:41,748][23556] Avg episode reward: [(0, '135.369')] [2023-03-06 17:15:42,145][23882] Updated weights for policy 0, policy_version 27180 (0.0007) [2023-03-06 17:15:42,896][23882] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-06 17:15:43,698][23882] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-06 17:15:44,478][23882] Updated weights for policy 0, policy_version 27210 (0.0006) [2023-03-06 17:15:45,266][23882] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-06 17:15:46,039][23882] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-06 17:15:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13069.0). Total num frames: 27892736. Throughput: 0: 13085.0. Samples: 27864520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:46,748][23556] Avg episode reward: [(0, '134.461')] [2023-03-06 17:15:46,813][23882] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-06 17:15:47,592][23882] Updated weights for policy 0, policy_version 27250 (0.0007) [2023-03-06 17:15:48,398][23882] Updated weights for policy 0, policy_version 27260 (0.0007) [2023-03-06 17:15:49,189][23882] Updated weights for policy 0, policy_version 27270 (0.0005) [2023-03-06 17:15:49,944][23882] Updated weights for policy 0, policy_version 27280 (0.0007) [2023-03-06 17:15:50,747][23882] Updated weights for policy 0, policy_version 27290 (0.0006) [2023-03-06 17:15:51,516][23882] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-06 17:15:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 27957248. Throughput: 0: 13083.9. Samples: 27943075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:51,748][23556] Avg episode reward: [(0, '129.174')] [2023-03-06 17:15:52,304][23882] Updated weights for policy 0, policy_version 27310 (0.0007) [2023-03-06 17:15:53,093][23882] Updated weights for policy 0, policy_version 27320 (0.0007) [2023-03-06 17:15:53,878][23882] Updated weights for policy 0, policy_version 27330 (0.0007) [2023-03-06 17:15:54,654][23882] Updated weights for policy 0, policy_version 27340 (0.0006) [2023-03-06 17:15:55,442][23882] Updated weights for policy 0, policy_version 27350 (0.0007) [2023-03-06 17:15:56,209][23882] Updated weights for policy 0, policy_version 27360 (0.0006) [2023-03-06 17:15:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 28022784. Throughput: 0: 13085.4. Samples: 28021560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:15:56,748][23556] Avg episode reward: [(0, '127.753')] [2023-03-06 17:15:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000027366_28022784.pth... [2023-03-06 17:15:56,787][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000024303_24886272.pth [2023-03-06 17:15:56,981][23882] Updated weights for policy 0, policy_version 27370 (0.0007) [2023-03-06 17:15:57,769][23882] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-06 17:15:58,550][23882] Updated weights for policy 0, policy_version 27390 (0.0007) [2023-03-06 17:15:59,340][23882] Updated weights for policy 0, policy_version 27400 (0.0006) [2023-03-06 17:16:00,127][23882] Updated weights for policy 0, policy_version 27410 (0.0007) [2023-03-06 17:16:00,915][23882] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-06 17:16:01,695][23882] Updated weights for policy 0, policy_version 27430 (0.0007) [2023-03-06 17:16:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 28088320. Throughput: 0: 13093.4. Samples: 28060899. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:16:01,748][23556] Avg episode reward: [(0, '147.215')] [2023-03-06 17:16:02,475][23882] Updated weights for policy 0, policy_version 27440 (0.0007) [2023-03-06 17:16:03,253][23882] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-06 17:16:04,054][23882] Updated weights for policy 0, policy_version 27460 (0.0007) [2023-03-06 17:16:04,833][23882] Updated weights for policy 0, policy_version 27470 (0.0007) [2023-03-06 17:16:05,625][23882] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-06 17:16:06,422][23882] Updated weights for policy 0, policy_version 27490 (0.0007) [2023-03-06 17:16:06,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 28153856. Throughput: 0: 13093.8. Samples: 28138979. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:16:06,749][23556] Avg episode reward: [(0, '170.715')] [2023-03-06 17:16:06,752][23831] Saving new best policy, reward=170.715! [2023-03-06 17:16:07,231][23882] Updated weights for policy 0, policy_version 27500 (0.0007) [2023-03-06 17:16:07,988][23882] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-06 17:16:08,789][23882] Updated weights for policy 0, policy_version 27520 (0.0007) [2023-03-06 17:16:09,561][23882] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-06 17:16:10,330][23882] Updated weights for policy 0, policy_version 27540 (0.0007) [2023-03-06 17:16:11,118][23882] Updated weights for policy 0, policy_version 27550 (0.0008) [2023-03-06 17:16:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 28219392. Throughput: 0: 13092.1. Samples: 28217196. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:16:11,748][23556] Avg episode reward: [(0, '132.553')] [2023-03-06 17:16:11,913][23882] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-06 17:16:12,686][23882] Updated weights for policy 0, policy_version 27570 (0.0006) [2023-03-06 17:16:13,478][23882] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-06 17:16:14,269][23882] Updated weights for policy 0, policy_version 27590 (0.0007) [2023-03-06 17:16:15,037][23882] Updated weights for policy 0, policy_version 27600 (0.0007) [2023-03-06 17:16:15,829][23882] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-06 17:16:16,608][23882] Updated weights for policy 0, policy_version 27620 (0.0006) [2023-03-06 17:16:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 28283904. Throughput: 0: 13091.1. Samples: 28256490. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:16:16,749][23556] Avg episode reward: [(0, '166.785')] [2023-03-06 17:16:17,392][23882] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-06 17:16:18,181][23882] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-06 17:16:18,951][23882] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-06 17:16:19,746][23882] Updated weights for policy 0, policy_version 27660 (0.0006) [2023-03-06 17:16:20,531][23882] Updated weights for policy 0, policy_version 27670 (0.0006) [2023-03-06 17:16:21,304][23882] Updated weights for policy 0, policy_version 27680 (0.0007) [2023-03-06 17:16:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 28349440. Throughput: 0: 13077.8. Samples: 28334975. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:16:21,748][23556] Avg episode reward: [(0, '161.784')] [2023-03-06 17:16:22,099][23882] Updated weights for policy 0, policy_version 27690 (0.0007) [2023-03-06 17:16:22,887][23882] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-06 17:16:23,674][23882] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-06 17:16:24,457][23882] Updated weights for policy 0, policy_version 27720 (0.0007) [2023-03-06 17:16:25,236][23882] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-06 17:16:26,017][23882] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-06 17:16:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 28414976. Throughput: 0: 13066.3. Samples: 28413105. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:16:26,748][23556] Avg episode reward: [(0, '189.395')] [2023-03-06 17:16:26,751][23831] Saving new best policy, reward=189.395! [2023-03-06 17:16:26,817][23882] Updated weights for policy 0, policy_version 27750 (0.0007) [2023-03-06 17:16:27,612][23882] Updated weights for policy 0, policy_version 27760 (0.0006) [2023-03-06 17:16:28,387][23882] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-06 17:16:29,165][23882] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-06 17:16:29,956][23882] Updated weights for policy 0, policy_version 27790 (0.0008) [2023-03-06 17:16:30,730][23882] Updated weights for policy 0, policy_version 27800 (0.0007) [2023-03-06 17:16:31,496][23882] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-06 17:16:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 28479488. Throughput: 0: 13060.3. Samples: 28452234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:16:31,748][23556] Avg episode reward: [(0, '189.172')] [2023-03-06 17:16:32,294][23882] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-06 17:16:33,071][23882] Updated weights for policy 0, policy_version 27830 (0.0007) [2023-03-06 17:16:33,842][23882] Updated weights for policy 0, policy_version 27840 (0.0006) [2023-03-06 17:16:34,634][23882] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-06 17:16:35,411][23882] Updated weights for policy 0, policy_version 27860 (0.0007) [2023-03-06 17:16:36,181][23882] Updated weights for policy 0, policy_version 27870 (0.0006) [2023-03-06 17:16:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 28545024. Throughput: 0: 13062.3. Samples: 28530876. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:16:36,748][23556] Avg episode reward: [(0, '170.513')] [2023-03-06 17:16:36,992][23882] Updated weights for policy 0, policy_version 27880 (0.0008) [2023-03-06 17:16:37,782][23882] Updated weights for policy 0, policy_version 27890 (0.0005) [2023-03-06 17:16:38,560][23882] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-06 17:16:39,348][23882] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-06 17:16:40,146][23882] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-06 17:16:40,925][23882] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-06 17:16:41,710][23882] Updated weights for policy 0, policy_version 27940 (0.0007) [2023-03-06 17:16:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 28610560. Throughput: 0: 13053.4. Samples: 28608963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:16:41,748][23556] Avg episode reward: [(0, '215.470')] [2023-03-06 17:16:41,749][23831] Saving new best policy, reward=215.470! [2023-03-06 17:16:42,486][23882] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-06 17:16:43,254][23882] Updated weights for policy 0, policy_version 27960 (0.0007) [2023-03-06 17:16:44,044][23882] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-06 17:16:44,837][23882] Updated weights for policy 0, policy_version 27980 (0.0007) [2023-03-06 17:16:45,609][23882] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-06 17:16:46,409][23882] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-06 17:16:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 28676096. Throughput: 0: 13055.1. Samples: 28648380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:16:46,748][23556] Avg episode reward: [(0, '217.600')] [2023-03-06 17:16:46,752][23831] Saving new best policy, reward=217.600! [2023-03-06 17:16:47,202][23882] Updated weights for policy 0, policy_version 28010 (0.0007) [2023-03-06 17:16:47,970][23882] Updated weights for policy 0, policy_version 28020 (0.0007) [2023-03-06 17:16:48,766][23882] Updated weights for policy 0, policy_version 28030 (0.0006) [2023-03-06 17:16:49,546][23882] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-06 17:16:50,318][23882] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-06 17:16:51,112][23882] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-06 17:16:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 28741632. Throughput: 0: 13058.0. Samples: 28726587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:16:51,759][23556] Avg episode reward: [(0, '221.254')] [2023-03-06 17:16:51,760][23831] Saving new best policy, reward=221.254! [2023-03-06 17:16:51,906][23882] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-06 17:16:52,685][23882] Updated weights for policy 0, policy_version 28080 (0.0007) [2023-03-06 17:16:53,472][23882] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-06 17:16:54,248][23882] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-06 17:16:55,015][23882] Updated weights for policy 0, policy_version 28110 (0.0007) [2023-03-06 17:16:55,803][23882] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-06 17:16:56,602][23882] Updated weights for policy 0, policy_version 28130 (0.0007) [2023-03-06 17:16:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 28806144. Throughput: 0: 13065.8. Samples: 28805154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:16:56,754][23556] Avg episode reward: [(0, '227.931')] [2023-03-06 17:16:56,758][23831] Saving new best policy, reward=227.931! [2023-03-06 17:16:57,393][23882] Updated weights for policy 0, policy_version 28140 (0.0007) [2023-03-06 17:16:58,154][23882] Updated weights for policy 0, policy_version 28150 (0.0007) [2023-03-06 17:16:58,942][23882] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-06 17:16:59,731][23882] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-06 17:17:00,511][23882] Updated weights for policy 0, policy_version 28180 (0.0005) [2023-03-06 17:17:01,289][23882] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-06 17:17:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 28871680. Throughput: 0: 13064.0. Samples: 28844371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:01,748][23556] Avg episode reward: [(0, '198.295')] [2023-03-06 17:17:02,063][23882] Updated weights for policy 0, policy_version 28200 (0.0006) [2023-03-06 17:17:02,876][23882] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-06 17:17:03,667][23882] Updated weights for policy 0, policy_version 28220 (0.0006) [2023-03-06 17:17:04,438][23882] Updated weights for policy 0, policy_version 28230 (0.0007) [2023-03-06 17:17:05,247][23882] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-06 17:17:06,052][23882] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-06 17:17:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 28937216. Throughput: 0: 13048.8. Samples: 28922170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:06,759][23556] Avg episode reward: [(0, '217.887')] [2023-03-06 17:17:06,828][23882] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-06 17:17:07,613][23882] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-06 17:17:08,395][23882] Updated weights for policy 0, policy_version 28280 (0.0005) [2023-03-06 17:17:09,185][23882] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-06 17:17:09,967][23882] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-06 17:17:10,739][23882] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-06 17:17:11,555][23882] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-06 17:17:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 29001728. Throughput: 0: 13049.2. Samples: 29000318. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:11,752][23556] Avg episode reward: [(0, '205.671')] [2023-03-06 17:17:12,322][23882] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-06 17:17:13,102][23882] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-06 17:17:13,887][23882] Updated weights for policy 0, policy_version 28350 (0.0006) [2023-03-06 17:17:14,677][23882] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-06 17:17:15,442][23882] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-06 17:17:16,241][23882] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-06 17:17:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29067264. Throughput: 0: 13050.2. Samples: 29039492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:16,759][23556] Avg episode reward: [(0, '257.712')] [2023-03-06 17:17:16,764][23831] Saving new best policy, reward=257.712! [2023-03-06 17:17:17,016][23882] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-06 17:17:17,798][23882] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-06 17:17:18,612][23882] Updated weights for policy 0, policy_version 28410 (0.0007) [2023-03-06 17:17:19,397][23882] Updated weights for policy 0, policy_version 28420 (0.0007) [2023-03-06 17:17:20,182][23882] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-06 17:17:20,983][23882] Updated weights for policy 0, policy_version 28440 (0.0005) [2023-03-06 17:17:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 29131776. Throughput: 0: 13040.6. Samples: 29117703. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:21,759][23556] Avg episode reward: [(0, '286.746')] [2023-03-06 17:17:21,769][23831] Saving new best policy, reward=286.746! [2023-03-06 17:17:21,770][23882] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-06 17:17:22,549][23882] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-06 17:17:23,331][23882] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-06 17:17:24,118][23882] Updated weights for policy 0, policy_version 28480 (0.0007) [2023-03-06 17:17:24,898][23882] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-06 17:17:25,694][23882] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-06 17:17:26,494][23882] Updated weights for policy 0, policy_version 28510 (0.0007) [2023-03-06 17:17:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 29197312. Throughput: 0: 13036.5. Samples: 29195608. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:26,754][23556] Avg episode reward: [(0, '315.277')] [2023-03-06 17:17:26,760][23831] Saving new best policy, reward=315.277! [2023-03-06 17:17:27,273][23882] Updated weights for policy 0, policy_version 28520 (0.0007) [2023-03-06 17:17:28,065][23882] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-06 17:17:28,848][23882] Updated weights for policy 0, policy_version 28540 (0.0006) [2023-03-06 17:17:29,644][23882] Updated weights for policy 0, policy_version 28550 (0.0007) [2023-03-06 17:17:30,425][23882] Updated weights for policy 0, policy_version 28560 (0.0007) [2023-03-06 17:17:31,202][23882] Updated weights for policy 0, policy_version 28570 (0.0006) [2023-03-06 17:17:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29262848. Throughput: 0: 13028.8. Samples: 29234678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:31,748][23556] Avg episode reward: [(0, '272.195')] [2023-03-06 17:17:31,969][23882] Updated weights for policy 0, policy_version 28580 (0.0005) [2023-03-06 17:17:32,750][23882] Updated weights for policy 0, policy_version 28590 (0.0007) [2023-03-06 17:17:33,527][23882] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-06 17:17:34,305][23882] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-06 17:17:35,075][23882] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-06 17:17:35,873][23882] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-06 17:17:36,661][23882] Updated weights for policy 0, policy_version 28640 (0.0006) [2023-03-06 17:17:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29328384. Throughput: 0: 13043.4. Samples: 29313542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:36,749][23556] Avg episode reward: [(0, '209.063')] [2023-03-06 17:17:37,443][23882] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-06 17:17:38,253][23882] Updated weights for policy 0, policy_version 28660 (0.0007) [2023-03-06 17:17:39,016][23882] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-06 17:17:39,783][23882] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-06 17:17:40,577][23882] Updated weights for policy 0, policy_version 28690 (0.0006) [2023-03-06 17:17:41,358][23882] Updated weights for policy 0, policy_version 28700 (0.0007) [2023-03-06 17:17:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29393920. Throughput: 0: 13037.2. Samples: 29391827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:41,754][23556] Avg episode reward: [(0, '249.055')] [2023-03-06 17:17:42,142][23882] Updated weights for policy 0, policy_version 28710 (0.0006) [2023-03-06 17:17:42,941][23882] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-06 17:17:43,709][23882] Updated weights for policy 0, policy_version 28730 (0.0007) [2023-03-06 17:17:44,481][23882] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-06 17:17:45,255][23882] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-06 17:17:46,051][23882] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-06 17:17:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 29458432. Throughput: 0: 13038.5. Samples: 29431103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:46,759][23556] Avg episode reward: [(0, '267.678')] [2023-03-06 17:17:46,830][23882] Updated weights for policy 0, policy_version 28770 (0.0007) [2023-03-06 17:17:47,626][23882] Updated weights for policy 0, policy_version 28780 (0.0007) [2023-03-06 17:17:48,404][23882] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-06 17:17:49,177][23882] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-06 17:17:49,970][23882] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-06 17:17:50,760][23882] Updated weights for policy 0, policy_version 28820 (0.0007) [2023-03-06 17:17:51,532][23882] Updated weights for policy 0, policy_version 28830 (0.0007) [2023-03-06 17:17:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13065.6). Total num frames: 29523968. Throughput: 0: 13051.0. Samples: 29509467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:51,759][23556] Avg episode reward: [(0, '305.803')] [2023-03-06 17:17:52,302][23882] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-06 17:17:53,104][23882] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-06 17:17:53,879][23882] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-06 17:17:54,650][23882] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-06 17:17:55,120][23831] KL-divergence is very high: 632.8373 [2023-03-06 17:17:55,457][23882] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-06 17:17:56,233][23882] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-06 17:17:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29589504. Throughput: 0: 13060.2. Samples: 29588028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:17:56,754][23556] Avg episode reward: [(0, '303.963')] [2023-03-06 17:17:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000028897_29590528.pth... [2023-03-06 17:17:56,786][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000025834_26454016.pth [2023-03-06 17:17:57,018][23882] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-06 17:17:57,805][23882] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-06 17:17:58,592][23882] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-06 17:17:59,357][23882] Updated weights for policy 0, policy_version 28930 (0.0006) [2023-03-06 17:18:00,143][23882] Updated weights for policy 0, policy_version 28940 (0.0006) [2023-03-06 17:18:00,923][23882] Updated weights for policy 0, policy_version 28950 (0.0006) [2023-03-06 17:18:01,388][23831] KL-divergence is very high: 5368.4102 [2023-03-06 17:18:01,704][23882] Updated weights for policy 0, policy_version 28960 (0.0007) [2023-03-06 17:18:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29655040. Throughput: 0: 13061.7. Samples: 29627271. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:18:01,759][23556] Avg episode reward: [(0, '308.196')] [2023-03-06 17:18:02,479][23882] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-06 17:18:03,269][23882] Updated weights for policy 0, policy_version 28980 (0.0006) [2023-03-06 17:18:03,575][23831] KL-divergence is very high: 107.7005 [2023-03-06 17:18:04,040][23831] KL-divergence is very high: 192.1697 [2023-03-06 17:18:04,048][23882] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-06 17:18:04,828][23882] Updated weights for policy 0, policy_version 29000 (0.0006) [2023-03-06 17:18:05,630][23882] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-06 17:18:06,423][23882] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-06 17:18:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 29720576. Throughput: 0: 13067.2. Samples: 29705726. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:18:06,759][23556] Avg episode reward: [(0, '324.680')] [2023-03-06 17:18:06,763][23831] Saving new best policy, reward=324.680! [2023-03-06 17:18:07,211][23882] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-06 17:18:07,982][23882] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-06 17:18:08,050][23831] KL-divergence is very high: 336.8425 [2023-03-06 17:18:08,771][23882] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-06 17:18:09,537][23882] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-06 17:18:10,338][23882] Updated weights for policy 0, policy_version 29070 (0.0005) [2023-03-06 17:18:11,110][23882] Updated weights for policy 0, policy_version 29080 (0.0007) [2023-03-06 17:18:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 29786112. Throughput: 0: 13080.0. Samples: 29784208. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:18:11,758][23556] Avg episode reward: [(0, '291.966')] [2023-03-06 17:18:11,889][23882] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-06 17:18:12,690][23882] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-06 17:18:13,137][23831] KL-divergence is very high: 2298.3489 [2023-03-06 17:18:13,458][23882] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-06 17:18:14,232][23882] Updated weights for policy 0, policy_version 29120 (0.0006) [2023-03-06 17:18:15,013][23882] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-06 17:18:15,791][23882] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-06 17:18:16,572][23882] Updated weights for policy 0, policy_version 29150 (0.0007) [2023-03-06 17:18:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 29851648. Throughput: 0: 13087.7. Samples: 29823626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:16,748][23556] Avg episode reward: [(0, '218.763')] [2023-03-06 17:18:17,364][23882] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-06 17:18:18,155][23882] Updated weights for policy 0, policy_version 29170 (0.0007) [2023-03-06 17:18:18,931][23882] Updated weights for policy 0, policy_version 29180 (0.0007) [2023-03-06 17:18:19,739][23882] Updated weights for policy 0, policy_version 29190 (0.0007) [2023-03-06 17:18:20,521][23882] Updated weights for policy 0, policy_version 29200 (0.0006) [2023-03-06 17:18:21,306][23882] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-06 17:18:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 29916160. Throughput: 0: 13071.5. Samples: 29901759. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:21,748][23556] Avg episode reward: [(0, '211.004')] [2023-03-06 17:18:22,109][23882] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-06 17:18:22,885][23882] Updated weights for policy 0, policy_version 29230 (0.0007) [2023-03-06 17:18:23,660][23882] Updated weights for policy 0, policy_version 29240 (0.0007) [2023-03-06 17:18:24,446][23882] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-06 17:18:25,225][23882] Updated weights for policy 0, policy_version 29260 (0.0006) [2023-03-06 17:18:26,008][23882] Updated weights for policy 0, policy_version 29270 (0.0008) [2023-03-06 17:18:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 29981696. Throughput: 0: 13069.0. Samples: 29979932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:26,749][23556] Avg episode reward: [(0, '180.015')] [2023-03-06 17:18:26,789][23882] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-06 17:18:27,581][23882] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-06 17:18:28,359][23882] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-06 17:18:29,140][23882] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-06 17:18:29,922][23882] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-06 17:18:30,713][23882] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-06 17:18:31,501][23882] Updated weights for policy 0, policy_version 29340 (0.0007) [2023-03-06 17:18:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 30047232. Throughput: 0: 13070.9. Samples: 30019293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:31,749][23556] Avg episode reward: [(0, '196.667')] [2023-03-06 17:18:32,279][23882] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-06 17:18:33,075][23882] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-06 17:18:33,848][23882] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-06 17:18:34,639][23882] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-06 17:18:35,416][23882] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-06 17:18:36,188][23882] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-06 17:18:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 30112768. Throughput: 0: 13069.8. Samples: 30097612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:36,749][23556] Avg episode reward: [(0, '228.187')] [2023-03-06 17:18:36,968][23882] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-06 17:18:37,753][23882] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-06 17:18:38,544][23882] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-06 17:18:39,349][23882] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-06 17:18:40,136][23882] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-06 17:18:40,897][23882] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-06 17:18:41,697][23882] Updated weights for policy 0, policy_version 29470 (0.0007) [2023-03-06 17:18:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30177280. Throughput: 0: 13066.4. Samples: 30176018. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:41,748][23556] Avg episode reward: [(0, '178.530')] [2023-03-06 17:18:42,471][23882] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-06 17:18:43,255][23882] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-06 17:18:44,049][23882] Updated weights for policy 0, policy_version 29500 (0.0006) [2023-03-06 17:18:44,841][23882] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-06 17:18:45,629][23882] Updated weights for policy 0, policy_version 29520 (0.0007) [2023-03-06 17:18:46,413][23882] Updated weights for policy 0, policy_version 29530 (0.0006) [2023-03-06 17:18:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 30242816. Throughput: 0: 13056.9. Samples: 30214829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:46,748][23556] Avg episode reward: [(0, '176.053')] [2023-03-06 17:18:47,187][23882] Updated weights for policy 0, policy_version 29540 (0.0007) [2023-03-06 17:18:47,973][23882] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-06 17:18:48,755][23882] Updated weights for policy 0, policy_version 29560 (0.0008) [2023-03-06 17:18:49,553][23882] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-06 17:18:50,334][23882] Updated weights for policy 0, policy_version 29580 (0.0007) [2023-03-06 17:18:51,113][23882] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-06 17:18:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 30308352. Throughput: 0: 13057.6. Samples: 30293316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:18:51,748][23556] Avg episode reward: [(0, '194.590')] [2023-03-06 17:18:51,905][23882] Updated weights for policy 0, policy_version 29600 (0.0007) [2023-03-06 17:18:52,685][23882] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-06 17:18:53,444][23882] Updated weights for policy 0, policy_version 29620 (0.0006) [2023-03-06 17:18:54,255][23882] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-06 17:18:55,022][23882] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-06 17:18:55,802][23882] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-06 17:18:56,614][23882] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-06 17:18:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30372864. Throughput: 0: 13055.5. Samples: 30371706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:18:56,748][23556] Avg episode reward: [(0, '202.524')] [2023-03-06 17:18:57,391][23882] Updated weights for policy 0, policy_version 29670 (0.0007) [2023-03-06 17:18:58,174][23882] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-06 17:18:58,962][23882] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-06 17:18:59,741][23882] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-06 17:19:00,517][23882] Updated weights for policy 0, policy_version 29710 (0.0007) [2023-03-06 17:19:01,313][23882] Updated weights for policy 0, policy_version 29720 (0.0007) [2023-03-06 17:19:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 30438400. Throughput: 0: 13050.0. Samples: 30410876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:01,748][23556] Avg episode reward: [(0, '162.197')] [2023-03-06 17:19:02,093][23882] Updated weights for policy 0, policy_version 29730 (0.0007) [2023-03-06 17:19:02,884][23882] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-06 17:19:03,661][23882] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-06 17:19:04,443][23882] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-06 17:19:05,206][23882] Updated weights for policy 0, policy_version 29770 (0.0007) [2023-03-06 17:19:06,005][23882] Updated weights for policy 0, policy_version 29780 (0.0008) [2023-03-06 17:19:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 30503936. Throughput: 0: 13057.4. Samples: 30489344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:06,748][23556] Avg episode reward: [(0, '194.514')] [2023-03-06 17:19:06,801][23882] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-06 17:19:07,585][23882] Updated weights for policy 0, policy_version 29800 (0.0007) [2023-03-06 17:19:08,360][23882] Updated weights for policy 0, policy_version 29810 (0.0007) [2023-03-06 17:19:09,143][23882] Updated weights for policy 0, policy_version 29820 (0.0006) [2023-03-06 17:19:09,935][23882] Updated weights for policy 0, policy_version 29830 (0.0009) [2023-03-06 17:19:10,717][23882] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-06 17:19:11,489][23882] Updated weights for policy 0, policy_version 29850 (0.0008) [2023-03-06 17:19:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30569472. Throughput: 0: 13058.3. Samples: 30567554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:11,748][23556] Avg episode reward: [(0, '177.879')] [2023-03-06 17:19:12,266][23882] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-06 17:19:13,059][23882] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-06 17:19:13,851][23882] Updated weights for policy 0, policy_version 29880 (0.0007) [2023-03-06 17:19:14,635][23882] Updated weights for policy 0, policy_version 29890 (0.0007) [2023-03-06 17:19:15,420][23882] Updated weights for policy 0, policy_version 29900 (0.0005) [2023-03-06 17:19:16,215][23882] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-06 17:19:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 30633984. Throughput: 0: 13050.5. Samples: 30606568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:16,748][23556] Avg episode reward: [(0, '171.500')] [2023-03-06 17:19:16,998][23882] Updated weights for policy 0, policy_version 29920 (0.0007) [2023-03-06 17:19:17,783][23882] Updated weights for policy 0, policy_version 29930 (0.0006) [2023-03-06 17:19:18,558][23882] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-06 17:19:19,349][23882] Updated weights for policy 0, policy_version 29950 (0.0007) [2023-03-06 17:19:20,135][23882] Updated weights for policy 0, policy_version 29960 (0.0007) [2023-03-06 17:19:20,898][23882] Updated weights for policy 0, policy_version 29970 (0.0005) [2023-03-06 17:19:21,678][23882] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-06 17:19:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30699520. Throughput: 0: 13057.8. Samples: 30685212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:21,748][23556] Avg episode reward: [(0, '132.739')] [2023-03-06 17:19:22,450][23882] Updated weights for policy 0, policy_version 29990 (0.0007) [2023-03-06 17:19:23,244][23882] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-06 17:19:24,043][23882] Updated weights for policy 0, policy_version 30010 (0.0007) [2023-03-06 17:19:24,839][23882] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-06 17:19:25,626][23882] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-06 17:19:26,407][23882] Updated weights for policy 0, policy_version 30040 (0.0007) [2023-03-06 17:19:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30765056. Throughput: 0: 13052.2. Samples: 30763368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:26,748][23556] Avg episode reward: [(0, '190.329')] [2023-03-06 17:19:27,183][23882] Updated weights for policy 0, policy_version 30050 (0.0006) [2023-03-06 17:19:27,956][23882] Updated weights for policy 0, policy_version 30060 (0.0007) [2023-03-06 17:19:28,751][23882] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-06 17:19:29,535][23882] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-06 17:19:30,322][23882] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-06 17:19:31,118][23882] Updated weights for policy 0, policy_version 30100 (0.0007) [2023-03-06 17:19:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30830592. Throughput: 0: 13063.6. Samples: 30802693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:31,748][23556] Avg episode reward: [(0, '284.383')] [2023-03-06 17:19:31,898][23882] Updated weights for policy 0, policy_version 30110 (0.0006) [2023-03-06 17:19:32,701][23882] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-06 17:19:33,482][23882] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-06 17:19:34,269][23882] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-06 17:19:35,035][23882] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-06 17:19:35,834][23882] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-06 17:19:36,614][23882] Updated weights for policy 0, policy_version 30170 (0.0006) [2023-03-06 17:19:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 30895104. Throughput: 0: 13054.6. Samples: 30880776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:36,748][23556] Avg episode reward: [(0, '263.129')] [2023-03-06 17:19:37,407][23882] Updated weights for policy 0, policy_version 30180 (0.0007) [2023-03-06 17:19:38,189][23882] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-06 17:19:38,969][23882] Updated weights for policy 0, policy_version 30200 (0.0006) [2023-03-06 17:19:39,764][23882] Updated weights for policy 0, policy_version 30210 (0.0007) [2023-03-06 17:19:40,543][23882] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-06 17:19:41,325][23882] Updated weights for policy 0, policy_version 30230 (0.0006) [2023-03-06 17:19:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 30960640. Throughput: 0: 13050.7. Samples: 30958987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:41,748][23556] Avg episode reward: [(0, '239.885')] [2023-03-06 17:19:42,107][23882] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-06 17:19:42,885][23882] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-06 17:19:43,671][23882] Updated weights for policy 0, policy_version 30260 (0.0006) [2023-03-06 17:19:44,453][23882] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-06 17:19:45,241][23882] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-06 17:19:46,031][23882] Updated weights for policy 0, policy_version 30290 (0.0007) [2023-03-06 17:19:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 31026176. Throughput: 0: 13052.2. Samples: 30998225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:46,748][23556] Avg episode reward: [(0, '234.760')] [2023-03-06 17:19:46,816][23882] Updated weights for policy 0, policy_version 30300 (0.0007) [2023-03-06 17:19:47,605][23882] Updated weights for policy 0, policy_version 30310 (0.0007) [2023-03-06 17:19:48,393][23882] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-06 17:19:49,169][23882] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-06 17:19:49,956][23882] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-06 17:19:50,762][23882] Updated weights for policy 0, policy_version 30350 (0.0006) [2023-03-06 17:19:51,546][23882] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-06 17:19:51,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 31090688. Throughput: 0: 13041.8. Samples: 31076222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:19:51,748][23556] Avg episode reward: [(0, '262.729')] [2023-03-06 17:19:52,317][23882] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-06 17:19:53,118][23882] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-06 17:19:53,901][23882] Updated weights for policy 0, policy_version 30390 (0.0006) [2023-03-06 17:19:54,713][23882] Updated weights for policy 0, policy_version 30400 (0.0007) [2023-03-06 17:19:55,499][23882] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-06 17:19:56,274][23882] Updated weights for policy 0, policy_version 30420 (0.0007) [2023-03-06 17:19:56,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 31155200. Throughput: 0: 13036.8. Samples: 31154212. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:19:56,748][23556] Avg episode reward: [(0, '326.172')] [2023-03-06 17:19:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000030426_31156224.pth... [2023-03-06 17:19:56,786][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000027366_28022784.pth [2023-03-06 17:19:56,789][23831] Saving new best policy, reward=326.172! [2023-03-06 17:19:57,061][23882] Updated weights for policy 0, policy_version 30430 (0.0007) [2023-03-06 17:19:57,877][23882] Updated weights for policy 0, policy_version 30440 (0.0005) [2023-03-06 17:19:58,650][23882] Updated weights for policy 0, policy_version 30450 (0.0005) [2023-03-06 17:19:59,424][23882] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-06 17:20:00,237][23882] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-06 17:20:01,015][23882] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-06 17:20:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 31220736. Throughput: 0: 13035.6. Samples: 31193167. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:20:01,748][23556] Avg episode reward: [(0, '303.973')] [2023-03-06 17:20:01,790][23882] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-06 17:20:02,596][23882] Updated weights for policy 0, policy_version 30500 (0.0007) [2023-03-06 17:20:03,370][23882] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-06 17:20:04,163][23882] Updated weights for policy 0, policy_version 30520 (0.0007) [2023-03-06 17:20:04,955][23882] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-06 17:20:05,734][23882] Updated weights for policy 0, policy_version 30540 (0.0007) [2023-03-06 17:20:06,530][23882] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-06 17:20:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13055.1). Total num frames: 31285248. Throughput: 0: 13018.7. Samples: 31271050. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:20:06,748][23556] Avg episode reward: [(0, '315.344')] [2023-03-06 17:20:07,301][23882] Updated weights for policy 0, policy_version 30560 (0.0007) [2023-03-06 17:20:08,091][23882] Updated weights for policy 0, policy_version 30570 (0.0006) [2023-03-06 17:20:08,869][23882] Updated weights for policy 0, policy_version 30580 (0.0006) [2023-03-06 17:20:09,659][23882] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-06 17:20:10,361][23831] KL-divergence is very high: 1146575.2500 [2023-03-06 17:20:10,419][23882] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-06 17:20:11,181][23882] Updated weights for policy 0, policy_version 30610 (0.0007) [2023-03-06 17:20:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 31351808. Throughput: 0: 13032.2. Samples: 31349817. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:20:11,748][23556] Avg episode reward: [(0, '263.755')] [2023-03-06 17:20:11,970][23882] Updated weights for policy 0, policy_version 30620 (0.0007) [2023-03-06 17:20:12,759][23882] Updated weights for policy 0, policy_version 30630 (0.0007) [2023-03-06 17:20:13,458][23831] KL-divergence is very high: 5509.4810 [2023-03-06 17:20:13,525][23882] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-06 17:20:13,664][23831] KL-divergence is very high: 151.3433 [2023-03-06 17:20:14,299][23882] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-06 17:20:15,090][23882] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-06 17:20:15,569][23831] KL-divergence is very high: 1276.9058 [2023-03-06 17:20:15,892][23882] Updated weights for policy 0, policy_version 30670 (0.0007) [2023-03-06 17:20:16,136][23831] KL-divergence is very high: 10669.6338 [2023-03-06 17:20:16,666][23882] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-06 17:20:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 31416320. Throughput: 0: 13032.3. Samples: 31389150. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:20:16,748][23556] Avg episode reward: [(0, '400.939')] [2023-03-06 17:20:16,768][23831] Saving new best policy, reward=400.939! [2023-03-06 17:20:17,443][23882] Updated weights for policy 0, policy_version 30690 (0.0007) [2023-03-06 17:20:18,251][23882] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-06 17:20:19,026][23882] Updated weights for policy 0, policy_version 30710 (0.0006) [2023-03-06 17:20:19,806][23882] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-06 17:20:20,613][23882] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-06 17:20:20,992][23831] KL-divergence is very high: 140.4705 [2023-03-06 17:20:21,386][23882] Updated weights for policy 0, policy_version 30740 (0.0007) [2023-03-06 17:20:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 31481856. Throughput: 0: 13033.3. Samples: 31467276. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:20:21,748][23556] Avg episode reward: [(0, '596.785')] [2023-03-06 17:20:21,749][23831] Saving new best policy, reward=596.785! [2023-03-06 17:20:22,165][23882] Updated weights for policy 0, policy_version 30750 (0.0007) [2023-03-06 17:20:22,970][23882] Updated weights for policy 0, policy_version 30760 (0.0007) [2023-03-06 17:20:23,735][23882] Updated weights for policy 0, policy_version 30770 (0.0006) [2023-03-06 17:20:24,536][23882] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-06 17:20:25,318][23882] Updated weights for policy 0, policy_version 30790 (0.0007) [2023-03-06 17:20:26,091][23882] Updated weights for policy 0, policy_version 30800 (0.0007) [2023-03-06 17:20:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 31547392. Throughput: 0: 13034.5. Samples: 31545539. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:20:26,748][23556] Avg episode reward: [(0, '656.939')] [2023-03-06 17:20:26,753][23831] Saving new best policy, reward=656.939! [2023-03-06 17:20:26,893][23882] Updated weights for policy 0, policy_version 30810 (0.0007) [2023-03-06 17:20:27,682][23882] Updated weights for policy 0, policy_version 30820 (0.0007) [2023-03-06 17:20:28,456][23882] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-03-06 17:20:29,249][23882] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-06 17:20:30,034][23882] Updated weights for policy 0, policy_version 30850 (0.0006) [2023-03-06 17:20:30,804][23882] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-06 17:20:31,594][23882] Updated weights for policy 0, policy_version 30870 (0.0007) [2023-03-06 17:20:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 31612928. Throughput: 0: 13033.6. Samples: 31584740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:20:31,748][23556] Avg episode reward: [(0, '803.849')] [2023-03-06 17:20:31,749][23831] Saving new best policy, reward=803.849! [2023-03-06 17:20:32,370][23882] Updated weights for policy 0, policy_version 30880 (0.0006) [2023-03-06 17:20:33,155][23882] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-06 17:20:33,938][23882] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-06 17:20:34,724][23882] Updated weights for policy 0, policy_version 30910 (0.0007) [2023-03-06 17:20:35,517][23882] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-06 17:20:36,290][23882] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-06 17:20:36,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 31678464. Throughput: 0: 13041.9. Samples: 31663110. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:20:36,749][23556] Avg episode reward: [(0, '894.836')] [2023-03-06 17:20:36,753][23831] Saving new best policy, reward=894.836! [2023-03-06 17:20:37,086][23882] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-06 17:20:37,838][23882] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-06 17:20:38,627][23882] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-06 17:20:39,425][23882] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-06 17:20:40,185][23882] Updated weights for policy 0, policy_version 30980 (0.0007) [2023-03-06 17:20:40,968][23882] Updated weights for policy 0, policy_version 30990 (0.0007) [2023-03-06 17:20:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 31742976. Throughput: 0: 13056.4. Samples: 31741749. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:20:41,748][23556] Avg episode reward: [(0, '1049.887')] [2023-03-06 17:20:41,757][23831] Saving new best policy, reward=1049.887! [2023-03-06 17:20:41,758][23882] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-06 17:20:42,548][23882] Updated weights for policy 0, policy_version 31010 (0.0006) [2023-03-06 17:20:43,080][23831] KL-divergence is very high: 87402.7891 [2023-03-06 17:20:43,323][23882] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-06 17:20:44,120][23882] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-06 17:20:44,906][23882] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-06 17:20:45,693][23882] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-06 17:20:46,493][23882] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-06 17:20:46,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 31808512. Throughput: 0: 13061.5. Samples: 31780935. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:20:46,748][23556] Avg episode reward: [(0, '964.556')] [2023-03-06 17:20:47,268][23882] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-06 17:20:48,050][23882] Updated weights for policy 0, policy_version 31080 (0.0007) [2023-03-06 17:20:48,843][23882] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-06 17:20:49,640][23882] Updated weights for policy 0, policy_version 31100 (0.0007) [2023-03-06 17:20:50,405][23882] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-06 17:20:51,210][23882] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-06 17:20:51,510][23831] KL-divergence is very high: 101.1208 [2023-03-06 17:20:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 31873024. Throughput: 0: 13063.6. Samples: 31858915. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:20:51,748][23556] Avg episode reward: [(0, '1081.748')] [2023-03-06 17:20:51,755][23831] Saving new best policy, reward=1081.748! [2023-03-06 17:20:51,981][23882] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-06 17:20:52,762][23882] Updated weights for policy 0, policy_version 31140 (0.0006) [2023-03-06 17:20:53,560][23882] Updated weights for policy 0, policy_version 31150 (0.0007) [2023-03-06 17:20:54,346][23882] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-06 17:20:55,120][23882] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-06 17:20:55,904][23882] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-03-06 17:20:56,692][23882] Updated weights for policy 0, policy_version 31190 (0.0006) [2023-03-06 17:20:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 31938560. Throughput: 0: 13053.1. Samples: 31937208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:20:56,748][23556] Avg episode reward: [(0, '956.288')] [2023-03-06 17:20:57,471][23882] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-06 17:20:58,261][23882] Updated weights for policy 0, policy_version 31210 (0.0007) [2023-03-06 17:20:59,031][23882] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-06 17:20:59,821][23882] Updated weights for policy 0, policy_version 31230 (0.0005) [2023-03-06 17:21:00,606][23882] Updated weights for policy 0, policy_version 31240 (0.0007) [2023-03-06 17:21:01,392][23882] Updated weights for policy 0, policy_version 31250 (0.0007) [2023-03-06 17:21:01,540][23831] KL-divergence is very high: 127.1441 [2023-03-06 17:21:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 32004096. Throughput: 0: 13054.9. Samples: 31976620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:01,748][23556] Avg episode reward: [(0, '1156.661')] [2023-03-06 17:21:01,749][23831] Saving new best policy, reward=1156.661! [2023-03-06 17:21:02,171][23882] Updated weights for policy 0, policy_version 31260 (0.0008) [2023-03-06 17:21:02,933][23882] Updated weights for policy 0, policy_version 31270 (0.0006) [2023-03-06 17:21:03,731][23882] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-06 17:21:04,507][23882] Updated weights for policy 0, policy_version 31290 (0.0007) [2023-03-06 17:21:05,272][23882] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-06 17:21:06,070][23882] Updated weights for policy 0, policy_version 31310 (0.0006) [2023-03-06 17:21:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 32069632. Throughput: 0: 13060.7. Samples: 32055007. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:06,748][23556] Avg episode reward: [(0, '1178.442')] [2023-03-06 17:21:06,752][23831] Saving new best policy, reward=1178.442! [2023-03-06 17:21:06,872][23882] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-06 17:21:07,662][23882] Updated weights for policy 0, policy_version 31330 (0.0007) [2023-03-06 17:21:08,462][23882] Updated weights for policy 0, policy_version 31340 (0.0007) [2023-03-06 17:21:09,239][23882] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-06 17:21:10,025][23882] Updated weights for policy 0, policy_version 31360 (0.0007) [2023-03-06 17:21:10,815][23882] Updated weights for policy 0, policy_version 31370 (0.0007) [2023-03-06 17:21:11,605][23882] Updated weights for policy 0, policy_version 31380 (0.0007) [2023-03-06 17:21:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 32134144. Throughput: 0: 13055.1. Samples: 32133017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:11,748][23556] Avg episode reward: [(0, '1083.741')] [2023-03-06 17:21:12,394][23882] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-06 17:21:13,153][23882] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-06 17:21:13,953][23882] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-06 17:21:14,737][23882] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-06 17:21:15,542][23882] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-06 17:21:16,321][23882] Updated weights for policy 0, policy_version 31440 (0.0006) [2023-03-06 17:21:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 32199680. Throughput: 0: 13053.0. Samples: 32172125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:16,748][23556] Avg episode reward: [(0, '1126.824')] [2023-03-06 17:21:17,099][23882] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-06 17:21:17,894][23882] Updated weights for policy 0, policy_version 31460 (0.0006) [2023-03-06 17:21:18,687][23882] Updated weights for policy 0, policy_version 31470 (0.0007) [2023-03-06 17:21:19,461][23882] Updated weights for policy 0, policy_version 31480 (0.0006) [2023-03-06 17:21:20,250][23882] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-06 17:21:21,056][23882] Updated weights for policy 0, policy_version 31500 (0.0007) [2023-03-06 17:21:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 32265216. Throughput: 0: 13043.7. Samples: 32250074. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:21,748][23556] Avg episode reward: [(0, '1011.708')] [2023-03-06 17:21:21,833][23882] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-06 17:21:22,619][23882] Updated weights for policy 0, policy_version 31520 (0.0007) [2023-03-06 17:21:23,397][23882] Updated weights for policy 0, policy_version 31530 (0.0005) [2023-03-06 17:21:24,199][23882] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-06 17:21:24,965][23882] Updated weights for policy 0, policy_version 31550 (0.0006) [2023-03-06 17:21:25,759][23882] Updated weights for policy 0, policy_version 31560 (0.0007) [2023-03-06 17:21:26,544][23882] Updated weights for policy 0, policy_version 31570 (0.0007) [2023-03-06 17:21:26,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 32329728. Throughput: 0: 13038.2. Samples: 32328472. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:26,748][23556] Avg episode reward: [(0, '1068.523')] [2023-03-06 17:21:27,315][23882] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-06 17:21:28,086][23882] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-06 17:21:28,872][23882] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-06 17:21:29,256][23831] KL-divergence is very high: 405.8866 [2023-03-06 17:21:29,650][23882] Updated weights for policy 0, policy_version 31610 (0.0007) [2023-03-06 17:21:30,446][23882] Updated weights for policy 0, policy_version 31620 (0.0006) [2023-03-06 17:21:31,243][23882] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-06 17:21:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 32395264. Throughput: 0: 13042.4. Samples: 32367846. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:21:31,748][23556] Avg episode reward: [(0, '975.807')] [2023-03-06 17:21:32,021][23882] Updated weights for policy 0, policy_version 31640 (0.0007) [2023-03-06 17:21:32,222][23831] KL-divergence is very high: 100.8071 [2023-03-06 17:21:32,783][23882] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-06 17:21:33,593][23882] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-06 17:21:34,368][23882] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-06 17:21:35,153][23882] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-06 17:21:35,928][23882] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-06 17:21:36,729][23882] Updated weights for policy 0, policy_version 31700 (0.0007) [2023-03-06 17:21:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 32460800. Throughput: 0: 13048.5. Samples: 32446097. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:21:36,748][23556] Avg episode reward: [(0, '955.565')] [2023-03-06 17:21:37,504][23882] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-06 17:21:38,289][23882] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-06 17:21:39,046][23882] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-06 17:21:39,843][23882] Updated weights for policy 0, policy_version 31740 (0.0007) [2023-03-06 17:21:40,625][23882] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-06 17:21:41,397][23882] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-06 17:21:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 32526336. Throughput: 0: 13057.7. Samples: 32524805. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:21:41,754][23556] Avg episode reward: [(0, '861.941')] [2023-03-06 17:21:42,195][23882] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-06 17:21:42,342][23831] KL-divergence is very high: 113.5399 [2023-03-06 17:21:42,972][23882] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-06 17:21:43,116][23831] KL-divergence is very high: 139.4924 [2023-03-06 17:21:43,352][23831] KL-divergence is very high: 641.6625 [2023-03-06 17:21:43,742][23882] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-06 17:21:44,353][23831] KL-divergence is very high: 434.0047 [2023-03-06 17:21:44,531][23882] Updated weights for policy 0, policy_version 31800 (0.0007) [2023-03-06 17:21:45,330][23882] Updated weights for policy 0, policy_version 31810 (0.0006) [2023-03-06 17:21:45,484][23831] KL-divergence is very high: 1747.6196 [2023-03-06 17:21:46,112][23882] Updated weights for policy 0, policy_version 31820 (0.0007) [2023-03-06 17:21:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 32591872. Throughput: 0: 13050.6. Samples: 32563896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:21:46,748][23556] Avg episode reward: [(0, '631.268')] [2023-03-06 17:21:46,884][23882] Updated weights for policy 0, policy_version 31830 (0.0007) [2023-03-06 17:21:47,662][23882] Updated weights for policy 0, policy_version 31840 (0.0006) [2023-03-06 17:21:47,816][23831] KL-divergence is very high: 133857.2344 [2023-03-06 17:21:48,442][23882] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-06 17:21:48,919][23831] KL-divergence is very high: 189.8705 [2023-03-06 17:21:49,235][23882] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-06 17:21:50,014][23882] Updated weights for policy 0, policy_version 31870 (0.0006) [2023-03-06 17:21:50,803][23882] Updated weights for policy 0, policy_version 31880 (0.0007) [2023-03-06 17:21:51,591][23882] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-06 17:21:51,670][23831] KL-divergence is very high: 13309677.0000 [2023-03-06 17:21:51,731][23831] KL-divergence is very high: 15768.0850 [2023-03-06 17:21:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 32657408. Throughput: 0: 13052.0. Samples: 32642348. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:21:51,759][23556] Avg episode reward: [(0, '570.879')] [2023-03-06 17:21:52,295][23831] KL-divergence is very high: 210.1382 [2023-03-06 17:21:52,352][23882] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-06 17:21:52,926][23831] KL-divergence is very high: 498970.5000 [2023-03-06 17:21:53,135][23882] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-06 17:21:53,918][23882] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-06 17:21:54,711][23882] Updated weights for policy 0, policy_version 31930 (0.0007) [2023-03-06 17:21:55,331][23831] KL-divergence is very high: 51648.2070 [2023-03-06 17:21:55,501][23882] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-06 17:21:56,278][23882] Updated weights for policy 0, policy_version 31950 (0.0007) [2023-03-06 17:21:56,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 32722944. Throughput: 0: 13066.9. Samples: 32721026. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:21:56,754][23556] Avg episode reward: [(0, '448.343')] [2023-03-06 17:21:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000031956_32722944.pth... [2023-03-06 17:21:56,794][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000028897_29590528.pth [2023-03-06 17:21:57,045][23882] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-06 17:21:57,823][23882] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-06 17:21:58,606][23882] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-06 17:21:59,406][23882] Updated weights for policy 0, policy_version 31990 (0.0005) [2023-03-06 17:22:00,174][23882] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-06 17:22:00,953][23882] Updated weights for policy 0, policy_version 32010 (0.0005) [2023-03-06 17:22:01,744][23882] Updated weights for policy 0, policy_version 32020 (0.0007) [2023-03-06 17:22:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 32788480. Throughput: 0: 13067.6. Samples: 32760164. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:01,748][23556] Avg episode reward: [(0, '469.722')] [2023-03-06 17:22:02,521][23831] KL-divergence is very high: 264.4225 [2023-03-06 17:22:02,530][23882] Updated weights for policy 0, policy_version 32030 (0.0006) [2023-03-06 17:22:03,311][23882] Updated weights for policy 0, policy_version 32040 (0.0005) [2023-03-06 17:22:03,541][23831] KL-divergence is very high: 6628.3740 [2023-03-06 17:22:04,015][23831] KL-divergence is very high: 174.0939 [2023-03-06 17:22:04,096][23882] Updated weights for policy 0, policy_version 32050 (0.0007) [2023-03-06 17:22:04,656][23831] KL-divergence is very high: 133.5934 [2023-03-06 17:22:04,881][23882] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-06 17:22:05,672][23882] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-06 17:22:06,208][23831] KL-divergence is very high: 61799.2930 [2023-03-06 17:22:06,446][23882] Updated weights for policy 0, policy_version 32080 (0.0007) [2023-03-06 17:22:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 32854016. Throughput: 0: 13079.5. Samples: 32838648. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:06,759][23556] Avg episode reward: [(0, '497.027')] [2023-03-06 17:22:07,214][23882] Updated weights for policy 0, policy_version 32090 (0.0006) [2023-03-06 17:22:07,363][23831] KL-divergence is very high: 623.5568 [2023-03-06 17:22:07,467][23831] KL-divergence is very high: 20815014.0000 [2023-03-06 17:22:08,023][23882] Updated weights for policy 0, policy_version 32100 (0.0007) [2023-03-06 17:22:08,793][23882] Updated weights for policy 0, policy_version 32110 (0.0008) [2023-03-06 17:22:09,583][23882] Updated weights for policy 0, policy_version 32120 (0.0007) [2023-03-06 17:22:10,371][23882] Updated weights for policy 0, policy_version 32130 (0.0006) [2023-03-06 17:22:11,162][23882] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-06 17:22:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 32918528. Throughput: 0: 13081.1. Samples: 32917120. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:11,754][23556] Avg episode reward: [(0, '229.151')] [2023-03-06 17:22:11,932][23882] Updated weights for policy 0, policy_version 32150 (0.0007) [2023-03-06 17:22:12,625][23831] KL-divergence is very high: 1206156.6250 [2023-03-06 17:22:12,715][23882] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-06 17:22:13,485][23882] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-06 17:22:14,282][23882] Updated weights for policy 0, policy_version 32180 (0.0007) [2023-03-06 17:22:15,072][23882] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-06 17:22:15,280][23831] KL-divergence is very high: 76908.4219 [2023-03-06 17:22:15,845][23882] Updated weights for policy 0, policy_version 32200 (0.0006) [2023-03-06 17:22:16,616][23882] Updated weights for policy 0, policy_version 32210 (0.0007) [2023-03-06 17:22:16,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 32984064. Throughput: 0: 13079.8. Samples: 32956439. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:22:16,759][23556] Avg episode reward: [(0, '473.977')] [2023-03-06 17:22:17,414][23882] Updated weights for policy 0, policy_version 32220 (0.0007) [2023-03-06 17:22:18,179][23882] Updated weights for policy 0, policy_version 32230 (0.0005) [2023-03-06 17:22:18,870][23831] KL-divergence is very high: 257.9231 [2023-03-06 17:22:18,967][23882] Updated weights for policy 0, policy_version 32240 (0.0007) [2023-03-06 17:22:19,737][23882] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-06 17:22:20,518][23882] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-06 17:22:20,820][23831] KL-divergence is very high: 1256.3613 [2023-03-06 17:22:21,305][23882] Updated weights for policy 0, policy_version 32270 (0.0007) [2023-03-06 17:22:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 33049600. Throughput: 0: 13092.4. Samples: 33035255. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:22:21,759][23556] Avg episode reward: [(0, '541.231')] [2023-03-06 17:22:22,100][23882] Updated weights for policy 0, policy_version 32280 (0.0007) [2023-03-06 17:22:22,889][23882] Updated weights for policy 0, policy_version 32290 (0.0007) [2023-03-06 17:22:23,674][23882] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-06 17:22:24,451][23882] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-06 17:22:25,241][23882] Updated weights for policy 0, policy_version 32320 (0.0007) [2023-03-06 17:22:26,022][23882] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-06 17:22:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13058.6). Total num frames: 33115136. Throughput: 0: 13078.7. Samples: 33113349. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:22:26,754][23556] Avg episode reward: [(0, '712.917')] [2023-03-06 17:22:26,803][23882] Updated weights for policy 0, policy_version 32340 (0.0007) [2023-03-06 17:22:27,581][23882] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-06 17:22:28,336][23882] Updated weights for policy 0, policy_version 32360 (0.0007) [2023-03-06 17:22:29,111][23882] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-06 17:22:29,911][23882] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-06 17:22:30,682][23882] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-06 17:22:31,485][23882] Updated weights for policy 0, policy_version 32400 (0.0006) [2023-03-06 17:22:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13058.6). Total num frames: 33180672. Throughput: 0: 13089.7. Samples: 33152935. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:22:31,759][23556] Avg episode reward: [(0, '408.059')] [2023-03-06 17:22:32,261][23882] Updated weights for policy 0, policy_version 32410 (0.0007) [2023-03-06 17:22:32,812][23831] KL-divergence is very high: 118.1988 [2023-03-06 17:22:33,057][23882] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-06 17:22:33,832][23882] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-06 17:22:34,618][23882] Updated weights for policy 0, policy_version 32440 (0.0005) [2023-03-06 17:22:35,405][23882] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-06 17:22:36,096][23831] KL-divergence is very high: 532.1747 [2023-03-06 17:22:36,189][23882] Updated weights for policy 0, policy_version 32460 (0.0007) [2023-03-06 17:22:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13058.6). Total num frames: 33246208. Throughput: 0: 13088.2. Samples: 33231319. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:36,748][23556] Avg episode reward: [(0, '394.266')] [2023-03-06 17:22:36,977][23882] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-06 17:22:37,045][23831] KL-divergence is very high: 1927.7456 [2023-03-06 17:22:37,744][23882] Updated weights for policy 0, policy_version 32480 (0.0006) [2023-03-06 17:22:38,119][23831] KL-divergence is very high: 432.7832 [2023-03-06 17:22:38,531][23882] Updated weights for policy 0, policy_version 32490 (0.0006) [2023-03-06 17:22:39,334][23882] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-06 17:22:40,124][23882] Updated weights for policy 0, policy_version 32510 (0.0007) [2023-03-06 17:22:40,905][23882] Updated weights for policy 0, policy_version 32520 (0.0007) [2023-03-06 17:22:41,707][23882] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-06 17:22:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 33310720. Throughput: 0: 13077.7. Samples: 33309524. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:41,749][23556] Avg episode reward: [(0, '281.423')] [2023-03-06 17:22:42,477][23882] Updated weights for policy 0, policy_version 32540 (0.0007) [2023-03-06 17:22:43,251][23882] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-06 17:22:44,061][23882] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-06 17:22:44,843][23882] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-06 17:22:45,649][23882] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-06 17:22:46,418][23882] Updated weights for policy 0, policy_version 32590 (0.0007) [2023-03-06 17:22:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33376256. Throughput: 0: 13070.2. Samples: 33348323. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:46,748][23556] Avg episode reward: [(0, '505.688')] [2023-03-06 17:22:47,182][23882] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-06 17:22:47,972][23882] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-06 17:22:48,783][23882] Updated weights for policy 0, policy_version 32620 (0.0006) [2023-03-06 17:22:49,564][23882] Updated weights for policy 0, policy_version 32630 (0.0007) [2023-03-06 17:22:50,349][23882] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-06 17:22:51,129][23882] Updated weights for policy 0, policy_version 32650 (0.0007) [2023-03-06 17:22:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 33440768. Throughput: 0: 13064.7. Samples: 33426562. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:22:51,748][23556] Avg episode reward: [(0, '481.838')] [2023-03-06 17:22:51,912][23882] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-06 17:22:52,692][23882] Updated weights for policy 0, policy_version 32670 (0.0008) [2023-03-06 17:22:53,458][23882] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-06 17:22:54,242][23882] Updated weights for policy 0, policy_version 32690 (0.0006) [2023-03-06 17:22:55,015][23882] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-06 17:22:55,782][23882] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-06 17:22:56,583][23882] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-06 17:22:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33507328. Throughput: 0: 13072.0. Samples: 33505362. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:22:56,748][23556] Avg episode reward: [(0, '495.496')] [2023-03-06 17:22:57,364][23882] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-06 17:22:58,142][23882] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-06 17:22:58,932][23882] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-06 17:22:59,700][23882] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-06 17:23:00,485][23882] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-06 17:23:01,258][23882] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-06 17:23:01,748][23556] Fps is (10 sec: 13209.4, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 33572864. Throughput: 0: 13070.8. Samples: 33544623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:01,749][23556] Avg episode reward: [(0, '457.224')] [2023-03-06 17:23:02,051][23882] Updated weights for policy 0, policy_version 32790 (0.0006) [2023-03-06 17:23:02,829][23882] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-06 17:23:03,601][23882] Updated weights for policy 0, policy_version 32810 (0.0007) [2023-03-06 17:23:04,384][23882] Updated weights for policy 0, policy_version 32820 (0.0007) [2023-03-06 17:23:05,180][23882] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-06 17:23:05,953][23882] Updated weights for policy 0, policy_version 32840 (0.0006) [2023-03-06 17:23:06,745][23882] Updated weights for policy 0, policy_version 32850 (0.0007) [2023-03-06 17:23:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 33638400. Throughput: 0: 13070.6. Samples: 33623433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:06,748][23556] Avg episode reward: [(0, '662.478')] [2023-03-06 17:23:07,195][23831] KL-divergence is very high: 139.7002 [2023-03-06 17:23:07,526][23882] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-06 17:23:08,297][23882] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-06 17:23:09,089][23882] Updated weights for policy 0, policy_version 32880 (0.0006) [2023-03-06 17:23:09,866][23882] Updated weights for policy 0, policy_version 32890 (0.0007) [2023-03-06 17:23:10,662][23882] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-06 17:23:11,435][23882] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-06 17:23:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13058.6). Total num frames: 33703936. Throughput: 0: 13077.9. Samples: 33701855. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:11,748][23556] Avg episode reward: [(0, '614.538')] [2023-03-06 17:23:12,222][23882] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-06 17:23:13,006][23882] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-06 17:23:13,804][23882] Updated weights for policy 0, policy_version 32940 (0.0007) [2023-03-06 17:23:14,581][23882] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-06 17:23:15,359][23882] Updated weights for policy 0, policy_version 32960 (0.0007) [2023-03-06 17:23:16,142][23882] Updated weights for policy 0, policy_version 32970 (0.0006) [2023-03-06 17:23:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33768448. Throughput: 0: 13066.5. Samples: 33740928. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:16,748][23556] Avg episode reward: [(0, '433.373')] [2023-03-06 17:23:16,924][23882] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-06 17:23:17,705][23882] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-06 17:23:18,498][23882] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-06 17:23:19,287][23882] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-06 17:23:20,059][23882] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-06 17:23:20,430][23831] KL-divergence is very high: 278.8089 [2023-03-06 17:23:20,855][23882] Updated weights for policy 0, policy_version 33030 (0.0007) [2023-03-06 17:23:21,629][23882] Updated weights for policy 0, policy_version 33040 (0.0007) [2023-03-06 17:23:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33833984. Throughput: 0: 13069.6. Samples: 33819451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:21,748][23556] Avg episode reward: [(0, '548.370')] [2023-03-06 17:23:22,396][23882] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-06 17:23:23,174][23882] Updated weights for policy 0, policy_version 33060 (0.0007) [2023-03-06 17:23:23,977][23882] Updated weights for policy 0, policy_version 33070 (0.0007) [2023-03-06 17:23:24,759][23882] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-06 17:23:25,543][23882] Updated weights for policy 0, policy_version 33090 (0.0006) [2023-03-06 17:23:26,326][23882] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-06 17:23:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33899520. Throughput: 0: 13074.4. Samples: 33897873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:26,749][23556] Avg episode reward: [(0, '623.242')] [2023-03-06 17:23:27,096][23882] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-06 17:23:27,893][23882] Updated weights for policy 0, policy_version 33120 (0.0007) [2023-03-06 17:23:28,670][23882] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-06 17:23:29,444][23882] Updated weights for policy 0, policy_version 33140 (0.0006) [2023-03-06 17:23:30,232][23882] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-06 17:23:31,014][23882] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-06 17:23:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 33965056. Throughput: 0: 13085.2. Samples: 33937155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:31,748][23556] Avg episode reward: [(0, '600.228')] [2023-03-06 17:23:31,791][23882] Updated weights for policy 0, policy_version 33170 (0.0006) [2023-03-06 17:23:32,586][23882] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-06 17:23:33,387][23882] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-06 17:23:34,172][23882] Updated weights for policy 0, policy_version 33200 (0.0007) [2023-03-06 17:23:34,969][23882] Updated weights for policy 0, policy_version 33210 (0.0007) [2023-03-06 17:23:35,750][23882] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-06 17:23:36,510][23882] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-06 17:23:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 34029568. Throughput: 0: 13079.7. Samples: 34015149. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:23:36,748][23556] Avg episode reward: [(0, '751.047')] [2023-03-06 17:23:37,313][23882] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-06 17:23:38,093][23882] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-06 17:23:38,871][23882] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-06 17:23:39,651][23882] Updated weights for policy 0, policy_version 33270 (0.0006) [2023-03-06 17:23:40,434][23882] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-06 17:23:41,225][23882] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-06 17:23:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 34095104. Throughput: 0: 13072.3. Samples: 34093615. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:23:41,749][23556] Avg episode reward: [(0, '858.491')] [2023-03-06 17:23:42,021][23882] Updated weights for policy 0, policy_version 33300 (0.0006) [2023-03-06 17:23:42,788][23882] Updated weights for policy 0, policy_version 33310 (0.0006) [2023-03-06 17:23:43,570][23882] Updated weights for policy 0, policy_version 33320 (0.0007) [2023-03-06 17:23:44,363][23882] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-06 17:23:45,147][23882] Updated weights for policy 0, policy_version 33340 (0.0007) [2023-03-06 17:23:45,912][23882] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-06 17:23:46,688][23882] Updated weights for policy 0, policy_version 33360 (0.0008) [2023-03-06 17:23:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 34160640. Throughput: 0: 13069.8. Samples: 34132763. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:23:46,748][23556] Avg episode reward: [(0, '698.834')] [2023-03-06 17:23:47,466][23882] Updated weights for policy 0, policy_version 33370 (0.0005) [2023-03-06 17:23:48,247][23882] Updated weights for policy 0, policy_version 33380 (0.0008) [2023-03-06 17:23:49,007][23882] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-06 17:23:49,794][23882] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-06 17:23:50,600][23882] Updated weights for policy 0, policy_version 33410 (0.0007) [2023-03-06 17:23:51,375][23882] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-06 17:23:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13062.1). Total num frames: 34226176. Throughput: 0: 13069.8. Samples: 34211572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:23:51,748][23556] Avg episode reward: [(0, '506.737')] [2023-03-06 17:23:52,169][23882] Updated weights for policy 0, policy_version 33430 (0.0007) [2023-03-06 17:23:52,949][23882] Updated weights for policy 0, policy_version 33440 (0.0007) [2023-03-06 17:23:53,744][23882] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-06 17:23:54,515][23882] Updated weights for policy 0, policy_version 33460 (0.0007) [2023-03-06 17:23:55,297][23882] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-06 17:23:56,085][23882] Updated weights for policy 0, policy_version 33480 (0.0007) [2023-03-06 17:23:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13062.1). Total num frames: 34291712. Throughput: 0: 13074.4. Samples: 34290205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:23:56,749][23556] Avg episode reward: [(0, '598.890')] [2023-03-06 17:23:56,754][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000033488_34291712.pth... [2023-03-06 17:23:56,788][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000030426_31156224.pth [2023-03-06 17:23:56,868][23882] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-06 17:23:57,641][23882] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-06 17:23:58,422][23882] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-06 17:23:59,206][23882] Updated weights for policy 0, policy_version 33520 (0.0007) [2023-03-06 17:23:59,995][23882] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-06 17:24:00,765][23882] Updated weights for policy 0, policy_version 33540 (0.0006) [2023-03-06 17:24:01,549][23882] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-06 17:24:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 34357248. Throughput: 0: 13077.3. Samples: 34329409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:01,748][23556] Avg episode reward: [(0, '726.446')] [2023-03-06 17:24:02,322][23882] Updated weights for policy 0, policy_version 33560 (0.0007) [2023-03-06 17:24:03,125][23882] Updated weights for policy 0, policy_version 33570 (0.0007) [2023-03-06 17:24:03,903][23882] Updated weights for policy 0, policy_version 33580 (0.0006) [2023-03-06 17:24:04,674][23882] Updated weights for policy 0, policy_version 33590 (0.0007) [2023-03-06 17:24:05,455][23882] Updated weights for policy 0, policy_version 33600 (0.0007) [2023-03-06 17:24:06,238][23882] Updated weights for policy 0, policy_version 33610 (0.0007) [2023-03-06 17:24:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 34422784. Throughput: 0: 13082.0. Samples: 34408138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:06,748][23556] Avg episode reward: [(0, '639.816')] [2023-03-06 17:24:07,022][23882] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-06 17:24:07,807][23882] Updated weights for policy 0, policy_version 33630 (0.0007) [2023-03-06 17:24:08,594][23882] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-06 17:24:09,372][23882] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-06 17:24:10,162][23882] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-06 17:24:10,946][23882] Updated weights for policy 0, policy_version 33670 (0.0006) [2023-03-06 17:24:11,725][23882] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-06 17:24:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 34488320. Throughput: 0: 13080.1. Samples: 34486477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:11,748][23556] Avg episode reward: [(0, '462.381')] [2023-03-06 17:24:12,496][23882] Updated weights for policy 0, policy_version 33690 (0.0007) [2023-03-06 17:24:13,291][23882] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-06 17:24:14,062][23882] Updated weights for policy 0, policy_version 33710 (0.0007) [2023-03-06 17:24:14,834][23882] Updated weights for policy 0, policy_version 33720 (0.0006) [2023-03-06 17:24:15,634][23882] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-06 17:24:16,432][23882] Updated weights for policy 0, policy_version 33740 (0.0007) [2023-03-06 17:24:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13065.6). Total num frames: 34553856. Throughput: 0: 13085.1. Samples: 34525984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:16,748][23556] Avg episode reward: [(0, '699.816')] [2023-03-06 17:24:17,218][23882] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-06 17:24:18,002][23882] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-06 17:24:18,781][23882] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-06 17:24:19,555][23882] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-06 17:24:20,334][23882] Updated weights for policy 0, policy_version 33790 (0.0007) [2023-03-06 17:24:21,109][23882] Updated weights for policy 0, policy_version 33800 (0.0006) [2023-03-06 17:24:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13065.5). Total num frames: 34619392. Throughput: 0: 13095.3. Samples: 34604437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:21,748][23556] Avg episode reward: [(0, '507.516')] [2023-03-06 17:24:21,884][23882] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-06 17:24:22,666][23882] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-06 17:24:23,454][23882] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-06 17:24:24,236][23882] Updated weights for policy 0, policy_version 33840 (0.0008) [2023-03-06 17:24:25,010][23882] Updated weights for policy 0, policy_version 33850 (0.0007) [2023-03-06 17:24:25,786][23882] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-06 17:24:26,575][23882] Updated weights for policy 0, policy_version 33870 (0.0007) [2023-03-06 17:24:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13065.5). Total num frames: 34684928. Throughput: 0: 13099.0. Samples: 34683069. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:26,748][23556] Avg episode reward: [(0, '451.986')] [2023-03-06 17:24:27,374][23882] Updated weights for policy 0, policy_version 33880 (0.0007) [2023-03-06 17:24:28,165][23882] Updated weights for policy 0, policy_version 33890 (0.0007) [2023-03-06 17:24:28,951][23882] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-06 17:24:29,743][23882] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-06 17:24:30,515][23882] Updated weights for policy 0, policy_version 33920 (0.0007) [2023-03-06 17:24:31,303][23882] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-06 17:24:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 34750464. Throughput: 0: 13094.7. Samples: 34722024. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:31,748][23556] Avg episode reward: [(0, '469.592')] [2023-03-06 17:24:32,057][23882] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-06 17:24:32,840][23882] Updated weights for policy 0, policy_version 33950 (0.0007) [2023-03-06 17:24:33,636][23882] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-06 17:24:34,406][23882] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-06 17:24:35,184][23882] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-06 17:24:35,949][23882] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-06 17:24:36,730][23882] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-06 17:24:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13069.0). Total num frames: 34816000. Throughput: 0: 13094.2. Samples: 34800812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:36,748][23556] Avg episode reward: [(0, '388.693')] [2023-03-06 17:24:37,524][23882] Updated weights for policy 0, policy_version 34010 (0.0007) [2023-03-06 17:24:38,221][23831] KL-divergence is very high: 799.9331 [2023-03-06 17:24:38,309][23882] Updated weights for policy 0, policy_version 34020 (0.0007) [2023-03-06 17:24:39,091][23882] Updated weights for policy 0, policy_version 34030 (0.0007) [2023-03-06 17:24:39,866][23882] Updated weights for policy 0, policy_version 34040 (0.0006) [2023-03-06 17:24:40,656][23882] Updated weights for policy 0, policy_version 34050 (0.0007) [2023-03-06 17:24:41,430][23882] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-06 17:24:41,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13069.0). Total num frames: 34881536. Throughput: 0: 13097.4. Samples: 34879588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:41,748][23556] Avg episode reward: [(0, '255.452')] [2023-03-06 17:24:42,212][23882] Updated weights for policy 0, policy_version 34070 (0.0007) [2023-03-06 17:24:42,994][23882] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-06 17:24:43,772][23882] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-06 17:24:44,560][23882] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-06 17:24:45,336][23882] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-06 17:24:46,114][23882] Updated weights for policy 0, policy_version 34120 (0.0007) [2023-03-06 17:24:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 34946048. Throughput: 0: 13097.7. Samples: 34918807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:46,748][23556] Avg episode reward: [(0, '496.436')] [2023-03-06 17:24:46,915][23882] Updated weights for policy 0, policy_version 34130 (0.0007) [2023-03-06 17:24:47,703][23882] Updated weights for policy 0, policy_version 34140 (0.0006) [2023-03-06 17:24:48,473][23882] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-06 17:24:49,249][23882] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-06 17:24:50,029][23882] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-06 17:24:50,810][23882] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-06 17:24:51,591][23882] Updated weights for policy 0, policy_version 34190 (0.0006) [2023-03-06 17:24:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 35011584. Throughput: 0: 13097.2. Samples: 34997512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:51,748][23556] Avg episode reward: [(0, '591.515')] [2023-03-06 17:24:52,377][23882] Updated weights for policy 0, policy_version 34200 (0.0007) [2023-03-06 17:24:53,153][23882] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-06 17:24:53,944][23882] Updated weights for policy 0, policy_version 34220 (0.0008) [2023-03-06 17:24:54,737][23882] Updated weights for policy 0, policy_version 34230 (0.0007) [2023-03-06 17:24:55,513][23882] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-06 17:24:56,295][23882] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-06 17:24:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 35077120. Throughput: 0: 13099.5. Samples: 35075956. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:24:56,748][23556] Avg episode reward: [(0, '726.701')] [2023-03-06 17:24:57,074][23882] Updated weights for policy 0, policy_version 34260 (0.0005) [2023-03-06 17:24:57,858][23882] Updated weights for policy 0, policy_version 34270 (0.0005) [2023-03-06 17:24:58,659][23882] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-06 17:24:59,428][23882] Updated weights for policy 0, policy_version 34290 (0.0007) [2023-03-06 17:25:00,194][23882] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-06 17:25:00,982][23882] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-06 17:25:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 35142656. Throughput: 0: 13090.6. Samples: 35115063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:01,748][23556] Avg episode reward: [(0, '597.665')] [2023-03-06 17:25:01,759][23882] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-06 17:25:02,565][23882] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-06 17:25:03,343][23882] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-06 17:25:04,122][23882] Updated weights for policy 0, policy_version 34350 (0.0007) [2023-03-06 17:25:04,921][23882] Updated weights for policy 0, policy_version 34360 (0.0007) [2023-03-06 17:25:05,701][23882] Updated weights for policy 0, policy_version 34370 (0.0006) [2023-03-06 17:25:06,487][23882] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-06 17:25:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 35208192. Throughput: 0: 13089.4. Samples: 35193459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:06,748][23556] Avg episode reward: [(0, '512.140')] [2023-03-06 17:25:07,272][23882] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-06 17:25:08,050][23882] Updated weights for policy 0, policy_version 34400 (0.0007) [2023-03-06 17:25:08,836][23882] Updated weights for policy 0, policy_version 34410 (0.0007) [2023-03-06 17:25:09,619][23882] Updated weights for policy 0, policy_version 34420 (0.0007) [2023-03-06 17:25:10,399][23882] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-06 17:25:11,200][23882] Updated weights for policy 0, policy_version 34440 (0.0007) [2023-03-06 17:25:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 35273728. Throughput: 0: 13082.5. Samples: 35271781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:11,748][23556] Avg episode reward: [(0, '232.913')] [2023-03-06 17:25:11,967][23882] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-06 17:25:12,739][23882] Updated weights for policy 0, policy_version 34460 (0.0007) [2023-03-06 17:25:13,526][23882] Updated weights for policy 0, policy_version 34470 (0.0007) [2023-03-06 17:25:14,299][23882] Updated weights for policy 0, policy_version 34480 (0.0007) [2023-03-06 17:25:15,091][23882] Updated weights for policy 0, policy_version 34490 (0.0007) [2023-03-06 17:25:15,885][23882] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-06 17:25:16,668][23882] Updated weights for policy 0, policy_version 34510 (0.0006) [2023-03-06 17:25:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 35339264. Throughput: 0: 13096.2. Samples: 35311355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:16,749][23556] Avg episode reward: [(0, '278.502')] [2023-03-06 17:25:17,458][23882] Updated weights for policy 0, policy_version 34520 (0.0005) [2023-03-06 17:25:18,242][23882] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-06 17:25:19,018][23882] Updated weights for policy 0, policy_version 34540 (0.0007) [2023-03-06 17:25:19,800][23882] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-06 17:25:20,592][23882] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-06 17:25:21,351][23882] Updated weights for policy 0, policy_version 34570 (0.0007) [2023-03-06 17:25:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 35403776. Throughput: 0: 13077.0. Samples: 35389280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:21,748][23556] Avg episode reward: [(0, '459.955')] [2023-03-06 17:25:22,153][23882] Updated weights for policy 0, policy_version 34580 (0.0007) [2023-03-06 17:25:22,934][23882] Updated weights for policy 0, policy_version 34590 (0.0006) [2023-03-06 17:25:23,714][23882] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-06 17:25:24,496][23882] Updated weights for policy 0, policy_version 34610 (0.0007) [2023-03-06 17:25:25,278][23882] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-06 17:25:26,050][23882] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-06 17:25:26,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 35469312. Throughput: 0: 13080.3. Samples: 35468203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:25:26,748][23556] Avg episode reward: [(0, '300.063')] [2023-03-06 17:25:26,831][23882] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-06 17:25:27,641][23882] Updated weights for policy 0, policy_version 34650 (0.0005) [2023-03-06 17:25:28,419][23882] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-06 17:25:29,203][23882] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-06 17:25:29,978][23882] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-06 17:25:30,765][23882] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-06 17:25:31,546][23882] Updated weights for policy 0, policy_version 34700 (0.0007) [2023-03-06 17:25:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 35534848. Throughput: 0: 13076.5. Samples: 35507247. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:31,748][23556] Avg episode reward: [(0, '213.367')] [2023-03-06 17:25:32,313][23882] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-06 17:25:33,094][23882] Updated weights for policy 0, policy_version 34720 (0.0007) [2023-03-06 17:25:33,874][23882] Updated weights for policy 0, policy_version 34730 (0.0007) [2023-03-06 17:25:34,654][23882] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-06 17:25:35,463][23882] Updated weights for policy 0, policy_version 34750 (0.0007) [2023-03-06 17:25:36,057][23831] KL-divergence is very high: 1765.9248 [2023-03-06 17:25:36,221][23882] Updated weights for policy 0, policy_version 34760 (0.0006) [2023-03-06 17:25:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 35600384. Throughput: 0: 13071.5. Samples: 35585729. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:36,748][23556] Avg episode reward: [(0, '404.321')] [2023-03-06 17:25:36,998][23882] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-06 17:25:37,793][23882] Updated weights for policy 0, policy_version 34780 (0.0007) [2023-03-06 17:25:38,575][23882] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-06 17:25:39,364][23882] Updated weights for policy 0, policy_version 34800 (0.0008) [2023-03-06 17:25:40,140][23882] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-06 17:25:40,915][23882] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-06 17:25:41,712][23882] Updated weights for policy 0, policy_version 34830 (0.0007) [2023-03-06 17:25:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 35665920. Throughput: 0: 13075.5. Samples: 35664355. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:41,748][23556] Avg episode reward: [(0, '300.991')] [2023-03-06 17:25:42,498][23882] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-06 17:25:43,266][23882] Updated weights for policy 0, policy_version 34850 (0.0007) [2023-03-06 17:25:44,053][23882] Updated weights for policy 0, policy_version 34860 (0.0006) [2023-03-06 17:25:44,840][23882] Updated weights for policy 0, policy_version 34870 (0.0007) [2023-03-06 17:25:45,001][23831] KL-divergence is very high: 1167.3922 [2023-03-06 17:25:45,632][23882] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-06 17:25:46,419][23882] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-06 17:25:46,748][23556] Fps is (10 sec: 13106.9, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 35731456. Throughput: 0: 13080.3. Samples: 35703678. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:46,749][23556] Avg episode reward: [(0, '278.562')] [2023-03-06 17:25:47,210][23882] Updated weights for policy 0, policy_version 34900 (0.0006) [2023-03-06 17:25:47,981][23882] Updated weights for policy 0, policy_version 34910 (0.0007) [2023-03-06 17:25:48,763][23882] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-06 17:25:49,546][23882] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-06 17:25:50,333][23882] Updated weights for policy 0, policy_version 34940 (0.0007) [2023-03-06 17:25:51,117][23882] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-06 17:25:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 35796992. Throughput: 0: 13076.7. Samples: 35781914. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:51,749][23556] Avg episode reward: [(0, '247.507')] [2023-03-06 17:25:51,886][23882] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-06 17:25:52,674][23882] Updated weights for policy 0, policy_version 34970 (0.0008) [2023-03-06 17:25:53,461][23882] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-06 17:25:54,225][23882] Updated weights for policy 0, policy_version 34990 (0.0007) [2023-03-06 17:25:55,013][23882] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-06 17:25:55,815][23882] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-06 17:25:56,604][23882] Updated weights for policy 0, policy_version 35020 (0.0007) [2023-03-06 17:25:56,748][23556] Fps is (10 sec: 13107.6, 60 sec: 13090.2, 300 sec: 13079.4). Total num frames: 35862528. Throughput: 0: 13080.5. Samples: 35860403. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:25:56,748][23556] Avg episode reward: [(0, '322.795')] [2023-03-06 17:25:56,751][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000035022_35862528.pth... [2023-03-06 17:25:56,780][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000031956_32722944.pth [2023-03-06 17:25:57,370][23882] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-06 17:25:58,163][23882] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-06 17:25:58,953][23882] Updated weights for policy 0, policy_version 35050 (0.0007) [2023-03-06 17:25:59,743][23882] Updated weights for policy 0, policy_version 35060 (0.0007) [2023-03-06 17:26:00,538][23882] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-06 17:26:01,312][23882] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-06 17:26:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 35927040. Throughput: 0: 13067.7. Samples: 35899401. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:26:01,748][23556] Avg episode reward: [(0, '362.853')] [2023-03-06 17:26:02,086][23882] Updated weights for policy 0, policy_version 35090 (0.0006) [2023-03-06 17:26:02,870][23882] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-06 17:26:03,646][23882] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-06 17:26:04,435][23882] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-06 17:26:05,201][23882] Updated weights for policy 0, policy_version 35130 (0.0008) [2023-03-06 17:26:05,976][23882] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-06 17:26:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 35992576. Throughput: 0: 13083.0. Samples: 35978014. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:26:06,748][23556] Avg episode reward: [(0, '329.111')] [2023-03-06 17:26:06,775][23882] Updated weights for policy 0, policy_version 35150 (0.0007) [2023-03-06 17:26:07,555][23882] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-06 17:26:08,347][23882] Updated weights for policy 0, policy_version 35170 (0.0006) [2023-03-06 17:26:09,138][23882] Updated weights for policy 0, policy_version 35180 (0.0005) [2023-03-06 17:26:09,935][23882] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-06 17:26:10,713][23882] Updated weights for policy 0, policy_version 35200 (0.0006) [2023-03-06 17:26:11,498][23882] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-06 17:26:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 36058112. Throughput: 0: 13070.0. Samples: 36056354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:26:11,748][23556] Avg episode reward: [(0, '332.160')] [2023-03-06 17:26:12,276][23882] Updated weights for policy 0, policy_version 35220 (0.0005) [2023-03-06 17:26:13,069][23882] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-06 17:26:13,837][23882] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-06 17:26:14,635][23882] Updated weights for policy 0, policy_version 35250 (0.0006) [2023-03-06 17:26:15,432][23882] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-06 17:26:16,206][23882] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-06 17:26:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36122624. Throughput: 0: 13072.2. Samples: 36095496. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:26:16,748][23556] Avg episode reward: [(0, '199.268')] [2023-03-06 17:26:17,008][23882] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-06 17:26:17,795][23882] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-06 17:26:18,578][23882] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-06 17:26:19,389][23882] Updated weights for policy 0, policy_version 35310 (0.0007) [2023-03-06 17:26:20,150][23882] Updated weights for policy 0, policy_version 35320 (0.0006) [2023-03-06 17:26:20,922][23882] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-06 17:26:21,717][23882] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-06 17:26:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 36188160. Throughput: 0: 13064.9. Samples: 36173651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:26:21,748][23556] Avg episode reward: [(0, '252.765')] [2023-03-06 17:26:22,499][23882] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-06 17:26:23,288][23882] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-06 17:26:24,061][23882] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-06 17:26:24,841][23882] Updated weights for policy 0, policy_version 35380 (0.0005) [2023-03-06 17:26:25,633][23882] Updated weights for policy 0, policy_version 35390 (0.0006) [2023-03-06 17:26:26,410][23882] Updated weights for policy 0, policy_version 35400 (0.0007) [2023-03-06 17:26:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 36253696. Throughput: 0: 13056.8. Samples: 36251911. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:26:26,748][23556] Avg episode reward: [(0, '229.617')] [2023-03-06 17:26:27,189][23882] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-06 17:26:27,995][23882] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-06 17:26:28,779][23882] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-06 17:26:29,582][23882] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-06 17:26:30,360][23882] Updated weights for policy 0, policy_version 35450 (0.0005) [2023-03-06 17:26:31,141][23882] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-06 17:26:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36318208. Throughput: 0: 13046.1. Samples: 36290750. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:26:31,748][23556] Avg episode reward: [(0, '319.945')] [2023-03-06 17:26:31,925][23882] Updated weights for policy 0, policy_version 35470 (0.0008) [2023-03-06 17:26:32,011][23831] KL-divergence is very high: 545.2661 [2023-03-06 17:26:32,713][23882] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-06 17:26:33,475][23882] Updated weights for policy 0, policy_version 35490 (0.0006) [2023-03-06 17:26:34,268][23882] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-06 17:26:35,051][23882] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-06 17:26:35,836][23882] Updated weights for policy 0, policy_version 35520 (0.0006) [2023-03-06 17:26:36,615][23882] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-06 17:26:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36383744. Throughput: 0: 13050.4. Samples: 36369182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:26:36,748][23556] Avg episode reward: [(0, '278.560')] [2023-03-06 17:26:37,397][23882] Updated weights for policy 0, policy_version 35540 (0.0007) [2023-03-06 17:26:38,188][23882] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-06 17:26:38,960][23882] Updated weights for policy 0, policy_version 35560 (0.0007) [2023-03-06 17:26:39,734][23882] Updated weights for policy 0, policy_version 35570 (0.0007) [2023-03-06 17:26:40,516][23882] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-06 17:26:41,291][23882] Updated weights for policy 0, policy_version 35590 (0.0006) [2023-03-06 17:26:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36449280. Throughput: 0: 13057.7. Samples: 36448003. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:26:41,748][23556] Avg episode reward: [(0, '302.583')] [2023-03-06 17:26:42,062][23882] Updated weights for policy 0, policy_version 35600 (0.0008) [2023-03-06 17:26:42,852][23882] Updated weights for policy 0, policy_version 35610 (0.0006) [2023-03-06 17:26:43,626][23882] Updated weights for policy 0, policy_version 35620 (0.0006) [2023-03-06 17:26:44,410][23882] Updated weights for policy 0, policy_version 35630 (0.0007) [2023-03-06 17:26:45,196][23882] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-06 17:26:45,969][23882] Updated weights for policy 0, policy_version 35650 (0.0006) [2023-03-06 17:26:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36514816. Throughput: 0: 13066.0. Samples: 36487369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:26:46,748][23556] Avg episode reward: [(0, '163.986')] [2023-03-06 17:26:46,769][23882] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-06 17:26:47,553][23882] Updated weights for policy 0, policy_version 35670 (0.0007) [2023-03-06 17:26:48,329][23882] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-06 17:26:49,128][23882] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-06 17:26:49,903][23882] Updated weights for policy 0, policy_version 35700 (0.0007) [2023-03-06 17:26:50,682][23882] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-06 17:26:51,469][23882] Updated weights for policy 0, policy_version 35720 (0.0006) [2023-03-06 17:26:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36580352. Throughput: 0: 13061.5. Samples: 36565782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:26:51,748][23556] Avg episode reward: [(0, '156.455')] [2023-03-06 17:26:52,258][23882] Updated weights for policy 0, policy_version 35730 (0.0006) [2023-03-06 17:26:53,044][23882] Updated weights for policy 0, policy_version 35740 (0.0007) [2023-03-06 17:26:53,837][23882] Updated weights for policy 0, policy_version 35750 (0.0007) [2023-03-06 17:26:54,631][23882] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-06 17:26:55,419][23882] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-06 17:26:56,195][23882] Updated weights for policy 0, policy_version 35780 (0.0007) [2023-03-06 17:26:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13072.5). Total num frames: 36644864. Throughput: 0: 13053.6. Samples: 36643764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:26:56,748][23556] Avg episode reward: [(0, '227.144')] [2023-03-06 17:26:56,973][23882] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-06 17:26:57,749][23882] Updated weights for policy 0, policy_version 35800 (0.0006) [2023-03-06 17:26:58,519][23882] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-06 17:26:59,235][23831] KL-divergence is very high: 8027.7959 [2023-03-06 17:26:59,312][23882] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-06 17:27:00,088][23882] Updated weights for policy 0, policy_version 35830 (0.0006) [2023-03-06 17:27:00,805][23831] KL-divergence is very high: 207.6448 [2023-03-06 17:27:00,880][23882] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-06 17:27:01,664][23882] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-06 17:27:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 36710400. Throughput: 0: 13058.8. Samples: 36683141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:27:01,748][23556] Avg episode reward: [(0, '130.524')] [2023-03-06 17:27:02,439][23882] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-06 17:27:03,213][23882] Updated weights for policy 0, policy_version 35870 (0.0006) [2023-03-06 17:27:04,002][23882] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-06 17:27:04,785][23882] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-06 17:27:05,570][23882] Updated weights for policy 0, policy_version 35900 (0.0007) [2023-03-06 17:27:06,368][23882] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-06 17:27:06,522][23831] KL-divergence is very high: 135.8261 [2023-03-06 17:27:06,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 36776960. Throughput: 0: 13072.7. Samples: 36761924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:27:06,748][23556] Avg episode reward: [(0, '223.949')] [2023-03-06 17:27:07,153][23882] Updated weights for policy 0, policy_version 35920 (0.0007) [2023-03-06 17:27:07,530][23831] KL-divergence is very high: 111.9954 [2023-03-06 17:27:07,952][23882] Updated weights for policy 0, policy_version 35930 (0.0007) [2023-03-06 17:27:08,722][23882] Updated weights for policy 0, policy_version 35940 (0.0005) [2023-03-06 17:27:09,506][23882] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-06 17:27:10,319][23882] Updated weights for policy 0, policy_version 35960 (0.0006) [2023-03-06 17:27:11,105][23882] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-06 17:27:11,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13076.0). Total num frames: 36841472. Throughput: 0: 13060.7. Samples: 36839644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:27:11,749][23556] Avg episode reward: [(0, '231.326')] [2023-03-06 17:27:11,886][23882] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-06 17:27:12,663][23882] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-06 17:27:13,446][23882] Updated weights for policy 0, policy_version 36000 (0.0007) [2023-03-06 17:27:13,522][23831] KL-divergence is very high: 2023.7537 [2023-03-06 17:27:14,223][23882] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-06 17:27:15,002][23882] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-06 17:27:15,317][23831] KL-divergence is very high: 257.9850 [2023-03-06 17:27:15,779][23882] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-06 17:27:16,557][23882] Updated weights for policy 0, policy_version 36040 (0.0007) [2023-03-06 17:27:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 36907008. Throughput: 0: 13072.6. Samples: 36879017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:27:16,749][23556] Avg episode reward: [(0, '161.768')] [2023-03-06 17:27:17,355][23882] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-06 17:27:18,137][23882] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-06 17:27:18,811][23831] KL-divergence is very high: 168.1504 [2023-03-06 17:27:18,901][23882] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-06 17:27:19,514][23831] KL-divergence is very high: 487.4182 [2023-03-06 17:27:19,682][23882] Updated weights for policy 0, policy_version 36080 (0.0007) [2023-03-06 17:27:20,476][23831] KL-divergence is very high: 193546.0312 [2023-03-06 17:27:20,483][23882] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-06 17:27:21,271][23882] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-06 17:27:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13076.0). Total num frames: 36972544. Throughput: 0: 13078.3. Samples: 36957708. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:27:21,748][23556] Avg episode reward: [(0, '204.398')] [2023-03-06 17:27:22,034][23882] Updated weights for policy 0, policy_version 36110 (0.0006) [2023-03-06 17:27:22,183][23831] KL-divergence is very high: 9995.8467 [2023-03-06 17:27:22,657][23831] KL-divergence is very high: 750.3798 [2023-03-06 17:27:22,824][23882] Updated weights for policy 0, policy_version 36120 (0.0007) [2023-03-06 17:27:23,610][23882] Updated weights for policy 0, policy_version 36130 (0.0007) [2023-03-06 17:27:24,382][23882] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-06 17:27:25,163][23882] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-06 17:27:25,920][23882] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-06 17:27:26,708][23882] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-06 17:27:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 37038080. Throughput: 0: 13074.0. Samples: 37036332. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:27:26,748][23556] Avg episode reward: [(0, '126.255')] [2023-03-06 17:27:27,515][23882] Updated weights for policy 0, policy_version 36180 (0.0007) [2023-03-06 17:27:28,285][23882] Updated weights for policy 0, policy_version 36190 (0.0006) [2023-03-06 17:27:29,080][23882] Updated weights for policy 0, policy_version 36200 (0.0006) [2023-03-06 17:27:29,869][23882] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-06 17:27:30,647][23882] Updated weights for policy 0, policy_version 36220 (0.0006) [2023-03-06 17:27:31,449][23882] Updated weights for policy 0, policy_version 36230 (0.0005) [2023-03-06 17:27:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 37102592. Throughput: 0: 13068.5. Samples: 37075453. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:27:31,748][23556] Avg episode reward: [(0, '175.411')] [2023-03-06 17:27:32,220][23882] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-06 17:27:32,990][23882] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-06 17:27:33,297][23831] KL-divergence is very high: 266.8248 [2023-03-06 17:27:33,789][23882] Updated weights for policy 0, policy_version 36260 (0.0007) [2023-03-06 17:27:34,552][23882] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-06 17:27:35,317][23882] Updated weights for policy 0, policy_version 36280 (0.0007) [2023-03-06 17:27:36,110][23882] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-06 17:27:36,187][23831] KL-divergence is very high: 91255.7344 [2023-03-06 17:27:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37169152. Throughput: 0: 13076.1. Samples: 37154204. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:27:36,748][23556] Avg episode reward: [(0, '170.764')] [2023-03-06 17:27:36,898][23882] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-06 17:27:37,676][23882] Updated weights for policy 0, policy_version 36310 (0.0007) [2023-03-06 17:27:38,448][23882] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-06 17:27:39,215][23882] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-06 17:27:39,993][23882] Updated weights for policy 0, policy_version 36340 (0.0005) [2023-03-06 17:27:40,771][23882] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-06 17:27:41,549][23882] Updated weights for policy 0, policy_version 36360 (0.0007) [2023-03-06 17:27:41,699][23831] KL-divergence is very high: 667.1198 [2023-03-06 17:27:41,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37234688. Throughput: 0: 13095.8. Samples: 37233074. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:27:41,754][23556] Avg episode reward: [(0, '130.929')] [2023-03-06 17:27:41,853][23831] KL-divergence is very high: 591.2597 [2023-03-06 17:27:42,335][23882] Updated weights for policy 0, policy_version 36370 (0.0008) [2023-03-06 17:27:43,114][23882] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-03-06 17:27:43,895][23882] Updated weights for policy 0, policy_version 36390 (0.0005) [2023-03-06 17:27:44,686][23882] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-06 17:27:45,472][23882] Updated weights for policy 0, policy_version 36410 (0.0006) [2023-03-06 17:27:46,242][23882] Updated weights for policy 0, policy_version 36420 (0.0007) [2023-03-06 17:27:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13082.9). Total num frames: 37300224. Throughput: 0: 13094.2. Samples: 37272378. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:27:46,759][23556] Avg episode reward: [(0, '106.820')] [2023-03-06 17:27:47,029][23882] Updated weights for policy 0, policy_version 36430 (0.0007) [2023-03-06 17:27:47,800][23882] Updated weights for policy 0, policy_version 36440 (0.0007) [2023-03-06 17:27:48,596][23882] Updated weights for policy 0, policy_version 36450 (0.0009) [2023-03-06 17:27:49,380][23882] Updated weights for policy 0, policy_version 36460 (0.0007) [2023-03-06 17:27:50,180][23882] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-06 17:27:50,950][23882] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-06 17:27:51,737][23882] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-06 17:27:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37365760. Throughput: 0: 13085.0. Samples: 37350752. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:27:51,760][23556] Avg episode reward: [(0, '73.408')] [2023-03-06 17:27:52,522][23882] Updated weights for policy 0, policy_version 36500 (0.0007) [2023-03-06 17:27:53,294][23882] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-06 17:27:54,081][23882] Updated weights for policy 0, policy_version 36520 (0.0006) [2023-03-06 17:27:54,871][23882] Updated weights for policy 0, policy_version 36530 (0.0007) [2023-03-06 17:27:55,665][23882] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-06 17:27:56,437][23882] Updated weights for policy 0, policy_version 36550 (0.0007) [2023-03-06 17:27:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 37430272. Throughput: 0: 13100.1. Samples: 37429146. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:27:56,754][23556] Avg episode reward: [(0, '93.969')] [2023-03-06 17:27:56,758][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000036554_37431296.pth... [2023-03-06 17:27:56,789][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000033488_34291712.pth [2023-03-06 17:27:57,219][23882] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-06 17:27:57,999][23882] Updated weights for policy 0, policy_version 36570 (0.0005) [2023-03-06 17:27:58,776][23882] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-06 17:27:59,561][23882] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-06 17:28:00,333][23882] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-06 17:28:01,117][23882] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-06 17:28:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.1, 300 sec: 13076.0). Total num frames: 37495808. Throughput: 0: 13098.6. Samples: 37468455. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:28:01,759][23556] Avg episode reward: [(0, '112.312')] [2023-03-06 17:28:01,906][23882] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-06 17:28:02,694][23882] Updated weights for policy 0, policy_version 36630 (0.0007) [2023-03-06 17:28:03,485][23882] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-06 17:28:04,255][23882] Updated weights for policy 0, policy_version 36650 (0.0007) [2023-03-06 17:28:05,041][23882] Updated weights for policy 0, policy_version 36660 (0.0007) [2023-03-06 17:28:05,811][23882] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-06 17:28:06,594][23882] Updated weights for policy 0, policy_version 36680 (0.0007) [2023-03-06 17:28:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 37561344. Throughput: 0: 13097.1. Samples: 37547077. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:28:06,759][23556] Avg episode reward: [(0, '88.763')] [2023-03-06 17:28:07,394][23882] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-06 17:28:08,177][23882] Updated weights for policy 0, policy_version 36700 (0.0006) [2023-03-06 17:28:08,951][23882] Updated weights for policy 0, policy_version 36710 (0.0007) [2023-03-06 17:28:09,734][23882] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-06 17:28:10,529][23882] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-06 17:28:11,310][23882] Updated weights for policy 0, policy_version 36740 (0.0006) [2023-03-06 17:28:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37626880. Throughput: 0: 13090.3. Samples: 37625399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:11,755][23556] Avg episode reward: [(0, '96.073')] [2023-03-06 17:28:12,094][23882] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-06 17:28:12,863][23882] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-06 17:28:13,641][23882] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-06 17:28:14,419][23882] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-06 17:28:15,209][23882] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-06 17:28:15,979][23882] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-06 17:28:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37692416. Throughput: 0: 13099.8. Samples: 37664944. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:16,759][23556] Avg episode reward: [(0, '90.382')] [2023-03-06 17:28:16,777][23882] Updated weights for policy 0, policy_version 36810 (0.0008) [2023-03-06 17:28:17,544][23882] Updated weights for policy 0, policy_version 36820 (0.0007) [2023-03-06 17:28:18,330][23882] Updated weights for policy 0, policy_version 36830 (0.0007) [2023-03-06 17:28:19,132][23882] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-06 17:28:19,907][23882] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-06 17:28:20,710][23882] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-06 17:28:21,490][23882] Updated weights for policy 0, policy_version 36870 (0.0007) [2023-03-06 17:28:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37757952. Throughput: 0: 13089.1. Samples: 37743213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:21,759][23556] Avg episode reward: [(0, '83.549')] [2023-03-06 17:28:22,274][23882] Updated weights for policy 0, policy_version 36880 (0.0006) [2023-03-06 17:28:23,056][23882] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-06 17:28:23,840][23882] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-06 17:28:24,614][23882] Updated weights for policy 0, policy_version 36910 (0.0007) [2023-03-06 17:28:25,395][23882] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-06 17:28:26,189][23882] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-06 17:28:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37823488. Throughput: 0: 13079.9. Samples: 37821670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:26,754][23556] Avg episode reward: [(0, '83.156')] [2023-03-06 17:28:26,974][23882] Updated weights for policy 0, policy_version 36940 (0.0007) [2023-03-06 17:28:27,752][23882] Updated weights for policy 0, policy_version 36950 (0.0006) [2023-03-06 17:28:28,562][23882] Updated weights for policy 0, policy_version 36960 (0.0008) [2023-03-06 17:28:29,349][23882] Updated weights for policy 0, policy_version 36970 (0.0006) [2023-03-06 17:28:30,117][23882] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-06 17:28:30,902][23882] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-06 17:28:31,691][23882] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-06 17:28:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.1, 300 sec: 13079.4). Total num frames: 37888000. Throughput: 0: 13070.0. Samples: 37860530. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:31,759][23556] Avg episode reward: [(0, '68.347')] [2023-03-06 17:28:32,482][23882] Updated weights for policy 0, policy_version 37010 (0.0007) [2023-03-06 17:28:33,261][23882] Updated weights for policy 0, policy_version 37020 (0.0006) [2023-03-06 17:28:34,033][23882] Updated weights for policy 0, policy_version 37030 (0.0008) [2023-03-06 17:28:34,810][23882] Updated weights for policy 0, policy_version 37040 (0.0007) [2023-03-06 17:28:35,602][23882] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-06 17:28:36,386][23882] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-06 17:28:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13079.4). Total num frames: 37953536. Throughput: 0: 13076.1. Samples: 37939178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:36,759][23556] Avg episode reward: [(0, '55.253')] [2023-03-06 17:28:37,171][23882] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-06 17:28:37,953][23882] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-06 17:28:38,732][23882] Updated weights for policy 0, policy_version 37090 (0.0007) [2023-03-06 17:28:39,517][23882] Updated weights for policy 0, policy_version 37100 (0.0006) [2023-03-06 17:28:40,312][23882] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-06 17:28:41,085][23882] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-06 17:28:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 38019072. Throughput: 0: 13074.8. Samples: 38017513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:41,758][23556] Avg episode reward: [(0, '69.915')] [2023-03-06 17:28:41,855][23882] Updated weights for policy 0, policy_version 37130 (0.0005) [2023-03-06 17:28:42,113][23831] KL-divergence is very high: 208.7153 [2023-03-06 17:28:42,654][23882] Updated weights for policy 0, policy_version 37140 (0.0007) [2023-03-06 17:28:43,449][23882] Updated weights for policy 0, policy_version 37150 (0.0007) [2023-03-06 17:28:44,207][23882] Updated weights for policy 0, policy_version 37160 (0.0005) [2023-03-06 17:28:44,997][23882] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-06 17:28:45,766][23882] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-06 17:28:46,553][23882] Updated weights for policy 0, policy_version 37190 (0.0006) [2023-03-06 17:28:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.0, 300 sec: 13079.4). Total num frames: 38084608. Throughput: 0: 13075.1. Samples: 38056834. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:46,759][23556] Avg episode reward: [(0, '83.174')] [2023-03-06 17:28:47,354][23882] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-06 17:28:48,150][23882] Updated weights for policy 0, policy_version 37210 (0.0006) [2023-03-06 17:28:48,918][23882] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-06 17:28:49,693][23882] Updated weights for policy 0, policy_version 37230 (0.0006) [2023-03-06 17:28:50,490][23882] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-06 17:28:51,270][23882] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-06 17:28:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13079.4). Total num frames: 38150144. Throughput: 0: 13063.2. Samples: 38134918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:28:51,759][23556] Avg episode reward: [(0, '89.609')] [2023-03-06 17:28:52,065][23882] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-06 17:28:52,862][23882] Updated weights for policy 0, policy_version 37270 (0.0006) [2023-03-06 17:28:53,658][23882] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-06 17:28:54,430][23882] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-06 17:28:55,215][23882] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-06 17:28:55,989][23882] Updated weights for policy 0, policy_version 37310 (0.0007) [2023-03-06 17:28:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 38214656. Throughput: 0: 13066.2. Samples: 38213378. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:28:56,757][23556] Avg episode reward: [(0, '116.885')] [2023-03-06 17:28:56,770][23882] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-06 17:28:57,566][23882] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-06 17:28:58,345][23882] Updated weights for policy 0, policy_version 37340 (0.0007) [2023-03-06 17:28:59,143][23882] Updated weights for policy 0, policy_version 37350 (0.0007) [2023-03-06 17:28:59,915][23882] Updated weights for policy 0, policy_version 37360 (0.0005) [2023-03-06 17:29:00,719][23882] Updated weights for policy 0, policy_version 37370 (0.0007) [2023-03-06 17:29:01,506][23882] Updated weights for policy 0, policy_version 37380 (0.0007) [2023-03-06 17:29:01,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 38279168. Throughput: 0: 13052.5. Samples: 38252305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:29:01,759][23556] Avg episode reward: [(0, '144.015')] [2023-03-06 17:29:02,283][23882] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-06 17:29:03,074][23882] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-06 17:29:03,845][23882] Updated weights for policy 0, policy_version 37410 (0.0006) [2023-03-06 17:29:04,616][23882] Updated weights for policy 0, policy_version 37420 (0.0007) [2023-03-06 17:29:05,393][23882] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-06 17:29:06,013][23831] KL-divergence is very high: 137.3360 [2023-03-06 17:29:06,175][23882] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-06 17:29:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13076.0). Total num frames: 38345728. Throughput: 0: 13056.9. Samples: 38330771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:29:06,759][23556] Avg episode reward: [(0, '119.917')] [2023-03-06 17:29:06,944][23882] Updated weights for policy 0, policy_version 37450 (0.0007) [2023-03-06 17:29:07,736][23882] Updated weights for policy 0, policy_version 37460 (0.0006) [2023-03-06 17:29:08,526][23882] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-06 17:29:09,293][23882] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-06 17:29:10,076][23882] Updated weights for policy 0, policy_version 37490 (0.0006) [2023-03-06 17:29:10,865][23882] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-06 17:29:11,640][23882] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-06 17:29:11,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 38411264. Throughput: 0: 13064.3. Samples: 38409564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:29:11,748][23556] Avg episode reward: [(0, '133.617')] [2023-03-06 17:29:12,413][23882] Updated weights for policy 0, policy_version 37520 (0.0007) [2023-03-06 17:29:13,208][23882] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-06 17:29:13,974][23882] Updated weights for policy 0, policy_version 37540 (0.0006) [2023-03-06 17:29:14,771][23882] Updated weights for policy 0, policy_version 37550 (0.0007) [2023-03-06 17:29:15,549][23882] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-06 17:29:16,321][23882] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-06 17:29:16,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 38476800. Throughput: 0: 13074.7. Samples: 38448891. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:29:16,748][23556] Avg episode reward: [(0, '42.265')] [2023-03-06 17:29:17,097][23882] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-06 17:29:17,727][23831] KL-divergence is very high: 153293.6406 [2023-03-06 17:29:17,893][23882] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-06 17:29:18,673][23882] Updated weights for policy 0, policy_version 37600 (0.0007) [2023-03-06 17:29:19,471][23882] Updated weights for policy 0, policy_version 37610 (0.0007) [2023-03-06 17:29:20,247][23882] Updated weights for policy 0, policy_version 37620 (0.0007) [2023-03-06 17:29:21,041][23882] Updated weights for policy 0, policy_version 37630 (0.0006) [2023-03-06 17:29:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 38542336. Throughput: 0: 13069.4. Samples: 38527298. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:21,748][23556] Avg episode reward: [(0, '52.847')] [2023-03-06 17:29:21,821][23882] Updated weights for policy 0, policy_version 37640 (0.0007) [2023-03-06 17:29:22,600][23882] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-06 17:29:23,388][23882] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-06 17:29:24,173][23882] Updated weights for policy 0, policy_version 37670 (0.0007) [2023-03-06 17:29:24,974][23882] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-06 17:29:25,753][23882] Updated weights for policy 0, policy_version 37690 (0.0007) [2023-03-06 17:29:26,539][23882] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-06 17:29:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 38606848. Throughput: 0: 13065.5. Samples: 38605460. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:26,748][23556] Avg episode reward: [(0, '55.799')] [2023-03-06 17:29:27,345][23882] Updated weights for policy 0, policy_version 37710 (0.0006) [2023-03-06 17:29:28,110][23882] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-06 17:29:28,895][23882] Updated weights for policy 0, policy_version 37730 (0.0007) [2023-03-06 17:29:29,682][23882] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-06 17:29:30,452][23882] Updated weights for policy 0, policy_version 37750 (0.0007) [2023-03-06 17:29:31,245][23882] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-06 17:29:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 38672384. Throughput: 0: 13062.5. Samples: 38644647. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:31,748][23556] Avg episode reward: [(0, '85.351')] [2023-03-06 17:29:32,021][23882] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-06 17:29:32,795][23882] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-06 17:29:33,588][23882] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-06 17:29:34,368][23882] Updated weights for policy 0, policy_version 37800 (0.0007) [2023-03-06 17:29:35,169][23882] Updated weights for policy 0, policy_version 37810 (0.0007) [2023-03-06 17:29:35,948][23882] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-06 17:29:36,742][23882] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-06 17:29:36,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 38737920. Throughput: 0: 13068.4. Samples: 38723000. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:36,748][23556] Avg episode reward: [(0, '97.083')] [2023-03-06 17:29:37,534][23882] Updated weights for policy 0, policy_version 37840 (0.0006) [2023-03-06 17:29:38,324][23882] Updated weights for policy 0, policy_version 37850 (0.0005) [2023-03-06 17:29:39,107][23882] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-06 17:29:39,892][23882] Updated weights for policy 0, policy_version 37870 (0.0008) [2023-03-06 17:29:40,688][23882] Updated weights for policy 0, policy_version 37880 (0.0007) [2023-03-06 17:29:41,470][23882] Updated weights for policy 0, policy_version 37890 (0.0007) [2023-03-06 17:29:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13072.5). Total num frames: 38802432. Throughput: 0: 13056.5. Samples: 38800920. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:41,749][23556] Avg episode reward: [(0, '119.972')] [2023-03-06 17:29:42,246][23882] Updated weights for policy 0, policy_version 37900 (0.0006) [2023-03-06 17:29:43,049][23882] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-06 17:29:43,836][23882] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-06 17:29:44,619][23882] Updated weights for policy 0, policy_version 37930 (0.0007) [2023-03-06 17:29:45,419][23882] Updated weights for policy 0, policy_version 37940 (0.0007) [2023-03-06 17:29:46,197][23882] Updated weights for policy 0, policy_version 37950 (0.0006) [2023-03-06 17:29:46,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13039.0, 300 sec: 13069.0). Total num frames: 38866944. Throughput: 0: 13057.3. Samples: 38839883. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:46,748][23556] Avg episode reward: [(0, '106.761')] [2023-03-06 17:29:46,966][23882] Updated weights for policy 0, policy_version 37960 (0.0006) [2023-03-06 17:29:47,757][23882] Updated weights for policy 0, policy_version 37970 (0.0006) [2023-03-06 17:29:48,530][23882] Updated weights for policy 0, policy_version 37980 (0.0007) [2023-03-06 17:29:49,317][23882] Updated weights for policy 0, policy_version 37990 (0.0006) [2023-03-06 17:29:50,098][23882] Updated weights for policy 0, policy_version 38000 (0.0006) [2023-03-06 17:29:50,880][23882] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-06 17:29:51,665][23882] Updated weights for policy 0, policy_version 38020 (0.0007) [2023-03-06 17:29:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13069.0). Total num frames: 38932480. Throughput: 0: 13056.6. Samples: 38918316. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:51,748][23556] Avg episode reward: [(0, '81.079')] [2023-03-06 17:29:52,457][23882] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-06 17:29:53,230][23882] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-06 17:29:54,050][23882] Updated weights for policy 0, policy_version 38050 (0.0007) [2023-03-06 17:29:54,831][23882] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-06 17:29:55,614][23882] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-06 17:29:56,394][23882] Updated weights for policy 0, policy_version 38080 (0.0007) [2023-03-06 17:29:56,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 38998016. Throughput: 0: 13041.6. Samples: 38996438. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:29:56,749][23556] Avg episode reward: [(0, '94.452')] [2023-03-06 17:29:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000038084_38998016.pth... [2023-03-06 17:29:56,785][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000035022_35862528.pth [2023-03-06 17:29:57,176][23882] Updated weights for policy 0, policy_version 38090 (0.0007) [2023-03-06 17:29:57,954][23882] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-06 17:29:58,752][23882] Updated weights for policy 0, policy_version 38110 (0.0007) [2023-03-06 17:29:59,524][23882] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-06 17:30:00,312][23882] Updated weights for policy 0, policy_version 38130 (0.0007) [2023-03-06 17:30:01,093][23882] Updated weights for policy 0, policy_version 38140 (0.0007) [2023-03-06 17:30:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 39063552. Throughput: 0: 13044.3. Samples: 39035888. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:01,749][23556] Avg episode reward: [(0, '75.228')] [2023-03-06 17:30:01,882][23882] Updated weights for policy 0, policy_version 38150 (0.0007) [2023-03-06 17:30:02,656][23882] Updated weights for policy 0, policy_version 38160 (0.0007) [2023-03-06 17:30:03,425][23882] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-06 17:30:04,191][23882] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-06 17:30:04,997][23882] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-06 17:30:05,781][23882] Updated weights for policy 0, policy_version 38200 (0.0007) [2023-03-06 17:30:06,570][23882] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-06 17:30:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 39129088. Throughput: 0: 13040.5. Samples: 39114123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:06,748][23556] Avg episode reward: [(0, '79.679')] [2023-03-06 17:30:07,358][23882] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-06 17:30:08,162][23882] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-06 17:30:08,944][23882] Updated weights for policy 0, policy_version 38240 (0.0006) [2023-03-06 17:30:09,731][23882] Updated weights for policy 0, policy_version 38250 (0.0007) [2023-03-06 17:30:10,505][23882] Updated weights for policy 0, policy_version 38260 (0.0007) [2023-03-06 17:30:11,296][23882] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-06 17:30:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 39193600. Throughput: 0: 13038.2. Samples: 39192181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:11,749][23556] Avg episode reward: [(0, '105.157')] [2023-03-06 17:30:12,085][23882] Updated weights for policy 0, policy_version 38280 (0.0007) [2023-03-06 17:30:12,873][23882] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-06 17:30:13,630][23882] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-06 17:30:14,425][23882] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-06 17:30:15,216][23882] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-06 17:30:16,001][23882] Updated weights for policy 0, policy_version 38330 (0.0006) [2023-03-06 17:30:16,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13069.0). Total num frames: 39259136. Throughput: 0: 13041.2. Samples: 39231500. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:16,749][23556] Avg episode reward: [(0, '93.826')] [2023-03-06 17:30:16,776][23882] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-06 17:30:17,565][23882] Updated weights for policy 0, policy_version 38350 (0.0007) [2023-03-06 17:30:18,330][23882] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-06 17:30:19,136][23882] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-06 17:30:19,906][23882] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-06 17:30:20,694][23882] Updated weights for policy 0, policy_version 38390 (0.0006) [2023-03-06 17:30:21,474][23882] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-06 17:30:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13069.0). Total num frames: 39324672. Throughput: 0: 13044.1. Samples: 39309985. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:21,748][23556] Avg episode reward: [(0, '132.565')] [2023-03-06 17:30:22,259][23882] Updated weights for policy 0, policy_version 38410 (0.0006) [2023-03-06 17:30:23,046][23882] Updated weights for policy 0, policy_version 38420 (0.0007) [2023-03-06 17:30:23,810][23882] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-06 17:30:24,604][23882] Updated weights for policy 0, policy_version 38440 (0.0006) [2023-03-06 17:30:25,361][23882] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-06 17:30:26,152][23882] Updated weights for policy 0, policy_version 38460 (0.0007) [2023-03-06 17:30:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 39390208. Throughput: 0: 13059.8. Samples: 39388610. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:26,748][23556] Avg episode reward: [(0, '128.131')] [2023-03-06 17:30:26,942][23882] Updated weights for policy 0, policy_version 38470 (0.0007) [2023-03-06 17:30:27,721][23882] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-06 17:30:28,505][23882] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-06 17:30:29,303][23882] Updated weights for policy 0, policy_version 38500 (0.0007) [2023-03-06 17:30:30,083][23882] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-06 17:30:30,867][23882] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-06 17:30:31,651][23882] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-06 17:30:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 39455744. Throughput: 0: 13063.2. Samples: 39427728. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:30:31,748][23556] Avg episode reward: [(0, '148.482')] [2023-03-06 17:30:32,425][23882] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-06 17:30:33,192][23882] Updated weights for policy 0, policy_version 38550 (0.0007) [2023-03-06 17:30:33,977][23882] Updated weights for policy 0, policy_version 38560 (0.0007) [2023-03-06 17:30:34,776][23882] Updated weights for policy 0, policy_version 38570 (0.0006) [2023-03-06 17:30:35,557][23882] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-06 17:30:36,330][23882] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-06 17:30:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 39521280. Throughput: 0: 13067.6. Samples: 39506358. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:30:36,748][23556] Avg episode reward: [(0, '181.808')] [2023-03-06 17:30:37,119][23882] Updated weights for policy 0, policy_version 38600 (0.0007) [2023-03-06 17:30:37,906][23882] Updated weights for policy 0, policy_version 38610 (0.0007) [2023-03-06 17:30:38,686][23882] Updated weights for policy 0, policy_version 38620 (0.0006) [2023-03-06 17:30:39,481][23882] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-06 17:30:40,280][23882] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-06 17:30:41,073][23882] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-06 17:30:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 39585792. Throughput: 0: 13063.7. Samples: 39584302. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:30:41,748][23556] Avg episode reward: [(0, '138.622')] [2023-03-06 17:30:41,846][23882] Updated weights for policy 0, policy_version 38660 (0.0006) [2023-03-06 17:30:42,623][23882] Updated weights for policy 0, policy_version 38670 (0.0007) [2023-03-06 17:30:43,423][23882] Updated weights for policy 0, policy_version 38680 (0.0007) [2023-03-06 17:30:44,199][23882] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-06 17:30:44,984][23882] Updated weights for policy 0, policy_version 38700 (0.0007) [2023-03-06 17:30:45,770][23882] Updated weights for policy 0, policy_version 38710 (0.0007) [2023-03-06 17:30:46,559][23882] Updated weights for policy 0, policy_version 38720 (0.0007) [2023-03-06 17:30:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13065.5). Total num frames: 39651328. Throughput: 0: 13061.8. Samples: 39623670. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:30:46,748][23556] Avg episode reward: [(0, '112.495')] [2023-03-06 17:30:47,358][23882] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-06 17:30:48,118][23882] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-06 17:30:48,907][23882] Updated weights for policy 0, policy_version 38750 (0.0007) [2023-03-06 17:30:49,722][23882] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-06 17:30:50,500][23882] Updated weights for policy 0, policy_version 38770 (0.0007) [2023-03-06 17:30:51,306][23882] Updated weights for policy 0, policy_version 38780 (0.0007) [2023-03-06 17:30:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 39715840. Throughput: 0: 13055.8. Samples: 39701634. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 17:30:51,748][23556] Avg episode reward: [(0, '107.323')] [2023-03-06 17:30:52,079][23882] Updated weights for policy 0, policy_version 38790 (0.0007) [2023-03-06 17:30:52,863][23882] Updated weights for policy 0, policy_version 38800 (0.0007) [2023-03-06 17:30:53,668][23882] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-06 17:30:54,448][23882] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-06 17:30:55,225][23882] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-06 17:30:56,007][23882] Updated weights for policy 0, policy_version 38840 (0.0005) [2023-03-06 17:30:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 39781376. Throughput: 0: 13057.1. Samples: 39779752. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:30:56,748][23556] Avg episode reward: [(0, '125.242')] [2023-03-06 17:30:56,792][23882] Updated weights for policy 0, policy_version 38850 (0.0006) [2023-03-06 17:30:57,562][23882] Updated weights for policy 0, policy_version 38860 (0.0005) [2023-03-06 17:30:58,347][23882] Updated weights for policy 0, policy_version 38870 (0.0006) [2023-03-06 17:30:59,129][23882] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-06 17:30:59,910][23882] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-06 17:31:00,698][23882] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-06 17:31:01,458][23882] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-06 17:31:01,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 39846912. Throughput: 0: 13060.7. Samples: 39819230. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:31:01,748][23556] Avg episode reward: [(0, '180.788')] [2023-03-06 17:31:02,249][23882] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-06 17:31:03,034][23882] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-06 17:31:03,822][23882] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-06 17:31:04,607][23882] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-06 17:31:05,390][23882] Updated weights for policy 0, policy_version 38960 (0.0007) [2023-03-06 17:31:06,157][23882] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-06 17:31:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 39912448. Throughput: 0: 13059.2. Samples: 39897647. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:31:06,748][23556] Avg episode reward: [(0, '141.792')] [2023-03-06 17:31:06,949][23882] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-06 17:31:07,729][23882] Updated weights for policy 0, policy_version 38990 (0.0007) [2023-03-06 17:31:08,519][23882] Updated weights for policy 0, policy_version 39000 (0.0007) [2023-03-06 17:31:09,312][23882] Updated weights for policy 0, policy_version 39010 (0.0007) [2023-03-06 17:31:10,085][23882] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-06 17:31:10,870][23882] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-06 17:31:11,663][23882] Updated weights for policy 0, policy_version 39040 (0.0007) [2023-03-06 17:31:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 39977984. Throughput: 0: 13053.5. Samples: 39976016. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:31:11,748][23556] Avg episode reward: [(0, '113.863')] [2023-03-06 17:31:12,451][23882] Updated weights for policy 0, policy_version 39050 (0.0007) [2023-03-06 17:31:13,242][23882] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-06 17:31:14,028][23882] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-06 17:31:14,821][23882] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-06 17:31:15,605][23882] Updated weights for policy 0, policy_version 39090 (0.0007) [2023-03-06 17:31:16,412][23882] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-06 17:31:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 40042496. Throughput: 0: 13048.7. Samples: 40014920. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:31:16,748][23556] Avg episode reward: [(0, '154.889')] [2023-03-06 17:31:17,191][23882] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-06 17:31:17,999][23882] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-06 17:31:18,795][23882] Updated weights for policy 0, policy_version 39130 (0.0006) [2023-03-06 17:31:19,578][23882] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-06 17:31:20,376][23882] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-06 17:31:21,170][23882] Updated weights for policy 0, policy_version 39160 (0.0006) [2023-03-06 17:31:21,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 40107008. Throughput: 0: 13023.9. Samples: 40092432. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:21,748][23556] Avg episode reward: [(0, '181.807')] [2023-03-06 17:31:21,961][23882] Updated weights for policy 0, policy_version 39170 (0.0007) [2023-03-06 17:31:22,739][23882] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-06 17:31:23,537][23882] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-06 17:31:24,324][23882] Updated weights for policy 0, policy_version 39200 (0.0009) [2023-03-06 17:31:25,086][23882] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-06 17:31:25,877][23882] Updated weights for policy 0, policy_version 39220 (0.0007) [2023-03-06 17:31:26,675][23882] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-06 17:31:26,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13062.1). Total num frames: 40171520. Throughput: 0: 13026.0. Samples: 40170471. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:26,748][23556] Avg episode reward: [(0, '227.188')] [2023-03-06 17:31:27,457][23882] Updated weights for policy 0, policy_version 39240 (0.0007) [2023-03-06 17:31:28,240][23882] Updated weights for policy 0, policy_version 39250 (0.0007) [2023-03-06 17:31:29,046][23882] Updated weights for policy 0, policy_version 39260 (0.0007) [2023-03-06 17:31:29,832][23882] Updated weights for policy 0, policy_version 39270 (0.0007) [2023-03-06 17:31:30,620][23882] Updated weights for policy 0, policy_version 39280 (0.0007) [2023-03-06 17:31:31,395][23882] Updated weights for policy 0, policy_version 39290 (0.0007) [2023-03-06 17:31:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13062.1). Total num frames: 40237056. Throughput: 0: 13016.9. Samples: 40209428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:31,748][23556] Avg episode reward: [(0, '194.021')] [2023-03-06 17:31:32,177][23882] Updated weights for policy 0, policy_version 39300 (0.0007) [2023-03-06 17:31:32,981][23882] Updated weights for policy 0, policy_version 39310 (0.0006) [2023-03-06 17:31:33,775][23882] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-06 17:31:34,549][23882] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-06 17:31:35,321][23882] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-06 17:31:36,121][23882] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-06 17:31:36,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13021.9, 300 sec: 13062.1). Total num frames: 40302592. Throughput: 0: 13022.0. Samples: 40287628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:36,748][23556] Avg episode reward: [(0, '197.863')] [2023-03-06 17:31:36,890][23882] Updated weights for policy 0, policy_version 39360 (0.0007) [2023-03-06 17:31:37,700][23882] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-06 17:31:38,486][23882] Updated weights for policy 0, policy_version 39380 (0.0007) [2023-03-06 17:31:39,265][23882] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-06 17:31:40,030][23882] Updated weights for policy 0, policy_version 39400 (0.0007) [2023-03-06 17:31:40,830][23882] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-06 17:31:41,606][23882] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-06 17:31:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13058.6). Total num frames: 40367104. Throughput: 0: 13026.7. Samples: 40365953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:41,748][23556] Avg episode reward: [(0, '214.303')] [2023-03-06 17:31:42,398][23882] Updated weights for policy 0, policy_version 39430 (0.0006) [2023-03-06 17:31:43,173][23882] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-06 17:31:43,950][23882] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-06 17:31:44,727][23882] Updated weights for policy 0, policy_version 39460 (0.0005) [2023-03-06 17:31:45,525][23882] Updated weights for policy 0, policy_version 39470 (0.0008) [2023-03-06 17:31:46,304][23882] Updated weights for policy 0, policy_version 39480 (0.0007) [2023-03-06 17:31:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13058.6). Total num frames: 40432640. Throughput: 0: 13020.2. Samples: 40405139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:46,748][23556] Avg episode reward: [(0, '209.037')] [2023-03-06 17:31:47,095][23882] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-06 17:31:47,893][23882] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-06 17:31:48,677][23882] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-06 17:31:49,471][23882] Updated weights for policy 0, policy_version 39520 (0.0007) [2023-03-06 17:31:50,249][23882] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-06 17:31:51,038][23882] Updated weights for policy 0, policy_version 39540 (0.0006) [2023-03-06 17:31:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 40498176. Throughput: 0: 13014.1. Samples: 40483284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:51,748][23556] Avg episode reward: [(0, '179.542')] [2023-03-06 17:31:51,817][23882] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-06 17:31:52,593][23882] Updated weights for policy 0, policy_version 39560 (0.0007) [2023-03-06 17:31:53,385][23882] Updated weights for policy 0, policy_version 39570 (0.0005) [2023-03-06 17:31:54,158][23882] Updated weights for policy 0, policy_version 39580 (0.0007) [2023-03-06 17:31:54,950][23882] Updated weights for policy 0, policy_version 39590 (0.0006) [2023-03-06 17:31:55,718][23882] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-06 17:31:56,489][23882] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-06 17:31:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 40563712. Throughput: 0: 13019.1. Samples: 40561877. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:31:56,748][23556] Avg episode reward: [(0, '188.672')] [2023-03-06 17:31:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000039613_40563712.pth... [2023-03-06 17:31:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000036554_37431296.pth [2023-03-06 17:31:57,274][23882] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-06 17:31:58,063][23882] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-06 17:31:58,844][23882] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-06 17:31:59,630][23882] Updated weights for policy 0, policy_version 39650 (0.0007) [2023-03-06 17:32:00,413][23882] Updated weights for policy 0, policy_version 39660 (0.0007) [2023-03-06 17:32:01,185][23882] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-06 17:32:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 40629248. Throughput: 0: 13027.1. Samples: 40601137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:32:01,748][23556] Avg episode reward: [(0, '119.156')] [2023-03-06 17:32:01,970][23882] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-06 17:32:02,747][23882] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-06 17:32:03,526][23882] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-06 17:32:04,328][23882] Updated weights for policy 0, policy_version 39710 (0.0007) [2023-03-06 17:32:05,101][23882] Updated weights for policy 0, policy_version 39720 (0.0006) [2023-03-06 17:32:05,875][23882] Updated weights for policy 0, policy_version 39730 (0.0006) [2023-03-06 17:32:06,678][23882] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-06 17:32:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13058.6). Total num frames: 40693760. Throughput: 0: 13048.8. Samples: 40679630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:32:06,748][23556] Avg episode reward: [(0, '186.088')] [2023-03-06 17:32:07,456][23882] Updated weights for policy 0, policy_version 39750 (0.0005) [2023-03-06 17:32:08,235][23882] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-06 17:32:09,025][23882] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-06 17:32:09,797][23882] Updated weights for policy 0, policy_version 39780 (0.0007) [2023-03-06 17:32:10,589][23882] Updated weights for policy 0, policy_version 39790 (0.0006) [2023-03-06 17:32:11,384][23882] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-06 17:32:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13058.6). Total num frames: 40759296. Throughput: 0: 13056.4. Samples: 40758010. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:32:11,748][23556] Avg episode reward: [(0, '245.567')] [2023-03-06 17:32:12,166][23882] Updated weights for policy 0, policy_version 39810 (0.0008) [2023-03-06 17:32:12,948][23882] Updated weights for policy 0, policy_version 39820 (0.0007) [2023-03-06 17:32:13,731][23882] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-06 17:32:14,513][23882] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-06 17:32:15,307][23882] Updated weights for policy 0, policy_version 39850 (0.0006) [2023-03-06 17:32:16,090][23882] Updated weights for policy 0, policy_version 39860 (0.0006) [2023-03-06 17:32:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 40824832. Throughput: 0: 13059.6. Samples: 40797110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:32:16,748][23556] Avg episode reward: [(0, '229.604')] [2023-03-06 17:32:16,869][23882] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-06 17:32:17,673][23882] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-06 17:32:18,450][23882] Updated weights for policy 0, policy_version 39890 (0.0007) [2023-03-06 17:32:19,267][23882] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-06 17:32:20,046][23882] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-06 17:32:20,817][23882] Updated weights for policy 0, policy_version 39920 (0.0007) [2023-03-06 17:32:21,614][23882] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-06 17:32:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 40889344. Throughput: 0: 13055.2. Samples: 40875110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:32:21,748][23556] Avg episode reward: [(0, '219.342')] [2023-03-06 17:32:22,393][23882] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-06 17:32:23,181][23882] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-06 17:32:23,973][23882] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-06 17:32:24,758][23882] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-06 17:32:25,540][23882] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-06 17:32:26,339][23882] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-06 17:32:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 40954880. Throughput: 0: 13046.1. Samples: 40953031. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:32:26,748][23556] Avg episode reward: [(0, '337.772')] [2023-03-06 17:32:27,133][23882] Updated weights for policy 0, policy_version 40000 (0.0007) [2023-03-06 17:32:27,918][23882] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-06 17:32:28,692][23882] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-06 17:32:29,485][23882] Updated weights for policy 0, policy_version 40030 (0.0006) [2023-03-06 17:32:30,274][23882] Updated weights for policy 0, policy_version 40040 (0.0007) [2023-03-06 17:32:31,054][23882] Updated weights for policy 0, policy_version 40050 (0.0005) [2023-03-06 17:32:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 41020416. Throughput: 0: 13043.7. Samples: 40992106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:32:31,748][23556] Avg episode reward: [(0, '278.937')] [2023-03-06 17:32:31,833][23882] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-06 17:32:32,618][23882] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-06 17:32:33,401][23882] Updated weights for policy 0, policy_version 40080 (0.0008) [2023-03-06 17:32:34,180][23882] Updated weights for policy 0, policy_version 40090 (0.0007) [2023-03-06 17:32:34,972][23882] Updated weights for policy 0, policy_version 40100 (0.0007) [2023-03-06 17:32:35,765][23882] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-06 17:32:36,550][23882] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-06 17:32:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 41084928. Throughput: 0: 13050.6. Samples: 41070559. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:32:36,748][23556] Avg episode reward: [(0, '196.025')] [2023-03-06 17:32:37,357][23882] Updated weights for policy 0, policy_version 40130 (0.0007) [2023-03-06 17:32:38,130][23882] Updated weights for policy 0, policy_version 40140 (0.0007) [2023-03-06 17:32:38,917][23882] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-06 17:32:39,706][23882] Updated weights for policy 0, policy_version 40160 (0.0007) [2023-03-06 17:32:40,476][23882] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-06 17:32:41,283][23882] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-06 17:32:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 41150464. Throughput: 0: 13033.6. Samples: 41148386. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:32:41,748][23556] Avg episode reward: [(0, '111.837')] [2023-03-06 17:32:42,049][23882] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-06 17:32:42,846][23882] Updated weights for policy 0, policy_version 40200 (0.0007) [2023-03-06 17:32:43,623][23882] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-06 17:32:44,412][23882] Updated weights for policy 0, policy_version 40220 (0.0007) [2023-03-06 17:32:45,195][23882] Updated weights for policy 0, policy_version 40230 (0.0007) [2023-03-06 17:32:45,978][23882] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-06 17:32:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 41214976. Throughput: 0: 13032.5. Samples: 41187599. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:32:46,748][23556] Avg episode reward: [(0, '179.360')] [2023-03-06 17:32:46,756][23882] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-06 17:32:47,530][23882] Updated weights for policy 0, policy_version 40260 (0.0007) [2023-03-06 17:32:48,320][23882] Updated weights for policy 0, policy_version 40270 (0.0006) [2023-03-06 17:32:49,103][23882] Updated weights for policy 0, policy_version 40280 (0.0006) [2023-03-06 17:32:49,901][23882] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-06 17:32:50,671][23882] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-06 17:32:51,453][23882] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-06 17:32:51,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 41280512. Throughput: 0: 13031.8. Samples: 41266064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:32:51,748][23556] Avg episode reward: [(0, '164.603')] [2023-03-06 17:32:52,242][23882] Updated weights for policy 0, policy_version 40320 (0.0006) [2023-03-06 17:32:53,053][23882] Updated weights for policy 0, policy_version 40330 (0.0007) [2023-03-06 17:32:53,846][23882] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-06 17:32:54,617][23882] Updated weights for policy 0, policy_version 40350 (0.0007) [2023-03-06 17:32:55,406][23882] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-06 17:32:56,191][23882] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-06 17:32:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 41346048. Throughput: 0: 13025.9. Samples: 41344175. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:32:56,748][23556] Avg episode reward: [(0, '129.879')] [2023-03-06 17:32:56,975][23882] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-06 17:32:57,769][23882] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-06 17:32:58,563][23882] Updated weights for policy 0, policy_version 40400 (0.0006) [2023-03-06 17:32:59,343][23882] Updated weights for policy 0, policy_version 40410 (0.0005) [2023-03-06 17:33:00,118][23882] Updated weights for policy 0, policy_version 40420 (0.0007) [2023-03-06 17:33:00,909][23882] Updated weights for policy 0, policy_version 40430 (0.0008) [2023-03-06 17:33:01,704][23882] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-06 17:33:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 41410560. Throughput: 0: 13022.3. Samples: 41383111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:33:01,748][23556] Avg episode reward: [(0, '205.957')] [2023-03-06 17:33:02,480][23882] Updated weights for policy 0, policy_version 40450 (0.0006) [2023-03-06 17:33:03,258][23882] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-06 17:33:04,038][23882] Updated weights for policy 0, policy_version 40470 (0.0007) [2023-03-06 17:33:04,817][23882] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-06 17:33:05,617][23882] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-06 17:33:06,403][23882] Updated weights for policy 0, policy_version 40500 (0.0005) [2023-03-06 17:33:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 41476096. Throughput: 0: 13027.8. Samples: 41461362. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:33:06,748][23556] Avg episode reward: [(0, '218.930')] [2023-03-06 17:33:07,204][23882] Updated weights for policy 0, policy_version 40510 (0.0007) [2023-03-06 17:33:07,993][23882] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-06 17:33:08,779][23882] Updated weights for policy 0, policy_version 40530 (0.0007) [2023-03-06 17:33:09,576][23882] Updated weights for policy 0, policy_version 40540 (0.0005) [2023-03-06 17:33:10,370][23882] Updated weights for policy 0, policy_version 40550 (0.0007) [2023-03-06 17:33:11,148][23882] Updated weights for policy 0, policy_version 40560 (0.0007) [2023-03-06 17:33:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 41540608. Throughput: 0: 13022.7. Samples: 41539051. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:33:11,748][23556] Avg episode reward: [(0, '272.767')] [2023-03-06 17:33:11,930][23882] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-06 17:33:12,741][23882] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-06 17:33:13,524][23882] Updated weights for policy 0, policy_version 40590 (0.0007) [2023-03-06 17:33:14,288][23882] Updated weights for policy 0, policy_version 40600 (0.0006) [2023-03-06 17:33:15,068][23882] Updated weights for policy 0, policy_version 40610 (0.0007) [2023-03-06 17:33:15,853][23882] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-06 17:33:16,640][23882] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-06 17:33:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 41606144. Throughput: 0: 13023.1. Samples: 41578147. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:33:16,748][23556] Avg episode reward: [(0, '278.988')] [2023-03-06 17:33:17,421][23882] Updated weights for policy 0, policy_version 40640 (0.0007) [2023-03-06 17:33:18,230][23882] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-06 17:33:18,989][23882] Updated weights for policy 0, policy_version 40660 (0.0007) [2023-03-06 17:33:19,761][23882] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-06 17:33:20,573][23882] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-06 17:33:21,358][23882] Updated weights for policy 0, policy_version 40690 (0.0007) [2023-03-06 17:33:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 41670656. Throughput: 0: 13022.1. Samples: 41656554. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:33:21,748][23556] Avg episode reward: [(0, '255.021')] [2023-03-06 17:33:22,130][23882] Updated weights for policy 0, policy_version 40700 (0.0005) [2023-03-06 17:33:22,929][23882] Updated weights for policy 0, policy_version 40710 (0.0007) [2023-03-06 17:33:23,714][23882] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-06 17:33:24,515][23882] Updated weights for policy 0, policy_version 40730 (0.0007) [2023-03-06 17:33:25,304][23882] Updated weights for policy 0, policy_version 40740 (0.0007) [2023-03-06 17:33:26,089][23882] Updated weights for policy 0, policy_version 40750 (0.0006) [2023-03-06 17:33:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 41736192. Throughput: 0: 13026.6. Samples: 41734586. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 17:33:26,748][23556] Avg episode reward: [(0, '281.547')] [2023-03-06 17:33:26,871][23882] Updated weights for policy 0, policy_version 40760 (0.0006) [2023-03-06 17:33:27,668][23882] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-06 17:33:28,438][23882] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-06 17:33:29,238][23882] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-06 17:33:30,024][23882] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-06 17:33:30,787][23882] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-06 17:33:31,582][23882] Updated weights for policy 0, policy_version 40820 (0.0007) [2023-03-06 17:33:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 41801728. Throughput: 0: 13024.2. Samples: 41773688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:33:31,748][23556] Avg episode reward: [(0, '229.943')] [2023-03-06 17:33:32,358][23882] Updated weights for policy 0, policy_version 40830 (0.0006) [2023-03-06 17:33:33,145][23882] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-06 17:33:33,962][23882] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-06 17:33:34,727][23882] Updated weights for policy 0, policy_version 40860 (0.0007) [2023-03-06 17:33:35,526][23882] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-06 17:33:36,319][23882] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-06 17:33:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 41866240. Throughput: 0: 13012.7. Samples: 41851634. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:33:36,748][23556] Avg episode reward: [(0, '421.549')] [2023-03-06 17:33:37,096][23882] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-06 17:33:37,872][23882] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-06 17:33:38,669][23882] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-06 17:33:39,443][23882] Updated weights for policy 0, policy_version 40920 (0.0006) [2023-03-06 17:33:40,250][23882] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-06 17:33:41,041][23882] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-06 17:33:41,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 41930752. Throughput: 0: 13013.9. Samples: 41929800. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:33:41,748][23556] Avg episode reward: [(0, '335.713')] [2023-03-06 17:33:41,818][23882] Updated weights for policy 0, policy_version 40950 (0.0007) [2023-03-06 17:33:42,600][23882] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-06 17:33:43,385][23882] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-06 17:33:44,188][23882] Updated weights for policy 0, policy_version 40980 (0.0007) [2023-03-06 17:33:44,970][23882] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-06 17:33:45,754][23882] Updated weights for policy 0, policy_version 41000 (0.0008) [2023-03-06 17:33:46,545][23882] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-06 17:33:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 41996288. Throughput: 0: 13012.3. Samples: 41968667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:33:46,748][23556] Avg episode reward: [(0, '394.293')] [2023-03-06 17:33:47,311][23882] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-06 17:33:48,095][23882] Updated weights for policy 0, policy_version 41030 (0.0006) [2023-03-06 17:33:48,886][23882] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-06 17:33:49,679][23882] Updated weights for policy 0, policy_version 41050 (0.0007) [2023-03-06 17:33:50,456][23882] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-06 17:33:51,246][23882] Updated weights for policy 0, policy_version 41070 (0.0007) [2023-03-06 17:33:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 42061824. Throughput: 0: 13015.8. Samples: 42047073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:33:51,748][23556] Avg episode reward: [(0, '334.748')] [2023-03-06 17:33:52,022][23882] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-06 17:33:52,816][23882] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-06 17:33:53,622][23882] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-06 17:33:54,402][23882] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-06 17:33:55,189][23882] Updated weights for policy 0, policy_version 41120 (0.0007) [2023-03-06 17:33:55,982][23882] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-06 17:33:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 42126336. Throughput: 0: 13023.8. Samples: 42125124. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:33:56,754][23556] Avg episode reward: [(0, '362.984')] [2023-03-06 17:33:56,769][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000041140_42127360.pth... [2023-03-06 17:33:56,769][23882] Updated weights for policy 0, policy_version 41140 (0.0007) [2023-03-06 17:33:56,798][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000038084_38998016.pth [2023-03-06 17:33:57,540][23882] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-06 17:33:58,340][23882] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-06 17:33:59,119][23882] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-06 17:33:59,907][23882] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-06 17:34:00,689][23882] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-06 17:34:01,476][23882] Updated weights for policy 0, policy_version 41200 (0.0007) [2023-03-06 17:34:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 42191872. Throughput: 0: 13024.7. Samples: 42164257. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:01,748][23556] Avg episode reward: [(0, '335.725')] [2023-03-06 17:34:02,241][23882] Updated weights for policy 0, policy_version 41210 (0.0005) [2023-03-06 17:34:03,041][23882] Updated weights for policy 0, policy_version 41220 (0.0006) [2023-03-06 17:34:03,806][23882] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-06 17:34:04,611][23882] Updated weights for policy 0, policy_version 41240 (0.0006) [2023-03-06 17:34:05,394][23882] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-06 17:34:06,185][23882] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-06 17:34:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 42257408. Throughput: 0: 13023.3. Samples: 42242604. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:06,748][23556] Avg episode reward: [(0, '336.895')] [2023-03-06 17:34:06,983][23882] Updated weights for policy 0, policy_version 41270 (0.0007) [2023-03-06 17:34:07,741][23882] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-06 17:34:08,520][23882] Updated weights for policy 0, policy_version 41290 (0.0008) [2023-03-06 17:34:09,321][23882] Updated weights for policy 0, policy_version 41300 (0.0005) [2023-03-06 17:34:10,101][23882] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-06 17:34:10,892][23882] Updated weights for policy 0, policy_version 41320 (0.0007) [2023-03-06 17:34:11,669][23882] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-06 17:34:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42322944. Throughput: 0: 13030.7. Samples: 42320967. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:11,748][23556] Avg episode reward: [(0, '322.793')] [2023-03-06 17:34:12,445][23882] Updated weights for policy 0, policy_version 41340 (0.0007) [2023-03-06 17:34:13,238][23882] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-06 17:34:14,017][23882] Updated weights for policy 0, policy_version 41360 (0.0007) [2023-03-06 17:34:14,801][23882] Updated weights for policy 0, policy_version 41370 (0.0007) [2023-03-06 17:34:15,597][23882] Updated weights for policy 0, policy_version 41380 (0.0007) [2023-03-06 17:34:16,372][23882] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-06 17:34:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42388480. Throughput: 0: 13031.5. Samples: 42360103. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:16,748][23556] Avg episode reward: [(0, '312.245')] [2023-03-06 17:34:17,156][23882] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-06 17:34:17,941][23882] Updated weights for policy 0, policy_version 41410 (0.0007) [2023-03-06 17:34:18,735][23882] Updated weights for policy 0, policy_version 41420 (0.0007) [2023-03-06 17:34:19,526][23882] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-06 17:34:20,298][23882] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-06 17:34:21,093][23882] Updated weights for policy 0, policy_version 41450 (0.0007) [2023-03-06 17:34:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42452992. Throughput: 0: 13037.9. Samples: 42438340. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:21,748][23556] Avg episode reward: [(0, '350.418')] [2023-03-06 17:34:21,878][23882] Updated weights for policy 0, policy_version 41460 (0.0007) [2023-03-06 17:34:22,669][23882] Updated weights for policy 0, policy_version 41470 (0.0007) [2023-03-06 17:34:23,455][23882] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-06 17:34:24,235][23882] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-06 17:34:25,016][23882] Updated weights for policy 0, policy_version 41500 (0.0007) [2023-03-06 17:34:25,803][23882] Updated weights for policy 0, policy_version 41510 (0.0006) [2023-03-06 17:34:26,579][23882] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-06 17:34:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42518528. Throughput: 0: 13037.6. Samples: 42516490. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:26,748][23556] Avg episode reward: [(0, '375.182')] [2023-03-06 17:34:27,373][23882] Updated weights for policy 0, policy_version 41530 (0.0007) [2023-03-06 17:34:28,147][23882] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-06 17:34:28,952][23882] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-06 17:34:29,750][23882] Updated weights for policy 0, policy_version 41560 (0.0007) [2023-03-06 17:34:30,515][23882] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-06 17:34:31,314][23882] Updated weights for policy 0, policy_version 41580 (0.0007) [2023-03-06 17:34:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 42583040. Throughput: 0: 13038.2. Samples: 42555387. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:31,748][23556] Avg episode reward: [(0, '390.577')] [2023-03-06 17:34:32,094][23882] Updated weights for policy 0, policy_version 41590 (0.0007) [2023-03-06 17:34:32,884][23882] Updated weights for policy 0, policy_version 41600 (0.0007) [2023-03-06 17:34:33,687][23882] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-06 17:34:34,456][23882] Updated weights for policy 0, policy_version 41620 (0.0007) [2023-03-06 17:34:35,230][23882] Updated weights for policy 0, policy_version 41630 (0.0007) [2023-03-06 17:34:36,003][23882] Updated weights for policy 0, policy_version 41640 (0.0006) [2023-03-06 17:34:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 42648576. Throughput: 0: 13039.7. Samples: 42633861. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:36,748][23556] Avg episode reward: [(0, '368.548')] [2023-03-06 17:34:36,788][23882] Updated weights for policy 0, policy_version 41650 (0.0006) [2023-03-06 17:34:37,588][23882] Updated weights for policy 0, policy_version 41660 (0.0006) [2023-03-06 17:34:38,393][23882] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-06 17:34:39,174][23882] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-06 17:34:39,969][23882] Updated weights for policy 0, policy_version 41690 (0.0006) [2023-03-06 17:34:40,766][23882] Updated weights for policy 0, policy_version 41700 (0.0007) [2023-03-06 17:34:41,542][23882] Updated weights for policy 0, policy_version 41710 (0.0006) [2023-03-06 17:34:41,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 42713088. Throughput: 0: 13037.8. Samples: 42711824. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:34:41,748][23556] Avg episode reward: [(0, '332.634')] [2023-03-06 17:34:42,308][23882] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-06 17:34:43,103][23882] Updated weights for policy 0, policy_version 41730 (0.0006) [2023-03-06 17:34:43,877][23882] Updated weights for policy 0, policy_version 41740 (0.0006) [2023-03-06 17:34:44,646][23882] Updated weights for policy 0, policy_version 41750 (0.0006) [2023-03-06 17:34:45,444][23882] Updated weights for policy 0, policy_version 41760 (0.0007) [2023-03-06 17:34:46,250][23882] Updated weights for policy 0, policy_version 41770 (0.0006) [2023-03-06 17:34:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42778624. Throughput: 0: 13041.9. Samples: 42751141. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:34:46,748][23556] Avg episode reward: [(0, '389.429')] [2023-03-06 17:34:47,018][23882] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-06 17:34:47,822][23882] Updated weights for policy 0, policy_version 41790 (0.0006) [2023-03-06 17:34:48,609][23882] Updated weights for policy 0, policy_version 41800 (0.0007) [2023-03-06 17:34:49,399][23882] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-06 17:34:50,174][23882] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-06 17:34:50,953][23882] Updated weights for policy 0, policy_version 41830 (0.0007) [2023-03-06 17:34:51,745][23882] Updated weights for policy 0, policy_version 41840 (0.0005) [2023-03-06 17:34:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 42844160. Throughput: 0: 13032.6. Samples: 42829073. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:34:51,748][23556] Avg episode reward: [(0, '385.767')] [2023-03-06 17:34:52,527][23882] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-06 17:34:53,311][23882] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-06 17:34:54,088][23882] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-06 17:34:54,868][23882] Updated weights for policy 0, policy_version 41880 (0.0006) [2023-03-06 17:34:55,659][23882] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-06 17:34:56,425][23882] Updated weights for policy 0, policy_version 41900 (0.0006) [2023-03-06 17:34:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 42909696. Throughput: 0: 13036.0. Samples: 42907585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:34:56,748][23556] Avg episode reward: [(0, '440.336')] [2023-03-06 17:34:57,229][23882] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-06 17:34:58,016][23882] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-06 17:34:58,802][23882] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-06 17:34:59,570][23882] Updated weights for policy 0, policy_version 41940 (0.0005) [2023-03-06 17:35:00,355][23882] Updated weights for policy 0, policy_version 41950 (0.0007) [2023-03-06 17:35:01,138][23882] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-06 17:35:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 42974208. Throughput: 0: 13037.3. Samples: 42946784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:35:01,749][23556] Avg episode reward: [(0, '474.913')] [2023-03-06 17:35:01,922][23882] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-06 17:35:02,692][23882] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-06 17:35:03,482][23882] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-06 17:35:04,270][23882] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-06 17:35:05,059][23882] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-06 17:35:05,847][23882] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-06 17:35:06,629][23882] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-06 17:35:06,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 43039744. Throughput: 0: 13037.3. Samples: 43025019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:35:06,749][23556] Avg episode reward: [(0, '424.146')] [2023-03-06 17:35:07,428][23882] Updated weights for policy 0, policy_version 42040 (0.0005) [2023-03-06 17:35:08,214][23882] Updated weights for policy 0, policy_version 42050 (0.0007) [2023-03-06 17:35:08,986][23882] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-06 17:35:09,771][23882] Updated weights for policy 0, policy_version 42070 (0.0007) [2023-03-06 17:35:10,542][23882] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-06 17:35:11,341][23882] Updated weights for policy 0, policy_version 42090 (0.0007) [2023-03-06 17:35:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 43105280. Throughput: 0: 13044.5. Samples: 43103495. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:35:11,749][23556] Avg episode reward: [(0, '451.815')] [2023-03-06 17:35:12,123][23882] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-06 17:35:12,906][23882] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-06 17:35:13,712][23882] Updated weights for policy 0, policy_version 42120 (0.0006) [2023-03-06 17:35:14,481][23882] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-06 17:35:15,253][23882] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-06 17:35:16,041][23882] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-06 17:35:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 43170816. Throughput: 0: 13048.8. Samples: 43142581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:16,759][23556] Avg episode reward: [(0, '454.444')] [2023-03-06 17:35:16,811][23882] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-06 17:35:17,593][23882] Updated weights for policy 0, policy_version 42170 (0.0006) [2023-03-06 17:35:18,399][23882] Updated weights for policy 0, policy_version 42180 (0.0006) [2023-03-06 17:35:19,185][23882] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-06 17:35:19,977][23882] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-06 17:35:20,782][23882] Updated weights for policy 0, policy_version 42210 (0.0006) [2023-03-06 17:35:21,574][23882] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-06 17:35:21,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 43235328. Throughput: 0: 13041.0. Samples: 43220708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:21,759][23556] Avg episode reward: [(0, '561.239')] [2023-03-06 17:35:22,359][23882] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-06 17:35:23,149][23882] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-06 17:35:23,931][23882] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-06 17:35:24,722][23882] Updated weights for policy 0, policy_version 42260 (0.0006) [2023-03-06 17:35:25,507][23882] Updated weights for policy 0, policy_version 42270 (0.0007) [2023-03-06 17:35:26,291][23882] Updated weights for policy 0, policy_version 42280 (0.0005) [2023-03-06 17:35:26,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 43299840. Throughput: 0: 13041.7. Samples: 43298701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:26,758][23556] Avg episode reward: [(0, '482.244')] [2023-03-06 17:35:27,071][23882] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-06 17:35:27,860][23882] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-06 17:35:28,631][23882] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-06 17:35:29,427][23882] Updated weights for policy 0, policy_version 42320 (0.0007) [2023-03-06 17:35:30,203][23882] Updated weights for policy 0, policy_version 42330 (0.0007) [2023-03-06 17:35:30,987][23882] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-06 17:35:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 43365376. Throughput: 0: 13037.9. Samples: 43337845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:31,759][23556] Avg episode reward: [(0, '501.299')] [2023-03-06 17:35:31,779][23882] Updated weights for policy 0, policy_version 42350 (0.0007) [2023-03-06 17:35:32,564][23882] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-06 17:35:33,342][23882] Updated weights for policy 0, policy_version 42370 (0.0006) [2023-03-06 17:35:34,141][23882] Updated weights for policy 0, policy_version 42380 (0.0007) [2023-03-06 17:35:34,914][23882] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-06 17:35:35,702][23882] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-06 17:35:36,477][23882] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-06 17:35:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 43429888. Throughput: 0: 13044.7. Samples: 43416083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:36,759][23556] Avg episode reward: [(0, '491.610')] [2023-03-06 17:35:37,278][23882] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-06 17:35:38,072][23882] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-06 17:35:38,852][23882] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-06 17:35:39,641][23882] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-06 17:35:40,433][23882] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-06 17:35:41,216][23882] Updated weights for policy 0, policy_version 42470 (0.0006) [2023-03-06 17:35:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 43495424. Throughput: 0: 13035.0. Samples: 43494163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:41,748][23556] Avg episode reward: [(0, '511.351')] [2023-03-06 17:35:41,998][23882] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-06 17:35:42,785][23882] Updated weights for policy 0, policy_version 42490 (0.0007) [2023-03-06 17:35:43,582][23882] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-06 17:35:44,372][23882] Updated weights for policy 0, policy_version 42510 (0.0007) [2023-03-06 17:35:45,153][23882] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-03-06 17:35:45,943][23882] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-06 17:35:46,737][23882] Updated weights for policy 0, policy_version 42540 (0.0005) [2023-03-06 17:35:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 43560960. Throughput: 0: 13025.2. Samples: 43532918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:46,748][23556] Avg episode reward: [(0, '539.515')] [2023-03-06 17:35:47,506][23882] Updated weights for policy 0, policy_version 42550 (0.0007) [2023-03-06 17:35:48,299][23882] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-06 17:35:49,107][23882] Updated weights for policy 0, policy_version 42570 (0.0006) [2023-03-06 17:35:49,889][23882] Updated weights for policy 0, policy_version 42580 (0.0005) [2023-03-06 17:35:50,667][23882] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-06 17:35:51,466][23882] Updated weights for policy 0, policy_version 42600 (0.0007) [2023-03-06 17:35:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 43625472. Throughput: 0: 13022.6. Samples: 43611036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:51,748][23556] Avg episode reward: [(0, '590.608')] [2023-03-06 17:35:52,249][23882] Updated weights for policy 0, policy_version 42610 (0.0007) [2023-03-06 17:35:53,052][23882] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-06 17:35:53,830][23882] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-06 17:35:54,644][23882] Updated weights for policy 0, policy_version 42640 (0.0007) [2023-03-06 17:35:55,405][23882] Updated weights for policy 0, policy_version 42650 (0.0006) [2023-03-06 17:35:56,197][23882] Updated weights for policy 0, policy_version 42660 (0.0007) [2023-03-06 17:35:56,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 43689984. Throughput: 0: 13008.7. Samples: 43688886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:35:56,748][23556] Avg episode reward: [(0, '675.845')] [2023-03-06 17:35:56,764][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000042667_43691008.pth... [2023-03-06 17:35:56,795][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000039613_40563712.pth [2023-03-06 17:35:57,006][23882] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-06 17:35:57,787][23882] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-06 17:35:58,570][23882] Updated weights for policy 0, policy_version 42690 (0.0007) [2023-03-06 17:35:59,354][23882] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-06 17:36:00,139][23882] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-06 17:36:00,942][23882] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-06 17:36:01,716][23882] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-06 17:36:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 43755520. Throughput: 0: 13007.3. Samples: 43727911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:36:01,749][23556] Avg episode reward: [(0, '563.738')] [2023-03-06 17:36:02,502][23882] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-06 17:36:03,288][23882] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-06 17:36:04,070][23882] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-06 17:36:04,878][23882] Updated weights for policy 0, policy_version 42770 (0.0007) [2023-03-06 17:36:05,655][23882] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-06 17:36:06,444][23882] Updated weights for policy 0, policy_version 42790 (0.0007) [2023-03-06 17:36:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 43820032. Throughput: 0: 12997.6. Samples: 43805600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:06,748][23556] Avg episode reward: [(0, '628.252')] [2023-03-06 17:36:07,221][23882] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-06 17:36:08,019][23882] Updated weights for policy 0, policy_version 42810 (0.0006) [2023-03-06 17:36:08,820][23882] Updated weights for policy 0, policy_version 42820 (0.0007) [2023-03-06 17:36:09,595][23882] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-06 17:36:10,384][23882] Updated weights for policy 0, policy_version 42840 (0.0007) [2023-03-06 17:36:11,164][23882] Updated weights for policy 0, policy_version 42850 (0.0007) [2023-03-06 17:36:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 43885568. Throughput: 0: 13003.7. Samples: 43883868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:11,748][23556] Avg episode reward: [(0, '684.487')] [2023-03-06 17:36:11,938][23882] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-06 17:36:12,726][23882] Updated weights for policy 0, policy_version 42870 (0.0007) [2023-03-06 17:36:13,532][23882] Updated weights for policy 0, policy_version 42880 (0.0006) [2023-03-06 17:36:14,308][23882] Updated weights for policy 0, policy_version 42890 (0.0007) [2023-03-06 17:36:15,111][23882] Updated weights for policy 0, policy_version 42900 (0.0007) [2023-03-06 17:36:15,879][23882] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-06 17:36:16,678][23882] Updated weights for policy 0, policy_version 42920 (0.0006) [2023-03-06 17:36:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 43950080. Throughput: 0: 13000.0. Samples: 43922847. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:16,749][23556] Avg episode reward: [(0, '600.661')] [2023-03-06 17:36:17,481][23882] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-06 17:36:18,264][23882] Updated weights for policy 0, policy_version 42940 (0.0007) [2023-03-06 17:36:19,046][23882] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-06 17:36:19,842][23882] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-06 17:36:20,630][23882] Updated weights for policy 0, policy_version 42970 (0.0007) [2023-03-06 17:36:21,416][23882] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-06 17:36:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 44015616. Throughput: 0: 12993.4. Samples: 44000785. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:21,749][23556] Avg episode reward: [(0, '669.181')] [2023-03-06 17:36:22,197][23882] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-06 17:36:22,975][23882] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-06 17:36:23,760][23882] Updated weights for policy 0, policy_version 43010 (0.0007) [2023-03-06 17:36:24,548][23882] Updated weights for policy 0, policy_version 43020 (0.0006) [2023-03-06 17:36:25,326][23882] Updated weights for policy 0, policy_version 43030 (0.0007) [2023-03-06 17:36:26,110][23882] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-06 17:36:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 44081152. Throughput: 0: 12998.5. Samples: 44079096. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:26,748][23556] Avg episode reward: [(0, '574.875')] [2023-03-06 17:36:26,907][23882] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-06 17:36:27,688][23882] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-06 17:36:28,480][23882] Updated weights for policy 0, policy_version 43070 (0.0006) [2023-03-06 17:36:29,279][23882] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-06 17:36:30,076][23882] Updated weights for policy 0, policy_version 43090 (0.0006) [2023-03-06 17:36:30,885][23882] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-06 17:36:31,658][23882] Updated weights for policy 0, policy_version 43110 (0.0006) [2023-03-06 17:36:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 44145664. Throughput: 0: 13001.9. Samples: 44118005. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 17:36:31,748][23556] Avg episode reward: [(0, '522.601')] [2023-03-06 17:36:32,447][23882] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-06 17:36:33,229][23882] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-06 17:36:34,018][23882] Updated weights for policy 0, policy_version 43140 (0.0006) [2023-03-06 17:36:34,785][23882] Updated weights for policy 0, policy_version 43150 (0.0007) [2023-03-06 17:36:35,602][23882] Updated weights for policy 0, policy_version 43160 (0.0007) [2023-03-06 17:36:36,383][23882] Updated weights for policy 0, policy_version 43170 (0.0006) [2023-03-06 17:36:36,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 44210176. Throughput: 0: 12994.7. Samples: 44195796. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:36:36,748][23556] Avg episode reward: [(0, '485.895')] [2023-03-06 17:36:37,170][23882] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-06 17:36:37,944][23882] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-06 17:36:38,728][23882] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-06 17:36:39,520][23882] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-06 17:36:40,306][23882] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-06 17:36:41,078][23882] Updated weights for policy 0, policy_version 43230 (0.0006) [2023-03-06 17:36:41,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 44275712. Throughput: 0: 13005.6. Samples: 44274139. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:36:41,748][23556] Avg episode reward: [(0, '633.459')] [2023-03-06 17:36:41,865][23882] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-06 17:36:42,666][23882] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-06 17:36:43,440][23882] Updated weights for policy 0, policy_version 43260 (0.0007) [2023-03-06 17:36:44,238][23882] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-06 17:36:45,021][23882] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-06 17:36:45,798][23882] Updated weights for policy 0, policy_version 43290 (0.0007) [2023-03-06 17:36:46,586][23882] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-06 17:36:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 44341248. Throughput: 0: 13005.1. Samples: 44313140. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:36:46,748][23556] Avg episode reward: [(0, '523.257')] [2023-03-06 17:36:47,365][23882] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-06 17:36:48,163][23882] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-06 17:36:48,937][23882] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-06 17:36:49,721][23882] Updated weights for policy 0, policy_version 43340 (0.0007) [2023-03-06 17:36:50,514][23882] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-06 17:36:51,324][23882] Updated weights for policy 0, policy_version 43360 (0.0007) [2023-03-06 17:36:51,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 44405760. Throughput: 0: 13017.0. Samples: 44391364. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:36:51,748][23556] Avg episode reward: [(0, '447.223')] [2023-03-06 17:36:52,103][23882] Updated weights for policy 0, policy_version 43370 (0.0007) [2023-03-06 17:36:52,884][23882] Updated weights for policy 0, policy_version 43380 (0.0007) [2023-03-06 17:36:53,680][23882] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-06 17:36:54,449][23882] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-06 17:36:55,232][23882] Updated weights for policy 0, policy_version 43410 (0.0007) [2023-03-06 17:36:56,028][23882] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-06 17:36:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 44471296. Throughput: 0: 13013.3. Samples: 44469467. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:36:56,754][23556] Avg episode reward: [(0, '595.434')] [2023-03-06 17:36:56,817][23882] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-06 17:36:57,614][23882] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-06 17:36:58,406][23882] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-06 17:36:59,201][23882] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-06 17:36:59,985][23882] Updated weights for policy 0, policy_version 43470 (0.0006) [2023-03-06 17:37:00,765][23882] Updated weights for policy 0, policy_version 43480 (0.0007) [2023-03-06 17:37:01,558][23882] Updated weights for policy 0, policy_version 43490 (0.0007) [2023-03-06 17:37:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 44535808. Throughput: 0: 13009.3. Samples: 44508266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:01,759][23556] Avg episode reward: [(0, '410.768')] [2023-03-06 17:37:02,330][23882] Updated weights for policy 0, policy_version 43500 (0.0007) [2023-03-06 17:37:03,122][23882] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-06 17:37:03,921][23882] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-06 17:37:04,700][23882] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-06 17:37:05,492][23882] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-06 17:37:06,280][23882] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-06 17:37:06,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 44600320. Throughput: 0: 13009.5. Samples: 44586213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:06,759][23556] Avg episode reward: [(0, '439.717')] [2023-03-06 17:37:07,075][23882] Updated weights for policy 0, policy_version 43560 (0.0005) [2023-03-06 17:37:07,865][23882] Updated weights for policy 0, policy_version 43570 (0.0006) [2023-03-06 17:37:08,653][23882] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-06 17:37:09,434][23882] Updated weights for policy 0, policy_version 43590 (0.0007) [2023-03-06 17:37:10,207][23882] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-06 17:37:10,976][23882] Updated weights for policy 0, policy_version 43610 (0.0007) [2023-03-06 17:37:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 44665856. Throughput: 0: 13006.6. Samples: 44664393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:11,754][23556] Avg episode reward: [(0, '442.129')] [2023-03-06 17:37:11,768][23882] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-06 17:37:12,567][23882] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-06 17:37:13,338][23882] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-06 17:37:14,129][23882] Updated weights for policy 0, policy_version 43650 (0.0007) [2023-03-06 17:37:14,909][23882] Updated weights for policy 0, policy_version 43660 (0.0006) [2023-03-06 17:37:15,712][23882] Updated weights for policy 0, policy_version 43670 (0.0005) [2023-03-06 17:37:16,510][23882] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-06 17:37:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 44730368. Throughput: 0: 13011.7. Samples: 44703532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:16,759][23556] Avg episode reward: [(0, '494.194')] [2023-03-06 17:37:17,293][23882] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-06 17:37:18,073][23882] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-06 17:37:18,874][23882] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-06 17:37:19,649][23882] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-06 17:37:20,441][23882] Updated weights for policy 0, policy_version 43730 (0.0007) [2023-03-06 17:37:21,216][23882] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-06 17:37:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 44795904. Throughput: 0: 13015.2. Samples: 44781482. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:21,748][23556] Avg episode reward: [(0, '751.168')] [2023-03-06 17:37:21,992][23882] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-06 17:37:22,801][23882] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-06 17:37:23,584][23882] Updated weights for policy 0, policy_version 43770 (0.0007) [2023-03-06 17:37:24,364][23882] Updated weights for policy 0, policy_version 43780 (0.0008) [2023-03-06 17:37:25,161][23882] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-06 17:37:25,938][23882] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-06 17:37:26,719][23882] Updated weights for policy 0, policy_version 43810 (0.0007) [2023-03-06 17:37:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 44861440. Throughput: 0: 13012.1. Samples: 44859683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:26,748][23556] Avg episode reward: [(0, '441.279')] [2023-03-06 17:37:27,509][23882] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-06 17:37:28,296][23882] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-06 17:37:29,085][23882] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-06 17:37:29,860][23882] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-06 17:37:30,656][23882] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-06 17:37:31,419][23882] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-06 17:37:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 44926976. Throughput: 0: 13013.9. Samples: 44898767. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:31,748][23556] Avg episode reward: [(0, '401.014')] [2023-03-06 17:37:32,223][23882] Updated weights for policy 0, policy_version 43880 (0.0007) [2023-03-06 17:37:32,994][23882] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-06 17:37:33,777][23882] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-06 17:37:34,562][23882] Updated weights for policy 0, policy_version 43910 (0.0005) [2023-03-06 17:37:35,188][23831] KL-divergence is very high: 242.3564 [2023-03-06 17:37:35,350][23882] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-06 17:37:36,130][23882] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-06 17:37:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 44991488. Throughput: 0: 13020.2. Samples: 44977276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:36,748][23556] Avg episode reward: [(0, '357.964')] [2023-03-06 17:37:36,913][23882] Updated weights for policy 0, policy_version 43940 (0.0005) [2023-03-06 17:37:37,690][23882] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-06 17:37:38,477][23882] Updated weights for policy 0, policy_version 43960 (0.0007) [2023-03-06 17:37:39,250][23882] Updated weights for policy 0, policy_version 43970 (0.0007) [2023-03-06 17:37:40,031][23882] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-06 17:37:40,818][23882] Updated weights for policy 0, policy_version 43990 (0.0006) [2023-03-06 17:37:41,619][23882] Updated weights for policy 0, policy_version 44000 (0.0007) [2023-03-06 17:37:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 45057024. Throughput: 0: 13026.2. Samples: 45055649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:41,748][23556] Avg episode reward: [(0, '506.235')] [2023-03-06 17:37:42,390][23882] Updated weights for policy 0, policy_version 44010 (0.0007) [2023-03-06 17:37:43,178][23882] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-06 17:37:43,958][23882] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-06 17:37:44,746][23882] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-06 17:37:45,542][23882] Updated weights for policy 0, policy_version 44050 (0.0005) [2023-03-06 17:37:46,322][23882] Updated weights for policy 0, policy_version 44060 (0.0006) [2023-03-06 17:37:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 45122560. Throughput: 0: 13032.8. Samples: 45094741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:46,748][23556] Avg episode reward: [(0, '400.544')] [2023-03-06 17:37:47,105][23882] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-06 17:37:47,909][23882] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-06 17:37:48,707][23882] Updated weights for policy 0, policy_version 44090 (0.0007) [2023-03-06 17:37:49,502][23882] Updated weights for policy 0, policy_version 44100 (0.0007) [2023-03-06 17:37:50,285][23882] Updated weights for policy 0, policy_version 44110 (0.0006) [2023-03-06 17:37:50,911][23831] KL-divergence is very high: 1037.9591 [2023-03-06 17:37:51,073][23882] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-06 17:37:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 45187072. Throughput: 0: 13030.7. Samples: 45172596. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:51,749][23556] Avg episode reward: [(0, '383.088')] [2023-03-06 17:37:51,845][23882] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-06 17:37:52,622][23882] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-06 17:37:53,402][23882] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-06 17:37:54,190][23882] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-06 17:37:55,007][23882] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-06 17:37:55,782][23882] Updated weights for policy 0, policy_version 44180 (0.0006) [2023-03-06 17:37:56,572][23882] Updated weights for policy 0, policy_version 44190 (0.0007) [2023-03-06 17:37:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 45252608. Throughput: 0: 13029.5. Samples: 45250722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:37:56,748][23556] Avg episode reward: [(0, '397.179')] [2023-03-06 17:37:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000044192_45252608.pth... [2023-03-06 17:37:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000041140_42127360.pth [2023-03-06 17:37:57,359][23882] Updated weights for policy 0, policy_version 44200 (0.0006) [2023-03-06 17:37:58,137][23882] Updated weights for policy 0, policy_version 44210 (0.0006) [2023-03-06 17:37:58,933][23882] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-06 17:37:59,717][23882] Updated weights for policy 0, policy_version 44230 (0.0007) [2023-03-06 17:38:00,489][23882] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-06 17:38:01,267][23882] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-06 17:38:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 45318144. Throughput: 0: 13029.1. Samples: 45289842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:01,748][23556] Avg episode reward: [(0, '260.654')] [2023-03-06 17:38:02,049][23882] Updated weights for policy 0, policy_version 44260 (0.0006) [2023-03-06 17:38:02,821][23882] Updated weights for policy 0, policy_version 44270 (0.0007) [2023-03-06 17:38:03,608][23882] Updated weights for policy 0, policy_version 44280 (0.0007) [2023-03-06 17:38:04,418][23882] Updated weights for policy 0, policy_version 44290 (0.0007) [2023-03-06 17:38:05,190][23882] Updated weights for policy 0, policy_version 44300 (0.0006) [2023-03-06 17:38:05,969][23882] Updated weights for policy 0, policy_version 44310 (0.0006) [2023-03-06 17:38:06,746][23882] Updated weights for policy 0, policy_version 44320 (0.0007) [2023-03-06 17:38:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 45383680. Throughput: 0: 13044.5. Samples: 45368483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:06,748][23556] Avg episode reward: [(0, '304.957')] [2023-03-06 17:38:07,546][23882] Updated weights for policy 0, policy_version 44330 (0.0007) [2023-03-06 17:38:08,325][23882] Updated weights for policy 0, policy_version 44340 (0.0007) [2023-03-06 17:38:09,100][23882] Updated weights for policy 0, policy_version 44350 (0.0007) [2023-03-06 17:38:09,891][23882] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-03-06 17:38:10,678][23882] Updated weights for policy 0, policy_version 44370 (0.0007) [2023-03-06 17:38:11,460][23882] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-06 17:38:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 45448192. Throughput: 0: 13046.6. Samples: 45446779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:11,748][23556] Avg episode reward: [(0, '238.682')] [2023-03-06 17:38:12,251][23882] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-06 17:38:13,035][23882] Updated weights for policy 0, policy_version 44400 (0.0007) [2023-03-06 17:38:13,802][23882] Updated weights for policy 0, policy_version 44410 (0.0006) [2023-03-06 17:38:14,615][23882] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-06 17:38:15,415][23882] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-06 17:38:16,193][23882] Updated weights for policy 0, policy_version 44440 (0.0007) [2023-03-06 17:38:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 45513728. Throughput: 0: 13046.7. Samples: 45485869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:16,748][23556] Avg episode reward: [(0, '305.348')] [2023-03-06 17:38:16,987][23882] Updated weights for policy 0, policy_version 44450 (0.0007) [2023-03-06 17:38:17,771][23882] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-06 17:38:18,546][23882] Updated weights for policy 0, policy_version 44470 (0.0005) [2023-03-06 17:38:19,340][23882] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-06 17:38:20,102][23882] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-03-06 17:38:20,893][23882] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-06 17:38:21,675][23882] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-06 17:38:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 45578240. Throughput: 0: 13039.2. Samples: 45564041. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:21,748][23556] Avg episode reward: [(0, '286.170')] [2023-03-06 17:38:22,455][23882] Updated weights for policy 0, policy_version 44520 (0.0006) [2023-03-06 17:38:22,924][23831] KL-divergence is very high: 589.6928 [2023-03-06 17:38:23,247][23882] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-06 17:38:24,055][23882] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-06 17:38:24,833][23882] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-06 17:38:25,621][23882] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-06 17:38:26,394][23882] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-06 17:38:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 45643776. Throughput: 0: 13032.9. Samples: 45642127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:26,748][23556] Avg episode reward: [(0, '351.756')] [2023-03-06 17:38:27,177][23882] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-06 17:38:27,956][23882] Updated weights for policy 0, policy_version 44590 (0.0008) [2023-03-06 17:38:28,741][23882] Updated weights for policy 0, policy_version 44600 (0.0007) [2023-03-06 17:38:29,531][23882] Updated weights for policy 0, policy_version 44610 (0.0007) [2023-03-06 17:38:30,318][23882] Updated weights for policy 0, policy_version 44620 (0.0007) [2023-03-06 17:38:31,114][23882] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-06 17:38:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 45709312. Throughput: 0: 13037.1. Samples: 45681410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:31,748][23556] Avg episode reward: [(0, '322.921')] [2023-03-06 17:38:31,891][23882] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-06 17:38:32,667][23882] Updated weights for policy 0, policy_version 44650 (0.0007) [2023-03-06 17:38:33,432][23882] Updated weights for policy 0, policy_version 44660 (0.0007) [2023-03-06 17:38:34,220][23882] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-06 17:38:34,987][23882] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-06 17:38:35,749][23882] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-06 17:38:36,555][23882] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-06 17:38:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 45774848. Throughput: 0: 13058.7. Samples: 45760234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:36,748][23556] Avg episode reward: [(0, '418.979')] [2023-03-06 17:38:37,010][23831] KL-divergence is very high: 854.2196 [2023-03-06 17:38:37,329][23882] Updated weights for policy 0, policy_version 44710 (0.0007) [2023-03-06 17:38:38,109][23882] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-06 17:38:38,813][23831] KL-divergence is very high: 1389.1198 [2023-03-06 17:38:38,899][23882] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-06 17:38:39,684][23882] Updated weights for policy 0, policy_version 44740 (0.0006) [2023-03-06 17:38:40,453][23882] Updated weights for policy 0, policy_version 44750 (0.0007) [2023-03-06 17:38:41,252][23882] Updated weights for policy 0, policy_version 44760 (0.0008) [2023-03-06 17:38:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 45840384. Throughput: 0: 13068.5. Samples: 45838801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:41,748][23556] Avg episode reward: [(0, '413.888')] [2023-03-06 17:38:42,033][23882] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-06 17:38:42,804][23882] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-06 17:38:43,599][23882] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-06 17:38:44,379][23882] Updated weights for policy 0, policy_version 44800 (0.0007) [2023-03-06 17:38:45,165][23882] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-06 17:38:45,938][23882] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-06 17:38:46,738][23882] Updated weights for policy 0, policy_version 44830 (0.0007) [2023-03-06 17:38:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 45905920. Throughput: 0: 13067.7. Samples: 45877885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:38:46,749][23556] Avg episode reward: [(0, '261.410')] [2023-03-06 17:38:47,510][23882] Updated weights for policy 0, policy_version 44840 (0.0006) [2023-03-06 17:38:48,311][23882] Updated weights for policy 0, policy_version 44850 (0.0006) [2023-03-06 17:38:49,077][23882] Updated weights for policy 0, policy_version 44860 (0.0005) [2023-03-06 17:38:49,857][23882] Updated weights for policy 0, policy_version 44870 (0.0005) [2023-03-06 17:38:50,639][23882] Updated weights for policy 0, policy_version 44880 (0.0007) [2023-03-06 17:38:51,425][23882] Updated weights for policy 0, policy_version 44890 (0.0005) [2023-03-06 17:38:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13034.3). Total num frames: 45971456. Throughput: 0: 13066.0. Samples: 45956453. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:38:51,759][23556] Avg episode reward: [(0, '353.156')] [2023-03-06 17:38:52,205][23882] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-06 17:38:52,990][23882] Updated weights for policy 0, policy_version 44910 (0.0008) [2023-03-06 17:38:53,790][23882] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-06 17:38:54,546][23882] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-06 17:38:55,345][23882] Updated weights for policy 0, policy_version 44940 (0.0006) [2023-03-06 17:38:56,143][23882] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-03-06 17:38:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 46035968. Throughput: 0: 13066.1. Samples: 46034754. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:38:56,757][23556] Avg episode reward: [(0, '192.685')] [2023-03-06 17:38:56,917][23882] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-06 17:38:57,719][23882] Updated weights for policy 0, policy_version 44970 (0.0006) [2023-03-06 17:38:58,487][23882] Updated weights for policy 0, policy_version 44980 (0.0007) [2023-03-06 17:38:59,287][23882] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-06 17:39:00,070][23882] Updated weights for policy 0, policy_version 45000 (0.0006) [2023-03-06 17:39:00,849][23882] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-06 17:39:01,628][23882] Updated weights for policy 0, policy_version 45020 (0.0008) [2023-03-06 17:39:01,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 46101504. Throughput: 0: 13066.4. Samples: 46073858. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:39:01,748][23556] Avg episode reward: [(0, '311.811')] [2023-03-06 17:39:02,420][23882] Updated weights for policy 0, policy_version 45030 (0.0008) [2023-03-06 17:39:03,221][23882] Updated weights for policy 0, policy_version 45040 (0.0007) [2023-03-06 17:39:03,742][23831] KL-divergence is very high: 436.9353 [2023-03-06 17:39:03,999][23882] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-06 17:39:04,778][23882] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-06 17:39:05,569][23882] Updated weights for policy 0, policy_version 45070 (0.0007) [2023-03-06 17:39:06,334][23882] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-06 17:39:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 46167040. Throughput: 0: 13064.6. Samples: 46151948. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:39:06,748][23556] Avg episode reward: [(0, '326.455')] [2023-03-06 17:39:07,115][23882] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-06 17:39:07,888][23882] Updated weights for policy 0, policy_version 45100 (0.0007) [2023-03-06 17:39:08,673][23882] Updated weights for policy 0, policy_version 45110 (0.0007) [2023-03-06 17:39:09,444][23882] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-06 17:39:10,230][23882] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-06 17:39:11,012][23882] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-06 17:39:11,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13030.8). Total num frames: 46232576. Throughput: 0: 13080.9. Samples: 46230769. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:39:11,748][23556] Avg episode reward: [(0, '341.394')] [2023-03-06 17:39:11,796][23882] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-06 17:39:12,574][23882] Updated weights for policy 0, policy_version 45160 (0.0007) [2023-03-06 17:39:13,372][23882] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-06 17:39:14,140][23882] Updated weights for policy 0, policy_version 45180 (0.0006) [2023-03-06 17:39:14,934][23882] Updated weights for policy 0, policy_version 45190 (0.0007) [2023-03-06 17:39:15,713][23882] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-06 17:39:16,489][23882] Updated weights for policy 0, policy_version 45210 (0.0006) [2023-03-06 17:39:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13034.3). Total num frames: 46298112. Throughput: 0: 13081.0. Samples: 46270053. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:39:16,748][23556] Avg episode reward: [(0, '363.910')] [2023-03-06 17:39:17,288][23882] Updated weights for policy 0, policy_version 45220 (0.0006) [2023-03-06 17:39:18,073][23882] Updated weights for policy 0, policy_version 45230 (0.0007) [2023-03-06 17:39:18,849][23882] Updated weights for policy 0, policy_version 45240 (0.0006) [2023-03-06 17:39:19,657][23882] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-06 17:39:20,433][23882] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-06 17:39:21,226][23882] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-06 17:39:21,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13073.0, 300 sec: 13030.8). Total num frames: 46362624. Throughput: 0: 13064.8. Samples: 46348151. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:21,748][23556] Avg episode reward: [(0, '350.633')] [2023-03-06 17:39:22,026][23882] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-06 17:39:22,828][23882] Updated weights for policy 0, policy_version 45290 (0.0007) [2023-03-06 17:39:23,627][23882] Updated weights for policy 0, policy_version 45300 (0.0007) [2023-03-06 17:39:24,388][23882] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-06 17:39:25,194][23882] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-06 17:39:25,978][23882] Updated weights for policy 0, policy_version 45330 (0.0006) [2023-03-06 17:39:26,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 46427136. Throughput: 0: 13043.2. Samples: 46425744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:26,748][23556] Avg episode reward: [(0, '408.850')] [2023-03-06 17:39:26,766][23882] Updated weights for policy 0, policy_version 45340 (0.0007) [2023-03-06 17:39:27,548][23882] Updated weights for policy 0, policy_version 45350 (0.0008) [2023-03-06 17:39:28,353][23882] Updated weights for policy 0, policy_version 45360 (0.0007) [2023-03-06 17:39:29,140][23882] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-06 17:39:29,933][23882] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-06 17:39:30,713][23882] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-06 17:39:31,496][23882] Updated weights for policy 0, policy_version 45400 (0.0007) [2023-03-06 17:39:31,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 46492672. Throughput: 0: 13038.4. Samples: 46464610. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:31,748][23556] Avg episode reward: [(0, '406.608')] [2023-03-06 17:39:32,289][23882] Updated weights for policy 0, policy_version 45410 (0.0007) [2023-03-06 17:39:33,092][23882] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-06 17:39:33,877][23882] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-06 17:39:34,649][23882] Updated weights for policy 0, policy_version 45440 (0.0007) [2023-03-06 17:39:35,428][23882] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-06 17:39:36,222][23882] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-06 17:39:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 46557184. Throughput: 0: 13033.4. Samples: 46542956. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:36,748][23556] Avg episode reward: [(0, '408.474')] [2023-03-06 17:39:36,998][23882] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-06 17:39:37,787][23882] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-06 17:39:38,556][23882] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-06 17:39:39,366][23882] Updated weights for policy 0, policy_version 45500 (0.0007) [2023-03-06 17:39:40,138][23882] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-06 17:39:40,916][23882] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-06 17:39:41,705][23882] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-06 17:39:41,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 46622720. Throughput: 0: 13035.1. Samples: 46621335. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:41,748][23556] Avg episode reward: [(0, '425.781')] [2023-03-06 17:39:42,502][23882] Updated weights for policy 0, policy_version 45540 (0.0007) [2023-03-06 17:39:43,271][23882] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-06 17:39:44,059][23882] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-06 17:39:44,856][23882] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-06 17:39:45,635][23882] Updated weights for policy 0, policy_version 45580 (0.0005) [2023-03-06 17:39:46,418][23882] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-06 17:39:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 46688256. Throughput: 0: 13032.0. Samples: 46660297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:39:46,748][23556] Avg episode reward: [(0, '423.466')] [2023-03-06 17:39:47,211][23882] Updated weights for policy 0, policy_version 45600 (0.0007) [2023-03-06 17:39:48,005][23882] Updated weights for policy 0, policy_version 45610 (0.0006) [2023-03-06 17:39:48,789][23882] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-06 17:39:49,560][23882] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-06 17:39:50,346][23882] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-06 17:39:51,135][23882] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-06 17:39:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 46752768. Throughput: 0: 13036.3. Samples: 46738583. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:39:51,748][23556] Avg episode reward: [(0, '351.917')] [2023-03-06 17:39:51,952][23882] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-06 17:39:52,737][23882] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-06 17:39:53,526][23882] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-06 17:39:54,282][23882] Updated weights for policy 0, policy_version 45690 (0.0007) [2023-03-06 17:39:55,077][23882] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-06 17:39:55,865][23882] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-06 17:39:56,646][23882] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-06 17:39:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 46818304. Throughput: 0: 13015.8. Samples: 46816482. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:39:56,748][23556] Avg episode reward: [(0, '392.799')] [2023-03-06 17:39:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000045721_46818304.pth... [2023-03-06 17:39:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000042667_43691008.pth [2023-03-06 17:39:57,203][23831] KL-divergence is very high: 6240.1362 [2023-03-06 17:39:57,449][23882] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-06 17:39:58,219][23882] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-06 17:39:59,020][23882] Updated weights for policy 0, policy_version 45750 (0.0007) [2023-03-06 17:39:59,796][23882] Updated weights for policy 0, policy_version 45760 (0.0008) [2023-03-06 17:40:00,578][23882] Updated weights for policy 0, policy_version 45770 (0.0006) [2023-03-06 17:40:01,354][23882] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-06 17:40:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 46882816. Throughput: 0: 13011.4. Samples: 46855568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:01,748][23556] Avg episode reward: [(0, '339.000')] [2023-03-06 17:40:02,145][23882] Updated weights for policy 0, policy_version 45790 (0.0007) [2023-03-06 17:40:02,924][23882] Updated weights for policy 0, policy_version 45800 (0.0006) [2023-03-06 17:40:03,718][23882] Updated weights for policy 0, policy_version 45810 (0.0006) [2023-03-06 17:40:04,501][23882] Updated weights for policy 0, policy_version 45820 (0.0007) [2023-03-06 17:40:05,286][23882] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-06 17:40:06,063][23882] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-06 17:40:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 46948352. Throughput: 0: 13016.6. Samples: 46933898. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:06,748][23556] Avg episode reward: [(0, '451.964')] [2023-03-06 17:40:06,838][23882] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-06 17:40:07,613][23882] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-06 17:40:08,413][23882] Updated weights for policy 0, policy_version 45870 (0.0007) [2023-03-06 17:40:09,212][23882] Updated weights for policy 0, policy_version 45880 (0.0007) [2023-03-06 17:40:10,005][23882] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-06 17:40:10,794][23882] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-06 17:40:11,572][23882] Updated weights for policy 0, policy_version 45910 (0.0005) [2023-03-06 17:40:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 47013888. Throughput: 0: 13028.6. Samples: 47012032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:11,748][23556] Avg episode reward: [(0, '430.579')] [2023-03-06 17:40:12,381][23882] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-06 17:40:13,158][23882] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-06 17:40:13,930][23882] Updated weights for policy 0, policy_version 45940 (0.0007) [2023-03-06 17:40:14,731][23882] Updated weights for policy 0, policy_version 45950 (0.0007) [2023-03-06 17:40:15,517][23882] Updated weights for policy 0, policy_version 45960 (0.0006) [2023-03-06 17:40:16,292][23882] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-06 17:40:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 47078400. Throughput: 0: 13031.5. Samples: 47051027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:16,748][23556] Avg episode reward: [(0, '445.520')] [2023-03-06 17:40:17,088][23882] Updated weights for policy 0, policy_version 45980 (0.0006) [2023-03-06 17:40:17,876][23882] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-06 17:40:18,670][23882] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-06 17:40:19,442][23882] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-06 17:40:20,224][23882] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-06 17:40:21,018][23882] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-06 17:40:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 47143936. Throughput: 0: 13026.3. Samples: 47129141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:40:21,748][23556] Avg episode reward: [(0, '365.139')] [2023-03-06 17:40:21,800][23882] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-06 17:40:22,595][23882] Updated weights for policy 0, policy_version 46050 (0.0007) [2023-03-06 17:40:23,379][23882] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-06 17:40:24,153][23882] Updated weights for policy 0, policy_version 46070 (0.0006) [2023-03-06 17:40:24,942][23882] Updated weights for policy 0, policy_version 46080 (0.0007) [2023-03-06 17:40:25,724][23882] Updated weights for policy 0, policy_version 46090 (0.0007) [2023-03-06 17:40:26,507][23882] Updated weights for policy 0, policy_version 46100 (0.0007) [2023-03-06 17:40:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 47209472. Throughput: 0: 13024.4. Samples: 47207432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:40:26,748][23556] Avg episode reward: [(0, '459.544')] [2023-03-06 17:40:27,302][23882] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-06 17:40:28,073][23882] Updated weights for policy 0, policy_version 46120 (0.0007) [2023-03-06 17:40:28,880][23882] Updated weights for policy 0, policy_version 46130 (0.0007) [2023-03-06 17:40:29,649][23882] Updated weights for policy 0, policy_version 46140 (0.0006) [2023-03-06 17:40:30,442][23882] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-06 17:40:31,198][23882] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-06 17:40:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 47273984. Throughput: 0: 13028.4. Samples: 47246574. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:40:31,748][23556] Avg episode reward: [(0, '519.073')] [2023-03-06 17:40:31,985][23882] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-06 17:40:32,785][23882] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-06 17:40:33,564][23882] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-06 17:40:34,345][23882] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-06 17:40:35,139][23882] Updated weights for policy 0, policy_version 46210 (0.0007) [2023-03-06 17:40:35,837][23831] KL-divergence is very high: 137.9029 [2023-03-06 17:40:35,912][23882] Updated weights for policy 0, policy_version 46220 (0.0007) [2023-03-06 17:40:36,689][23882] Updated weights for policy 0, policy_version 46230 (0.0007) [2023-03-06 17:40:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 47339520. Throughput: 0: 13032.3. Samples: 47325039. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:40:36,748][23556] Avg episode reward: [(0, '472.534')] [2023-03-06 17:40:37,491][23882] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-06 17:40:38,274][23882] Updated weights for policy 0, policy_version 46250 (0.0007) [2023-03-06 17:40:39,070][23882] Updated weights for policy 0, policy_version 46260 (0.0006) [2023-03-06 17:40:39,822][23882] Updated weights for policy 0, policy_version 46270 (0.0006) [2023-03-06 17:40:40,583][23882] Updated weights for policy 0, policy_version 46280 (0.0007) [2023-03-06 17:40:41,377][23882] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-06 17:40:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 47405056. Throughput: 0: 13048.3. Samples: 47403652. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:40:41,748][23556] Avg episode reward: [(0, '320.084')] [2023-03-06 17:40:42,181][23882] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-06 17:40:42,958][23882] Updated weights for policy 0, policy_version 46310 (0.0007) [2023-03-06 17:40:43,745][23882] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-06 17:40:44,524][23882] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-06 17:40:45,330][23882] Updated weights for policy 0, policy_version 46340 (0.0007) [2023-03-06 17:40:46,111][23882] Updated weights for policy 0, policy_version 46350 (0.0007) [2023-03-06 17:40:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 47470592. Throughput: 0: 13048.2. Samples: 47442739. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:46,748][23556] Avg episode reward: [(0, '254.864')] [2023-03-06 17:40:46,884][23882] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-06 17:40:47,673][23882] Updated weights for policy 0, policy_version 46370 (0.0007) [2023-03-06 17:40:48,445][23882] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-06 17:40:49,262][23882] Updated weights for policy 0, policy_version 46390 (0.0007) [2023-03-06 17:40:50,065][23882] Updated weights for policy 0, policy_version 46400 (0.0007) [2023-03-06 17:40:50,847][23882] Updated weights for policy 0, policy_version 46410 (0.0006) [2023-03-06 17:40:51,642][23882] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-06 17:40:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 47535104. Throughput: 0: 13036.9. Samples: 47520559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:51,748][23556] Avg episode reward: [(0, '340.938')] [2023-03-06 17:40:52,107][23831] KL-divergence is very high: 118.8423 [2023-03-06 17:40:52,426][23882] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-06 17:40:53,201][23882] Updated weights for policy 0, policy_version 46440 (0.0006) [2023-03-06 17:40:54,000][23882] Updated weights for policy 0, policy_version 46450 (0.0007) [2023-03-06 17:40:54,783][23882] Updated weights for policy 0, policy_version 46460 (0.0007) [2023-03-06 17:40:55,545][23882] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-06 17:40:55,857][23831] KL-divergence is very high: 235.2035 [2023-03-06 17:40:56,331][23882] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-06 17:40:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 47600640. Throughput: 0: 13038.4. Samples: 47598758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:40:56,748][23556] Avg episode reward: [(0, '244.843')] [2023-03-06 17:40:57,117][23882] Updated weights for policy 0, policy_version 46490 (0.0007) [2023-03-06 17:40:57,895][23882] Updated weights for policy 0, policy_version 46500 (0.0006) [2023-03-06 17:40:58,685][23882] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-06 17:40:59,480][23882] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-06 17:41:00,260][23882] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-06 17:41:01,042][23882] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-06 17:41:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 47665152. Throughput: 0: 13045.8. Samples: 47638088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:41:01,748][23556] Avg episode reward: [(0, '245.623')] [2023-03-06 17:41:01,825][23882] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-06 17:41:02,616][23882] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-06 17:41:03,403][23882] Updated weights for policy 0, policy_version 46570 (0.0007) [2023-03-06 17:41:04,182][23882] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-06 17:41:04,957][23882] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-06 17:41:05,725][23882] Updated weights for policy 0, policy_version 46600 (0.0005) [2023-03-06 17:41:06,498][23882] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-06 17:41:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 47731712. Throughput: 0: 13054.0. Samples: 47716571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:41:06,748][23556] Avg episode reward: [(0, '202.213')] [2023-03-06 17:41:07,273][23882] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-06 17:41:08,070][23882] Updated weights for policy 0, policy_version 46630 (0.0005) [2023-03-06 17:41:08,849][23882] Updated weights for policy 0, policy_version 46640 (0.0007) [2023-03-06 17:41:09,634][23882] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-06 17:41:10,413][23882] Updated weights for policy 0, policy_version 46660 (0.0007) [2023-03-06 17:41:11,182][23882] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-06 17:41:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 47796224. Throughput: 0: 13063.9. Samples: 47795305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:41:11,748][23556] Avg episode reward: [(0, '329.946')] [2023-03-06 17:41:11,974][23882] Updated weights for policy 0, policy_version 46680 (0.0007) [2023-03-06 17:41:12,772][23882] Updated weights for policy 0, policy_version 46690 (0.0008) [2023-03-06 17:41:13,555][23882] Updated weights for policy 0, policy_version 46700 (0.0007) [2023-03-06 17:41:14,346][23882] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-06 17:41:15,137][23882] Updated weights for policy 0, policy_version 46720 (0.0007) [2023-03-06 17:41:15,924][23882] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-06 17:41:16,712][23882] Updated weights for policy 0, policy_version 46740 (0.0006) [2023-03-06 17:41:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 47861760. Throughput: 0: 13056.0. Samples: 47834094. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:16,748][23556] Avg episode reward: [(0, '269.541')] [2023-03-06 17:41:17,495][23882] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-06 17:41:18,267][23882] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-06 17:41:19,064][23882] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-06 17:41:19,828][23882] Updated weights for policy 0, policy_version 46780 (0.0009) [2023-03-06 17:41:20,617][23882] Updated weights for policy 0, policy_version 46790 (0.0006) [2023-03-06 17:41:21,411][23882] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-06 17:41:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 47927296. Throughput: 0: 13052.7. Samples: 47912411. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:21,748][23556] Avg episode reward: [(0, '330.052')] [2023-03-06 17:41:22,189][23882] Updated weights for policy 0, policy_version 46810 (0.0006) [2023-03-06 17:41:22,969][23882] Updated weights for policy 0, policy_version 46820 (0.0005) [2023-03-06 17:41:23,778][23882] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-06 17:41:24,554][23882] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-06 17:41:25,334][23882] Updated weights for policy 0, policy_version 46850 (0.0005) [2023-03-06 17:41:26,123][23882] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-06 17:41:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 47991808. Throughput: 0: 13044.4. Samples: 47990649. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:26,748][23556] Avg episode reward: [(0, '387.543')] [2023-03-06 17:41:26,927][23882] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-06 17:41:27,703][23882] Updated weights for policy 0, policy_version 46880 (0.0007) [2023-03-06 17:41:28,489][23882] Updated weights for policy 0, policy_version 46890 (0.0005) [2023-03-06 17:41:29,290][23882] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-06 17:41:30,067][23882] Updated weights for policy 0, policy_version 46910 (0.0007) [2023-03-06 17:41:30,858][23882] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-06 17:41:31,397][23831] KL-divergence is very high: 208.8683 [2023-03-06 17:41:31,657][23882] Updated weights for policy 0, policy_version 46930 (0.0006) [2023-03-06 17:41:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 48057344. Throughput: 0: 13044.3. Samples: 48029729. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:31,748][23556] Avg episode reward: [(0, '320.070')] [2023-03-06 17:41:32,433][23882] Updated weights for policy 0, policy_version 46940 (0.0006) [2023-03-06 17:41:33,221][23882] Updated weights for policy 0, policy_version 46950 (0.0007) [2023-03-06 17:41:34,028][23882] Updated weights for policy 0, policy_version 46960 (0.0008) [2023-03-06 17:41:34,801][23882] Updated weights for policy 0, policy_version 46970 (0.0006) [2023-03-06 17:41:35,582][23882] Updated weights for policy 0, policy_version 46980 (0.0007) [2023-03-06 17:41:36,377][23882] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-06 17:41:36,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 48122880. Throughput: 0: 13046.1. Samples: 48107635. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:36,748][23556] Avg episode reward: [(0, '340.875')] [2023-03-06 17:41:37,151][23882] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-06 17:41:37,917][23882] Updated weights for policy 0, policy_version 47010 (0.0006) [2023-03-06 17:41:38,719][23882] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-06 17:41:39,508][23882] Updated weights for policy 0, policy_version 47030 (0.0007) [2023-03-06 17:41:40,282][23882] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-06 17:41:41,069][23882] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-06 17:41:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 48187392. Throughput: 0: 13049.2. Samples: 48185974. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:41:41,748][23556] Avg episode reward: [(0, '379.762')] [2023-03-06 17:41:41,860][23882] Updated weights for policy 0, policy_version 47060 (0.0005) [2023-03-06 17:41:42,630][23882] Updated weights for policy 0, policy_version 47070 (0.0007) [2023-03-06 17:41:43,405][23882] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-06 17:41:44,212][23882] Updated weights for policy 0, policy_version 47090 (0.0007) [2023-03-06 17:41:44,983][23882] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-06 17:41:45,773][23882] Updated weights for policy 0, policy_version 47110 (0.0006) [2023-03-06 17:41:46,563][23882] Updated weights for policy 0, policy_version 47120 (0.0007) [2023-03-06 17:41:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 48252928. Throughput: 0: 13047.4. Samples: 48225221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:41:46,748][23556] Avg episode reward: [(0, '397.559')] [2023-03-06 17:41:47,334][23882] Updated weights for policy 0, policy_version 47130 (0.0007) [2023-03-06 17:41:48,117][23882] Updated weights for policy 0, policy_version 47140 (0.0007) [2023-03-06 17:41:48,908][23882] Updated weights for policy 0, policy_version 47150 (0.0007) [2023-03-06 17:41:49,693][23882] Updated weights for policy 0, policy_version 47160 (0.0006) [2023-03-06 17:41:50,479][23882] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-06 17:41:51,284][23882] Updated weights for policy 0, policy_version 47180 (0.0005) [2023-03-06 17:41:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 48317440. Throughput: 0: 13043.4. Samples: 48303526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:41:51,748][23556] Avg episode reward: [(0, '467.210')] [2023-03-06 17:41:52,072][23882] Updated weights for policy 0, policy_version 47190 (0.0007) [2023-03-06 17:41:52,853][23882] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-06 17:41:53,643][23882] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-06 17:41:54,428][23882] Updated weights for policy 0, policy_version 47220 (0.0006) [2023-03-06 17:41:55,217][23882] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-06 17:41:56,030][23882] Updated weights for policy 0, policy_version 47240 (0.0007) [2023-03-06 17:41:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 48382976. Throughput: 0: 13019.1. Samples: 48381164. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:41:56,748][23556] Avg episode reward: [(0, '459.739')] [2023-03-06 17:41:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000047249_48382976.pth... [2023-03-06 17:41:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000044192_45252608.pth [2023-03-06 17:41:56,808][23882] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-06 17:41:57,573][23882] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-06 17:41:58,372][23882] Updated weights for policy 0, policy_version 47270 (0.0007) [2023-03-06 17:41:59,147][23882] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-06 17:41:59,946][23882] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-06 17:42:00,737][23882] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-06 17:42:01,523][23882] Updated weights for policy 0, policy_version 47310 (0.0007) [2023-03-06 17:42:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 48447488. Throughput: 0: 13025.4. Samples: 48420238. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:42:01,748][23556] Avg episode reward: [(0, '328.763')] [2023-03-06 17:42:02,317][23882] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-06 17:42:03,129][23882] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-06 17:42:03,905][23882] Updated weights for policy 0, policy_version 47340 (0.0007) [2023-03-06 17:42:04,672][23882] Updated weights for policy 0, policy_version 47350 (0.0007) [2023-03-06 17:42:05,461][23882] Updated weights for policy 0, policy_version 47360 (0.0007) [2023-03-06 17:42:06,247][23882] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-06 17:42:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 48513024. Throughput: 0: 13021.9. Samples: 48498395. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:42:06,748][23556] Avg episode reward: [(0, '291.584')] [2023-03-06 17:42:07,013][23882] Updated weights for policy 0, policy_version 47380 (0.0007) [2023-03-06 17:42:07,825][23882] Updated weights for policy 0, policy_version 47390 (0.0007) [2023-03-06 17:42:08,594][23882] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-06 17:42:09,377][23882] Updated weights for policy 0, policy_version 47410 (0.0006) [2023-03-06 17:42:10,179][23882] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-06 17:42:10,958][23882] Updated weights for policy 0, policy_version 47430 (0.0007) [2023-03-06 17:42:11,733][23882] Updated weights for policy 0, policy_version 47440 (0.0007) [2023-03-06 17:42:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 48578560. Throughput: 0: 13024.3. Samples: 48576745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:42:11,749][23556] Avg episode reward: [(0, '301.050')] [2023-03-06 17:42:12,514][23882] Updated weights for policy 0, policy_version 47450 (0.0007) [2023-03-06 17:42:13,301][23882] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-06 17:42:14,082][23882] Updated weights for policy 0, policy_version 47470 (0.0006) [2023-03-06 17:42:14,862][23882] Updated weights for policy 0, policy_version 47480 (0.0006) [2023-03-06 17:42:15,654][23882] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-06 17:42:16,418][23882] Updated weights for policy 0, policy_version 47500 (0.0006) [2023-03-06 17:42:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 48644096. Throughput: 0: 13026.7. Samples: 48615931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:16,748][23556] Avg episode reward: [(0, '556.249')] [2023-03-06 17:42:17,209][23882] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-06 17:42:17,990][23882] Updated weights for policy 0, policy_version 47520 (0.0007) [2023-03-06 17:42:18,766][23882] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-06 17:42:19,547][23882] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-06 17:42:20,340][23882] Updated weights for policy 0, policy_version 47550 (0.0007) [2023-03-06 17:42:21,133][23882] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-06 17:42:21,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 48708608. Throughput: 0: 13034.1. Samples: 48694168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:21,748][23556] Avg episode reward: [(0, '641.177')] [2023-03-06 17:42:21,905][23882] Updated weights for policy 0, policy_version 47570 (0.0008) [2023-03-06 17:42:22,706][23882] Updated weights for policy 0, policy_version 47580 (0.0006) [2023-03-06 17:42:23,164][23831] KL-divergence is very high: 690.3075 [2023-03-06 17:42:23,478][23882] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-06 17:42:24,283][23882] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-06 17:42:25,066][23882] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-06 17:42:25,843][23882] Updated weights for policy 0, policy_version 47620 (0.0007) [2023-03-06 17:42:26,624][23882] Updated weights for policy 0, policy_version 47630 (0.0007) [2023-03-06 17:42:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 48774144. Throughput: 0: 13036.7. Samples: 48772628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:26,748][23556] Avg episode reward: [(0, '624.331')] [2023-03-06 17:42:27,405][23882] Updated weights for policy 0, policy_version 47640 (0.0007) [2023-03-06 17:42:28,179][23882] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-06 17:42:28,950][23882] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-06 17:42:29,750][23882] Updated weights for policy 0, policy_version 47670 (0.0007) [2023-03-06 17:42:30,523][23882] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-06 17:42:31,310][23882] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-06 17:42:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 48839680. Throughput: 0: 13042.1. Samples: 48812117. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:31,748][23556] Avg episode reward: [(0, '576.812')] [2023-03-06 17:42:32,078][23882] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-06 17:42:32,854][23882] Updated weights for policy 0, policy_version 47710 (0.0007) [2023-03-06 17:42:33,639][23882] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-06 17:42:34,426][23882] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-06 17:42:35,201][23882] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-06 17:42:35,981][23882] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-06 17:42:36,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 48905216. Throughput: 0: 13048.9. Samples: 48890725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:36,748][23556] Avg episode reward: [(0, '535.697')] [2023-03-06 17:42:36,760][23882] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-06 17:42:37,538][23882] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-06 17:42:38,324][23882] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-06 17:42:39,123][23882] Updated weights for policy 0, policy_version 47790 (0.0007) [2023-03-06 17:42:39,943][23882] Updated weights for policy 0, policy_version 47800 (0.0007) [2023-03-06 17:42:40,714][23882] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-06 17:42:41,510][23882] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-06 17:42:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 48970752. Throughput: 0: 13056.7. Samples: 48968714. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:41,748][23556] Avg episode reward: [(0, '692.597')] [2023-03-06 17:42:42,285][23882] Updated weights for policy 0, policy_version 47830 (0.0006) [2023-03-06 17:42:43,082][23882] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-06 17:42:43,875][23882] Updated weights for policy 0, policy_version 47850 (0.0006) [2023-03-06 17:42:44,673][23882] Updated weights for policy 0, policy_version 47860 (0.0007) [2023-03-06 17:42:45,451][23882] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-06 17:42:46,235][23882] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-06 17:42:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 49035264. Throughput: 0: 13055.4. Samples: 49007730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:46,748][23556] Avg episode reward: [(0, '843.382')] [2023-03-06 17:42:47,024][23882] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-06 17:42:47,816][23882] Updated weights for policy 0, policy_version 47900 (0.0007) [2023-03-06 17:42:48,610][23882] Updated weights for policy 0, policy_version 47910 (0.0007) [2023-03-06 17:42:49,386][23882] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-06 17:42:50,177][23882] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-06 17:42:50,946][23882] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-06 17:42:51,728][23882] Updated weights for policy 0, policy_version 47950 (0.0005) [2023-03-06 17:42:51,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49100800. Throughput: 0: 13052.9. Samples: 49085777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:51,748][23556] Avg episode reward: [(0, '708.143')] [2023-03-06 17:42:52,519][23882] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-06 17:42:53,291][23882] Updated weights for policy 0, policy_version 47970 (0.0006) [2023-03-06 17:42:54,065][23882] Updated weights for policy 0, policy_version 47980 (0.0006) [2023-03-06 17:42:54,853][23882] Updated weights for policy 0, policy_version 47990 (0.0007) [2023-03-06 17:42:55,629][23882] Updated weights for policy 0, policy_version 48000 (0.0007) [2023-03-06 17:42:56,424][23882] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-06 17:42:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49166336. Throughput: 0: 13059.0. Samples: 49164400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:42:56,748][23556] Avg episode reward: [(0, '722.213')] [2023-03-06 17:42:57,205][23882] Updated weights for policy 0, policy_version 48020 (0.0008) [2023-03-06 17:42:57,988][23882] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-06 17:42:58,760][23882] Updated weights for policy 0, policy_version 48040 (0.0007) [2023-03-06 17:42:59,552][23882] Updated weights for policy 0, policy_version 48050 (0.0006) [2023-03-06 17:43:00,341][23882] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-06 17:43:01,117][23882] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-06 17:43:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 49231872. Throughput: 0: 13059.6. Samples: 49203616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:43:01,749][23556] Avg episode reward: [(0, '633.786')] [2023-03-06 17:43:01,913][23882] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-06 17:43:02,680][23882] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-06 17:43:03,449][23882] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-06 17:43:04,257][23882] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-06 17:43:05,037][23882] Updated weights for policy 0, policy_version 48120 (0.0006) [2023-03-06 17:43:05,823][23882] Updated weights for policy 0, policy_version 48130 (0.0007) [2023-03-06 17:43:06,609][23882] Updated weights for policy 0, policy_version 48140 (0.0007) [2023-03-06 17:43:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49296384. Throughput: 0: 13063.4. Samples: 49282022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:43:06,748][23556] Avg episode reward: [(0, '633.731')] [2023-03-06 17:43:07,379][23882] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-06 17:43:08,161][23882] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-06 17:43:08,949][23882] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-06 17:43:09,725][23882] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-06 17:43:10,521][23882] Updated weights for policy 0, policy_version 48190 (0.0006) [2023-03-06 17:43:11,306][23882] Updated weights for policy 0, policy_version 48200 (0.0007) [2023-03-06 17:43:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49361920. Throughput: 0: 13064.8. Samples: 49360546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:43:11,748][23556] Avg episode reward: [(0, '774.579')] [2023-03-06 17:43:12,079][23882] Updated weights for policy 0, policy_version 48210 (0.0008) [2023-03-06 17:43:12,869][23882] Updated weights for policy 0, policy_version 48220 (0.0007) [2023-03-06 17:43:13,662][23882] Updated weights for policy 0, policy_version 48230 (0.0007) [2023-03-06 17:43:14,438][23882] Updated weights for policy 0, policy_version 48240 (0.0006) [2023-03-06 17:43:15,227][23882] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-06 17:43:16,009][23882] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-06 17:43:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 49427456. Throughput: 0: 13054.3. Samples: 49399560. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:16,748][23556] Avg episode reward: [(0, '663.316')] [2023-03-06 17:43:16,786][23882] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-06 17:43:17,566][23882] Updated weights for policy 0, policy_version 48280 (0.0007) [2023-03-06 17:43:18,337][23882] Updated weights for policy 0, policy_version 48290 (0.0007) [2023-03-06 17:43:19,146][23882] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-06 17:43:19,926][23882] Updated weights for policy 0, policy_version 48310 (0.0006) [2023-03-06 17:43:20,700][23882] Updated weights for policy 0, policy_version 48320 (0.0007) [2023-03-06 17:43:21,482][23882] Updated weights for policy 0, policy_version 48330 (0.0007) [2023-03-06 17:43:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 49492992. Throughput: 0: 13055.4. Samples: 49478217. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:21,759][23556] Avg episode reward: [(0, '799.227')] [2023-03-06 17:43:22,278][23882] Updated weights for policy 0, policy_version 48340 (0.0007) [2023-03-06 17:43:23,056][23882] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-06 17:43:23,840][23882] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-06 17:43:24,619][23882] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-06 17:43:25,388][23882] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-06 17:43:26,181][23882] Updated weights for policy 0, policy_version 48390 (0.0007) [2023-03-06 17:43:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 49558528. Throughput: 0: 13065.6. Samples: 49556666. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:26,758][23556] Avg episode reward: [(0, '510.914')] [2023-03-06 17:43:26,973][23882] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-06 17:43:27,762][23882] Updated weights for policy 0, policy_version 48410 (0.0007) [2023-03-06 17:43:28,559][23882] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-06 17:43:29,337][23882] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-06 17:43:30,137][23882] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-06 17:43:30,917][23882] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-06 17:43:31,705][23882] Updated weights for policy 0, policy_version 48460 (0.0006) [2023-03-06 17:43:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49623040. Throughput: 0: 13066.4. Samples: 49595716. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:31,759][23556] Avg episode reward: [(0, '692.326')] [2023-03-06 17:43:32,495][23882] Updated weights for policy 0, policy_version 48470 (0.0007) [2023-03-06 17:43:33,262][23882] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-06 17:43:34,056][23882] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-06 17:43:34,865][23882] Updated weights for policy 0, policy_version 48500 (0.0007) [2023-03-06 17:43:35,618][23882] Updated weights for policy 0, policy_version 48510 (0.0007) [2023-03-06 17:43:36,414][23882] Updated weights for policy 0, policy_version 48520 (0.0007) [2023-03-06 17:43:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49688576. Throughput: 0: 13069.0. Samples: 49673882. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:36,759][23556] Avg episode reward: [(0, '611.428')] [2023-03-06 17:43:37,190][23882] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-06 17:43:37,983][23882] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-06 17:43:38,764][23882] Updated weights for policy 0, policy_version 48550 (0.0007) [2023-03-06 17:43:39,546][23882] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-06 17:43:40,339][23882] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-06 17:43:41,121][23882] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-06 17:43:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49754112. Throughput: 0: 13061.3. Samples: 49752161. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:43:41,754][23556] Avg episode reward: [(0, '698.654')] [2023-03-06 17:43:41,890][23882] Updated weights for policy 0, policy_version 48590 (0.0007) [2023-03-06 17:43:42,675][23882] Updated weights for policy 0, policy_version 48600 (0.0006) [2023-03-06 17:43:43,463][23882] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-06 17:43:44,240][23882] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-06 17:43:45,034][23882] Updated weights for policy 0, policy_version 48630 (0.0006) [2023-03-06 17:43:45,821][23882] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-06 17:43:46,594][23882] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-06 17:43:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 49818624. Throughput: 0: 13060.3. Samples: 49791327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:43:46,759][23556] Avg episode reward: [(0, '873.845')] [2023-03-06 17:43:47,376][23882] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-06 17:43:48,166][23882] Updated weights for policy 0, policy_version 48670 (0.0007) [2023-03-06 17:43:48,960][23882] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-06 17:43:49,741][23882] Updated weights for policy 0, policy_version 48690 (0.0007) [2023-03-06 17:43:50,545][23882] Updated weights for policy 0, policy_version 48700 (0.0007) [2023-03-06 17:43:51,321][23882] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-06 17:43:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 49884160. Throughput: 0: 13050.7. Samples: 49869304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:43:51,759][23556] Avg episode reward: [(0, '864.879')] [2023-03-06 17:43:52,110][23882] Updated weights for policy 0, policy_version 48720 (0.0008) [2023-03-06 17:43:52,908][23882] Updated weights for policy 0, policy_version 48730 (0.0007) [2023-03-06 17:43:53,689][23882] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-06 17:43:54,481][23882] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-06 17:43:55,277][23882] Updated weights for policy 0, policy_version 48760 (0.0006) [2023-03-06 17:43:56,079][23882] Updated weights for policy 0, policy_version 48770 (0.0006) [2023-03-06 17:43:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 49948672. Throughput: 0: 13035.2. Samples: 49947130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:43:56,758][23556] Avg episode reward: [(0, '649.171')] [2023-03-06 17:43:56,775][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000048779_49949696.pth... [2023-03-06 17:43:56,806][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000045721_46818304.pth [2023-03-06 17:43:56,870][23882] Updated weights for policy 0, policy_version 48780 (0.0006) [2023-03-06 17:43:57,649][23882] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-06 17:43:58,434][23882] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-06 17:43:59,220][23882] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-06 17:44:00,006][23882] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-06 17:44:00,812][23882] Updated weights for policy 0, policy_version 48830 (0.0007) [2023-03-06 17:44:01,581][23882] Updated weights for policy 0, policy_version 48840 (0.0006) [2023-03-06 17:44:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13041.3). Total num frames: 50014208. Throughput: 0: 13039.8. Samples: 49986352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:01,748][23556] Avg episode reward: [(0, '776.786')] [2023-03-06 17:44:02,354][23882] Updated weights for policy 0, policy_version 48850 (0.0006) [2023-03-06 17:44:03,154][23882] Updated weights for policy 0, policy_version 48860 (0.0007) [2023-03-06 17:44:03,926][23882] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-06 17:44:04,723][23882] Updated weights for policy 0, policy_version 48880 (0.0007) [2023-03-06 17:44:05,502][23882] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-06 17:44:06,289][23882] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-06 17:44:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 50078720. Throughput: 0: 13028.6. Samples: 50064506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:06,748][23556] Avg episode reward: [(0, '724.770')] [2023-03-06 17:44:07,057][23882] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-06 17:44:07,850][23882] Updated weights for policy 0, policy_version 48920 (0.0007) [2023-03-06 17:44:08,654][23882] Updated weights for policy 0, policy_version 48930 (0.0007) [2023-03-06 17:44:09,452][23882] Updated weights for policy 0, policy_version 48940 (0.0007) [2023-03-06 17:44:10,232][23882] Updated weights for policy 0, policy_version 48950 (0.0007) [2023-03-06 17:44:11,012][23882] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-06 17:44:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 50144256. Throughput: 0: 13021.3. Samples: 50142623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:11,748][23556] Avg episode reward: [(0, '863.377')] [2023-03-06 17:44:11,786][23882] Updated weights for policy 0, policy_version 48970 (0.0007) [2023-03-06 17:44:12,583][23882] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-06 17:44:13,351][23882] Updated weights for policy 0, policy_version 48990 (0.0005) [2023-03-06 17:44:14,137][23882] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-06 17:44:14,937][23882] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-06 17:44:15,706][23882] Updated weights for policy 0, policy_version 49020 (0.0006) [2023-03-06 17:44:16,491][23882] Updated weights for policy 0, policy_version 49030 (0.0007) [2023-03-06 17:44:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 50209792. Throughput: 0: 13026.3. Samples: 50181899. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:16,748][23556] Avg episode reward: [(0, '725.239')] [2023-03-06 17:44:17,282][23882] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-06 17:44:18,055][23882] Updated weights for policy 0, policy_version 49050 (0.0007) [2023-03-06 17:44:18,830][23882] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-06 17:44:19,621][23882] Updated weights for policy 0, policy_version 49070 (0.0007) [2023-03-06 17:44:20,417][23882] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-06 17:44:21,201][23882] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-06 17:44:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50275328. Throughput: 0: 13031.7. Samples: 50260310. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:21,748][23556] Avg episode reward: [(0, '654.567')] [2023-03-06 17:44:21,976][23882] Updated weights for policy 0, policy_version 49100 (0.0006) [2023-03-06 17:44:22,777][23882] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-06 17:44:23,561][23882] Updated weights for policy 0, policy_version 49120 (0.0007) [2023-03-06 17:44:24,329][23882] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-06 17:44:25,113][23882] Updated weights for policy 0, policy_version 49140 (0.0005) [2023-03-06 17:44:25,893][23882] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-06 17:44:26,669][23882] Updated weights for policy 0, policy_version 49160 (0.0007) [2023-03-06 17:44:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 50339840. Throughput: 0: 13036.5. Samples: 50338804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:26,748][23556] Avg episode reward: [(0, '707.816')] [2023-03-06 17:44:27,462][23882] Updated weights for policy 0, policy_version 49170 (0.0007) [2023-03-06 17:44:28,240][23882] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-06 17:44:29,029][23882] Updated weights for policy 0, policy_version 49190 (0.0006) [2023-03-06 17:44:29,817][23882] Updated weights for policy 0, policy_version 49200 (0.0006) [2023-03-06 17:44:30,606][23882] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-06 17:44:31,394][23882] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-06 17:44:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50405376. Throughput: 0: 13030.8. Samples: 50377714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:31,748][23556] Avg episode reward: [(0, '666.785')] [2023-03-06 17:44:32,183][23882] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-06 17:44:32,988][23882] Updated weights for policy 0, policy_version 49240 (0.0007) [2023-03-06 17:44:33,768][23882] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-06 17:44:34,544][23882] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-06 17:44:35,337][23882] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-06 17:44:36,106][23882] Updated weights for policy 0, policy_version 49280 (0.0005) [2023-03-06 17:44:36,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50470912. Throughput: 0: 13033.9. Samples: 50455829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:36,748][23556] Avg episode reward: [(0, '368.148')] [2023-03-06 17:44:36,890][23882] Updated weights for policy 0, policy_version 49290 (0.0007) [2023-03-06 17:44:37,690][23882] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-06 17:44:38,472][23882] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-06 17:44:39,250][23882] Updated weights for policy 0, policy_version 49320 (0.0006) [2023-03-06 17:44:40,067][23882] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-06 17:44:40,847][23882] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-06 17:44:41,626][23882] Updated weights for policy 0, policy_version 49350 (0.0005) [2023-03-06 17:44:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 50535424. Throughput: 0: 13043.8. Samples: 50534102. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:44:41,748][23556] Avg episode reward: [(0, '327.278')] [2023-03-06 17:44:42,417][23882] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-06 17:44:43,222][23882] Updated weights for policy 0, policy_version 49370 (0.0006) [2023-03-06 17:44:44,005][23882] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-06 17:44:44,781][23882] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-06 17:44:45,569][23882] Updated weights for policy 0, policy_version 49400 (0.0008) [2023-03-06 17:44:46,341][23882] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-06 17:44:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50600960. Throughput: 0: 13034.4. Samples: 50572901. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:44:46,748][23556] Avg episode reward: [(0, '222.951')] [2023-03-06 17:44:47,116][23882] Updated weights for policy 0, policy_version 49420 (0.0006) [2023-03-06 17:44:47,905][23882] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-06 17:44:48,682][23882] Updated weights for policy 0, policy_version 49440 (0.0007) [2023-03-06 17:44:49,477][23882] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-06 17:44:50,253][23882] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-06 17:44:51,050][23882] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-06 17:44:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 50665472. Throughput: 0: 13043.0. Samples: 50651442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:44:51,748][23556] Avg episode reward: [(0, '234.159')] [2023-03-06 17:44:51,846][23882] Updated weights for policy 0, policy_version 49480 (0.0006) [2023-03-06 17:44:52,618][23882] Updated weights for policy 0, policy_version 49490 (0.0007) [2023-03-06 17:44:53,404][23882] Updated weights for policy 0, policy_version 49500 (0.0006) [2023-03-06 17:44:54,196][23882] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-06 17:44:54,978][23882] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-06 17:44:55,751][23882] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-06 17:44:56,518][23882] Updated weights for policy 0, policy_version 49540 (0.0007) [2023-03-06 17:44:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50731008. Throughput: 0: 13046.1. Samples: 50729697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:44:56,748][23556] Avg episode reward: [(0, '172.408')] [2023-03-06 17:44:57,314][23882] Updated weights for policy 0, policy_version 49550 (0.0006) [2023-03-06 17:44:58,125][23882] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-06 17:44:58,894][23882] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-06 17:44:59,690][23882] Updated weights for policy 0, policy_version 49580 (0.0006) [2023-03-06 17:45:00,478][23882] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-06 17:45:01,274][23882] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-06 17:45:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50796544. Throughput: 0: 13038.8. Samples: 50768646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:45:01,748][23556] Avg episode reward: [(0, '303.927')] [2023-03-06 17:45:02,057][23882] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-06 17:45:02,825][23882] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-06 17:45:03,605][23882] Updated weights for policy 0, policy_version 49630 (0.0007) [2023-03-06 17:45:04,384][23882] Updated weights for policy 0, policy_version 49640 (0.0007) [2023-03-06 17:45:05,165][23882] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-06 17:45:05,939][23882] Updated weights for policy 0, policy_version 49660 (0.0007) [2023-03-06 17:45:06,706][23882] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-06 17:45:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 50862080. Throughput: 0: 13044.6. Samples: 50847317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:45:06,748][23556] Avg episode reward: [(0, '332.842')] [2023-03-06 17:45:07,485][23882] Updated weights for policy 0, policy_version 49680 (0.0007) [2023-03-06 17:45:08,281][23882] Updated weights for policy 0, policy_version 49690 (0.0007) [2023-03-06 17:45:09,053][23882] Updated weights for policy 0, policy_version 49700 (0.0007) [2023-03-06 17:45:09,838][23882] Updated weights for policy 0, policy_version 49710 (0.0006) [2023-03-06 17:45:10,624][23882] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-06 17:45:11,418][23882] Updated weights for policy 0, policy_version 49730 (0.0007) [2023-03-06 17:45:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 50927616. Throughput: 0: 13042.5. Samples: 50925715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:45:11,748][23556] Avg episode reward: [(0, '270.628')] [2023-03-06 17:45:12,209][23882] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-06 17:45:12,984][23882] Updated weights for policy 0, policy_version 49750 (0.0006) [2023-03-06 17:45:13,770][23882] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-06 17:45:14,561][23882] Updated weights for policy 0, policy_version 49770 (0.0007) [2023-03-06 17:45:15,353][23882] Updated weights for policy 0, policy_version 49780 (0.0007) [2023-03-06 17:45:16,125][23882] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-06 17:45:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 50992128. Throughput: 0: 13048.0. Samples: 50964877. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:16,748][23556] Avg episode reward: [(0, '142.778')] [2023-03-06 17:45:16,913][23882] Updated weights for policy 0, policy_version 49800 (0.0006) [2023-03-06 17:45:17,694][23882] Updated weights for policy 0, policy_version 49810 (0.0007) [2023-03-06 17:45:18,476][23882] Updated weights for policy 0, policy_version 49820 (0.0007) [2023-03-06 17:45:19,268][23882] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-06 17:45:20,043][23882] Updated weights for policy 0, policy_version 49840 (0.0007) [2023-03-06 17:45:20,841][23882] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-06 17:45:21,622][23882] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-06 17:45:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 51057664. Throughput: 0: 13051.8. Samples: 51043160. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:21,748][23556] Avg episode reward: [(0, '161.381')] [2023-03-06 17:45:22,405][23882] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-06 17:45:23,200][23882] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-06 17:45:23,992][23882] Updated weights for policy 0, policy_version 49890 (0.0007) [2023-03-06 17:45:24,773][23882] Updated weights for policy 0, policy_version 49900 (0.0007) [2023-03-06 17:45:25,572][23882] Updated weights for policy 0, policy_version 49910 (0.0006) [2023-03-06 17:45:26,347][23882] Updated weights for policy 0, policy_version 49920 (0.0007) [2023-03-06 17:45:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 51123200. Throughput: 0: 13051.4. Samples: 51121415. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:26,748][23556] Avg episode reward: [(0, '71.889')] [2023-03-06 17:45:27,118][23882] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-06 17:45:27,919][23882] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-06 17:45:28,685][23882] Updated weights for policy 0, policy_version 49950 (0.0007) [2023-03-06 17:45:29,493][23882] Updated weights for policy 0, policy_version 49960 (0.0007) [2023-03-06 17:45:30,274][23882] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-06 17:45:31,062][23882] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-06 17:45:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 51187712. Throughput: 0: 13055.8. Samples: 51160412. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:31,748][23556] Avg episode reward: [(0, '158.163')] [2023-03-06 17:45:31,827][23882] Updated weights for policy 0, policy_version 49990 (0.0006) [2023-03-06 17:45:32,624][23882] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-06 17:45:33,395][23882] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-06 17:45:34,186][23882] Updated weights for policy 0, policy_version 50020 (0.0007) [2023-03-06 17:45:34,987][23882] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-06 17:45:35,768][23882] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-06 17:45:36,548][23882] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-06 17:45:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 51253248. Throughput: 0: 13053.8. Samples: 51238862. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:36,748][23556] Avg episode reward: [(0, '168.401')] [2023-03-06 17:45:37,321][23882] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-06 17:45:38,107][23882] Updated weights for policy 0, policy_version 50070 (0.0007) [2023-03-06 17:45:38,893][23882] Updated weights for policy 0, policy_version 50080 (0.0006) [2023-03-06 17:45:39,664][23882] Updated weights for policy 0, policy_version 50090 (0.0005) [2023-03-06 17:45:40,439][23882] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-06 17:45:41,215][23882] Updated weights for policy 0, policy_version 50110 (0.0006) [2023-03-06 17:45:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 51318784. Throughput: 0: 13063.9. Samples: 51317573. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:41,748][23556] Avg episode reward: [(0, '192.569')] [2023-03-06 17:45:42,001][23882] Updated weights for policy 0, policy_version 50120 (0.0006) [2023-03-06 17:45:42,770][23882] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-06 17:45:43,546][23882] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-06 17:45:44,338][23882] Updated weights for policy 0, policy_version 50150 (0.0007) [2023-03-06 17:45:45,129][23882] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-03-06 17:45:45,886][23882] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-06 17:45:46,674][23882] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-06 17:45:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 51384320. Throughput: 0: 13072.1. Samples: 51356889. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:45:46,748][23556] Avg episode reward: [(0, '395.874')] [2023-03-06 17:45:47,476][23882] Updated weights for policy 0, policy_version 50190 (0.0007) [2023-03-06 17:45:48,254][23882] Updated weights for policy 0, policy_version 50200 (0.0006) [2023-03-06 17:45:49,073][23882] Updated weights for policy 0, policy_version 50210 (0.0006) [2023-03-06 17:45:49,857][23882] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-06 17:45:50,635][23882] Updated weights for policy 0, policy_version 50230 (0.0007) [2023-03-06 17:45:51,425][23882] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-06 17:45:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 51449856. Throughput: 0: 13059.7. Samples: 51435004. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:45:51,749][23556] Avg episode reward: [(0, '399.884')] [2023-03-06 17:45:52,222][23882] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-06 17:45:52,999][23882] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-06 17:45:53,782][23882] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-06 17:45:54,566][23882] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-06 17:45:55,349][23882] Updated weights for policy 0, policy_version 50290 (0.0006) [2023-03-06 17:45:56,125][23882] Updated weights for policy 0, policy_version 50300 (0.0006) [2023-03-06 17:45:56,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 51515392. Throughput: 0: 13057.7. Samples: 51513309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:45:56,748][23556] Avg episode reward: [(0, '337.654')] [2023-03-06 17:45:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000050308_51515392.pth... [2023-03-06 17:45:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000047249_48382976.pth [2023-03-06 17:45:56,897][23882] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-06 17:45:57,712][23882] Updated weights for policy 0, policy_version 50320 (0.0007) [2023-03-06 17:45:58,480][23882] Updated weights for policy 0, policy_version 50330 (0.0006) [2023-03-06 17:45:59,271][23882] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-06 17:46:00,049][23882] Updated weights for policy 0, policy_version 50350 (0.0007) [2023-03-06 17:46:00,834][23882] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-06 17:46:01,608][23882] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-06 17:46:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 51579904. Throughput: 0: 13055.9. Samples: 51552392. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:46:01,748][23556] Avg episode reward: [(0, '400.551')] [2023-03-06 17:46:02,408][23882] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-06 17:46:03,195][23882] Updated weights for policy 0, policy_version 50390 (0.0008) [2023-03-06 17:46:03,961][23882] Updated weights for policy 0, policy_version 50400 (0.0006) [2023-03-06 17:46:04,765][23882] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-06 17:46:05,530][23882] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-06 17:46:06,316][23882] Updated weights for policy 0, policy_version 50430 (0.0007) [2023-03-06 17:46:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 51645440. Throughput: 0: 13059.1. Samples: 51630823. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:46:06,748][23556] Avg episode reward: [(0, '409.722')] [2023-03-06 17:46:07,089][23882] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-06 17:46:07,870][23882] Updated weights for policy 0, policy_version 50450 (0.0007) [2023-03-06 17:46:08,667][23882] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-06 17:46:09,438][23882] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-06 17:46:10,217][23882] Updated weights for policy 0, policy_version 50480 (0.0006) [2023-03-06 17:46:10,997][23882] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-06 17:46:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 51710976. Throughput: 0: 13070.8. Samples: 51709600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:46:11,748][23556] Avg episode reward: [(0, '667.220')] [2023-03-06 17:46:11,771][23882] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-06 17:46:12,551][23882] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-06 17:46:13,335][23882] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-06 17:46:14,121][23882] Updated weights for policy 0, policy_version 50530 (0.0007) [2023-03-06 17:46:14,895][23882] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-06 17:46:15,671][23882] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-06 17:46:16,457][23882] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-06 17:46:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 51776512. Throughput: 0: 13075.3. Samples: 51748801. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:46:16,748][23556] Avg episode reward: [(0, '580.150')] [2023-03-06 17:46:17,230][23882] Updated weights for policy 0, policy_version 50570 (0.0007) [2023-03-06 17:46:18,021][23882] Updated weights for policy 0, policy_version 50580 (0.0005) [2023-03-06 17:46:18,799][23882] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-06 17:46:19,577][23882] Updated weights for policy 0, policy_version 50600 (0.0007) [2023-03-06 17:46:20,369][23882] Updated weights for policy 0, policy_version 50610 (0.0007) [2023-03-06 17:46:21,171][23882] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-06 17:46:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 51842048. Throughput: 0: 13080.9. Samples: 51827500. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:21,748][23556] Avg episode reward: [(0, '689.292')] [2023-03-06 17:46:21,957][23882] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-06 17:46:22,711][23882] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-06 17:46:23,533][23882] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-06 17:46:24,314][23882] Updated weights for policy 0, policy_version 50660 (0.0007) [2023-03-06 17:46:25,098][23882] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-06 17:46:25,885][23882] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-06 17:46:26,658][23882] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-06 17:46:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 51907584. Throughput: 0: 13067.4. Samples: 51905605. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:26,748][23556] Avg episode reward: [(0, '745.037')] [2023-03-06 17:46:27,448][23882] Updated weights for policy 0, policy_version 50700 (0.0006) [2023-03-06 17:46:28,250][23882] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-06 17:46:29,006][23882] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-06 17:46:29,791][23882] Updated weights for policy 0, policy_version 50730 (0.0006) [2023-03-06 17:46:30,578][23882] Updated weights for policy 0, policy_version 50740 (0.0007) [2023-03-06 17:46:31,359][23882] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-06 17:46:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 51972096. Throughput: 0: 13064.2. Samples: 51944775. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:31,748][23556] Avg episode reward: [(0, '768.353')] [2023-03-06 17:46:32,161][23882] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-06 17:46:32,958][23882] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-06 17:46:33,726][23882] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-06 17:46:34,520][23882] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-06 17:46:35,304][23882] Updated weights for policy 0, policy_version 50800 (0.0007) [2023-03-06 17:46:36,104][23882] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-06 17:46:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 52037632. Throughput: 0: 13062.3. Samples: 52022807. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:36,748][23556] Avg episode reward: [(0, '803.330')] [2023-03-06 17:46:36,903][23882] Updated weights for policy 0, policy_version 50820 (0.0007) [2023-03-06 17:46:37,662][23882] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-06 17:46:38,447][23882] Updated weights for policy 0, policy_version 50840 (0.0007) [2023-03-06 17:46:39,240][23882] Updated weights for policy 0, policy_version 50850 (0.0007) [2023-03-06 17:46:40,021][23882] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-06 17:46:40,834][23882] Updated weights for policy 0, policy_version 50870 (0.0005) [2023-03-06 17:46:41,624][23882] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-06 17:46:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 52102144. Throughput: 0: 13053.5. Samples: 52100719. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:41,748][23556] Avg episode reward: [(0, '745.527')] [2023-03-06 17:46:42,398][23882] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-06 17:46:43,187][23882] Updated weights for policy 0, policy_version 50900 (0.0007) [2023-03-06 17:46:43,976][23882] Updated weights for policy 0, policy_version 50910 (0.0006) [2023-03-06 17:46:44,755][23882] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-06 17:46:45,547][23882] Updated weights for policy 0, policy_version 50930 (0.0007) [2023-03-06 17:46:46,339][23882] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-06 17:46:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 52167680. Throughput: 0: 13054.5. Samples: 52139845. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:46,748][23556] Avg episode reward: [(0, '659.227')] [2023-03-06 17:46:47,127][23882] Updated weights for policy 0, policy_version 50950 (0.0007) [2023-03-06 17:46:47,910][23882] Updated weights for policy 0, policy_version 50960 (0.0007) [2023-03-06 17:46:48,703][23882] Updated weights for policy 0, policy_version 50970 (0.0007) [2023-03-06 17:46:49,493][23882] Updated weights for policy 0, policy_version 50980 (0.0007) [2023-03-06 17:46:50,276][23882] Updated weights for policy 0, policy_version 50990 (0.0007) [2023-03-06 17:46:51,085][23882] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-06 17:46:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 52232192. Throughput: 0: 13039.9. Samples: 52217617. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:51,748][23556] Avg episode reward: [(0, '610.429')] [2023-03-06 17:46:51,870][23882] Updated weights for policy 0, policy_version 51010 (0.0006) [2023-03-06 17:46:52,652][23882] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-06 17:46:53,414][23882] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-06 17:46:54,214][23882] Updated weights for policy 0, policy_version 51040 (0.0007) [2023-03-06 17:46:54,997][23882] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-06 17:46:55,780][23882] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-06 17:46:56,559][23882] Updated weights for policy 0, policy_version 51070 (0.0007) [2023-03-06 17:46:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 52297728. Throughput: 0: 13032.6. Samples: 52296064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:46:56,748][23556] Avg episode reward: [(0, '785.129')] [2023-03-06 17:46:57,355][23882] Updated weights for policy 0, policy_version 51080 (0.0007) [2023-03-06 17:46:58,138][23882] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-06 17:46:58,916][23882] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-06 17:46:59,728][23882] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-06 17:47:00,514][23882] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-06 17:47:01,285][23882] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-06 17:47:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 52362240. Throughput: 0: 13024.2. Samples: 52334887. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:01,748][23556] Avg episode reward: [(0, '857.682')] [2023-03-06 17:47:02,083][23882] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-06 17:47:02,874][23882] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-06 17:47:03,646][23882] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-06 17:47:04,439][23882] Updated weights for policy 0, policy_version 51170 (0.0007) [2023-03-06 17:47:05,228][23882] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-06 17:47:06,008][23882] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-06 17:47:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 52427776. Throughput: 0: 13014.4. Samples: 52413149. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:06,748][23556] Avg episode reward: [(0, '816.076')] [2023-03-06 17:47:06,795][23882] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-06 17:47:07,580][23882] Updated weights for policy 0, policy_version 51210 (0.0008) [2023-03-06 17:47:08,363][23882] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-06 17:47:09,146][23882] Updated weights for policy 0, policy_version 51230 (0.0007) [2023-03-06 17:47:09,933][23882] Updated weights for policy 0, policy_version 51240 (0.0007) [2023-03-06 17:47:10,712][23882] Updated weights for policy 0, policy_version 51250 (0.0007) [2023-03-06 17:47:11,479][23882] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-06 17:47:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 52493312. Throughput: 0: 13024.9. Samples: 52491727. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:11,748][23556] Avg episode reward: [(0, '833.266')] [2023-03-06 17:47:12,265][23882] Updated weights for policy 0, policy_version 51270 (0.0006) [2023-03-06 17:47:13,041][23882] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-06 17:47:13,828][23882] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-06 17:47:14,624][23882] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-06 17:47:15,414][23882] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-06 17:47:16,195][23882] Updated weights for policy 0, policy_version 51320 (0.0007) [2023-03-06 17:47:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 52558848. Throughput: 0: 13023.9. Samples: 52530849. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:16,748][23556] Avg episode reward: [(0, '939.032')] [2023-03-06 17:47:16,982][23882] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-06 17:47:17,754][23882] Updated weights for policy 0, policy_version 51340 (0.0005) [2023-03-06 17:47:18,557][23882] Updated weights for policy 0, policy_version 51350 (0.0007) [2023-03-06 17:47:19,326][23882] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-06 17:47:20,105][23882] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-06 17:47:20,892][23882] Updated weights for policy 0, policy_version 51380 (0.0006) [2023-03-06 17:47:21,678][23882] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-06 17:47:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13048.2). Total num frames: 52623360. Throughput: 0: 13032.1. Samples: 52609254. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:21,749][23556] Avg episode reward: [(0, '858.014')] [2023-03-06 17:47:22,490][23882] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-06 17:47:23,258][23882] Updated weights for policy 0, policy_version 51410 (0.0006) [2023-03-06 17:47:24,039][23882] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-06 17:47:24,262][23831] KL-divergence is very high: 103668.8125 [2023-03-06 17:47:24,838][23882] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-06 17:47:25,609][23882] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-06 17:47:26,390][23882] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-06 17:47:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 52688896. Throughput: 0: 13040.2. Samples: 52687527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:26,754][23556] Avg episode reward: [(0, '482.102')] [2023-03-06 17:47:27,175][23882] Updated weights for policy 0, policy_version 51460 (0.0007) [2023-03-06 17:47:27,954][23882] Updated weights for policy 0, policy_version 51470 (0.0007) [2023-03-06 17:47:28,732][23882] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-06 17:47:29,521][23882] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-06 17:47:30,313][23882] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-06 17:47:31,093][23882] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-03-06 17:47:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 52754432. Throughput: 0: 13038.4. Samples: 52726574. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:31,748][23556] Avg episode reward: [(0, '470.561')] [2023-03-06 17:47:31,899][23882] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-06 17:47:32,681][23882] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-06 17:47:33,465][23882] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-06 17:47:34,243][23882] Updated weights for policy 0, policy_version 51550 (0.0005) [2023-03-06 17:47:35,017][23882] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-06 17:47:35,811][23882] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-06 17:47:36,596][23882] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-06 17:47:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 52818944. Throughput: 0: 13047.4. Samples: 52804752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:36,748][23556] Avg episode reward: [(0, '321.276')] [2023-03-06 17:47:37,373][23882] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-06 17:47:38,142][23882] Updated weights for policy 0, policy_version 51600 (0.0006) [2023-03-06 17:47:38,917][23882] Updated weights for policy 0, policy_version 51610 (0.0006) [2023-03-06 17:47:39,691][23882] Updated weights for policy 0, policy_version 51620 (0.0007) [2023-03-06 17:47:40,473][23882] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-06 17:47:41,251][23882] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-06 17:47:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 52885504. Throughput: 0: 13062.5. Samples: 52883876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:41,748][23556] Avg episode reward: [(0, '248.508')] [2023-03-06 17:47:42,021][23882] Updated weights for policy 0, policy_version 51650 (0.0006) [2023-03-06 17:47:42,804][23882] Updated weights for policy 0, policy_version 51660 (0.0007) [2023-03-06 17:47:43,593][23882] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-06 17:47:44,384][23882] Updated weights for policy 0, policy_version 51680 (0.0007) [2023-03-06 17:47:45,159][23882] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-06 17:47:45,960][23882] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-06 17:47:46,730][23882] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-06 17:47:46,748][23556] Fps is (10 sec: 13209.5, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 52951040. Throughput: 0: 13072.0. Samples: 52923127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:46,748][23556] Avg episode reward: [(0, '271.846')] [2023-03-06 17:47:47,519][23882] Updated weights for policy 0, policy_version 51720 (0.0007) [2023-03-06 17:47:48,301][23882] Updated weights for policy 0, policy_version 51730 (0.0007) [2023-03-06 17:47:49,094][23882] Updated weights for policy 0, policy_version 51740 (0.0007) [2023-03-06 17:47:49,867][23882] Updated weights for policy 0, policy_version 51750 (0.0007) [2023-03-06 17:47:50,641][23882] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-06 17:47:51,434][23882] Updated weights for policy 0, policy_version 51770 (0.0007) [2023-03-06 17:47:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 53015552. Throughput: 0: 13073.1. Samples: 53001437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:47:51,748][23556] Avg episode reward: [(0, '313.547')] [2023-03-06 17:47:52,224][23882] Updated weights for policy 0, policy_version 51780 (0.0006) [2023-03-06 17:47:53,008][23882] Updated weights for policy 0, policy_version 51790 (0.0006) [2023-03-06 17:47:53,791][23882] Updated weights for policy 0, policy_version 51800 (0.0006) [2023-03-06 17:47:54,573][23882] Updated weights for policy 0, policy_version 51810 (0.0007) [2023-03-06 17:47:55,364][23882] Updated weights for policy 0, policy_version 51820 (0.0007) [2023-03-06 17:47:56,160][23882] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-06 17:47:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 53081088. Throughput: 0: 13065.8. Samples: 53079689. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:47:56,748][23556] Avg episode reward: [(0, '281.621')] [2023-03-06 17:47:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000051837_53081088.pth... [2023-03-06 17:47:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000048779_49949696.pth [2023-03-06 17:47:56,945][23882] Updated weights for policy 0, policy_version 51840 (0.0007) [2023-03-06 17:47:57,727][23882] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-06 17:47:58,495][23882] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-06 17:47:59,269][23882] Updated weights for policy 0, policy_version 51870 (0.0006) [2023-03-06 17:48:00,045][23882] Updated weights for policy 0, policy_version 51880 (0.0006) [2023-03-06 17:48:00,825][23882] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-06 17:48:01,608][23882] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-06 17:48:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 53146624. Throughput: 0: 13073.1. Samples: 53119139. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:48:01,759][23556] Avg episode reward: [(0, '301.079')] [2023-03-06 17:48:02,376][23882] Updated weights for policy 0, policy_version 51910 (0.0007) [2023-03-06 17:48:03,169][23882] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-06 17:48:03,954][23882] Updated weights for policy 0, policy_version 51930 (0.0006) [2023-03-06 17:48:04,706][23882] Updated weights for policy 0, policy_version 51940 (0.0008) [2023-03-06 17:48:05,496][23882] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-06 17:48:06,254][23882] Updated weights for policy 0, policy_version 51960 (0.0006) [2023-03-06 17:48:06,748][23556] Fps is (10 sec: 13209.4, 60 sec: 13090.1, 300 sec: 13055.1). Total num frames: 53213184. Throughput: 0: 13087.6. Samples: 53198196. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:48:06,759][23556] Avg episode reward: [(0, '325.604')] [2023-03-06 17:48:07,034][23882] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-06 17:48:07,829][23882] Updated weights for policy 0, policy_version 51980 (0.0007) [2023-03-06 17:48:08,601][23882] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-06 17:48:09,384][23882] Updated weights for policy 0, policy_version 52000 (0.0007) [2023-03-06 17:48:10,177][23882] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-06 17:48:10,962][23882] Updated weights for policy 0, policy_version 52020 (0.0007) [2023-03-06 17:48:11,739][23882] Updated weights for policy 0, policy_version 52030 (0.0007) [2023-03-06 17:48:11,748][23556] Fps is (10 sec: 13209.6, 60 sec: 13090.1, 300 sec: 13055.1). Total num frames: 53278720. Throughput: 0: 13093.8. Samples: 53276748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:48:11,758][23556] Avg episode reward: [(0, '221.714')] [2023-03-06 17:48:12,509][23882] Updated weights for policy 0, policy_version 52040 (0.0006) [2023-03-06 17:48:13,298][23882] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-06 17:48:14,066][23882] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-06 17:48:14,843][23882] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-06 17:48:15,625][23882] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-06 17:48:16,398][23882] Updated weights for policy 0, policy_version 52090 (0.0006) [2023-03-06 17:48:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13055.1). Total num frames: 53344256. Throughput: 0: 13102.9. Samples: 53316204. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:48:16,759][23556] Avg episode reward: [(0, '365.551')] [2023-03-06 17:48:17,193][23882] Updated weights for policy 0, policy_version 52100 (0.0007) [2023-03-06 17:48:17,973][23882] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-06 17:48:18,762][23882] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-06 17:48:19,550][23882] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-06 17:48:20,337][23882] Updated weights for policy 0, policy_version 52140 (0.0006) [2023-03-06 17:48:21,132][23882] Updated weights for policy 0, policy_version 52150 (0.0007) [2023-03-06 17:48:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.2, 300 sec: 13051.7). Total num frames: 53408768. Throughput: 0: 13109.8. Samples: 53394694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:48:21,759][23556] Avg episode reward: [(0, '380.803')] [2023-03-06 17:48:21,907][23882] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-06 17:48:22,683][23882] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-06 17:48:23,474][23882] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-06 17:48:24,254][23882] Updated weights for policy 0, policy_version 52190 (0.0007) [2023-03-06 17:48:25,037][23882] Updated weights for policy 0, policy_version 52200 (0.0007) [2023-03-06 17:48:25,830][23882] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-06 17:48:26,620][23882] Updated weights for policy 0, policy_version 52220 (0.0008) [2023-03-06 17:48:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13055.1). Total num frames: 53474304. Throughput: 0: 13090.7. Samples: 53472959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:26,754][23556] Avg episode reward: [(0, '428.067')] [2023-03-06 17:48:27,406][23882] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-06 17:48:28,197][23882] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-06 17:48:28,982][23882] Updated weights for policy 0, policy_version 52250 (0.0007) [2023-03-06 17:48:29,774][23882] Updated weights for policy 0, policy_version 52260 (0.0007) [2023-03-06 17:48:30,554][23882] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-06 17:48:31,342][23882] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-06 17:48:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13055.1). Total num frames: 53539840. Throughput: 0: 13082.1. Samples: 53511821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:31,759][23556] Avg episode reward: [(0, '419.817')] [2023-03-06 17:48:32,122][23882] Updated weights for policy 0, policy_version 52290 (0.0007) [2023-03-06 17:48:32,919][23882] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-06 17:48:33,687][23882] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-06 17:48:34,475][23882] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-06 17:48:35,281][23882] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-06 17:48:36,051][23882] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-06 17:48:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13090.1, 300 sec: 13051.7). Total num frames: 53604352. Throughput: 0: 13078.9. Samples: 53589985. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:36,759][23556] Avg episode reward: [(0, '336.766')] [2023-03-06 17:48:36,839][23882] Updated weights for policy 0, policy_version 52350 (0.0007) [2023-03-06 17:48:37,642][23882] Updated weights for policy 0, policy_version 52360 (0.0007) [2023-03-06 17:48:38,422][23882] Updated weights for policy 0, policy_version 52370 (0.0007) [2023-03-06 17:48:39,195][23882] Updated weights for policy 0, policy_version 52380 (0.0006) [2023-03-06 17:48:39,982][23882] Updated weights for policy 0, policy_version 52390 (0.0007) [2023-03-06 17:48:40,776][23882] Updated weights for policy 0, policy_version 52400 (0.0007) [2023-03-06 17:48:41,546][23882] Updated weights for policy 0, policy_version 52410 (0.0007) [2023-03-06 17:48:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 53669888. Throughput: 0: 13078.4. Samples: 53668218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:41,754][23556] Avg episode reward: [(0, '536.295')] [2023-03-06 17:48:42,333][23882] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-06 17:48:43,152][23882] Updated weights for policy 0, policy_version 52430 (0.0007) [2023-03-06 17:48:43,926][23882] Updated weights for policy 0, policy_version 52440 (0.0006) [2023-03-06 17:48:44,725][23882] Updated weights for policy 0, policy_version 52450 (0.0008) [2023-03-06 17:48:45,484][23882] Updated weights for policy 0, policy_version 52460 (0.0006) [2023-03-06 17:48:46,268][23882] Updated weights for policy 0, policy_version 52470 (0.0007) [2023-03-06 17:48:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 53735424. Throughput: 0: 13075.3. Samples: 53707526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:46,759][23556] Avg episode reward: [(0, '375.414')] [2023-03-06 17:48:47,041][23882] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-06 17:48:47,825][23882] Updated weights for policy 0, policy_version 52490 (0.0007) [2023-03-06 17:48:48,613][23882] Updated weights for policy 0, policy_version 52500 (0.0007) [2023-03-06 17:48:49,411][23882] Updated weights for policy 0, policy_version 52510 (0.0007) [2023-03-06 17:48:50,201][23882] Updated weights for policy 0, policy_version 52520 (0.0007) [2023-03-06 17:48:50,991][23882] Updated weights for policy 0, policy_version 52530 (0.0007) [2023-03-06 17:48:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 53799936. Throughput: 0: 13056.7. Samples: 53785746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:51,759][23556] Avg episode reward: [(0, '462.341')] [2023-03-06 17:48:51,770][23882] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-06 17:48:52,542][23882] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-06 17:48:53,335][23882] Updated weights for policy 0, policy_version 52560 (0.0007) [2023-03-06 17:48:54,102][23882] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-06 17:48:54,888][23882] Updated weights for policy 0, policy_version 52580 (0.0006) [2023-03-06 17:48:55,657][23882] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-06 17:48:56,456][23882] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-06 17:48:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13073.0, 300 sec: 13055.1). Total num frames: 53865472. Throughput: 0: 13054.7. Samples: 53864213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:48:56,754][23556] Avg episode reward: [(0, '522.185')] [2023-03-06 17:48:57,243][23882] Updated weights for policy 0, policy_version 52610 (0.0007) [2023-03-06 17:48:58,022][23882] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-06 17:48:58,805][23882] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-06 17:48:59,586][23882] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-06 17:49:00,365][23882] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-06 17:49:01,160][23882] Updated weights for policy 0, policy_version 52660 (0.0007) [2023-03-06 17:49:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 53931008. Throughput: 0: 13047.8. Samples: 53903356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:01,759][23556] Avg episode reward: [(0, '466.074')] [2023-03-06 17:49:01,943][23882] Updated weights for policy 0, policy_version 52670 (0.0007) [2023-03-06 17:49:02,725][23882] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-06 17:49:03,506][23882] Updated weights for policy 0, policy_version 52690 (0.0007) [2023-03-06 17:49:04,295][23882] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-06 17:49:05,085][23882] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-06 17:49:05,862][23882] Updated weights for policy 0, policy_version 52720 (0.0005) [2023-03-06 17:49:06,638][23882] Updated weights for policy 0, policy_version 52730 (0.0006) [2023-03-06 17:49:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 53996544. Throughput: 0: 13047.7. Samples: 53981843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:06,759][23556] Avg episode reward: [(0, '392.697')] [2023-03-06 17:49:07,406][23882] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-06 17:49:08,185][23882] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-06 17:49:08,971][23882] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-06 17:49:09,754][23882] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-06 17:49:10,530][23882] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-06 17:49:11,312][23882] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-06 17:49:11,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 54062080. Throughput: 0: 13056.2. Samples: 54060487. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:11,748][23556] Avg episode reward: [(0, '454.870')] [2023-03-06 17:49:12,095][23882] Updated weights for policy 0, policy_version 52800 (0.0007) [2023-03-06 17:49:12,866][23882] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-06 17:49:13,652][23882] Updated weights for policy 0, policy_version 52820 (0.0005) [2023-03-06 17:49:14,445][23882] Updated weights for policy 0, policy_version 52830 (0.0006) [2023-03-06 17:49:15,215][23882] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-06 17:49:16,023][23882] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-06 17:49:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 54127616. Throughput: 0: 13069.0. Samples: 54099924. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:16,748][23556] Avg episode reward: [(0, '357.663')] [2023-03-06 17:49:16,788][23882] Updated weights for policy 0, policy_version 52860 (0.0007) [2023-03-06 17:49:17,578][23882] Updated weights for policy 0, policy_version 52870 (0.0008) [2023-03-06 17:49:18,353][23882] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-06 17:49:19,137][23882] Updated weights for policy 0, policy_version 52890 (0.0007) [2023-03-06 17:49:19,912][23882] Updated weights for policy 0, policy_version 52900 (0.0007) [2023-03-06 17:49:20,705][23882] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-06 17:49:21,483][23882] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-06 17:49:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54193152. Throughput: 0: 13076.0. Samples: 54178403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:21,748][23556] Avg episode reward: [(0, '405.677')] [2023-03-06 17:49:22,279][23882] Updated weights for policy 0, policy_version 52930 (0.0005) [2023-03-06 17:49:23,046][23882] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-06 17:49:23,834][23882] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-06 17:49:24,629][23882] Updated weights for policy 0, policy_version 52960 (0.0007) [2023-03-06 17:49:25,410][23882] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-06 17:49:26,202][23882] Updated weights for policy 0, policy_version 52980 (0.0007) [2023-03-06 17:49:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 54257664. Throughput: 0: 13070.9. Samples: 54256409. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:26,748][23556] Avg episode reward: [(0, '399.258')] [2023-03-06 17:49:26,995][23882] Updated weights for policy 0, policy_version 52990 (0.0007) [2023-03-06 17:49:27,768][23882] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-06 17:49:28,579][23882] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-06 17:49:29,366][23882] Updated weights for policy 0, policy_version 53020 (0.0007) [2023-03-06 17:49:30,133][23882] Updated weights for policy 0, policy_version 53030 (0.0007) [2023-03-06 17:49:30,910][23882] Updated weights for policy 0, policy_version 53040 (0.0007) [2023-03-06 17:49:31,685][23882] Updated weights for policy 0, policy_version 53050 (0.0006) [2023-03-06 17:49:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 54323200. Throughput: 0: 13067.1. Samples: 54295545. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:31,748][23556] Avg episode reward: [(0, '428.710')] [2023-03-06 17:49:32,475][23882] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-06 17:49:33,247][23882] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-06 17:49:34,030][23882] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-06 17:49:34,809][23882] Updated weights for policy 0, policy_version 53090 (0.0007) [2023-03-06 17:49:35,591][23882] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-06 17:49:36,379][23882] Updated weights for policy 0, policy_version 53110 (0.0005) [2023-03-06 17:49:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13062.1). Total num frames: 54388736. Throughput: 0: 13079.2. Samples: 54374309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:36,748][23556] Avg episode reward: [(0, '318.923')] [2023-03-06 17:49:37,162][23882] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-06 17:49:37,960][23882] Updated weights for policy 0, policy_version 53130 (0.0007) [2023-03-06 17:49:38,742][23882] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-06 17:49:39,523][23882] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-06 17:49:40,294][23882] Updated weights for policy 0, policy_version 53160 (0.0006) [2023-03-06 17:49:41,108][23882] Updated weights for policy 0, policy_version 53170 (0.0006) [2023-03-06 17:49:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54454272. Throughput: 0: 13076.6. Samples: 54452658. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:41,748][23556] Avg episode reward: [(0, '481.159')] [2023-03-06 17:49:41,882][23882] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-06 17:49:42,655][23882] Updated weights for policy 0, policy_version 53190 (0.0006) [2023-03-06 17:49:43,445][23882] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-06 17:49:44,230][23882] Updated weights for policy 0, policy_version 53210 (0.0006) [2023-03-06 17:49:45,008][23882] Updated weights for policy 0, policy_version 53220 (0.0007) [2023-03-06 17:49:45,805][23882] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-06 17:49:46,585][23882] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-06 17:49:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 54519808. Throughput: 0: 13080.2. Samples: 54491963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:46,748][23556] Avg episode reward: [(0, '468.802')] [2023-03-06 17:49:47,367][23882] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-06 17:49:48,145][23882] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-06 17:49:48,931][23882] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-06 17:49:49,722][23882] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-06 17:49:50,496][23882] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-06 17:49:51,278][23882] Updated weights for policy 0, policy_version 53300 (0.0007) [2023-03-06 17:49:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54584320. Throughput: 0: 13070.9. Samples: 54570033. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:51,748][23556] Avg episode reward: [(0, '394.922')] [2023-03-06 17:49:52,066][23882] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-06 17:49:52,852][23882] Updated weights for policy 0, policy_version 53320 (0.0007) [2023-03-06 17:49:53,630][23882] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-06 17:49:54,399][23882] Updated weights for policy 0, policy_version 53340 (0.0008) [2023-03-06 17:49:55,198][23882] Updated weights for policy 0, policy_version 53350 (0.0007) [2023-03-06 17:49:55,993][23882] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-06 17:49:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54649856. Throughput: 0: 13071.6. Samples: 54648708. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:49:56,748][23556] Avg episode reward: [(0, '309.784')] [2023-03-06 17:49:56,759][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000053370_54650880.pth... [2023-03-06 17:49:56,759][23882] Updated weights for policy 0, policy_version 53370 (0.0007) [2023-03-06 17:49:56,788][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000050308_51515392.pth [2023-03-06 17:49:57,537][23882] Updated weights for policy 0, policy_version 53380 (0.0006) [2023-03-06 17:49:58,319][23882] Updated weights for policy 0, policy_version 53390 (0.0007) [2023-03-06 17:49:59,095][23882] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-06 17:49:59,893][23882] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-06 17:50:00,663][23882] Updated weights for policy 0, policy_version 53420 (0.0009) [2023-03-06 17:50:01,453][23882] Updated weights for policy 0, policy_version 53430 (0.0006) [2023-03-06 17:50:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54715392. Throughput: 0: 13068.4. Samples: 54688003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:01,748][23556] Avg episode reward: [(0, '267.836')] [2023-03-06 17:50:02,239][23882] Updated weights for policy 0, policy_version 53440 (0.0006) [2023-03-06 17:50:03,020][23882] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-06 17:50:03,812][23882] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-06 17:50:04,590][23882] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-06 17:50:05,375][23882] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-06 17:50:06,154][23882] Updated weights for policy 0, policy_version 53490 (0.0007) [2023-03-06 17:50:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 54780928. Throughput: 0: 13065.0. Samples: 54766327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:06,748][23556] Avg episode reward: [(0, '387.603')] [2023-03-06 17:50:06,963][23882] Updated weights for policy 0, policy_version 53500 (0.0007) [2023-03-06 17:50:07,747][23882] Updated weights for policy 0, policy_version 53510 (0.0007) [2023-03-06 17:50:08,526][23882] Updated weights for policy 0, policy_version 53520 (0.0005) [2023-03-06 17:50:09,293][23882] Updated weights for policy 0, policy_version 53530 (0.0007) [2023-03-06 17:50:10,089][23882] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-06 17:50:10,871][23882] Updated weights for policy 0, policy_version 53550 (0.0007) [2023-03-06 17:50:11,644][23882] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-06 17:50:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13065.6). Total num frames: 54846464. Throughput: 0: 13069.9. Samples: 54844552. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:11,748][23556] Avg episode reward: [(0, '416.008')] [2023-03-06 17:50:12,435][23882] Updated weights for policy 0, policy_version 53570 (0.0006) [2023-03-06 17:50:13,242][23882] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-06 17:50:14,018][23882] Updated weights for policy 0, policy_version 53590 (0.0007) [2023-03-06 17:50:14,793][23882] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-06 17:50:15,591][23882] Updated weights for policy 0, policy_version 53610 (0.0006) [2023-03-06 17:50:16,356][23882] Updated weights for policy 0, policy_version 53620 (0.0007) [2023-03-06 17:50:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 54912000. Throughput: 0: 13069.5. Samples: 54883672. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:16,748][23556] Avg episode reward: [(0, '510.244')] [2023-03-06 17:50:17,171][23882] Updated weights for policy 0, policy_version 53630 (0.0007) [2023-03-06 17:50:17,947][23882] Updated weights for policy 0, policy_version 53640 (0.0006) [2023-03-06 17:50:18,714][23882] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-06 17:50:19,505][23882] Updated weights for policy 0, policy_version 53660 (0.0007) [2023-03-06 17:50:20,269][23882] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-06 17:50:21,042][23882] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-06 17:50:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 54977536. Throughput: 0: 13066.3. Samples: 54962292. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:21,748][23556] Avg episode reward: [(0, '443.330')] [2023-03-06 17:50:21,833][23882] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-06 17:50:22,628][23882] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-06 17:50:23,389][23882] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-06 17:50:24,181][23882] Updated weights for policy 0, policy_version 53720 (0.0007) [2023-03-06 17:50:24,968][23882] Updated weights for policy 0, policy_version 53730 (0.0006) [2023-03-06 17:50:25,740][23882] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-06 17:50:26,530][23882] Updated weights for policy 0, policy_version 53750 (0.0007) [2023-03-06 17:50:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13090.2, 300 sec: 13069.0). Total num frames: 55043072. Throughput: 0: 13070.5. Samples: 55040830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:26,748][23556] Avg episode reward: [(0, '420.003')] [2023-03-06 17:50:27,303][23882] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-06 17:50:28,077][23882] Updated weights for policy 0, policy_version 53770 (0.0006) [2023-03-06 17:50:28,866][23882] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-06 17:50:29,654][23882] Updated weights for policy 0, policy_version 53790 (0.0007) [2023-03-06 17:50:30,439][23882] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-06 17:50:31,214][23882] Updated weights for policy 0, policy_version 53810 (0.0007) [2023-03-06 17:50:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 55107584. Throughput: 0: 13070.9. Samples: 55080153. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:31,759][23556] Avg episode reward: [(0, '507.365')] [2023-03-06 17:50:32,016][23882] Updated weights for policy 0, policy_version 53820 (0.0005) [2023-03-06 17:50:32,800][23882] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-06 17:50:33,573][23882] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-06 17:50:34,356][23882] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-06 17:50:35,164][23882] Updated weights for policy 0, policy_version 53860 (0.0007) [2023-03-06 17:50:35,937][23882] Updated weights for policy 0, policy_version 53870 (0.0006) [2023-03-06 17:50:36,714][23882] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-06 17:50:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 55173120. Throughput: 0: 13072.1. Samples: 55158277. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:50:36,759][23556] Avg episode reward: [(0, '527.251')] [2023-03-06 17:50:37,511][23882] Updated weights for policy 0, policy_version 53890 (0.0007) [2023-03-06 17:50:38,279][23882] Updated weights for policy 0, policy_version 53900 (0.0005) [2023-03-06 17:50:39,044][23882] Updated weights for policy 0, policy_version 53910 (0.0005) [2023-03-06 17:50:39,837][23882] Updated weights for policy 0, policy_version 53920 (0.0006) [2023-03-06 17:50:40,638][23882] Updated weights for policy 0, policy_version 53930 (0.0007) [2023-03-06 17:50:41,401][23882] Updated weights for policy 0, policy_version 53940 (0.0007) [2023-03-06 17:50:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 55238656. Throughput: 0: 13071.5. Samples: 55236926. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:50:41,754][23556] Avg episode reward: [(0, '580.893')] [2023-03-06 17:50:42,182][23882] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-06 17:50:42,973][23882] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-03-06 17:50:43,750][23882] Updated weights for policy 0, policy_version 53970 (0.0007) [2023-03-06 17:50:44,552][23882] Updated weights for policy 0, policy_version 53980 (0.0006) [2023-03-06 17:50:45,321][23882] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-06 17:50:46,118][23882] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-06 17:50:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13065.5). Total num frames: 55304192. Throughput: 0: 13067.6. Samples: 55276047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:50:46,759][23556] Avg episode reward: [(0, '642.670')] [2023-03-06 17:50:46,874][23882] Updated weights for policy 0, policy_version 54010 (0.0007) [2023-03-06 17:50:47,669][23882] Updated weights for policy 0, policy_version 54020 (0.0007) [2023-03-06 17:50:48,461][23882] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-06 17:50:49,231][23882] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-06 17:50:50,015][23882] Updated weights for policy 0, policy_version 54050 (0.0007) [2023-03-06 17:50:50,799][23882] Updated weights for policy 0, policy_version 54060 (0.0006) [2023-03-06 17:50:51,595][23882] Updated weights for policy 0, policy_version 54070 (0.0006) [2023-03-06 17:50:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 55368704. Throughput: 0: 13075.6. Samples: 55354730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:50:51,748][23556] Avg episode reward: [(0, '688.104')] [2023-03-06 17:50:52,374][23882] Updated weights for policy 0, policy_version 54080 (0.0006) [2023-03-06 17:50:53,157][23882] Updated weights for policy 0, policy_version 54090 (0.0007) [2023-03-06 17:50:53,950][23882] Updated weights for policy 0, policy_version 54100 (0.0005) [2023-03-06 17:50:54,720][23882] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-06 17:50:55,520][23882] Updated weights for policy 0, policy_version 54120 (0.0007) [2023-03-06 17:50:56,300][23882] Updated weights for policy 0, policy_version 54130 (0.0007) [2023-03-06 17:50:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 55434240. Throughput: 0: 13075.3. Samples: 55432941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:50:56,748][23556] Avg episode reward: [(0, '665.438')] [2023-03-06 17:50:57,082][23882] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-06 17:50:57,858][23882] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-06 17:50:58,652][23882] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-06 17:50:59,438][23882] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-06 17:51:00,214][23882] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-06 17:51:01,014][23882] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-06 17:51:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 55499776. Throughput: 0: 13075.1. Samples: 55472052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:01,748][23556] Avg episode reward: [(0, '670.015')] [2023-03-06 17:51:01,810][23882] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-06 17:51:02,602][23882] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-06 17:51:03,373][23882] Updated weights for policy 0, policy_version 54220 (0.0006) [2023-03-06 17:51:04,158][23882] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-06 17:51:04,933][23882] Updated weights for policy 0, policy_version 54240 (0.0007) [2023-03-06 17:51:05,726][23882] Updated weights for policy 0, policy_version 54250 (0.0006) [2023-03-06 17:51:06,499][23882] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-06 17:51:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 55564288. Throughput: 0: 13064.8. Samples: 55550210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:06,748][23556] Avg episode reward: [(0, '701.150')] [2023-03-06 17:51:07,287][23882] Updated weights for policy 0, policy_version 54270 (0.0006) [2023-03-06 17:51:08,055][23882] Updated weights for policy 0, policy_version 54280 (0.0008) [2023-03-06 17:51:08,849][23882] Updated weights for policy 0, policy_version 54290 (0.0007) [2023-03-06 17:51:09,614][23882] Updated weights for policy 0, policy_version 54300 (0.0007) [2023-03-06 17:51:10,397][23882] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-06 17:51:11,195][23882] Updated weights for policy 0, policy_version 54320 (0.0007) [2023-03-06 17:51:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 55629824. Throughput: 0: 13063.5. Samples: 55628689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:11,748][23556] Avg episode reward: [(0, '713.682')] [2023-03-06 17:51:12,004][23882] Updated weights for policy 0, policy_version 54330 (0.0007) [2023-03-06 17:51:12,776][23882] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-06 17:51:13,552][23882] Updated weights for policy 0, policy_version 54350 (0.0006) [2023-03-06 17:51:14,333][23882] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-06 17:51:15,121][23882] Updated weights for policy 0, policy_version 54370 (0.0007) [2023-03-06 17:51:15,914][23882] Updated weights for policy 0, policy_version 54380 (0.0007) [2023-03-06 17:51:16,701][23882] Updated weights for policy 0, policy_version 54390 (0.0007) [2023-03-06 17:51:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 55695360. Throughput: 0: 13064.5. Samples: 55668057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:16,748][23556] Avg episode reward: [(0, '809.554')] [2023-03-06 17:51:17,474][23882] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-06 17:51:18,257][23882] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-06 17:51:19,046][23882] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-06 17:51:19,819][23882] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-06 17:51:20,623][23882] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-06 17:51:21,405][23882] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-06 17:51:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 55760896. Throughput: 0: 13062.0. Samples: 55746068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:21,748][23556] Avg episode reward: [(0, '766.978')] [2023-03-06 17:51:22,181][23882] Updated weights for policy 0, policy_version 54460 (0.0005) [2023-03-06 17:51:22,984][23882] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-06 17:51:23,770][23882] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-06 17:51:24,552][23882] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-06 17:51:25,331][23882] Updated weights for policy 0, policy_version 54500 (0.0007) [2023-03-06 17:51:26,091][23882] Updated weights for policy 0, policy_version 54510 (0.0007) [2023-03-06 17:51:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 55826432. Throughput: 0: 13058.6. Samples: 55824563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:26,748][23556] Avg episode reward: [(0, '792.568')] [2023-03-06 17:51:26,862][23882] Updated weights for policy 0, policy_version 54520 (0.0007) [2023-03-06 17:51:27,677][23882] Updated weights for policy 0, policy_version 54530 (0.0007) [2023-03-06 17:51:28,455][23882] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-06 17:51:29,254][23882] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-06 17:51:30,047][23882] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-06 17:51:30,837][23882] Updated weights for policy 0, policy_version 54570 (0.0007) [2023-03-06 17:51:31,616][23882] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-06 17:51:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 55890944. Throughput: 0: 13055.9. Samples: 55863562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:31,748][23556] Avg episode reward: [(0, '828.111')] [2023-03-06 17:51:32,403][23882] Updated weights for policy 0, policy_version 54590 (0.0007) [2023-03-06 17:51:33,190][23882] Updated weights for policy 0, policy_version 54600 (0.0007) [2023-03-06 17:51:33,976][23882] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-06 17:51:34,749][23882] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-06 17:51:35,533][23882] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-06 17:51:36,325][23882] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-06 17:51:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 55956480. Throughput: 0: 13047.3. Samples: 55941859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:36,748][23556] Avg episode reward: [(0, '841.483')] [2023-03-06 17:51:37,105][23882] Updated weights for policy 0, policy_version 54650 (0.0006) [2023-03-06 17:51:37,897][23882] Updated weights for policy 0, policy_version 54660 (0.0007) [2023-03-06 17:51:38,682][23882] Updated weights for policy 0, policy_version 54670 (0.0007) [2023-03-06 17:51:39,478][23882] Updated weights for policy 0, policy_version 54680 (0.0007) [2023-03-06 17:51:40,249][23882] Updated weights for policy 0, policy_version 54690 (0.0008) [2023-03-06 17:51:41,043][23882] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-06 17:51:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 56020992. Throughput: 0: 13046.1. Samples: 56020017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 17:51:41,748][23556] Avg episode reward: [(0, '827.391')] [2023-03-06 17:51:41,825][23882] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-06 17:51:42,604][23882] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-06 17:51:43,385][23882] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-06 17:51:44,162][23882] Updated weights for policy 0, policy_version 54740 (0.0007) [2023-03-06 17:51:44,935][23882] Updated weights for policy 0, policy_version 54750 (0.0007) [2023-03-06 17:51:45,725][23882] Updated weights for policy 0, policy_version 54760 (0.0007) [2023-03-06 17:51:46,519][23882] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-06 17:51:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13065.5). Total num frames: 56086528. Throughput: 0: 13049.9. Samples: 56059297. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:46,748][23556] Avg episode reward: [(0, '601.443')] [2023-03-06 17:51:47,309][23882] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-06 17:51:48,102][23882] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-06 17:51:48,877][23882] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-06 17:51:49,660][23882] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-06 17:51:50,461][23882] Updated weights for policy 0, policy_version 54820 (0.0007) [2023-03-06 17:51:51,229][23882] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-06 17:51:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 56152064. Throughput: 0: 13051.6. Samples: 56137528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:51,748][23556] Avg episode reward: [(0, '613.249')] [2023-03-06 17:51:52,016][23882] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-06 17:51:52,804][23882] Updated weights for policy 0, policy_version 54850 (0.0006) [2023-03-06 17:51:53,585][23882] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-06 17:51:54,354][23882] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-06 17:51:55,140][23882] Updated weights for policy 0, policy_version 54880 (0.0007) [2023-03-06 17:51:55,916][23882] Updated weights for policy 0, policy_version 54890 (0.0006) [2023-03-06 17:51:56,699][23882] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-06 17:51:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 56217600. Throughput: 0: 13051.1. Samples: 56215990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:51:56,748][23556] Avg episode reward: [(0, '894.017')] [2023-03-06 17:51:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000054900_56217600.pth... [2023-03-06 17:51:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000051837_53081088.pth [2023-03-06 17:51:57,486][23882] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-06 17:51:58,286][23882] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-06 17:51:59,059][23882] Updated weights for policy 0, policy_version 54930 (0.0007) [2023-03-06 17:51:59,861][23882] Updated weights for policy 0, policy_version 54940 (0.0007) [2023-03-06 17:52:00,652][23882] Updated weights for policy 0, policy_version 54950 (0.0007) [2023-03-06 17:52:01,430][23882] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-06 17:52:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13065.6). Total num frames: 56282112. Throughput: 0: 13047.8. Samples: 56255207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:52:01,748][23556] Avg episode reward: [(0, '909.453')] [2023-03-06 17:52:02,228][23882] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-06 17:52:03,024][23882] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-06 17:52:03,794][23882] Updated weights for policy 0, policy_version 54990 (0.0007) [2023-03-06 17:52:04,597][23882] Updated weights for policy 0, policy_version 55000 (0.0007) [2023-03-06 17:52:05,384][23882] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-06 17:52:06,162][23882] Updated weights for policy 0, policy_version 55020 (0.0007) [2023-03-06 17:52:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 56347648. Throughput: 0: 13044.5. Samples: 56333072. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:52:06,759][23556] Avg episode reward: [(0, '1026.828')] [2023-03-06 17:52:06,958][23882] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-06 17:52:07,729][23882] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-06 17:52:08,523][23882] Updated weights for policy 0, policy_version 55050 (0.0007) [2023-03-06 17:52:09,297][23882] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-06 17:52:10,093][23882] Updated weights for policy 0, policy_version 55070 (0.0007) [2023-03-06 17:52:10,870][23882] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-06 17:52:11,654][23882] Updated weights for policy 0, policy_version 55090 (0.0005) [2023-03-06 17:52:11,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 56413184. Throughput: 0: 13041.5. Samples: 56411431. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:52:11,748][23556] Avg episode reward: [(0, '1054.641')] [2023-03-06 17:52:12,429][23882] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-06 17:52:13,209][23882] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-06 17:52:13,998][23882] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-06 17:52:14,797][23882] Updated weights for policy 0, policy_version 55130 (0.0007) [2023-03-06 17:52:15,561][23882] Updated weights for policy 0, policy_version 55140 (0.0007) [2023-03-06 17:52:16,360][23882] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-06 17:52:16,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 56478720. Throughput: 0: 13045.0. Samples: 56450587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:52:16,758][23556] Avg episode reward: [(0, '1061.075')] [2023-03-06 17:52:17,142][23882] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-06 17:52:17,922][23882] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-06 17:52:18,717][23882] Updated weights for policy 0, policy_version 55180 (0.0005) [2023-03-06 17:52:19,476][23882] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-06 17:52:20,264][23882] Updated weights for policy 0, policy_version 55200 (0.0007) [2023-03-06 17:52:21,055][23882] Updated weights for policy 0, policy_version 55210 (0.0007) [2023-03-06 17:52:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 56543232. Throughput: 0: 13047.4. Samples: 56528992. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:21,759][23556] Avg episode reward: [(0, '1062.091')] [2023-03-06 17:52:21,847][23882] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-06 17:52:22,625][23882] Updated weights for policy 0, policy_version 55230 (0.0007) [2023-03-06 17:52:23,420][23882] Updated weights for policy 0, policy_version 55240 (0.0007) [2023-03-06 17:52:24,205][23882] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-06 17:52:24,989][23882] Updated weights for policy 0, policy_version 55260 (0.0006) [2023-03-06 17:52:25,772][23882] Updated weights for policy 0, policy_version 55270 (0.0008) [2023-03-06 17:52:26,561][23882] Updated weights for policy 0, policy_version 55280 (0.0007) [2023-03-06 17:52:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 56608768. Throughput: 0: 13046.2. Samples: 56607098. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:26,758][23556] Avg episode reward: [(0, '1114.009')] [2023-03-06 17:52:27,344][23882] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-06 17:52:28,133][23882] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-06 17:52:28,909][23882] Updated weights for policy 0, policy_version 55310 (0.0007) [2023-03-06 17:52:29,699][23882] Updated weights for policy 0, policy_version 55320 (0.0006) [2023-03-06 17:52:30,487][23882] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-06 17:52:31,274][23882] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-06 17:52:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 56674304. Throughput: 0: 13043.6. Samples: 56646257. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:31,759][23556] Avg episode reward: [(0, '1197.674')] [2023-03-06 17:52:31,759][23831] Saving new best policy, reward=1197.674! [2023-03-06 17:52:32,041][23882] Updated weights for policy 0, policy_version 55350 (0.0007) [2023-03-06 17:52:32,818][23882] Updated weights for policy 0, policy_version 55360 (0.0007) [2023-03-06 17:52:33,606][23882] Updated weights for policy 0, policy_version 55370 (0.0007) [2023-03-06 17:52:34,381][23882] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-06 17:52:35,168][23882] Updated weights for policy 0, policy_version 55390 (0.0006) [2023-03-06 17:52:35,963][23882] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-06 17:52:36,743][23882] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-06 17:52:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 56739840. Throughput: 0: 13051.3. Samples: 56724836. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:36,748][23556] Avg episode reward: [(0, '1191.113')] [2023-03-06 17:52:37,521][23882] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-06 17:52:38,300][23882] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-06 17:52:39,076][23882] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-06 17:52:39,857][23882] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-06 17:52:40,631][23882] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-06 17:52:41,419][23882] Updated weights for policy 0, policy_version 55470 (0.0006) [2023-03-06 17:52:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.6). Total num frames: 56805376. Throughput: 0: 13056.2. Samples: 56803516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:41,748][23556] Avg episode reward: [(0, '1319.085')] [2023-03-06 17:52:41,749][23831] Saving new best policy, reward=1319.085! [2023-03-06 17:52:42,209][23882] Updated weights for policy 0, policy_version 55480 (0.0006) [2023-03-06 17:52:42,997][23882] Updated weights for policy 0, policy_version 55490 (0.0006) [2023-03-06 17:52:43,784][23882] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-06 17:52:44,571][23882] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-06 17:52:45,347][23882] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-06 17:52:46,149][23882] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-06 17:52:46,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 56869888. Throughput: 0: 13051.2. Samples: 56842515. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:46,748][23556] Avg episode reward: [(0, '1326.431')] [2023-03-06 17:52:46,752][23831] Saving new best policy, reward=1326.431! [2023-03-06 17:52:46,937][23882] Updated weights for policy 0, policy_version 55540 (0.0007) [2023-03-06 17:52:47,719][23882] Updated weights for policy 0, policy_version 55550 (0.0007) [2023-03-06 17:52:48,513][23882] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-06 17:52:49,300][23882] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-06 17:52:50,078][23882] Updated weights for policy 0, policy_version 55580 (0.0007) [2023-03-06 17:52:50,864][23882] Updated weights for policy 0, policy_version 55590 (0.0006) [2023-03-06 17:52:51,648][23882] Updated weights for policy 0, policy_version 55600 (0.0006) [2023-03-06 17:52:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 56935424. Throughput: 0: 13059.4. Samples: 56920745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:52:51,748][23556] Avg episode reward: [(0, '1287.257')] [2023-03-06 17:52:52,416][23882] Updated weights for policy 0, policy_version 55610 (0.0007) [2023-03-06 17:52:53,211][23882] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-06 17:52:54,000][23882] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-06 17:52:54,753][23882] Updated weights for policy 0, policy_version 55640 (0.0007) [2023-03-06 17:52:55,565][23882] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-06 17:52:56,354][23882] Updated weights for policy 0, policy_version 55660 (0.0007) [2023-03-06 17:52:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 56999936. Throughput: 0: 13057.8. Samples: 56999033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:52:56,748][23556] Avg episode reward: [(0, '1284.599')] [2023-03-06 17:52:57,134][23882] Updated weights for policy 0, policy_version 55670 (0.0007) [2023-03-06 17:52:57,942][23882] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-06 17:52:58,713][23882] Updated weights for policy 0, policy_version 55690 (0.0006) [2023-03-06 17:52:59,509][23882] Updated weights for policy 0, policy_version 55700 (0.0007) [2023-03-06 17:53:00,282][23882] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-06 17:53:01,074][23882] Updated weights for policy 0, policy_version 55720 (0.0007) [2023-03-06 17:53:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 57065472. Throughput: 0: 13051.7. Samples: 57037915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:01,748][23556] Avg episode reward: [(0, '1268.718')] [2023-03-06 17:53:01,846][23882] Updated weights for policy 0, policy_version 55730 (0.0007) [2023-03-06 17:53:02,641][23882] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-06 17:53:03,420][23882] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-06 17:53:04,186][23882] Updated weights for policy 0, policy_version 55760 (0.0007) [2023-03-06 17:53:04,986][23882] Updated weights for policy 0, policy_version 55770 (0.0007) [2023-03-06 17:53:05,777][23882] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-06 17:53:06,580][23882] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-06 17:53:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 57131008. Throughput: 0: 13047.7. Samples: 57116140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:06,748][23556] Avg episode reward: [(0, '1273.893')] [2023-03-06 17:53:07,351][23882] Updated weights for policy 0, policy_version 55800 (0.0006) [2023-03-06 17:53:08,146][23882] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-06 17:53:08,932][23882] Updated weights for policy 0, policy_version 55820 (0.0007) [2023-03-06 17:53:09,723][23882] Updated weights for policy 0, policy_version 55830 (0.0008) [2023-03-06 17:53:10,508][23882] Updated weights for policy 0, policy_version 55840 (0.0007) [2023-03-06 17:53:11,289][23882] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-06 17:53:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 57195520. Throughput: 0: 13050.8. Samples: 57194382. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:11,748][23556] Avg episode reward: [(0, '1230.175')] [2023-03-06 17:53:12,074][23882] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-06 17:53:12,845][23882] Updated weights for policy 0, policy_version 55870 (0.0007) [2023-03-06 17:53:13,627][23882] Updated weights for policy 0, policy_version 55880 (0.0007) [2023-03-06 17:53:14,416][23882] Updated weights for policy 0, policy_version 55890 (0.0007) [2023-03-06 17:53:15,217][23882] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-06 17:53:15,992][23882] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-06 17:53:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 57261056. Throughput: 0: 13053.4. Samples: 57233660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:16,758][23556] Avg episode reward: [(0, '1392.601')] [2023-03-06 17:53:16,773][23831] Saving new best policy, reward=1392.601! [2023-03-06 17:53:16,775][23882] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-06 17:53:17,560][23882] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-06 17:53:18,349][23882] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-06 17:53:19,145][23882] Updated weights for policy 0, policy_version 55950 (0.0007) [2023-03-06 17:53:19,939][23882] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-06 17:53:20,725][23882] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-06 17:53:21,500][23882] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-06 17:53:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 57326592. Throughput: 0: 13037.5. Samples: 57311524. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:21,759][23556] Avg episode reward: [(0, '1352.451')] [2023-03-06 17:53:22,289][23882] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-06 17:53:23,074][23882] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-06 17:53:23,854][23882] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-06 17:53:24,644][23882] Updated weights for policy 0, policy_version 56020 (0.0007) [2023-03-06 17:53:25,426][23882] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-06 17:53:26,220][23882] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-06 17:53:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 57391104. Throughput: 0: 13026.9. Samples: 57389725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:26,758][23556] Avg episode reward: [(0, '1135.689')] [2023-03-06 17:53:27,010][23882] Updated weights for policy 0, policy_version 56050 (0.0007) [2023-03-06 17:53:27,797][23882] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-06 17:53:28,583][23882] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-06 17:53:29,355][23882] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-06 17:53:30,145][23882] Updated weights for policy 0, policy_version 56090 (0.0007) [2023-03-06 17:53:30,924][23882] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-06 17:53:31,704][23882] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-06 17:53:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 57456640. Throughput: 0: 13030.9. Samples: 57428905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:31,759][23556] Avg episode reward: [(0, '1267.987')] [2023-03-06 17:53:32,490][23882] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-06 17:53:33,276][23882] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-06 17:53:34,073][23882] Updated weights for policy 0, policy_version 56140 (0.0006) [2023-03-06 17:53:34,861][23882] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-06 17:53:35,635][23882] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-06 17:53:36,410][23882] Updated weights for policy 0, policy_version 56170 (0.0006) [2023-03-06 17:53:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 57522176. Throughput: 0: 13031.7. Samples: 57507170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:36,759][23556] Avg episode reward: [(0, '1359.230')] [2023-03-06 17:53:37,202][23882] Updated weights for policy 0, policy_version 56180 (0.0007) [2023-03-06 17:53:37,990][23882] Updated weights for policy 0, policy_version 56190 (0.0006) [2023-03-06 17:53:38,773][23882] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-06 17:53:39,565][23882] Updated weights for policy 0, policy_version 56210 (0.0007) [2023-03-06 17:53:40,350][23882] Updated weights for policy 0, policy_version 56220 (0.0008) [2023-03-06 17:53:41,128][23882] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-06 17:53:41,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13021.8, 300 sec: 13055.1). Total num frames: 57586688. Throughput: 0: 13035.3. Samples: 57585622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:41,754][23556] Avg episode reward: [(0, '1435.936')] [2023-03-06 17:53:41,755][23831] Saving new best policy, reward=1435.936! [2023-03-06 17:53:41,914][23882] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-06 17:53:42,698][23882] Updated weights for policy 0, policy_version 56250 (0.0007) [2023-03-06 17:53:43,486][23882] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-06 17:53:44,267][23882] Updated weights for policy 0, policy_version 56270 (0.0007) [2023-03-06 17:53:45,053][23882] Updated weights for policy 0, policy_version 56280 (0.0007) [2023-03-06 17:53:45,844][23882] Updated weights for policy 0, policy_version 56290 (0.0007) [2023-03-06 17:53:46,629][23882] Updated weights for policy 0, policy_version 56300 (0.0006) [2023-03-06 17:53:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 57652224. Throughput: 0: 13040.3. Samples: 57624730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:46,759][23556] Avg episode reward: [(0, '1025.813')] [2023-03-06 17:53:47,429][23882] Updated weights for policy 0, policy_version 56310 (0.0006) [2023-03-06 17:53:48,214][23882] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-06 17:53:49,012][23882] Updated weights for policy 0, policy_version 56330 (0.0007) [2023-03-06 17:53:49,800][23882] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-06 17:53:50,585][23882] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-06 17:53:51,365][23882] Updated weights for policy 0, policy_version 56360 (0.0007) [2023-03-06 17:53:51,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 57717760. Throughput: 0: 13031.6. Samples: 57702563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:51,759][23556] Avg episode reward: [(0, '1189.411')] [2023-03-06 17:53:52,153][23882] Updated weights for policy 0, policy_version 56370 (0.0007) [2023-03-06 17:53:52,930][23882] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-06 17:53:53,711][23882] Updated weights for policy 0, policy_version 56390 (0.0007) [2023-03-06 17:53:54,498][23882] Updated weights for policy 0, policy_version 56400 (0.0007) [2023-03-06 17:53:55,264][23882] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-06 17:53:56,048][23882] Updated weights for policy 0, policy_version 56420 (0.0006) [2023-03-06 17:53:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 57782272. Throughput: 0: 13034.8. Samples: 57780948. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:53:56,758][23556] Avg episode reward: [(0, '1520.374')] [2023-03-06 17:53:56,771][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000056429_57783296.pth... [2023-03-06 17:53:56,800][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000053370_54650880.pth [2023-03-06 17:53:56,803][23831] Saving new best policy, reward=1520.374! [2023-03-06 17:53:56,855][23882] Updated weights for policy 0, policy_version 56430 (0.0006) [2023-03-06 17:53:57,621][23882] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-06 17:53:58,415][23882] Updated weights for policy 0, policy_version 56450 (0.0007) [2023-03-06 17:53:59,214][23882] Updated weights for policy 0, policy_version 56460 (0.0007) [2023-03-06 17:54:00,008][23882] Updated weights for policy 0, policy_version 56470 (0.0007) [2023-03-06 17:54:00,782][23882] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-06 17:54:01,580][23882] Updated weights for policy 0, policy_version 56490 (0.0007) [2023-03-06 17:54:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 57847808. Throughput: 0: 13029.7. Samples: 57819996. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:54:01,759][23556] Avg episode reward: [(0, '1363.491')] [2023-03-06 17:54:02,367][23882] Updated weights for policy 0, policy_version 56500 (0.0006) [2023-03-06 17:54:03,128][23882] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-06 17:54:03,910][23882] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-06 17:54:04,701][23882] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-06 17:54:05,489][23882] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-06 17:54:06,275][23882] Updated weights for policy 0, policy_version 56550 (0.0007) [2023-03-06 17:54:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 57912320. Throughput: 0: 13038.7. Samples: 57898265. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:06,759][23556] Avg episode reward: [(0, '1346.366')] [2023-03-06 17:54:07,072][23882] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-06 17:54:07,840][23882] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-06 17:54:08,645][23882] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-06 17:54:09,425][23882] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-06 17:54:10,210][23882] Updated weights for policy 0, policy_version 56600 (0.0007) [2023-03-06 17:54:10,996][23882] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-06 17:54:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 57977856. Throughput: 0: 13036.4. Samples: 57976362. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:11,754][23556] Avg episode reward: [(0, '1417.188')] [2023-03-06 17:54:11,787][23882] Updated weights for policy 0, policy_version 56620 (0.0007) [2023-03-06 17:54:12,574][23882] Updated weights for policy 0, policy_version 56630 (0.0007) [2023-03-06 17:54:13,362][23882] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-06 17:54:14,145][23882] Updated weights for policy 0, policy_version 56650 (0.0006) [2023-03-06 17:54:14,913][23882] Updated weights for policy 0, policy_version 56660 (0.0005) [2023-03-06 17:54:15,694][23882] Updated weights for policy 0, policy_version 56670 (0.0007) [2023-03-06 17:54:16,501][23882] Updated weights for policy 0, policy_version 56680 (0.0007) [2023-03-06 17:54:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 58042368. Throughput: 0: 13035.1. Samples: 58015483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:16,759][23556] Avg episode reward: [(0, '1517.704')] [2023-03-06 17:54:17,305][23882] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-06 17:54:18,095][23882] Updated weights for policy 0, policy_version 56700 (0.0007) [2023-03-06 17:54:18,889][23882] Updated weights for policy 0, policy_version 56710 (0.0007) [2023-03-06 17:54:19,666][23882] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-06 17:54:20,456][23882] Updated weights for policy 0, policy_version 56730 (0.0006) [2023-03-06 17:54:21,248][23882] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-06 17:54:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 58107904. Throughput: 0: 13022.0. Samples: 58093161. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:21,748][23556] Avg episode reward: [(0, '1718.982')] [2023-03-06 17:54:21,749][23831] Saving new best policy, reward=1718.982! [2023-03-06 17:54:22,027][23882] Updated weights for policy 0, policy_version 56750 (0.0007) [2023-03-06 17:54:22,824][23882] Updated weights for policy 0, policy_version 56760 (0.0007) [2023-03-06 17:54:23,619][23882] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-06 17:54:24,399][23882] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-06 17:54:25,189][23882] Updated weights for policy 0, policy_version 56790 (0.0006) [2023-03-06 17:54:25,973][23882] Updated weights for policy 0, policy_version 56800 (0.0007) [2023-03-06 17:54:26,732][23882] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-06 17:54:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 58173440. Throughput: 0: 13016.7. Samples: 58171373. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:26,748][23556] Avg episode reward: [(0, '1534.545')] [2023-03-06 17:54:27,522][23882] Updated weights for policy 0, policy_version 56820 (0.0007) [2023-03-06 17:54:28,307][23882] Updated weights for policy 0, policy_version 56830 (0.0007) [2023-03-06 17:54:29,109][23882] Updated weights for policy 0, policy_version 56840 (0.0007) [2023-03-06 17:54:29,887][23882] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-06 17:54:30,671][23882] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-06 17:54:31,456][23882] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-06 17:54:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 58237952. Throughput: 0: 13014.2. Samples: 58210368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:31,748][23556] Avg episode reward: [(0, '1640.643')] [2023-03-06 17:54:32,232][23882] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-06 17:54:33,007][23882] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-06 17:54:33,808][23882] Updated weights for policy 0, policy_version 56900 (0.0007) [2023-03-06 17:54:34,575][23882] Updated weights for policy 0, policy_version 56910 (0.0005) [2023-03-06 17:54:35,370][23882] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-06 17:54:36,158][23882] Updated weights for policy 0, policy_version 56930 (0.0006) [2023-03-06 17:54:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13048.2). Total num frames: 58303488. Throughput: 0: 13028.5. Samples: 58288847. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:54:36,748][23556] Avg episode reward: [(0, '1544.855')] [2023-03-06 17:54:36,957][23882] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-06 17:54:37,766][23882] Updated weights for policy 0, policy_version 56950 (0.0007) [2023-03-06 17:54:38,536][23882] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-06 17:54:39,326][23882] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-06 17:54:40,096][23882] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-06 17:54:40,878][23882] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-06 17:54:41,644][23882] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-06 17:54:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 58369024. Throughput: 0: 13029.3. Samples: 58367264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:54:41,748][23556] Avg episode reward: [(0, '1574.904')] [2023-03-06 17:54:42,441][23882] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-06 17:54:43,214][23882] Updated weights for policy 0, policy_version 57020 (0.0008) [2023-03-06 17:54:43,997][23882] Updated weights for policy 0, policy_version 57030 (0.0006) [2023-03-06 17:54:44,792][23882] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-06 17:54:45,565][23882] Updated weights for policy 0, policy_version 57050 (0.0007) [2023-03-06 17:54:46,349][23882] Updated weights for policy 0, policy_version 57060 (0.0007) [2023-03-06 17:54:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 58433536. Throughput: 0: 13029.6. Samples: 58406328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:54:46,748][23556] Avg episode reward: [(0, '1517.852')] [2023-03-06 17:54:47,137][23882] Updated weights for policy 0, policy_version 57070 (0.0007) [2023-03-06 17:54:47,929][23882] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-06 17:54:48,726][23882] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-06 17:54:49,521][23882] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-06 17:54:50,284][23882] Updated weights for policy 0, policy_version 57110 (0.0007) [2023-03-06 17:54:51,095][23882] Updated weights for policy 0, policy_version 57120 (0.0008) [2023-03-06 17:54:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 58499072. Throughput: 0: 13028.9. Samples: 58484564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:54:51,748][23556] Avg episode reward: [(0, '1503.317')] [2023-03-06 17:54:51,867][23882] Updated weights for policy 0, policy_version 57130 (0.0007) [2023-03-06 17:54:52,649][23882] Updated weights for policy 0, policy_version 57140 (0.0007) [2023-03-06 17:54:53,427][23882] Updated weights for policy 0, policy_version 57150 (0.0008) [2023-03-06 17:54:54,186][23882] Updated weights for policy 0, policy_version 57160 (0.0007) [2023-03-06 17:54:54,985][23882] Updated weights for policy 0, policy_version 57170 (0.0006) [2023-03-06 17:54:55,760][23882] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-06 17:54:56,551][23882] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-06 17:54:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 58564608. Throughput: 0: 13035.8. Samples: 58562976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:54:56,748][23556] Avg episode reward: [(0, '1651.018')] [2023-03-06 17:54:57,330][23882] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-06 17:54:58,136][23882] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-06 17:54:58,886][23882] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-06 17:54:59,663][23882] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-06 17:55:00,445][23882] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-06 17:55:01,219][23882] Updated weights for policy 0, policy_version 57250 (0.0006) [2023-03-06 17:55:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 58630144. Throughput: 0: 13043.9. Samples: 58602459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:55:01,748][23556] Avg episode reward: [(0, '1559.241')] [2023-03-06 17:55:02,022][23882] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-06 17:55:02,814][23882] Updated weights for policy 0, policy_version 57270 (0.0007) [2023-03-06 17:55:03,590][23882] Updated weights for policy 0, policy_version 57280 (0.0007) [2023-03-06 17:55:04,387][23882] Updated weights for policy 0, policy_version 57290 (0.0007) [2023-03-06 17:55:05,167][23882] Updated weights for policy 0, policy_version 57300 (0.0006) [2023-03-06 17:55:05,946][23882] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-06 17:55:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 58695680. Throughput: 0: 13054.3. Samples: 58680603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:55:06,748][23556] Avg episode reward: [(0, '1467.997')] [2023-03-06 17:55:06,749][23882] Updated weights for policy 0, policy_version 57320 (0.0007) [2023-03-06 17:55:07,516][23882] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-06 17:55:08,294][23882] Updated weights for policy 0, policy_version 57340 (0.0008) [2023-03-06 17:55:09,090][23882] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-06 17:55:09,869][23882] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-06 17:55:10,681][23882] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-06 17:55:11,466][23882] Updated weights for policy 0, policy_version 57380 (0.0007) [2023-03-06 17:55:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 58760192. Throughput: 0: 13056.9. Samples: 58758935. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:55:11,748][23556] Avg episode reward: [(0, '1614.080')] [2023-03-06 17:55:12,244][23882] Updated weights for policy 0, policy_version 57390 (0.0007) [2023-03-06 17:55:13,029][23882] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-06 17:55:13,807][23882] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-06 17:55:14,585][23882] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-06 17:55:15,371][23882] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-06 17:55:16,153][23882] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-06 17:55:16,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 58825728. Throughput: 0: 13064.6. Samples: 58798276. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:16,748][23556] Avg episode reward: [(0, '1452.521')] [2023-03-06 17:55:16,934][23882] Updated weights for policy 0, policy_version 57450 (0.0007) [2023-03-06 17:55:17,718][23882] Updated weights for policy 0, policy_version 57460 (0.0007) [2023-03-06 17:55:18,494][23882] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-06 17:55:19,289][23882] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-06 17:55:20,058][23882] Updated weights for policy 0, policy_version 57490 (0.0007) [2023-03-06 17:55:20,839][23882] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-06 17:55:21,637][23882] Updated weights for policy 0, policy_version 57510 (0.0007) [2023-03-06 17:55:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 58891264. Throughput: 0: 13060.8. Samples: 58876579. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:21,748][23556] Avg episode reward: [(0, '1622.928')] [2023-03-06 17:55:22,427][23882] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-06 17:55:23,222][23882] Updated weights for policy 0, policy_version 57530 (0.0007) [2023-03-06 17:55:24,010][23882] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-06 17:55:24,805][23882] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-06 17:55:25,597][23882] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-06 17:55:26,386][23882] Updated weights for policy 0, policy_version 57570 (0.0007) [2023-03-06 17:55:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 58955776. Throughput: 0: 13043.4. Samples: 58954218. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:26,748][23556] Avg episode reward: [(0, '1600.865')] [2023-03-06 17:55:27,183][23882] Updated weights for policy 0, policy_version 57580 (0.0005) [2023-03-06 17:55:27,965][23882] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-03-06 17:55:28,769][23882] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-06 17:55:29,555][23882] Updated weights for policy 0, policy_version 57610 (0.0006) [2023-03-06 17:55:30,330][23882] Updated weights for policy 0, policy_version 57620 (0.0006) [2023-03-06 17:55:31,126][23882] Updated weights for policy 0, policy_version 57630 (0.0007) [2023-03-06 17:55:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 59021312. Throughput: 0: 13041.4. Samples: 58993190. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:31,748][23556] Avg episode reward: [(0, '1532.583')] [2023-03-06 17:55:31,896][23882] Updated weights for policy 0, policy_version 57640 (0.0006) [2023-03-06 17:55:32,693][23882] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-06 17:55:33,492][23882] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-06 17:55:34,278][23882] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-06 17:55:35,046][23882] Updated weights for policy 0, policy_version 57680 (0.0006) [2023-03-06 17:55:35,825][23882] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-06 17:55:36,624][23882] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-06 17:55:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 59085824. Throughput: 0: 13041.2. Samples: 59071417. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:36,748][23556] Avg episode reward: [(0, '1549.016')] [2023-03-06 17:55:37,396][23882] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-06 17:55:38,173][23882] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-06 17:55:38,961][23882] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-06 17:55:39,746][23882] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-06 17:55:40,533][23882] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-06 17:55:41,334][23882] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-06 17:55:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 59151360. Throughput: 0: 13034.5. Samples: 59149527. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:41,748][23556] Avg episode reward: [(0, '1552.979')] [2023-03-06 17:55:42,114][23882] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-06 17:55:42,910][23882] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-06 17:55:43,703][23882] Updated weights for policy 0, policy_version 57790 (0.0005) [2023-03-06 17:55:44,495][23882] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-06 17:55:45,278][23882] Updated weights for policy 0, policy_version 57810 (0.0007) [2023-03-06 17:55:46,069][23882] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-06 17:55:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59215872. Throughput: 0: 13021.0. Samples: 59188406. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:55:46,748][23556] Avg episode reward: [(0, '1573.142')] [2023-03-06 17:55:46,860][23882] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-06 17:55:47,629][23882] Updated weights for policy 0, policy_version 57840 (0.0007) [2023-03-06 17:55:48,433][23882] Updated weights for policy 0, policy_version 57850 (0.0007) [2023-03-06 17:55:49,206][23882] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-06 17:55:49,980][23882] Updated weights for policy 0, policy_version 57870 (0.0007) [2023-03-06 17:55:50,779][23882] Updated weights for policy 0, policy_version 57880 (0.0006) [2023-03-06 17:55:51,562][23882] Updated weights for policy 0, policy_version 57890 (0.0005) [2023-03-06 17:55:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59281408. Throughput: 0: 13023.4. Samples: 59266660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:55:51,748][23556] Avg episode reward: [(0, '1604.313')] [2023-03-06 17:55:52,350][23882] Updated weights for policy 0, policy_version 57900 (0.0006) [2023-03-06 17:55:53,118][23882] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-06 17:55:53,904][23882] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-06 17:55:54,698][23882] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-06 17:55:55,497][23882] Updated weights for policy 0, policy_version 57940 (0.0006) [2023-03-06 17:55:56,267][23882] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-06 17:55:56,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 59346944. Throughput: 0: 13025.7. Samples: 59345093. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:55:56,748][23556] Avg episode reward: [(0, '1630.484')] [2023-03-06 17:55:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000057956_59346944.pth... [2023-03-06 17:55:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000054900_56217600.pth [2023-03-06 17:55:57,043][23882] Updated weights for policy 0, policy_version 57960 (0.0007) [2023-03-06 17:55:57,829][23882] Updated weights for policy 0, policy_version 57970 (0.0007) [2023-03-06 17:55:58,631][23882] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-06 17:55:59,392][23882] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-06 17:56:00,179][23882] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-06 17:56:00,977][23882] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-06 17:56:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 59411456. Throughput: 0: 13023.2. Samples: 59384317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:56:01,748][23556] Avg episode reward: [(0, '1694.790')] [2023-03-06 17:56:01,755][23882] Updated weights for policy 0, policy_version 58020 (0.0007) [2023-03-06 17:56:02,529][23882] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-03-06 17:56:03,315][23882] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-06 17:56:04,084][23882] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-06 17:56:04,867][23882] Updated weights for policy 0, policy_version 58060 (0.0007) [2023-03-06 17:56:05,651][23882] Updated weights for policy 0, policy_version 58070 (0.0007) [2023-03-06 17:56:06,449][23882] Updated weights for policy 0, policy_version 58080 (0.0007) [2023-03-06 17:56:06,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 59476992. Throughput: 0: 13026.0. Samples: 59462753. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:56:06,749][23556] Avg episode reward: [(0, '1660.792')] [2023-03-06 17:56:07,227][23882] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-06 17:56:08,022][23882] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-06 17:56:08,808][23882] Updated weights for policy 0, policy_version 58110 (0.0007) [2023-03-06 17:56:09,584][23882] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-06 17:56:10,364][23882] Updated weights for policy 0, policy_version 58130 (0.0006) [2023-03-06 17:56:11,141][23882] Updated weights for policy 0, policy_version 58140 (0.0007) [2023-03-06 17:56:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59542528. Throughput: 0: 13045.5. Samples: 59541264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:56:11,748][23556] Avg episode reward: [(0, '1574.973')] [2023-03-06 17:56:11,917][23882] Updated weights for policy 0, policy_version 58150 (0.0007) [2023-03-06 17:56:12,710][23882] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-06 17:56:13,509][23882] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-06 17:56:14,290][23882] Updated weights for policy 0, policy_version 58180 (0.0006) [2023-03-06 17:56:15,072][23882] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-06 17:56:15,859][23882] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-06 17:56:16,656][23882] Updated weights for policy 0, policy_version 58210 (0.0006) [2023-03-06 17:56:16,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59608064. Throughput: 0: 13045.1. Samples: 59580221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:56:16,748][23556] Avg episode reward: [(0, '1569.752')] [2023-03-06 17:56:17,437][23882] Updated weights for policy 0, policy_version 58220 (0.0007) [2023-03-06 17:56:18,246][23882] Updated weights for policy 0, policy_version 58230 (0.0006) [2023-03-06 17:56:19,013][23882] Updated weights for policy 0, policy_version 58240 (0.0007) [2023-03-06 17:56:19,817][23882] Updated weights for policy 0, policy_version 58250 (0.0007) [2023-03-06 17:56:20,599][23882] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-06 17:56:21,382][23882] Updated weights for policy 0, policy_version 58270 (0.0007) [2023-03-06 17:56:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 59672576. Throughput: 0: 13039.6. Samples: 59658198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:56:21,748][23556] Avg episode reward: [(0, '1620.589')] [2023-03-06 17:56:22,169][23882] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-06 17:56:22,949][23882] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-06 17:56:23,731][23882] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-06 17:56:24,526][23882] Updated weights for policy 0, policy_version 58310 (0.0007) [2023-03-06 17:56:25,302][23882] Updated weights for policy 0, policy_version 58320 (0.0007) [2023-03-06 17:56:26,085][23882] Updated weights for policy 0, policy_version 58330 (0.0007) [2023-03-06 17:56:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59738112. Throughput: 0: 13044.7. Samples: 59736539. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:26,748][23556] Avg episode reward: [(0, '1735.976')] [2023-03-06 17:56:26,753][23831] Saving new best policy, reward=1735.976! [2023-03-06 17:56:26,862][23882] Updated weights for policy 0, policy_version 58340 (0.0007) [2023-03-06 17:56:27,654][23882] Updated weights for policy 0, policy_version 58350 (0.0007) [2023-03-06 17:56:28,429][23882] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-06 17:56:29,211][23882] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-06 17:56:29,990][23882] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-06 17:56:30,761][23882] Updated weights for policy 0, policy_version 58390 (0.0007) [2023-03-06 17:56:31,538][23882] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-06 17:56:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 59803648. Throughput: 0: 13053.8. Samples: 59775826. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:31,748][23556] Avg episode reward: [(0, '1523.427')] [2023-03-06 17:56:32,334][23882] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-06 17:56:33,143][23882] Updated weights for policy 0, policy_version 58420 (0.0008) [2023-03-06 17:56:33,914][23882] Updated weights for policy 0, policy_version 58430 (0.0007) [2023-03-06 17:56:34,700][23882] Updated weights for policy 0, policy_version 58440 (0.0007) [2023-03-06 17:56:35,487][23882] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-06 17:56:36,283][23882] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-06 17:56:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59868160. Throughput: 0: 13053.1. Samples: 59854047. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:36,748][23556] Avg episode reward: [(0, '1554.269')] [2023-03-06 17:56:37,093][23882] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-06 17:56:37,876][23882] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-06 17:56:38,639][23882] Updated weights for policy 0, policy_version 58490 (0.0007) [2023-03-06 17:56:39,435][23882] Updated weights for policy 0, policy_version 58500 (0.0005) [2023-03-06 17:56:40,233][23882] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-06 17:56:41,014][23882] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-06 17:56:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 59933696. Throughput: 0: 13041.5. Samples: 59931959. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:41,748][23556] Avg episode reward: [(0, '1531.149')] [2023-03-06 17:56:41,814][23882] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-06 17:56:42,601][23882] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-06 17:56:43,389][23882] Updated weights for policy 0, policy_version 58550 (0.0007) [2023-03-06 17:56:44,205][23882] Updated weights for policy 0, policy_version 58560 (0.0007) [2023-03-06 17:56:44,983][23882] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-06 17:56:45,778][23882] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-06 17:56:46,547][23882] Updated weights for policy 0, policy_version 58590 (0.0006) [2023-03-06 17:56:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 59998208. Throughput: 0: 13029.3. Samples: 59970633. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:46,748][23556] Avg episode reward: [(0, '1642.910')] [2023-03-06 17:56:47,335][23882] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-06 17:56:48,113][23882] Updated weights for policy 0, policy_version 58610 (0.0007) [2023-03-06 17:56:48,909][23882] Updated weights for policy 0, policy_version 58620 (0.0007) [2023-03-06 17:56:49,698][23882] Updated weights for policy 0, policy_version 58630 (0.0007) [2023-03-06 17:56:50,489][23882] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-06 17:56:51,261][23882] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-06 17:56:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 60063744. Throughput: 0: 13024.2. Samples: 60048840. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:51,748][23556] Avg episode reward: [(0, '1587.818')] [2023-03-06 17:56:52,039][23882] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-06 17:56:52,819][23882] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-06 17:56:53,617][23882] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-06 17:56:54,396][23882] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-06 17:56:55,186][23882] Updated weights for policy 0, policy_version 58700 (0.0007) [2023-03-06 17:56:55,979][23882] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-06 17:56:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 60128256. Throughput: 0: 13019.8. Samples: 60127154. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:56:56,748][23556] Avg episode reward: [(0, '1623.537')] [2023-03-06 17:56:56,764][23882] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-06 17:56:57,538][23882] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-06 17:56:58,326][23882] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-06 17:56:59,109][23882] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-06 17:56:59,915][23882] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-06 17:57:00,703][23882] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-06 17:57:01,490][23882] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-06 17:57:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 60193792. Throughput: 0: 13022.1. Samples: 60166217. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:01,748][23556] Avg episode reward: [(0, '1682.380')] [2023-03-06 17:57:02,273][23882] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-06 17:57:03,070][23882] Updated weights for policy 0, policy_version 58800 (0.0008) [2023-03-06 17:57:03,852][23882] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-06 17:57:04,645][23882] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-06 17:57:05,443][23882] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-06 17:57:06,226][23882] Updated weights for policy 0, policy_version 58840 (0.0006) [2023-03-06 17:57:06,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60258304. Throughput: 0: 13016.7. Samples: 60243947. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:06,748][23556] Avg episode reward: [(0, '1616.293')] [2023-03-06 17:57:07,011][23882] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-06 17:57:07,806][23882] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-06 17:57:08,585][23882] Updated weights for policy 0, policy_version 58870 (0.0007) [2023-03-06 17:57:09,380][23882] Updated weights for policy 0, policy_version 58880 (0.0006) [2023-03-06 17:57:10,151][23882] Updated weights for policy 0, policy_version 58890 (0.0007) [2023-03-06 17:57:10,937][23882] Updated weights for policy 0, policy_version 58900 (0.0007) [2023-03-06 17:57:11,718][23882] Updated weights for policy 0, policy_version 58910 (0.0006) [2023-03-06 17:57:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60323840. Throughput: 0: 13010.0. Samples: 60321990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:11,748][23556] Avg episode reward: [(0, '1712.727')] [2023-03-06 17:57:12,501][23882] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-06 17:57:13,281][23882] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-06 17:57:14,067][23882] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-06 17:57:14,836][23882] Updated weights for policy 0, policy_version 58950 (0.0007) [2023-03-06 17:57:15,642][23882] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-06 17:57:16,417][23882] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-06 17:57:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 60389376. Throughput: 0: 13011.0. Samples: 60361322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:16,748][23556] Avg episode reward: [(0, '1609.175')] [2023-03-06 17:57:17,207][23882] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-06 17:57:17,992][23882] Updated weights for policy 0, policy_version 58990 (0.0007) [2023-03-06 17:57:18,772][23882] Updated weights for policy 0, policy_version 59000 (0.0007) [2023-03-06 17:57:19,550][23882] Updated weights for policy 0, policy_version 59010 (0.0007) [2023-03-06 17:57:20,338][23882] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-06 17:57:21,135][23882] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-06 17:57:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60453888. Throughput: 0: 13014.3. Samples: 60439691. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:21,748][23556] Avg episode reward: [(0, '1399.235')] [2023-03-06 17:57:21,926][23882] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-06 17:57:22,692][23882] Updated weights for policy 0, policy_version 59050 (0.0007) [2023-03-06 17:57:23,467][23882] Updated weights for policy 0, policy_version 59060 (0.0006) [2023-03-06 17:57:24,254][23882] Updated weights for policy 0, policy_version 59070 (0.0007) [2023-03-06 17:57:25,058][23882] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-06 17:57:25,849][23882] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-06 17:57:26,653][23882] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-06 17:57:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60519424. Throughput: 0: 13019.0. Samples: 60517813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:26,748][23556] Avg episode reward: [(0, '1295.664')] [2023-03-06 17:57:27,436][23882] Updated weights for policy 0, policy_version 59110 (0.0006) [2023-03-06 17:57:28,223][23882] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-06 17:57:29,012][23882] Updated weights for policy 0, policy_version 59130 (0.0007) [2023-03-06 17:57:29,805][23882] Updated weights for policy 0, policy_version 59140 (0.0006) [2023-03-06 17:57:30,585][23882] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-06 17:57:31,361][23882] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-06 17:57:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60584960. Throughput: 0: 13023.1. Samples: 60556673. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:31,748][23556] Avg episode reward: [(0, '1448.448')] [2023-03-06 17:57:32,161][23882] Updated weights for policy 0, policy_version 59170 (0.0007) [2023-03-06 17:57:32,956][23882] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-06 17:57:33,724][23882] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-06 17:57:34,504][23882] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-06 17:57:35,302][23882] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-06 17:57:36,049][23882] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-06 17:57:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 60649472. Throughput: 0: 13025.5. Samples: 60634988. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:57:36,748][23556] Avg episode reward: [(0, '1355.158')] [2023-03-06 17:57:36,837][23882] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-06 17:57:37,634][23882] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-06 17:57:38,431][23882] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-06 17:57:39,217][23882] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-06 17:57:39,995][23882] Updated weights for policy 0, policy_version 59270 (0.0007) [2023-03-06 17:57:40,779][23882] Updated weights for policy 0, policy_version 59280 (0.0006) [2023-03-06 17:57:41,581][23882] Updated weights for policy 0, policy_version 59290 (0.0007) [2023-03-06 17:57:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60715008. Throughput: 0: 13021.0. Samples: 60713099. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:57:41,748][23556] Avg episode reward: [(0, '1294.473')] [2023-03-06 17:57:42,370][23882] Updated weights for policy 0, policy_version 59300 (0.0007) [2023-03-06 17:57:43,163][23882] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-06 17:57:43,950][23882] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-06 17:57:44,726][23882] Updated weights for policy 0, policy_version 59330 (0.0007) [2023-03-06 17:57:45,516][23882] Updated weights for policy 0, policy_version 59340 (0.0006) [2023-03-06 17:57:46,290][23882] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-06 17:57:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 60779520. Throughput: 0: 13020.8. Samples: 60752152. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:57:46,748][23556] Avg episode reward: [(0, '1539.399')] [2023-03-06 17:57:47,069][23882] Updated weights for policy 0, policy_version 59360 (0.0007) [2023-03-06 17:57:47,868][23882] Updated weights for policy 0, policy_version 59370 (0.0006) [2023-03-06 17:57:48,625][23882] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-06 17:57:49,425][23882] Updated weights for policy 0, policy_version 59390 (0.0007) [2023-03-06 17:57:50,190][23882] Updated weights for policy 0, policy_version 59400 (0.0007) [2023-03-06 17:57:50,984][23882] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-06 17:57:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60845056. Throughput: 0: 13043.1. Samples: 60830890. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:57:51,748][23556] Avg episode reward: [(0, '1476.078')] [2023-03-06 17:57:51,764][23882] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-06 17:57:52,542][23882] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-06 17:57:53,355][23882] Updated weights for policy 0, policy_version 59440 (0.0007) [2023-03-06 17:57:54,146][23882] Updated weights for policy 0, policy_version 59450 (0.0006) [2023-03-06 17:57:54,926][23882] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-06 17:57:55,714][23882] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-06 17:57:56,501][23882] Updated weights for policy 0, policy_version 59480 (0.0007) [2023-03-06 17:57:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 60910592. Throughput: 0: 13039.7. Samples: 60908778. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:57:56,748][23556] Avg episode reward: [(0, '1412.255')] [2023-03-06 17:57:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000059483_60910592.pth... [2023-03-06 17:57:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000056429_57783296.pth [2023-03-06 17:57:57,275][23882] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-06 17:57:58,068][23882] Updated weights for policy 0, policy_version 59500 (0.0007) [2023-03-06 17:57:58,865][23882] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-06 17:57:59,642][23882] Updated weights for policy 0, policy_version 59520 (0.0006) [2023-03-06 17:58:00,437][23882] Updated weights for policy 0, policy_version 59530 (0.0006) [2023-03-06 17:58:01,233][23882] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-06 17:58:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 60975104. Throughput: 0: 13029.7. Samples: 60947660. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:58:01,748][23556] Avg episode reward: [(0, '1518.798')] [2023-03-06 17:58:02,017][23882] Updated weights for policy 0, policy_version 59550 (0.0007) [2023-03-06 17:58:02,798][23882] Updated weights for policy 0, policy_version 59560 (0.0007) [2023-03-06 17:58:03,591][23882] Updated weights for policy 0, policy_version 59570 (0.0007) [2023-03-06 17:58:04,353][23882] Updated weights for policy 0, policy_version 59580 (0.0007) [2023-03-06 17:58:05,138][23882] Updated weights for policy 0, policy_version 59590 (0.0006) [2023-03-06 17:58:05,940][23882] Updated weights for policy 0, policy_version 59600 (0.0007) [2023-03-06 17:58:06,733][23882] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-06 17:58:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61040640. Throughput: 0: 13024.4. Samples: 61025789. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:58:06,748][23556] Avg episode reward: [(0, '1612.514')] [2023-03-06 17:58:07,511][23882] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-06 17:58:08,311][23882] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-06 17:58:09,082][23882] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-06 17:58:09,866][23882] Updated weights for policy 0, policy_version 59650 (0.0007) [2023-03-06 17:58:10,651][23882] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-03-06 17:58:11,425][23882] Updated weights for policy 0, policy_version 59670 (0.0005) [2023-03-06 17:58:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61106176. Throughput: 0: 13027.0. Samples: 61104030. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 17:58:11,748][23556] Avg episode reward: [(0, '1601.351')] [2023-03-06 17:58:12,202][23882] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-06 17:58:12,989][23882] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-06 17:58:13,769][23882] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-06 17:58:14,565][23882] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-06 17:58:15,351][23882] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-06 17:58:16,142][23882] Updated weights for policy 0, policy_version 59730 (0.0007) [2023-03-06 17:58:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 61170688. Throughput: 0: 13040.9. Samples: 61143513. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:16,748][23556] Avg episode reward: [(0, '1625.179')] [2023-03-06 17:58:16,917][23882] Updated weights for policy 0, policy_version 59740 (0.0006) [2023-03-06 17:58:17,677][23882] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-06 17:58:18,471][23882] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-06 17:58:19,254][23882] Updated weights for policy 0, policy_version 59770 (0.0007) [2023-03-06 17:58:20,034][23882] Updated weights for policy 0, policy_version 59780 (0.0007) [2023-03-06 17:58:20,824][23882] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-06 17:58:21,609][23882] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-06 17:58:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61236224. Throughput: 0: 13040.7. Samples: 61221821. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:21,748][23556] Avg episode reward: [(0, '1618.988')] [2023-03-06 17:58:22,387][23882] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-06 17:58:23,175][23882] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-06 17:58:23,968][23882] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-06 17:58:24,760][23882] Updated weights for policy 0, policy_version 59840 (0.0007) [2023-03-06 17:58:25,539][23882] Updated weights for policy 0, policy_version 59850 (0.0007) [2023-03-06 17:58:26,333][23882] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-06 17:58:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61301760. Throughput: 0: 13045.3. Samples: 61300137. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:26,748][23556] Avg episode reward: [(0, '1663.372')] [2023-03-06 17:58:27,100][23882] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-06 17:58:27,872][23882] Updated weights for policy 0, policy_version 59880 (0.0007) [2023-03-06 17:58:28,665][23882] Updated weights for policy 0, policy_version 59890 (0.0007) [2023-03-06 17:58:29,452][23882] Updated weights for policy 0, policy_version 59900 (0.0007) [2023-03-06 17:58:30,239][23882] Updated weights for policy 0, policy_version 59910 (0.0007) [2023-03-06 17:58:31,038][23882] Updated weights for policy 0, policy_version 59920 (0.0006) [2023-03-06 17:58:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61367296. Throughput: 0: 13050.7. Samples: 61339434. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:31,748][23556] Avg episode reward: [(0, '1717.210')] [2023-03-06 17:58:31,811][23882] Updated weights for policy 0, policy_version 59930 (0.0007) [2023-03-06 17:58:32,611][23882] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-06 17:58:33,393][23882] Updated weights for policy 0, policy_version 59950 (0.0005) [2023-03-06 17:58:34,158][23882] Updated weights for policy 0, policy_version 59960 (0.0005) [2023-03-06 17:58:34,941][23882] Updated weights for policy 0, policy_version 59970 (0.0006) [2023-03-06 17:58:35,739][23882] Updated weights for policy 0, policy_version 59980 (0.0007) [2023-03-06 17:58:36,507][23882] Updated weights for policy 0, policy_version 59990 (0.0006) [2023-03-06 17:58:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 61432832. Throughput: 0: 13039.3. Samples: 61417659. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:36,748][23556] Avg episode reward: [(0, '1635.365')] [2023-03-06 17:58:37,299][23882] Updated weights for policy 0, policy_version 60000 (0.0007) [2023-03-06 17:58:38,099][23882] Updated weights for policy 0, policy_version 60010 (0.0005) [2023-03-06 17:58:38,883][23882] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-06 17:58:39,665][23882] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-06 17:58:40,439][23882] Updated weights for policy 0, policy_version 60040 (0.0006) [2023-03-06 17:58:41,213][23882] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-06 17:58:41,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61497344. Throughput: 0: 13051.9. Samples: 61496118. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:41,748][23556] Avg episode reward: [(0, '1683.946')] [2023-03-06 17:58:42,005][23882] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-06 17:58:42,775][23882] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-06 17:58:43,565][23882] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-06 17:58:44,335][23882] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-06 17:58:45,127][23882] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-06 17:58:45,917][23882] Updated weights for policy 0, policy_version 60110 (0.0007) [2023-03-06 17:58:46,687][23882] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-06 17:58:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 61563904. Throughput: 0: 13059.6. Samples: 61535342. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 17:58:46,748][23556] Avg episode reward: [(0, '1391.928')] [2023-03-06 17:58:47,148][23831] KL-divergence is very high: 115.5084 [2023-03-06 17:58:47,469][23882] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-06 17:58:48,244][23882] Updated weights for policy 0, policy_version 60140 (0.0007) [2023-03-06 17:58:49,040][23882] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-06 17:58:49,830][23882] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-06 17:58:50,594][23882] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-06 17:58:51,368][23882] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-06 17:58:51,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 61628416. Throughput: 0: 13071.8. Samples: 61614017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:58:51,759][23556] Avg episode reward: [(0, '1242.422')] [2023-03-06 17:58:52,150][23882] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-06 17:58:52,923][23882] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-06 17:58:53,708][23882] Updated weights for policy 0, policy_version 60210 (0.0007) [2023-03-06 17:58:54,509][23882] Updated weights for policy 0, policy_version 60220 (0.0007) [2023-03-06 17:58:55,295][23882] Updated weights for policy 0, policy_version 60230 (0.0006) [2023-03-06 17:58:56,062][23882] Updated weights for policy 0, policy_version 60240 (0.0007) [2023-03-06 17:58:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 61693952. Throughput: 0: 13079.0. Samples: 61692586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:58:56,758][23556] Avg episode reward: [(0, '1178.509')] [2023-03-06 17:58:56,870][23882] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-06 17:58:57,653][23882] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-06 17:58:58,418][23882] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-06 17:58:59,211][23882] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-06 17:58:59,997][23882] Updated weights for policy 0, policy_version 60290 (0.0007) [2023-03-06 17:59:00,777][23882] Updated weights for policy 0, policy_version 60300 (0.0007) [2023-03-06 17:59:01,561][23882] Updated weights for policy 0, policy_version 60310 (0.0007) [2023-03-06 17:59:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 61759488. Throughput: 0: 13070.5. Samples: 61731684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:01,759][23556] Avg episode reward: [(0, '1443.323')] [2023-03-06 17:59:02,350][23882] Updated weights for policy 0, policy_version 60320 (0.0007) [2023-03-06 17:59:03,136][23882] Updated weights for policy 0, policy_version 60330 (0.0007) [2023-03-06 17:59:03,935][23882] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-06 17:59:04,711][23882] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-06 17:59:05,482][23882] Updated weights for policy 0, policy_version 60360 (0.0007) [2023-03-06 17:59:05,786][23831] KL-divergence is very high: 3197.7954 [2023-03-06 17:59:06,258][23882] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-06 17:59:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 61825024. Throughput: 0: 13073.0. Samples: 61810107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:06,759][23556] Avg episode reward: [(0, '1321.474')] [2023-03-06 17:59:07,056][23882] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-06 17:59:07,829][23882] Updated weights for policy 0, policy_version 60390 (0.0007) [2023-03-06 17:59:08,611][23882] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-06 17:59:09,393][23882] Updated weights for policy 0, policy_version 60410 (0.0006) [2023-03-06 17:59:10,173][23882] Updated weights for policy 0, policy_version 60420 (0.0007) [2023-03-06 17:59:10,943][23882] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-06 17:59:11,732][23882] Updated weights for policy 0, policy_version 60440 (0.0007) [2023-03-06 17:59:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 61890560. Throughput: 0: 13078.1. Samples: 61888649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:11,759][23556] Avg episode reward: [(0, '1346.046')] [2023-03-06 17:59:12,524][23882] Updated weights for policy 0, policy_version 60450 (0.0006) [2023-03-06 17:59:12,597][23831] KL-divergence is very high: 2660.6775 [2023-03-06 17:59:13,288][23882] Updated weights for policy 0, policy_version 60460 (0.0006) [2023-03-06 17:59:14,080][23882] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-06 17:59:14,866][23882] Updated weights for policy 0, policy_version 60480 (0.0007) [2023-03-06 17:59:15,643][23882] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-06 17:59:16,429][23882] Updated weights for policy 0, policy_version 60500 (0.0007) [2023-03-06 17:59:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13041.3). Total num frames: 61955072. Throughput: 0: 13075.6. Samples: 61927836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:16,748][23556] Avg episode reward: [(0, '1385.162')] [2023-03-06 17:59:17,229][23882] Updated weights for policy 0, policy_version 60510 (0.0007) [2023-03-06 17:59:18,011][23882] Updated weights for policy 0, policy_version 60520 (0.0007) [2023-03-06 17:59:18,787][23882] Updated weights for policy 0, policy_version 60530 (0.0007) [2023-03-06 17:59:19,585][23882] Updated weights for policy 0, policy_version 60540 (0.0007) [2023-03-06 17:59:20,368][23882] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-06 17:59:21,149][23882] Updated weights for policy 0, policy_version 60560 (0.0007) [2023-03-06 17:59:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13041.2). Total num frames: 62020608. Throughput: 0: 13076.6. Samples: 62006104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:21,759][23556] Avg episode reward: [(0, '1395.394')] [2023-03-06 17:59:21,939][23882] Updated weights for policy 0, policy_version 60570 (0.0007) [2023-03-06 17:59:22,724][23882] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-03-06 17:59:23,520][23882] Updated weights for policy 0, policy_version 60590 (0.0006) [2023-03-06 17:59:24,316][23882] Updated weights for policy 0, policy_version 60600 (0.0007) [2023-03-06 17:59:25,106][23882] Updated weights for policy 0, policy_version 60610 (0.0007) [2023-03-06 17:59:25,863][23882] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-06 17:59:26,019][23831] KL-divergence is very high: 106.6743 [2023-03-06 17:59:26,656][23882] Updated weights for policy 0, policy_version 60630 (0.0006) [2023-03-06 17:59:26,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 62086144. Throughput: 0: 13072.3. Samples: 62084371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 17:59:26,758][23556] Avg episode reward: [(0, '1344.873')] [2023-03-06 17:59:27,436][23882] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-06 17:59:27,588][23831] KL-divergence is very high: 31324.4336 [2023-03-06 17:59:28,220][23882] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-06 17:59:29,021][23882] Updated weights for policy 0, policy_version 60660 (0.0007) [2023-03-06 17:59:29,790][23882] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-06 17:59:30,571][23882] Updated weights for policy 0, policy_version 60680 (0.0006) [2023-03-06 17:59:31,357][23882] Updated weights for policy 0, policy_version 60690 (0.0007) [2023-03-06 17:59:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 62151680. Throughput: 0: 13070.8. Samples: 62123531. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:31,759][23556] Avg episode reward: [(0, '1121.320')] [2023-03-06 17:59:32,133][23882] Updated weights for policy 0, policy_version 60700 (0.0006) [2023-03-06 17:59:32,915][23882] Updated weights for policy 0, policy_version 60710 (0.0007) [2023-03-06 17:59:33,696][23882] Updated weights for policy 0, policy_version 60720 (0.0006) [2023-03-06 17:59:34,478][23882] Updated weights for policy 0, policy_version 60730 (0.0007) [2023-03-06 17:59:35,246][23882] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-06 17:59:36,028][23882] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-06 17:59:36,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 62216192. Throughput: 0: 13070.3. Samples: 62202180. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:36,759][23556] Avg episode reward: [(0, '1240.951')] [2023-03-06 17:59:36,826][23882] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-06 17:59:37,573][23882] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-06 17:59:38,377][23882] Updated weights for policy 0, policy_version 60780 (0.0006) [2023-03-06 17:59:39,149][23882] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-06 17:59:39,865][23831] KL-divergence is very high: 171.9745 [2023-03-06 17:59:39,943][23882] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-06 17:59:40,721][23882] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-06 17:59:41,502][23882] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-06 17:59:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13048.2). Total num frames: 62282752. Throughput: 0: 13068.7. Samples: 62280679. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:41,748][23556] Avg episode reward: [(0, '1470.110')] [2023-03-06 17:59:42,300][23882] Updated weights for policy 0, policy_version 60830 (0.0007) [2023-03-06 17:59:43,067][23882] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-06 17:59:43,875][23882] Updated weights for policy 0, policy_version 60850 (0.0007) [2023-03-06 17:59:44,660][23882] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-06 17:59:45,431][23882] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-06 17:59:46,196][23882] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-06 17:59:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 62347264. Throughput: 0: 13069.0. Samples: 62319790. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:46,749][23556] Avg episode reward: [(0, '1525.684')] [2023-03-06 17:59:46,991][23882] Updated weights for policy 0, policy_version 60890 (0.0006) [2023-03-06 17:59:47,786][23882] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-06 17:59:48,560][23882] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-06 17:59:49,332][23882] Updated weights for policy 0, policy_version 60920 (0.0006) [2023-03-06 17:59:50,117][23882] Updated weights for policy 0, policy_version 60930 (0.0008) [2023-03-06 17:59:50,925][23882] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-06 17:59:50,973][23831] KL-divergence is very high: 150.7044 [2023-03-06 17:59:51,694][23882] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-06 17:59:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13090.1, 300 sec: 13048.2). Total num frames: 62413824. Throughput: 0: 13073.8. Samples: 62398428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:51,748][23556] Avg episode reward: [(0, '1559.713')] [2023-03-06 17:59:52,488][23882] Updated weights for policy 0, policy_version 60960 (0.0007) [2023-03-06 17:59:53,265][23882] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-06 17:59:54,043][23882] Updated weights for policy 0, policy_version 60980 (0.0006) [2023-03-06 17:59:54,828][23882] Updated weights for policy 0, policy_version 60990 (0.0007) [2023-03-06 17:59:54,877][23831] KL-divergence is very high: 352.5025 [2023-03-06 17:59:55,598][23882] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-06 17:59:56,390][23882] Updated weights for policy 0, policy_version 61010 (0.0006) [2023-03-06 17:59:56,462][23831] KL-divergence is very high: 346.2448 [2023-03-06 17:59:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 62478336. Throughput: 0: 13071.3. Samples: 62476860. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 17:59:56,748][23556] Avg episode reward: [(0, '1636.577')] [2023-03-06 17:59:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000061014_62478336.pth... [2023-03-06 17:59:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000057956_59346944.pth [2023-03-06 17:59:57,166][23882] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-06 17:59:57,950][23882] Updated weights for policy 0, policy_version 61030 (0.0007) [2023-03-06 17:59:58,725][23882] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-06 17:59:59,517][23882] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-06 18:00:00,300][23882] Updated weights for policy 0, policy_version 61060 (0.0006) [2023-03-06 18:00:01,072][23882] Updated weights for policy 0, policy_version 61070 (0.0008) [2023-03-06 18:00:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 62543872. Throughput: 0: 13077.2. Samples: 62516311. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:00:01,748][23556] Avg episode reward: [(0, '1511.910')] [2023-03-06 18:00:01,858][23882] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-06 18:00:02,670][23882] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-06 18:00:03,451][23882] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-06 18:00:04,225][23882] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-06 18:00:05,017][23831] KL-divergence is very high: 2066.1992 [2023-03-06 18:00:05,024][23882] Updated weights for policy 0, policy_version 61120 (0.0007) [2023-03-06 18:00:05,805][23882] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-06 18:00:06,579][23882] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-06 18:00:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 62609408. Throughput: 0: 13072.1. Samples: 62594347. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:06,748][23556] Avg episode reward: [(0, '1370.949')] [2023-03-06 18:00:07,368][23882] Updated weights for policy 0, policy_version 61150 (0.0007) [2023-03-06 18:00:08,150][23882] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-06 18:00:08,918][23882] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-06 18:00:09,727][23882] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-06 18:00:10,517][23882] Updated weights for policy 0, policy_version 61190 (0.0007) [2023-03-06 18:00:11,308][23882] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-06 18:00:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 62673920. Throughput: 0: 13071.0. Samples: 62672564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:11,749][23556] Avg episode reward: [(0, '1280.790')] [2023-03-06 18:00:12,090][23882] Updated weights for policy 0, policy_version 61210 (0.0006) [2023-03-06 18:00:12,900][23882] Updated weights for policy 0, policy_version 61220 (0.0007) [2023-03-06 18:00:13,658][23882] Updated weights for policy 0, policy_version 61230 (0.0007) [2023-03-06 18:00:14,442][23882] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-06 18:00:15,228][23882] Updated weights for policy 0, policy_version 61250 (0.0007) [2023-03-06 18:00:16,011][23882] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-06 18:00:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 62739456. Throughput: 0: 13073.1. Samples: 62711821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:16,748][23556] Avg episode reward: [(0, '1347.316')] [2023-03-06 18:00:16,793][23882] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-06 18:00:17,568][23882] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-06 18:00:18,357][23882] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-06 18:00:19,139][23882] Updated weights for policy 0, policy_version 61300 (0.0007) [2023-03-06 18:00:19,930][23882] Updated weights for policy 0, policy_version 61310 (0.0006) [2023-03-06 18:00:20,716][23882] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-03-06 18:00:21,493][23882] Updated weights for policy 0, policy_version 61330 (0.0006) [2023-03-06 18:00:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 62804992. Throughput: 0: 13063.1. Samples: 62790021. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:21,748][23556] Avg episode reward: [(0, '1444.049')] [2023-03-06 18:00:22,282][23882] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-06 18:00:23,060][23882] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-06 18:00:23,829][23882] Updated weights for policy 0, policy_version 61360 (0.0007) [2023-03-06 18:00:24,639][23882] Updated weights for policy 0, policy_version 61370 (0.0006) [2023-03-06 18:00:25,437][23882] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-06 18:00:26,199][23882] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-06 18:00:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 62870528. Throughput: 0: 13062.5. Samples: 62868489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:26,748][23556] Avg episode reward: [(0, '1409.243')] [2023-03-06 18:00:26,990][23882] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-06 18:00:27,785][23882] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-06 18:00:28,555][23882] Updated weights for policy 0, policy_version 61420 (0.0007) [2023-03-06 18:00:29,344][23882] Updated weights for policy 0, policy_version 61430 (0.0007) [2023-03-06 18:00:30,115][23882] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-06 18:00:30,912][23882] Updated weights for policy 0, policy_version 61450 (0.0007) [2023-03-06 18:00:31,700][23882] Updated weights for policy 0, policy_version 61460 (0.0007) [2023-03-06 18:00:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 62935040. Throughput: 0: 13060.5. Samples: 62907512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:31,748][23556] Avg episode reward: [(0, '1447.315')] [2023-03-06 18:00:32,490][23882] Updated weights for policy 0, policy_version 61470 (0.0005) [2023-03-06 18:00:33,277][23882] Updated weights for policy 0, policy_version 61480 (0.0005) [2023-03-06 18:00:34,056][23882] Updated weights for policy 0, policy_version 61490 (0.0007) [2023-03-06 18:00:34,841][23882] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-06 18:00:35,622][23882] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-06 18:00:36,390][23882] Updated weights for policy 0, policy_version 61520 (0.0005) [2023-03-06 18:00:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 63000576. Throughput: 0: 13054.6. Samples: 62985884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:36,748][23556] Avg episode reward: [(0, '1164.459')] [2023-03-06 18:00:37,184][23882] Updated weights for policy 0, policy_version 61530 (0.0007) [2023-03-06 18:00:37,968][23882] Updated weights for policy 0, policy_version 61540 (0.0006) [2023-03-06 18:00:38,746][23882] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-06 18:00:39,517][23882] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-06 18:00:40,290][23882] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-06 18:00:41,089][23882] Updated weights for policy 0, policy_version 61580 (0.0008) [2023-03-06 18:00:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63066112. Throughput: 0: 13061.0. Samples: 63064605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:00:41,748][23556] Avg episode reward: [(0, '1305.427')] [2023-03-06 18:00:41,869][23882] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-06 18:00:42,653][23882] Updated weights for policy 0, policy_version 61600 (0.0006) [2023-03-06 18:00:43,421][23882] Updated weights for policy 0, policy_version 61610 (0.0007) [2023-03-06 18:00:44,211][23882] Updated weights for policy 0, policy_version 61620 (0.0007) [2023-03-06 18:00:44,966][23882] Updated weights for policy 0, policy_version 61630 (0.0007) [2023-03-06 18:00:45,761][23882] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-06 18:00:46,537][23882] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-06 18:00:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 63131648. Throughput: 0: 13058.4. Samples: 63103940. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:00:46,748][23556] Avg episode reward: [(0, '1305.555')] [2023-03-06 18:00:47,322][23882] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-06 18:00:48,106][23882] Updated weights for policy 0, policy_version 61670 (0.0007) [2023-03-06 18:00:48,893][23882] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-06 18:00:49,670][23882] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-06 18:00:50,468][23882] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-06 18:00:51,241][23882] Updated weights for policy 0, policy_version 61710 (0.0007) [2023-03-06 18:00:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63197184. Throughput: 0: 13068.9. Samples: 63182444. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:00:51,748][23556] Avg episode reward: [(0, '1148.182')] [2023-03-06 18:00:52,030][23882] Updated weights for policy 0, policy_version 61720 (0.0007) [2023-03-06 18:00:52,808][23882] Updated weights for policy 0, policy_version 61730 (0.0007) [2023-03-06 18:00:53,607][23882] Updated weights for policy 0, policy_version 61740 (0.0005) [2023-03-06 18:00:54,392][23882] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-06 18:00:55,192][23882] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-06 18:00:55,962][23882] Updated weights for policy 0, policy_version 61770 (0.0007) [2023-03-06 18:00:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63261696. Throughput: 0: 13064.6. Samples: 63260470. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:00:56,748][23556] Avg episode reward: [(0, '1274.633')] [2023-03-06 18:00:56,770][23882] Updated weights for policy 0, policy_version 61780 (0.0007) [2023-03-06 18:00:57,540][23882] Updated weights for policy 0, policy_version 61790 (0.0008) [2023-03-06 18:00:58,326][23882] Updated weights for policy 0, policy_version 61800 (0.0007) [2023-03-06 18:00:59,115][23882] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-06 18:00:59,890][23882] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-06 18:01:00,685][23882] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-06 18:01:01,464][23882] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-06 18:01:01,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63327232. Throughput: 0: 13062.0. Samples: 63299611. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:01:01,748][23556] Avg episode reward: [(0, '1257.645')] [2023-03-06 18:01:02,246][23882] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-06 18:01:03,034][23882] Updated weights for policy 0, policy_version 61860 (0.0006) [2023-03-06 18:01:03,822][23882] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-06 18:01:04,599][23882] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-06 18:01:05,390][23882] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-06 18:01:06,178][23882] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-06 18:01:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63392768. Throughput: 0: 13064.8. Samples: 63377939. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:01:06,748][23556] Avg episode reward: [(0, '1332.455')] [2023-03-06 18:01:06,965][23882] Updated weights for policy 0, policy_version 61910 (0.0007) [2023-03-06 18:01:07,734][23882] Updated weights for policy 0, policy_version 61920 (0.0006) [2023-03-06 18:01:08,530][23882] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-06 18:01:09,306][23882] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-06 18:01:10,093][23882] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-06 18:01:10,872][23882] Updated weights for policy 0, policy_version 61960 (0.0007) [2023-03-06 18:01:11,674][23882] Updated weights for policy 0, policy_version 61970 (0.0007) [2023-03-06 18:01:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 63457280. Throughput: 0: 13061.0. Samples: 63456233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:01:11,748][23556] Avg episode reward: [(0, '1402.935')] [2023-03-06 18:01:12,449][23882] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-06 18:01:13,226][23882] Updated weights for policy 0, policy_version 61990 (0.0006) [2023-03-06 18:01:14,025][23882] Updated weights for policy 0, policy_version 62000 (0.0007) [2023-03-06 18:01:14,807][23882] Updated weights for policy 0, policy_version 62010 (0.0007) [2023-03-06 18:01:15,572][23882] Updated weights for policy 0, policy_version 62020 (0.0006) [2023-03-06 18:01:16,390][23882] Updated weights for policy 0, policy_version 62030 (0.0006) [2023-03-06 18:01:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63522816. Throughput: 0: 13063.6. Samples: 63495376. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:01:16,748][23556] Avg episode reward: [(0, '1292.130')] [2023-03-06 18:01:17,162][23882] Updated weights for policy 0, policy_version 62040 (0.0007) [2023-03-06 18:01:17,953][23882] Updated weights for policy 0, policy_version 62050 (0.0006) [2023-03-06 18:01:18,734][23882] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-06 18:01:19,509][23882] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-06 18:01:20,303][23882] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-06 18:01:21,073][23882] Updated weights for policy 0, policy_version 62090 (0.0007) [2023-03-06 18:01:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63588352. Throughput: 0: 13064.9. Samples: 63573801. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:01:21,748][23556] Avg episode reward: [(0, '1548.118')] [2023-03-06 18:01:21,848][23882] Updated weights for policy 0, policy_version 62100 (0.0007) [2023-03-06 18:01:22,650][23882] Updated weights for policy 0, policy_version 62110 (0.0007) [2023-03-06 18:01:23,415][23882] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-06 18:01:24,191][23882] Updated weights for policy 0, policy_version 62130 (0.0007) [2023-03-06 18:01:24,964][23882] Updated weights for policy 0, policy_version 62140 (0.0007) [2023-03-06 18:01:25,754][23882] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-06 18:01:26,534][23882] Updated weights for policy 0, policy_version 62160 (0.0007) [2023-03-06 18:01:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63653888. Throughput: 0: 13064.2. Samples: 63652494. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:26,748][23556] Avg episode reward: [(0, '1407.097')] [2023-03-06 18:01:27,334][23882] Updated weights for policy 0, policy_version 62170 (0.0007) [2023-03-06 18:01:28,128][23882] Updated weights for policy 0, policy_version 62180 (0.0007) [2023-03-06 18:01:28,908][23882] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-06 18:01:29,698][23882] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-06 18:01:30,488][23882] Updated weights for policy 0, policy_version 62210 (0.0007) [2023-03-06 18:01:31,267][23882] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-03-06 18:01:31,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13055.1). Total num frames: 63719424. Throughput: 0: 13055.0. Samples: 63691417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:31,748][23556] Avg episode reward: [(0, '1163.363')] [2023-03-06 18:01:32,050][23882] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-06 18:01:32,853][23882] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-06 18:01:33,622][23882] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-06 18:01:34,417][23882] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-06 18:01:35,213][23882] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-06 18:01:35,980][23882] Updated weights for policy 0, policy_version 62280 (0.0007) [2023-03-06 18:01:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 63783936. Throughput: 0: 13046.1. Samples: 63769520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:36,748][23556] Avg episode reward: [(0, '1276.821')] [2023-03-06 18:01:36,766][23882] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-06 18:01:37,528][23882] Updated weights for policy 0, policy_version 62300 (0.0007) [2023-03-06 18:01:38,321][23882] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-06 18:01:39,101][23882] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-06 18:01:39,887][23882] Updated weights for policy 0, policy_version 62330 (0.0006) [2023-03-06 18:01:40,660][23882] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-06 18:01:41,438][23882] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-06 18:01:41,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 63850496. Throughput: 0: 13065.7. Samples: 63848423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:41,748][23556] Avg episode reward: [(0, '1454.865')] [2023-03-06 18:01:42,221][23882] Updated weights for policy 0, policy_version 62360 (0.0005) [2023-03-06 18:01:43,016][23882] Updated weights for policy 0, policy_version 62370 (0.0007) [2023-03-06 18:01:43,782][23882] Updated weights for policy 0, policy_version 62380 (0.0005) [2023-03-06 18:01:44,567][23882] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-06 18:01:45,369][23882] Updated weights for policy 0, policy_version 62400 (0.0006) [2023-03-06 18:01:46,157][23882] Updated weights for policy 0, policy_version 62410 (0.0007) [2023-03-06 18:01:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 63915008. Throughput: 0: 13064.5. Samples: 63887513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:46,748][23556] Avg episode reward: [(0, '1395.230')] [2023-03-06 18:01:46,941][23882] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-06 18:01:47,727][23882] Updated weights for policy 0, policy_version 62430 (0.0007) [2023-03-06 18:01:48,516][23882] Updated weights for policy 0, policy_version 62440 (0.0006) [2023-03-06 18:01:49,298][23882] Updated weights for policy 0, policy_version 62450 (0.0007) [2023-03-06 18:01:50,081][23882] Updated weights for policy 0, policy_version 62460 (0.0007) [2023-03-06 18:01:50,864][23882] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-06 18:01:51,653][23882] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-06 18:01:51,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 63979520. Throughput: 0: 13058.4. Samples: 63965565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:51,748][23556] Avg episode reward: [(0, '1597.505')] [2023-03-06 18:01:52,450][23882] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-06 18:01:53,233][23882] Updated weights for policy 0, policy_version 62500 (0.0006) [2023-03-06 18:01:54,022][23882] Updated weights for policy 0, policy_version 62510 (0.0007) [2023-03-06 18:01:54,812][23882] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-06 18:01:55,593][23882] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-06 18:01:56,371][23882] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-06 18:01:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 64045056. Throughput: 0: 13057.2. Samples: 64043808. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:01:56,748][23556] Avg episode reward: [(0, '1455.124')] [2023-03-06 18:01:56,764][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000062545_64046080.pth... [2023-03-06 18:01:56,793][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000059483_60910592.pth [2023-03-06 18:01:57,158][23882] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-06 18:01:57,969][23882] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-06 18:01:58,747][23882] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-06 18:01:59,536][23882] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-06 18:02:00,335][23882] Updated weights for policy 0, policy_version 62590 (0.0007) [2023-03-06 18:02:01,105][23882] Updated weights for policy 0, policy_version 62600 (0.0007) [2023-03-06 18:02:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64110592. Throughput: 0: 13050.2. Samples: 64082632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:01,748][23556] Avg episode reward: [(0, '1505.066')] [2023-03-06 18:02:01,885][23882] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-06 18:02:02,682][23882] Updated weights for policy 0, policy_version 62620 (0.0007) [2023-03-06 18:02:03,453][23882] Updated weights for policy 0, policy_version 62630 (0.0007) [2023-03-06 18:02:04,245][23882] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-06 18:02:05,020][23882] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-06 18:02:05,797][23882] Updated weights for policy 0, policy_version 62660 (0.0007) [2023-03-06 18:02:06,572][23882] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-06 18:02:06,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64176128. Throughput: 0: 13050.2. Samples: 64161061. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:06,748][23556] Avg episode reward: [(0, '1426.612')] [2023-03-06 18:02:07,361][23882] Updated weights for policy 0, policy_version 62680 (0.0006) [2023-03-06 18:02:08,146][23882] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-06 18:02:08,925][23882] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-06 18:02:09,709][23882] Updated weights for policy 0, policy_version 62710 (0.0006) [2023-03-06 18:02:10,512][23882] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-06 18:02:11,294][23882] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-06 18:02:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 64240640. Throughput: 0: 13042.3. Samples: 64239399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:11,749][23556] Avg episode reward: [(0, '1518.172')] [2023-03-06 18:02:12,076][23882] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-06 18:02:12,846][23882] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-06 18:02:13,634][23882] Updated weights for policy 0, policy_version 62760 (0.0007) [2023-03-06 18:02:14,429][23882] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-06 18:02:15,193][23882] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-06 18:02:15,997][23882] Updated weights for policy 0, policy_version 62790 (0.0006) [2023-03-06 18:02:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64306176. Throughput: 0: 13049.4. Samples: 64278640. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:16,749][23556] Avg episode reward: [(0, '1519.934')] [2023-03-06 18:02:16,766][23882] Updated weights for policy 0, policy_version 62800 (0.0007) [2023-03-06 18:02:17,542][23882] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-06 18:02:18,349][23882] Updated weights for policy 0, policy_version 62820 (0.0007) [2023-03-06 18:02:19,114][23882] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-06 18:02:19,906][23882] Updated weights for policy 0, policy_version 62840 (0.0007) [2023-03-06 18:02:20,700][23882] Updated weights for policy 0, policy_version 62850 (0.0007) [2023-03-06 18:02:21,489][23882] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-06 18:02:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64371712. Throughput: 0: 13052.8. Samples: 64356895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:21,748][23556] Avg episode reward: [(0, '1462.795')] [2023-03-06 18:02:22,273][23882] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-06 18:02:23,031][23882] Updated weights for policy 0, policy_version 62880 (0.0007) [2023-03-06 18:02:23,847][23882] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-06 18:02:24,630][23882] Updated weights for policy 0, policy_version 62900 (0.0006) [2023-03-06 18:02:25,405][23882] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-06 18:02:26,186][23882] Updated weights for policy 0, policy_version 62920 (0.0007) [2023-03-06 18:02:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64437248. Throughput: 0: 13042.0. Samples: 64435315. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:26,748][23556] Avg episode reward: [(0, '1519.640')] [2023-03-06 18:02:26,973][23882] Updated weights for policy 0, policy_version 62930 (0.0007) [2023-03-06 18:02:27,751][23882] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-06 18:02:28,555][23882] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-06 18:02:29,344][23882] Updated weights for policy 0, policy_version 62960 (0.0007) [2023-03-06 18:02:30,124][23882] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-06 18:02:30,922][23882] Updated weights for policy 0, policy_version 62980 (0.0007) [2023-03-06 18:02:31,708][23882] Updated weights for policy 0, policy_version 62990 (0.0007) [2023-03-06 18:02:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 64501760. Throughput: 0: 13042.5. Samples: 64474424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:31,748][23556] Avg episode reward: [(0, '1616.962')] [2023-03-06 18:02:32,493][23882] Updated weights for policy 0, policy_version 63000 (0.0006) [2023-03-06 18:02:33,258][23882] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-06 18:02:34,065][23882] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-06 18:02:34,848][23882] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-06 18:02:35,611][23882] Updated weights for policy 0, policy_version 63040 (0.0006) [2023-03-06 18:02:36,424][23882] Updated weights for policy 0, policy_version 63050 (0.0007) [2023-03-06 18:02:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 64567296. Throughput: 0: 13038.6. Samples: 64552302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:02:36,748][23556] Avg episode reward: [(0, '1629.845')] [2023-03-06 18:02:37,201][23882] Updated weights for policy 0, policy_version 63060 (0.0007) [2023-03-06 18:02:37,986][23882] Updated weights for policy 0, policy_version 63070 (0.0007) [2023-03-06 18:02:38,762][23882] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-06 18:02:39,529][23882] Updated weights for policy 0, policy_version 63090 (0.0007) [2023-03-06 18:02:40,329][23882] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-06 18:02:41,114][23882] Updated weights for policy 0, policy_version 63110 (0.0007) [2023-03-06 18:02:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 64632832. Throughput: 0: 13045.2. Samples: 64630842. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:02:41,748][23556] Avg episode reward: [(0, '1589.326')] [2023-03-06 18:02:41,894][23882] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-06 18:02:42,707][23882] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-06 18:02:43,475][23882] Updated weights for policy 0, policy_version 63140 (0.0006) [2023-03-06 18:02:44,247][23882] Updated weights for policy 0, policy_version 63150 (0.0007) [2023-03-06 18:02:45,034][23882] Updated weights for policy 0, policy_version 63160 (0.0006) [2023-03-06 18:02:45,815][23882] Updated weights for policy 0, policy_version 63170 (0.0006) [2023-03-06 18:02:46,609][23882] Updated weights for policy 0, policy_version 63180 (0.0007) [2023-03-06 18:02:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 64697344. Throughput: 0: 13052.8. Samples: 64670010. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:02:46,748][23556] Avg episode reward: [(0, '1635.738')] [2023-03-06 18:02:47,384][23882] Updated weights for policy 0, policy_version 63190 (0.0007) [2023-03-06 18:02:48,175][23882] Updated weights for policy 0, policy_version 63200 (0.0007) [2023-03-06 18:02:48,954][23882] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-06 18:02:49,720][23882] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-06 18:02:50,508][23882] Updated weights for policy 0, policy_version 63230 (0.0006) [2023-03-06 18:02:51,291][23882] Updated weights for policy 0, policy_version 63240 (0.0007) [2023-03-06 18:02:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 64763904. Throughput: 0: 13059.5. Samples: 64748736. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:02:51,748][23556] Avg episode reward: [(0, '1661.631')] [2023-03-06 18:02:52,073][23882] Updated weights for policy 0, policy_version 63250 (0.0006) [2023-03-06 18:02:52,837][23882] Updated weights for policy 0, policy_version 63260 (0.0007) [2023-03-06 18:02:53,635][23882] Updated weights for policy 0, policy_version 63270 (0.0007) [2023-03-06 18:02:54,426][23882] Updated weights for policy 0, policy_version 63280 (0.0007) [2023-03-06 18:02:55,199][23882] Updated weights for policy 0, policy_version 63290 (0.0007) [2023-03-06 18:02:55,981][23882] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-06 18:02:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 64828416. Throughput: 0: 13062.6. Samples: 64827217. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:02:56,748][23556] Avg episode reward: [(0, '1889.553')] [2023-03-06 18:02:56,753][23831] Saving new best policy, reward=1889.553! [2023-03-06 18:02:56,754][23882] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-06 18:02:57,539][23882] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-06 18:02:58,337][23882] Updated weights for policy 0, policy_version 63330 (0.0006) [2023-03-06 18:02:59,124][23882] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-06 18:02:59,904][23882] Updated weights for policy 0, policy_version 63350 (0.0007) [2023-03-06 18:03:00,697][23882] Updated weights for policy 0, policy_version 63360 (0.0006) [2023-03-06 18:03:01,485][23882] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-06 18:03:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 64893952. Throughput: 0: 13057.7. Samples: 64866233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:03:01,748][23556] Avg episode reward: [(0, '1721.913')] [2023-03-06 18:03:02,252][23882] Updated weights for policy 0, policy_version 63380 (0.0007) [2023-03-06 18:03:03,046][23882] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-06 18:03:03,829][23882] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-06 18:03:04,617][23882] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-06 18:03:05,395][23882] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-06 18:03:06,174][23882] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-06 18:03:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 64959488. Throughput: 0: 13062.5. Samples: 64944709. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:03:06,748][23556] Avg episode reward: [(0, '1707.728')] [2023-03-06 18:03:06,952][23882] Updated weights for policy 0, policy_version 63440 (0.0007) [2023-03-06 18:03:07,733][23882] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-06 18:03:08,512][23882] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-06 18:03:09,284][23882] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-06 18:03:10,065][23882] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-06 18:03:10,866][23882] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-06 18:03:11,631][23882] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-06 18:03:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.6). Total num frames: 65025024. Throughput: 0: 13069.2. Samples: 65023425. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:03:11,748][23556] Avg episode reward: [(0, '1385.018')] [2023-03-06 18:03:12,402][23882] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-06 18:03:13,204][23882] Updated weights for policy 0, policy_version 63520 (0.0007) [2023-03-06 18:03:13,984][23882] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-06 18:03:14,779][23882] Updated weights for policy 0, policy_version 63540 (0.0006) [2023-03-06 18:03:15,558][23882] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-06 18:03:16,346][23882] Updated weights for policy 0, policy_version 63560 (0.0006) [2023-03-06 18:03:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 65090560. Throughput: 0: 13067.8. Samples: 65062475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:03:16,748][23556] Avg episode reward: [(0, '1446.274')] [2023-03-06 18:03:17,122][23882] Updated weights for policy 0, policy_version 63570 (0.0007) [2023-03-06 18:03:17,912][23882] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-06 18:03:18,707][23882] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-06 18:03:19,497][23882] Updated weights for policy 0, policy_version 63600 (0.0007) [2023-03-06 18:03:20,256][23882] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-06 18:03:20,491][23831] KL-divergence is very high: 766.5893 [2023-03-06 18:03:21,025][23882] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-06 18:03:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 65155072. Throughput: 0: 13080.8. Samples: 65140938. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:21,748][23556] Avg episode reward: [(0, '1657.432')] [2023-03-06 18:03:21,829][23882] Updated weights for policy 0, policy_version 63630 (0.0009) [2023-03-06 18:03:22,621][23882] Updated weights for policy 0, policy_version 63640 (0.0005) [2023-03-06 18:03:23,382][23882] Updated weights for policy 0, policy_version 63650 (0.0007) [2023-03-06 18:03:24,162][23882] Updated weights for policy 0, policy_version 63660 (0.0007) [2023-03-06 18:03:24,949][23882] Updated weights for policy 0, policy_version 63670 (0.0005) [2023-03-06 18:03:25,733][23882] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-06 18:03:26,502][23882] Updated weights for policy 0, policy_version 63690 (0.0006) [2023-03-06 18:03:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 65220608. Throughput: 0: 13083.3. Samples: 65219589. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:26,748][23556] Avg episode reward: [(0, '468.475')] [2023-03-06 18:03:27,275][23882] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-06 18:03:28,066][23882] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-06 18:03:28,849][23882] Updated weights for policy 0, policy_version 63720 (0.0006) [2023-03-06 18:03:29,628][23882] Updated weights for policy 0, policy_version 63730 (0.0005) [2023-03-06 18:03:30,421][23882] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-06 18:03:31,209][23882] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-06 18:03:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 65286144. Throughput: 0: 13085.0. Samples: 65258834. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:31,748][23556] Avg episode reward: [(0, '345.175')] [2023-03-06 18:03:31,982][23882] Updated weights for policy 0, policy_version 63760 (0.0007) [2023-03-06 18:03:32,782][23882] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-06 18:03:33,564][23882] Updated weights for policy 0, policy_version 63780 (0.0006) [2023-03-06 18:03:34,381][23882] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-06 18:03:35,173][23882] Updated weights for policy 0, policy_version 63800 (0.0006) [2023-03-06 18:03:35,936][23882] Updated weights for policy 0, policy_version 63810 (0.0007) [2023-03-06 18:03:36,731][23882] Updated weights for policy 0, policy_version 63820 (0.0007) [2023-03-06 18:03:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13065.6). Total num frames: 65351680. Throughput: 0: 13066.0. Samples: 65336709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:36,748][23556] Avg episode reward: [(0, '1516.545')] [2023-03-06 18:03:37,529][23882] Updated weights for policy 0, policy_version 63830 (0.0007) [2023-03-06 18:03:38,289][23882] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-06 18:03:39,064][23882] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-06 18:03:39,854][23882] Updated weights for policy 0, policy_version 63860 (0.0006) [2023-03-06 18:03:40,617][23882] Updated weights for policy 0, policy_version 63870 (0.0006) [2023-03-06 18:03:41,398][23882] Updated weights for policy 0, policy_version 63880 (0.0006) [2023-03-06 18:03:41,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13062.1). Total num frames: 65417216. Throughput: 0: 13075.6. Samples: 65415621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:41,748][23556] Avg episode reward: [(0, '1537.515')] [2023-03-06 18:03:42,190][23882] Updated weights for policy 0, policy_version 63890 (0.0007) [2023-03-06 18:03:42,963][23882] Updated weights for policy 0, policy_version 63900 (0.0006) [2023-03-06 18:03:43,741][23882] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-06 18:03:44,539][23882] Updated weights for policy 0, policy_version 63920 (0.0007) [2023-03-06 18:03:45,342][23882] Updated weights for policy 0, policy_version 63930 (0.0007) [2023-03-06 18:03:46,113][23882] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-06 18:03:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 65481728. Throughput: 0: 13077.7. Samples: 65454730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:46,748][23556] Avg episode reward: [(0, '1613.476')] [2023-03-06 18:03:46,900][23882] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-06 18:03:47,689][23882] Updated weights for policy 0, policy_version 63960 (0.0007) [2023-03-06 18:03:48,470][23831] KL-divergence is very high: 254.6693 [2023-03-06 18:03:48,477][23882] Updated weights for policy 0, policy_version 63970 (0.0008) [2023-03-06 18:03:49,254][23882] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-06 18:03:50,045][23882] Updated weights for policy 0, policy_version 63990 (0.0007) [2023-03-06 18:03:50,834][23882] Updated weights for policy 0, policy_version 64000 (0.0007) [2023-03-06 18:03:51,613][23882] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-06 18:03:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 65547264. Throughput: 0: 13070.8. Samples: 65532895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:51,748][23556] Avg episode reward: [(0, '1583.551')] [2023-03-06 18:03:52,401][23882] Updated weights for policy 0, policy_version 64020 (0.0006) [2023-03-06 18:03:53,161][23882] Updated weights for policy 0, policy_version 64030 (0.0006) [2023-03-06 18:03:53,955][23882] Updated weights for policy 0, policy_version 64040 (0.0007) [2023-03-06 18:03:54,736][23882] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-06 18:03:55,533][23882] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-06 18:03:56,329][23882] Updated weights for policy 0, policy_version 64070 (0.0007) [2023-03-06 18:03:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 65612800. Throughput: 0: 13057.5. Samples: 65611012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:03:56,748][23556] Avg episode reward: [(0, '1427.386')] [2023-03-06 18:03:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000064075_65612800.pth... [2023-03-06 18:03:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000061014_62478336.pth [2023-03-06 18:03:57,107][23882] Updated weights for policy 0, policy_version 64080 (0.0006) [2023-03-06 18:03:57,896][23882] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-06 18:03:58,687][23882] Updated weights for policy 0, policy_version 64100 (0.0007) [2023-03-06 18:03:59,465][23882] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-06 18:04:00,273][23882] Updated weights for policy 0, policy_version 64120 (0.0006) [2023-03-06 18:04:01,041][23882] Updated weights for policy 0, policy_version 64130 (0.0007) [2023-03-06 18:04:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 65678336. Throughput: 0: 13059.1. Samples: 65650133. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:01,748][23556] Avg episode reward: [(0, '1281.492')] [2023-03-06 18:04:01,815][23882] Updated weights for policy 0, policy_version 64140 (0.0007) [2023-03-06 18:04:02,593][23882] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-06 18:04:03,373][23882] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-06 18:04:04,156][23882] Updated weights for policy 0, policy_version 64170 (0.0007) [2023-03-06 18:04:04,969][23882] Updated weights for policy 0, policy_version 64180 (0.0007) [2023-03-06 18:04:05,733][23882] Updated weights for policy 0, policy_version 64190 (0.0006) [2023-03-06 18:04:06,534][23882] Updated weights for policy 0, policy_version 64200 (0.0006) [2023-03-06 18:04:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 65742848. Throughput: 0: 13056.9. Samples: 65728500. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:06,748][23556] Avg episode reward: [(0, '1482.598')] [2023-03-06 18:04:07,303][23882] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-06 18:04:08,093][23882] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-03-06 18:04:08,882][23882] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-06 18:04:09,662][23882] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-06 18:04:10,460][23882] Updated weights for policy 0, policy_version 64250 (0.0007) [2023-03-06 18:04:11,243][23882] Updated weights for policy 0, policy_version 64260 (0.0007) [2023-03-06 18:04:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 65808384. Throughput: 0: 13046.4. Samples: 65806680. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:11,748][23556] Avg episode reward: [(0, '1447.787')] [2023-03-06 18:04:12,030][23882] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-06 18:04:12,792][23882] Updated weights for policy 0, policy_version 64280 (0.0006) [2023-03-06 18:04:13,589][23882] Updated weights for policy 0, policy_version 64290 (0.0007) [2023-03-06 18:04:14,345][23882] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-06 18:04:15,131][23882] Updated weights for policy 0, policy_version 64310 (0.0007) [2023-03-06 18:04:15,921][23882] Updated weights for policy 0, policy_version 64320 (0.0007) [2023-03-06 18:04:16,134][23831] KL-divergence is very high: 2532.0144 [2023-03-06 18:04:16,704][23882] Updated weights for policy 0, policy_version 64330 (0.0007) [2023-03-06 18:04:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 65873920. Throughput: 0: 13055.8. Samples: 65846348. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:16,749][23556] Avg episode reward: [(0, '1245.933')] [2023-03-06 18:04:16,848][23831] KL-divergence is very high: 344.0534 [2023-03-06 18:04:17,469][23882] Updated weights for policy 0, policy_version 64340 (0.0006) [2023-03-06 18:04:18,262][23882] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-06 18:04:19,039][23882] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-06 18:04:19,277][23831] KL-divergence is very high: 1402.0992 [2023-03-06 18:04:19,825][23882] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-06 18:04:20,382][23831] KL-divergence is very high: 313.5172 [2023-03-06 18:04:20,622][23882] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-06 18:04:21,403][23882] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-06 18:04:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 65939456. Throughput: 0: 13066.5. Samples: 65924703. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:21,748][23556] Avg episode reward: [(0, '1038.528')] [2023-03-06 18:04:22,184][23882] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-06 18:04:22,982][23882] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-06 18:04:23,774][23882] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-06 18:04:24,548][23882] Updated weights for policy 0, policy_version 64430 (0.0007) [2023-03-06 18:04:25,340][23882] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-06 18:04:26,124][23882] Updated weights for policy 0, policy_version 64450 (0.0007) [2023-03-06 18:04:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66003968. Throughput: 0: 13049.0. Samples: 66002825. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:26,748][23556] Avg episode reward: [(0, '1046.486')] [2023-03-06 18:04:26,905][23882] Updated weights for policy 0, policy_version 64460 (0.0005) [2023-03-06 18:04:27,689][23882] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-06 18:04:28,308][23831] KL-divergence is very high: 608.5107 [2023-03-06 18:04:28,475][23882] Updated weights for policy 0, policy_version 64480 (0.0007) [2023-03-06 18:04:29,281][23882] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-06 18:04:30,055][23882] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-06 18:04:30,833][23882] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-06 18:04:31,632][23882] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-06 18:04:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 66069504. Throughput: 0: 13049.8. Samples: 66041971. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:04:31,748][23556] Avg episode reward: [(0, '1188.559')] [2023-03-06 18:04:32,432][23882] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-06 18:04:33,206][23882] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-06 18:04:33,986][23882] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-06 18:04:34,777][23882] Updated weights for policy 0, policy_version 64560 (0.0007) [2023-03-06 18:04:35,551][23882] Updated weights for policy 0, policy_version 64570 (0.0006) [2023-03-06 18:04:36,330][23882] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-06 18:04:36,412][23831] KL-divergence is very high: 108.6388 [2023-03-06 18:04:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66135040. Throughput: 0: 13049.0. Samples: 66120101. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:04:36,748][23556] Avg episode reward: [(0, '1314.165')] [2023-03-06 18:04:37,110][23882] Updated weights for policy 0, policy_version 64590 (0.0007) [2023-03-06 18:04:37,893][23882] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-03-06 18:04:38,701][23882] Updated weights for policy 0, policy_version 64610 (0.0007) [2023-03-06 18:04:39,474][23882] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-06 18:04:40,289][23882] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-06 18:04:41,069][23882] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-06 18:04:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 66199552. Throughput: 0: 13046.8. Samples: 66198117. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:04:41,748][23556] Avg episode reward: [(0, '1232.699')] [2023-03-06 18:04:41,846][23882] Updated weights for policy 0, policy_version 64650 (0.0007) [2023-03-06 18:04:42,623][23882] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-06 18:04:43,415][23882] Updated weights for policy 0, policy_version 64670 (0.0007) [2023-03-06 18:04:44,198][23882] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-06 18:04:44,981][23882] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-06 18:04:45,780][23882] Updated weights for policy 0, policy_version 64700 (0.0007) [2023-03-06 18:04:46,550][23882] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-06 18:04:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 66265088. Throughput: 0: 13050.0. Samples: 66237385. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:04:46,759][23556] Avg episode reward: [(0, '1208.285')] [2023-03-06 18:04:47,353][23882] Updated weights for policy 0, policy_version 64720 (0.0007) [2023-03-06 18:04:48,124][23882] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-06 18:04:48,905][23882] Updated weights for policy 0, policy_version 64740 (0.0007) [2023-03-06 18:04:49,699][23882] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-06 18:04:50,479][23882] Updated weights for policy 0, policy_version 64760 (0.0007) [2023-03-06 18:04:51,282][23882] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-06 18:04:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 66329600. Throughput: 0: 13042.1. Samples: 66315394. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:04:51,759][23556] Avg episode reward: [(0, '1359.337')] [2023-03-06 18:04:52,069][23882] Updated weights for policy 0, policy_version 64780 (0.0007) [2023-03-06 18:04:52,838][23882] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-06 18:04:53,633][23882] Updated weights for policy 0, policy_version 64800 (0.0007) [2023-03-06 18:04:54,421][23882] Updated weights for policy 0, policy_version 64810 (0.0007) [2023-03-06 18:04:55,192][23882] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-06 18:04:55,984][23882] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-06 18:04:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 66395136. Throughput: 0: 13048.5. Samples: 66393863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:04:56,758][23556] Avg episode reward: [(0, '1423.633')] [2023-03-06 18:04:56,758][23882] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-06 18:04:57,555][23882] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-06 18:04:58,344][23882] Updated weights for policy 0, policy_version 64860 (0.0005) [2023-03-06 18:04:59,105][23882] Updated weights for policy 0, policy_version 64870 (0.0007) [2023-03-06 18:04:59,886][23882] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-06 18:05:00,686][23882] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-06 18:05:01,472][23882] Updated weights for policy 0, policy_version 64900 (0.0006) [2023-03-06 18:05:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 66460672. Throughput: 0: 13038.3. Samples: 66433071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:05:01,748][23556] Avg episode reward: [(0, '1745.496')] [2023-03-06 18:05:02,282][23882] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-06 18:05:03,037][23831] KL-divergence is very high: 4476.7158 [2023-03-06 18:05:03,044][23882] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-06 18:05:03,821][23882] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-06 18:05:04,598][23882] Updated weights for policy 0, policy_version 64940 (0.0007) [2023-03-06 18:05:05,378][23882] Updated weights for policy 0, policy_version 64950 (0.0006) [2023-03-06 18:05:06,158][23882] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-06 18:05:06,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66526208. Throughput: 0: 13038.8. Samples: 66511449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:05:06,748][23556] Avg episode reward: [(0, '1390.749')] [2023-03-06 18:05:06,951][23882] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-06 18:05:07,733][23882] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-03-06 18:05:08,521][23882] Updated weights for policy 0, policy_version 64990 (0.0007) [2023-03-06 18:05:09,291][23882] Updated weights for policy 0, policy_version 65000 (0.0006) [2023-03-06 18:05:10,081][23882] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-06 18:05:10,862][23882] Updated weights for policy 0, policy_version 65020 (0.0005) [2023-03-06 18:05:11,641][23882] Updated weights for policy 0, policy_version 65030 (0.0007) [2023-03-06 18:05:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66591744. Throughput: 0: 13047.8. Samples: 66589977. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:05:11,748][23556] Avg episode reward: [(0, '1319.552')] [2023-03-06 18:05:12,430][23882] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-06 18:05:13,203][23882] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-06 18:05:13,978][23882] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-06 18:05:14,765][23882] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-06 18:05:15,555][23882] Updated weights for policy 0, policy_version 65080 (0.0007) [2023-03-06 18:05:16,319][23882] Updated weights for policy 0, policy_version 65090 (0.0007) [2023-03-06 18:05:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66657280. Throughput: 0: 13056.4. Samples: 66629506. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:16,748][23556] Avg episode reward: [(0, '1244.938')] [2023-03-06 18:05:17,109][23882] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-06 18:05:17,906][23882] Updated weights for policy 0, policy_version 65110 (0.0006) [2023-03-06 18:05:18,685][23882] Updated weights for policy 0, policy_version 65120 (0.0007) [2023-03-06 18:05:19,465][23882] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-06 18:05:20,237][23882] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-06 18:05:21,027][23882] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-06 18:05:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66722816. Throughput: 0: 13064.3. Samples: 66707992. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:21,748][23556] Avg episode reward: [(0, '1463.150')] [2023-03-06 18:05:21,797][23882] Updated weights for policy 0, policy_version 65160 (0.0006) [2023-03-06 18:05:22,584][23882] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-06 18:05:23,378][23882] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-06 18:05:24,172][23882] Updated weights for policy 0, policy_version 65190 (0.0006) [2023-03-06 18:05:24,953][23882] Updated weights for policy 0, policy_version 65200 (0.0007) [2023-03-06 18:05:25,734][23882] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-06 18:05:26,511][23882] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-06 18:05:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66787328. Throughput: 0: 13070.1. Samples: 66786272. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:26,748][23556] Avg episode reward: [(0, '1401.631')] [2023-03-06 18:05:27,304][23882] Updated weights for policy 0, policy_version 65230 (0.0007) [2023-03-06 18:05:28,092][23882] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-06 18:05:28,873][23882] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-06 18:05:29,661][23882] Updated weights for policy 0, policy_version 65260 (0.0006) [2023-03-06 18:05:30,438][23882] Updated weights for policy 0, policy_version 65270 (0.0006) [2023-03-06 18:05:31,210][23882] Updated weights for policy 0, policy_version 65280 (0.0007) [2023-03-06 18:05:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66852864. Throughput: 0: 13064.8. Samples: 66825300. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:31,748][23556] Avg episode reward: [(0, '1570.976')] [2023-03-06 18:05:32,008][23882] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-06 18:05:32,784][23882] Updated weights for policy 0, policy_version 65300 (0.0007) [2023-03-06 18:05:33,570][23882] Updated weights for policy 0, policy_version 65310 (0.0006) [2023-03-06 18:05:34,374][23882] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-06 18:05:35,144][23882] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-06 18:05:35,918][23882] Updated weights for policy 0, policy_version 65340 (0.0006) [2023-03-06 18:05:36,722][23882] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-06 18:05:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 66918400. Throughput: 0: 13073.8. Samples: 66903715. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:36,748][23556] Avg episode reward: [(0, '1698.721')] [2023-03-06 18:05:37,512][23882] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-06 18:05:38,289][23882] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-06 18:05:39,077][23882] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-06 18:05:39,862][23882] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-06 18:05:40,173][23831] KL-divergence is very high: 2014.1179 [2023-03-06 18:05:40,386][23831] KL-divergence is very high: 105.9030 [2023-03-06 18:05:40,645][23882] Updated weights for policy 0, policy_version 65400 (0.0007) [2023-03-06 18:05:41,432][23882] Updated weights for policy 0, policy_version 65410 (0.0007) [2023-03-06 18:05:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 66982912. Throughput: 0: 13065.0. Samples: 66981787. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:41,748][23556] Avg episode reward: [(0, '1547.678')] [2023-03-06 18:05:42,234][23882] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-06 18:05:42,992][23882] Updated weights for policy 0, policy_version 65430 (0.0007) [2023-03-06 18:05:43,780][23882] Updated weights for policy 0, policy_version 65440 (0.0007) [2023-03-06 18:05:44,574][23882] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-06 18:05:45,373][23882] Updated weights for policy 0, policy_version 65460 (0.0007) [2023-03-06 18:05:46,147][23882] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-06 18:05:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 67048448. Throughput: 0: 13064.8. Samples: 67020990. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:46,748][23556] Avg episode reward: [(0, '1480.690')] [2023-03-06 18:05:46,936][23882] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-06 18:05:47,704][23882] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-06 18:05:48,490][23882] Updated weights for policy 0, policy_version 65500 (0.0007) [2023-03-06 18:05:49,277][23882] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-06 18:05:50,065][23882] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-06 18:05:50,849][23882] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-06 18:05:51,632][23882] Updated weights for policy 0, policy_version 65540 (0.0007) [2023-03-06 18:05:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 67113984. Throughput: 0: 13066.2. Samples: 67099429. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:05:51,748][23556] Avg episode reward: [(0, '1343.811')] [2023-03-06 18:05:52,405][23882] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-06 18:05:53,180][23882] Updated weights for policy 0, policy_version 65560 (0.0007) [2023-03-06 18:05:53,964][23882] Updated weights for policy 0, policy_version 65570 (0.0007) [2023-03-06 18:05:54,755][23882] Updated weights for policy 0, policy_version 65580 (0.0007) [2023-03-06 18:05:55,535][23882] Updated weights for policy 0, policy_version 65590 (0.0006) [2023-03-06 18:05:56,325][23882] Updated weights for policy 0, policy_version 65600 (0.0007) [2023-03-06 18:05:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 67179520. Throughput: 0: 13065.8. Samples: 67177938. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:05:56,748][23556] Avg episode reward: [(0, '1634.001')] [2023-03-06 18:05:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000065605_67179520.pth... [2023-03-06 18:05:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000062545_64046080.pth [2023-03-06 18:05:57,116][23882] Updated weights for policy 0, policy_version 65610 (0.0006) [2023-03-06 18:05:57,331][23831] KL-divergence is very high: 274.7841 [2023-03-06 18:05:57,874][23831] KL-divergence is very high: 498.5749 [2023-03-06 18:05:57,880][23882] Updated weights for policy 0, policy_version 65620 (0.0007) [2023-03-06 18:05:58,679][23882] Updated weights for policy 0, policy_version 65630 (0.0007) [2023-03-06 18:05:59,379][23831] KL-divergence is very high: 422.5384 [2023-03-06 18:05:59,461][23831] KL-divergence is very high: 876.5392 [2023-03-06 18:05:59,468][23882] Updated weights for policy 0, policy_version 65640 (0.0006) [2023-03-06 18:05:59,542][23831] KL-divergence is very high: 355.4499 [2023-03-06 18:05:59,937][23831] KL-divergence is very high: 241.2653 [2023-03-06 18:06:00,261][23882] Updated weights for policy 0, policy_version 65650 (0.0007) [2023-03-06 18:06:01,031][23882] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-06 18:06:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 67245056. Throughput: 0: 13053.5. Samples: 67216914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:01,748][23556] Avg episode reward: [(0, '1563.246')] [2023-03-06 18:06:01,838][23882] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-06 18:06:02,596][23882] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-06 18:06:03,381][23882] Updated weights for policy 0, policy_version 65690 (0.0006) [2023-03-06 18:06:04,158][23882] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-06 18:06:04,920][23882] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-06 18:06:05,703][23882] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-06 18:06:06,511][23882] Updated weights for policy 0, policy_version 65730 (0.0007) [2023-03-06 18:06:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67309568. Throughput: 0: 13056.2. Samples: 67295523. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:06,748][23556] Avg episode reward: [(0, '1232.622')] [2023-03-06 18:06:07,294][23882] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-06 18:06:08,084][23882] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-06 18:06:08,887][23882] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-06 18:06:09,666][23882] Updated weights for policy 0, policy_version 65770 (0.0006) [2023-03-06 18:06:10,426][23882] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-06 18:06:11,211][23882] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-06 18:06:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67375104. Throughput: 0: 13059.0. Samples: 67373928. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:11,748][23556] Avg episode reward: [(0, '933.194')] [2023-03-06 18:06:11,992][23882] Updated weights for policy 0, policy_version 65800 (0.0007) [2023-03-06 18:06:12,765][23882] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-06 18:06:13,550][23882] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-06 18:06:14,354][23882] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-06 18:06:15,131][23882] Updated weights for policy 0, policy_version 65840 (0.0007) [2023-03-06 18:06:15,929][23882] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-06 18:06:16,701][23882] Updated weights for policy 0, policy_version 65860 (0.0007) [2023-03-06 18:06:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67440640. Throughput: 0: 13062.8. Samples: 67413126. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:16,748][23556] Avg episode reward: [(0, '1130.266')] [2023-03-06 18:06:17,487][23882] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-06 18:06:18,275][23882] Updated weights for policy 0, policy_version 65880 (0.0007) [2023-03-06 18:06:19,066][23882] Updated weights for policy 0, policy_version 65890 (0.0006) [2023-03-06 18:06:19,832][23882] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-06 18:06:20,624][23882] Updated weights for policy 0, policy_version 65910 (0.0008) [2023-03-06 18:06:21,422][23882] Updated weights for policy 0, policy_version 65920 (0.0007) [2023-03-06 18:06:21,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67506176. Throughput: 0: 13059.6. Samples: 67491398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:21,749][23556] Avg episode reward: [(0, '1436.669')] [2023-03-06 18:06:22,198][23882] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-06 18:06:22,974][23882] Updated weights for policy 0, policy_version 65940 (0.0005) [2023-03-06 18:06:23,762][23882] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-06 18:06:24,536][23882] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-06 18:06:25,317][23882] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-06 18:06:26,096][23882] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-06 18:06:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 67571712. Throughput: 0: 13070.3. Samples: 67569950. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:26,748][23556] Avg episode reward: [(0, '1456.774')] [2023-03-06 18:06:26,876][23882] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-06 18:06:27,658][23882] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-06 18:06:28,435][23882] Updated weights for policy 0, policy_version 66010 (0.0007) [2023-03-06 18:06:29,238][23882] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-06 18:06:30,014][23882] Updated weights for policy 0, policy_version 66030 (0.0007) [2023-03-06 18:06:30,796][23882] Updated weights for policy 0, policy_version 66040 (0.0007) [2023-03-06 18:06:31,585][23882] Updated weights for policy 0, policy_version 66050 (0.0007) [2023-03-06 18:06:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67636224. Throughput: 0: 13068.8. Samples: 67609085. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:31,748][23556] Avg episode reward: [(0, '1603.905')] [2023-03-06 18:06:32,362][23882] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-06 18:06:33,144][23882] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-06 18:06:33,936][23882] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-06 18:06:34,715][23882] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-06 18:06:35,502][23882] Updated weights for policy 0, policy_version 66100 (0.0007) [2023-03-06 18:06:36,302][23882] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-06 18:06:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 67701760. Throughput: 0: 13067.7. Samples: 67687475. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:06:36,748][23556] Avg episode reward: [(0, '1507.937')] [2023-03-06 18:06:37,081][23882] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-06 18:06:37,855][23882] Updated weights for policy 0, policy_version 66130 (0.0007) [2023-03-06 18:06:38,494][23831] KL-divergence is very high: 4753.2783 [2023-03-06 18:06:38,662][23882] Updated weights for policy 0, policy_version 66140 (0.0007) [2023-03-06 18:06:39,449][23882] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-06 18:06:40,221][23882] Updated weights for policy 0, policy_version 66160 (0.0005) [2023-03-06 18:06:41,028][23882] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-06 18:06:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13058.6). Total num frames: 67767296. Throughput: 0: 13059.6. Samples: 67765619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:06:41,748][23556] Avg episode reward: [(0, '1759.770')] [2023-03-06 18:06:41,798][23882] Updated weights for policy 0, policy_version 66180 (0.0007) [2023-03-06 18:06:42,566][23882] Updated weights for policy 0, policy_version 66190 (0.0007) [2023-03-06 18:06:43,369][23882] Updated weights for policy 0, policy_version 66200 (0.0007) [2023-03-06 18:06:44,143][23882] Updated weights for policy 0, policy_version 66210 (0.0008) [2023-03-06 18:06:44,918][23882] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-06 18:06:45,711][23882] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-06 18:06:46,498][23882] Updated weights for policy 0, policy_version 66240 (0.0007) [2023-03-06 18:06:46,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67831808. Throughput: 0: 13065.6. Samples: 67804869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:06:46,748][23556] Avg episode reward: [(0, '1794.614')] [2023-03-06 18:06:47,282][23882] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-06 18:06:48,069][23882] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-06 18:06:48,869][23882] Updated weights for policy 0, policy_version 66270 (0.0007) [2023-03-06 18:06:49,653][23882] Updated weights for policy 0, policy_version 66280 (0.0006) [2023-03-06 18:06:50,426][23882] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-06 18:06:51,226][23882] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-06 18:06:51,303][23831] KL-divergence is very high: 168.4738 [2023-03-06 18:06:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67897344. Throughput: 0: 13054.4. Samples: 67882974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:06:51,748][23556] Avg episode reward: [(0, '1794.016')] [2023-03-06 18:06:52,009][23882] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-06 18:06:52,791][23882] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-06 18:06:53,574][23882] Updated weights for policy 0, policy_version 66330 (0.0007) [2023-03-06 18:06:54,354][23882] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-06 18:06:55,142][23882] Updated weights for policy 0, policy_version 66350 (0.0007) [2023-03-06 18:06:55,935][23882] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-06 18:06:56,721][23882] Updated weights for policy 0, policy_version 66370 (0.0006) [2023-03-06 18:06:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 67962880. Throughput: 0: 13048.9. Samples: 67961128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:06:56,748][23556] Avg episode reward: [(0, '1701.445')] [2023-03-06 18:06:57,498][23882] Updated weights for policy 0, policy_version 66380 (0.0006) [2023-03-06 18:06:58,278][23882] Updated weights for policy 0, policy_version 66390 (0.0006) [2023-03-06 18:06:59,065][23882] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-06 18:06:59,844][23882] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-06 18:07:00,622][23882] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-06 18:07:01,397][23882] Updated weights for policy 0, policy_version 66430 (0.0007) [2023-03-06 18:07:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68028416. Throughput: 0: 13053.0. Samples: 68000509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:01,748][23556] Avg episode reward: [(0, '1709.770')] [2023-03-06 18:07:02,189][23882] Updated weights for policy 0, policy_version 66440 (0.0007) [2023-03-06 18:07:02,335][23831] KL-divergence is very high: 544.1923 [2023-03-06 18:07:02,974][23882] Updated weights for policy 0, policy_version 66450 (0.0007) [2023-03-06 18:07:03,741][23882] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-06 18:07:04,530][23882] Updated weights for policy 0, policy_version 66470 (0.0007) [2023-03-06 18:07:05,306][23882] Updated weights for policy 0, policy_version 66480 (0.0007) [2023-03-06 18:07:06,091][23882] Updated weights for policy 0, policy_version 66490 (0.0007) [2023-03-06 18:07:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 68093952. Throughput: 0: 13058.7. Samples: 68079037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:06,748][23556] Avg episode reward: [(0, '1717.321')] [2023-03-06 18:07:06,887][23882] Updated weights for policy 0, policy_version 66500 (0.0007) [2023-03-06 18:07:07,667][23882] Updated weights for policy 0, policy_version 66510 (0.0007) [2023-03-06 18:07:08,441][23882] Updated weights for policy 0, policy_version 66520 (0.0007) [2023-03-06 18:07:09,221][23882] Updated weights for policy 0, policy_version 66530 (0.0006) [2023-03-06 18:07:10,008][23882] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-06 18:07:10,791][23882] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-06 18:07:11,569][23882] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-06 18:07:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68158464. Throughput: 0: 13055.3. Samples: 68157440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:11,748][23556] Avg episode reward: [(0, '1537.816')] [2023-03-06 18:07:12,378][23882] Updated weights for policy 0, policy_version 66570 (0.0007) [2023-03-06 18:07:13,166][23882] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-06 18:07:13,942][23882] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-06 18:07:14,718][23882] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-06 18:07:15,490][23882] Updated weights for policy 0, policy_version 66610 (0.0006) [2023-03-06 18:07:16,269][23882] Updated weights for policy 0, policy_version 66620 (0.0007) [2023-03-06 18:07:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 68225024. Throughput: 0: 13057.8. Samples: 68196687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:16,748][23556] Avg episode reward: [(0, '1318.861')] [2023-03-06 18:07:17,054][23882] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-06 18:07:17,841][23882] Updated weights for policy 0, policy_version 66640 (0.0006) [2023-03-06 18:07:18,620][23882] Updated weights for policy 0, policy_version 66650 (0.0006) [2023-03-06 18:07:19,038][23831] KL-divergence is very high: 426.3123 [2023-03-06 18:07:19,420][23882] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-06 18:07:20,214][23882] Updated weights for policy 0, policy_version 66670 (0.0007) [2023-03-06 18:07:21,006][23882] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-06 18:07:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68289536. Throughput: 0: 13051.3. Samples: 68274782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:21,748][23556] Avg episode reward: [(0, '1549.817')] [2023-03-06 18:07:21,800][23882] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-03-06 18:07:22,417][23831] KL-divergence is very high: 370.7087 [2023-03-06 18:07:22,585][23882] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-06 18:07:23,362][23882] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-06 18:07:24,158][23882] Updated weights for policy 0, policy_version 66720 (0.0007) [2023-03-06 18:07:24,939][23882] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-06 18:07:25,734][23882] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-06 18:07:26,032][23831] KL-divergence is very high: 438.3655 [2023-03-06 18:07:26,506][23882] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-06 18:07:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 68355072. Throughput: 0: 13055.3. Samples: 68353106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:26,748][23556] Avg episode reward: [(0, '1306.980')] [2023-03-06 18:07:27,306][23882] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-06 18:07:28,085][23882] Updated weights for policy 0, policy_version 66770 (0.0006) [2023-03-06 18:07:28,868][23882] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-06 18:07:29,666][23882] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-06 18:07:30,436][23882] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-06 18:07:31,210][23882] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-06 18:07:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68419584. Throughput: 0: 13048.7. Samples: 68392057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:31,748][23556] Avg episode reward: [(0, '1679.948')] [2023-03-06 18:07:31,994][23882] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-06 18:07:32,789][23882] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-06 18:07:33,561][23882] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-06 18:07:34,347][23882] Updated weights for policy 0, policy_version 66850 (0.0007) [2023-03-06 18:07:35,145][23882] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-06 18:07:35,928][23882] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-06 18:07:36,693][23882] Updated weights for policy 0, policy_version 66880 (0.0006) [2023-03-06 18:07:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68485120. Throughput: 0: 13057.6. Samples: 68470563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:36,748][23556] Avg episode reward: [(0, '1643.234')] [2023-03-06 18:07:37,484][23882] Updated weights for policy 0, policy_version 66890 (0.0007) [2023-03-06 18:07:38,269][23882] Updated weights for policy 0, policy_version 66900 (0.0006) [2023-03-06 18:07:39,046][23882] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-06 18:07:39,841][23882] Updated weights for policy 0, policy_version 66920 (0.0007) [2023-03-06 18:07:40,610][23882] Updated weights for policy 0, policy_version 66930 (0.0007) [2023-03-06 18:07:41,398][23882] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-06 18:07:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 68550656. Throughput: 0: 13061.8. Samples: 68548906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:41,748][23556] Avg episode reward: [(0, '1601.385')] [2023-03-06 18:07:42,205][23882] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-06 18:07:42,984][23882] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-06 18:07:43,764][23882] Updated weights for policy 0, policy_version 66970 (0.0007) [2023-03-06 18:07:44,538][23882] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-06 18:07:45,317][23882] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-06 18:07:46,096][23882] Updated weights for policy 0, policy_version 67000 (0.0007) [2023-03-06 18:07:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 68616192. Throughput: 0: 13057.1. Samples: 68588080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:46,748][23556] Avg episode reward: [(0, '1710.041')] [2023-03-06 18:07:46,868][23882] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-06 18:07:47,668][23882] Updated weights for policy 0, policy_version 67020 (0.0005) [2023-03-06 18:07:48,447][23882] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-06 18:07:49,234][23882] Updated weights for policy 0, policy_version 67040 (0.0005) [2023-03-06 18:07:50,025][23882] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-06 18:07:50,831][23882] Updated weights for policy 0, policy_version 67060 (0.0007) [2023-03-06 18:07:51,600][23882] Updated weights for policy 0, policy_version 67070 (0.0006) [2023-03-06 18:07:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 68681728. Throughput: 0: 13051.0. Samples: 68666328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:51,748][23556] Avg episode reward: [(0, '1719.745')] [2023-03-06 18:07:52,390][23882] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-06 18:07:53,154][23882] Updated weights for policy 0, policy_version 67090 (0.0007) [2023-03-06 18:07:53,956][23882] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-06 18:07:54,736][23882] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-06 18:07:55,535][23882] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-06 18:07:56,309][23882] Updated weights for policy 0, policy_version 67130 (0.0006) [2023-03-06 18:07:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 68746240. Throughput: 0: 13050.7. Samples: 68744722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:07:56,748][23556] Avg episode reward: [(0, '1909.803')] [2023-03-06 18:07:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000067135_68746240.pth... [2023-03-06 18:07:56,785][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000064075_65612800.pth [2023-03-06 18:07:56,788][23831] Saving new best policy, reward=1909.803! [2023-03-06 18:07:57,111][23882] Updated weights for policy 0, policy_version 67140 (0.0007) [2023-03-06 18:07:57,897][23882] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-06 18:07:58,678][23882] Updated weights for policy 0, policy_version 67160 (0.0007) [2023-03-06 18:07:59,476][23882] Updated weights for policy 0, policy_version 67170 (0.0007) [2023-03-06 18:08:00,261][23882] Updated weights for policy 0, policy_version 67180 (0.0007) [2023-03-06 18:08:01,053][23882] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-06 18:08:01,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 68810752. Throughput: 0: 13041.1. Samples: 68783536. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:01,748][23556] Avg episode reward: [(0, '1758.643')] [2023-03-06 18:08:01,853][23882] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-06 18:08:02,644][23882] Updated weights for policy 0, policy_version 67210 (0.0006) [2023-03-06 18:08:03,433][23882] Updated weights for policy 0, policy_version 67220 (0.0007) [2023-03-06 18:08:04,221][23882] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-06 18:08:05,004][23882] Updated weights for policy 0, policy_version 67240 (0.0006) [2023-03-06 18:08:05,774][23882] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-06 18:08:06,601][23882] Updated weights for policy 0, policy_version 67260 (0.0006) [2023-03-06 18:08:06,748][23556] Fps is (10 sec: 12902.6, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 68875264. Throughput: 0: 13034.6. Samples: 68861340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:06,748][23556] Avg episode reward: [(0, '1765.079')] [2023-03-06 18:08:07,368][23882] Updated weights for policy 0, policy_version 67270 (0.0007) [2023-03-06 18:08:08,148][23882] Updated weights for policy 0, policy_version 67280 (0.0007) [2023-03-06 18:08:08,922][23882] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-06 18:08:09,718][23882] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-06 18:08:10,501][23882] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-06 18:08:11,310][23882] Updated weights for policy 0, policy_version 67320 (0.0005) [2023-03-06 18:08:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 68940800. Throughput: 0: 13028.9. Samples: 68939407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:11,748][23556] Avg episode reward: [(0, '1910.224')] [2023-03-06 18:08:11,749][23831] Saving new best policy, reward=1910.224! [2023-03-06 18:08:12,090][23882] Updated weights for policy 0, policy_version 67330 (0.0006) [2023-03-06 18:08:12,887][23882] Updated weights for policy 0, policy_version 67340 (0.0007) [2023-03-06 18:08:13,670][23882] Updated weights for policy 0, policy_version 67350 (0.0007) [2023-03-06 18:08:14,460][23882] Updated weights for policy 0, policy_version 67360 (0.0007) [2023-03-06 18:08:15,230][23882] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-06 18:08:16,018][23882] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-06 18:08:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13055.1). Total num frames: 69006336. Throughput: 0: 13030.8. Samples: 68978441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:16,748][23556] Avg episode reward: [(0, '1757.239')] [2023-03-06 18:08:16,788][23882] Updated weights for policy 0, policy_version 67390 (0.0008) [2023-03-06 18:08:17,567][23882] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-06 18:08:18,350][23882] Updated weights for policy 0, policy_version 67410 (0.0007) [2023-03-06 18:08:19,149][23882] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-06 18:08:19,917][23882] Updated weights for policy 0, policy_version 67430 (0.0007) [2023-03-06 18:08:20,706][23882] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-06 18:08:21,482][23882] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-06 18:08:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 69071872. Throughput: 0: 13033.8. Samples: 69057084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:21,748][23556] Avg episode reward: [(0, '1621.697')] [2023-03-06 18:08:22,272][23882] Updated weights for policy 0, policy_version 67460 (0.0007) [2023-03-06 18:08:23,064][23882] Updated weights for policy 0, policy_version 67470 (0.0006) [2023-03-06 18:08:23,846][23882] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-06 18:08:24,633][23882] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-06 18:08:25,397][23882] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-06 18:08:26,196][23882] Updated weights for policy 0, policy_version 67510 (0.0005) [2023-03-06 18:08:26,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13051.7). Total num frames: 69136384. Throughput: 0: 13030.1. Samples: 69135263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:26,749][23556] Avg episode reward: [(0, '1787.691')] [2023-03-06 18:08:26,987][23882] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-06 18:08:27,766][23882] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-06 18:08:28,560][23882] Updated weights for policy 0, policy_version 67540 (0.0007) [2023-03-06 18:08:29,329][23882] Updated weights for policy 0, policy_version 67550 (0.0006) [2023-03-06 18:08:30,092][23882] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-06 18:08:30,669][23831] KL-divergence is very high: 119.0032 [2023-03-06 18:08:30,905][23882] Updated weights for policy 0, policy_version 67570 (0.0007) [2023-03-06 18:08:31,689][23882] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-06 18:08:31,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 69201920. Throughput: 0: 13032.7. Samples: 69174554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:31,749][23556] Avg episode reward: [(0, '1751.172')] [2023-03-06 18:08:32,470][23882] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-06 18:08:33,263][23882] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-06 18:08:34,042][23882] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-06 18:08:34,812][23882] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-06 18:08:35,605][23882] Updated weights for policy 0, policy_version 67630 (0.0007) [2023-03-06 18:08:36,404][23882] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-06 18:08:36,748][23556] Fps is (10 sec: 13107.6, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 69267456. Throughput: 0: 13034.3. Samples: 69252871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:08:36,748][23556] Avg episode reward: [(0, '1731.143')] [2023-03-06 18:08:37,201][23882] Updated weights for policy 0, policy_version 67650 (0.0007) [2023-03-06 18:08:37,581][23831] KL-divergence is very high: 447.7305 [2023-03-06 18:08:37,979][23882] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-06 18:08:38,784][23882] Updated weights for policy 0, policy_version 67670 (0.0006) [2023-03-06 18:08:39,564][23882] Updated weights for policy 0, policy_version 67680 (0.0008) [2023-03-06 18:08:40,345][23882] Updated weights for policy 0, policy_version 67690 (0.0006) [2023-03-06 18:08:41,120][23882] Updated weights for policy 0, policy_version 67700 (0.0006) [2023-03-06 18:08:41,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 69331968. Throughput: 0: 13024.4. Samples: 69330817. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:08:41,748][23556] Avg episode reward: [(0, '1743.317')] [2023-03-06 18:08:41,915][23882] Updated weights for policy 0, policy_version 67710 (0.0006) [2023-03-06 18:08:42,690][23882] Updated weights for policy 0, policy_version 67720 (0.0007) [2023-03-06 18:08:43,489][23882] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-06 18:08:44,285][23882] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-06 18:08:45,064][23882] Updated weights for policy 0, policy_version 67750 (0.0006) [2023-03-06 18:08:45,851][23882] Updated weights for policy 0, policy_version 67760 (0.0007) [2023-03-06 18:08:46,633][23882] Updated weights for policy 0, policy_version 67770 (0.0007) [2023-03-06 18:08:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 69397504. Throughput: 0: 13026.7. Samples: 69369735. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:08:46,748][23556] Avg episode reward: [(0, '1829.987')] [2023-03-06 18:08:47,421][23882] Updated weights for policy 0, policy_version 67780 (0.0007) [2023-03-06 18:08:48,232][23882] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-03-06 18:08:48,998][23831] KL-divergence is very high: 505.7611 [2023-03-06 18:08:49,006][23882] Updated weights for policy 0, policy_version 67800 (0.0007) [2023-03-06 18:08:49,629][23831] KL-divergence is very high: 46741340.0000 [2023-03-06 18:08:49,781][23882] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-06 18:08:50,595][23882] Updated weights for policy 0, policy_version 67820 (0.0007) [2023-03-06 18:08:51,367][23882] Updated weights for policy 0, policy_version 67830 (0.0007) [2023-03-06 18:08:51,677][23831] KL-divergence is very high: 757.3736 [2023-03-06 18:08:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13048.2). Total num frames: 69462016. Throughput: 0: 13032.7. Samples: 69447811. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:08:51,748][23556] Avg episode reward: [(0, '1712.486')] [2023-03-06 18:08:52,153][23882] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-06 18:08:52,926][23882] Updated weights for policy 0, policy_version 67850 (0.0007) [2023-03-06 18:08:53,702][23882] Updated weights for policy 0, policy_version 67860 (0.0007) [2023-03-06 18:08:54,477][23882] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-06 18:08:54,638][23831] KL-divergence is very high: 169.6730 [2023-03-06 18:08:55,269][23882] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-06 18:08:55,422][23831] KL-divergence is very high: 27635.0000 [2023-03-06 18:08:56,046][23882] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-06 18:08:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 69528576. Throughput: 0: 13044.5. Samples: 69526409. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:08:56,748][23556] Avg episode reward: [(0, '1831.388')] [2023-03-06 18:08:56,829][23882] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-06 18:08:57,597][23882] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-06 18:08:58,387][23882] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-06 18:08:59,193][23882] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-06 18:08:59,973][23882] Updated weights for policy 0, policy_version 67940 (0.0007) [2023-03-06 18:09:00,764][23882] Updated weights for policy 0, policy_version 67950 (0.0007) [2023-03-06 18:09:01,555][23882] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-06 18:09:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 69593088. Throughput: 0: 13046.5. Samples: 69565534. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:09:01,748][23556] Avg episode reward: [(0, '1760.276')] [2023-03-06 18:09:02,314][23882] Updated weights for policy 0, policy_version 67970 (0.0007) [2023-03-06 18:09:03,098][23882] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-06 18:09:03,873][23882] Updated weights for policy 0, policy_version 67990 (0.0007) [2023-03-06 18:09:04,654][23882] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-06 18:09:05,458][23882] Updated weights for policy 0, policy_version 68010 (0.0007) [2023-03-06 18:09:06,242][23882] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-06 18:09:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 69658624. Throughput: 0: 13041.7. Samples: 69643960. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:09:06,748][23556] Avg episode reward: [(0, '1798.916')] [2023-03-06 18:09:07,022][23882] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-06 18:09:07,814][23882] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-06 18:09:08,583][23882] Updated weights for policy 0, policy_version 68050 (0.0005) [2023-03-06 18:09:09,384][23882] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-06 18:09:10,172][23882] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-06 18:09:10,939][23882] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-06 18:09:11,737][23882] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-06 18:09:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 69724160. Throughput: 0: 13046.4. Samples: 69722350. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:09:11,748][23556] Avg episode reward: [(0, '1953.684')] [2023-03-06 18:09:11,749][23831] Saving new best policy, reward=1953.684! [2023-03-06 18:09:12,511][23882] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-06 18:09:13,287][23882] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-06 18:09:14,075][23882] Updated weights for policy 0, policy_version 68120 (0.0007) [2023-03-06 18:09:14,856][23882] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-06 18:09:15,651][23882] Updated weights for policy 0, policy_version 68140 (0.0007) [2023-03-06 18:09:16,429][23882] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-06 18:09:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 69789696. Throughput: 0: 13044.7. Samples: 69761563. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:09:16,748][23556] Avg episode reward: [(0, '1861.502')] [2023-03-06 18:09:17,207][23882] Updated weights for policy 0, policy_version 68160 (0.0007) [2023-03-06 18:09:17,991][23882] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-06 18:09:18,766][23882] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-06 18:09:19,583][23882] Updated weights for policy 0, policy_version 68190 (0.0007) [2023-03-06 18:09:20,365][23882] Updated weights for policy 0, policy_version 68200 (0.0007) [2023-03-06 18:09:21,146][23882] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-06 18:09:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 69854208. Throughput: 0: 13042.9. Samples: 69839802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:21,748][23556] Avg episode reward: [(0, '1995.423')] [2023-03-06 18:09:21,749][23831] Saving new best policy, reward=1995.423! [2023-03-06 18:09:21,848][23831] KL-divergence is very high: 834.5372 [2023-03-06 18:09:21,930][23831] KL-divergence is very high: 1318.1990 [2023-03-06 18:09:21,937][23882] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-06 18:09:22,718][23882] Updated weights for policy 0, policy_version 68230 (0.0007) [2023-03-06 18:09:23,506][23882] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-06 18:09:24,299][23882] Updated weights for policy 0, policy_version 68250 (0.0007) [2023-03-06 18:09:25,099][23882] Updated weights for policy 0, policy_version 68260 (0.0007) [2023-03-06 18:09:25,893][23882] Updated weights for policy 0, policy_version 68270 (0.0007) [2023-03-06 18:09:26,678][23882] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-06 18:09:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 69919744. Throughput: 0: 13041.4. Samples: 69917682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:26,748][23556] Avg episode reward: [(0, '1843.596')] [2023-03-06 18:09:27,463][23882] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-06 18:09:27,998][23831] KL-divergence is very high: 13194.0439 [2023-03-06 18:09:28,253][23882] Updated weights for policy 0, policy_version 68300 (0.0007) [2023-03-06 18:09:29,022][23882] Updated weights for policy 0, policy_version 68310 (0.0006) [2023-03-06 18:09:29,798][23882] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-06 18:09:30,593][23882] Updated weights for policy 0, policy_version 68330 (0.0007) [2023-03-06 18:09:31,373][23882] Updated weights for policy 0, policy_version 68340 (0.0007) [2023-03-06 18:09:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 69984256. Throughput: 0: 13047.3. Samples: 69956864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:31,748][23556] Avg episode reward: [(0, '1809.955')] [2023-03-06 18:09:32,145][23882] Updated weights for policy 0, policy_version 68350 (0.0007) [2023-03-06 18:09:32,942][23882] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-06 18:09:33,710][23882] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-06 18:09:34,517][23882] Updated weights for policy 0, policy_version 68380 (0.0007) [2023-03-06 18:09:35,300][23882] Updated weights for policy 0, policy_version 68390 (0.0007) [2023-03-06 18:09:36,081][23882] Updated weights for policy 0, policy_version 68400 (0.0005) [2023-03-06 18:09:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 70049792. Throughput: 0: 13047.1. Samples: 70034929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:36,748][23556] Avg episode reward: [(0, '1753.872')] [2023-03-06 18:09:36,878][23882] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-06 18:09:37,656][23882] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-06 18:09:38,441][23882] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-06 18:09:39,243][23882] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-06 18:09:40,015][23882] Updated weights for policy 0, policy_version 68450 (0.0007) [2023-03-06 18:09:40,813][23882] Updated weights for policy 0, policy_version 68460 (0.0008) [2023-03-06 18:09:41,609][23882] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-06 18:09:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 70114304. Throughput: 0: 13037.1. Samples: 70113078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:41,748][23556] Avg episode reward: [(0, '1783.789')] [2023-03-06 18:09:42,373][23882] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-06 18:09:43,189][23882] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-06 18:09:43,946][23882] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-06 18:09:44,733][23882] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-06 18:09:45,525][23882] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-06 18:09:46,305][23831] KL-divergence is very high: 8483.8682 [2023-03-06 18:09:46,313][23882] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-06 18:09:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 70179840. Throughput: 0: 13041.6. Samples: 70152405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:46,748][23556] Avg episode reward: [(0, '1938.064')] [2023-03-06 18:09:47,102][23882] Updated weights for policy 0, policy_version 68540 (0.0007) [2023-03-06 18:09:47,894][23882] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-06 18:09:48,682][23882] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-06 18:09:49,459][23882] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-06 18:09:50,234][23882] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-06 18:09:51,022][23882] Updated weights for policy 0, policy_version 68590 (0.0007) [2023-03-06 18:09:51,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 70245376. Throughput: 0: 13035.1. Samples: 70230539. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:51,749][23556] Avg episode reward: [(0, '1795.108')] [2023-03-06 18:09:51,799][23882] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-06 18:09:52,578][23882] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-06 18:09:53,349][23882] Updated weights for policy 0, policy_version 68620 (0.0006) [2023-03-06 18:09:54,121][23882] Updated weights for policy 0, policy_version 68630 (0.0007) [2023-03-06 18:09:54,909][23882] Updated weights for policy 0, policy_version 68640 (0.0007) [2023-03-06 18:09:55,704][23882] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-06 18:09:56,500][23882] Updated weights for policy 0, policy_version 68660 (0.0006) [2023-03-06 18:09:56,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 70310912. Throughput: 0: 13036.2. Samples: 70308976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:09:56,748][23556] Avg episode reward: [(0, '1833.346')] [2023-03-06 18:09:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000068663_70310912.pth... [2023-03-06 18:09:56,781][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000065605_67179520.pth [2023-03-06 18:09:57,289][23882] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-06 18:09:58,086][23882] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-06 18:09:58,867][23882] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-06 18:09:59,630][23882] Updated weights for policy 0, policy_version 68700 (0.0007) [2023-03-06 18:10:00,430][23882] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-06 18:10:01,199][23882] Updated weights for policy 0, policy_version 68720 (0.0007) [2023-03-06 18:10:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 70375424. Throughput: 0: 13034.9. Samples: 70348134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:10:01,748][23556] Avg episode reward: [(0, '1702.783')] [2023-03-06 18:10:01,980][23882] Updated weights for policy 0, policy_version 68730 (0.0007) [2023-03-06 18:10:02,779][23882] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-06 18:10:03,566][23882] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-06 18:10:04,350][23882] Updated weights for policy 0, policy_version 68760 (0.0007) [2023-03-06 18:10:05,158][23882] Updated weights for policy 0, policy_version 68770 (0.0006) [2023-03-06 18:10:05,946][23882] Updated weights for policy 0, policy_version 68780 (0.0007) [2023-03-06 18:10:06,732][23882] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-06 18:10:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 70440960. Throughput: 0: 13029.8. Samples: 70426144. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:06,748][23556] Avg episode reward: [(0, '1734.975')] [2023-03-06 18:10:07,527][23882] Updated weights for policy 0, policy_version 68800 (0.0007) [2023-03-06 18:10:08,326][23882] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-06 18:10:09,117][23882] Updated weights for policy 0, policy_version 68820 (0.0007) [2023-03-06 18:10:09,904][23882] Updated weights for policy 0, policy_version 68830 (0.0007) [2023-03-06 18:10:10,692][23882] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-06 18:10:11,473][23882] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-06 18:10:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 70505472. Throughput: 0: 13029.9. Samples: 70504029. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:11,748][23556] Avg episode reward: [(0, '1871.500')] [2023-03-06 18:10:12,251][23882] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-06 18:10:13,044][23882] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-06 18:10:13,827][23882] Updated weights for policy 0, policy_version 68880 (0.0007) [2023-03-06 18:10:14,618][23882] Updated weights for policy 0, policy_version 68890 (0.0007) [2023-03-06 18:10:15,394][23882] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-06 18:10:16,177][23882] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-06 18:10:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 70571008. Throughput: 0: 13027.8. Samples: 70543118. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:16,748][23556] Avg episode reward: [(0, '1964.914')] [2023-03-06 18:10:16,975][23882] Updated weights for policy 0, policy_version 68920 (0.0006) [2023-03-06 18:10:17,775][23882] Updated weights for policy 0, policy_version 68930 (0.0006) [2023-03-06 18:10:18,555][23882] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-06 18:10:19,342][23882] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-06 18:10:20,126][23882] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-06 18:10:20,919][23882] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-06 18:10:21,690][23882] Updated weights for policy 0, policy_version 68980 (0.0007) [2023-03-06 18:10:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 70635520. Throughput: 0: 13029.4. Samples: 70621252. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:21,749][23556] Avg episode reward: [(0, '1938.907')] [2023-03-06 18:10:22,489][23882] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-06 18:10:23,279][23882] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-06 18:10:24,061][23882] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-06 18:10:24,845][23882] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-06 18:10:25,616][23882] Updated weights for policy 0, policy_version 69030 (0.0007) [2023-03-06 18:10:26,416][23882] Updated weights for policy 0, policy_version 69040 (0.0007) [2023-03-06 18:10:26,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 70701056. Throughput: 0: 13025.0. Samples: 70699203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:26,748][23556] Avg episode reward: [(0, '1959.165')] [2023-03-06 18:10:27,198][23882] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-06 18:10:27,980][23882] Updated weights for policy 0, policy_version 69060 (0.0007) [2023-03-06 18:10:28,768][23882] Updated weights for policy 0, policy_version 69070 (0.0006) [2023-03-06 18:10:29,554][23882] Updated weights for policy 0, policy_version 69080 (0.0006) [2023-03-06 18:10:30,345][23882] Updated weights for policy 0, policy_version 69090 (0.0007) [2023-03-06 18:10:31,117][23882] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-06 18:10:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 70766592. Throughput: 0: 13023.5. Samples: 70738461. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:31,748][23556] Avg episode reward: [(0, '1877.289')] [2023-03-06 18:10:31,908][23882] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-06 18:10:32,683][23882] Updated weights for policy 0, policy_version 69120 (0.0007) [2023-03-06 18:10:33,453][23882] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-06 18:10:34,264][23882] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-06 18:10:35,060][23882] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-06 18:10:35,841][23882] Updated weights for policy 0, policy_version 69160 (0.0006) [2023-03-06 18:10:36,632][23882] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-06 18:10:36,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13021.8, 300 sec: 13044.7). Total num frames: 70831104. Throughput: 0: 13027.1. Samples: 70816760. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:36,749][23556] Avg episode reward: [(0, '1913.615')] [2023-03-06 18:10:37,427][23882] Updated weights for policy 0, policy_version 69180 (0.0007) [2023-03-06 18:10:37,639][23831] KL-divergence is very high: 348.6468 [2023-03-06 18:10:38,198][23882] Updated weights for policy 0, policy_version 69190 (0.0007) [2023-03-06 18:10:39,002][23882] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-06 18:10:39,793][23882] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-06 18:10:40,573][23882] Updated weights for policy 0, policy_version 69220 (0.0008) [2023-03-06 18:10:41,361][23882] Updated weights for policy 0, policy_version 69230 (0.0007) [2023-03-06 18:10:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 70896640. Throughput: 0: 13013.2. Samples: 70894569. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:10:41,748][23556] Avg episode reward: [(0, '2053.306')] [2023-03-06 18:10:41,749][23831] Saving new best policy, reward=2053.306! [2023-03-06 18:10:42,138][23882] Updated weights for policy 0, policy_version 69240 (0.0007) [2023-03-06 18:10:42,910][23882] Updated weights for policy 0, policy_version 69250 (0.0007) [2023-03-06 18:10:43,716][23882] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-06 18:10:44,496][23882] Updated weights for policy 0, policy_version 69270 (0.0006) [2023-03-06 18:10:44,979][23831] KL-divergence is very high: 114.1040 [2023-03-06 18:10:45,305][23882] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-06 18:10:46,094][23882] Updated weights for policy 0, policy_version 69290 (0.0006) [2023-03-06 18:10:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 70961152. Throughput: 0: 13013.3. Samples: 70933733. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:10:46,748][23556] Avg episode reward: [(0, '1468.680')] [2023-03-06 18:10:46,887][23882] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-06 18:10:47,658][23882] Updated weights for policy 0, policy_version 69310 (0.0008) [2023-03-06 18:10:48,462][23882] Updated weights for policy 0, policy_version 69320 (0.0007) [2023-03-06 18:10:49,237][23882] Updated weights for policy 0, policy_version 69330 (0.0006) [2023-03-06 18:10:50,029][23882] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-06 18:10:50,810][23882] Updated weights for policy 0, policy_version 69350 (0.0006) [2023-03-06 18:10:51,589][23882] Updated weights for policy 0, policy_version 69360 (0.0005) [2023-03-06 18:10:51,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 71025664. Throughput: 0: 13009.0. Samples: 71011548. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:10:51,748][23556] Avg episode reward: [(0, '1751.271')] [2023-03-06 18:10:52,370][23882] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-06 18:10:53,155][23882] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-06 18:10:53,922][23882] Updated weights for policy 0, policy_version 69390 (0.0006) [2023-03-06 18:10:54,698][23882] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-06 18:10:55,485][23882] Updated weights for policy 0, policy_version 69410 (0.0006) [2023-03-06 18:10:56,267][23882] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-06 18:10:56,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 71092224. Throughput: 0: 13026.5. Samples: 71090221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:10:56,748][23556] Avg episode reward: [(0, '1985.570')] [2023-03-06 18:10:57,044][23882] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-06 18:10:57,833][23882] Updated weights for policy 0, policy_version 69440 (0.0007) [2023-03-06 18:10:58,632][23882] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-06 18:10:59,421][23882] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-06 18:11:00,207][23882] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-06 18:11:00,997][23882] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-06 18:11:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 71156736. Throughput: 0: 13024.9. Samples: 71129237. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:01,748][23556] Avg episode reward: [(0, '1883.655')] [2023-03-06 18:11:01,770][23882] Updated weights for policy 0, policy_version 69490 (0.0007) [2023-03-06 18:11:02,018][23831] KL-divergence is very high: 151.7804 [2023-03-06 18:11:02,571][23882] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-06 18:11:03,350][23882] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-06 18:11:04,149][23882] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-06 18:11:04,917][23882] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-06 18:11:05,708][23882] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-06 18:11:06,499][23882] Updated weights for policy 0, policy_version 69550 (0.0007) [2023-03-06 18:11:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 71222272. Throughput: 0: 13023.3. Samples: 71207303. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:06,749][23556] Avg episode reward: [(0, '1844.217')] [2023-03-06 18:11:07,287][23882] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-06 18:11:08,072][23882] Updated weights for policy 0, policy_version 69570 (0.0007) [2023-03-06 18:11:08,865][23882] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-06 18:11:09,650][23882] Updated weights for policy 0, policy_version 69590 (0.0005) [2023-03-06 18:11:10,433][23882] Updated weights for policy 0, policy_version 69600 (0.0007) [2023-03-06 18:11:11,212][23882] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-06 18:11:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 71286784. Throughput: 0: 13031.8. Samples: 71285634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:11,748][23556] Avg episode reward: [(0, '1989.924')] [2023-03-06 18:11:11,985][23882] Updated weights for policy 0, policy_version 69620 (0.0007) [2023-03-06 18:11:12,777][23882] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-06 18:11:13,573][23882] Updated weights for policy 0, policy_version 69640 (0.0007) [2023-03-06 18:11:14,346][23882] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-06 18:11:15,151][23882] Updated weights for policy 0, policy_version 69660 (0.0006) [2023-03-06 18:11:15,941][23882] Updated weights for policy 0, policy_version 69670 (0.0008) [2023-03-06 18:11:16,723][23882] Updated weights for policy 0, policy_version 69680 (0.0005) [2023-03-06 18:11:16,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 71352320. Throughput: 0: 13026.2. Samples: 71324637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:16,748][23556] Avg episode reward: [(0, '1949.689')] [2023-03-06 18:11:17,519][23882] Updated weights for policy 0, policy_version 69690 (0.0007) [2023-03-06 18:11:18,319][23882] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-06 18:11:19,092][23882] Updated weights for policy 0, policy_version 69710 (0.0006) [2023-03-06 18:11:19,889][23882] Updated weights for policy 0, policy_version 69720 (0.0007) [2023-03-06 18:11:20,681][23882] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-06 18:11:21,457][23882] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-06 18:11:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 71416832. Throughput: 0: 13011.6. Samples: 71402278. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:21,748][23556] Avg episode reward: [(0, '1862.597')] [2023-03-06 18:11:22,240][23882] Updated weights for policy 0, policy_version 69750 (0.0005) [2023-03-06 18:11:23,034][23882] Updated weights for policy 0, policy_version 69760 (0.0007) [2023-03-06 18:11:23,797][23882] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-06 18:11:24,604][23882] Updated weights for policy 0, policy_version 69780 (0.0007) [2023-03-06 18:11:25,389][23882] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-06 18:11:26,165][23882] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-06 18:11:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 71482368. Throughput: 0: 13026.8. Samples: 71480776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:26,748][23556] Avg episode reward: [(0, '1922.900')] [2023-03-06 18:11:26,936][23882] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-06 18:11:27,707][23882] Updated weights for policy 0, policy_version 69820 (0.0007) [2023-03-06 18:11:28,511][23882] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-06 18:11:29,284][23882] Updated weights for policy 0, policy_version 69840 (0.0007) [2023-03-06 18:11:30,070][23882] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-06 18:11:30,862][23882] Updated weights for policy 0, policy_version 69860 (0.0006) [2023-03-06 18:11:31,645][23882] Updated weights for policy 0, policy_version 69870 (0.0007) [2023-03-06 18:11:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 71547904. Throughput: 0: 13028.9. Samples: 71520034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:31,748][23556] Avg episode reward: [(0, '1860.080')] [2023-03-06 18:11:32,426][23882] Updated weights for policy 0, policy_version 69880 (0.0006) [2023-03-06 18:11:33,217][23882] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-06 18:11:34,017][23882] Updated weights for policy 0, policy_version 69900 (0.0006) [2023-03-06 18:11:34,807][23882] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-06 18:11:35,588][23882] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-06 18:11:36,378][23882] Updated weights for policy 0, policy_version 69930 (0.0007) [2023-03-06 18:11:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 71612416. Throughput: 0: 13035.9. Samples: 71598164. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:36,748][23556] Avg episode reward: [(0, '1868.706')] [2023-03-06 18:11:37,166][23882] Updated weights for policy 0, policy_version 69940 (0.0006) [2023-03-06 18:11:37,959][23882] Updated weights for policy 0, policy_version 69950 (0.0007) [2023-03-06 18:11:38,753][23882] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-06 18:11:39,524][23882] Updated weights for policy 0, policy_version 69970 (0.0007) [2023-03-06 18:11:40,318][23882] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-06 18:11:41,092][23882] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-06 18:11:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 71677952. Throughput: 0: 13021.0. Samples: 71676166. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:41,748][23556] Avg episode reward: [(0, '1913.740')] [2023-03-06 18:11:41,873][23882] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-06 18:11:42,652][23882] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-06 18:11:43,456][23882] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-06 18:11:44,242][23882] Updated weights for policy 0, policy_version 70030 (0.0005) [2023-03-06 18:11:45,031][23882] Updated weights for policy 0, policy_version 70040 (0.0007) [2023-03-06 18:11:45,818][23882] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-06 18:11:46,606][23882] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-06 18:11:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 71742464. Throughput: 0: 13020.7. Samples: 71715168. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:46,748][23556] Avg episode reward: [(0, '2026.855')] [2023-03-06 18:11:47,385][23882] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-06 18:11:48,187][23882] Updated weights for policy 0, policy_version 70080 (0.0005) [2023-03-06 18:11:48,966][23882] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-06 18:11:49,746][23882] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-06 18:11:50,531][23882] Updated weights for policy 0, policy_version 70110 (0.0006) [2023-03-06 18:11:51,315][23882] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-06 18:11:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 71808000. Throughput: 0: 13022.6. Samples: 71793316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:51,748][23556] Avg episode reward: [(0, '1874.670')] [2023-03-06 18:11:52,076][23882] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-06 18:11:52,873][23882] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-06 18:11:53,645][23882] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-06 18:11:54,434][23882] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-06 18:11:55,223][23882] Updated weights for policy 0, policy_version 70170 (0.0007) [2023-03-06 18:11:56,013][23882] Updated weights for policy 0, policy_version 70180 (0.0007) [2023-03-06 18:11:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 71873536. Throughput: 0: 13026.1. Samples: 71871809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:11:56,748][23556] Avg episode reward: [(0, '2004.448')] [2023-03-06 18:11:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000070189_71873536.pth... [2023-03-06 18:11:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000067135_68746240.pth [2023-03-06 18:11:56,806][23882] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-06 18:11:57,581][23882] Updated weights for policy 0, policy_version 70200 (0.0006) [2023-03-06 18:11:58,367][23882] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-06 18:11:59,152][23882] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-06 18:11:59,941][23882] Updated weights for policy 0, policy_version 70230 (0.0008) [2023-03-06 18:12:00,716][23882] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-06 18:12:01,497][23882] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-06 18:12:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 71939072. Throughput: 0: 13030.9. Samples: 71911026. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:01,749][23556] Avg episode reward: [(0, '1971.552')] [2023-03-06 18:12:02,283][23882] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-06 18:12:03,066][23882] Updated weights for policy 0, policy_version 70270 (0.0005) [2023-03-06 18:12:03,844][23882] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-06 18:12:04,625][23882] Updated weights for policy 0, policy_version 70290 (0.0007) [2023-03-06 18:12:05,430][23882] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-06 18:12:06,197][23882] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-06 18:12:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 72003584. Throughput: 0: 13044.0. Samples: 71989255. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:06,748][23556] Avg episode reward: [(0, '2062.455')] [2023-03-06 18:12:06,753][23831] Saving new best policy, reward=2062.455! [2023-03-06 18:12:07,002][23882] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-06 18:12:07,784][23882] Updated weights for policy 0, policy_version 70330 (0.0006) [2023-03-06 18:12:08,577][23882] Updated weights for policy 0, policy_version 70340 (0.0007) [2023-03-06 18:12:09,370][23882] Updated weights for policy 0, policy_version 70350 (0.0007) [2023-03-06 18:12:10,160][23882] Updated weights for policy 0, policy_version 70360 (0.0007) [2023-03-06 18:12:10,941][23882] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-06 18:12:11,735][23882] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-06 18:12:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 72069120. Throughput: 0: 13034.7. Samples: 72067336. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:11,748][23556] Avg episode reward: [(0, '2011.648')] [2023-03-06 18:12:12,517][23882] Updated weights for policy 0, policy_version 70390 (0.0007) [2023-03-06 18:12:13,289][23882] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-06 18:12:14,079][23882] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-06 18:12:14,874][23882] Updated weights for policy 0, policy_version 70420 (0.0008) [2023-03-06 18:12:15,664][23882] Updated weights for policy 0, policy_version 70430 (0.0007) [2023-03-06 18:12:16,448][23882] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-06 18:12:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72133632. Throughput: 0: 13030.5. Samples: 72106407. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:16,748][23556] Avg episode reward: [(0, '1769.036')] [2023-03-06 18:12:17,218][23882] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-06 18:12:18,021][23882] Updated weights for policy 0, policy_version 70460 (0.0006) [2023-03-06 18:12:18,809][23882] Updated weights for policy 0, policy_version 70470 (0.0006) [2023-03-06 18:12:19,584][23882] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-06 18:12:20,396][23882] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-06 18:12:21,192][23882] Updated weights for policy 0, policy_version 70500 (0.0007) [2023-03-06 18:12:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 72199168. Throughput: 0: 13025.0. Samples: 72184291. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:21,748][23556] Avg episode reward: [(0, '1952.696')] [2023-03-06 18:12:21,965][23882] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-06 18:12:22,763][23882] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-06 18:12:23,565][23882] Updated weights for policy 0, policy_version 70530 (0.0007) [2023-03-06 18:12:24,341][23882] Updated weights for policy 0, policy_version 70540 (0.0006) [2023-03-06 18:12:25,129][23882] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-06 18:12:25,917][23882] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-06 18:12:26,707][23882] Updated weights for policy 0, policy_version 70570 (0.0007) [2023-03-06 18:12:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72263680. Throughput: 0: 13026.2. Samples: 72262344. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:26,748][23556] Avg episode reward: [(0, '1921.844')] [2023-03-06 18:12:27,499][23882] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-06 18:12:28,284][23882] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-06 18:12:29,066][23882] Updated weights for policy 0, policy_version 70600 (0.0007) [2023-03-06 18:12:29,847][23882] Updated weights for policy 0, policy_version 70610 (0.0006) [2023-03-06 18:12:30,634][23882] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-06 18:12:31,422][23882] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-06 18:12:31,748][23556] Fps is (10 sec: 13004.2, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 72329216. Throughput: 0: 13019.8. Samples: 72301066. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:31,749][23556] Avg episode reward: [(0, '1828.593')] [2023-03-06 18:12:32,216][23882] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-06 18:12:32,994][23882] Updated weights for policy 0, policy_version 70650 (0.0007) [2023-03-06 18:12:33,789][23882] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-06 18:12:34,563][23882] Updated weights for policy 0, policy_version 70670 (0.0008) [2023-03-06 18:12:35,353][23882] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-06 18:12:36,143][23882] Updated weights for policy 0, policy_version 70690 (0.0007) [2023-03-06 18:12:36,749][23556] Fps is (10 sec: 13003.6, 60 sec: 13021.7, 300 sec: 13027.3). Total num frames: 72393728. Throughput: 0: 13025.4. Samples: 72379471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:36,749][23556] Avg episode reward: [(0, '1766.017')] [2023-03-06 18:12:36,925][23882] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-06 18:12:37,721][23882] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-06 18:12:38,504][23882] Updated weights for policy 0, policy_version 70720 (0.0006) [2023-03-06 18:12:39,277][23882] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-06 18:12:40,059][23882] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-06 18:12:40,846][23882] Updated weights for policy 0, policy_version 70750 (0.0008) [2023-03-06 18:12:41,622][23882] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-06 18:12:41,748][23556] Fps is (10 sec: 13005.4, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 72459264. Throughput: 0: 13021.5. Samples: 72457775. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:41,748][23556] Avg episode reward: [(0, '1864.442')] [2023-03-06 18:12:42,409][23882] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-06 18:12:43,195][23882] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-06 18:12:43,985][23882] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-06 18:12:44,773][23882] Updated weights for policy 0, policy_version 70800 (0.0007) [2023-03-06 18:12:45,537][23882] Updated weights for policy 0, policy_version 70810 (0.0007) [2023-03-06 18:12:46,336][23882] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-06 18:12:46,748][23556] Fps is (10 sec: 13108.3, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 72524800. Throughput: 0: 13016.2. Samples: 72496754. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:46,748][23556] Avg episode reward: [(0, '1792.288')] [2023-03-06 18:12:47,131][23882] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-06 18:12:47,919][23882] Updated weights for policy 0, policy_version 70840 (0.0007) [2023-03-06 18:12:48,705][23882] Updated weights for policy 0, policy_version 70850 (0.0007) [2023-03-06 18:12:49,491][23882] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-06 18:12:50,269][23882] Updated weights for policy 0, policy_version 70870 (0.0006) [2023-03-06 18:12:51,049][23882] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-06 18:12:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 72589312. Throughput: 0: 13017.1. Samples: 72575023. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:51,748][23556] Avg episode reward: [(0, '1726.765')] [2023-03-06 18:12:51,844][23882] Updated weights for policy 0, policy_version 70890 (0.0007) [2023-03-06 18:12:52,626][23882] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-06 18:12:53,426][23882] Updated weights for policy 0, policy_version 70910 (0.0007) [2023-03-06 18:12:54,193][23882] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-06 18:12:54,979][23882] Updated weights for policy 0, policy_version 70930 (0.0007) [2023-03-06 18:12:55,753][23882] Updated weights for policy 0, policy_version 70940 (0.0007) [2023-03-06 18:12:56,548][23882] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-06 18:12:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72654848. Throughput: 0: 13023.1. Samples: 72653378. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:12:56,748][23556] Avg episode reward: [(0, '1693.878')] [2023-03-06 18:12:57,349][23882] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-06 18:12:58,136][23882] Updated weights for policy 0, policy_version 70970 (0.0007) [2023-03-06 18:12:58,929][23882] Updated weights for policy 0, policy_version 70980 (0.0007) [2023-03-06 18:12:59,707][23882] Updated weights for policy 0, policy_version 70990 (0.0007) [2023-03-06 18:13:00,490][23882] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-06 18:13:01,267][23882] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-06 18:13:01,748][23556] Fps is (10 sec: 13106.9, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 72720384. Throughput: 0: 13016.0. Samples: 72692131. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:01,748][23556] Avg episode reward: [(0, '1781.285')] [2023-03-06 18:13:02,049][23882] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-06 18:13:02,849][23882] Updated weights for policy 0, policy_version 71030 (0.0006) [2023-03-06 18:13:03,634][23882] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-06 18:13:04,419][23882] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-06 18:13:05,214][23882] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-06 18:13:05,988][23882] Updated weights for policy 0, policy_version 71070 (0.0006) [2023-03-06 18:13:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72784896. Throughput: 0: 13027.0. Samples: 72770506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:06,748][23556] Avg episode reward: [(0, '1816.113')] [2023-03-06 18:13:06,773][23882] Updated weights for policy 0, policy_version 71080 (0.0007) [2023-03-06 18:13:07,558][23882] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-06 18:13:08,351][23882] Updated weights for policy 0, policy_version 71100 (0.0007) [2023-03-06 18:13:09,132][23882] Updated weights for policy 0, policy_version 71110 (0.0006) [2023-03-06 18:13:09,917][23882] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-06 18:13:10,696][23882] Updated weights for policy 0, policy_version 71130 (0.0007) [2023-03-06 18:13:11,460][23882] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-06 18:13:11,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72850432. Throughput: 0: 13028.3. Samples: 72848616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:11,748][23556] Avg episode reward: [(0, '1834.205')] [2023-03-06 18:13:12,273][23882] Updated weights for policy 0, policy_version 71150 (0.0007) [2023-03-06 18:13:13,048][23882] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-06 18:13:13,823][23882] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-06 18:13:14,621][23882] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-06 18:13:15,415][23882] Updated weights for policy 0, policy_version 71190 (0.0007) [2023-03-06 18:13:16,198][23882] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-06 18:13:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 72914944. Throughput: 0: 13037.5. Samples: 72887746. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:16,748][23556] Avg episode reward: [(0, '1713.327')] [2023-03-06 18:13:17,004][23882] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-06 18:13:17,778][23882] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-06 18:13:18,589][23882] Updated weights for policy 0, policy_version 71230 (0.0007) [2023-03-06 18:13:19,376][23882] Updated weights for policy 0, policy_version 71240 (0.0006) [2023-03-06 18:13:20,162][23882] Updated weights for policy 0, policy_version 71250 (0.0006) [2023-03-06 18:13:20,921][23882] Updated weights for policy 0, policy_version 71260 (0.0007) [2023-03-06 18:13:21,730][23882] Updated weights for policy 0, policy_version 71270 (0.0007) [2023-03-06 18:13:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 72980480. Throughput: 0: 13030.0. Samples: 72965809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:21,748][23556] Avg episode reward: [(0, '1916.955')] [2023-03-06 18:13:22,510][23882] Updated weights for policy 0, policy_version 71280 (0.0006) [2023-03-06 18:13:23,292][23882] Updated weights for policy 0, policy_version 71290 (0.0007) [2023-03-06 18:13:24,061][23882] Updated weights for policy 0, policy_version 71300 (0.0006) [2023-03-06 18:13:24,862][23882] Updated weights for policy 0, policy_version 71310 (0.0007) [2023-03-06 18:13:25,638][23882] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-06 18:13:26,421][23882] Updated weights for policy 0, policy_version 71330 (0.0007) [2023-03-06 18:13:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 73046016. Throughput: 0: 13030.7. Samples: 73044157. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:26,748][23556] Avg episode reward: [(0, '1894.416')] [2023-03-06 18:13:27,206][23882] Updated weights for policy 0, policy_version 71340 (0.0006) [2023-03-06 18:13:27,972][23882] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-06 18:13:28,752][23882] Updated weights for policy 0, policy_version 71360 (0.0007) [2023-03-06 18:13:29,552][23882] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-06 18:13:30,330][23882] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-06 18:13:31,117][23882] Updated weights for policy 0, policy_version 71390 (0.0006) [2023-03-06 18:13:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 73111552. Throughput: 0: 13036.9. Samples: 73083415. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:31,748][23556] Avg episode reward: [(0, '2082.436')] [2023-03-06 18:13:31,749][23831] Saving new best policy, reward=2082.436! [2023-03-06 18:13:31,906][23882] Updated weights for policy 0, policy_version 71400 (0.0006) [2023-03-06 18:13:32,676][23882] Updated weights for policy 0, policy_version 71410 (0.0007) [2023-03-06 18:13:33,478][23882] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-06 18:13:34,269][23882] Updated weights for policy 0, policy_version 71430 (0.0007) [2023-03-06 18:13:35,055][23882] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-06 18:13:35,826][23882] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-06 18:13:36,612][23882] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-06 18:13:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.1, 300 sec: 13030.8). Total num frames: 73176064. Throughput: 0: 13035.8. Samples: 73161633. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:36,748][23556] Avg episode reward: [(0, '2022.475')] [2023-03-06 18:13:37,398][23882] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-06 18:13:38,180][23882] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-06 18:13:38,961][23882] Updated weights for policy 0, policy_version 71490 (0.0008) [2023-03-06 18:13:39,755][23882] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-06 18:13:40,530][23882] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-06 18:13:41,322][23882] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-06 18:13:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 73241600. Throughput: 0: 13034.1. Samples: 73239913. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:41,748][23556] Avg episode reward: [(0, '2078.253')] [2023-03-06 18:13:42,120][23882] Updated weights for policy 0, policy_version 71530 (0.0007) [2023-03-06 18:13:42,915][23882] Updated weights for policy 0, policy_version 71540 (0.0007) [2023-03-06 18:13:43,699][23882] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-06 18:13:44,480][23882] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-06 18:13:45,283][23882] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-06 18:13:46,062][23882] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-06 18:13:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 73306112. Throughput: 0: 13034.4. Samples: 73278675. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:46,748][23556] Avg episode reward: [(0, '2064.932')] [2023-03-06 18:13:46,861][23882] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-06 18:13:47,636][23882] Updated weights for policy 0, policy_version 71600 (0.0007) [2023-03-06 18:13:48,416][23882] Updated weights for policy 0, policy_version 71610 (0.0007) [2023-03-06 18:13:49,206][23882] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-06 18:13:50,003][23882] Updated weights for policy 0, policy_version 71630 (0.0006) [2023-03-06 18:13:50,787][23882] Updated weights for policy 0, policy_version 71640 (0.0007) [2023-03-06 18:13:51,579][23882] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-06 18:13:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 73371648. Throughput: 0: 13027.3. Samples: 73356732. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:51,748][23556] Avg episode reward: [(0, '1976.045')] [2023-03-06 18:13:52,354][23882] Updated weights for policy 0, policy_version 71660 (0.0007) [2023-03-06 18:13:53,142][23882] Updated weights for policy 0, policy_version 71670 (0.0006) [2023-03-06 18:13:53,934][23882] Updated weights for policy 0, policy_version 71680 (0.0007) [2023-03-06 18:13:54,718][23882] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-06 18:13:55,506][23882] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-06 18:13:56,284][23882] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-06 18:13:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 73436160. Throughput: 0: 13028.8. Samples: 73434914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:13:56,748][23556] Avg episode reward: [(0, '2013.322')] [2023-03-06 18:13:56,760][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000071716_73437184.pth... [2023-03-06 18:13:56,789][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000068663_70310912.pth [2023-03-06 18:13:57,077][23882] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-06 18:13:57,868][23882] Updated weights for policy 0, policy_version 71730 (0.0007) [2023-03-06 18:13:58,654][23882] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-06 18:13:59,458][23882] Updated weights for policy 0, policy_version 71750 (0.0007) [2023-03-06 18:14:00,244][23882] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-06 18:14:01,024][23882] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-06 18:14:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 73501696. Throughput: 0: 13021.1. Samples: 73473694. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:14:01,748][23556] Avg episode reward: [(0, '1991.134')] [2023-03-06 18:14:01,821][23882] Updated weights for policy 0, policy_version 71780 (0.0007) [2023-03-06 18:14:02,613][23882] Updated weights for policy 0, policy_version 71790 (0.0008) [2023-03-06 18:14:03,413][23882] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-06 18:14:04,182][23882] Updated weights for policy 0, policy_version 71810 (0.0007) [2023-03-06 18:14:04,949][23882] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-06 18:14:05,743][23882] Updated weights for policy 0, policy_version 71830 (0.0006) [2023-03-06 18:14:06,513][23882] Updated weights for policy 0, policy_version 71840 (0.0007) [2023-03-06 18:14:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 73567232. Throughput: 0: 13027.0. Samples: 73552024. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:14:06,748][23556] Avg episode reward: [(0, '1976.664')] [2023-03-06 18:14:07,295][23882] Updated weights for policy 0, policy_version 71850 (0.0007) [2023-03-06 18:14:08,086][23882] Updated weights for policy 0, policy_version 71860 (0.0006) [2023-03-06 18:14:08,857][23882] Updated weights for policy 0, policy_version 71870 (0.0008) [2023-03-06 18:14:09,664][23882] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-06 18:14:10,438][23882] Updated weights for policy 0, policy_version 71890 (0.0006) [2023-03-06 18:14:11,234][23882] Updated weights for policy 0, policy_version 71900 (0.0006) [2023-03-06 18:14:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 73631744. Throughput: 0: 13023.5. Samples: 73630213. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:14:11,748][23556] Avg episode reward: [(0, '2036.789')] [2023-03-06 18:14:12,020][23882] Updated weights for policy 0, policy_version 71910 (0.0007) [2023-03-06 18:14:12,794][23882] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-06 18:14:13,576][23882] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-06 18:14:14,352][23882] Updated weights for policy 0, policy_version 71940 (0.0006) [2023-03-06 18:14:15,130][23882] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-06 18:14:15,907][23882] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-06 18:14:16,714][23882] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-06 18:14:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 73697280. Throughput: 0: 13027.6. Samples: 73669660. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:14:16,748][23556] Avg episode reward: [(0, '1884.930')] [2023-03-06 18:14:17,491][23882] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-06 18:14:18,301][23882] Updated weights for policy 0, policy_version 71990 (0.0007) [2023-03-06 18:14:19,077][23882] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-06 18:14:19,854][23882] Updated weights for policy 0, policy_version 72010 (0.0006) [2023-03-06 18:14:20,645][23882] Updated weights for policy 0, policy_version 72020 (0.0006) [2023-03-06 18:14:21,433][23882] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-06 18:14:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 73762816. Throughput: 0: 13025.5. Samples: 73747782. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:21,748][23556] Avg episode reward: [(0, '1577.518')] [2023-03-06 18:14:22,222][23882] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-06 18:14:23,014][23882] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-06 18:14:23,791][23882] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-06 18:14:24,554][23882] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-06 18:14:25,350][23882] Updated weights for policy 0, policy_version 72080 (0.0007) [2023-03-06 18:14:26,142][23882] Updated weights for policy 0, policy_version 72090 (0.0007) [2023-03-06 18:14:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 73827328. Throughput: 0: 13025.5. Samples: 73826061. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:26,748][23556] Avg episode reward: [(0, '1473.635')] [2023-03-06 18:14:26,924][23882] Updated weights for policy 0, policy_version 72100 (0.0007) [2023-03-06 18:14:27,713][23882] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-06 18:14:28,507][23882] Updated weights for policy 0, policy_version 72120 (0.0007) [2023-03-06 18:14:29,275][23882] Updated weights for policy 0, policy_version 72130 (0.0007) [2023-03-06 18:14:30,074][23882] Updated weights for policy 0, policy_version 72140 (0.0007) [2023-03-06 18:14:30,865][23882] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-06 18:14:31,633][23882] Updated weights for policy 0, policy_version 72160 (0.0007) [2023-03-06 18:14:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 73892864. Throughput: 0: 13034.6. Samples: 73865230. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:31,748][23556] Avg episode reward: [(0, '1691.621')] [2023-03-06 18:14:32,417][23882] Updated weights for policy 0, policy_version 72170 (0.0007) [2023-03-06 18:14:33,214][23882] Updated weights for policy 0, policy_version 72180 (0.0007) [2023-03-06 18:14:33,988][23882] Updated weights for policy 0, policy_version 72190 (0.0007) [2023-03-06 18:14:34,773][23882] Updated weights for policy 0, policy_version 72200 (0.0007) [2023-03-06 18:14:35,558][23882] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-06 18:14:36,333][23882] Updated weights for policy 0, policy_version 72220 (0.0007) [2023-03-06 18:14:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 73958400. Throughput: 0: 13034.5. Samples: 73943287. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:36,748][23556] Avg episode reward: [(0, '1864.002')] [2023-03-06 18:14:37,116][23882] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-06 18:14:37,907][23882] Updated weights for policy 0, policy_version 72240 (0.0007) [2023-03-06 18:14:38,673][23882] Updated weights for policy 0, policy_version 72250 (0.0006) [2023-03-06 18:14:39,477][23882] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-06 18:14:40,263][23882] Updated weights for policy 0, policy_version 72270 (0.0007) [2023-03-06 18:14:41,064][23882] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-06 18:14:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 74022912. Throughput: 0: 13040.8. Samples: 74021748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:41,748][23556] Avg episode reward: [(0, '1621.823')] [2023-03-06 18:14:41,834][23882] Updated weights for policy 0, policy_version 72290 (0.0007) [2023-03-06 18:14:42,607][23882] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-06 18:14:43,414][23882] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-06 18:14:44,183][23882] Updated weights for policy 0, policy_version 72320 (0.0007) [2023-03-06 18:14:44,959][23882] Updated weights for policy 0, policy_version 72330 (0.0006) [2023-03-06 18:14:45,738][23882] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-06 18:14:46,513][23882] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-06 18:14:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 74088448. Throughput: 0: 13055.0. Samples: 74061169. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:46,748][23556] Avg episode reward: [(0, '1863.476')] [2023-03-06 18:14:47,308][23882] Updated weights for policy 0, policy_version 72360 (0.0007) [2023-03-06 18:14:48,098][23882] Updated weights for policy 0, policy_version 72370 (0.0007) [2023-03-06 18:14:48,869][23882] Updated weights for policy 0, policy_version 72380 (0.0007) [2023-03-06 18:14:49,661][23882] Updated weights for policy 0, policy_version 72390 (0.0007) [2023-03-06 18:14:50,445][23882] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-06 18:14:51,222][23882] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-06 18:14:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 74153984. Throughput: 0: 13053.8. Samples: 74139446. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:51,748][23556] Avg episode reward: [(0, '1805.522')] [2023-03-06 18:14:52,014][23882] Updated weights for policy 0, policy_version 72420 (0.0007) [2023-03-06 18:14:52,829][23882] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-06 18:14:53,599][23882] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-06 18:14:54,367][23882] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-06 18:14:55,156][23882] Updated weights for policy 0, policy_version 72460 (0.0007) [2023-03-06 18:14:55,937][23882] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-06 18:14:56,722][23882] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-06 18:14:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 74219520. Throughput: 0: 13054.7. Samples: 74217673. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:14:56,748][23556] Avg episode reward: [(0, '1855.910')] [2023-03-06 18:14:57,523][23882] Updated weights for policy 0, policy_version 72490 (0.0007) [2023-03-06 18:14:58,300][23882] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-06 18:14:59,082][23882] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-06 18:14:59,876][23882] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-06 18:15:00,677][23882] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-06 18:15:01,456][23882] Updated weights for policy 0, policy_version 72540 (0.0007) [2023-03-06 18:15:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 74284032. Throughput: 0: 13047.8. Samples: 74256812. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:15:01,748][23556] Avg episode reward: [(0, '1931.951')] [2023-03-06 18:15:02,221][23882] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-06 18:15:03,024][23882] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-06 18:15:03,785][23882] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-06 18:15:04,566][23882] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-06 18:15:05,355][23882] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-06 18:15:06,126][23882] Updated weights for policy 0, policy_version 72600 (0.0007) [2023-03-06 18:15:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 74349568. Throughput: 0: 13053.8. Samples: 74335203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:06,748][23556] Avg episode reward: [(0, '1832.943')] [2023-03-06 18:15:06,923][23882] Updated weights for policy 0, policy_version 72610 (0.0005) [2023-03-06 18:15:07,706][23882] Updated weights for policy 0, policy_version 72620 (0.0007) [2023-03-06 18:15:08,482][23882] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-06 18:15:09,277][23882] Updated weights for policy 0, policy_version 72640 (0.0007) [2023-03-06 18:15:10,067][23882] Updated weights for policy 0, policy_version 72650 (0.0007) [2023-03-06 18:15:10,853][23882] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-06 18:15:11,658][23882] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-06 18:15:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 74415104. Throughput: 0: 13049.5. Samples: 74413288. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:11,748][23556] Avg episode reward: [(0, '1950.353')] [2023-03-06 18:15:12,434][23882] Updated weights for policy 0, policy_version 72680 (0.0006) [2023-03-06 18:15:13,235][23882] Updated weights for policy 0, policy_version 72690 (0.0007) [2023-03-06 18:15:14,018][23882] Updated weights for policy 0, policy_version 72700 (0.0006) [2023-03-06 18:15:14,806][23882] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-06 18:15:15,610][23882] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-06 18:15:16,415][23882] Updated weights for policy 0, policy_version 72730 (0.0007) [2023-03-06 18:15:16,459][23831] KL-divergence is very high: 299.4135 [2023-03-06 18:15:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 74479616. Throughput: 0: 13044.2. Samples: 74452220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:16,748][23556] Avg episode reward: [(0, '1978.025')] [2023-03-06 18:15:17,191][23882] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-06 18:15:17,966][23882] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-06 18:15:18,758][23882] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-06 18:15:19,549][23882] Updated weights for policy 0, policy_version 72770 (0.0007) [2023-03-06 18:15:20,326][23882] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-06 18:15:21,099][23882] Updated weights for policy 0, policy_version 72790 (0.0007) [2023-03-06 18:15:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 74545152. Throughput: 0: 13044.3. Samples: 74530282. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:21,748][23556] Avg episode reward: [(0, '1894.086')] [2023-03-06 18:15:21,888][23882] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-06 18:15:22,660][23882] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-06 18:15:23,450][23882] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-06 18:15:24,220][23882] Updated weights for policy 0, policy_version 72830 (0.0006) [2023-03-06 18:15:24,994][23882] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-06 18:15:25,768][23882] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-06 18:15:26,583][23882] Updated weights for policy 0, policy_version 72860 (0.0007) [2023-03-06 18:15:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 74610688. Throughput: 0: 13048.0. Samples: 74608909. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:26,748][23556] Avg episode reward: [(0, '1945.325')] [2023-03-06 18:15:27,342][23882] Updated weights for policy 0, policy_version 72870 (0.0005) [2023-03-06 18:15:28,135][23882] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-06 18:15:28,938][23882] Updated weights for policy 0, policy_version 72890 (0.0006) [2023-03-06 18:15:29,697][23882] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-06 18:15:30,470][23882] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-06 18:15:31,259][23882] Updated weights for policy 0, policy_version 72920 (0.0007) [2023-03-06 18:15:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 74676224. Throughput: 0: 13044.5. Samples: 74648171. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:31,748][23556] Avg episode reward: [(0, '1880.688')] [2023-03-06 18:15:32,055][23882] Updated weights for policy 0, policy_version 72930 (0.0007) [2023-03-06 18:15:32,828][23882] Updated weights for policy 0, policy_version 72940 (0.0007) [2023-03-06 18:15:33,638][23882] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-06 18:15:34,420][23882] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-06 18:15:35,208][23882] Updated weights for policy 0, policy_version 72970 (0.0007) [2023-03-06 18:15:35,983][23882] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-06 18:15:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 74740736. Throughput: 0: 13045.4. Samples: 74726492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:36,748][23556] Avg episode reward: [(0, '1996.844')] [2023-03-06 18:15:36,762][23882] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-06 18:15:37,546][23882] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-06 18:15:37,691][23831] KL-divergence is very high: 116103.1562 [2023-03-06 18:15:37,907][23831] KL-divergence is very high: 38980140.0000 [2023-03-06 18:15:38,326][23882] Updated weights for policy 0, policy_version 73010 (0.0006) [2023-03-06 18:15:39,115][23882] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-06 18:15:39,892][23882] Updated weights for policy 0, policy_version 73030 (0.0006) [2023-03-06 18:15:40,662][23882] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-06 18:15:41,456][23882] Updated weights for policy 0, policy_version 73050 (0.0007) [2023-03-06 18:15:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 74806272. Throughput: 0: 13052.3. Samples: 74805025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:41,758][23556] Avg episode reward: [(0, '1914.592')] [2023-03-06 18:15:42,231][23882] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-06 18:15:43,022][23882] Updated weights for policy 0, policy_version 73070 (0.0007) [2023-03-06 18:15:43,806][23882] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-06 18:15:44,577][23882] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-06 18:15:45,359][23882] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-06 18:15:46,152][23882] Updated weights for policy 0, policy_version 73110 (0.0007) [2023-03-06 18:15:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 74871808. Throughput: 0: 13054.9. Samples: 74844285. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:15:46,748][23556] Avg episode reward: [(0, '1965.895')] [2023-03-06 18:15:46,934][23882] Updated weights for policy 0, policy_version 73120 (0.0007) [2023-03-06 18:15:47,715][23882] Updated weights for policy 0, policy_version 73130 (0.0007) [2023-03-06 18:15:48,506][23882] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-06 18:15:49,282][23882] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-06 18:15:50,070][23882] Updated weights for policy 0, policy_version 73160 (0.0005) [2023-03-06 18:15:50,861][23882] Updated weights for policy 0, policy_version 73170 (0.0007) [2023-03-06 18:15:51,653][23882] Updated weights for policy 0, policy_version 73180 (0.0007) [2023-03-06 18:15:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 74937344. Throughput: 0: 13054.5. Samples: 74922655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:15:51,748][23556] Avg episode reward: [(0, '1735.550')] [2023-03-06 18:15:52,408][23882] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-06 18:15:53,202][23882] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-06 18:15:53,987][23882] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-06 18:15:54,778][23882] Updated weights for policy 0, policy_version 73220 (0.0007) [2023-03-06 18:15:55,566][23882] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-06 18:15:56,350][23882] Updated weights for policy 0, policy_version 73240 (0.0007) [2023-03-06 18:15:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75002880. Throughput: 0: 13060.7. Samples: 75001023. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:15:56,748][23556] Avg episode reward: [(0, '1755.559')] [2023-03-06 18:15:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000073245_75002880.pth... [2023-03-06 18:15:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000070189_71873536.pth [2023-03-06 18:15:57,122][23882] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-06 18:15:57,909][23882] Updated weights for policy 0, policy_version 73260 (0.0007) [2023-03-06 18:15:58,689][23882] Updated weights for policy 0, policy_version 73270 (0.0007) [2023-03-06 18:15:59,471][23882] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-06 18:16:00,261][23882] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-06 18:16:01,045][23882] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-06 18:16:01,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13037.8). Total num frames: 75068416. Throughput: 0: 13069.4. Samples: 75040346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:01,749][23556] Avg episode reward: [(0, '1940.199')] [2023-03-06 18:16:01,829][23882] Updated weights for policy 0, policy_version 73310 (0.0007) [2023-03-06 18:16:02,613][23882] Updated weights for policy 0, policy_version 73320 (0.0006) [2023-03-06 18:16:03,407][23882] Updated weights for policy 0, policy_version 73330 (0.0006) [2023-03-06 18:16:04,185][23882] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-06 18:16:04,968][23882] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-06 18:16:05,745][23882] Updated weights for policy 0, policy_version 73360 (0.0007) [2023-03-06 18:16:06,515][23882] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-06 18:16:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75132928. Throughput: 0: 13074.7. Samples: 75118645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:06,748][23556] Avg episode reward: [(0, '2019.247')] [2023-03-06 18:16:07,315][23882] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-06 18:16:08,100][23882] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-06 18:16:08,878][23882] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-06 18:16:09,662][23882] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-06 18:16:10,450][23882] Updated weights for policy 0, policy_version 73420 (0.0006) [2023-03-06 18:16:11,241][23882] Updated weights for policy 0, policy_version 73430 (0.0007) [2023-03-06 18:16:11,631][23831] KL-divergence is very high: 175.1221 [2023-03-06 18:16:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75198464. Throughput: 0: 13065.0. Samples: 75196836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:11,748][23556] Avg episode reward: [(0, '2016.166')] [2023-03-06 18:16:12,018][23882] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-06 18:16:12,796][23882] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-06 18:16:13,576][23882] Updated weights for policy 0, policy_version 73460 (0.0006) [2023-03-06 18:16:14,377][23882] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-06 18:16:15,150][23882] Updated weights for policy 0, policy_version 73480 (0.0005) [2023-03-06 18:16:15,933][23882] Updated weights for policy 0, policy_version 73490 (0.0007) [2023-03-06 18:16:16,552][23831] KL-divergence is very high: 50282.0039 [2023-03-06 18:16:16,648][23831] KL-divergence is very high: 435.7275 [2023-03-06 18:16:16,721][23831] KL-divergence is very high: 233.5200 [2023-03-06 18:16:16,729][23882] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-06 18:16:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13041.2). Total num frames: 75264000. Throughput: 0: 13067.3. Samples: 75236202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:16,748][23556] Avg episode reward: [(0, '1897.377')] [2023-03-06 18:16:17,517][23882] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-06 18:16:18,304][23882] Updated weights for policy 0, policy_version 73520 (0.0008) [2023-03-06 18:16:19,076][23882] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-06 18:16:19,308][23831] KL-divergence is very high: 618.9828 [2023-03-06 18:16:19,874][23882] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-06 18:16:20,653][23882] Updated weights for policy 0, policy_version 73550 (0.0008) [2023-03-06 18:16:21,443][23882] Updated weights for policy 0, policy_version 73560 (0.0006) [2023-03-06 18:16:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75328512. Throughput: 0: 13064.5. Samples: 75314395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:21,748][23556] Avg episode reward: [(0, '1832.504')] [2023-03-06 18:16:22,249][23882] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-06 18:16:23,040][23882] Updated weights for policy 0, policy_version 73580 (0.0007) [2023-03-06 18:16:23,814][23882] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-06 18:16:24,597][23882] Updated weights for policy 0, policy_version 73600 (0.0007) [2023-03-06 18:16:25,398][23882] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-06 18:16:26,183][23882] Updated weights for policy 0, policy_version 73620 (0.0007) [2023-03-06 18:16:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75394048. Throughput: 0: 13047.9. Samples: 75392183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:26,748][23556] Avg episode reward: [(0, '1748.171')] [2023-03-06 18:16:26,965][23882] Updated weights for policy 0, policy_version 73630 (0.0007) [2023-03-06 18:16:27,776][23882] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-06 18:16:28,533][23882] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-06 18:16:29,313][23882] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-06 18:16:30,105][23882] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-06 18:16:30,885][23882] Updated weights for policy 0, policy_version 73680 (0.0007) [2023-03-06 18:16:31,679][23882] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-06 18:16:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 75458560. Throughput: 0: 13043.0. Samples: 75431217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:16:31,748][23556] Avg episode reward: [(0, '1808.163')] [2023-03-06 18:16:32,458][23882] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-06 18:16:33,260][23882] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-06 18:16:34,049][23882] Updated weights for policy 0, policy_version 73720 (0.0007) [2023-03-06 18:16:34,841][23882] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-06 18:16:35,640][23882] Updated weights for policy 0, policy_version 73740 (0.0007) [2023-03-06 18:16:36,426][23882] Updated weights for policy 0, policy_version 73750 (0.0007) [2023-03-06 18:16:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 75524096. Throughput: 0: 13034.8. Samples: 75509221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:16:36,748][23556] Avg episode reward: [(0, '1619.244')] [2023-03-06 18:16:37,213][23882] Updated weights for policy 0, policy_version 73760 (0.0005) [2023-03-06 18:16:38,001][23882] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-06 18:16:38,781][23882] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-06 18:16:39,584][23882] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-06 18:16:40,374][23882] Updated weights for policy 0, policy_version 73800 (0.0007) [2023-03-06 18:16:41,158][23882] Updated weights for policy 0, policy_version 73810 (0.0007) [2023-03-06 18:16:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 75588608. Throughput: 0: 13025.3. Samples: 75587160. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:16:41,748][23556] Avg episode reward: [(0, '1882.721')] [2023-03-06 18:16:41,949][23882] Updated weights for policy 0, policy_version 73820 (0.0007) [2023-03-06 18:16:42,727][23882] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-06 18:16:43,511][23882] Updated weights for policy 0, policy_version 73840 (0.0006) [2023-03-06 18:16:44,289][23882] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-06 18:16:45,073][23882] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-06 18:16:45,868][23882] Updated weights for policy 0, policy_version 73870 (0.0007) [2023-03-06 18:16:46,648][23882] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-06 18:16:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 75654144. Throughput: 0: 13019.1. Samples: 75626204. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:16:46,748][23556] Avg episode reward: [(0, '2012.273')] [2023-03-06 18:16:47,429][23882] Updated weights for policy 0, policy_version 73890 (0.0006) [2023-03-06 18:16:48,213][23882] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-06 18:16:48,995][23882] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-06 18:16:49,798][23882] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-06 18:16:50,579][23882] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-06 18:16:51,360][23882] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-06 18:16:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 75718656. Throughput: 0: 13017.2. Samples: 75704419. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:16:51,748][23556] Avg episode reward: [(0, '2098.236')] [2023-03-06 18:16:51,749][23831] Saving new best policy, reward=2098.236! [2023-03-06 18:16:52,173][23882] Updated weights for policy 0, policy_version 73950 (0.0007) [2023-03-06 18:16:52,968][23882] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-06 18:16:53,742][23882] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-06 18:16:54,517][23882] Updated weights for policy 0, policy_version 73980 (0.0007) [2023-03-06 18:16:54,596][23831] KL-divergence is very high: 7260.5703 [2023-03-06 18:16:55,315][23882] Updated weights for policy 0, policy_version 73990 (0.0008) [2023-03-06 18:16:56,085][23882] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-06 18:16:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 75784192. Throughput: 0: 13015.9. Samples: 75782554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:16:56,748][23556] Avg episode reward: [(0, '2039.879')] [2023-03-06 18:16:56,886][23882] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-06 18:16:57,670][23882] Updated weights for policy 0, policy_version 74020 (0.0007) [2023-03-06 18:16:58,452][23882] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-06 18:16:59,248][23882] Updated weights for policy 0, policy_version 74040 (0.0006) [2023-03-06 18:17:00,047][23882] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-06 18:17:00,816][23882] Updated weights for policy 0, policy_version 74060 (0.0007) [2023-03-06 18:17:01,598][23882] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-06 18:17:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 75848704. Throughput: 0: 13003.4. Samples: 75821356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:17:01,748][23556] Avg episode reward: [(0, '2051.078')] [2023-03-06 18:17:02,393][23882] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-06 18:17:03,177][23882] Updated weights for policy 0, policy_version 74090 (0.0006) [2023-03-06 18:17:03,961][23882] Updated weights for policy 0, policy_version 74100 (0.0007) [2023-03-06 18:17:04,727][23882] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-06 18:17:05,518][23882] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-06 18:17:06,322][23882] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-06 18:17:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 75914240. Throughput: 0: 13007.6. Samples: 75899737. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:17:06,748][23556] Avg episode reward: [(0, '1992.901')] [2023-03-06 18:17:07,093][23882] Updated weights for policy 0, policy_version 74140 (0.0005) [2023-03-06 18:17:07,886][23882] Updated weights for policy 0, policy_version 74150 (0.0007) [2023-03-06 18:17:08,685][23882] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-06 18:17:09,458][23882] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-06 18:17:10,251][23882] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-06 18:17:11,048][23882] Updated weights for policy 0, policy_version 74190 (0.0006) [2023-03-06 18:17:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 75979776. Throughput: 0: 13012.8. Samples: 75977757. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:17:11,748][23556] Avg episode reward: [(0, '2008.299')] [2023-03-06 18:17:11,830][23882] Updated weights for policy 0, policy_version 74200 (0.0007) [2023-03-06 18:17:12,611][23882] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-06 18:17:13,405][23882] Updated weights for policy 0, policy_version 74220 (0.0007) [2023-03-06 18:17:14,189][23882] Updated weights for policy 0, policy_version 74230 (0.0008) [2023-03-06 18:17:14,984][23882] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-06 18:17:15,754][23882] Updated weights for policy 0, policy_version 74250 (0.0007) [2023-03-06 18:17:16,532][23882] Updated weights for policy 0, policy_version 74260 (0.0007) [2023-03-06 18:17:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 76044288. Throughput: 0: 13013.6. Samples: 76016829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:17:16,748][23556] Avg episode reward: [(0, '1988.691')] [2023-03-06 18:17:17,340][23882] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-06 18:17:18,121][23882] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-06 18:17:18,902][23882] Updated weights for policy 0, policy_version 74290 (0.0007) [2023-03-06 18:17:19,677][23882] Updated weights for policy 0, policy_version 74300 (0.0006) [2023-03-06 18:17:20,468][23882] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-06 18:17:21,244][23882] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-06 18:17:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 76109824. Throughput: 0: 13020.6. Samples: 76095148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:21,749][23556] Avg episode reward: [(0, '2102.021')] [2023-03-06 18:17:21,749][23831] Saving new best policy, reward=2102.021! [2023-03-06 18:17:22,056][23882] Updated weights for policy 0, policy_version 74330 (0.0007) [2023-03-06 18:17:22,833][23882] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-06 18:17:23,626][23882] Updated weights for policy 0, policy_version 74350 (0.0007) [2023-03-06 18:17:24,397][23882] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-06 18:17:25,188][23882] Updated weights for policy 0, policy_version 74370 (0.0007) [2023-03-06 18:17:25,986][23882] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-06 18:17:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 76174336. Throughput: 0: 13013.9. Samples: 76172785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:26,748][23556] Avg episode reward: [(0, '1955.724')] [2023-03-06 18:17:26,770][23882] Updated weights for policy 0, policy_version 74390 (0.0006) [2023-03-06 18:17:27,574][23882] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-06 18:17:28,376][23882] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-06 18:17:29,153][23882] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-06 18:17:29,942][23882] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-06 18:17:30,741][23882] Updated weights for policy 0, policy_version 74440 (0.0006) [2023-03-06 18:17:31,518][23882] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-06 18:17:31,748][23556] Fps is (10 sec: 12902.6, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 76238848. Throughput: 0: 13012.4. Samples: 76211760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:31,748][23556] Avg episode reward: [(0, '2007.091')] [2023-03-06 18:17:32,289][23882] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-06 18:17:33,081][23882] Updated weights for policy 0, policy_version 74470 (0.0007) [2023-03-06 18:17:33,862][23882] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-06 18:17:34,641][23882] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-06 18:17:35,434][23882] Updated weights for policy 0, policy_version 74500 (0.0007) [2023-03-06 18:17:36,206][23882] Updated weights for policy 0, policy_version 74510 (0.0005) [2023-03-06 18:17:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 76304384. Throughput: 0: 13011.9. Samples: 76289951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:36,748][23556] Avg episode reward: [(0, '2001.837')] [2023-03-06 18:17:36,981][23882] Updated weights for policy 0, policy_version 74520 (0.0007) [2023-03-06 18:17:37,771][23882] Updated weights for policy 0, policy_version 74530 (0.0007) [2023-03-06 18:17:38,569][23882] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-06 18:17:39,333][23882] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-06 18:17:40,134][23882] Updated weights for policy 0, policy_version 74560 (0.0007) [2023-03-06 18:17:40,903][23882] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-06 18:17:41,690][23882] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-06 18:17:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76369920. Throughput: 0: 13021.9. Samples: 76368538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:41,748][23556] Avg episode reward: [(0, '2037.659')] [2023-03-06 18:17:42,479][23882] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-06 18:17:43,266][23882] Updated weights for policy 0, policy_version 74600 (0.0006) [2023-03-06 18:17:44,055][23882] Updated weights for policy 0, policy_version 74610 (0.0006) [2023-03-06 18:17:44,850][23882] Updated weights for policy 0, policy_version 74620 (0.0007) [2023-03-06 18:17:45,630][23882] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-06 18:17:46,409][23882] Updated weights for policy 0, policy_version 74640 (0.0006) [2023-03-06 18:17:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 76435456. Throughput: 0: 13031.7. Samples: 76407784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:46,748][23556] Avg episode reward: [(0, '2097.360')] [2023-03-06 18:17:47,210][23882] Updated weights for policy 0, policy_version 74650 (0.0007) [2023-03-06 18:17:47,981][23882] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-06 18:17:48,786][23882] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-06 18:17:49,557][23882] Updated weights for policy 0, policy_version 74680 (0.0007) [2023-03-06 18:17:50,347][23882] Updated weights for policy 0, policy_version 74690 (0.0006) [2023-03-06 18:17:51,127][23882] Updated weights for policy 0, policy_version 74700 (0.0007) [2023-03-06 18:17:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76499968. Throughput: 0: 13021.7. Samples: 76485712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:51,748][23556] Avg episode reward: [(0, '2034.823')] [2023-03-06 18:17:51,930][23882] Updated weights for policy 0, policy_version 74710 (0.0007) [2023-03-06 18:17:52,705][23882] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-06 18:17:53,494][23882] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-06 18:17:54,270][23882] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-06 18:17:55,065][23882] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-06 18:17:55,846][23882] Updated weights for policy 0, policy_version 74760 (0.0007) [2023-03-06 18:17:56,628][23882] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-06 18:17:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76565504. Throughput: 0: 13026.0. Samples: 76563929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:17:56,748][23556] Avg episode reward: [(0, '1902.056')] [2023-03-06 18:17:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000074771_76565504.pth... [2023-03-06 18:17:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000071716_73437184.pth [2023-03-06 18:17:57,431][23882] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-06 18:17:58,224][23882] Updated weights for policy 0, policy_version 74790 (0.0007) [2023-03-06 18:17:59,010][23882] Updated weights for policy 0, policy_version 74800 (0.0005) [2023-03-06 18:17:59,796][23882] Updated weights for policy 0, policy_version 74810 (0.0007) [2023-03-06 18:18:00,584][23882] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-06 18:18:01,373][23882] Updated weights for policy 0, policy_version 74830 (0.0008) [2023-03-06 18:18:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76630016. Throughput: 0: 13018.1. Samples: 76602645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:18:01,748][23556] Avg episode reward: [(0, '1842.889')] [2023-03-06 18:18:02,170][23882] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-06 18:18:02,962][23882] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-06 18:18:03,737][23882] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-06 18:18:04,525][23882] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-06 18:18:05,306][23882] Updated weights for policy 0, policy_version 74880 (0.0007) [2023-03-06 18:18:06,097][23882] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-06 18:18:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76695552. Throughput: 0: 13016.2. Samples: 76680875. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:06,748][23556] Avg episode reward: [(0, '2069.269')] [2023-03-06 18:18:06,878][23882] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-06 18:18:07,674][23882] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-06 18:18:08,474][23882] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-06 18:18:09,247][23882] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-06 18:18:10,046][23882] Updated weights for policy 0, policy_version 74940 (0.0007) [2023-03-06 18:18:10,822][23882] Updated weights for policy 0, policy_version 74950 (0.0007) [2023-03-06 18:18:11,606][23882] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-06 18:18:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 76760064. Throughput: 0: 13023.6. Samples: 76758847. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:11,754][23556] Avg episode reward: [(0, '1827.123')] [2023-03-06 18:18:12,403][23882] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-06 18:18:13,180][23882] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-06 18:18:13,953][23882] Updated weights for policy 0, policy_version 74990 (0.0007) [2023-03-06 18:18:14,747][23882] Updated weights for policy 0, policy_version 75000 (0.0005) [2023-03-06 18:18:15,540][23882] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-06 18:18:16,320][23882] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-06 18:18:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76825600. Throughput: 0: 13029.9. Samples: 76798104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:16,759][23556] Avg episode reward: [(0, '1936.602')] [2023-03-06 18:18:17,095][23882] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-06 18:18:17,879][23882] Updated weights for policy 0, policy_version 75040 (0.0007) [2023-03-06 18:18:18,644][23882] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-06 18:18:19,442][23882] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-06 18:18:20,226][23882] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-06 18:18:21,021][23882] Updated weights for policy 0, policy_version 75080 (0.0008) [2023-03-06 18:18:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 76891136. Throughput: 0: 13026.5. Samples: 76876142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:21,759][23556] Avg episode reward: [(0, '1722.275')] [2023-03-06 18:18:21,799][23882] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-06 18:18:22,592][23882] Updated weights for policy 0, policy_version 75100 (0.0007) [2023-03-06 18:18:23,392][23882] Updated weights for policy 0, policy_version 75110 (0.0006) [2023-03-06 18:18:24,162][23882] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-06 18:18:24,969][23882] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-06 18:18:25,732][23882] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-06 18:18:26,505][23882] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-06 18:18:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 76955648. Throughput: 0: 13023.9. Samples: 76954613. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:26,748][23556] Avg episode reward: [(0, '1391.894')] [2023-03-06 18:18:27,301][23882] Updated weights for policy 0, policy_version 75160 (0.0006) [2023-03-06 18:18:28,093][23882] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-06 18:18:28,873][23882] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-06 18:18:29,654][23882] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-06 18:18:30,463][23882] Updated weights for policy 0, policy_version 75200 (0.0006) [2023-03-06 18:18:31,242][23882] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-06 18:18:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77021184. Throughput: 0: 13022.3. Samples: 76993788. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:31,748][23556] Avg episode reward: [(0, '1712.032')] [2023-03-06 18:18:32,037][23882] Updated weights for policy 0, policy_version 75220 (0.0005) [2023-03-06 18:18:32,814][23882] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-06 18:18:33,594][23882] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-06 18:18:34,388][23882] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-03-06 18:18:35,169][23882] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-06 18:18:35,956][23882] Updated weights for policy 0, policy_version 75270 (0.0007) [2023-03-06 18:18:36,742][23882] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-06 18:18:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77086720. Throughput: 0: 13021.8. Samples: 77071694. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:36,748][23556] Avg episode reward: [(0, '2020.852')] [2023-03-06 18:18:37,535][23882] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-06 18:18:38,312][23882] Updated weights for policy 0, policy_version 75300 (0.0007) [2023-03-06 18:18:39,096][23882] Updated weights for policy 0, policy_version 75310 (0.0007) [2023-03-06 18:18:39,881][23882] Updated weights for policy 0, policy_version 75320 (0.0006) [2023-03-06 18:18:40,661][23882] Updated weights for policy 0, policy_version 75330 (0.0007) [2023-03-06 18:18:41,450][23882] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-06 18:18:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 77151232. Throughput: 0: 13026.3. Samples: 77150109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:41,748][23556] Avg episode reward: [(0, '2105.390')] [2023-03-06 18:18:41,749][23831] Saving new best policy, reward=2105.390! [2023-03-06 18:18:42,238][23882] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-06 18:18:43,019][23882] Updated weights for policy 0, policy_version 75360 (0.0006) [2023-03-06 18:18:43,805][23882] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-06 18:18:44,596][23882] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-06 18:18:45,370][23882] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-06 18:18:46,151][23882] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-06 18:18:46,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 77216768. Throughput: 0: 13031.2. Samples: 77189049. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:18:46,748][23556] Avg episode reward: [(0, '1971.264')] [2023-03-06 18:18:46,941][23882] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-06 18:18:47,711][23882] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-06 18:18:48,489][23882] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-06 18:18:49,298][23882] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-06 18:18:50,068][23882] Updated weights for policy 0, policy_version 75450 (0.0007) [2023-03-06 18:18:50,861][23882] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-06 18:18:51,658][23882] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-06 18:18:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 77282304. Throughput: 0: 13037.9. Samples: 77267579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:18:51,748][23556] Avg episode reward: [(0, '1997.637')] [2023-03-06 18:18:52,442][23882] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-06 18:18:53,253][23882] Updated weights for policy 0, policy_version 75490 (0.0007) [2023-03-06 18:18:54,030][23882] Updated weights for policy 0, policy_version 75500 (0.0007) [2023-03-06 18:18:54,801][23882] Updated weights for policy 0, policy_version 75510 (0.0007) [2023-03-06 18:18:55,598][23882] Updated weights for policy 0, policy_version 75520 (0.0007) [2023-03-06 18:18:56,386][23882] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-06 18:18:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 77346816. Throughput: 0: 13034.8. Samples: 77345411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:18:56,748][23556] Avg episode reward: [(0, '1976.308')] [2023-03-06 18:18:57,161][23882] Updated weights for policy 0, policy_version 75540 (0.0007) [2023-03-06 18:18:57,937][23882] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-06 18:18:58,708][23882] Updated weights for policy 0, policy_version 75560 (0.0007) [2023-03-06 18:18:59,525][23882] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-06 18:19:00,293][23882] Updated weights for policy 0, policy_version 75580 (0.0007) [2023-03-06 18:19:01,086][23882] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-06 18:19:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77412352. Throughput: 0: 13034.8. Samples: 77384670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:01,748][23556] Avg episode reward: [(0, '1999.436')] [2023-03-06 18:19:01,879][23882] Updated weights for policy 0, policy_version 75600 (0.0006) [2023-03-06 18:19:02,669][23882] Updated weights for policy 0, policy_version 75610 (0.0006) [2023-03-06 18:19:03,442][23882] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-06 18:19:04,217][23882] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-06 18:19:04,997][23882] Updated weights for policy 0, policy_version 75640 (0.0006) [2023-03-06 18:19:05,806][23882] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-06 18:19:06,572][23882] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-06 18:19:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 77477888. Throughput: 0: 13041.6. Samples: 77463016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:06,748][23556] Avg episode reward: [(0, '1995.098')] [2023-03-06 18:19:07,360][23882] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-06 18:19:08,144][23882] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-06 18:19:08,942][23882] Updated weights for policy 0, policy_version 75690 (0.0007) [2023-03-06 18:19:09,721][23882] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-06 18:19:10,498][23882] Updated weights for policy 0, policy_version 75710 (0.0007) [2023-03-06 18:19:11,285][23882] Updated weights for policy 0, policy_version 75720 (0.0006) [2023-03-06 18:19:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 77542400. Throughput: 0: 13035.5. Samples: 77541210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:11,748][23556] Avg episode reward: [(0, '2155.292')] [2023-03-06 18:19:11,750][23831] Saving new best policy, reward=2155.292! [2023-03-06 18:19:12,057][23882] Updated weights for policy 0, policy_version 75730 (0.0008) [2023-03-06 18:19:12,863][23882] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-06 18:19:13,644][23882] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-06 18:19:14,421][23882] Updated weights for policy 0, policy_version 75760 (0.0007) [2023-03-06 18:19:15,221][23882] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-06 18:19:16,006][23882] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-06 18:19:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77607936. Throughput: 0: 13036.1. Samples: 77580412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:16,748][23556] Avg episode reward: [(0, '2019.776')] [2023-03-06 18:19:16,782][23882] Updated weights for policy 0, policy_version 75790 (0.0006) [2023-03-06 18:19:17,577][23882] Updated weights for policy 0, policy_version 75800 (0.0007) [2023-03-06 18:19:18,368][23882] Updated weights for policy 0, policy_version 75810 (0.0007) [2023-03-06 18:19:19,138][23882] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-06 18:19:19,914][23882] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-06 18:19:20,688][23882] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-06 18:19:21,474][23882] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-06 18:19:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 77673472. Throughput: 0: 13045.1. Samples: 77658725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:21,748][23556] Avg episode reward: [(0, '1956.390')] [2023-03-06 18:19:22,263][23882] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-06 18:19:23,066][23882] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-06 18:19:23,839][23882] Updated weights for policy 0, policy_version 75880 (0.0006) [2023-03-06 18:19:24,627][23882] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-06 18:19:25,433][23882] Updated weights for policy 0, policy_version 75900 (0.0007) [2023-03-06 18:19:26,213][23882] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-06 18:19:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77737984. Throughput: 0: 13037.1. Samples: 77736777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:26,748][23556] Avg episode reward: [(0, '1998.781')] [2023-03-06 18:19:26,990][23882] Updated weights for policy 0, policy_version 75920 (0.0007) [2023-03-06 18:19:27,764][23882] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-06 18:19:28,550][23882] Updated weights for policy 0, policy_version 75940 (0.0007) [2023-03-06 18:19:29,317][23882] Updated weights for policy 0, policy_version 75950 (0.0005) [2023-03-06 18:19:30,108][23882] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-06 18:19:30,879][23882] Updated weights for policy 0, policy_version 75970 (0.0007) [2023-03-06 18:19:31,699][23882] Updated weights for policy 0, policy_version 75980 (0.0007) [2023-03-06 18:19:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77803520. Throughput: 0: 13044.4. Samples: 77776045. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:31,748][23556] Avg episode reward: [(0, '1913.295')] [2023-03-06 18:19:32,501][23882] Updated weights for policy 0, policy_version 75990 (0.0007) [2023-03-06 18:19:33,287][23882] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-06 18:19:34,075][23882] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-06 18:19:34,853][23882] Updated weights for policy 0, policy_version 76020 (0.0006) [2023-03-06 18:19:35,653][23882] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-06 18:19:36,426][23882] Updated weights for policy 0, policy_version 76040 (0.0007) [2023-03-06 18:19:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 77869056. Throughput: 0: 13031.6. Samples: 77854002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:36,748][23556] Avg episode reward: [(0, '1957.105')] [2023-03-06 18:19:37,215][23882] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-06 18:19:38,008][23882] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-06 18:19:38,806][23882] Updated weights for policy 0, policy_version 76070 (0.0007) [2023-03-06 18:19:39,583][23882] Updated weights for policy 0, policy_version 76080 (0.0007) [2023-03-06 18:19:40,361][23882] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-06 18:19:41,149][23882] Updated weights for policy 0, policy_version 76100 (0.0007) [2023-03-06 18:19:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77933568. Throughput: 0: 13040.8. Samples: 77932248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:41,748][23556] Avg episode reward: [(0, '1901.775')] [2023-03-06 18:19:41,917][23882] Updated weights for policy 0, policy_version 76110 (0.0007) [2023-03-06 18:19:42,715][23882] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-06 18:19:43,499][23882] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-06 18:19:44,285][23882] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-06 18:19:45,069][23882] Updated weights for policy 0, policy_version 76150 (0.0006) [2023-03-06 18:19:45,846][23882] Updated weights for policy 0, policy_version 76160 (0.0007) [2023-03-06 18:19:46,633][23882] Updated weights for policy 0, policy_version 76170 (0.0008) [2023-03-06 18:19:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 77999104. Throughput: 0: 13038.6. Samples: 77971409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:46,748][23556] Avg episode reward: [(0, '2074.785')] [2023-03-06 18:19:47,413][23882] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-06 18:19:48,189][23882] Updated weights for policy 0, policy_version 76190 (0.0006) [2023-03-06 18:19:48,973][23882] Updated weights for policy 0, policy_version 76200 (0.0007) [2023-03-06 18:19:49,758][23882] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-06 18:19:50,542][23882] Updated weights for policy 0, policy_version 76220 (0.0005) [2023-03-06 18:19:51,324][23882] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-06 18:19:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 78064640. Throughput: 0: 13039.8. Samples: 78049809. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:51,748][23556] Avg episode reward: [(0, '1978.822')] [2023-03-06 18:19:52,114][23882] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-06 18:19:52,887][23882] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-06 18:19:53,664][23882] Updated weights for policy 0, policy_version 76260 (0.0007) [2023-03-06 18:19:54,437][23882] Updated weights for policy 0, policy_version 76270 (0.0005) [2023-03-06 18:19:55,221][23882] Updated weights for policy 0, policy_version 76280 (0.0008) [2023-03-06 18:19:56,000][23882] Updated weights for policy 0, policy_version 76290 (0.0007) [2023-03-06 18:19:56,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13055.9, 300 sec: 13037.8). Total num frames: 78130176. Throughput: 0: 13049.7. Samples: 78128449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:19:56,749][23556] Avg episode reward: [(0, '2002.903')] [2023-03-06 18:19:56,754][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000076299_78130176.pth... [2023-03-06 18:19:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000073245_75002880.pth [2023-03-06 18:19:56,807][23882] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-06 18:19:57,569][23882] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-06 18:19:58,361][23882] Updated weights for policy 0, policy_version 76320 (0.0008) [2023-03-06 18:19:59,144][23882] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-06 18:19:59,822][23831] KL-divergence is very high: 3736.0381 [2023-03-06 18:19:59,918][23882] Updated weights for policy 0, policy_version 76340 (0.0006) [2023-03-06 18:20:00,301][23831] KL-divergence is very high: 321.6665 [2023-03-06 18:20:00,714][23882] Updated weights for policy 0, policy_version 76350 (0.0007) [2023-03-06 18:20:01,494][23882] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-06 18:20:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78195712. Throughput: 0: 13055.5. Samples: 78167907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:20:01,759][23556] Avg episode reward: [(0, '1962.799')] [2023-03-06 18:20:02,279][23882] Updated weights for policy 0, policy_version 76370 (0.0007) [2023-03-06 18:20:03,062][23882] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-06 18:20:03,837][23882] Updated weights for policy 0, policy_version 76390 (0.0007) [2023-03-06 18:20:04,630][23882] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-06 18:20:05,410][23882] Updated weights for policy 0, policy_version 76410 (0.0007) [2023-03-06 18:20:06,200][23882] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-06 18:20:06,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78261248. Throughput: 0: 13052.2. Samples: 78246073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:20:06,759][23556] Avg episode reward: [(0, '2086.373')] [2023-03-06 18:20:06,989][23882] Updated weights for policy 0, policy_version 76430 (0.0007) [2023-03-06 18:20:07,781][23882] Updated weights for policy 0, policy_version 76440 (0.0007) [2023-03-06 18:20:08,560][23882] Updated weights for policy 0, policy_version 76450 (0.0005) [2023-03-06 18:20:09,323][23882] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-06 18:20:10,116][23882] Updated weights for policy 0, policy_version 76470 (0.0007) [2023-03-06 18:20:10,895][23882] Updated weights for policy 0, policy_version 76480 (0.0007) [2023-03-06 18:20:11,678][23882] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-06 18:20:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78325760. Throughput: 0: 13064.3. Samples: 78324673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:20:11,754][23556] Avg episode reward: [(0, '2031.718')] [2023-03-06 18:20:12,473][23882] Updated weights for policy 0, policy_version 76500 (0.0007) [2023-03-06 18:20:12,527][23831] KL-divergence is very high: 194.6199 [2023-03-06 18:20:13,240][23882] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-06 18:20:14,037][23882] Updated weights for policy 0, policy_version 76520 (0.0007) [2023-03-06 18:20:14,829][23882] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-06 18:20:15,614][23882] Updated weights for policy 0, policy_version 76540 (0.0007) [2023-03-06 18:20:16,396][23882] Updated weights for policy 0, policy_version 76550 (0.0007) [2023-03-06 18:20:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78391296. Throughput: 0: 13061.4. Samples: 78363809. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:20:16,759][23556] Avg episode reward: [(0, '1737.403')] [2023-03-06 18:20:17,187][23882] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-06 18:20:17,969][23882] Updated weights for policy 0, policy_version 76570 (0.0008) [2023-03-06 18:20:18,749][23882] Updated weights for policy 0, policy_version 76580 (0.0006) [2023-03-06 18:20:19,544][23882] Updated weights for policy 0, policy_version 76590 (0.0006) [2023-03-06 18:20:20,325][23882] Updated weights for policy 0, policy_version 76600 (0.0007) [2023-03-06 18:20:21,107][23882] Updated weights for policy 0, policy_version 76610 (0.0006) [2023-03-06 18:20:21,748][23556] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78456832. Throughput: 0: 13062.5. Samples: 78441815. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:20:21,759][23556] Avg episode reward: [(0, '1931.717')] [2023-03-06 18:20:21,892][23882] Updated weights for policy 0, policy_version 76620 (0.0007) [2023-03-06 18:20:22,681][23882] Updated weights for policy 0, policy_version 76630 (0.0007) [2023-03-06 18:20:23,470][23882] Updated weights for policy 0, policy_version 76640 (0.0007) [2023-03-06 18:20:24,246][23882] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-06 18:20:25,044][23882] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-06 18:20:25,831][23882] Updated weights for policy 0, policy_version 76670 (0.0006) [2023-03-06 18:20:26,602][23882] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-06 18:20:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 78521344. Throughput: 0: 13064.0. Samples: 78520127. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:26,756][23556] Avg episode reward: [(0, '1767.280')] [2023-03-06 18:20:27,395][23882] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-06 18:20:28,079][23831] KL-divergence is very high: 116.4594 [2023-03-06 18:20:28,181][23882] Updated weights for policy 0, policy_version 76700 (0.0007) [2023-03-06 18:20:28,959][23882] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-06 18:20:29,749][23882] Updated weights for policy 0, policy_version 76720 (0.0007) [2023-03-06 18:20:30,517][23882] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-06 18:20:31,306][23882] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-06 18:20:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78586880. Throughput: 0: 13065.5. Samples: 78559355. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:31,759][23556] Avg episode reward: [(0, '1433.238')] [2023-03-06 18:20:32,095][23882] Updated weights for policy 0, policy_version 76750 (0.0007) [2023-03-06 18:20:32,865][23882] Updated weights for policy 0, policy_version 76760 (0.0007) [2023-03-06 18:20:33,674][23882] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-06 18:20:34,466][23882] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-06 18:20:35,253][23882] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-06 18:20:36,030][23882] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-06 18:20:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 78652416. Throughput: 0: 13061.3. Samples: 78637568. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:36,759][23556] Avg episode reward: [(0, '1391.681')] [2023-03-06 18:20:36,821][23882] Updated weights for policy 0, policy_version 76810 (0.0007) [2023-03-06 18:20:37,600][23882] Updated weights for policy 0, policy_version 76820 (0.0006) [2023-03-06 18:20:38,383][23882] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-06 18:20:39,159][23882] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-06 18:20:39,936][23882] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-06 18:20:40,325][23831] KL-divergence is very high: 130.8752 [2023-03-06 18:20:40,704][23882] Updated weights for policy 0, policy_version 76860 (0.0007) [2023-03-06 18:20:41,478][23882] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-06 18:20:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 78717952. Throughput: 0: 13058.5. Samples: 78716080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:41,754][23556] Avg episode reward: [(0, '1404.371')] [2023-03-06 18:20:42,285][23882] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-06 18:20:43,058][23882] Updated weights for policy 0, policy_version 76890 (0.0006) [2023-03-06 18:20:43,845][23882] Updated weights for policy 0, policy_version 76900 (0.0007) [2023-03-06 18:20:44,628][23882] Updated weights for policy 0, policy_version 76910 (0.0007) [2023-03-06 18:20:45,403][23882] Updated weights for policy 0, policy_version 76920 (0.0006) [2023-03-06 18:20:46,180][23882] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-06 18:20:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 78782464. Throughput: 0: 13052.7. Samples: 78755280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:46,759][23556] Avg episode reward: [(0, '1574.235')] [2023-03-06 18:20:46,970][23882] Updated weights for policy 0, policy_version 76940 (0.0007) [2023-03-06 18:20:47,762][23882] Updated weights for policy 0, policy_version 76950 (0.0006) [2023-03-06 18:20:48,570][23882] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-06 18:20:49,345][23882] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-06 18:20:50,146][23882] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-06 18:20:50,930][23882] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-06 18:20:51,705][23882] Updated weights for policy 0, policy_version 77000 (0.0007) [2023-03-06 18:20:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 78848000. Throughput: 0: 13053.2. Samples: 78833468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:51,748][23556] Avg episode reward: [(0, '1891.357')] [2023-03-06 18:20:51,932][23831] KL-divergence is very high: 325.2028 [2023-03-06 18:20:52,478][23882] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-06 18:20:53,239][23882] Updated weights for policy 0, policy_version 77020 (0.0007) [2023-03-06 18:20:54,041][23882] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-06 18:20:54,801][23882] Updated weights for policy 0, policy_version 77040 (0.0007) [2023-03-06 18:20:55,594][23882] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-06 18:20:56,374][23882] Updated weights for policy 0, policy_version 77060 (0.0006) [2023-03-06 18:20:56,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 78913536. Throughput: 0: 13056.6. Samples: 78912220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:20:56,748][23556] Avg episode reward: [(0, '1816.913')] [2023-03-06 18:20:57,174][23882] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-06 18:20:57,965][23882] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-06 18:20:58,751][23882] Updated weights for policy 0, policy_version 77090 (0.0007) [2023-03-06 18:20:59,531][23882] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-06 18:21:00,328][23882] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-06 18:21:01,115][23882] Updated weights for policy 0, policy_version 77120 (0.0006) [2023-03-06 18:21:01,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 78978048. Throughput: 0: 13051.9. Samples: 78951145. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:21:01,749][23556] Avg episode reward: [(0, '1231.632')] [2023-03-06 18:21:01,908][23882] Updated weights for policy 0, policy_version 77130 (0.0007) [2023-03-06 18:21:02,681][23882] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-06 18:21:03,470][23882] Updated weights for policy 0, policy_version 77150 (0.0007) [2023-03-06 18:21:04,243][23882] Updated weights for policy 0, policy_version 77160 (0.0006) [2023-03-06 18:21:05,026][23882] Updated weights for policy 0, policy_version 77170 (0.0007) [2023-03-06 18:21:05,814][23882] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-06 18:21:06,608][23882] Updated weights for policy 0, policy_version 77190 (0.0006) [2023-03-06 18:21:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 79043584. Throughput: 0: 13058.4. Samples: 79029445. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:21:06,748][23556] Avg episode reward: [(0, '1411.854')] [2023-03-06 18:21:07,384][23882] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-06 18:21:08,170][23882] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-06 18:21:08,968][23882] Updated weights for policy 0, policy_version 77220 (0.0006) [2023-03-06 18:21:09,753][23882] Updated weights for policy 0, policy_version 77230 (0.0007) [2023-03-06 18:21:10,545][23882] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-06 18:21:11,347][23882] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-06 18:21:11,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 79109120. Throughput: 0: 13046.0. Samples: 79107196. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:11,748][23556] Avg episode reward: [(0, '1072.915')] [2023-03-06 18:21:12,125][23882] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-06 18:21:12,917][23882] Updated weights for policy 0, policy_version 77270 (0.0006) [2023-03-06 18:21:13,705][23882] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-06 18:21:14,496][23882] Updated weights for policy 0, policy_version 77290 (0.0007) [2023-03-06 18:21:15,279][23882] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-06 18:21:16,065][23882] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-06 18:21:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 79173632. Throughput: 0: 13041.5. Samples: 79146220. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:16,748][23556] Avg episode reward: [(0, '1155.575')] [2023-03-06 18:21:16,854][23882] Updated weights for policy 0, policy_version 77320 (0.0007) [2023-03-06 18:21:17,639][23882] Updated weights for policy 0, policy_version 77330 (0.0005) [2023-03-06 18:21:18,406][23882] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-06 18:21:19,195][23882] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-06 18:21:19,983][23882] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-06 18:21:20,785][23882] Updated weights for policy 0, policy_version 77370 (0.0007) [2023-03-06 18:21:21,574][23882] Updated weights for policy 0, policy_version 77380 (0.0006) [2023-03-06 18:21:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 79239168. Throughput: 0: 13039.5. Samples: 79224347. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:21,748][23556] Avg episode reward: [(0, '1562.172')] [2023-03-06 18:21:22,345][23882] Updated weights for policy 0, policy_version 77390 (0.0007) [2023-03-06 18:21:23,141][23882] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-06 18:21:23,930][23882] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-06 18:21:24,702][23882] Updated weights for policy 0, policy_version 77420 (0.0008) [2023-03-06 18:21:25,481][23882] Updated weights for policy 0, policy_version 77430 (0.0006) [2023-03-06 18:21:26,282][23882] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-06 18:21:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 79303680. Throughput: 0: 13036.0. Samples: 79302701. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:26,748][23556] Avg episode reward: [(0, '1810.491')] [2023-03-06 18:21:27,073][23882] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-06 18:21:27,856][23882] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-06 18:21:28,640][23882] Updated weights for policy 0, policy_version 77470 (0.0007) [2023-03-06 18:21:29,425][23882] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-06 18:21:30,202][23882] Updated weights for policy 0, policy_version 77490 (0.0006) [2023-03-06 18:21:30,978][23882] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-06 18:21:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 79369216. Throughput: 0: 13031.4. Samples: 79341695. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:31,748][23556] Avg episode reward: [(0, '1810.731')] [2023-03-06 18:21:31,756][23882] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-06 18:21:32,526][23882] Updated weights for policy 0, policy_version 77520 (0.0006) [2023-03-06 18:21:33,311][23882] Updated weights for policy 0, policy_version 77530 (0.0007) [2023-03-06 18:21:34,099][23882] Updated weights for policy 0, policy_version 77540 (0.0007) [2023-03-06 18:21:34,884][23882] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-06 18:21:35,668][23882] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-06 18:21:36,444][23882] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-06 18:21:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 79434752. Throughput: 0: 13044.4. Samples: 79420465. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:36,748][23556] Avg episode reward: [(0, '1873.357')] [2023-03-06 18:21:37,227][23882] Updated weights for policy 0, policy_version 77580 (0.0007) [2023-03-06 18:21:38,016][23882] Updated weights for policy 0, policy_version 77590 (0.0007) [2023-03-06 18:21:38,790][23882] Updated weights for policy 0, policy_version 77600 (0.0007) [2023-03-06 18:21:39,571][23882] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-06 18:21:40,362][23882] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-06 18:21:41,153][23882] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-06 18:21:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 79500288. Throughput: 0: 13030.3. Samples: 79498583. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:41,748][23556] Avg episode reward: [(0, '1796.788')] [2023-03-06 18:21:41,954][23882] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-06 18:21:42,741][23882] Updated weights for policy 0, policy_version 77650 (0.0007) [2023-03-06 18:21:43,518][23882] Updated weights for policy 0, policy_version 77660 (0.0006) [2023-03-06 18:21:44,309][23882] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-06 18:21:45,101][23882] Updated weights for policy 0, policy_version 77680 (0.0006) [2023-03-06 18:21:45,887][23882] Updated weights for policy 0, policy_version 77690 (0.0007) [2023-03-06 18:21:46,682][23882] Updated weights for policy 0, policy_version 77700 (0.0007) [2023-03-06 18:21:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 79564800. Throughput: 0: 13033.8. Samples: 79537662. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:46,748][23556] Avg episode reward: [(0, '1925.974')] [2023-03-06 18:21:47,470][23882] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-06 18:21:48,252][23882] Updated weights for policy 0, policy_version 77720 (0.0008) [2023-03-06 18:21:49,039][23882] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-06 18:21:49,820][23882] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-06 18:21:50,589][23882] Updated weights for policy 0, policy_version 77750 (0.0006) [2023-03-06 18:21:51,377][23882] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-06 18:21:51,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 79630336. Throughput: 0: 13034.3. Samples: 79615989. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:51,748][23556] Avg episode reward: [(0, '1904.894')] [2023-03-06 18:21:52,161][23882] Updated weights for policy 0, policy_version 77770 (0.0006) [2023-03-06 18:21:52,939][23882] Updated weights for policy 0, policy_version 77780 (0.0007) [2023-03-06 18:21:53,742][23882] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-06 18:21:54,521][23882] Updated weights for policy 0, policy_version 77800 (0.0007) [2023-03-06 18:21:55,318][23882] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-06 18:21:56,105][23882] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-06 18:21:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 79695872. Throughput: 0: 13040.1. Samples: 79693998. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 18:21:56,748][23556] Avg episode reward: [(0, '2004.205')] [2023-03-06 18:21:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000077828_79695872.pth... [2023-03-06 18:21:56,781][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000074771_76565504.pth [2023-03-06 18:21:56,908][23882] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-06 18:21:57,682][23882] Updated weights for policy 0, policy_version 77840 (0.0007) [2023-03-06 18:21:58,451][23882] Updated weights for policy 0, policy_version 77850 (0.0007) [2023-03-06 18:21:59,253][23882] Updated weights for policy 0, policy_version 77860 (0.0007) [2023-03-06 18:22:00,026][23882] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-06 18:22:00,813][23882] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-06 18:22:01,595][23882] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-06 18:22:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 79760384. Throughput: 0: 13042.5. Samples: 79733133. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:01,748][23556] Avg episode reward: [(0, '1721.171')] [2023-03-06 18:22:02,380][23882] Updated weights for policy 0, policy_version 77900 (0.0006) [2023-03-06 18:22:03,142][23882] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-06 18:22:03,929][23882] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-06 18:22:04,709][23882] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-06 18:22:05,521][23882] Updated weights for policy 0, policy_version 77940 (0.0007) [2023-03-06 18:22:06,291][23882] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-06 18:22:06,748][23556] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 79825920. Throughput: 0: 13049.2. Samples: 79811566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:06,749][23556] Avg episode reward: [(0, '2041.762')] [2023-03-06 18:22:07,076][23882] Updated weights for policy 0, policy_version 77960 (0.0007) [2023-03-06 18:22:07,846][23882] Updated weights for policy 0, policy_version 77970 (0.0007) [2023-03-06 18:22:08,632][23882] Updated weights for policy 0, policy_version 77980 (0.0006) [2023-03-06 18:22:09,414][23882] Updated weights for policy 0, policy_version 77990 (0.0007) [2023-03-06 18:22:10,198][23882] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-06 18:22:10,991][23882] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-06 18:22:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 79891456. Throughput: 0: 13055.1. Samples: 79890180. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:11,748][23556] Avg episode reward: [(0, '2111.167')] [2023-03-06 18:22:11,758][23882] Updated weights for policy 0, policy_version 78020 (0.0006) [2023-03-06 18:22:12,534][23882] Updated weights for policy 0, policy_version 78030 (0.0007) [2023-03-06 18:22:13,322][23882] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-06 18:22:14,103][23882] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-06 18:22:14,402][23831] KL-divergence is very high: 483.3935 [2023-03-06 18:22:14,881][23882] Updated weights for policy 0, policy_version 78060 (0.0007) [2023-03-06 18:22:15,657][23882] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-06 18:22:16,447][23882] Updated weights for policy 0, policy_version 78080 (0.0006) [2023-03-06 18:22:16,748][23556] Fps is (10 sec: 13209.9, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 79958016. Throughput: 0: 13064.9. Samples: 79929616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:16,748][23556] Avg episode reward: [(0, '1993.154')] [2023-03-06 18:22:17,221][23882] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-06 18:22:18,006][23882] Updated weights for policy 0, policy_version 78100 (0.0007) [2023-03-06 18:22:18,791][23882] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-06 18:22:19,573][23882] Updated weights for policy 0, policy_version 78120 (0.0007) [2023-03-06 18:22:20,367][23882] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-06 18:22:21,143][23882] Updated weights for policy 0, policy_version 78140 (0.0007) [2023-03-06 18:22:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 80022528. Throughput: 0: 13056.3. Samples: 80007996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:21,748][23556] Avg episode reward: [(0, '1742.987')] [2023-03-06 18:22:21,943][23882] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-06 18:22:22,712][23882] Updated weights for policy 0, policy_version 78160 (0.0007) [2023-03-06 18:22:23,489][23882] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-06 18:22:24,271][23882] Updated weights for policy 0, policy_version 78180 (0.0007) [2023-03-06 18:22:25,052][23882] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-06 18:22:25,840][23882] Updated weights for policy 0, policy_version 78200 (0.0007) [2023-03-06 18:22:26,634][23882] Updated weights for policy 0, policy_version 78210 (0.0007) [2023-03-06 18:22:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 80088064. Throughput: 0: 13063.0. Samples: 80086420. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:26,749][23556] Avg episode reward: [(0, '1802.710')] [2023-03-06 18:22:27,418][23882] Updated weights for policy 0, policy_version 78220 (0.0006) [2023-03-06 18:22:28,189][23882] Updated weights for policy 0, policy_version 78230 (0.0007) [2023-03-06 18:22:28,961][23882] Updated weights for policy 0, policy_version 78240 (0.0007) [2023-03-06 18:22:29,730][23882] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-06 18:22:30,507][23882] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-06 18:22:31,293][23882] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-06 18:22:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 80153600. Throughput: 0: 13074.2. Samples: 80125999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:31,748][23556] Avg episode reward: [(0, '1503.075')] [2023-03-06 18:22:32,074][23882] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-06 18:22:32,843][23882] Updated weights for policy 0, policy_version 78290 (0.0007) [2023-03-06 18:22:33,637][23882] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-06 18:22:34,435][23882] Updated weights for policy 0, policy_version 78310 (0.0007) [2023-03-06 18:22:35,222][23882] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-06 18:22:35,984][23882] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-06 18:22:36,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 80219136. Throughput: 0: 13081.4. Samples: 80204650. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:36,748][23556] Avg episode reward: [(0, '1082.832')] [2023-03-06 18:22:36,750][23882] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-06 18:22:37,566][23882] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-06 18:22:38,355][23882] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-06 18:22:39,153][23882] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-06 18:22:39,918][23882] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-06 18:22:40,726][23882] Updated weights for policy 0, policy_version 78390 (0.0007) [2023-03-06 18:22:41,500][23882] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-06 18:22:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13048.2). Total num frames: 80284672. Throughput: 0: 13085.6. Samples: 80282852. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:41,748][23556] Avg episode reward: [(0, '1177.466')] [2023-03-06 18:22:42,290][23882] Updated weights for policy 0, policy_version 78410 (0.0007) [2023-03-06 18:22:43,095][23882] Updated weights for policy 0, policy_version 78420 (0.0006) [2023-03-06 18:22:43,881][23882] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-06 18:22:44,675][23882] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-06 18:22:45,462][23882] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-06 18:22:46,249][23882] Updated weights for policy 0, policy_version 78460 (0.0007) [2023-03-06 18:22:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13048.2). Total num frames: 80349184. Throughput: 0: 13077.4. Samples: 80321617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:46,748][23556] Avg episode reward: [(0, '1431.377')] [2023-03-06 18:22:47,014][23882] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-06 18:22:47,805][23882] Updated weights for policy 0, policy_version 78480 (0.0006) [2023-03-06 18:22:48,600][23882] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-06 18:22:49,377][23882] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-06 18:22:50,174][23882] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-06 18:22:50,964][23882] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-06 18:22:51,742][23882] Updated weights for policy 0, policy_version 78530 (0.0005) [2023-03-06 18:22:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 80414720. Throughput: 0: 13065.8. Samples: 80399524. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:51,748][23556] Avg episode reward: [(0, '1625.327')] [2023-03-06 18:22:52,534][23882] Updated weights for policy 0, policy_version 78540 (0.0006) [2023-03-06 18:22:53,323][23882] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-06 18:22:54,106][23882] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-06 18:22:54,890][23882] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-06 18:22:55,681][23882] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-06 18:22:56,447][23882] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-06 18:22:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 80479232. Throughput: 0: 13062.6. Samples: 80477999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:22:56,748][23556] Avg episode reward: [(0, '1761.968')] [2023-03-06 18:22:57,250][23882] Updated weights for policy 0, policy_version 78600 (0.0007) [2023-03-06 18:22:58,031][23882] Updated weights for policy 0, policy_version 78610 (0.0007) [2023-03-06 18:22:58,823][23882] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-06 18:22:59,613][23882] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-06 18:23:00,406][23882] Updated weights for policy 0, policy_version 78640 (0.0007) [2023-03-06 18:23:01,189][23882] Updated weights for policy 0, policy_version 78650 (0.0007) [2023-03-06 18:23:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 80544768. Throughput: 0: 13050.2. Samples: 80516875. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:01,748][23556] Avg episode reward: [(0, '1820.405')] [2023-03-06 18:23:01,981][23882] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-06 18:23:02,760][23882] Updated weights for policy 0, policy_version 78670 (0.0006) [2023-03-06 18:23:03,547][23882] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-06 18:23:04,351][23882] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-06 18:23:05,134][23882] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-06 18:23:05,907][23882] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-06 18:23:06,703][23882] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-06 18:23:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 80609280. Throughput: 0: 13039.6. Samples: 80594778. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:06,748][23556] Avg episode reward: [(0, '1716.807')] [2023-03-06 18:23:07,490][23882] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-06 18:23:08,283][23882] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-06 18:23:09,059][23882] Updated weights for policy 0, policy_version 78750 (0.0006) [2023-03-06 18:23:09,850][23882] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-06 18:23:10,653][23882] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-06 18:23:11,438][23882] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-06 18:23:11,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 80673792. Throughput: 0: 13030.0. Samples: 80672768. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:11,748][23556] Avg episode reward: [(0, '1655.829')] [2023-03-06 18:23:12,230][23882] Updated weights for policy 0, policy_version 78790 (0.0006) [2023-03-06 18:23:13,030][23882] Updated weights for policy 0, policy_version 78800 (0.0006) [2023-03-06 18:23:13,814][23882] Updated weights for policy 0, policy_version 78810 (0.0007) [2023-03-06 18:23:14,602][23882] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-06 18:23:15,383][23882] Updated weights for policy 0, policy_version 78830 (0.0008) [2023-03-06 18:23:16,193][23882] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-06 18:23:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 80739328. Throughput: 0: 13012.3. Samples: 80711552. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:16,748][23556] Avg episode reward: [(0, '1621.632')] [2023-03-06 18:23:16,969][23882] Updated weights for policy 0, policy_version 78850 (0.0005) [2023-03-06 18:23:17,777][23882] Updated weights for policy 0, policy_version 78860 (0.0007) [2023-03-06 18:23:18,558][23882] Updated weights for policy 0, policy_version 78870 (0.0006) [2023-03-06 18:23:19,341][23882] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-06 18:23:20,134][23882] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-06 18:23:20,934][23882] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-06 18:23:21,737][23882] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-06 18:23:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 80803840. Throughput: 0: 12989.5. Samples: 80789176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:21,748][23556] Avg episode reward: [(0, '1714.923')] [2023-03-06 18:23:22,521][23882] Updated weights for policy 0, policy_version 78920 (0.0007) [2023-03-06 18:23:23,311][23882] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-06 18:23:24,127][23882] Updated weights for policy 0, policy_version 78940 (0.0007) [2023-03-06 18:23:24,898][23882] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-06 18:23:25,677][23882] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-06 18:23:26,452][23882] Updated weights for policy 0, policy_version 78970 (0.0007) [2023-03-06 18:23:26,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 80868352. Throughput: 0: 12980.4. Samples: 80866968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:26,748][23556] Avg episode reward: [(0, '1862.055')] [2023-03-06 18:23:27,254][23882] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-06 18:23:28,038][23882] Updated weights for policy 0, policy_version 78990 (0.0006) [2023-03-06 18:23:28,756][23831] KL-divergence is very high: 1691.6747 [2023-03-06 18:23:28,817][23882] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-06 18:23:29,633][23882] Updated weights for policy 0, policy_version 79010 (0.0006) [2023-03-06 18:23:30,094][23831] KL-divergence is very high: 380.4673 [2023-03-06 18:23:30,165][23831] KL-divergence is very high: 1570.7816 [2023-03-06 18:23:30,422][23882] Updated weights for policy 0, policy_version 79020 (0.0006) [2023-03-06 18:23:31,179][23882] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-06 18:23:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 80933888. Throughput: 0: 12984.9. Samples: 80905936. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:23:31,748][23556] Avg episode reward: [(0, '1769.144')] [2023-03-06 18:23:31,972][23882] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-06 18:23:32,745][23882] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-06 18:23:33,525][23882] Updated weights for policy 0, policy_version 79060 (0.0007) [2023-03-06 18:23:34,325][23882] Updated weights for policy 0, policy_version 79070 (0.0005) [2023-03-06 18:23:35,088][23882] Updated weights for policy 0, policy_version 79080 (0.0008) [2023-03-06 18:23:35,878][23882] Updated weights for policy 0, policy_version 79090 (0.0006) [2023-03-06 18:23:36,674][23882] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-06 18:23:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13041.2). Total num frames: 80998400. Throughput: 0: 12994.6. Samples: 80984284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:23:36,748][23556] Avg episode reward: [(0, '1559.609')] [2023-03-06 18:23:37,473][23882] Updated weights for policy 0, policy_version 79110 (0.0007) [2023-03-06 18:23:38,251][23882] Updated weights for policy 0, policy_version 79120 (0.0007) [2023-03-06 18:23:39,041][23882] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-06 18:23:39,832][23882] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-06 18:23:40,614][23882] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-06 18:23:41,394][23882] Updated weights for policy 0, policy_version 79160 (0.0007) [2023-03-06 18:23:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13041.3). Total num frames: 81063936. Throughput: 0: 12984.4. Samples: 81062294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:23:41,748][23556] Avg episode reward: [(0, '1680.980')] [2023-03-06 18:23:42,187][23882] Updated weights for policy 0, policy_version 79170 (0.0007) [2023-03-06 18:23:42,972][23882] Updated weights for policy 0, policy_version 79180 (0.0006) [2023-03-06 18:23:43,757][23882] Updated weights for policy 0, policy_version 79190 (0.0006) [2023-03-06 18:23:44,556][23882] Updated weights for policy 0, policy_version 79200 (0.0007) [2023-03-06 18:23:45,331][23882] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-06 18:23:46,113][23882] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-06 18:23:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 12987.7, 300 sec: 13037.8). Total num frames: 81128448. Throughput: 0: 12984.1. Samples: 81101159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:23:46,748][23556] Avg episode reward: [(0, '1816.998')] [2023-03-06 18:23:46,913][23882] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-06 18:23:47,724][23882] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-06 18:23:48,501][23882] Updated weights for policy 0, policy_version 79250 (0.0007) [2023-03-06 18:23:49,294][23882] Updated weights for policy 0, policy_version 79260 (0.0007) [2023-03-06 18:23:50,061][23882] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-06 18:23:50,862][23882] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-06 18:23:51,657][23882] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-06 18:23:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13041.2). Total num frames: 81193984. Throughput: 0: 12986.3. Samples: 81179157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:23:51,748][23556] Avg episode reward: [(0, '1772.754')] [2023-03-06 18:23:52,454][23882] Updated weights for policy 0, policy_version 79300 (0.0006) [2023-03-06 18:23:53,241][23882] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-06 18:23:54,044][23882] Updated weights for policy 0, policy_version 79320 (0.0007) [2023-03-06 18:23:54,820][23882] Updated weights for policy 0, policy_version 79330 (0.0007) [2023-03-06 18:23:55,596][23882] Updated weights for policy 0, policy_version 79340 (0.0006) [2023-03-06 18:23:56,394][23882] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-06 18:23:56,748][23556] Fps is (10 sec: 13004.6, 60 sec: 12987.7, 300 sec: 13037.8). Total num frames: 81258496. Throughput: 0: 12984.6. Samples: 81257076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:23:56,754][23556] Avg episode reward: [(0, '1966.197')] [2023-03-06 18:23:56,766][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000079355_81259520.pth... [2023-03-06 18:23:56,796][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000076299_78130176.pth [2023-03-06 18:23:57,160][23882] Updated weights for policy 0, policy_version 79360 (0.0007) [2023-03-06 18:23:57,938][23882] Updated weights for policy 0, policy_version 79370 (0.0007) [2023-03-06 18:23:58,726][23882] Updated weights for policy 0, policy_version 79380 (0.0007) [2023-03-06 18:23:59,528][23882] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-06 18:24:00,317][23882] Updated weights for policy 0, policy_version 79400 (0.0007) [2023-03-06 18:24:01,094][23882] Updated weights for policy 0, policy_version 79410 (0.0007) [2023-03-06 18:24:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13037.8). Total num frames: 81324032. Throughput: 0: 12994.8. Samples: 81296317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:24:01,759][23556] Avg episode reward: [(0, '1909.975')] [2023-03-06 18:24:01,880][23882] Updated weights for policy 0, policy_version 79420 (0.0007) [2023-03-06 18:24:02,674][23882] Updated weights for policy 0, policy_version 79430 (0.0005) [2023-03-06 18:24:03,449][23882] Updated weights for policy 0, policy_version 79440 (0.0007) [2023-03-06 18:24:04,234][23882] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-06 18:24:05,018][23882] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-06 18:24:05,790][23882] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-06 18:24:06,565][23882] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-06 18:24:06,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 81389568. Throughput: 0: 13010.6. Samples: 81374652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:24:06,759][23556] Avg episode reward: [(0, '1909.275')] [2023-03-06 18:24:07,352][23882] Updated weights for policy 0, policy_version 79490 (0.0006) [2023-03-06 18:24:08,153][23882] Updated weights for policy 0, policy_version 79500 (0.0007) [2023-03-06 18:24:08,946][23882] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-06 18:24:09,720][23882] Updated weights for policy 0, policy_version 79520 (0.0008) [2023-03-06 18:24:10,501][23882] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-06 18:24:11,290][23882] Updated weights for policy 0, policy_version 79540 (0.0007) [2023-03-06 18:24:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 81454080. Throughput: 0: 13021.7. Samples: 81452944. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:24:11,758][23556] Avg episode reward: [(0, '1789.164')] [2023-03-06 18:24:12,065][23882] Updated weights for policy 0, policy_version 79550 (0.0007) [2023-03-06 18:24:12,851][23882] Updated weights for policy 0, policy_version 79560 (0.0007) [2023-03-06 18:24:13,630][23882] Updated weights for policy 0, policy_version 79570 (0.0007) [2023-03-06 18:24:14,084][23831] KL-divergence is very high: 8999.1279 [2023-03-06 18:24:14,404][23882] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-06 18:24:15,190][23882] Updated weights for policy 0, policy_version 79590 (0.0007) [2023-03-06 18:24:15,982][23882] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-06 18:24:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 81519616. Throughput: 0: 13029.4. Samples: 81492257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:24:16,759][23556] Avg episode reward: [(0, '1823.463')] [2023-03-06 18:24:16,785][23882] Updated weights for policy 0, policy_version 79610 (0.0007) [2023-03-06 18:24:17,561][23882] Updated weights for policy 0, policy_version 79620 (0.0007) [2023-03-06 18:24:18,338][23882] Updated weights for policy 0, policy_version 79630 (0.0006) [2023-03-06 18:24:19,132][23882] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-06 18:24:19,903][23882] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-06 18:24:20,689][23882] Updated weights for policy 0, policy_version 79660 (0.0006) [2023-03-06 18:24:21,464][23882] Updated weights for policy 0, policy_version 79670 (0.0006) [2023-03-06 18:24:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 81585152. Throughput: 0: 13029.1. Samples: 81570592. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:21,759][23556] Avg episode reward: [(0, '1963.141')] [2023-03-06 18:24:22,249][23882] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-06 18:24:23,025][23882] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-06 18:24:23,826][23882] Updated weights for policy 0, policy_version 79700 (0.0006) [2023-03-06 18:24:24,603][23882] Updated weights for policy 0, policy_version 79710 (0.0007) [2023-03-06 18:24:25,398][23882] Updated weights for policy 0, policy_version 79720 (0.0007) [2023-03-06 18:24:26,183][23882] Updated weights for policy 0, policy_version 79730 (0.0006) [2023-03-06 18:24:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 81650688. Throughput: 0: 13033.8. Samples: 81648818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:26,758][23556] Avg episode reward: [(0, '1864.220')] [2023-03-06 18:24:26,968][23882] Updated weights for policy 0, policy_version 79740 (0.0007) [2023-03-06 18:24:27,754][23882] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-06 18:24:28,546][23882] Updated weights for policy 0, policy_version 79760 (0.0007) [2023-03-06 18:24:29,346][23882] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-06 18:24:30,123][23882] Updated weights for policy 0, policy_version 79780 (0.0006) [2023-03-06 18:24:30,898][23882] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-06 18:24:31,689][23882] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-06 18:24:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 81715200. Throughput: 0: 13039.6. Samples: 81687940. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:31,759][23556] Avg episode reward: [(0, '1883.391')] [2023-03-06 18:24:32,465][23882] Updated weights for policy 0, policy_version 79810 (0.0006) [2023-03-06 18:24:33,233][23882] Updated weights for policy 0, policy_version 79820 (0.0006) [2023-03-06 18:24:34,043][23882] Updated weights for policy 0, policy_version 79830 (0.0007) [2023-03-06 18:24:34,833][23882] Updated weights for policy 0, policy_version 79840 (0.0007) [2023-03-06 18:24:35,626][23882] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-06 18:24:36,410][23882] Updated weights for policy 0, policy_version 79860 (0.0007) [2023-03-06 18:24:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 81780736. Throughput: 0: 13041.6. Samples: 81766029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:36,759][23556] Avg episode reward: [(0, '1747.117')] [2023-03-06 18:24:37,190][23882] Updated weights for policy 0, policy_version 79870 (0.0006) [2023-03-06 18:24:37,973][23882] Updated weights for policy 0, policy_version 79880 (0.0007) [2023-03-06 18:24:38,745][23882] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-06 18:24:39,534][23882] Updated weights for policy 0, policy_version 79900 (0.0006) [2023-03-06 18:24:40,326][23882] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-06 18:24:41,110][23882] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-06 18:24:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 81846272. Throughput: 0: 13051.4. Samples: 81844386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:41,755][23556] Avg episode reward: [(0, '1513.742')] [2023-03-06 18:24:41,897][23882] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-06 18:24:42,676][23882] Updated weights for policy 0, policy_version 79940 (0.0007) [2023-03-06 18:24:43,460][23882] Updated weights for policy 0, policy_version 79950 (0.0007) [2023-03-06 18:24:44,224][23882] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-06 18:24:45,011][23882] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-06 18:24:45,811][23882] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-06 18:24:46,591][23882] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-06 18:24:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 81910784. Throughput: 0: 13053.9. Samples: 81883740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:46,759][23556] Avg episode reward: [(0, '1612.000')] [2023-03-06 18:24:47,376][23882] Updated weights for policy 0, policy_version 80000 (0.0006) [2023-03-06 18:24:48,174][23882] Updated weights for policy 0, policy_version 80010 (0.0005) [2023-03-06 18:24:48,953][23882] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-06 18:24:49,736][23882] Updated weights for policy 0, policy_version 80030 (0.0007) [2023-03-06 18:24:50,521][23882] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-06 18:24:51,316][23882] Updated weights for policy 0, policy_version 80050 (0.0007) [2023-03-06 18:24:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 81976320. Throughput: 0: 13047.0. Samples: 81961770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:51,748][23556] Avg episode reward: [(0, '1724.041')] [2023-03-06 18:24:52,090][23882] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-06 18:24:52,883][23882] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-06 18:24:53,674][23882] Updated weights for policy 0, policy_version 80080 (0.0008) [2023-03-06 18:24:54,471][23882] Updated weights for policy 0, policy_version 80090 (0.0007) [2023-03-06 18:24:55,255][23882] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-06 18:24:56,045][23882] Updated weights for policy 0, policy_version 80110 (0.0008) [2023-03-06 18:24:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82040832. Throughput: 0: 13042.6. Samples: 82039862. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:24:56,748][23556] Avg episode reward: [(0, '1853.598')] [2023-03-06 18:24:56,837][23882] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-06 18:24:57,621][23882] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-06 18:24:58,403][23882] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-03-06 18:24:59,183][23882] Updated weights for policy 0, policy_version 80150 (0.0007) [2023-03-06 18:24:59,972][23882] Updated weights for policy 0, policy_version 80160 (0.0006) [2023-03-06 18:25:00,765][23882] Updated weights for policy 0, policy_version 80170 (0.0007) [2023-03-06 18:25:01,543][23882] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-06 18:25:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82106368. Throughput: 0: 13037.6. Samples: 82078947. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:25:01,748][23556] Avg episode reward: [(0, '1918.418')] [2023-03-06 18:25:02,336][23882] Updated weights for policy 0, policy_version 80190 (0.0007) [2023-03-06 18:25:03,121][23882] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-06 18:25:03,914][23882] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-06 18:25:04,712][23882] Updated weights for policy 0, policy_version 80220 (0.0008) [2023-03-06 18:25:05,503][23882] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-06 18:25:06,277][23882] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-06 18:25:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 82171904. Throughput: 0: 13026.2. Samples: 82156771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:25:06,748][23556] Avg episode reward: [(0, '1918.158')] [2023-03-06 18:25:07,083][23882] Updated weights for policy 0, policy_version 80250 (0.0007) [2023-03-06 18:25:07,851][23882] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-06 18:25:08,640][23882] Updated weights for policy 0, policy_version 80270 (0.0007) [2023-03-06 18:25:09,426][23882] Updated weights for policy 0, policy_version 80280 (0.0007) [2023-03-06 18:25:10,230][23882] Updated weights for policy 0, policy_version 80290 (0.0005) [2023-03-06 18:25:10,996][23882] Updated weights for policy 0, policy_version 80300 (0.0006) [2023-03-06 18:25:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82236416. Throughput: 0: 13028.8. Samples: 82235111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:11,748][23556] Avg episode reward: [(0, '1999.791')] [2023-03-06 18:25:11,772][23882] Updated weights for policy 0, policy_version 80310 (0.0007) [2023-03-06 18:25:12,580][23882] Updated weights for policy 0, policy_version 80320 (0.0006) [2023-03-06 18:25:13,350][23882] Updated weights for policy 0, policy_version 80330 (0.0006) [2023-03-06 18:25:14,115][23882] Updated weights for policy 0, policy_version 80340 (0.0006) [2023-03-06 18:25:14,932][23882] Updated weights for policy 0, policy_version 80350 (0.0006) [2023-03-06 18:25:15,708][23882] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-06 18:25:16,501][23882] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-06 18:25:16,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82301952. Throughput: 0: 13027.3. Samples: 82274166. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:16,748][23556] Avg episode reward: [(0, '1893.835')] [2023-03-06 18:25:17,281][23882] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-06 18:25:18,081][23882] Updated weights for policy 0, policy_version 80390 (0.0007) [2023-03-06 18:25:18,855][23882] Updated weights for policy 0, policy_version 80400 (0.0007) [2023-03-06 18:25:19,661][23882] Updated weights for policy 0, policy_version 80410 (0.0006) [2023-03-06 18:25:20,431][23882] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-06 18:25:21,231][23882] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-06 18:25:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 82366464. Throughput: 0: 13024.6. Samples: 82352135. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:21,749][23556] Avg episode reward: [(0, '2077.148')] [2023-03-06 18:25:22,008][23882] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-06 18:25:22,795][23882] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-06 18:25:23,585][23882] Updated weights for policy 0, policy_version 80460 (0.0007) [2023-03-06 18:25:24,371][23882] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-06 18:25:25,151][23882] Updated weights for policy 0, policy_version 80480 (0.0007) [2023-03-06 18:25:25,946][23882] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-06 18:25:26,735][23882] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-06 18:25:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 82432000. Throughput: 0: 13016.1. Samples: 82430108. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:26,748][23556] Avg episode reward: [(0, '2015.958')] [2023-03-06 18:25:27,535][23882] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-06 18:25:28,312][23882] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-06 18:25:29,097][23882] Updated weights for policy 0, policy_version 80530 (0.0007) [2023-03-06 18:25:29,876][23882] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-06 18:25:30,646][23882] Updated weights for policy 0, policy_version 80550 (0.0007) [2023-03-06 18:25:31,436][23882] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-06 18:25:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82497536. Throughput: 0: 13011.1. Samples: 82469239. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:31,748][23556] Avg episode reward: [(0, '1969.600')] [2023-03-06 18:25:32,230][23882] Updated weights for policy 0, policy_version 80570 (0.0009) [2023-03-06 18:25:33,004][23882] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-06 18:25:33,781][23882] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-06 18:25:34,574][23882] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-06 18:25:35,337][23882] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-06 18:25:36,135][23882] Updated weights for policy 0, policy_version 80620 (0.0007) [2023-03-06 18:25:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 82562048. Throughput: 0: 13024.3. Samples: 82547861. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:36,748][23556] Avg episode reward: [(0, '1937.886')] [2023-03-06 18:25:36,918][23882] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-06 18:25:37,693][23882] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-06 18:25:38,505][23882] Updated weights for policy 0, policy_version 80650 (0.0007) [2023-03-06 18:25:39,262][23882] Updated weights for policy 0, policy_version 80660 (0.0005) [2023-03-06 18:25:40,054][23882] Updated weights for policy 0, policy_version 80670 (0.0007) [2023-03-06 18:25:40,826][23882] Updated weights for policy 0, policy_version 80680 (0.0007) [2023-03-06 18:25:41,614][23882] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-06 18:25:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 82627584. Throughput: 0: 13031.9. Samples: 82626295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:41,748][23556] Avg episode reward: [(0, '1969.402')] [2023-03-06 18:25:42,415][23882] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-06 18:25:43,192][23882] Updated weights for policy 0, policy_version 80710 (0.0006) [2023-03-06 18:25:43,984][23882] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-06 18:25:44,755][23882] Updated weights for policy 0, policy_version 80730 (0.0006) [2023-03-06 18:25:45,563][23882] Updated weights for policy 0, policy_version 80740 (0.0007) [2023-03-06 18:25:46,351][23882] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-06 18:25:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 82692096. Throughput: 0: 13028.8. Samples: 82665244. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:46,748][23556] Avg episode reward: [(0, '1842.533')] [2023-03-06 18:25:47,129][23882] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-06 18:25:47,934][23882] Updated weights for policy 0, policy_version 80770 (0.0007) [2023-03-06 18:25:48,708][23882] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-06 18:25:49,490][23882] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-06 18:25:50,283][23882] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-06 18:25:51,053][23882] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-06 18:25:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 82757632. Throughput: 0: 13033.6. Samples: 82743284. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:51,748][23556] Avg episode reward: [(0, '1722.837')] [2023-03-06 18:25:51,834][23882] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-06 18:25:52,629][23882] Updated weights for policy 0, policy_version 80830 (0.0006) [2023-03-06 18:25:53,407][23882] Updated weights for policy 0, policy_version 80840 (0.0006) [2023-03-06 18:25:54,186][23882] Updated weights for policy 0, policy_version 80850 (0.0007) [2023-03-06 18:25:54,974][23882] Updated weights for policy 0, policy_version 80860 (0.0006) [2023-03-06 18:25:55,743][23882] Updated weights for policy 0, policy_version 80870 (0.0006) [2023-03-06 18:25:56,531][23882] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-06 18:25:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82823168. Throughput: 0: 13042.1. Samples: 82822008. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:25:56,748][23556] Avg episode reward: [(0, '1906.365')] [2023-03-06 18:25:56,756][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000080883_82824192.pth... [2023-03-06 18:25:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000077828_79695872.pth [2023-03-06 18:25:57,309][23882] Updated weights for policy 0, policy_version 80890 (0.0007) [2023-03-06 18:25:58,114][23882] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-06 18:25:58,884][23882] Updated weights for policy 0, policy_version 80910 (0.0006) [2023-03-06 18:25:59,686][23882] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-06 18:26:00,466][23882] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-06 18:26:01,257][23882] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-06 18:26:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82888704. Throughput: 0: 13040.1. Samples: 82860972. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:01,748][23556] Avg episode reward: [(0, '1876.459')] [2023-03-06 18:26:02,057][23882] Updated weights for policy 0, policy_version 80950 (0.0006) [2023-03-06 18:26:02,838][23882] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-06 18:26:03,627][23882] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-06 18:26:04,415][23882] Updated weights for policy 0, policy_version 80980 (0.0007) [2023-03-06 18:26:05,202][23882] Updated weights for policy 0, policy_version 80990 (0.0006) [2023-03-06 18:26:05,973][23882] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-06 18:26:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 82953216. Throughput: 0: 13036.3. Samples: 82938769. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:06,748][23556] Avg episode reward: [(0, '1898.770')] [2023-03-06 18:26:06,754][23882] Updated weights for policy 0, policy_version 81010 (0.0007) [2023-03-06 18:26:07,534][23882] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-06 18:26:08,314][23882] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-06 18:26:09,100][23882] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-06 18:26:09,891][23882] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-06 18:26:10,665][23882] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-06 18:26:11,445][23882] Updated weights for policy 0, policy_version 81070 (0.0007) [2023-03-06 18:26:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83018752. Throughput: 0: 13048.6. Samples: 83017296. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:11,748][23556] Avg episode reward: [(0, '1967.448')] [2023-03-06 18:26:12,254][23882] Updated weights for policy 0, policy_version 81080 (0.0007) [2023-03-06 18:26:13,044][23882] Updated weights for policy 0, policy_version 81090 (0.0007) [2023-03-06 18:26:13,814][23882] Updated weights for policy 0, policy_version 81100 (0.0006) [2023-03-06 18:26:14,615][23882] Updated weights for policy 0, policy_version 81110 (0.0006) [2023-03-06 18:26:15,394][23882] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-06 18:26:16,190][23882] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-06 18:26:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83084288. Throughput: 0: 13044.4. Samples: 83056240. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:16,748][23556] Avg episode reward: [(0, '2008.774')] [2023-03-06 18:26:16,977][23882] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-06 18:26:17,746][23882] Updated weights for policy 0, policy_version 81150 (0.0006) [2023-03-06 18:26:18,530][23882] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-06 18:26:19,313][23882] Updated weights for policy 0, policy_version 81170 (0.0007) [2023-03-06 18:26:20,109][23882] Updated weights for policy 0, policy_version 81180 (0.0007) [2023-03-06 18:26:20,895][23882] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-03-06 18:26:21,669][23882] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-03-06 18:26:21,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 83149824. Throughput: 0: 13038.4. Samples: 83134589. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:21,748][23556] Avg episode reward: [(0, '1989.373')] [2023-03-06 18:26:22,453][23882] Updated weights for policy 0, policy_version 81210 (0.0007) [2023-03-06 18:26:23,238][23882] Updated weights for policy 0, policy_version 81220 (0.0007) [2023-03-06 18:26:24,040][23882] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-06 18:26:24,824][23882] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-06 18:26:25,602][23882] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-06 18:26:26,386][23882] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-06 18:26:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83214336. Throughput: 0: 13037.7. Samples: 83212994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:26,748][23556] Avg episode reward: [(0, '2020.178')] [2023-03-06 18:26:27,176][23882] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-06 18:26:27,956][23882] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-06 18:26:28,757][23882] Updated weights for policy 0, policy_version 81290 (0.0007) [2023-03-06 18:26:29,530][23882] Updated weights for policy 0, policy_version 81300 (0.0007) [2023-03-06 18:26:30,321][23882] Updated weights for policy 0, policy_version 81310 (0.0006) [2023-03-06 18:26:31,111][23882] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-03-06 18:26:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83279872. Throughput: 0: 13038.3. Samples: 83251967. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:31,748][23556] Avg episode reward: [(0, '2057.243')] [2023-03-06 18:26:31,878][23882] Updated weights for policy 0, policy_version 81330 (0.0007) [2023-03-06 18:26:32,662][23882] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-06 18:26:33,442][23882] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-06 18:26:34,225][23882] Updated weights for policy 0, policy_version 81360 (0.0007) [2023-03-06 18:26:34,999][23882] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-06 18:26:35,789][23882] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-06 18:26:36,567][23882] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-06 18:26:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83345408. Throughput: 0: 13050.8. Samples: 83330570. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:36,748][23556] Avg episode reward: [(0, '1946.598')] [2023-03-06 18:26:37,349][23882] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-06 18:26:38,137][23882] Updated weights for policy 0, policy_version 81410 (0.0007) [2023-03-06 18:26:38,911][23882] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-06 18:26:39,705][23882] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-06 18:26:40,469][23882] Updated weights for policy 0, policy_version 81440 (0.0007) [2023-03-06 18:26:41,258][23882] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-06 18:26:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 83410944. Throughput: 0: 13046.4. Samples: 83409097. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:41,748][23556] Avg episode reward: [(0, '2004.913')] [2023-03-06 18:26:42,062][23882] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-06 18:26:42,837][23882] Updated weights for policy 0, policy_version 81470 (0.0006) [2023-03-06 18:26:43,612][23882] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-06 18:26:44,396][23882] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-06 18:26:45,182][23882] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-06 18:26:45,957][23882] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-06 18:26:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83475456. Throughput: 0: 13048.2. Samples: 83448139. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:26:46,748][23556] Avg episode reward: [(0, '1972.299')] [2023-03-06 18:26:46,760][23882] Updated weights for policy 0, policy_version 81520 (0.0007) [2023-03-06 18:26:47,546][23882] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-06 18:26:48,325][23882] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-06 18:26:49,126][23882] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-06 18:26:49,906][23882] Updated weights for policy 0, policy_version 81560 (0.0005) [2023-03-06 18:26:50,705][23882] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-06 18:26:51,485][23882] Updated weights for policy 0, policy_version 81580 (0.0006) [2023-03-06 18:26:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83540992. Throughput: 0: 13052.6. Samples: 83526137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:26:51,748][23556] Avg episode reward: [(0, '2068.917')] [2023-03-06 18:26:52,260][23882] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-06 18:26:53,058][23882] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-06 18:26:53,831][23882] Updated weights for policy 0, policy_version 81610 (0.0006) [2023-03-06 18:26:54,618][23882] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-06 18:26:55,409][23882] Updated weights for policy 0, policy_version 81630 (0.0006) [2023-03-06 18:26:56,188][23882] Updated weights for policy 0, policy_version 81640 (0.0007) [2023-03-06 18:26:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83605504. Throughput: 0: 13049.0. Samples: 83604502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:26:56,748][23556] Avg episode reward: [(0, '2050.766')] [2023-03-06 18:26:56,989][23882] Updated weights for policy 0, policy_version 81650 (0.0006) [2023-03-06 18:26:57,769][23882] Updated weights for policy 0, policy_version 81660 (0.0007) [2023-03-06 18:26:58,559][23882] Updated weights for policy 0, policy_version 81670 (0.0006) [2023-03-06 18:26:59,362][23882] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-06 18:27:00,126][23882] Updated weights for policy 0, policy_version 81690 (0.0008) [2023-03-06 18:27:00,904][23882] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-06 18:27:01,674][23882] Updated weights for policy 0, policy_version 81710 (0.0007) [2023-03-06 18:27:01,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83671040. Throughput: 0: 13048.3. Samples: 83643414. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:01,748][23556] Avg episode reward: [(0, '1914.408')] [2023-03-06 18:27:02,470][23882] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-06 18:27:03,258][23882] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-06 18:27:04,022][23882] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-06 18:27:04,799][23882] Updated weights for policy 0, policy_version 81750 (0.0007) [2023-03-06 18:27:05,606][23882] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-06 18:27:06,393][23882] Updated weights for policy 0, policy_version 81770 (0.0008) [2023-03-06 18:27:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83736576. Throughput: 0: 13054.8. Samples: 83722054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:06,748][23556] Avg episode reward: [(0, '2122.251')] [2023-03-06 18:27:07,155][23882] Updated weights for policy 0, policy_version 81780 (0.0006) [2023-03-06 18:27:07,938][23882] Updated weights for policy 0, policy_version 81790 (0.0006) [2023-03-06 18:27:08,697][23882] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-06 18:27:09,475][23882] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-06 18:27:10,266][23882] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-06 18:27:11,037][23882] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-06 18:27:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 83802112. Throughput: 0: 13066.3. Samples: 83800976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:11,748][23556] Avg episode reward: [(0, '1907.426')] [2023-03-06 18:27:11,821][23882] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-06 18:27:12,625][23882] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-06 18:27:13,413][23882] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-06 18:27:14,214][23882] Updated weights for policy 0, policy_version 81870 (0.0007) [2023-03-06 18:27:14,990][23882] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-06 18:27:15,758][23882] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-06 18:27:16,535][23882] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-03-06 18:27:16,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83867648. Throughput: 0: 13064.4. Samples: 83839863. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:16,748][23556] Avg episode reward: [(0, '2016.428')] [2023-03-06 18:27:17,338][23882] Updated weights for policy 0, policy_version 81910 (0.0007) [2023-03-06 18:27:18,107][23882] Updated weights for policy 0, policy_version 81920 (0.0007) [2023-03-06 18:27:18,896][23882] Updated weights for policy 0, policy_version 81930 (0.0007) [2023-03-06 18:27:19,698][23882] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-06 18:27:20,473][23882] Updated weights for policy 0, policy_version 81950 (0.0006) [2023-03-06 18:27:21,245][23882] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-06 18:27:21,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 83933184. Throughput: 0: 13061.0. Samples: 83918316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:21,748][23556] Avg episode reward: [(0, '1948.730')] [2023-03-06 18:27:22,026][23882] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-06 18:27:22,836][23882] Updated weights for policy 0, policy_version 81980 (0.0007) [2023-03-06 18:27:23,611][23882] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-06 18:27:24,407][23882] Updated weights for policy 0, policy_version 82000 (0.0007) [2023-03-06 18:27:25,180][23882] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-06 18:27:25,974][23882] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-06 18:27:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 83997696. Throughput: 0: 13054.5. Samples: 83996549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:26,748][23556] Avg episode reward: [(0, '2013.596')] [2023-03-06 18:27:26,763][23882] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-06 18:27:27,544][23882] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-06 18:27:28,320][23882] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-06 18:27:29,094][23882] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-06 18:27:29,886][23882] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-06 18:27:30,657][23882] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-06 18:27:31,442][23882] Updated weights for policy 0, policy_version 82090 (0.0007) [2023-03-06 18:27:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 84063232. Throughput: 0: 13059.4. Samples: 84035813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:31,748][23556] Avg episode reward: [(0, '2051.749')] [2023-03-06 18:27:32,244][23882] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-06 18:27:33,041][23882] Updated weights for policy 0, policy_version 82110 (0.0007) [2023-03-06 18:27:33,829][23882] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-06 18:27:34,610][23882] Updated weights for policy 0, policy_version 82130 (0.0006) [2023-03-06 18:27:35,408][23882] Updated weights for policy 0, policy_version 82140 (0.0005) [2023-03-06 18:27:36,177][23882] Updated weights for policy 0, policy_version 82150 (0.0007) [2023-03-06 18:27:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 84128768. Throughput: 0: 13060.5. Samples: 84113860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:27:36,748][23556] Avg episode reward: [(0, '2010.825')] [2023-03-06 18:27:36,937][23882] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-03-06 18:27:37,748][23882] Updated weights for policy 0, policy_version 82170 (0.0007) [2023-03-06 18:27:38,511][23882] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-06 18:27:39,305][23882] Updated weights for policy 0, policy_version 82190 (0.0006) [2023-03-06 18:27:40,084][23882] Updated weights for policy 0, policy_version 82200 (0.0007) [2023-03-06 18:27:40,872][23882] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-06 18:27:41,670][23882] Updated weights for policy 0, policy_version 82220 (0.0007) [2023-03-06 18:27:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 84194304. Throughput: 0: 13063.3. Samples: 84192350. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:27:41,748][23556] Avg episode reward: [(0, '1903.854')] [2023-03-06 18:27:42,430][23882] Updated weights for policy 0, policy_version 82230 (0.0007) [2023-03-06 18:27:43,221][23882] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-06 18:27:44,020][23882] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-06 18:27:44,804][23882] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-06 18:27:45,567][23882] Updated weights for policy 0, policy_version 82270 (0.0007) [2023-03-06 18:27:46,340][23882] Updated weights for policy 0, policy_version 82280 (0.0007) [2023-03-06 18:27:46,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13034.3). Total num frames: 84259840. Throughput: 0: 13067.5. Samples: 84231451. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:27:46,749][23556] Avg episode reward: [(0, '2025.295')] [2023-03-06 18:27:47,140][23882] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-06 18:27:47,919][23882] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-06 18:27:48,699][23882] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-06 18:27:49,480][23882] Updated weights for policy 0, policy_version 82320 (0.0006) [2023-03-06 18:27:50,281][23882] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-06 18:27:51,061][23882] Updated weights for policy 0, policy_version 82340 (0.0006) [2023-03-06 18:27:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 84324352. Throughput: 0: 13067.2. Samples: 84310077. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:27:51,748][23556] Avg episode reward: [(0, '2010.050')] [2023-03-06 18:27:51,837][23882] Updated weights for policy 0, policy_version 82350 (0.0007) [2023-03-06 18:27:52,627][23882] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-06 18:27:53,399][23882] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-06 18:27:54,196][23882] Updated weights for policy 0, policy_version 82380 (0.0007) [2023-03-06 18:27:54,979][23882] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-06 18:27:55,768][23882] Updated weights for policy 0, policy_version 82400 (0.0007) [2023-03-06 18:27:56,552][23882] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-06 18:27:56,748][23556] Fps is (10 sec: 13005.1, 60 sec: 13073.1, 300 sec: 13034.3). Total num frames: 84389888. Throughput: 0: 13053.3. Samples: 84388373. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:27:56,748][23556] Avg episode reward: [(0, '2083.743')] [2023-03-06 18:27:56,752][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000082412_84389888.pth... [2023-03-06 18:27:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000079355_81259520.pth [2023-03-06 18:27:57,335][23882] Updated weights for policy 0, policy_version 82420 (0.0007) [2023-03-06 18:27:58,110][23882] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-06 18:27:58,895][23882] Updated weights for policy 0, policy_version 82440 (0.0007) [2023-03-06 18:27:59,671][23882] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-06 18:28:00,448][23882] Updated weights for policy 0, policy_version 82460 (0.0007) [2023-03-06 18:28:01,224][23882] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-06 18:28:01,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 84455424. Throughput: 0: 13064.0. Samples: 84427742. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:01,748][23556] Avg episode reward: [(0, '1991.457')] [2023-03-06 18:28:02,008][23882] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-06 18:28:02,803][23882] Updated weights for policy 0, policy_version 82490 (0.0006) [2023-03-06 18:28:03,584][23882] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-06 18:28:04,364][23882] Updated weights for policy 0, policy_version 82510 (0.0006) [2023-03-06 18:28:05,154][23882] Updated weights for policy 0, policy_version 82520 (0.0007) [2023-03-06 18:28:05,932][23882] Updated weights for policy 0, policy_version 82530 (0.0007) [2023-03-06 18:28:06,736][23882] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-06 18:28:06,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 84520960. Throughput: 0: 13064.0. Samples: 84506196. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:06,748][23556] Avg episode reward: [(0, '1945.840')] [2023-03-06 18:28:07,529][23882] Updated weights for policy 0, policy_version 82550 (0.0005) [2023-03-06 18:28:08,316][23882] Updated weights for policy 0, policy_version 82560 (0.0008) [2023-03-06 18:28:09,103][23882] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-06 18:28:09,873][23882] Updated weights for policy 0, policy_version 82580 (0.0007) [2023-03-06 18:28:10,662][23882] Updated weights for policy 0, policy_version 82590 (0.0006) [2023-03-06 18:28:11,458][23882] Updated weights for policy 0, policy_version 82600 (0.0007) [2023-03-06 18:28:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 84585472. Throughput: 0: 13059.2. Samples: 84584214. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:11,748][23556] Avg episode reward: [(0, '2010.874')] [2023-03-06 18:28:12,231][23882] Updated weights for policy 0, policy_version 82610 (0.0007) [2023-03-06 18:28:13,006][23882] Updated weights for policy 0, policy_version 82620 (0.0007) [2023-03-06 18:28:13,797][23882] Updated weights for policy 0, policy_version 82630 (0.0006) [2023-03-06 18:28:14,594][23882] Updated weights for policy 0, policy_version 82640 (0.0006) [2023-03-06 18:28:15,379][23882] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-06 18:28:16,177][23882] Updated weights for policy 0, policy_version 82660 (0.0007) [2023-03-06 18:28:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 84651008. Throughput: 0: 13056.1. Samples: 84623334. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:16,748][23556] Avg episode reward: [(0, '2150.981')] [2023-03-06 18:28:16,962][23882] Updated weights for policy 0, policy_version 82670 (0.0007) [2023-03-06 18:28:17,731][23882] Updated weights for policy 0, policy_version 82680 (0.0006) [2023-03-06 18:28:18,521][23882] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-06 18:28:19,320][23882] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-06 18:28:20,106][23882] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-06 18:28:20,869][23882] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-06 18:28:21,661][23882] Updated weights for policy 0, policy_version 82730 (0.0007) [2023-03-06 18:28:21,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 84715520. Throughput: 0: 13058.5. Samples: 84701492. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:21,748][23556] Avg episode reward: [(0, '1058.573')] [2023-03-06 18:28:22,460][23882] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-06 18:28:23,213][23882] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-06 18:28:24,007][23882] Updated weights for policy 0, policy_version 82760 (0.0007) [2023-03-06 18:28:24,810][23882] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-06 18:28:25,587][23882] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-06 18:28:26,385][23882] Updated weights for policy 0, policy_version 82790 (0.0007) [2023-03-06 18:28:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 84781056. Throughput: 0: 13050.2. Samples: 84779609. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:28:26,748][23556] Avg episode reward: [(0, '248.756')] [2023-03-06 18:28:27,181][23882] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-06 18:28:27,951][23882] Updated weights for policy 0, policy_version 82810 (0.0007) [2023-03-06 18:28:28,737][23882] Updated weights for policy 0, policy_version 82820 (0.0006) [2023-03-06 18:28:29,527][23882] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-06 18:28:30,297][23882] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-06 18:28:31,085][23882] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-06 18:28:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 84846592. Throughput: 0: 13052.5. Samples: 84818812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:31,748][23556] Avg episode reward: [(0, '222.036')] [2023-03-06 18:28:31,884][23882] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-06 18:28:32,660][23882] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-06 18:28:33,442][23882] Updated weights for policy 0, policy_version 82880 (0.0008) [2023-03-06 18:28:34,229][23882] Updated weights for policy 0, policy_version 82890 (0.0007) [2023-03-06 18:28:35,014][23882] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-06 18:28:35,790][23882] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-06 18:28:36,590][23882] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-06 18:28:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 84911104. Throughput: 0: 13046.8. Samples: 84897185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:36,748][23556] Avg episode reward: [(0, '909.281')] [2023-03-06 18:28:37,370][23882] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-06 18:28:38,154][23882] Updated weights for policy 0, policy_version 82940 (0.0006) [2023-03-06 18:28:38,944][23882] Updated weights for policy 0, policy_version 82950 (0.0006) [2023-03-06 18:28:39,742][23882] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-06 18:28:40,520][23882] Updated weights for policy 0, policy_version 82970 (0.0006) [2023-03-06 18:28:41,302][23882] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-06 18:28:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 84976640. Throughput: 0: 13040.1. Samples: 84975177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:41,748][23556] Avg episode reward: [(0, '1495.906')] [2023-03-06 18:28:42,099][23882] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-06 18:28:42,886][23882] Updated weights for policy 0, policy_version 83000 (0.0006) [2023-03-06 18:28:43,676][23882] Updated weights for policy 0, policy_version 83010 (0.0005) [2023-03-06 18:28:44,458][23882] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-06 18:28:45,233][23882] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-06 18:28:46,031][23882] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-06 18:28:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 85042176. Throughput: 0: 13032.6. Samples: 85014210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:46,748][23556] Avg episode reward: [(0, '1757.431')] [2023-03-06 18:28:46,820][23882] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-06 18:28:47,611][23882] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-06 18:28:48,385][23882] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-06 18:28:49,190][23882] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-06 18:28:49,995][23882] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-06 18:28:50,770][23882] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-06 18:28:51,546][23882] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-06 18:28:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 85106688. Throughput: 0: 13021.1. Samples: 85092144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:51,748][23556] Avg episode reward: [(0, '1749.539')] [2023-03-06 18:28:52,341][23882] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-06 18:28:53,120][23882] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-06 18:28:53,917][23882] Updated weights for policy 0, policy_version 83140 (0.0007) [2023-03-06 18:28:54,710][23882] Updated weights for policy 0, policy_version 83150 (0.0005) [2023-03-06 18:28:55,493][23882] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-06 18:28:56,267][23882] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-06 18:28:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 85172224. Throughput: 0: 13025.0. Samples: 85170340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:28:56,748][23556] Avg episode reward: [(0, '1819.837')] [2023-03-06 18:28:57,045][23882] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-06 18:28:57,837][23882] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-06 18:28:58,612][23882] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-06 18:28:59,396][23882] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-06 18:29:00,173][23882] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-06 18:29:00,956][23882] Updated weights for policy 0, policy_version 83230 (0.0006) [2023-03-06 18:29:01,735][23882] Updated weights for policy 0, policy_version 83240 (0.0007) [2023-03-06 18:29:01,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 85237760. Throughput: 0: 13025.5. Samples: 85209484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:29:01,748][23556] Avg episode reward: [(0, '1843.851')] [2023-03-06 18:29:02,518][23882] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-06 18:29:03,293][23882] Updated weights for policy 0, policy_version 83260 (0.0007) [2023-03-06 18:29:04,081][23882] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-06 18:29:04,859][23882] Updated weights for policy 0, policy_version 83280 (0.0006) [2023-03-06 18:29:05,661][23882] Updated weights for policy 0, policy_version 83290 (0.0007) [2023-03-06 18:29:06,452][23882] Updated weights for policy 0, policy_version 83300 (0.0006) [2023-03-06 18:29:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 85302272. Throughput: 0: 13036.5. Samples: 85288134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:29:06,748][23556] Avg episode reward: [(0, '1909.549')] [2023-03-06 18:29:07,214][23882] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-06 18:29:08,014][23882] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-03-06 18:29:08,781][23882] Updated weights for policy 0, policy_version 83330 (0.0007) [2023-03-06 18:29:09,558][23882] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-06 18:29:10,344][23882] Updated weights for policy 0, policy_version 83350 (0.0007) [2023-03-06 18:29:11,114][23882] Updated weights for policy 0, policy_version 83360 (0.0007) [2023-03-06 18:29:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85368832. Throughput: 0: 13049.9. Samples: 85366856. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:29:11,748][23556] Avg episode reward: [(0, '1717.142')] [2023-03-06 18:29:11,902][23882] Updated weights for policy 0, policy_version 83370 (0.0006) [2023-03-06 18:29:12,680][23882] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-06 18:29:13,475][23882] Updated weights for policy 0, policy_version 83390 (0.0007) [2023-03-06 18:29:14,254][23882] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-06 18:29:15,049][23882] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-06 18:29:15,819][23882] Updated weights for policy 0, policy_version 83420 (0.0007) [2023-03-06 18:29:16,616][23882] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-06 18:29:16,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 85433344. Throughput: 0: 13044.7. Samples: 85405825. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:29:16,748][23556] Avg episode reward: [(0, '1735.394')] [2023-03-06 18:29:17,383][23882] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-06 18:29:18,150][23882] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-06 18:29:18,950][23882] Updated weights for policy 0, policy_version 83460 (0.0005) [2023-03-06 18:29:19,722][23882] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-06 18:29:20,523][23882] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-06 18:29:21,300][23882] Updated weights for policy 0, policy_version 83490 (0.0006) [2023-03-06 18:29:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 85498880. Throughput: 0: 13051.1. Samples: 85484484. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:21,748][23556] Avg episode reward: [(0, '1745.590')] [2023-03-06 18:29:22,090][23882] Updated weights for policy 0, policy_version 83500 (0.0007) [2023-03-06 18:29:22,880][23882] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-06 18:29:23,676][23882] Updated weights for policy 0, policy_version 83520 (0.0007) [2023-03-06 18:29:24,052][23831] KL-divergence is very high: 8371.8867 [2023-03-06 18:29:24,146][23831] KL-divergence is very high: 2945.0999 [2023-03-06 18:29:24,472][23882] Updated weights for policy 0, policy_version 83530 (0.0006) [2023-03-06 18:29:25,267][23882] Updated weights for policy 0, policy_version 83540 (0.0007) [2023-03-06 18:29:26,061][23882] Updated weights for policy 0, policy_version 83550 (0.0006) [2023-03-06 18:29:26,504][23831] KL-divergence is very high: 4523702.5000 [2023-03-06 18:29:26,573][23831] KL-divergence is very high: 767.4705 [2023-03-06 18:29:26,653][23831] KL-divergence is very high: 144.7617 [2023-03-06 18:29:26,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85564416. Throughput: 0: 13048.8. Samples: 85562371. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:26,748][23556] Avg episode reward: [(0, '1463.615')] [2023-03-06 18:29:26,819][23882] Updated weights for policy 0, policy_version 83560 (0.0006) [2023-03-06 18:29:27,626][23882] Updated weights for policy 0, policy_version 83570 (0.0007) [2023-03-06 18:29:28,400][23882] Updated weights for policy 0, policy_version 83580 (0.0006) [2023-03-06 18:29:29,198][23882] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-06 18:29:29,981][23882] Updated weights for policy 0, policy_version 83600 (0.0007) [2023-03-06 18:29:30,772][23882] Updated weights for policy 0, policy_version 83610 (0.0007) [2023-03-06 18:29:31,557][23882] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-06 18:29:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 85628928. Throughput: 0: 13049.4. Samples: 85601433. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:31,748][23556] Avg episode reward: [(0, '1262.099')] [2023-03-06 18:29:32,338][23882] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-06 18:29:33,125][23882] Updated weights for policy 0, policy_version 83640 (0.0006) [2023-03-06 18:29:33,901][23882] Updated weights for policy 0, policy_version 83650 (0.0006) [2023-03-06 18:29:34,687][23882] Updated weights for policy 0, policy_version 83660 (0.0007) [2023-03-06 18:29:35,470][23882] Updated weights for policy 0, policy_version 83670 (0.0007) [2023-03-06 18:29:36,251][23882] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-06 18:29:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 85694464. Throughput: 0: 13053.1. Samples: 85679533. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:36,748][23556] Avg episode reward: [(0, '1718.697')] [2023-03-06 18:29:37,030][23882] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-06 18:29:37,813][23882] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-06 18:29:38,586][23882] Updated weights for policy 0, policy_version 83710 (0.0007) [2023-03-06 18:29:39,370][23882] Updated weights for policy 0, policy_version 83720 (0.0007) [2023-03-06 18:29:40,155][23882] Updated weights for policy 0, policy_version 83730 (0.0008) [2023-03-06 18:29:40,948][23882] Updated weights for policy 0, policy_version 83740 (0.0007) [2023-03-06 18:29:41,731][23882] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-06 18:29:41,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85760000. Throughput: 0: 13064.0. Samples: 85758223. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:41,748][23556] Avg episode reward: [(0, '1712.500')] [2023-03-06 18:29:42,513][23882] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-06 18:29:43,301][23882] Updated weights for policy 0, policy_version 83770 (0.0007) [2023-03-06 18:29:44,083][23882] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-06 18:29:44,853][23882] Updated weights for policy 0, policy_version 83790 (0.0006) [2023-03-06 18:29:45,650][23882] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-06 18:29:46,411][23882] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-06 18:29:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85825536. Throughput: 0: 13064.9. Samples: 85797405. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:46,748][23556] Avg episode reward: [(0, '1592.187')] [2023-03-06 18:29:47,199][23882] Updated weights for policy 0, policy_version 83820 (0.0007) [2023-03-06 18:29:47,992][23882] Updated weights for policy 0, policy_version 83830 (0.0005) [2023-03-06 18:29:48,763][23882] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-06 18:29:49,563][23882] Updated weights for policy 0, policy_version 83850 (0.0007) [2023-03-06 18:29:50,338][23882] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-06 18:29:51,121][23882] Updated weights for policy 0, policy_version 83870 (0.0007) [2023-03-06 18:29:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85890048. Throughput: 0: 13063.1. Samples: 85875972. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:51,748][23556] Avg episode reward: [(0, '1750.231')] [2023-03-06 18:29:51,922][23882] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-06 18:29:52,692][23882] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-06 18:29:53,487][23882] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-06 18:29:54,265][23882] Updated weights for policy 0, policy_version 83910 (0.0007) [2023-03-06 18:29:55,046][23882] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-06 18:29:55,847][23882] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-06 18:29:56,626][23882] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-06 18:29:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 85955584. Throughput: 0: 13048.5. Samples: 85954042. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:29:56,748][23556] Avg episode reward: [(0, '1693.919')] [2023-03-06 18:29:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000083941_85955584.pth... [2023-03-06 18:29:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000080883_82824192.pth [2023-03-06 18:29:57,418][23882] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-06 18:29:58,196][23882] Updated weights for policy 0, policy_version 83960 (0.0007) [2023-03-06 18:29:58,971][23882] Updated weights for policy 0, policy_version 83970 (0.0006) [2023-03-06 18:29:59,778][23882] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-06 18:30:00,557][23882] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-06 18:30:01,344][23882] Updated weights for policy 0, policy_version 84000 (0.0006) [2023-03-06 18:30:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 86021120. Throughput: 0: 13053.6. Samples: 85993234. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:30:01,748][23556] Avg episode reward: [(0, '1903.890')] [2023-03-06 18:30:02,129][23882] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-06 18:30:02,913][23882] Updated weights for policy 0, policy_version 84020 (0.0006) [2023-03-06 18:30:03,690][23882] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-06 18:30:04,482][23882] Updated weights for policy 0, policy_version 84040 (0.0006) [2023-03-06 18:30:05,262][23882] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-06 18:30:06,062][23882] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-06 18:30:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 86085632. Throughput: 0: 13039.6. Samples: 86071264. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 18:30:06,748][23556] Avg episode reward: [(0, '1890.169')] [2023-03-06 18:30:06,844][23882] Updated weights for policy 0, policy_version 84070 (0.0007) [2023-03-06 18:30:07,624][23882] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-06 18:30:08,425][23882] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-06 18:30:09,209][23882] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-06 18:30:10,002][23882] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-06 18:30:10,804][23882] Updated weights for policy 0, policy_version 84120 (0.0007) [2023-03-06 18:30:11,582][23882] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-06 18:30:11,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86151168. Throughput: 0: 13042.2. Samples: 86149273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:11,749][23556] Avg episode reward: [(0, '1853.197')] [2023-03-06 18:30:12,378][23882] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-06 18:30:13,158][23882] Updated weights for policy 0, policy_version 84150 (0.0007) [2023-03-06 18:30:13,977][23882] Updated weights for policy 0, policy_version 84160 (0.0007) [2023-03-06 18:30:14,729][23882] Updated weights for policy 0, policy_version 84170 (0.0007) [2023-03-06 18:30:15,508][23882] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-06 18:30:16,307][23882] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-06 18:30:16,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86215680. Throughput: 0: 13040.7. Samples: 86188264. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:16,748][23556] Avg episode reward: [(0, '1974.893')] [2023-03-06 18:30:17,095][23882] Updated weights for policy 0, policy_version 84200 (0.0006) [2023-03-06 18:30:17,875][23882] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-06 18:30:18,656][23882] Updated weights for policy 0, policy_version 84220 (0.0007) [2023-03-06 18:30:19,449][23882] Updated weights for policy 0, policy_version 84230 (0.0007) [2023-03-06 18:30:20,249][23882] Updated weights for policy 0, policy_version 84240 (0.0006) [2023-03-06 18:30:21,030][23882] Updated weights for policy 0, policy_version 84250 (0.0006) [2023-03-06 18:30:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86281216. Throughput: 0: 13039.9. Samples: 86266332. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:21,748][23556] Avg episode reward: [(0, '1920.324')] [2023-03-06 18:30:21,815][23882] Updated weights for policy 0, policy_version 84260 (0.0007) [2023-03-06 18:30:22,591][23882] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-06 18:30:23,388][23882] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-06 18:30:24,167][23882] Updated weights for policy 0, policy_version 84290 (0.0007) [2023-03-06 18:30:24,942][23882] Updated weights for policy 0, policy_version 84300 (0.0007) [2023-03-06 18:30:25,744][23882] Updated weights for policy 0, policy_version 84310 (0.0006) [2023-03-06 18:30:26,531][23882] Updated weights for policy 0, policy_version 84320 (0.0007) [2023-03-06 18:30:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86346752. Throughput: 0: 13030.8. Samples: 86344608. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:26,748][23556] Avg episode reward: [(0, '2032.258')] [2023-03-06 18:30:27,303][23882] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-06 18:30:28,076][23882] Updated weights for policy 0, policy_version 84340 (0.0007) [2023-03-06 18:30:28,862][23882] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-06 18:30:29,638][23882] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-06 18:30:30,425][23882] Updated weights for policy 0, policy_version 84370 (0.0007) [2023-03-06 18:30:31,203][23882] Updated weights for policy 0, policy_version 84380 (0.0007) [2023-03-06 18:30:31,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 86412288. Throughput: 0: 13035.6. Samples: 86384007. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:31,748][23556] Avg episode reward: [(0, '2035.352')] [2023-03-06 18:30:31,997][23882] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-06 18:30:32,762][23882] Updated weights for policy 0, policy_version 84400 (0.0006) [2023-03-06 18:30:33,557][23882] Updated weights for policy 0, policy_version 84410 (0.0007) [2023-03-06 18:30:34,340][23882] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-06 18:30:35,113][23882] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-06 18:30:35,899][23882] Updated weights for policy 0, policy_version 84440 (0.0008) [2023-03-06 18:30:36,682][23882] Updated weights for policy 0, policy_version 84450 (0.0005) [2023-03-06 18:30:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86476800. Throughput: 0: 13030.5. Samples: 86462343. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:36,748][23556] Avg episode reward: [(0, '2084.958')] [2023-03-06 18:30:37,468][23882] Updated weights for policy 0, policy_version 84460 (0.0007) [2023-03-06 18:30:38,276][23882] Updated weights for policy 0, policy_version 84470 (0.0007) [2023-03-06 18:30:39,067][23882] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-06 18:30:39,848][23882] Updated weights for policy 0, policy_version 84490 (0.0006) [2023-03-06 18:30:40,633][23882] Updated weights for policy 0, policy_version 84500 (0.0006) [2023-03-06 18:30:41,421][23882] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-06 18:30:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 86542336. Throughput: 0: 13032.2. Samples: 86540489. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:41,749][23556] Avg episode reward: [(0, '1986.287')] [2023-03-06 18:30:42,206][23882] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-06 18:30:42,978][23882] Updated weights for policy 0, policy_version 84530 (0.0007) [2023-03-06 18:30:43,770][23882] Updated weights for policy 0, policy_version 84540 (0.0006) [2023-03-06 18:30:44,565][23882] Updated weights for policy 0, policy_version 84550 (0.0008) [2023-03-06 18:30:45,357][23882] Updated weights for policy 0, policy_version 84560 (0.0007) [2023-03-06 18:30:46,145][23882] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-06 18:30:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 86606848. Throughput: 0: 13027.5. Samples: 86579471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:46,748][23556] Avg episode reward: [(0, '1981.899')] [2023-03-06 18:30:46,938][23882] Updated weights for policy 0, policy_version 84580 (0.0006) [2023-03-06 18:30:47,734][23882] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-06 18:30:48,515][23882] Updated weights for policy 0, policy_version 84600 (0.0007) [2023-03-06 18:30:49,294][23882] Updated weights for policy 0, policy_version 84610 (0.0006) [2023-03-06 18:30:50,076][23882] Updated weights for policy 0, policy_version 84620 (0.0008) [2023-03-06 18:30:50,850][23882] Updated weights for policy 0, policy_version 84630 (0.0006) [2023-03-06 18:30:51,643][23882] Updated weights for policy 0, policy_version 84640 (0.0008) [2023-03-06 18:30:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86672384. Throughput: 0: 13033.5. Samples: 86657773. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:51,748][23556] Avg episode reward: [(0, '2039.798')] [2023-03-06 18:30:52,440][23882] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-06 18:30:53,236][23882] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-06 18:30:54,025][23882] Updated weights for policy 0, policy_version 84670 (0.0007) [2023-03-06 18:30:54,801][23882] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-06 18:30:55,600][23882] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-06 18:30:56,382][23882] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-06 18:30:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 86736896. Throughput: 0: 13028.2. Samples: 86735541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:30:56,748][23556] Avg episode reward: [(0, '2122.809')] [2023-03-06 18:30:57,166][23882] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-06 18:30:57,944][23882] Updated weights for policy 0, policy_version 84720 (0.0006) [2023-03-06 18:30:58,728][23882] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-06 18:30:59,509][23882] Updated weights for policy 0, policy_version 84740 (0.0008) [2023-03-06 18:31:00,278][23882] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-06 18:31:01,058][23882] Updated weights for policy 0, policy_version 84760 (0.0008) [2023-03-06 18:31:01,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13048.2). Total num frames: 86802432. Throughput: 0: 13036.2. Samples: 86774896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:01,749][23556] Avg episode reward: [(0, '1992.710')] [2023-03-06 18:31:01,856][23882] Updated weights for policy 0, policy_version 84770 (0.0007) [2023-03-06 18:31:02,640][23882] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-06 18:31:03,434][23882] Updated weights for policy 0, policy_version 84790 (0.0006) [2023-03-06 18:31:04,214][23882] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-06 18:31:05,015][23882] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-06 18:31:05,800][23882] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-06 18:31:06,575][23882] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-06 18:31:06,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 86867968. Throughput: 0: 13040.8. Samples: 86853168. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:06,748][23556] Avg episode reward: [(0, '1996.210')] [2023-03-06 18:31:07,345][23882] Updated weights for policy 0, policy_version 84840 (0.0007) [2023-03-06 18:31:08,142][23882] Updated weights for policy 0, policy_version 84850 (0.0007) [2023-03-06 18:31:08,931][23882] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-06 18:31:09,713][23882] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-06 18:31:10,520][23882] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-06 18:31:11,295][23882] Updated weights for policy 0, policy_version 84890 (0.0006) [2023-03-06 18:31:11,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 86932480. Throughput: 0: 13036.2. Samples: 86931238. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:11,748][23556] Avg episode reward: [(0, '2039.624')] [2023-03-06 18:31:12,079][23882] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-06 18:31:12,865][23882] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-06 18:31:13,656][23882] Updated weights for policy 0, policy_version 84920 (0.0007) [2023-03-06 18:31:14,439][23882] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-06 18:31:15,221][23882] Updated weights for policy 0, policy_version 84940 (0.0006) [2023-03-06 18:31:16,022][23882] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-06 18:31:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 86998016. Throughput: 0: 13029.9. Samples: 86970354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:16,748][23556] Avg episode reward: [(0, '2112.205')] [2023-03-06 18:31:16,813][23882] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-06 18:31:17,608][23882] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-06 18:31:18,394][23882] Updated weights for policy 0, policy_version 84980 (0.0007) [2023-03-06 18:31:19,174][23882] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-06 18:31:19,983][23882] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-06 18:31:20,761][23882] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-06 18:31:21,542][23882] Updated weights for policy 0, policy_version 85020 (0.0006) [2023-03-06 18:31:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 87062528. Throughput: 0: 13017.4. Samples: 87048127. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:21,748][23556] Avg episode reward: [(0, '2026.809')] [2023-03-06 18:31:22,330][23882] Updated weights for policy 0, policy_version 85030 (0.0005) [2023-03-06 18:31:23,120][23882] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-06 18:31:23,897][23882] Updated weights for policy 0, policy_version 85050 (0.0007) [2023-03-06 18:31:24,682][23882] Updated weights for policy 0, policy_version 85060 (0.0006) [2023-03-06 18:31:25,475][23882] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-06 18:31:26,250][23882] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-06 18:31:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 87128064. Throughput: 0: 13016.7. Samples: 87126241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:26,748][23556] Avg episode reward: [(0, '1946.061')] [2023-03-06 18:31:27,052][23882] Updated weights for policy 0, policy_version 85090 (0.0008) [2023-03-06 18:31:27,843][23882] Updated weights for policy 0, policy_version 85100 (0.0007) [2023-03-06 18:31:28,625][23882] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-06 18:31:29,403][23882] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-06 18:31:30,206][23882] Updated weights for policy 0, policy_version 85130 (0.0005) [2023-03-06 18:31:30,990][23882] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-06 18:31:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 87192576. Throughput: 0: 13014.7. Samples: 87165130. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:31,748][23556] Avg episode reward: [(0, '2097.931')] [2023-03-06 18:31:31,797][23882] Updated weights for policy 0, policy_version 85150 (0.0007) [2023-03-06 18:31:32,581][23882] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-06 18:31:33,387][23882] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-06 18:31:34,161][23882] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-06 18:31:34,952][23882] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-06 18:31:35,747][23882] Updated weights for policy 0, policy_version 85200 (0.0007) [2023-03-06 18:31:36,526][23882] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-06 18:31:36,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 87257088. Throughput: 0: 12999.1. Samples: 87242734. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:36,748][23556] Avg episode reward: [(0, '2237.934')] [2023-03-06 18:31:36,752][23831] Saving new best policy, reward=2237.934! [2023-03-06 18:31:37,326][23882] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-06 18:31:38,109][23882] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-06 18:31:38,874][23882] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-06 18:31:39,667][23882] Updated weights for policy 0, policy_version 85250 (0.0007) [2023-03-06 18:31:40,448][23882] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-06 18:31:41,236][23882] Updated weights for policy 0, policy_version 85270 (0.0007) [2023-03-06 18:31:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13041.2). Total num frames: 87322624. Throughput: 0: 13014.8. Samples: 87321208. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:41,748][23556] Avg episode reward: [(0, '2129.707')] [2023-03-06 18:31:42,015][23882] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-06 18:31:42,791][23882] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-06 18:31:43,566][23882] Updated weights for policy 0, policy_version 85300 (0.0007) [2023-03-06 18:31:44,361][23882] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-06 18:31:45,167][23882] Updated weights for policy 0, policy_version 85320 (0.0007) [2023-03-06 18:31:45,951][23882] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-06 18:31:46,730][23882] Updated weights for policy 0, policy_version 85340 (0.0007) [2023-03-06 18:31:46,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 87388160. Throughput: 0: 13007.7. Samples: 87360242. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:46,748][23556] Avg episode reward: [(0, '2089.368')] [2023-03-06 18:31:47,527][23882] Updated weights for policy 0, policy_version 85350 (0.0007) [2023-03-06 18:31:48,314][23882] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-06 18:31:49,105][23882] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-03-06 18:31:49,886][23882] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-06 18:31:50,662][23882] Updated weights for policy 0, policy_version 85390 (0.0005) [2023-03-06 18:31:51,449][23882] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-06 18:31:51,747][23556] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13041.3). Total num frames: 87452672. Throughput: 0: 13004.1. Samples: 87438350. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:31:51,748][23556] Avg episode reward: [(0, '2171.014')] [2023-03-06 18:31:52,237][23882] Updated weights for policy 0, policy_version 85410 (0.0007) [2023-03-06 18:31:53,044][23882] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-06 18:31:53,833][23882] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-06 18:31:54,607][23882] Updated weights for policy 0, policy_version 85440 (0.0007) [2023-03-06 18:31:55,385][23882] Updated weights for policy 0, policy_version 85450 (0.0007) [2023-03-06 18:31:56,168][23882] Updated weights for policy 0, policy_version 85460 (0.0007) [2023-03-06 18:31:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 87518208. Throughput: 0: 13003.3. Samples: 87516388. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:31:56,748][23556] Avg episode reward: [(0, '2079.438')] [2023-03-06 18:31:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000085467_87518208.pth... [2023-03-06 18:31:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000082412_84389888.pth [2023-03-06 18:31:56,952][23882] Updated weights for policy 0, policy_version 85470 (0.0008) [2023-03-06 18:31:57,743][23882] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-06 18:31:58,537][23882] Updated weights for policy 0, policy_version 85490 (0.0007) [2023-03-06 18:31:59,342][23882] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-06 18:32:00,113][23882] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-06 18:32:00,887][23882] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-06 18:32:01,683][23882] Updated weights for policy 0, policy_version 85530 (0.0007) [2023-03-06 18:32:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 87582720. Throughput: 0: 12998.6. Samples: 87555289. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:01,748][23556] Avg episode reward: [(0, '2117.221')] [2023-03-06 18:32:02,469][23882] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-06 18:32:03,261][23882] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-06 18:32:04,037][23882] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-06 18:32:04,821][23882] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-06 18:32:05,602][23882] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-06 18:32:06,383][23882] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-06 18:32:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 87648256. Throughput: 0: 13015.5. Samples: 87633826. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:06,748][23556] Avg episode reward: [(0, '2088.107')] [2023-03-06 18:32:07,185][23882] Updated weights for policy 0, policy_version 85600 (0.0007) [2023-03-06 18:32:07,979][23882] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-06 18:32:08,757][23882] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-06 18:32:09,548][23882] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-03-06 18:32:10,340][23882] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-06 18:32:11,117][23882] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-06 18:32:11,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 87713792. Throughput: 0: 13009.5. Samples: 87711669. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:11,748][23556] Avg episode reward: [(0, '2101.569')] [2023-03-06 18:32:11,905][23882] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-06 18:32:12,693][23882] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-06 18:32:13,485][23882] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-06 18:32:14,274][23882] Updated weights for policy 0, policy_version 85690 (0.0006) [2023-03-06 18:32:15,063][23882] Updated weights for policy 0, policy_version 85700 (0.0008) [2023-03-06 18:32:15,851][23882] Updated weights for policy 0, policy_version 85710 (0.0007) [2023-03-06 18:32:16,638][23882] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-06 18:32:16,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 87778304. Throughput: 0: 13009.8. Samples: 87750571. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:16,748][23556] Avg episode reward: [(0, '2330.525')] [2023-03-06 18:32:16,753][23831] Saving new best policy, reward=2330.525! [2023-03-06 18:32:17,418][23882] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-06 18:32:18,209][23882] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-06 18:32:19,000][23882] Updated weights for policy 0, policy_version 85750 (0.0007) [2023-03-06 18:32:19,786][23882] Updated weights for policy 0, policy_version 85760 (0.0007) [2023-03-06 18:32:20,597][23882] Updated weights for policy 0, policy_version 85770 (0.0007) [2023-03-06 18:32:21,380][23882] Updated weights for policy 0, policy_version 85780 (0.0007) [2023-03-06 18:32:21,748][23556] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 87842816. Throughput: 0: 13017.4. Samples: 87828519. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:21,748][23556] Avg episode reward: [(0, '2078.316')] [2023-03-06 18:32:22,158][23882] Updated weights for policy 0, policy_version 85790 (0.0007) [2023-03-06 18:32:22,942][23882] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-06 18:32:23,732][23882] Updated weights for policy 0, policy_version 85810 (0.0007) [2023-03-06 18:32:24,499][23882] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-06 18:32:25,278][23882] Updated weights for policy 0, policy_version 85830 (0.0007) [2023-03-06 18:32:26,086][23882] Updated weights for policy 0, policy_version 85840 (0.0007) [2023-03-06 18:32:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 87908352. Throughput: 0: 13012.5. Samples: 87906771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:26,748][23556] Avg episode reward: [(0, '2056.795')] [2023-03-06 18:32:26,862][23882] Updated weights for policy 0, policy_version 85850 (0.0007) [2023-03-06 18:32:27,629][23882] Updated weights for policy 0, policy_version 85860 (0.0007) [2023-03-06 18:32:28,427][23882] Updated weights for policy 0, policy_version 85870 (0.0007) [2023-03-06 18:32:29,201][23882] Updated weights for policy 0, policy_version 85880 (0.0007) [2023-03-06 18:32:29,993][23882] Updated weights for policy 0, policy_version 85890 (0.0006) [2023-03-06 18:32:30,777][23882] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-06 18:32:31,551][23882] Updated weights for policy 0, policy_version 85910 (0.0006) [2023-03-06 18:32:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 87973888. Throughput: 0: 13017.9. Samples: 87946048. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:31,748][23556] Avg episode reward: [(0, '2135.552')] [2023-03-06 18:32:32,336][23882] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-06 18:32:33,109][23882] Updated weights for policy 0, policy_version 85930 (0.0007) [2023-03-06 18:32:33,928][23882] Updated weights for policy 0, policy_version 85940 (0.0007) [2023-03-06 18:32:34,717][23882] Updated weights for policy 0, policy_version 85950 (0.0007) [2023-03-06 18:32:35,503][23882] Updated weights for policy 0, policy_version 85960 (0.0006) [2023-03-06 18:32:36,272][23882] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-06 18:32:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 88039424. Throughput: 0: 13017.3. Samples: 88024128. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:36,748][23556] Avg episode reward: [(0, '2095.571')] [2023-03-06 18:32:37,065][23882] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-06 18:32:37,850][23882] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-06 18:32:38,645][23882] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-06 18:32:39,425][23882] Updated weights for policy 0, policy_version 86010 (0.0007) [2023-03-06 18:32:40,241][23882] Updated weights for policy 0, policy_version 86020 (0.0006) [2023-03-06 18:32:41,036][23882] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-06 18:32:41,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 88103936. Throughput: 0: 13015.0. Samples: 88102064. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:32:41,749][23556] Avg episode reward: [(0, '2236.821')] [2023-03-06 18:32:41,814][23882] Updated weights for policy 0, policy_version 86040 (0.0007) [2023-03-06 18:32:42,613][23882] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-06 18:32:43,397][23882] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-06 18:32:44,174][23882] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-06 18:32:44,960][23882] Updated weights for policy 0, policy_version 86080 (0.0007) [2023-03-06 18:32:45,758][23882] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-06 18:32:46,558][23882] Updated weights for policy 0, policy_version 86100 (0.0006) [2023-03-06 18:32:46,748][23556] Fps is (10 sec: 12902.2, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 88168448. Throughput: 0: 13014.8. Samples: 88140958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:32:46,749][23556] Avg episode reward: [(0, '2124.579')] [2023-03-06 18:32:47,341][23882] Updated weights for policy 0, policy_version 86110 (0.0007) [2023-03-06 18:32:48,137][23882] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-06 18:32:48,930][23882] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-06 18:32:49,723][23882] Updated weights for policy 0, policy_version 86140 (0.0007) [2023-03-06 18:32:50,501][23882] Updated weights for policy 0, policy_version 86150 (0.0006) [2023-03-06 18:32:51,293][23882] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-06 18:32:51,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 88232960. Throughput: 0: 12995.8. Samples: 88218636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:32:51,748][23556] Avg episode reward: [(0, '2027.901')] [2023-03-06 18:32:52,111][23882] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-06 18:32:52,901][23882] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-06 18:32:53,691][23882] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-06 18:32:54,372][23831] KL-divergence is very high: 134.8748 [2023-03-06 18:32:54,472][23882] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-06 18:32:55,245][23882] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-06 18:32:56,047][23882] Updated weights for policy 0, policy_version 86220 (0.0007) [2023-03-06 18:32:56,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 88298496. Throughput: 0: 13001.9. Samples: 88296755. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:32:56,748][23556] Avg episode reward: [(0, '1973.712')] [2023-03-06 18:32:56,799][23882] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-06 18:32:57,567][23882] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-06 18:32:58,371][23882] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-06 18:32:59,141][23882] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-06 18:32:59,926][23882] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-06 18:33:00,719][23882] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-06 18:33:01,496][23882] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-06 18:33:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 88364032. Throughput: 0: 13010.5. Samples: 88336046. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:01,748][23556] Avg episode reward: [(0, '1849.294')] [2023-03-06 18:33:02,286][23882] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-06 18:33:03,064][23882] Updated weights for policy 0, policy_version 86310 (0.0007) [2023-03-06 18:33:03,863][23882] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-06 18:33:04,640][23882] Updated weights for policy 0, policy_version 86330 (0.0007) [2023-03-06 18:33:05,439][23882] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-06 18:33:06,229][23882] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-06 18:33:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 88428544. Throughput: 0: 13016.0. Samples: 88414240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:06,748][23556] Avg episode reward: [(0, '2118.423')] [2023-03-06 18:33:06,991][23882] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-06 18:33:07,779][23882] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-06 18:33:08,555][23882] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-06 18:33:09,332][23882] Updated weights for policy 0, policy_version 86390 (0.0006) [2023-03-06 18:33:10,116][23882] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-06 18:33:10,906][23882] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-06 18:33:11,695][23882] Updated weights for policy 0, policy_version 86420 (0.0006) [2023-03-06 18:33:11,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 88494080. Throughput: 0: 13023.1. Samples: 88492810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:11,748][23556] Avg episode reward: [(0, '2108.694')] [2023-03-06 18:33:12,501][23882] Updated weights for policy 0, policy_version 86430 (0.0007) [2023-03-06 18:33:13,273][23882] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-06 18:33:14,076][23882] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-06 18:33:14,846][23882] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-06 18:33:15,649][23882] Updated weights for policy 0, policy_version 86470 (0.0007) [2023-03-06 18:33:16,427][23882] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-06 18:33:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 88559616. Throughput: 0: 13015.2. Samples: 88531731. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:16,748][23556] Avg episode reward: [(0, '2185.361')] [2023-03-06 18:33:17,209][23882] Updated weights for policy 0, policy_version 86490 (0.0007) [2023-03-06 18:33:18,003][23882] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-06 18:33:18,794][23882] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-03-06 18:33:19,572][23882] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-06 18:33:20,363][23882] Updated weights for policy 0, policy_version 86530 (0.0007) [2023-03-06 18:33:21,138][23882] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-06 18:33:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 88624128. Throughput: 0: 13014.3. Samples: 88609774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:21,748][23556] Avg episode reward: [(0, '2042.281')] [2023-03-06 18:33:21,920][23882] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-06 18:33:22,705][23882] Updated weights for policy 0, policy_version 86560 (0.0006) [2023-03-06 18:33:23,477][23882] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-06 18:33:24,275][23882] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-03-06 18:33:25,060][23882] Updated weights for policy 0, policy_version 86590 (0.0007) [2023-03-06 18:33:25,842][23882] Updated weights for policy 0, policy_version 86600 (0.0007) [2023-03-06 18:33:26,616][23882] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-06 18:33:26,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 88689664. Throughput: 0: 13024.7. Samples: 88688178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:26,749][23556] Avg episode reward: [(0, '2153.243')] [2023-03-06 18:33:27,398][23882] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-06 18:33:28,175][23882] Updated weights for policy 0, policy_version 86630 (0.0007) [2023-03-06 18:33:28,994][23882] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-06 18:33:29,771][23882] Updated weights for policy 0, policy_version 86650 (0.0007) [2023-03-06 18:33:30,548][23882] Updated weights for policy 0, policy_version 86660 (0.0007) [2023-03-06 18:33:31,355][23882] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-06 18:33:31,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 88755200. Throughput: 0: 13028.8. Samples: 88727253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:31,748][23556] Avg episode reward: [(0, '2154.837')] [2023-03-06 18:33:32,148][23882] Updated weights for policy 0, policy_version 86680 (0.0007) [2023-03-06 18:33:32,926][23882] Updated weights for policy 0, policy_version 86690 (0.0007) [2023-03-06 18:33:33,706][23882] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-06 18:33:34,499][23882] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-06 18:33:35,262][23882] Updated weights for policy 0, policy_version 86720 (0.0007) [2023-03-06 18:33:36,055][23882] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-06 18:33:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 88819712. Throughput: 0: 13041.8. Samples: 88805518. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:33:36,748][23556] Avg episode reward: [(0, '2123.756')] [2023-03-06 18:33:36,835][23882] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-06 18:33:37,621][23882] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-06 18:33:38,408][23882] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-06 18:33:39,183][23882] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-06 18:33:39,964][23882] Updated weights for policy 0, policy_version 86780 (0.0007) [2023-03-06 18:33:40,745][23882] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-06 18:33:41,524][23882] Updated weights for policy 0, policy_version 86800 (0.0007) [2023-03-06 18:33:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 88885248. Throughput: 0: 13050.1. Samples: 88884011. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:33:41,748][23556] Avg episode reward: [(0, '2220.550')] [2023-03-06 18:33:42,309][23882] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-06 18:33:43,094][23882] Updated weights for policy 0, policy_version 86820 (0.0007) [2023-03-06 18:33:43,889][23882] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-06 18:33:44,668][23882] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-06 18:33:45,450][23882] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-06 18:33:46,250][23882] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-06 18:33:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 88950784. Throughput: 0: 13050.7. Samples: 88923326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:33:46,748][23556] Avg episode reward: [(0, '2150.470')] [2023-03-06 18:33:47,027][23882] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-06 18:33:47,795][23882] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-06 18:33:48,582][23882] Updated weights for policy 0, policy_version 86890 (0.0007) [2023-03-06 18:33:49,371][23882] Updated weights for policy 0, policy_version 86900 (0.0007) [2023-03-06 18:33:50,144][23882] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-06 18:33:50,935][23882] Updated weights for policy 0, policy_version 86920 (0.0007) [2023-03-06 18:33:51,720][23882] Updated weights for policy 0, policy_version 86930 (0.0007) [2023-03-06 18:33:51,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 89016320. Throughput: 0: 13048.1. Samples: 89001403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:33:51,748][23556] Avg episode reward: [(0, '2114.769')] [2023-03-06 18:33:52,512][23882] Updated weights for policy 0, policy_version 86940 (0.0007) [2023-03-06 18:33:53,289][23882] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-06 18:33:54,080][23882] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-06 18:33:54,873][23882] Updated weights for policy 0, policy_version 86970 (0.0007) [2023-03-06 18:33:55,641][23882] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-06 18:33:56,423][23882] Updated weights for policy 0, policy_version 86990 (0.0005) [2023-03-06 18:33:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 89081856. Throughput: 0: 13047.0. Samples: 89079925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:33:56,748][23556] Avg episode reward: [(0, '1935.397')] [2023-03-06 18:33:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000086994_89081856.pth... [2023-03-06 18:33:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000083941_85955584.pth [2023-03-06 18:33:57,206][23882] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-06 18:33:57,979][23882] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-06 18:33:58,765][23882] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-06 18:33:59,545][23882] Updated weights for policy 0, policy_version 87030 (0.0005) [2023-03-06 18:34:00,334][23882] Updated weights for policy 0, policy_version 87040 (0.0007) [2023-03-06 18:34:01,125][23882] Updated weights for policy 0, policy_version 87050 (0.0007) [2023-03-06 18:34:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 89146368. Throughput: 0: 13057.9. Samples: 89119336. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:01,748][23556] Avg episode reward: [(0, '2078.001')] [2023-03-06 18:34:01,906][23882] Updated weights for policy 0, policy_version 87060 (0.0007) [2023-03-06 18:34:02,688][23882] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-06 18:34:03,456][23882] Updated weights for policy 0, policy_version 87080 (0.0007) [2023-03-06 18:34:04,248][23882] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-06 18:34:05,042][23882] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-06 18:34:05,828][23882] Updated weights for policy 0, policy_version 87110 (0.0007) [2023-03-06 18:34:06,624][23882] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-06 18:34:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 89211904. Throughput: 0: 13062.9. Samples: 89197605. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:06,748][23556] Avg episode reward: [(0, '2215.055')] [2023-03-06 18:34:07,408][23882] Updated weights for policy 0, policy_version 87130 (0.0007) [2023-03-06 18:34:08,201][23882] Updated weights for policy 0, policy_version 87140 (0.0007) [2023-03-06 18:34:08,980][23882] Updated weights for policy 0, policy_version 87150 (0.0007) [2023-03-06 18:34:09,767][23882] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-06 18:34:10,551][23882] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-06 18:34:11,338][23882] Updated weights for policy 0, policy_version 87180 (0.0007) [2023-03-06 18:34:11,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 89277440. Throughput: 0: 13055.9. Samples: 89275695. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:11,748][23556] Avg episode reward: [(0, '2052.217')] [2023-03-06 18:34:12,116][23882] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-06 18:34:12,885][23882] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-06 18:34:13,694][23882] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-06 18:34:14,473][23882] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-06 18:34:15,269][23882] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-06 18:34:16,041][23882] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-06 18:34:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89341952. Throughput: 0: 13054.7. Samples: 89314714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:16,748][23556] Avg episode reward: [(0, '2090.555')] [2023-03-06 18:34:16,839][23882] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-06 18:34:17,626][23882] Updated weights for policy 0, policy_version 87260 (0.0007) [2023-03-06 18:34:18,401][23882] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-06 18:34:19,184][23882] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-06 18:34:19,973][23882] Updated weights for policy 0, policy_version 87290 (0.0007) [2023-03-06 18:34:20,740][23882] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-06 18:34:21,521][23882] Updated weights for policy 0, policy_version 87310 (0.0006) [2023-03-06 18:34:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 89407488. Throughput: 0: 13057.7. Samples: 89393115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:21,748][23556] Avg episode reward: [(0, '2037.377')] [2023-03-06 18:34:22,315][23882] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-06 18:34:23,099][23882] Updated weights for policy 0, policy_version 87330 (0.0007) [2023-03-06 18:34:23,881][23882] Updated weights for policy 0, policy_version 87340 (0.0006) [2023-03-06 18:34:24,675][23882] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-06 18:34:25,457][23882] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-03-06 18:34:26,245][23882] Updated weights for policy 0, policy_version 87370 (0.0006) [2023-03-06 18:34:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 89473024. Throughput: 0: 13048.4. Samples: 89471187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:26,748][23556] Avg episode reward: [(0, '2055.274')] [2023-03-06 18:34:27,061][23882] Updated weights for policy 0, policy_version 87380 (0.0006) [2023-03-06 18:34:27,839][23882] Updated weights for policy 0, policy_version 87390 (0.0007) [2023-03-06 18:34:28,629][23882] Updated weights for policy 0, policy_version 87400 (0.0007) [2023-03-06 18:34:29,431][23882] Updated weights for policy 0, policy_version 87410 (0.0007) [2023-03-06 18:34:30,216][23882] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-06 18:34:31,006][23882] Updated weights for policy 0, policy_version 87430 (0.0005) [2023-03-06 18:34:31,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89537536. Throughput: 0: 13035.1. Samples: 89509906. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:31,748][23556] Avg episode reward: [(0, '2082.977')] [2023-03-06 18:34:31,782][23882] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-06 18:34:32,582][23882] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-06 18:34:33,381][23882] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-06 18:34:34,151][23882] Updated weights for policy 0, policy_version 87470 (0.0007) [2023-03-06 18:34:34,929][23882] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-06 18:34:35,697][23882] Updated weights for policy 0, policy_version 87490 (0.0005) [2023-03-06 18:34:36,481][23882] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-06 18:34:36,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 89603072. Throughput: 0: 13038.3. Samples: 89588130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:36,748][23556] Avg episode reward: [(0, '2117.741')] [2023-03-06 18:34:37,290][23882] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-06 18:34:38,069][23882] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-06 18:34:38,846][23882] Updated weights for policy 0, policy_version 87530 (0.0007) [2023-03-06 18:34:39,622][23882] Updated weights for policy 0, policy_version 87540 (0.0007) [2023-03-06 18:34:40,433][23882] Updated weights for policy 0, policy_version 87550 (0.0007) [2023-03-06 18:34:41,209][23882] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-06 18:34:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 89667584. Throughput: 0: 13029.4. Samples: 89666250. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:41,748][23556] Avg episode reward: [(0, '1959.169')] [2023-03-06 18:34:41,989][23882] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-06 18:34:42,783][23882] Updated weights for policy 0, policy_version 87580 (0.0007) [2023-03-06 18:34:43,549][23882] Updated weights for policy 0, policy_version 87590 (0.0007) [2023-03-06 18:34:44,339][23882] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-06 18:34:45,125][23882] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-06 18:34:45,921][23882] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-06 18:34:46,700][23882] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-06 18:34:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89733120. Throughput: 0: 13029.3. Samples: 89705654. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:46,748][23556] Avg episode reward: [(0, '2107.952')] [2023-03-06 18:34:47,498][23882] Updated weights for policy 0, policy_version 87640 (0.0006) [2023-03-06 18:34:48,292][23882] Updated weights for policy 0, policy_version 87650 (0.0007) [2023-03-06 18:34:49,070][23882] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-06 18:34:49,880][23882] Updated weights for policy 0, policy_version 87670 (0.0007) [2023-03-06 18:34:50,649][23882] Updated weights for policy 0, policy_version 87680 (0.0007) [2023-03-06 18:34:51,431][23882] Updated weights for policy 0, policy_version 87690 (0.0006) [2023-03-06 18:34:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89798656. Throughput: 0: 13022.2. Samples: 89783604. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:51,748][23556] Avg episode reward: [(0, '1991.052')] [2023-03-06 18:34:52,208][23882] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-06 18:34:52,997][23882] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-06 18:34:53,790][23882] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-06 18:34:54,578][23882] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-06 18:34:55,366][23882] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-06 18:34:56,152][23882] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-06 18:34:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 89863168. Throughput: 0: 13026.6. Samples: 89861892. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:34:56,748][23556] Avg episode reward: [(0, '2073.441')] [2023-03-06 18:34:56,934][23882] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-06 18:34:57,708][23882] Updated weights for policy 0, policy_version 87770 (0.0009) [2023-03-06 18:34:58,487][23882] Updated weights for policy 0, policy_version 87780 (0.0006) [2023-03-06 18:34:59,259][23882] Updated weights for policy 0, policy_version 87790 (0.0007) [2023-03-06 18:35:00,026][23882] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-06 18:35:00,817][23882] Updated weights for policy 0, policy_version 87810 (0.0007) [2023-03-06 18:35:01,612][23882] Updated weights for policy 0, policy_version 87820 (0.0006) [2023-03-06 18:35:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89928704. Throughput: 0: 13032.8. Samples: 89901189. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:35:01,748][23556] Avg episode reward: [(0, '2160.160')] [2023-03-06 18:35:02,401][23882] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-06 18:35:03,183][23882] Updated weights for policy 0, policy_version 87840 (0.0007) [2023-03-06 18:35:03,958][23882] Updated weights for policy 0, policy_version 87850 (0.0007) [2023-03-06 18:35:04,766][23882] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-06 18:35:05,565][23882] Updated weights for policy 0, policy_version 87870 (0.0007) [2023-03-06 18:35:06,325][23882] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-06 18:35:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 89994240. Throughput: 0: 13027.5. Samples: 89979352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:35:06,748][23556] Avg episode reward: [(0, '2121.654')] [2023-03-06 18:35:07,116][23882] Updated weights for policy 0, policy_version 87890 (0.0006) [2023-03-06 18:35:07,898][23882] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-06 18:35:08,670][23882] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-06 18:35:09,456][23882] Updated weights for policy 0, policy_version 87920 (0.0007) [2023-03-06 18:35:10,265][23882] Updated weights for policy 0, policy_version 87930 (0.0007) [2023-03-06 18:35:11,056][23882] Updated weights for policy 0, policy_version 87940 (0.0006) [2023-03-06 18:35:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 90058752. Throughput: 0: 13027.7. Samples: 90057433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:35:11,748][23556] Avg episode reward: [(0, '2128.195')] [2023-03-06 18:35:11,844][23882] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-06 18:35:12,655][23882] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-06 18:35:13,430][23882] Updated weights for policy 0, policy_version 87970 (0.0006) [2023-03-06 18:35:14,200][23882] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-06 18:35:14,984][23882] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-06 18:35:15,765][23882] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-06 18:35:16,558][23882] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-06 18:35:16,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90124288. Throughput: 0: 13039.6. Samples: 90096690. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:35:16,749][23556] Avg episode reward: [(0, '2018.656')] [2023-03-06 18:35:17,342][23882] Updated weights for policy 0, policy_version 88020 (0.0006) [2023-03-06 18:35:18,113][23882] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-06 18:35:18,895][23882] Updated weights for policy 0, policy_version 88040 (0.0006) [2023-03-06 18:35:19,683][23882] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-06 18:35:20,463][23882] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-06 18:35:21,259][23882] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-06 18:35:21,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90189824. Throughput: 0: 13043.5. Samples: 90175085. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:35:21,748][23556] Avg episode reward: [(0, '1910.990')] [2023-03-06 18:35:22,054][23882] Updated weights for policy 0, policy_version 88080 (0.0006) [2023-03-06 18:35:22,839][23882] Updated weights for policy 0, policy_version 88090 (0.0007) [2023-03-06 18:35:23,626][23882] Updated weights for policy 0, policy_version 88100 (0.0007) [2023-03-06 18:35:24,410][23882] Updated weights for policy 0, policy_version 88110 (0.0008) [2023-03-06 18:35:25,202][23882] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-06 18:35:25,982][23882] Updated weights for policy 0, policy_version 88130 (0.0006) [2023-03-06 18:35:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 90254336. Throughput: 0: 13040.4. Samples: 90253070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:26,748][23556] Avg episode reward: [(0, '2030.331')] [2023-03-06 18:35:26,765][23882] Updated weights for policy 0, policy_version 88140 (0.0006) [2023-03-06 18:35:27,545][23882] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-06 18:35:28,332][23882] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-06 18:35:29,101][23882] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-06 18:35:29,897][23882] Updated weights for policy 0, policy_version 88180 (0.0007) [2023-03-06 18:35:30,675][23882] Updated weights for policy 0, policy_version 88190 (0.0007) [2023-03-06 18:35:31,460][23882] Updated weights for policy 0, policy_version 88200 (0.0007) [2023-03-06 18:35:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13027.4). Total num frames: 90319872. Throughput: 0: 13036.1. Samples: 90292277. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:31,748][23556] Avg episode reward: [(0, '2039.400')] [2023-03-06 18:35:32,253][23882] Updated weights for policy 0, policy_version 88210 (0.0006) [2023-03-06 18:35:33,055][23882] Updated weights for policy 0, policy_version 88220 (0.0007) [2023-03-06 18:35:33,849][23882] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-06 18:35:34,631][23882] Updated weights for policy 0, policy_version 88240 (0.0007) [2023-03-06 18:35:35,413][23882] Updated weights for policy 0, policy_version 88250 (0.0007) [2023-03-06 18:35:36,200][23882] Updated weights for policy 0, policy_version 88260 (0.0006) [2023-03-06 18:35:36,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 90384384. Throughput: 0: 13038.1. Samples: 90370320. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:36,748][23556] Avg episode reward: [(0, '2223.346')] [2023-03-06 18:35:37,000][23882] Updated weights for policy 0, policy_version 88270 (0.0007) [2023-03-06 18:35:37,771][23882] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-06 18:35:38,557][23882] Updated weights for policy 0, policy_version 88290 (0.0007) [2023-03-06 18:35:39,376][23882] Updated weights for policy 0, policy_version 88300 (0.0007) [2023-03-06 18:35:40,153][23882] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-06 18:35:40,935][23882] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-06 18:35:41,729][23882] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-06 18:35:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90449920. Throughput: 0: 13027.9. Samples: 90448148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:41,748][23556] Avg episode reward: [(0, '2079.462')] [2023-03-06 18:35:42,511][23882] Updated weights for policy 0, policy_version 88340 (0.0007) [2023-03-06 18:35:43,293][23882] Updated weights for policy 0, policy_version 88350 (0.0007) [2023-03-06 18:35:44,073][23882] Updated weights for policy 0, policy_version 88360 (0.0007) [2023-03-06 18:35:44,854][23882] Updated weights for policy 0, policy_version 88370 (0.0005) [2023-03-06 18:35:45,634][23882] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-06 18:35:46,414][23882] Updated weights for policy 0, policy_version 88390 (0.0007) [2023-03-06 18:35:46,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90515456. Throughput: 0: 13028.4. Samples: 90487467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:46,759][23556] Avg episode reward: [(0, '2186.996')] [2023-03-06 18:35:47,202][23882] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-06 18:35:47,982][23882] Updated weights for policy 0, policy_version 88410 (0.0006) [2023-03-06 18:35:48,787][23882] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-06 18:35:49,563][23882] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-06 18:35:50,339][23882] Updated weights for policy 0, policy_version 88440 (0.0007) [2023-03-06 18:35:51,141][23882] Updated weights for policy 0, policy_version 88450 (0.0006) [2023-03-06 18:35:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 90579968. Throughput: 0: 13031.2. Samples: 90565757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:51,749][23556] Avg episode reward: [(0, '2225.331')] [2023-03-06 18:35:51,918][23882] Updated weights for policy 0, policy_version 88460 (0.0005) [2023-03-06 18:35:52,694][23882] Updated weights for policy 0, policy_version 88470 (0.0007) [2023-03-06 18:35:53,488][23882] Updated weights for policy 0, policy_version 88480 (0.0007) [2023-03-06 18:35:54,278][23882] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-06 18:35:55,042][23882] Updated weights for policy 0, policy_version 88500 (0.0007) [2023-03-06 18:35:55,840][23882] Updated weights for policy 0, policy_version 88510 (0.0008) [2023-03-06 18:35:56,638][23882] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-06 18:35:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90645504. Throughput: 0: 13028.3. Samples: 90643709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:35:56,748][23556] Avg episode reward: [(0, '2171.325')] [2023-03-06 18:35:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000088521_90645504.pth... [2023-03-06 18:35:56,782][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000085467_87518208.pth [2023-03-06 18:35:57,430][23882] Updated weights for policy 0, policy_version 88530 (0.0007) [2023-03-06 18:35:58,217][23882] Updated weights for policy 0, policy_version 88540 (0.0007) [2023-03-06 18:35:58,985][23882] Updated weights for policy 0, policy_version 88550 (0.0007) [2023-03-06 18:35:59,765][23882] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-06 18:36:00,560][23882] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-06 18:36:01,350][23882] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-06 18:36:01,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90711040. Throughput: 0: 13031.6. Samples: 90683108. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:36:01,748][23556] Avg episode reward: [(0, '1976.366')] [2023-03-06 18:36:02,124][23882] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-06 18:36:02,923][23882] Updated weights for policy 0, policy_version 88600 (0.0007) [2023-03-06 18:36:03,709][23882] Updated weights for policy 0, policy_version 88610 (0.0007) [2023-03-06 18:36:04,486][23882] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-06 18:36:05,269][23882] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-06 18:36:06,066][23882] Updated weights for policy 0, policy_version 88640 (0.0007) [2023-03-06 18:36:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 90775552. Throughput: 0: 13026.7. Samples: 90761283. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:36:06,748][23556] Avg episode reward: [(0, '2057.220')] [2023-03-06 18:36:06,840][23882] Updated weights for policy 0, policy_version 88650 (0.0006) [2023-03-06 18:36:07,622][23882] Updated weights for policy 0, policy_version 88660 (0.0007) [2023-03-06 18:36:08,416][23882] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-06 18:36:09,220][23882] Updated weights for policy 0, policy_version 88680 (0.0007) [2023-03-06 18:36:09,978][23882] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-06 18:36:10,766][23882] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-06 18:36:11,555][23882] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-06 18:36:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90841088. Throughput: 0: 13033.4. Samples: 90839572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:36:11,748][23556] Avg episode reward: [(0, '2067.600')] [2023-03-06 18:36:12,330][23882] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-06 18:36:13,131][23882] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-06 18:36:13,913][23882] Updated weights for policy 0, policy_version 88740 (0.0007) [2023-03-06 18:36:14,686][23882] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-06 18:36:15,491][23882] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-06 18:36:16,265][23882] Updated weights for policy 0, policy_version 88770 (0.0007) [2023-03-06 18:36:16,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 90906624. Throughput: 0: 13029.3. Samples: 90878597. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:16,748][23556] Avg episode reward: [(0, '2231.927')] [2023-03-06 18:36:17,041][23882] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-06 18:36:17,831][23882] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-06 18:36:18,610][23882] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-06 18:36:19,385][23882] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-06 18:36:20,164][23882] Updated weights for policy 0, policy_version 88820 (0.0007) [2023-03-06 18:36:20,949][23882] Updated weights for policy 0, policy_version 88830 (0.0007) [2023-03-06 18:36:21,737][23882] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-06 18:36:21,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 90972160. Throughput: 0: 13041.2. Samples: 90957176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:21,749][23556] Avg episode reward: [(0, '2093.344')] [2023-03-06 18:36:22,531][23882] Updated weights for policy 0, policy_version 88850 (0.0005) [2023-03-06 18:36:23,310][23882] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-06 18:36:24,082][23882] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-06 18:36:24,862][23882] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-06 18:36:25,646][23882] Updated weights for policy 0, policy_version 88890 (0.0007) [2023-03-06 18:36:26,435][23882] Updated weights for policy 0, policy_version 88900 (0.0008) [2023-03-06 18:36:26,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 91037696. Throughput: 0: 13055.2. Samples: 91035635. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:26,748][23556] Avg episode reward: [(0, '2068.615')] [2023-03-06 18:36:27,211][23882] Updated weights for policy 0, policy_version 88910 (0.0006) [2023-03-06 18:36:28,001][23882] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-06 18:36:28,788][23882] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-06 18:36:29,579][23882] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-06 18:36:30,368][23882] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-06 18:36:31,151][23882] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-06 18:36:31,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 91102208. Throughput: 0: 13052.1. Samples: 91074812. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:31,748][23556] Avg episode reward: [(0, '1840.435')] [2023-03-06 18:36:31,930][23882] Updated weights for policy 0, policy_version 88970 (0.0007) [2023-03-06 18:36:32,732][23882] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-06 18:36:33,517][23882] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-06 18:36:34,302][23882] Updated weights for policy 0, policy_version 89000 (0.0007) [2023-03-06 18:36:35,085][23882] Updated weights for policy 0, policy_version 89010 (0.0007) [2023-03-06 18:36:35,884][23882] Updated weights for policy 0, policy_version 89020 (0.0007) [2023-03-06 18:36:36,662][23882] Updated weights for policy 0, policy_version 89030 (0.0006) [2023-03-06 18:36:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 91167744. Throughput: 0: 13042.2. Samples: 91152655. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:36,748][23556] Avg episode reward: [(0, '1733.016')] [2023-03-06 18:36:37,445][23882] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-06 18:36:38,238][23882] Updated weights for policy 0, policy_version 89050 (0.0007) [2023-03-06 18:36:39,042][23882] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-06 18:36:39,831][23882] Updated weights for policy 0, policy_version 89070 (0.0007) [2023-03-06 18:36:40,605][23882] Updated weights for policy 0, policy_version 89080 (0.0006) [2023-03-06 18:36:41,384][23882] Updated weights for policy 0, policy_version 89090 (0.0007) [2023-03-06 18:36:41,747][23556] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 91232256. Throughput: 0: 13050.1. Samples: 91230963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:41,748][23556] Avg episode reward: [(0, '1981.195')] [2023-03-06 18:36:42,166][23882] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-06 18:36:42,946][23882] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-06 18:36:43,722][23882] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-06 18:36:44,504][23882] Updated weights for policy 0, policy_version 89130 (0.0006) [2023-03-06 18:36:45,285][23882] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-06 18:36:46,067][23882] Updated weights for policy 0, policy_version 89150 (0.0007) [2023-03-06 18:36:46,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 91297792. Throughput: 0: 13048.3. Samples: 91270283. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:46,748][23556] Avg episode reward: [(0, '1938.249')] [2023-03-06 18:36:46,837][23882] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-06 18:36:47,628][23882] Updated weights for policy 0, policy_version 89170 (0.0007) [2023-03-06 18:36:48,420][23882] Updated weights for policy 0, policy_version 89180 (0.0005) [2023-03-06 18:36:49,207][23882] Updated weights for policy 0, policy_version 89190 (0.0007) [2023-03-06 18:36:49,981][23882] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-06 18:36:50,778][23882] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-06 18:36:51,553][23882] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-06 18:36:51,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 91363328. Throughput: 0: 13052.0. Samples: 91348621. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:51,748][23556] Avg episode reward: [(0, '2016.392')] [2023-03-06 18:36:52,345][23882] Updated weights for policy 0, policy_version 89230 (0.0007) [2023-03-06 18:36:53,114][23882] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-06 18:36:53,900][23882] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-06 18:36:54,672][23882] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-06 18:36:55,463][23882] Updated weights for policy 0, policy_version 89270 (0.0005) [2023-03-06 18:36:56,246][23882] Updated weights for policy 0, policy_version 89280 (0.0007) [2023-03-06 18:36:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 91428864. Throughput: 0: 13058.4. Samples: 91427199. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:36:56,748][23556] Avg episode reward: [(0, '2003.944')] [2023-03-06 18:36:57,052][23882] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-06 18:36:57,833][23882] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-06 18:36:58,617][23882] Updated weights for policy 0, policy_version 89310 (0.0005) [2023-03-06 18:36:59,397][23882] Updated weights for policy 0, policy_version 89320 (0.0005) [2023-03-06 18:37:00,182][23882] Updated weights for policy 0, policy_version 89330 (0.0007) [2023-03-06 18:37:00,974][23882] Updated weights for policy 0, policy_version 89340 (0.0007) [2023-03-06 18:37:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 91493376. Throughput: 0: 13055.7. Samples: 91466103. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:37:01,748][23556] Avg episode reward: [(0, '1771.790')] [2023-03-06 18:37:01,762][23882] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-06 18:37:02,546][23882] Updated weights for policy 0, policy_version 89360 (0.0007) [2023-03-06 18:37:03,333][23882] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-06 18:37:04,118][23882] Updated weights for policy 0, policy_version 89380 (0.0005) [2023-03-06 18:37:04,901][23882] Updated weights for policy 0, policy_version 89390 (0.0007) [2023-03-06 18:37:05,693][23882] Updated weights for policy 0, policy_version 89400 (0.0007) [2023-03-06 18:37:06,482][23882] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-06 18:37:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 91558912. Throughput: 0: 13043.9. Samples: 91544148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:37:06,748][23556] Avg episode reward: [(0, '2077.968')] [2023-03-06 18:37:07,269][23882] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-06 18:37:08,055][23882] Updated weights for policy 0, policy_version 89430 (0.0007) [2023-03-06 18:37:08,858][23882] Updated weights for policy 0, policy_version 89440 (0.0006) [2023-03-06 18:37:09,655][23882] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-06 18:37:10,449][23882] Updated weights for policy 0, policy_version 89460 (0.0007) [2023-03-06 18:37:11,274][23882] Updated weights for policy 0, policy_version 89470 (0.0007) [2023-03-06 18:37:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 91623424. Throughput: 0: 13021.0. Samples: 91621578. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:11,748][23556] Avg episode reward: [(0, '2020.995')] [2023-03-06 18:37:12,047][23882] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-06 18:37:12,840][23882] Updated weights for policy 0, policy_version 89490 (0.0007) [2023-03-06 18:37:13,625][23882] Updated weights for policy 0, policy_version 89500 (0.0007) [2023-03-06 18:37:14,422][23882] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-06 18:37:15,193][23882] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-06 18:37:15,962][23882] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-06 18:37:16,747][23882] Updated weights for policy 0, policy_version 89540 (0.0006) [2023-03-06 18:37:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 91688960. Throughput: 0: 13019.4. Samples: 91660685. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:16,748][23556] Avg episode reward: [(0, '2020.672')] [2023-03-06 18:37:17,544][23882] Updated weights for policy 0, policy_version 89550 (0.0007) [2023-03-06 18:37:18,322][23882] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-06 18:37:19,121][23882] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-06 18:37:19,899][23882] Updated weights for policy 0, policy_version 89580 (0.0007) [2023-03-06 18:37:20,678][23882] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-06 18:37:21,475][23882] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-06 18:37:21,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 91753472. Throughput: 0: 13029.7. Samples: 91738987. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:21,748][23556] Avg episode reward: [(0, '2092.314')] [2023-03-06 18:37:22,246][23882] Updated weights for policy 0, policy_version 89610 (0.0006) [2023-03-06 18:37:23,026][23882] Updated weights for policy 0, policy_version 89620 (0.0007) [2023-03-06 18:37:23,820][23882] Updated weights for policy 0, policy_version 89630 (0.0007) [2023-03-06 18:37:24,600][23882] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-06 18:37:25,375][23882] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-06 18:37:26,182][23882] Updated weights for policy 0, policy_version 89660 (0.0005) [2023-03-06 18:37:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 91819008. Throughput: 0: 13028.2. Samples: 91817231. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:26,748][23556] Avg episode reward: [(0, '2114.821')] [2023-03-06 18:37:26,979][23882] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-06 18:37:27,769][23882] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-06 18:37:28,554][23882] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-06 18:37:29,361][23882] Updated weights for policy 0, policy_version 89700 (0.0007) [2023-03-06 18:37:30,147][23882] Updated weights for policy 0, policy_version 89710 (0.0007) [2023-03-06 18:37:30,922][23882] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-06 18:37:31,718][23882] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-06 18:37:31,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 91883520. Throughput: 0: 13015.8. Samples: 91855996. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:31,748][23556] Avg episode reward: [(0, '2193.709')] [2023-03-06 18:37:32,496][23882] Updated weights for policy 0, policy_version 89740 (0.0008) [2023-03-06 18:37:33,279][23882] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-06 18:37:34,065][23882] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-06 18:37:34,859][23882] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-06 18:37:35,637][23882] Updated weights for policy 0, policy_version 89780 (0.0005) [2023-03-06 18:37:36,417][23882] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-06 18:37:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 91949056. Throughput: 0: 13009.1. Samples: 91934030. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:36,759][23556] Avg episode reward: [(0, '1852.713')] [2023-03-06 18:37:37,199][23882] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-06 18:37:37,987][23882] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-06 18:37:38,775][23882] Updated weights for policy 0, policy_version 89820 (0.0007) [2023-03-06 18:37:39,543][23882] Updated weights for policy 0, policy_version 89830 (0.0007) [2023-03-06 18:37:40,342][23882] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-06 18:37:41,136][23882] Updated weights for policy 0, policy_version 89850 (0.0007) [2023-03-06 18:37:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 92013568. Throughput: 0: 12997.7. Samples: 92012095. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:41,754][23556] Avg episode reward: [(0, '2076.374')] [2023-03-06 18:37:41,922][23882] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-06 18:37:42,719][23882] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-06 18:37:43,489][23882] Updated weights for policy 0, policy_version 89880 (0.0007) [2023-03-06 18:37:44,302][23882] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-06 18:37:45,099][23882] Updated weights for policy 0, policy_version 89900 (0.0007) [2023-03-06 18:37:45,883][23882] Updated weights for policy 0, policy_version 89910 (0.0007) [2023-03-06 18:37:46,657][23882] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-06 18:37:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 92079104. Throughput: 0: 13002.2. Samples: 92051201. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:46,759][23556] Avg episode reward: [(0, '2130.170')] [2023-03-06 18:37:47,445][23882] Updated weights for policy 0, policy_version 89930 (0.0007) [2023-03-06 18:37:48,232][23882] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-06 18:37:49,032][23882] Updated weights for policy 0, policy_version 89950 (0.0007) [2023-03-06 18:37:49,821][23882] Updated weights for policy 0, policy_version 89960 (0.0007) [2023-03-06 18:37:50,619][23882] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-06 18:37:51,413][23882] Updated weights for policy 0, policy_version 89980 (0.0007) [2023-03-06 18:37:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 92143616. Throughput: 0: 12995.0. Samples: 92128925. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:51,748][23556] Avg episode reward: [(0, '1976.765')] [2023-03-06 18:37:52,192][23882] Updated weights for policy 0, policy_version 89990 (0.0005) [2023-03-06 18:37:52,989][23882] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-06 18:37:53,778][23882] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-06 18:37:54,574][23882] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-06 18:37:55,357][23882] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-06 18:37:56,148][23882] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-06 18:37:56,748][23556] Fps is (10 sec: 12902.3, 60 sec: 12987.7, 300 sec: 13030.8). Total num frames: 92208128. Throughput: 0: 13003.4. Samples: 92206731. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:37:56,748][23556] Avg episode reward: [(0, '2133.799')] [2023-03-06 18:37:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000090047_92208128.pth... [2023-03-06 18:37:56,786][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000086994_89081856.pth [2023-03-06 18:37:56,944][23882] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-06 18:37:57,733][23882] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-06 18:37:58,510][23882] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-06 18:37:59,289][23882] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-06 18:38:00,081][23882] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-06 18:38:00,865][23882] Updated weights for policy 0, policy_version 90100 (0.0006) [2023-03-06 18:38:01,663][23882] Updated weights for policy 0, policy_version 90110 (0.0007) [2023-03-06 18:38:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 92273664. Throughput: 0: 13001.8. Samples: 92245766. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:01,748][23556] Avg episode reward: [(0, '2087.430')] [2023-03-06 18:38:02,443][23882] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-06 18:38:03,230][23882] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-06 18:38:04,036][23882] Updated weights for policy 0, policy_version 90140 (0.0007) [2023-03-06 18:38:04,817][23882] Updated weights for policy 0, policy_version 90150 (0.0008) [2023-03-06 18:38:05,595][23882] Updated weights for policy 0, policy_version 90160 (0.0007) [2023-03-06 18:38:06,365][23882] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-06 18:38:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13030.8). Total num frames: 92338176. Throughput: 0: 12996.0. Samples: 92323809. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:06,748][23556] Avg episode reward: [(0, '2109.920')] [2023-03-06 18:38:07,167][23882] Updated weights for policy 0, policy_version 90180 (0.0006) [2023-03-06 18:38:07,965][23882] Updated weights for policy 0, policy_version 90190 (0.0007) [2023-03-06 18:38:08,752][23882] Updated weights for policy 0, policy_version 90200 (0.0006) [2023-03-06 18:38:09,545][23882] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-06 18:38:10,322][23882] Updated weights for policy 0, policy_version 90220 (0.0005) [2023-03-06 18:38:11,122][23882] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-06 18:38:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 92403712. Throughput: 0: 12988.3. Samples: 92401707. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:11,748][23556] Avg episode reward: [(0, '2024.942')] [2023-03-06 18:38:11,905][23882] Updated weights for policy 0, policy_version 90240 (0.0006) [2023-03-06 18:38:12,695][23882] Updated weights for policy 0, policy_version 90250 (0.0007) [2023-03-06 18:38:13,484][23882] Updated weights for policy 0, policy_version 90260 (0.0007) [2023-03-06 18:38:14,282][23882] Updated weights for policy 0, policy_version 90270 (0.0007) [2023-03-06 18:38:15,071][23882] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-06 18:38:15,822][23882] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-06 18:38:16,613][23882] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-06 18:38:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13030.8). Total num frames: 92468224. Throughput: 0: 12990.4. Samples: 92440564. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:16,748][23556] Avg episode reward: [(0, '1832.601')] [2023-03-06 18:38:17,395][23882] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-06 18:38:18,171][23882] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-06 18:38:18,954][23882] Updated weights for policy 0, policy_version 90330 (0.0006) [2023-03-06 18:38:19,757][23882] Updated weights for policy 0, policy_version 90340 (0.0007) [2023-03-06 18:38:20,537][23882] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-06 18:38:21,307][23882] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-06 18:38:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 92533760. Throughput: 0: 13001.7. Samples: 92519109. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:21,759][23556] Avg episode reward: [(0, '1993.807')] [2023-03-06 18:38:22,113][23882] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-06 18:38:22,890][23882] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-06 18:38:23,669][23882] Updated weights for policy 0, policy_version 90390 (0.0007) [2023-03-06 18:38:24,448][23882] Updated weights for policy 0, policy_version 90400 (0.0007) [2023-03-06 18:38:25,224][23882] Updated weights for policy 0, policy_version 90410 (0.0007) [2023-03-06 18:38:26,010][23882] Updated weights for policy 0, policy_version 90420 (0.0007) [2023-03-06 18:38:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 92599296. Throughput: 0: 13006.5. Samples: 92597388. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:26,754][23556] Avg episode reward: [(0, '2057.654')] [2023-03-06 18:38:26,802][23882] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-06 18:38:27,601][23882] Updated weights for policy 0, policy_version 90440 (0.0007) [2023-03-06 18:38:28,388][23882] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-06 18:38:29,182][23882] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-06 18:38:29,973][23882] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-06 18:38:30,743][23882] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-06 18:38:31,529][23882] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-06 18:38:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 92663808. Throughput: 0: 13005.2. Samples: 92636434. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:31,759][23556] Avg episode reward: [(0, '2060.125')] [2023-03-06 18:38:32,324][23882] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-06 18:38:33,106][23882] Updated weights for policy 0, policy_version 90510 (0.0005) [2023-03-06 18:38:33,900][23882] Updated weights for policy 0, policy_version 90520 (0.0008) [2023-03-06 18:38:34,700][23882] Updated weights for policy 0, policy_version 90530 (0.0006) [2023-03-06 18:38:35,496][23882] Updated weights for policy 0, policy_version 90540 (0.0008) [2023-03-06 18:38:36,280][23882] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-06 18:38:36,748][23556] Fps is (10 sec: 12902.4, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 92728320. Throughput: 0: 13007.8. Samples: 92714275. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:36,759][23556] Avg episode reward: [(0, '2093.374')] [2023-03-06 18:38:37,078][23882] Updated weights for policy 0, policy_version 90560 (0.0007) [2023-03-06 18:38:37,875][23882] Updated weights for policy 0, policy_version 90570 (0.0006) [2023-03-06 18:38:38,673][23882] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-06 18:38:39,450][23882] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-06 18:38:40,237][23882] Updated weights for policy 0, policy_version 90600 (0.0006) [2023-03-06 18:38:41,034][23882] Updated weights for policy 0, policy_version 90610 (0.0006) [2023-03-06 18:38:41,748][23556] Fps is (10 sec: 12902.3, 60 sec: 12987.7, 300 sec: 13023.9). Total num frames: 92792832. Throughput: 0: 13005.7. Samples: 92791988. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:41,754][23556] Avg episode reward: [(0, '2103.933')] [2023-03-06 18:38:41,820][23882] Updated weights for policy 0, policy_version 90620 (0.0006) [2023-03-06 18:38:42,600][23882] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-06 18:38:43,410][23882] Updated weights for policy 0, policy_version 90640 (0.0007) [2023-03-06 18:38:44,206][23882] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-06 18:38:44,998][23882] Updated weights for policy 0, policy_version 90660 (0.0007) [2023-03-06 18:38:45,770][23882] Updated weights for policy 0, policy_version 90670 (0.0007) [2023-03-06 18:38:46,585][23882] Updated weights for policy 0, policy_version 90680 (0.0007) [2023-03-06 18:38:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13023.9). Total num frames: 92858368. Throughput: 0: 12993.3. Samples: 92830468. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:46,759][23556] Avg episode reward: [(0, '1954.363')] [2023-03-06 18:38:47,374][23882] Updated weights for policy 0, policy_version 90690 (0.0007) [2023-03-06 18:38:48,149][23882] Updated weights for policy 0, policy_version 90700 (0.0006) [2023-03-06 18:38:48,927][23882] Updated weights for policy 0, policy_version 90710 (0.0007) [2023-03-06 18:38:49,730][23882] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-06 18:38:50,502][23882] Updated weights for policy 0, policy_version 90730 (0.0007) [2023-03-06 18:38:51,263][23882] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-06 18:38:51,748][23556] Fps is (10 sec: 13005.0, 60 sec: 12987.7, 300 sec: 13020.4). Total num frames: 92922880. Throughput: 0: 12997.5. Samples: 92908694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:51,759][23556] Avg episode reward: [(0, '2053.492')] [2023-03-06 18:38:52,068][23882] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-06 18:38:52,842][23882] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-06 18:38:53,624][23882] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-06 18:38:54,410][23882] Updated weights for policy 0, policy_version 90780 (0.0006) [2023-03-06 18:38:55,203][23882] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-06 18:38:55,982][23882] Updated weights for policy 0, policy_version 90800 (0.0007) [2023-03-06 18:38:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 92988416. Throughput: 0: 13005.2. Samples: 92986941. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:38:56,758][23556] Avg episode reward: [(0, '1950.137')] [2023-03-06 18:38:56,788][23882] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-06 18:38:57,565][23882] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-06 18:38:58,345][23882] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-06 18:38:59,135][23882] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-06 18:38:59,928][23882] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-06 18:39:00,695][23882] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-06 18:39:01,506][23882] Updated weights for policy 0, policy_version 90870 (0.0007) [2023-03-06 18:39:01,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 93053952. Throughput: 0: 13010.2. Samples: 93026025. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:01,759][23556] Avg episode reward: [(0, '2213.808')] [2023-03-06 18:39:02,294][23882] Updated weights for policy 0, policy_version 90880 (0.0007) [2023-03-06 18:39:03,079][23882] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-06 18:39:03,853][23882] Updated weights for policy 0, policy_version 90900 (0.0007) [2023-03-06 18:39:04,621][23882] Updated weights for policy 0, policy_version 90910 (0.0007) [2023-03-06 18:39:05,405][23882] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-06 18:39:06,204][23882] Updated weights for policy 0, policy_version 90930 (0.0005) [2023-03-06 18:39:06,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 93119488. Throughput: 0: 13007.8. Samples: 93104458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:06,748][23556] Avg episode reward: [(0, '2129.072')] [2023-03-06 18:39:06,987][23882] Updated weights for policy 0, policy_version 90940 (0.0006) [2023-03-06 18:39:07,792][23882] Updated weights for policy 0, policy_version 90950 (0.0007) [2023-03-06 18:39:08,574][23882] Updated weights for policy 0, policy_version 90960 (0.0007) [2023-03-06 18:39:09,353][23882] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-06 18:39:10,134][23882] Updated weights for policy 0, policy_version 90980 (0.0005) [2023-03-06 18:39:10,923][23882] Updated weights for policy 0, policy_version 90990 (0.0006) [2023-03-06 18:39:11,714][23882] Updated weights for policy 0, policy_version 91000 (0.0007) [2023-03-06 18:39:11,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 93184000. Throughput: 0: 12999.6. Samples: 93182371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:11,748][23556] Avg episode reward: [(0, '2064.510')] [2023-03-06 18:39:12,496][23882] Updated weights for policy 0, policy_version 91010 (0.0007) [2023-03-06 18:39:13,303][23882] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-06 18:39:14,076][23882] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-06 18:39:14,858][23882] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-06 18:39:15,633][23882] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-06 18:39:16,429][23882] Updated weights for policy 0, policy_version 91060 (0.0008) [2023-03-06 18:39:16,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 93249536. Throughput: 0: 12999.4. Samples: 93221408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:16,748][23556] Avg episode reward: [(0, '2011.141')] [2023-03-06 18:39:17,213][23882] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-06 18:39:17,980][23882] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-06 18:39:18,796][23882] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-06 18:39:19,570][23882] Updated weights for policy 0, policy_version 91100 (0.0006) [2023-03-06 18:39:20,354][23882] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-06 18:39:21,142][23882] Updated weights for policy 0, policy_version 91120 (0.0007) [2023-03-06 18:39:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 93314048. Throughput: 0: 13009.7. Samples: 93299711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:21,748][23556] Avg episode reward: [(0, '2041.236')] [2023-03-06 18:39:21,935][23882] Updated weights for policy 0, policy_version 91130 (0.0006) [2023-03-06 18:39:22,724][23882] Updated weights for policy 0, policy_version 91140 (0.0006) [2023-03-06 18:39:23,493][23882] Updated weights for policy 0, policy_version 91150 (0.0006) [2023-03-06 18:39:24,274][23882] Updated weights for policy 0, policy_version 91160 (0.0006) [2023-03-06 18:39:25,052][23882] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-06 18:39:25,847][23882] Updated weights for policy 0, policy_version 91180 (0.0005) [2023-03-06 18:39:26,629][23882] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-06 18:39:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 93379584. Throughput: 0: 13023.2. Samples: 93378031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:26,748][23556] Avg episode reward: [(0, '2089.019')] [2023-03-06 18:39:27,418][23882] Updated weights for policy 0, policy_version 91200 (0.0006) [2023-03-06 18:39:28,214][23882] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-06 18:39:28,989][23882] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-06 18:39:29,771][23882] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-06 18:39:30,562][23882] Updated weights for policy 0, policy_version 91240 (0.0006) [2023-03-06 18:39:31,358][23882] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-06 18:39:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 93444096. Throughput: 0: 13035.3. Samples: 93417056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:31,748][23556] Avg episode reward: [(0, '2001.203')] [2023-03-06 18:39:32,158][23882] Updated weights for policy 0, policy_version 91260 (0.0007) [2023-03-06 18:39:32,934][23882] Updated weights for policy 0, policy_version 91270 (0.0007) [2023-03-06 18:39:33,733][23882] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-06 18:39:34,524][23882] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-06 18:39:35,296][23882] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-06 18:39:36,072][23882] Updated weights for policy 0, policy_version 91310 (0.0007) [2023-03-06 18:39:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 93509632. Throughput: 0: 13030.5. Samples: 93495068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:36,748][23556] Avg episode reward: [(0, '2038.955')] [2023-03-06 18:39:36,867][23882] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-06 18:39:37,657][23882] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-06 18:39:38,441][23882] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-06 18:39:39,229][23882] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-06 18:39:40,017][23882] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-06 18:39:40,794][23882] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-06 18:39:41,576][23882] Updated weights for policy 0, policy_version 91380 (0.0007) [2023-03-06 18:39:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93575168. Throughput: 0: 13031.1. Samples: 93573341. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:41,748][23556] Avg episode reward: [(0, '2134.838')] [2023-03-06 18:39:42,349][23882] Updated weights for policy 0, policy_version 91390 (0.0006) [2023-03-06 18:39:43,155][23882] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-06 18:39:43,945][23882] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-06 18:39:44,729][23882] Updated weights for policy 0, policy_version 91420 (0.0008) [2023-03-06 18:39:45,495][23882] Updated weights for policy 0, policy_version 91430 (0.0005) [2023-03-06 18:39:46,289][23882] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-06 18:39:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93639680. Throughput: 0: 13028.9. Samples: 93612323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:46,748][23556] Avg episode reward: [(0, '2200.534')] [2023-03-06 18:39:47,077][23882] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-06 18:39:47,855][23882] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-06 18:39:48,641][23882] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-06 18:39:49,430][23882] Updated weights for policy 0, policy_version 91480 (0.0007) [2023-03-06 18:39:50,216][23882] Updated weights for policy 0, policy_version 91490 (0.0008) [2023-03-06 18:39:50,993][23882] Updated weights for policy 0, policy_version 91500 (0.0007) [2023-03-06 18:39:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93705216. Throughput: 0: 13028.2. Samples: 93690727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:51,748][23556] Avg episode reward: [(0, '2159.401')] [2023-03-06 18:39:51,774][23882] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-06 18:39:52,591][23882] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-06 18:39:53,377][23882] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-06 18:39:54,160][23882] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-06 18:39:54,938][23882] Updated weights for policy 0, policy_version 91550 (0.0007) [2023-03-06 18:39:55,727][23882] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-06 18:39:56,518][23882] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-06 18:39:56,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93769728. Throughput: 0: 13029.2. Samples: 93768687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:39:56,748][23556] Avg episode reward: [(0, '2034.551')] [2023-03-06 18:39:56,755][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000091573_93770752.pth... [2023-03-06 18:39:56,785][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000088521_90645504.pth [2023-03-06 18:39:57,318][23882] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-06 18:39:58,083][23882] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-06 18:39:58,891][23882] Updated weights for policy 0, policy_version 91600 (0.0007) [2023-03-06 18:39:59,675][23882] Updated weights for policy 0, policy_version 91610 (0.0007) [2023-03-06 18:40:00,453][23882] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-06 18:40:01,247][23882] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-06 18:40:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93835264. Throughput: 0: 13028.2. Samples: 93807680. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:01,748][23556] Avg episode reward: [(0, '1909.753')] [2023-03-06 18:40:02,037][23882] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-06 18:40:02,822][23882] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-06 18:40:03,609][23882] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-06 18:40:04,386][23882] Updated weights for policy 0, policy_version 91670 (0.0006) [2023-03-06 18:40:05,176][23882] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-06 18:40:05,977][23882] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-06 18:40:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 93899776. Throughput: 0: 13024.2. Samples: 93885798. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:06,748][23556] Avg episode reward: [(0, '2221.560')] [2023-03-06 18:40:06,750][23882] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-06 18:40:07,549][23882] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-06 18:40:08,329][23882] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-06 18:40:09,102][23882] Updated weights for policy 0, policy_version 91730 (0.0007) [2023-03-06 18:40:09,912][23882] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-06 18:40:10,684][23882] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-06 18:40:11,465][23882] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-06 18:40:11,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93965312. Throughput: 0: 13016.5. Samples: 93963775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:11,748][23556] Avg episode reward: [(0, '1975.694')] [2023-03-06 18:40:12,261][23882] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-06 18:40:13,065][23882] Updated weights for policy 0, policy_version 91780 (0.0006) [2023-03-06 18:40:13,845][23882] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-06 18:40:14,634][23882] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-06 18:40:15,439][23882] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-06 18:40:16,222][23882] Updated weights for policy 0, policy_version 91820 (0.0007) [2023-03-06 18:40:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 94029824. Throughput: 0: 13012.3. Samples: 94002607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:16,748][23556] Avg episode reward: [(0, '2032.097')] [2023-03-06 18:40:16,999][23882] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-06 18:40:17,784][23882] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-06 18:40:18,565][23882] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-06 18:40:19,345][23882] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-06 18:40:20,129][23882] Updated weights for policy 0, policy_version 91870 (0.0007) [2023-03-06 18:40:20,923][23882] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-06 18:40:21,700][23882] Updated weights for policy 0, policy_version 91890 (0.0006) [2023-03-06 18:40:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94095360. Throughput: 0: 13017.4. Samples: 94080853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:21,748][23556] Avg episode reward: [(0, '1908.319')] [2023-03-06 18:40:22,479][23882] Updated weights for policy 0, policy_version 91900 (0.0007) [2023-03-06 18:40:23,268][23882] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-06 18:40:24,056][23882] Updated weights for policy 0, policy_version 91920 (0.0006) [2023-03-06 18:40:24,835][23882] Updated weights for policy 0, policy_version 91930 (0.0007) [2023-03-06 18:40:25,622][23882] Updated weights for policy 0, policy_version 91940 (0.0005) [2023-03-06 18:40:26,415][23882] Updated weights for policy 0, policy_version 91950 (0.0006) [2023-03-06 18:40:26,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94160896. Throughput: 0: 13017.1. Samples: 94159112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:26,748][23556] Avg episode reward: [(0, '2036.036')] [2023-03-06 18:40:27,214][23882] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-06 18:40:28,009][23882] Updated weights for policy 0, policy_version 91970 (0.0007) [2023-03-06 18:40:28,795][23882] Updated weights for policy 0, policy_version 91980 (0.0007) [2023-03-06 18:40:29,581][23882] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-06 18:40:30,352][23882] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-06 18:40:31,154][23882] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-06 18:40:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 94225408. Throughput: 0: 13014.3. Samples: 94197966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:31,748][23556] Avg episode reward: [(0, '1990.489')] [2023-03-06 18:40:31,952][23882] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-06 18:40:32,749][23882] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-06 18:40:33,526][23882] Updated weights for policy 0, policy_version 92040 (0.0006) [2023-03-06 18:40:34,315][23882] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-06 18:40:35,088][23882] Updated weights for policy 0, policy_version 92060 (0.0007) [2023-03-06 18:40:35,871][23882] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-06 18:40:36,672][23882] Updated weights for policy 0, policy_version 92080 (0.0007) [2023-03-06 18:40:36,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94289920. Throughput: 0: 13004.3. Samples: 94275919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:36,748][23556] Avg episode reward: [(0, '2067.138')] [2023-03-06 18:40:37,451][23882] Updated weights for policy 0, policy_version 92090 (0.0007) [2023-03-06 18:40:38,244][23882] Updated weights for policy 0, policy_version 92100 (0.0006) [2023-03-06 18:40:39,050][23882] Updated weights for policy 0, policy_version 92110 (0.0007) [2023-03-06 18:40:39,864][23882] Updated weights for policy 0, policy_version 92120 (0.0007) [2023-03-06 18:40:40,623][23882] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-06 18:40:41,433][23882] Updated weights for policy 0, policy_version 92140 (0.0006) [2023-03-06 18:40:41,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 94355456. Throughput: 0: 12996.4. Samples: 94353525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:41,748][23556] Avg episode reward: [(0, '2047.381')] [2023-03-06 18:40:42,208][23882] Updated weights for policy 0, policy_version 92150 (0.0007) [2023-03-06 18:40:42,994][23882] Updated weights for policy 0, policy_version 92160 (0.0007) [2023-03-06 18:40:43,793][23882] Updated weights for policy 0, policy_version 92170 (0.0007) [2023-03-06 18:40:44,568][23882] Updated weights for policy 0, policy_version 92180 (0.0007) [2023-03-06 18:40:45,346][23882] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-06 18:40:46,141][23882] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-06 18:40:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 94419968. Throughput: 0: 12996.5. Samples: 94392523. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:46,748][23556] Avg episode reward: [(0, '1992.535')] [2023-03-06 18:40:46,934][23882] Updated weights for policy 0, policy_version 92210 (0.0007) [2023-03-06 18:40:47,710][23882] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-06 18:40:48,490][23882] Updated weights for policy 0, policy_version 92230 (0.0006) [2023-03-06 18:40:49,263][23882] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-06 18:40:50,088][23882] Updated weights for policy 0, policy_version 92250 (0.0006) [2023-03-06 18:40:50,884][23882] Updated weights for policy 0, policy_version 92260 (0.0007) [2023-03-06 18:40:51,670][23882] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-06 18:40:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94485504. Throughput: 0: 12993.5. Samples: 94470506. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:51,748][23556] Avg episode reward: [(0, '2046.554')] [2023-03-06 18:40:52,449][23882] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-06 18:40:53,246][23882] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-06 18:40:54,038][23882] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-06 18:40:54,821][23882] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-06 18:40:55,607][23882] Updated weights for policy 0, policy_version 92320 (0.0006) [2023-03-06 18:40:56,394][23882] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-06 18:40:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 94550016. Throughput: 0: 12993.9. Samples: 94548503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:40:56,748][23556] Avg episode reward: [(0, '2049.463')] [2023-03-06 18:40:57,187][23882] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-06 18:40:57,974][23882] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-06 18:40:58,757][23882] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-06 18:40:59,535][23882] Updated weights for policy 0, policy_version 92370 (0.0007) [2023-03-06 18:41:00,325][23882] Updated weights for policy 0, policy_version 92380 (0.0007) [2023-03-06 18:41:01,117][23882] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-06 18:41:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94615552. Throughput: 0: 13001.0. Samples: 94587652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:01,748][23556] Avg episode reward: [(0, '2037.235')] [2023-03-06 18:41:01,888][23882] Updated weights for policy 0, policy_version 92400 (0.0007) [2023-03-06 18:41:02,668][23882] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-06 18:41:03,448][23882] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-06 18:41:04,246][23882] Updated weights for policy 0, policy_version 92430 (0.0007) [2023-03-06 18:41:05,029][23882] Updated weights for policy 0, policy_version 92440 (0.0007) [2023-03-06 18:41:05,815][23882] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-06 18:41:06,616][23882] Updated weights for policy 0, policy_version 92460 (0.0005) [2023-03-06 18:41:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 94680064. Throughput: 0: 12998.8. Samples: 94665797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:06,748][23556] Avg episode reward: [(0, '1939.832')] [2023-03-06 18:41:07,407][23882] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-06 18:41:08,179][23882] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-06 18:41:08,969][23882] Updated weights for policy 0, policy_version 92490 (0.0007) [2023-03-06 18:41:09,766][23882] Updated weights for policy 0, policy_version 92500 (0.0007) [2023-03-06 18:41:10,560][23882] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-06 18:41:11,374][23882] Updated weights for policy 0, policy_version 92520 (0.0007) [2023-03-06 18:41:11,748][23556] Fps is (10 sec: 12902.5, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 94744576. Throughput: 0: 12985.2. Samples: 94743446. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:11,748][23556] Avg episode reward: [(0, '1971.375')] [2023-03-06 18:41:12,157][23882] Updated weights for policy 0, policy_version 92530 (0.0006) [2023-03-06 18:41:12,928][23882] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-06 18:41:13,709][23882] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-06 18:41:14,505][23882] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-06 18:41:15,288][23882] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-06 18:41:16,074][23882] Updated weights for policy 0, policy_version 92580 (0.0006) [2023-03-06 18:41:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 94810112. Throughput: 0: 12990.7. Samples: 94782546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:16,748][23556] Avg episode reward: [(0, '1949.657')] [2023-03-06 18:41:16,870][23882] Updated weights for policy 0, policy_version 92590 (0.0006) [2023-03-06 18:41:17,658][23882] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-06 18:41:18,452][23882] Updated weights for policy 0, policy_version 92610 (0.0007) [2023-03-06 18:41:19,233][23882] Updated weights for policy 0, policy_version 92620 (0.0008) [2023-03-06 18:41:20,007][23882] Updated weights for policy 0, policy_version 92630 (0.0007) [2023-03-06 18:41:20,803][23882] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-06 18:41:21,602][23882] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-06 18:41:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 94874624. Throughput: 0: 12990.7. Samples: 94860499. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:21,748][23556] Avg episode reward: [(0, '1848.894')] [2023-03-06 18:41:22,388][23882] Updated weights for policy 0, policy_version 92660 (0.0007) [2023-03-06 18:41:23,162][23882] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-06 18:41:23,964][23882] Updated weights for policy 0, policy_version 92680 (0.0007) [2023-03-06 18:41:24,749][23882] Updated weights for policy 0, policy_version 92690 (0.0006) [2023-03-06 18:41:25,508][23882] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-06 18:41:26,308][23882] Updated weights for policy 0, policy_version 92710 (0.0006) [2023-03-06 18:41:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 94940160. Throughput: 0: 13005.5. Samples: 94938772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:26,748][23556] Avg episode reward: [(0, '1980.479')] [2023-03-06 18:41:27,080][23882] Updated weights for policy 0, policy_version 92720 (0.0006) [2023-03-06 18:41:27,882][23882] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-06 18:41:28,652][23882] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-06 18:41:29,437][23882] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-06 18:41:30,206][23882] Updated weights for policy 0, policy_version 92760 (0.0007) [2023-03-06 18:41:30,995][23882] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-06 18:41:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 95005696. Throughput: 0: 13011.9. Samples: 94978058. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:31,748][23556] Avg episode reward: [(0, '1999.512')] [2023-03-06 18:41:31,764][23882] Updated weights for policy 0, policy_version 92780 (0.0005) [2023-03-06 18:41:32,542][23882] Updated weights for policy 0, policy_version 92790 (0.0006) [2023-03-06 18:41:33,321][23882] Updated weights for policy 0, policy_version 92800 (0.0007) [2023-03-06 18:41:34,121][23882] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-06 18:41:34,912][23882] Updated weights for policy 0, policy_version 92820 (0.0007) [2023-03-06 18:41:35,688][23882] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-06 18:41:36,471][23882] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-06 18:41:36,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 95071232. Throughput: 0: 13022.5. Samples: 95056520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:36,748][23556] Avg episode reward: [(0, '1937.076')] [2023-03-06 18:41:37,253][23882] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-06 18:41:38,052][23882] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-06 18:41:38,846][23882] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-06 18:41:39,628][23882] Updated weights for policy 0, policy_version 92880 (0.0008) [2023-03-06 18:41:40,385][23882] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-06 18:41:41,198][23882] Updated weights for policy 0, policy_version 92900 (0.0006) [2023-03-06 18:41:41,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 95136768. Throughput: 0: 13029.6. Samples: 95134835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:41,748][23556] Avg episode reward: [(0, '2040.527')] [2023-03-06 18:41:41,975][23882] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-06 18:41:42,772][23882] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-06 18:41:43,451][23831] KL-divergence is very high: 112.0047 [2023-03-06 18:41:43,569][23882] Updated weights for policy 0, policy_version 92930 (0.0008) [2023-03-06 18:41:44,345][23882] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-06 18:41:45,109][23882] Updated weights for policy 0, policy_version 92950 (0.0006) [2023-03-06 18:41:45,902][23882] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-06 18:41:46,665][23882] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-06 18:41:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 95202304. Throughput: 0: 13030.8. Samples: 95174037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:46,748][23556] Avg episode reward: [(0, '1990.761')] [2023-03-06 18:41:47,449][23882] Updated weights for policy 0, policy_version 92980 (0.0007) [2023-03-06 18:41:48,235][23882] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-06 18:41:49,029][23882] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-06 18:41:49,828][23882] Updated weights for policy 0, policy_version 93010 (0.0006) [2023-03-06 18:41:50,596][23882] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-06 18:41:51,364][23882] Updated weights for policy 0, policy_version 93030 (0.0006) [2023-03-06 18:41:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 95266816. Throughput: 0: 13037.2. Samples: 95252472. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:51,748][23556] Avg episode reward: [(0, '1803.978')] [2023-03-06 18:41:51,924][23831] KL-divergence is very high: 155.4914 [2023-03-06 18:41:52,162][23882] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-06 18:41:52,957][23882] Updated weights for policy 0, policy_version 93050 (0.0007) [2023-03-06 18:41:53,744][23882] Updated weights for policy 0, policy_version 93060 (0.0007) [2023-03-06 18:41:54,517][23882] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-06 18:41:55,296][23882] Updated weights for policy 0, policy_version 93080 (0.0007) [2023-03-06 18:41:55,359][23831] KL-divergence is very high: 118735.8828 [2023-03-06 18:41:55,442][23831] KL-divergence is very high: 125.0499 [2023-03-06 18:41:56,090][23882] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-06 18:41:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 95332352. Throughput: 0: 13050.6. Samples: 95330723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:41:56,748][23556] Avg episode reward: [(0, '2012.131')] [2023-03-06 18:41:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000093098_95332352.pth... [2023-03-06 18:41:56,783][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000090047_92208128.pth [2023-03-06 18:41:56,868][23882] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-06 18:41:57,413][23831] KL-divergence is very high: 173.3604 [2023-03-06 18:41:57,656][23882] Updated weights for policy 0, policy_version 93110 (0.0007) [2023-03-06 18:41:57,962][23831] KL-divergence is very high: 218.4378 [2023-03-06 18:41:58,032][23831] KL-divergence is very high: 11592.1406 [2023-03-06 18:41:58,463][23882] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-06 18:41:59,245][23882] Updated weights for policy 0, policy_version 93130 (0.0007) [2023-03-06 18:42:00,035][23882] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-06 18:42:00,245][23831] KL-divergence is very high: 123.3956 [2023-03-06 18:42:00,808][23882] Updated weights for policy 0, policy_version 93150 (0.0006) [2023-03-06 18:42:01,595][23882] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-06 18:42:01,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 95397888. Throughput: 0: 13045.2. Samples: 95369581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:01,748][23556] Avg episode reward: [(0, '1812.448')] [2023-03-06 18:42:02,370][23882] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-06 18:42:03,153][23882] Updated weights for policy 0, policy_version 93180 (0.0007) [2023-03-06 18:42:03,949][23882] Updated weights for policy 0, policy_version 93190 (0.0006) [2023-03-06 18:42:04,740][23882] Updated weights for policy 0, policy_version 93200 (0.0007) [2023-03-06 18:42:05,534][23882] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-06 18:42:06,308][23882] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-06 18:42:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 95462400. Throughput: 0: 13051.5. Samples: 95447818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:06,748][23556] Avg episode reward: [(0, '1832.993')] [2023-03-06 18:42:07,107][23882] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-03-06 18:42:07,173][23831] KL-divergence is very high: 127.8580 [2023-03-06 18:42:07,894][23882] Updated weights for policy 0, policy_version 93240 (0.0006) [2023-03-06 18:42:08,665][23882] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-06 18:42:09,453][23882] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-06 18:42:09,521][23831] KL-divergence is very high: 112.6466 [2023-03-06 18:42:10,239][23882] Updated weights for policy 0, policy_version 93270 (0.0006) [2023-03-06 18:42:11,009][23882] Updated weights for policy 0, policy_version 93280 (0.0006) [2023-03-06 18:42:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13013.5). Total num frames: 95527936. Throughput: 0: 13054.4. Samples: 95526220. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:11,748][23556] Avg episode reward: [(0, '1839.227')] [2023-03-06 18:42:11,813][23882] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-06 18:42:12,591][23882] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-06 18:42:13,386][23882] Updated weights for policy 0, policy_version 93310 (0.0005) [2023-03-06 18:42:14,163][23882] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-06 18:42:14,933][23882] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-06 18:42:15,718][23831] KL-divergence is very high: 115.2682 [2023-03-06 18:42:15,726][23882] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-06 18:42:16,514][23882] Updated weights for policy 0, policy_version 93350 (0.0007) [2023-03-06 18:42:16,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 95592448. Throughput: 0: 13052.6. Samples: 95565425. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:16,749][23556] Avg episode reward: [(0, '1722.645')] [2023-03-06 18:42:16,824][23831] KL-divergence is very high: 26904.1660 [2023-03-06 18:42:17,303][23831] KL-divergence is very high: 153.9871 [2023-03-06 18:42:17,309][23882] Updated weights for policy 0, policy_version 93360 (0.0007) [2023-03-06 18:42:18,097][23882] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-06 18:42:18,560][23831] KL-divergence is very high: 3177.2224 [2023-03-06 18:42:18,721][23831] KL-divergence is very high: 313.1182 [2023-03-06 18:42:18,889][23882] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-06 18:42:19,283][23831] KL-divergence is very high: 197.6413 [2023-03-06 18:42:19,605][23831] KL-divergence is very high: 13567.7402 [2023-03-06 18:42:19,681][23882] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-06 18:42:20,456][23882] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-06 18:42:21,244][23882] Updated weights for policy 0, policy_version 93410 (0.0007) [2023-03-06 18:42:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13013.5). Total num frames: 95657984. Throughput: 0: 13038.4. Samples: 95643249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:21,748][23556] Avg episode reward: [(0, '1557.457')] [2023-03-06 18:42:22,040][23882] Updated weights for policy 0, policy_version 93420 (0.0007) [2023-03-06 18:42:22,826][23882] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-06 18:42:23,611][23831] KL-divergence is very high: 110.4598 [2023-03-06 18:42:23,618][23882] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-06 18:42:24,412][23882] Updated weights for policy 0, policy_version 93450 (0.0008) [2023-03-06 18:42:25,189][23882] Updated weights for policy 0, policy_version 93460 (0.0007) [2023-03-06 18:42:25,958][23882] Updated weights for policy 0, policy_version 93470 (0.0006) [2023-03-06 18:42:26,737][23882] Updated weights for policy 0, policy_version 93480 (0.0007) [2023-03-06 18:42:26,748][23556] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13017.0). Total num frames: 95723520. Throughput: 0: 13039.8. Samples: 95721624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:26,748][23556] Avg episode reward: [(0, '1349.225')] [2023-03-06 18:42:27,524][23882] Updated weights for policy 0, policy_version 93490 (0.0007) [2023-03-06 18:42:28,288][23882] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-06 18:42:29,081][23882] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-06 18:42:29,858][23882] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-06 18:42:30,650][23882] Updated weights for policy 0, policy_version 93530 (0.0007) [2023-03-06 18:42:31,425][23882] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-06 18:42:31,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13016.9). Total num frames: 95789056. Throughput: 0: 13040.2. Samples: 95760844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:31,748][23556] Avg episode reward: [(0, '1138.591')] [2023-03-06 18:42:32,209][23882] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-06 18:42:32,985][23882] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-06 18:42:33,770][23882] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-06 18:42:34,569][23882] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-06 18:42:35,355][23882] Updated weights for policy 0, policy_version 93590 (0.0006) [2023-03-06 18:42:36,147][23882] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-06 18:42:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13016.9). Total num frames: 95853568. Throughput: 0: 13041.0. Samples: 95839317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:36,748][23556] Avg episode reward: [(0, '1662.915')] [2023-03-06 18:42:36,931][23882] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-06 18:42:37,719][23882] Updated weights for policy 0, policy_version 93620 (0.0007) [2023-03-06 18:42:38,516][23882] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-06 18:42:39,309][23882] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-06 18:42:40,097][23882] Updated weights for policy 0, policy_version 93650 (0.0006) [2023-03-06 18:42:40,888][23882] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-06 18:42:40,964][23831] KL-divergence is very high: 166.9972 [2023-03-06 18:42:41,678][23882] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-06 18:42:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13017.0). Total num frames: 95919104. Throughput: 0: 13028.6. Samples: 95917010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:41,748][23556] Avg episode reward: [(0, '1822.053')] [2023-03-06 18:42:42,444][23882] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-06 18:42:43,244][23882] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-06 18:42:44,021][23882] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-06 18:42:44,817][23882] Updated weights for policy 0, policy_version 93710 (0.0007) [2023-03-06 18:42:45,596][23882] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-06 18:42:46,397][23882] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-06 18:42:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 95983616. Throughput: 0: 13033.9. Samples: 95956106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:46,748][23556] Avg episode reward: [(0, '1753.447')] [2023-03-06 18:42:47,186][23882] Updated weights for policy 0, policy_version 93740 (0.0005) [2023-03-06 18:42:47,957][23882] Updated weights for policy 0, policy_version 93750 (0.0006) [2023-03-06 18:42:48,745][23882] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-06 18:42:49,521][23882] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-06 18:42:50,304][23882] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-06 18:42:51,104][23882] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-06 18:42:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 96049152. Throughput: 0: 13035.0. Samples: 96034391. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:51,748][23556] Avg episode reward: [(0, '1646.729')] [2023-03-06 18:42:51,902][23882] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-06 18:42:52,667][23882] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-06 18:42:53,457][23882] Updated weights for policy 0, policy_version 93820 (0.0007) [2023-03-06 18:42:54,262][23882] Updated weights for policy 0, policy_version 93830 (0.0006) [2023-03-06 18:42:55,030][23882] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-06 18:42:55,807][23882] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-06 18:42:56,603][23882] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-06 18:42:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 96113664. Throughput: 0: 13029.7. Samples: 96112558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:42:56,748][23556] Avg episode reward: [(0, '1617.585')] [2023-03-06 18:42:57,392][23882] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-06 18:42:58,178][23882] Updated weights for policy 0, policy_version 93880 (0.0005) [2023-03-06 18:42:58,979][23882] Updated weights for policy 0, policy_version 93890 (0.0007) [2023-03-06 18:42:59,778][23882] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-06 18:43:00,546][23882] Updated weights for policy 0, policy_version 93910 (0.0007) [2023-03-06 18:43:01,341][23882] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-06 18:43:01,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 96179200. Throughput: 0: 13018.6. Samples: 96151259. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:01,759][23556] Avg episode reward: [(0, '1892.270')] [2023-03-06 18:43:02,132][23882] Updated weights for policy 0, policy_version 93930 (0.0007) [2023-03-06 18:43:02,929][23882] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-06 18:43:03,727][23882] Updated weights for policy 0, policy_version 93950 (0.0007) [2023-03-06 18:43:04,498][23882] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-06 18:43:05,301][23882] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-06 18:43:06,097][23882] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-06 18:43:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 96243712. Throughput: 0: 13020.2. Samples: 96229158. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:06,759][23556] Avg episode reward: [(0, '1872.454')] [2023-03-06 18:43:06,871][23882] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-06 18:43:07,669][23882] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-06 18:43:08,442][23882] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-06 18:43:09,229][23882] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-06 18:43:10,015][23882] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-06 18:43:10,809][23882] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-06 18:43:11,585][23882] Updated weights for policy 0, policy_version 94050 (0.0007) [2023-03-06 18:43:11,748][23556] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 96308224. Throughput: 0: 13006.9. Samples: 96306934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:11,754][23556] Avg episode reward: [(0, '2025.861')] [2023-03-06 18:43:12,365][23882] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-06 18:43:13,163][23882] Updated weights for policy 0, policy_version 94070 (0.0007) [2023-03-06 18:43:13,933][23882] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-06 18:43:14,723][23882] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-06 18:43:15,501][23882] Updated weights for policy 0, policy_version 94100 (0.0007) [2023-03-06 18:43:16,304][23882] Updated weights for policy 0, policy_version 94110 (0.0007) [2023-03-06 18:43:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 96373760. Throughput: 0: 13012.6. Samples: 96346410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:16,748][23556] Avg episode reward: [(0, '1896.690')] [2023-03-06 18:43:17,092][23882] Updated weights for policy 0, policy_version 94120 (0.0006) [2023-03-06 18:43:17,888][23882] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-06 18:43:18,682][23882] Updated weights for policy 0, policy_version 94140 (0.0007) [2023-03-06 18:43:19,439][23882] Updated weights for policy 0, policy_version 94150 (0.0005) [2023-03-06 18:43:20,222][23882] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-06 18:43:21,006][23882] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-03-06 18:43:21,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 96438272. Throughput: 0: 13004.3. Samples: 96424510. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:21,748][23556] Avg episode reward: [(0, '1982.071')] [2023-03-06 18:43:21,817][23882] Updated weights for policy 0, policy_version 94180 (0.0007) [2023-03-06 18:43:22,614][23882] Updated weights for policy 0, policy_version 94190 (0.0006) [2023-03-06 18:43:23,412][23882] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-06 18:43:24,222][23882] Updated weights for policy 0, policy_version 94210 (0.0007) [2023-03-06 18:43:25,011][23882] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-06 18:43:25,791][23882] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-06 18:43:26,594][23882] Updated weights for policy 0, policy_version 94240 (0.0007) [2023-03-06 18:43:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 96503808. Throughput: 0: 12996.6. Samples: 96501857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:26,748][23556] Avg episode reward: [(0, '1937.917')] [2023-03-06 18:43:27,373][23882] Updated weights for policy 0, policy_version 94250 (0.0007) [2023-03-06 18:43:28,162][23882] Updated weights for policy 0, policy_version 94260 (0.0008) [2023-03-06 18:43:28,945][23882] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-06 18:43:29,738][23882] Updated weights for policy 0, policy_version 94280 (0.0005) [2023-03-06 18:43:30,522][23882] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-06 18:43:31,295][23882] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-06 18:43:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13017.0). Total num frames: 96568320. Throughput: 0: 12994.5. Samples: 96540859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:31,748][23556] Avg episode reward: [(0, '1817.719')] [2023-03-06 18:43:32,098][23882] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-06 18:43:32,873][23882] Updated weights for policy 0, policy_version 94320 (0.0007) [2023-03-06 18:43:33,668][23882] Updated weights for policy 0, policy_version 94330 (0.0008) [2023-03-06 18:43:34,466][23882] Updated weights for policy 0, policy_version 94340 (0.0007) [2023-03-06 18:43:35,249][23882] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-06 18:43:36,025][23882] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-06 18:43:36,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96633856. Throughput: 0: 12991.4. Samples: 96619006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:43:36,748][23556] Avg episode reward: [(0, '2063.940')] [2023-03-06 18:43:36,813][23882] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-06 18:43:37,583][23882] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-06 18:43:38,381][23882] Updated weights for policy 0, policy_version 94390 (0.0006) [2023-03-06 18:43:39,168][23882] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-06 18:43:39,947][23882] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-06 18:43:40,738][23882] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-06 18:43:41,524][23882] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-06 18:43:41,748][23556] Fps is (10 sec: 13107.0, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96699392. Throughput: 0: 12994.9. Samples: 96697327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:43:41,748][23556] Avg episode reward: [(0, '1939.767')] [2023-03-06 18:43:42,288][23882] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-06 18:43:43,087][23882] Updated weights for policy 0, policy_version 94450 (0.0007) [2023-03-06 18:43:43,877][23882] Updated weights for policy 0, policy_version 94460 (0.0007) [2023-03-06 18:43:44,671][23882] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-06 18:43:45,466][23882] Updated weights for policy 0, policy_version 94480 (0.0008) [2023-03-06 18:43:46,233][23882] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-06 18:43:46,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96763904. Throughput: 0: 13003.7. Samples: 96736426. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:43:46,748][23556] Avg episode reward: [(0, '1936.597')] [2023-03-06 18:43:47,017][23882] Updated weights for policy 0, policy_version 94500 (0.0007) [2023-03-06 18:43:47,809][23882] Updated weights for policy 0, policy_version 94510 (0.0007) [2023-03-06 18:43:48,576][23882] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-06 18:43:49,373][23882] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-06 18:43:50,150][23882] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-06 18:43:50,926][23882] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-06 18:43:51,715][23882] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-06 18:43:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96829440. Throughput: 0: 13013.1. Samples: 96814749. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:43:51,748][23556] Avg episode reward: [(0, '2106.259')] [2023-03-06 18:43:52,495][23882] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-06 18:43:53,280][23882] Updated weights for policy 0, policy_version 94580 (0.0007) [2023-03-06 18:43:54,058][23882] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-06 18:43:54,838][23882] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-06 18:43:55,642][23882] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-06 18:43:56,411][23882] Updated weights for policy 0, policy_version 94620 (0.0006) [2023-03-06 18:43:56,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 96894976. Throughput: 0: 13026.7. Samples: 96893139. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:43:56,748][23556] Avg episode reward: [(0, '2013.543')] [2023-03-06 18:43:56,753][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000094624_96894976.pth... [2023-03-06 18:43:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000091573_93770752.pth [2023-03-06 18:43:57,207][23882] Updated weights for policy 0, policy_version 94630 (0.0007) [2023-03-06 18:43:57,991][23882] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-06 18:43:58,791][23882] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-06 18:43:59,574][23882] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-06 18:44:00,368][23882] Updated weights for policy 0, policy_version 94670 (0.0007) [2023-03-06 18:44:01,155][23882] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-06 18:44:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 96959488. Throughput: 0: 13015.7. Samples: 96932116. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:01,748][23556] Avg episode reward: [(0, '2077.746')] [2023-03-06 18:44:01,945][23882] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-06 18:44:02,728][23882] Updated weights for policy 0, policy_version 94700 (0.0006) [2023-03-06 18:44:03,526][23882] Updated weights for policy 0, policy_version 94710 (0.0007) [2023-03-06 18:44:04,317][23882] Updated weights for policy 0, policy_version 94720 (0.0006) [2023-03-06 18:44:05,086][23882] Updated weights for policy 0, policy_version 94730 (0.0007) [2023-03-06 18:44:05,873][23882] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-06 18:44:06,642][23882] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-06 18:44:06,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97025024. Throughput: 0: 13012.4. Samples: 97010069. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:06,748][23556] Avg episode reward: [(0, '2134.191')] [2023-03-06 18:44:07,421][23882] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-06 18:44:08,203][23882] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-06 18:44:09,007][23882] Updated weights for policy 0, policy_version 94780 (0.0007) [2023-03-06 18:44:09,792][23882] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-06 18:44:10,600][23882] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-06 18:44:11,391][23882] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-06 18:44:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 97089536. Throughput: 0: 13030.9. Samples: 97088248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:11,748][23556] Avg episode reward: [(0, '2054.520')] [2023-03-06 18:44:12,179][23882] Updated weights for policy 0, policy_version 94820 (0.0006) [2023-03-06 18:44:12,940][23882] Updated weights for policy 0, policy_version 94830 (0.0007) [2023-03-06 18:44:13,740][23882] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-06 18:44:14,525][23882] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-06 18:44:15,313][23882] Updated weights for policy 0, policy_version 94860 (0.0007) [2023-03-06 18:44:16,095][23882] Updated weights for policy 0, policy_version 94870 (0.0007) [2023-03-06 18:44:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97155072. Throughput: 0: 13034.5. Samples: 97127412. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:16,748][23556] Avg episode reward: [(0, '1854.706')] [2023-03-06 18:44:16,892][23882] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-06 18:44:17,688][23882] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-06 18:44:18,465][23882] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-06 18:44:19,248][23882] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-06 18:44:20,060][23882] Updated weights for policy 0, policy_version 94920 (0.0007) [2023-03-06 18:44:20,829][23882] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-06 18:44:21,635][23882] Updated weights for policy 0, policy_version 94940 (0.0007) [2023-03-06 18:44:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 97219584. Throughput: 0: 13025.5. Samples: 97205153. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:21,748][23556] Avg episode reward: [(0, '2057.592')] [2023-03-06 18:44:22,417][23882] Updated weights for policy 0, policy_version 94950 (0.0008) [2023-03-06 18:44:23,206][23882] Updated weights for policy 0, policy_version 94960 (0.0005) [2023-03-06 18:44:24,001][23882] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-06 18:44:24,784][23882] Updated weights for policy 0, policy_version 94980 (0.0007) [2023-03-06 18:44:25,570][23882] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-06 18:44:26,357][23882] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-06 18:44:26,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97285120. Throughput: 0: 13018.6. Samples: 97283166. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:26,748][23556] Avg episode reward: [(0, '1968.474')] [2023-03-06 18:44:27,123][23882] Updated weights for policy 0, policy_version 95010 (0.0007) [2023-03-06 18:44:27,924][23882] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-06 18:44:28,712][23882] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-06 18:44:29,510][23882] Updated weights for policy 0, policy_version 95040 (0.0007) [2023-03-06 18:44:30,290][23882] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-06 18:44:31,081][23882] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-06 18:44:31,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.8, 300 sec: 13016.9). Total num frames: 97349632. Throughput: 0: 13016.3. Samples: 97322158. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 18:44:31,748][23556] Avg episode reward: [(0, '1995.101')] [2023-03-06 18:44:31,858][23882] Updated weights for policy 0, policy_version 95070 (0.0007) [2023-03-06 18:44:32,637][23882] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-06 18:44:33,407][23882] Updated weights for policy 0, policy_version 95090 (0.0007) [2023-03-06 18:44:34,211][23882] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-06 18:44:34,984][23882] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-06 18:44:35,765][23882] Updated weights for policy 0, policy_version 95120 (0.0007) [2023-03-06 18:44:36,558][23882] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-06 18:44:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 97415168. Throughput: 0: 13016.1. Samples: 97400475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:44:36,748][23556] Avg episode reward: [(0, '2020.558')] [2023-03-06 18:44:37,350][23882] Updated weights for policy 0, policy_version 95140 (0.0007) [2023-03-06 18:44:38,135][23882] Updated weights for policy 0, policy_version 95150 (0.0007) [2023-03-06 18:44:38,921][23882] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-06 18:44:39,711][23882] Updated weights for policy 0, policy_version 95170 (0.0006) [2023-03-06 18:44:40,501][23882] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-06 18:44:41,284][23882] Updated weights for policy 0, policy_version 95190 (0.0007) [2023-03-06 18:44:41,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 97479680. Throughput: 0: 13008.3. Samples: 97478511. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:44:41,748][23556] Avg episode reward: [(0, '1956.878')] [2023-03-06 18:44:42,073][23882] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-06 18:44:42,854][23882] Updated weights for policy 0, policy_version 95210 (0.0006) [2023-03-06 18:44:43,650][23882] Updated weights for policy 0, policy_version 95220 (0.0007) [2023-03-06 18:44:44,434][23882] Updated weights for policy 0, policy_version 95230 (0.0006) [2023-03-06 18:44:45,219][23882] Updated weights for policy 0, policy_version 95240 (0.0006) [2023-03-06 18:44:46,004][23882] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-06 18:44:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 97545216. Throughput: 0: 13012.2. Samples: 97517664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:44:46,748][23556] Avg episode reward: [(0, '1928.170')] [2023-03-06 18:44:46,776][23882] Updated weights for policy 0, policy_version 95260 (0.0007) [2023-03-06 18:44:47,549][23882] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-06 18:44:48,334][23882] Updated weights for policy 0, policy_version 95280 (0.0007) [2023-03-06 18:44:49,147][23882] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-06 18:44:49,938][23882] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-06 18:44:50,713][23882] Updated weights for policy 0, policy_version 95310 (0.0007) [2023-03-06 18:44:51,499][23882] Updated weights for policy 0, policy_version 95320 (0.0007) [2023-03-06 18:44:51,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 97610752. Throughput: 0: 13017.8. Samples: 97595871. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:44:51,749][23556] Avg episode reward: [(0, '1936.733')] [2023-03-06 18:44:52,294][23882] Updated weights for policy 0, policy_version 95330 (0.0005) [2023-03-06 18:44:53,089][23882] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-06 18:44:53,877][23882] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-06 18:44:54,650][23882] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-06 18:44:55,452][23882] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-06 18:44:56,225][23882] Updated weights for policy 0, policy_version 95380 (0.0007) [2023-03-06 18:44:56,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 97675264. Throughput: 0: 13009.3. Samples: 97673667. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:44:56,748][23556] Avg episode reward: [(0, '1868.808')] [2023-03-06 18:44:57,020][23882] Updated weights for policy 0, policy_version 95390 (0.0006) [2023-03-06 18:44:57,821][23882] Updated weights for policy 0, policy_version 95400 (0.0008) [2023-03-06 18:44:58,604][23882] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-06 18:44:59,376][23882] Updated weights for policy 0, policy_version 95420 (0.0007) [2023-03-06 18:45:00,174][23882] Updated weights for policy 0, policy_version 95430 (0.0006) [2023-03-06 18:45:00,945][23882] Updated weights for policy 0, policy_version 95440 (0.0007) [2023-03-06 18:45:01,733][23882] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-06 18:45:01,748][23556] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97740800. Throughput: 0: 13005.2. Samples: 97712645. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:01,748][23556] Avg episode reward: [(0, '2110.996')] [2023-03-06 18:45:02,535][23882] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-06 18:45:03,305][23882] Updated weights for policy 0, policy_version 95470 (0.0008) [2023-03-06 18:45:04,117][23882] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-06 18:45:04,886][23882] Updated weights for policy 0, policy_version 95490 (0.0007) [2023-03-06 18:45:05,688][23882] Updated weights for policy 0, policy_version 95500 (0.0006) [2023-03-06 18:45:06,462][23882] Updated weights for policy 0, policy_version 95510 (0.0007) [2023-03-06 18:45:06,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 97805312. Throughput: 0: 13015.1. Samples: 97790831. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:06,748][23556] Avg episode reward: [(0, '2199.314')] [2023-03-06 18:45:07,244][23882] Updated weights for policy 0, policy_version 95520 (0.0007) [2023-03-06 18:45:08,018][23882] Updated weights for policy 0, policy_version 95530 (0.0006) [2023-03-06 18:45:08,811][23882] Updated weights for policy 0, policy_version 95540 (0.0007) [2023-03-06 18:45:09,589][23882] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-06 18:45:10,386][23882] Updated weights for policy 0, policy_version 95560 (0.0008) [2023-03-06 18:45:11,177][23882] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-06 18:45:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97870848. Throughput: 0: 13016.0. Samples: 97868883. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:11,748][23556] Avg episode reward: [(0, '2169.291')] [2023-03-06 18:45:11,953][23882] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-06 18:45:12,757][23882] Updated weights for policy 0, policy_version 95590 (0.0008) [2023-03-06 18:45:13,532][23882] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-06 18:45:14,322][23882] Updated weights for policy 0, policy_version 95610 (0.0005) [2023-03-06 18:45:15,118][23882] Updated weights for policy 0, policy_version 95620 (0.0006) [2023-03-06 18:45:15,893][23882] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-06 18:45:16,681][23882] Updated weights for policy 0, policy_version 95640 (0.0007) [2023-03-06 18:45:16,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 97935360. Throughput: 0: 13018.1. Samples: 97907974. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:16,748][23556] Avg episode reward: [(0, '2094.393')] [2023-03-06 18:45:17,477][23882] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-06 18:45:18,273][23882] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-06 18:45:19,056][23882] Updated weights for policy 0, policy_version 95670 (0.0007) [2023-03-06 18:45:19,839][23882] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-06 18:45:20,631][23882] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-06 18:45:21,410][23882] Updated weights for policy 0, policy_version 95700 (0.0005) [2023-03-06 18:45:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 98000896. Throughput: 0: 13013.8. Samples: 97986097. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:21,748][23556] Avg episode reward: [(0, '2180.419')] [2023-03-06 18:45:22,187][23882] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-06 18:45:22,986][23882] Updated weights for policy 0, policy_version 95720 (0.0007) [2023-03-06 18:45:23,777][23882] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-06 18:45:24,558][23882] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-06 18:45:25,355][23882] Updated weights for policy 0, policy_version 95750 (0.0006) [2023-03-06 18:45:26,125][23882] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-06 18:45:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98065408. Throughput: 0: 13014.1. Samples: 98064145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:26,748][23556] Avg episode reward: [(0, '2017.218')] [2023-03-06 18:45:26,921][23882] Updated weights for policy 0, policy_version 95770 (0.0006) [2023-03-06 18:45:27,710][23882] Updated weights for policy 0, policy_version 95780 (0.0007) [2023-03-06 18:45:28,519][23882] Updated weights for policy 0, policy_version 95790 (0.0007) [2023-03-06 18:45:29,295][23882] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-06 18:45:30,084][23882] Updated weights for policy 0, policy_version 95810 (0.0007) [2023-03-06 18:45:30,871][23882] Updated weights for policy 0, policy_version 95820 (0.0007) [2023-03-06 18:45:31,648][23882] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-06 18:45:31,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 98130944. Throughput: 0: 13009.4. Samples: 98103086. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 18:45:31,749][23556] Avg episode reward: [(0, '2014.540')] [2023-03-06 18:45:32,431][23882] Updated weights for policy 0, policy_version 95840 (0.0006) [2023-03-06 18:45:33,207][23882] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-06 18:45:34,003][23882] Updated weights for policy 0, policy_version 95860 (0.0007) [2023-03-06 18:45:34,807][23882] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-06 18:45:35,577][23882] Updated weights for policy 0, policy_version 95880 (0.0006) [2023-03-06 18:45:36,364][23882] Updated weights for policy 0, policy_version 95890 (0.0007) [2023-03-06 18:45:36,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98195456. Throughput: 0: 13009.4. Samples: 98181294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:45:36,748][23556] Avg episode reward: [(0, '2042.369')] [2023-03-06 18:45:37,147][23882] Updated weights for policy 0, policy_version 95900 (0.0007) [2023-03-06 18:45:37,943][23882] Updated weights for policy 0, policy_version 95910 (0.0005) [2023-03-06 18:45:38,745][23882] Updated weights for policy 0, policy_version 95920 (0.0007) [2023-03-06 18:45:39,516][23882] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-06 18:45:40,310][23882] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-06 18:45:41,114][23882] Updated weights for policy 0, policy_version 95950 (0.0006) [2023-03-06 18:45:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 98260992. Throughput: 0: 13011.5. Samples: 98259185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:45:41,748][23556] Avg episode reward: [(0, '2107.131')] [2023-03-06 18:45:41,873][23882] Updated weights for policy 0, policy_version 95960 (0.0006) [2023-03-06 18:45:42,680][23882] Updated weights for policy 0, policy_version 95970 (0.0006) [2023-03-06 18:45:43,465][23882] Updated weights for policy 0, policy_version 95980 (0.0007) [2023-03-06 18:45:44,259][23882] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-06 18:45:45,030][23882] Updated weights for policy 0, policy_version 96000 (0.0007) [2023-03-06 18:45:45,820][23882] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-06 18:45:46,632][23882] Updated weights for policy 0, policy_version 96020 (0.0007) [2023-03-06 18:45:46,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98325504. Throughput: 0: 13011.6. Samples: 98298167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:45:46,748][23556] Avg episode reward: [(0, '2163.868')] [2023-03-06 18:45:47,399][23882] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-06 18:45:48,202][23882] Updated weights for policy 0, policy_version 96040 (0.0007) [2023-03-06 18:45:48,972][23882] Updated weights for policy 0, policy_version 96050 (0.0007) [2023-03-06 18:45:49,757][23882] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-06 18:45:50,546][23882] Updated weights for policy 0, policy_version 96070 (0.0007) [2023-03-06 18:45:51,317][23882] Updated weights for policy 0, policy_version 96080 (0.0005) [2023-03-06 18:45:51,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98391040. Throughput: 0: 13011.2. Samples: 98376332. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:45:51,748][23556] Avg episode reward: [(0, '1912.801')] [2023-03-06 18:45:52,110][23882] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-06 18:45:52,886][23882] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-06 18:45:53,680][23882] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-06 18:45:54,447][23882] Updated weights for policy 0, policy_version 96120 (0.0007) [2023-03-06 18:45:55,254][23882] Updated weights for policy 0, policy_version 96130 (0.0006) [2023-03-06 18:45:56,038][23882] Updated weights for policy 0, policy_version 96140 (0.0006) [2023-03-06 18:45:56,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 98456576. Throughput: 0: 13014.7. Samples: 98454546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:45:56,748][23556] Avg episode reward: [(0, '2115.968')] [2023-03-06 18:45:56,754][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000096149_98456576.pth... [2023-03-06 18:45:56,784][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000093098_95332352.pth [2023-03-06 18:45:56,825][23882] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-06 18:45:57,611][23882] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-06 18:45:58,410][23882] Updated weights for policy 0, policy_version 96170 (0.0007) [2023-03-06 18:45:59,198][23882] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-06 18:45:59,982][23882] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-06 18:46:00,782][23882] Updated weights for policy 0, policy_version 96200 (0.0006) [2023-03-06 18:46:01,558][23882] Updated weights for policy 0, policy_version 96210 (0.0007) [2023-03-06 18:46:01,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98521088. Throughput: 0: 13009.5. Samples: 98493400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:01,748][23556] Avg episode reward: [(0, '2169.889')] [2023-03-06 18:46:02,357][23882] Updated weights for policy 0, policy_version 96220 (0.0006) [2023-03-06 18:46:03,125][23882] Updated weights for policy 0, policy_version 96230 (0.0007) [2023-03-06 18:46:03,924][23882] Updated weights for policy 0, policy_version 96240 (0.0007) [2023-03-06 18:46:04,708][23882] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-06 18:46:05,494][23882] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-06 18:46:06,256][23882] Updated weights for policy 0, policy_version 96270 (0.0007) [2023-03-06 18:46:06,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 98586624. Throughput: 0: 13010.0. Samples: 98571548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:06,748][23556] Avg episode reward: [(0, '2254.915')] [2023-03-06 18:46:07,052][23882] Updated weights for policy 0, policy_version 96280 (0.0006) [2023-03-06 18:46:07,836][23882] Updated weights for policy 0, policy_version 96290 (0.0007) [2023-03-06 18:46:08,626][23882] Updated weights for policy 0, policy_version 96300 (0.0007) [2023-03-06 18:46:09,418][23882] Updated weights for policy 0, policy_version 96310 (0.0007) [2023-03-06 18:46:10,199][23882] Updated weights for policy 0, policy_version 96320 (0.0006) [2023-03-06 18:46:10,993][23882] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-06 18:46:11,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98651136. Throughput: 0: 13007.2. Samples: 98649467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:11,748][23556] Avg episode reward: [(0, '1978.502')] [2023-03-06 18:46:11,799][23882] Updated weights for policy 0, policy_version 96340 (0.0007) [2023-03-06 18:46:12,564][23882] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-06 18:46:13,359][23882] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-06 18:46:14,138][23882] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-06 18:46:14,930][23882] Updated weights for policy 0, policy_version 96380 (0.0007) [2023-03-06 18:46:15,718][23882] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-06 18:46:16,521][23882] Updated weights for policy 0, policy_version 96400 (0.0007) [2023-03-06 18:46:16,748][23556] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98715648. Throughput: 0: 13008.2. Samples: 98688456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:16,748][23556] Avg episode reward: [(0, '2145.797')] [2023-03-06 18:46:17,285][23882] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-06 18:46:18,069][23882] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-06 18:46:18,847][23882] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-06 18:46:19,641][23882] Updated weights for policy 0, policy_version 96440 (0.0006) [2023-03-06 18:46:20,433][23882] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-06 18:46:21,220][23882] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-06 18:46:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98781184. Throughput: 0: 13011.3. Samples: 98766801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:21,748][23556] Avg episode reward: [(0, '2139.274')] [2023-03-06 18:46:22,021][23882] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-06 18:46:22,816][23882] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-06 18:46:23,608][23882] Updated weights for policy 0, policy_version 96490 (0.0007) [2023-03-06 18:46:24,395][23882] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-06 18:46:25,198][23882] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-06 18:46:25,989][23882] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-06 18:46:26,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98845696. Throughput: 0: 13003.8. Samples: 98844358. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:26,748][23556] Avg episode reward: [(0, '2116.344')] [2023-03-06 18:46:26,782][23882] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-06 18:46:27,567][23882] Updated weights for policy 0, policy_version 96540 (0.0006) [2023-03-06 18:46:28,346][23882] Updated weights for policy 0, policy_version 96550 (0.0006) [2023-03-06 18:46:29,126][23882] Updated weights for policy 0, policy_version 96560 (0.0007) [2023-03-06 18:46:29,910][23882] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-06 18:46:30,694][23882] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-06 18:46:31,502][23882] Updated weights for policy 0, policy_version 96590 (0.0005) [2023-03-06 18:46:31,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 98911232. Throughput: 0: 13009.2. Samples: 98883580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:31,748][23556] Avg episode reward: [(0, '2160.409')] [2023-03-06 18:46:32,282][23882] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-06 18:46:33,063][23882] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-06 18:46:33,849][23882] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-06 18:46:34,635][23882] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-06 18:46:35,420][23882] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-06 18:46:36,213][23882] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-06 18:46:36,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 98975744. Throughput: 0: 13005.2. Samples: 98961566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:36,748][23556] Avg episode reward: [(0, '2006.870')] [2023-03-06 18:46:36,983][23882] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-06 18:46:37,758][23882] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-06 18:46:38,546][23882] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-06 18:46:39,324][23882] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-06 18:46:40,109][23882] Updated weights for policy 0, policy_version 96700 (0.0007) [2023-03-06 18:46:40,902][23882] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-06 18:46:41,686][23882] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-06 18:46:41,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 99041280. Throughput: 0: 13009.6. Samples: 99039980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:41,748][23556] Avg episode reward: [(0, '2056.770')] [2023-03-06 18:46:42,502][23882] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-06 18:46:43,284][23882] Updated weights for policy 0, policy_version 96740 (0.0007) [2023-03-06 18:46:44,063][23882] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-06 18:46:44,846][23882] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-06 18:46:45,638][23882] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-06 18:46:46,413][23882] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-06 18:46:46,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 99106816. Throughput: 0: 13014.7. Samples: 99079061. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:46,748][23556] Avg episode reward: [(0, '2190.926')] [2023-03-06 18:46:47,217][23882] Updated weights for policy 0, policy_version 96790 (0.0007) [2023-03-06 18:46:48,001][23882] Updated weights for policy 0, policy_version 96800 (0.0008) [2023-03-06 18:46:48,779][23882] Updated weights for policy 0, policy_version 96810 (0.0007) [2023-03-06 18:46:49,571][23882] Updated weights for policy 0, policy_version 96820 (0.0007) [2023-03-06 18:46:50,350][23882] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-06 18:46:51,134][23882] Updated weights for policy 0, policy_version 96840 (0.0007) [2023-03-06 18:46:51,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 99171328. Throughput: 0: 13009.9. Samples: 99156995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:51,748][23556] Avg episode reward: [(0, '2093.819')] [2023-03-06 18:46:51,934][23882] Updated weights for policy 0, policy_version 96850 (0.0006) [2023-03-06 18:46:52,720][23882] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-06 18:46:53,510][23882] Updated weights for policy 0, policy_version 96870 (0.0007) [2023-03-06 18:46:54,289][23882] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-06 18:46:55,066][23882] Updated weights for policy 0, policy_version 96890 (0.0007) [2023-03-06 18:46:55,869][23882] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-06 18:46:56,649][23882] Updated weights for policy 0, policy_version 96910 (0.0007) [2023-03-06 18:46:56,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 99236864. Throughput: 0: 13013.2. Samples: 99235060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:46:56,748][23556] Avg episode reward: [(0, '2111.799')] [2023-03-06 18:46:57,435][23882] Updated weights for policy 0, policy_version 96920 (0.0007) [2023-03-06 18:46:58,241][23882] Updated weights for policy 0, policy_version 96930 (0.0007) [2023-03-06 18:46:58,998][23882] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-06 18:46:59,805][23882] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-06 18:47:00,593][23882] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-06 18:47:01,369][23882] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-06 18:47:01,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 99301376. Throughput: 0: 13011.4. Samples: 99273967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:01,759][23556] Avg episode reward: [(0, '2078.342')] [2023-03-06 18:47:02,168][23882] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-06 18:47:02,965][23882] Updated weights for policy 0, policy_version 96990 (0.0005) [2023-03-06 18:47:03,742][23882] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-06 18:47:04,526][23882] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-06 18:47:05,320][23882] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-06 18:47:06,091][23882] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-06 18:47:06,748][23556] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 99366912. Throughput: 0: 13002.6. Samples: 99351921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:06,759][23556] Avg episode reward: [(0, '2050.681')] [2023-03-06 18:47:06,878][23882] Updated weights for policy 0, policy_version 97040 (0.0006) [2023-03-06 18:47:07,666][23882] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-06 18:47:08,453][23882] Updated weights for policy 0, policy_version 97060 (0.0006) [2023-03-06 18:47:09,241][23882] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-06 18:47:10,006][23882] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-06 18:47:10,789][23882] Updated weights for policy 0, policy_version 97090 (0.0007) [2023-03-06 18:47:11,573][23882] Updated weights for policy 0, policy_version 97100 (0.0007) [2023-03-06 18:47:11,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 99432448. Throughput: 0: 13027.1. Samples: 99430578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:11,748][23556] Avg episode reward: [(0, '2035.363')] [2023-03-06 18:47:12,343][23882] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-06 18:47:13,137][23882] Updated weights for policy 0, policy_version 97120 (0.0007) [2023-03-06 18:47:13,924][23882] Updated weights for policy 0, policy_version 97130 (0.0007) [2023-03-06 18:47:14,706][23882] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-06 18:47:15,496][23882] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-06 18:47:16,281][23882] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-06 18:47:16,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 99497984. Throughput: 0: 13026.4. Samples: 99469767. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:16,748][23556] Avg episode reward: [(0, '1979.953')] [2023-03-06 18:47:17,055][23882] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-06 18:47:17,855][23882] Updated weights for policy 0, policy_version 97180 (0.0007) [2023-03-06 18:47:18,651][23882] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-06 18:47:19,424][23882] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-06 18:47:20,219][23882] Updated weights for policy 0, policy_version 97210 (0.0006) [2023-03-06 18:47:20,990][23882] Updated weights for policy 0, policy_version 97220 (0.0006) [2023-03-06 18:47:21,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 99562496. Throughput: 0: 13033.7. Samples: 99548081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:21,748][23556] Avg episode reward: [(0, '2084.855')] [2023-03-06 18:47:21,781][23882] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-06 18:47:22,589][23882] Updated weights for policy 0, policy_version 97240 (0.0007) [2023-03-06 18:47:23,355][23882] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-06 18:47:24,141][23882] Updated weights for policy 0, policy_version 97260 (0.0007) [2023-03-06 18:47:24,930][23882] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-06 18:47:25,710][23882] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-06 18:47:26,481][23882] Updated weights for policy 0, policy_version 97290 (0.0006) [2023-03-06 18:47:26,748][23556] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13013.5). Total num frames: 99628032. Throughput: 0: 13029.9. Samples: 99626324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:26,748][23556] Avg episode reward: [(0, '2127.672')] [2023-03-06 18:47:27,268][23882] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-06 18:47:28,038][23882] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-06 18:47:28,819][23882] Updated weights for policy 0, policy_version 97320 (0.0006) [2023-03-06 18:47:29,604][23882] Updated weights for policy 0, policy_version 97330 (0.0007) [2023-03-06 18:47:30,390][23882] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-06 18:47:31,183][23882] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-06 18:47:31,748][23556] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 99693568. Throughput: 0: 13036.8. Samples: 99665716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:31,748][23556] Avg episode reward: [(0, '2086.148')] [2023-03-06 18:47:31,957][23882] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-06 18:47:32,742][23882] Updated weights for policy 0, policy_version 97370 (0.0005) [2023-03-06 18:47:33,515][23882] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-06 18:47:34,299][23882] Updated weights for policy 0, policy_version 97390 (0.0006) [2023-03-06 18:47:35,098][23882] Updated weights for policy 0, policy_version 97400 (0.0006) [2023-03-06 18:47:35,880][23882] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-06 18:47:36,666][23882] Updated weights for policy 0, policy_version 97420 (0.0005) [2023-03-06 18:47:36,748][23556] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13016.9). Total num frames: 99759104. Throughput: 0: 13051.5. Samples: 99744313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:36,748][23556] Avg episode reward: [(0, '2148.794')] [2023-03-06 18:47:37,440][23882] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-06 18:47:38,206][23882] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-06 18:47:38,995][23882] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-06 18:47:39,790][23882] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-06 18:47:40,573][23882] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-06 18:47:41,338][23882] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-06 18:47:41,748][23556] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13020.4). Total num frames: 99824640. Throughput: 0: 13059.0. Samples: 99822717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:41,748][23556] Avg episode reward: [(0, '2113.270')] [2023-03-06 18:47:42,136][23882] Updated weights for policy 0, policy_version 97490 (0.0005) [2023-03-06 18:47:42,919][23882] Updated weights for policy 0, policy_version 97500 (0.0006) [2023-03-06 18:47:43,697][23882] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-06 18:47:44,480][23882] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-06 18:47:45,277][23882] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-06 18:47:46,057][23882] Updated weights for policy 0, policy_version 97540 (0.0007) [2023-03-06 18:47:46,748][23556] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 99889152. Throughput: 0: 13064.1. Samples: 99861851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:46,748][23556] Avg episode reward: [(0, '2093.628')] [2023-03-06 18:47:46,845][23882] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-06 18:47:47,636][23882] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-06 18:47:48,436][23882] Updated weights for policy 0, policy_version 97570 (0.0008) [2023-03-06 18:47:49,209][23882] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-06 18:47:49,996][23882] Updated weights for policy 0, policy_version 97590 (0.0007) [2023-03-06 18:47:50,765][23882] Updated weights for policy 0, policy_version 97600 (0.0005) [2023-03-06 18:47:51,564][23882] Updated weights for policy 0, policy_version 97610 (0.0007) [2023-03-06 18:47:51,748][23556] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13020.4). Total num frames: 99954688. Throughput: 0: 13068.5. Samples: 99940002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 18:47:51,748][23556] Avg episode reward: [(0, '1891.986')] [2023-03-06 18:47:52,366][23882] Updated weights for policy 0, policy_version 97620 (0.0006) [2023-03-06 18:47:53,145][23882] Updated weights for policy 0, policy_version 97630 (0.0007) [2023-03-06 18:47:53,934][23882] Updated weights for policy 0, policy_version 97640 (0.0007) [2023-03-06 18:47:54,730][23882] Updated weights for policy 0, policy_version 97650 (0.0007) [2023-03-06 18:47:55,364][24263] Stopping RolloutWorker_w28... [2023-03-06 18:47:55,364][24080] Stopping RolloutWorker_w20... [2023-03-06 18:47:55,364][24150] Stopping RolloutWorker_w21... [2023-03-06 18:47:55,364][24146] Stopping RolloutWorker_w17... [2023-03-06 18:47:55,365][24263] Loop rollout_proc28_evt_loop terminating... [2023-03-06 18:47:55,364][23831] Stopping Batcher_0... [2023-03-06 18:47:55,365][24146] Loop rollout_proc17_evt_loop terminating... [2023-03-06 18:47:55,365][24150] Loop rollout_proc21_evt_loop terminating... [2023-03-06 18:47:55,365][24080] Loop rollout_proc20_evt_loop terminating... [2023-03-06 18:47:55,365][23919] Stopping RolloutWorker_w5... [2023-03-06 18:47:55,365][24112] Stopping RolloutWorker_w12... [2023-03-06 18:47:55,365][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 18:47:55,365][24147] Stopping RolloutWorker_w11... [2023-03-06 18:47:55,365][24188] Stopping RolloutWorker_w25... [2023-03-06 18:47:55,365][24112] Loop rollout_proc12_evt_loop terminating... [2023-03-06 18:47:55,365][23919] Loop rollout_proc5_evt_loop terminating... [2023-03-06 18:47:55,364][23556] Component RolloutWorker_w21 stopped! [2023-03-06 18:47:55,365][24151] Stopping RolloutWorker_w8... [2023-03-06 18:47:55,365][24198] Stopping RolloutWorker_w26... [2023-03-06 18:47:55,365][24114] Stopping RolloutWorker_w10... [2023-03-06 18:47:55,365][24149] Stopping RolloutWorker_w16... [2023-03-06 18:47:55,365][24197] Stopping RolloutWorker_w24... [2023-03-06 18:47:55,365][23886] Stopping RolloutWorker_w3... [2023-03-06 18:47:55,365][24264] Stopping RolloutWorker_w30... [2023-03-06 18:47:55,365][24045] Stopping RolloutWorker_w19... [2023-03-06 18:47:55,365][24147] Loop rollout_proc11_evt_loop terminating... [2023-03-06 18:47:55,365][24188] Loop rollout_proc25_evt_loop terminating... [2023-03-06 18:47:55,365][24078] Stopping RolloutWorker_w18... [2023-03-06 18:47:55,365][24231] Stopping RolloutWorker_w27... [2023-03-06 18:47:55,365][24153] Stopping RolloutWorker_w9... [2023-03-06 18:47:55,365][24151] Loop rollout_proc8_evt_loop terminating... [2023-03-06 18:47:55,365][24044] Stopping RolloutWorker_w6... [2023-03-06 18:47:55,365][24114] Loop rollout_proc10_evt_loop terminating... [2023-03-06 18:47:55,365][24198] Loop rollout_proc26_evt_loop terminating... [2023-03-06 18:47:55,365][24149] Loop rollout_proc16_evt_loop terminating... [2023-03-06 18:47:55,365][24297] Stopping RolloutWorker_w31... [2023-03-06 18:47:55,365][24197] Loop rollout_proc24_evt_loop terminating... [2023-03-06 18:47:55,365][23886] Loop rollout_proc3_evt_loop terminating... [2023-03-06 18:47:55,365][24113] Stopping RolloutWorker_w7... [2023-03-06 18:47:55,365][24148] Stopping RolloutWorker_w15... [2023-03-06 18:47:55,365][24264] Loop rollout_proc30_evt_loop terminating... [2023-03-06 18:47:55,365][23885] Stopping RolloutWorker_w2... [2023-03-06 18:47:55,365][24078] Loop rollout_proc18_evt_loop terminating... [2023-03-06 18:47:55,365][24045] Loop rollout_proc19_evt_loop terminating... [2023-03-06 18:47:55,365][23556] Component RolloutWorker_w20 stopped! [2023-03-06 18:47:55,365][24152] Stopping RolloutWorker_w22... [2023-03-06 18:47:55,365][24044] Loop rollout_proc6_evt_loop terminating... [2023-03-06 18:47:55,365][24231] Loop rollout_proc27_evt_loop terminating... [2023-03-06 18:47:55,365][24153] Loop rollout_proc9_evt_loop terminating... [2023-03-06 18:47:55,365][23884] Stopping RolloutWorker_w1... [2023-03-06 18:47:55,366][23885] Loop rollout_proc2_evt_loop terminating... [2023-03-06 18:47:55,365][23883] Stopping RolloutWorker_w0... [2023-03-06 18:47:55,366][24297] Loop rollout_proc31_evt_loop terminating... [2023-03-06 18:47:55,366][24113] Loop rollout_proc7_evt_loop terminating... [2023-03-06 18:47:55,366][24148] Loop rollout_proc15_evt_loop terminating... [2023-03-06 18:47:55,366][24152] Loop rollout_proc22_evt_loop terminating... [2023-03-06 18:47:55,366][23556] Component RolloutWorker_w28 stopped! [2023-03-06 18:47:55,366][23884] Loop rollout_proc1_evt_loop terminating... [2023-03-06 18:47:55,366][23883] Loop rollout_proc0_evt_loop terminating... [2023-03-06 18:47:55,366][23556] Component Batcher_0 stopped! [2023-03-06 18:47:55,365][24186] Stopping RolloutWorker_w23... [2023-03-06 18:47:55,366][23556] Component RolloutWorker_w17 stopped! [2023-03-06 18:47:55,367][23556] Component RolloutWorker_w5 stopped! [2023-03-06 18:47:55,367][23556] Component RolloutWorker_w12 stopped! [2023-03-06 18:47:55,367][23556] Component RolloutWorker_w11 stopped! [2023-03-06 18:47:55,367][23556] Component RolloutWorker_w25 stopped! [2023-03-06 18:47:55,368][23556] Component RolloutWorker_w18 stopped! [2023-03-06 18:47:55,368][23556] Component RolloutWorker_w8 stopped! [2023-03-06 18:47:55,368][23556] Component RolloutWorker_w16 stopped! [2023-03-06 18:47:55,368][23556] Component RolloutWorker_w26 stopped! [2023-03-06 18:47:55,368][23556] Component RolloutWorker_w10 stopped! [2023-03-06 18:47:55,369][23556] Component RolloutWorker_w24 stopped! [2023-03-06 18:47:55,369][23556] Component RolloutWorker_w3 stopped! [2023-03-06 18:47:55,369][23556] Component RolloutWorker_w30 stopped! [2023-03-06 18:47:55,370][23556] Component RolloutWorker_w19 stopped! [2023-03-06 18:47:55,370][23556] Component RolloutWorker_w23 stopped! [2023-03-06 18:47:55,370][23556] Component RolloutWorker_w27 stopped! [2023-03-06 18:47:55,370][24186] Loop rollout_proc23_evt_loop terminating... [2023-03-06 18:47:55,371][23556] Component RolloutWorker_w9 stopped! [2023-03-06 18:47:55,373][23556] Component RolloutWorker_w6 stopped! [2023-03-06 18:47:55,376][23556] Component RolloutWorker_w31 stopped! [2023-03-06 18:47:55,365][23831] Loop batcher_evt_loop terminating... [2023-03-06 18:47:55,376][23556] Component RolloutWorker_w7 stopped! [2023-03-06 18:47:55,376][23556] Component RolloutWorker_w15 stopped! [2023-03-06 18:47:55,377][23556] Component RolloutWorker_w2 stopped! [2023-03-06 18:47:55,382][24296] Stopping RolloutWorker_w29... [2023-03-06 18:47:55,377][23556] Component RolloutWorker_w22 stopped! [2023-03-06 18:47:55,384][23556] Component RolloutWorker_w0 stopped! [2023-03-06 18:47:55,385][23556] Component RolloutWorker_w1 stopped! [2023-03-06 18:47:55,385][23556] Component RolloutWorker_w4 stopped! [2023-03-06 18:47:55,385][23556] Component RolloutWorker_w29 stopped! [2023-03-06 18:47:55,383][24296] Loop rollout_proc29_evt_loop terminating... [2023-03-06 18:47:55,392][24079] Stopping RolloutWorker_w14... [2023-03-06 18:47:55,394][24079] Loop rollout_proc14_evt_loop terminating... [2023-03-06 18:47:55,393][23556] Component RolloutWorker_w14 stopped! [2023-03-06 18:47:55,371][23887] Stopping RolloutWorker_w4... [2023-03-06 18:47:55,397][23887] Loop rollout_proc4_evt_loop terminating... [2023-03-06 18:47:55,400][24046] Stopping RolloutWorker_w13... [2023-03-06 18:47:55,401][24046] Loop rollout_proc13_evt_loop terminating... [2023-03-06 18:47:55,400][23556] Component RolloutWorker_w13 stopped! [2023-03-06 18:47:55,436][23882] Weights refcount: 2 0 [2023-03-06 18:47:55,438][23882] Stopping InferenceWorker_p0-w0... [2023-03-06 18:47:55,439][23882] Loop inference_proc0-0_evt_loop terminating... [2023-03-06 18:47:55,439][23556] Component InferenceWorker_p0-w0 stopped! [2023-03-06 18:47:55,482][23831] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000094624_96894976.pth [2023-03-06 18:47:55,492][23831] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/basketball-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 18:47:55,577][23831] Stopping LearnerWorker_p0... [2023-03-06 18:47:55,578][23831] Loop learner_proc0_evt_loop terminating... [2023-03-06 18:47:55,578][23556] Component LearnerWorker_p0 stopped! [2023-03-06 18:47:55,579][23556] Waiting for process learner_proc0 to stop... [2023-03-06 18:47:56,748][23556] Waiting for process inference_proc0-0 to join... [2023-03-06 18:47:56,749][23556] Waiting for process rollout_proc0 to join... [2023-03-06 18:47:56,749][23556] Waiting for process rollout_proc1 to join... [2023-03-06 18:47:56,749][23556] Waiting for process rollout_proc2 to join... [2023-03-06 18:47:56,749][23556] Waiting for process rollout_proc3 to join... [2023-03-06 18:47:56,749][23556] Waiting for process rollout_proc4 to join... [2023-03-06 18:47:56,750][23556] Waiting for process rollout_proc5 to join... [2023-03-06 18:47:56,750][23556] Waiting for process rollout_proc6 to join... [2023-03-06 18:47:56,750][23556] Waiting for process rollout_proc7 to join... [2023-03-06 18:47:56,750][23556] Waiting for process rollout_proc8 to join... [2023-03-06 18:47:56,750][23556] Waiting for process rollout_proc9 to join... [2023-03-06 18:47:56,751][23556] Waiting for process rollout_proc10 to join... [2023-03-06 18:47:56,751][23556] Waiting for process rollout_proc11 to join... [2023-03-06 18:47:56,751][23556] Waiting for process rollout_proc12 to join... [2023-03-06 18:47:56,751][23556] Waiting for process rollout_proc13 to join... [2023-03-06 18:47:56,752][23556] Waiting for process rollout_proc14 to join... [2023-03-06 18:47:56,752][23556] Waiting for process rollout_proc15 to join... [2023-03-06 18:47:56,752][23556] Waiting for process rollout_proc16 to join... [2023-03-06 18:47:56,752][23556] Waiting for process rollout_proc17 to join... [2023-03-06 18:47:56,752][23556] Waiting for process rollout_proc18 to join... [2023-03-06 18:47:56,753][23556] Waiting for process rollout_proc19 to join... [2023-03-06 18:47:56,753][23556] Waiting for process rollout_proc20 to join... [2023-03-06 18:47:56,753][23556] Waiting for process rollout_proc21 to join... [2023-03-06 18:47:56,753][23556] Waiting for process rollout_proc22 to join... [2023-03-06 18:47:56,754][23556] Waiting for process rollout_proc23 to join... [2023-03-06 18:47:56,754][23556] Waiting for process rollout_proc24 to join... [2023-03-06 18:47:56,754][23556] Waiting for process rollout_proc25 to join... [2023-03-06 18:47:56,754][23556] Waiting for process rollout_proc26 to join... [2023-03-06 18:47:56,754][23556] Waiting for process rollout_proc27 to join... [2023-03-06 18:47:56,755][23556] Waiting for process rollout_proc28 to join... [2023-03-06 18:47:56,755][23556] Waiting for process rollout_proc29 to join... [2023-03-06 18:47:56,755][23556] Waiting for process rollout_proc30 to join... [2023-03-06 18:47:56,755][23556] Waiting for process rollout_proc31 to join... [2023-03-06 18:47:56,756][23556] Batcher 0 profile tree view: batching: 828.3875, releasing_batches: 1.6072 [2023-03-06 18:47:56,756][23556] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 234.8623 update_model: 138.2336 weight_update: 0.0007 one_step: 0.0118 handle_policy_step: 6905.2948 deserialize: 210.6793, stack: 36.3558, obs_to_device_normalize: 1236.6739, forward: 3063.4929, send_messages: 1370.1198 prepare_outputs: 711.3184 to_cpu: 362.0262 [2023-03-06 18:47:56,756][23556] Learner 0 profile tree view: misc: 0.5633, prepare_batch: 418.5011 train: 907.8386 epoch_init: 0.3852, minibatch_init: 0.3909, losses_postprocess: 30.3853, kl_divergence: 35.4510, after_optimizer: 96.4979 calculate_losses: 299.8243 losses_init: 0.2185, forward_head: 16.7106, bptt_initial: 109.0998, tail: 60.0577, advantages_returns: 7.4659, losses: 28.1666 bptt: 69.0376 bptt_forward_core: 66.5862 update: 422.1611 clip: 55.3751 [2023-03-06 18:47:56,756][23556] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.9965, enqueue_policy_requests: 183.8799, env_step: 2957.5125, overhead: 176.1455, complete_rollouts: 9.9113 save_policy_outputs: 237.4536 split_output_tensors: 115.9134 [2023-03-06 18:47:56,756][23556] RolloutWorker_w31 profile tree view: wait_for_trajectories: 4.1187, enqueue_policy_requests: 192.8931, env_step: 2988.1331, overhead: 178.7672, complete_rollouts: 9.9940 save_policy_outputs: 234.6419 split_output_tensors: 114.3029 [2023-03-06 18:47:56,757][23556] Loop Runner_EvtLoop terminating... [2023-03-06 18:47:56,757][23556] Runner profile tree view: main_loop: 7676.9544 [2023-03-06 18:47:56,757][23556] Collected {0: 100001792}, FPS: 13026.2