[2023-03-06 20:58:26,235][62145] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/config.json... [2023-03-06 20:58:26,249][62145] Rollout worker 0 uses device cpu [2023-03-06 20:58:26,249][62145] Rollout worker 1 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 2 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 3 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 4 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 5 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 6 uses device cpu [2023-03-06 20:58:26,250][62145] Rollout worker 7 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 8 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 9 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 10 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 11 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 12 uses device cpu [2023-03-06 20:58:26,251][62145] Rollout worker 13 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 14 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 15 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 16 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 17 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 18 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 19 uses device cpu [2023-03-06 20:58:26,252][62145] Rollout worker 20 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 21 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 22 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 23 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 24 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 25 uses device cpu [2023-03-06 20:58:26,253][62145] Rollout worker 26 uses device cpu [2023-03-06 20:58:26,254][62145] Rollout worker 27 uses device cpu [2023-03-06 20:58:26,254][62145] Rollout worker 28 uses device cpu [2023-03-06 20:58:26,254][62145] Rollout worker 29 uses device cpu [2023-03-06 20:58:26,254][62145] Rollout worker 30 uses device cpu [2023-03-06 20:58:26,254][62145] Rollout worker 31 uses device cpu [2023-03-06 20:58:26,267][62145] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 20:58:26,268][62145] InferenceWorker_p0-w0: min num requests: 10 [2023-03-06 20:58:26,356][62145] Starting all processes... [2023-03-06 20:58:26,357][62145] Starting process learner_proc0 [2023-03-06 20:58:26,406][62145] Starting all processes... [2023-03-06 20:58:26,461][62145] Starting process inference_proc0-0 [2023-03-06 20:58:26,461][62145] Starting process rollout_proc0 [2023-03-06 20:58:26,461][62145] Starting process rollout_proc1 [2023-03-06 20:58:26,462][62145] Starting process rollout_proc2 [2023-03-06 20:58:26,462][62145] Starting process rollout_proc3 [2023-03-06 20:58:26,462][62145] Starting process rollout_proc4 [2023-03-06 20:58:26,462][62145] Starting process rollout_proc5 [2023-03-06 20:58:26,462][62145] Starting process rollout_proc6 [2023-03-06 20:58:26,464][62145] Starting process rollout_proc7 [2023-03-06 20:58:26,464][62145] Starting process rollout_proc8 [2023-03-06 20:58:26,464][62145] Starting process rollout_proc9 [2023-03-06 20:58:26,464][62145] Starting process rollout_proc10 [2023-03-06 20:58:26,471][62145] Starting process rollout_proc11 [2023-03-06 20:58:26,473][62145] Starting process rollout_proc12 [2023-03-06 20:58:26,473][62145] Starting process rollout_proc13 [2023-03-06 20:58:26,474][62145] Starting process rollout_proc14 [2023-03-06 20:58:26,474][62145] Starting process rollout_proc15 [2023-03-06 20:58:26,482][62145] Starting process rollout_proc16 [2023-03-06 20:58:26,491][62145] Starting process rollout_proc17 [2023-03-06 20:58:26,491][62145] Starting process rollout_proc18 [2023-03-06 20:58:26,581][62145] Starting process rollout_proc19 [2023-03-06 20:58:26,594][62145] Starting process rollout_proc20 [2023-03-06 20:58:26,608][62145] Starting process rollout_proc21 [2023-03-06 20:58:26,631][62145] Starting process rollout_proc22 [2023-03-06 20:58:26,637][62145] Starting process rollout_proc23 [2023-03-06 20:58:26,644][62145] Starting process rollout_proc24 [2023-03-06 20:58:26,645][62145] Starting process rollout_proc25 [2023-03-06 20:58:26,654][62145] Starting process rollout_proc26 [2023-03-06 20:58:26,654][62145] Starting process rollout_proc27 [2023-03-06 20:58:26,654][62145] Starting process rollout_proc28 [2023-03-06 20:58:26,655][62145] Starting process rollout_proc29 [2023-03-06 20:58:26,655][62145] Starting process rollout_proc30 [2023-03-06 20:58:26,655][62145] Starting process rollout_proc31 [2023-03-06 20:58:28,370][62424] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 20:58:28,371][62424] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-06 20:58:28,381][62424] Num visible devices: 1 [2023-03-06 20:58:28,408][62424] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-06 20:58:28,408][62424] Starting seed is not provided [2023-03-06 20:58:28,408][62424] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 20:58:28,409][62424] Initializing actor-critic model on device cuda:0 [2023-03-06 20:58:28,409][62424] RunningMeanStd input shape: (39,) [2023-03-06 20:58:28,409][62424] RunningMeanStd input shape: (1,) [2023-03-06 20:58:28,442][62608] Worker 15 uses CPU cores [15] [2023-03-06 20:58:28,511][62424] Created Actor Critic model with architecture: [2023-03-06 20:58:28,512][62424] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-06 20:58:28,548][62475] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 20:58:28,548][62475] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-06 20:58:28,565][62475] Num visible devices: 1 [2023-03-06 20:58:28,695][62614] Worker 6 uses CPU cores [6] [2023-03-06 20:58:28,735][62940] Worker 27 uses CPU cores [27] [2023-03-06 20:58:28,807][62935] Worker 22 uses CPU cores [22] [2023-03-06 20:58:28,919][62901] Worker 20 uses CPU cores [20] [2023-03-06 20:58:29,079][62647] Worker 10 uses CPU cores [10] [2023-03-06 20:58:29,202][62840] Worker 7 uses CPU cores [7] [2023-03-06 20:58:29,259][62615] Worker 18 uses CPU cores [18] [2023-03-06 20:58:29,478][62942] Worker 29 uses CPU cores [29] [2023-03-06 20:58:29,481][62603] Worker 5 uses CPU cores [5] [2023-03-06 20:58:29,748][62939] Worker 26 uses CPU cores [26] [2023-03-06 20:58:29,751][62476] Worker 0 uses CPU cores [0] [2023-03-06 20:58:29,855][62775] Worker 19 uses CPU cores [19] [2023-03-06 20:58:30,029][62424] Using optimizer [2023-03-06 20:58:30,030][62424] No checkpoints found [2023-03-06 20:58:30,030][62424] Did not load from checkpoint, starting from scratch! [2023-03-06 20:58:30,030][62424] Initialized policy 0 weights for model version 0 [2023-03-06 20:58:30,032][62424] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 20:58:30,035][62424] LearnerWorker_p0 finished initialization! [2023-03-06 20:58:30,062][62612] Worker 13 uses CPU cores [13] [2023-03-06 20:58:30,066][62742] Worker 8 uses CPU cores [8] [2023-03-06 20:58:30,110][62475] RunningMeanStd input shape: (39,) [2023-03-06 20:58:30,110][62475] RunningMeanStd input shape: (1,) [2023-03-06 20:58:30,229][62609] Worker 3 uses CPU cores [3] [2023-03-06 20:58:30,295][62937] Worker 24 uses CPU cores [24] [2023-03-06 20:58:30,560][62611] Worker 9 uses CPU cores [9] [2023-03-06 20:58:30,643][62610] Worker 14 uses CPU cores [14] [2023-03-06 20:58:30,647][62604] Worker 12 uses CPU cores [12] [2023-03-06 20:58:30,814][62477] Worker 1 uses CPU cores [1] [2023-03-06 20:58:30,907][62145] Inference worker 0-0 is ready! [2023-03-06 20:58:30,908][62145] All inference workers are ready! Signal rollout workers to start! [2023-03-06 20:58:31,027][62902] Worker 21 uses CPU cores [21] [2023-03-06 20:58:31,253][62478] Worker 2 uses CPU cores [2] [2023-03-06 20:58:31,354][62605] Worker 4 uses CPU cores [4] [2023-03-06 20:58:31,455][62982] Worker 31 uses CPU cores [31] [2023-03-06 20:58:31,455][62938] Worker 25 uses CPU cores [25] [2023-03-06 20:58:31,588][62941] Worker 28 uses CPU cores [28] [2023-03-06 20:58:31,655][62607] Worker 16 uses CPU cores [16] [2023-03-06 20:58:31,869][62613] Worker 17 uses CPU cores [17] [2023-03-06 20:58:31,987][62606] Worker 11 uses CPU cores [11] [2023-03-06 20:58:32,352][62936] Worker 23 uses CPU cores [23] [2023-03-06 20:58:32,390][62145] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-06 20:58:32,481][62974] Worker 30 uses CPU cores [30] [2023-03-06 20:58:33,848][62840] Decorrelating experience for 0 frames... [2023-03-06 20:58:33,871][62940] Decorrelating experience for 0 frames... [2023-03-06 20:58:33,993][62603] Decorrelating experience for 0 frames... [2023-03-06 20:58:33,995][62901] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,055][62476] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,068][62604] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,106][62477] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,126][62937] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,164][62610] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,168][62611] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,171][62742] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,202][62902] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,239][62647] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,247][62939] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,292][62614] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,314][62775] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,317][62615] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,331][62608] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,349][62609] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,394][62935] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,581][62612] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,587][62942] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,628][62478] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,850][62982] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,923][62607] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,929][62605] Decorrelating experience for 0 frames... [2023-03-06 20:58:34,982][62938] Decorrelating experience for 0 frames... [2023-03-06 20:58:35,012][62941] Decorrelating experience for 0 frames... [2023-03-06 20:58:35,458][62606] Decorrelating experience for 0 frames... [2023-03-06 20:58:35,480][62613] Decorrelating experience for 0 frames... [2023-03-06 20:58:35,902][62936] Decorrelating experience for 0 frames... [2023-03-06 20:58:35,988][62974] Decorrelating experience for 0 frames... [2023-03-06 20:58:36,932][62940] Decorrelating experience for 32 frames... [2023-03-06 20:58:36,981][62840] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,058][62603] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,115][62901] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,160][62477] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,180][62476] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,188][62604] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,204][62902] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,212][62937] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,234][62610] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,250][62742] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,342][62647] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,346][62939] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,390][62145] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.4. Samples: 2. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-06 20:58:37,450][62611] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,466][62615] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,474][62608] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,535][62614] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,570][62775] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,597][62609] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,632][62935] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,744][62478] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,758][62424] Signal inference workers to stop experience collection... [2023-03-06 20:58:37,762][62475] InferenceWorker_p0-w0: stopping experience collection [2023-03-06 20:58:37,781][62605] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,807][62607] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,821][62941] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,885][62942] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,887][62612] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,895][62982] Decorrelating experience for 32 frames... [2023-03-06 20:58:37,958][62938] Decorrelating experience for 32 frames... [2023-03-06 20:58:38,087][62424] Signal inference workers to resume experience collection... [2023-03-06 20:58:38,088][62475] InferenceWorker_p0-w0: resuming experience collection [2023-03-06 20:58:38,124][62613] Decorrelating experience for 32 frames... [2023-03-06 20:58:38,134][62606] Decorrelating experience for 32 frames... [2023-03-06 20:58:38,400][62936] Decorrelating experience for 32 frames... [2023-03-06 20:58:38,481][62974] Decorrelating experience for 32 frames... [2023-03-06 20:58:39,269][62475] Updated weights for policy 0, policy_version 10 (0.0218) [2023-03-06 20:58:40,075][62475] Updated weights for policy 0, policy_version 20 (0.0006) [2023-03-06 20:58:40,880][62475] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-06 20:58:41,648][62475] Updated weights for policy 0, policy_version 40 (0.0006) [2023-03-06 20:58:42,390][62145] Fps is (10 sec: 5017.6, 60 sec: 5017.6, 300 sec: 5017.6). Total num frames: 50176. Throughput: 0: 1983.3. Samples: 19833. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 20:58:42,390][62145] Avg episode reward: [(0, '235.818')] [2023-03-06 20:58:42,416][62475] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-06 20:58:43,233][62475] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-06 20:58:44,017][62475] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-06 20:58:44,778][62475] Updated weights for policy 0, policy_version 80 (0.0007) [2023-03-06 20:58:45,571][62475] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-06 20:58:46,263][62145] Heartbeat connected on Batcher_0 [2023-03-06 20:58:46,265][62145] Heartbeat connected on LearnerWorker_p0 [2023-03-06 20:58:46,271][62145] Heartbeat connected on RolloutWorker_w0 [2023-03-06 20:58:46,272][62145] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-06 20:58:46,273][62145] Heartbeat connected on RolloutWorker_w1 [2023-03-06 20:58:46,274][62145] Heartbeat connected on RolloutWorker_w2 [2023-03-06 20:58:46,278][62145] Heartbeat connected on RolloutWorker_w4 [2023-03-06 20:58:46,278][62145] Heartbeat connected on RolloutWorker_w3 [2023-03-06 20:58:46,279][62145] Heartbeat connected on RolloutWorker_w5 [2023-03-06 20:58:46,281][62145] Heartbeat connected on RolloutWorker_w6 [2023-03-06 20:58:46,283][62145] Heartbeat connected on RolloutWorker_w7 [2023-03-06 20:58:46,286][62145] Heartbeat connected on RolloutWorker_w8 [2023-03-06 20:58:46,287][62145] Heartbeat connected on RolloutWorker_w9 [2023-03-06 20:58:46,289][62145] Heartbeat connected on RolloutWorker_w10 [2023-03-06 20:58:46,318][62145] Heartbeat connected on RolloutWorker_w11 [2023-03-06 20:58:46,319][62145] Heartbeat connected on RolloutWorker_w12 [2023-03-06 20:58:46,321][62145] Heartbeat connected on RolloutWorker_w13 [2023-03-06 20:58:46,323][62145] Heartbeat connected on RolloutWorker_w14 [2023-03-06 20:58:46,325][62145] Heartbeat connected on RolloutWorker_w15 [2023-03-06 20:58:46,326][62145] Heartbeat connected on RolloutWorker_w16 [2023-03-06 20:58:46,328][62145] Heartbeat connected on RolloutWorker_w17 [2023-03-06 20:58:46,333][62145] Heartbeat connected on RolloutWorker_w19 [2023-03-06 20:58:46,333][62145] Heartbeat connected on RolloutWorker_w18 [2023-03-06 20:58:46,335][62145] Heartbeat connected on RolloutWorker_w20 [2023-03-06 20:58:46,338][62145] Heartbeat connected on RolloutWorker_w22 [2023-03-06 20:58:46,338][62145] Heartbeat connected on RolloutWorker_w21 [2023-03-06 20:58:46,340][62145] Heartbeat connected on RolloutWorker_w23 [2023-03-06 20:58:46,341][62145] Heartbeat connected on RolloutWorker_w24 [2023-03-06 20:58:46,343][62145] Heartbeat connected on RolloutWorker_w25 [2023-03-06 20:58:46,345][62145] Heartbeat connected on RolloutWorker_w26 [2023-03-06 20:58:46,347][62145] Heartbeat connected on RolloutWorker_w27 [2023-03-06 20:58:46,349][62145] Heartbeat connected on RolloutWorker_w28 [2023-03-06 20:58:46,352][62145] Heartbeat connected on RolloutWorker_w29 [2023-03-06 20:58:46,354][62145] Heartbeat connected on RolloutWorker_w30 [2023-03-06 20:58:46,354][62145] Heartbeat connected on RolloutWorker_w31 [2023-03-06 20:58:46,369][62475] Updated weights for policy 0, policy_version 100 (0.0007) [2023-03-06 20:58:47,149][62475] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-06 20:58:47,390][62145] Fps is (10 sec: 11571.3, 60 sec: 7714.1, 300 sec: 7714.1). Total num frames: 115712. Throughput: 0: 6511.9. Samples: 97678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:58:47,390][62145] Avg episode reward: [(0, '292.499')] [2023-03-06 20:58:47,391][62424] Saving new best policy, reward=292.499! [2023-03-06 20:58:47,941][62475] Updated weights for policy 0, policy_version 120 (0.0006) [2023-03-06 20:58:48,753][62475] Updated weights for policy 0, policy_version 130 (0.0007) [2023-03-06 20:58:49,547][62475] Updated weights for policy 0, policy_version 140 (0.0005) [2023-03-06 20:58:50,333][62475] Updated weights for policy 0, policy_version 150 (0.0006) [2023-03-06 20:58:51,171][62475] Updated weights for policy 0, policy_version 160 (0.0007) [2023-03-06 20:58:51,954][62475] Updated weights for policy 0, policy_version 170 (0.0007) [2023-03-06 20:58:52,389][62145] Fps is (10 sec: 12902.5, 60 sec: 8960.0, 300 sec: 8960.0). Total num frames: 179200. Throughput: 0: 8735.0. Samples: 174700. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 20:58:52,390][62145] Avg episode reward: [(0, '269.370')] [2023-03-06 20:58:52,742][62475] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-06 20:58:53,566][62475] Updated weights for policy 0, policy_version 190 (0.0007) [2023-03-06 20:58:54,356][62475] Updated weights for policy 0, policy_version 200 (0.0007) [2023-03-06 20:58:55,141][62475] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-06 20:58:55,964][62475] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-06 20:58:56,745][62475] Updated weights for policy 0, policy_version 230 (0.0006) [2023-03-06 20:58:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 9748.5, 300 sec: 9748.5). Total num frames: 243712. Throughput: 0: 8530.6. Samples: 213266. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 20:58:57,390][62145] Avg episode reward: [(0, '301.820')] [2023-03-06 20:58:57,391][62424] Saving new best policy, reward=301.820! [2023-03-06 20:58:57,534][62475] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-06 20:58:58,349][62475] Updated weights for policy 0, policy_version 250 (0.0007) [2023-03-06 20:58:59,126][62475] Updated weights for policy 0, policy_version 260 (0.0006) [2023-03-06 20:58:59,927][62475] Updated weights for policy 0, policy_version 270 (0.0007) [2023-03-06 20:59:00,739][62475] Updated weights for policy 0, policy_version 280 (0.0007) [2023-03-06 20:59:01,533][62475] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-06 20:59:02,321][62475] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-06 20:59:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 10240.0, 300 sec: 10240.0). Total num frames: 307200. Throughput: 0: 9671.3. Samples: 290138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:02,390][62145] Avg episode reward: [(0, '271.630')] [2023-03-06 20:59:03,135][62475] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-06 20:59:03,943][62475] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-06 20:59:04,722][62475] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-06 20:59:05,523][62475] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-06 20:59:06,326][62475] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-06 20:59:07,122][62475] Updated weights for policy 0, policy_version 360 (0.0006) [2023-03-06 20:59:07,389][62145] Fps is (10 sec: 12800.2, 60 sec: 10620.4, 300 sec: 10620.4). Total num frames: 371712. Throughput: 0: 10494.2. Samples: 367296. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 20:59:07,390][62145] Avg episode reward: [(0, '294.204')] [2023-03-06 20:59:07,917][62475] Updated weights for policy 0, policy_version 370 (0.0007) [2023-03-06 20:59:08,722][62475] Updated weights for policy 0, policy_version 380 (0.0007) [2023-03-06 20:59:09,515][62475] Updated weights for policy 0, policy_version 390 (0.0007) [2023-03-06 20:59:10,286][62475] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-06 20:59:11,103][62475] Updated weights for policy 0, policy_version 410 (0.0007) [2023-03-06 20:59:11,892][62475] Updated weights for policy 0, policy_version 420 (0.0006) [2023-03-06 20:59:12,389][62145] Fps is (10 sec: 12902.4, 60 sec: 10905.6, 300 sec: 10905.6). Total num frames: 436224. Throughput: 0: 10149.2. Samples: 405966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:12,390][62145] Avg episode reward: [(0, '359.080')] [2023-03-06 20:59:12,393][62424] Saving new best policy, reward=359.080! [2023-03-06 20:59:12,667][62475] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-06 20:59:13,497][62475] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-06 20:59:14,279][62475] Updated weights for policy 0, policy_version 450 (0.0007) [2023-03-06 20:59:15,055][62475] Updated weights for policy 0, policy_version 460 (0.0007) [2023-03-06 20:59:15,865][62475] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-06 20:59:16,644][62475] Updated weights for policy 0, policy_version 480 (0.0007) [2023-03-06 20:59:17,390][62145] Fps is (10 sec: 12902.3, 60 sec: 11127.5, 300 sec: 11127.5). Total num frames: 500736. Throughput: 0: 10736.1. Samples: 483122. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 20:59:17,390][62145] Avg episode reward: [(0, '356.679')] [2023-03-06 20:59:17,422][62475] Updated weights for policy 0, policy_version 490 (0.0006) [2023-03-06 20:59:18,260][62475] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-06 20:59:19,042][62475] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-06 20:59:19,844][62475] Updated weights for policy 0, policy_version 520 (0.0006) [2023-03-06 20:59:20,640][62475] Updated weights for policy 0, policy_version 530 (0.0007) [2023-03-06 20:59:21,448][62475] Updated weights for policy 0, policy_version 540 (0.0008) [2023-03-06 20:59:22,224][62475] Updated weights for policy 0, policy_version 550 (0.0007) [2023-03-06 20:59:22,389][62145] Fps is (10 sec: 12902.4, 60 sec: 11305.0, 300 sec: 11305.0). Total num frames: 565248. Throughput: 0: 12456.0. Samples: 560520. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 20:59:22,390][62145] Avg episode reward: [(0, '372.819')] [2023-03-06 20:59:22,393][62424] Saving new best policy, reward=372.819! [2023-03-06 20:59:23,021][62475] Updated weights for policy 0, policy_version 560 (0.0006) [2023-03-06 20:59:23,797][62475] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-06 20:59:24,575][62475] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-06 20:59:25,399][62475] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-06 20:59:26,185][62475] Updated weights for policy 0, policy_version 600 (0.0007) [2023-03-06 20:59:26,981][62475] Updated weights for policy 0, policy_version 610 (0.0007) [2023-03-06 20:59:27,390][62145] Fps is (10 sec: 12902.4, 60 sec: 11450.2, 300 sec: 11450.2). Total num frames: 629760. Throughput: 0: 12878.7. Samples: 599376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:27,390][62145] Avg episode reward: [(0, '415.074')] [2023-03-06 20:59:27,391][62424] Saving new best policy, reward=415.074! [2023-03-06 20:59:27,778][62475] Updated weights for policy 0, policy_version 620 (0.0007) [2023-03-06 20:59:28,575][62475] Updated weights for policy 0, policy_version 630 (0.0005) [2023-03-06 20:59:29,357][62475] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-06 20:59:30,133][62475] Updated weights for policy 0, policy_version 650 (0.0006) [2023-03-06 20:59:30,925][62475] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-06 20:59:31,726][62475] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-06 20:59:32,390][62145] Fps is (10 sec: 12902.4, 60 sec: 11571.2, 300 sec: 11571.2). Total num frames: 694272. Throughput: 0: 12866.1. Samples: 676654. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 20:59:32,390][62145] Avg episode reward: [(0, '403.455')] [2023-03-06 20:59:32,516][62475] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-06 20:59:33,332][62475] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-06 20:59:34,122][62475] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-06 20:59:34,914][62475] Updated weights for policy 0, policy_version 710 (0.0006) [2023-03-06 20:59:35,737][62475] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-06 20:59:36,521][62475] Updated weights for policy 0, policy_version 730 (0.0007) [2023-03-06 20:59:37,317][62475] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-06 20:59:37,390][62145] Fps is (10 sec: 12902.5, 60 sec: 12646.4, 300 sec: 11673.6). Total num frames: 758784. Throughput: 0: 12868.0. Samples: 753761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:37,390][62145] Avg episode reward: [(0, '420.230')] [2023-03-06 20:59:37,390][62424] Saving new best policy, reward=420.230! [2023-03-06 20:59:38,119][62475] Updated weights for policy 0, policy_version 750 (0.0006) [2023-03-06 20:59:38,914][62475] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-06 20:59:39,708][62475] Updated weights for policy 0, policy_version 770 (0.0007) [2023-03-06 20:59:40,525][62475] Updated weights for policy 0, policy_version 780 (0.0007) [2023-03-06 20:59:41,314][62475] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-06 20:59:42,106][62475] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-06 20:59:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 11746.8). Total num frames: 822272. Throughput: 0: 12870.4. Samples: 792431. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:42,390][62145] Avg episode reward: [(0, '403.176')] [2023-03-06 20:59:42,917][62475] Updated weights for policy 0, policy_version 810 (0.0006) [2023-03-06 20:59:43,707][62475] Updated weights for policy 0, policy_version 820 (0.0006) [2023-03-06 20:59:44,508][62475] Updated weights for policy 0, policy_version 830 (0.0007) [2023-03-06 20:59:45,330][62475] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-06 20:59:46,141][62475] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-06 20:59:46,928][62475] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-06 20:59:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12834.1, 300 sec: 11810.1). Total num frames: 885760. Throughput: 0: 12859.0. Samples: 868795. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 20:59:47,390][62145] Avg episode reward: [(0, '425.799')] [2023-03-06 20:59:47,404][62424] Saving new best policy, reward=425.799! [2023-03-06 20:59:47,726][62475] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-06 20:59:48,525][62475] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-06 20:59:49,315][62475] Updated weights for policy 0, policy_version 890 (0.0007) [2023-03-06 20:59:50,105][62475] Updated weights for policy 0, policy_version 900 (0.0007) [2023-03-06 20:59:50,910][62475] Updated weights for policy 0, policy_version 910 (0.0007) [2023-03-06 20:59:51,693][62475] Updated weights for policy 0, policy_version 920 (0.0006) [2023-03-06 20:59:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 11878.4). Total num frames: 950272. Throughput: 0: 12861.1. Samples: 946048. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:52,390][62145] Avg episode reward: [(0, '433.428')] [2023-03-06 20:59:52,402][62424] Saving new best policy, reward=433.428! [2023-03-06 20:59:52,489][62475] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-06 20:59:53,304][62475] Updated weights for policy 0, policy_version 940 (0.0007) [2023-03-06 20:59:54,105][62475] Updated weights for policy 0, policy_version 950 (0.0007) [2023-03-06 20:59:54,889][62475] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-06 20:59:55,697][62475] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-06 20:59:56,489][62475] Updated weights for policy 0, policy_version 980 (0.0007) [2023-03-06 20:59:57,274][62475] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-06 20:59:57,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 11938.6). Total num frames: 1014784. Throughput: 0: 12857.1. Samples: 984537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 20:59:57,390][62145] Avg episode reward: [(0, '459.865')] [2023-03-06 20:59:57,391][62424] Saving new best policy, reward=459.865! [2023-03-06 20:59:58,096][62475] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-06 20:59:58,879][62475] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-06 20:59:59,655][62475] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-06 21:00:00,472][62475] Updated weights for policy 0, policy_version 1030 (0.0006) [2023-03-06 21:00:01,258][62475] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-06 21:00:02,043][62475] Updated weights for policy 0, policy_version 1050 (0.0005) [2023-03-06 21:00:02,389][62145] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 11992.2). Total num frames: 1079296. Throughput: 0: 12856.6. Samples: 1061670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:00:02,390][62145] Avg episode reward: [(0, '455.382')] [2023-03-06 21:00:02,864][62475] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-06 21:00:03,650][62475] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-06 21:00:04,441][62475] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-06 21:00:05,247][62475] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-06 21:00:06,038][62475] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-06 21:00:06,850][62475] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-06 21:00:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12029.3). Total num frames: 1142784. Throughput: 0: 12849.0. Samples: 1138724. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:00:07,390][62145] Avg episode reward: [(0, '470.029')] [2023-03-06 21:00:07,390][62424] Saving new best policy, reward=470.029! [2023-03-06 21:00:07,651][62475] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-06 21:00:08,421][62475] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-06 21:00:09,235][62475] Updated weights for policy 0, policy_version 1140 (0.0007) [2023-03-06 21:00:10,015][62475] Updated weights for policy 0, policy_version 1150 (0.0007) [2023-03-06 21:00:10,819][62475] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-06 21:00:11,598][62475] Updated weights for policy 0, policy_version 1170 (0.0007) [2023-03-06 21:00:12,389][62475] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-06 21:00:12,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12083.2). Total num frames: 1208320. Throughput: 0: 12848.0. Samples: 1177537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:00:12,390][62145] Avg episode reward: [(0, '501.321')] [2023-03-06 21:00:12,393][62424] Saving new best policy, reward=501.321! [2023-03-06 21:00:13,206][62475] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-06 21:00:13,985][62475] Updated weights for policy 0, policy_version 1200 (0.0006) [2023-03-06 21:00:14,766][62475] Updated weights for policy 0, policy_version 1210 (0.0007) [2023-03-06 21:00:15,606][62475] Updated weights for policy 0, policy_version 1220 (0.0007) [2023-03-06 21:00:16,403][62475] Updated weights for policy 0, policy_version 1230 (0.0007) [2023-03-06 21:00:17,195][62475] Updated weights for policy 0, policy_version 1240 (0.0007) [2023-03-06 21:00:17,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12112.5). Total num frames: 1271808. Throughput: 0: 12839.7. Samples: 1254443. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:00:17,390][62145] Avg episode reward: [(0, '520.870')] [2023-03-06 21:00:17,391][62424] Saving new best policy, reward=520.870! [2023-03-06 21:00:18,006][62475] Updated weights for policy 0, policy_version 1250 (0.0006) [2023-03-06 21:00:18,801][62475] Updated weights for policy 0, policy_version 1260 (0.0007) [2023-03-06 21:00:19,589][62475] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-06 21:00:20,395][62475] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-06 21:00:21,183][62475] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-06 21:00:21,968][62475] Updated weights for policy 0, policy_version 1300 (0.0005) [2023-03-06 21:00:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12148.4). Total num frames: 1336320. Throughput: 0: 12842.7. Samples: 1331685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:00:22,390][62145] Avg episode reward: [(0, '528.874')] [2023-03-06 21:00:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000001305_1336320.pth... [2023-03-06 21:00:22,427][62424] Saving new best policy, reward=528.874! [2023-03-06 21:00:22,789][62475] Updated weights for policy 0, policy_version 1310 (0.0006) [2023-03-06 21:00:23,574][62475] Updated weights for policy 0, policy_version 1320 (0.0007) [2023-03-06 21:00:24,358][62475] Updated weights for policy 0, policy_version 1330 (0.0007) [2023-03-06 21:00:25,161][62475] Updated weights for policy 0, policy_version 1340 (0.0007) [2023-03-06 21:00:25,946][62475] Updated weights for policy 0, policy_version 1350 (0.0006) [2023-03-06 21:00:26,755][62475] Updated weights for policy 0, policy_version 1360 (0.0007) [2023-03-06 21:00:27,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12181.2). Total num frames: 1400832. Throughput: 0: 12839.5. Samples: 1370207. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:00:27,390][62145] Avg episode reward: [(0, '522.521')] [2023-03-06 21:00:27,546][62475] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-06 21:00:28,333][62475] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-06 21:00:29,126][62475] Updated weights for policy 0, policy_version 1390 (0.0006) [2023-03-06 21:00:29,930][62475] Updated weights for policy 0, policy_version 1400 (0.0007) [2023-03-06 21:00:30,726][62475] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-06 21:00:31,517][62475] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-06 21:00:32,335][62475] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-06 21:00:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12202.7). Total num frames: 1464320. Throughput: 0: 12860.1. Samples: 1447499. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:00:32,390][62145] Avg episode reward: [(0, '516.498')] [2023-03-06 21:00:33,133][62475] Updated weights for policy 0, policy_version 1440 (0.0006) [2023-03-06 21:00:33,906][62475] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-06 21:00:34,722][62475] Updated weights for policy 0, policy_version 1460 (0.0006) [2023-03-06 21:00:35,508][62475] Updated weights for policy 0, policy_version 1470 (0.0007) [2023-03-06 21:00:36,309][62475] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-06 21:00:37,122][62475] Updated weights for policy 0, policy_version 1490 (0.0006) [2023-03-06 21:00:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12230.7). Total num frames: 1528832. Throughput: 0: 12853.9. Samples: 1524475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:00:37,390][62145] Avg episode reward: [(0, '521.847')] [2023-03-06 21:00:37,933][62475] Updated weights for policy 0, policy_version 1500 (0.0008) [2023-03-06 21:00:38,709][62475] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-06 21:00:39,503][62475] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-06 21:00:40,280][62475] Updated weights for policy 0, policy_version 1530 (0.0007) [2023-03-06 21:00:41,079][62475] Updated weights for policy 0, policy_version 1540 (0.0007) [2023-03-06 21:00:41,889][62475] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-06 21:00:42,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12256.5). Total num frames: 1593344. Throughput: 0: 12856.3. Samples: 1563069. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:00:42,390][62145] Avg episode reward: [(0, '533.990')] [2023-03-06 21:00:42,394][62424] Saving new best policy, reward=533.990! [2023-03-06 21:00:42,678][62475] Updated weights for policy 0, policy_version 1560 (0.0005) [2023-03-06 21:00:43,486][62475] Updated weights for policy 0, policy_version 1570 (0.0006) [2023-03-06 21:00:44,276][62475] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-06 21:00:45,060][62475] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-06 21:00:45,875][62475] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-06 21:00:46,654][62475] Updated weights for policy 0, policy_version 1610 (0.0005) [2023-03-06 21:00:47,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12272.8). Total num frames: 1656832. Throughput: 0: 12856.7. Samples: 1640223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:00:47,401][62145] Avg episode reward: [(0, '526.765')] [2023-03-06 21:00:47,474][62475] Updated weights for policy 0, policy_version 1620 (0.0006) [2023-03-06 21:00:48,279][62475] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-06 21:00:49,091][62475] Updated weights for policy 0, policy_version 1640 (0.0006) [2023-03-06 21:00:49,867][62475] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-06 21:00:50,686][62475] Updated weights for policy 0, policy_version 1660 (0.0007) [2023-03-06 21:00:51,473][62475] Updated weights for policy 0, policy_version 1670 (0.0007) [2023-03-06 21:00:52,283][62475] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-06 21:00:52,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12295.3). Total num frames: 1721344. Throughput: 0: 12853.4. Samples: 1717129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:00:52,390][62145] Avg episode reward: [(0, '527.651')] [2023-03-06 21:00:53,087][62475] Updated weights for policy 0, policy_version 1690 (0.0005) [2023-03-06 21:00:53,875][62475] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-06 21:00:54,654][62475] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-06 21:00:55,445][62475] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-06 21:00:56,249][62475] Updated weights for policy 0, policy_version 1730 (0.0007) [2023-03-06 21:00:57,063][62475] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-06 21:00:57,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12316.2). Total num frames: 1785856. Throughput: 0: 12845.5. Samples: 1755586. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:00:57,390][62145] Avg episode reward: [(0, '516.422')] [2023-03-06 21:00:57,868][62475] Updated weights for policy 0, policy_version 1750 (0.0007) [2023-03-06 21:00:58,664][62475] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-06 21:00:59,464][62475] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-06 21:01:00,255][62475] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-06 21:01:01,041][62475] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-06 21:01:01,843][62475] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-06 21:01:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12329.0). Total num frames: 1849344. Throughput: 0: 12849.2. Samples: 1832659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:01:02,390][62145] Avg episode reward: [(0, '536.707')] [2023-03-06 21:01:02,395][62424] Saving new best policy, reward=536.707! [2023-03-06 21:01:02,649][62475] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-06 21:01:03,447][62475] Updated weights for policy 0, policy_version 1820 (0.0007) [2023-03-06 21:01:04,221][62475] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-06 21:01:05,034][62475] Updated weights for policy 0, policy_version 1840 (0.0007) [2023-03-06 21:01:05,832][62475] Updated weights for policy 0, policy_version 1850 (0.0007) [2023-03-06 21:01:06,630][62475] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-06 21:01:07,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12347.5). Total num frames: 1913856. Throughput: 0: 12835.9. Samples: 1909299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:07,390][62145] Avg episode reward: [(0, '516.925')] [2023-03-06 21:01:07,446][62475] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-06 21:01:08,231][62475] Updated weights for policy 0, policy_version 1880 (0.0007) [2023-03-06 21:01:09,025][62475] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-06 21:01:09,821][62475] Updated weights for policy 0, policy_version 1900 (0.0005) [2023-03-06 21:01:10,611][62475] Updated weights for policy 0, policy_version 1910 (0.0006) [2023-03-06 21:01:11,401][62475] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-06 21:01:12,200][62475] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-06 21:01:12,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12364.8). Total num frames: 1978368. Throughput: 0: 12843.1. Samples: 1948148. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:01:12,390][62145] Avg episode reward: [(0, '532.657')] [2023-03-06 21:01:13,015][62475] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-06 21:01:13,792][62475] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-06 21:01:14,598][62475] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-06 21:01:15,401][62475] Updated weights for policy 0, policy_version 1970 (0.0007) [2023-03-06 21:01:16,216][62475] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-06 21:01:17,012][62475] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-06 21:01:17,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12374.9). Total num frames: 2041856. Throughput: 0: 12832.3. Samples: 2024951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:17,390][62145] Avg episode reward: [(0, '524.322')] [2023-03-06 21:01:17,807][62475] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-06 21:01:18,612][62475] Updated weights for policy 0, policy_version 2010 (0.0008) [2023-03-06 21:01:19,405][62475] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-06 21:01:20,207][62475] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-06 21:01:21,013][62475] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-06 21:01:21,806][62475] Updated weights for policy 0, policy_version 2050 (0.0006) [2023-03-06 21:01:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12390.4). Total num frames: 2106368. Throughput: 0: 12830.1. Samples: 2101829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:22,390][62145] Avg episode reward: [(0, '531.458')] [2023-03-06 21:01:22,599][62475] Updated weights for policy 0, policy_version 2060 (0.0007) [2023-03-06 21:01:23,399][62475] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-06 21:01:24,213][62475] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-06 21:01:25,010][62475] Updated weights for policy 0, policy_version 2090 (0.0007) [2023-03-06 21:01:25,789][62475] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-06 21:01:26,579][62475] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-06 21:01:27,373][62475] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-06 21:01:27,389][62145] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12405.0). Total num frames: 2170880. Throughput: 0: 12827.5. Samples: 2140306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:27,390][62145] Avg episode reward: [(0, '540.961')] [2023-03-06 21:01:27,390][62424] Saving new best policy, reward=540.961! [2023-03-06 21:01:28,203][62475] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-06 21:01:28,988][62475] Updated weights for policy 0, policy_version 2140 (0.0007) [2023-03-06 21:01:29,794][62475] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-06 21:01:30,598][62475] Updated weights for policy 0, policy_version 2160 (0.0006) [2023-03-06 21:01:31,381][62475] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-06 21:01:32,192][62475] Updated weights for policy 0, policy_version 2180 (0.0006) [2023-03-06 21:01:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12413.2). Total num frames: 2234368. Throughput: 0: 12821.6. Samples: 2217194. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:32,390][62145] Avg episode reward: [(0, '531.790')] [2023-03-06 21:01:32,985][62475] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-06 21:01:33,757][62475] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-06 21:01:34,564][62475] Updated weights for policy 0, policy_version 2210 (0.0007) [2023-03-06 21:01:35,344][62475] Updated weights for policy 0, policy_version 2220 (0.0006) [2023-03-06 21:01:36,147][62475] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-06 21:01:36,948][62475] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-06 21:01:37,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12426.4). Total num frames: 2298880. Throughput: 0: 12831.4. Samples: 2294541. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:01:37,390][62145] Avg episode reward: [(0, '535.697')] [2023-03-06 21:01:37,741][62475] Updated weights for policy 0, policy_version 2250 (0.0007) [2023-03-06 21:01:38,541][62475] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-06 21:01:39,330][62475] Updated weights for policy 0, policy_version 2270 (0.0005) [2023-03-06 21:01:40,137][62475] Updated weights for policy 0, policy_version 2280 (0.0007) [2023-03-06 21:01:40,933][62475] Updated weights for policy 0, policy_version 2290 (0.0007) [2023-03-06 21:01:41,728][62475] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-06 21:01:42,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12438.9). Total num frames: 2363392. Throughput: 0: 12833.3. Samples: 2333085. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:42,390][62145] Avg episode reward: [(0, '552.186')] [2023-03-06 21:01:42,394][62424] Saving new best policy, reward=552.186! [2023-03-06 21:01:42,516][62475] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-06 21:01:43,335][62475] Updated weights for policy 0, policy_version 2320 (0.0006) [2023-03-06 21:01:44,133][62475] Updated weights for policy 0, policy_version 2330 (0.0007) [2023-03-06 21:01:44,918][62475] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-06 21:01:45,725][62475] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-06 21:01:46,529][62475] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-06 21:01:47,314][62475] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-06 21:01:47,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12445.5). Total num frames: 2426880. Throughput: 0: 12832.0. Samples: 2410099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:47,390][62145] Avg episode reward: [(0, '551.415')] [2023-03-06 21:01:48,115][62475] Updated weights for policy 0, policy_version 2380 (0.0007) [2023-03-06 21:01:48,918][62475] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-06 21:01:49,708][62475] Updated weights for policy 0, policy_version 2400 (0.0007) [2023-03-06 21:01:50,496][62475] Updated weights for policy 0, policy_version 2410 (0.0007) [2023-03-06 21:01:51,274][62475] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-06 21:01:52,079][62475] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-06 21:01:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12457.0). Total num frames: 2491392. Throughput: 0: 12847.2. Samples: 2487423. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:01:52,390][62145] Avg episode reward: [(0, '559.934')] [2023-03-06 21:01:52,404][62424] Saving new best policy, reward=559.934! [2023-03-06 21:01:52,895][62475] Updated weights for policy 0, policy_version 2440 (0.0007) [2023-03-06 21:01:53,688][62475] Updated weights for policy 0, policy_version 2450 (0.0005) [2023-03-06 21:01:54,494][62475] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-06 21:01:55,289][62475] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-06 21:01:56,077][62475] Updated weights for policy 0, policy_version 2480 (0.0007) [2023-03-06 21:01:56,882][62475] Updated weights for policy 0, policy_version 2490 (0.0007) [2023-03-06 21:01:57,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12467.8). Total num frames: 2555904. Throughput: 0: 12835.4. Samples: 2525741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:01:57,390][62145] Avg episode reward: [(0, '540.358')] [2023-03-06 21:01:57,669][62475] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-06 21:01:58,468][62475] Updated weights for policy 0, policy_version 2510 (0.0007) [2023-03-06 21:01:59,277][62475] Updated weights for policy 0, policy_version 2520 (0.0007) [2023-03-06 21:02:00,068][62475] Updated weights for policy 0, policy_version 2530 (0.0005) [2023-03-06 21:02:00,864][62475] Updated weights for policy 0, policy_version 2540 (0.0008) [2023-03-06 21:02:01,666][62475] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-06 21:02:02,389][62145] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12478.2). Total num frames: 2620416. Throughput: 0: 12843.5. Samples: 2602910. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:02:02,390][62145] Avg episode reward: [(0, '547.591')] [2023-03-06 21:02:02,471][62475] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-06 21:02:03,250][62475] Updated weights for policy 0, policy_version 2570 (0.0006) [2023-03-06 21:02:04,067][62475] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-06 21:02:04,883][62475] Updated weights for policy 0, policy_version 2590 (0.0007) [2023-03-06 21:02:05,673][62475] Updated weights for policy 0, policy_version 2600 (0.0006) [2023-03-06 21:02:06,469][62475] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-06 21:02:07,264][62475] Updated weights for policy 0, policy_version 2620 (0.0007) [2023-03-06 21:02:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12483.3). Total num frames: 2683904. Throughput: 0: 12840.5. Samples: 2679652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:07,390][62145] Avg episode reward: [(0, '551.096')] [2023-03-06 21:02:08,047][62475] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-06 21:02:08,862][62475] Updated weights for policy 0, policy_version 2640 (0.0007) [2023-03-06 21:02:09,654][62475] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-06 21:02:10,442][62475] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-06 21:02:11,249][62475] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-06 21:02:12,045][62475] Updated weights for policy 0, policy_version 2680 (0.0006) [2023-03-06 21:02:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12492.8). Total num frames: 2748416. Throughput: 0: 12840.4. Samples: 2718123. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:02:12,390][62145] Avg episode reward: [(0, '565.032')] [2023-03-06 21:02:12,394][62424] Saving new best policy, reward=565.032! [2023-03-06 21:02:12,834][62475] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-06 21:02:13,633][62475] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-06 21:02:14,414][62475] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-06 21:02:15,212][62475] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-06 21:02:16,002][62475] Updated weights for policy 0, policy_version 2730 (0.0006) [2023-03-06 21:02:16,799][62475] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-06 21:02:17,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12501.9). Total num frames: 2812928. Throughput: 0: 12854.1. Samples: 2795631. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:02:17,390][62145] Avg episode reward: [(0, '570.065')] [2023-03-06 21:02:17,391][62424] Saving new best policy, reward=570.065! [2023-03-06 21:02:17,587][62475] Updated weights for policy 0, policy_version 2750 (0.0007) [2023-03-06 21:02:18,388][62475] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-06 21:02:19,198][62475] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-06 21:02:19,993][62475] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-06 21:02:20,784][62475] Updated weights for policy 0, policy_version 2790 (0.0005) [2023-03-06 21:02:21,576][62475] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-06 21:02:22,368][62475] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-06 21:02:22,390][62145] Fps is (10 sec: 12902.2, 60 sec: 12851.2, 300 sec: 12510.6). Total num frames: 2877440. Throughput: 0: 12848.7. Samples: 2872732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:22,390][62145] Avg episode reward: [(0, '573.992')] [2023-03-06 21:02:22,395][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000002810_2877440.pth... [2023-03-06 21:02:22,425][62424] Saving new best policy, reward=573.992! [2023-03-06 21:02:23,179][62475] Updated weights for policy 0, policy_version 2820 (0.0007) [2023-03-06 21:02:23,972][62475] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-06 21:02:24,757][62475] Updated weights for policy 0, policy_version 2840 (0.0007) [2023-03-06 21:02:25,545][62475] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-06 21:02:26,356][62475] Updated weights for policy 0, policy_version 2860 (0.0007) [2023-03-06 21:02:27,161][62475] Updated weights for policy 0, policy_version 2870 (0.0007) [2023-03-06 21:02:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12514.6). Total num frames: 2940928. Throughput: 0: 12849.5. Samples: 2911309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:27,390][62145] Avg episode reward: [(0, '592.941')] [2023-03-06 21:02:27,392][62424] Saving new best policy, reward=592.941! [2023-03-06 21:02:27,964][62475] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-06 21:02:28,771][62475] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-06 21:02:29,558][62475] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-06 21:02:30,354][62475] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-06 21:02:31,140][62475] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-06 21:02:31,961][62475] Updated weights for policy 0, policy_version 2930 (0.0007) [2023-03-06 21:02:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12522.7). Total num frames: 3005440. Throughput: 0: 12848.0. Samples: 2988259. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:02:32,401][62145] Avg episode reward: [(0, '575.478')] [2023-03-06 21:02:32,744][62475] Updated weights for policy 0, policy_version 2940 (0.0006) [2023-03-06 21:02:33,562][62475] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-06 21:02:34,356][62475] Updated weights for policy 0, policy_version 2960 (0.0008) [2023-03-06 21:02:35,157][62475] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-06 21:02:35,952][62475] Updated weights for policy 0, policy_version 2980 (0.0005) [2023-03-06 21:02:36,755][62475] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-06 21:02:37,389][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12530.4). Total num frames: 3069952. Throughput: 0: 12836.3. Samples: 3065053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:37,400][62145] Avg episode reward: [(0, '593.942')] [2023-03-06 21:02:37,401][62424] Saving new best policy, reward=593.942! [2023-03-06 21:02:37,547][62475] Updated weights for policy 0, policy_version 3000 (0.0007) [2023-03-06 21:02:38,348][62475] Updated weights for policy 0, policy_version 3010 (0.0006) [2023-03-06 21:02:39,143][62475] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-06 21:02:39,943][62475] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-06 21:02:40,735][62475] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-06 21:02:41,549][62475] Updated weights for policy 0, policy_version 3050 (0.0007) [2023-03-06 21:02:42,332][62475] Updated weights for policy 0, policy_version 3060 (0.0008) [2023-03-06 21:02:42,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12533.8). Total num frames: 3133440. Throughput: 0: 12843.3. Samples: 3103688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:42,401][62145] Avg episode reward: [(0, '577.955')] [2023-03-06 21:02:43,127][62475] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-06 21:02:43,926][62475] Updated weights for policy 0, policy_version 3080 (0.0007) [2023-03-06 21:02:44,737][62475] Updated weights for policy 0, policy_version 3090 (0.0006) [2023-03-06 21:02:45,529][62475] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-06 21:02:46,323][62475] Updated weights for policy 0, policy_version 3110 (0.0007) [2023-03-06 21:02:47,112][62475] Updated weights for policy 0, policy_version 3120 (0.0007) [2023-03-06 21:02:47,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12541.0). Total num frames: 3197952. Throughput: 0: 12839.0. Samples: 3180664. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:02:47,400][62145] Avg episode reward: [(0, '585.583')] [2023-03-06 21:02:47,912][62475] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-06 21:02:48,710][62475] Updated weights for policy 0, policy_version 3140 (0.0006) [2023-03-06 21:02:49,478][62475] Updated weights for policy 0, policy_version 3150 (0.0006) [2023-03-06 21:02:50,292][62475] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-06 21:02:51,096][62475] Updated weights for policy 0, policy_version 3170 (0.0007) [2023-03-06 21:02:51,884][62475] Updated weights for policy 0, policy_version 3180 (0.0007) [2023-03-06 21:02:52,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12547.9). Total num frames: 3262464. Throughput: 0: 12851.2. Samples: 3257958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:02:52,390][62145] Avg episode reward: [(0, '568.399')] [2023-03-06 21:02:52,684][62475] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-06 21:02:53,473][62475] Updated weights for policy 0, policy_version 3200 (0.0007) [2023-03-06 21:02:54,260][62475] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-06 21:02:55,063][62475] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-06 21:02:55,846][62475] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-06 21:02:56,651][62475] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-06 21:02:57,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12554.6). Total num frames: 3326976. Throughput: 0: 12854.3. Samples: 3296567. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:02:57,390][62145] Avg episode reward: [(0, '575.860')] [2023-03-06 21:02:57,427][62475] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-06 21:02:58,229][62475] Updated weights for policy 0, policy_version 3260 (0.0006) [2023-03-06 21:02:59,018][62475] Updated weights for policy 0, policy_version 3270 (0.0007) [2023-03-06 21:02:59,808][62475] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-06 21:03:00,614][62475] Updated weights for policy 0, policy_version 3290 (0.0007) [2023-03-06 21:03:01,421][62475] Updated weights for policy 0, policy_version 3300 (0.0006) [2023-03-06 21:03:02,198][62475] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-06 21:03:02,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12561.1). Total num frames: 3391488. Throughput: 0: 12851.1. Samples: 3373929. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:03:02,390][62145] Avg episode reward: [(0, '573.386')] [2023-03-06 21:03:03,021][62475] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-06 21:03:03,830][62475] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-06 21:03:04,609][62475] Updated weights for policy 0, policy_version 3340 (0.0007) [2023-03-06 21:03:05,424][62475] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-06 21:03:06,224][62475] Updated weights for policy 0, policy_version 3360 (0.0007) [2023-03-06 21:03:07,017][62475] Updated weights for policy 0, policy_version 3370 (0.0006) [2023-03-06 21:03:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12563.6). Total num frames: 3454976. Throughput: 0: 12846.8. Samples: 3450838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:07,390][62145] Avg episode reward: [(0, '608.620')] [2023-03-06 21:03:07,391][62424] Saving new best policy, reward=608.620! [2023-03-06 21:03:07,814][62475] Updated weights for policy 0, policy_version 3380 (0.0007) [2023-03-06 21:03:08,638][62475] Updated weights for policy 0, policy_version 3390 (0.0006) [2023-03-06 21:03:09,407][62475] Updated weights for policy 0, policy_version 3400 (0.0007) [2023-03-06 21:03:10,200][62475] Updated weights for policy 0, policy_version 3410 (0.0007) [2023-03-06 21:03:11,004][62475] Updated weights for policy 0, policy_version 3420 (0.0007) [2023-03-06 21:03:11,797][62475] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-06 21:03:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12569.6). Total num frames: 3519488. Throughput: 0: 12846.2. Samples: 3489386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:12,390][62145] Avg episode reward: [(0, '602.895')] [2023-03-06 21:03:12,588][62475] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-06 21:03:13,402][62475] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-06 21:03:14,182][62475] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-06 21:03:14,970][62475] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-06 21:03:15,777][62475] Updated weights for policy 0, policy_version 3480 (0.0007) [2023-03-06 21:03:16,566][62475] Updated weights for policy 0, policy_version 3490 (0.0006) [2023-03-06 21:03:17,372][62475] Updated weights for policy 0, policy_version 3500 (0.0007) [2023-03-06 21:03:17,389][62145] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12575.4). Total num frames: 3584000. Throughput: 0: 12851.1. Samples: 3566555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:17,390][62145] Avg episode reward: [(0, '596.456')] [2023-03-06 21:03:18,174][62475] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-06 21:03:18,958][62475] Updated weights for policy 0, policy_version 3520 (0.0006) [2023-03-06 21:03:19,749][62475] Updated weights for policy 0, policy_version 3530 (0.0007) [2023-03-06 21:03:20,551][62475] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-06 21:03:21,352][62475] Updated weights for policy 0, policy_version 3550 (0.0007) [2023-03-06 21:03:22,204][62475] Updated weights for policy 0, policy_version 3560 (0.0006) [2023-03-06 21:03:22,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12577.5). Total num frames: 3647488. Throughput: 0: 12857.1. Samples: 3643622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:22,390][62145] Avg episode reward: [(0, '618.183')] [2023-03-06 21:03:22,394][62424] Saving new best policy, reward=618.183! [2023-03-06 21:03:23,004][62475] Updated weights for policy 0, policy_version 3570 (0.0007) [2023-03-06 21:03:23,789][62475] Updated weights for policy 0, policy_version 3580 (0.0005) [2023-03-06 21:03:24,598][62475] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-06 21:03:25,404][62475] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-06 21:03:26,179][62475] Updated weights for policy 0, policy_version 3610 (0.0005) [2023-03-06 21:03:26,978][62475] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-06 21:03:27,389][62145] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12583.1). Total num frames: 3712000. Throughput: 0: 12841.1. Samples: 3681537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:03:27,390][62145] Avg episode reward: [(0, '604.352')] [2023-03-06 21:03:27,773][62475] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-06 21:03:28,581][62475] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-06 21:03:29,355][62475] Updated weights for policy 0, policy_version 3650 (0.0006) [2023-03-06 21:03:30,158][62475] Updated weights for policy 0, policy_version 3660 (0.0006) [2023-03-06 21:03:30,983][62475] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-06 21:03:31,781][62475] Updated weights for policy 0, policy_version 3680 (0.0006) [2023-03-06 21:03:32,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12798.3). Total num frames: 3775488. Throughput: 0: 12841.8. Samples: 3758546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:32,390][62145] Avg episode reward: [(0, '609.329')] [2023-03-06 21:03:32,566][62475] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-06 21:03:33,383][62475] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-06 21:03:34,169][62475] Updated weights for policy 0, policy_version 3710 (0.0006) [2023-03-06 21:03:34,968][62475] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-06 21:03:35,796][62475] Updated weights for policy 0, policy_version 3730 (0.0007) [2023-03-06 21:03:36,577][62475] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-06 21:03:37,359][62475] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-06 21:03:37,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 3840000. Throughput: 0: 12828.6. Samples: 3835245. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:03:37,390][62145] Avg episode reward: [(0, '618.009')] [2023-03-06 21:03:38,166][62475] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-06 21:03:38,965][62475] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-06 21:03:39,766][62475] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-06 21:03:40,554][62475] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-06 21:03:41,354][62475] Updated weights for policy 0, policy_version 3800 (0.0007) [2023-03-06 21:03:42,152][62475] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-06 21:03:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12839.9). Total num frames: 3903488. Throughput: 0: 12828.0. Samples: 3873825. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:42,390][62145] Avg episode reward: [(0, '613.756')] [2023-03-06 21:03:42,964][62475] Updated weights for policy 0, policy_version 3820 (0.0007) [2023-03-06 21:03:43,747][62475] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-06 21:03:44,555][62475] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-06 21:03:45,340][62475] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-06 21:03:46,157][62475] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-06 21:03:46,960][62475] Updated weights for policy 0, policy_version 3870 (0.0007) [2023-03-06 21:03:47,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 3968000. Throughput: 0: 12818.6. Samples: 3950764. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:03:47,390][62145] Avg episode reward: [(0, '579.917')] [2023-03-06 21:03:47,759][62475] Updated weights for policy 0, policy_version 3880 (0.0007) [2023-03-06 21:03:48,569][62475] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-06 21:03:49,350][62475] Updated weights for policy 0, policy_version 3900 (0.0006) [2023-03-06 21:03:50,149][62475] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-06 21:03:50,942][62475] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-06 21:03:51,730][62475] Updated weights for policy 0, policy_version 3930 (0.0007) [2023-03-06 21:03:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 4031488. Throughput: 0: 12822.8. Samples: 4027862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:03:52,390][62145] Avg episode reward: [(0, '580.287')] [2023-03-06 21:03:52,551][62475] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-06 21:03:53,350][62475] Updated weights for policy 0, policy_version 3950 (0.0007) [2023-03-06 21:03:54,142][62475] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-06 21:03:54,958][62475] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-06 21:03:55,741][62475] Updated weights for policy 0, policy_version 3980 (0.0007) [2023-03-06 21:03:56,537][62475] Updated weights for policy 0, policy_version 3990 (0.0007) [2023-03-06 21:03:57,345][62475] Updated weights for policy 0, policy_version 4000 (0.0007) [2023-03-06 21:03:57,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12843.4). Total num frames: 4096000. Throughput: 0: 12813.9. Samples: 4066012. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:03:57,390][62145] Avg episode reward: [(0, '583.874')] [2023-03-06 21:03:58,152][62475] Updated weights for policy 0, policy_version 4010 (0.0006) [2023-03-06 21:03:58,955][62475] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-06 21:03:59,749][62475] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-03-06 21:04:00,547][62475] Updated weights for policy 0, policy_version 4040 (0.0007) [2023-03-06 21:04:01,361][62475] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-06 21:04:02,142][62475] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-06 21:04:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12839.9). Total num frames: 4159488. Throughput: 0: 12805.7. Samples: 4142814. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:04:02,390][62145] Avg episode reward: [(0, '604.983')] [2023-03-06 21:04:02,950][62475] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-06 21:04:03,746][62475] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-06 21:04:04,534][62475] Updated weights for policy 0, policy_version 4090 (0.0006) [2023-03-06 21:04:05,345][62475] Updated weights for policy 0, policy_version 4100 (0.0007) [2023-03-06 21:04:06,121][62475] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-06 21:04:06,919][62475] Updated weights for policy 0, policy_version 4120 (0.0007) [2023-03-06 21:04:07,389][62145] Fps is (10 sec: 12902.6, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 4225024. Throughput: 0: 12811.5. Samples: 4220138. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:04:07,390][62145] Avg episode reward: [(0, '588.648')] [2023-03-06 21:04:07,702][62475] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-06 21:04:08,533][62475] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-06 21:04:09,316][62475] Updated weights for policy 0, policy_version 4150 (0.0006) [2023-03-06 21:04:10,121][62475] Updated weights for policy 0, policy_version 4160 (0.0007) [2023-03-06 21:04:10,925][62475] Updated weights for policy 0, policy_version 4170 (0.0006) [2023-03-06 21:04:11,708][62475] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-06 21:04:12,390][62145] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 4288512. Throughput: 0: 12818.2. Samples: 4258359. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:12,390][62145] Avg episode reward: [(0, '595.917')] [2023-03-06 21:04:12,516][62475] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-06 21:04:13,310][62475] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-06 21:04:14,090][62475] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-06 21:04:14,900][62475] Updated weights for policy 0, policy_version 4220 (0.0006) [2023-03-06 21:04:15,710][62475] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-06 21:04:16,511][62475] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-06 21:04:17,313][62475] Updated weights for policy 0, policy_version 4250 (0.0007) [2023-03-06 21:04:17,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12839.9). Total num frames: 4353024. Throughput: 0: 12816.8. Samples: 4335302. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:04:17,390][62145] Avg episode reward: [(0, '587.981')] [2023-03-06 21:04:18,113][62475] Updated weights for policy 0, policy_version 4260 (0.0006) [2023-03-06 21:04:18,906][62475] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-06 21:04:19,710][62475] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-06 21:04:20,524][62475] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-06 21:04:21,305][62475] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-06 21:04:22,092][62475] Updated weights for policy 0, policy_version 4310 (0.0007) [2023-03-06 21:04:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12836.4). Total num frames: 4416512. Throughput: 0: 12823.7. Samples: 4412312. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:04:22,390][62145] Avg episode reward: [(0, '593.292')] [2023-03-06 21:04:22,395][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000004313_4416512.pth... [2023-03-06 21:04:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000001305_1336320.pth [2023-03-06 21:04:22,910][62475] Updated weights for policy 0, policy_version 4320 (0.0007) [2023-03-06 21:04:23,693][62475] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-06 21:04:24,496][62475] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-06 21:04:25,332][62475] Updated weights for policy 0, policy_version 4350 (0.0007) [2023-03-06 21:04:26,111][62475] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-06 21:04:26,902][62475] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-06 21:04:27,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 4481024. Throughput: 0: 12815.1. Samples: 4450504. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:04:27,390][62145] Avg episode reward: [(0, '595.848')] [2023-03-06 21:04:27,703][62475] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-06 21:04:28,519][62475] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-06 21:04:29,296][62475] Updated weights for policy 0, policy_version 4400 (0.0007) [2023-03-06 21:04:30,102][62475] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-06 21:04:30,896][62475] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-06 21:04:31,704][62475] Updated weights for policy 0, policy_version 4430 (0.0007) [2023-03-06 21:04:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 4544512. Throughput: 0: 12815.1. Samples: 4527447. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:32,390][62145] Avg episode reward: [(0, '599.393')] [2023-03-06 21:04:32,490][62475] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-06 21:04:33,308][62475] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-06 21:04:34,093][62475] Updated weights for policy 0, policy_version 4460 (0.0006) [2023-03-06 21:04:34,888][62475] Updated weights for policy 0, policy_version 4470 (0.0007) [2023-03-06 21:04:35,703][62475] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-06 21:04:36,480][62475] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-06 21:04:37,296][62475] Updated weights for policy 0, policy_version 4500 (0.0007) [2023-03-06 21:04:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 4609024. Throughput: 0: 12817.3. Samples: 4604639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:37,390][62145] Avg episode reward: [(0, '610.562')] [2023-03-06 21:04:38,094][62475] Updated weights for policy 0, policy_version 4510 (0.0008) [2023-03-06 21:04:38,897][62475] Updated weights for policy 0, policy_version 4520 (0.0006) [2023-03-06 21:04:39,693][62475] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-06 21:04:40,506][62475] Updated weights for policy 0, policy_version 4540 (0.0007) [2023-03-06 21:04:41,302][62475] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-06 21:04:42,107][62475] Updated weights for policy 0, policy_version 4560 (0.0007) [2023-03-06 21:04:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 4672512. Throughput: 0: 12815.0. Samples: 4642684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:04:42,390][62145] Avg episode reward: [(0, '611.330')] [2023-03-06 21:04:42,914][62475] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-06 21:04:43,705][62475] Updated weights for policy 0, policy_version 4580 (0.0008) [2023-03-06 21:04:44,519][62475] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-06 21:04:45,310][62475] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-06 21:04:46,115][62475] Updated weights for policy 0, policy_version 4610 (0.0007) [2023-03-06 21:04:46,912][62475] Updated weights for policy 0, policy_version 4620 (0.0006) [2023-03-06 21:04:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12833.0). Total num frames: 4736000. Throughput: 0: 12808.5. Samples: 4719199. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:47,390][62145] Avg episode reward: [(0, '584.762')] [2023-03-06 21:04:47,727][62475] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-06 21:04:48,513][62475] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-06 21:04:49,301][62475] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-06 21:04:50,119][62475] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-06 21:04:50,912][62475] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-06 21:04:51,721][62475] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-06 21:04:52,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 4800512. Throughput: 0: 12796.4. Samples: 4795980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:52,390][62145] Avg episode reward: [(0, '610.938')] [2023-03-06 21:04:52,526][62475] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-06 21:04:53,341][62475] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-06 21:04:54,126][62475] Updated weights for policy 0, policy_version 4710 (0.0007) [2023-03-06 21:04:54,937][62475] Updated weights for policy 0, policy_version 4720 (0.0007) [2023-03-06 21:04:55,723][62475] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-06 21:04:56,529][62475] Updated weights for policy 0, policy_version 4740 (0.0006) [2023-03-06 21:04:57,347][62475] Updated weights for policy 0, policy_version 4750 (0.0007) [2023-03-06 21:04:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 4864000. Throughput: 0: 12796.3. Samples: 4834191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:04:57,400][62145] Avg episode reward: [(0, '593.514')] [2023-03-06 21:04:58,157][62475] Updated weights for policy 0, policy_version 4760 (0.0007) [2023-03-06 21:04:58,961][62475] Updated weights for policy 0, policy_version 4770 (0.0006) [2023-03-06 21:04:59,760][62475] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-06 21:05:00,570][62475] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-06 21:05:01,354][62475] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-06 21:05:02,173][62475] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-06 21:05:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 4927488. Throughput: 0: 12786.3. Samples: 4910685. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:05:02,390][62145] Avg episode reward: [(0, '613.995')] [2023-03-06 21:05:02,973][62475] Updated weights for policy 0, policy_version 4820 (0.0007) [2023-03-06 21:05:03,785][62475] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-06 21:05:04,562][62475] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-06 21:05:05,386][62475] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-06 21:05:06,187][62475] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-06 21:05:06,997][62475] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-06 21:05:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.8, 300 sec: 12822.6). Total num frames: 4990976. Throughput: 0: 12771.9. Samples: 4987044. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:05:07,390][62145] Avg episode reward: [(0, '625.070')] [2023-03-06 21:05:07,402][62424] Saving new best policy, reward=625.070! [2023-03-06 21:05:07,794][62475] Updated weights for policy 0, policy_version 4880 (0.0006) [2023-03-06 21:05:08,597][62475] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-06 21:05:09,399][62475] Updated weights for policy 0, policy_version 4900 (0.0007) [2023-03-06 21:05:10,198][62475] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-06 21:05:10,979][62475] Updated weights for policy 0, policy_version 4920 (0.0007) [2023-03-06 21:05:11,785][62475] Updated weights for policy 0, policy_version 4930 (0.0009) [2023-03-06 21:05:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12826.0). Total num frames: 5055488. Throughput: 0: 12778.5. Samples: 5025538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:12,390][62145] Avg episode reward: [(0, '612.128')] [2023-03-06 21:05:12,562][62475] Updated weights for policy 0, policy_version 4940 (0.0006) [2023-03-06 21:05:13,365][62475] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-06 21:05:14,168][62475] Updated weights for policy 0, policy_version 4960 (0.0007) [2023-03-06 21:05:14,974][62475] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-06 21:05:15,795][62475] Updated weights for policy 0, policy_version 4980 (0.0007) [2023-03-06 21:05:16,572][62475] Updated weights for policy 0, policy_version 4990 (0.0007) [2023-03-06 21:05:17,373][62475] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-06 21:05:17,390][62145] Fps is (10 sec: 12902.4, 60 sec: 12782.9, 300 sec: 12826.0). Total num frames: 5120000. Throughput: 0: 12780.7. Samples: 5102580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:17,390][62145] Avg episode reward: [(0, '587.600')] [2023-03-06 21:05:18,181][62475] Updated weights for policy 0, policy_version 5010 (0.0007) [2023-03-06 21:05:18,976][62475] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-06 21:05:19,782][62475] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-06 21:05:20,581][62475] Updated weights for policy 0, policy_version 5040 (0.0007) [2023-03-06 21:05:21,390][62475] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-06 21:05:22,202][62475] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-06 21:05:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12783.0, 300 sec: 12822.6). Total num frames: 5183488. Throughput: 0: 12765.5. Samples: 5179087. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:22,401][62145] Avg episode reward: [(0, '565.074')] [2023-03-06 21:05:23,006][62475] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-06 21:05:23,817][62475] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-06 21:05:24,614][62475] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-06 21:05:25,414][62475] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-06 21:05:26,215][62475] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-06 21:05:27,015][62475] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-06 21:05:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12822.6). Total num frames: 5246976. Throughput: 0: 12767.9. Samples: 5217238. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:27,400][62145] Avg episode reward: [(0, '643.636')] [2023-03-06 21:05:27,401][62424] Saving new best policy, reward=643.636! [2023-03-06 21:05:27,837][62475] Updated weights for policy 0, policy_version 5130 (0.0007) [2023-03-06 21:05:28,611][62475] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-06 21:05:29,442][62475] Updated weights for policy 0, policy_version 5150 (0.0006) [2023-03-06 21:05:30,251][62475] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-06 21:05:31,057][62475] Updated weights for policy 0, policy_version 5170 (0.0006) [2023-03-06 21:05:31,882][62475] Updated weights for policy 0, policy_version 5180 (0.0007) [2023-03-06 21:05:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12819.1). Total num frames: 5310464. Throughput: 0: 12761.7. Samples: 5293474. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:05:32,401][62145] Avg episode reward: [(0, '569.126')] [2023-03-06 21:05:32,682][62475] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-06 21:05:33,480][62475] Updated weights for policy 0, policy_version 5200 (0.0007) [2023-03-06 21:05:34,287][62475] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-06 21:05:35,092][62475] Updated weights for policy 0, policy_version 5220 (0.0007) [2023-03-06 21:05:35,873][62475] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-06 21:05:36,683][62475] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-06 21:05:37,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12815.6). Total num frames: 5373952. Throughput: 0: 12754.6. Samples: 5369937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:37,401][62145] Avg episode reward: [(0, '537.003')] [2023-03-06 21:05:37,496][62475] Updated weights for policy 0, policy_version 5250 (0.0007) [2023-03-06 21:05:38,296][62475] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-03-06 21:05:39,089][62475] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-06 21:05:39,897][62475] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-06 21:05:40,709][62475] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-06 21:05:41,510][62475] Updated weights for policy 0, policy_version 5300 (0.0006) [2023-03-06 21:05:42,322][62475] Updated weights for policy 0, policy_version 5310 (0.0006) [2023-03-06 21:05:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12815.6). Total num frames: 5437440. Throughput: 0: 12750.6. Samples: 5407970. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:05:42,401][62145] Avg episode reward: [(0, '555.496')] [2023-03-06 21:05:43,130][62475] Updated weights for policy 0, policy_version 5320 (0.0007) [2023-03-06 21:05:43,943][62475] Updated weights for policy 0, policy_version 5330 (0.0006) [2023-03-06 21:05:44,749][62475] Updated weights for policy 0, policy_version 5340 (0.0007) [2023-03-06 21:05:45,550][62475] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-06 21:05:46,346][62475] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-06 21:05:47,156][62475] Updated weights for policy 0, policy_version 5370 (0.0007) [2023-03-06 21:05:47,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12812.2). Total num frames: 5500928. Throughput: 0: 12746.4. Samples: 5484275. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:05:47,400][62145] Avg episode reward: [(0, '596.853')] [2023-03-06 21:05:47,958][62475] Updated weights for policy 0, policy_version 5380 (0.0008) [2023-03-06 21:05:48,772][62475] Updated weights for policy 0, policy_version 5390 (0.0007) [2023-03-06 21:05:49,557][62475] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-06 21:05:50,371][62475] Updated weights for policy 0, policy_version 5410 (0.0006) [2023-03-06 21:05:51,189][62475] Updated weights for policy 0, policy_version 5420 (0.0007) [2023-03-06 21:05:51,974][62475] Updated weights for policy 0, policy_version 5430 (0.0007) [2023-03-06 21:05:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12812.1). Total num frames: 5565440. Throughput: 0: 12748.9. Samples: 5560744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:52,390][62145] Avg episode reward: [(0, '572.053')] [2023-03-06 21:05:52,778][62475] Updated weights for policy 0, policy_version 5440 (0.0007) [2023-03-06 21:05:53,561][62475] Updated weights for policy 0, policy_version 5450 (0.0006) [2023-03-06 21:05:54,387][62475] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-06 21:05:55,186][62475] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-06 21:05:55,984][62475] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-06 21:05:56,799][62475] Updated weights for policy 0, policy_version 5490 (0.0007) [2023-03-06 21:05:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12812.1). Total num frames: 5628928. Throughput: 0: 12744.0. Samples: 5599017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:05:57,390][62145] Avg episode reward: [(0, '552.679')] [2023-03-06 21:05:57,594][62475] Updated weights for policy 0, policy_version 5500 (0.0007) [2023-03-06 21:05:58,408][62475] Updated weights for policy 0, policy_version 5510 (0.0007) [2023-03-06 21:05:59,181][62475] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-06 21:06:00,011][62475] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-06 21:06:00,815][62475] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-06 21:06:01,616][62475] Updated weights for policy 0, policy_version 5550 (0.0007) [2023-03-06 21:06:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12808.7). Total num frames: 5692416. Throughput: 0: 12732.6. Samples: 5675545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:06:02,390][62145] Avg episode reward: [(0, '555.556')] [2023-03-06 21:06:02,432][62475] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-06 21:06:03,204][62475] Updated weights for policy 0, policy_version 5570 (0.0007) [2023-03-06 21:06:04,025][62475] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-06 21:06:04,828][62475] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-06 21:06:05,607][62475] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-06 21:06:06,397][62475] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-06 21:06:07,210][62475] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-06 21:06:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12808.7). Total num frames: 5756928. Throughput: 0: 12742.3. Samples: 5752489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:06:07,390][62145] Avg episode reward: [(0, '597.507')] [2023-03-06 21:06:08,016][62475] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-06 21:06:08,819][62475] Updated weights for policy 0, policy_version 5640 (0.0005) [2023-03-06 21:06:09,623][62475] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-06 21:06:10,434][62475] Updated weights for policy 0, policy_version 5660 (0.0006) [2023-03-06 21:06:11,214][62475] Updated weights for policy 0, policy_version 5670 (0.0008) [2023-03-06 21:06:12,021][62475] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-06 21:06:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12808.7). Total num frames: 5820416. Throughput: 0: 12740.3. Samples: 5790553. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:06:12,390][62145] Avg episode reward: [(0, '571.089')] [2023-03-06 21:06:12,820][62475] Updated weights for policy 0, policy_version 5690 (0.0005) [2023-03-06 21:06:13,611][62475] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-06 21:06:14,426][62475] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-06 21:06:15,229][62475] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-06 21:06:16,025][62475] Updated weights for policy 0, policy_version 5730 (0.0006) [2023-03-06 21:06:16,828][62475] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-06 21:06:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12805.2). Total num frames: 5883904. Throughput: 0: 12751.3. Samples: 5867283. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:06:17,390][62145] Avg episode reward: [(0, '600.288')] [2023-03-06 21:06:17,655][62475] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-06 21:06:18,443][62475] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-06 21:06:19,244][62475] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-06 21:06:20,050][62475] Updated weights for policy 0, policy_version 5780 (0.0006) [2023-03-06 21:06:20,865][62475] Updated weights for policy 0, policy_version 5790 (0.0006) [2023-03-06 21:06:21,647][62475] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-06 21:06:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12805.2). Total num frames: 5948416. Throughput: 0: 12751.1. Samples: 5943736. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:06:22,390][62145] Avg episode reward: [(0, '525.276')] [2023-03-06 21:06:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000005809_5948416.pth... [2023-03-06 21:06:22,423][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000002810_2877440.pth [2023-03-06 21:06:22,460][62475] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-06 21:06:23,273][62475] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-06 21:06:24,051][62475] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-06 21:06:24,866][62475] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-06 21:06:25,669][62475] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-06 21:06:26,464][62475] Updated weights for policy 0, policy_version 5860 (0.0007) [2023-03-06 21:06:27,257][62475] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-06 21:06:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12805.2). Total num frames: 6011904. Throughput: 0: 12756.4. Samples: 5982009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:06:27,390][62145] Avg episode reward: [(0, '595.782')] [2023-03-06 21:06:28,063][62475] Updated weights for policy 0, policy_version 5880 (0.0006) [2023-03-06 21:06:28,858][62475] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-06 21:06:29,644][62475] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-06 21:06:30,456][62475] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-06 21:06:31,250][62475] Updated weights for policy 0, policy_version 5920 (0.0006) [2023-03-06 21:06:32,031][62475] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-06 21:06:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12805.2). Total num frames: 6076416. Throughput: 0: 12768.9. Samples: 6058878. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:06:32,390][62145] Avg episode reward: [(0, '665.746')] [2023-03-06 21:06:32,393][62424] Saving new best policy, reward=665.746! [2023-03-06 21:06:32,849][62475] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-06 21:06:33,661][62475] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-06 21:06:34,461][62475] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-06 21:06:35,279][62475] Updated weights for policy 0, policy_version 5970 (0.0006) [2023-03-06 21:06:36,077][62475] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-06 21:06:36,885][62475] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-06 21:06:37,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12765.9, 300 sec: 12801.7). Total num frames: 6139904. Throughput: 0: 12769.9. Samples: 6135391. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:06:37,390][62145] Avg episode reward: [(0, '635.594')] [2023-03-06 21:06:37,693][62475] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-06 21:06:38,476][62475] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-06 21:06:39,302][62475] Updated weights for policy 0, policy_version 6020 (0.0006) [2023-03-06 21:06:40,118][62475] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-06 21:06:40,917][62475] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-06 21:06:41,726][62475] Updated weights for policy 0, policy_version 6050 (0.0007) [2023-03-06 21:06:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12801.7). Total num frames: 6203392. Throughput: 0: 12766.5. Samples: 6173511. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:06:42,390][62145] Avg episode reward: [(0, '669.265')] [2023-03-06 21:06:42,393][62424] Saving new best policy, reward=669.265! [2023-03-06 21:06:42,545][62475] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-06 21:06:43,347][62475] Updated weights for policy 0, policy_version 6070 (0.0006) [2023-03-06 21:06:44,149][62475] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-06 21:06:44,958][62475] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-06 21:06:45,745][62475] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-06 21:06:46,558][62475] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-06 21:06:47,357][62475] Updated weights for policy 0, policy_version 6120 (0.0006) [2023-03-06 21:06:47,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12765.9, 300 sec: 12798.3). Total num frames: 6266880. Throughput: 0: 12757.8. Samples: 6249643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:06:47,390][62145] Avg episode reward: [(0, '702.232')] [2023-03-06 21:06:47,390][62424] Saving new best policy, reward=702.232! [2023-03-06 21:06:48,164][62475] Updated weights for policy 0, policy_version 6130 (0.0007) [2023-03-06 21:06:48,995][62475] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-06 21:06:49,789][62475] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-06 21:06:50,592][62475] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-06 21:06:51,381][62475] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-06 21:06:52,180][62475] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-06 21:06:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12794.8). Total num frames: 6330368. Throughput: 0: 12746.1. Samples: 6326062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:06:52,390][62145] Avg episode reward: [(0, '708.702')] [2023-03-06 21:06:52,394][62424] Saving new best policy, reward=708.702! [2023-03-06 21:06:53,015][62475] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-06 21:06:53,828][62475] Updated weights for policy 0, policy_version 6200 (0.0007) [2023-03-06 21:06:54,634][62475] Updated weights for policy 0, policy_version 6210 (0.0007) [2023-03-06 21:06:55,447][62475] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-06 21:06:56,255][62475] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-06 21:06:57,055][62475] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-06 21:06:57,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12791.3). Total num frames: 6393856. Throughput: 0: 12740.0. Samples: 6363851. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:06:57,390][62145] Avg episode reward: [(0, '709.255')] [2023-03-06 21:06:57,390][62424] Saving new best policy, reward=709.255! [2023-03-06 21:06:57,872][62475] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-06 21:06:58,675][62475] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-06 21:06:59,478][62475] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-06 21:07:00,301][62475] Updated weights for policy 0, policy_version 6280 (0.0006) [2023-03-06 21:07:01,103][62475] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-06 21:07:01,901][62475] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-06 21:07:02,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12791.3). Total num frames: 6457344. Throughput: 0: 12723.3. Samples: 6439829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:02,390][62145] Avg episode reward: [(0, '635.745')] [2023-03-06 21:07:02,716][62475] Updated weights for policy 0, policy_version 6310 (0.0006) [2023-03-06 21:07:03,525][62475] Updated weights for policy 0, policy_version 6320 (0.0007) [2023-03-06 21:07:04,333][62475] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-06 21:07:05,133][62475] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-06 21:07:05,962][62475] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-06 21:07:06,763][62475] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-06 21:07:07,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12784.4). Total num frames: 6519808. Throughput: 0: 12711.1. Samples: 6515737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:07:07,390][62145] Avg episode reward: [(0, '685.877')] [2023-03-06 21:07:07,585][62475] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-06 21:07:08,370][62475] Updated weights for policy 0, policy_version 6380 (0.0007) [2023-03-06 21:07:09,188][62475] Updated weights for policy 0, policy_version 6390 (0.0007) [2023-03-06 21:07:09,979][62475] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-06 21:07:10,798][62475] Updated weights for policy 0, policy_version 6410 (0.0007) [2023-03-06 21:07:11,598][62475] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-06 21:07:12,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12714.7, 300 sec: 12780.9). Total num frames: 6583296. Throughput: 0: 12709.1. Samples: 6553919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:12,390][62145] Avg episode reward: [(0, '683.024')] [2023-03-06 21:07:12,420][62475] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-06 21:07:13,222][62475] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-06 21:07:14,018][62475] Updated weights for policy 0, policy_version 6450 (0.0007) [2023-03-06 21:07:14,842][62475] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-06 21:07:15,645][62475] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-06 21:07:16,461][62475] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-06 21:07:17,238][62475] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-06 21:07:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12777.4). Total num frames: 6646784. Throughput: 0: 12693.5. Samples: 6630084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:07:17,390][62145] Avg episode reward: [(0, '643.359')] [2023-03-06 21:07:18,058][62475] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-06 21:07:18,854][62475] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-06 21:07:19,653][62475] Updated weights for policy 0, policy_version 6520 (0.0007) [2023-03-06 21:07:20,466][62475] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-06 21:07:21,265][62475] Updated weights for policy 0, policy_version 6540 (0.0007) [2023-03-06 21:07:22,078][62475] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-06 21:07:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12777.4). Total num frames: 6710272. Throughput: 0: 12690.3. Samples: 6706454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:07:22,390][62145] Avg episode reward: [(0, '554.179')] [2023-03-06 21:07:22,893][62475] Updated weights for policy 0, policy_version 6560 (0.0006) [2023-03-06 21:07:23,707][62475] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-06 21:07:24,508][62475] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-06 21:07:25,330][62475] Updated weights for policy 0, policy_version 6590 (0.0007) [2023-03-06 21:07:26,142][62475] Updated weights for policy 0, policy_version 6600 (0.0007) [2023-03-06 21:07:26,936][62475] Updated weights for policy 0, policy_version 6610 (0.0006) [2023-03-06 21:07:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12774.0). Total num frames: 6773760. Throughput: 0: 12681.9. Samples: 6744199. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:07:27,390][62145] Avg episode reward: [(0, '573.754')] [2023-03-06 21:07:27,762][62475] Updated weights for policy 0, policy_version 6620 (0.0006) [2023-03-06 21:07:28,579][62475] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-06 21:07:29,385][62475] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-06 21:07:30,185][62475] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-06 21:07:31,008][62475] Updated weights for policy 0, policy_version 6660 (0.0006) [2023-03-06 21:07:31,819][62475] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-06 21:07:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12770.5). Total num frames: 6837248. Throughput: 0: 12671.3. Samples: 6819855. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:32,390][62145] Avg episode reward: [(0, '581.681')] [2023-03-06 21:07:32,622][62475] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-06 21:07:33,440][62475] Updated weights for policy 0, policy_version 6690 (0.0007) [2023-03-06 21:07:34,253][62475] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-06 21:07:35,076][62475] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-06 21:07:35,881][62475] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-06 21:07:36,689][62475] Updated weights for policy 0, policy_version 6730 (0.0007) [2023-03-06 21:07:37,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12767.0). Total num frames: 6899712. Throughput: 0: 12657.8. Samples: 6895665. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:37,390][62145] Avg episode reward: [(0, '477.547')] [2023-03-06 21:07:37,493][62475] Updated weights for policy 0, policy_version 6740 (0.0006) [2023-03-06 21:07:38,300][62475] Updated weights for policy 0, policy_version 6750 (0.0006) [2023-03-06 21:07:39,114][62475] Updated weights for policy 0, policy_version 6760 (0.0007) [2023-03-06 21:07:39,930][62475] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-06 21:07:40,745][62475] Updated weights for policy 0, policy_version 6780 (0.0006) [2023-03-06 21:07:41,555][62475] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-06 21:07:42,360][62475] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-06 21:07:42,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12763.6). Total num frames: 6963200. Throughput: 0: 12663.1. Samples: 6933690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:07:42,390][62145] Avg episode reward: [(0, '544.650')] [2023-03-06 21:07:43,178][62475] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-06 21:07:43,993][62475] Updated weights for policy 0, policy_version 6820 (0.0007) [2023-03-06 21:07:44,813][62475] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-06 21:07:45,620][62475] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-06 21:07:46,415][62475] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-06 21:07:47,244][62475] Updated weights for policy 0, policy_version 6860 (0.0006) [2023-03-06 21:07:47,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12756.6). Total num frames: 7025664. Throughput: 0: 12652.3. Samples: 7009185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:47,390][62145] Avg episode reward: [(0, '510.248')] [2023-03-06 21:07:48,059][62475] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-06 21:07:48,866][62475] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-06 21:07:49,694][62475] Updated weights for policy 0, policy_version 6890 (0.0006) [2023-03-06 21:07:50,490][62475] Updated weights for policy 0, policy_version 6900 (0.0006) [2023-03-06 21:07:51,283][62475] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-06 21:07:52,134][62475] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-06 21:07:52,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12753.1). Total num frames: 7089152. Throughput: 0: 12642.8. Samples: 7084662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:07:52,390][62145] Avg episode reward: [(0, '506.362')] [2023-03-06 21:07:52,931][62475] Updated weights for policy 0, policy_version 6930 (0.0006) [2023-03-06 21:07:53,748][62475] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-06 21:07:54,554][62475] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-06 21:07:55,374][62475] Updated weights for policy 0, policy_version 6960 (0.0006) [2023-03-06 21:07:56,187][62475] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-06 21:07:56,989][62475] Updated weights for policy 0, policy_version 6980 (0.0006) [2023-03-06 21:07:57,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12746.2). Total num frames: 7151616. Throughput: 0: 12633.9. Samples: 7122446. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:07:57,390][62145] Avg episode reward: [(0, '506.838')] [2023-03-06 21:07:57,799][62475] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-06 21:07:58,612][62475] Updated weights for policy 0, policy_version 7000 (0.0007) [2023-03-06 21:07:59,405][62475] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-06 21:08:00,224][62475] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-06 21:08:01,026][62475] Updated weights for policy 0, policy_version 7030 (0.0006) [2023-03-06 21:08:01,851][62475] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-06 21:08:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12749.7). Total num frames: 7216128. Throughput: 0: 12635.7. Samples: 7198692. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:02,390][62145] Avg episode reward: [(0, '535.056')] [2023-03-06 21:08:02,643][62475] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-06 21:08:03,447][62475] Updated weights for policy 0, policy_version 7060 (0.0007) [2023-03-06 21:08:04,259][62475] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-06 21:08:05,064][62475] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-06 21:08:05,892][62475] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-06 21:08:06,718][62475] Updated weights for policy 0, policy_version 7100 (0.0007) [2023-03-06 21:08:07,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12742.7). Total num frames: 7278592. Throughput: 0: 12617.3. Samples: 7274232. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:07,390][62145] Avg episode reward: [(0, '529.376')] [2023-03-06 21:08:07,515][62475] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-06 21:08:08,324][62475] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-06 21:08:09,129][62475] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-06 21:08:09,931][62475] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-06 21:08:10,746][62475] Updated weights for policy 0, policy_version 7150 (0.0007) [2023-03-06 21:08:11,557][62475] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-06 21:08:12,368][62475] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-06 21:08:12,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12739.2). Total num frames: 7342080. Throughput: 0: 12629.5. Samples: 7312528. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:12,390][62145] Avg episode reward: [(0, '438.119')] [2023-03-06 21:08:13,170][62475] Updated weights for policy 0, policy_version 7180 (0.0007) [2023-03-06 21:08:13,978][62475] Updated weights for policy 0, policy_version 7190 (0.0006) [2023-03-06 21:08:14,771][62475] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-06 21:08:15,569][62475] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-06 21:08:16,393][62475] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-06 21:08:17,211][62475] Updated weights for policy 0, policy_version 7230 (0.0008) [2023-03-06 21:08:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12739.3). Total num frames: 7405568. Throughput: 0: 12634.8. Samples: 7388422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:17,390][62145] Avg episode reward: [(0, '550.046')] [2023-03-06 21:08:17,993][62475] Updated weights for policy 0, policy_version 7240 (0.0006) [2023-03-06 21:08:18,824][62475] Updated weights for policy 0, policy_version 7250 (0.0007) [2023-03-06 21:08:19,605][62475] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-06 21:08:20,418][62475] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-06 21:08:21,272][62475] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-06 21:08:22,068][62475] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-06 21:08:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12735.8). Total num frames: 7469056. Throughput: 0: 12638.1. Samples: 7464380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:08:22,390][62145] Avg episode reward: [(0, '511.531')] [2023-03-06 21:08:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000007294_7469056.pth... [2023-03-06 21:08:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000004313_4416512.pth [2023-03-06 21:08:22,874][62475] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-06 21:08:23,691][62475] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-06 21:08:24,494][62475] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-06 21:08:24,575][62424] KL-divergence is very high: 2510.8916 [2023-03-06 21:08:24,650][62424] KL-divergence is very high: 104.9240 [2023-03-06 21:08:24,736][62424] KL-divergence is very high: 126.3561 [2023-03-06 21:08:24,799][62424] KL-divergence is very high: 242.8945 [2023-03-06 21:08:25,231][62424] KL-divergence is very high: 740.4147 [2023-03-06 21:08:25,305][62475] Updated weights for policy 0, policy_version 7330 (0.0005) [2023-03-06 21:08:25,395][62424] KL-divergence is very high: 472.5934 [2023-03-06 21:08:25,716][62424] KL-divergence is very high: 268.0630 [2023-03-06 21:08:26,126][62475] Updated weights for policy 0, policy_version 7340 (0.0006) [2023-03-06 21:08:26,927][62475] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-06 21:08:27,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12732.3). Total num frames: 7531520. Throughput: 0: 12633.6. Samples: 7502201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:08:27,390][62145] Avg episode reward: [(0, '495.815')] [2023-03-06 21:08:27,719][62475] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-06 21:08:28,551][62475] Updated weights for policy 0, policy_version 7370 (0.0007) [2023-03-06 21:08:29,350][62475] Updated weights for policy 0, policy_version 7380 (0.0006) [2023-03-06 21:08:30,160][62475] Updated weights for policy 0, policy_version 7390 (0.0007) [2023-03-06 21:08:30,981][62475] Updated weights for policy 0, policy_version 7400 (0.0006) [2023-03-06 21:08:31,806][62475] Updated weights for policy 0, policy_version 7410 (0.0006) [2023-03-06 21:08:32,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12629.3, 300 sec: 12728.8). Total num frames: 7595008. Throughput: 0: 12639.8. Samples: 7577978. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:08:32,390][62145] Avg episode reward: [(0, '634.912')] [2023-03-06 21:08:32,605][62475] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-06 21:08:33,395][62475] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-06 21:08:34,217][62475] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-06 21:08:35,023][62475] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-06 21:08:35,830][62475] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-06 21:08:36,644][62475] Updated weights for policy 0, policy_version 7470 (0.0006) [2023-03-06 21:08:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12728.8). Total num frames: 7658496. Throughput: 0: 12648.1. Samples: 7653827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:08:37,390][62145] Avg episode reward: [(0, '647.325')] [2023-03-06 21:08:37,445][62475] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-06 21:08:38,270][62475] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-06 21:08:39,094][62475] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-06 21:08:39,907][62475] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-06 21:08:40,722][62475] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-06 21:08:41,550][62475] Updated weights for policy 0, policy_version 7530 (0.0006) [2023-03-06 21:08:42,354][62475] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-06 21:08:42,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12629.3, 300 sec: 12721.9). Total num frames: 7720960. Throughput: 0: 12645.8. Samples: 7691505. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:42,390][62145] Avg episode reward: [(0, '625.021')] [2023-03-06 21:08:43,174][62475] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-06 21:08:43,972][62475] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-06 21:08:44,789][62475] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-06 21:08:45,595][62475] Updated weights for policy 0, policy_version 7580 (0.0006) [2023-03-06 21:08:45,651][62424] KL-divergence is very high: 103.7645 [2023-03-06 21:08:46,409][62475] Updated weights for policy 0, policy_version 7590 (0.0007) [2023-03-06 21:08:47,217][62475] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-06 21:08:47,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12721.9). Total num frames: 7784448. Throughput: 0: 12633.9. Samples: 7767219. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:47,390][62145] Avg episode reward: [(0, '536.137')] [2023-03-06 21:08:48,010][62475] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-06 21:08:48,857][62475] Updated weights for policy 0, policy_version 7620 (0.0006) [2023-03-06 21:08:49,670][62475] Updated weights for policy 0, policy_version 7630 (0.0007) [2023-03-06 21:08:50,467][62475] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-06 21:08:51,251][62475] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-06 21:08:52,068][62475] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-06 21:08:52,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12629.3, 300 sec: 12715.0). Total num frames: 7846912. Throughput: 0: 12641.1. Samples: 7843082. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:08:52,390][62145] Avg episode reward: [(0, '567.210')] [2023-03-06 21:08:52,869][62475] Updated weights for policy 0, policy_version 7670 (0.0007) [2023-03-06 21:08:53,680][62475] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-06 21:08:54,480][62475] Updated weights for policy 0, policy_version 7690 (0.0007) [2023-03-06 21:08:55,278][62475] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-06 21:08:56,101][62475] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-06 21:08:56,422][62424] KL-divergence is very high: 119.9178 [2023-03-06 21:08:56,905][62475] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-06 21:08:57,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12646.4, 300 sec: 12715.0). Total num frames: 7910400. Throughput: 0: 12640.9. Samples: 7881369. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:08:57,390][62145] Avg episode reward: [(0, '560.409')] [2023-03-06 21:08:57,716][62475] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-06 21:08:58,528][62475] Updated weights for policy 0, policy_version 7740 (0.0007) [2023-03-06 21:08:58,599][62424] KL-divergence is very high: 2775.1460 [2023-03-06 21:08:58,765][62424] KL-divergence is very high: 431.7324 [2023-03-06 21:08:59,085][62424] KL-divergence is very high: 1115.3018 [2023-03-06 21:08:59,259][62424] KL-divergence is very high: 289.6699 [2023-03-06 21:08:59,361][62475] Updated weights for policy 0, policy_version 7750 (0.0005) [2023-03-06 21:08:59,430][62424] KL-divergence is very high: 1020.2804 [2023-03-06 21:08:59,517][62424] KL-divergence is very high: 101.8800 [2023-03-06 21:08:59,591][62424] KL-divergence is very high: 127.5083 [2023-03-06 21:08:59,674][62424] KL-divergence is very high: 313.3930 [2023-03-06 21:08:59,747][62424] KL-divergence is very high: 703.4086 [2023-03-06 21:08:59,837][62424] KL-divergence is very high: 457.6951 [2023-03-06 21:08:59,910][62424] KL-divergence is very high: 356.0632 [2023-03-06 21:09:00,071][62424] KL-divergence is very high: 1471.4293 [2023-03-06 21:09:00,159][62424] KL-divergence is very high: 1140.4941 [2023-03-06 21:09:00,167][62475] Updated weights for policy 0, policy_version 7760 (0.0006) [2023-03-06 21:09:00,988][62475] Updated weights for policy 0, policy_version 7770 (0.0007) [2023-03-06 21:09:01,136][62424] KL-divergence is very high: 227.0090 [2023-03-06 21:09:01,371][62424] KL-divergence is very high: 224.1548 [2023-03-06 21:09:01,799][62475] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-06 21:09:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12629.3, 300 sec: 12708.0). Total num frames: 7973888. Throughput: 0: 12632.9. Samples: 7956904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:09:02,390][62145] Avg episode reward: [(0, '616.424')] [2023-03-06 21:09:02,597][62475] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-06 21:09:03,407][62475] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-06 21:09:04,217][62475] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-06 21:09:05,018][62475] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-06 21:09:05,831][62475] Updated weights for policy 0, policy_version 7830 (0.0007) [2023-03-06 21:09:06,644][62475] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-06 21:09:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12708.0). Total num frames: 8037376. Throughput: 0: 12632.3. Samples: 8032834. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:07,390][62145] Avg episode reward: [(0, '528.971')] [2023-03-06 21:09:07,451][62475] Updated weights for policy 0, policy_version 7850 (0.0006) [2023-03-06 21:09:08,258][62475] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-06 21:09:09,081][62475] Updated weights for policy 0, policy_version 7870 (0.0006) [2023-03-06 21:09:09,893][62475] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-06 21:09:10,693][62475] Updated weights for policy 0, policy_version 7890 (0.0006) [2023-03-06 21:09:11,489][62475] Updated weights for policy 0, policy_version 7900 (0.0006) [2023-03-06 21:09:12,296][62475] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-06 21:09:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12646.4, 300 sec: 12704.5). Total num frames: 8100864. Throughput: 0: 12633.3. Samples: 8070701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:12,390][62145] Avg episode reward: [(0, '523.416')] [2023-03-06 21:09:13,097][62475] Updated weights for policy 0, policy_version 7920 (0.0007) [2023-03-06 21:09:13,912][62475] Updated weights for policy 0, policy_version 7930 (0.0007) [2023-03-06 21:09:14,730][62475] Updated weights for policy 0, policy_version 7940 (0.0006) [2023-03-06 21:09:15,531][62475] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-06 21:09:16,335][62475] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-06 21:09:17,124][62475] Updated weights for policy 0, policy_version 7970 (0.0007) [2023-03-06 21:09:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12646.4, 300 sec: 12704.5). Total num frames: 8164352. Throughput: 0: 12646.8. Samples: 8147083. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:09:17,390][62145] Avg episode reward: [(0, '448.055')] [2023-03-06 21:09:17,940][62475] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-06 21:09:18,749][62475] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-06 21:09:19,563][62475] Updated weights for policy 0, policy_version 8000 (0.0006) [2023-03-06 21:09:20,349][62475] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-06 21:09:21,142][62475] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-06 21:09:21,974][62475] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-06 21:09:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12646.4, 300 sec: 12701.1). Total num frames: 8227840. Throughput: 0: 12654.2. Samples: 8223267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:22,390][62145] Avg episode reward: [(0, '549.781')] [2023-03-06 21:09:22,766][62475] Updated weights for policy 0, policy_version 8040 (0.0007) [2023-03-06 21:09:23,558][62475] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-06 21:09:24,377][62475] Updated weights for policy 0, policy_version 8060 (0.0007) [2023-03-06 21:09:25,210][62475] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-06 21:09:26,005][62475] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-06 21:09:26,801][62475] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-06 21:09:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12701.1). Total num frames: 8291328. Throughput: 0: 12660.2. Samples: 8261212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:27,390][62145] Avg episode reward: [(0, '583.920')] [2023-03-06 21:09:27,629][62475] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-06 21:09:28,433][62475] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-06 21:09:29,267][62475] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-06 21:09:30,042][62475] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-06 21:09:30,862][62475] Updated weights for policy 0, policy_version 8140 (0.0006) [2023-03-06 21:09:31,668][62475] Updated weights for policy 0, policy_version 8150 (0.0006) [2023-03-06 21:09:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 8354816. Throughput: 0: 12670.6. Samples: 8337398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:32,400][62145] Avg episode reward: [(0, '534.030')] [2023-03-06 21:09:32,473][62475] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-06 21:09:33,269][62475] Updated weights for policy 0, policy_version 8170 (0.0006) [2023-03-06 21:09:34,058][62475] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-06 21:09:34,865][62475] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-06 21:09:35,670][62475] Updated weights for policy 0, policy_version 8200 (0.0006) [2023-03-06 21:09:36,474][62475] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-06 21:09:37,285][62475] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-06 21:09:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 8418304. Throughput: 0: 12681.2. Samples: 8413735. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:09:37,390][62145] Avg episode reward: [(0, '582.463')] [2023-03-06 21:09:38,085][62475] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-06 21:09:38,916][62475] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-06 21:09:39,722][62475] Updated weights for policy 0, policy_version 8250 (0.0006) [2023-03-06 21:09:40,526][62475] Updated weights for policy 0, policy_version 8260 (0.0007) [2023-03-06 21:09:41,352][62475] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-06 21:09:42,175][62475] Updated weights for policy 0, policy_version 8280 (0.0006) [2023-03-06 21:09:42,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12694.1). Total num frames: 8480768. Throughput: 0: 12672.8. Samples: 8451643. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:09:42,390][62145] Avg episode reward: [(0, '537.134')] [2023-03-06 21:09:42,974][62475] Updated weights for policy 0, policy_version 8290 (0.0007) [2023-03-06 21:09:43,792][62475] Updated weights for policy 0, policy_version 8300 (0.0006) [2023-03-06 21:09:44,598][62475] Updated weights for policy 0, policy_version 8310 (0.0007) [2023-03-06 21:09:45,409][62475] Updated weights for policy 0, policy_version 8320 (0.0007) [2023-03-06 21:09:46,201][62475] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-06 21:09:47,038][62475] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-06 21:09:47,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.4, 300 sec: 12690.7). Total num frames: 8544256. Throughput: 0: 12677.0. Samples: 8527372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:47,390][62145] Avg episode reward: [(0, '518.968')] [2023-03-06 21:09:47,822][62475] Updated weights for policy 0, policy_version 8350 (0.0006) [2023-03-06 21:09:47,898][62424] KL-divergence is very high: 4509.6226 [2023-03-06 21:09:48,629][62475] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-06 21:09:49,445][62475] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-06 21:09:50,236][62475] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-06 21:09:51,052][62475] Updated weights for policy 0, policy_version 8390 (0.0007) [2023-03-06 21:09:51,767][62424] KL-divergence is very high: 372.1480 [2023-03-06 21:09:51,849][62475] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-06 21:09:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 8607744. Throughput: 0: 12683.9. Samples: 8603610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:52,390][62145] Avg episode reward: [(0, '540.512')] [2023-03-06 21:09:52,656][62475] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-06 21:09:53,482][62475] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-06 21:09:54,285][62475] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-06 21:09:55,088][62475] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-06 21:09:55,888][62475] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-06 21:09:56,713][62475] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-06 21:09:57,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 8671232. Throughput: 0: 12688.6. Samples: 8641687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:09:57,390][62145] Avg episode reward: [(0, '536.781')] [2023-03-06 21:09:57,519][62475] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-06 21:09:58,310][62475] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-06 21:09:59,138][62475] Updated weights for policy 0, policy_version 8490 (0.0006) [2023-03-06 21:09:59,931][62475] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-06 21:10:00,737][62475] Updated weights for policy 0, policy_version 8510 (0.0007) [2023-03-06 21:10:01,224][62424] KL-divergence is very high: 593.4801 [2023-03-06 21:10:01,384][62424] KL-divergence is very high: 10142.6777 [2023-03-06 21:10:01,542][62424] KL-divergence is very high: 727.3256 [2023-03-06 21:10:01,551][62475] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-06 21:10:01,716][62424] KL-divergence is very high: 1638.9570 [2023-03-06 21:10:02,358][62475] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-06 21:10:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 8734720. Throughput: 0: 12679.2. Samples: 8717646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:10:02,390][62145] Avg episode reward: [(0, '610.252')] [2023-03-06 21:10:03,145][62475] Updated weights for policy 0, policy_version 8540 (0.0007) [2023-03-06 21:10:03,964][62475] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-06 21:10:04,770][62475] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-06 21:10:05,241][62424] KL-divergence is very high: 533.2645 [2023-03-06 21:10:05,561][62424] KL-divergence is very high: 3145.9805 [2023-03-06 21:10:05,570][62475] Updated weights for policy 0, policy_version 8570 (0.0007) [2023-03-06 21:10:05,807][62424] KL-divergence is very high: 20177.3711 [2023-03-06 21:10:06,128][62424] KL-divergence is very high: 1891.1987 [2023-03-06 21:10:06,382][62475] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-06 21:10:06,776][62424] KL-divergence is very high: 165.3805 [2023-03-06 21:10:07,209][62475] Updated weights for policy 0, policy_version 8590 (0.0007) [2023-03-06 21:10:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 8798208. Throughput: 0: 12679.1. Samples: 8793827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:10:07,390][62145] Avg episode reward: [(0, '464.285')] [2023-03-06 21:10:08,002][62475] Updated weights for policy 0, policy_version 8600 (0.0007) [2023-03-06 21:10:08,789][62475] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-06 21:10:09,440][62424] KL-divergence is very high: 11451.8311 [2023-03-06 21:10:09,617][62475] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-06 21:10:10,405][62475] Updated weights for policy 0, policy_version 8630 (0.0007) [2023-03-06 21:10:11,196][62475] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-06 21:10:12,025][62475] Updated weights for policy 0, policy_version 8650 (0.0007) [2023-03-06 21:10:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 8861696. Throughput: 0: 12685.4. Samples: 8832052. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:10:12,390][62145] Avg episode reward: [(0, '608.252')] [2023-03-06 21:10:12,730][62424] KL-divergence is very high: 908.5594 [2023-03-06 21:10:12,832][62475] Updated weights for policy 0, policy_version 8660 (0.0006) [2023-03-06 21:10:13,618][62475] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-06 21:10:13,948][62424] KL-divergence is very high: 165.9560 [2023-03-06 21:10:14,344][62424] KL-divergence is very high: 1345.0330 [2023-03-06 21:10:14,437][62475] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-06 21:10:15,230][62475] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-06 21:10:16,035][62475] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-06 21:10:16,847][62424] KL-divergence is very high: 1291.7141 [2023-03-06 21:10:16,854][62475] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-06 21:10:17,089][62424] KL-divergence is very high: 2898.9919 [2023-03-06 21:10:17,159][62424] KL-divergence is very high: 1340.9984 [2023-03-06 21:10:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 8925184. Throughput: 0: 12690.8. Samples: 8908484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:10:17,390][62145] Avg episode reward: [(0, '523.850')] [2023-03-06 21:10:17,639][62475] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-06 21:10:18,369][62424] KL-divergence is very high: 3329.2449 [2023-03-06 21:10:18,444][62475] Updated weights for policy 0, policy_version 8730 (0.0007) [2023-03-06 21:10:18,531][62424] KL-divergence is very high: 1578.1646 [2023-03-06 21:10:18,693][62424] KL-divergence is very high: 12127.9004 [2023-03-06 21:10:18,849][62424] KL-divergence is very high: 2496.6934 [2023-03-06 21:10:19,011][62424] KL-divergence is very high: 2597.0325 [2023-03-06 21:10:19,167][62424] KL-divergence is very high: 115.5197 [2023-03-06 21:10:19,252][62475] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-06 21:10:19,328][62424] KL-divergence is very high: 388.0942 [2023-03-06 21:10:19,402][62424] KL-divergence is very high: 675.6747 [2023-03-06 21:10:19,493][62424] KL-divergence is very high: 21015.3418 [2023-03-06 21:10:19,572][62424] KL-divergence is very high: 5037.8975 [2023-03-06 21:10:19,804][62424] KL-divergence is very high: 1415763.2500 [2023-03-06 21:10:19,965][62424] KL-divergence is very high: 624.0584 [2023-03-06 21:10:20,066][62475] Updated weights for policy 0, policy_version 8750 (0.0007) [2023-03-06 21:10:20,139][62424] KL-divergence is very high: 558.0314 [2023-03-06 21:10:20,219][62424] KL-divergence is very high: 20394.5547 [2023-03-06 21:10:20,308][62424] KL-divergence is very high: 468.0921 [2023-03-06 21:10:20,391][62424] KL-divergence is very high: 12749.1416 [2023-03-06 21:10:20,467][62424] KL-divergence is very high: 1615.4612 [2023-03-06 21:10:20,621][62424] KL-divergence is very high: 377.9609 [2023-03-06 21:10:20,710][62424] KL-divergence is very high: 3760.0278 [2023-03-06 21:10:20,876][62475] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-06 21:10:21,025][62424] KL-divergence is very high: 756.4051 [2023-03-06 21:10:21,352][62424] KL-divergence is very high: 5488.7446 [2023-03-06 21:10:21,448][62424] KL-divergence is very high: 235.9205 [2023-03-06 21:10:21,697][62475] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-06 21:10:21,849][62424] KL-divergence is very high: 514315.4375 [2023-03-06 21:10:21,924][62424] KL-divergence is very high: 1882.8025 [2023-03-06 21:10:22,014][62424] KL-divergence is very high: 596.0648 [2023-03-06 21:10:22,092][62424] KL-divergence is very high: 28837.2461 [2023-03-06 21:10:22,246][62424] KL-divergence is very high: 3463.3203 [2023-03-06 21:10:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 8988672. Throughput: 0: 12685.5. Samples: 8984583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:22,390][62145] Avg episode reward: [(0, '529.387')] [2023-03-06 21:10:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000008778_8988672.pth... [2023-03-06 21:10:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000005809_5948416.pth [2023-03-06 21:10:22,439][62424] KL-divergence is very high: 17466.0605 [2023-03-06 21:10:22,517][62475] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-06 21:10:22,822][62424] KL-divergence is very high: 1700.2839 [2023-03-06 21:10:23,305][62475] Updated weights for policy 0, policy_version 8790 (0.0007) [2023-03-06 21:10:23,453][62424] KL-divergence is very high: 1190.7819 [2023-03-06 21:10:23,718][62424] KL-divergence is very high: 312.4545 [2023-03-06 21:10:23,872][62424] KL-divergence is very high: 459.7438 [2023-03-06 21:10:24,114][62475] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-06 21:10:24,183][62424] KL-divergence is very high: 20539.7441 [2023-03-06 21:10:24,584][62424] KL-divergence is very high: 33859.6094 [2023-03-06 21:10:24,668][62424] KL-divergence is very high: 135.6347 [2023-03-06 21:10:24,743][62424] KL-divergence is very high: 2960.2410 [2023-03-06 21:10:24,820][62424] KL-divergence is very high: 3780.4814 [2023-03-06 21:10:24,911][62475] Updated weights for policy 0, policy_version 8810 (0.0007) [2023-03-06 21:10:25,147][62424] KL-divergence is very high: 484.5221 [2023-03-06 21:10:25,559][62424] KL-divergence is very high: 311.9050 [2023-03-06 21:10:25,723][62475] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-06 21:10:25,879][62424] KL-divergence is very high: 817.9205 [2023-03-06 21:10:26,286][62424] KL-divergence is very high: 286.2054 [2023-03-06 21:10:26,441][62424] KL-divergence is very high: 288.4759 [2023-03-06 21:10:26,541][62475] Updated weights for policy 0, policy_version 8830 (0.0006) [2023-03-06 21:10:27,242][62424] KL-divergence is very high: 158.8659 [2023-03-06 21:10:27,345][62475] Updated weights for policy 0, policy_version 8840 (0.0007) [2023-03-06 21:10:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 9052160. Throughput: 0: 12689.9. Samples: 9022689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:27,390][62145] Avg episode reward: [(0, '471.699')] [2023-03-06 21:10:27,493][62424] KL-divergence is very high: 217.0082 [2023-03-06 21:10:27,588][62424] KL-divergence is very high: 8775.4199 [2023-03-06 21:10:27,748][62424] KL-divergence is very high: 7685.3652 [2023-03-06 21:10:27,913][62424] KL-divergence is very high: 674.7808 [2023-03-06 21:10:28,070][62424] KL-divergence is very high: 349.2484 [2023-03-06 21:10:28,141][62475] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-06 21:10:28,383][62424] KL-divergence is very high: 14821.6211 [2023-03-06 21:10:28,880][62424] KL-divergence is very high: 3182.0625 [2023-03-06 21:10:28,965][62475] Updated weights for policy 0, policy_version 8860 (0.0007) [2023-03-06 21:10:29,275][62424] KL-divergence is very high: 183.2251 [2023-03-06 21:10:29,790][62475] Updated weights for policy 0, policy_version 8870 (0.0007) [2023-03-06 21:10:30,012][62424] KL-divergence is very high: 156.0134 [2023-03-06 21:10:30,569][62475] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-06 21:10:31,384][62475] Updated weights for policy 0, policy_version 8890 (0.0006) [2023-03-06 21:10:31,785][62424] KL-divergence is very high: 1756.4355 [2023-03-06 21:10:32,183][62424] KL-divergence is very high: 256.4346 [2023-03-06 21:10:32,191][62475] Updated weights for policy 0, policy_version 8900 (0.0006) [2023-03-06 21:10:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 9115648. Throughput: 0: 12694.9. Samples: 9098642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:10:32,390][62145] Avg episode reward: [(0, '415.183')] [2023-03-06 21:10:32,432][62424] KL-divergence is very high: 1246.5153 [2023-03-06 21:10:32,671][62424] KL-divergence is very high: 146.0420 [2023-03-06 21:10:32,765][62424] KL-divergence is very high: 177.7888 [2023-03-06 21:10:32,990][62475] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-06 21:10:33,480][62424] KL-divergence is very high: 343.5712 [2023-03-06 21:10:33,814][62475] Updated weights for policy 0, policy_version 8920 (0.0007) [2023-03-06 21:10:33,979][62424] KL-divergence is very high: 1350.3621 [2023-03-06 21:10:34,457][62424] KL-divergence is very high: 111.6683 [2023-03-06 21:10:34,622][62475] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-06 21:10:34,922][62424] KL-divergence is very high: 422.9598 [2023-03-06 21:10:35,420][62424] KL-divergence is very high: 956.2237 [2023-03-06 21:10:35,428][62475] Updated weights for policy 0, policy_version 8940 (0.0007) [2023-03-06 21:10:35,977][62424] KL-divergence is very high: 4895.0337 [2023-03-06 21:10:36,228][62475] Updated weights for policy 0, policy_version 8950 (0.0007) [2023-03-06 21:10:36,323][62424] KL-divergence is very high: 302.9731 [2023-03-06 21:10:37,038][62475] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-06 21:10:37,122][62424] KL-divergence is very high: 119.8896 [2023-03-06 21:10:37,186][62424] KL-divergence is very high: 1465.6433 [2023-03-06 21:10:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 9179136. Throughput: 0: 12690.6. Samples: 9174686. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:37,390][62145] Avg episode reward: [(0, '450.129')] [2023-03-06 21:10:37,592][62424] KL-divergence is very high: 146.8521 [2023-03-06 21:10:37,668][62424] KL-divergence is very high: 2373.6963 [2023-03-06 21:10:37,751][62424] KL-divergence is very high: 124.8785 [2023-03-06 21:10:37,836][62424] KL-divergence is very high: 19256.1504 [2023-03-06 21:10:37,843][62475] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-06 21:10:38,076][62424] KL-divergence is very high: 1900.1702 [2023-03-06 21:10:38,234][62424] KL-divergence is very high: 143.0846 [2023-03-06 21:10:38,646][62475] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-06 21:10:39,459][62475] Updated weights for policy 0, policy_version 8990 (0.0007) [2023-03-06 21:10:40,278][62475] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-06 21:10:41,075][62475] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-03-06 21:10:41,916][62475] Updated weights for policy 0, policy_version 9020 (0.0005) [2023-03-06 21:10:42,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 9241600. Throughput: 0: 12691.2. Samples: 9212791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:42,390][62145] Avg episode reward: [(0, '408.346')] [2023-03-06 21:10:42,703][62475] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-06 21:10:43,499][62475] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-06 21:10:44,011][62424] KL-divergence is very high: 19757.3828 [2023-03-06 21:10:44,332][62475] Updated weights for policy 0, policy_version 9050 (0.0007) [2023-03-06 21:10:45,147][62475] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-06 21:10:45,934][62475] Updated weights for policy 0, policy_version 9070 (0.0006) [2023-03-06 21:10:46,757][62475] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-06 21:10:47,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12680.6, 300 sec: 12676.8). Total num frames: 9305088. Throughput: 0: 12689.6. Samples: 9288678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:10:47,390][62145] Avg episode reward: [(0, '496.598')] [2023-03-06 21:10:47,554][62475] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-06 21:10:48,362][62475] Updated weights for policy 0, policy_version 9100 (0.0007) [2023-03-06 21:10:49,074][62424] KL-divergence is very high: 467.5533 [2023-03-06 21:10:49,163][62475] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-06 21:10:49,963][62475] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-06 21:10:50,769][62475] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-06 21:10:51,011][62424] KL-divergence is very high: 154.2043 [2023-03-06 21:10:51,250][62424] KL-divergence is very high: 8810.2324 [2023-03-06 21:10:51,585][62475] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-06 21:10:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 9368576. Throughput: 0: 12686.2. Samples: 9364704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:52,390][62145] Avg episode reward: [(0, '439.722')] [2023-03-06 21:10:52,404][62475] Updated weights for policy 0, policy_version 9150 (0.0006) [2023-03-06 21:10:53,208][62475] Updated weights for policy 0, policy_version 9160 (0.0006) [2023-03-06 21:10:53,990][62475] Updated weights for policy 0, policy_version 9170 (0.0006) [2023-03-06 21:10:54,811][62475] Updated weights for policy 0, policy_version 9180 (0.0006) [2023-03-06 21:10:55,598][62475] Updated weights for policy 0, policy_version 9190 (0.0006) [2023-03-06 21:10:56,413][62475] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-06 21:10:57,217][62475] Updated weights for policy 0, policy_version 9210 (0.0007) [2023-03-06 21:10:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 9433088. Throughput: 0: 12683.7. Samples: 9402818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:10:57,390][62145] Avg episode reward: [(0, '468.850')] [2023-03-06 21:10:58,020][62475] Updated weights for policy 0, policy_version 9220 (0.0006) [2023-03-06 21:10:58,842][62475] Updated weights for policy 0, policy_version 9230 (0.0006) [2023-03-06 21:10:59,648][62475] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-06 21:11:00,446][62475] Updated weights for policy 0, policy_version 9250 (0.0006) [2023-03-06 21:11:01,242][62475] Updated weights for policy 0, policy_version 9260 (0.0006) [2023-03-06 21:11:02,063][62475] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-06 21:11:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 9496576. Throughput: 0: 12685.0. Samples: 9479309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:02,390][62145] Avg episode reward: [(0, '459.961')] [2023-03-06 21:11:02,846][62475] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-06 21:11:03,663][62475] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-06 21:11:04,465][62475] Updated weights for policy 0, policy_version 9300 (0.0007) [2023-03-06 21:11:05,266][62475] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-06 21:11:06,075][62475] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-06 21:11:06,883][62475] Updated weights for policy 0, policy_version 9330 (0.0007) [2023-03-06 21:11:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 9560064. Throughput: 0: 12688.5. Samples: 9555566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:07,390][62145] Avg episode reward: [(0, '478.714')] [2023-03-06 21:11:07,701][62475] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-06 21:11:08,494][62475] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-06 21:11:09,300][62475] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-06 21:11:10,126][62475] Updated weights for policy 0, policy_version 9370 (0.0007) [2023-03-06 21:11:10,909][62475] Updated weights for policy 0, policy_version 9380 (0.0007) [2023-03-06 21:11:11,725][62475] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-06 21:11:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 9623552. Throughput: 0: 12682.1. Samples: 9593384. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:12,390][62145] Avg episode reward: [(0, '416.319')] [2023-03-06 21:11:12,539][62475] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-06 21:11:12,794][62424] KL-divergence is very high: 2916.9236 [2023-03-06 21:11:13,350][62475] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-06 21:11:14,161][62475] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-06 21:11:14,962][62475] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-06 21:11:15,780][62475] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-06 21:11:16,503][62424] KL-divergence is very high: 223.6458 [2023-03-06 21:11:16,603][62475] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-06 21:11:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12669.8). Total num frames: 9686016. Throughput: 0: 12684.5. Samples: 9669442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:17,390][62145] Avg episode reward: [(0, '355.303')] [2023-03-06 21:11:17,406][62475] Updated weights for policy 0, policy_version 9460 (0.0006) [2023-03-06 21:11:18,209][62475] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-06 21:11:19,008][62475] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-06 21:11:19,846][62475] Updated weights for policy 0, policy_version 9490 (0.0006) [2023-03-06 21:11:20,643][62475] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-06 21:11:21,441][62475] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-06 21:11:22,255][62475] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-06 21:11:22,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12669.8). Total num frames: 9749504. Throughput: 0: 12681.4. Samples: 9745348. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:11:22,390][62145] Avg episode reward: [(0, '462.150')] [2023-03-06 21:11:23,060][62475] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-06 21:11:23,835][62475] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-06 21:11:24,646][62475] Updated weights for policy 0, policy_version 9550 (0.0006) [2023-03-06 21:11:25,458][62475] Updated weights for policy 0, policy_version 9560 (0.0007) [2023-03-06 21:11:25,922][62424] KL-divergence is very high: 14854.9238 [2023-03-06 21:11:26,243][62475] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-06 21:11:27,073][62475] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-06 21:11:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12669.8). Total num frames: 9814016. Throughput: 0: 12683.8. Samples: 9783559. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:11:27,390][62145] Avg episode reward: [(0, '378.853')] [2023-03-06 21:11:27,863][62475] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-06 21:11:28,693][62475] Updated weights for policy 0, policy_version 9600 (0.0007) [2023-03-06 21:11:29,485][62475] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-06 21:11:30,294][62475] Updated weights for policy 0, policy_version 9620 (0.0006) [2023-03-06 21:11:31,096][62475] Updated weights for policy 0, policy_version 9630 (0.0006) [2023-03-06 21:11:31,912][62475] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-06 21:11:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12669.8). Total num frames: 9877504. Throughput: 0: 12696.3. Samples: 9860011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:32,390][62145] Avg episode reward: [(0, '419.924')] [2023-03-06 21:11:32,726][62475] Updated weights for policy 0, policy_version 9650 (0.0007) [2023-03-06 21:11:33,524][62424] KL-divergence is very high: 3898.5874 [2023-03-06 21:11:33,532][62475] Updated weights for policy 0, policy_version 9660 (0.0006) [2023-03-06 21:11:34,312][62475] Updated weights for policy 0, policy_version 9670 (0.0007) [2023-03-06 21:11:35,126][62475] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-06 21:11:35,929][62475] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-06 21:11:36,741][62475] Updated weights for policy 0, policy_version 9700 (0.0006) [2023-03-06 21:11:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12669.8). Total num frames: 9940992. Throughput: 0: 12703.1. Samples: 9936344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:37,390][62145] Avg episode reward: [(0, '422.071')] [2023-03-06 21:11:37,557][62475] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-06 21:11:38,350][62475] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-06 21:11:39,145][62475] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-06 21:11:39,956][62475] Updated weights for policy 0, policy_version 9740 (0.0006) [2023-03-06 21:11:40,753][62475] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-06 21:11:41,565][62475] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-06 21:11:42,365][62475] Updated weights for policy 0, policy_version 9770 (0.0005) [2023-03-06 21:11:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12669.8). Total num frames: 10004480. Throughput: 0: 12705.8. Samples: 9974579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:42,390][62145] Avg episode reward: [(0, '402.596')] [2023-03-06 21:11:43,170][62475] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-06 21:11:43,957][62475] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-06 21:11:44,266][62424] KL-divergence is very high: 5562.7275 [2023-03-06 21:11:44,441][62424] KL-divergence is very high: 5884.0942 [2023-03-06 21:11:44,621][62424] KL-divergence is very high: 6190.7622 [2023-03-06 21:11:44,793][62475] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-06 21:11:44,869][62424] KL-divergence is very high: 11786.7637 [2023-03-06 21:11:45,571][62475] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-06 21:11:46,384][62475] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-06 21:11:47,193][62475] Updated weights for policy 0, policy_version 9830 (0.0007) [2023-03-06 21:11:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12669.8). Total num frames: 10067968. Throughput: 0: 12704.0. Samples: 10050989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:11:47,390][62145] Avg episode reward: [(0, '482.444')] [2023-03-06 21:11:48,008][62475] Updated weights for policy 0, policy_version 9840 (0.0007) [2023-03-06 21:11:48,808][62475] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-06 21:11:49,623][62475] Updated weights for policy 0, policy_version 9860 (0.0007) [2023-03-06 21:11:50,430][62475] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-06 21:11:51,223][62475] Updated weights for policy 0, policy_version 9880 (0.0007) [2023-03-06 21:11:52,027][62475] Updated weights for policy 0, policy_version 9890 (0.0006) [2023-03-06 21:11:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12669.8). Total num frames: 10131456. Throughput: 0: 12703.6. Samples: 10127227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:11:52,390][62145] Avg episode reward: [(0, '362.397')] [2023-03-06 21:11:52,852][62475] Updated weights for policy 0, policy_version 9900 (0.0007) [2023-03-06 21:11:53,643][62475] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-06 21:11:54,371][62424] KL-divergence is very high: 221618.4688 [2023-03-06 21:11:54,456][62475] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-06 21:11:55,233][62475] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-06 21:11:56,039][62475] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-06 21:11:56,870][62475] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-06 21:11:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12669.8). Total num frames: 10194944. Throughput: 0: 12710.9. Samples: 10165374. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:11:57,390][62145] Avg episode reward: [(0, '307.520')] [2023-03-06 21:11:57,661][62475] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-06 21:11:58,464][62475] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-06 21:11:59,242][62475] Updated weights for policy 0, policy_version 9980 (0.0007) [2023-03-06 21:12:00,042][62475] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-06 21:12:00,864][62475] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-03-06 21:12:01,679][62475] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-06 21:12:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12673.3). Total num frames: 10258432. Throughput: 0: 12719.5. Samples: 10241818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:02,390][62145] Avg episode reward: [(0, '371.849')] [2023-03-06 21:12:02,497][62475] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-06 21:12:03,293][62475] Updated weights for policy 0, policy_version 10030 (0.0007) [2023-03-06 21:12:04,093][62475] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-06 21:12:04,899][62475] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-06 21:12:05,707][62475] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-06 21:12:06,512][62475] Updated weights for policy 0, policy_version 10070 (0.0007) [2023-03-06 21:12:07,217][62424] KL-divergence is very high: 2435.9790 [2023-03-06 21:12:07,321][62475] Updated weights for policy 0, policy_version 10080 (0.0007) [2023-03-06 21:12:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12676.8). Total num frames: 10322944. Throughput: 0: 12727.0. Samples: 10318062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:07,390][62145] Avg episode reward: [(0, '397.461')] [2023-03-06 21:12:08,115][62475] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-06 21:12:08,904][62475] Updated weights for policy 0, policy_version 10100 (0.0007) [2023-03-06 21:12:09,719][62475] Updated weights for policy 0, policy_version 10110 (0.0007) [2023-03-06 21:12:10,541][62475] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-06 21:12:11,336][62475] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-03-06 21:12:12,141][62475] Updated weights for policy 0, policy_version 10140 (0.0007) [2023-03-06 21:12:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12676.8). Total num frames: 10386432. Throughput: 0: 12726.5. Samples: 10356254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:12,390][62145] Avg episode reward: [(0, '402.700')] [2023-03-06 21:12:12,965][62475] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-06 21:12:13,759][62475] Updated weights for policy 0, policy_version 10160 (0.0007) [2023-03-06 21:12:14,561][62475] Updated weights for policy 0, policy_version 10170 (0.0007) [2023-03-06 21:12:15,364][62475] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-06 21:12:16,180][62475] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-06 21:12:16,976][62475] Updated weights for policy 0, policy_version 10200 (0.0006) [2023-03-06 21:12:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12673.3). Total num frames: 10448896. Throughput: 0: 12722.7. Samples: 10432534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:17,390][62145] Avg episode reward: [(0, '432.401')] [2023-03-06 21:12:17,784][62475] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-06 21:12:18,604][62475] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-06 21:12:19,391][62475] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-06 21:12:20,204][62475] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-06 21:12:21,020][62475] Updated weights for policy 0, policy_version 10250 (0.0007) [2023-03-06 21:12:21,817][62475] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-06 21:12:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12676.8). Total num frames: 10513408. Throughput: 0: 12720.5. Samples: 10508765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:22,390][62145] Avg episode reward: [(0, '521.388')] [2023-03-06 21:12:22,395][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000010267_10513408.pth... [2023-03-06 21:12:22,427][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000007294_7469056.pth [2023-03-06 21:12:22,621][62475] Updated weights for policy 0, policy_version 10270 (0.0007) [2023-03-06 21:12:23,425][62475] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-06 21:12:24,242][62475] Updated weights for policy 0, policy_version 10290 (0.0006) [2023-03-06 21:12:25,041][62475] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-06 21:12:25,829][62475] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-06 21:12:26,638][62475] Updated weights for policy 0, policy_version 10320 (0.0007) [2023-03-06 21:12:27,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12676.8). Total num frames: 10576896. Throughput: 0: 12718.4. Samples: 10546909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:27,390][62145] Avg episode reward: [(0, '493.082')] [2023-03-06 21:12:27,452][62475] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-06 21:12:28,265][62475] Updated weights for policy 0, policy_version 10340 (0.0007) [2023-03-06 21:12:29,062][62475] Updated weights for policy 0, policy_version 10350 (0.0007) [2023-03-06 21:12:29,873][62475] Updated weights for policy 0, policy_version 10360 (0.0007) [2023-03-06 21:12:30,682][62475] Updated weights for policy 0, policy_version 10370 (0.0008) [2023-03-06 21:12:31,484][62475] Updated weights for policy 0, policy_version 10380 (0.0007) [2023-03-06 21:12:32,291][62475] Updated weights for policy 0, policy_version 10390 (0.0005) [2023-03-06 21:12:32,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12680.3). Total num frames: 10640384. Throughput: 0: 12712.8. Samples: 10623062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:32,390][62145] Avg episode reward: [(0, '550.745')] [2023-03-06 21:12:33,096][62475] Updated weights for policy 0, policy_version 10400 (0.0006) [2023-03-06 21:12:33,872][62475] Updated weights for policy 0, policy_version 10410 (0.0007) [2023-03-06 21:12:34,674][62475] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-06 21:12:35,481][62475] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-06 21:12:36,285][62475] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-06 21:12:37,107][62475] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-06 21:12:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12680.2). Total num frames: 10703872. Throughput: 0: 12720.8. Samples: 10699664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:37,390][62145] Avg episode reward: [(0, '502.480')] [2023-03-06 21:12:37,914][62475] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-06 21:12:38,712][62475] Updated weights for policy 0, policy_version 10470 (0.0006) [2023-03-06 21:12:39,502][62475] Updated weights for policy 0, policy_version 10480 (0.0007) [2023-03-06 21:12:40,307][62475] Updated weights for policy 0, policy_version 10490 (0.0007) [2023-03-06 21:12:41,115][62475] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-06 21:12:41,912][62475] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-06 21:12:42,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.6, 300 sec: 12683.7). Total num frames: 10767360. Throughput: 0: 12723.6. Samples: 10737937. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:12:42,390][62145] Avg episode reward: [(0, '451.472')] [2023-03-06 21:12:42,711][62475] Updated weights for policy 0, policy_version 10520 (0.0007) [2023-03-06 21:12:43,514][62475] Updated weights for policy 0, policy_version 10530 (0.0005) [2023-03-06 21:12:44,325][62475] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-06 21:12:45,117][62475] Updated weights for policy 0, policy_version 10550 (0.0007) [2023-03-06 21:12:45,924][62475] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-06 21:12:46,747][62475] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-06 21:12:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12683.7). Total num frames: 10830848. Throughput: 0: 12724.0. Samples: 10814397. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:12:47,390][62145] Avg episode reward: [(0, '483.573')] [2023-03-06 21:12:47,568][62475] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-06 21:12:48,366][62475] Updated weights for policy 0, policy_version 10590 (0.0007) [2023-03-06 21:12:49,177][62475] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-06 21:12:50,001][62475] Updated weights for policy 0, policy_version 10610 (0.0006) [2023-03-06 21:12:50,798][62475] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-06 21:12:51,601][62475] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-06 21:12:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 10894336. Throughput: 0: 12716.6. Samples: 10890311. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:12:52,390][62145] Avg episode reward: [(0, '423.970')] [2023-03-06 21:12:52,410][62475] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-06 21:12:53,222][62475] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-06 21:12:54,046][62475] Updated weights for policy 0, policy_version 10660 (0.0006) [2023-03-06 21:12:54,849][62475] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-06 21:12:55,644][62475] Updated weights for policy 0, policy_version 10680 (0.0007) [2023-03-06 21:12:56,451][62475] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-06 21:12:57,253][62475] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-06 21:12:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12683.7). Total num frames: 10957824. Throughput: 0: 12711.1. Samples: 10928256. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:12:57,390][62145] Avg episode reward: [(0, '396.897')] [2023-03-06 21:12:58,052][62475] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-06 21:12:58,880][62475] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-06 21:12:59,686][62475] Updated weights for policy 0, policy_version 10730 (0.0006) [2023-03-06 21:13:00,480][62475] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-06 21:13:01,298][62475] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-06 21:13:02,111][62475] Updated weights for policy 0, policy_version 10760 (0.0007) [2023-03-06 21:13:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 11021312. Throughput: 0: 12706.5. Samples: 11004328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:02,390][62145] Avg episode reward: [(0, '443.731')] [2023-03-06 21:13:02,921][62475] Updated weights for policy 0, policy_version 10770 (0.0006) [2023-03-06 21:13:03,728][62475] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-06 21:13:04,537][62475] Updated weights for policy 0, policy_version 10790 (0.0006) [2023-03-06 21:13:05,321][62475] Updated weights for policy 0, policy_version 10800 (0.0006) [2023-03-06 21:13:06,129][62475] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-06 21:13:06,917][62475] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-06 21:13:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 11084800. Throughput: 0: 12708.9. Samples: 11080662. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:07,390][62145] Avg episode reward: [(0, '351.075')] [2023-03-06 21:13:07,734][62475] Updated weights for policy 0, policy_version 10830 (0.0006) [2023-03-06 21:13:08,533][62475] Updated weights for policy 0, policy_version 10840 (0.0005) [2023-03-06 21:13:09,337][62475] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-06 21:13:10,131][62475] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-06 21:13:10,956][62475] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-06 21:13:11,757][62475] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-06 21:13:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 11149312. Throughput: 0: 12711.8. Samples: 11118938. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:12,390][62145] Avg episode reward: [(0, '423.382')] [2023-03-06 21:13:12,532][62475] Updated weights for policy 0, policy_version 10890 (0.0006) [2023-03-06 21:13:13,357][62475] Updated weights for policy 0, policy_version 10900 (0.0007) [2023-03-06 21:13:14,159][62475] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-06 21:13:14,960][62475] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-06 21:13:15,780][62475] Updated weights for policy 0, policy_version 10930 (0.0006) [2023-03-06 21:13:16,588][62475] Updated weights for policy 0, policy_version 10940 (0.0007) [2023-03-06 21:13:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 11211776. Throughput: 0: 12716.2. Samples: 11195290. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:17,390][62145] Avg episode reward: [(0, '420.377')] [2023-03-06 21:13:17,399][62475] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-06 21:13:18,208][62475] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-06 21:13:19,008][62475] Updated weights for policy 0, policy_version 10970 (0.0007) [2023-03-06 21:13:19,816][62475] Updated weights for policy 0, policy_version 10980 (0.0007) [2023-03-06 21:13:20,617][62475] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-06 21:13:21,415][62475] Updated weights for policy 0, policy_version 11000 (0.0006) [2023-03-06 21:13:22,229][62475] Updated weights for policy 0, policy_version 11010 (0.0007) [2023-03-06 21:13:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 11276288. Throughput: 0: 12706.8. Samples: 11271469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:22,390][62145] Avg episode reward: [(0, '414.766')] [2023-03-06 21:13:23,022][62475] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-06 21:13:23,830][62475] Updated weights for policy 0, policy_version 11030 (0.0007) [2023-03-06 21:13:24,628][62475] Updated weights for policy 0, policy_version 11040 (0.0007) [2023-03-06 21:13:25,440][62475] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-06 21:13:26,234][62475] Updated weights for policy 0, policy_version 11060 (0.0007) [2023-03-06 21:13:27,036][62475] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-06 21:13:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 11339776. Throughput: 0: 12707.8. Samples: 11309786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:27,390][62145] Avg episode reward: [(0, '444.082')] [2023-03-06 21:13:27,863][62475] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-06 21:13:28,666][62475] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-06 21:13:29,462][62475] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-06 21:13:30,259][62475] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-06 21:13:31,078][62475] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-06 21:13:31,881][62475] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-06 21:13:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12694.1). Total num frames: 11403264. Throughput: 0: 12704.8. Samples: 11386114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:32,390][62145] Avg episode reward: [(0, '439.383')] [2023-03-06 21:13:32,674][62475] Updated weights for policy 0, policy_version 11140 (0.0005) [2023-03-06 21:13:33,467][62475] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-06 21:13:34,270][62475] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-06 21:13:35,060][62475] Updated weights for policy 0, policy_version 11170 (0.0006) [2023-03-06 21:13:35,858][62475] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-06 21:13:36,678][62475] Updated weights for policy 0, policy_version 11190 (0.0007) [2023-03-06 21:13:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12697.6). Total num frames: 11466752. Throughput: 0: 12722.5. Samples: 11462825. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:37,390][62145] Avg episode reward: [(0, '436.953')] [2023-03-06 21:13:37,492][62475] Updated weights for policy 0, policy_version 11200 (0.0006) [2023-03-06 21:13:38,285][62475] Updated weights for policy 0, policy_version 11210 (0.0007) [2023-03-06 21:13:39,097][62475] Updated weights for policy 0, policy_version 11220 (0.0007) [2023-03-06 21:13:39,913][62475] Updated weights for policy 0, policy_version 11230 (0.0006) [2023-03-06 21:13:40,710][62475] Updated weights for policy 0, policy_version 11240 (0.0007) [2023-03-06 21:13:41,506][62475] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-06 21:13:42,316][62475] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-06 21:13:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12701.1). Total num frames: 11531264. Throughput: 0: 12722.8. Samples: 11500783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:13:42,390][62145] Avg episode reward: [(0, '388.550')] [2023-03-06 21:13:43,115][62475] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-06 21:13:43,899][62475] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-06 21:13:44,730][62475] Updated weights for policy 0, policy_version 11290 (0.0007) [2023-03-06 21:13:45,515][62475] Updated weights for policy 0, policy_version 11300 (0.0007) [2023-03-06 21:13:46,311][62475] Updated weights for policy 0, policy_version 11310 (0.0007) [2023-03-06 21:13:47,129][62475] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-06 21:13:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 11593728. Throughput: 0: 12736.6. Samples: 11577474. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:47,390][62145] Avg episode reward: [(0, '459.049')] [2023-03-06 21:13:47,950][62475] Updated weights for policy 0, policy_version 11330 (0.0007) [2023-03-06 21:13:48,757][62475] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-06 21:13:49,552][62475] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-06 21:13:50,355][62475] Updated weights for policy 0, policy_version 11360 (0.0007) [2023-03-06 21:13:51,149][62475] Updated weights for policy 0, policy_version 11370 (0.0006) [2023-03-06 21:13:51,953][62475] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-06 21:13:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 11658240. Throughput: 0: 12734.9. Samples: 11653732. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:52,390][62145] Avg episode reward: [(0, '349.235')] [2023-03-06 21:13:52,746][62475] Updated weights for policy 0, policy_version 11390 (0.0006) [2023-03-06 21:13:53,574][62475] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-06 21:13:54,367][62475] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-06 21:13:55,157][62475] Updated weights for policy 0, policy_version 11420 (0.0007) [2023-03-06 21:13:55,991][62475] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-06 21:13:56,782][62475] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-06 21:13:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 11721728. Throughput: 0: 12736.1. Samples: 11692064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:13:57,401][62145] Avg episode reward: [(0, '388.231')] [2023-03-06 21:13:57,571][62475] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-06 21:13:58,373][62475] Updated weights for policy 0, policy_version 11460 (0.0007) [2023-03-06 21:13:59,182][62475] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-06 21:13:59,983][62475] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-06 21:14:00,781][62475] Updated weights for policy 0, policy_version 11490 (0.0006) [2023-03-06 21:14:01,588][62475] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-06 21:14:02,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 11785216. Throughput: 0: 12740.3. Samples: 11768607. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:02,397][62475] Updated weights for policy 0, policy_version 11510 (0.0007) [2023-03-06 21:14:02,401][62145] Avg episode reward: [(0, '388.237')] [2023-03-06 21:14:03,205][62475] Updated weights for policy 0, policy_version 11520 (0.0007) [2023-03-06 21:14:03,999][62475] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-06 21:14:04,823][62475] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-06 21:14:05,631][62475] Updated weights for policy 0, policy_version 11550 (0.0006) [2023-03-06 21:14:06,435][62475] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-06 21:14:07,258][62475] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-06 21:14:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 11848704. Throughput: 0: 12737.6. Samples: 11844662. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:07,400][62145] Avg episode reward: [(0, '356.630')] [2023-03-06 21:14:08,045][62475] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-06 21:14:08,846][62475] Updated weights for policy 0, policy_version 11590 (0.0006) [2023-03-06 21:14:09,661][62475] Updated weights for policy 0, policy_version 11600 (0.0008) [2023-03-06 21:14:10,447][62475] Updated weights for policy 0, policy_version 11610 (0.0007) [2023-03-06 21:14:11,273][62475] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-06 21:14:12,076][62475] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-06 21:14:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 11912192. Throughput: 0: 12737.3. Samples: 11882965. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:12,401][62145] Avg episode reward: [(0, '408.089')] [2023-03-06 21:14:12,872][62475] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-06 21:14:13,690][62475] Updated weights for policy 0, policy_version 11650 (0.0005) [2023-03-06 21:14:14,470][62475] Updated weights for policy 0, policy_version 11660 (0.0006) [2023-03-06 21:14:15,260][62475] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-06 21:14:16,069][62475] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-06 21:14:16,869][62475] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-06 21:14:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 11976704. Throughput: 0: 12742.0. Samples: 11959502. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:17,400][62145] Avg episode reward: [(0, '380.768')] [2023-03-06 21:14:17,677][62475] Updated weights for policy 0, policy_version 11700 (0.0006) [2023-03-06 21:14:18,501][62475] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-06 21:14:19,293][62475] Updated weights for policy 0, policy_version 11720 (0.0006) [2023-03-06 21:14:20,091][62475] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-06 21:14:20,896][62475] Updated weights for policy 0, policy_version 11740 (0.0007) [2023-03-06 21:14:21,706][62475] Updated weights for policy 0, policy_version 11750 (0.0006) [2023-03-06 21:14:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 12040192. Throughput: 0: 12735.0. Samples: 12035899. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:14:22,401][62145] Avg episode reward: [(0, '333.208')] [2023-03-06 21:14:22,415][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000011759_12041216.pth... [2023-03-06 21:14:22,444][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000008778_8988672.pth [2023-03-06 21:14:22,501][62475] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-06 21:14:23,302][62475] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-06 21:14:24,111][62475] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-06 21:14:24,920][62475] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-06 21:14:25,721][62475] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-06 21:14:26,534][62475] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-06 21:14:27,317][62475] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-06 21:14:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 12103680. Throughput: 0: 12741.5. Samples: 12074149. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:14:27,401][62145] Avg episode reward: [(0, '317.334')] [2023-03-06 21:14:28,134][62475] Updated weights for policy 0, policy_version 11830 (0.0005) [2023-03-06 21:14:28,927][62475] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-06 21:14:29,743][62475] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-06 21:14:30,546][62475] Updated weights for policy 0, policy_version 11860 (0.0006) [2023-03-06 21:14:31,339][62475] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-06 21:14:32,145][62475] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-06 21:14:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 12167168. Throughput: 0: 12736.3. Samples: 12150609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:14:32,401][62145] Avg episode reward: [(0, '363.672')] [2023-03-06 21:14:32,930][62475] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-06 21:14:33,739][62475] Updated weights for policy 0, policy_version 11900 (0.0007) [2023-03-06 21:14:34,523][62475] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-06 21:14:35,311][62475] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-06 21:14:36,126][62475] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-06 21:14:36,941][62475] Updated weights for policy 0, policy_version 11940 (0.0007) [2023-03-06 21:14:37,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 12231680. Throughput: 0: 12749.4. Samples: 12227456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:14:37,401][62145] Avg episode reward: [(0, '389.723')] [2023-03-06 21:14:37,730][62475] Updated weights for policy 0, policy_version 11950 (0.0005) [2023-03-06 21:14:38,525][62475] Updated weights for policy 0, policy_version 11960 (0.0007) [2023-03-06 21:14:39,351][62475] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-06 21:14:40,177][62475] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-06 21:14:40,956][62475] Updated weights for policy 0, policy_version 11990 (0.0005) [2023-03-06 21:14:41,754][62475] Updated weights for policy 0, policy_version 12000 (0.0006) [2023-03-06 21:14:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 12295168. Throughput: 0: 12745.9. Samples: 12265631. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:14:42,390][62145] Avg episode reward: [(0, '343.147')] [2023-03-06 21:14:42,561][62475] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-06 21:14:43,379][62475] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-06 21:14:44,184][62475] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-06 21:14:44,979][62475] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-06 21:14:45,782][62475] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-06 21:14:46,586][62475] Updated weights for policy 0, policy_version 12060 (0.0006) [2023-03-06 21:14:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 12358656. Throughput: 0: 12747.5. Samples: 12342243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:14:47,390][62145] Avg episode reward: [(0, '378.298')] [2023-03-06 21:14:47,398][62475] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-06 21:14:48,184][62475] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-06 21:14:48,993][62475] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-06 21:14:49,796][62475] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-06 21:14:50,590][62475] Updated weights for policy 0, policy_version 12110 (0.0007) [2023-03-06 21:14:51,406][62475] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-06 21:14:52,233][62475] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-06 21:14:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 12422144. Throughput: 0: 12755.0. Samples: 12418638. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:52,390][62145] Avg episode reward: [(0, '409.085')] [2023-03-06 21:14:53,026][62475] Updated weights for policy 0, policy_version 12140 (0.0007) [2023-03-06 21:14:53,827][62475] Updated weights for policy 0, policy_version 12150 (0.0006) [2023-03-06 21:14:54,619][62475] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-06 21:14:55,417][62475] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-06 21:14:56,210][62475] Updated weights for policy 0, policy_version 12180 (0.0007) [2023-03-06 21:14:57,028][62475] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-06 21:14:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 12486656. Throughput: 0: 12753.0. Samples: 12456851. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:14:57,390][62145] Avg episode reward: [(0, '347.804')] [2023-03-06 21:14:57,822][62475] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-06 21:14:58,626][62475] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-06 21:14:59,446][62475] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-06 21:15:00,249][62475] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-06 21:15:01,066][62475] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-06 21:15:01,865][62475] Updated weights for policy 0, policy_version 12250 (0.0007) [2023-03-06 21:15:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 12550144. Throughput: 0: 12746.9. Samples: 12533115. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:02,390][62145] Avg episode reward: [(0, '359.621')] [2023-03-06 21:15:02,670][62475] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-06 21:15:03,484][62475] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-06 21:15:04,315][62475] Updated weights for policy 0, policy_version 12280 (0.0007) [2023-03-06 21:15:05,123][62475] Updated weights for policy 0, policy_version 12290 (0.0006) [2023-03-06 21:15:05,917][62475] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-06 21:15:06,720][62475] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-06 21:15:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 12613632. Throughput: 0: 12741.2. Samples: 12609253. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:15:07,390][62145] Avg episode reward: [(0, '429.625')] [2023-03-06 21:15:07,521][62475] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-06 21:15:08,334][62475] Updated weights for policy 0, policy_version 12330 (0.0007) [2023-03-06 21:15:09,144][62475] Updated weights for policy 0, policy_version 12340 (0.0007) [2023-03-06 21:15:09,952][62475] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-06 21:15:10,759][62475] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-06 21:15:11,553][62475] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-06 21:15:12,353][62475] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-06 21:15:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 12677120. Throughput: 0: 12733.6. Samples: 12647162. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:15:12,390][62145] Avg episode reward: [(0, '350.325')] [2023-03-06 21:15:13,158][62475] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-06 21:15:13,969][62475] Updated weights for policy 0, policy_version 12400 (0.0007) [2023-03-06 21:15:14,785][62475] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-06 21:15:15,572][62475] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-06 21:15:16,379][62475] Updated weights for policy 0, policy_version 12430 (0.0007) [2023-03-06 21:15:17,182][62475] Updated weights for policy 0, policy_version 12440 (0.0007) [2023-03-06 21:15:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 12740608. Throughput: 0: 12734.3. Samples: 12723650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:17,390][62145] Avg episode reward: [(0, '413.124')] [2023-03-06 21:15:17,982][62475] Updated weights for policy 0, policy_version 12450 (0.0005) [2023-03-06 21:15:18,794][62475] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-06 21:15:19,594][62475] Updated weights for policy 0, policy_version 12470 (0.0007) [2023-03-06 21:15:20,414][62475] Updated weights for policy 0, policy_version 12480 (0.0006) [2023-03-06 21:15:21,220][62475] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-06 21:15:22,023][62475] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-06 21:15:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 12804096. Throughput: 0: 12720.5. Samples: 12799876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:22,390][62145] Avg episode reward: [(0, '370.550')] [2023-03-06 21:15:22,829][62475] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-06 21:15:23,626][62475] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-06 21:15:24,417][62475] Updated weights for policy 0, policy_version 12530 (0.0007) [2023-03-06 21:15:25,221][62475] Updated weights for policy 0, policy_version 12540 (0.0007) [2023-03-06 21:15:26,030][62475] Updated weights for policy 0, policy_version 12550 (0.0007) [2023-03-06 21:15:26,083][62424] KL-divergence is very high: 2430.5396 [2023-03-06 21:15:26,841][62475] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-06 21:15:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 12867584. Throughput: 0: 12724.4. Samples: 12838231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:27,390][62145] Avg episode reward: [(0, '334.226')] [2023-03-06 21:15:27,637][62475] Updated weights for policy 0, policy_version 12570 (0.0007) [2023-03-06 21:15:28,454][62475] Updated weights for policy 0, policy_version 12580 (0.0007) [2023-03-06 21:15:29,253][62475] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-06 21:15:30,059][62475] Updated weights for policy 0, policy_version 12600 (0.0007) [2023-03-06 21:15:30,871][62475] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-06 21:15:31,670][62475] Updated weights for policy 0, policy_version 12620 (0.0006) [2023-03-06 21:15:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 12931072. Throughput: 0: 12717.3. Samples: 12914521. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:32,390][62145] Avg episode reward: [(0, '383.784')] [2023-03-06 21:15:32,485][62475] Updated weights for policy 0, policy_version 12630 (0.0006) [2023-03-06 21:15:33,287][62475] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-06 21:15:34,091][62475] Updated weights for policy 0, policy_version 12650 (0.0006) [2023-03-06 21:15:34,917][62475] Updated weights for policy 0, policy_version 12660 (0.0007) [2023-03-06 21:15:35,721][62475] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-06 21:15:36,550][62475] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-06 21:15:37,357][62475] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-06 21:15:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 12994560. Throughput: 0: 12701.4. Samples: 12990201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:37,390][62145] Avg episode reward: [(0, '393.900')] [2023-03-06 21:15:38,147][62475] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-06 21:15:38,971][62475] Updated weights for policy 0, policy_version 12710 (0.0007) [2023-03-06 21:15:39,763][62475] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-06 21:15:40,567][62475] Updated weights for policy 0, policy_version 12730 (0.0006) [2023-03-06 21:15:41,382][62475] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-06 21:15:42,193][62475] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-06 21:15:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13058048. Throughput: 0: 12699.9. Samples: 13028345. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:42,390][62145] Avg episode reward: [(0, '383.152')] [2023-03-06 21:15:43,010][62475] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-06 21:15:43,822][62475] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-06 21:15:44,623][62475] Updated weights for policy 0, policy_version 12780 (0.0007) [2023-03-06 21:15:45,410][62475] Updated weights for policy 0, policy_version 12790 (0.0007) [2023-03-06 21:15:46,218][62475] Updated weights for policy 0, policy_version 12800 (0.0007) [2023-03-06 21:15:47,027][62475] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-06 21:15:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13121536. Throughput: 0: 12699.5. Samples: 13104592. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:47,390][62145] Avg episode reward: [(0, '323.134')] [2023-03-06 21:15:47,822][62475] Updated weights for policy 0, policy_version 12820 (0.0006) [2023-03-06 21:15:48,621][62475] Updated weights for policy 0, policy_version 12830 (0.0007) [2023-03-06 21:15:49,435][62475] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-06 21:15:50,234][62475] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-06 21:15:51,034][62475] Updated weights for policy 0, policy_version 12860 (0.0006) [2023-03-06 21:15:51,848][62475] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-06 21:15:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 13185024. Throughput: 0: 12705.2. Samples: 13180989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:15:52,390][62145] Avg episode reward: [(0, '342.812')] [2023-03-06 21:15:52,630][62475] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-06 21:15:53,455][62475] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-06 21:15:54,283][62475] Updated weights for policy 0, policy_version 12900 (0.0008) [2023-03-06 21:15:55,057][62475] Updated weights for policy 0, policy_version 12910 (0.0007) [2023-03-06 21:15:55,853][62475] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-06 21:15:56,674][62475] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-06 21:15:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13249536. Throughput: 0: 12711.3. Samples: 13219172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:15:57,390][62145] Avg episode reward: [(0, '375.136')] [2023-03-06 21:15:57,467][62475] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-06 21:15:58,261][62475] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-06 21:15:59,071][62475] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-06 21:15:59,877][62475] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-06 21:16:00,681][62475] Updated weights for policy 0, policy_version 12980 (0.0007) [2023-03-06 21:16:01,491][62475] Updated weights for policy 0, policy_version 12990 (0.0006) [2023-03-06 21:16:02,298][62475] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-06 21:16:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13313024. Throughput: 0: 12712.4. Samples: 13295710. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:16:02,390][62145] Avg episode reward: [(0, '362.848')] [2023-03-06 21:16:03,103][62475] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-06 21:16:03,911][62475] Updated weights for policy 0, policy_version 13020 (0.0007) [2023-03-06 21:16:04,714][62475] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-06 21:16:05,522][62475] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-06 21:16:06,319][62475] Updated weights for policy 0, policy_version 13050 (0.0007) [2023-03-06 21:16:07,133][62475] Updated weights for policy 0, policy_version 13060 (0.0007) [2023-03-06 21:16:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13376512. Throughput: 0: 12712.8. Samples: 13371953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:07,390][62145] Avg episode reward: [(0, '404.063')] [2023-03-06 21:16:07,937][62475] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-06 21:16:08,738][62475] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-06 21:16:09,550][62475] Updated weights for policy 0, policy_version 13090 (0.0007) [2023-03-06 21:16:10,362][62475] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-06 21:16:11,167][62475] Updated weights for policy 0, policy_version 13110 (0.0007) [2023-03-06 21:16:11,980][62475] Updated weights for policy 0, policy_version 13120 (0.0007) [2023-03-06 21:16:12,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 13438976. Throughput: 0: 12704.9. Samples: 13409950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:12,390][62145] Avg episode reward: [(0, '386.203')] [2023-03-06 21:16:12,795][62475] Updated weights for policy 0, policy_version 13130 (0.0007) [2023-03-06 21:16:13,598][62475] Updated weights for policy 0, policy_version 13140 (0.0006) [2023-03-06 21:16:14,417][62475] Updated weights for policy 0, policy_version 13150 (0.0007) [2023-03-06 21:16:15,229][62475] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-06 21:16:16,016][62475] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-06 21:16:16,830][62475] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-06 21:16:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 13503488. Throughput: 0: 12695.8. Samples: 13485835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 21:16:17,390][62145] Avg episode reward: [(0, '307.551')] [2023-03-06 21:16:17,638][62475] Updated weights for policy 0, policy_version 13190 (0.0006) [2023-03-06 21:16:18,438][62475] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-06 21:16:19,255][62475] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-06 21:16:20,059][62475] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-06 21:16:20,840][62475] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-06 21:16:21,665][62475] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-06 21:16:22,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13566976. Throughput: 0: 12711.5. Samples: 13562219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 21:16:22,390][62145] Avg episode reward: [(0, '317.574')] [2023-03-06 21:16:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000013249_13566976.pth... [2023-03-06 21:16:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000010267_10513408.pth [2023-03-06 21:16:22,479][62475] Updated weights for policy 0, policy_version 13250 (0.0007) [2023-03-06 21:16:23,274][62475] Updated weights for policy 0, policy_version 13260 (0.0007) [2023-03-06 21:16:24,086][62475] Updated weights for policy 0, policy_version 13270 (0.0006) [2023-03-06 21:16:24,889][62475] Updated weights for policy 0, policy_version 13280 (0.0007) [2023-03-06 21:16:25,689][62475] Updated weights for policy 0, policy_version 13290 (0.0007) [2023-03-06 21:16:26,489][62475] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-06 21:16:27,282][62475] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-06 21:16:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13630464. Throughput: 0: 12710.8. Samples: 13600330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:27,390][62145] Avg episode reward: [(0, '325.581')] [2023-03-06 21:16:28,080][62475] Updated weights for policy 0, policy_version 13320 (0.0006) [2023-03-06 21:16:28,883][62475] Updated weights for policy 0, policy_version 13330 (0.0006) [2023-03-06 21:16:29,681][62475] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-06 21:16:30,481][62475] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-06 21:16:31,287][62475] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-06 21:16:32,089][62475] Updated weights for policy 0, policy_version 13370 (0.0006) [2023-03-06 21:16:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13693952. Throughput: 0: 12723.1. Samples: 13677131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:32,390][62145] Avg episode reward: [(0, '363.293')] [2023-03-06 21:16:32,886][62475] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-06 21:16:33,695][62475] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-06 21:16:34,498][62475] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-06 21:16:35,321][62475] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-06 21:16:36,109][62475] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-06 21:16:36,922][62475] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-06 21:16:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 13757440. Throughput: 0: 12723.7. Samples: 13753555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:37,390][62145] Avg episode reward: [(0, '377.337')] [2023-03-06 21:16:37,710][62475] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-06 21:16:38,529][62475] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-06 21:16:39,327][62475] Updated weights for policy 0, policy_version 13460 (0.0005) [2023-03-06 21:16:40,134][62475] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-06 21:16:40,938][62475] Updated weights for policy 0, policy_version 13480 (0.0007) [2023-03-06 21:16:41,742][62475] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-06 21:16:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 13821952. Throughput: 0: 12721.8. Samples: 13791652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:42,390][62145] Avg episode reward: [(0, '359.393')] [2023-03-06 21:16:42,539][62475] Updated weights for policy 0, policy_version 13500 (0.0008) [2023-03-06 21:16:43,330][62475] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-06 21:16:44,157][62475] Updated weights for policy 0, policy_version 13520 (0.0007) [2023-03-06 21:16:44,932][62475] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-06 21:16:45,736][62475] Updated weights for policy 0, policy_version 13540 (0.0006) [2023-03-06 21:16:46,561][62475] Updated weights for policy 0, policy_version 13550 (0.0007) [2023-03-06 21:16:47,358][62475] Updated weights for policy 0, policy_version 13560 (0.0006) [2023-03-06 21:16:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 13885440. Throughput: 0: 12722.3. Samples: 13868214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:16:47,390][62145] Avg episode reward: [(0, '325.386')] [2023-03-06 21:16:48,149][62475] Updated weights for policy 0, policy_version 13570 (0.0007) [2023-03-06 21:16:48,943][62475] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-06 21:16:49,753][62475] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-06 21:16:50,548][62475] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-06 21:16:51,342][62475] Updated weights for policy 0, policy_version 13610 (0.0006) [2023-03-06 21:16:52,157][62475] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-06 21:16:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 13949952. Throughput: 0: 12735.5. Samples: 13945051. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:16:52,390][62145] Avg episode reward: [(0, '387.034')] [2023-03-06 21:16:52,959][62475] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-06 21:16:53,762][62475] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-06 21:16:54,571][62475] Updated weights for policy 0, policy_version 13650 (0.0007) [2023-03-06 21:16:55,371][62475] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-06 21:16:56,175][62475] Updated weights for policy 0, policy_version 13670 (0.0006) [2023-03-06 21:16:56,986][62475] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-06 21:16:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 14013440. Throughput: 0: 12741.1. Samples: 13983301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:16:57,390][62145] Avg episode reward: [(0, '377.703')] [2023-03-06 21:16:57,793][62475] Updated weights for policy 0, policy_version 13690 (0.0006) [2023-03-06 21:16:58,602][62475] Updated weights for policy 0, policy_version 13700 (0.0006) [2023-03-06 21:16:59,403][62475] Updated weights for policy 0, policy_version 13710 (0.0007) [2023-03-06 21:17:00,214][62475] Updated weights for policy 0, policy_version 13720 (0.0007) [2023-03-06 21:17:01,024][62475] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-06 21:17:01,826][62475] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-06 21:17:02,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 14075904. Throughput: 0: 12747.7. Samples: 14059482. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:02,390][62145] Avg episode reward: [(0, '396.315')] [2023-03-06 21:17:02,646][62475] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-06 21:17:03,446][62475] Updated weights for policy 0, policy_version 13760 (0.0007) [2023-03-06 21:17:04,258][62475] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-06 21:17:05,056][62475] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-06 21:17:05,854][62475] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-06 21:17:06,662][62475] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-06 21:17:07,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 14139392. Throughput: 0: 12741.2. Samples: 14135574. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:07,390][62145] Avg episode reward: [(0, '359.704')] [2023-03-06 21:17:07,466][62475] Updated weights for policy 0, policy_version 13810 (0.0007) [2023-03-06 21:17:08,279][62475] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-06 21:17:09,077][62475] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-06 21:17:09,891][62475] Updated weights for policy 0, policy_version 13840 (0.0006) [2023-03-06 21:17:10,691][62475] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-06 21:17:11,483][62475] Updated weights for policy 0, policy_version 13860 (0.0007) [2023-03-06 21:17:12,314][62475] Updated weights for policy 0, policy_version 13870 (0.0006) [2023-03-06 21:17:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 14203904. Throughput: 0: 12741.0. Samples: 14173676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:12,390][62145] Avg episode reward: [(0, '338.700')] [2023-03-06 21:17:13,113][62475] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-06 21:17:13,909][62475] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-06 21:17:13,989][62424] KL-divergence is very high: 58236.7109 [2023-03-06 21:17:14,715][62475] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-06 21:17:15,515][62475] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-06 21:17:16,328][62475] Updated weights for policy 0, policy_version 13920 (0.0007) [2023-03-06 21:17:17,136][62475] Updated weights for policy 0, policy_version 13930 (0.0007) [2023-03-06 21:17:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 14267392. Throughput: 0: 12733.1. Samples: 14250118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:17,400][62145] Avg episode reward: [(0, '281.691')] [2023-03-06 21:17:17,941][62475] Updated weights for policy 0, policy_version 13940 (0.0007) [2023-03-06 21:17:18,757][62475] Updated weights for policy 0, policy_version 13950 (0.0007) [2023-03-06 21:17:19,558][62475] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-06 21:17:20,347][62475] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-06 21:17:21,171][62475] Updated weights for policy 0, policy_version 13980 (0.0007) [2023-03-06 21:17:21,957][62475] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-06 21:17:22,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 14330880. Throughput: 0: 12730.2. Samples: 14326414. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:22,401][62145] Avg episode reward: [(0, '288.886')] [2023-03-06 21:17:22,769][62475] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-06 21:17:23,564][62475] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-06 21:17:24,363][62475] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-06 21:17:25,170][62475] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-06 21:17:25,976][62475] Updated weights for policy 0, policy_version 14040 (0.0006) [2023-03-06 21:17:26,767][62475] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-06 21:17:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 14394368. Throughput: 0: 12735.1. Samples: 14364732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:27,390][62145] Avg episode reward: [(0, '317.871')] [2023-03-06 21:17:27,572][62475] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-06 21:17:28,381][62475] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-06 21:17:29,184][62475] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-06 21:17:30,005][62475] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-06 21:17:30,790][62475] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-06 21:17:31,594][62475] Updated weights for policy 0, policy_version 14110 (0.0006) [2023-03-06 21:17:32,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 14457856. Throughput: 0: 12733.4. Samples: 14441216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:32,390][62145] Avg episode reward: [(0, '341.545')] [2023-03-06 21:17:32,421][62475] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-06 21:17:33,205][62475] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-06 21:17:33,996][62475] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-06 21:17:34,821][62475] Updated weights for policy 0, policy_version 14150 (0.0006) [2023-03-06 21:17:35,621][62475] Updated weights for policy 0, policy_version 14160 (0.0006) [2023-03-06 21:17:36,435][62475] Updated weights for policy 0, policy_version 14170 (0.0006) [2023-03-06 21:17:37,235][62475] Updated weights for policy 0, policy_version 14180 (0.0006) [2023-03-06 21:17:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 14521344. Throughput: 0: 12719.5. Samples: 14517428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:37,390][62145] Avg episode reward: [(0, '292.026')] [2023-03-06 21:17:38,055][62475] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-06 21:17:38,852][62475] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-06 21:17:39,655][62475] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-06 21:17:40,455][62475] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-06 21:17:41,253][62475] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-06 21:17:42,064][62475] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-06 21:17:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 14585856. Throughput: 0: 12717.3. Samples: 14555581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:42,390][62145] Avg episode reward: [(0, '282.874')] [2023-03-06 21:17:42,854][62475] Updated weights for policy 0, policy_version 14250 (0.0006) [2023-03-06 21:17:43,678][62475] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-06 21:17:44,476][62475] Updated weights for policy 0, policy_version 14270 (0.0008) [2023-03-06 21:17:45,278][62475] Updated weights for policy 0, policy_version 14280 (0.0007) [2023-03-06 21:17:46,082][62475] Updated weights for policy 0, policy_version 14290 (0.0006) [2023-03-06 21:17:46,902][62475] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-06 21:17:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 14649344. Throughput: 0: 12721.3. Samples: 14631941. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:17:47,390][62145] Avg episode reward: [(0, '316.518')] [2023-03-06 21:17:47,711][62475] Updated weights for policy 0, policy_version 14310 (0.0006) [2023-03-06 21:17:48,509][62475] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-06 21:17:49,317][62475] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-06 21:17:50,122][62475] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-06 21:17:50,917][62475] Updated weights for policy 0, policy_version 14350 (0.0007) [2023-03-06 21:17:51,725][62475] Updated weights for policy 0, policy_version 14360 (0.0006) [2023-03-06 21:17:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 14712832. Throughput: 0: 12723.5. Samples: 14708128. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:17:52,390][62145] Avg episode reward: [(0, '330.271')] [2023-03-06 21:17:52,565][62475] Updated weights for policy 0, policy_version 14370 (0.0007) [2023-03-06 21:17:53,359][62475] Updated weights for policy 0, policy_version 14380 (0.0007) [2023-03-06 21:17:54,165][62475] Updated weights for policy 0, policy_version 14390 (0.0007) [2023-03-06 21:17:54,997][62475] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-03-06 21:17:55,798][62475] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-06 21:17:56,614][62475] Updated weights for policy 0, policy_version 14420 (0.0007) [2023-03-06 21:17:57,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 14775296. Throughput: 0: 12711.8. Samples: 14745709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:17:57,390][62145] Avg episode reward: [(0, '394.023')] [2023-03-06 21:17:57,413][62475] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-06 21:17:58,205][62475] Updated weights for policy 0, policy_version 14440 (0.0006) [2023-03-06 21:17:59,025][62475] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-06 21:17:59,830][62475] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-06 21:18:00,630][62475] Updated weights for policy 0, policy_version 14470 (0.0007) [2023-03-06 21:18:01,447][62475] Updated weights for policy 0, policy_version 14480 (0.0007) [2023-03-06 21:18:02,251][62475] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-06 21:18:02,389][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 14838784. Throughput: 0: 12707.2. Samples: 14821944. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:18:02,390][62145] Avg episode reward: [(0, '316.138')] [2023-03-06 21:18:03,046][62475] Updated weights for policy 0, policy_version 14500 (0.0006) [2023-03-06 21:18:03,848][62475] Updated weights for policy 0, policy_version 14510 (0.0006) [2023-03-06 21:18:04,639][62475] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-06 21:18:05,447][62475] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-06 21:18:06,258][62475] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-06 21:18:07,047][62475] Updated weights for policy 0, policy_version 14550 (0.0006) [2023-03-06 21:18:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 14903296. Throughput: 0: 12718.3. Samples: 14898734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:07,390][62145] Avg episode reward: [(0, '347.174')] [2023-03-06 21:18:07,842][62475] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-06 21:18:08,642][62475] Updated weights for policy 0, policy_version 14570 (0.0006) [2023-03-06 21:18:09,433][62475] Updated weights for policy 0, policy_version 14580 (0.0007) [2023-03-06 21:18:10,233][62475] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-06 21:18:11,021][62475] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-06 21:18:11,837][62475] Updated weights for policy 0, policy_version 14610 (0.0006) [2023-03-06 21:18:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 14966784. Throughput: 0: 12723.1. Samples: 14937271. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:12,390][62145] Avg episode reward: [(0, '329.667')] [2023-03-06 21:18:12,654][62475] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-06 21:18:13,458][62475] Updated weights for policy 0, policy_version 14630 (0.0006) [2023-03-06 21:18:14,248][62475] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-06 21:18:15,065][62475] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-06 21:18:15,861][62475] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-06 21:18:16,661][62475] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-06 21:18:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 15030272. Throughput: 0: 12721.3. Samples: 15013675. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:17,390][62145] Avg episode reward: [(0, '386.102')] [2023-03-06 21:18:17,465][62475] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-06 21:18:18,259][62475] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-06 21:18:19,060][62475] Updated weights for policy 0, policy_version 14700 (0.0007) [2023-03-06 21:18:19,865][62475] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-06 21:18:20,676][62475] Updated weights for policy 0, policy_version 14720 (0.0008) [2023-03-06 21:18:21,482][62475] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-06 21:18:22,292][62475] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-06 21:18:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 15094784. Throughput: 0: 12728.4. Samples: 15090207. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:22,390][62145] Avg episode reward: [(0, '345.202')] [2023-03-06 21:18:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000014741_15094784.pth... [2023-03-06 21:18:22,429][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000011759_12041216.pth [2023-03-06 21:18:23,112][62475] Updated weights for policy 0, policy_version 14750 (0.0006) [2023-03-06 21:18:23,895][62475] Updated weights for policy 0, policy_version 14760 (0.0007) [2023-03-06 21:18:24,725][62475] Updated weights for policy 0, policy_version 14770 (0.0006) [2023-03-06 21:18:25,514][62475] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-06 21:18:26,332][62475] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-06 21:18:27,157][62475] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-06 21:18:27,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 15158272. Throughput: 0: 12724.6. Samples: 15128187. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:18:27,390][62145] Avg episode reward: [(0, '361.872')] [2023-03-06 21:18:27,947][62475] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-06 21:18:28,755][62475] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-06 21:18:29,556][62475] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-06 21:18:30,349][62475] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-06 21:18:31,157][62475] Updated weights for policy 0, policy_version 14850 (0.0007) [2023-03-06 21:18:31,970][62475] Updated weights for policy 0, policy_version 14860 (0.0007) [2023-03-06 21:18:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 15221760. Throughput: 0: 12726.8. Samples: 15204648. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:18:32,390][62145] Avg episode reward: [(0, '481.080')] [2023-03-06 21:18:32,774][62475] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-06 21:18:33,553][62475] Updated weights for policy 0, policy_version 14880 (0.0007) [2023-03-06 21:18:34,371][62475] Updated weights for policy 0, policy_version 14890 (0.0007) [2023-03-06 21:18:34,932][62424] KL-divergence is very high: 249.5297 [2023-03-06 21:18:35,192][62475] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-06 21:18:35,980][62475] Updated weights for policy 0, policy_version 14910 (0.0006) [2023-03-06 21:18:36,777][62475] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-06 21:18:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 15285248. Throughput: 0: 12729.9. Samples: 15280973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:18:37,390][62145] Avg episode reward: [(0, '431.805')] [2023-03-06 21:18:37,595][62475] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-06 21:18:38,393][62475] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-06 21:18:39,204][62475] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-06 21:18:40,006][62475] Updated weights for policy 0, policy_version 14960 (0.0006) [2023-03-06 21:18:40,814][62475] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-06 21:18:41,605][62475] Updated weights for policy 0, policy_version 14980 (0.0006) [2023-03-06 21:18:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 15348736. Throughput: 0: 12744.1. Samples: 15319192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:18:42,390][62145] Avg episode reward: [(0, '403.139')] [2023-03-06 21:18:42,398][62475] Updated weights for policy 0, policy_version 14990 (0.0007) [2023-03-06 21:18:43,214][62475] Updated weights for policy 0, policy_version 15000 (0.0007) [2023-03-06 21:18:44,024][62475] Updated weights for policy 0, policy_version 15010 (0.0007) [2023-03-06 21:18:44,840][62475] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-06 21:18:45,641][62475] Updated weights for policy 0, policy_version 15030 (0.0005) [2023-03-06 21:18:46,458][62475] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-06 21:18:47,249][62475] Updated weights for policy 0, policy_version 15050 (0.0007) [2023-03-06 21:18:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 15412224. Throughput: 0: 12742.4. Samples: 15395352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:47,390][62145] Avg episode reward: [(0, '506.500')] [2023-03-06 21:18:48,055][62475] Updated weights for policy 0, policy_version 15060 (0.0007) [2023-03-06 21:18:48,865][62475] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-06 21:18:49,663][62475] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-06 21:18:50,465][62475] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-06 21:18:51,267][62475] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-06 21:18:52,079][62475] Updated weights for policy 0, policy_version 15110 (0.0006) [2023-03-06 21:18:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 15475712. Throughput: 0: 12733.3. Samples: 15471733. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:52,390][62145] Avg episode reward: [(0, '472.635')] [2023-03-06 21:18:52,869][62475] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-06 21:18:53,672][62475] Updated weights for policy 0, policy_version 15130 (0.0006) [2023-03-06 21:18:54,470][62475] Updated weights for policy 0, policy_version 15140 (0.0006) [2023-03-06 21:18:55,263][62475] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-06 21:18:56,045][62475] Updated weights for policy 0, policy_version 15160 (0.0008) [2023-03-06 21:18:56,870][62475] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-06 21:18:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 15540224. Throughput: 0: 12733.5. Samples: 15510281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:18:57,390][62145] Avg episode reward: [(0, '446.167')] [2023-03-06 21:18:57,674][62475] Updated weights for policy 0, policy_version 15180 (0.0007) [2023-03-06 21:18:58,491][62475] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-06 21:18:59,294][62475] Updated weights for policy 0, policy_version 15200 (0.0006) [2023-03-06 21:19:00,098][62475] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-06 21:19:00,897][62475] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-06 21:19:01,690][62475] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-06 21:19:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 15603712. Throughput: 0: 12735.2. Samples: 15586759. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:19:02,390][62145] Avg episode reward: [(0, '476.008')] [2023-03-06 21:19:02,503][62475] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-06 21:19:03,327][62475] Updated weights for policy 0, policy_version 15250 (0.0006) [2023-03-06 21:19:04,125][62475] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-06 21:19:04,926][62475] Updated weights for policy 0, policy_version 15270 (0.0007) [2023-03-06 21:19:05,735][62475] Updated weights for policy 0, policy_version 15280 (0.0007) [2023-03-06 21:19:06,521][62475] Updated weights for policy 0, policy_version 15290 (0.0007) [2023-03-06 21:19:07,326][62475] Updated weights for policy 0, policy_version 15300 (0.0007) [2023-03-06 21:19:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 15667200. Throughput: 0: 12731.0. Samples: 15663103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:19:07,390][62145] Avg episode reward: [(0, '486.924')] [2023-03-06 21:19:08,125][62475] Updated weights for policy 0, policy_version 15310 (0.0005) [2023-03-06 21:19:08,923][62475] Updated weights for policy 0, policy_version 15320 (0.0007) [2023-03-06 21:19:09,741][62475] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-06 21:19:10,551][62475] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-06 21:19:11,354][62475] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-06 21:19:12,166][62475] Updated weights for policy 0, policy_version 15360 (0.0007) [2023-03-06 21:19:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 15730688. Throughput: 0: 12733.1. Samples: 15701177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:19:12,390][62145] Avg episode reward: [(0, '485.084')] [2023-03-06 21:19:12,984][62475] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-06 21:19:13,802][62475] Updated weights for policy 0, policy_version 15380 (0.0006) [2023-03-06 21:19:14,604][62475] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-06 21:19:15,406][62475] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-06 21:19:16,204][62475] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-06 21:19:17,009][62475] Updated weights for policy 0, policy_version 15420 (0.0006) [2023-03-06 21:19:17,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 15795200. Throughput: 0: 12728.1. Samples: 15777410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:19:17,390][62145] Avg episode reward: [(0, '505.000')] [2023-03-06 21:19:17,803][62475] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-06 21:19:18,604][62475] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-06 21:19:19,408][62475] Updated weights for policy 0, policy_version 15450 (0.0007) [2023-03-06 21:19:20,217][62475] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-06 21:19:21,002][62475] Updated weights for policy 0, policy_version 15470 (0.0007) [2023-03-06 21:19:21,812][62475] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-06 21:19:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 15858688. Throughput: 0: 12735.9. Samples: 15854091. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:19:22,390][62145] Avg episode reward: [(0, '449.289')] [2023-03-06 21:19:22,606][62475] Updated weights for policy 0, policy_version 15490 (0.0007) [2023-03-06 21:19:23,441][62475] Updated weights for policy 0, policy_version 15500 (0.0006) [2023-03-06 21:19:24,232][62475] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-06 21:19:25,025][62475] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-06 21:19:25,835][62475] Updated weights for policy 0, policy_version 15530 (0.0007) [2023-03-06 21:19:26,613][62475] Updated weights for policy 0, policy_version 15540 (0.0006) [2023-03-06 21:19:27,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.8, 300 sec: 12728.8). Total num frames: 15922176. Throughput: 0: 12735.8. Samples: 15892303. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:19:27,390][62145] Avg episode reward: [(0, '499.291')] [2023-03-06 21:19:27,419][62475] Updated weights for policy 0, policy_version 15550 (0.0006) [2023-03-06 21:19:28,235][62475] Updated weights for policy 0, policy_version 15560 (0.0006) [2023-03-06 21:19:29,038][62475] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-06 21:19:29,849][62475] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-06 21:19:30,641][62475] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-06 21:19:31,436][62475] Updated weights for policy 0, policy_version 15600 (0.0007) [2023-03-06 21:19:32,240][62475] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-06 21:19:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 15985664. Throughput: 0: 12744.3. Samples: 15968844. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:19:32,390][62145] Avg episode reward: [(0, '418.209')] [2023-03-06 21:19:33,038][62475] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-06 21:19:33,857][62475] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-06 21:19:34,665][62475] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-06 21:19:35,473][62475] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-06 21:19:36,275][62475] Updated weights for policy 0, policy_version 15660 (0.0006) [2023-03-06 21:19:37,080][62475] Updated weights for policy 0, policy_version 15670 (0.0006) [2023-03-06 21:19:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 16050176. Throughput: 0: 12746.7. Samples: 16045335. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:19:37,390][62145] Avg episode reward: [(0, '498.517')] [2023-03-06 21:19:37,872][62475] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-06 21:19:38,677][62475] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-06 21:19:39,465][62475] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-06 21:19:40,261][62475] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-06 21:19:41,074][62475] Updated weights for policy 0, policy_version 15720 (0.0007) [2023-03-06 21:19:41,860][62475] Updated weights for policy 0, policy_version 15730 (0.0006) [2023-03-06 21:19:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 16113664. Throughput: 0: 12745.4. Samples: 16083825. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:19:42,390][62145] Avg episode reward: [(0, '435.998')] [2023-03-06 21:19:42,675][62475] Updated weights for policy 0, policy_version 15740 (0.0007) [2023-03-06 21:19:43,488][62475] Updated weights for policy 0, policy_version 15750 (0.0007) [2023-03-06 21:19:44,286][62475] Updated weights for policy 0, policy_version 15760 (0.0006) [2023-03-06 21:19:45,084][62475] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-06 21:19:45,890][62475] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-06 21:19:46,695][62475] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-06 21:19:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 16177152. Throughput: 0: 12743.3. Samples: 16160208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:19:47,390][62145] Avg episode reward: [(0, '404.518')] [2023-03-06 21:19:47,516][62475] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-06 21:19:48,309][62475] Updated weights for policy 0, policy_version 15810 (0.0007) [2023-03-06 21:19:49,128][62475] Updated weights for policy 0, policy_version 15820 (0.0007) [2023-03-06 21:19:49,933][62475] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-06 21:19:50,736][62475] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-06 21:19:51,529][62475] Updated weights for policy 0, policy_version 15850 (0.0006) [2023-03-06 21:19:52,352][62475] Updated weights for policy 0, policy_version 15860 (0.0006) [2023-03-06 21:19:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 16240640. Throughput: 0: 12742.4. Samples: 16236510. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:19:52,390][62145] Avg episode reward: [(0, '440.131')] [2023-03-06 21:19:53,155][62475] Updated weights for policy 0, policy_version 15870 (0.0006) [2023-03-06 21:19:53,957][62475] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-06 21:19:54,752][62475] Updated weights for policy 0, policy_version 15890 (0.0007) [2023-03-06 21:19:55,566][62475] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-06 21:19:56,368][62475] Updated weights for policy 0, policy_version 15910 (0.0006) [2023-03-06 21:19:57,183][62475] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-06 21:19:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 16304128. Throughput: 0: 12741.2. Samples: 16274530. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:19:57,390][62145] Avg episode reward: [(0, '484.974')] [2023-03-06 21:19:57,981][62475] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-06 21:19:58,805][62475] Updated weights for policy 0, policy_version 15940 (0.0007) [2023-03-06 21:19:59,593][62475] Updated weights for policy 0, policy_version 15950 (0.0006) [2023-03-06 21:20:00,439][62475] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-06 21:20:01,241][62475] Updated weights for policy 0, policy_version 15970 (0.0007) [2023-03-06 21:20:02,031][62475] Updated weights for policy 0, policy_version 15980 (0.0007) [2023-03-06 21:20:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 16367616. Throughput: 0: 12736.1. Samples: 16350535. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:20:02,390][62145] Avg episode reward: [(0, '421.614')] [2023-03-06 21:20:02,835][62475] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-06 21:20:03,647][62475] Updated weights for policy 0, policy_version 16000 (0.0006) [2023-03-06 21:20:04,449][62475] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-06 21:20:05,250][62475] Updated weights for policy 0, policy_version 16020 (0.0007) [2023-03-06 21:20:06,058][62475] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-06 21:20:06,862][62475] Updated weights for policy 0, policy_version 16040 (0.0005) [2023-03-06 21:20:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 16431104. Throughput: 0: 12728.9. Samples: 16426893. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:20:07,390][62145] Avg episode reward: [(0, '435.208')] [2023-03-06 21:20:07,657][62475] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-06 21:20:08,455][62475] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-06 21:20:09,269][62475] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-06 21:20:10,061][62475] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-06 21:20:10,855][62475] Updated weights for policy 0, policy_version 16090 (0.0006) [2023-03-06 21:20:11,668][62475] Updated weights for policy 0, policy_version 16100 (0.0007) [2023-03-06 21:20:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 16494592. Throughput: 0: 12731.9. Samples: 16465238. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:20:12,390][62145] Avg episode reward: [(0, '357.765')] [2023-03-06 21:20:12,484][62475] Updated weights for policy 0, policy_version 16110 (0.0007) [2023-03-06 21:20:13,274][62475] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-06 21:20:14,095][62475] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-06 21:20:14,902][62475] Updated weights for policy 0, policy_version 16140 (0.0007) [2023-03-06 21:20:15,705][62475] Updated weights for policy 0, policy_version 16150 (0.0007) [2023-03-06 21:20:16,501][62475] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-06 21:20:17,311][62475] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-06 21:20:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 16558080. Throughput: 0: 12725.4. Samples: 16541488. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:20:17,390][62145] Avg episode reward: [(0, '345.641')] [2023-03-06 21:20:18,088][62475] Updated weights for policy 0, policy_version 16180 (0.0007) [2023-03-06 21:20:18,899][62475] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-06 21:20:19,708][62475] Updated weights for policy 0, policy_version 16200 (0.0007) [2023-03-06 21:20:20,505][62475] Updated weights for policy 0, policy_version 16210 (0.0005) [2023-03-06 21:20:21,325][62475] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-06 21:20:22,136][62475] Updated weights for policy 0, policy_version 16230 (0.0007) [2023-03-06 21:20:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 16621568. Throughput: 0: 12723.1. Samples: 16617876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:20:22,390][62145] Avg episode reward: [(0, '385.679')] [2023-03-06 21:20:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000016233_16622592.pth... [2023-03-06 21:20:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000013249_13566976.pth [2023-03-06 21:20:22,928][62475] Updated weights for policy 0, policy_version 16240 (0.0007) [2023-03-06 21:20:23,760][62475] Updated weights for policy 0, policy_version 16250 (0.0005) [2023-03-06 21:20:24,571][62475] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-06 21:20:25,366][62475] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-06 21:20:26,179][62475] Updated weights for policy 0, policy_version 16280 (0.0007) [2023-03-06 21:20:26,971][62475] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-06 21:20:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 16686080. Throughput: 0: 12712.7. Samples: 16655896. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:20:27,390][62145] Avg episode reward: [(0, '518.011')] [2023-03-06 21:20:27,796][62475] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-06 21:20:28,591][62475] Updated weights for policy 0, policy_version 16310 (0.0006) [2023-03-06 21:20:29,414][62475] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-06 21:20:30,191][62475] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-06 21:20:31,002][62475] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-06 21:20:31,821][62475] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-06 21:20:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 16749568. Throughput: 0: 12711.5. Samples: 16732226. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:20:32,390][62145] Avg episode reward: [(0, '493.332')] [2023-03-06 21:20:32,610][62475] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-06 21:20:33,423][62475] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-06 21:20:34,233][62475] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-06 21:20:35,034][62475] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-06 21:20:35,854][62475] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-06 21:20:36,644][62475] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-06 21:20:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 16813056. Throughput: 0: 12712.7. Samples: 16808581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:20:37,390][62145] Avg episode reward: [(0, '478.204')] [2023-03-06 21:20:37,450][62475] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-06 21:20:38,252][62475] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-06 21:20:39,045][62475] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-06 21:20:39,516][62424] KL-divergence is very high: 223.8039 [2023-03-06 21:20:39,866][62475] Updated weights for policy 0, policy_version 16450 (0.0007) [2023-03-06 21:20:40,682][62475] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-06 21:20:41,472][62475] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-06 21:20:42,291][62475] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-06 21:20:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 16876544. Throughput: 0: 12713.9. Samples: 16846655. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:20:42,390][62145] Avg episode reward: [(0, '492.971')] [2023-03-06 21:20:43,086][62475] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-06 21:20:43,889][62475] Updated weights for policy 0, policy_version 16500 (0.0007) [2023-03-06 21:20:44,674][62475] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-06 21:20:45,469][62475] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-06 21:20:46,274][62475] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-06 21:20:47,077][62475] Updated weights for policy 0, policy_version 16540 (0.0006) [2023-03-06 21:20:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 16940032. Throughput: 0: 12730.9. Samples: 16923427. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:20:47,390][62145] Avg episode reward: [(0, '509.344')] [2023-03-06 21:20:47,883][62475] Updated weights for policy 0, policy_version 16550 (0.0007) [2023-03-06 21:20:48,700][62475] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-06 21:20:49,513][62475] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-06 21:20:50,317][62475] Updated weights for policy 0, policy_version 16580 (0.0007) [2023-03-06 21:20:51,124][62475] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-06 21:20:51,945][62475] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-06 21:20:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 17003520. Throughput: 0: 12722.5. Samples: 16999406. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:20:52,390][62145] Avg episode reward: [(0, '442.723')] [2023-03-06 21:20:52,731][62475] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-06 21:20:53,540][62475] Updated weights for policy 0, policy_version 16620 (0.0007) [2023-03-06 21:20:54,330][62475] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-06 21:20:55,128][62475] Updated weights for policy 0, policy_version 16640 (0.0007) [2023-03-06 21:20:55,698][62424] KL-divergence is very high: 237.7088 [2023-03-06 21:20:55,926][62475] Updated weights for policy 0, policy_version 16650 (0.0008) [2023-03-06 21:20:56,729][62475] Updated weights for policy 0, policy_version 16660 (0.0007) [2023-03-06 21:20:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17068032. Throughput: 0: 12722.5. Samples: 17037749. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:20:57,390][62145] Avg episode reward: [(0, '474.985')] [2023-03-06 21:20:57,530][62475] Updated weights for policy 0, policy_version 16670 (0.0007) [2023-03-06 21:20:58,339][62475] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-06 21:20:58,434][62424] KL-divergence is very high: 130.6006 [2023-03-06 21:20:59,136][62475] Updated weights for policy 0, policy_version 16690 (0.0008) [2023-03-06 21:20:59,938][62475] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-06 21:21:00,734][62475] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-06 21:21:01,557][62475] Updated weights for policy 0, policy_version 16720 (0.0006) [2023-03-06 21:21:02,373][62475] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-06 21:21:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17131520. Throughput: 0: 12733.8. Samples: 17114509. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:21:02,390][62145] Avg episode reward: [(0, '498.494')] [2023-03-06 21:21:03,170][62475] Updated weights for policy 0, policy_version 16740 (0.0006) [2023-03-06 21:21:03,978][62475] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-06 21:21:04,777][62475] Updated weights for policy 0, policy_version 16760 (0.0007) [2023-03-06 21:21:05,583][62475] Updated weights for policy 0, policy_version 16770 (0.0007) [2023-03-06 21:21:06,385][62475] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-06 21:21:07,183][62475] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-06 21:21:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 17195008. Throughput: 0: 12728.7. Samples: 17190666. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:21:07,390][62145] Avg episode reward: [(0, '482.731')] [2023-03-06 21:21:07,985][62475] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-06 21:21:08,814][62475] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-03-06 21:21:09,602][62475] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-06 21:21:10,406][62475] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-06 21:21:11,206][62475] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-06 21:21:12,022][62475] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-06 21:21:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17258496. Throughput: 0: 12734.7. Samples: 17228958. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:21:12,390][62145] Avg episode reward: [(0, '503.752')] [2023-03-06 21:21:12,822][62475] Updated weights for policy 0, policy_version 16860 (0.0006) [2023-03-06 21:21:13,637][62475] Updated weights for policy 0, policy_version 16870 (0.0008) [2023-03-06 21:21:14,451][62475] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-06 21:21:15,231][62475] Updated weights for policy 0, policy_version 16890 (0.0006) [2023-03-06 21:21:16,042][62475] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-06 21:21:16,846][62475] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-06 21:21:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17321984. Throughput: 0: 12732.4. Samples: 17305183. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:21:17,390][62145] Avg episode reward: [(0, '537.254')] [2023-03-06 21:21:17,645][62475] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-06 21:21:18,454][62475] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-06 21:21:19,261][62475] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-06 21:21:20,040][62475] Updated weights for policy 0, policy_version 16950 (0.0006) [2023-03-06 21:21:20,853][62475] Updated weights for policy 0, policy_version 16960 (0.0007) [2023-03-06 21:21:21,661][62475] Updated weights for policy 0, policy_version 16970 (0.0005) [2023-03-06 21:21:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 17386496. Throughput: 0: 12734.7. Samples: 17381644. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:21:22,390][62145] Avg episode reward: [(0, '563.309')] [2023-03-06 21:21:22,466][62475] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-06 21:21:23,274][62475] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-06 21:21:24,072][62475] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-06 21:21:24,857][62475] Updated weights for policy 0, policy_version 17010 (0.0007) [2023-03-06 21:21:25,672][62475] Updated weights for policy 0, policy_version 17020 (0.0005) [2023-03-06 21:21:26,480][62475] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-06 21:21:27,279][62475] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-06 21:21:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 17449984. Throughput: 0: 12747.7. Samples: 17420301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:21:27,390][62145] Avg episode reward: [(0, '429.515')] [2023-03-06 21:21:28,081][62475] Updated weights for policy 0, policy_version 17050 (0.0007) [2023-03-06 21:21:28,890][62475] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-06 21:21:29,708][62475] Updated weights for policy 0, policy_version 17070 (0.0007) [2023-03-06 21:21:30,489][62475] Updated weights for policy 0, policy_version 17080 (0.0006) [2023-03-06 21:21:31,298][62475] Updated weights for policy 0, policy_version 17090 (0.0005) [2023-03-06 21:21:32,141][62475] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-06 21:21:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 17513472. Throughput: 0: 12732.6. Samples: 17496395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:21:32,390][62145] Avg episode reward: [(0, '455.371')] [2023-03-06 21:21:32,934][62475] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-06 21:21:33,745][62475] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-06 21:21:34,546][62475] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-06 21:21:35,352][62475] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-06 21:21:36,161][62475] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-06 21:21:36,963][62475] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-06 21:21:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17576960. Throughput: 0: 12733.5. Samples: 17572413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:21:37,390][62145] Avg episode reward: [(0, '490.436')] [2023-03-06 21:21:37,770][62475] Updated weights for policy 0, policy_version 17170 (0.0006) [2023-03-06 21:21:38,569][62475] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-06 21:21:39,382][62475] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-06 21:21:40,196][62475] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-06 21:21:40,977][62475] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-06 21:21:41,803][62475] Updated weights for policy 0, policy_version 17220 (0.0006) [2023-03-06 21:21:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17640448. Throughput: 0: 12727.1. Samples: 17610469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:21:42,390][62145] Avg episode reward: [(0, '541.449')] [2023-03-06 21:21:42,598][62475] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-06 21:21:43,415][62475] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-06 21:21:44,217][62475] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-06 21:21:45,004][62475] Updated weights for policy 0, policy_version 17260 (0.0007) [2023-03-06 21:21:45,815][62475] Updated weights for policy 0, policy_version 17270 (0.0006) [2023-03-06 21:21:46,611][62475] Updated weights for policy 0, policy_version 17280 (0.0007) [2023-03-06 21:21:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 17703936. Throughput: 0: 12723.3. Samples: 17687059. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:21:47,390][62145] Avg episode reward: [(0, '597.552')] [2023-03-06 21:21:47,396][62475] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-06 21:21:48,199][62475] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-06 21:21:49,013][62475] Updated weights for policy 0, policy_version 17310 (0.0007) [2023-03-06 21:21:49,817][62475] Updated weights for policy 0, policy_version 17320 (0.0005) [2023-03-06 21:21:50,605][62475] Updated weights for policy 0, policy_version 17330 (0.0006) [2023-03-06 21:21:51,413][62475] Updated weights for policy 0, policy_version 17340 (0.0007) [2023-03-06 21:21:52,210][62475] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-06 21:21:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 17768448. Throughput: 0: 12734.2. Samples: 17763704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:21:52,390][62145] Avg episode reward: [(0, '551.824')] [2023-03-06 21:21:53,020][62475] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-06 21:21:53,824][62475] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-06 21:21:54,615][62475] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-06 21:21:55,429][62475] Updated weights for policy 0, policy_version 17390 (0.0006) [2023-03-06 21:21:56,229][62475] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-06 21:21:57,034][62475] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-06 21:21:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 17831936. Throughput: 0: 12736.5. Samples: 17802102. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:21:57,390][62145] Avg episode reward: [(0, '422.046')] [2023-03-06 21:21:57,841][62475] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-06 21:21:58,675][62475] Updated weights for policy 0, policy_version 17430 (0.0006) [2023-03-06 21:21:59,467][62475] Updated weights for policy 0, policy_version 17440 (0.0007) [2023-03-06 21:21:59,535][62424] KL-divergence is very high: 327.6901 [2023-03-06 21:22:00,277][62475] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-06 21:22:01,093][62475] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-06 21:22:01,883][62475] Updated weights for policy 0, policy_version 17470 (0.0006) [2023-03-06 21:22:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 17895424. Throughput: 0: 12734.1. Samples: 17878216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:22:02,390][62145] Avg episode reward: [(0, '477.029')] [2023-03-06 21:22:02,666][62475] Updated weights for policy 0, policy_version 17480 (0.0005) [2023-03-06 21:22:03,487][62475] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-06 21:22:04,285][62475] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-06 21:22:05,080][62475] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-06 21:22:05,902][62475] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-06 21:22:06,703][62475] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-06 21:22:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 17958912. Throughput: 0: 12731.8. Samples: 17954577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:22:07,401][62145] Avg episode reward: [(0, '522.670')] [2023-03-06 21:22:07,514][62475] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-06 21:22:08,310][62475] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-06 21:22:09,120][62475] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-06 21:22:09,910][62475] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-06 21:22:10,714][62475] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-06 21:22:11,495][62475] Updated weights for policy 0, policy_version 17590 (0.0006) [2023-03-06 21:22:12,285][62475] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-06 21:22:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 18023424. Throughput: 0: 12726.7. Samples: 17993002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:22:12,400][62145] Avg episode reward: [(0, '405.246')] [2023-03-06 21:22:13,088][62475] Updated weights for policy 0, policy_version 17610 (0.0006) [2023-03-06 21:22:13,890][62475] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-06 21:22:14,698][62475] Updated weights for policy 0, policy_version 17630 (0.0007) [2023-03-06 21:22:15,502][62475] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-06 21:22:15,732][62424] KL-divergence is very high: 128.6881 [2023-03-06 21:22:16,302][62475] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-06 21:22:17,133][62475] Updated weights for policy 0, policy_version 17660 (0.0007) [2023-03-06 21:22:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 18086912. Throughput: 0: 12743.3. Samples: 18069844. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:22:17,401][62145] Avg episode reward: [(0, '541.729')] [2023-03-06 21:22:17,922][62475] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-06 21:22:18,745][62475] Updated weights for policy 0, policy_version 17680 (0.0005) [2023-03-06 21:22:18,781][62424] KL-divergence is very high: 502.3051 [2023-03-06 21:22:19,533][62475] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-06 21:22:20,331][62475] Updated weights for policy 0, policy_version 17700 (0.0005) [2023-03-06 21:22:21,152][62475] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-06 21:22:21,950][62475] Updated weights for policy 0, policy_version 17720 (0.0006) [2023-03-06 21:22:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 18150400. Throughput: 0: 12745.7. Samples: 18145969. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:22:22,401][62145] Avg episode reward: [(0, '427.490')] [2023-03-06 21:22:22,405][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000017725_18150400.pth... [2023-03-06 21:22:22,436][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000014741_15094784.pth [2023-03-06 21:22:22,763][62475] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-06 21:22:23,576][62475] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-06 21:22:24,382][62475] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-06 21:22:25,214][62475] Updated weights for policy 0, policy_version 17760 (0.0006) [2023-03-06 21:22:26,010][62475] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-06 21:22:26,819][62475] Updated weights for policy 0, policy_version 17780 (0.0006) [2023-03-06 21:22:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 18213888. Throughput: 0: 12742.8. Samples: 18183896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:22:27,390][62145] Avg episode reward: [(0, '504.325')] [2023-03-06 21:22:27,646][62475] Updated weights for policy 0, policy_version 17790 (0.0007) [2023-03-06 21:22:28,460][62475] Updated weights for policy 0, policy_version 17800 (0.0007) [2023-03-06 21:22:29,254][62475] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-06 21:22:30,047][62475] Updated weights for policy 0, policy_version 17820 (0.0007) [2023-03-06 21:22:30,842][62475] Updated weights for policy 0, policy_version 17830 (0.0006) [2023-03-06 21:22:31,646][62475] Updated weights for policy 0, policy_version 17840 (0.0007) [2023-03-06 21:22:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 18277376. Throughput: 0: 12733.9. Samples: 18260086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:22:32,390][62145] Avg episode reward: [(0, '473.735')] [2023-03-06 21:22:32,440][62475] Updated weights for policy 0, policy_version 17850 (0.0006) [2023-03-06 21:22:33,242][62475] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-06 21:22:34,044][62475] Updated weights for policy 0, policy_version 17870 (0.0006) [2023-03-06 21:22:34,852][62475] Updated weights for policy 0, policy_version 17880 (0.0007) [2023-03-06 21:22:35,643][62475] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-06 21:22:36,465][62475] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-06 21:22:37,278][62475] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-06 21:22:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 18340864. Throughput: 0: 12727.9. Samples: 18336460. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:22:37,390][62145] Avg episode reward: [(0, '474.299')] [2023-03-06 21:22:38,088][62475] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-06 21:22:38,886][62475] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-06 21:22:39,696][62475] Updated weights for policy 0, policy_version 17940 (0.0007) [2023-03-06 21:22:40,149][62424] KL-divergence is very high: 113.3741 [2023-03-06 21:22:40,489][62475] Updated weights for policy 0, policy_version 17950 (0.0006) [2023-03-06 21:22:41,292][62475] Updated weights for policy 0, policy_version 17960 (0.0008) [2023-03-06 21:22:42,115][62475] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-06 21:22:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 18404352. Throughput: 0: 12724.1. Samples: 18374686. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:22:42,390][62145] Avg episode reward: [(0, '429.920')] [2023-03-06 21:22:42,909][62475] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-06 21:22:43,720][62475] Updated weights for policy 0, policy_version 17990 (0.0006) [2023-03-06 21:22:44,537][62475] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-06 21:22:45,345][62475] Updated weights for policy 0, policy_version 18010 (0.0007) [2023-03-06 21:22:46,125][62475] Updated weights for policy 0, policy_version 18020 (0.0005) [2023-03-06 21:22:46,956][62475] Updated weights for policy 0, policy_version 18030 (0.0008) [2023-03-06 21:22:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 18467840. Throughput: 0: 12723.8. Samples: 18450787. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:22:47,390][62145] Avg episode reward: [(0, '491.662')] [2023-03-06 21:22:47,742][62475] Updated weights for policy 0, policy_version 18040 (0.0005) [2023-03-06 21:22:48,530][62475] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-06 21:22:49,351][62475] Updated weights for policy 0, policy_version 18060 (0.0006) [2023-03-06 21:22:50,161][62475] Updated weights for policy 0, policy_version 18070 (0.0007) [2023-03-06 21:22:50,951][62475] Updated weights for policy 0, policy_version 18080 (0.0006) [2023-03-06 21:22:51,762][62475] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-06 21:22:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 18531328. Throughput: 0: 12725.8. Samples: 18527238. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:22:52,390][62145] Avg episode reward: [(0, '426.707')] [2023-03-06 21:22:52,580][62475] Updated weights for policy 0, policy_version 18100 (0.0005) [2023-03-06 21:22:53,385][62475] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-06 21:22:54,172][62475] Updated weights for policy 0, policy_version 18120 (0.0006) [2023-03-06 21:22:54,989][62475] Updated weights for policy 0, policy_version 18130 (0.0006) [2023-03-06 21:22:55,795][62475] Updated weights for policy 0, policy_version 18140 (0.0005) [2023-03-06 21:22:56,588][62475] Updated weights for policy 0, policy_version 18150 (0.0006) [2023-03-06 21:22:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 18594816. Throughput: 0: 12718.1. Samples: 18565318. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:22:57,390][62145] Avg episode reward: [(0, '525.156')] [2023-03-06 21:22:57,414][62475] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-06 21:22:58,199][62475] Updated weights for policy 0, policy_version 18170 (0.0006) [2023-03-06 21:22:59,010][62475] Updated weights for policy 0, policy_version 18180 (0.0007) [2023-03-06 21:22:59,819][62475] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-06 21:23:00,637][62475] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-06 21:23:01,437][62475] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-06 21:23:02,240][62475] Updated weights for policy 0, policy_version 18220 (0.0006) [2023-03-06 21:23:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 18658304. Throughput: 0: 12707.2. Samples: 18641670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:02,390][62145] Avg episode reward: [(0, '496.102')] [2023-03-06 21:23:03,048][62475] Updated weights for policy 0, policy_version 18230 (0.0006) [2023-03-06 21:23:03,857][62475] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-06 21:23:04,677][62475] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-06 21:23:05,465][62475] Updated weights for policy 0, policy_version 18260 (0.0007) [2023-03-06 21:23:06,286][62475] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-06 21:23:07,092][62475] Updated weights for policy 0, policy_version 18280 (0.0007) [2023-03-06 21:23:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 18721792. Throughput: 0: 12705.7. Samples: 18717726. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:23:07,390][62145] Avg episode reward: [(0, '544.844')] [2023-03-06 21:23:07,886][62475] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-06 21:23:08,694][62475] Updated weights for policy 0, policy_version 18300 (0.0007) [2023-03-06 21:23:09,512][62475] Updated weights for policy 0, policy_version 18310 (0.0006) [2023-03-06 21:23:10,321][62475] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-06 21:23:11,130][62475] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-06 21:23:11,936][62475] Updated weights for policy 0, policy_version 18340 (0.0007) [2023-03-06 21:23:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 18785280. Throughput: 0: 12705.9. Samples: 18755661. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:23:12,390][62145] Avg episode reward: [(0, '690.085')] [2023-03-06 21:23:12,729][62475] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-06 21:23:13,518][62475] Updated weights for policy 0, policy_version 18360 (0.0006) [2023-03-06 21:23:14,337][62475] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-06 21:23:15,134][62475] Updated weights for policy 0, policy_version 18380 (0.0007) [2023-03-06 21:23:15,932][62475] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-06 21:23:16,743][62475] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-06 21:23:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 18849792. Throughput: 0: 12715.5. Samples: 18832281. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:23:17,390][62145] Avg episode reward: [(0, '526.827')] [2023-03-06 21:23:17,541][62475] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-06 21:23:18,324][62475] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-06 21:23:19,156][62475] Updated weights for policy 0, policy_version 18430 (0.0006) [2023-03-06 21:23:19,969][62475] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-06 21:23:20,766][62475] Updated weights for policy 0, policy_version 18450 (0.0007) [2023-03-06 21:23:21,566][62475] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-06 21:23:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 18913280. Throughput: 0: 12714.2. Samples: 18908601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:22,390][62475] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-06 21:23:22,390][62145] Avg episode reward: [(0, '525.320')] [2023-03-06 21:23:23,184][62475] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-06 21:23:23,982][62475] Updated weights for policy 0, policy_version 18490 (0.0007) [2023-03-06 21:23:24,802][62475] Updated weights for policy 0, policy_version 18500 (0.0006) [2023-03-06 21:23:25,604][62475] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-06 21:23:26,420][62475] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-06 21:23:27,237][62475] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-06 21:23:27,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 18975744. Throughput: 0: 12710.5. Samples: 18946656. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:27,390][62145] Avg episode reward: [(0, '622.824')] [2023-03-06 21:23:28,025][62475] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-06 21:23:28,823][62475] Updated weights for policy 0, policy_version 18550 (0.0006) [2023-03-06 21:23:29,653][62475] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-06 21:23:30,438][62475] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-06 21:23:31,247][62475] Updated weights for policy 0, policy_version 18580 (0.0006) [2023-03-06 21:23:32,060][62475] Updated weights for policy 0, policy_version 18590 (0.0007) [2023-03-06 21:23:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 19040256. Throughput: 0: 12711.4. Samples: 19022797. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:23:32,390][62145] Avg episode reward: [(0, '633.034')] [2023-03-06 21:23:32,862][62475] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-06 21:23:33,681][62475] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-06 21:23:34,490][62475] Updated weights for policy 0, policy_version 18620 (0.0007) [2023-03-06 21:23:35,280][62475] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-06 21:23:36,093][62475] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-06 21:23:36,887][62475] Updated weights for policy 0, policy_version 18650 (0.0007) [2023-03-06 21:23:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12728.8). Total num frames: 19103744. Throughput: 0: 12708.8. Samples: 19099136. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:23:37,390][62145] Avg episode reward: [(0, '562.405')] [2023-03-06 21:23:37,689][62475] Updated weights for policy 0, policy_version 18660 (0.0006) [2023-03-06 21:23:38,507][62475] Updated weights for policy 0, policy_version 18670 (0.0006) [2023-03-06 21:23:39,294][62475] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-06 21:23:40,112][62475] Updated weights for policy 0, policy_version 18690 (0.0007) [2023-03-06 21:23:40,891][62475] Updated weights for policy 0, policy_version 18700 (0.0007) [2023-03-06 21:23:41,685][62475] Updated weights for policy 0, policy_version 18710 (0.0007) [2023-03-06 21:23:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 19167232. Throughput: 0: 12710.5. Samples: 19137291. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 21:23:42,390][62145] Avg episode reward: [(0, '638.976')] [2023-03-06 21:23:42,490][62475] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-06 21:23:43,279][62475] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-06 21:23:44,097][62475] Updated weights for policy 0, policy_version 18740 (0.0007) [2023-03-06 21:23:44,890][62475] Updated weights for policy 0, policy_version 18750 (0.0006) [2023-03-06 21:23:45,689][62475] Updated weights for policy 0, policy_version 18760 (0.0005) [2023-03-06 21:23:46,478][62475] Updated weights for policy 0, policy_version 18770 (0.0007) [2023-03-06 21:23:47,310][62475] Updated weights for policy 0, policy_version 18780 (0.0006) [2023-03-06 21:23:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 19230720. Throughput: 0: 12723.4. Samples: 19214221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:47,390][62145] Avg episode reward: [(0, '697.353')] [2023-03-06 21:23:48,126][62475] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-06 21:23:48,935][62475] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-06 21:23:49,729][62475] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-06 21:23:50,549][62475] Updated weights for policy 0, policy_version 18820 (0.0007) [2023-03-06 21:23:51,343][62475] Updated weights for policy 0, policy_version 18830 (0.0007) [2023-03-06 21:23:52,156][62475] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-06 21:23:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 19294208. Throughput: 0: 12726.1. Samples: 19290399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:52,390][62145] Avg episode reward: [(0, '699.067')] [2023-03-06 21:23:52,943][62475] Updated weights for policy 0, policy_version 18850 (0.0005) [2023-03-06 21:23:53,761][62475] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-06 21:23:54,586][62475] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-06 21:23:55,374][62475] Updated weights for policy 0, policy_version 18880 (0.0007) [2023-03-06 21:23:56,178][62475] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-06 21:23:56,984][62475] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-06 21:23:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 19357696. Throughput: 0: 12726.2. Samples: 19328342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:23:57,390][62145] Avg episode reward: [(0, '620.594')] [2023-03-06 21:23:57,797][62475] Updated weights for policy 0, policy_version 18910 (0.0007) [2023-03-06 21:23:58,587][62475] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-06 21:23:59,394][62475] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-06 21:24:00,197][62475] Updated weights for policy 0, policy_version 18940 (0.0006) [2023-03-06 21:24:00,997][62475] Updated weights for policy 0, policy_version 18950 (0.0007) [2023-03-06 21:24:01,804][62475] Updated weights for policy 0, policy_version 18960 (0.0006) [2023-03-06 21:24:02,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 19422208. Throughput: 0: 12725.3. Samples: 19404920. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:02,390][62145] Avg episode reward: [(0, '633.936')] [2023-03-06 21:24:02,603][62475] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-06 21:24:03,402][62475] Updated weights for policy 0, policy_version 18980 (0.0007) [2023-03-06 21:24:04,223][62475] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-06 21:24:05,031][62475] Updated weights for policy 0, policy_version 19000 (0.0008) [2023-03-06 21:24:05,187][62424] KL-divergence is very high: 1013.4500 [2023-03-06 21:24:05,838][62475] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-06 21:24:06,641][62475] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-06 21:24:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 19485696. Throughput: 0: 12722.1. Samples: 19481095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:07,390][62145] Avg episode reward: [(0, '519.167')] [2023-03-06 21:24:07,458][62475] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-06 21:24:08,267][62475] Updated weights for policy 0, policy_version 19040 (0.0007) [2023-03-06 21:24:09,057][62475] Updated weights for policy 0, policy_version 19050 (0.0007) [2023-03-06 21:24:09,870][62475] Updated weights for policy 0, policy_version 19060 (0.0006) [2023-03-06 21:24:10,657][62475] Updated weights for policy 0, policy_version 19070 (0.0007) [2023-03-06 21:24:10,817][62424] KL-divergence is very high: 504.7644 [2023-03-06 21:24:11,462][62475] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-06 21:24:11,547][62424] KL-divergence is very high: 10361.0986 [2023-03-06 21:24:12,262][62475] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-06 21:24:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 19549184. Throughput: 0: 12723.7. Samples: 19519221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:24:12,390][62145] Avg episode reward: [(0, '620.749')] [2023-03-06 21:24:12,674][62424] KL-divergence is very high: 4200.0171 [2023-03-06 21:24:13,058][62475] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-06 21:24:13,865][62475] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-06 21:24:14,673][62475] Updated weights for policy 0, policy_version 19120 (0.0006) [2023-03-06 21:24:15,152][62424] KL-divergence is very high: 186.1980 [2023-03-06 21:24:15,486][62475] Updated weights for policy 0, policy_version 19130 (0.0006) [2023-03-06 21:24:16,292][62475] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-06 21:24:16,693][62424] KL-divergence is very high: 295288.9688 [2023-03-06 21:24:16,853][62424] KL-divergence is very high: 120757.2188 [2023-03-06 21:24:17,016][62424] KL-divergence is very high: 423684.5625 [2023-03-06 21:24:17,099][62475] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-06 21:24:17,182][62424] KL-divergence is very high: 1049.2576 [2023-03-06 21:24:17,338][62424] KL-divergence is very high: 160.1388 [2023-03-06 21:24:17,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 19612672. Throughput: 0: 12729.5. Samples: 19595628. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:24:17,390][62145] Avg episode reward: [(0, '577.100')] [2023-03-06 21:24:17,661][62424] KL-divergence is very high: 3253.5742 [2023-03-06 21:24:17,816][62424] KL-divergence is very high: 846.5096 [2023-03-06 21:24:17,909][62424] KL-divergence is very high: 6532.4102 [2023-03-06 21:24:17,917][62475] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-06 21:24:18,149][62424] KL-divergence is very high: 3673.1555 [2023-03-06 21:24:18,721][62475] Updated weights for policy 0, policy_version 19170 (0.0007) [2023-03-06 21:24:19,520][62475] Updated weights for policy 0, policy_version 19180 (0.0007) [2023-03-06 21:24:20,254][62424] KL-divergence is very high: 4110628.0000 [2023-03-06 21:24:20,338][62475] Updated weights for policy 0, policy_version 19190 (0.0007) [2023-03-06 21:24:21,146][62475] Updated weights for policy 0, policy_version 19200 (0.0006) [2023-03-06 21:24:21,948][62475] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-06 21:24:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 19676160. Throughput: 0: 12728.1. Samples: 19671900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:24:22,390][62145] Avg episode reward: [(0, '507.513')] [2023-03-06 21:24:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000019215_19676160.pth... [2023-03-06 21:24:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000016233_16622592.pth [2023-03-06 21:24:22,757][62475] Updated weights for policy 0, policy_version 19220 (0.0007) [2023-03-06 21:24:23,549][62475] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-06 21:24:24,364][62475] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-06 21:24:25,165][62475] Updated weights for policy 0, policy_version 19250 (0.0007) [2023-03-06 21:24:25,989][62475] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-06 21:24:26,800][62475] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-06 21:24:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 19739648. Throughput: 0: 12726.9. Samples: 19710000. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:24:27,390][62145] Avg episode reward: [(0, '667.644')] [2023-03-06 21:24:27,619][62475] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-06 21:24:28,250][62424] KL-divergence is very high: 28427.4121 [2023-03-06 21:24:28,409][62475] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-06 21:24:29,204][62475] Updated weights for policy 0, policy_version 19300 (0.0006) [2023-03-06 21:24:30,009][62475] Updated weights for policy 0, policy_version 19310 (0.0006) [2023-03-06 21:24:30,099][62424] KL-divergence is very high: 2698268.5000 [2023-03-06 21:24:30,842][62475] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-06 21:24:31,627][62475] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-06 21:24:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 19803136. Throughput: 0: 12704.9. Samples: 19785942. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:24:32,390][62145] Avg episode reward: [(0, '642.507')] [2023-03-06 21:24:32,432][62475] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-06 21:24:33,255][62475] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-06 21:24:34,074][62475] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-06 21:24:34,858][62424] KL-divergence is very high: 381770.2812 [2023-03-06 21:24:34,866][62475] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-06 21:24:35,027][62424] KL-divergence is very high: 161087.4219 [2023-03-06 21:24:35,191][62424] KL-divergence is very high: 1322.5791 [2023-03-06 21:24:35,675][62475] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-06 21:24:36,470][62475] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-06 21:24:37,289][62475] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-06 21:24:37,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 19866624. Throughput: 0: 12701.9. Samples: 19861982. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:37,390][62145] Avg episode reward: [(0, '550.275')] [2023-03-06 21:24:37,438][62424] KL-divergence is very high: 30296.8086 [2023-03-06 21:24:38,088][62475] Updated weights for policy 0, policy_version 19410 (0.0006) [2023-03-06 21:24:38,886][62475] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-06 21:24:39,702][62475] Updated weights for policy 0, policy_version 19430 (0.0005) [2023-03-06 21:24:40,501][62475] Updated weights for policy 0, policy_version 19440 (0.0006) [2023-03-06 21:24:41,298][62475] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-06 21:24:42,121][62475] Updated weights for policy 0, policy_version 19460 (0.0007) [2023-03-06 21:24:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 19930112. Throughput: 0: 12709.2. Samples: 19900255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:42,390][62145] Avg episode reward: [(0, '567.952')] [2023-03-06 21:24:42,517][62424] KL-divergence is very high: 1776.5208 [2023-03-06 21:24:42,955][62475] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-06 21:24:43,677][62424] KL-divergence is very high: 1342.4519 [2023-03-06 21:24:43,758][62475] Updated weights for policy 0, policy_version 19480 (0.0006) [2023-03-06 21:24:44,567][62475] Updated weights for policy 0, policy_version 19490 (0.0006) [2023-03-06 21:24:45,383][62475] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-06 21:24:46,185][62475] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-06 21:24:46,990][62475] Updated weights for policy 0, policy_version 19520 (0.0006) [2023-03-06 21:24:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 19993600. Throughput: 0: 12690.1. Samples: 19975974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:47,390][62145] Avg episode reward: [(0, '490.440')] [2023-03-06 21:24:47,806][62475] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-06 21:24:48,602][62475] Updated weights for policy 0, policy_version 19540 (0.0006) [2023-03-06 21:24:49,389][62475] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-06 21:24:49,949][62424] KL-divergence is very high: 1714.0994 [2023-03-06 21:24:50,108][62424] KL-divergence is very high: 20733.5312 [2023-03-06 21:24:50,198][62475] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-06 21:24:50,269][62424] KL-divergence is very high: 11389.8398 [2023-03-06 21:24:50,995][62475] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-06 21:24:51,470][62424] KL-divergence is very high: 100306.2969 [2023-03-06 21:24:51,623][62424] KL-divergence is very high: 3548.1499 [2023-03-06 21:24:51,792][62475] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-06 21:24:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 20057088. Throughput: 0: 12702.0. Samples: 20052687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:52,390][62145] Avg episode reward: [(0, '594.861')] [2023-03-06 21:24:52,588][62475] Updated weights for policy 0, policy_version 19590 (0.0007) [2023-03-06 21:24:53,413][62475] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-06 21:24:54,210][62475] Updated weights for policy 0, policy_version 19610 (0.0007) [2023-03-06 21:24:55,007][62475] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-06 21:24:55,837][62475] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-06 21:24:56,635][62475] Updated weights for policy 0, policy_version 19640 (0.0007) [2023-03-06 21:24:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 20120576. Throughput: 0: 12700.2. Samples: 20090728. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:24:57,401][62145] Avg episode reward: [(0, '636.606')] [2023-03-06 21:24:57,449][62475] Updated weights for policy 0, policy_version 19650 (0.0006) [2023-03-06 21:24:58,258][62475] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-06 21:24:59,066][62475] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-06 21:24:59,870][62475] Updated weights for policy 0, policy_version 19680 (0.0007) [2023-03-06 21:25:00,676][62475] Updated weights for policy 0, policy_version 19690 (0.0007) [2023-03-06 21:25:01,497][62475] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-06 21:25:02,289][62475] Updated weights for policy 0, policy_version 19710 (0.0007) [2023-03-06 21:25:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 20184064. Throughput: 0: 12691.7. Samples: 20166754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:02,401][62145] Avg episode reward: [(0, '461.845')] [2023-03-06 21:25:02,523][62424] KL-divergence is very high: 3900.4170 [2023-03-06 21:25:03,080][62475] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-06 21:25:03,906][62475] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-06 21:25:04,716][62475] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-06 21:25:05,521][62475] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-06 21:25:06,329][62475] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-06 21:25:06,640][62424] KL-divergence is very high: 920220.3750 [2023-03-06 21:25:07,121][62475] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-06 21:25:07,363][62424] KL-divergence is very high: 10758.7871 [2023-03-06 21:25:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 20247552. Throughput: 0: 12694.0. Samples: 20243131. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:25:07,401][62145] Avg episode reward: [(0, '599.081')] [2023-03-06 21:25:07,523][62424] KL-divergence is very high: 3956.9028 [2023-03-06 21:25:07,923][62475] Updated weights for policy 0, policy_version 19780 (0.0006) [2023-03-06 21:25:08,734][62475] Updated weights for policy 0, policy_version 19790 (0.0007) [2023-03-06 21:25:09,532][62475] Updated weights for policy 0, policy_version 19800 (0.0006) [2023-03-06 21:25:10,334][62475] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-06 21:25:11,137][62475] Updated weights for policy 0, policy_version 19820 (0.0007) [2023-03-06 21:25:11,951][62475] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-06 21:25:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 20311040. Throughput: 0: 12692.5. Samples: 20281162. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:25:12,400][62145] Avg episode reward: [(0, '660.623')] [2023-03-06 21:25:12,511][62424] KL-divergence is very high: 1957.5258 [2023-03-06 21:25:12,754][62475] Updated weights for policy 0, policy_version 19840 (0.0007) [2023-03-06 21:25:13,327][62424] KL-divergence is very high: 251908.0938 [2023-03-06 21:25:13,558][62475] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-06 21:25:14,374][62475] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-06 21:25:15,164][62475] Updated weights for policy 0, policy_version 19870 (0.0006) [2023-03-06 21:25:15,971][62475] Updated weights for policy 0, policy_version 19880 (0.0006) [2023-03-06 21:25:16,773][62475] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-06 21:25:16,843][62424] KL-divergence is very high: 127.6547 [2023-03-06 21:25:16,994][62424] KL-divergence is very high: 2721.2554 [2023-03-06 21:25:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 20374528. Throughput: 0: 12703.9. Samples: 20357616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:17,401][62145] Avg episode reward: [(0, '705.499')] [2023-03-06 21:25:17,573][62475] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-06 21:25:18,376][62475] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-06 21:25:19,193][62475] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-06 21:25:20,003][62475] Updated weights for policy 0, policy_version 19930 (0.0007) [2023-03-06 21:25:20,802][62475] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-06 21:25:21,587][62475] Updated weights for policy 0, policy_version 19950 (0.0007) [2023-03-06 21:25:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20438016. Throughput: 0: 12713.6. Samples: 20434093. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:22,401][62145] Avg episode reward: [(0, '636.924')] [2023-03-06 21:25:22,429][62475] Updated weights for policy 0, policy_version 19960 (0.0007) [2023-03-06 21:25:22,738][62424] KL-divergence is very high: 166.1560 [2023-03-06 21:25:23,215][62475] Updated weights for policy 0, policy_version 19970 (0.0005) [2023-03-06 21:25:24,029][62475] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-06 21:25:24,847][62475] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-06 21:25:25,644][62475] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-06 21:25:26,462][62475] Updated weights for policy 0, policy_version 20010 (0.0007) [2023-03-06 21:25:26,843][62424] KL-divergence is very high: 131.4161 [2023-03-06 21:25:27,277][62475] Updated weights for policy 0, policy_version 20020 (0.0006) [2023-03-06 21:25:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20501504. Throughput: 0: 12703.9. Samples: 20471930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:27,401][62145] Avg episode reward: [(0, '657.056')] [2023-03-06 21:25:27,795][62424] KL-divergence is very high: 33421.0430 [2023-03-06 21:25:28,070][62475] Updated weights for policy 0, policy_version 20030 (0.0006) [2023-03-06 21:25:28,873][62475] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-06 21:25:29,678][62475] Updated weights for policy 0, policy_version 20050 (0.0007) [2023-03-06 21:25:30,235][62424] KL-divergence is very high: 6617.8442 [2023-03-06 21:25:30,494][62475] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-06 21:25:31,299][62475] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-06 21:25:32,107][62424] KL-divergence is very high: 2049.4529 [2023-03-06 21:25:32,114][62475] Updated weights for policy 0, policy_version 20080 (0.0007) [2023-03-06 21:25:32,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20564992. Throughput: 0: 12711.6. Samples: 20547998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:32,400][62145] Avg episode reward: [(0, '663.881')] [2023-03-06 21:25:32,914][62475] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-06 21:25:33,717][62475] Updated weights for policy 0, policy_version 20100 (0.0006) [2023-03-06 21:25:34,529][62475] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-06 21:25:35,327][62475] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-06 21:25:35,656][62424] KL-divergence is very high: 593.6144 [2023-03-06 21:25:35,817][62424] KL-divergence is very high: 4800.8062 [2023-03-06 21:25:36,157][62475] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-06 21:25:36,953][62475] Updated weights for policy 0, policy_version 20140 (0.0007) [2023-03-06 21:25:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20628480. Throughput: 0: 12700.3. Samples: 20624200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:37,401][62145] Avg episode reward: [(0, '818.800')] [2023-03-06 21:25:37,402][62424] Saving new best policy, reward=818.800! [2023-03-06 21:25:37,769][62475] Updated weights for policy 0, policy_version 20150 (0.0007) [2023-03-06 21:25:38,146][62424] KL-divergence is very high: 1388.5995 [2023-03-06 21:25:38,579][62475] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-06 21:25:39,380][62475] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-06 21:25:39,683][62424] KL-divergence is very high: 59439.3633 [2023-03-06 21:25:40,179][62475] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-06 21:25:40,981][62475] Updated weights for policy 0, policy_version 20190 (0.0006) [2023-03-06 21:25:41,792][62475] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-06 21:25:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20691968. Throughput: 0: 12697.5. Samples: 20662117. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:42,401][62145] Avg episode reward: [(0, '771.245')] [2023-03-06 21:25:42,597][62475] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-06 21:25:43,425][62475] Updated weights for policy 0, policy_version 20220 (0.0006) [2023-03-06 21:25:44,223][62475] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-06 21:25:45,026][62475] Updated weights for policy 0, policy_version 20240 (0.0006) [2023-03-06 21:25:45,849][62475] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-06 21:25:46,637][62475] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-06 21:25:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 20755456. Throughput: 0: 12695.8. Samples: 20738064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:47,401][62145] Avg episode reward: [(0, '643.180')] [2023-03-06 21:25:47,459][62475] Updated weights for policy 0, policy_version 20270 (0.0006) [2023-03-06 21:25:48,243][62475] Updated weights for policy 0, policy_version 20280 (0.0007) [2023-03-06 21:25:48,504][62424] KL-divergence is very high: 1679.1611 [2023-03-06 21:25:49,066][62475] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-06 21:25:49,882][62475] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-06 21:25:50,678][62475] Updated weights for policy 0, policy_version 20310 (0.0007) [2023-03-06 21:25:51,413][62424] KL-divergence is very high: 11360.0391 [2023-03-06 21:25:51,496][62475] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-06 21:25:52,293][62475] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-06 21:25:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 20818944. Throughput: 0: 12689.3. Samples: 20814148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:52,390][62145] Avg episode reward: [(0, '601.947')] [2023-03-06 21:25:53,076][62475] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-06 21:25:53,925][62475] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-06 21:25:54,719][62475] Updated weights for policy 0, policy_version 20360 (0.0006) [2023-03-06 21:25:55,359][62424] KL-divergence is very high: 25300.6680 [2023-03-06 21:25:55,530][62475] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-06 21:25:56,348][62475] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-06 21:25:57,141][62475] Updated weights for policy 0, policy_version 20390 (0.0006) [2023-03-06 21:25:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 20882432. Throughput: 0: 12692.1. Samples: 20852308. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:25:57,390][62145] Avg episode reward: [(0, '668.818')] [2023-03-06 21:25:57,776][62424] KL-divergence is very high: 161.0611 [2023-03-06 21:25:57,943][62475] Updated weights for policy 0, policy_version 20400 (0.0006) [2023-03-06 21:25:58,758][62475] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-03-06 21:25:59,550][62475] Updated weights for policy 0, policy_version 20420 (0.0006) [2023-03-06 21:26:00,341][62475] Updated weights for policy 0, policy_version 20430 (0.0007) [2023-03-06 21:26:01,185][62475] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-06 21:26:01,566][62424] KL-divergence is very high: 45314.3320 [2023-03-06 21:26:01,980][62475] Updated weights for policy 0, policy_version 20450 (0.0006) [2023-03-06 21:26:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 20945920. Throughput: 0: 12688.2. Samples: 20928585. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:26:02,390][62145] Avg episode reward: [(0, '648.378')] [2023-03-06 21:26:02,782][62475] Updated weights for policy 0, policy_version 20460 (0.0006) [2023-03-06 21:26:03,584][62475] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-06 21:26:04,379][62475] Updated weights for policy 0, policy_version 20480 (0.0006) [2023-03-06 21:26:05,166][62475] Updated weights for policy 0, policy_version 20490 (0.0007) [2023-03-06 21:26:05,998][62475] Updated weights for policy 0, policy_version 20500 (0.0006) [2023-03-06 21:26:06,819][62475] Updated weights for policy 0, policy_version 20510 (0.0008) [2023-03-06 21:26:07,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 21009408. Throughput: 0: 12683.2. Samples: 21004838. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:26:07,390][62145] Avg episode reward: [(0, '827.631')] [2023-03-06 21:26:07,390][62424] Saving new best policy, reward=827.631! [2023-03-06 21:26:07,606][62475] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-06 21:26:08,421][62475] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-06 21:26:09,229][62475] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-06 21:26:10,044][62475] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-06 21:26:10,854][62475] Updated weights for policy 0, policy_version 20560 (0.0007) [2023-03-06 21:26:11,656][62475] Updated weights for policy 0, policy_version 20570 (0.0008) [2023-03-06 21:26:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 21072896. Throughput: 0: 12687.9. Samples: 21042888. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:26:12,390][62145] Avg episode reward: [(0, '763.473')] [2023-03-06 21:26:12,455][62475] Updated weights for policy 0, policy_version 20580 (0.0007) [2023-03-06 21:26:13,262][62475] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-06 21:26:14,074][62475] Updated weights for policy 0, policy_version 20600 (0.0007) [2023-03-06 21:26:14,883][62475] Updated weights for policy 0, policy_version 20610 (0.0007) [2023-03-06 21:26:15,699][62475] Updated weights for policy 0, policy_version 20620 (0.0006) [2023-03-06 21:26:16,483][62475] Updated weights for policy 0, policy_version 20630 (0.0006) [2023-03-06 21:26:17,293][62475] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-06 21:26:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 21136384. Throughput: 0: 12687.0. Samples: 21118915. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:26:17,390][62145] Avg episode reward: [(0, '820.912')] [2023-03-06 21:26:18,088][62475] Updated weights for policy 0, policy_version 20650 (0.0007) [2023-03-06 21:26:18,897][62475] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-06 21:26:19,698][62475] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-06 21:26:20,499][62475] Updated weights for policy 0, policy_version 20680 (0.0007) [2023-03-06 21:26:21,301][62475] Updated weights for policy 0, policy_version 20690 (0.0007) [2023-03-06 21:26:22,109][62475] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-06 21:26:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 21199872. Throughput: 0: 12695.7. Samples: 21195510. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:26:22,390][62145] Avg episode reward: [(0, '820.972')] [2023-03-06 21:26:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000020703_21199872.pth... [2023-03-06 21:26:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000017725_18150400.pth [2023-03-06 21:26:22,922][62475] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-06 21:26:23,732][62475] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-06 21:26:24,539][62475] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-06 21:26:25,341][62475] Updated weights for policy 0, policy_version 20740 (0.0006) [2023-03-06 21:26:26,145][62475] Updated weights for policy 0, policy_version 20750 (0.0006) [2023-03-06 21:26:26,932][62475] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-06 21:26:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 21263360. Throughput: 0: 12698.5. Samples: 21233547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:27,390][62145] Avg episode reward: [(0, '912.932')] [2023-03-06 21:26:27,390][62424] Saving new best policy, reward=912.932! [2023-03-06 21:26:27,753][62475] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-06 21:26:28,538][62475] Updated weights for policy 0, policy_version 20780 (0.0006) [2023-03-06 21:26:29,342][62475] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-06 21:26:30,147][62475] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-06 21:26:30,945][62475] Updated weights for policy 0, policy_version 20810 (0.0006) [2023-03-06 21:26:31,749][62475] Updated weights for policy 0, policy_version 20820 (0.0006) [2023-03-06 21:26:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 21327872. Throughput: 0: 12714.4. Samples: 21310214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:32,390][62145] Avg episode reward: [(0, '667.539')] [2023-03-06 21:26:32,553][62475] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-06 21:26:33,346][62475] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-06 21:26:34,150][62475] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-06 21:26:34,965][62475] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-06 21:26:35,748][62475] Updated weights for policy 0, policy_version 20870 (0.0007) [2023-03-06 21:26:36,556][62475] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-06 21:26:37,376][62475] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-06 21:26:37,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 21391360. Throughput: 0: 12723.0. Samples: 21386682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:37,390][62145] Avg episode reward: [(0, '879.334')] [2023-03-06 21:26:38,168][62475] Updated weights for policy 0, policy_version 20900 (0.0007) [2023-03-06 21:26:38,967][62475] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-06 21:26:39,782][62475] Updated weights for policy 0, policy_version 20920 (0.0006) [2023-03-06 21:26:40,599][62475] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-06 21:26:41,408][62475] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-06 21:26:42,215][62475] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-06 21:26:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 21454848. Throughput: 0: 12725.0. Samples: 21424933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:42,390][62145] Avg episode reward: [(0, '834.134')] [2023-03-06 21:26:43,016][62475] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-06 21:26:43,861][62475] Updated weights for policy 0, policy_version 20970 (0.0007) [2023-03-06 21:26:44,642][62475] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-06 21:26:45,447][62475] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-06 21:26:46,244][62475] Updated weights for policy 0, policy_version 21000 (0.0006) [2023-03-06 21:26:47,064][62475] Updated weights for policy 0, policy_version 21010 (0.0007) [2023-03-06 21:26:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 21518336. Throughput: 0: 12722.2. Samples: 21501082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:47,390][62145] Avg episode reward: [(0, '836.470')] [2023-03-06 21:26:47,874][62475] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-06 21:26:48,690][62475] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-06 21:26:49,500][62475] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-06 21:26:50,306][62475] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-06 21:26:51,118][62475] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-06 21:26:51,921][62475] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-06 21:26:52,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 21580800. Throughput: 0: 12711.0. Samples: 21576835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:26:52,390][62145] Avg episode reward: [(0, '686.998')] [2023-03-06 21:26:52,717][62475] Updated weights for policy 0, policy_version 21080 (0.0006) [2023-03-06 21:26:53,526][62475] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-06 21:26:54,351][62475] Updated weights for policy 0, policy_version 21100 (0.0007) [2023-03-06 21:26:55,150][62475] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-06 21:26:55,962][62475] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-06 21:26:56,763][62475] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-06 21:26:57,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 21644288. Throughput: 0: 12713.5. Samples: 21614992. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:26:57,390][62145] Avg episode reward: [(0, '585.575')] [2023-03-06 21:26:57,572][62475] Updated weights for policy 0, policy_version 21140 (0.0005) [2023-03-06 21:26:58,372][62475] Updated weights for policy 0, policy_version 21150 (0.0007) [2023-03-06 21:26:59,180][62475] Updated weights for policy 0, policy_version 21160 (0.0005) [2023-03-06 21:26:59,994][62475] Updated weights for policy 0, policy_version 21170 (0.0007) [2023-03-06 21:27:00,825][62475] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-06 21:27:01,626][62475] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-06 21:27:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 21707776. Throughput: 0: 12708.2. Samples: 21690781. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:02,390][62145] Avg episode reward: [(0, '1065.116')] [2023-03-06 21:27:02,394][62424] Saving new best policy, reward=1065.116! [2023-03-06 21:27:02,450][62475] Updated weights for policy 0, policy_version 21200 (0.0007) [2023-03-06 21:27:03,229][62475] Updated weights for policy 0, policy_version 21210 (0.0005) [2023-03-06 21:27:04,059][62475] Updated weights for policy 0, policy_version 21220 (0.0007) [2023-03-06 21:27:04,857][62475] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-06 21:27:05,662][62475] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-06 21:27:06,470][62475] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-06 21:27:07,270][62475] Updated weights for policy 0, policy_version 21260 (0.0007) [2023-03-06 21:27:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 21771264. Throughput: 0: 12699.2. Samples: 21766974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:07,390][62145] Avg episode reward: [(0, '833.478')] [2023-03-06 21:27:08,078][62475] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-06 21:27:08,881][62475] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-06 21:27:09,688][62475] Updated weights for policy 0, policy_version 21290 (0.0007) [2023-03-06 21:27:10,488][62475] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-06 21:27:11,307][62475] Updated weights for policy 0, policy_version 21310 (0.0007) [2023-03-06 21:27:12,107][62475] Updated weights for policy 0, policy_version 21320 (0.0007) [2023-03-06 21:27:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 21834752. Throughput: 0: 12700.2. Samples: 21805058. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:12,390][62145] Avg episode reward: [(0, '780.309')] [2023-03-06 21:27:12,934][62475] Updated weights for policy 0, policy_version 21330 (0.0006) [2023-03-06 21:27:13,746][62475] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-06 21:27:14,567][62475] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-06 21:27:15,364][62475] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-06 21:27:16,170][62475] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-06 21:27:16,975][62475] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-06 21:27:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 21898240. Throughput: 0: 12681.8. Samples: 21880895. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:17,390][62145] Avg episode reward: [(0, '709.993')] [2023-03-06 21:27:17,775][62475] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-06 21:27:18,599][62475] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-06 21:27:19,408][62475] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-06 21:27:20,207][62475] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-06 21:27:21,004][62475] Updated weights for policy 0, policy_version 21430 (0.0007) [2023-03-06 21:27:21,827][62475] Updated weights for policy 0, policy_version 21440 (0.0006) [2023-03-06 21:27:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 21961728. Throughput: 0: 12671.2. Samples: 21956887. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:22,390][62145] Avg episode reward: [(0, '645.478')] [2023-03-06 21:27:22,646][62475] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-06 21:27:23,442][62475] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-06 21:27:24,238][62475] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-06 21:27:25,046][62475] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-06 21:27:25,853][62475] Updated weights for policy 0, policy_version 21490 (0.0006) [2023-03-06 21:27:26,659][62475] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-06 21:27:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 22025216. Throughput: 0: 12670.1. Samples: 21995088. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:27,390][62145] Avg episode reward: [(0, '805.596')] [2023-03-06 21:27:27,489][62475] Updated weights for policy 0, policy_version 21510 (0.0005) [2023-03-06 21:27:28,297][62475] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-06 21:27:29,100][62475] Updated weights for policy 0, policy_version 21530 (0.0006) [2023-03-06 21:27:29,899][62475] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-06 21:27:30,696][62475] Updated weights for policy 0, policy_version 21550 (0.0005) [2023-03-06 21:27:31,480][62475] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-03-06 21:27:32,291][62475] Updated weights for policy 0, policy_version 21570 (0.0006) [2023-03-06 21:27:32,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12704.5). Total num frames: 22088704. Throughput: 0: 12672.9. Samples: 22071365. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:32,390][62145] Avg episode reward: [(0, '802.837')] [2023-03-06 21:27:33,073][62475] Updated weights for policy 0, policy_version 21580 (0.0006) [2023-03-06 21:27:33,863][62475] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-06 21:27:34,681][62475] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-06 21:27:35,481][62475] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-06 21:27:36,274][62475] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-06 21:27:37,112][62475] Updated weights for policy 0, policy_version 21630 (0.0007) [2023-03-06 21:27:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12704.5). Total num frames: 22152192. Throughput: 0: 12693.0. Samples: 22148019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:37,390][62145] Avg episode reward: [(0, '776.348')] [2023-03-06 21:27:37,904][62475] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-06 21:27:38,702][62475] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-06 21:27:39,501][62475] Updated weights for policy 0, policy_version 21660 (0.0007) [2023-03-06 21:27:40,292][62475] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-06 21:27:41,095][62475] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-06 21:27:41,905][62475] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-06 21:27:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.6, 300 sec: 12704.5). Total num frames: 22215680. Throughput: 0: 12694.8. Samples: 22186257. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:42,390][62145] Avg episode reward: [(0, '892.230')] [2023-03-06 21:27:42,715][62475] Updated weights for policy 0, policy_version 21700 (0.0007) [2023-03-06 21:27:43,517][62475] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-06 21:27:44,322][62475] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-06 21:27:45,134][62475] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-06 21:27:45,940][62475] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-06 21:27:46,757][62475] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-06 21:27:47,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12704.5). Total num frames: 22279168. Throughput: 0: 12707.8. Samples: 22262634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:47,390][62145] Avg episode reward: [(0, '818.200')] [2023-03-06 21:27:47,567][62475] Updated weights for policy 0, policy_version 21760 (0.0006) [2023-03-06 21:27:48,350][62475] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-06 21:27:49,175][62475] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-06 21:27:50,001][62475] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-06 21:27:50,793][62475] Updated weights for policy 0, policy_version 21800 (0.0006) [2023-03-06 21:27:51,600][62475] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-06 21:27:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 22342656. Throughput: 0: 12702.7. Samples: 22338595. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:27:52,390][62145] Avg episode reward: [(0, '688.235')] [2023-03-06 21:27:52,418][62475] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-06 21:27:53,221][62475] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-06 21:27:54,023][62475] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-06 21:27:54,826][62475] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-06 21:27:55,614][62475] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-06 21:27:56,423][62475] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-06 21:27:57,228][62475] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-06 21:27:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 22406144. Throughput: 0: 12706.2. Samples: 22376836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:27:57,390][62145] Avg episode reward: [(0, '717.803')] [2023-03-06 21:27:58,020][62475] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-06 21:27:58,830][62475] Updated weights for policy 0, policy_version 21900 (0.0006) [2023-03-06 21:27:59,636][62475] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-06 21:28:00,441][62475] Updated weights for policy 0, policy_version 21920 (0.0006) [2023-03-06 21:28:01,247][62475] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-06 21:28:02,068][62475] Updated weights for policy 0, policy_version 21940 (0.0005) [2023-03-06 21:28:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 22470656. Throughput: 0: 12720.5. Samples: 22453316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:02,390][62145] Avg episode reward: [(0, '780.081')] [2023-03-06 21:28:02,851][62475] Updated weights for policy 0, policy_version 21950 (0.0007) [2023-03-06 21:28:03,665][62475] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-06 21:28:04,474][62475] Updated weights for policy 0, policy_version 21970 (0.0006) [2023-03-06 21:28:05,270][62475] Updated weights for policy 0, policy_version 21980 (0.0005) [2023-03-06 21:28:06,086][62475] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-06 21:28:06,891][62475] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-06 21:28:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 22534144. Throughput: 0: 12725.4. Samples: 22529529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:07,390][62145] Avg episode reward: [(0, '1038.411')] [2023-03-06 21:28:07,699][62475] Updated weights for policy 0, policy_version 22010 (0.0006) [2023-03-06 21:28:08,502][62475] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-06 21:28:09,300][62475] Updated weights for policy 0, policy_version 22030 (0.0007) [2023-03-06 21:28:10,107][62475] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-06 21:28:10,930][62475] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-06 21:28:11,726][62475] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-06 21:28:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 22597632. Throughput: 0: 12724.2. Samples: 22567677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:28:12,390][62145] Avg episode reward: [(0, '860.914')] [2023-03-06 21:28:12,535][62475] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-06 21:28:13,343][62475] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-06 21:28:14,145][62475] Updated weights for policy 0, policy_version 22090 (0.0006) [2023-03-06 21:28:14,966][62475] Updated weights for policy 0, policy_version 22100 (0.0006) [2023-03-06 21:28:15,781][62475] Updated weights for policy 0, policy_version 22110 (0.0007) [2023-03-06 21:28:16,565][62475] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-06 21:28:17,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 22660096. Throughput: 0: 12723.5. Samples: 22643921. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:28:17,390][62145] Avg episode reward: [(0, '969.254')] [2023-03-06 21:28:17,393][62475] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-06 21:28:18,184][62475] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-06 21:28:18,990][62475] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-06 21:28:19,790][62475] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-06 21:28:20,601][62475] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-06 21:28:21,417][62475] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-06 21:28:22,226][62475] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-06 21:28:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12708.0). Total num frames: 22724608. Throughput: 0: 12706.0. Samples: 22719790. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:28:22,390][62145] Avg episode reward: [(0, '1088.038')] [2023-03-06 21:28:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000022192_22724608.pth... [2023-03-06 21:28:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000019215_19676160.pth [2023-03-06 21:28:22,427][62424] Saving new best policy, reward=1088.038! [2023-03-06 21:28:23,025][62475] Updated weights for policy 0, policy_version 22200 (0.0006) [2023-03-06 21:28:23,826][62475] Updated weights for policy 0, policy_version 22210 (0.0007) [2023-03-06 21:28:24,609][62475] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-06 21:28:25,426][62475] Updated weights for policy 0, policy_version 22230 (0.0006) [2023-03-06 21:28:26,233][62475] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-06 21:28:27,034][62475] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-06 21:28:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 22788096. Throughput: 0: 12710.0. Samples: 22758206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:27,390][62145] Avg episode reward: [(0, '1090.932')] [2023-03-06 21:28:27,390][62424] Saving new best policy, reward=1090.932! [2023-03-06 21:28:27,842][62475] Updated weights for policy 0, policy_version 22260 (0.0007) [2023-03-06 21:28:28,648][62475] Updated weights for policy 0, policy_version 22270 (0.0006) [2023-03-06 21:28:29,456][62475] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-06 21:28:30,261][62475] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-06 21:28:31,067][62475] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-06 21:28:31,883][62475] Updated weights for policy 0, policy_version 22310 (0.0007) [2023-03-06 21:28:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 22851584. Throughput: 0: 12706.7. Samples: 22834437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:32,390][62145] Avg episode reward: [(0, '1098.454')] [2023-03-06 21:28:32,394][62424] Saving new best policy, reward=1098.454! [2023-03-06 21:28:32,707][62475] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-06 21:28:33,508][62475] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-06 21:28:34,311][62475] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-06 21:28:35,119][62475] Updated weights for policy 0, policy_version 22350 (0.0007) [2023-03-06 21:28:35,943][62475] Updated weights for policy 0, policy_version 22360 (0.0007) [2023-03-06 21:28:36,741][62475] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-06 21:28:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 22915072. Throughput: 0: 12702.3. Samples: 22910196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:37,390][62145] Avg episode reward: [(0, '875.931')] [2023-03-06 21:28:37,553][62475] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-06 21:28:38,373][62475] Updated weights for policy 0, policy_version 22390 (0.0007) [2023-03-06 21:28:39,204][62475] Updated weights for policy 0, policy_version 22400 (0.0007) [2023-03-06 21:28:40,001][62475] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-06 21:28:40,815][62475] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-06 21:28:41,594][62475] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-06 21:28:42,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 22977536. Throughput: 0: 12692.7. Samples: 22948007. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:28:42,390][62145] Avg episode reward: [(0, '992.857')] [2023-03-06 21:28:42,410][62475] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-06 21:28:43,205][62475] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-06 21:28:43,995][62475] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-06 21:28:44,795][62475] Updated weights for policy 0, policy_version 22470 (0.0005) [2023-03-06 21:28:45,618][62475] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-06 21:28:46,409][62475] Updated weights for policy 0, policy_version 22490 (0.0005) [2023-03-06 21:28:47,226][62475] Updated weights for policy 0, policy_version 22500 (0.0006) [2023-03-06 21:28:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23042048. Throughput: 0: 12694.1. Samples: 23024548. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:28:47,390][62145] Avg episode reward: [(0, '820.256')] [2023-03-06 21:28:48,018][62475] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-06 21:28:48,829][62475] Updated weights for policy 0, policy_version 22520 (0.0007) [2023-03-06 21:28:49,627][62475] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-06 21:28:50,446][62475] Updated weights for policy 0, policy_version 22540 (0.0006) [2023-03-06 21:28:51,257][62475] Updated weights for policy 0, policy_version 22550 (0.0005) [2023-03-06 21:28:52,077][62475] Updated weights for policy 0, policy_version 22560 (0.0007) [2023-03-06 21:28:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23105536. Throughput: 0: 12695.5. Samples: 23100829. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 21:28:52,390][62145] Avg episode reward: [(0, '1192.442')] [2023-03-06 21:28:52,394][62424] Saving new best policy, reward=1192.442! [2023-03-06 21:28:52,861][62475] Updated weights for policy 0, policy_version 22570 (0.0007) [2023-03-06 21:28:53,684][62475] Updated weights for policy 0, policy_version 22580 (0.0007) [2023-03-06 21:28:54,491][62475] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-06 21:28:55,287][62475] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-06 21:28:56,093][62475] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-06 21:28:56,913][62475] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-06 21:28:57,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 23168000. Throughput: 0: 12694.3. Samples: 23138919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:28:57,390][62145] Avg episode reward: [(0, '901.402')] [2023-03-06 21:28:57,720][62475] Updated weights for policy 0, policy_version 22630 (0.0007) [2023-03-06 21:28:58,509][62475] Updated weights for policy 0, policy_version 22640 (0.0007) [2023-03-06 21:28:59,306][62475] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-06 21:29:00,121][62475] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-06 21:29:00,946][62475] Updated weights for policy 0, policy_version 22670 (0.0007) [2023-03-06 21:29:01,732][62475] Updated weights for policy 0, policy_version 22680 (0.0007) [2023-03-06 21:29:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23232512. Throughput: 0: 12693.6. Samples: 23215133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:02,390][62145] Avg episode reward: [(0, '974.741')] [2023-03-06 21:29:02,544][62475] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-06 21:29:03,321][62475] Updated weights for policy 0, policy_version 22700 (0.0007) [2023-03-06 21:29:04,130][62475] Updated weights for policy 0, policy_version 22710 (0.0005) [2023-03-06 21:29:04,927][62475] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-06 21:29:05,723][62475] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-06 21:29:06,541][62475] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-06 21:29:07,338][62475] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-06 21:29:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23296000. Throughput: 0: 12711.9. Samples: 23291823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:07,390][62145] Avg episode reward: [(0, '871.791')] [2023-03-06 21:29:08,149][62475] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-06 21:29:08,946][62475] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-06 21:29:09,759][62475] Updated weights for policy 0, policy_version 22780 (0.0006) [2023-03-06 21:29:10,537][62475] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-06 21:29:11,358][62475] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-06 21:29:12,173][62475] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-06 21:29:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23359488. Throughput: 0: 12707.5. Samples: 23330044. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-06 21:29:12,390][62145] Avg episode reward: [(0, '805.486')] [2023-03-06 21:29:12,974][62475] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-06 21:29:13,785][62475] Updated weights for policy 0, policy_version 22830 (0.0006) [2023-03-06 21:29:14,594][62475] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-06 21:29:15,404][62475] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-06 21:29:16,215][62475] Updated weights for policy 0, policy_version 22860 (0.0005) [2023-03-06 21:29:17,011][62475] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-06 21:29:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 23422976. Throughput: 0: 12707.7. Samples: 23406284. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-06 21:29:17,390][62145] Avg episode reward: [(0, '975.909')] [2023-03-06 21:29:17,810][62475] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-06 21:29:18,628][62475] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-06 21:29:19,414][62475] Updated weights for policy 0, policy_version 22900 (0.0006) [2023-03-06 21:29:20,241][62475] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-06 21:29:21,049][62475] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-06 21:29:21,849][62475] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-06 21:29:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23486464. Throughput: 0: 12714.0. Samples: 23482326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-06 21:29:22,390][62145] Avg episode reward: [(0, '680.474')] [2023-03-06 21:29:22,672][62475] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-06 21:29:23,468][62475] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-06 21:29:24,282][62475] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-06 21:29:25,102][62475] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-06 21:29:25,914][62475] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-06 21:29:26,711][62475] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-06 21:29:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23549952. Throughput: 0: 12715.0. Samples: 23520181. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:29:27,390][62145] Avg episode reward: [(0, '832.616')] [2023-03-06 21:29:27,523][62475] Updated weights for policy 0, policy_version 23000 (0.0007) [2023-03-06 21:29:28,317][62475] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-06 21:29:29,124][62475] Updated weights for policy 0, policy_version 23020 (0.0007) [2023-03-06 21:29:29,912][62475] Updated weights for policy 0, policy_version 23030 (0.0007) [2023-03-06 21:29:30,734][62475] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-06 21:29:31,533][62475] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-06 21:29:32,335][62475] Updated weights for policy 0, policy_version 23060 (0.0008) [2023-03-06 21:29:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 23613440. Throughput: 0: 12712.6. Samples: 23596617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:29:32,390][62145] Avg episode reward: [(0, '805.974')] [2023-03-06 21:29:33,149][62475] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-06 21:29:33,928][62475] Updated weights for policy 0, policy_version 23080 (0.0008) [2023-03-06 21:29:34,730][62475] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-06 21:29:35,538][62475] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-06 21:29:36,335][62475] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-06 21:29:37,139][62475] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-06 21:29:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23677952. Throughput: 0: 12719.6. Samples: 23673212. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:29:37,390][62145] Avg episode reward: [(0, '995.583')] [2023-03-06 21:29:37,953][62475] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-06 21:29:38,737][62475] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-06 21:29:39,534][62475] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-06 21:29:40,339][62475] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-06 21:29:41,174][62475] Updated weights for policy 0, policy_version 23170 (0.0007) [2023-03-06 21:29:41,957][62475] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-06 21:29:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 23741440. Throughput: 0: 12726.1. Samples: 23711593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:42,390][62145] Avg episode reward: [(0, '1116.884')] [2023-03-06 21:29:42,773][62475] Updated weights for policy 0, policy_version 23190 (0.0007) [2023-03-06 21:29:43,594][62475] Updated weights for policy 0, policy_version 23200 (0.0008) [2023-03-06 21:29:44,369][62475] Updated weights for policy 0, policy_version 23210 (0.0006) [2023-03-06 21:29:45,193][62475] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-06 21:29:45,998][62475] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-06 21:29:46,798][62475] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-06 21:29:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23804928. Throughput: 0: 12724.1. Samples: 23787717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:47,390][62145] Avg episode reward: [(0, '1171.410')] [2023-03-06 21:29:47,590][62475] Updated weights for policy 0, policy_version 23250 (0.0006) [2023-03-06 21:29:48,406][62475] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-06 21:29:49,219][62475] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-06 21:29:50,022][62475] Updated weights for policy 0, policy_version 23280 (0.0005) [2023-03-06 21:29:50,837][62475] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-06 21:29:51,637][62475] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-06 21:29:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23868416. Throughput: 0: 12714.2. Samples: 23863964. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:52,390][62145] Avg episode reward: [(0, '1057.056')] [2023-03-06 21:29:52,457][62475] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-06 21:29:53,254][62475] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-06 21:29:54,042][62475] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-06 21:29:54,846][62475] Updated weights for policy 0, policy_version 23340 (0.0007) [2023-03-06 21:29:55,657][62475] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-06 21:29:56,463][62475] Updated weights for policy 0, policy_version 23360 (0.0006) [2023-03-06 21:29:57,262][62475] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-06 21:29:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 23931904. Throughput: 0: 12714.9. Samples: 23902214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:29:57,390][62145] Avg episode reward: [(0, '1021.588')] [2023-03-06 21:29:58,080][62475] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-06 21:29:58,873][62475] Updated weights for policy 0, policy_version 23390 (0.0006) [2023-03-06 21:29:59,663][62475] Updated weights for policy 0, policy_version 23400 (0.0007) [2023-03-06 21:30:00,501][62475] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-06 21:30:01,297][62475] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-06 21:30:02,114][62475] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-06 21:30:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 23995392. Throughput: 0: 12713.8. Samples: 23978407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:30:02,390][62145] Avg episode reward: [(0, '1167.104')] [2023-03-06 21:30:02,902][62475] Updated weights for policy 0, policy_version 23440 (0.0007) [2023-03-06 21:30:03,697][62475] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-06 21:30:04,511][62475] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-06 21:30:05,329][62475] Updated weights for policy 0, policy_version 23470 (0.0007) [2023-03-06 21:30:06,122][62475] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-06 21:30:06,915][62475] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-03-06 21:30:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12704.5). Total num frames: 24058880. Throughput: 0: 12724.8. Samples: 24054945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:30:07,390][62145] Avg episode reward: [(0, '1101.231')] [2023-03-06 21:30:07,731][62475] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-06 21:30:08,528][62475] Updated weights for policy 0, policy_version 23510 (0.0007) [2023-03-06 21:30:09,331][62475] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-06 21:30:10,133][62475] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-06 21:30:10,944][62475] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-06 21:30:11,765][62475] Updated weights for policy 0, policy_version 23550 (0.0007) [2023-03-06 21:30:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 24122368. Throughput: 0: 12729.9. Samples: 24093025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:30:12,390][62145] Avg episode reward: [(0, '966.612')] [2023-03-06 21:30:12,549][62475] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-06 21:30:12,959][62424] KL-divergence is very high: 20020.4121 [2023-03-06 21:30:13,367][62475] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-06 21:30:14,186][62475] Updated weights for policy 0, policy_version 23580 (0.0007) [2023-03-06 21:30:14,967][62475] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-06 21:30:15,797][62475] Updated weights for policy 0, policy_version 23600 (0.0006) [2023-03-06 21:30:16,585][62475] Updated weights for policy 0, policy_version 23610 (0.0006) [2023-03-06 21:30:17,371][62475] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-06 21:30:17,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 24186880. Throughput: 0: 12727.8. Samples: 24169367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:30:17,390][62145] Avg episode reward: [(0, '900.861')] [2023-03-06 21:30:18,179][62475] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-06 21:30:18,986][62475] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-06 21:30:19,795][62475] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-06 21:30:20,602][62475] Updated weights for policy 0, policy_version 23660 (0.0007) [2023-03-06 21:30:21,414][62475] Updated weights for policy 0, policy_version 23670 (0.0006) [2023-03-06 21:30:22,214][62475] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-06 21:30:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 24250368. Throughput: 0: 12721.9. Samples: 24245697. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:30:22,401][62145] Avg episode reward: [(0, '1047.629')] [2023-03-06 21:30:22,404][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000023682_24250368.pth... [2023-03-06 21:30:22,440][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000020703_21199872.pth [2023-03-06 21:30:23,018][62475] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-06 21:30:23,845][62475] Updated weights for policy 0, policy_version 23700 (0.0007) [2023-03-06 21:30:24,654][62475] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-06 21:30:25,450][62475] Updated weights for policy 0, policy_version 23720 (0.0006) [2023-03-06 21:30:26,257][62475] Updated weights for policy 0, policy_version 23730 (0.0006) [2023-03-06 21:30:27,051][62475] Updated weights for policy 0, policy_version 23740 (0.0007) [2023-03-06 21:30:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 24313856. Throughput: 0: 12712.4. Samples: 24283651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:30:27,401][62145] Avg episode reward: [(0, '933.235')] [2023-03-06 21:30:27,871][62475] Updated weights for policy 0, policy_version 23750 (0.0005) [2023-03-06 21:30:28,668][62475] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-06 21:30:29,473][62475] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-06 21:30:30,282][62475] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-06 21:30:31,089][62475] Updated weights for policy 0, policy_version 23790 (0.0007) [2023-03-06 21:30:31,886][62475] Updated weights for policy 0, policy_version 23800 (0.0007) [2023-03-06 21:30:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12708.0). Total num frames: 24377344. Throughput: 0: 12713.7. Samples: 24359835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:30:32,400][62145] Avg episode reward: [(0, '963.977')] [2023-03-06 21:30:32,709][62475] Updated weights for policy 0, policy_version 23810 (0.0006) [2023-03-06 21:30:33,508][62475] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-06 21:30:34,307][62475] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-06 21:30:35,130][62475] Updated weights for policy 0, policy_version 23840 (0.0007) [2023-03-06 21:30:35,926][62475] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-06 21:30:36,733][62475] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-06 21:30:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 24440832. Throughput: 0: 12717.5. Samples: 24436249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:30:37,390][62145] Avg episode reward: [(0, '802.774')] [2023-03-06 21:30:37,531][62475] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-06 21:30:38,330][62475] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-06 21:30:39,121][62475] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-06 21:30:39,922][62475] Updated weights for policy 0, policy_version 23900 (0.0006) [2023-03-06 21:30:40,734][62475] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-06 21:30:41,516][62475] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-06 21:30:42,342][62475] Updated weights for policy 0, policy_version 23930 (0.0006) [2023-03-06 21:30:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 24504320. Throughput: 0: 12722.4. Samples: 24474722. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:30:42,390][62145] Avg episode reward: [(0, '894.133')] [2023-03-06 21:30:43,134][62475] Updated weights for policy 0, policy_version 23940 (0.0007) [2023-03-06 21:30:43,920][62475] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-06 21:30:44,733][62475] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-06 21:30:45,539][62475] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-06 21:30:46,354][62475] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-06 21:30:47,147][62475] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-06 21:30:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 24568832. Throughput: 0: 12731.0. Samples: 24551299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:30:47,390][62145] Avg episode reward: [(0, '698.612')] [2023-03-06 21:30:47,934][62475] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-06 21:30:48,749][62475] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-06 21:30:49,551][62475] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-06 21:30:50,346][62475] Updated weights for policy 0, policy_version 24030 (0.0006) [2023-03-06 21:30:51,184][62475] Updated weights for policy 0, policy_version 24040 (0.0007) [2023-03-06 21:30:51,981][62475] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-06 21:30:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 24631296. Throughput: 0: 12724.9. Samples: 24627563. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:30:52,390][62145] Avg episode reward: [(0, '704.429')] [2023-03-06 21:30:52,802][62475] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-06 21:30:53,617][62475] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-06 21:30:54,421][62475] Updated weights for policy 0, policy_version 24080 (0.0007) [2023-03-06 21:30:55,225][62475] Updated weights for policy 0, policy_version 24090 (0.0007) [2023-03-06 21:30:55,297][62424] KL-divergence is very high: 318.6990 [2023-03-06 21:30:56,028][62475] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-06 21:30:56,835][62475] Updated weights for policy 0, policy_version 24110 (0.0006) [2023-03-06 21:30:57,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 24694784. Throughput: 0: 12721.6. Samples: 24665497. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:30:57,390][62145] Avg episode reward: [(0, '573.743')] [2023-03-06 21:30:57,625][62475] Updated weights for policy 0, policy_version 24120 (0.0007) [2023-03-06 21:30:58,445][62475] Updated weights for policy 0, policy_version 24130 (0.0006) [2023-03-06 21:30:59,254][62475] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-06 21:31:00,062][62475] Updated weights for policy 0, policy_version 24150 (0.0007) [2023-03-06 21:31:00,857][62475] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-06 21:31:01,657][62475] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-06 21:31:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12711.5). Total num frames: 24759296. Throughput: 0: 12720.7. Samples: 24741797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:02,390][62145] Avg episode reward: [(0, '592.169')] [2023-03-06 21:31:02,467][62475] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-06 21:31:03,260][62475] Updated weights for policy 0, policy_version 24190 (0.0007) [2023-03-06 21:31:04,059][62475] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-06 21:31:04,869][62475] Updated weights for policy 0, policy_version 24210 (0.0007) [2023-03-06 21:31:05,670][62475] Updated weights for policy 0, policy_version 24220 (0.0006) [2023-03-06 21:31:06,475][62475] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-06 21:31:07,272][62475] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-06 21:31:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.8, 300 sec: 12711.5). Total num frames: 24822784. Throughput: 0: 12726.3. Samples: 24818379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:07,390][62145] Avg episode reward: [(0, '504.421')] [2023-03-06 21:31:08,068][62475] Updated weights for policy 0, policy_version 24250 (0.0007) [2023-03-06 21:31:08,889][62475] Updated weights for policy 0, policy_version 24260 (0.0006) [2023-03-06 21:31:09,681][62475] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-06 21:31:10,489][62475] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-06 21:31:11,306][62475] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-06 21:31:12,106][62475] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-06 21:31:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 24886272. Throughput: 0: 12729.8. Samples: 24856490. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:12,390][62145] Avg episode reward: [(0, '647.748')] [2023-03-06 21:31:12,912][62475] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-06 21:31:13,718][62475] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-06 21:31:14,524][62475] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-06 21:31:15,329][62475] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-06 21:31:16,137][62475] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-06 21:31:16,929][62475] Updated weights for policy 0, policy_version 24360 (0.0007) [2023-03-06 21:31:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12711.5). Total num frames: 24949760. Throughput: 0: 12732.3. Samples: 24932791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:31:17,390][62145] Avg episode reward: [(0, '553.482')] [2023-03-06 21:31:17,725][62475] Updated weights for policy 0, policy_version 24370 (0.0006) [2023-03-06 21:31:18,534][62475] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-06 21:31:19,350][62475] Updated weights for policy 0, policy_version 24390 (0.0006) [2023-03-06 21:31:20,174][62475] Updated weights for policy 0, policy_version 24400 (0.0007) [2023-03-06 21:31:20,947][62475] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-06 21:31:21,746][62475] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-06 21:31:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 25013248. Throughput: 0: 12737.4. Samples: 25009432. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:31:22,390][62145] Avg episode reward: [(0, '621.115')] [2023-03-06 21:31:22,546][62475] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-06 21:31:23,359][62475] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-06 21:31:24,167][62475] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-06 21:31:24,955][62475] Updated weights for policy 0, policy_version 24460 (0.0005) [2023-03-06 21:31:25,765][62475] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-06 21:31:26,574][62475] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-06 21:31:27,383][62475] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-06 21:31:27,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 25077760. Throughput: 0: 12734.6. Samples: 25047779. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:31:27,390][62145] Avg episode reward: [(0, '673.211')] [2023-03-06 21:31:28,178][62475] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-06 21:31:28,985][62475] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-06 21:31:29,778][62475] Updated weights for policy 0, policy_version 24520 (0.0006) [2023-03-06 21:31:30,580][62475] Updated weights for policy 0, policy_version 24530 (0.0007) [2023-03-06 21:31:31,398][62475] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-06 21:31:32,221][62475] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-06 21:31:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 25141248. Throughput: 0: 12728.5. Samples: 25124080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:32,390][62145] Avg episode reward: [(0, '686.696')] [2023-03-06 21:31:33,040][62475] Updated weights for policy 0, policy_version 24560 (0.0006) [2023-03-06 21:31:33,835][62475] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-06 21:31:34,633][62475] Updated weights for policy 0, policy_version 24580 (0.0006) [2023-03-06 21:31:35,434][62475] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-06 21:31:36,241][62475] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-06 21:31:37,025][62475] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-06 21:31:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 25204736. Throughput: 0: 12727.2. Samples: 25200287. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:37,390][62145] Avg episode reward: [(0, '722.566')] [2023-03-06 21:31:37,864][62475] Updated weights for policy 0, policy_version 24620 (0.0007) [2023-03-06 21:31:38,656][62475] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-06 21:31:39,467][62475] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-06 21:31:40,254][62475] Updated weights for policy 0, policy_version 24650 (0.0006) [2023-03-06 21:31:41,064][62475] Updated weights for policy 0, policy_version 24660 (0.0007) [2023-03-06 21:31:41,877][62475] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-06 21:31:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 25268224. Throughput: 0: 12730.0. Samples: 25238347. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:42,390][62145] Avg episode reward: [(0, '679.229')] [2023-03-06 21:31:42,667][62475] Updated weights for policy 0, policy_version 24680 (0.0007) [2023-03-06 21:31:43,474][62475] Updated weights for policy 0, policy_version 24690 (0.0006) [2023-03-06 21:31:44,279][62475] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-06 21:31:45,066][62475] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-06 21:31:45,882][62475] Updated weights for policy 0, policy_version 24720 (0.0007) [2023-03-06 21:31:46,681][62475] Updated weights for policy 0, policy_version 24730 (0.0006) [2023-03-06 21:31:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 25331712. Throughput: 0: 12738.7. Samples: 25315037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:47,390][62145] Avg episode reward: [(0, '776.586')] [2023-03-06 21:31:47,471][62475] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-06 21:31:48,271][62475] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-06 21:31:49,098][62475] Updated weights for policy 0, policy_version 24760 (0.0006) [2023-03-06 21:31:49,918][62475] Updated weights for policy 0, policy_version 24770 (0.0007) [2023-03-06 21:31:50,717][62475] Updated weights for policy 0, policy_version 24780 (0.0007) [2023-03-06 21:31:51,542][62475] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-06 21:31:52,338][62475] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-06 21:31:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 25395200. Throughput: 0: 12728.3. Samples: 25391154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:52,390][62145] Avg episode reward: [(0, '976.698')] [2023-03-06 21:31:53,137][62475] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-06 21:31:53,935][62475] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-06 21:31:54,777][62475] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-06 21:31:55,559][62475] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-06 21:31:56,363][62475] Updated weights for policy 0, policy_version 24850 (0.0007) [2023-03-06 21:31:57,175][62475] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-06 21:31:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 25458688. Throughput: 0: 12723.6. Samples: 25429052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:31:57,390][62145] Avg episode reward: [(0, '779.244')] [2023-03-06 21:31:57,963][62475] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-06 21:31:58,792][62475] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-06 21:31:59,597][62475] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-06 21:32:00,375][62475] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-06 21:32:01,198][62475] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-06 21:32:02,007][62475] Updated weights for policy 0, policy_version 24920 (0.0007) [2023-03-06 21:32:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 25522176. Throughput: 0: 12729.4. Samples: 25505612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:02,390][62145] Avg episode reward: [(0, '845.627')] [2023-03-06 21:32:02,825][62475] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-06 21:32:03,625][62475] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-06 21:32:04,410][62475] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-06 21:32:05,230][62475] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-06 21:32:06,030][62475] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-06 21:32:06,824][62475] Updated weights for policy 0, policy_version 24980 (0.0005) [2023-03-06 21:32:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 25585664. Throughput: 0: 12717.5. Samples: 25581721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:32:07,390][62145] Avg episode reward: [(0, '689.566')] [2023-03-06 21:32:07,658][62475] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-06 21:32:08,434][62475] Updated weights for policy 0, policy_version 25000 (0.0007) [2023-03-06 21:32:09,239][62475] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-06 21:32:10,049][62475] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-06 21:32:10,851][62475] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-06 21:32:11,634][62475] Updated weights for policy 0, policy_version 25040 (0.0007) [2023-03-06 21:32:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 25650176. Throughput: 0: 12714.0. Samples: 25619909. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:32:12,390][62145] Avg episode reward: [(0, '702.410')] [2023-03-06 21:32:12,449][62475] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-06 21:32:13,250][62475] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-06 21:32:14,042][62475] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-06 21:32:14,875][62475] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-06 21:32:15,677][62475] Updated weights for policy 0, policy_version 25090 (0.0006) [2023-03-06 21:32:16,470][62475] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-06 21:32:17,283][62475] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-06 21:32:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 25713664. Throughput: 0: 12720.2. Samples: 25696491. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:32:17,390][62145] Avg episode reward: [(0, '862.709')] [2023-03-06 21:32:18,067][62475] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-06 21:32:18,893][62475] Updated weights for policy 0, policy_version 25130 (0.0007) [2023-03-06 21:32:19,691][62475] Updated weights for policy 0, policy_version 25140 (0.0007) [2023-03-06 21:32:20,500][62475] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-06 21:32:21,306][62475] Updated weights for policy 0, policy_version 25160 (0.0006) [2023-03-06 21:32:22,093][62475] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-06 21:32:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 25777152. Throughput: 0: 12724.5. Samples: 25772892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:22,390][62145] Avg episode reward: [(0, '732.679')] [2023-03-06 21:32:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000025173_25777152.pth... [2023-03-06 21:32:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000022192_22724608.pth [2023-03-06 21:32:22,907][62475] Updated weights for policy 0, policy_version 25180 (0.0007) [2023-03-06 21:32:23,725][62475] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-06 21:32:24,525][62475] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-06 21:32:25,338][62475] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-06 21:32:26,145][62475] Updated weights for policy 0, policy_version 25220 (0.0006) [2023-03-06 21:32:26,962][62475] Updated weights for policy 0, policy_version 25230 (0.0007) [2023-03-06 21:32:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 25840640. Throughput: 0: 12723.3. Samples: 25810895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:27,390][62145] Avg episode reward: [(0, '840.435')] [2023-03-06 21:32:27,766][62475] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-06 21:32:28,575][62475] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-06 21:32:29,377][62475] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-06 21:32:30,166][62475] Updated weights for policy 0, policy_version 25270 (0.0006) [2023-03-06 21:32:30,969][62475] Updated weights for policy 0, policy_version 25280 (0.0005) [2023-03-06 21:32:31,770][62475] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-06 21:32:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12718.4). Total num frames: 25904128. Throughput: 0: 12717.2. Samples: 25887312. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:32,390][62145] Avg episode reward: [(0, '798.878')] [2023-03-06 21:32:32,575][62475] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-06 21:32:33,389][62475] Updated weights for policy 0, policy_version 25310 (0.0006) [2023-03-06 21:32:34,171][62475] Updated weights for policy 0, policy_version 25320 (0.0006) [2023-03-06 21:32:34,988][62475] Updated weights for policy 0, policy_version 25330 (0.0007) [2023-03-06 21:32:35,791][62475] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-06 21:32:36,586][62475] Updated weights for policy 0, policy_version 25350 (0.0006) [2023-03-06 21:32:37,388][62475] Updated weights for policy 0, policy_version 25360 (0.0006) [2023-03-06 21:32:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 25968640. Throughput: 0: 12726.9. Samples: 25963864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:37,390][62145] Avg episode reward: [(0, '763.099')] [2023-03-06 21:32:38,208][62475] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-06 21:32:39,008][62475] Updated weights for policy 0, policy_version 25380 (0.0006) [2023-03-06 21:32:39,809][62475] Updated weights for policy 0, policy_version 25390 (0.0007) [2023-03-06 21:32:40,619][62475] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-06 21:32:41,413][62475] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-06 21:32:42,211][62475] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-06 21:32:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 26032128. Throughput: 0: 12731.4. Samples: 26001966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:42,390][62145] Avg episode reward: [(0, '796.566')] [2023-03-06 21:32:43,015][62475] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-06 21:32:43,806][62475] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-06 21:32:44,610][62475] Updated weights for policy 0, policy_version 25450 (0.0007) [2023-03-06 21:32:45,399][62475] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-06 21:32:46,197][62475] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-06 21:32:46,990][62475] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-06 21:32:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 26095616. Throughput: 0: 12738.9. Samples: 26078863. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:47,390][62145] Avg episode reward: [(0, '818.452')] [2023-03-06 21:32:47,809][62475] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-06 21:32:48,605][62475] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-06 21:32:49,415][62475] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-06 21:32:50,225][62475] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-06 21:32:51,040][62475] Updated weights for policy 0, policy_version 25530 (0.0006) [2023-03-06 21:32:51,830][62475] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-06 21:32:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 26160128. Throughput: 0: 12744.8. Samples: 26155235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:52,390][62145] Avg episode reward: [(0, '799.578')] [2023-03-06 21:32:52,643][62475] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-06 21:32:53,432][62475] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-06 21:32:54,230][62475] Updated weights for policy 0, policy_version 25570 (0.0008) [2023-03-06 21:32:55,037][62475] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-06 21:32:55,844][62475] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-06 21:32:56,645][62475] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-06 21:32:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 26223616. Throughput: 0: 12748.7. Samples: 26193600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:32:57,390][62145] Avg episode reward: [(0, '676.087')] [2023-03-06 21:32:57,455][62475] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-06 21:32:58,277][62475] Updated weights for policy 0, policy_version 25620 (0.0007) [2023-03-06 21:32:59,068][62475] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-06 21:32:59,876][62475] Updated weights for policy 0, policy_version 25640 (0.0006) [2023-03-06 21:33:00,685][62475] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-06 21:33:01,498][62475] Updated weights for policy 0, policy_version 25660 (0.0007) [2023-03-06 21:33:02,294][62475] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-06 21:33:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 26287104. Throughput: 0: 12736.5. Samples: 26269632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:02,390][62145] Avg episode reward: [(0, '719.390')] [2023-03-06 21:33:03,093][62475] Updated weights for policy 0, policy_version 25680 (0.0006) [2023-03-06 21:33:03,915][62475] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-06 21:33:04,716][62475] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-06 21:33:05,538][62475] Updated weights for policy 0, policy_version 25710 (0.0007) [2023-03-06 21:33:06,322][62475] Updated weights for policy 0, policy_version 25720 (0.0007) [2023-03-06 21:33:07,110][62475] Updated weights for policy 0, policy_version 25730 (0.0007) [2023-03-06 21:33:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 26350592. Throughput: 0: 12736.7. Samples: 26346043. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:07,390][62145] Avg episode reward: [(0, '953.733')] [2023-03-06 21:33:07,930][62475] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-06 21:33:08,741][62475] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-06 21:33:09,538][62475] Updated weights for policy 0, policy_version 25760 (0.0007) [2023-03-06 21:33:10,363][62475] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-06 21:33:11,166][62475] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-06 21:33:11,966][62475] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-06 21:33:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 26414080. Throughput: 0: 12735.7. Samples: 26384003. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:12,390][62145] Avg episode reward: [(0, '706.422')] [2023-03-06 21:33:12,770][62475] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-06 21:33:13,585][62475] Updated weights for policy 0, policy_version 25810 (0.0006) [2023-03-06 21:33:14,386][62475] Updated weights for policy 0, policy_version 25820 (0.0007) [2023-03-06 21:33:15,198][62475] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-06 21:33:15,994][62475] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-06 21:33:16,797][62475] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-06 21:33:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 26477568. Throughput: 0: 12733.9. Samples: 26460338. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:17,390][62145] Avg episode reward: [(0, '731.633')] [2023-03-06 21:33:17,614][62475] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-06 21:33:18,411][62475] Updated weights for policy 0, policy_version 25870 (0.0006) [2023-03-06 21:33:19,237][62475] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-06 21:33:20,047][62475] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-06 21:33:20,842][62475] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-06 21:33:21,639][62475] Updated weights for policy 0, policy_version 25910 (0.0006) [2023-03-06 21:33:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12721.9). Total num frames: 26541056. Throughput: 0: 12723.8. Samples: 26536435. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:22,390][62145] Avg episode reward: [(0, '910.541')] [2023-03-06 21:33:22,437][62475] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-06 21:33:23,252][62475] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-06 21:33:24,063][62475] Updated weights for policy 0, policy_version 25940 (0.0006) [2023-03-06 21:33:24,877][62475] Updated weights for policy 0, policy_version 25950 (0.0006) [2023-03-06 21:33:25,662][62475] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-06 21:33:26,497][62475] Updated weights for policy 0, policy_version 25970 (0.0007) [2023-03-06 21:33:27,317][62475] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-06 21:33:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 26604544. Throughput: 0: 12724.3. Samples: 26574558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:27,390][62145] Avg episode reward: [(0, '878.558')] [2023-03-06 21:33:28,116][62475] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-06 21:33:28,919][62475] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-06 21:33:29,719][62475] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-06 21:33:30,510][62475] Updated weights for policy 0, policy_version 26020 (0.0007) [2023-03-06 21:33:31,334][62475] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-06 21:33:32,137][62475] Updated weights for policy 0, policy_version 26040 (0.0006) [2023-03-06 21:33:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 26668032. Throughput: 0: 12710.0. Samples: 26650816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:32,390][62145] Avg episode reward: [(0, '838.206')] [2023-03-06 21:33:32,954][62475] Updated weights for policy 0, policy_version 26050 (0.0006) [2023-03-06 21:33:33,751][62475] Updated weights for policy 0, policy_version 26060 (0.0007) [2023-03-06 21:33:34,565][62475] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-06 21:33:35,383][62475] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-06 21:33:36,170][62475] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-06 21:33:36,975][62475] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-06 21:33:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 26731520. Throughput: 0: 12700.9. Samples: 26726777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:37,390][62145] Avg episode reward: [(0, '811.380')] [2023-03-06 21:33:37,789][62475] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-06 21:33:38,595][62475] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-06 21:33:39,401][62475] Updated weights for policy 0, policy_version 26130 (0.0006) [2023-03-06 21:33:40,185][62475] Updated weights for policy 0, policy_version 26140 (0.0006) [2023-03-06 21:33:40,426][62424] KL-divergence is very high: 136.9964 [2023-03-06 21:33:40,987][62475] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-06 21:33:41,774][62475] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-06 21:33:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 26795008. Throughput: 0: 12701.3. Samples: 26765158. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:33:42,390][62145] Avg episode reward: [(0, '637.777')] [2023-03-06 21:33:42,580][62475] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-06 21:33:43,371][62475] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-06 21:33:43,768][62424] KL-divergence is very high: 4174.2236 [2023-03-06 21:33:44,175][62475] Updated weights for policy 0, policy_version 26190 (0.0007) [2023-03-06 21:33:44,986][62475] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-06 21:33:45,795][62475] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-06 21:33:46,598][62475] Updated weights for policy 0, policy_version 26220 (0.0007) [2023-03-06 21:33:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 26858496. Throughput: 0: 12712.9. Samples: 26841716. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:33:47,390][62145] Avg episode reward: [(0, '779.185')] [2023-03-06 21:33:47,416][62475] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-06 21:33:48,238][62475] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-06 21:33:49,027][62475] Updated weights for policy 0, policy_version 26250 (0.0006) [2023-03-06 21:33:49,829][62475] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-06 21:33:50,619][62475] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-06 21:33:51,430][62475] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-06 21:33:52,221][62475] Updated weights for policy 0, policy_version 26290 (0.0006) [2023-03-06 21:33:52,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 26923008. Throughput: 0: 12716.5. Samples: 26918286. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:33:52,390][62145] Avg episode reward: [(0, '738.412')] [2023-03-06 21:33:53,029][62475] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-06 21:33:53,847][62475] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-06 21:33:54,657][62475] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-06 21:33:55,457][62475] Updated weights for policy 0, policy_version 26330 (0.0007) [2023-03-06 21:33:56,264][62475] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-06 21:33:57,061][62475] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-06 21:33:57,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 26986496. Throughput: 0: 12714.8. Samples: 26956168. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:33:57,390][62145] Avg episode reward: [(0, '770.865')] [2023-03-06 21:33:57,876][62475] Updated weights for policy 0, policy_version 26360 (0.0007) [2023-03-06 21:33:58,682][62475] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-06 21:33:59,490][62475] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-06 21:34:00,289][62475] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-06 21:34:01,098][62475] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-06 21:34:01,892][62475] Updated weights for policy 0, policy_version 26410 (0.0007) [2023-03-06 21:34:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27049984. Throughput: 0: 12713.5. Samples: 27032446. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:34:02,390][62145] Avg episode reward: [(0, '638.853')] [2023-03-06 21:34:02,695][62475] Updated weights for policy 0, policy_version 26420 (0.0007) [2023-03-06 21:34:03,506][62475] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-06 21:34:04,327][62475] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-06 21:34:05,123][62475] Updated weights for policy 0, policy_version 26450 (0.0006) [2023-03-06 21:34:05,923][62475] Updated weights for policy 0, policy_version 26460 (0.0006) [2023-03-06 21:34:06,724][62475] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-06 21:34:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27113472. Throughput: 0: 12721.7. Samples: 27108911. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:34:07,390][62145] Avg episode reward: [(0, '538.111')] [2023-03-06 21:34:07,526][62475] Updated weights for policy 0, policy_version 26480 (0.0007) [2023-03-06 21:34:08,322][62475] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-06 21:34:09,124][62475] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-06 21:34:09,938][62475] Updated weights for policy 0, policy_version 26510 (0.0006) [2023-03-06 21:34:10,734][62475] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-06 21:34:11,561][62475] Updated weights for policy 0, policy_version 26530 (0.0007) [2023-03-06 21:34:12,352][62475] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-06 21:34:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27176960. Throughput: 0: 12726.5. Samples: 27147249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:34:12,390][62145] Avg episode reward: [(0, '692.717')] [2023-03-06 21:34:13,154][62475] Updated weights for policy 0, policy_version 26550 (0.0007) [2023-03-06 21:34:13,974][62475] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-06 21:34:14,765][62475] Updated weights for policy 0, policy_version 26570 (0.0006) [2023-03-06 21:34:15,567][62475] Updated weights for policy 0, policy_version 26580 (0.0008) [2023-03-06 21:34:16,370][62475] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-06 21:34:17,180][62475] Updated weights for policy 0, policy_version 26600 (0.0007) [2023-03-06 21:34:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27240448. Throughput: 0: 12729.8. Samples: 27223658. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:34:17,390][62145] Avg episode reward: [(0, '623.713')] [2023-03-06 21:34:17,976][62475] Updated weights for policy 0, policy_version 26610 (0.0006) [2023-03-06 21:34:18,776][62475] Updated weights for policy 0, policy_version 26620 (0.0006) [2023-03-06 21:34:19,578][62475] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-06 21:34:20,390][62475] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-06 21:34:21,193][62475] Updated weights for policy 0, policy_version 26650 (0.0006) [2023-03-06 21:34:22,014][62475] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-06 21:34:22,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 27303936. Throughput: 0: 12736.5. Samples: 27299921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:22,390][62145] Avg episode reward: [(0, '709.079')] [2023-03-06 21:34:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000026665_27304960.pth... [2023-03-06 21:34:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000023682_24250368.pth [2023-03-06 21:34:22,805][62475] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-06 21:34:23,618][62475] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-06 21:34:24,422][62475] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-06 21:34:25,217][62475] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-06 21:34:26,026][62475] Updated weights for policy 0, policy_version 26710 (0.0006) [2023-03-06 21:34:26,831][62475] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-06 21:34:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27367424. Throughput: 0: 12730.4. Samples: 27338024. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:27,390][62145] Avg episode reward: [(0, '644.333')] [2023-03-06 21:34:27,631][62475] Updated weights for policy 0, policy_version 26730 (0.0007) [2023-03-06 21:34:28,433][62475] Updated weights for policy 0, policy_version 26740 (0.0007) [2023-03-06 21:34:29,244][62475] Updated weights for policy 0, policy_version 26750 (0.0007) [2023-03-06 21:34:30,043][62475] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-06 21:34:30,857][62475] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-06 21:34:31,657][62475] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-06 21:34:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 27431936. Throughput: 0: 12727.3. Samples: 27414443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:32,390][62145] Avg episode reward: [(0, '615.695')] [2023-03-06 21:34:32,438][62475] Updated weights for policy 0, policy_version 26790 (0.0007) [2023-03-06 21:34:33,238][62475] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-06 21:34:34,058][62475] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-06 21:34:34,858][62475] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-06 21:34:35,657][62475] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-06 21:34:36,461][62475] Updated weights for policy 0, policy_version 26840 (0.0006) [2023-03-06 21:34:37,257][62475] Updated weights for policy 0, policy_version 26850 (0.0007) [2023-03-06 21:34:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 27495424. Throughput: 0: 12731.7. Samples: 27491214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:37,390][62145] Avg episode reward: [(0, '597.985')] [2023-03-06 21:34:38,066][62475] Updated weights for policy 0, policy_version 26860 (0.0007) [2023-03-06 21:34:38,887][62475] Updated weights for policy 0, policy_version 26870 (0.0007) [2023-03-06 21:34:39,681][62475] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-06 21:34:40,488][62475] Updated weights for policy 0, policy_version 26890 (0.0007) [2023-03-06 21:34:41,299][62475] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-06 21:34:42,105][62475] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-06 21:34:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 27558912. Throughput: 0: 12735.9. Samples: 27529282. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:42,390][62145] Avg episode reward: [(0, '477.067')] [2023-03-06 21:34:42,929][62475] Updated weights for policy 0, policy_version 26920 (0.0007) [2023-03-06 21:34:43,712][62475] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-06 21:34:44,526][62475] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-06 21:34:45,332][62475] Updated weights for policy 0, policy_version 26950 (0.0007) [2023-03-06 21:34:46,141][62475] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-06 21:34:46,939][62475] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-06 21:34:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 27622400. Throughput: 0: 12735.4. Samples: 27605541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:47,390][62145] Avg episode reward: [(0, '678.412')] [2023-03-06 21:34:47,737][62475] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-06 21:34:48,556][62475] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-06 21:34:49,358][62475] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-06 21:34:50,162][62475] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-06 21:34:50,974][62475] Updated weights for policy 0, policy_version 27020 (0.0006) [2023-03-06 21:34:51,773][62475] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-06 21:34:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27685888. Throughput: 0: 12731.4. Samples: 27681826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:52,390][62145] Avg episode reward: [(0, '625.096')] [2023-03-06 21:34:52,566][62475] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-06 21:34:53,388][62475] Updated weights for policy 0, policy_version 27050 (0.0006) [2023-03-06 21:34:54,205][62475] Updated weights for policy 0, policy_version 27060 (0.0006) [2023-03-06 21:34:55,012][62475] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-06 21:34:55,820][62475] Updated weights for policy 0, policy_version 27080 (0.0006) [2023-03-06 21:34:56,625][62475] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-06 21:34:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27749376. Throughput: 0: 12720.2. Samples: 27719661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:34:57,390][62145] Avg episode reward: [(0, '790.408')] [2023-03-06 21:34:57,450][62475] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-06 21:34:58,257][62475] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-06 21:34:59,038][62475] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-06 21:34:59,858][62475] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-06 21:35:00,646][62475] Updated weights for policy 0, policy_version 27140 (0.0006) [2023-03-06 21:35:01,458][62475] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-06 21:35:02,266][62475] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-06 21:35:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27812864. Throughput: 0: 12718.3. Samples: 27795979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:35:02,390][62145] Avg episode reward: [(0, '618.395')] [2023-03-06 21:35:03,070][62475] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-06 21:35:03,876][62475] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-06 21:35:04,688][62475] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-06 21:35:05,475][62475] Updated weights for policy 0, policy_version 27200 (0.0007) [2023-03-06 21:35:06,277][62475] Updated weights for policy 0, policy_version 27210 (0.0007) [2023-03-06 21:35:07,083][62475] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-06 21:35:07,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 27876352. Throughput: 0: 12719.9. Samples: 27872316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:35:07,390][62145] Avg episode reward: [(0, '687.121')] [2023-03-06 21:35:07,881][62475] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-06 21:35:08,688][62475] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-06 21:35:09,491][62475] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-06 21:35:10,262][62475] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-06 21:35:11,099][62475] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-06 21:35:11,907][62475] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-06 21:35:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 27940864. Throughput: 0: 12726.3. Samples: 27910708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:35:12,390][62145] Avg episode reward: [(0, '674.723')] [2023-03-06 21:35:12,710][62475] Updated weights for policy 0, policy_version 27290 (0.0006) [2023-03-06 21:35:13,513][62475] Updated weights for policy 0, policy_version 27300 (0.0007) [2023-03-06 21:35:14,317][62475] Updated weights for policy 0, policy_version 27310 (0.0005) [2023-03-06 21:35:15,115][62475] Updated weights for policy 0, policy_version 27320 (0.0006) [2023-03-06 21:35:15,901][62475] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-06 21:35:16,722][62475] Updated weights for policy 0, policy_version 27340 (0.0007) [2023-03-06 21:35:17,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 28004352. Throughput: 0: 12727.1. Samples: 27987162. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:35:17,390][62145] Avg episode reward: [(0, '790.571')] [2023-03-06 21:35:17,514][62475] Updated weights for policy 0, policy_version 27350 (0.0006) [2023-03-06 21:35:18,326][62475] Updated weights for policy 0, policy_version 27360 (0.0006) [2023-03-06 21:35:19,133][62475] Updated weights for policy 0, policy_version 27370 (0.0006) [2023-03-06 21:35:19,937][62475] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-06 21:35:20,752][62475] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-06 21:35:21,546][62475] Updated weights for policy 0, policy_version 27400 (0.0007) [2023-03-06 21:35:22,359][62475] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-06 21:35:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 28067840. Throughput: 0: 12717.4. Samples: 28063494. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:35:22,390][62145] Avg episode reward: [(0, '684.060')] [2023-03-06 21:35:23,157][62475] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-06 21:35:23,973][62475] Updated weights for policy 0, policy_version 27430 (0.0007) [2023-03-06 21:35:24,761][62475] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-06 21:35:25,574][62475] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-06 21:35:26,386][62475] Updated weights for policy 0, policy_version 27460 (0.0006) [2023-03-06 21:35:27,205][62475] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-06 21:35:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 28131328. Throughput: 0: 12719.2. Samples: 28101644. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:35:27,390][62145] Avg episode reward: [(0, '644.075')] [2023-03-06 21:35:28,014][62475] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-06 21:35:28,830][62475] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-06 21:35:29,652][62475] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-06 21:35:30,443][62475] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-06 21:35:31,253][62475] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-06 21:35:32,059][62475] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-06 21:35:32,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 28193792. Throughput: 0: 12710.3. Samples: 28177505. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:35:32,390][62145] Avg episode reward: [(0, '512.133')] [2023-03-06 21:35:32,885][62475] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-06 21:35:33,683][62475] Updated weights for policy 0, policy_version 27550 (0.0006) [2023-03-06 21:35:34,490][62475] Updated weights for policy 0, policy_version 27560 (0.0005) [2023-03-06 21:35:35,296][62475] Updated weights for policy 0, policy_version 27570 (0.0007) [2023-03-06 21:35:36,101][62475] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-06 21:35:36,905][62475] Updated weights for policy 0, policy_version 27590 (0.0006) [2023-03-06 21:35:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 28258304. Throughput: 0: 12706.9. Samples: 28253637. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:35:37,390][62145] Avg episode reward: [(0, '716.090')] [2023-03-06 21:35:37,698][62475] Updated weights for policy 0, policy_version 27600 (0.0006) [2023-03-06 21:35:38,497][62475] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-06 21:35:39,310][62475] Updated weights for policy 0, policy_version 27620 (0.0006) [2023-03-06 21:35:40,119][62475] Updated weights for policy 0, policy_version 27630 (0.0007) [2023-03-06 21:35:40,921][62475] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-06 21:35:41,733][62475] Updated weights for policy 0, policy_version 27650 (0.0007) [2023-03-06 21:35:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 28321792. Throughput: 0: 12714.2. Samples: 28291801. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:35:42,390][62145] Avg episode reward: [(0, '969.495')] [2023-03-06 21:35:42,524][62475] Updated weights for policy 0, policy_version 27660 (0.0007) [2023-03-06 21:35:43,338][62475] Updated weights for policy 0, policy_version 27670 (0.0007) [2023-03-06 21:35:44,136][62475] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-06 21:35:44,943][62475] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-06 21:35:45,742][62475] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-06 21:35:46,557][62475] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-06 21:35:47,366][62475] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-06 21:35:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 28385280. Throughput: 0: 12714.7. Samples: 28368142. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:35:47,390][62145] Avg episode reward: [(0, '899.817')] [2023-03-06 21:35:48,154][62475] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-06 21:35:48,969][62475] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-06 21:35:49,762][62475] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-06 21:35:50,567][62475] Updated weights for policy 0, policy_version 27760 (0.0006) [2023-03-06 21:35:51,377][62475] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-06 21:35:52,173][62475] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-06 21:35:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 28448768. Throughput: 0: 12719.5. Samples: 28444695. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-06 21:35:52,390][62145] Avg episode reward: [(0, '759.683')] [2023-03-06 21:35:52,954][62475] Updated weights for policy 0, policy_version 27790 (0.0006) [2023-03-06 21:35:53,771][62475] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-06 21:35:54,581][62475] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-06 21:35:55,385][62475] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-06 21:35:56,201][62475] Updated weights for policy 0, policy_version 27830 (0.0007) [2023-03-06 21:35:56,996][62475] Updated weights for policy 0, policy_version 27840 (0.0006) [2023-03-06 21:35:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 28512256. Throughput: 0: 12716.3. Samples: 28482943. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-06 21:35:57,390][62145] Avg episode reward: [(0, '730.357')] [2023-03-06 21:35:57,802][62475] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-06 21:35:58,606][62475] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-06 21:35:59,431][62475] Updated weights for policy 0, policy_version 27870 (0.0006) [2023-03-06 21:36:00,226][62475] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-06 21:36:01,041][62475] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-06 21:36:01,838][62475] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-06 21:36:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 28575744. Throughput: 0: 12709.6. Samples: 28559095. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-06 21:36:02,390][62145] Avg episode reward: [(0, '714.564')] [2023-03-06 21:36:02,660][62475] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-06 21:36:03,441][62475] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-06 21:36:04,270][62475] Updated weights for policy 0, policy_version 27930 (0.0005) [2023-03-06 21:36:05,062][62475] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-06 21:36:05,866][62475] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-06 21:36:06,600][62424] KL-divergence is very high: 5177.4341 [2023-03-06 21:36:06,700][62475] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-06 21:36:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 28639232. Throughput: 0: 12701.3. Samples: 28635056. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-06 21:36:07,390][62145] Avg episode reward: [(0, '684.259')] [2023-03-06 21:36:07,499][62475] Updated weights for policy 0, policy_version 27970 (0.0007) [2023-03-06 21:36:08,311][62475] Updated weights for policy 0, policy_version 27980 (0.0006) [2023-03-06 21:36:09,132][62475] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-06 21:36:09,940][62475] Updated weights for policy 0, policy_version 28000 (0.0007) [2023-03-06 21:36:10,741][62475] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-06 21:36:11,557][62475] Updated weights for policy 0, policy_version 28020 (0.0006) [2023-03-06 21:36:12,340][62475] Updated weights for policy 0, policy_version 28030 (0.0006) [2023-03-06 21:36:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 28702720. Throughput: 0: 12699.3. Samples: 28673112. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:36:12,390][62145] Avg episode reward: [(0, '799.526')] [2023-03-06 21:36:13,137][62475] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-06 21:36:13,956][62475] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-06 21:36:14,771][62475] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-06 21:36:15,569][62475] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-06 21:36:16,368][62475] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-06 21:36:17,186][62475] Updated weights for policy 0, policy_version 28090 (0.0007) [2023-03-06 21:36:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 28766208. Throughput: 0: 12706.6. Samples: 28749301. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:36:17,390][62145] Avg episode reward: [(0, '834.026')] [2023-03-06 21:36:17,995][62475] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-06 21:36:18,787][62475] Updated weights for policy 0, policy_version 28110 (0.0007) [2023-03-06 21:36:19,592][62475] Updated weights for policy 0, policy_version 28120 (0.0007) [2023-03-06 21:36:20,370][62475] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-06 21:36:21,201][62475] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-06 21:36:21,985][62475] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-06 21:36:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 28830720. Throughput: 0: 12716.7. Samples: 28825890. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:36:22,390][62145] Avg episode reward: [(0, '870.409')] [2023-03-06 21:36:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000028155_28830720.pth... [2023-03-06 21:36:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000025173_25777152.pth [2023-03-06 21:36:22,777][62475] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-06 21:36:23,605][62475] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-06 21:36:24,428][62475] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-06 21:36:25,217][62475] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-06 21:36:26,018][62475] Updated weights for policy 0, policy_version 28200 (0.0006) [2023-03-06 21:36:26,831][62475] Updated weights for policy 0, policy_version 28210 (0.0007) [2023-03-06 21:36:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 28893184. Throughput: 0: 12713.0. Samples: 28863887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:27,390][62145] Avg episode reward: [(0, '777.630')] [2023-03-06 21:36:27,625][62475] Updated weights for policy 0, policy_version 28220 (0.0008) [2023-03-06 21:36:28,426][62475] Updated weights for policy 0, policy_version 28230 (0.0007) [2023-03-06 21:36:29,257][62475] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-06 21:36:30,042][62475] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-06 21:36:30,844][62475] Updated weights for policy 0, policy_version 28260 (0.0005) [2023-03-06 21:36:31,174][62424] KL-divergence is very high: 26958.9707 [2023-03-06 21:36:31,670][62475] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-06 21:36:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 28957696. Throughput: 0: 12713.8. Samples: 28940262. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:32,390][62145] Avg episode reward: [(0, '572.203')] [2023-03-06 21:36:32,476][62475] Updated weights for policy 0, policy_version 28280 (0.0006) [2023-03-06 21:36:33,263][62475] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-06 21:36:34,077][62475] Updated weights for policy 0, policy_version 28300 (0.0005) [2023-03-06 21:36:34,889][62475] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-06 21:36:35,698][62475] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-06 21:36:36,509][62475] Updated weights for policy 0, policy_version 28330 (0.0007) [2023-03-06 21:36:37,309][62475] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-06 21:36:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29021184. Throughput: 0: 12700.3. Samples: 29016205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:37,390][62145] Avg episode reward: [(0, '589.525')] [2023-03-06 21:36:38,120][62475] Updated weights for policy 0, policy_version 28350 (0.0007) [2023-03-06 21:36:38,925][62475] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-06 21:36:39,732][62475] Updated weights for policy 0, policy_version 28370 (0.0007) [2023-03-06 21:36:40,545][62475] Updated weights for policy 0, policy_version 28380 (0.0009) [2023-03-06 21:36:41,349][62475] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-06 21:36:42,146][62475] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-06 21:36:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29084672. Throughput: 0: 12698.7. Samples: 29054387. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:42,390][62145] Avg episode reward: [(0, '524.974')] [2023-03-06 21:36:42,952][62475] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-06 21:36:43,743][62475] Updated weights for policy 0, policy_version 28420 (0.0007) [2023-03-06 21:36:44,556][62475] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-06 21:36:45,337][62475] Updated weights for policy 0, policy_version 28440 (0.0007) [2023-03-06 21:36:46,146][62475] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-06 21:36:46,959][62475] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-06 21:36:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29148160. Throughput: 0: 12709.7. Samples: 29131032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:47,390][62145] Avg episode reward: [(0, '628.199')] [2023-03-06 21:36:47,765][62475] Updated weights for policy 0, policy_version 28470 (0.0007) [2023-03-06 21:36:48,586][62475] Updated weights for policy 0, policy_version 28480 (0.0006) [2023-03-06 21:36:49,369][62475] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-06 21:36:50,176][62475] Updated weights for policy 0, policy_version 28500 (0.0007) [2023-03-06 21:36:50,974][62475] Updated weights for policy 0, policy_version 28510 (0.0006) [2023-03-06 21:36:51,802][62475] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-06 21:36:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29211648. Throughput: 0: 12714.6. Samples: 29207211. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:52,390][62145] Avg episode reward: [(0, '796.255')] [2023-03-06 21:36:52,592][62475] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-06 21:36:53,400][62475] Updated weights for policy 0, policy_version 28540 (0.0007) [2023-03-06 21:36:54,193][62475] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-06 21:36:55,017][62475] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-06 21:36:55,822][62475] Updated weights for policy 0, policy_version 28570 (0.0006) [2023-03-06 21:36:56,631][62475] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-06 21:36:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29275136. Throughput: 0: 12712.9. Samples: 29245191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:36:57,390][62145] Avg episode reward: [(0, '862.608')] [2023-03-06 21:36:57,441][62475] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-06 21:36:58,237][62475] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-06 21:36:59,028][62475] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-06 21:36:59,852][62475] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-06 21:37:00,659][62475] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-06 21:37:01,457][62475] Updated weights for policy 0, policy_version 28640 (0.0007) [2023-03-06 21:37:02,282][62475] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-06 21:37:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29338624. Throughput: 0: 12718.7. Samples: 29321646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:02,391][62145] Avg episode reward: [(0, '648.845')] [2023-03-06 21:37:03,072][62475] Updated weights for policy 0, policy_version 28660 (0.0006) [2023-03-06 21:37:03,893][62475] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-06 21:37:04,701][62475] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-06 21:37:05,492][62475] Updated weights for policy 0, policy_version 28690 (0.0006) [2023-03-06 21:37:06,300][62475] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-06 21:37:07,114][62475] Updated weights for policy 0, policy_version 28710 (0.0006) [2023-03-06 21:37:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29402112. Throughput: 0: 12708.7. Samples: 29397779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:07,390][62145] Avg episode reward: [(0, '793.439')] [2023-03-06 21:37:07,919][62475] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-06 21:37:08,709][62475] Updated weights for policy 0, policy_version 28730 (0.0005) [2023-03-06 21:37:08,869][62424] KL-divergence is very high: 383.5457 [2023-03-06 21:37:09,524][62475] Updated weights for policy 0, policy_version 28740 (0.0007) [2023-03-06 21:37:10,343][62475] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-06 21:37:11,142][62475] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-06 21:37:11,946][62475] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-06 21:37:12,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29465600. Throughput: 0: 12710.2. Samples: 29435844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:12,390][62145] Avg episode reward: [(0, '966.142')] [2023-03-06 21:37:12,758][62475] Updated weights for policy 0, policy_version 28780 (0.0007) [2023-03-06 21:37:13,551][62475] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-06 21:37:14,351][62475] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-06 21:37:15,153][62475] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-06 21:37:15,968][62475] Updated weights for policy 0, policy_version 28820 (0.0008) [2023-03-06 21:37:16,770][62475] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-06 21:37:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29529088. Throughput: 0: 12714.4. Samples: 29512410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:17,390][62145] Avg episode reward: [(0, '831.088')] [2023-03-06 21:37:17,559][62475] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-06 21:37:18,373][62475] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-06 21:37:19,178][62475] Updated weights for policy 0, policy_version 28860 (0.0007) [2023-03-06 21:37:19,954][62475] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-06 21:37:20,744][62475] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-06 21:37:21,584][62475] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-06 21:37:22,383][62475] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-06 21:37:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 29593600. Throughput: 0: 12725.6. Samples: 29588860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:22,390][62145] Avg episode reward: [(0, '897.051')] [2023-03-06 21:37:23,158][62475] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-06 21:37:23,981][62475] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-06 21:37:24,761][62475] Updated weights for policy 0, policy_version 28930 (0.0007) [2023-03-06 21:37:25,554][62475] Updated weights for policy 0, policy_version 28940 (0.0007) [2023-03-06 21:37:26,355][62475] Updated weights for policy 0, policy_version 28950 (0.0007) [2023-03-06 21:37:27,173][62475] Updated weights for policy 0, policy_version 28960 (0.0007) [2023-03-06 21:37:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 29657088. Throughput: 0: 12737.7. Samples: 29627581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:27,390][62145] Avg episode reward: [(0, '820.055')] [2023-03-06 21:37:27,965][62475] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-06 21:37:28,790][62475] Updated weights for policy 0, policy_version 28980 (0.0006) [2023-03-06 21:37:29,601][62475] Updated weights for policy 0, policy_version 28990 (0.0007) [2023-03-06 21:37:30,392][62475] Updated weights for policy 0, policy_version 29000 (0.0007) [2023-03-06 21:37:31,197][62475] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-06 21:37:32,010][62475] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-06 21:37:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29720576. Throughput: 0: 12726.1. Samples: 29703706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:32,390][62145] Avg episode reward: [(0, '652.287')] [2023-03-06 21:37:32,822][62475] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-06 21:37:33,653][62475] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-06 21:37:34,457][62475] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-06 21:37:35,274][62475] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-06 21:37:36,093][62475] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-06 21:37:36,893][62475] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-06 21:37:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29784064. Throughput: 0: 12714.7. Samples: 29779373. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:37:37,390][62145] Avg episode reward: [(0, '749.063')] [2023-03-06 21:37:37,699][62475] Updated weights for policy 0, policy_version 29090 (0.0007) [2023-03-06 21:37:38,492][62475] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-06 21:37:39,308][62475] Updated weights for policy 0, policy_version 29110 (0.0007) [2023-03-06 21:37:40,130][62475] Updated weights for policy 0, policy_version 29120 (0.0007) [2023-03-06 21:37:40,917][62475] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-06 21:37:41,743][62475] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-06 21:37:42,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 29847552. Throughput: 0: 12717.5. Samples: 29817477. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:37:42,390][62145] Avg episode reward: [(0, '845.240')] [2023-03-06 21:37:42,554][62475] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-06 21:37:43,377][62475] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-06 21:37:44,187][62475] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-06 21:37:45,000][62475] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-06 21:37:45,817][62475] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-06 21:37:46,614][62475] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-06 21:37:47,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 29910016. Throughput: 0: 12701.3. Samples: 29893203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:37:47,390][62145] Avg episode reward: [(0, '939.277')] [2023-03-06 21:37:47,426][62475] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-06 21:37:48,229][62475] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-06 21:37:49,033][62475] Updated weights for policy 0, policy_version 29230 (0.0006) [2023-03-06 21:37:49,812][62475] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-06 21:37:50,613][62475] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-06 21:37:51,408][62475] Updated weights for policy 0, policy_version 29260 (0.0007) [2023-03-06 21:37:52,227][62475] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-06 21:37:52,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 29973504. Throughput: 0: 12708.9. Samples: 29969681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:37:52,390][62145] Avg episode reward: [(0, '777.808')] [2023-03-06 21:37:53,042][62475] Updated weights for policy 0, policy_version 29280 (0.0008) [2023-03-06 21:37:53,854][62475] Updated weights for policy 0, policy_version 29290 (0.0007) [2023-03-06 21:37:54,651][62475] Updated weights for policy 0, policy_version 29300 (0.0007) [2023-03-06 21:37:55,449][62475] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-06 21:37:56,273][62475] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-06 21:37:57,085][62475] Updated weights for policy 0, policy_version 29330 (0.0007) [2023-03-06 21:37:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 30036992. Throughput: 0: 12708.9. Samples: 30007745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:37:57,390][62145] Avg episode reward: [(0, '863.780')] [2023-03-06 21:37:57,866][62475] Updated weights for policy 0, policy_version 29340 (0.0006) [2023-03-06 21:37:58,695][62475] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-06 21:37:59,496][62475] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-06 21:38:00,288][62475] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-06 21:38:01,078][62475] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-06 21:38:01,906][62475] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-06 21:38:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 30101504. Throughput: 0: 12707.3. Samples: 30084240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:02,390][62145] Avg episode reward: [(0, '730.609')] [2023-03-06 21:38:02,714][62475] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-06 21:38:03,508][62475] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-06 21:38:04,311][62475] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-06 21:38:05,112][62475] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-06 21:38:05,910][62475] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-06 21:38:06,719][62475] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-06 21:38:07,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 30164992. Throughput: 0: 12700.8. Samples: 30160393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:07,390][62145] Avg episode reward: [(0, '798.437')] [2023-03-06 21:38:07,550][62475] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-06 21:38:08,353][62475] Updated weights for policy 0, policy_version 29470 (0.0007) [2023-03-06 21:38:09,146][62475] Updated weights for policy 0, policy_version 29480 (0.0007) [2023-03-06 21:38:09,939][62475] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-06 21:38:10,735][62475] Updated weights for policy 0, policy_version 29500 (0.0007) [2023-03-06 21:38:11,538][62475] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-06 21:38:12,359][62475] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-06 21:38:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 30228480. Throughput: 0: 12693.3. Samples: 30198781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:12,390][62145] Avg episode reward: [(0, '876.865')] [2023-03-06 21:38:13,133][62475] Updated weights for policy 0, policy_version 29530 (0.0007) [2023-03-06 21:38:13,955][62475] Updated weights for policy 0, policy_version 29540 (0.0006) [2023-03-06 21:38:14,757][62475] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-06 21:38:15,540][62475] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-06 21:38:16,368][62475] Updated weights for policy 0, policy_version 29570 (0.0007) [2023-03-06 21:38:17,182][62475] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-06 21:38:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 30291968. Throughput: 0: 12696.6. Samples: 30275050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:17,390][62145] Avg episode reward: [(0, '934.272')] [2023-03-06 21:38:17,982][62475] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-06 21:38:18,800][62475] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-06 21:38:19,594][62475] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-06 21:38:20,393][62475] Updated weights for policy 0, policy_version 29620 (0.0007) [2023-03-06 21:38:21,208][62475] Updated weights for policy 0, policy_version 29630 (0.0007) [2023-03-06 21:38:22,023][62475] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-06 21:38:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 30355456. Throughput: 0: 12705.6. Samples: 30351127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:22,390][62145] Avg episode reward: [(0, '919.630')] [2023-03-06 21:38:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000029644_30355456.pth... [2023-03-06 21:38:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000026665_27304960.pth [2023-03-06 21:38:22,838][62475] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-06 21:38:23,637][62475] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-06 21:38:24,435][62475] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-06 21:38:25,217][62475] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-06 21:38:26,032][62475] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-06 21:38:26,838][62475] Updated weights for policy 0, policy_version 29700 (0.0007) [2023-03-06 21:38:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 30418944. Throughput: 0: 12711.8. Samples: 30389508. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:27,390][62145] Avg episode reward: [(0, '1030.726')] [2023-03-06 21:38:27,648][62475] Updated weights for policy 0, policy_version 29710 (0.0007) [2023-03-06 21:38:28,445][62475] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-06 21:38:29,267][62475] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-06 21:38:30,070][62475] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-06 21:38:30,869][62475] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-06 21:38:31,665][62475] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-06 21:38:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 30482432. Throughput: 0: 12722.6. Samples: 30465722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:32,390][62145] Avg episode reward: [(0, '880.097')] [2023-03-06 21:38:32,471][62475] Updated weights for policy 0, policy_version 29770 (0.0006) [2023-03-06 21:38:33,301][62475] Updated weights for policy 0, policy_version 29780 (0.0006) [2023-03-06 21:38:34,105][62475] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-06 21:38:34,904][62475] Updated weights for policy 0, policy_version 29800 (0.0007) [2023-03-06 21:38:35,731][62475] Updated weights for policy 0, policy_version 29810 (0.0007) [2023-03-06 21:38:36,522][62475] Updated weights for policy 0, policy_version 29820 (0.0007) [2023-03-06 21:38:37,350][62475] Updated weights for policy 0, policy_version 29830 (0.0007) [2023-03-06 21:38:37,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 30545920. Throughput: 0: 12713.0. Samples: 30541764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:38:37,390][62145] Avg episode reward: [(0, '928.120')] [2023-03-06 21:38:38,158][62475] Updated weights for policy 0, policy_version 29840 (0.0007) [2023-03-06 21:38:38,943][62475] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-06 21:38:39,791][62475] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-06 21:38:40,576][62475] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-06 21:38:41,375][62475] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-06 21:38:42,203][62475] Updated weights for policy 0, policy_version 29890 (0.0006) [2023-03-06 21:38:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 30609408. Throughput: 0: 12709.5. Samples: 30579673. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:38:42,390][62145] Avg episode reward: [(0, '1099.344')] [2023-03-06 21:38:43,000][62475] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-06 21:38:43,808][62475] Updated weights for policy 0, policy_version 29910 (0.0007) [2023-03-06 21:38:44,613][62475] Updated weights for policy 0, policy_version 29920 (0.0007) [2023-03-06 21:38:45,412][62475] Updated weights for policy 0, policy_version 29930 (0.0007) [2023-03-06 21:38:46,233][62475] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-06 21:38:47,042][62475] Updated weights for policy 0, policy_version 29950 (0.0006) [2023-03-06 21:38:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 30672896. Throughput: 0: 12706.2. Samples: 30656016. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:38:47,390][62145] Avg episode reward: [(0, '949.435')] [2023-03-06 21:38:47,825][62475] Updated weights for policy 0, policy_version 29960 (0.0007) [2023-03-06 21:38:48,631][62475] Updated weights for policy 0, policy_version 29970 (0.0007) [2023-03-06 21:38:49,430][62475] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-06 21:38:50,242][62475] Updated weights for policy 0, policy_version 29990 (0.0006) [2023-03-06 21:38:51,051][62475] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-06 21:38:51,584][62424] KL-divergence is very high: 288.6699 [2023-03-06 21:38:51,859][62475] Updated weights for policy 0, policy_version 30010 (0.0006) [2023-03-06 21:38:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12711.5). Total num frames: 30736384. Throughput: 0: 12709.0. Samples: 30732300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:38:52,390][62145] Avg episode reward: [(0, '749.918')] [2023-03-06 21:38:52,680][62475] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-06 21:38:53,466][62475] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-06 21:38:54,282][62475] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-06 21:38:55,067][62475] Updated weights for policy 0, policy_version 30050 (0.0007) [2023-03-06 21:38:55,882][62475] Updated weights for policy 0, policy_version 30060 (0.0007) [2023-03-06 21:38:56,683][62475] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-06 21:38:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 30799872. Throughput: 0: 12700.3. Samples: 30770295. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:38:57,390][62145] Avg episode reward: [(0, '705.371')] [2023-03-06 21:38:57,490][62475] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-06 21:38:58,292][62475] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-06 21:38:59,113][62475] Updated weights for policy 0, policy_version 30100 (0.0006) [2023-03-06 21:38:59,910][62475] Updated weights for policy 0, policy_version 30110 (0.0007) [2023-03-06 21:39:00,719][62475] Updated weights for policy 0, policy_version 30120 (0.0007) [2023-03-06 21:39:01,518][62475] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-06 21:39:02,314][62475] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-06 21:39:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 30863360. Throughput: 0: 12705.1. Samples: 30846779. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 21:39:02,390][62145] Avg episode reward: [(0, '856.962')] [2023-03-06 21:39:03,131][62475] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-06 21:39:03,926][62475] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-06 21:39:04,730][62475] Updated weights for policy 0, policy_version 30170 (0.0006) [2023-03-06 21:39:05,533][62475] Updated weights for policy 0, policy_version 30180 (0.0006) [2023-03-06 21:39:06,155][62424] KL-divergence is very high: 2765.0476 [2023-03-06 21:39:06,338][62475] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-06 21:39:07,158][62475] Updated weights for policy 0, policy_version 30200 (0.0007) [2023-03-06 21:39:07,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 30927872. Throughput: 0: 12709.9. Samples: 30923071. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 21:39:07,390][62145] Avg episode reward: [(0, '697.513')] [2023-03-06 21:39:07,953][62475] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-06 21:39:08,762][62475] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-06 21:39:09,567][62475] Updated weights for policy 0, policy_version 30230 (0.0008) [2023-03-06 21:39:10,362][62475] Updated weights for policy 0, policy_version 30240 (0.0007) [2023-03-06 21:39:11,193][62475] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-06 21:39:11,997][62475] Updated weights for policy 0, policy_version 30260 (0.0007) [2023-03-06 21:39:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 30990336. Throughput: 0: 12702.1. Samples: 30961100. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 21:39:12,390][62145] Avg episode reward: [(0, '834.163')] [2023-03-06 21:39:12,810][62475] Updated weights for policy 0, policy_version 30270 (0.0007) [2023-03-06 21:39:13,615][62475] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-06 21:39:14,425][62475] Updated weights for policy 0, policy_version 30290 (0.0006) [2023-03-06 21:39:15,230][62475] Updated weights for policy 0, policy_version 30300 (0.0006) [2023-03-06 21:39:15,299][62424] KL-divergence is very high: 5035.8350 [2023-03-06 21:39:16,030][62475] Updated weights for policy 0, policy_version 30310 (0.0006) [2023-03-06 21:39:16,834][62475] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-06 21:39:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 31053824. Throughput: 0: 12701.2. Samples: 31037276. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-06 21:39:17,390][62145] Avg episode reward: [(0, '921.291')] [2023-03-06 21:39:17,643][62475] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-06 21:39:18,446][62475] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-06 21:39:19,241][62475] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-06 21:39:20,050][62475] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-06 21:39:20,848][62475] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-06 21:39:21,636][62475] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-06 21:39:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 31118336. Throughput: 0: 12713.7. Samples: 31113881. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:39:22,390][62145] Avg episode reward: [(0, '796.391')] [2023-03-06 21:39:22,430][62475] Updated weights for policy 0, policy_version 30390 (0.0006) [2023-03-06 21:39:23,238][62475] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-06 21:39:24,030][62475] Updated weights for policy 0, policy_version 30410 (0.0007) [2023-03-06 21:39:24,841][62475] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-06 21:39:25,645][62475] Updated weights for policy 0, policy_version 30430 (0.0006) [2023-03-06 21:39:26,431][62475] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-06 21:39:27,259][62475] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-06 21:39:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 31181824. Throughput: 0: 12723.4. Samples: 31152224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:39:27,390][62145] Avg episode reward: [(0, '770.520')] [2023-03-06 21:39:28,058][62475] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-06 21:39:28,864][62475] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-06 21:39:29,658][62475] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-06 21:39:30,471][62475] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-06 21:39:31,261][62475] Updated weights for policy 0, policy_version 30500 (0.0006) [2023-03-06 21:39:32,075][62475] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-06 21:39:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 31246336. Throughput: 0: 12725.9. Samples: 31228680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:39:32,390][62145] Avg episode reward: [(0, '997.017')] [2023-03-06 21:39:32,887][62475] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-06 21:39:33,666][62475] Updated weights for policy 0, policy_version 30530 (0.0005) [2023-03-06 21:39:34,307][62424] KL-divergence is very high: 192.7584 [2023-03-06 21:39:34,477][62475] Updated weights for policy 0, policy_version 30540 (0.0006) [2023-03-06 21:39:35,290][62475] Updated weights for policy 0, policy_version 30550 (0.0007) [2023-03-06 21:39:36,101][62475] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-06 21:39:36,882][62475] Updated weights for policy 0, policy_version 30570 (0.0006) [2023-03-06 21:39:37,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 31309824. Throughput: 0: 12731.4. Samples: 31305211. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:39:37,390][62145] Avg episode reward: [(0, '892.964')] [2023-03-06 21:39:37,694][62475] Updated weights for policy 0, policy_version 30580 (0.0006) [2023-03-06 21:39:38,489][62475] Updated weights for policy 0, policy_version 30590 (0.0007) [2023-03-06 21:39:39,299][62475] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-06 21:39:40,121][62475] Updated weights for policy 0, policy_version 30610 (0.0006) [2023-03-06 21:39:40,908][62475] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-06 21:39:41,720][62475] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-06 21:39:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 31373312. Throughput: 0: 12734.4. Samples: 31343343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:39:42,390][62145] Avg episode reward: [(0, '968.832')] [2023-03-06 21:39:42,503][62475] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-06 21:39:43,305][62475] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-06 21:39:44,113][62475] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-06 21:39:44,918][62475] Updated weights for policy 0, policy_version 30670 (0.0007) [2023-03-06 21:39:45,727][62475] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-06 21:39:46,520][62475] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-06 21:39:47,321][62475] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-06 21:39:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 31436800. Throughput: 0: 12734.7. Samples: 31419841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:39:47,390][62145] Avg episode reward: [(0, '959.466')] [2023-03-06 21:39:48,134][62475] Updated weights for policy 0, policy_version 30710 (0.0007) [2023-03-06 21:39:48,932][62475] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-06 21:39:49,735][62475] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-06 21:39:50,532][62475] Updated weights for policy 0, policy_version 30740 (0.0006) [2023-03-06 21:39:51,329][62475] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-06 21:39:52,135][62475] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-06 21:39:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 31501312. Throughput: 0: 12747.1. Samples: 31496690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:39:52,390][62145] Avg episode reward: [(0, '897.427')] [2023-03-06 21:39:52,918][62475] Updated weights for policy 0, policy_version 30770 (0.0007) [2023-03-06 21:39:53,732][62475] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-06 21:39:54,532][62475] Updated weights for policy 0, policy_version 30790 (0.0006) [2023-03-06 21:39:55,326][62475] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-06 21:39:56,157][62475] Updated weights for policy 0, policy_version 30810 (0.0006) [2023-03-06 21:39:56,938][62475] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-06 21:39:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 31564800. Throughput: 0: 12754.7. Samples: 31535063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:39:57,390][62145] Avg episode reward: [(0, '724.219')] [2023-03-06 21:39:57,729][62475] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-03-06 21:39:58,555][62475] Updated weights for policy 0, policy_version 30840 (0.0007) [2023-03-06 21:39:59,357][62475] Updated weights for policy 0, policy_version 30850 (0.0006) [2023-03-06 21:40:00,150][62475] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-06 21:40:00,970][62475] Updated weights for policy 0, policy_version 30870 (0.0006) [2023-03-06 21:40:01,783][62475] Updated weights for policy 0, policy_version 30880 (0.0007) [2023-03-06 21:40:02,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 31628288. Throughput: 0: 12755.5. Samples: 31611277. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:40:02,390][62145] Avg episode reward: [(0, '672.624')] [2023-03-06 21:40:02,580][62475] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-06 21:40:03,387][62475] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-06 21:40:04,183][62475] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-06 21:40:04,996][62475] Updated weights for policy 0, policy_version 30920 (0.0007) [2023-03-06 21:40:05,783][62475] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-06 21:40:06,602][62475] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-06 21:40:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 31691776. Throughput: 0: 12749.4. Samples: 31687601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:40:07,390][62145] Avg episode reward: [(0, '414.582')] [2023-03-06 21:40:07,413][62475] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-06 21:40:08,212][62475] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-06 21:40:09,048][62475] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-06 21:40:09,514][62424] KL-divergence is very high: 765.9384 [2023-03-06 21:40:09,848][62475] Updated weights for policy 0, policy_version 30980 (0.0007) [2023-03-06 21:40:10,632][62475] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-06 21:40:11,431][62475] Updated weights for policy 0, policy_version 31000 (0.0007) [2023-03-06 21:40:12,225][62475] Updated weights for policy 0, policy_version 31010 (0.0007) [2023-03-06 21:40:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 31755264. Throughput: 0: 12744.2. Samples: 31725715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:40:12,390][62145] Avg episode reward: [(0, '387.484')] [2023-03-06 21:40:13,041][62475] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-06 21:40:13,834][62475] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-06 21:40:14,637][62475] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-06 21:40:15,437][62475] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-06 21:40:16,232][62475] Updated weights for policy 0, policy_version 31060 (0.0007) [2023-03-06 21:40:17,033][62475] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-06 21:40:17,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12718.4). Total num frames: 31819776. Throughput: 0: 12750.4. Samples: 31802447. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:40:17,390][62145] Avg episode reward: [(0, '780.954')] [2023-03-06 21:40:17,863][62475] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-06 21:40:18,652][62475] Updated weights for policy 0, policy_version 31090 (0.0007) [2023-03-06 21:40:19,451][62475] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-06 21:40:20,264][62475] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-06 21:40:21,083][62475] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-06 21:40:21,874][62475] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-06 21:40:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 31883264. Throughput: 0: 12744.6. Samples: 31878717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:22,390][62145] Avg episode reward: [(0, '686.432')] [2023-03-06 21:40:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000031136_31883264.pth... [2023-03-06 21:40:22,427][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000028155_28830720.pth [2023-03-06 21:40:22,682][62475] Updated weights for policy 0, policy_version 31140 (0.0007) [2023-03-06 21:40:23,497][62475] Updated weights for policy 0, policy_version 31150 (0.0006) [2023-03-06 21:40:24,279][62475] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-06 21:40:25,080][62475] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-06 21:40:25,875][62475] Updated weights for policy 0, policy_version 31180 (0.0006) [2023-03-06 21:40:26,687][62475] Updated weights for policy 0, policy_version 31190 (0.0006) [2023-03-06 21:40:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 31946752. Throughput: 0: 12751.4. Samples: 31917154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:27,390][62145] Avg episode reward: [(0, '754.093')] [2023-03-06 21:40:27,494][62475] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-06 21:40:28,294][62475] Updated weights for policy 0, policy_version 31210 (0.0007) [2023-03-06 21:40:29,111][62475] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-06 21:40:29,937][62475] Updated weights for policy 0, policy_version 31230 (0.0006) [2023-03-06 21:40:30,743][62475] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-06 21:40:31,537][62475] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-06 21:40:32,346][62475] Updated weights for policy 0, policy_version 31260 (0.0006) [2023-03-06 21:40:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 32010240. Throughput: 0: 12742.8. Samples: 31993269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:32,401][62145] Avg episode reward: [(0, '845.693')] [2023-03-06 21:40:33,158][62475] Updated weights for policy 0, policy_version 31270 (0.0007) [2023-03-06 21:40:33,965][62475] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-06 21:40:34,752][62475] Updated weights for policy 0, policy_version 31290 (0.0007) [2023-03-06 21:40:35,550][62475] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-06 21:40:36,363][62475] Updated weights for policy 0, policy_version 31310 (0.0007) [2023-03-06 21:40:37,167][62475] Updated weights for policy 0, policy_version 31320 (0.0007) [2023-03-06 21:40:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 32073728. Throughput: 0: 12734.4. Samples: 32069741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:37,401][62145] Avg episode reward: [(0, '590.565')] [2023-03-06 21:40:37,973][62475] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-06 21:40:38,783][62475] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-06 21:40:39,575][62475] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-06 21:40:40,350][62475] Updated weights for policy 0, policy_version 31360 (0.0006) [2023-03-06 21:40:41,188][62475] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-06 21:40:41,970][62475] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-06 21:40:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 32137216. Throughput: 0: 12732.1. Samples: 32108010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:42,401][62145] Avg episode reward: [(0, '571.752')] [2023-03-06 21:40:42,772][62475] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-06 21:40:43,596][62475] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-06 21:40:44,383][62475] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-06 21:40:45,186][62475] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-06 21:40:45,977][62475] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-06 21:40:46,787][62475] Updated weights for policy 0, policy_version 31440 (0.0006) [2023-03-06 21:40:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 32201728. Throughput: 0: 12741.1. Samples: 32184624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:47,401][62145] Avg episode reward: [(0, '603.553')] [2023-03-06 21:40:47,578][62475] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-06 21:40:48,394][62475] Updated weights for policy 0, policy_version 31460 (0.0006) [2023-03-06 21:40:49,191][62475] Updated weights for policy 0, policy_version 31470 (0.0007) [2023-03-06 21:40:49,999][62475] Updated weights for policy 0, policy_version 31480 (0.0006) [2023-03-06 21:40:50,806][62475] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-06 21:40:51,602][62475] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-06 21:40:52,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32265216. Throughput: 0: 12747.8. Samples: 32261253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:52,390][62145] Avg episode reward: [(0, '540.536')] [2023-03-06 21:40:52,400][62475] Updated weights for policy 0, policy_version 31510 (0.0007) [2023-03-06 21:40:53,204][62475] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-06 21:40:53,994][62475] Updated weights for policy 0, policy_version 31530 (0.0007) [2023-03-06 21:40:54,819][62475] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-06 21:40:55,622][62475] Updated weights for policy 0, policy_version 31550 (0.0007) [2023-03-06 21:40:56,444][62475] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-06 21:40:57,243][62475] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-06 21:40:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32328704. Throughput: 0: 12750.4. Samples: 32299483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:40:57,390][62145] Avg episode reward: [(0, '517.553')] [2023-03-06 21:40:58,054][62475] Updated weights for policy 0, policy_version 31580 (0.0007) [2023-03-06 21:40:58,857][62475] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-06 21:40:59,667][62475] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-06 21:41:00,469][62475] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-06 21:41:01,259][62475] Updated weights for policy 0, policy_version 31620 (0.0006) [2023-03-06 21:41:02,084][62475] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-06 21:41:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 32393216. Throughput: 0: 12736.2. Samples: 32375575. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:41:02,390][62145] Avg episode reward: [(0, '487.207')] [2023-03-06 21:41:02,890][62475] Updated weights for policy 0, policy_version 31640 (0.0007) [2023-03-06 21:41:03,701][62475] Updated weights for policy 0, policy_version 31650 (0.0007) [2023-03-06 21:41:04,508][62475] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-06 21:41:05,346][62475] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-06 21:41:06,125][62475] Updated weights for policy 0, policy_version 31680 (0.0007) [2023-03-06 21:41:06,949][62475] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-06 21:41:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32455680. Throughput: 0: 12729.4. Samples: 32451539. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:41:07,390][62145] Avg episode reward: [(0, '477.913')] [2023-03-06 21:41:07,750][62475] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-06 21:41:08,550][62475] Updated weights for policy 0, policy_version 31710 (0.0007) [2023-03-06 21:41:09,366][62475] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-06 21:41:10,155][62475] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-06 21:41:10,963][62475] Updated weights for policy 0, policy_version 31740 (0.0007) [2023-03-06 21:41:11,772][62475] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-06 21:41:12,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32519168. Throughput: 0: 12725.4. Samples: 32489799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:41:12,390][62145] Avg episode reward: [(0, '403.611')] [2023-03-06 21:41:12,564][62475] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-06 21:41:13,355][62475] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-06 21:41:14,154][62475] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-06 21:41:14,957][62475] Updated weights for policy 0, policy_version 31790 (0.0007) [2023-03-06 21:41:15,773][62475] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-06 21:41:16,567][62475] Updated weights for policy 0, policy_version 31810 (0.0007) [2023-03-06 21:41:17,377][62475] Updated weights for policy 0, policy_version 31820 (0.0007) [2023-03-06 21:41:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32583680. Throughput: 0: 12731.8. Samples: 32566198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:41:17,390][62145] Avg episode reward: [(0, '422.794')] [2023-03-06 21:41:18,192][62475] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-06 21:41:18,989][62475] Updated weights for policy 0, policy_version 31840 (0.0007) [2023-03-06 21:41:19,802][62475] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-06 21:41:20,587][62475] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-06 21:41:21,402][62475] Updated weights for policy 0, policy_version 31870 (0.0006) [2023-03-06 21:41:22,211][62475] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-06 21:41:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 32647168. Throughput: 0: 12733.4. Samples: 32642744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:41:22,390][62145] Avg episode reward: [(0, '355.209')] [2023-03-06 21:41:23,012][62475] Updated weights for policy 0, policy_version 31890 (0.0005) [2023-03-06 21:41:23,811][62475] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-06 21:41:24,610][62475] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-06 21:41:25,424][62475] Updated weights for policy 0, policy_version 31920 (0.0007) [2023-03-06 21:41:26,218][62475] Updated weights for policy 0, policy_version 31930 (0.0006) [2023-03-06 21:41:27,021][62475] Updated weights for policy 0, policy_version 31940 (0.0007) [2023-03-06 21:41:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32710656. Throughput: 0: 12731.7. Samples: 32680937. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:41:27,390][62145] Avg episode reward: [(0, '384.895')] [2023-03-06 21:41:27,816][62475] Updated weights for policy 0, policy_version 31950 (0.0007) [2023-03-06 21:41:28,624][62475] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-06 21:41:29,449][62475] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-06 21:41:30,259][62475] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-06 21:41:31,057][62475] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-06 21:41:31,873][62475] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-06 21:41:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32774144. Throughput: 0: 12722.3. Samples: 32757127. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:41:32,390][62145] Avg episode reward: [(0, '371.892')] [2023-03-06 21:41:32,691][62475] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-06 21:41:33,502][62475] Updated weights for policy 0, policy_version 32020 (0.0007) [2023-03-06 21:41:34,312][62475] Updated weights for policy 0, policy_version 32030 (0.0006) [2023-03-06 21:41:35,129][62475] Updated weights for policy 0, policy_version 32040 (0.0006) [2023-03-06 21:41:35,934][62475] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-06 21:41:36,733][62475] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-06 21:41:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32837632. Throughput: 0: 12703.8. Samples: 32832923. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:41:37,390][62145] Avg episode reward: [(0, '410.179')] [2023-03-06 21:41:37,540][62475] Updated weights for policy 0, policy_version 32070 (0.0007) [2023-03-06 21:41:38,350][62475] Updated weights for policy 0, policy_version 32080 (0.0007) [2023-03-06 21:41:39,157][62475] Updated weights for policy 0, policy_version 32090 (0.0006) [2023-03-06 21:41:39,961][62475] Updated weights for policy 0, policy_version 32100 (0.0006) [2023-03-06 21:41:40,760][62475] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-06 21:41:41,577][62475] Updated weights for policy 0, policy_version 32120 (0.0007) [2023-03-06 21:41:42,375][62475] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-03-06 21:41:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 32901120. Throughput: 0: 12700.9. Samples: 32871025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:41:42,390][62145] Avg episode reward: [(0, '354.758')] [2023-03-06 21:41:43,205][62475] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-06 21:41:44,005][62475] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-06 21:41:44,792][62475] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-06 21:41:45,615][62475] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-06 21:41:46,424][62475] Updated weights for policy 0, policy_version 32180 (0.0008) [2023-03-06 21:41:47,222][62475] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-06 21:41:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 32964608. Throughput: 0: 12699.8. Samples: 32947065. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:41:47,390][62145] Avg episode reward: [(0, '395.944')] [2023-03-06 21:41:48,034][62475] Updated weights for policy 0, policy_version 32200 (0.0007) [2023-03-06 21:41:48,833][62475] Updated weights for policy 0, policy_version 32210 (0.0007) [2023-03-06 21:41:49,655][62475] Updated weights for policy 0, policy_version 32220 (0.0007) [2023-03-06 21:41:50,441][62475] Updated weights for policy 0, policy_version 32230 (0.0006) [2023-03-06 21:41:51,250][62475] Updated weights for policy 0, policy_version 32240 (0.0007) [2023-03-06 21:41:52,065][62475] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-06 21:41:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 33028096. Throughput: 0: 12707.6. Samples: 33023383. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:41:52,390][62145] Avg episode reward: [(0, '478.876')] [2023-03-06 21:41:52,855][62475] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-06 21:41:53,665][62475] Updated weights for policy 0, policy_version 32270 (0.0007) [2023-03-06 21:41:54,465][62475] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-06 21:41:55,269][62475] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-06 21:41:56,086][62475] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-06 21:41:56,886][62475] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-06 21:41:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 33091584. Throughput: 0: 12708.7. Samples: 33061693. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:41:57,390][62145] Avg episode reward: [(0, '501.533')] [2023-03-06 21:41:57,683][62475] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-06 21:41:58,506][62475] Updated weights for policy 0, policy_version 32330 (0.0007) [2023-03-06 21:41:59,303][62475] Updated weights for policy 0, policy_version 32340 (0.0007) [2023-03-06 21:42:00,104][62475] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-06 21:42:00,925][62475] Updated weights for policy 0, policy_version 32360 (0.0006) [2023-03-06 21:42:01,706][62475] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-06 21:42:02,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 33155072. Throughput: 0: 12705.0. Samples: 33137925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:02,390][62145] Avg episode reward: [(0, '612.481')] [2023-03-06 21:42:02,503][62475] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-06 21:42:03,324][62475] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-06 21:42:04,109][62475] Updated weights for policy 0, policy_version 32400 (0.0006) [2023-03-06 21:42:04,918][62475] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-06 21:42:05,719][62475] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-06 21:42:06,522][62475] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-06 21:42:07,330][62475] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-06 21:42:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12721.9). Total num frames: 33218560. Throughput: 0: 12707.2. Samples: 33214570. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:07,390][62145] Avg episode reward: [(0, '432.430')] [2023-03-06 21:42:08,130][62475] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-06 21:42:08,960][62475] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-06 21:42:09,750][62475] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-06 21:42:10,569][62475] Updated weights for policy 0, policy_version 32480 (0.0007) [2023-03-06 21:42:11,376][62475] Updated weights for policy 0, policy_version 32490 (0.0006) [2023-03-06 21:42:12,174][62475] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-06 21:42:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 33282048. Throughput: 0: 12702.2. Samples: 33252538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:12,390][62145] Avg episode reward: [(0, '395.315')] [2023-03-06 21:42:12,979][62475] Updated weights for policy 0, policy_version 32510 (0.0007) [2023-03-06 21:42:13,794][62475] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-06 21:42:14,577][62475] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-06 21:42:15,374][62475] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-06 21:42:16,199][62475] Updated weights for policy 0, policy_version 32550 (0.0007) [2023-03-06 21:42:16,993][62475] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-06 21:42:17,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33345536. Throughput: 0: 12705.5. Samples: 33328876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:17,390][62145] Avg episode reward: [(0, '510.825')] [2023-03-06 21:42:17,786][62475] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-06 21:42:18,618][62475] Updated weights for policy 0, policy_version 32580 (0.0007) [2023-03-06 21:42:19,422][62475] Updated weights for policy 0, policy_version 32590 (0.0007) [2023-03-06 21:42:20,224][62475] Updated weights for policy 0, policy_version 32600 (0.0008) [2023-03-06 21:42:21,033][62475] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-06 21:42:21,829][62475] Updated weights for policy 0, policy_version 32620 (0.0006) [2023-03-06 21:42:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33409024. Throughput: 0: 12715.3. Samples: 33405112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:22,390][62145] Avg episode reward: [(0, '313.486')] [2023-03-06 21:42:22,408][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000032627_33410048.pth... [2023-03-06 21:42:22,438][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000029644_30355456.pth [2023-03-06 21:42:22,628][62475] Updated weights for policy 0, policy_version 32630 (0.0006) [2023-03-06 21:42:23,448][62475] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-06 21:42:24,239][62475] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-06 21:42:25,032][62475] Updated weights for policy 0, policy_version 32660 (0.0008) [2023-03-06 21:42:25,856][62475] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-06 21:42:26,667][62475] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-06 21:42:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33472512. Throughput: 0: 12718.0. Samples: 33443335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:27,390][62145] Avg episode reward: [(0, '333.415')] [2023-03-06 21:42:27,476][62475] Updated weights for policy 0, policy_version 32690 (0.0006) [2023-03-06 21:42:28,286][62475] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-06 21:42:29,086][62475] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-06 21:42:29,907][62475] Updated weights for policy 0, policy_version 32720 (0.0007) [2023-03-06 21:42:30,705][62475] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-06 21:42:31,515][62475] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-06 21:42:32,325][62475] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-06 21:42:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33536000. Throughput: 0: 12716.1. Samples: 33519289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:32,390][62145] Avg episode reward: [(0, '501.379')] [2023-03-06 21:42:33,119][62475] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-06 21:42:33,955][62475] Updated weights for policy 0, policy_version 32770 (0.0007) [2023-03-06 21:42:34,754][62475] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-06 21:42:35,544][62475] Updated weights for policy 0, policy_version 32790 (0.0006) [2023-03-06 21:42:36,369][62475] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-06 21:42:37,179][62475] Updated weights for policy 0, policy_version 32810 (0.0006) [2023-03-06 21:42:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33599488. Throughput: 0: 12710.4. Samples: 33595350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:37,390][62145] Avg episode reward: [(0, '553.737')] [2023-03-06 21:42:37,980][62475] Updated weights for policy 0, policy_version 32820 (0.0007) [2023-03-06 21:42:38,786][62475] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-06 21:42:39,589][62475] Updated weights for policy 0, policy_version 32840 (0.0007) [2023-03-06 21:42:40,402][62475] Updated weights for policy 0, policy_version 32850 (0.0006) [2023-03-06 21:42:41,222][62475] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-06 21:42:42,030][62475] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-06 21:42:42,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 33662976. Throughput: 0: 12704.3. Samples: 33633385. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:42,390][62145] Avg episode reward: [(0, '540.587')] [2023-03-06 21:42:42,824][62475] Updated weights for policy 0, policy_version 32880 (0.0007) [2023-03-06 21:42:43,636][62475] Updated weights for policy 0, policy_version 32890 (0.0006) [2023-03-06 21:42:44,441][62475] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-06 21:42:45,239][62475] Updated weights for policy 0, policy_version 32910 (0.0007) [2023-03-06 21:42:46,070][62475] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-06 21:42:46,872][62475] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-06 21:42:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 33726464. Throughput: 0: 12699.1. Samples: 33709384. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:47,390][62145] Avg episode reward: [(0, '651.604')] [2023-03-06 21:42:47,660][62475] Updated weights for policy 0, policy_version 32940 (0.0006) [2023-03-06 21:42:48,466][62475] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-06 21:42:49,265][62475] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-06 21:42:50,089][62475] Updated weights for policy 0, policy_version 32970 (0.0007) [2023-03-06 21:42:50,883][62475] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-06 21:42:51,690][62475] Updated weights for policy 0, policy_version 32990 (0.0007) [2023-03-06 21:42:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 33789952. Throughput: 0: 12692.4. Samples: 33785728. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:52,390][62145] Avg episode reward: [(0, '612.750')] [2023-03-06 21:42:52,506][62475] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-06 21:42:53,306][62475] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-06 21:42:54,120][62475] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-06 21:42:54,921][62475] Updated weights for policy 0, policy_version 33030 (0.0006) [2023-03-06 21:42:55,732][62475] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-06 21:42:56,542][62475] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-06 21:42:57,353][62475] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-06 21:42:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33853440. Throughput: 0: 12697.4. Samples: 33823922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:42:57,390][62145] Avg episode reward: [(0, '413.502')] [2023-03-06 21:42:58,154][62475] Updated weights for policy 0, policy_version 33070 (0.0007) [2023-03-06 21:42:58,977][62475] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-06 21:42:59,773][62475] Updated weights for policy 0, policy_version 33090 (0.0006) [2023-03-06 21:43:00,573][62475] Updated weights for policy 0, policy_version 33100 (0.0007) [2023-03-06 21:43:01,374][62475] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-06 21:43:02,201][62475] Updated weights for policy 0, policy_version 33120 (0.0008) [2023-03-06 21:43:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33916928. Throughput: 0: 12691.5. Samples: 33899994. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:43:02,390][62145] Avg episode reward: [(0, '480.949')] [2023-03-06 21:43:02,998][62475] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-06 21:43:03,810][62475] Updated weights for policy 0, policy_version 33140 (0.0007) [2023-03-06 21:43:04,626][62475] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-06 21:43:05,434][62475] Updated weights for policy 0, policy_version 33160 (0.0005) [2023-03-06 21:43:06,242][62475] Updated weights for policy 0, policy_version 33170 (0.0006) [2023-03-06 21:43:07,057][62475] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-06 21:43:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 33980416. Throughput: 0: 12685.1. Samples: 33975940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:43:07,390][62145] Avg episode reward: [(0, '418.349')] [2023-03-06 21:43:07,866][62475] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-06 21:43:08,689][62475] Updated weights for policy 0, policy_version 33200 (0.0007) [2023-03-06 21:43:09,465][62475] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-06 21:43:10,286][62475] Updated weights for policy 0, policy_version 33220 (0.0007) [2023-03-06 21:43:11,057][62475] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-06 21:43:11,853][62475] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-06 21:43:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 34043904. Throughput: 0: 12683.7. Samples: 34014103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:43:12,390][62145] Avg episode reward: [(0, '375.154')] [2023-03-06 21:43:12,646][62475] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-06 21:43:13,451][62475] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-06 21:43:14,260][62475] Updated weights for policy 0, policy_version 33270 (0.0006) [2023-03-06 21:43:15,079][62475] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-06 21:43:15,864][62475] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-06 21:43:16,686][62475] Updated weights for policy 0, policy_version 33300 (0.0006) [2023-03-06 21:43:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 34107392. Throughput: 0: 12697.2. Samples: 34090661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:43:17,390][62145] Avg episode reward: [(0, '397.409')] [2023-03-06 21:43:17,478][62475] Updated weights for policy 0, policy_version 33310 (0.0006) [2023-03-06 21:43:18,277][62475] Updated weights for policy 0, policy_version 33320 (0.0008) [2023-03-06 21:43:19,079][62475] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-06 21:43:19,881][62475] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-06 21:43:20,687][62475] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-06 21:43:21,504][62475] Updated weights for policy 0, policy_version 33360 (0.0006) [2023-03-06 21:43:22,318][62475] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-06 21:43:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34171904. Throughput: 0: 12705.8. Samples: 34167112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:43:22,390][62145] Avg episode reward: [(0, '508.178')] [2023-03-06 21:43:23,133][62475] Updated weights for policy 0, policy_version 33380 (0.0006) [2023-03-06 21:43:23,916][62475] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-06 21:43:24,716][62475] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-06 21:43:25,506][62475] Updated weights for policy 0, policy_version 33410 (0.0006) [2023-03-06 21:43:26,316][62475] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-06 21:43:27,104][62475] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-06 21:43:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34235392. Throughput: 0: 12711.7. Samples: 34205411. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:43:27,390][62145] Avg episode reward: [(0, '643.787')] [2023-03-06 21:43:27,941][62475] Updated weights for policy 0, policy_version 33440 (0.0006) [2023-03-06 21:43:28,750][62475] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-06 21:43:29,535][62475] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-06 21:43:30,345][62475] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-06 21:43:31,144][62475] Updated weights for policy 0, policy_version 33480 (0.0007) [2023-03-06 21:43:31,941][62475] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-06 21:43:32,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34298880. Throughput: 0: 12720.2. Samples: 34281792. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:43:32,390][62145] Avg episode reward: [(0, '502.356')] [2023-03-06 21:43:32,745][62475] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-06 21:43:33,547][62475] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-06 21:43:34,348][62475] Updated weights for policy 0, policy_version 33520 (0.0006) [2023-03-06 21:43:35,166][62475] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-06 21:43:35,973][62475] Updated weights for policy 0, policy_version 33540 (0.0007) [2023-03-06 21:43:36,788][62475] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-06 21:43:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34362368. Throughput: 0: 12720.3. Samples: 34358140. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:43:37,390][62145] Avg episode reward: [(0, '479.515')] [2023-03-06 21:43:37,603][62475] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-06 21:43:38,417][62475] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-06 21:43:39,231][62475] Updated weights for policy 0, policy_version 33580 (0.0007) [2023-03-06 21:43:40,017][62475] Updated weights for policy 0, policy_version 33590 (0.0006) [2023-03-06 21:43:40,829][62475] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-06 21:43:41,613][62475] Updated weights for policy 0, policy_version 33610 (0.0006) [2023-03-06 21:43:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34425856. Throughput: 0: 12713.3. Samples: 34396019. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:43:42,390][62145] Avg episode reward: [(0, '582.544')] [2023-03-06 21:43:42,427][62475] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-06 21:43:43,228][62475] Updated weights for policy 0, policy_version 33630 (0.0006) [2023-03-06 21:43:44,012][62475] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-06 21:43:44,844][62475] Updated weights for policy 0, policy_version 33650 (0.0007) [2023-03-06 21:43:45,644][62475] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-06 21:43:46,454][62475] Updated weights for policy 0, policy_version 33670 (0.0006) [2023-03-06 21:43:47,248][62475] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-06 21:43:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34489344. Throughput: 0: 12723.8. Samples: 34472564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:43:47,390][62145] Avg episode reward: [(0, '522.658')] [2023-03-06 21:43:48,062][62475] Updated weights for policy 0, policy_version 33690 (0.0007) [2023-03-06 21:43:48,866][62475] Updated weights for policy 0, policy_version 33700 (0.0007) [2023-03-06 21:43:49,660][62475] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-06 21:43:50,461][62475] Updated weights for policy 0, policy_version 33720 (0.0007) [2023-03-06 21:43:51,264][62475] Updated weights for policy 0, policy_version 33730 (0.0007) [2023-03-06 21:43:52,064][62475] Updated weights for policy 0, policy_version 33740 (0.0007) [2023-03-06 21:43:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34552832. Throughput: 0: 12733.9. Samples: 34548968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:43:52,400][62145] Avg episode reward: [(0, '551.150')] [2023-03-06 21:43:52,887][62475] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-06 21:43:53,683][62475] Updated weights for policy 0, policy_version 33760 (0.0007) [2023-03-06 21:43:54,517][62475] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-06 21:43:55,303][62475] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-06 21:43:56,107][62475] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-06 21:43:56,927][62475] Updated weights for policy 0, policy_version 33800 (0.0006) [2023-03-06 21:43:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34616320. Throughput: 0: 12729.0. Samples: 34586909. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:43:57,401][62145] Avg episode reward: [(0, '536.301')] [2023-03-06 21:43:57,718][62475] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-06 21:43:58,520][62475] Updated weights for policy 0, policy_version 33820 (0.0007) [2023-03-06 21:43:59,321][62475] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-06 21:44:00,134][62475] Updated weights for policy 0, policy_version 33840 (0.0007) [2023-03-06 21:44:00,945][62475] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-06 21:44:01,733][62475] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-06 21:44:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12721.9). Total num frames: 34680832. Throughput: 0: 12722.2. Samples: 34663160. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:44:02,401][62145] Avg episode reward: [(0, '472.332')] [2023-03-06 21:44:02,538][62475] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-06 21:44:03,355][62475] Updated weights for policy 0, policy_version 33880 (0.0006) [2023-03-06 21:44:04,153][62475] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-06 21:44:04,969][62475] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-06 21:44:05,767][62475] Updated weights for policy 0, policy_version 33910 (0.0007) [2023-03-06 21:44:06,550][62475] Updated weights for policy 0, policy_version 33920 (0.0006) [2023-03-06 21:44:07,365][62475] Updated weights for policy 0, policy_version 33930 (0.0005) [2023-03-06 21:44:07,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 34744320. Throughput: 0: 12725.8. Samples: 34739770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:44:07,390][62145] Avg episode reward: [(0, '549.282')] [2023-03-06 21:44:08,158][62475] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-06 21:44:08,973][62475] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-06 21:44:09,787][62475] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-06 21:44:10,585][62475] Updated weights for policy 0, policy_version 33970 (0.0007) [2023-03-06 21:44:11,410][62475] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-06 21:44:12,219][62475] Updated weights for policy 0, policy_version 33990 (0.0005) [2023-03-06 21:44:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 34807808. Throughput: 0: 12721.7. Samples: 34777889. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:44:12,400][62145] Avg episode reward: [(0, '689.730')] [2023-03-06 21:44:13,022][62475] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-06 21:44:13,828][62475] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-06 21:44:14,633][62475] Updated weights for policy 0, policy_version 34020 (0.0006) [2023-03-06 21:44:15,437][62475] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-06 21:44:16,254][62475] Updated weights for policy 0, policy_version 34040 (0.0006) [2023-03-06 21:44:17,049][62475] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-06 21:44:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 34871296. Throughput: 0: 12715.2. Samples: 34853977. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:44:17,390][62145] Avg episode reward: [(0, '620.268')] [2023-03-06 21:44:17,852][62475] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-06 21:44:18,673][62475] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-06 21:44:19,480][62475] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-06 21:44:20,281][62475] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-06 21:44:21,076][62475] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-06 21:44:21,883][62475] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-06 21:44:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 34934784. Throughput: 0: 12715.9. Samples: 34930356. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:44:22,390][62145] Avg episode reward: [(0, '489.616')] [2023-03-06 21:44:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000034116_34934784.pth... [2023-03-06 21:44:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000031136_31883264.pth [2023-03-06 21:44:22,687][62475] Updated weights for policy 0, policy_version 34120 (0.0006) [2023-03-06 21:44:23,507][62475] Updated weights for policy 0, policy_version 34130 (0.0007) [2023-03-06 21:44:24,300][62475] Updated weights for policy 0, policy_version 34140 (0.0006) [2023-03-06 21:44:25,109][62475] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-06 21:44:25,911][62475] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-06 21:44:26,712][62475] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-06 21:44:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 34998272. Throughput: 0: 12719.4. Samples: 34968390. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 21:44:27,390][62145] Avg episode reward: [(0, '555.791')] [2023-03-06 21:44:27,509][62475] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-06 21:44:28,328][62475] Updated weights for policy 0, policy_version 34190 (0.0007) [2023-03-06 21:44:29,126][62475] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-06 21:44:29,942][62475] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-06 21:44:30,728][62475] Updated weights for policy 0, policy_version 34220 (0.0006) [2023-03-06 21:44:31,557][62475] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-06 21:44:32,389][62475] Updated weights for policy 0, policy_version 34240 (0.0007) [2023-03-06 21:44:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 35061760. Throughput: 0: 12713.9. Samples: 35044689. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:44:32,390][62145] Avg episode reward: [(0, '507.030')] [2023-03-06 21:44:33,194][62475] Updated weights for policy 0, policy_version 34250 (0.0007) [2023-03-06 21:44:34,013][62475] Updated weights for policy 0, policy_version 34260 (0.0006) [2023-03-06 21:44:34,831][62475] Updated weights for policy 0, policy_version 34270 (0.0006) [2023-03-06 21:44:35,625][62475] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-06 21:44:36,434][62475] Updated weights for policy 0, policy_version 34290 (0.0006) [2023-03-06 21:44:37,262][62475] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-06 21:44:37,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 35124224. Throughput: 0: 12698.7. Samples: 35120411. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:44:37,390][62145] Avg episode reward: [(0, '650.107')] [2023-03-06 21:44:38,044][62475] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-06 21:44:38,867][62475] Updated weights for policy 0, policy_version 34320 (0.0007) [2023-03-06 21:44:39,655][62475] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-06 21:44:40,451][62475] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-06 21:44:41,261][62475] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-06 21:44:42,077][62475] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-06 21:44:42,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 35187712. Throughput: 0: 12699.4. Samples: 35158383. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:44:42,390][62145] Avg episode reward: [(0, '662.939')] [2023-03-06 21:44:42,885][62475] Updated weights for policy 0, policy_version 34370 (0.0007) [2023-03-06 21:44:43,691][62475] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-06 21:44:44,496][62475] Updated weights for policy 0, policy_version 34390 (0.0007) [2023-03-06 21:44:45,282][62475] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-06 21:44:46,099][62475] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-06 21:44:46,903][62475] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-06 21:44:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 35251200. Throughput: 0: 12701.5. Samples: 35234731. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:44:47,390][62145] Avg episode reward: [(0, '468.800')] [2023-03-06 21:44:47,722][62475] Updated weights for policy 0, policy_version 34430 (0.0006) [2023-03-06 21:44:48,531][62475] Updated weights for policy 0, policy_version 34440 (0.0007) [2023-03-06 21:44:49,343][62475] Updated weights for policy 0, policy_version 34450 (0.0007) [2023-03-06 21:44:50,160][62475] Updated weights for policy 0, policy_version 34460 (0.0006) [2023-03-06 21:44:50,964][62475] Updated weights for policy 0, policy_version 34470 (0.0007) [2023-03-06 21:44:51,774][62475] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-06 21:44:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 35314688. Throughput: 0: 12687.7. Samples: 35310714. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:44:52,390][62145] Avg episode reward: [(0, '678.500')] [2023-03-06 21:44:52,577][62475] Updated weights for policy 0, policy_version 34490 (0.0007) [2023-03-06 21:44:53,379][62475] Updated weights for policy 0, policy_version 34500 (0.0007) [2023-03-06 21:44:54,179][62475] Updated weights for policy 0, policy_version 34510 (0.0007) [2023-03-06 21:44:54,989][62475] Updated weights for policy 0, policy_version 34520 (0.0007) [2023-03-06 21:44:55,532][62424] KL-divergence is very high: 239.5113 [2023-03-06 21:44:55,786][62475] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-06 21:44:55,849][62424] KL-divergence is very high: 106.4870 [2023-03-06 21:44:56,582][62475] Updated weights for policy 0, policy_version 34540 (0.0006) [2023-03-06 21:44:57,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 35378176. Throughput: 0: 12685.7. Samples: 35348744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:44:57,390][62145] Avg episode reward: [(0, '585.060')] [2023-03-06 21:44:57,405][62475] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-06 21:44:58,204][62475] Updated weights for policy 0, policy_version 34560 (0.0007) [2023-03-06 21:44:58,998][62475] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-06 21:44:59,798][62475] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-06 21:45:00,597][62475] Updated weights for policy 0, policy_version 34590 (0.0006) [2023-03-06 21:45:01,400][62475] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-06 21:45:02,202][62475] Updated weights for policy 0, policy_version 34610 (0.0006) [2023-03-06 21:45:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 35442688. Throughput: 0: 12699.0. Samples: 35425431. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:02,390][62145] Avg episode reward: [(0, '614.457')] [2023-03-06 21:45:03,017][62475] Updated weights for policy 0, policy_version 34620 (0.0007) [2023-03-06 21:45:03,824][62475] Updated weights for policy 0, policy_version 34630 (0.0007) [2023-03-06 21:45:04,640][62475] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-06 21:45:05,435][62475] Updated weights for policy 0, policy_version 34650 (0.0008) [2023-03-06 21:45:06,249][62475] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-06 21:45:07,060][62475] Updated weights for policy 0, policy_version 34670 (0.0007) [2023-03-06 21:45:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 35506176. Throughput: 0: 12690.3. Samples: 35501421. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:07,390][62145] Avg episode reward: [(0, '777.167')] [2023-03-06 21:45:07,875][62475] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-06 21:45:08,679][62475] Updated weights for policy 0, policy_version 34690 (0.0007) [2023-03-06 21:45:09,470][62475] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-06 21:45:10,292][62475] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-06 21:45:11,104][62475] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-06 21:45:11,900][62475] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-06 21:45:12,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 35568640. Throughput: 0: 12691.8. Samples: 35539521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:12,390][62145] Avg episode reward: [(0, '702.545')] [2023-03-06 21:45:12,717][62475] Updated weights for policy 0, policy_version 34740 (0.0007) [2023-03-06 21:45:13,529][62475] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-06 21:45:14,332][62475] Updated weights for policy 0, policy_version 34760 (0.0008) [2023-03-06 21:45:15,151][62475] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-06 21:45:15,961][62475] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-06 21:45:16,762][62475] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-06 21:45:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 35632128. Throughput: 0: 12682.4. Samples: 35615398. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:45:17,390][62145] Avg episode reward: [(0, '735.991')] [2023-03-06 21:45:17,559][62475] Updated weights for policy 0, policy_version 34800 (0.0006) [2023-03-06 21:45:18,342][62475] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-06 21:45:19,158][62475] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-06 21:45:19,961][62475] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-06 21:45:20,777][62475] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-06 21:45:21,586][62475] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-06 21:45:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 35695616. Throughput: 0: 12698.7. Samples: 35691854. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:45:22,390][62145] Avg episode reward: [(0, '844.023')] [2023-03-06 21:45:22,393][62475] Updated weights for policy 0, policy_version 34860 (0.0007) [2023-03-06 21:45:23,186][62475] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-06 21:45:24,010][62475] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-06 21:45:24,790][62475] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-06 21:45:25,606][62475] Updated weights for policy 0, policy_version 34900 (0.0006) [2023-03-06 21:45:26,433][62475] Updated weights for policy 0, policy_version 34910 (0.0007) [2023-03-06 21:45:27,241][62475] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-06 21:45:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 35759104. Throughput: 0: 12702.3. Samples: 35729987. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:45:27,390][62145] Avg episode reward: [(0, '653.733')] [2023-03-06 21:45:28,035][62475] Updated weights for policy 0, policy_version 34930 (0.0005) [2023-03-06 21:45:28,833][62475] Updated weights for policy 0, policy_version 34940 (0.0006) [2023-03-06 21:45:29,627][62475] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-06 21:45:30,439][62475] Updated weights for policy 0, policy_version 34960 (0.0008) [2023-03-06 21:45:30,818][62424] KL-divergence is very high: 498.2453 [2023-03-06 21:45:31,237][62475] Updated weights for policy 0, policy_version 34970 (0.0007) [2023-03-06 21:45:32,050][62475] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-06 21:45:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 35823616. Throughput: 0: 12704.4. Samples: 35806428. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:45:32,390][62145] Avg episode reward: [(0, '732.455')] [2023-03-06 21:45:32,854][62475] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-06 21:45:33,659][62475] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-06 21:45:34,444][62475] Updated weights for policy 0, policy_version 35010 (0.0007) [2023-03-06 21:45:35,264][62475] Updated weights for policy 0, policy_version 35020 (0.0007) [2023-03-06 21:45:36,097][62475] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-06 21:45:36,897][62475] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-06 21:45:37,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 35887104. Throughput: 0: 12708.2. Samples: 35882584. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:37,390][62145] Avg episode reward: [(0, '583.997')] [2023-03-06 21:45:37,705][62475] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-06 21:45:38,530][62475] Updated weights for policy 0, policy_version 35060 (0.0007) [2023-03-06 21:45:39,326][62475] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-06 21:45:40,126][62475] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-06 21:45:40,933][62475] Updated weights for policy 0, policy_version 35090 (0.0008) [2023-03-06 21:45:41,730][62475] Updated weights for policy 0, policy_version 35100 (0.0007) [2023-03-06 21:45:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 35950592. Throughput: 0: 12708.0. Samples: 35920603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:42,390][62145] Avg episode reward: [(0, '947.714')] [2023-03-06 21:45:42,525][62475] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-06 21:45:43,322][62475] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-06 21:45:44,128][62475] Updated weights for policy 0, policy_version 35130 (0.0006) [2023-03-06 21:45:44,944][62475] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-06 21:45:45,738][62475] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-06 21:45:46,543][62475] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-06 21:45:47,353][62475] Updated weights for policy 0, policy_version 35170 (0.0006) [2023-03-06 21:45:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 36014080. Throughput: 0: 12700.6. Samples: 35996956. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:47,390][62145] Avg episode reward: [(0, '854.978')] [2023-03-06 21:45:48,152][62475] Updated weights for policy 0, policy_version 35180 (0.0006) [2023-03-06 21:45:48,947][62475] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-06 21:45:49,759][62475] Updated weights for policy 0, policy_version 35200 (0.0006) [2023-03-06 21:45:50,561][62475] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-06 21:45:51,356][62475] Updated weights for policy 0, policy_version 35220 (0.0007) [2023-03-06 21:45:52,165][62475] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-06 21:45:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12708.0). Total num frames: 36077568. Throughput: 0: 12716.3. Samples: 36073656. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:52,390][62145] Avg episode reward: [(0, '827.868')] [2023-03-06 21:45:52,984][62475] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-06 21:45:53,773][62475] Updated weights for policy 0, policy_version 35250 (0.0005) [2023-03-06 21:45:54,573][62475] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-06 21:45:55,378][62475] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-06 21:45:56,186][62475] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-06 21:45:56,978][62475] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-06 21:45:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 36142080. Throughput: 0: 12716.8. Samples: 36111777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:45:57,390][62145] Avg episode reward: [(0, '832.527')] [2023-03-06 21:45:57,783][62475] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-06 21:45:58,582][62475] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-06 21:45:59,397][62475] Updated weights for policy 0, policy_version 35320 (0.0006) [2023-03-06 21:46:00,194][62475] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-06 21:46:00,996][62475] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-06 21:46:01,804][62475] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-06 21:46:02,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 36205568. Throughput: 0: 12733.6. Samples: 36188410. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:46:02,390][62145] Avg episode reward: [(0, '772.783')] [2023-03-06 21:46:02,593][62475] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-06 21:46:03,426][62475] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-06 21:46:04,218][62475] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-06 21:46:05,016][62475] Updated weights for policy 0, policy_version 35390 (0.0006) [2023-03-06 21:46:05,827][62475] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-06 21:46:06,602][62475] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-06 21:46:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 36269056. Throughput: 0: 12735.0. Samples: 36264927. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:46:07,390][62145] Avg episode reward: [(0, '704.917')] [2023-03-06 21:46:07,401][62475] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-06 21:46:08,221][62475] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-06 21:46:09,019][62475] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-06 21:46:09,839][62475] Updated weights for policy 0, policy_version 35450 (0.0007) [2023-03-06 21:46:10,646][62475] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-06 21:46:11,448][62475] Updated weights for policy 0, policy_version 35470 (0.0007) [2023-03-06 21:46:12,266][62475] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-06 21:46:12,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 36332544. Throughput: 0: 12731.9. Samples: 36302923. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:46:12,390][62145] Avg episode reward: [(0, '786.773')] [2023-03-06 21:46:13,061][62475] Updated weights for policy 0, policy_version 35490 (0.0007) [2023-03-06 21:46:13,876][62475] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-06 21:46:14,675][62475] Updated weights for policy 0, policy_version 35510 (0.0007) [2023-03-06 21:46:15,473][62475] Updated weights for policy 0, policy_version 35520 (0.0007) [2023-03-06 21:46:16,281][62475] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-06 21:46:17,086][62475] Updated weights for policy 0, policy_version 35540 (0.0006) [2023-03-06 21:46:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 36396032. Throughput: 0: 12725.7. Samples: 36379086. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:46:17,390][62145] Avg episode reward: [(0, '716.593')] [2023-03-06 21:46:17,893][62475] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-06 21:46:18,724][62475] Updated weights for policy 0, policy_version 35560 (0.0007) [2023-03-06 21:46:19,532][62475] Updated weights for policy 0, policy_version 35570 (0.0005) [2023-03-06 21:46:20,329][62475] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-06 21:46:21,142][62475] Updated weights for policy 0, policy_version 35590 (0.0007) [2023-03-06 21:46:21,936][62475] Updated weights for policy 0, policy_version 35600 (0.0007) [2023-03-06 21:46:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 36459520. Throughput: 0: 12729.4. Samples: 36455408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:46:22,390][62145] Avg episode reward: [(0, '838.738')] [2023-03-06 21:46:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000035605_36459520.pth... [2023-03-06 21:46:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000032627_33410048.pth [2023-03-06 21:46:22,748][62475] Updated weights for policy 0, policy_version 35610 (0.0006) [2023-03-06 21:46:23,550][62475] Updated weights for policy 0, policy_version 35620 (0.0006) [2023-03-06 21:46:24,361][62475] Updated weights for policy 0, policy_version 35630 (0.0007) [2023-03-06 21:46:25,156][62475] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-06 21:46:25,951][62475] Updated weights for policy 0, policy_version 35650 (0.0007) [2023-03-06 21:46:26,758][62475] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-06 21:46:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 36523008. Throughput: 0: 12729.7. Samples: 36493440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:46:27,390][62145] Avg episode reward: [(0, '917.697')] [2023-03-06 21:46:27,553][62475] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-06 21:46:28,361][62475] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-06 21:46:29,150][62475] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-06 21:46:29,956][62475] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-06 21:46:30,767][62475] Updated weights for policy 0, policy_version 35710 (0.0007) [2023-03-06 21:46:31,563][62475] Updated weights for policy 0, policy_version 35720 (0.0006) [2023-03-06 21:46:32,369][62475] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-06 21:46:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 36587520. Throughput: 0: 12738.5. Samples: 36570189. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:46:32,390][62145] Avg episode reward: [(0, '777.451')] [2023-03-06 21:46:33,183][62475] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-06 21:46:33,993][62475] Updated weights for policy 0, policy_version 35750 (0.0006) [2023-03-06 21:46:34,794][62475] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-06 21:46:35,580][62475] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-06 21:46:36,400][62475] Updated weights for policy 0, policy_version 35780 (0.0007) [2023-03-06 21:46:37,193][62475] Updated weights for policy 0, policy_version 35790 (0.0007) [2023-03-06 21:46:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12711.5). Total num frames: 36651008. Throughput: 0: 12731.5. Samples: 36646571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:46:37,390][62145] Avg episode reward: [(0, '778.570')] [2023-03-06 21:46:37,997][62475] Updated weights for policy 0, policy_version 35800 (0.0006) [2023-03-06 21:46:38,817][62475] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-06 21:46:39,619][62475] Updated weights for policy 0, policy_version 35820 (0.0007) [2023-03-06 21:46:40,434][62475] Updated weights for policy 0, policy_version 35830 (0.0006) [2023-03-06 21:46:41,240][62475] Updated weights for policy 0, policy_version 35840 (0.0007) [2023-03-06 21:46:42,054][62475] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-06 21:46:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 36714496. Throughput: 0: 12726.8. Samples: 36684483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-06 21:46:42,390][62145] Avg episode reward: [(0, '786.209')] [2023-03-06 21:46:42,847][62475] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-06 21:46:43,686][62475] Updated weights for policy 0, policy_version 35870 (0.0006) [2023-03-06 21:46:44,478][62475] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-06 21:46:45,280][62475] Updated weights for policy 0, policy_version 35890 (0.0007) [2023-03-06 21:46:46,123][62475] Updated weights for policy 0, policy_version 35900 (0.0006) [2023-03-06 21:46:46,929][62475] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-06 21:46:47,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 36776960. Throughput: 0: 12709.1. Samples: 36760321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:46:47,390][62145] Avg episode reward: [(0, '865.737')] [2023-03-06 21:46:47,734][62475] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-06 21:46:48,561][62475] Updated weights for policy 0, policy_version 35930 (0.0007) [2023-03-06 21:46:49,372][62475] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-06 21:46:50,162][62475] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-06 21:46:50,986][62475] Updated weights for policy 0, policy_version 35960 (0.0006) [2023-03-06 21:46:51,293][62424] KL-divergence is very high: 351.5344 [2023-03-06 21:46:51,785][62475] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-06 21:46:52,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 36840448. Throughput: 0: 12696.2. Samples: 36836254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:46:52,390][62145] Avg episode reward: [(0, '808.000')] [2023-03-06 21:46:52,582][62475] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-06 21:46:53,379][62475] Updated weights for policy 0, policy_version 35990 (0.0007) [2023-03-06 21:46:54,207][62475] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-06 21:46:55,003][62475] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-06 21:46:55,807][62475] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-06 21:46:56,603][62475] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-06 21:46:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 36903936. Throughput: 0: 12697.9. Samples: 36874327. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:46:57,390][62145] Avg episode reward: [(0, '823.225')] [2023-03-06 21:46:57,426][62475] Updated weights for policy 0, policy_version 36040 (0.0007) [2023-03-06 21:46:58,225][62475] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-06 21:46:59,030][62475] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-06 21:46:59,844][62475] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-06 21:47:00,642][62475] Updated weights for policy 0, policy_version 36080 (0.0006) [2023-03-06 21:47:01,462][62475] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-06 21:47:02,273][62475] Updated weights for policy 0, policy_version 36100 (0.0007) [2023-03-06 21:47:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 36967424. Throughput: 0: 12697.3. Samples: 36950465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:02,390][62145] Avg episode reward: [(0, '613.888')] [2023-03-06 21:47:03,076][62475] Updated weights for policy 0, policy_version 36110 (0.0006) [2023-03-06 21:47:03,898][62475] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-06 21:47:04,692][62475] Updated weights for policy 0, policy_version 36130 (0.0006) [2023-03-06 21:47:05,498][62475] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-06 21:47:06,307][62475] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-06 21:47:07,109][62475] Updated weights for policy 0, policy_version 36160 (0.0007) [2023-03-06 21:47:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37030912. Throughput: 0: 12692.3. Samples: 37026560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:07,390][62145] Avg episode reward: [(0, '507.602')] [2023-03-06 21:47:07,927][62475] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-06 21:47:08,719][62475] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-06 21:47:09,516][62475] Updated weights for policy 0, policy_version 36190 (0.0007) [2023-03-06 21:47:10,345][62475] Updated weights for policy 0, policy_version 36200 (0.0007) [2023-03-06 21:47:11,170][62475] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-06 21:47:11,970][62475] Updated weights for policy 0, policy_version 36220 (0.0006) [2023-03-06 21:47:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37094400. Throughput: 0: 12694.3. Samples: 37064682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:12,390][62145] Avg episode reward: [(0, '602.523')] [2023-03-06 21:47:12,779][62475] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-06 21:47:13,564][62475] Updated weights for policy 0, policy_version 36240 (0.0007) [2023-03-06 21:47:14,390][62475] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-06 21:47:15,182][62475] Updated weights for policy 0, policy_version 36260 (0.0007) [2023-03-06 21:47:16,000][62475] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-06 21:47:16,809][62475] Updated weights for policy 0, policy_version 36280 (0.0007) [2023-03-06 21:47:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37157888. Throughput: 0: 12677.6. Samples: 37140679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:17,390][62145] Avg episode reward: [(0, '696.356')] [2023-03-06 21:47:17,611][62475] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-06 21:47:18,407][62475] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-06 21:47:19,214][62475] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-06 21:47:20,019][62475] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-06 21:47:20,813][62475] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-06 21:47:21,617][62475] Updated weights for policy 0, policy_version 36340 (0.0006) [2023-03-06 21:47:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37221376. Throughput: 0: 12680.5. Samples: 37217196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:22,390][62145] Avg episode reward: [(0, '653.667')] [2023-03-06 21:47:22,423][62475] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-06 21:47:23,230][62424] KL-divergence is very high: 10884.4746 [2023-03-06 21:47:23,237][62475] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-06 21:47:24,034][62475] Updated weights for policy 0, policy_version 36370 (0.0006) [2023-03-06 21:47:24,834][62475] Updated weights for policy 0, policy_version 36380 (0.0006) [2023-03-06 21:47:25,674][62475] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-06 21:47:26,474][62475] Updated weights for policy 0, policy_version 36400 (0.0007) [2023-03-06 21:47:27,288][62475] Updated weights for policy 0, policy_version 36410 (0.0005) [2023-03-06 21:47:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37284864. Throughput: 0: 12685.2. Samples: 37255317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:47:27,390][62145] Avg episode reward: [(0, '490.823')] [2023-03-06 21:47:28,091][62475] Updated weights for policy 0, policy_version 36420 (0.0007) [2023-03-06 21:47:28,894][62475] Updated weights for policy 0, policy_version 36430 (0.0006) [2023-03-06 21:47:29,690][62475] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-06 21:47:30,487][62475] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-06 21:47:31,295][62475] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-06 21:47:32,090][62475] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-06 21:47:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 37348352. Throughput: 0: 12691.3. Samples: 37331428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:47:32,390][62145] Avg episode reward: [(0, '696.689')] [2023-03-06 21:47:32,897][62475] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-06 21:47:33,708][62475] Updated weights for policy 0, policy_version 36490 (0.0007) [2023-03-06 21:47:34,513][62475] Updated weights for policy 0, policy_version 36500 (0.0006) [2023-03-06 21:47:35,320][62475] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-06 21:47:36,122][62475] Updated weights for policy 0, policy_version 36520 (0.0007) [2023-03-06 21:47:36,927][62475] Updated weights for policy 0, policy_version 36530 (0.0007) [2023-03-06 21:47:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 37411840. Throughput: 0: 12699.9. Samples: 37407751. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:47:37,390][62145] Avg episode reward: [(0, '810.706')] [2023-03-06 21:47:37,747][62475] Updated weights for policy 0, policy_version 36540 (0.0005) [2023-03-06 21:47:38,553][62475] Updated weights for policy 0, policy_version 36550 (0.0007) [2023-03-06 21:47:39,370][62475] Updated weights for policy 0, policy_version 36560 (0.0007) [2023-03-06 21:47:40,160][62475] Updated weights for policy 0, policy_version 36570 (0.0008) [2023-03-06 21:47:40,956][62475] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-06 21:47:41,789][62475] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-06 21:47:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 37475328. Throughput: 0: 12697.9. Samples: 37445734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:47:42,390][62145] Avg episode reward: [(0, '667.465')] [2023-03-06 21:47:42,581][62475] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-06 21:47:43,375][62475] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-06 21:47:44,193][62475] Updated weights for policy 0, policy_version 36620 (0.0007) [2023-03-06 21:47:44,998][62475] Updated weights for policy 0, policy_version 36630 (0.0006) [2023-03-06 21:47:45,814][62475] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-06 21:47:46,629][62475] Updated weights for policy 0, policy_version 36650 (0.0006) [2023-03-06 21:47:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37538816. Throughput: 0: 12698.3. Samples: 37521890. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:47:47,390][62145] Avg episode reward: [(0, '854.202')] [2023-03-06 21:47:47,438][62475] Updated weights for policy 0, policy_version 36660 (0.0007) [2023-03-06 21:47:48,241][62475] Updated weights for policy 0, policy_version 36670 (0.0007) [2023-03-06 21:47:49,040][62475] Updated weights for policy 0, policy_version 36680 (0.0005) [2023-03-06 21:47:49,858][62475] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-06 21:47:50,655][62475] Updated weights for policy 0, policy_version 36700 (0.0006) [2023-03-06 21:47:51,454][62475] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-06 21:47:52,279][62475] Updated weights for policy 0, policy_version 36720 (0.0007) [2023-03-06 21:47:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37602304. Throughput: 0: 12699.9. Samples: 37598056. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:47:52,390][62145] Avg episode reward: [(0, '744.629')] [2023-03-06 21:47:53,093][62475] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-06 21:47:53,902][62475] Updated weights for policy 0, policy_version 36740 (0.0006) [2023-03-06 21:47:54,677][62475] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-06 21:47:55,499][62475] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-06 21:47:56,293][62475] Updated weights for policy 0, policy_version 36770 (0.0007) [2023-03-06 21:47:57,089][62475] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-06 21:47:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37665792. Throughput: 0: 12700.1. Samples: 37636185. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:47:57,390][62145] Avg episode reward: [(0, '893.091')] [2023-03-06 21:47:57,902][62475] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-06 21:47:58,700][62475] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-06 21:47:59,495][62475] Updated weights for policy 0, policy_version 36810 (0.0006) [2023-03-06 21:48:00,312][62475] Updated weights for policy 0, policy_version 36820 (0.0007) [2023-03-06 21:48:01,094][62475] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-06 21:48:01,928][62475] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-06 21:48:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 37729280. Throughput: 0: 12710.2. Samples: 37712640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:48:02,390][62145] Avg episode reward: [(0, '895.782')] [2023-03-06 21:48:02,721][62475] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-06 21:48:03,537][62475] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-06 21:48:04,337][62475] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-06 21:48:05,138][62475] Updated weights for policy 0, policy_version 36880 (0.0006) [2023-03-06 21:48:05,954][62475] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-06 21:48:06,745][62475] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-06 21:48:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 37793792. Throughput: 0: 12704.7. Samples: 37788906. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:48:07,390][62145] Avg episode reward: [(0, '1053.738')] [2023-03-06 21:48:07,548][62475] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-06 21:48:08,353][62475] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-06 21:48:09,147][62475] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-06 21:48:09,944][62475] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-06 21:48:10,763][62475] Updated weights for policy 0, policy_version 36950 (0.0006) [2023-03-06 21:48:11,576][62475] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-06 21:48:12,383][62475] Updated weights for policy 0, policy_version 36970 (0.0006) [2023-03-06 21:48:12,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 37857280. Throughput: 0: 12711.7. Samples: 37827342. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:48:12,390][62145] Avg episode reward: [(0, '939.666')] [2023-03-06 21:48:13,188][62475] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-06 21:48:14,005][62475] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-06 21:48:14,796][62475] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-06 21:48:15,618][62475] Updated weights for policy 0, policy_version 37010 (0.0006) [2023-03-06 21:48:16,423][62475] Updated weights for policy 0, policy_version 37020 (0.0007) [2023-03-06 21:48:17,231][62475] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-06 21:48:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 37920768. Throughput: 0: 12709.5. Samples: 37903356. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:48:17,390][62145] Avg episode reward: [(0, '1114.688')] [2023-03-06 21:48:18,042][62475] Updated weights for policy 0, policy_version 37040 (0.0007) [2023-03-06 21:48:18,835][62475] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-06 21:48:19,650][62475] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-06 21:48:20,449][62475] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-06 21:48:21,264][62475] Updated weights for policy 0, policy_version 37080 (0.0007) [2023-03-06 21:48:22,072][62475] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-06 21:48:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 37984256. Throughput: 0: 12704.9. Samples: 37979474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:48:22,390][62145] Avg episode reward: [(0, '992.426')] [2023-03-06 21:48:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000037094_37984256.pth... [2023-03-06 21:48:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000034116_34934784.pth [2023-03-06 21:48:22,847][62475] Updated weights for policy 0, policy_version 37100 (0.0007) [2023-03-06 21:48:23,677][62475] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-06 21:48:24,466][62475] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-06 21:48:25,279][62475] Updated weights for policy 0, policy_version 37130 (0.0007) [2023-03-06 21:48:26,097][62475] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-06 21:48:26,897][62475] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-06 21:48:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38047744. Throughput: 0: 12709.8. Samples: 38017673. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:48:27,390][62145] Avg episode reward: [(0, '1012.804')] [2023-03-06 21:48:27,701][62475] Updated weights for policy 0, policy_version 37160 (0.0005) [2023-03-06 21:48:28,502][62475] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-06 21:48:29,307][62475] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-06 21:48:30,113][62475] Updated weights for policy 0, policy_version 37190 (0.0006) [2023-03-06 21:48:30,925][62475] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-06 21:48:31,709][62475] Updated weights for policy 0, policy_version 37210 (0.0007) [2023-03-06 21:48:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38111232. Throughput: 0: 12713.0. Samples: 38093975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:48:32,390][62145] Avg episode reward: [(0, '812.763')] [2023-03-06 21:48:32,532][62475] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-06 21:48:33,328][62475] Updated weights for policy 0, policy_version 37230 (0.0008) [2023-03-06 21:48:34,150][62475] Updated weights for policy 0, policy_version 37240 (0.0007) [2023-03-06 21:48:34,953][62475] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-06 21:48:35,754][62475] Updated weights for policy 0, policy_version 37260 (0.0007) [2023-03-06 21:48:36,547][62475] Updated weights for policy 0, policy_version 37270 (0.0006) [2023-03-06 21:48:37,373][62475] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-06 21:48:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38174720. Throughput: 0: 12716.1. Samples: 38170280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:48:37,390][62145] Avg episode reward: [(0, '1112.726')] [2023-03-06 21:48:38,182][62475] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-06 21:48:38,976][62475] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-06 21:48:39,774][62475] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-06 21:48:40,573][62475] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-06 21:48:41,378][62475] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-06 21:48:42,183][62475] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-06 21:48:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38238208. Throughput: 0: 12714.4. Samples: 38208333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:48:42,390][62145] Avg episode reward: [(0, '946.580')] [2023-03-06 21:48:43,011][62475] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-06 21:48:43,795][62475] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-06 21:48:44,608][62475] Updated weights for policy 0, policy_version 37370 (0.0005) [2023-03-06 21:48:45,404][62475] Updated weights for policy 0, policy_version 37380 (0.0007) [2023-03-06 21:48:46,217][62475] Updated weights for policy 0, policy_version 37390 (0.0007) [2023-03-06 21:48:47,046][62475] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-06 21:48:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38301696. Throughput: 0: 12710.3. Samples: 38284602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:48:47,400][62145] Avg episode reward: [(0, '966.513')] [2023-03-06 21:48:47,862][62475] Updated weights for policy 0, policy_version 37410 (0.0007) [2023-03-06 21:48:48,662][62475] Updated weights for policy 0, policy_version 37420 (0.0006) [2023-03-06 21:48:49,462][62475] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-06 21:48:50,278][62475] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-06 21:48:51,080][62475] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-06 21:48:51,894][62475] Updated weights for policy 0, policy_version 37460 (0.0007) [2023-03-06 21:48:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38365184. Throughput: 0: 12708.2. Samples: 38360775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:48:52,401][62145] Avg episode reward: [(0, '900.358')] [2023-03-06 21:48:52,689][62475] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-06 21:48:53,486][62475] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-06 21:48:54,270][62475] Updated weights for policy 0, policy_version 37490 (0.0007) [2023-03-06 21:48:55,076][62475] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-06 21:48:55,901][62475] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-06 21:48:56,693][62475] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-06 21:48:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 38428672. Throughput: 0: 12709.0. Samples: 38399246. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:48:57,390][62145] Avg episode reward: [(0, '835.893')] [2023-03-06 21:48:57,499][62475] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-06 21:48:58,310][62475] Updated weights for policy 0, policy_version 37540 (0.0007) [2023-03-06 21:48:59,114][62475] Updated weights for policy 0, policy_version 37550 (0.0006) [2023-03-06 21:48:59,937][62475] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-06 21:49:00,730][62475] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-06 21:49:01,533][62475] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-06 21:49:02,342][62475] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-06 21:49:02,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 38492160. Throughput: 0: 12706.0. Samples: 38475126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:02,390][62145] Avg episode reward: [(0, '1087.628')] [2023-03-06 21:49:03,130][62475] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-06 21:49:03,925][62475] Updated weights for policy 0, policy_version 37610 (0.0006) [2023-03-06 21:49:04,753][62475] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-06 21:49:05,565][62475] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-03-06 21:49:06,369][62475] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-06 21:49:07,164][62475] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-06 21:49:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 38555648. Throughput: 0: 12717.4. Samples: 38551756. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:07,390][62145] Avg episode reward: [(0, '994.545')] [2023-03-06 21:49:07,947][62475] Updated weights for policy 0, policy_version 37660 (0.0007) [2023-03-06 21:49:08,778][62475] Updated weights for policy 0, policy_version 37670 (0.0006) [2023-03-06 21:49:09,558][62475] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-06 21:49:10,388][62475] Updated weights for policy 0, policy_version 37690 (0.0007) [2023-03-06 21:49:11,210][62475] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-06 21:49:11,986][62475] Updated weights for policy 0, policy_version 37710 (0.0006) [2023-03-06 21:49:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38620160. Throughput: 0: 12714.2. Samples: 38589813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:12,390][62145] Avg episode reward: [(0, '942.130')] [2023-03-06 21:49:12,797][62475] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-06 21:49:13,619][62475] Updated weights for policy 0, policy_version 37730 (0.0006) [2023-03-06 21:49:14,401][62475] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-06 21:49:15,209][62475] Updated weights for policy 0, policy_version 37750 (0.0007) [2023-03-06 21:49:16,012][62475] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-06 21:49:16,809][62475] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-06 21:49:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38683648. Throughput: 0: 12715.6. Samples: 38666175. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:17,390][62145] Avg episode reward: [(0, '1216.972')] [2023-03-06 21:49:17,390][62424] Saving new best policy, reward=1216.972! [2023-03-06 21:49:17,633][62475] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-06 21:49:18,421][62475] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-06 21:49:19,237][62475] Updated weights for policy 0, policy_version 37800 (0.0006) [2023-03-06 21:49:20,041][62475] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-06 21:49:20,863][62475] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-06 21:49:21,670][62475] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-06 21:49:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 38747136. Throughput: 0: 12711.9. Samples: 38742314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:22,390][62145] Avg episode reward: [(0, '1018.069')] [2023-03-06 21:49:22,491][62475] Updated weights for policy 0, policy_version 37840 (0.0008) [2023-03-06 21:49:23,296][62475] Updated weights for policy 0, policy_version 37850 (0.0006) [2023-03-06 21:49:24,126][62475] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-06 21:49:24,929][62475] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-06 21:49:25,759][62475] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-06 21:49:26,546][62475] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-06 21:49:27,364][62475] Updated weights for policy 0, policy_version 37900 (0.0007) [2023-03-06 21:49:27,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 38809600. Throughput: 0: 12703.5. Samples: 38779990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:27,390][62145] Avg episode reward: [(0, '1068.700')] [2023-03-06 21:49:28,161][62475] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-06 21:49:28,974][62475] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-06 21:49:29,780][62475] Updated weights for policy 0, policy_version 37930 (0.0006) [2023-03-06 21:49:30,576][62475] Updated weights for policy 0, policy_version 37940 (0.0007) [2023-03-06 21:49:31,397][62475] Updated weights for policy 0, policy_version 37950 (0.0007) [2023-03-06 21:49:32,193][62475] Updated weights for policy 0, policy_version 37960 (0.0006) [2023-03-06 21:49:32,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 38873088. Throughput: 0: 12695.9. Samples: 38855918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:32,390][62145] Avg episode reward: [(0, '891.959')] [2023-03-06 21:49:32,986][62475] Updated weights for policy 0, policy_version 37970 (0.0006) [2023-03-06 21:49:33,805][62475] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-06 21:49:34,588][62475] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-03-06 21:49:35,407][62475] Updated weights for policy 0, policy_version 38000 (0.0006) [2023-03-06 21:49:36,221][62475] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-06 21:49:37,023][62475] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-06 21:49:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 38936576. Throughput: 0: 12704.1. Samples: 38932459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:37,390][62145] Avg episode reward: [(0, '857.314')] [2023-03-06 21:49:37,830][62475] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-06 21:49:38,648][62475] Updated weights for policy 0, policy_version 38040 (0.0009) [2023-03-06 21:49:39,451][62475] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-06 21:49:40,270][62475] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-06 21:49:41,070][62475] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-06 21:49:41,885][62475] Updated weights for policy 0, policy_version 38080 (0.0005) [2023-03-06 21:49:42,389][62145] Fps is (10 sec: 12697.9, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 39000064. Throughput: 0: 12690.1. Samples: 38970298. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:42,390][62145] Avg episode reward: [(0, '998.918')] [2023-03-06 21:49:42,679][62475] Updated weights for policy 0, policy_version 38090 (0.0007) [2023-03-06 21:49:43,477][62475] Updated weights for policy 0, policy_version 38100 (0.0007) [2023-03-06 21:49:44,266][62475] Updated weights for policy 0, policy_version 38110 (0.0005) [2023-03-06 21:49:45,081][62475] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-06 21:49:45,890][62475] Updated weights for policy 0, policy_version 38130 (0.0006) [2023-03-06 21:49:46,680][62475] Updated weights for policy 0, policy_version 38140 (0.0006) [2023-03-06 21:49:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 39063552. Throughput: 0: 12699.8. Samples: 39046620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:47,390][62145] Avg episode reward: [(0, '906.504')] [2023-03-06 21:49:47,491][62475] Updated weights for policy 0, policy_version 38150 (0.0006) [2023-03-06 21:49:48,301][62475] Updated weights for policy 0, policy_version 38160 (0.0006) [2023-03-06 21:49:49,096][62475] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-06 21:49:49,904][62475] Updated weights for policy 0, policy_version 38180 (0.0007) [2023-03-06 21:49:50,725][62475] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-06 21:49:51,535][62475] Updated weights for policy 0, policy_version 38200 (0.0006) [2023-03-06 21:49:52,331][62475] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-06 21:49:52,390][62145] Fps is (10 sec: 12697.3, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 39127040. Throughput: 0: 12689.0. Samples: 39122763. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:49:52,390][62145] Avg episode reward: [(0, '741.515')] [2023-03-06 21:49:53,156][62475] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-06 21:49:53,978][62475] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-06 21:49:54,759][62475] Updated weights for policy 0, policy_version 38240 (0.0007) [2023-03-06 21:49:55,580][62475] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-06 21:49:56,394][62475] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-06 21:49:57,181][62475] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-06 21:49:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39190528. Throughput: 0: 12690.7. Samples: 39160895. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:49:57,390][62145] Avg episode reward: [(0, '905.918')] [2023-03-06 21:49:57,982][62475] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-06 21:49:58,774][62475] Updated weights for policy 0, policy_version 38290 (0.0005) [2023-03-06 21:49:59,599][62475] Updated weights for policy 0, policy_version 38300 (0.0007) [2023-03-06 21:50:00,408][62475] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-06 21:50:01,218][62475] Updated weights for policy 0, policy_version 38320 (0.0007) [2023-03-06 21:50:02,031][62475] Updated weights for policy 0, policy_version 38330 (0.0007) [2023-03-06 21:50:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39254016. Throughput: 0: 12686.9. Samples: 39237086. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:50:02,390][62145] Avg episode reward: [(0, '831.046')] [2023-03-06 21:50:02,828][62475] Updated weights for policy 0, policy_version 38340 (0.0007) [2023-03-06 21:50:03,641][62475] Updated weights for policy 0, policy_version 38350 (0.0006) [2023-03-06 21:50:04,456][62475] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-06 21:50:05,244][62475] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-06 21:50:06,058][62475] Updated weights for policy 0, policy_version 38380 (0.0007) [2023-03-06 21:50:06,880][62475] Updated weights for policy 0, policy_version 38390 (0.0007) [2023-03-06 21:50:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 39317504. Throughput: 0: 12684.6. Samples: 39313121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:50:07,390][62145] Avg episode reward: [(0, '926.104')] [2023-03-06 21:50:07,673][62475] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-06 21:50:08,494][62475] Updated weights for policy 0, policy_version 38410 (0.0007) [2023-03-06 21:50:09,292][62475] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-06 21:50:10,102][62475] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-06 21:50:10,914][62475] Updated weights for policy 0, policy_version 38440 (0.0007) [2023-03-06 21:50:11,716][62475] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-06 21:50:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 39380992. Throughput: 0: 12693.7. Samples: 39351206. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:50:12,390][62145] Avg episode reward: [(0, '1077.497')] [2023-03-06 21:50:12,532][62475] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-06 21:50:13,341][62475] Updated weights for policy 0, policy_version 38470 (0.0006) [2023-03-06 21:50:14,126][62475] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-06 21:50:14,949][62475] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-06 21:50:15,739][62475] Updated weights for policy 0, policy_version 38500 (0.0006) [2023-03-06 21:50:16,569][62475] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-06 21:50:17,366][62475] Updated weights for policy 0, policy_version 38520 (0.0007) [2023-03-06 21:50:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 39444480. Throughput: 0: 12698.5. Samples: 39427351. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:50:17,390][62145] Avg episode reward: [(0, '927.318')] [2023-03-06 21:50:18,153][62475] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-06 21:50:18,971][62475] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-06 21:50:19,774][62475] Updated weights for policy 0, policy_version 38550 (0.0006) [2023-03-06 21:50:20,577][62475] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-06 21:50:21,394][62475] Updated weights for policy 0, policy_version 38570 (0.0006) [2023-03-06 21:50:22,198][62475] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-06 21:50:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 39507968. Throughput: 0: 12691.2. Samples: 39503563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:22,390][62145] Avg episode reward: [(0, '998.039')] [2023-03-06 21:50:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000038582_39507968.pth... [2023-03-06 21:50:22,429][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000035605_36459520.pth [2023-03-06 21:50:23,006][62475] Updated weights for policy 0, policy_version 38590 (0.0007) [2023-03-06 21:50:23,809][62475] Updated weights for policy 0, policy_version 38600 (0.0006) [2023-03-06 21:50:24,603][62475] Updated weights for policy 0, policy_version 38610 (0.0007) [2023-03-06 21:50:25,411][62475] Updated weights for policy 0, policy_version 38620 (0.0006) [2023-03-06 21:50:26,211][62475] Updated weights for policy 0, policy_version 38630 (0.0005) [2023-03-06 21:50:27,008][62475] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-06 21:50:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39571456. Throughput: 0: 12700.3. Samples: 39541814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:27,390][62145] Avg episode reward: [(0, '758.663')] [2023-03-06 21:50:27,813][62475] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-06 21:50:28,628][62475] Updated weights for policy 0, policy_version 38660 (0.0006) [2023-03-06 21:50:29,430][62475] Updated weights for policy 0, policy_version 38670 (0.0006) [2023-03-06 21:50:30,246][62475] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-06 21:50:31,035][62475] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-06 21:50:31,861][62475] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-06 21:50:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39634944. Throughput: 0: 12698.1. Samples: 39618033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:32,390][62145] Avg episode reward: [(0, '854.186')] [2023-03-06 21:50:32,658][62475] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-06 21:50:33,476][62475] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-06 21:50:34,297][62475] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-06 21:50:35,080][62475] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-06 21:50:35,886][62475] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-06 21:50:36,699][62475] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-06 21:50:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39698432. Throughput: 0: 12698.8. Samples: 39694207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:37,390][62145] Avg episode reward: [(0, '964.488')] [2023-03-06 21:50:37,508][62475] Updated weights for policy 0, policy_version 38770 (0.0006) [2023-03-06 21:50:38,313][62475] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-06 21:50:39,112][62475] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-06 21:50:39,935][62475] Updated weights for policy 0, policy_version 38800 (0.0006) [2023-03-06 21:50:40,725][62475] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-06 21:50:41,539][62475] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-06 21:50:42,350][62475] Updated weights for policy 0, policy_version 38830 (0.0007) [2023-03-06 21:50:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39761920. Throughput: 0: 12699.1. Samples: 39732353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:42,390][62145] Avg episode reward: [(0, '1001.099')] [2023-03-06 21:50:43,149][62475] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-06 21:50:43,967][62475] Updated weights for policy 0, policy_version 38850 (0.0007) [2023-03-06 21:50:44,769][62475] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-06 21:50:45,571][62475] Updated weights for policy 0, policy_version 38870 (0.0006) [2023-03-06 21:50:46,375][62475] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-06 21:50:47,169][62475] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-06 21:50:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 39825408. Throughput: 0: 12701.0. Samples: 39808632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:47,390][62145] Avg episode reward: [(0, '740.941')] [2023-03-06 21:50:47,981][62475] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-06 21:50:48,810][62475] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-06 21:50:49,616][62475] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-06 21:50:50,411][62475] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-06 21:50:51,224][62475] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-06 21:50:52,025][62475] Updated weights for policy 0, policy_version 38950 (0.0007) [2023-03-06 21:50:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 39888896. Throughput: 0: 12700.5. Samples: 39884643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:52,390][62145] Avg episode reward: [(0, '1156.852')] [2023-03-06 21:50:52,840][62475] Updated weights for policy 0, policy_version 38960 (0.0007) [2023-03-06 21:50:53,649][62475] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-06 21:50:54,450][62475] Updated weights for policy 0, policy_version 38980 (0.0007) [2023-03-06 21:50:55,239][62475] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-06 21:50:56,062][62475] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-06 21:50:56,870][62475] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-06 21:50:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 39952384. Throughput: 0: 12701.4. Samples: 39922769. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:50:57,390][62145] Avg episode reward: [(0, '1120.979')] [2023-03-06 21:50:57,673][62475] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-06 21:50:58,497][62475] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-06 21:50:59,318][62475] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-06 21:51:00,115][62475] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-06 21:51:00,901][62475] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-06 21:51:01,717][62475] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-06 21:51:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40015872. Throughput: 0: 12699.3. Samples: 39998818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:02,390][62145] Avg episode reward: [(0, '890.018')] [2023-03-06 21:51:02,518][62475] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-06 21:51:03,336][62475] Updated weights for policy 0, policy_version 39090 (0.0007) [2023-03-06 21:51:04,119][62475] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-06 21:51:04,943][62475] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-06 21:51:05,750][62475] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-06 21:51:06,559][62475] Updated weights for policy 0, policy_version 39130 (0.0006) [2023-03-06 21:51:07,349][62475] Updated weights for policy 0, policy_version 39140 (0.0007) [2023-03-06 21:51:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40079360. Throughput: 0: 12699.2. Samples: 40075025. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:07,390][62145] Avg episode reward: [(0, '1011.207')] [2023-03-06 21:51:08,166][62475] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-06 21:51:08,980][62475] Updated weights for policy 0, policy_version 39160 (0.0008) [2023-03-06 21:51:09,762][62475] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-06 21:51:10,563][62475] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-06 21:51:11,382][62475] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-06 21:51:12,178][62475] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-06 21:51:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40142848. Throughput: 0: 12695.5. Samples: 40113109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:12,390][62145] Avg episode reward: [(0, '684.560')] [2023-03-06 21:51:12,988][62475] Updated weights for policy 0, policy_version 39210 (0.0007) [2023-03-06 21:51:13,814][62475] Updated weights for policy 0, policy_version 39220 (0.0007) [2023-03-06 21:51:14,629][62475] Updated weights for policy 0, policy_version 39230 (0.0008) [2023-03-06 21:51:15,429][62475] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-06 21:51:16,228][62475] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-06 21:51:17,042][62475] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-06 21:51:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40206336. Throughput: 0: 12688.9. Samples: 40189032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:17,390][62145] Avg episode reward: [(0, '1135.739')] [2023-03-06 21:51:17,854][62475] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-06 21:51:18,679][62475] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-06 21:51:19,469][62475] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-06 21:51:20,293][62475] Updated weights for policy 0, policy_version 39300 (0.0006) [2023-03-06 21:51:21,089][62475] Updated weights for policy 0, policy_version 39310 (0.0006) [2023-03-06 21:51:21,900][62475] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-06 21:51:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40269824. Throughput: 0: 12685.5. Samples: 40265054. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:22,390][62145] Avg episode reward: [(0, '803.409')] [2023-03-06 21:51:22,700][62475] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-06 21:51:23,511][62475] Updated weights for policy 0, policy_version 39340 (0.0007) [2023-03-06 21:51:24,316][62475] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-06 21:51:25,113][62475] Updated weights for policy 0, policy_version 39360 (0.0007) [2023-03-06 21:51:25,915][62475] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-06 21:51:26,714][62475] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-06 21:51:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 40333312. Throughput: 0: 12687.5. Samples: 40303290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:27,390][62145] Avg episode reward: [(0, '771.533')] [2023-03-06 21:51:27,527][62475] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-06 21:51:28,340][62475] Updated weights for policy 0, policy_version 39400 (0.0007) [2023-03-06 21:51:29,136][62475] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-06 21:51:29,942][62475] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-06 21:51:30,724][62475] Updated weights for policy 0, policy_version 39430 (0.0006) [2023-03-06 21:51:31,524][62475] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-06 21:51:32,346][62475] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-06 21:51:32,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 40396800. Throughput: 0: 12693.7. Samples: 40379847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:32,390][62145] Avg episode reward: [(0, '858.817')] [2023-03-06 21:51:33,145][62475] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-06 21:51:33,953][62475] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-06 21:51:34,766][62475] Updated weights for policy 0, policy_version 39480 (0.0007) [2023-03-06 21:51:35,569][62475] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-06 21:51:36,386][62475] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-06 21:51:37,178][62475] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-06 21:51:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 40460288. Throughput: 0: 12699.1. Samples: 40456101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:37,390][62145] Avg episode reward: [(0, '939.682')] [2023-03-06 21:51:37,994][62475] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-06 21:51:38,795][62475] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-06 21:51:39,596][62475] Updated weights for policy 0, policy_version 39540 (0.0006) [2023-03-06 21:51:40,426][62475] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-06 21:51:41,234][62475] Updated weights for policy 0, policy_version 39560 (0.0007) [2023-03-06 21:51:42,038][62475] Updated weights for policy 0, policy_version 39570 (0.0007) [2023-03-06 21:51:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40523776. Throughput: 0: 12694.4. Samples: 40494016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:42,390][62145] Avg episode reward: [(0, '750.528')] [2023-03-06 21:51:42,838][62475] Updated weights for policy 0, policy_version 39580 (0.0006) [2023-03-06 21:51:43,642][62475] Updated weights for policy 0, policy_version 39590 (0.0006) [2023-03-06 21:51:44,464][62475] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-06 21:51:45,274][62475] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-06 21:51:46,079][62475] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-06 21:51:46,878][62475] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-06 21:51:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40587264. Throughput: 0: 12696.2. Samples: 40570147. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:47,390][62145] Avg episode reward: [(0, '850.074')] [2023-03-06 21:51:47,703][62475] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-06 21:51:48,512][62475] Updated weights for policy 0, policy_version 39650 (0.0007) [2023-03-06 21:51:49,303][62475] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-06 21:51:50,105][62475] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-06 21:51:50,922][62475] Updated weights for policy 0, policy_version 39680 (0.0007) [2023-03-06 21:51:51,726][62475] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-06 21:51:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40650752. Throughput: 0: 12689.5. Samples: 40646056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:52,390][62145] Avg episode reward: [(0, '853.049')] [2023-03-06 21:51:52,530][62475] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-06 21:51:53,326][62475] Updated weights for policy 0, policy_version 39710 (0.0007) [2023-03-06 21:51:54,138][62475] Updated weights for policy 0, policy_version 39720 (0.0007) [2023-03-06 21:51:54,957][62475] Updated weights for policy 0, policy_version 39730 (0.0006) [2023-03-06 21:51:55,775][62475] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-06 21:51:56,565][62475] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-06 21:51:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40714240. Throughput: 0: 12693.8. Samples: 40684330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:51:57,390][62475] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-06 21:51:57,390][62145] Avg episode reward: [(0, '615.795')] [2023-03-06 21:51:58,182][62475] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-06 21:51:58,986][62475] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-06 21:51:59,790][62475] Updated weights for policy 0, policy_version 39790 (0.0007) [2023-03-06 21:52:00,597][62475] Updated weights for policy 0, policy_version 39800 (0.0007) [2023-03-06 21:52:01,410][62475] Updated weights for policy 0, policy_version 39810 (0.0007) [2023-03-06 21:52:02,198][62475] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-06 21:52:02,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40777728. Throughput: 0: 12698.6. Samples: 40760470. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:02,390][62145] Avg episode reward: [(0, '822.776')] [2023-03-06 21:52:02,997][62475] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-06 21:52:03,827][62475] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-06 21:52:04,614][62475] Updated weights for policy 0, policy_version 39850 (0.0007) [2023-03-06 21:52:05,442][62475] Updated weights for policy 0, policy_version 39860 (0.0006) [2023-03-06 21:52:06,250][62475] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-06 21:52:07,030][62475] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-06 21:52:07,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40841216. Throughput: 0: 12702.6. Samples: 40836672. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:07,390][62145] Avg episode reward: [(0, '799.162')] [2023-03-06 21:52:07,858][62475] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-06 21:52:08,653][62475] Updated weights for policy 0, policy_version 39900 (0.0007) [2023-03-06 21:52:09,442][62475] Updated weights for policy 0, policy_version 39910 (0.0005) [2023-03-06 21:52:10,247][62475] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-06 21:52:11,060][62475] Updated weights for policy 0, policy_version 39930 (0.0005) [2023-03-06 21:52:11,858][62475] Updated weights for policy 0, policy_version 39940 (0.0007) [2023-03-06 21:52:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40904704. Throughput: 0: 12704.7. Samples: 40875003. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:12,390][62145] Avg episode reward: [(0, '545.969')] [2023-03-06 21:52:12,669][62475] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-06 21:52:13,482][62475] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-06 21:52:14,280][62475] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-06 21:52:15,085][62475] Updated weights for policy 0, policy_version 39980 (0.0007) [2023-03-06 21:52:15,900][62475] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-06 21:52:16,717][62475] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-06 21:52:17,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 40968192. Throughput: 0: 12696.0. Samples: 40951168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:17,390][62145] Avg episode reward: [(0, '666.229')] [2023-03-06 21:52:17,521][62475] Updated weights for policy 0, policy_version 40010 (0.0007) [2023-03-06 21:52:18,338][62475] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-06 21:52:19,130][62475] Updated weights for policy 0, policy_version 40030 (0.0006) [2023-03-06 21:52:19,929][62475] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-06 21:52:20,747][62475] Updated weights for policy 0, policy_version 40050 (0.0007) [2023-03-06 21:52:21,537][62475] Updated weights for policy 0, policy_version 40060 (0.0007) [2023-03-06 21:52:22,339][62475] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-06 21:52:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 41031680. Throughput: 0: 12699.2. Samples: 41027566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:22,390][62145] Avg episode reward: [(0, '856.523')] [2023-03-06 21:52:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000040070_41031680.pth... [2023-03-06 21:52:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000037094_37984256.pth [2023-03-06 21:52:23,150][62475] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-06 21:52:23,947][62475] Updated weights for policy 0, policy_version 40090 (0.0007) [2023-03-06 21:52:24,750][62475] Updated weights for policy 0, policy_version 40100 (0.0006) [2023-03-06 21:52:25,573][62475] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-06 21:52:26,369][62475] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-06 21:52:27,174][62475] Updated weights for policy 0, policy_version 40130 (0.0006) [2023-03-06 21:52:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 41095168. Throughput: 0: 12703.2. Samples: 41065662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:27,390][62145] Avg episode reward: [(0, '659.822')] [2023-03-06 21:52:27,985][62475] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-06 21:52:28,790][62475] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-06 21:52:29,578][62475] Updated weights for policy 0, policy_version 40160 (0.0006) [2023-03-06 21:52:30,402][62475] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-06 21:52:31,204][62475] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-06 21:52:31,991][62475] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-06 21:52:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 41158656. Throughput: 0: 12704.4. Samples: 41141843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:32,390][62145] Avg episode reward: [(0, '717.564')] [2023-03-06 21:52:32,818][62475] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-06 21:52:33,644][62475] Updated weights for policy 0, policy_version 40210 (0.0007) [2023-03-06 21:52:34,418][62475] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-06 21:52:35,233][62475] Updated weights for policy 0, policy_version 40230 (0.0007) [2023-03-06 21:52:36,055][62475] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-06 21:52:36,861][62475] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-06 21:52:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 41222144. Throughput: 0: 12710.5. Samples: 41218026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:37,390][62145] Avg episode reward: [(0, '824.687')] [2023-03-06 21:52:37,660][62475] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-06 21:52:38,475][62475] Updated weights for policy 0, policy_version 40270 (0.0007) [2023-03-06 21:52:39,270][62475] Updated weights for policy 0, policy_version 40280 (0.0006) [2023-03-06 21:52:40,079][62475] Updated weights for policy 0, policy_version 40290 (0.0008) [2023-03-06 21:52:40,892][62475] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-06 21:52:41,692][62475] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-06 21:52:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 41286656. Throughput: 0: 12707.4. Samples: 41256163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:42,390][62145] Avg episode reward: [(0, '717.665')] [2023-03-06 21:52:42,484][62475] Updated weights for policy 0, policy_version 40320 (0.0008) [2023-03-06 21:52:43,292][62475] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-06 21:52:44,101][62475] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-06 21:52:44,893][62475] Updated weights for policy 0, policy_version 40350 (0.0007) [2023-03-06 21:52:45,685][62475] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-06 21:52:46,514][62475] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-06 21:52:47,306][62475] Updated weights for policy 0, policy_version 40380 (0.0007) [2023-03-06 21:52:47,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 41350144. Throughput: 0: 12713.8. Samples: 41332588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:52:47,390][62145] Avg episode reward: [(0, '737.186')] [2023-03-06 21:52:48,134][62475] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-06 21:52:48,952][62475] Updated weights for policy 0, policy_version 40400 (0.0006) [2023-03-06 21:52:49,733][62475] Updated weights for policy 0, policy_version 40410 (0.0007) [2023-03-06 21:52:50,541][62475] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-06 21:52:51,337][62475] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-06 21:52:52,122][62475] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-06 21:52:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 41413632. Throughput: 0: 12721.6. Samples: 41409141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:52:52,390][62145] Avg episode reward: [(0, '734.601')] [2023-03-06 21:52:52,945][62475] Updated weights for policy 0, policy_version 40450 (0.0006) [2023-03-06 21:52:53,735][62475] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-06 21:52:54,550][62475] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-06 21:52:55,344][62475] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-06 21:52:56,165][62475] Updated weights for policy 0, policy_version 40490 (0.0007) [2023-03-06 21:52:56,965][62475] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-06 21:52:57,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 41477120. Throughput: 0: 12720.3. Samples: 41447415. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:52:57,390][62145] Avg episode reward: [(0, '883.360')] [2023-03-06 21:52:57,762][62475] Updated weights for policy 0, policy_version 40510 (0.0006) [2023-03-06 21:52:58,582][62475] Updated weights for policy 0, policy_version 40520 (0.0007) [2023-03-06 21:52:59,381][62475] Updated weights for policy 0, policy_version 40530 (0.0007) [2023-03-06 21:53:00,192][62475] Updated weights for policy 0, policy_version 40540 (0.0006) [2023-03-06 21:53:00,999][62475] Updated weights for policy 0, policy_version 40550 (0.0006) [2023-03-06 21:53:01,809][62475] Updated weights for policy 0, policy_version 40560 (0.0006) [2023-03-06 21:53:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 41540608. Throughput: 0: 12716.7. Samples: 41523421. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:53:02,390][62145] Avg episode reward: [(0, '812.051')] [2023-03-06 21:53:02,600][62475] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-06 21:53:03,424][62475] Updated weights for policy 0, policy_version 40580 (0.0007) [2023-03-06 21:53:04,248][62475] Updated weights for policy 0, policy_version 40590 (0.0007) [2023-03-06 21:53:05,057][62475] Updated weights for policy 0, policy_version 40600 (0.0006) [2023-03-06 21:53:05,854][62475] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-06 21:53:06,677][62475] Updated weights for policy 0, policy_version 40620 (0.0007) [2023-03-06 21:53:07,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 41603072. Throughput: 0: 12702.4. Samples: 41599176. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:53:07,390][62145] Avg episode reward: [(0, '982.467')] [2023-03-06 21:53:07,457][62475] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-06 21:53:08,296][62475] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-06 21:53:09,084][62475] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-06 21:53:09,886][62475] Updated weights for policy 0, policy_version 40660 (0.0006) [2023-03-06 21:53:10,688][62475] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-06 21:53:11,518][62475] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-06 21:53:12,309][62475] Updated weights for policy 0, policy_version 40690 (0.0007) [2023-03-06 21:53:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 41667584. Throughput: 0: 12705.5. Samples: 41637410. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:53:12,390][62145] Avg episode reward: [(0, '1047.430')] [2023-03-06 21:53:13,140][62475] Updated weights for policy 0, policy_version 40700 (0.0007) [2023-03-06 21:53:13,942][62475] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-06 21:53:14,726][62475] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-06 21:53:15,529][62475] Updated weights for policy 0, policy_version 40730 (0.0006) [2023-03-06 21:53:16,344][62475] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-06 21:53:17,140][62475] Updated weights for policy 0, policy_version 40750 (0.0008) [2023-03-06 21:53:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 41730048. Throughput: 0: 12702.5. Samples: 41713456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:53:17,390][62145] Avg episode reward: [(0, '798.501')] [2023-03-06 21:53:17,944][62475] Updated weights for policy 0, policy_version 40760 (0.0006) [2023-03-06 21:53:18,773][62475] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-06 21:53:19,575][62475] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-06 21:53:20,383][62475] Updated weights for policy 0, policy_version 40790 (0.0007) [2023-03-06 21:53:21,207][62475] Updated weights for policy 0, policy_version 40800 (0.0007) [2023-03-06 21:53:22,015][62475] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-06 21:53:22,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 41793536. Throughput: 0: 12696.6. Samples: 41789371. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:53:22,390][62145] Avg episode reward: [(0, '938.040')] [2023-03-06 21:53:22,798][62475] Updated weights for policy 0, policy_version 40820 (0.0006) [2023-03-06 21:53:23,604][62475] Updated weights for policy 0, policy_version 40830 (0.0006) [2023-03-06 21:53:24,422][62475] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-06 21:53:25,230][62475] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-06 21:53:26,049][62475] Updated weights for policy 0, policy_version 40860 (0.0007) [2023-03-06 21:53:26,866][62475] Updated weights for policy 0, policy_version 40870 (0.0007) [2023-03-06 21:53:27,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 41857024. Throughput: 0: 12700.2. Samples: 41827673. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:53:27,390][62145] Avg episode reward: [(0, '772.809')] [2023-03-06 21:53:27,668][62475] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-06 21:53:28,465][62475] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-06 21:53:29,268][62475] Updated weights for policy 0, policy_version 40900 (0.0008) [2023-03-06 21:53:30,068][62475] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-06 21:53:30,880][62475] Updated weights for policy 0, policy_version 40920 (0.0006) [2023-03-06 21:53:31,684][62475] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-06 21:53:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 41920512. Throughput: 0: 12693.2. Samples: 41903782. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:53:32,390][62145] Avg episode reward: [(0, '915.027')] [2023-03-06 21:53:32,482][62475] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-06 21:53:33,280][62475] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-06 21:53:34,090][62475] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-06 21:53:34,899][62475] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-06 21:53:35,686][62475] Updated weights for policy 0, policy_version 40980 (0.0006) [2023-03-06 21:53:36,517][62475] Updated weights for policy 0, policy_version 40990 (0.0007) [2023-03-06 21:53:37,319][62475] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-06 21:53:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 41985024. Throughput: 0: 12689.6. Samples: 41980172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:53:37,390][62145] Avg episode reward: [(0, '1068.974')] [2023-03-06 21:53:38,118][62475] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-06 21:53:38,926][62475] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-06 21:53:39,746][62475] Updated weights for policy 0, policy_version 41030 (0.0007) [2023-03-06 21:53:40,550][62475] Updated weights for policy 0, policy_version 41040 (0.0007) [2023-03-06 21:53:41,349][62475] Updated weights for policy 0, policy_version 41050 (0.0007) [2023-03-06 21:53:42,154][62475] Updated weights for policy 0, policy_version 41060 (0.0007) [2023-03-06 21:53:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 42047488. Throughput: 0: 12684.9. Samples: 42018235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:53:42,390][62145] Avg episode reward: [(0, '1022.874')] [2023-03-06 21:53:42,965][62475] Updated weights for policy 0, policy_version 41070 (0.0006) [2023-03-06 21:53:43,776][62475] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-06 21:53:44,581][62475] Updated weights for policy 0, policy_version 41090 (0.0007) [2023-03-06 21:53:45,378][62475] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-06 21:53:46,184][62475] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-06 21:53:46,979][62475] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-06 21:53:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 42112000. Throughput: 0: 12691.0. Samples: 42094517. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:53:47,390][62145] Avg episode reward: [(0, '1187.625')] [2023-03-06 21:53:47,773][62475] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-06 21:53:48,604][62475] Updated weights for policy 0, policy_version 41140 (0.0006) [2023-03-06 21:53:49,413][62475] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-06 21:53:50,215][62475] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-06 21:53:51,013][62475] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-06 21:53:51,829][62475] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-06 21:53:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 42175488. Throughput: 0: 12699.0. Samples: 42170633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:53:52,390][62145] Avg episode reward: [(0, '1160.172')] [2023-03-06 21:53:52,621][62475] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-06 21:53:53,431][62475] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-06 21:53:54,254][62475] Updated weights for policy 0, policy_version 41210 (0.0006) [2023-03-06 21:53:55,049][62475] Updated weights for policy 0, policy_version 41220 (0.0007) [2023-03-06 21:53:55,860][62475] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-06 21:53:56,675][62475] Updated weights for policy 0, policy_version 41240 (0.0008) [2023-03-06 21:53:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 42238976. Throughput: 0: 12696.2. Samples: 42208740. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:53:57,390][62145] Avg episode reward: [(0, '1036.668')] [2023-03-06 21:53:57,475][62475] Updated weights for policy 0, policy_version 41250 (0.0007) [2023-03-06 21:53:58,275][62475] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-06 21:53:59,095][62475] Updated weights for policy 0, policy_version 41270 (0.0006) [2023-03-06 21:53:59,886][62475] Updated weights for policy 0, policy_version 41280 (0.0007) [2023-03-06 21:54:00,696][62475] Updated weights for policy 0, policy_version 41290 (0.0006) [2023-03-06 21:54:01,513][62475] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-06 21:54:02,309][62475] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-06 21:54:02,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 42301440. Throughput: 0: 12698.9. Samples: 42284907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:54:02,390][62145] Avg episode reward: [(0, '871.305')] [2023-03-06 21:54:03,121][62475] Updated weights for policy 0, policy_version 41320 (0.0006) [2023-03-06 21:54:03,937][62475] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-06 21:54:04,736][62475] Updated weights for policy 0, policy_version 41340 (0.0006) [2023-03-06 21:54:05,539][62475] Updated weights for policy 0, policy_version 41350 (0.0007) [2023-03-06 21:54:06,369][62475] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-06 21:54:07,167][62475] Updated weights for policy 0, policy_version 41370 (0.0006) [2023-03-06 21:54:07,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 42364928. Throughput: 0: 12700.1. Samples: 42360878. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:07,390][62145] Avg episode reward: [(0, '810.619')] [2023-03-06 21:54:07,985][62475] Updated weights for policy 0, policy_version 41380 (0.0007) [2023-03-06 21:54:08,797][62475] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-06 21:54:09,601][62475] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-06 21:54:10,395][62475] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-06 21:54:11,194][62475] Updated weights for policy 0, policy_version 41420 (0.0007) [2023-03-06 21:54:11,994][62475] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-06 21:54:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 42428416. Throughput: 0: 12694.7. Samples: 42398935. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:12,390][62145] Avg episode reward: [(0, '835.537')] [2023-03-06 21:54:12,790][62475] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-06 21:54:13,598][62475] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-06 21:54:14,395][62475] Updated weights for policy 0, policy_version 41460 (0.0006) [2023-03-06 21:54:15,193][62475] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-06 21:54:16,002][62475] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-06 21:54:16,824][62475] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-06 21:54:17,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 42492928. Throughput: 0: 12705.0. Samples: 42475503. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:17,390][62145] Avg episode reward: [(0, '687.295')] [2023-03-06 21:54:17,619][62475] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-06 21:54:18,423][62475] Updated weights for policy 0, policy_version 41510 (0.0006) [2023-03-06 21:54:19,230][62475] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-06 21:54:20,032][62475] Updated weights for policy 0, policy_version 41530 (0.0007) [2023-03-06 21:54:20,846][62475] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-06 21:54:21,649][62475] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-06 21:54:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12701.1). Total num frames: 42556416. Throughput: 0: 12702.5. Samples: 42551784. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:22,390][62145] Avg episode reward: [(0, '703.095')] [2023-03-06 21:54:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000041559_42556416.pth... [2023-03-06 21:54:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000038582_39507968.pth [2023-03-06 21:54:22,453][62475] Updated weights for policy 0, policy_version 41560 (0.0007) [2023-03-06 21:54:23,242][62475] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-06 21:54:24,043][62475] Updated weights for policy 0, policy_version 41580 (0.0005) [2023-03-06 21:54:24,822][62475] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-06 21:54:25,632][62475] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-06 21:54:26,440][62475] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-06 21:54:27,271][62475] Updated weights for policy 0, policy_version 41620 (0.0006) [2023-03-06 21:54:27,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 42619904. Throughput: 0: 12716.0. Samples: 42590456. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:27,390][62145] Avg episode reward: [(0, '697.815')] [2023-03-06 21:54:28,053][62475] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-06 21:54:28,865][62475] Updated weights for policy 0, policy_version 41640 (0.0008) [2023-03-06 21:54:29,672][62475] Updated weights for policy 0, policy_version 41650 (0.0006) [2023-03-06 21:54:30,477][62475] Updated weights for policy 0, policy_version 41660 (0.0006) [2023-03-06 21:54:31,262][62475] Updated weights for policy 0, policy_version 41670 (0.0007) [2023-03-06 21:54:32,078][62475] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-06 21:54:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 42683392. Throughput: 0: 12718.2. Samples: 42666835. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:32,390][62145] Avg episode reward: [(0, '719.085')] [2023-03-06 21:54:32,889][62475] Updated weights for policy 0, policy_version 41690 (0.0007) [2023-03-06 21:54:33,667][62475] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-06 21:54:34,469][62475] Updated weights for policy 0, policy_version 41710 (0.0007) [2023-03-06 21:54:35,303][62475] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-06 21:54:36,080][62475] Updated weights for policy 0, policy_version 41730 (0.0007) [2023-03-06 21:54:36,893][62475] Updated weights for policy 0, policy_version 41740 (0.0007) [2023-03-06 21:54:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 42747904. Throughput: 0: 12724.8. Samples: 42743250. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:37,390][62145] Avg episode reward: [(0, '567.739')] [2023-03-06 21:54:37,694][62475] Updated weights for policy 0, policy_version 41750 (0.0006) [2023-03-06 21:54:38,494][62475] Updated weights for policy 0, policy_version 41760 (0.0006) [2023-03-06 21:54:39,301][62475] Updated weights for policy 0, policy_version 41770 (0.0006) [2023-03-06 21:54:40,094][62475] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-06 21:54:40,903][62475] Updated weights for policy 0, policy_version 41790 (0.0006) [2023-03-06 21:54:41,710][62475] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-06 21:54:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 42811392. Throughput: 0: 12730.5. Samples: 42781612. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:42,390][62145] Avg episode reward: [(0, '563.087')] [2023-03-06 21:54:42,513][62475] Updated weights for policy 0, policy_version 41810 (0.0007) [2023-03-06 21:54:43,342][62475] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-06 21:54:44,133][62475] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-06 21:54:44,966][62475] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-06 21:54:45,761][62475] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-06 21:54:46,564][62475] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-06 21:54:47,382][62475] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-06 21:54:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 42874880. Throughput: 0: 12728.8. Samples: 42857704. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:47,390][62145] Avg episode reward: [(0, '730.842')] [2023-03-06 21:54:48,174][62475] Updated weights for policy 0, policy_version 41880 (0.0006) [2023-03-06 21:54:48,972][62475] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-06 21:54:49,787][62475] Updated weights for policy 0, policy_version 41900 (0.0006) [2023-03-06 21:54:50,571][62475] Updated weights for policy 0, policy_version 41910 (0.0007) [2023-03-06 21:54:51,387][62475] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-06 21:54:52,201][62475] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-06 21:54:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 42938368. Throughput: 0: 12736.3. Samples: 42934010. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:52,390][62145] Avg episode reward: [(0, '669.825')] [2023-03-06 21:54:52,991][62475] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-06 21:54:53,795][62475] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-06 21:54:54,615][62475] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-06 21:54:55,433][62475] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-06 21:54:56,215][62475] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-06 21:54:57,017][62475] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-06 21:54:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12704.5). Total num frames: 43001856. Throughput: 0: 12739.7. Samples: 42972223. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:54:57,390][62145] Avg episode reward: [(0, '825.592')] [2023-03-06 21:54:57,827][62475] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-06 21:54:58,619][62475] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-06 21:54:59,441][62475] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-06 21:55:00,256][62475] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-06 21:55:01,057][62475] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-06 21:55:01,879][62475] Updated weights for policy 0, policy_version 42050 (0.0007) [2023-03-06 21:55:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 43065344. Throughput: 0: 12728.8. Samples: 43048300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:02,390][62145] Avg episode reward: [(0, '918.975')] [2023-03-06 21:55:02,677][62475] Updated weights for policy 0, policy_version 42060 (0.0007) [2023-03-06 21:55:03,467][62475] Updated weights for policy 0, policy_version 42070 (0.0006) [2023-03-06 21:55:04,285][62475] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-06 21:55:05,078][62475] Updated weights for policy 0, policy_version 42090 (0.0007) [2023-03-06 21:55:05,899][62475] Updated weights for policy 0, policy_version 42100 (0.0007) [2023-03-06 21:55:06,694][62475] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-06 21:55:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12704.5). Total num frames: 43128832. Throughput: 0: 12730.8. Samples: 43124670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:07,390][62145] Avg episode reward: [(0, '590.219')] [2023-03-06 21:55:07,498][62475] Updated weights for policy 0, policy_version 42120 (0.0007) [2023-03-06 21:55:08,307][62475] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-06 21:55:09,109][62475] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-06 21:55:09,919][62475] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-06 21:55:10,735][62475] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-06 21:55:11,533][62475] Updated weights for policy 0, policy_version 42170 (0.0007) [2023-03-06 21:55:12,344][62475] Updated weights for policy 0, policy_version 42180 (0.0007) [2023-03-06 21:55:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 43192320. Throughput: 0: 12716.7. Samples: 43162709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:12,390][62145] Avg episode reward: [(0, '598.695')] [2023-03-06 21:55:13,150][62475] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-06 21:55:13,962][62475] Updated weights for policy 0, policy_version 42200 (0.0007) [2023-03-06 21:55:14,750][62475] Updated weights for policy 0, policy_version 42210 (0.0006) [2023-03-06 21:55:15,580][62475] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-06 21:55:16,377][62475] Updated weights for policy 0, policy_version 42230 (0.0007) [2023-03-06 21:55:17,165][62475] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-06 21:55:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12704.5). Total num frames: 43255808. Throughput: 0: 12711.4. Samples: 43238849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:17,390][62145] Avg episode reward: [(0, '759.158')] [2023-03-06 21:55:18,002][62475] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-06 21:55:18,801][62475] Updated weights for policy 0, policy_version 42260 (0.0006) [2023-03-06 21:55:19,621][62475] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-06 21:55:20,416][62475] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-06 21:55:21,222][62475] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-06 21:55:22,043][62475] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-06 21:55:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 43319296. Throughput: 0: 12705.6. Samples: 43315002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:22,390][62145] Avg episode reward: [(0, '618.206')] [2023-03-06 21:55:22,842][62475] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-06 21:55:23,652][62475] Updated weights for policy 0, policy_version 42320 (0.0006) [2023-03-06 21:55:24,455][62475] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-06 21:55:25,254][62475] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-06 21:55:26,077][62475] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-06 21:55:26,874][62475] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-06 21:55:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 43382784. Throughput: 0: 12701.9. Samples: 43353196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:27,390][62145] Avg episode reward: [(0, '728.180')] [2023-03-06 21:55:27,689][62475] Updated weights for policy 0, policy_version 42370 (0.0006) [2023-03-06 21:55:28,481][62475] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-06 21:55:29,286][62475] Updated weights for policy 0, policy_version 42390 (0.0007) [2023-03-06 21:55:30,106][62475] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-06 21:55:30,922][62475] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-06 21:55:31,726][62475] Updated weights for policy 0, policy_version 42420 (0.0007) [2023-03-06 21:55:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 43446272. Throughput: 0: 12698.2. Samples: 43429125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:32,390][62145] Avg episode reward: [(0, '586.313')] [2023-03-06 21:55:32,523][62475] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-06 21:55:33,016][62424] KL-divergence is very high: 300.4550 [2023-03-06 21:55:33,351][62475] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-06 21:55:34,138][62475] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-06 21:55:34,954][62475] Updated weights for policy 0, policy_version 42460 (0.0007) [2023-03-06 21:55:35,772][62475] Updated weights for policy 0, policy_version 42470 (0.0007) [2023-03-06 21:55:36,561][62475] Updated weights for policy 0, policy_version 42480 (0.0007) [2023-03-06 21:55:37,382][62475] Updated weights for policy 0, policy_version 42490 (0.0007) [2023-03-06 21:55:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43509760. Throughput: 0: 12693.5. Samples: 43505218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:37,390][62145] Avg episode reward: [(0, '604.293')] [2023-03-06 21:55:38,177][62475] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-06 21:55:38,982][62475] Updated weights for policy 0, policy_version 42510 (0.0006) [2023-03-06 21:55:39,767][62475] Updated weights for policy 0, policy_version 42520 (0.0006) [2023-03-06 21:55:40,583][62475] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-06 21:55:41,385][62475] Updated weights for policy 0, policy_version 42540 (0.0005) [2023-03-06 21:55:42,187][62475] Updated weights for policy 0, policy_version 42550 (0.0006) [2023-03-06 21:55:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43573248. Throughput: 0: 12694.0. Samples: 43543451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:42,390][62145] Avg episode reward: [(0, '649.730')] [2023-03-06 21:55:43,004][62475] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-06 21:55:43,798][62475] Updated weights for policy 0, policy_version 42570 (0.0006) [2023-03-06 21:55:44,595][62475] Updated weights for policy 0, policy_version 42580 (0.0007) [2023-03-06 21:55:45,412][62475] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-06 21:55:46,206][62475] Updated weights for policy 0, policy_version 42600 (0.0007) [2023-03-06 21:55:47,014][62475] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-06 21:55:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43636736. Throughput: 0: 12701.1. Samples: 43619849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:55:47,390][62145] Avg episode reward: [(0, '619.758')] [2023-03-06 21:55:47,830][62475] Updated weights for policy 0, policy_version 42620 (0.0007) [2023-03-06 21:55:48,632][62475] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-06 21:55:49,428][62475] Updated weights for policy 0, policy_version 42640 (0.0007) [2023-03-06 21:55:50,222][62475] Updated weights for policy 0, policy_version 42650 (0.0007) [2023-03-06 21:55:51,042][62475] Updated weights for policy 0, policy_version 42660 (0.0007) [2023-03-06 21:55:51,843][62475] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-06 21:55:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43700224. Throughput: 0: 12700.5. Samples: 43696191. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:55:52,390][62145] Avg episode reward: [(0, '794.528')] [2023-03-06 21:55:52,642][62475] Updated weights for policy 0, policy_version 42680 (0.0008) [2023-03-06 21:55:53,478][62475] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-06 21:55:54,272][62475] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-06 21:55:55,059][62475] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-06 21:55:55,899][62475] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-06 21:55:56,701][62475] Updated weights for policy 0, policy_version 42730 (0.0007) [2023-03-06 21:55:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43763712. Throughput: 0: 12701.4. Samples: 43734274. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:55:57,390][62145] Avg episode reward: [(0, '719.066')] [2023-03-06 21:55:57,499][62475] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-06 21:55:58,314][62475] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-06 21:55:59,138][62475] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-06 21:55:59,911][62475] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-06 21:56:00,651][62424] KL-divergence is very high: 264.4541 [2023-03-06 21:56:00,734][62475] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-06 21:56:01,535][62475] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-06 21:56:02,335][62475] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-06 21:56:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43827200. Throughput: 0: 12697.2. Samples: 43810222. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:56:02,390][62145] Avg episode reward: [(0, '732.226')] [2023-03-06 21:56:03,147][62475] Updated weights for policy 0, policy_version 42810 (0.0007) [2023-03-06 21:56:03,953][62475] Updated weights for policy 0, policy_version 42820 (0.0007) [2023-03-06 21:56:04,765][62475] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-06 21:56:05,570][62475] Updated weights for policy 0, policy_version 42840 (0.0006) [2023-03-06 21:56:06,382][62475] Updated weights for policy 0, policy_version 42850 (0.0006) [2023-03-06 21:56:07,181][62475] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-06 21:56:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43890688. Throughput: 0: 12700.7. Samples: 43886534. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:56:07,390][62145] Avg episode reward: [(0, '722.975')] [2023-03-06 21:56:07,984][62475] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-06 21:56:08,795][62475] Updated weights for policy 0, policy_version 42880 (0.0007) [2023-03-06 21:56:09,605][62475] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-06 21:56:10,405][62475] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-06 21:56:11,205][62475] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-06 21:56:12,012][62475] Updated weights for policy 0, policy_version 42920 (0.0006) [2023-03-06 21:56:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 43954176. Throughput: 0: 12700.4. Samples: 43924714. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:56:12,390][62145] Avg episode reward: [(0, '711.190')] [2023-03-06 21:56:12,819][62475] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-06 21:56:13,630][62475] Updated weights for policy 0, policy_version 42940 (0.0006) [2023-03-06 21:56:14,426][62475] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-06 21:56:15,233][62475] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-06 21:56:16,041][62475] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-06 21:56:16,832][62475] Updated weights for policy 0, policy_version 42980 (0.0005) [2023-03-06 21:56:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 44017664. Throughput: 0: 12705.7. Samples: 44000879. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 21:56:17,390][62145] Avg episode reward: [(0, '675.647')] [2023-03-06 21:56:17,667][62475] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-06 21:56:18,477][62475] Updated weights for policy 0, policy_version 43000 (0.0007) [2023-03-06 21:56:19,280][62475] Updated weights for policy 0, policy_version 43010 (0.0007) [2023-03-06 21:56:20,101][62475] Updated weights for policy 0, policy_version 43020 (0.0007) [2023-03-06 21:56:20,903][62475] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-06 21:56:21,690][62475] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-06 21:56:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 44081152. Throughput: 0: 12702.8. Samples: 44076847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:56:22,390][62145] Avg episode reward: [(0, '636.724')] [2023-03-06 21:56:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000043048_44081152.pth... [2023-03-06 21:56:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000040070_41031680.pth [2023-03-06 21:56:22,503][62475] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-06 21:56:23,324][62475] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-06 21:56:24,122][62475] Updated weights for policy 0, policy_version 43070 (0.0006) [2023-03-06 21:56:24,943][62475] Updated weights for policy 0, policy_version 43080 (0.0005) [2023-03-06 21:56:25,751][62475] Updated weights for policy 0, policy_version 43090 (0.0006) [2023-03-06 21:56:26,537][62475] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-06 21:56:27,335][62475] Updated weights for policy 0, policy_version 43110 (0.0006) [2023-03-06 21:56:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 44144640. Throughput: 0: 12700.3. Samples: 44114963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:56:27,390][62145] Avg episode reward: [(0, '758.230')] [2023-03-06 21:56:28,145][62475] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-06 21:56:28,939][62475] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-06 21:56:29,745][62475] Updated weights for policy 0, policy_version 43140 (0.0007) [2023-03-06 21:56:30,550][62475] Updated weights for policy 0, policy_version 43150 (0.0007) [2023-03-06 21:56:31,349][62475] Updated weights for policy 0, policy_version 43160 (0.0006) [2023-03-06 21:56:32,158][62475] Updated weights for policy 0, policy_version 43170 (0.0007) [2023-03-06 21:56:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 44208128. Throughput: 0: 12702.9. Samples: 44191478. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:56:32,390][62145] Avg episode reward: [(0, '808.652')] [2023-03-06 21:56:32,972][62475] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-06 21:56:33,759][62475] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-06 21:56:34,564][62475] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-06 21:56:35,370][62475] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-06 21:56:36,162][62475] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-06 21:56:36,976][62475] Updated weights for policy 0, policy_version 43230 (0.0007) [2023-03-06 21:56:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 44272640. Throughput: 0: 12705.7. Samples: 44267945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:56:37,390][62145] Avg episode reward: [(0, '679.532')] [2023-03-06 21:56:37,776][62475] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-06 21:56:38,605][62475] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-06 21:56:39,395][62475] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-06 21:56:40,197][62475] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-06 21:56:41,014][62475] Updated weights for policy 0, policy_version 43280 (0.0007) [2023-03-06 21:56:41,802][62475] Updated weights for policy 0, policy_version 43290 (0.0007) [2023-03-06 21:56:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 44336128. Throughput: 0: 12707.8. Samples: 44306122. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:56:42,390][62145] Avg episode reward: [(0, '578.687')] [2023-03-06 21:56:42,627][62475] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-06 21:56:43,426][62475] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-06 21:56:44,234][62475] Updated weights for policy 0, policy_version 43320 (0.0005) [2023-03-06 21:56:45,044][62475] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-06 21:56:45,823][62475] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-06 21:56:46,628][62475] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-06 21:56:47,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 44399616. Throughput: 0: 12718.3. Samples: 44382546. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:56:47,390][62145] Avg episode reward: [(0, '816.725')] [2023-03-06 21:56:47,418][62475] Updated weights for policy 0, policy_version 43360 (0.0006) [2023-03-06 21:56:48,223][62475] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-06 21:56:49,021][62475] Updated weights for policy 0, policy_version 43380 (0.0007) [2023-03-06 21:56:49,834][62475] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-06 21:56:50,635][62475] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-06 21:56:51,451][62475] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-06 21:56:52,254][62475] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-06 21:56:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12708.0). Total num frames: 44463104. Throughput: 0: 12724.2. Samples: 44459125. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:56:52,390][62145] Avg episode reward: [(0, '647.194')] [2023-03-06 21:56:53,051][62475] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-06 21:56:53,856][62475] Updated weights for policy 0, policy_version 43440 (0.0007) [2023-03-06 21:56:54,668][62475] Updated weights for policy 0, policy_version 43450 (0.0007) [2023-03-06 21:56:55,466][62475] Updated weights for policy 0, policy_version 43460 (0.0007) [2023-03-06 21:56:56,265][62475] Updated weights for policy 0, policy_version 43470 (0.0006) [2023-03-06 21:56:57,088][62475] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-06 21:56:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 44526592. Throughput: 0: 12722.0. Samples: 44497206. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:56:57,390][62145] Avg episode reward: [(0, '640.741')] [2023-03-06 21:56:57,888][62475] Updated weights for policy 0, policy_version 43490 (0.0007) [2023-03-06 21:56:58,671][62475] Updated weights for policy 0, policy_version 43500 (0.0005) [2023-03-06 21:56:59,477][62475] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-06 21:57:00,282][62475] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-06 21:57:01,087][62475] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-06 21:57:01,906][62475] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-06 21:57:02,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 44591104. Throughput: 0: 12730.8. Samples: 44573765. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:57:02,390][62145] Avg episode reward: [(0, '802.143')] [2023-03-06 21:57:02,698][62475] Updated weights for policy 0, policy_version 43550 (0.0007) [2023-03-06 21:57:03,506][62475] Updated weights for policy 0, policy_version 43560 (0.0006) [2023-03-06 21:57:04,317][62475] Updated weights for policy 0, policy_version 43570 (0.0007) [2023-03-06 21:57:05,126][62475] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-06 21:57:05,929][62475] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-06 21:57:06,730][62475] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-06 21:57:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 44654592. Throughput: 0: 12738.2. Samples: 44650065. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:57:07,390][62145] Avg episode reward: [(0, '761.765')] [2023-03-06 21:57:07,529][62475] Updated weights for policy 0, policy_version 43610 (0.0007) [2023-03-06 21:57:08,313][62475] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-06 21:57:09,106][62475] Updated weights for policy 0, policy_version 43630 (0.0007) [2023-03-06 21:57:09,915][62475] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-06 21:57:10,728][62475] Updated weights for policy 0, policy_version 43650 (0.0005) [2023-03-06 21:57:11,514][62475] Updated weights for policy 0, policy_version 43660 (0.0006) [2023-03-06 21:57:12,336][62475] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-06 21:57:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 44718080. Throughput: 0: 12747.9. Samples: 44688616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:57:12,390][62145] Avg episode reward: [(0, '693.472')] [2023-03-06 21:57:13,133][62475] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-06 21:57:13,934][62475] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-06 21:57:14,749][62475] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-06 21:57:15,547][62475] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-06 21:57:16,338][62475] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-06 21:57:17,149][62475] Updated weights for policy 0, policy_version 43730 (0.0007) [2023-03-06 21:57:17,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 44782592. Throughput: 0: 12746.4. Samples: 44765066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:17,390][62145] Avg episode reward: [(0, '762.466')] [2023-03-06 21:57:17,954][62475] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-06 21:57:18,774][62475] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-06 21:57:19,553][62475] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-06 21:57:20,369][62475] Updated weights for policy 0, policy_version 43770 (0.0005) [2023-03-06 21:57:21,158][62475] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-06 21:57:21,972][62475] Updated weights for policy 0, policy_version 43790 (0.0008) [2023-03-06 21:57:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 44846080. Throughput: 0: 12743.7. Samples: 44841412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:22,390][62145] Avg episode reward: [(0, '990.765')] [2023-03-06 21:57:22,782][62475] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-06 21:57:23,571][62475] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-06 21:57:24,394][62475] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-06 21:57:25,192][62475] Updated weights for policy 0, policy_version 43830 (0.0008) [2023-03-06 21:57:26,002][62475] Updated weights for policy 0, policy_version 43840 (0.0007) [2023-03-06 21:57:26,810][62475] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-06 21:57:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 44909568. Throughput: 0: 12744.5. Samples: 44879623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:27,390][62145] Avg episode reward: [(0, '556.285')] [2023-03-06 21:57:27,627][62475] Updated weights for policy 0, policy_version 43860 (0.0007) [2023-03-06 21:57:28,442][62475] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-06 21:57:29,242][62475] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-06 21:57:30,043][62475] Updated weights for policy 0, policy_version 43890 (0.0005) [2023-03-06 21:57:30,825][62475] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-06 21:57:31,642][62475] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-06 21:57:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 44973056. Throughput: 0: 12739.1. Samples: 44955806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:32,390][62145] Avg episode reward: [(0, '809.738')] [2023-03-06 21:57:32,446][62475] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-06 21:57:33,261][62475] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-06 21:57:34,063][62475] Updated weights for policy 0, policy_version 43940 (0.0006) [2023-03-06 21:57:34,867][62475] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-06 21:57:35,657][62475] Updated weights for policy 0, policy_version 43960 (0.0006) [2023-03-06 21:57:36,451][62475] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-06 21:57:37,266][62475] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-06 21:57:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 45036544. Throughput: 0: 12735.9. Samples: 45032240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:37,390][62145] Avg episode reward: [(0, '824.943')] [2023-03-06 21:57:38,079][62475] Updated weights for policy 0, policy_version 43990 (0.0007) [2023-03-06 21:57:38,870][62475] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-06 21:57:39,655][62475] Updated weights for policy 0, policy_version 44010 (0.0006) [2023-03-06 21:57:40,457][62475] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-06 21:57:41,266][62475] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-06 21:57:42,070][62475] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-06 21:57:42,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 45101056. Throughput: 0: 12746.7. Samples: 45070808. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:42,390][62145] Avg episode reward: [(0, '740.961')] [2023-03-06 21:57:42,868][62475] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-06 21:57:43,679][62475] Updated weights for policy 0, policy_version 44060 (0.0005) [2023-03-06 21:57:44,478][62475] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-06 21:57:45,283][62475] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-06 21:57:46,084][62475] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-06 21:57:46,879][62475] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-06 21:57:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 45164544. Throughput: 0: 12745.1. Samples: 45147293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:47,390][62145] Avg episode reward: [(0, '647.402')] [2023-03-06 21:57:47,686][62475] Updated weights for policy 0, policy_version 44110 (0.0005) [2023-03-06 21:57:48,473][62475] Updated weights for policy 0, policy_version 44120 (0.0005) [2023-03-06 21:57:49,278][62475] Updated weights for policy 0, policy_version 44130 (0.0007) [2023-03-06 21:57:50,091][62475] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-06 21:57:50,895][62475] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-06 21:57:51,693][62475] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-06 21:57:52,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 45228032. Throughput: 0: 12750.1. Samples: 45223822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:52,390][62145] Avg episode reward: [(0, '856.500')] [2023-03-06 21:57:52,505][62475] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-06 21:57:53,302][62475] Updated weights for policy 0, policy_version 44180 (0.0007) [2023-03-06 21:57:54,112][62475] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-06 21:57:54,927][62475] Updated weights for policy 0, policy_version 44200 (0.0006) [2023-03-06 21:57:55,556][62424] KL-divergence is very high: 1032.2468 [2023-03-06 21:57:55,732][62475] Updated weights for policy 0, policy_version 44210 (0.0007) [2023-03-06 21:57:56,539][62475] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-06 21:57:57,340][62475] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-06 21:57:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 45291520. Throughput: 0: 12740.3. Samples: 45261928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:57:57,390][62145] Avg episode reward: [(0, '598.331')] [2023-03-06 21:57:58,142][62475] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-06 21:57:58,937][62475] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-06 21:57:59,750][62475] Updated weights for policy 0, policy_version 44260 (0.0006) [2023-03-06 21:58:00,567][62475] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-06 21:58:01,379][62475] Updated weights for policy 0, policy_version 44280 (0.0006) [2023-03-06 21:58:02,195][62475] Updated weights for policy 0, policy_version 44290 (0.0006) [2023-03-06 21:58:02,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 45355008. Throughput: 0: 12730.6. Samples: 45337945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:02,390][62145] Avg episode reward: [(0, '605.402')] [2023-03-06 21:58:03,008][62475] Updated weights for policy 0, policy_version 44300 (0.0007) [2023-03-06 21:58:03,809][62475] Updated weights for policy 0, policy_version 44310 (0.0008) [2023-03-06 21:58:04,622][62475] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-06 21:58:05,433][62475] Updated weights for policy 0, policy_version 44330 (0.0007) [2023-03-06 21:58:06,216][62475] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-06 21:58:07,057][62475] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-06 21:58:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12715.0). Total num frames: 45418496. Throughput: 0: 12722.9. Samples: 45413943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:07,390][62145] Avg episode reward: [(0, '731.912')] [2023-03-06 21:58:07,866][62475] Updated weights for policy 0, policy_version 44360 (0.0006) [2023-03-06 21:58:08,662][62475] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-06 21:58:09,483][62475] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-06 21:58:10,277][62475] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-06 21:58:11,072][62475] Updated weights for policy 0, policy_version 44400 (0.0007) [2023-03-06 21:58:11,875][62475] Updated weights for policy 0, policy_version 44410 (0.0007) [2023-03-06 21:58:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 45481984. Throughput: 0: 12716.6. Samples: 45451868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:12,390][62145] Avg episode reward: [(0, '665.978')] [2023-03-06 21:58:12,705][62475] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-06 21:58:13,500][62475] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-06 21:58:14,298][62475] Updated weights for policy 0, policy_version 44440 (0.0007) [2023-03-06 21:58:15,095][62475] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-06 21:58:15,897][62475] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-06 21:58:16,705][62475] Updated weights for policy 0, policy_version 44470 (0.0007) [2023-03-06 21:58:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 45545472. Throughput: 0: 12725.6. Samples: 45528458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:17,390][62145] Avg episode reward: [(0, '489.917')] [2023-03-06 21:58:17,508][62475] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-06 21:58:18,314][62475] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-03-06 21:58:19,123][62475] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-06 21:58:19,908][62475] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-06 21:58:20,722][62475] Updated weights for policy 0, policy_version 44520 (0.0007) [2023-03-06 21:58:21,526][62475] Updated weights for policy 0, policy_version 44530 (0.0007) [2023-03-06 21:58:22,335][62475] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-06 21:58:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 45608960. Throughput: 0: 12722.2. Samples: 45604738. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:22,390][62145] Avg episode reward: [(0, '495.199')] [2023-03-06 21:58:22,405][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000044541_45609984.pth... [2023-03-06 21:58:22,435][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000041559_42556416.pth [2023-03-06 21:58:23,123][62475] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-06 21:58:23,934][62475] Updated weights for policy 0, policy_version 44560 (0.0007) [2023-03-06 21:58:24,731][62475] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-06 21:58:25,527][62475] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-06 21:58:26,321][62475] Updated weights for policy 0, policy_version 44590 (0.0006) [2023-03-06 21:58:27,132][62475] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-06 21:58:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 45673472. Throughput: 0: 12722.6. Samples: 45643324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:27,390][62145] Avg episode reward: [(0, '654.835')] [2023-03-06 21:58:27,934][62475] Updated weights for policy 0, policy_version 44610 (0.0006) [2023-03-06 21:58:28,729][62475] Updated weights for policy 0, policy_version 44620 (0.0006) [2023-03-06 21:58:29,537][62475] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-06 21:58:30,354][62475] Updated weights for policy 0, policy_version 44640 (0.0007) [2023-03-06 21:58:31,163][62475] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-06 21:58:31,962][62475] Updated weights for policy 0, policy_version 44660 (0.0008) [2023-03-06 21:58:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 45736960. Throughput: 0: 12722.3. Samples: 45719799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:32,390][62145] Avg episode reward: [(0, '516.670')] [2023-03-06 21:58:32,747][62475] Updated weights for policy 0, policy_version 44670 (0.0007) [2023-03-06 21:58:33,540][62475] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-06 21:58:34,343][62475] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-06 21:58:35,137][62475] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-06 21:58:35,953][62475] Updated weights for policy 0, policy_version 44710 (0.0006) [2023-03-06 21:58:36,755][62475] Updated weights for policy 0, policy_version 44720 (0.0007) [2023-03-06 21:58:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 45800448. Throughput: 0: 12724.4. Samples: 45796419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:37,401][62145] Avg episode reward: [(0, '473.594')] [2023-03-06 21:58:37,548][62475] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-06 21:58:38,372][62475] Updated weights for policy 0, policy_version 44740 (0.0007) [2023-03-06 21:58:39,176][62475] Updated weights for policy 0, policy_version 44750 (0.0007) [2023-03-06 21:58:39,970][62475] Updated weights for policy 0, policy_version 44760 (0.0007) [2023-03-06 21:58:40,787][62475] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-06 21:58:41,582][62475] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-06 21:58:42,373][62475] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-06 21:58:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 45864960. Throughput: 0: 12726.6. Samples: 45834625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:42,401][62145] Avg episode reward: [(0, '609.849')] [2023-03-06 21:58:43,193][62475] Updated weights for policy 0, policy_version 44800 (0.0007) [2023-03-06 21:58:43,979][62475] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-06 21:58:44,789][62475] Updated weights for policy 0, policy_version 44820 (0.0007) [2023-03-06 21:58:45,605][62475] Updated weights for policy 0, policy_version 44830 (0.0006) [2023-03-06 21:58:46,401][62475] Updated weights for policy 0, policy_version 44840 (0.0007) [2023-03-06 21:58:47,192][62475] Updated weights for policy 0, policy_version 44850 (0.0006) [2023-03-06 21:58:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 45928448. Throughput: 0: 12739.4. Samples: 45911221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:47,390][62145] Avg episode reward: [(0, '554.160')] [2023-03-06 21:58:47,992][62475] Updated weights for policy 0, policy_version 44860 (0.0007) [2023-03-06 21:58:48,798][62475] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-06 21:58:49,584][62475] Updated weights for policy 0, policy_version 44880 (0.0008) [2023-03-06 21:58:50,393][62475] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-06 21:58:51,180][62475] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-06 21:58:51,992][62475] Updated weights for policy 0, policy_version 44910 (0.0007) [2023-03-06 21:58:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 45992960. Throughput: 0: 12759.6. Samples: 45988125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:52,390][62145] Avg episode reward: [(0, '532.979')] [2023-03-06 21:58:52,814][62475] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-06 21:58:53,610][62475] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-06 21:58:54,414][62475] Updated weights for policy 0, policy_version 44940 (0.0007) [2023-03-06 21:58:55,206][62475] Updated weights for policy 0, policy_version 44950 (0.0006) [2023-03-06 21:58:56,009][62475] Updated weights for policy 0, policy_version 44960 (0.0007) [2023-03-06 21:58:56,809][62475] Updated weights for policy 0, policy_version 44970 (0.0005) [2023-03-06 21:58:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 46056448. Throughput: 0: 12764.9. Samples: 46026291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:58:57,390][62145] Avg episode reward: [(0, '501.544')] [2023-03-06 21:58:57,610][62475] Updated weights for policy 0, policy_version 44980 (0.0007) [2023-03-06 21:58:58,415][62475] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-06 21:58:59,214][62475] Updated weights for policy 0, policy_version 45000 (0.0006) [2023-03-06 21:59:00,011][62475] Updated weights for policy 0, policy_version 45010 (0.0007) [2023-03-06 21:59:00,783][62475] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-06 21:59:01,578][62475] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-06 21:59:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 46119936. Throughput: 0: 12773.3. Samples: 46103255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 21:59:02,390][62145] Avg episode reward: [(0, '546.632')] [2023-03-06 21:59:02,400][62475] Updated weights for policy 0, policy_version 45040 (0.0007) [2023-03-06 21:59:03,186][62475] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-06 21:59:03,986][62475] Updated weights for policy 0, policy_version 45060 (0.0007) [2023-03-06 21:59:04,770][62475] Updated weights for policy 0, policy_version 45070 (0.0006) [2023-03-06 21:59:05,572][62475] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-06 21:59:06,361][62475] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-06 21:59:07,170][62475] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-06 21:59:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12732.3). Total num frames: 46184448. Throughput: 0: 12791.9. Samples: 46180374. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:07,390][62145] Avg episode reward: [(0, '548.523')] [2023-03-06 21:59:07,973][62475] Updated weights for policy 0, policy_version 45110 (0.0006) [2023-03-06 21:59:08,757][62475] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-06 21:59:09,561][62475] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-06 21:59:10,369][62475] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-06 21:59:11,169][62475] Updated weights for policy 0, policy_version 45150 (0.0007) [2023-03-06 21:59:11,975][62475] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-06 21:59:12,390][62145] Fps is (10 sec: 12902.5, 60 sec: 12782.9, 300 sec: 12732.3). Total num frames: 46248960. Throughput: 0: 12791.0. Samples: 46218918. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:12,390][62145] Avg episode reward: [(0, '494.605')] [2023-03-06 21:59:12,779][62475] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-06 21:59:13,586][62475] Updated weights for policy 0, policy_version 45180 (0.0006) [2023-03-06 21:59:14,393][62475] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-06 21:59:15,195][62475] Updated weights for policy 0, policy_version 45200 (0.0007) [2023-03-06 21:59:16,020][62475] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-06 21:59:16,801][62475] Updated weights for policy 0, policy_version 45220 (0.0007) [2023-03-06 21:59:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12732.3). Total num frames: 46312448. Throughput: 0: 12782.8. Samples: 46295026. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:17,390][62145] Avg episode reward: [(0, '629.474')] [2023-03-06 21:59:17,617][62475] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-06 21:59:18,399][62475] Updated weights for policy 0, policy_version 45240 (0.0006) [2023-03-06 21:59:19,209][62475] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-06 21:59:20,026][62475] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-06 21:59:20,825][62475] Updated weights for policy 0, policy_version 45270 (0.0007) [2023-03-06 21:59:21,623][62475] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-06 21:59:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12783.0, 300 sec: 12732.3). Total num frames: 46375936. Throughput: 0: 12784.0. Samples: 46371697. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:22,390][62145] Avg episode reward: [(0, '683.920')] [2023-03-06 21:59:22,419][62475] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-06 21:59:23,239][62475] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-06 21:59:24,025][62475] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-06 21:59:24,839][62475] Updated weights for policy 0, policy_version 45320 (0.0007) [2023-03-06 21:59:25,638][62475] Updated weights for policy 0, policy_version 45330 (0.0007) [2023-03-06 21:59:26,436][62475] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-06 21:59:27,240][62475] Updated weights for policy 0, policy_version 45350 (0.0007) [2023-03-06 21:59:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12732.3). Total num frames: 46439424. Throughput: 0: 12784.5. Samples: 46409929. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:27,390][62145] Avg episode reward: [(0, '660.466')] [2023-03-06 21:59:28,037][62475] Updated weights for policy 0, policy_version 45360 (0.0006) [2023-03-06 21:59:28,838][62475] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-06 21:59:29,644][62475] Updated weights for policy 0, policy_version 45380 (0.0007) [2023-03-06 21:59:30,453][62475] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-06 21:59:31,269][62475] Updated weights for policy 0, policy_version 45400 (0.0007) [2023-03-06 21:59:32,069][62475] Updated weights for policy 0, policy_version 45410 (0.0006) [2023-03-06 21:59:32,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12783.0, 300 sec: 12732.3). Total num frames: 46503936. Throughput: 0: 12781.1. Samples: 46486368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:32,390][62145] Avg episode reward: [(0, '622.462')] [2023-03-06 21:59:32,878][62475] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-06 21:59:33,692][62475] Updated weights for policy 0, policy_version 45430 (0.0007) [2023-03-06 21:59:34,517][62475] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-06 21:59:35,298][62475] Updated weights for policy 0, policy_version 45450 (0.0007) [2023-03-06 21:59:36,106][62475] Updated weights for policy 0, policy_version 45460 (0.0008) [2023-03-06 21:59:36,930][62475] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-06 21:59:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12728.8). Total num frames: 46566400. Throughput: 0: 12759.3. Samples: 46562293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:37,390][62145] Avg episode reward: [(0, '686.514')] [2023-03-06 21:59:37,733][62475] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-06 21:59:38,549][62475] Updated weights for policy 0, policy_version 45490 (0.0007) [2023-03-06 21:59:39,341][62475] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-06 21:59:40,142][62475] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-06 21:59:40,962][62475] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-06 21:59:41,770][62475] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-06 21:59:42,389][62145] Fps is (10 sec: 12595.1, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 46629888. Throughput: 0: 12760.6. Samples: 46600518. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:42,390][62145] Avg episode reward: [(0, '541.892')] [2023-03-06 21:59:42,573][62475] Updated weights for policy 0, policy_version 45540 (0.0007) [2023-03-06 21:59:43,384][62475] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-06 21:59:44,198][62475] Updated weights for policy 0, policy_version 45560 (0.0007) [2023-03-06 21:59:44,995][62475] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-06 21:59:45,801][62475] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-06 21:59:46,605][62475] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-06 21:59:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 46693376. Throughput: 0: 12744.5. Samples: 46676756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:47,390][62145] Avg episode reward: [(0, '582.273')] [2023-03-06 21:59:47,406][62475] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-06 21:59:48,189][62475] Updated weights for policy 0, policy_version 45610 (0.0006) [2023-03-06 21:59:49,006][62475] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-06 21:59:49,811][62475] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-06 21:59:50,637][62475] Updated weights for policy 0, policy_version 45640 (0.0007) [2023-03-06 21:59:51,433][62475] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-06 21:59:52,235][62475] Updated weights for policy 0, policy_version 45660 (0.0007) [2023-03-06 21:59:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 46756864. Throughput: 0: 12724.3. Samples: 46752966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:52,390][62145] Avg episode reward: [(0, '538.525')] [2023-03-06 21:59:53,058][62475] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-06 21:59:53,862][62475] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-06 21:59:54,658][62475] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-06 21:59:55,481][62475] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-06 21:59:56,285][62475] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-06 21:59:57,085][62475] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-06 21:59:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12728.8). Total num frames: 46820352. Throughput: 0: 12713.2. Samples: 46791010. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 21:59:57,390][62145] Avg episode reward: [(0, '527.967')] [2023-03-06 21:59:57,884][62475] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-06 21:59:58,681][62475] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-06 21:59:59,477][62475] Updated weights for policy 0, policy_version 45750 (0.0007) [2023-03-06 22:00:00,258][62475] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-06 22:00:01,075][62475] Updated weights for policy 0, policy_version 45770 (0.0006) [2023-03-06 22:00:01,894][62475] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-06 22:00:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 46884864. Throughput: 0: 12723.0. Samples: 46867560. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:02,390][62145] Avg episode reward: [(0, '492.228')] [2023-03-06 22:00:02,683][62475] Updated weights for policy 0, policy_version 45790 (0.0006) [2023-03-06 22:00:03,522][62475] Updated weights for policy 0, policy_version 45800 (0.0006) [2023-03-06 22:00:04,302][62475] Updated weights for policy 0, policy_version 45810 (0.0006) [2023-03-06 22:00:05,113][62475] Updated weights for policy 0, policy_version 45820 (0.0008) [2023-03-06 22:00:05,932][62475] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-06 22:00:06,729][62475] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-06 22:00:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 46948352. Throughput: 0: 12708.9. Samples: 46943596. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:07,390][62145] Avg episode reward: [(0, '743.122')] [2023-03-06 22:00:07,536][62475] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-06 22:00:08,342][62475] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-06 22:00:09,146][62475] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-06 22:00:09,960][62475] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-06 22:00:10,770][62475] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-06 22:00:11,579][62475] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-06 22:00:12,383][62475] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-06 22:00:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 47011840. Throughput: 0: 12704.8. Samples: 46981645. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:12,390][62145] Avg episode reward: [(0, '735.398')] [2023-03-06 22:00:13,182][62475] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-06 22:00:13,995][62475] Updated weights for policy 0, policy_version 45930 (0.0007) [2023-03-06 22:00:14,799][62475] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-06 22:00:15,578][62475] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-06 22:00:16,417][62475] Updated weights for policy 0, policy_version 45960 (0.0007) [2023-03-06 22:00:17,213][62475] Updated weights for policy 0, policy_version 45970 (0.0007) [2023-03-06 22:00:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 47075328. Throughput: 0: 12704.7. Samples: 47058080. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:17,390][62145] Avg episode reward: [(0, '654.217')] [2023-03-06 22:00:18,019][62475] Updated weights for policy 0, policy_version 45980 (0.0006) [2023-03-06 22:00:18,823][62475] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-06 22:00:19,626][62475] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-06 22:00:20,445][62475] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-06 22:00:21,236][62475] Updated weights for policy 0, policy_version 46020 (0.0007) [2023-03-06 22:00:22,034][62475] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-06 22:00:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 47138816. Throughput: 0: 12714.1. Samples: 47134429. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:22,390][62145] Avg episode reward: [(0, '516.174')] [2023-03-06 22:00:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000046034_47138816.pth... [2023-03-06 22:00:22,423][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000043048_44081152.pth [2023-03-06 22:00:22,837][62475] Updated weights for policy 0, policy_version 46040 (0.0007) [2023-03-06 22:00:23,639][62475] Updated weights for policy 0, policy_version 46050 (0.0007) [2023-03-06 22:00:24,458][62475] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-06 22:00:25,254][62475] Updated weights for policy 0, policy_version 46070 (0.0006) [2023-03-06 22:00:26,069][62475] Updated weights for policy 0, policy_version 46080 (0.0007) [2023-03-06 22:00:26,863][62475] Updated weights for policy 0, policy_version 46090 (0.0005) [2023-03-06 22:00:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 47202304. Throughput: 0: 12710.3. Samples: 47172483. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 22:00:27,390][62145] Avg episode reward: [(0, '615.559')] [2023-03-06 22:00:27,673][62475] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-06 22:00:28,461][62475] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-06 22:00:29,296][62475] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-06 22:00:30,082][62475] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-06 22:00:30,861][62475] Updated weights for policy 0, policy_version 46140 (0.0006) [2023-03-06 22:00:31,678][62475] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-06 22:00:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 47265792. Throughput: 0: 12722.8. Samples: 47249283. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:32,390][62145] Avg episode reward: [(0, '516.122')] [2023-03-06 22:00:32,466][62475] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-06 22:00:33,290][62475] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-06 22:00:34,093][62475] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-06 22:00:34,913][62475] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-06 22:00:35,713][62475] Updated weights for policy 0, policy_version 46200 (0.0008) [2023-03-06 22:00:36,536][62475] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-06 22:00:37,334][62475] Updated weights for policy 0, policy_version 46220 (0.0006) [2023-03-06 22:00:37,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 47329280. Throughput: 0: 12716.6. Samples: 47325213. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:37,390][62145] Avg episode reward: [(0, '748.586')] [2023-03-06 22:00:38,143][62475] Updated weights for policy 0, policy_version 46230 (0.0006) [2023-03-06 22:00:38,943][62475] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-06 22:00:39,750][62475] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-06 22:00:40,540][62475] Updated weights for policy 0, policy_version 46260 (0.0007) [2023-03-06 22:00:41,335][62475] Updated weights for policy 0, policy_version 46270 (0.0006) [2023-03-06 22:00:42,145][62475] Updated weights for policy 0, policy_version 46280 (0.0006) [2023-03-06 22:00:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 47393792. Throughput: 0: 12720.2. Samples: 47363418. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:42,390][62145] Avg episode reward: [(0, '633.009')] [2023-03-06 22:00:42,925][62475] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-06 22:00:43,734][62475] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-06 22:00:44,528][62475] Updated weights for policy 0, policy_version 46310 (0.0007) [2023-03-06 22:00:45,341][62475] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-06 22:00:46,158][62475] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-06 22:00:46,966][62475] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-06 22:00:47,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 47457280. Throughput: 0: 12720.4. Samples: 47439975. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:47,390][62145] Avg episode reward: [(0, '773.955')] [2023-03-06 22:00:47,761][62475] Updated weights for policy 0, policy_version 46350 (0.0008) [2023-03-06 22:00:48,573][62475] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-06 22:00:49,376][62475] Updated weights for policy 0, policy_version 46370 (0.0007) [2023-03-06 22:00:50,194][62475] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-06 22:00:50,997][62475] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-06 22:00:51,797][62475] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-06 22:00:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 47520768. Throughput: 0: 12724.6. Samples: 47516205. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:52,390][62145] Avg episode reward: [(0, '783.429')] [2023-03-06 22:00:52,614][62475] Updated weights for policy 0, policy_version 46410 (0.0005) [2023-03-06 22:00:53,430][62475] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-06 22:00:54,230][62475] Updated weights for policy 0, policy_version 46430 (0.0007) [2023-03-06 22:00:55,037][62475] Updated weights for policy 0, policy_version 46440 (0.0006) [2023-03-06 22:00:55,855][62475] Updated weights for policy 0, policy_version 46450 (0.0008) [2023-03-06 22:00:56,670][62475] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-06 22:00:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 47584256. Throughput: 0: 12725.5. Samples: 47554295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:00:57,390][62145] Avg episode reward: [(0, '679.790')] [2023-03-06 22:00:57,472][62475] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-06 22:00:58,293][62475] Updated weights for policy 0, policy_version 46480 (0.0007) [2023-03-06 22:00:59,085][62475] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-06 22:00:59,887][62475] Updated weights for policy 0, policy_version 46500 (0.0006) [2023-03-06 22:01:00,687][62475] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-06 22:01:01,491][62475] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-06 22:01:02,290][62475] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-06 22:01:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 47647744. Throughput: 0: 12718.1. Samples: 47630396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:02,390][62145] Avg episode reward: [(0, '576.582')] [2023-03-06 22:01:03,094][62475] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-06 22:01:03,906][62475] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-06 22:01:04,711][62475] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-06 22:01:05,517][62475] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-06 22:01:06,329][62475] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-06 22:01:07,134][62475] Updated weights for policy 0, policy_version 46590 (0.0007) [2023-03-06 22:01:07,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 47710208. Throughput: 0: 12713.1. Samples: 47706519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:07,390][62145] Avg episode reward: [(0, '566.092')] [2023-03-06 22:01:07,937][62475] Updated weights for policy 0, policy_version 46600 (0.0007) [2023-03-06 22:01:08,743][62475] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-06 22:01:09,551][62475] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-06 22:01:10,352][62475] Updated weights for policy 0, policy_version 46630 (0.0006) [2023-03-06 22:01:11,162][62475] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-06 22:01:11,973][62475] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-06 22:01:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 47774720. Throughput: 0: 12714.8. Samples: 47744648. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:12,390][62145] Avg episode reward: [(0, '538.859')] [2023-03-06 22:01:12,776][62475] Updated weights for policy 0, policy_version 46660 (0.0007) [2023-03-06 22:01:13,580][62475] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-06 22:01:14,364][62475] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-06 22:01:15,178][62475] Updated weights for policy 0, policy_version 46690 (0.0006) [2023-03-06 22:01:15,967][62475] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-06 22:01:16,758][62475] Updated weights for policy 0, policy_version 46710 (0.0007) [2023-03-06 22:01:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 47838208. Throughput: 0: 12711.0. Samples: 47821277. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:17,390][62145] Avg episode reward: [(0, '842.370')] [2023-03-06 22:01:17,567][62475] Updated weights for policy 0, policy_version 46720 (0.0006) [2023-03-06 22:01:18,361][62475] Updated weights for policy 0, policy_version 46730 (0.0006) [2023-03-06 22:01:19,181][62475] Updated weights for policy 0, policy_version 46740 (0.0006) [2023-03-06 22:01:19,995][62475] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-06 22:01:20,806][62475] Updated weights for policy 0, policy_version 46760 (0.0007) [2023-03-06 22:01:21,615][62475] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-06 22:01:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 47901696. Throughput: 0: 12718.1. Samples: 47897529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:22,390][62145] Avg episode reward: [(0, '766.735')] [2023-03-06 22:01:22,415][62475] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-06 22:01:23,225][62475] Updated weights for policy 0, policy_version 46790 (0.0007) [2023-03-06 22:01:24,013][62475] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-06 22:01:24,833][62475] Updated weights for policy 0, policy_version 46810 (0.0007) [2023-03-06 22:01:25,640][62475] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-06 22:01:26,427][62475] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-06 22:01:27,246][62475] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-06 22:01:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 47965184. Throughput: 0: 12716.9. Samples: 47935679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:27,390][62145] Avg episode reward: [(0, '775.159')] [2023-03-06 22:01:28,031][62475] Updated weights for policy 0, policy_version 46850 (0.0006) [2023-03-06 22:01:28,836][62475] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-06 22:01:29,646][62475] Updated weights for policy 0, policy_version 46870 (0.0007) [2023-03-06 22:01:30,455][62475] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-06 22:01:31,244][62475] Updated weights for policy 0, policy_version 46890 (0.0007) [2023-03-06 22:01:32,060][62475] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-06 22:01:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 48029696. Throughput: 0: 12716.1. Samples: 48012200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:32,390][62145] Avg episode reward: [(0, '783.453')] [2023-03-06 22:01:32,861][62475] Updated weights for policy 0, policy_version 46910 (0.0007) [2023-03-06 22:01:33,666][62475] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-06 22:01:34,463][62475] Updated weights for policy 0, policy_version 46930 (0.0008) [2023-03-06 22:01:35,272][62475] Updated weights for policy 0, policy_version 46940 (0.0007) [2023-03-06 22:01:36,085][62475] Updated weights for policy 0, policy_version 46950 (0.0006) [2023-03-06 22:01:36,885][62475] Updated weights for policy 0, policy_version 46960 (0.0006) [2023-03-06 22:01:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 48093184. Throughput: 0: 12721.3. Samples: 48088666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:37,390][62145] Avg episode reward: [(0, '866.370')] [2023-03-06 22:01:37,673][62475] Updated weights for policy 0, policy_version 46970 (0.0006) [2023-03-06 22:01:38,473][62475] Updated weights for policy 0, policy_version 46980 (0.0007) [2023-03-06 22:01:39,262][62475] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-06 22:01:40,062][62475] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-06 22:01:40,873][62475] Updated weights for policy 0, policy_version 47010 (0.0007) [2023-03-06 22:01:41,679][62475] Updated weights for policy 0, policy_version 47020 (0.0007) [2023-03-06 22:01:42,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 48156672. Throughput: 0: 12731.5. Samples: 48127211. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:42,390][62145] Avg episode reward: [(0, '538.574')] [2023-03-06 22:01:42,493][62475] Updated weights for policy 0, policy_version 47030 (0.0006) [2023-03-06 22:01:43,287][62475] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-06 22:01:44,086][62475] Updated weights for policy 0, policy_version 47050 (0.0007) [2023-03-06 22:01:44,889][62475] Updated weights for policy 0, policy_version 47060 (0.0006) [2023-03-06 22:01:45,711][62475] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-06 22:01:46,513][62475] Updated weights for policy 0, policy_version 47080 (0.0007) [2023-03-06 22:01:47,303][62475] Updated weights for policy 0, policy_version 47090 (0.0007) [2023-03-06 22:01:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 48221184. Throughput: 0: 12739.8. Samples: 48203685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:47,390][62145] Avg episode reward: [(0, '715.690')] [2023-03-06 22:01:48,109][62475] Updated weights for policy 0, policy_version 47100 (0.0007) [2023-03-06 22:01:48,910][62475] Updated weights for policy 0, policy_version 47110 (0.0007) [2023-03-06 22:01:49,734][62475] Updated weights for policy 0, policy_version 47120 (0.0006) [2023-03-06 22:01:50,545][62475] Updated weights for policy 0, policy_version 47130 (0.0009) [2023-03-06 22:01:51,353][62475] Updated weights for policy 0, policy_version 47140 (0.0007) [2023-03-06 22:01:52,150][62475] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-06 22:01:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 48284672. Throughput: 0: 12742.1. Samples: 48279912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:52,390][62145] Avg episode reward: [(0, '732.781')] [2023-03-06 22:01:52,941][62475] Updated weights for policy 0, policy_version 47160 (0.0007) [2023-03-06 22:01:53,749][62475] Updated weights for policy 0, policy_version 47170 (0.0007) [2023-03-06 22:01:54,544][62475] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-06 22:01:55,347][62475] Updated weights for policy 0, policy_version 47190 (0.0007) [2023-03-06 22:01:56,141][62475] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-06 22:01:56,940][62475] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-06 22:01:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12735.8). Total num frames: 48348160. Throughput: 0: 12750.9. Samples: 48318437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:01:57,390][62145] Avg episode reward: [(0, '617.073')] [2023-03-06 22:01:57,734][62475] Updated weights for policy 0, policy_version 47220 (0.0006) [2023-03-06 22:01:58,545][62475] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-06 22:01:59,335][62475] Updated weights for policy 0, policy_version 47240 (0.0007) [2023-03-06 22:02:00,151][62475] Updated weights for policy 0, policy_version 47250 (0.0007) [2023-03-06 22:02:00,945][62475] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-06 22:02:01,742][62475] Updated weights for policy 0, policy_version 47270 (0.0005) [2023-03-06 22:02:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 48411648. Throughput: 0: 12750.1. Samples: 48395033. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:02,390][62145] Avg episode reward: [(0, '740.012')] [2023-03-06 22:02:02,557][62475] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-06 22:02:03,376][62475] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-06 22:02:04,165][62475] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-06 22:02:04,996][62475] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-06 22:02:05,789][62475] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-06 22:02:06,606][62475] Updated weights for policy 0, policy_version 47330 (0.0007) [2023-03-06 22:02:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 48475136. Throughput: 0: 12743.6. Samples: 48470990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:07,390][62145] Avg episode reward: [(0, '570.985')] [2023-03-06 22:02:07,407][62475] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-06 22:02:08,225][62475] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-06 22:02:09,030][62475] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-06 22:02:09,841][62475] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-06 22:02:10,641][62475] Updated weights for policy 0, policy_version 47380 (0.0007) [2023-03-06 22:02:11,441][62475] Updated weights for policy 0, policy_version 47390 (0.0006) [2023-03-06 22:02:12,250][62475] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-06 22:02:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 48538624. Throughput: 0: 12742.7. Samples: 48509100. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:12,390][62145] Avg episode reward: [(0, '498.407')] [2023-03-06 22:02:13,053][62475] Updated weights for policy 0, policy_version 47410 (0.0006) [2023-03-06 22:02:13,854][62475] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-06 22:02:14,659][62475] Updated weights for policy 0, policy_version 47430 (0.0006) [2023-03-06 22:02:15,471][62475] Updated weights for policy 0, policy_version 47440 (0.0007) [2023-03-06 22:02:16,277][62475] Updated weights for policy 0, policy_version 47450 (0.0006) [2023-03-06 22:02:17,084][62475] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-06 22:02:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 48602112. Throughput: 0: 12738.6. Samples: 48585435. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:17,390][62145] Avg episode reward: [(0, '580.024')] [2023-03-06 22:02:17,874][62475] Updated weights for policy 0, policy_version 47470 (0.0007) [2023-03-06 22:02:18,678][62475] Updated weights for policy 0, policy_version 47480 (0.0007) [2023-03-06 22:02:19,503][62475] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-06 22:02:20,298][62475] Updated weights for policy 0, policy_version 47500 (0.0007) [2023-03-06 22:02:21,114][62475] Updated weights for policy 0, policy_version 47510 (0.0007) [2023-03-06 22:02:21,919][62475] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-06 22:02:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 48666624. Throughput: 0: 12736.4. Samples: 48661805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:22,390][62145] Avg episode reward: [(0, '611.383')] [2023-03-06 22:02:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000047526_48666624.pth... [2023-03-06 22:02:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000044541_45609984.pth [2023-03-06 22:02:22,706][62475] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-06 22:02:23,518][62475] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-06 22:02:24,318][62475] Updated weights for policy 0, policy_version 47550 (0.0007) [2023-03-06 22:02:25,111][62475] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-06 22:02:25,924][62475] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-06 22:02:26,726][62475] Updated weights for policy 0, policy_version 47580 (0.0006) [2023-03-06 22:02:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 48730112. Throughput: 0: 12728.3. Samples: 48699986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:02:27,390][62145] Avg episode reward: [(0, '668.614')] [2023-03-06 22:02:27,534][62475] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-06 22:02:28,345][62475] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-06 22:02:29,129][62475] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-06 22:02:29,946][62475] Updated weights for policy 0, policy_version 47620 (0.0007) [2023-03-06 22:02:30,742][62475] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-06 22:02:31,558][62475] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-06 22:02:32,358][62475] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-06 22:02:32,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 48793600. Throughput: 0: 12727.1. Samples: 48776407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:32,390][62145] Avg episode reward: [(0, '617.766')] [2023-03-06 22:02:33,170][62475] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-06 22:02:33,981][62475] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-06 22:02:34,775][62475] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-06 22:02:35,613][62475] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-06 22:02:36,420][62475] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-06 22:02:37,223][62475] Updated weights for policy 0, policy_version 47710 (0.0007) [2023-03-06 22:02:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 48857088. Throughput: 0: 12724.7. Samples: 48852525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:37,390][62145] Avg episode reward: [(0, '504.720')] [2023-03-06 22:02:38,015][62475] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-06 22:02:38,826][62475] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-06 22:02:39,604][62475] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-06 22:02:40,407][62475] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-06 22:02:41,231][62475] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-06 22:02:42,029][62475] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-06 22:02:42,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 48920576. Throughput: 0: 12722.3. Samples: 48890940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:42,390][62145] Avg episode reward: [(0, '539.575')] [2023-03-06 22:02:42,846][62475] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-06 22:02:43,645][62475] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-06 22:02:44,433][62475] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-06 22:02:45,248][62475] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-06 22:02:46,036][62475] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-06 22:02:46,839][62475] Updated weights for policy 0, policy_version 47830 (0.0006) [2023-03-06 22:02:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 48984064. Throughput: 0: 12716.3. Samples: 48967266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:47,390][62145] Avg episode reward: [(0, '512.058')] [2023-03-06 22:02:47,654][62475] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-06 22:02:48,465][62475] Updated weights for policy 0, policy_version 47850 (0.0006) [2023-03-06 22:02:49,276][62475] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-06 22:02:50,081][62475] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-06 22:02:50,870][62475] Updated weights for policy 0, policy_version 47880 (0.0005) [2023-03-06 22:02:51,699][62475] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-06 22:02:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 49047552. Throughput: 0: 12724.2. Samples: 49043580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:52,390][62145] Avg episode reward: [(0, '867.220')] [2023-03-06 22:02:52,478][62475] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-06 22:02:53,286][62475] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-06 22:02:54,075][62475] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-06 22:02:54,881][62475] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-06 22:02:55,679][62475] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-06 22:02:56,474][62475] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-06 22:02:57,278][62475] Updated weights for policy 0, policy_version 47960 (0.0007) [2023-03-06 22:02:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 49112064. Throughput: 0: 12733.8. Samples: 49082120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:02:57,390][62145] Avg episode reward: [(0, '602.003')] [2023-03-06 22:02:58,095][62475] Updated weights for policy 0, policy_version 47970 (0.0006) [2023-03-06 22:02:58,898][62475] Updated weights for policy 0, policy_version 47980 (0.0006) [2023-03-06 22:02:59,691][62475] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-06 22:03:00,477][62475] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-06 22:03:01,284][62475] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-06 22:03:02,087][62475] Updated weights for policy 0, policy_version 48020 (0.0006) [2023-03-06 22:03:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 49175552. Throughput: 0: 12739.7. Samples: 49158720. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:02,390][62145] Avg episode reward: [(0, '431.219')] [2023-03-06 22:03:02,878][62475] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-06 22:03:03,682][62475] Updated weights for policy 0, policy_version 48040 (0.0006) [2023-03-06 22:03:04,452][62475] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-06 22:03:05,271][62475] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-06 22:03:06,085][62475] Updated weights for policy 0, policy_version 48070 (0.0007) [2023-03-06 22:03:06,884][62475] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-06 22:03:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 49240064. Throughput: 0: 12750.1. Samples: 49235558. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:07,390][62145] Avg episode reward: [(0, '573.125')] [2023-03-06 22:03:07,687][62475] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-06 22:03:08,314][62424] KL-divergence is very high: 775200.8750 [2023-03-06 22:03:08,480][62475] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-06 22:03:09,292][62475] Updated weights for policy 0, policy_version 48110 (0.0007) [2023-03-06 22:03:10,091][62475] Updated weights for policy 0, policy_version 48120 (0.0006) [2023-03-06 22:03:10,892][62475] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-06 22:03:11,678][62475] Updated weights for policy 0, policy_version 48140 (0.0005) [2023-03-06 22:03:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 49303552. Throughput: 0: 12753.4. Samples: 49273890. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:12,390][62145] Avg episode reward: [(0, '454.775')] [2023-03-06 22:03:12,461][62475] Updated weights for policy 0, policy_version 48150 (0.0005) [2023-03-06 22:03:13,310][62475] Updated weights for policy 0, policy_version 48160 (0.0007) [2023-03-06 22:03:14,098][62475] Updated weights for policy 0, policy_version 48170 (0.0007) [2023-03-06 22:03:14,872][62475] Updated weights for policy 0, policy_version 48180 (0.0007) [2023-03-06 22:03:15,711][62475] Updated weights for policy 0, policy_version 48190 (0.0006) [2023-03-06 22:03:16,505][62475] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-06 22:03:17,301][62475] Updated weights for policy 0, policy_version 48210 (0.0006) [2023-03-06 22:03:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 49368064. Throughput: 0: 12757.5. Samples: 49350495. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:17,390][62145] Avg episode reward: [(0, '538.113')] [2023-03-06 22:03:18,090][62475] Updated weights for policy 0, policy_version 48220 (0.0006) [2023-03-06 22:03:18,906][62475] Updated weights for policy 0, policy_version 48230 (0.0007) [2023-03-06 22:03:19,700][62475] Updated weights for policy 0, policy_version 48240 (0.0006) [2023-03-06 22:03:20,499][62475] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-06 22:03:21,300][62475] Updated weights for policy 0, policy_version 48260 (0.0007) [2023-03-06 22:03:22,098][62475] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-06 22:03:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 49431552. Throughput: 0: 12775.8. Samples: 49427437. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:22,390][62145] Avg episode reward: [(0, '470.927')] [2023-03-06 22:03:22,882][62475] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-06 22:03:23,699][62475] Updated weights for policy 0, policy_version 48290 (0.0007) [2023-03-06 22:03:24,497][62475] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-06 22:03:25,295][62475] Updated weights for policy 0, policy_version 48310 (0.0007) [2023-03-06 22:03:26,106][62475] Updated weights for policy 0, policy_version 48320 (0.0006) [2023-03-06 22:03:26,902][62475] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-06 22:03:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 49496064. Throughput: 0: 12774.7. Samples: 49465800. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:03:27,390][62145] Avg episode reward: [(0, '354.466')] [2023-03-06 22:03:27,706][62475] Updated weights for policy 0, policy_version 48340 (0.0007) [2023-03-06 22:03:28,495][62475] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-06 22:03:29,289][62475] Updated weights for policy 0, policy_version 48360 (0.0006) [2023-03-06 22:03:30,118][62475] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-06 22:03:30,913][62475] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-06 22:03:31,696][62475] Updated weights for policy 0, policy_version 48390 (0.0007) [2023-03-06 22:03:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 49559552. Throughput: 0: 12781.6. Samples: 49542438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:32,390][62145] Avg episode reward: [(0, '450.657')] [2023-03-06 22:03:32,505][62475] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-06 22:03:33,290][62475] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-06 22:03:34,084][62475] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-06 22:03:34,903][62475] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-06 22:03:35,697][62475] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-06 22:03:36,491][62475] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-06 22:03:37,289][62475] Updated weights for policy 0, policy_version 48460 (0.0007) [2023-03-06 22:03:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12742.7). Total num frames: 49624064. Throughput: 0: 12796.4. Samples: 49619417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:37,390][62145] Avg episode reward: [(0, '335.610')] [2023-03-06 22:03:38,095][62475] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-06 22:03:38,888][62475] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-06 22:03:39,666][62475] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-06 22:03:40,498][62475] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-06 22:03:41,308][62475] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-06 22:03:42,103][62475] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-06 22:03:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12742.7). Total num frames: 49687552. Throughput: 0: 12798.4. Samples: 49658047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:42,399][62145] Avg episode reward: [(0, '291.650')] [2023-03-06 22:03:42,903][62475] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-06 22:03:43,703][62475] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-06 22:03:44,495][62475] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-06 22:03:45,314][62475] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-06 22:03:46,114][62475] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-06 22:03:46,915][62475] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-06 22:03:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12739.3). Total num frames: 49751040. Throughput: 0: 12791.4. Samples: 49734331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:47,401][62145] Avg episode reward: [(0, '365.549')] [2023-03-06 22:03:47,727][62475] Updated weights for policy 0, policy_version 48590 (0.0006) [2023-03-06 22:03:48,530][62475] Updated weights for policy 0, policy_version 48600 (0.0007) [2023-03-06 22:03:49,321][62475] Updated weights for policy 0, policy_version 48610 (0.0005) [2023-03-06 22:03:50,132][62475] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-06 22:03:50,931][62475] Updated weights for policy 0, policy_version 48630 (0.0006) [2023-03-06 22:03:51,728][62475] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-06 22:03:52,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 49815552. Throughput: 0: 12783.5. Samples: 49810817. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:52,400][62145] Avg episode reward: [(0, '326.137')] [2023-03-06 22:03:52,550][62475] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-06 22:03:53,361][62475] Updated weights for policy 0, policy_version 48660 (0.0007) [2023-03-06 22:03:54,157][62475] Updated weights for policy 0, policy_version 48670 (0.0007) [2023-03-06 22:03:54,963][62475] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-06 22:03:55,770][62475] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-06 22:03:56,554][62475] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-06 22:03:57,366][62475] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-06 22:03:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12742.7). Total num frames: 49879040. Throughput: 0: 12781.0. Samples: 49849035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:03:57,390][62145] Avg episode reward: [(0, '342.826')] [2023-03-06 22:03:58,178][62475] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-06 22:03:58,981][62475] Updated weights for policy 0, policy_version 48730 (0.0007) [2023-03-06 22:03:59,770][62475] Updated weights for policy 0, policy_version 48740 (0.0005) [2023-03-06 22:04:00,601][62475] Updated weights for policy 0, policy_version 48750 (0.0007) [2023-03-06 22:04:01,392][62475] Updated weights for policy 0, policy_version 48760 (0.0006) [2023-03-06 22:04:02,201][62475] Updated weights for policy 0, policy_version 48770 (0.0007) [2023-03-06 22:04:02,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12739.3). Total num frames: 49942528. Throughput: 0: 12776.1. Samples: 49925419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:02,390][62145] Avg episode reward: [(0, '326.084')] [2023-03-06 22:04:02,998][62475] Updated weights for policy 0, policy_version 48780 (0.0007) [2023-03-06 22:04:03,814][62475] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-06 22:04:04,614][62475] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-06 22:04:05,407][62475] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-06 22:04:06,209][62475] Updated weights for policy 0, policy_version 48820 (0.0007) [2023-03-06 22:04:07,017][62475] Updated weights for policy 0, policy_version 48830 (0.0008) [2023-03-06 22:04:07,389][62145] Fps is (10 sec: 12697.9, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 50006016. Throughput: 0: 12767.6. Samples: 50001976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:07,390][62145] Avg episode reward: [(0, '280.256')] [2023-03-06 22:04:07,819][62475] Updated weights for policy 0, policy_version 48840 (0.0007) [2023-03-06 22:04:08,609][62475] Updated weights for policy 0, policy_version 48850 (0.0006) [2023-03-06 22:04:09,434][62475] Updated weights for policy 0, policy_version 48860 (0.0006) [2023-03-06 22:04:10,234][62475] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-06 22:04:11,057][62475] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-06 22:04:11,847][62475] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-06 22:04:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 50069504. Throughput: 0: 12762.1. Samples: 50040095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:12,390][62145] Avg episode reward: [(0, '290.919')] [2023-03-06 22:04:12,644][62475] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-06 22:04:13,443][62475] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-06 22:04:14,266][62475] Updated weights for policy 0, policy_version 48920 (0.0005) [2023-03-06 22:04:15,051][62475] Updated weights for policy 0, policy_version 48930 (0.0006) [2023-03-06 22:04:15,857][62475] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-06 22:04:16,656][62475] Updated weights for policy 0, policy_version 48950 (0.0007) [2023-03-06 22:04:17,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12765.9, 300 sec: 12739.2). Total num frames: 50134016. Throughput: 0: 12759.5. Samples: 50116617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:17,390][62145] Avg episode reward: [(0, '340.439')] [2023-03-06 22:04:17,468][62475] Updated weights for policy 0, policy_version 48960 (0.0007) [2023-03-06 22:04:18,275][62475] Updated weights for policy 0, policy_version 48970 (0.0006) [2023-03-06 22:04:19,067][62475] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-06 22:04:19,866][62475] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-06 22:04:20,682][62475] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-06 22:04:21,473][62475] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-06 22:04:22,306][62475] Updated weights for policy 0, policy_version 49020 (0.0007) [2023-03-06 22:04:22,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12739.3). Total num frames: 50197504. Throughput: 0: 12744.2. Samples: 50192908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:22,390][62145] Avg episode reward: [(0, '306.516')] [2023-03-06 22:04:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000049021_50197504.pth... [2023-03-06 22:04:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000046034_47138816.pth [2023-03-06 22:04:23,109][62475] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-06 22:04:23,910][62475] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-06 22:04:24,710][62475] Updated weights for policy 0, policy_version 49050 (0.0006) [2023-03-06 22:04:25,505][62475] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-06 22:04:26,318][62475] Updated weights for policy 0, policy_version 49070 (0.0007) [2023-03-06 22:04:27,116][62475] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-06 22:04:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 50260992. Throughput: 0: 12731.8. Samples: 50230979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:04:27,390][62145] Avg episode reward: [(0, '391.052')] [2023-03-06 22:04:27,918][62475] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-06 22:04:28,727][62475] Updated weights for policy 0, policy_version 49100 (0.0007) [2023-03-06 22:04:29,525][62475] Updated weights for policy 0, policy_version 49110 (0.0007) [2023-03-06 22:04:30,318][62475] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-06 22:04:31,136][62475] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-06 22:04:31,943][62475] Updated weights for policy 0, policy_version 49140 (0.0007) [2023-03-06 22:04:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 50324480. Throughput: 0: 12737.6. Samples: 50307521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:32,390][62145] Avg episode reward: [(0, '316.178')] [2023-03-06 22:04:32,738][62475] Updated weights for policy 0, policy_version 49150 (0.0007) [2023-03-06 22:04:33,554][62475] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-06 22:04:34,349][62475] Updated weights for policy 0, policy_version 49170 (0.0007) [2023-03-06 22:04:35,180][62475] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-06 22:04:35,980][62475] Updated weights for policy 0, policy_version 49190 (0.0007) [2023-03-06 22:04:36,782][62475] Updated weights for policy 0, policy_version 49200 (0.0006) [2023-03-06 22:04:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 50387968. Throughput: 0: 12733.6. Samples: 50383831. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:37,390][62145] Avg episode reward: [(0, '388.738')] [2023-03-06 22:04:37,573][62475] Updated weights for policy 0, policy_version 49210 (0.0007) [2023-03-06 22:04:38,381][62475] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-06 22:04:39,180][62475] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-06 22:04:40,006][62475] Updated weights for policy 0, policy_version 49240 (0.0007) [2023-03-06 22:04:40,784][62475] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-06 22:04:41,568][62475] Updated weights for policy 0, policy_version 49260 (0.0007) [2023-03-06 22:04:42,381][62475] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-06 22:04:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 50452480. Throughput: 0: 12732.4. Samples: 50421991. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:42,390][62145] Avg episode reward: [(0, '295.847')] [2023-03-06 22:04:43,176][62475] Updated weights for policy 0, policy_version 49280 (0.0007) [2023-03-06 22:04:43,981][62475] Updated weights for policy 0, policy_version 49290 (0.0007) [2023-03-06 22:04:44,790][62475] Updated weights for policy 0, policy_version 49300 (0.0007) [2023-03-06 22:04:45,588][62475] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-06 22:04:46,372][62475] Updated weights for policy 0, policy_version 49320 (0.0007) [2023-03-06 22:04:47,170][62475] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-06 22:04:47,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 50515968. Throughput: 0: 12745.6. Samples: 50498972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:47,390][62145] Avg episode reward: [(0, '304.192')] [2023-03-06 22:04:47,975][62475] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-06 22:04:48,792][62475] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-06 22:04:49,577][62475] Updated weights for policy 0, policy_version 49360 (0.0007) [2023-03-06 22:04:50,386][62475] Updated weights for policy 0, policy_version 49370 (0.0006) [2023-03-06 22:04:51,204][62475] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-06 22:04:51,998][62475] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-06 22:04:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 50580480. Throughput: 0: 12746.8. Samples: 50575585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:52,390][62145] Avg episode reward: [(0, '312.094')] [2023-03-06 22:04:52,803][62475] Updated weights for policy 0, policy_version 49400 (0.0006) [2023-03-06 22:04:53,595][62475] Updated weights for policy 0, policy_version 49410 (0.0007) [2023-03-06 22:04:54,382][62475] Updated weights for policy 0, policy_version 49420 (0.0006) [2023-03-06 22:04:55,197][62475] Updated weights for policy 0, policy_version 49430 (0.0007) [2023-03-06 22:04:55,989][62475] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-06 22:04:56,785][62475] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-06 22:04:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 50643968. Throughput: 0: 12754.4. Samples: 50614044. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:04:57,390][62145] Avg episode reward: [(0, '278.954')] [2023-03-06 22:04:57,602][62475] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-06 22:04:58,415][62475] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-06 22:04:59,210][62475] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-06 22:05:00,028][62475] Updated weights for policy 0, policy_version 49490 (0.0006) [2023-03-06 22:05:00,821][62475] Updated weights for policy 0, policy_version 49500 (0.0006) [2023-03-06 22:05:01,616][62475] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-06 22:05:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 50707456. Throughput: 0: 12754.5. Samples: 50690567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:02,390][62145] Avg episode reward: [(0, '267.325')] [2023-03-06 22:05:02,414][62475] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-06 22:05:03,237][62475] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-06 22:05:04,026][62475] Updated weights for policy 0, policy_version 49540 (0.0005) [2023-03-06 22:05:04,839][62475] Updated weights for policy 0, policy_version 49550 (0.0006) [2023-03-06 22:05:05,641][62475] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-06 22:05:06,441][62475] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-06 22:05:07,224][62475] Updated weights for policy 0, policy_version 49580 (0.0006) [2023-03-06 22:05:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 50771968. Throughput: 0: 12760.1. Samples: 50767114. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:07,390][62145] Avg episode reward: [(0, '272.406')] [2023-03-06 22:05:08,026][62475] Updated weights for policy 0, policy_version 49590 (0.0007) [2023-03-06 22:05:08,834][62475] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-06 22:05:09,626][62475] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-06 22:05:10,430][62475] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-06 22:05:11,234][62475] Updated weights for policy 0, policy_version 49630 (0.0007) [2023-03-06 22:05:12,034][62475] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-06 22:05:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 50835456. Throughput: 0: 12766.4. Samples: 50805465. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:12,390][62145] Avg episode reward: [(0, '317.741')] [2023-03-06 22:05:12,830][62475] Updated weights for policy 0, policy_version 49650 (0.0007) [2023-03-06 22:05:13,632][62475] Updated weights for policy 0, policy_version 49660 (0.0007) [2023-03-06 22:05:14,445][62475] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-06 22:05:15,248][62475] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-06 22:05:16,052][62475] Updated weights for policy 0, policy_version 49690 (0.0007) [2023-03-06 22:05:16,854][62475] Updated weights for policy 0, policy_version 49700 (0.0006) [2023-03-06 22:05:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 50898944. Throughput: 0: 12766.3. Samples: 50882004. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:17,390][62145] Avg episode reward: [(0, '276.256')] [2023-03-06 22:05:17,670][62475] Updated weights for policy 0, policy_version 49710 (0.0007) [2023-03-06 22:05:18,474][62475] Updated weights for policy 0, policy_version 49720 (0.0007) [2023-03-06 22:05:19,279][62475] Updated weights for policy 0, policy_version 49730 (0.0006) [2023-03-06 22:05:20,069][62475] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-06 22:05:20,885][62475] Updated weights for policy 0, policy_version 49750 (0.0006) [2023-03-06 22:05:21,666][62475] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-06 22:05:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 50962432. Throughput: 0: 12769.7. Samples: 50958469. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:22,390][62145] Avg episode reward: [(0, '259.350')] [2023-03-06 22:05:22,478][62475] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-06 22:05:23,281][62475] Updated weights for policy 0, policy_version 49780 (0.0006) [2023-03-06 22:05:24,110][62475] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-06 22:05:24,899][62475] Updated weights for policy 0, policy_version 49800 (0.0007) [2023-03-06 22:05:25,713][62475] Updated weights for policy 0, policy_version 49810 (0.0006) [2023-03-06 22:05:26,529][62475] Updated weights for policy 0, policy_version 49820 (0.0006) [2023-03-06 22:05:27,323][62475] Updated weights for policy 0, policy_version 49830 (0.0007) [2023-03-06 22:05:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 51025920. Throughput: 0: 12767.8. Samples: 50996541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:05:27,390][62145] Avg episode reward: [(0, '304.377')] [2023-03-06 22:05:28,137][62475] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-06 22:05:28,940][62475] Updated weights for policy 0, policy_version 49850 (0.0005) [2023-03-06 22:05:29,741][62475] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-06 22:05:30,530][62475] Updated weights for policy 0, policy_version 49870 (0.0007) [2023-03-06 22:05:31,342][62475] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-06 22:05:32,146][62475] Updated weights for policy 0, policy_version 49890 (0.0006) [2023-03-06 22:05:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12749.7). Total num frames: 51090432. Throughput: 0: 12756.6. Samples: 51073019. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:32,390][62145] Avg episode reward: [(0, '285.777')] [2023-03-06 22:05:32,945][62475] Updated weights for policy 0, policy_version 49900 (0.0007) [2023-03-06 22:05:33,755][62475] Updated weights for policy 0, policy_version 49910 (0.0006) [2023-03-06 22:05:34,548][62475] Updated weights for policy 0, policy_version 49920 (0.0007) [2023-03-06 22:05:35,339][62475] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-06 22:05:36,150][62475] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-06 22:05:36,940][62475] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-06 22:05:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 51153920. Throughput: 0: 12757.3. Samples: 51149661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:37,390][62145] Avg episode reward: [(0, '294.335')] [2023-03-06 22:05:37,734][62475] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-06 22:05:38,569][62475] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-06 22:05:39,363][62475] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-06 22:05:40,172][62475] Updated weights for policy 0, policy_version 49990 (0.0006) [2023-03-06 22:05:40,978][62475] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-06 22:05:41,787][62475] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-06 22:05:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 51217408. Throughput: 0: 12746.4. Samples: 51187632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:42,390][62145] Avg episode reward: [(0, '276.629')] [2023-03-06 22:05:42,590][62475] Updated weights for policy 0, policy_version 50020 (0.0006) [2023-03-06 22:05:43,394][62475] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-06 22:05:44,202][62475] Updated weights for policy 0, policy_version 50040 (0.0007) [2023-03-06 22:05:44,997][62475] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-06 22:05:45,815][62475] Updated weights for policy 0, policy_version 50060 (0.0007) [2023-03-06 22:05:46,605][62475] Updated weights for policy 0, policy_version 50070 (0.0006) [2023-03-06 22:05:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 51280896. Throughput: 0: 12746.6. Samples: 51264166. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:47,390][62145] Avg episode reward: [(0, '262.415')] [2023-03-06 22:05:47,393][62475] Updated weights for policy 0, policy_version 50080 (0.0007) [2023-03-06 22:05:48,210][62475] Updated weights for policy 0, policy_version 50090 (0.0006) [2023-03-06 22:05:48,989][62475] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-06 22:05:49,797][62475] Updated weights for policy 0, policy_version 50110 (0.0007) [2023-03-06 22:05:50,603][62475] Updated weights for policy 0, policy_version 50120 (0.0007) [2023-03-06 22:05:51,393][62475] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-06 22:05:52,208][62475] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-06 22:05:52,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 51345408. Throughput: 0: 12750.9. Samples: 51340904. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:52,390][62145] Avg episode reward: [(0, '269.777')] [2023-03-06 22:05:53,019][62475] Updated weights for policy 0, policy_version 50150 (0.0006) [2023-03-06 22:05:53,830][62475] Updated weights for policy 0, policy_version 50160 (0.0006) [2023-03-06 22:05:54,619][62475] Updated weights for policy 0, policy_version 50170 (0.0007) [2023-03-06 22:05:55,418][62475] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-06 22:05:56,232][62475] Updated weights for policy 0, policy_version 50190 (0.0006) [2023-03-06 22:05:57,057][62475] Updated weights for policy 0, policy_version 50200 (0.0006) [2023-03-06 22:05:57,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 51408896. Throughput: 0: 12748.4. Samples: 51379144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:05:57,390][62145] Avg episode reward: [(0, '294.700')] [2023-03-06 22:05:57,858][62475] Updated weights for policy 0, policy_version 50210 (0.0007) [2023-03-06 22:05:58,646][62475] Updated weights for policy 0, policy_version 50220 (0.0007) [2023-03-06 22:05:59,469][62475] Updated weights for policy 0, policy_version 50230 (0.0006) [2023-03-06 22:06:00,264][62475] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-06 22:06:01,049][62475] Updated weights for policy 0, policy_version 50250 (0.0007) [2023-03-06 22:06:01,869][62475] Updated weights for policy 0, policy_version 50260 (0.0007) [2023-03-06 22:06:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51472384. Throughput: 0: 12739.2. Samples: 51455268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:02,390][62145] Avg episode reward: [(0, '258.210')] [2023-03-06 22:06:02,658][62475] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-06 22:06:03,451][62475] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-06 22:06:04,259][62475] Updated weights for policy 0, policy_version 50290 (0.0006) [2023-03-06 22:06:05,068][62475] Updated weights for policy 0, policy_version 50300 (0.0007) [2023-03-06 22:06:05,884][62475] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-06 22:06:06,683][62475] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-06 22:06:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12749.7). Total num frames: 51535872. Throughput: 0: 12740.7. Samples: 51531799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:07,390][62145] Avg episode reward: [(0, '285.270')] [2023-03-06 22:06:07,482][62475] Updated weights for policy 0, policy_version 50330 (0.0006) [2023-03-06 22:06:08,286][62475] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-06 22:06:09,091][62475] Updated weights for policy 0, policy_version 50350 (0.0007) [2023-03-06 22:06:09,896][62475] Updated weights for policy 0, policy_version 50360 (0.0007) [2023-03-06 22:06:10,708][62475] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-06 22:06:11,498][62475] Updated weights for policy 0, policy_version 50380 (0.0007) [2023-03-06 22:06:12,291][62475] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-06 22:06:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51600384. Throughput: 0: 12744.5. Samples: 51570044. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:12,390][62145] Avg episode reward: [(0, '249.699')] [2023-03-06 22:06:13,109][62475] Updated weights for policy 0, policy_version 50400 (0.0007) [2023-03-06 22:06:13,906][62475] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-06 22:06:14,690][62475] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-06 22:06:15,504][62475] Updated weights for policy 0, policy_version 50430 (0.0005) [2023-03-06 22:06:16,326][62475] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-06 22:06:17,114][62475] Updated weights for policy 0, policy_version 50450 (0.0007) [2023-03-06 22:06:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51663872. Throughput: 0: 12747.4. Samples: 51646653. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:17,390][62145] Avg episode reward: [(0, '259.135')] [2023-03-06 22:06:17,917][62475] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-06 22:06:18,720][62475] Updated weights for policy 0, policy_version 50470 (0.0007) [2023-03-06 22:06:19,521][62475] Updated weights for policy 0, policy_version 50480 (0.0007) [2023-03-06 22:06:20,318][62475] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-06 22:06:21,132][62475] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-06 22:06:21,922][62475] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-06 22:06:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51727360. Throughput: 0: 12751.1. Samples: 51723463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:22,390][62145] Avg episode reward: [(0, '271.031')] [2023-03-06 22:06:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000050516_51728384.pth... [2023-03-06 22:06:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000047526_48666624.pth [2023-03-06 22:06:22,714][62475] Updated weights for policy 0, policy_version 50520 (0.0005) [2023-03-06 22:06:23,520][62475] Updated weights for policy 0, policy_version 50530 (0.0006) [2023-03-06 22:06:24,305][62475] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-06 22:06:25,118][62475] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-06 22:06:25,950][62475] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-06 22:06:26,751][62475] Updated weights for policy 0, policy_version 50570 (0.0006) [2023-03-06 22:06:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 51790848. Throughput: 0: 12757.1. Samples: 51761701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:27,390][62145] Avg episode reward: [(0, '265.701')] [2023-03-06 22:06:27,562][62475] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-06 22:06:28,338][62475] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-06 22:06:29,153][62475] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-06 22:06:29,949][62475] Updated weights for policy 0, policy_version 50610 (0.0007) [2023-03-06 22:06:30,758][62475] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-06 22:06:31,560][62475] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-06 22:06:32,358][62475] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-06 22:06:32,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51855360. Throughput: 0: 12756.6. Samples: 51838214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:32,390][62145] Avg episode reward: [(0, '310.186')] [2023-03-06 22:06:33,153][62475] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-06 22:06:33,961][62475] Updated weights for policy 0, policy_version 50660 (0.0006) [2023-03-06 22:06:34,754][62475] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-06 22:06:35,558][62475] Updated weights for policy 0, policy_version 50680 (0.0007) [2023-03-06 22:06:36,365][62475] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-06 22:06:37,147][62475] Updated weights for policy 0, policy_version 50700 (0.0006) [2023-03-06 22:06:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 51918848. Throughput: 0: 12753.8. Samples: 51914827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:37,390][62145] Avg episode reward: [(0, '274.157')] [2023-03-06 22:06:37,976][62475] Updated weights for policy 0, policy_version 50710 (0.0008) [2023-03-06 22:06:38,781][62475] Updated weights for policy 0, policy_version 50720 (0.0007) [2023-03-06 22:06:39,581][62475] Updated weights for policy 0, policy_version 50730 (0.0007) [2023-03-06 22:06:40,385][62475] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-06 22:06:41,197][62475] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-06 22:06:42,003][62475] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-06 22:06:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12753.1). Total num frames: 51983360. Throughput: 0: 12753.1. Samples: 51953032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:42,390][62145] Avg episode reward: [(0, '341.006')] [2023-03-06 22:06:42,797][62475] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-06 22:06:43,599][62475] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-06 22:06:44,398][62475] Updated weights for policy 0, policy_version 50790 (0.0007) [2023-03-06 22:06:45,201][62475] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-06 22:06:45,994][62475] Updated weights for policy 0, policy_version 50810 (0.0007) [2023-03-06 22:06:46,777][62475] Updated weights for policy 0, policy_version 50820 (0.0007) [2023-03-06 22:06:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12753.1). Total num frames: 52046848. Throughput: 0: 12763.6. Samples: 52029630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:47,390][62145] Avg episode reward: [(0, '247.190')] [2023-03-06 22:06:47,592][62475] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-06 22:06:48,407][62475] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-06 22:06:49,194][62475] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-06 22:06:50,005][62475] Updated weights for policy 0, policy_version 50860 (0.0007) [2023-03-06 22:06:50,794][62475] Updated weights for policy 0, policy_version 50870 (0.0006) [2023-03-06 22:06:51,582][62475] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-06 22:06:52,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 52110336. Throughput: 0: 12766.5. Samples: 52106294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:52,390][62145] Avg episode reward: [(0, '316.055')] [2023-03-06 22:06:52,412][62475] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-06 22:06:53,190][62475] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-06 22:06:54,006][62475] Updated weights for policy 0, policy_version 50910 (0.0007) [2023-03-06 22:06:54,826][62475] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-06 22:06:55,629][62475] Updated weights for policy 0, policy_version 50930 (0.0006) [2023-03-06 22:06:56,439][62475] Updated weights for policy 0, policy_version 50940 (0.0007) [2023-03-06 22:06:57,245][62475] Updated weights for policy 0, policy_version 50950 (0.0005) [2023-03-06 22:06:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 52173824. Throughput: 0: 12765.5. Samples: 52144491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:06:57,390][62145] Avg episode reward: [(0, '434.618')] [2023-03-06 22:06:58,044][62475] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-06 22:06:58,835][62475] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-06 22:06:59,650][62475] Updated weights for policy 0, policy_version 50980 (0.0007) [2023-03-06 22:07:00,441][62475] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-06 22:07:01,244][62475] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-06 22:07:02,050][62475] Updated weights for policy 0, policy_version 51010 (0.0005) [2023-03-06 22:07:02,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12756.6). Total num frames: 52238336. Throughput: 0: 12764.1. Samples: 52221036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:02,390][62145] Avg episode reward: [(0, '427.465')] [2023-03-06 22:07:02,863][62475] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-06 22:07:03,664][62475] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-06 22:07:04,474][62475] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-06 22:07:05,279][62475] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-06 22:07:06,118][62475] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-06 22:07:06,905][62475] Updated weights for policy 0, policy_version 51070 (0.0006) [2023-03-06 22:07:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12756.6). Total num frames: 52301824. Throughput: 0: 12742.5. Samples: 52296878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:07,390][62145] Avg episode reward: [(0, '563.604')] [2023-03-06 22:07:07,707][62475] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-06 22:07:08,515][62475] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-06 22:07:09,301][62475] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-06 22:07:10,105][62475] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-06 22:07:10,921][62475] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-06 22:07:11,735][62475] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-06 22:07:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 52365312. Throughput: 0: 12743.8. Samples: 52335171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:12,390][62145] Avg episode reward: [(0, '400.424')] [2023-03-06 22:07:12,548][62475] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-06 22:07:13,335][62475] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-06 22:07:14,132][62475] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-06 22:07:14,937][62475] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-06 22:07:15,737][62475] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-06 22:07:16,550][62475] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-06 22:07:17,360][62475] Updated weights for policy 0, policy_version 51200 (0.0007) [2023-03-06 22:07:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 52428800. Throughput: 0: 12741.4. Samples: 52411578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:17,390][62145] Avg episode reward: [(0, '404.237')] [2023-03-06 22:07:18,181][62475] Updated weights for policy 0, policy_version 51210 (0.0007) [2023-03-06 22:07:18,991][62475] Updated weights for policy 0, policy_version 51220 (0.0007) [2023-03-06 22:07:19,796][62475] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-06 22:07:20,605][62475] Updated weights for policy 0, policy_version 51240 (0.0007) [2023-03-06 22:07:21,416][62475] Updated weights for policy 0, policy_version 51250 (0.0006) [2023-03-06 22:07:22,221][62475] Updated weights for policy 0, policy_version 51260 (0.0007) [2023-03-06 22:07:22,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 52492288. Throughput: 0: 12729.7. Samples: 52487664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:22,390][62145] Avg episode reward: [(0, '362.878')] [2023-03-06 22:07:23,006][62475] Updated weights for policy 0, policy_version 51270 (0.0006) [2023-03-06 22:07:23,806][62475] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-06 22:07:24,613][62475] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-06 22:07:25,399][62475] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-06 22:07:26,215][62475] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-06 22:07:27,008][62475] Updated weights for policy 0, policy_version 51320 (0.0007) [2023-03-06 22:07:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 52555776. Throughput: 0: 12737.1. Samples: 52526204. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:27,390][62145] Avg episode reward: [(0, '393.578')] [2023-03-06 22:07:27,819][62475] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-06 22:07:28,614][62475] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-06 22:07:29,405][62475] Updated weights for policy 0, policy_version 51350 (0.0007) [2023-03-06 22:07:30,223][62475] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-06 22:07:31,012][62475] Updated weights for policy 0, policy_version 51370 (0.0005) [2023-03-06 22:07:31,802][62475] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-03-06 22:07:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 52620288. Throughput: 0: 12740.7. Samples: 52602960. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:07:32,390][62145] Avg episode reward: [(0, '330.624')] [2023-03-06 22:07:32,617][62475] Updated weights for policy 0, policy_version 51390 (0.0007) [2023-03-06 22:07:33,418][62475] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-06 22:07:34,201][62475] Updated weights for policy 0, policy_version 51410 (0.0006) [2023-03-06 22:07:35,014][62475] Updated weights for policy 0, policy_version 51420 (0.0006) [2023-03-06 22:07:35,827][62475] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-06 22:07:36,630][62475] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-06 22:07:37,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 52683776. Throughput: 0: 12732.2. Samples: 52679241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:07:37,390][62145] Avg episode reward: [(0, '345.971')] [2023-03-06 22:07:37,464][62475] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-06 22:07:38,269][62475] Updated weights for policy 0, policy_version 51460 (0.0006) [2023-03-06 22:07:39,070][62475] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-06 22:07:39,873][62475] Updated weights for policy 0, policy_version 51480 (0.0007) [2023-03-06 22:07:40,688][62475] Updated weights for policy 0, policy_version 51490 (0.0007) [2023-03-06 22:07:41,483][62475] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-06 22:07:42,292][62475] Updated weights for policy 0, policy_version 51510 (0.0006) [2023-03-06 22:07:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12756.6). Total num frames: 52747264. Throughput: 0: 12730.7. Samples: 52717373. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:07:42,390][62145] Avg episode reward: [(0, '376.305')] [2023-03-06 22:07:43,077][62475] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-06 22:07:43,873][62475] Updated weights for policy 0, policy_version 51530 (0.0007) [2023-03-06 22:07:44,677][62475] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-06 22:07:45,490][62475] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-06 22:07:46,309][62475] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-06 22:07:47,107][62475] Updated weights for policy 0, policy_version 51570 (0.0007) [2023-03-06 22:07:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12756.6). Total num frames: 52810752. Throughput: 0: 12727.4. Samples: 52793767. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:07:47,390][62145] Avg episode reward: [(0, '411.564')] [2023-03-06 22:07:47,897][62475] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-06 22:07:48,706][62475] Updated weights for policy 0, policy_version 51590 (0.0007) [2023-03-06 22:07:49,517][62475] Updated weights for policy 0, policy_version 51600 (0.0007) [2023-03-06 22:07:50,344][62475] Updated weights for policy 0, policy_version 51610 (0.0006) [2023-03-06 22:07:51,167][62475] Updated weights for policy 0, policy_version 51620 (0.0006) [2023-03-06 22:07:51,962][62475] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-06 22:07:52,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12753.1). Total num frames: 52874240. Throughput: 0: 12732.7. Samples: 52869849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:07:52,390][62145] Avg episode reward: [(0, '350.690')] [2023-03-06 22:07:52,760][62475] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-06 22:07:53,550][62475] Updated weights for policy 0, policy_version 51650 (0.0007) [2023-03-06 22:07:54,354][62475] Updated weights for policy 0, policy_version 51660 (0.0007) [2023-03-06 22:07:55,167][62475] Updated weights for policy 0, policy_version 51670 (0.0007) [2023-03-06 22:07:55,963][62475] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-06 22:07:56,755][62475] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-06 22:07:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12753.1). Total num frames: 52937728. Throughput: 0: 12730.7. Samples: 52908055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:07:57,390][62145] Avg episode reward: [(0, '435.769')] [2023-03-06 22:07:57,590][62475] Updated weights for policy 0, policy_version 51700 (0.0007) [2023-03-06 22:07:58,380][62475] Updated weights for policy 0, policy_version 51710 (0.0007) [2023-03-06 22:07:59,203][62475] Updated weights for policy 0, policy_version 51720 (0.0005) [2023-03-06 22:07:59,999][62475] Updated weights for policy 0, policy_version 51730 (0.0006) [2023-03-06 22:08:00,801][62475] Updated weights for policy 0, policy_version 51740 (0.0007) [2023-03-06 22:08:01,609][62475] Updated weights for policy 0, policy_version 51750 (0.0006) [2023-03-06 22:08:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12749.7). Total num frames: 53001216. Throughput: 0: 12728.7. Samples: 52984369. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:02,390][62145] Avg episode reward: [(0, '483.379')] [2023-03-06 22:08:02,417][62475] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-06 22:08:03,202][62475] Updated weights for policy 0, policy_version 51770 (0.0006) [2023-03-06 22:08:04,020][62475] Updated weights for policy 0, policy_version 51780 (0.0005) [2023-03-06 22:08:04,816][62475] Updated weights for policy 0, policy_version 51790 (0.0008) [2023-03-06 22:08:05,615][62475] Updated weights for policy 0, policy_version 51800 (0.0006) [2023-03-06 22:08:06,421][62475] Updated weights for policy 0, policy_version 51810 (0.0007) [2023-03-06 22:08:07,226][62475] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-06 22:08:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12749.7). Total num frames: 53064704. Throughput: 0: 12741.0. Samples: 53061010. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:07,390][62145] Avg episode reward: [(0, '317.589')] [2023-03-06 22:08:08,014][62475] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-06 22:08:08,838][62475] Updated weights for policy 0, policy_version 51840 (0.0006) [2023-03-06 22:08:09,639][62475] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-06 22:08:10,438][62475] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-06 22:08:11,261][62475] Updated weights for policy 0, policy_version 51870 (0.0006) [2023-03-06 22:08:12,062][62475] Updated weights for policy 0, policy_version 51880 (0.0006) [2023-03-06 22:08:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12746.2). Total num frames: 53128192. Throughput: 0: 12732.3. Samples: 53099157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:12,390][62145] Avg episode reward: [(0, '403.037')] [2023-03-06 22:08:12,873][62475] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-06 22:08:13,678][62475] Updated weights for policy 0, policy_version 51900 (0.0007) [2023-03-06 22:08:14,454][62475] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-06 22:08:15,297][62475] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-06 22:08:15,771][62424] KL-divergence is very high: 102.5540 [2023-03-06 22:08:16,085][62475] Updated weights for policy 0, policy_version 51930 (0.0006) [2023-03-06 22:08:16,856][62475] Updated weights for policy 0, policy_version 51940 (0.0006) [2023-03-06 22:08:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12749.7). Total num frames: 53192704. Throughput: 0: 12721.0. Samples: 53175404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:17,390][62145] Avg episode reward: [(0, '455.226')] [2023-03-06 22:08:17,687][62475] Updated weights for policy 0, policy_version 51950 (0.0007) [2023-03-06 22:08:18,479][62475] Updated weights for policy 0, policy_version 51960 (0.0007) [2023-03-06 22:08:19,285][62475] Updated weights for policy 0, policy_version 51970 (0.0007) [2023-03-06 22:08:20,124][62475] Updated weights for policy 0, policy_version 51980 (0.0006) [2023-03-06 22:08:20,912][62475] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-06 22:08:21,697][62475] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-06 22:08:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 53256192. Throughput: 0: 12729.8. Samples: 53252083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:22,390][62145] Avg episode reward: [(0, '411.261')] [2023-03-06 22:08:22,408][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000052009_53257216.pth... [2023-03-06 22:08:22,439][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000049021_50197504.pth [2023-03-06 22:08:22,519][62475] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-06 22:08:23,302][62475] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-06 22:08:24,091][62475] Updated weights for policy 0, policy_version 52030 (0.0006) [2023-03-06 22:08:24,887][62475] Updated weights for policy 0, policy_version 52040 (0.0007) [2023-03-06 22:08:25,690][62475] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-06 22:08:26,524][62475] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-06 22:08:27,318][62475] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-06 22:08:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 53319680. Throughput: 0: 12734.6. Samples: 53290429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:27,390][62145] Avg episode reward: [(0, '376.974')] [2023-03-06 22:08:28,137][62475] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-06 22:08:28,938][62475] Updated weights for policy 0, policy_version 52090 (0.0006) [2023-03-06 22:08:29,740][62475] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-06 22:08:30,546][62475] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-06 22:08:31,359][62475] Updated weights for policy 0, policy_version 52120 (0.0007) [2023-03-06 22:08:32,170][62475] Updated weights for policy 0, policy_version 52130 (0.0007) [2023-03-06 22:08:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 53383168. Throughput: 0: 12727.7. Samples: 53366512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:32,390][62145] Avg episode reward: [(0, '407.511')] [2023-03-06 22:08:32,963][62475] Updated weights for policy 0, policy_version 52140 (0.0006) [2023-03-06 22:08:33,771][62475] Updated weights for policy 0, policy_version 52150 (0.0007) [2023-03-06 22:08:34,572][62475] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-06 22:08:35,365][62475] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-06 22:08:36,178][62475] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-06 22:08:36,967][62475] Updated weights for policy 0, policy_version 52190 (0.0006) [2023-03-06 22:08:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 53447680. Throughput: 0: 12740.5. Samples: 53443170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:08:37,400][62145] Avg episode reward: [(0, '419.174')] [2023-03-06 22:08:37,772][62475] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-06 22:08:38,575][62475] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-06 22:08:39,374][62475] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-06 22:08:40,157][62475] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-06 22:08:40,959][62475] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-06 22:08:41,770][62475] Updated weights for policy 0, policy_version 52250 (0.0006) [2023-03-06 22:08:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12746.2). Total num frames: 53511168. Throughput: 0: 12745.0. Samples: 53481581. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:42,400][62145] Avg episode reward: [(0, '337.035')] [2023-03-06 22:08:42,574][62475] Updated weights for policy 0, policy_version 52260 (0.0006) [2023-03-06 22:08:43,357][62475] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-06 22:08:44,178][62475] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-06 22:08:44,970][62475] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-06 22:08:45,774][62475] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-06 22:08:46,595][62475] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-06 22:08:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 53574656. Throughput: 0: 12752.0. Samples: 53558208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:47,401][62145] Avg episode reward: [(0, '289.539')] [2023-03-06 22:08:47,414][62475] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-06 22:08:48,203][62475] Updated weights for policy 0, policy_version 52330 (0.0007) [2023-03-06 22:08:49,018][62475] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-06 22:08:49,802][62475] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-06 22:08:50,626][62475] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-06 22:08:51,417][62475] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-06 22:08:52,233][62475] Updated weights for policy 0, policy_version 52380 (0.0007) [2023-03-06 22:08:52,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 53639168. Throughput: 0: 12740.4. Samples: 53634329. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:52,401][62145] Avg episode reward: [(0, '340.073')] [2023-03-06 22:08:53,036][62475] Updated weights for policy 0, policy_version 52390 (0.0008) [2023-03-06 22:08:53,845][62475] Updated weights for policy 0, policy_version 52400 (0.0006) [2023-03-06 22:08:54,645][62475] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-06 22:08:54,809][62424] KL-divergence is very high: 135.7700 [2023-03-06 22:08:55,453][62475] Updated weights for policy 0, policy_version 52420 (0.0007) [2023-03-06 22:08:56,271][62475] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-06 22:08:57,080][62475] Updated weights for policy 0, policy_version 52440 (0.0007) [2023-03-06 22:08:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 53701632. Throughput: 0: 12738.8. Samples: 53672402. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:08:57,390][62145] Avg episode reward: [(0, '324.382')] [2023-03-06 22:08:57,874][62475] Updated weights for policy 0, policy_version 52450 (0.0006) [2023-03-06 22:08:58,677][62475] Updated weights for policy 0, policy_version 52460 (0.0006) [2023-03-06 22:08:59,473][62475] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-06 22:09:00,257][62475] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-06 22:09:01,073][62475] Updated weights for policy 0, policy_version 52490 (0.0006) [2023-03-06 22:09:01,883][62475] Updated weights for policy 0, policy_version 52500 (0.0007) [2023-03-06 22:09:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 53766144. Throughput: 0: 12747.2. Samples: 53749030. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:09:02,390][62145] Avg episode reward: [(0, '311.516')] [2023-03-06 22:09:02,694][62475] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-06 22:09:03,487][62475] Updated weights for policy 0, policy_version 52520 (0.0006) [2023-03-06 22:09:04,293][62475] Updated weights for policy 0, policy_version 52530 (0.0007) [2023-03-06 22:09:05,100][62475] Updated weights for policy 0, policy_version 52540 (0.0007) [2023-03-06 22:09:05,902][62475] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-06 22:09:06,715][62475] Updated weights for policy 0, policy_version 52560 (0.0006) [2023-03-06 22:09:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 53829632. Throughput: 0: 12734.6. Samples: 53825142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:09:07,390][62145] Avg episode reward: [(0, '299.926')] [2023-03-06 22:09:07,517][62475] Updated weights for policy 0, policy_version 52570 (0.0007) [2023-03-06 22:09:08,309][62475] Updated weights for policy 0, policy_version 52580 (0.0007) [2023-03-06 22:09:09,134][62475] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-06 22:09:09,930][62475] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-06 22:09:10,749][62475] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-06 22:09:11,547][62475] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-06 22:09:12,351][62475] Updated weights for policy 0, policy_version 52630 (0.0007) [2023-03-06 22:09:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 53893120. Throughput: 0: 12735.0. Samples: 53863506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:09:12,390][62145] Avg episode reward: [(0, '331.502')] [2023-03-06 22:09:13,160][62475] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-06 22:09:13,957][62475] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-06 22:09:14,204][62424] KL-divergence is very high: 140.2845 [2023-03-06 22:09:14,762][62475] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-06 22:09:15,562][62475] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-06 22:09:16,383][62475] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-06 22:09:17,176][62475] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-06 22:09:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 53956608. Throughput: 0: 12735.5. Samples: 53939612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:17,401][62145] Avg episode reward: [(0, '323.672')] [2023-03-06 22:09:17,972][62475] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-06 22:09:18,784][62475] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-06 22:09:19,597][62475] Updated weights for policy 0, policy_version 52720 (0.0007) [2023-03-06 22:09:20,384][62475] Updated weights for policy 0, policy_version 52730 (0.0007) [2023-03-06 22:09:21,219][62475] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-06 22:09:22,004][62475] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-06 22:09:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 54020096. Throughput: 0: 12727.8. Samples: 54015922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:22,400][62145] Avg episode reward: [(0, '415.312')] [2023-03-06 22:09:22,817][62475] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-06 22:09:23,639][62475] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-06 22:09:24,433][62475] Updated weights for policy 0, policy_version 52780 (0.0005) [2023-03-06 22:09:25,242][62475] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-06 22:09:26,036][62475] Updated weights for policy 0, policy_version 52800 (0.0006) [2023-03-06 22:09:26,827][62475] Updated weights for policy 0, policy_version 52810 (0.0008) [2023-03-06 22:09:27,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 54084608. Throughput: 0: 12725.7. Samples: 54054240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:27,401][62145] Avg episode reward: [(0, '317.411')] [2023-03-06 22:09:27,630][62475] Updated weights for policy 0, policy_version 52820 (0.0006) [2023-03-06 22:09:28,414][62475] Updated weights for policy 0, policy_version 52830 (0.0006) [2023-03-06 22:09:29,211][62475] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-06 22:09:30,012][62475] Updated weights for policy 0, policy_version 52850 (0.0005) [2023-03-06 22:09:30,819][62475] Updated weights for policy 0, policy_version 52860 (0.0006) [2023-03-06 22:09:31,609][62475] Updated weights for policy 0, policy_version 52870 (0.0006) [2023-03-06 22:09:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 54148096. Throughput: 0: 12728.6. Samples: 54130994. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:32,401][62145] Avg episode reward: [(0, '349.781')] [2023-03-06 22:09:32,436][62475] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-06 22:09:33,258][62475] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-06 22:09:34,042][62475] Updated weights for policy 0, policy_version 52900 (0.0006) [2023-03-06 22:09:34,843][62475] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-06 22:09:35,646][62475] Updated weights for policy 0, policy_version 52920 (0.0007) [2023-03-06 22:09:36,453][62475] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-06 22:09:37,245][62475] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-06 22:09:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 54211584. Throughput: 0: 12734.0. Samples: 54207363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:37,401][62145] Avg episode reward: [(0, '356.099')] [2023-03-06 22:09:38,059][62475] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-06 22:09:38,856][62475] Updated weights for policy 0, policy_version 52960 (0.0007) [2023-03-06 22:09:39,645][62475] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-06 22:09:40,445][62475] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-06 22:09:41,255][62475] Updated weights for policy 0, policy_version 52990 (0.0008) [2023-03-06 22:09:42,039][62475] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-06 22:09:42,384][62424] KL-divergence is very high: 165.0707 [2023-03-06 22:09:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 54275072. Throughput: 0: 12740.8. Samples: 54245738. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:42,401][62145] Avg episode reward: [(0, '367.441')] [2023-03-06 22:09:42,865][62475] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-06 22:09:43,673][62475] Updated weights for policy 0, policy_version 53020 (0.0007) [2023-03-06 22:09:44,471][62475] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-06 22:09:45,290][62475] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-06 22:09:46,083][62475] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-03-06 22:09:46,883][62475] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-06 22:09:47,390][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54339584. Throughput: 0: 12739.1. Samples: 54322287. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:47,401][62145] Avg episode reward: [(0, '293.578')] [2023-03-06 22:09:47,687][62475] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-06 22:09:48,482][62475] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-06 22:09:49,300][62475] Updated weights for policy 0, policy_version 53090 (0.0006) [2023-03-06 22:09:50,089][62475] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-06 22:09:50,882][62475] Updated weights for policy 0, policy_version 53110 (0.0007) [2023-03-06 22:09:51,694][62475] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-06 22:09:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 54403072. Throughput: 0: 12752.2. Samples: 54398991. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:52,401][62145] Avg episode reward: [(0, '369.341')] [2023-03-06 22:09:52,481][62475] Updated weights for policy 0, policy_version 53130 (0.0007) [2023-03-06 22:09:53,308][62475] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-06 22:09:53,853][62424] KL-divergence is very high: 3389.9922 [2023-03-06 22:09:54,102][62475] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-06 22:09:54,907][62475] Updated weights for policy 0, policy_version 53160 (0.0007) [2023-03-06 22:09:55,701][62475] Updated weights for policy 0, policy_version 53170 (0.0006) [2023-03-06 22:09:56,514][62475] Updated weights for policy 0, policy_version 53180 (0.0007) [2023-03-06 22:09:57,320][62475] Updated weights for policy 0, policy_version 53190 (0.0007) [2023-03-06 22:09:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54466560. Throughput: 0: 12745.8. Samples: 54437067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:09:57,400][62145] Avg episode reward: [(0, '327.116')] [2023-03-06 22:09:58,119][62475] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-06 22:09:58,938][62475] Updated weights for policy 0, policy_version 53210 (0.0006) [2023-03-06 22:09:59,724][62475] Updated weights for policy 0, policy_version 53220 (0.0006) [2023-03-06 22:10:00,531][62475] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-06 22:10:01,347][62475] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-06 22:10:02,146][62475] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-06 22:10:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 54530048. Throughput: 0: 12754.4. Samples: 54513561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:02,390][62145] Avg episode reward: [(0, '343.001')] [2023-03-06 22:10:02,957][62475] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-06 22:10:03,781][62475] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-06 22:10:04,575][62475] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-06 22:10:05,378][62475] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-06 22:10:06,194][62475] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-06 22:10:06,976][62475] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-06 22:10:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 54593536. Throughput: 0: 12752.1. Samples: 54589768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:07,401][62145] Avg episode reward: [(0, '318.808')] [2023-03-06 22:10:07,784][62475] Updated weights for policy 0, policy_version 53320 (0.0006) [2023-03-06 22:10:08,609][62475] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-06 22:10:09,407][62475] Updated weights for policy 0, policy_version 53340 (0.0007) [2023-03-06 22:10:10,218][62475] Updated weights for policy 0, policy_version 53350 (0.0007) [2023-03-06 22:10:11,024][62475] Updated weights for policy 0, policy_version 53360 (0.0007) [2023-03-06 22:10:11,841][62475] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-06 22:10:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54658048. Throughput: 0: 12745.9. Samples: 54627806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:12,390][62145] Avg episode reward: [(0, '396.298')] [2023-03-06 22:10:12,635][62475] Updated weights for policy 0, policy_version 53380 (0.0006) [2023-03-06 22:10:13,426][62475] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-06 22:10:14,249][62475] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-06 22:10:15,063][62475] Updated weights for policy 0, policy_version 53410 (0.0007) [2023-03-06 22:10:15,854][62475] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-06 22:10:16,656][62475] Updated weights for policy 0, policy_version 53430 (0.0007) [2023-03-06 22:10:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54721536. Throughput: 0: 12737.2. Samples: 54704169. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:17,390][62145] Avg episode reward: [(0, '368.098')] [2023-03-06 22:10:17,465][62475] Updated weights for policy 0, policy_version 53440 (0.0006) [2023-03-06 22:10:18,258][62475] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-06 22:10:19,050][62475] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-06 22:10:19,871][62475] Updated weights for policy 0, policy_version 53470 (0.0007) [2023-03-06 22:10:20,665][62475] Updated weights for policy 0, policy_version 53480 (0.0007) [2023-03-06 22:10:21,452][62475] Updated weights for policy 0, policy_version 53490 (0.0006) [2023-03-06 22:10:22,246][62475] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-06 22:10:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54785024. Throughput: 0: 12744.2. Samples: 54780852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:22,390][62145] Avg episode reward: [(0, '313.856')] [2023-03-06 22:10:22,407][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000053502_54786048.pth... [2023-03-06 22:10:22,438][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000050516_51728384.pth [2023-03-06 22:10:23,053][62475] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-06 22:10:23,864][62475] Updated weights for policy 0, policy_version 53520 (0.0007) [2023-03-06 22:10:24,665][62475] Updated weights for policy 0, policy_version 53530 (0.0007) [2023-03-06 22:10:25,468][62475] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-06 22:10:26,285][62475] Updated weights for policy 0, policy_version 53550 (0.0006) [2023-03-06 22:10:27,078][62475] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-06 22:10:27,384][62424] KL-divergence is very high: 182.2102 [2023-03-06 22:10:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12739.3). Total num frames: 54848512. Throughput: 0: 12742.8. Samples: 54819161. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:27,390][62145] Avg episode reward: [(0, '393.323')] [2023-03-06 22:10:27,860][62475] Updated weights for policy 0, policy_version 53570 (0.0007) [2023-03-06 22:10:28,666][62475] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-06 22:10:29,475][62475] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-06 22:10:29,876][62424] KL-divergence is very high: 372.9458 [2023-03-06 22:10:30,282][62475] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-06 22:10:31,098][62475] Updated weights for policy 0, policy_version 53610 (0.0007) [2023-03-06 22:10:31,901][62475] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-06 22:10:31,964][62424] KL-divergence is very high: 1909.3588 [2023-03-06 22:10:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54913024. Throughput: 0: 12742.5. Samples: 54895698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:32,390][62145] Avg episode reward: [(0, '328.378')] [2023-03-06 22:10:32,534][62424] KL-divergence is very high: 1655.6523 [2023-03-06 22:10:32,691][62475] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-06 22:10:33,496][62475] Updated weights for policy 0, policy_version 53640 (0.0007) [2023-03-06 22:10:34,304][62475] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-06 22:10:35,106][62475] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-06 22:10:35,897][62475] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-06 22:10:36,697][62475] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-06 22:10:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 54976512. Throughput: 0: 12739.2. Samples: 54972252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:37,390][62145] Avg episode reward: [(0, '274.948')] [2023-03-06 22:10:37,497][62475] Updated weights for policy 0, policy_version 53690 (0.0007) [2023-03-06 22:10:38,298][62475] Updated weights for policy 0, policy_version 53700 (0.0007) [2023-03-06 22:10:39,086][62475] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-06 22:10:39,891][62475] Updated weights for policy 0, policy_version 53720 (0.0006) [2023-03-06 22:10:40,687][62475] Updated weights for policy 0, policy_version 53730 (0.0006) [2023-03-06 22:10:41,497][62475] Updated weights for policy 0, policy_version 53740 (0.0007) [2023-03-06 22:10:42,296][62475] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-06 22:10:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55040000. Throughput: 0: 12748.9. Samples: 55010771. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:42,390][62145] Avg episode reward: [(0, '364.800')] [2023-03-06 22:10:43,094][62475] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-06 22:10:43,913][62475] Updated weights for policy 0, policy_version 53770 (0.0007) [2023-03-06 22:10:44,722][62475] Updated weights for policy 0, policy_version 53780 (0.0005) [2023-03-06 22:10:45,519][62475] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-06 22:10:46,330][62475] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-06 22:10:47,145][62475] Updated weights for policy 0, policy_version 53810 (0.0006) [2023-03-06 22:10:47,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55104512. Throughput: 0: 12747.6. Samples: 55087203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:47,390][62145] Avg episode reward: [(0, '293.670')] [2023-03-06 22:10:47,948][62475] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-06 22:10:48,741][62475] Updated weights for policy 0, policy_version 53830 (0.0007) [2023-03-06 22:10:49,539][62475] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-06 22:10:50,337][62475] Updated weights for policy 0, policy_version 53850 (0.0005) [2023-03-06 22:10:51,156][62475] Updated weights for policy 0, policy_version 53860 (0.0006) [2023-03-06 22:10:51,956][62475] Updated weights for policy 0, policy_version 53870 (0.0006) [2023-03-06 22:10:52,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55168000. Throughput: 0: 12753.0. Samples: 55163652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:52,390][62145] Avg episode reward: [(0, '316.532')] [2023-03-06 22:10:52,754][62475] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-06 22:10:53,572][62475] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-06 22:10:54,361][62475] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-06 22:10:55,170][62475] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-06 22:10:55,972][62475] Updated weights for policy 0, policy_version 53920 (0.0005) [2023-03-06 22:10:56,780][62475] Updated weights for policy 0, policy_version 53930 (0.0006) [2023-03-06 22:10:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55231488. Throughput: 0: 12758.0. Samples: 55201919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:10:57,390][62145] Avg episode reward: [(0, '320.892')] [2023-03-06 22:10:57,585][62475] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-06 22:10:58,382][62475] Updated weights for policy 0, policy_version 53950 (0.0008) [2023-03-06 22:10:59,184][62475] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-03-06 22:10:59,984][62475] Updated weights for policy 0, policy_version 53970 (0.0007) [2023-03-06 22:11:00,770][62475] Updated weights for policy 0, policy_version 53980 (0.0006) [2023-03-06 22:11:01,577][62475] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-06 22:11:02,375][62475] Updated weights for policy 0, policy_version 54000 (0.0007) [2023-03-06 22:11:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 55296000. Throughput: 0: 12764.2. Samples: 55278556. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:11:02,390][62145] Avg episode reward: [(0, '339.318')] [2023-03-06 22:11:03,166][62475] Updated weights for policy 0, policy_version 54010 (0.0007) [2023-03-06 22:11:03,989][62475] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-06 22:11:04,789][62475] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-06 22:11:05,595][62475] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-06 22:11:06,402][62475] Updated weights for policy 0, policy_version 54050 (0.0006) [2023-03-06 22:11:07,218][62475] Updated weights for policy 0, policy_version 54060 (0.0007) [2023-03-06 22:11:07,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 55359488. Throughput: 0: 12755.1. Samples: 55354829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:11:07,390][62145] Avg episode reward: [(0, '313.894')] [2023-03-06 22:11:08,004][62475] Updated weights for policy 0, policy_version 54070 (0.0005) [2023-03-06 22:11:08,817][62475] Updated weights for policy 0, policy_version 54080 (0.0006) [2023-03-06 22:11:09,633][62475] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-06 22:11:10,430][62475] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-06 22:11:11,233][62475] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-06 22:11:12,049][62475] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-06 22:11:12,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55422976. Throughput: 0: 12751.9. Samples: 55392998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:11:12,390][62145] Avg episode reward: [(0, '315.119')] [2023-03-06 22:11:12,844][62475] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-06 22:11:13,632][62475] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-06 22:11:14,445][62475] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-06 22:11:15,256][62475] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-06 22:11:16,051][62475] Updated weights for policy 0, policy_version 54170 (0.0007) [2023-03-06 22:11:16,862][62475] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-06 22:11:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55486464. Throughput: 0: 12754.2. Samples: 55469636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:11:17,390][62145] Avg episode reward: [(0, '360.831')] [2023-03-06 22:11:17,666][62475] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-06 22:11:18,468][62475] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-06 22:11:19,266][62475] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-06 22:11:20,088][62475] Updated weights for policy 0, policy_version 54220 (0.0007) [2023-03-06 22:11:20,883][62475] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-06 22:11:21,681][62475] Updated weights for policy 0, policy_version 54240 (0.0007) [2023-03-06 22:11:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55549952. Throughput: 0: 12749.2. Samples: 55545965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:11:22,390][62145] Avg episode reward: [(0, '338.086')] [2023-03-06 22:11:22,479][62475] Updated weights for policy 0, policy_version 54250 (0.0005) [2023-03-06 22:11:23,274][62475] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-06 22:11:24,069][62475] Updated weights for policy 0, policy_version 54270 (0.0005) [2023-03-06 22:11:24,883][62475] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-06 22:11:25,674][62475] Updated weights for policy 0, policy_version 54290 (0.0007) [2023-03-06 22:11:26,493][62475] Updated weights for policy 0, policy_version 54300 (0.0007) [2023-03-06 22:11:27,313][62475] Updated weights for policy 0, policy_version 54310 (0.0005) [2023-03-06 22:11:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 55614464. Throughput: 0: 12745.8. Samples: 55584331. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:27,390][62145] Avg episode reward: [(0, '302.934')] [2023-03-06 22:11:28,106][62475] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-06 22:11:28,913][62475] Updated weights for policy 0, policy_version 54330 (0.0006) [2023-03-06 22:11:29,714][62475] Updated weights for policy 0, policy_version 54340 (0.0007) [2023-03-06 22:11:30,509][62475] Updated weights for policy 0, policy_version 54350 (0.0007) [2023-03-06 22:11:31,305][62475] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-06 22:11:32,115][62475] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-06 22:11:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 55677952. Throughput: 0: 12748.0. Samples: 55660861. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:32,390][62145] Avg episode reward: [(0, '296.249')] [2023-03-06 22:11:32,911][62475] Updated weights for policy 0, policy_version 54380 (0.0007) [2023-03-06 22:11:33,702][62475] Updated weights for policy 0, policy_version 54390 (0.0006) [2023-03-06 22:11:34,525][62475] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-06 22:11:35,309][62475] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-06 22:11:36,110][62475] Updated weights for policy 0, policy_version 54420 (0.0007) [2023-03-06 22:11:36,916][62475] Updated weights for policy 0, policy_version 54430 (0.0007) [2023-03-06 22:11:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 55741440. Throughput: 0: 12755.8. Samples: 55737664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:37,390][62145] Avg episode reward: [(0, '344.785')] [2023-03-06 22:11:37,718][62475] Updated weights for policy 0, policy_version 54440 (0.0007) [2023-03-06 22:11:38,511][62475] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-06 22:11:39,304][62475] Updated weights for policy 0, policy_version 54460 (0.0007) [2023-03-06 22:11:40,116][62475] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-06 22:11:40,922][62475] Updated weights for policy 0, policy_version 54480 (0.0007) [2023-03-06 22:11:41,721][62475] Updated weights for policy 0, policy_version 54490 (0.0007) [2023-03-06 22:11:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 55805952. Throughput: 0: 12756.8. Samples: 55775974. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:42,390][62145] Avg episode reward: [(0, '294.911')] [2023-03-06 22:11:42,531][62475] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-06 22:11:43,341][62475] Updated weights for policy 0, policy_version 54510 (0.0006) [2023-03-06 22:11:44,133][62475] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-06 22:11:44,933][62475] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-06 22:11:45,713][62475] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-06 22:11:46,504][62475] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-06 22:11:47,302][62475] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-06 22:11:47,390][62145] Fps is (10 sec: 12902.2, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 55870464. Throughput: 0: 12758.7. Samples: 55852698. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:47,390][62145] Avg episode reward: [(0, '426.306')] [2023-03-06 22:11:48,115][62475] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-06 22:11:48,914][62475] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-06 22:11:49,717][62475] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-06 22:11:50,524][62475] Updated weights for policy 0, policy_version 54600 (0.0006) [2023-03-06 22:11:51,322][62475] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-06 22:11:52,124][62475] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-06 22:11:52,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 55933952. Throughput: 0: 12771.5. Samples: 55929548. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:52,390][62145] Avg episode reward: [(0, '396.442')] [2023-03-06 22:11:52,934][62475] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-06 22:11:53,727][62475] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-06 22:11:54,537][62475] Updated weights for policy 0, policy_version 54650 (0.0006) [2023-03-06 22:11:55,356][62475] Updated weights for policy 0, policy_version 54660 (0.0007) [2023-03-06 22:11:56,148][62475] Updated weights for policy 0, policy_version 54670 (0.0006) [2023-03-06 22:11:56,954][62475] Updated weights for policy 0, policy_version 54680 (0.0006) [2023-03-06 22:11:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 55997440. Throughput: 0: 12768.8. Samples: 55967593. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:11:57,390][62145] Avg episode reward: [(0, '349.216')] [2023-03-06 22:11:57,772][62475] Updated weights for policy 0, policy_version 54690 (0.0006) [2023-03-06 22:11:58,566][62475] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-06 22:11:59,393][62475] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-06 22:12:00,185][62475] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-06 22:12:00,990][62475] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-06 22:12:01,220][62424] KL-divergence is very high: 295449.4688 [2023-03-06 22:12:01,800][62475] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-06 22:12:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 56060928. Throughput: 0: 12759.6. Samples: 56043819. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:02,390][62145] Avg episode reward: [(0, '380.754')] [2023-03-06 22:12:02,597][62475] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-06 22:12:03,398][62475] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-06 22:12:04,208][62475] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-06 22:12:05,006][62475] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-06 22:12:05,817][62475] Updated weights for policy 0, policy_version 54790 (0.0007) [2023-03-06 22:12:06,613][62475] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-06 22:12:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 56124416. Throughput: 0: 12762.3. Samples: 56120267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:07,390][62145] Avg episode reward: [(0, '507.943')] [2023-03-06 22:12:07,401][62475] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-06 22:12:08,214][62475] Updated weights for policy 0, policy_version 54820 (0.0006) [2023-03-06 22:12:09,020][62475] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-06 22:12:09,823][62475] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-06 22:12:10,642][62475] Updated weights for policy 0, policy_version 54850 (0.0006) [2023-03-06 22:12:11,438][62475] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-06 22:12:12,241][62475] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-06 22:12:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 56187904. Throughput: 0: 12758.4. Samples: 56158459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:12,390][62145] Avg episode reward: [(0, '529.330')] [2023-03-06 22:12:13,044][62475] Updated weights for policy 0, policy_version 54880 (0.0006) [2023-03-06 22:12:13,824][62475] Updated weights for policy 0, policy_version 54890 (0.0007) [2023-03-06 22:12:14,665][62475] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-06 22:12:15,460][62475] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-06 22:12:16,276][62475] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-06 22:12:17,070][62475] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-06 22:12:17,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 56252416. Throughput: 0: 12761.0. Samples: 56235104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:17,390][62145] Avg episode reward: [(0, '399.289')] [2023-03-06 22:12:17,863][62475] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-06 22:12:18,653][62475] Updated weights for policy 0, policy_version 54950 (0.0006) [2023-03-06 22:12:19,445][62475] Updated weights for policy 0, policy_version 54960 (0.0007) [2023-03-06 22:12:20,263][62475] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-06 22:12:21,070][62475] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-06 22:12:21,858][62475] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-06 22:12:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 56315904. Throughput: 0: 12755.8. Samples: 56311677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:22,390][62145] Avg episode reward: [(0, '543.504')] [2023-03-06 22:12:22,395][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000054996_56315904.pth... [2023-03-06 22:12:22,427][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000052009_53257216.pth [2023-03-06 22:12:22,679][62475] Updated weights for policy 0, policy_version 55000 (0.0006) [2023-03-06 22:12:23,474][62475] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-06 22:12:24,270][62475] Updated weights for policy 0, policy_version 55020 (0.0007) [2023-03-06 22:12:25,062][62475] Updated weights for policy 0, policy_version 55030 (0.0005) [2023-03-06 22:12:25,866][62475] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-06 22:12:26,685][62475] Updated weights for policy 0, policy_version 55050 (0.0006) [2023-03-06 22:12:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 56379392. Throughput: 0: 12753.9. Samples: 56349897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:27,390][62145] Avg episode reward: [(0, '552.723')] [2023-03-06 22:12:27,498][62475] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-06 22:12:28,302][62475] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-06 22:12:29,096][62475] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-06 22:12:29,896][62475] Updated weights for policy 0, policy_version 55090 (0.0007) [2023-03-06 22:12:30,685][62475] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-06 22:12:31,473][62475] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-06 22:12:32,309][62475] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-06 22:12:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 56443904. Throughput: 0: 12753.4. Samples: 56426600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:12:32,390][62145] Avg episode reward: [(0, '582.426')] [2023-03-06 22:12:33,101][62475] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-06 22:12:33,898][62475] Updated weights for policy 0, policy_version 55140 (0.0006) [2023-03-06 22:12:34,719][62475] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-06 22:12:35,507][62475] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-06 22:12:36,327][62475] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-06 22:12:37,121][62475] Updated weights for policy 0, policy_version 55180 (0.0006) [2023-03-06 22:12:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 56507392. Throughput: 0: 12745.1. Samples: 56503076. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:12:37,390][62145] Avg episode reward: [(0, '483.196')] [2023-03-06 22:12:37,923][62475] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-06 22:12:38,722][62475] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-06 22:12:39,509][62475] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-06 22:12:40,316][62475] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-06 22:12:41,119][62475] Updated weights for policy 0, policy_version 55230 (0.0006) [2023-03-06 22:12:41,921][62475] Updated weights for policy 0, policy_version 55240 (0.0006) [2023-03-06 22:12:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 56570880. Throughput: 0: 12749.6. Samples: 56541324. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:12:42,390][62145] Avg episode reward: [(0, '576.596')] [2023-03-06 22:12:42,733][62475] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-06 22:12:43,544][62475] Updated weights for policy 0, policy_version 55260 (0.0007) [2023-03-06 22:12:44,360][62475] Updated weights for policy 0, policy_version 55270 (0.0007) [2023-03-06 22:12:45,150][62475] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-06 22:12:45,957][62475] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-06 22:12:46,769][62475] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-06 22:12:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12746.2). Total num frames: 56634368. Throughput: 0: 12752.4. Samples: 56617677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:12:47,390][62145] Avg episode reward: [(0, '516.492')] [2023-03-06 22:12:47,584][62475] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-06 22:12:48,374][62475] Updated weights for policy 0, policy_version 55320 (0.0006) [2023-03-06 22:12:49,179][62475] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-06 22:12:49,976][62475] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-06 22:12:50,805][62475] Updated weights for policy 0, policy_version 55350 (0.0006) [2023-03-06 22:12:51,611][62475] Updated weights for policy 0, policy_version 55360 (0.0006) [2023-03-06 22:12:52,072][62424] KL-divergence is very high: 2898.8196 [2023-03-06 22:12:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 56697856. Throughput: 0: 12746.7. Samples: 56693870. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:12:52,390][62145] Avg episode reward: [(0, '672.488')] [2023-03-06 22:12:52,425][62475] Updated weights for policy 0, policy_version 55370 (0.0007) [2023-03-06 22:12:53,225][62475] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-06 22:12:54,011][62475] Updated weights for policy 0, policy_version 55390 (0.0007) [2023-03-06 22:12:54,812][62475] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-06 22:12:55,642][62475] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-06 22:12:56,433][62475] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-06 22:12:57,235][62475] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-06 22:12:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 56761344. Throughput: 0: 12744.7. Samples: 56731968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:12:57,390][62145] Avg episode reward: [(0, '684.688')] [2023-03-06 22:12:58,052][62475] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-06 22:12:58,860][62475] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-06 22:12:59,664][62475] Updated weights for policy 0, policy_version 55460 (0.0007) [2023-03-06 22:13:00,449][62475] Updated weights for policy 0, policy_version 55470 (0.0006) [2023-03-06 22:13:01,242][62475] Updated weights for policy 0, policy_version 55480 (0.0006) [2023-03-06 22:13:01,484][62424] KL-divergence is very high: 272.4428 [2023-03-06 22:13:01,640][62424] KL-divergence is very high: 192.4172 [2023-03-06 22:13:01,803][62424] KL-divergence is very high: 535.7459 [2023-03-06 22:13:01,879][62424] KL-divergence is very high: 174.9518 [2023-03-06 22:13:02,036][62424] KL-divergence is very high: 915.9327 [2023-03-06 22:13:02,045][62475] Updated weights for policy 0, policy_version 55490 (0.0006) [2023-03-06 22:13:02,297][62424] KL-divergence is very high: 1261.2125 [2023-03-06 22:13:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 56825856. Throughput: 0: 12739.9. Samples: 56808403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:13:02,390][62145] Avg episode reward: [(0, '585.509')] [2023-03-06 22:13:02,845][62475] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-06 22:13:03,656][62475] Updated weights for policy 0, policy_version 55510 (0.0007) [2023-03-06 22:13:04,461][62475] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-06 22:13:05,277][62475] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-06 22:13:06,079][62475] Updated weights for policy 0, policy_version 55540 (0.0006) [2023-03-06 22:13:06,893][62475] Updated weights for policy 0, policy_version 55550 (0.0007) [2023-03-06 22:13:07,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 56889344. Throughput: 0: 12736.3. Samples: 56884810. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:13:07,390][62145] Avg episode reward: [(0, '606.650')] [2023-03-06 22:13:07,682][62475] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-06 22:13:08,482][62475] Updated weights for policy 0, policy_version 55570 (0.0007) [2023-03-06 22:13:09,277][62475] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-06 22:13:10,085][62475] Updated weights for policy 0, policy_version 55590 (0.0007) [2023-03-06 22:13:10,892][62475] Updated weights for policy 0, policy_version 55600 (0.0006) [2023-03-06 22:13:11,724][62475] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-06 22:13:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12746.2). Total num frames: 56952832. Throughput: 0: 12741.2. Samples: 56923249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:12,390][62145] Avg episode reward: [(0, '608.993')] [2023-03-06 22:13:12,508][62475] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-06 22:13:13,313][62475] Updated weights for policy 0, policy_version 55630 (0.0007) [2023-03-06 22:13:14,116][62475] Updated weights for policy 0, policy_version 55640 (0.0006) [2023-03-06 22:13:14,913][62475] Updated weights for policy 0, policy_version 55650 (0.0007) [2023-03-06 22:13:15,730][62475] Updated weights for policy 0, policy_version 55660 (0.0006) [2023-03-06 22:13:16,542][62475] Updated weights for policy 0, policy_version 55670 (0.0007) [2023-03-06 22:13:17,342][62475] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-06 22:13:17,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57016320. Throughput: 0: 12728.2. Samples: 56999368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:17,390][62145] Avg episode reward: [(0, '721.418')] [2023-03-06 22:13:18,170][62475] Updated weights for policy 0, policy_version 55690 (0.0006) [2023-03-06 22:13:18,975][62475] Updated weights for policy 0, policy_version 55700 (0.0006) [2023-03-06 22:13:19,762][62475] Updated weights for policy 0, policy_version 55710 (0.0007) [2023-03-06 22:13:20,584][62475] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-06 22:13:21,394][62475] Updated weights for policy 0, policy_version 55730 (0.0007) [2023-03-06 22:13:22,190][62475] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-06 22:13:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57079808. Throughput: 0: 12720.1. Samples: 57075481. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:22,390][62145] Avg episode reward: [(0, '626.030')] [2023-03-06 22:13:22,996][62475] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-06 22:13:23,793][62475] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-06 22:13:24,610][62475] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-06 22:13:25,415][62424] KL-divergence is very high: 259.9848 [2023-03-06 22:13:25,423][62475] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-06 22:13:26,216][62475] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-06 22:13:27,031][62475] Updated weights for policy 0, policy_version 55800 (0.0007) [2023-03-06 22:13:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57143296. Throughput: 0: 12714.7. Samples: 57113484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:27,390][62145] Avg episode reward: [(0, '508.715')] [2023-03-06 22:13:27,840][62475] Updated weights for policy 0, policy_version 55810 (0.0007) [2023-03-06 22:13:28,631][62475] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-06 22:13:29,442][62475] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-06 22:13:30,267][62475] Updated weights for policy 0, policy_version 55840 (0.0006) [2023-03-06 22:13:31,073][62475] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-06 22:13:31,868][62475] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-06 22:13:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12742.7). Total num frames: 57206784. Throughput: 0: 12713.3. Samples: 57189776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:32,390][62145] Avg episode reward: [(0, '614.963')] [2023-03-06 22:13:32,678][62475] Updated weights for policy 0, policy_version 55870 (0.0007) [2023-03-06 22:13:33,478][62475] Updated weights for policy 0, policy_version 55880 (0.0006) [2023-03-06 22:13:34,301][62475] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-06 22:13:35,094][62475] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-06 22:13:35,897][62475] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-06 22:13:36,700][62475] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-06 22:13:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 57270272. Throughput: 0: 12719.0. Samples: 57266223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:37,390][62145] Avg episode reward: [(0, '655.396')] [2023-03-06 22:13:37,493][62475] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-06 22:13:38,299][62475] Updated weights for policy 0, policy_version 55940 (0.0008) [2023-03-06 22:13:39,109][62475] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-06 22:13:39,926][62475] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-06 22:13:40,719][62475] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-06 22:13:41,510][62475] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-06 22:13:42,313][62475] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-06 22:13:42,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57334784. Throughput: 0: 12717.5. Samples: 57304255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:13:42,390][62145] Avg episode reward: [(0, '610.505')] [2023-03-06 22:13:43,120][62475] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-06 22:13:43,923][62475] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-06 22:13:44,722][62475] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-06 22:13:45,534][62475] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-06 22:13:46,333][62475] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-06 22:13:47,149][62475] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-06 22:13:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 57398272. Throughput: 0: 12723.4. Samples: 57380955. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:13:47,401][62145] Avg episode reward: [(0, '855.976')] [2023-03-06 22:13:47,969][62475] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-06 22:13:48,748][62475] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-06 22:13:49,546][62475] Updated weights for policy 0, policy_version 56080 (0.0007) [2023-03-06 22:13:50,356][62475] Updated weights for policy 0, policy_version 56090 (0.0006) [2023-03-06 22:13:51,155][62475] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-06 22:13:51,957][62475] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-06 22:13:52,031][62424] KL-divergence is very high: 4537.7861 [2023-03-06 22:13:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57461760. Throughput: 0: 12722.0. Samples: 57457302. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:13:52,401][62145] Avg episode reward: [(0, '606.602')] [2023-03-06 22:13:52,766][62475] Updated weights for policy 0, policy_version 56120 (0.0007) [2023-03-06 22:13:53,566][62475] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-06 22:13:54,375][62475] Updated weights for policy 0, policy_version 56140 (0.0006) [2023-03-06 22:13:55,175][62475] Updated weights for policy 0, policy_version 56150 (0.0007) [2023-03-06 22:13:55,995][62475] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-06 22:13:56,790][62475] Updated weights for policy 0, policy_version 56170 (0.0007) [2023-03-06 22:13:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 57525248. Throughput: 0: 12714.1. Samples: 57495384. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:13:57,390][62145] Avg episode reward: [(0, '658.682')] [2023-03-06 22:13:57,605][62475] Updated weights for policy 0, policy_version 56180 (0.0007) [2023-03-06 22:13:58,401][62475] Updated weights for policy 0, policy_version 56190 (0.0006) [2023-03-06 22:13:59,204][62475] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-06 22:14:00,013][62475] Updated weights for policy 0, policy_version 56210 (0.0006) [2023-03-06 22:14:00,817][62475] Updated weights for policy 0, policy_version 56220 (0.0006) [2023-03-06 22:14:01,618][62475] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-06 22:14:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 57588736. Throughput: 0: 12721.0. Samples: 57571813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:14:02,390][62145] Avg episode reward: [(0, '639.178')] [2023-03-06 22:14:02,425][62475] Updated weights for policy 0, policy_version 56240 (0.0007) [2023-03-06 22:14:03,233][62475] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-06 22:14:04,028][62475] Updated weights for policy 0, policy_version 56260 (0.0007) [2023-03-06 22:14:04,837][62475] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-06 22:14:05,649][62475] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-06 22:14:06,443][62475] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-06 22:14:07,246][62475] Updated weights for policy 0, policy_version 56300 (0.0007) [2023-03-06 22:14:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 57652224. Throughput: 0: 12724.8. Samples: 57648097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:14:07,401][62145] Avg episode reward: [(0, '700.113')] [2023-03-06 22:14:08,047][62475] Updated weights for policy 0, policy_version 56310 (0.0007) [2023-03-06 22:14:08,863][62475] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-06 22:14:09,664][62475] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-06 22:14:10,470][62475] Updated weights for policy 0, policy_version 56340 (0.0007) [2023-03-06 22:14:11,291][62475] Updated weights for policy 0, policy_version 56350 (0.0007) [2023-03-06 22:14:12,081][62475] Updated weights for policy 0, policy_version 56360 (0.0007) [2023-03-06 22:14:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 57715712. Throughput: 0: 12727.6. Samples: 57686225. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:14:12,390][62145] Avg episode reward: [(0, '720.844')] [2023-03-06 22:14:12,873][62475] Updated weights for policy 0, policy_version 56370 (0.0006) [2023-03-06 22:14:13,687][62475] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-06 22:14:14,466][62475] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-06 22:14:15,281][62475] Updated weights for policy 0, policy_version 56400 (0.0006) [2023-03-06 22:14:16,070][62475] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-06 22:14:16,863][62475] Updated weights for policy 0, policy_version 56420 (0.0006) [2023-03-06 22:14:17,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 57780224. Throughput: 0: 12735.1. Samples: 57762857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:14:17,390][62145] Avg episode reward: [(0, '740.251')] [2023-03-06 22:14:17,692][62475] Updated weights for policy 0, policy_version 56430 (0.0007) [2023-03-06 22:14:18,488][62475] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-06 22:14:19,294][62475] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-06 22:14:20,109][62475] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-06 22:14:20,907][62475] Updated weights for policy 0, policy_version 56470 (0.0005) [2023-03-06 22:14:21,717][62475] Updated weights for policy 0, policy_version 56480 (0.0007) [2023-03-06 22:14:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 57843712. Throughput: 0: 12739.6. Samples: 57839506. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:22,390][62145] Avg episode reward: [(0, '924.004')] [2023-03-06 22:14:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000056488_57843712.pth... [2023-03-06 22:14:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000053502_54786048.pth [2023-03-06 22:14:22,505][62475] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-06 22:14:22,904][62424] KL-divergence is very high: 2208.8940 [2023-03-06 22:14:23,321][62475] Updated weights for policy 0, policy_version 56500 (0.0006) [2023-03-06 22:14:24,126][62475] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-06 22:14:24,918][62475] Updated weights for policy 0, policy_version 56520 (0.0007) [2023-03-06 22:14:25,734][62475] Updated weights for policy 0, policy_version 56530 (0.0007) [2023-03-06 22:14:26,532][62475] Updated weights for policy 0, policy_version 56540 (0.0007) [2023-03-06 22:14:27,347][62475] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-06 22:14:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 57907200. Throughput: 0: 12741.9. Samples: 57877641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:27,390][62145] Avg episode reward: [(0, '808.781')] [2023-03-06 22:14:28,153][62475] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-06 22:14:28,969][62475] Updated weights for policy 0, policy_version 56570 (0.0007) [2023-03-06 22:14:29,771][62475] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-06 22:14:30,576][62475] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-06 22:14:31,393][62475] Updated weights for policy 0, policy_version 56600 (0.0006) [2023-03-06 22:14:32,183][62475] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-06 22:14:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 57970688. Throughput: 0: 12725.1. Samples: 57953585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:32,390][62145] Avg episode reward: [(0, '874.355')] [2023-03-06 22:14:33,022][62475] Updated weights for policy 0, policy_version 56620 (0.0005) [2023-03-06 22:14:33,806][62475] Updated weights for policy 0, policy_version 56630 (0.0006) [2023-03-06 22:14:34,606][62475] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-06 22:14:35,435][62475] Updated weights for policy 0, policy_version 56650 (0.0006) [2023-03-06 22:14:36,229][62475] Updated weights for policy 0, policy_version 56660 (0.0006) [2023-03-06 22:14:37,030][62475] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-06 22:14:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 58034176. Throughput: 0: 12721.6. Samples: 58029775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:37,390][62145] Avg episode reward: [(0, '885.337')] [2023-03-06 22:14:37,846][62475] Updated weights for policy 0, policy_version 56680 (0.0006) [2023-03-06 22:14:38,629][62475] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-06 22:14:39,427][62475] Updated weights for policy 0, policy_version 56700 (0.0006) [2023-03-06 22:14:40,235][62475] Updated weights for policy 0, policy_version 56710 (0.0006) [2023-03-06 22:14:41,039][62475] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-06 22:14:41,861][62475] Updated weights for policy 0, policy_version 56730 (0.0006) [2023-03-06 22:14:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 58097664. Throughput: 0: 12727.9. Samples: 58068140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:42,390][62145] Avg episode reward: [(0, '800.917')] [2023-03-06 22:14:42,659][62475] Updated weights for policy 0, policy_version 56740 (0.0007) [2023-03-06 22:14:43,467][62475] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-06 22:14:44,276][62475] Updated weights for policy 0, policy_version 56760 (0.0008) [2023-03-06 22:14:45,091][62475] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-06 22:14:45,883][62475] Updated weights for policy 0, policy_version 56780 (0.0007) [2023-03-06 22:14:46,679][62475] Updated weights for policy 0, policy_version 56790 (0.0006) [2023-03-06 22:14:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 58161152. Throughput: 0: 12725.1. Samples: 58144444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:47,390][62145] Avg episode reward: [(0, '794.320')] [2023-03-06 22:14:47,480][62475] Updated weights for policy 0, policy_version 56800 (0.0007) [2023-03-06 22:14:48,290][62475] Updated weights for policy 0, policy_version 56810 (0.0007) [2023-03-06 22:14:49,077][62475] Updated weights for policy 0, policy_version 56820 (0.0006) [2023-03-06 22:14:49,888][62475] Updated weights for policy 0, policy_version 56830 (0.0006) [2023-03-06 22:14:50,686][62475] Updated weights for policy 0, policy_version 56840 (0.0006) [2023-03-06 22:14:51,492][62475] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-06 22:14:52,301][62475] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-06 22:14:52,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 58225664. Throughput: 0: 12729.1. Samples: 58220905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:14:52,390][62145] Avg episode reward: [(0, '904.441')] [2023-03-06 22:14:53,129][62475] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-06 22:14:53,918][62475] Updated weights for policy 0, policy_version 56880 (0.0007) [2023-03-06 22:14:54,724][62475] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-06 22:14:55,519][62475] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-06 22:14:56,336][62475] Updated weights for policy 0, policy_version 56910 (0.0006) [2023-03-06 22:14:57,118][62475] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-06 22:14:57,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 58289152. Throughput: 0: 12729.0. Samples: 58259028. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:14:57,390][62145] Avg episode reward: [(0, '718.040')] [2023-03-06 22:14:57,938][62475] Updated weights for policy 0, policy_version 56930 (0.0005) [2023-03-06 22:14:58,740][62475] Updated weights for policy 0, policy_version 56940 (0.0007) [2023-03-06 22:14:59,549][62475] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-06 22:15:00,362][62475] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-06 22:15:01,185][62475] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-06 22:15:01,970][62475] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-06 22:15:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 58352640. Throughput: 0: 12720.5. Samples: 58335277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:02,390][62145] Avg episode reward: [(0, '707.836')] [2023-03-06 22:15:02,795][62475] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-06 22:15:03,569][62475] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-06 22:15:04,363][62475] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-06 22:15:05,188][62475] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-06 22:15:05,976][62475] Updated weights for policy 0, policy_version 57030 (0.0007) [2023-03-06 22:15:06,786][62475] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-06 22:15:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 58416128. Throughput: 0: 12715.4. Samples: 58411700. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:07,390][62145] Avg episode reward: [(0, '906.705')] [2023-03-06 22:15:07,593][62475] Updated weights for policy 0, policy_version 57050 (0.0006) [2023-03-06 22:15:08,408][62475] Updated weights for policy 0, policy_version 57060 (0.0007) [2023-03-06 22:15:09,205][62475] Updated weights for policy 0, policy_version 57070 (0.0007) [2023-03-06 22:15:09,992][62475] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-06 22:15:10,809][62475] Updated weights for policy 0, policy_version 57090 (0.0007) [2023-03-06 22:15:11,617][62475] Updated weights for policy 0, policy_version 57100 (0.0008) [2023-03-06 22:15:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 58479616. Throughput: 0: 12718.0. Samples: 58449953. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:12,390][62145] Avg episode reward: [(0, '775.638')] [2023-03-06 22:15:12,432][62475] Updated weights for policy 0, policy_version 57110 (0.0005) [2023-03-06 22:15:13,269][62475] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-06 22:15:14,055][62475] Updated weights for policy 0, policy_version 57130 (0.0006) [2023-03-06 22:15:14,873][62475] Updated weights for policy 0, policy_version 57140 (0.0006) [2023-03-06 22:15:15,667][62475] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-06 22:15:16,464][62475] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-06 22:15:17,270][62475] Updated weights for policy 0, policy_version 57170 (0.0006) [2023-03-06 22:15:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 58543104. Throughput: 0: 12717.9. Samples: 58525892. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:17,390][62145] Avg episode reward: [(0, '986.684')] [2023-03-06 22:15:18,071][62475] Updated weights for policy 0, policy_version 57180 (0.0007) [2023-03-06 22:15:18,867][62475] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-06 22:15:19,689][62475] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-06 22:15:20,485][62475] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-06 22:15:21,311][62475] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-06 22:15:22,113][62475] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-06 22:15:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12739.2). Total num frames: 58606592. Throughput: 0: 12723.4. Samples: 58602331. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:22,390][62145] Avg episode reward: [(0, '781.649')] [2023-03-06 22:15:22,937][62475] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-06 22:15:23,725][62475] Updated weights for policy 0, policy_version 57250 (0.0007) [2023-03-06 22:15:24,504][62475] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-06 22:15:25,340][62475] Updated weights for policy 0, policy_version 57270 (0.0006) [2023-03-06 22:15:26,140][62475] Updated weights for policy 0, policy_version 57280 (0.0007) [2023-03-06 22:15:26,948][62475] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-06 22:15:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 58670080. Throughput: 0: 12717.7. Samples: 58640438. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:15:27,390][62145] Avg episode reward: [(0, '896.957')] [2023-03-06 22:15:27,762][62475] Updated weights for policy 0, policy_version 57300 (0.0006) [2023-03-06 22:15:28,574][62475] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-06 22:15:29,377][62475] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-06 22:15:30,181][62475] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-06 22:15:30,989][62475] Updated weights for policy 0, policy_version 57340 (0.0006) [2023-03-06 22:15:31,785][62475] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-06 22:15:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 58733568. Throughput: 0: 12711.1. Samples: 58716444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:32,390][62145] Avg episode reward: [(0, '887.020')] [2023-03-06 22:15:32,609][62475] Updated weights for policy 0, policy_version 57360 (0.0007) [2023-03-06 22:15:33,406][62475] Updated weights for policy 0, policy_version 57370 (0.0007) [2023-03-06 22:15:34,211][62475] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-06 22:15:35,016][62475] Updated weights for policy 0, policy_version 57390 (0.0006) [2023-03-06 22:15:35,846][62475] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-06 22:15:36,639][62475] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-06 22:15:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 58797056. Throughput: 0: 12704.6. Samples: 58792611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:37,390][62145] Avg episode reward: [(0, '787.020')] [2023-03-06 22:15:37,429][62475] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-06 22:15:38,251][62475] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-06 22:15:39,063][62475] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-06 22:15:39,865][62475] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-06 22:15:40,684][62475] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-06 22:15:41,499][62475] Updated weights for policy 0, policy_version 57470 (0.0007) [2023-03-06 22:15:42,300][62475] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-06 22:15:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 58860544. Throughput: 0: 12701.6. Samples: 58830597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:42,390][62145] Avg episode reward: [(0, '995.510')] [2023-03-06 22:15:43,095][62475] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-06 22:15:43,909][62475] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-06 22:15:44,705][62475] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-06 22:15:45,511][62475] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-06 22:15:46,302][62475] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-06 22:15:47,115][62475] Updated weights for policy 0, policy_version 57540 (0.0007) [2023-03-06 22:15:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 58924032. Throughput: 0: 12700.3. Samples: 58906792. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:47,390][62145] Avg episode reward: [(0, '812.034')] [2023-03-06 22:15:47,915][62475] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-06 22:15:48,746][62475] Updated weights for policy 0, policy_version 57560 (0.0007) [2023-03-06 22:15:49,550][62475] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-06 22:15:50,351][62475] Updated weights for policy 0, policy_version 57580 (0.0006) [2023-03-06 22:15:51,149][62475] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-03-06 22:15:51,955][62475] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-06 22:15:52,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 58987520. Throughput: 0: 12699.2. Samples: 58983163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:52,390][62145] Avg episode reward: [(0, '863.950')] [2023-03-06 22:15:52,750][62475] Updated weights for policy 0, policy_version 57610 (0.0006) [2023-03-06 22:15:53,551][62475] Updated weights for policy 0, policy_version 57620 (0.0006) [2023-03-06 22:15:54,244][62424] KL-divergence is very high: 345010.9375 [2023-03-06 22:15:54,354][62475] Updated weights for policy 0, policy_version 57630 (0.0007) [2023-03-06 22:15:55,147][62475] Updated weights for policy 0, policy_version 57640 (0.0007) [2023-03-06 22:15:55,978][62475] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-06 22:15:56,771][62475] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-06 22:15:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 59051008. Throughput: 0: 12701.6. Samples: 59021522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:15:57,390][62145] Avg episode reward: [(0, '908.992')] [2023-03-06 22:15:57,592][62475] Updated weights for policy 0, policy_version 57670 (0.0007) [2023-03-06 22:15:58,412][62475] Updated weights for policy 0, policy_version 57680 (0.0007) [2023-03-06 22:15:59,216][62475] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-06 22:16:00,012][62475] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-06 22:16:00,826][62475] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-06 22:16:01,626][62475] Updated weights for policy 0, policy_version 57720 (0.0007) [2023-03-06 22:16:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 59114496. Throughput: 0: 12700.3. Samples: 59097407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:02,390][62145] Avg episode reward: [(0, '884.232')] [2023-03-06 22:16:02,432][62475] Updated weights for policy 0, policy_version 57730 (0.0007) [2023-03-06 22:16:03,240][62475] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-06 22:16:04,046][62475] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-06 22:16:04,887][62475] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-06 22:16:05,706][62475] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-06 22:16:06,493][62475] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-06 22:16:07,298][62475] Updated weights for policy 0, policy_version 57790 (0.0006) [2023-03-06 22:16:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 59177984. Throughput: 0: 12689.3. Samples: 59173349. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:07,390][62145] Avg episode reward: [(0, '1054.649')] [2023-03-06 22:16:08,099][62424] KL-divergence is very high: 174.5410 [2023-03-06 22:16:08,107][62475] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-06 22:16:08,914][62475] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-06 22:16:09,722][62475] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-06 22:16:10,546][62475] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-06 22:16:11,354][62475] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-06 22:16:12,182][62475] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-06 22:16:12,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.6, 300 sec: 12725.4). Total num frames: 59240448. Throughput: 0: 12683.8. Samples: 59211208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:12,390][62145] Avg episode reward: [(0, '1171.956')] [2023-03-06 22:16:13,011][62475] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-06 22:16:13,829][62475] Updated weights for policy 0, policy_version 57870 (0.0006) [2023-03-06 22:16:14,607][62475] Updated weights for policy 0, policy_version 57880 (0.0007) [2023-03-06 22:16:15,416][62475] Updated weights for policy 0, policy_version 57890 (0.0005) [2023-03-06 22:16:16,217][62475] Updated weights for policy 0, policy_version 57900 (0.0007) [2023-03-06 22:16:17,028][62475] Updated weights for policy 0, policy_version 57910 (0.0007) [2023-03-06 22:16:17,389][62145] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12725.4). Total num frames: 59303936. Throughput: 0: 12678.0. Samples: 59286953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:17,390][62145] Avg episode reward: [(0, '975.623')] [2023-03-06 22:16:17,838][62475] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-06 22:16:18,649][62475] Updated weights for policy 0, policy_version 57930 (0.0007) [2023-03-06 22:16:19,457][62475] Updated weights for policy 0, policy_version 57940 (0.0007) [2023-03-06 22:16:20,262][62475] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-06 22:16:21,074][62475] Updated weights for policy 0, policy_version 57960 (0.0006) [2023-03-06 22:16:21,889][62475] Updated weights for policy 0, policy_version 57970 (0.0007) [2023-03-06 22:16:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.6, 300 sec: 12721.9). Total num frames: 59367424. Throughput: 0: 12674.2. Samples: 59362951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:22,390][62145] Avg episode reward: [(0, '1020.249')] [2023-03-06 22:16:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000057976_59367424.pth... [2023-03-06 22:16:22,429][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000054996_56315904.pth [2023-03-06 22:16:22,707][62475] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-06 22:16:23,509][62475] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-06 22:16:24,319][62475] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-06 22:16:25,125][62475] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-06 22:16:25,935][62475] Updated weights for policy 0, policy_version 58020 (0.0006) [2023-03-06 22:16:26,728][62475] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-03-06 22:16:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12721.9). Total num frames: 59430912. Throughput: 0: 12673.8. Samples: 59400920. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:27,390][62145] Avg episode reward: [(0, '1043.382')] [2023-03-06 22:16:27,528][62475] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-06 22:16:28,355][62475] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-06 22:16:29,156][62475] Updated weights for policy 0, policy_version 58060 (0.0006) [2023-03-06 22:16:29,963][62475] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-06 22:16:30,765][62475] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-06 22:16:31,594][62475] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-06 22:16:32,379][62475] Updated weights for policy 0, policy_version 58100 (0.0007) [2023-03-06 22:16:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12721.9). Total num frames: 59494400. Throughput: 0: 12670.3. Samples: 59476957. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:32,390][62145] Avg episode reward: [(0, '1215.581')] [2023-03-06 22:16:33,191][62475] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-06 22:16:33,988][62475] Updated weights for policy 0, policy_version 58120 (0.0007) [2023-03-06 22:16:34,796][62475] Updated weights for policy 0, policy_version 58130 (0.0007) [2023-03-06 22:16:35,598][62475] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-06 22:16:36,434][62475] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-06 22:16:37,250][62475] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-06 22:16:37,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12715.0). Total num frames: 59556864. Throughput: 0: 12661.5. Samples: 59552929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:16:37,390][62145] Avg episode reward: [(0, '1018.367')] [2023-03-06 22:16:38,056][62475] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-06 22:16:38,863][62475] Updated weights for policy 0, policy_version 58180 (0.0006) [2023-03-06 22:16:39,671][62475] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-06 22:16:40,478][62475] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-06 22:16:41,282][62475] Updated weights for policy 0, policy_version 58210 (0.0007) [2023-03-06 22:16:42,086][62475] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-06 22:16:42,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 59621376. Throughput: 0: 12653.9. Samples: 59590949. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:16:42,390][62145] Avg episode reward: [(0, '1003.887')] [2023-03-06 22:16:42,904][62475] Updated weights for policy 0, policy_version 58230 (0.0006) [2023-03-06 22:16:43,700][62475] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-06 22:16:44,504][62475] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-06 22:16:45,317][62475] Updated weights for policy 0, policy_version 58260 (0.0007) [2023-03-06 22:16:46,127][62475] Updated weights for policy 0, policy_version 58270 (0.0006) [2023-03-06 22:16:46,941][62475] Updated weights for policy 0, policy_version 58280 (0.0005) [2023-03-06 22:16:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12711.5). Total num frames: 59683840. Throughput: 0: 12657.6. Samples: 59666998. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:16:47,390][62145] Avg episode reward: [(0, '1138.762')] [2023-03-06 22:16:47,756][62475] Updated weights for policy 0, policy_version 58290 (0.0007) [2023-03-06 22:16:48,569][62475] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-06 22:16:49,371][62475] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-06 22:16:50,163][62475] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-06 22:16:50,976][62475] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-06 22:16:51,793][62475] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-06 22:16:52,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12711.5). Total num frames: 59747328. Throughput: 0: 12660.3. Samples: 59743062. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:16:52,390][62145] Avg episode reward: [(0, '1271.516')] [2023-03-06 22:16:52,394][62424] Saving new best policy, reward=1271.516! [2023-03-06 22:16:52,599][62475] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-06 22:16:53,391][62475] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-06 22:16:54,194][62475] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-06 22:16:55,007][62475] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-06 22:16:55,830][62475] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-06 22:16:56,626][62475] Updated weights for policy 0, policy_version 58400 (0.0007) [2023-03-06 22:16:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12711.5). Total num frames: 59810816. Throughput: 0: 12667.2. Samples: 59781231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:16:57,390][62145] Avg episode reward: [(0, '1131.689')] [2023-03-06 22:16:57,423][62475] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-06 22:16:58,234][62475] Updated weights for policy 0, policy_version 58420 (0.0006) [2023-03-06 22:16:59,045][62475] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-06 22:16:59,856][62475] Updated weights for policy 0, policy_version 58440 (0.0007) [2023-03-06 22:17:00,657][62475] Updated weights for policy 0, policy_version 58450 (0.0007) [2023-03-06 22:17:01,476][62475] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-06 22:17:02,291][62475] Updated weights for policy 0, policy_version 58470 (0.0007) [2023-03-06 22:17:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12711.5). Total num frames: 59874304. Throughput: 0: 12672.5. Samples: 59857216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:17:02,390][62145] Avg episode reward: [(0, '1385.630')] [2023-03-06 22:17:02,393][62424] Saving new best policy, reward=1385.630! [2023-03-06 22:17:03,100][62475] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-06 22:17:03,899][62475] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-06 22:17:04,714][62475] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-06 22:17:05,534][62475] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-06 22:17:06,346][62475] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-06 22:17:07,161][62475] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-06 22:17:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12711.5). Total num frames: 59937792. Throughput: 0: 12667.0. Samples: 59932966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:17:07,390][62145] Avg episode reward: [(0, '1077.554')] [2023-03-06 22:17:07,944][62475] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-06 22:17:08,751][62475] Updated weights for policy 0, policy_version 58550 (0.0006) [2023-03-06 22:17:09,560][62475] Updated weights for policy 0, policy_version 58560 (0.0007) [2023-03-06 22:17:10,347][62475] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-06 22:17:11,179][62475] Updated weights for policy 0, policy_version 58580 (0.0007) [2023-03-06 22:17:11,973][62475] Updated weights for policy 0, policy_version 58590 (0.0007) [2023-03-06 22:17:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 60001280. Throughput: 0: 12670.6. Samples: 59971097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:17:12,390][62145] Avg episode reward: [(0, '984.556')] [2023-03-06 22:17:12,782][62475] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-06 22:17:13,590][62475] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-06 22:17:14,382][62475] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-06 22:17:15,202][62475] Updated weights for policy 0, policy_version 58630 (0.0006) [2023-03-06 22:17:16,000][62475] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-06 22:17:16,818][62475] Updated weights for policy 0, policy_version 58650 (0.0008) [2023-03-06 22:17:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 60064768. Throughput: 0: 12680.5. Samples: 60047577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:17,390][62145] Avg episode reward: [(0, '1024.668')] [2023-03-06 22:17:17,610][62475] Updated weights for policy 0, policy_version 58660 (0.0005) [2023-03-06 22:17:18,413][62475] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-06 22:17:19,225][62475] Updated weights for policy 0, policy_version 58680 (0.0007) [2023-03-06 22:17:20,030][62475] Updated weights for policy 0, policy_version 58690 (0.0007) [2023-03-06 22:17:20,817][62475] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-06 22:17:21,646][62475] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-06 22:17:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 60128256. Throughput: 0: 12686.0. Samples: 60123801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:22,401][62145] Avg episode reward: [(0, '1035.240')] [2023-03-06 22:17:22,448][62475] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-06 22:17:23,249][62475] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-06 22:17:23,738][62424] KL-divergence is very high: 171.5451 [2023-03-06 22:17:24,069][62475] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-06 22:17:24,875][62475] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-06 22:17:25,685][62475] Updated weights for policy 0, policy_version 58760 (0.0007) [2023-03-06 22:17:26,503][62475] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-06 22:17:27,312][62475] Updated weights for policy 0, policy_version 58780 (0.0007) [2023-03-06 22:17:27,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12701.1). Total num frames: 60190720. Throughput: 0: 12680.7. Samples: 60161582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:27,400][62145] Avg episode reward: [(0, '990.708')] [2023-03-06 22:17:28,094][62475] Updated weights for policy 0, policy_version 58790 (0.0007) [2023-03-06 22:17:28,918][62475] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-06 22:17:29,731][62475] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-06 22:17:30,533][62475] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-06 22:17:31,361][62475] Updated weights for policy 0, policy_version 58830 (0.0007) [2023-03-06 22:17:32,165][62475] Updated weights for policy 0, policy_version 58840 (0.0006) [2023-03-06 22:17:32,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12701.1). Total num frames: 60254208. Throughput: 0: 12682.2. Samples: 60237695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:32,401][62145] Avg episode reward: [(0, '956.109')] [2023-03-06 22:17:32,973][62475] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-06 22:17:33,794][62475] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-06 22:17:34,613][62475] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-06 22:17:35,423][62475] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-06 22:17:36,228][62475] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-06 22:17:37,029][62475] Updated weights for policy 0, policy_version 58900 (0.0006) [2023-03-06 22:17:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12701.1). Total num frames: 60317696. Throughput: 0: 12674.3. Samples: 60313407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:37,390][62145] Avg episode reward: [(0, '1073.416')] [2023-03-06 22:17:37,841][62475] Updated weights for policy 0, policy_version 58910 (0.0006) [2023-03-06 22:17:38,650][62475] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-06 22:17:39,483][62475] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-06 22:17:40,281][62475] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-06 22:17:41,083][62475] Updated weights for policy 0, policy_version 58950 (0.0007) [2023-03-06 22:17:41,893][62475] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-06 22:17:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12701.1). Total num frames: 60381184. Throughput: 0: 12665.1. Samples: 60351162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:42,390][62145] Avg episode reward: [(0, '871.513')] [2023-03-06 22:17:42,706][62475] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-06 22:17:43,493][62475] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-06 22:17:44,293][62475] Updated weights for policy 0, policy_version 58990 (0.0005) [2023-03-06 22:17:45,114][62475] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-06 22:17:45,926][62475] Updated weights for policy 0, policy_version 59010 (0.0006) [2023-03-06 22:17:46,738][62475] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-06 22:17:47,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12701.1). Total num frames: 60444672. Throughput: 0: 12671.0. Samples: 60427413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:17:47,390][62145] Avg episode reward: [(0, '978.109')] [2023-03-06 22:17:47,543][62475] Updated weights for policy 0, policy_version 59030 (0.0007) [2023-03-06 22:17:48,341][62475] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-06 22:17:49,146][62475] Updated weights for policy 0, policy_version 59050 (0.0006) [2023-03-06 22:17:49,959][62475] Updated weights for policy 0, policy_version 59060 (0.0007) [2023-03-06 22:17:50,783][62475] Updated weights for policy 0, policy_version 59070 (0.0006) [2023-03-06 22:17:51,598][62475] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-06 22:17:52,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 60507136. Throughput: 0: 12673.3. Samples: 60503267. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:17:52,390][62145] Avg episode reward: [(0, '1093.960')] [2023-03-06 22:17:52,399][62475] Updated weights for policy 0, policy_version 59090 (0.0005) [2023-03-06 22:17:53,205][62475] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-06 22:17:54,017][62475] Updated weights for policy 0, policy_version 59110 (0.0006) [2023-03-06 22:17:54,822][62475] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-06 22:17:55,633][62475] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-06 22:17:56,426][62475] Updated weights for policy 0, policy_version 59140 (0.0006) [2023-03-06 22:17:57,241][62475] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-06 22:17:57,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12694.1). Total num frames: 60570624. Throughput: 0: 12669.1. Samples: 60541207. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:17:57,390][62145] Avg episode reward: [(0, '1051.590')] [2023-03-06 22:17:58,050][62475] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-06 22:17:58,853][62475] Updated weights for policy 0, policy_version 59170 (0.0007) [2023-03-06 22:17:59,676][62475] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-06 22:18:00,477][62475] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-06 22:18:01,254][62475] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-06 22:18:02,057][62475] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-06 22:18:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 60635136. Throughput: 0: 12669.5. Samples: 60617704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:02,390][62145] Avg episode reward: [(0, '1150.057')] [2023-03-06 22:18:02,866][62475] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-06 22:18:03,662][62475] Updated weights for policy 0, policy_version 59230 (0.0007) [2023-03-06 22:18:04,479][62475] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-06 22:18:05,298][62475] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-06 22:18:06,099][62475] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-06 22:18:06,896][62475] Updated weights for policy 0, policy_version 59270 (0.0006) [2023-03-06 22:18:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 60698624. Throughput: 0: 12667.1. Samples: 60693821. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:07,390][62145] Avg episode reward: [(0, '1186.812')] [2023-03-06 22:18:07,706][62475] Updated weights for policy 0, policy_version 59280 (0.0006) [2023-03-06 22:18:08,521][62475] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-06 22:18:09,325][62475] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-06 22:18:10,139][62475] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-06 22:18:10,937][62475] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-06 22:18:11,759][62475] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-06 22:18:12,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.4, 300 sec: 12694.1). Total num frames: 60761088. Throughput: 0: 12671.9. Samples: 60731821. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:12,390][62145] Avg episode reward: [(0, '884.509')] [2023-03-06 22:18:12,566][62475] Updated weights for policy 0, policy_version 59340 (0.0008) [2023-03-06 22:18:13,367][62475] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-06 22:18:14,168][62475] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-06 22:18:14,985][62475] Updated weights for policy 0, policy_version 59370 (0.0006) [2023-03-06 22:18:15,789][62475] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-06 22:18:16,605][62475] Updated weights for policy 0, policy_version 59390 (0.0006) [2023-03-06 22:18:17,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12663.4, 300 sec: 12694.1). Total num frames: 60824576. Throughput: 0: 12670.3. Samples: 60807859. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:17,390][62145] Avg episode reward: [(0, '1107.589')] [2023-03-06 22:18:17,416][62475] Updated weights for policy 0, policy_version 59400 (0.0007) [2023-03-06 22:18:18,217][62475] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-06 22:18:19,044][62475] Updated weights for policy 0, policy_version 59420 (0.0007) [2023-03-06 22:18:19,834][62475] Updated weights for policy 0, policy_version 59430 (0.0007) [2023-03-06 22:18:20,630][62475] Updated weights for policy 0, policy_version 59440 (0.0006) [2023-03-06 22:18:21,441][62475] Updated weights for policy 0, policy_version 59450 (0.0006) [2023-03-06 22:18:22,258][62475] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-06 22:18:22,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12694.1). Total num frames: 60888064. Throughput: 0: 12679.2. Samples: 60883970. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:22,390][62145] Avg episode reward: [(0, '759.390')] [2023-03-06 22:18:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000059461_60888064.pth... [2023-03-06 22:18:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000056488_57843712.pth [2023-03-06 22:18:23,059][62475] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-06 22:18:23,857][62475] Updated weights for policy 0, policy_version 59480 (0.0006) [2023-03-06 22:18:24,677][62475] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-06 22:18:25,493][62475] Updated weights for policy 0, policy_version 59500 (0.0006) [2023-03-06 22:18:26,291][62475] Updated weights for policy 0, policy_version 59510 (0.0007) [2023-03-06 22:18:27,114][62475] Updated weights for policy 0, policy_version 59520 (0.0007) [2023-03-06 22:18:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 60951552. Throughput: 0: 12684.7. Samples: 60921972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:18:27,390][62145] Avg episode reward: [(0, '828.297')] [2023-03-06 22:18:27,915][62475] Updated weights for policy 0, policy_version 59530 (0.0006) [2023-03-06 22:18:28,705][62475] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-06 22:18:29,541][62475] Updated weights for policy 0, policy_version 59550 (0.0007) [2023-03-06 22:18:30,354][62475] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-06 22:18:31,158][62475] Updated weights for policy 0, policy_version 59570 (0.0006) [2023-03-06 22:18:31,962][62475] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-06 22:18:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 61015040. Throughput: 0: 12676.6. Samples: 60997858. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:32,390][62145] Avg episode reward: [(0, '990.566')] [2023-03-06 22:18:32,782][62475] Updated weights for policy 0, policy_version 59590 (0.0006) [2023-03-06 22:18:33,581][62475] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-06 22:18:34,408][62475] Updated weights for policy 0, policy_version 59610 (0.0008) [2023-03-06 22:18:35,213][62475] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-06 22:18:36,018][62475] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-06 22:18:36,804][62475] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-06 22:18:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12690.7). Total num frames: 61078528. Throughput: 0: 12679.7. Samples: 61073852. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:37,390][62145] Avg episode reward: [(0, '868.740')] [2023-03-06 22:18:37,628][62475] Updated weights for policy 0, policy_version 59650 (0.0007) [2023-03-06 22:18:38,431][62475] Updated weights for policy 0, policy_version 59660 (0.0006) [2023-03-06 22:18:39,232][62475] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-06 22:18:40,028][62475] Updated weights for policy 0, policy_version 59680 (0.0007) [2023-03-06 22:18:40,831][62475] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-06 22:18:41,626][62475] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-06 22:18:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 61142016. Throughput: 0: 12686.5. Samples: 61112100. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:42,390][62145] Avg episode reward: [(0, '820.430')] [2023-03-06 22:18:42,432][62475] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-06 22:18:43,238][62475] Updated weights for policy 0, policy_version 59720 (0.0008) [2023-03-06 22:18:44,032][62475] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-06 22:18:44,834][62475] Updated weights for policy 0, policy_version 59740 (0.0007) [2023-03-06 22:18:45,642][62475] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-06 22:18:46,464][62475] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-06 22:18:47,254][62475] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-06 22:18:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 61205504. Throughput: 0: 12688.6. Samples: 61188690. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:47,390][62145] Avg episode reward: [(0, '893.406')] [2023-03-06 22:18:48,043][62475] Updated weights for policy 0, policy_version 59780 (0.0005) [2023-03-06 22:18:48,859][62475] Updated weights for policy 0, policy_version 59790 (0.0007) [2023-03-06 22:18:49,661][62475] Updated weights for policy 0, policy_version 59800 (0.0007) [2023-03-06 22:18:50,495][62475] Updated weights for policy 0, policy_version 59810 (0.0007) [2023-03-06 22:18:51,293][62475] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-06 22:18:52,096][62475] Updated weights for policy 0, policy_version 59830 (0.0007) [2023-03-06 22:18:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 61268992. Throughput: 0: 12693.8. Samples: 61265041. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:52,390][62145] Avg episode reward: [(0, '954.489')] [2023-03-06 22:18:52,902][62475] Updated weights for policy 0, policy_version 59840 (0.0006) [2023-03-06 22:18:53,702][62475] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-06 22:18:54,488][62475] Updated weights for policy 0, policy_version 59860 (0.0007) [2023-03-06 22:18:55,306][62475] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-06 22:18:56,100][62475] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-06 22:18:56,911][62475] Updated weights for policy 0, policy_version 59890 (0.0007) [2023-03-06 22:18:57,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 61332480. Throughput: 0: 12695.8. Samples: 61303129. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:18:57,401][62145] Avg episode reward: [(0, '881.165')] [2023-03-06 22:18:57,714][62475] Updated weights for policy 0, policy_version 59900 (0.0008) [2023-03-06 22:18:58,518][62475] Updated weights for policy 0, policy_version 59910 (0.0007) [2023-03-06 22:18:59,315][62475] Updated weights for policy 0, policy_version 59920 (0.0006) [2023-03-06 22:19:00,140][62475] Updated weights for policy 0, policy_version 59930 (0.0007) [2023-03-06 22:19:00,939][62475] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-06 22:19:01,748][62475] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-06 22:19:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 61395968. Throughput: 0: 12703.5. Samples: 61379515. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:19:02,401][62145] Avg episode reward: [(0, '756.942')] [2023-03-06 22:19:02,577][62475] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-06 22:19:03,386][62475] Updated weights for policy 0, policy_version 59970 (0.0006) [2023-03-06 22:19:04,198][62475] Updated weights for policy 0, policy_version 59980 (0.0007) [2023-03-06 22:19:05,025][62475] Updated weights for policy 0, policy_version 59990 (0.0007) [2023-03-06 22:19:05,810][62475] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-06 22:19:06,605][62475] Updated weights for policy 0, policy_version 60010 (0.0006) [2023-03-06 22:19:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 61459456. Throughput: 0: 12696.5. Samples: 61455313. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:07,401][62145] Avg episode reward: [(0, '1001.258')] [2023-03-06 22:19:07,429][62475] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-06 22:19:08,215][62475] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-06 22:19:09,029][62475] Updated weights for policy 0, policy_version 60040 (0.0006) [2023-03-06 22:19:09,844][62475] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-06 22:19:10,651][62475] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-06 22:19:11,454][62475] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-06 22:19:12,261][62475] Updated weights for policy 0, policy_version 60080 (0.0007) [2023-03-06 22:19:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 61522944. Throughput: 0: 12696.3. Samples: 61493306. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:12,400][62145] Avg episode reward: [(0, '923.814')] [2023-03-06 22:19:13,074][62475] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-06 22:19:13,895][62475] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-06 22:19:14,710][62475] Updated weights for policy 0, policy_version 60110 (0.0006) [2023-03-06 22:19:15,529][62475] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-06 22:19:16,330][62475] Updated weights for policy 0, policy_version 60130 (0.0007) [2023-03-06 22:19:17,130][62475] Updated weights for policy 0, policy_version 60140 (0.0007) [2023-03-06 22:19:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61585408. Throughput: 0: 12696.4. Samples: 61569194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:17,400][62145] Avg episode reward: [(0, '1088.011')] [2023-03-06 22:19:17,957][62475] Updated weights for policy 0, policy_version 60150 (0.0007) [2023-03-06 22:19:18,767][62475] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-06 22:19:19,581][62475] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-06 22:19:20,402][62475] Updated weights for policy 0, policy_version 60180 (0.0007) [2023-03-06 22:19:21,208][62475] Updated weights for policy 0, policy_version 60190 (0.0007) [2023-03-06 22:19:22,000][62475] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-06 22:19:22,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61648896. Throughput: 0: 12688.8. Samples: 61644846. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:22,400][62145] Avg episode reward: [(0, '1099.065')] [2023-03-06 22:19:22,819][62475] Updated weights for policy 0, policy_version 60210 (0.0007) [2023-03-06 22:19:23,621][62475] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-06 22:19:24,438][62475] Updated weights for policy 0, policy_version 60230 (0.0007) [2023-03-06 22:19:25,246][62475] Updated weights for policy 0, policy_version 60240 (0.0006) [2023-03-06 22:19:26,064][62475] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-06 22:19:26,865][62475] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-06 22:19:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61712384. Throughput: 0: 12681.6. Samples: 61682773. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:27,390][62145] Avg episode reward: [(0, '1183.860')] [2023-03-06 22:19:27,682][62475] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-06 22:19:28,489][62475] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-06 22:19:29,297][62475] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-06 22:19:30,108][62475] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-06 22:19:30,894][62475] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-06 22:19:31,689][62475] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-06 22:19:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61775872. Throughput: 0: 12670.5. Samples: 61758865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:32,390][62145] Avg episode reward: [(0, '1036.466')] [2023-03-06 22:19:32,515][62475] Updated weights for policy 0, policy_version 60330 (0.0006) [2023-03-06 22:19:33,325][62475] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-06 22:19:34,126][62475] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-06 22:19:34,944][62475] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-06 22:19:35,740][62475] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-06 22:19:36,567][62475] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-06 22:19:37,347][62475] Updated weights for policy 0, policy_version 60390 (0.0006) [2023-03-06 22:19:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61839360. Throughput: 0: 12665.2. Samples: 61834975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:37,390][62145] Avg episode reward: [(0, '929.131')] [2023-03-06 22:19:38,178][62475] Updated weights for policy 0, policy_version 60400 (0.0006) [2023-03-06 22:19:38,974][62475] Updated weights for policy 0, policy_version 60410 (0.0007) [2023-03-06 22:19:39,799][62475] Updated weights for policy 0, policy_version 60420 (0.0006) [2023-03-06 22:19:40,603][62475] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-06 22:19:41,392][62475] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-06 22:19:42,185][62475] Updated weights for policy 0, policy_version 60450 (0.0006) [2023-03-06 22:19:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 61902848. Throughput: 0: 12661.7. Samples: 61872904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:19:42,390][62145] Avg episode reward: [(0, '1089.245')] [2023-03-06 22:19:42,996][62475] Updated weights for policy 0, policy_version 60460 (0.0006) [2023-03-06 22:19:43,798][62475] Updated weights for policy 0, policy_version 60470 (0.0006) [2023-03-06 22:19:44,601][62475] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-06 22:19:45,427][62475] Updated weights for policy 0, policy_version 60490 (0.0007) [2023-03-06 22:19:46,235][62475] Updated weights for policy 0, policy_version 60500 (0.0007) [2023-03-06 22:19:47,034][62475] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-06 22:19:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 61966336. Throughput: 0: 12661.8. Samples: 61949295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:19:47,390][62145] Avg episode reward: [(0, '1268.427')] [2023-03-06 22:19:47,842][62475] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-06 22:19:48,670][62475] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-06 22:19:49,458][62475] Updated weights for policy 0, policy_version 60540 (0.0006) [2023-03-06 22:19:50,272][62475] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-06 22:19:51,078][62475] Updated weights for policy 0, policy_version 60560 (0.0006) [2023-03-06 22:19:51,873][62475] Updated weights for policy 0, policy_version 60570 (0.0007) [2023-03-06 22:19:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 62029824. Throughput: 0: 12668.5. Samples: 62025395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:19:52,390][62145] Avg episode reward: [(0, '1094.845')] [2023-03-06 22:19:52,695][62475] Updated weights for policy 0, policy_version 60580 (0.0006) [2023-03-06 22:19:53,493][62475] Updated weights for policy 0, policy_version 60590 (0.0006) [2023-03-06 22:19:54,321][62475] Updated weights for policy 0, policy_version 60600 (0.0006) [2023-03-06 22:19:55,126][62475] Updated weights for policy 0, policy_version 60610 (0.0006) [2023-03-06 22:19:55,950][62475] Updated weights for policy 0, policy_version 60620 (0.0007) [2023-03-06 22:19:56,743][62475] Updated weights for policy 0, policy_version 60630 (0.0006) [2023-03-06 22:19:57,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 62092288. Throughput: 0: 12662.3. Samples: 62063109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:19:57,390][62145] Avg episode reward: [(0, '1144.295')] [2023-03-06 22:19:57,554][62475] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-06 22:19:58,374][62475] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-06 22:19:59,182][62475] Updated weights for policy 0, policy_version 60660 (0.0006) [2023-03-06 22:19:59,997][62475] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-06 22:20:00,816][62475] Updated weights for policy 0, policy_version 60680 (0.0007) [2023-03-06 22:20:01,616][62475] Updated weights for policy 0, policy_version 60690 (0.0007) [2023-03-06 22:20:02,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 62155776. Throughput: 0: 12659.8. Samples: 62138885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:20:02,390][62145] Avg episode reward: [(0, '1309.234')] [2023-03-06 22:20:02,420][62475] Updated weights for policy 0, policy_version 60700 (0.0006) [2023-03-06 22:20:03,246][62475] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-06 22:20:04,034][62475] Updated weights for policy 0, policy_version 60720 (0.0006) [2023-03-06 22:20:04,822][62475] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-06 22:20:05,653][62475] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-06 22:20:06,463][62475] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-06 22:20:07,273][62475] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-06 22:20:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 62219264. Throughput: 0: 12669.5. Samples: 62214972. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:20:07,390][62145] Avg episode reward: [(0, '1290.975')] [2023-03-06 22:20:08,083][62475] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-06 22:20:08,890][62475] Updated weights for policy 0, policy_version 60780 (0.0006) [2023-03-06 22:20:09,693][62475] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-06 22:20:10,485][62475] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-06 22:20:11,308][62475] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-06 22:20:12,113][62475] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-06 22:20:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 62282752. Throughput: 0: 12672.6. Samples: 62253040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:20:12,390][62145] Avg episode reward: [(0, '1251.518')] [2023-03-06 22:20:12,911][62475] Updated weights for policy 0, policy_version 60830 (0.0008) [2023-03-06 22:20:13,732][62475] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-06 22:20:14,570][62475] Updated weights for policy 0, policy_version 60850 (0.0007) [2023-03-06 22:20:15,382][62475] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-06 22:20:16,183][62475] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-06 22:20:16,995][62475] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-06 22:20:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 62346240. Throughput: 0: 12663.4. Samples: 62328717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:20:17,390][62145] Avg episode reward: [(0, '974.064')] [2023-03-06 22:20:17,805][62475] Updated weights for policy 0, policy_version 60890 (0.0006) [2023-03-06 22:20:18,608][62475] Updated weights for policy 0, policy_version 60900 (0.0005) [2023-03-06 22:20:19,402][62475] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-06 22:20:20,213][62475] Updated weights for policy 0, policy_version 60920 (0.0007) [2023-03-06 22:20:21,014][62475] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-06 22:20:21,825][62475] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-06 22:20:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 62409728. Throughput: 0: 12666.0. Samples: 62404946. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:22,390][62145] Avg episode reward: [(0, '1193.810')] [2023-03-06 22:20:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000060947_62409728.pth... [2023-03-06 22:20:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000057976_59367424.pth [2023-03-06 22:20:22,632][62475] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-06 22:20:23,444][62475] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-06 22:20:24,233][62475] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-06 22:20:25,056][62475] Updated weights for policy 0, policy_version 60980 (0.0007) [2023-03-06 22:20:25,874][62475] Updated weights for policy 0, policy_version 60990 (0.0006) [2023-03-06 22:20:26,699][62475] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-06 22:20:26,846][62424] KL-divergence is very high: 558.1483 [2023-03-06 22:20:27,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62472192. Throughput: 0: 12663.7. Samples: 62442768. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:27,390][62145] Avg episode reward: [(0, '1182.414')] [2023-03-06 22:20:27,497][62475] Updated weights for policy 0, policy_version 61010 (0.0007) [2023-03-06 22:20:28,306][62475] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-06 22:20:29,134][62475] Updated weights for policy 0, policy_version 61030 (0.0006) [2023-03-06 22:20:29,931][62475] Updated weights for policy 0, policy_version 61040 (0.0008) [2023-03-06 22:20:30,737][62475] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-06 22:20:31,538][62475] Updated weights for policy 0, policy_version 61060 (0.0007) [2023-03-06 22:20:32,342][62475] Updated weights for policy 0, policy_version 61070 (0.0006) [2023-03-06 22:20:32,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62535680. Throughput: 0: 12654.4. Samples: 62518743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:32,390][62145] Avg episode reward: [(0, '1184.159')] [2023-03-06 22:20:33,156][62475] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-06 22:20:33,962][62475] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-06 22:20:34,750][62475] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-06 22:20:35,583][62475] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-06 22:20:36,405][62475] Updated weights for policy 0, policy_version 61120 (0.0007) [2023-03-06 22:20:37,209][62475] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-06 22:20:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62599168. Throughput: 0: 12649.7. Samples: 62594635. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:37,390][62145] Avg episode reward: [(0, '1373.110')] [2023-03-06 22:20:38,023][62475] Updated weights for policy 0, policy_version 61140 (0.0007) [2023-03-06 22:20:38,811][62475] Updated weights for policy 0, policy_version 61150 (0.0007) [2023-03-06 22:20:39,600][62475] Updated weights for policy 0, policy_version 61160 (0.0007) [2023-03-06 22:20:40,432][62475] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-06 22:20:41,222][62475] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-06 22:20:42,034][62475] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-06 22:20:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62662656. Throughput: 0: 12663.7. Samples: 62632974. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:42,390][62145] Avg episode reward: [(0, '1206.584')] [2023-03-06 22:20:42,851][62475] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-06 22:20:43,650][62475] Updated weights for policy 0, policy_version 61210 (0.0006) [2023-03-06 22:20:44,471][62475] Updated weights for policy 0, policy_version 61220 (0.0006) [2023-03-06 22:20:45,270][62475] Updated weights for policy 0, policy_version 61230 (0.0006) [2023-03-06 22:20:46,078][62475] Updated weights for policy 0, policy_version 61240 (0.0005) [2023-03-06 22:20:46,898][62475] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-06 22:20:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62726144. Throughput: 0: 12666.4. Samples: 62708871. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:47,390][62145] Avg episode reward: [(0, '1293.724')] [2023-03-06 22:20:47,695][62475] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-06 22:20:48,510][62475] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-06 22:20:49,311][62475] Updated weights for policy 0, policy_version 61280 (0.0005) [2023-03-06 22:20:50,125][62475] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-06 22:20:50,930][62475] Updated weights for policy 0, policy_version 61300 (0.0007) [2023-03-06 22:20:51,735][62475] Updated weights for policy 0, policy_version 61310 (0.0006) [2023-03-06 22:20:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62789632. Throughput: 0: 12666.4. Samples: 62784958. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:52,390][62145] Avg episode reward: [(0, '1299.984')] [2023-03-06 22:20:52,541][62475] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-03-06 22:20:53,331][62475] Updated weights for policy 0, policy_version 61330 (0.0006) [2023-03-06 22:20:54,150][62475] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-06 22:20:54,953][62475] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-06 22:20:55,771][62475] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-06 22:20:56,585][62475] Updated weights for policy 0, policy_version 61370 (0.0006) [2023-03-06 22:20:57,374][62475] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-06 22:20:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.6, 300 sec: 12673.3). Total num frames: 62853120. Throughput: 0: 12666.7. Samples: 62823040. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:20:57,390][62145] Avg episode reward: [(0, '1364.457')] [2023-03-06 22:20:58,201][62475] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-06 22:20:59,000][62475] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-06 22:20:59,805][62475] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-06 22:21:00,638][62475] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-06 22:21:01,445][62475] Updated weights for policy 0, policy_version 61430 (0.0008) [2023-03-06 22:21:02,236][62475] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-06 22:21:02,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12663.5, 300 sec: 12669.8). Total num frames: 62915584. Throughput: 0: 12670.4. Samples: 62898886. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:02,390][62145] Avg episode reward: [(0, '1241.958')] [2023-03-06 22:21:03,060][62475] Updated weights for policy 0, policy_version 61450 (0.0006) [2023-03-06 22:21:03,859][62475] Updated weights for policy 0, policy_version 61460 (0.0006) [2023-03-06 22:21:04,665][62475] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-06 22:21:05,499][62475] Updated weights for policy 0, policy_version 61480 (0.0007) [2023-03-06 22:21:06,287][62475] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-06 22:21:07,118][62475] Updated weights for policy 0, policy_version 61500 (0.0007) [2023-03-06 22:21:07,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 62979072. Throughput: 0: 12663.7. Samples: 62974813. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:07,390][62145] Avg episode reward: [(0, '1328.698')] [2023-03-06 22:21:07,916][62475] Updated weights for policy 0, policy_version 61510 (0.0007) [2023-03-06 22:21:08,713][62475] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-06 22:21:09,537][62475] Updated weights for policy 0, policy_version 61530 (0.0006) [2023-03-06 22:21:10,338][62475] Updated weights for policy 0, policy_version 61540 (0.0006) [2023-03-06 22:21:11,130][62475] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-06 22:21:11,959][62475] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-06 22:21:12,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 63042560. Throughput: 0: 12669.6. Samples: 63012902. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:12,390][62145] Avg episode reward: [(0, '1166.441')] [2023-03-06 22:21:12,771][62475] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-06 22:21:13,578][62475] Updated weights for policy 0, policy_version 61580 (0.0006) [2023-03-06 22:21:14,367][62475] Updated weights for policy 0, policy_version 61590 (0.0007) [2023-03-06 22:21:15,157][62475] Updated weights for policy 0, policy_version 61600 (0.0006) [2023-03-06 22:21:15,981][62475] Updated weights for policy 0, policy_version 61610 (0.0006) [2023-03-06 22:21:16,765][62475] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-06 22:21:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 63106048. Throughput: 0: 12673.7. Samples: 63089058. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:17,390][62145] Avg episode reward: [(0, '1033.899')] [2023-03-06 22:21:17,575][62475] Updated weights for policy 0, policy_version 61630 (0.0006) [2023-03-06 22:21:18,404][62475] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-06 22:21:19,201][62475] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-06 22:21:19,990][62475] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-06 22:21:20,809][62475] Updated weights for policy 0, policy_version 61670 (0.0008) [2023-03-06 22:21:21,604][62475] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-06 22:21:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12673.3). Total num frames: 63169536. Throughput: 0: 12686.8. Samples: 63165539. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:22,390][62145] Avg episode reward: [(0, '1249.415')] [2023-03-06 22:21:22,395][62475] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-06 22:21:23,213][62475] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-06 22:21:24,018][62475] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-06 22:21:24,818][62475] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-06 22:21:25,617][62475] Updated weights for policy 0, policy_version 61730 (0.0007) [2023-03-06 22:21:26,424][62475] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-06 22:21:27,234][62475] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-06 22:21:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 63234048. Throughput: 0: 12683.3. Samples: 63203722. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:27,390][62145] Avg episode reward: [(0, '868.824')] [2023-03-06 22:21:28,022][62475] Updated weights for policy 0, policy_version 61760 (0.0006) [2023-03-06 22:21:28,837][62475] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-06 22:21:29,642][62475] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-06 22:21:30,431][62475] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-06 22:21:31,255][62475] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-06 22:21:32,044][62475] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-06 22:21:32,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63297536. Throughput: 0: 12695.0. Samples: 63280144. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:21:32,390][62145] Avg episode reward: [(0, '1247.639')] [2023-03-06 22:21:32,862][62475] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-06 22:21:33,652][62475] Updated weights for policy 0, policy_version 61830 (0.0008) [2023-03-06 22:21:34,461][62475] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-06 22:21:35,267][62475] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-06 22:21:36,076][62475] Updated weights for policy 0, policy_version 61860 (0.0006) [2023-03-06 22:21:36,873][62475] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-06 22:21:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 63361024. Throughput: 0: 12699.4. Samples: 63356432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:21:37,390][62145] Avg episode reward: [(0, '940.320')] [2023-03-06 22:21:37,678][62475] Updated weights for policy 0, policy_version 61880 (0.0005) [2023-03-06 22:21:38,492][62475] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-06 22:21:39,307][62475] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-06 22:21:40,101][62475] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-06 22:21:40,904][62475] Updated weights for policy 0, policy_version 61920 (0.0006) [2023-03-06 22:21:41,724][62475] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-06 22:21:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63424512. Throughput: 0: 12702.4. Samples: 63394650. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:21:42,390][62145] Avg episode reward: [(0, '995.946')] [2023-03-06 22:21:42,521][62475] Updated weights for policy 0, policy_version 61940 (0.0007) [2023-03-06 22:21:43,315][62475] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-06 22:21:44,109][62475] Updated weights for policy 0, policy_version 61960 (0.0006) [2023-03-06 22:21:44,901][62475] Updated weights for policy 0, policy_version 61970 (0.0007) [2023-03-06 22:21:45,730][62475] Updated weights for policy 0, policy_version 61980 (0.0007) [2023-03-06 22:21:46,537][62475] Updated weights for policy 0, policy_version 61990 (0.0006) [2023-03-06 22:21:47,347][62475] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-06 22:21:47,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63488000. Throughput: 0: 12719.8. Samples: 63471275. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:21:47,390][62145] Avg episode reward: [(0, '794.395')] [2023-03-06 22:21:48,158][62475] Updated weights for policy 0, policy_version 62010 (0.0006) [2023-03-06 22:21:48,963][62475] Updated weights for policy 0, policy_version 62020 (0.0007) [2023-03-06 22:21:49,784][62475] Updated weights for policy 0, policy_version 62030 (0.0007) [2023-03-06 22:21:50,602][62475] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-06 22:21:51,396][62475] Updated weights for policy 0, policy_version 62050 (0.0007) [2023-03-06 22:21:52,219][62475] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-06 22:21:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63551488. Throughput: 0: 12714.2. Samples: 63546950. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:21:52,390][62145] Avg episode reward: [(0, '1042.124')] [2023-03-06 22:21:53,014][62475] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-06 22:21:53,830][62475] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-06 22:21:54,634][62475] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-06 22:21:55,438][62475] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-06 22:21:56,245][62475] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-06 22:21:57,041][62475] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-06 22:21:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63614976. Throughput: 0: 12712.2. Samples: 63584949. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:21:57,390][62145] Avg episode reward: [(0, '1046.773')] [2023-03-06 22:21:57,867][62475] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-06 22:21:58,674][62475] Updated weights for policy 0, policy_version 62140 (0.0007) [2023-03-06 22:21:59,481][62475] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-06 22:22:00,291][62475] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-06 22:22:01,091][62475] Updated weights for policy 0, policy_version 62170 (0.0006) [2023-03-06 22:22:01,892][62475] Updated weights for policy 0, policy_version 62180 (0.0006) [2023-03-06 22:22:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12680.2). Total num frames: 63678464. Throughput: 0: 12708.2. Samples: 63660927. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:22:02,390][62145] Avg episode reward: [(0, '1056.293')] [2023-03-06 22:22:02,722][62475] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-06 22:22:03,518][62475] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-06 22:22:04,320][62475] Updated weights for policy 0, policy_version 62210 (0.0007) [2023-03-06 22:22:05,135][62475] Updated weights for policy 0, policy_version 62220 (0.0006) [2023-03-06 22:22:05,923][62475] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-06 22:22:06,724][62475] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-06 22:22:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12680.2). Total num frames: 63741952. Throughput: 0: 12708.1. Samples: 63737402. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:22:07,400][62145] Avg episode reward: [(0, '959.626')] [2023-03-06 22:22:07,538][62475] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-06 22:22:08,329][62475] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-06 22:22:09,143][62475] Updated weights for policy 0, policy_version 62270 (0.0007) [2023-03-06 22:22:09,956][62475] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-06 22:22:10,781][62475] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-06 22:22:11,584][62475] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-06 22:22:12,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 63804416. Throughput: 0: 12705.1. Samples: 63775451. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-06 22:22:12,401][62145] Avg episode reward: [(0, '1014.326')] [2023-03-06 22:22:12,408][62475] Updated weights for policy 0, policy_version 62310 (0.0007) [2023-03-06 22:22:13,219][62475] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-06 22:22:14,019][62475] Updated weights for policy 0, policy_version 62330 (0.0006) [2023-03-06 22:22:14,837][62475] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-06 22:22:15,631][62475] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-06 22:22:16,442][62475] Updated weights for policy 0, policy_version 62360 (0.0006) [2023-03-06 22:22:17,241][62475] Updated weights for policy 0, policy_version 62370 (0.0007) [2023-03-06 22:22:17,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12676.8). Total num frames: 63867904. Throughput: 0: 12687.7. Samples: 63851092. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:17,401][62145] Avg episode reward: [(0, '1079.191')] [2023-03-06 22:22:18,046][62475] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-06 22:22:18,849][62475] Updated weights for policy 0, policy_version 62390 (0.0006) [2023-03-06 22:22:19,655][62475] Updated weights for policy 0, policy_version 62400 (0.0007) [2023-03-06 22:22:20,472][62475] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-06 22:22:21,286][62475] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-06 22:22:22,101][62475] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-06 22:22:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 63931392. Throughput: 0: 12684.8. Samples: 63927248. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:22,401][62145] Avg episode reward: [(0, '934.208')] [2023-03-06 22:22:22,405][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000062433_63931392.pth... [2023-03-06 22:22:22,436][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000059461_60888064.pth [2023-03-06 22:22:22,896][62475] Updated weights for policy 0, policy_version 62440 (0.0006) [2023-03-06 22:22:23,693][62475] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-06 22:22:24,500][62475] Updated weights for policy 0, policy_version 62460 (0.0006) [2023-03-06 22:22:25,297][62475] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-06 22:22:26,116][62475] Updated weights for policy 0, policy_version 62480 (0.0007) [2023-03-06 22:22:26,929][62475] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-06 22:22:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 63994880. Throughput: 0: 12684.3. Samples: 63965447. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:27,401][62145] Avg episode reward: [(0, '904.814')] [2023-03-06 22:22:27,765][62475] Updated weights for policy 0, policy_version 62500 (0.0007) [2023-03-06 22:22:28,589][62475] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-06 22:22:29,374][62475] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-06 22:22:30,187][62475] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-06 22:22:30,991][62475] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-06 22:22:31,794][62475] Updated weights for policy 0, policy_version 62550 (0.0007) [2023-03-06 22:22:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64058368. Throughput: 0: 12663.9. Samples: 64041149. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:32,398][62145] Avg episode reward: [(0, '1070.170')] [2023-03-06 22:22:32,608][62475] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-06 22:22:33,439][62475] Updated weights for policy 0, policy_version 62570 (0.0007) [2023-03-06 22:22:34,219][62475] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-06 22:22:35,026][62475] Updated weights for policy 0, policy_version 62590 (0.0007) [2023-03-06 22:22:35,840][62475] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-06 22:22:36,628][62475] Updated weights for policy 0, policy_version 62610 (0.0007) [2023-03-06 22:22:37,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64121856. Throughput: 0: 12674.7. Samples: 64117312. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:37,401][62145] Avg episode reward: [(0, '1020.100')] [2023-03-06 22:22:37,442][62475] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-06 22:22:38,241][62475] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-06 22:22:39,067][62475] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-06 22:22:39,874][62475] Updated weights for policy 0, policy_version 62650 (0.0007) [2023-03-06 22:22:40,679][62475] Updated weights for policy 0, policy_version 62660 (0.0007) [2023-03-06 22:22:41,487][62475] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-06 22:22:42,286][62475] Updated weights for policy 0, policy_version 62680 (0.0007) [2023-03-06 22:22:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64185344. Throughput: 0: 12677.1. Samples: 64155420. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:42,401][62145] Avg episode reward: [(0, '1039.648')] [2023-03-06 22:22:43,081][62475] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-06 22:22:43,913][62475] Updated weights for policy 0, policy_version 62700 (0.0007) [2023-03-06 22:22:44,718][62475] Updated weights for policy 0, policy_version 62710 (0.0007) [2023-03-06 22:22:45,530][62475] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-06 22:22:46,349][62475] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-06 22:22:47,142][62475] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-06 22:22:47,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12680.2). Total num frames: 64247808. Throughput: 0: 12676.6. Samples: 64231374. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:22:47,401][62145] Avg episode reward: [(0, '846.958')] [2023-03-06 22:22:47,951][62475] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-06 22:22:48,781][62475] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-06 22:22:49,577][62475] Updated weights for policy 0, policy_version 62770 (0.0007) [2023-03-06 22:22:50,378][62475] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-06 22:22:51,173][62475] Updated weights for policy 0, policy_version 62790 (0.0007) [2023-03-06 22:22:52,001][62475] Updated weights for policy 0, policy_version 62800 (0.0006) [2023-03-06 22:22:52,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12680.2). Total num frames: 64311296. Throughput: 0: 12667.4. Samples: 64307434. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:22:52,401][62145] Avg episode reward: [(0, '993.513')] [2023-03-06 22:22:52,787][62475] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-06 22:22:53,624][62475] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-06 22:22:54,438][62475] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-06 22:22:55,228][62475] Updated weights for policy 0, policy_version 62840 (0.0007) [2023-03-06 22:22:56,045][62475] Updated weights for policy 0, policy_version 62850 (0.0007) [2023-03-06 22:22:56,850][62475] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-06 22:22:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 64374784. Throughput: 0: 12664.2. Samples: 64345339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:22:57,390][62145] Avg episode reward: [(0, '972.126')] [2023-03-06 22:22:57,643][62475] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-06 22:22:58,454][62475] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-06 22:22:59,259][62475] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-06 22:23:00,061][62475] Updated weights for policy 0, policy_version 62900 (0.0006) [2023-03-06 22:23:00,872][62475] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-06 22:23:01,667][62475] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-06 22:23:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 64438272. Throughput: 0: 12679.9. Samples: 64421690. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:02,390][62145] Avg episode reward: [(0, '852.409')] [2023-03-06 22:23:02,502][62475] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-06 22:23:03,284][62475] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-06 22:23:04,086][62475] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-06 22:23:04,906][62475] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-06 22:23:05,707][62475] Updated weights for policy 0, policy_version 62970 (0.0007) [2023-03-06 22:23:06,503][62475] Updated weights for policy 0, policy_version 62980 (0.0007) [2023-03-06 22:23:07,316][62475] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-06 22:23:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12680.3). Total num frames: 64501760. Throughput: 0: 12679.1. Samples: 64497806. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:07,390][62145] Avg episode reward: [(0, '908.940')] [2023-03-06 22:23:07,869][62424] KL-divergence is very high: 174.4727 [2023-03-06 22:23:08,144][62475] Updated weights for policy 0, policy_version 63000 (0.0007) [2023-03-06 22:23:08,954][62475] Updated weights for policy 0, policy_version 63010 (0.0007) [2023-03-06 22:23:09,767][62475] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-06 22:23:10,589][62475] Updated weights for policy 0, policy_version 63030 (0.0007) [2023-03-06 22:23:11,369][62475] Updated weights for policy 0, policy_version 63040 (0.0006) [2023-03-06 22:23:12,183][62475] Updated weights for policy 0, policy_version 63050 (0.0007) [2023-03-06 22:23:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64565248. Throughput: 0: 12670.2. Samples: 64535604. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:12,390][62145] Avg episode reward: [(0, '970.025')] [2023-03-06 22:23:12,972][62475] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-06 22:23:13,778][62475] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-06 22:23:14,568][62475] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-06 22:23:15,395][62475] Updated weights for policy 0, policy_version 63090 (0.0007) [2023-03-06 22:23:16,199][62475] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-06 22:23:17,006][62475] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-06 22:23:17,098][62424] KL-divergence is very high: 122.5793 [2023-03-06 22:23:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64628736. Throughput: 0: 12684.4. Samples: 64611946. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:17,390][62145] Avg episode reward: [(0, '784.572')] [2023-03-06 22:23:17,821][62475] Updated weights for policy 0, policy_version 63120 (0.0007) [2023-03-06 22:23:18,618][62475] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-06 22:23:19,426][62475] Updated weights for policy 0, policy_version 63140 (0.0007) [2023-03-06 22:23:20,247][62475] Updated weights for policy 0, policy_version 63150 (0.0006) [2023-03-06 22:23:21,058][62475] Updated weights for policy 0, policy_version 63160 (0.0006) [2023-03-06 22:23:21,851][62475] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-03-06 22:23:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64692224. Throughput: 0: 12682.8. Samples: 64688040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:22,390][62145] Avg episode reward: [(0, '1074.635')] [2023-03-06 22:23:22,651][62475] Updated weights for policy 0, policy_version 63180 (0.0007) [2023-03-06 22:23:23,457][62475] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-06 22:23:24,258][62475] Updated weights for policy 0, policy_version 63200 (0.0007) [2023-03-06 22:23:25,085][62475] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-06 22:23:25,884][62475] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-06 22:23:26,694][62475] Updated weights for policy 0, policy_version 63230 (0.0007) [2023-03-06 22:23:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64755712. Throughput: 0: 12683.5. Samples: 64726176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:23:27,390][62145] Avg episode reward: [(0, '804.641')] [2023-03-06 22:23:27,483][62475] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-06 22:23:28,327][62475] Updated weights for policy 0, policy_version 63250 (0.0006) [2023-03-06 22:23:29,114][62475] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-06 22:23:29,917][62475] Updated weights for policy 0, policy_version 63270 (0.0006) [2023-03-06 22:23:30,743][62475] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-06 22:23:31,543][62475] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-06 22:23:32,354][62475] Updated weights for policy 0, policy_version 63300 (0.0007) [2023-03-06 22:23:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64819200. Throughput: 0: 12681.8. Samples: 64802054. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:32,390][62145] Avg episode reward: [(0, '939.726')] [2023-03-06 22:23:33,165][62475] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-06 22:23:33,985][62475] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-06 22:23:34,784][62475] Updated weights for policy 0, policy_version 63330 (0.0007) [2023-03-06 22:23:35,597][62475] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-06 22:23:36,406][62475] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-06 22:23:37,216][62475] Updated weights for policy 0, policy_version 63360 (0.0007) [2023-03-06 22:23:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12680.2). Total num frames: 64882688. Throughput: 0: 12685.2. Samples: 64878266. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:37,390][62145] Avg episode reward: [(0, '784.732')] [2023-03-06 22:23:38,018][62475] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-06 22:23:38,817][62475] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-06 22:23:39,629][62475] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-06 22:23:40,431][62475] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-06 22:23:41,239][62475] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-06 22:23:42,041][62475] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-06 22:23:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 64946176. Throughput: 0: 12685.1. Samples: 64916170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:42,390][62145] Avg episode reward: [(0, '1009.038')] [2023-03-06 22:23:42,863][62475] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-06 22:23:43,665][62475] Updated weights for policy 0, policy_version 63440 (0.0007) [2023-03-06 22:23:44,484][62475] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-06 22:23:45,287][62475] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-06 22:23:46,100][62475] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-06 22:23:46,892][62475] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-06 22:23:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 65009664. Throughput: 0: 12679.9. Samples: 64992284. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:47,390][62145] Avg episode reward: [(0, '856.119')] [2023-03-06 22:23:47,712][62475] Updated weights for policy 0, policy_version 63490 (0.0007) [2023-03-06 22:23:48,539][62475] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-06 22:23:49,335][62475] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-06 22:23:50,169][62475] Updated weights for policy 0, policy_version 63520 (0.0006) [2023-03-06 22:23:50,980][62475] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-06 22:23:51,802][62475] Updated weights for policy 0, policy_version 63540 (0.0006) [2023-03-06 22:23:52,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 65072128. Throughput: 0: 12665.8. Samples: 65067767. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:52,390][62145] Avg episode reward: [(0, '939.734')] [2023-03-06 22:23:52,598][62475] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-06 22:23:53,430][62475] Updated weights for policy 0, policy_version 63560 (0.0007) [2023-03-06 22:23:54,205][62475] Updated weights for policy 0, policy_version 63570 (0.0006) [2023-03-06 22:23:55,012][62475] Updated weights for policy 0, policy_version 63580 (0.0007) [2023-03-06 22:23:55,826][62475] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-06 22:23:56,613][62475] Updated weights for policy 0, policy_version 63600 (0.0007) [2023-03-06 22:23:56,764][62424] KL-divergence is very high: 270.9950 [2023-03-06 22:23:57,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 65135616. Throughput: 0: 12671.5. Samples: 65105821. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:23:57,390][62145] Avg episode reward: [(0, '890.658')] [2023-03-06 22:23:57,430][62475] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-06 22:23:58,246][62475] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-06 22:23:59,044][62475] Updated weights for policy 0, policy_version 63630 (0.0007) [2023-03-06 22:23:59,849][62475] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-06 22:24:00,669][62475] Updated weights for policy 0, policy_version 63650 (0.0007) [2023-03-06 22:24:01,474][62475] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-06 22:24:02,287][62475] Updated weights for policy 0, policy_version 63670 (0.0008) [2023-03-06 22:24:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 65199104. Throughput: 0: 12667.9. Samples: 65182001. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:24:02,390][62145] Avg episode reward: [(0, '943.467')] [2023-03-06 22:24:03,081][62475] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-06 22:24:03,906][62475] Updated weights for policy 0, policy_version 63690 (0.0007) [2023-03-06 22:24:04,705][62475] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-06 22:24:05,521][62475] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-06 22:24:06,353][62475] Updated weights for policy 0, policy_version 63720 (0.0006) [2023-03-06 22:24:07,151][62475] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-06 22:24:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 65262592. Throughput: 0: 12661.0. Samples: 65257783. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:24:07,390][62145] Avg episode reward: [(0, '822.528')] [2023-03-06 22:24:07,942][62475] Updated weights for policy 0, policy_version 63740 (0.0007) [2023-03-06 22:24:08,759][62475] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-06 22:24:09,569][62475] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-06 22:24:10,355][62475] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-06 22:24:11,169][62475] Updated weights for policy 0, policy_version 63780 (0.0006) [2023-03-06 22:24:11,979][62475] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-06 22:24:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 65326080. Throughput: 0: 12661.9. Samples: 65295962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:12,390][62145] Avg episode reward: [(0, '859.018')] [2023-03-06 22:24:12,788][62475] Updated weights for policy 0, policy_version 63800 (0.0007) [2023-03-06 22:24:13,621][62475] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-06 22:24:14,422][62475] Updated weights for policy 0, policy_version 63820 (0.0007) [2023-03-06 22:24:15,218][62475] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-06 22:24:16,040][62475] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-06 22:24:16,834][62475] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-06 22:24:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65388544. Throughput: 0: 12664.4. Samples: 65371952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:17,390][62145] Avg episode reward: [(0, '724.961')] [2023-03-06 22:24:17,638][62475] Updated weights for policy 0, policy_version 63860 (0.0007) [2023-03-06 22:24:18,441][62475] Updated weights for policy 0, policy_version 63870 (0.0007) [2023-03-06 22:24:19,230][62475] Updated weights for policy 0, policy_version 63880 (0.0006) [2023-03-06 22:24:20,037][62475] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-06 22:24:20,850][62475] Updated weights for policy 0, policy_version 63900 (0.0005) [2023-03-06 22:24:21,678][62475] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-06 22:24:22,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65452032. Throughput: 0: 12662.1. Samples: 65448063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:22,390][62145] Avg episode reward: [(0, '794.515')] [2023-03-06 22:24:22,403][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000063919_65453056.pth... [2023-03-06 22:24:22,433][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000060947_62409728.pth [2023-03-06 22:24:22,486][62475] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-06 22:24:23,306][62475] Updated weights for policy 0, policy_version 63930 (0.0007) [2023-03-06 22:24:24,116][62475] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-06 22:24:24,914][62475] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-06 22:24:25,724][62475] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-06 22:24:26,531][62475] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-06 22:24:27,340][62475] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-06 22:24:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65515520. Throughput: 0: 12663.5. Samples: 65486028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:27,390][62145] Avg episode reward: [(0, '695.813')] [2023-03-06 22:24:28,138][62475] Updated weights for policy 0, policy_version 63990 (0.0007) [2023-03-06 22:24:28,972][62475] Updated weights for policy 0, policy_version 64000 (0.0008) [2023-03-06 22:24:29,755][62475] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-06 22:24:30,566][62475] Updated weights for policy 0, policy_version 64020 (0.0006) [2023-03-06 22:24:31,379][62475] Updated weights for policy 0, policy_version 64030 (0.0007) [2023-03-06 22:24:32,177][62475] Updated weights for policy 0, policy_version 64040 (0.0006) [2023-03-06 22:24:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65579008. Throughput: 0: 12660.8. Samples: 65562018. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:32,400][62145] Avg episode reward: [(0, '917.774')] [2023-03-06 22:24:32,968][62475] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-06 22:24:33,786][62475] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-06 22:24:34,589][62475] Updated weights for policy 0, policy_version 64070 (0.0006) [2023-03-06 22:24:35,407][62475] Updated weights for policy 0, policy_version 64080 (0.0007) [2023-03-06 22:24:36,187][62475] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-06 22:24:37,002][62475] Updated weights for policy 0, policy_version 64100 (0.0006) [2023-03-06 22:24:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12663.4, 300 sec: 12676.8). Total num frames: 65642496. Throughput: 0: 12681.4. Samples: 65638428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:37,401][62145] Avg episode reward: [(0, '767.985')] [2023-03-06 22:24:37,797][62475] Updated weights for policy 0, policy_version 64110 (0.0007) [2023-03-06 22:24:38,625][62475] Updated weights for policy 0, policy_version 64120 (0.0006) [2023-03-06 22:24:39,426][62475] Updated weights for policy 0, policy_version 64130 (0.0006) [2023-03-06 22:24:40,237][62475] Updated weights for policy 0, policy_version 64140 (0.0006) [2023-03-06 22:24:41,032][62475] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-06 22:24:41,855][62475] Updated weights for policy 0, policy_version 64160 (0.0007) [2023-03-06 22:24:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65705984. Throughput: 0: 12680.5. Samples: 65676444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:42,400][62145] Avg episode reward: [(0, '935.136')] [2023-03-06 22:24:42,662][62475] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-06 22:24:43,482][62475] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-06 22:24:44,265][62475] Updated weights for policy 0, policy_version 64190 (0.0006) [2023-03-06 22:24:45,060][62475] Updated weights for policy 0, policy_version 64200 (0.0007) [2023-03-06 22:24:45,868][62475] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-06 22:24:46,701][62475] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-03-06 22:24:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 65769472. Throughput: 0: 12685.7. Samples: 65752857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:47,390][62145] Avg episode reward: [(0, '1032.605')] [2023-03-06 22:24:47,493][62475] Updated weights for policy 0, policy_version 64230 (0.0007) [2023-03-06 22:24:48,288][62475] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-06 22:24:49,113][62475] Updated weights for policy 0, policy_version 64250 (0.0006) [2023-03-06 22:24:49,904][62475] Updated weights for policy 0, policy_version 64260 (0.0006) [2023-03-06 22:24:50,722][62475] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-06 22:24:51,529][62475] Updated weights for policy 0, policy_version 64280 (0.0006) [2023-03-06 22:24:52,338][62475] Updated weights for policy 0, policy_version 64290 (0.0006) [2023-03-06 22:24:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 65832960. Throughput: 0: 12691.2. Samples: 65828886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:52,390][62145] Avg episode reward: [(0, '904.445')] [2023-03-06 22:24:53,134][62475] Updated weights for policy 0, policy_version 64300 (0.0007) [2023-03-06 22:24:53,933][62475] Updated weights for policy 0, policy_version 64310 (0.0006) [2023-03-06 22:24:54,754][62475] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-06 22:24:55,566][62475] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-06 22:24:56,355][62475] Updated weights for policy 0, policy_version 64340 (0.0007) [2023-03-06 22:24:57,166][62475] Updated weights for policy 0, policy_version 64350 (0.0007) [2023-03-06 22:24:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 65896448. Throughput: 0: 12690.4. Samples: 65867028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:24:57,390][62145] Avg episode reward: [(0, '962.199')] [2023-03-06 22:24:57,963][62475] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-06 22:24:58,757][62475] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-06 22:24:59,577][62475] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-06 22:25:00,384][62475] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-06 22:25:01,179][62475] Updated weights for policy 0, policy_version 64400 (0.0007) [2023-03-06 22:25:01,991][62475] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-06 22:25:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 65959936. Throughput: 0: 12697.5. Samples: 65943339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:02,390][62145] Avg episode reward: [(0, '680.383')] [2023-03-06 22:25:02,790][62475] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-06 22:25:03,598][62475] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-06 22:25:04,404][62475] Updated weights for policy 0, policy_version 64440 (0.0007) [2023-03-06 22:25:05,215][62475] Updated weights for policy 0, policy_version 64450 (0.0006) [2023-03-06 22:25:06,025][62475] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-06 22:25:06,832][62475] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-06 22:25:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 66023424. Throughput: 0: 12696.7. Samples: 66019413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:07,390][62145] Avg episode reward: [(0, '653.138')] [2023-03-06 22:25:07,651][62475] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-06 22:25:08,434][62475] Updated weights for policy 0, policy_version 64490 (0.0007) [2023-03-06 22:25:09,260][62475] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-06 22:25:10,068][62475] Updated weights for policy 0, policy_version 64510 (0.0005) [2023-03-06 22:25:10,882][62475] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-06 22:25:11,676][62475] Updated weights for policy 0, policy_version 64530 (0.0007) [2023-03-06 22:25:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 66086912. Throughput: 0: 12700.3. Samples: 66057543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:12,390][62145] Avg episode reward: [(0, '890.816')] [2023-03-06 22:25:12,497][62475] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-06 22:25:13,310][62475] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-06 22:25:14,126][62475] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-06 22:25:14,923][62475] Updated weights for policy 0, policy_version 64570 (0.0006) [2023-03-06 22:25:15,740][62475] Updated weights for policy 0, policy_version 64580 (0.0007) [2023-03-06 22:25:16,553][62475] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-06 22:25:17,351][62475] Updated weights for policy 0, policy_version 64600 (0.0006) [2023-03-06 22:25:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 66150400. Throughput: 0: 12697.6. Samples: 66133412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:17,390][62145] Avg episode reward: [(0, '1007.110')] [2023-03-06 22:25:18,163][62475] Updated weights for policy 0, policy_version 64610 (0.0006) [2023-03-06 22:25:18,974][62475] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-06 22:25:19,751][62475] Updated weights for policy 0, policy_version 64630 (0.0007) [2023-03-06 22:25:20,561][62475] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-06 22:25:21,367][62475] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-06 22:25:22,168][62475] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-06 22:25:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 66213888. Throughput: 0: 12697.0. Samples: 66209791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:22,390][62145] Avg episode reward: [(0, '953.982')] [2023-03-06 22:25:22,969][62475] Updated weights for policy 0, policy_version 64670 (0.0007) [2023-03-06 22:25:23,778][62475] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-06 22:25:24,597][62475] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-06 22:25:25,385][62475] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-06 22:25:26,212][62475] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-06 22:25:27,016][62475] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-06 22:25:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 66277376. Throughput: 0: 12699.7. Samples: 66247930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:25:27,390][62145] Avg episode reward: [(0, '678.012')] [2023-03-06 22:25:27,799][62475] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-06 22:25:28,625][62475] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-06 22:25:29,430][62475] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-06 22:25:30,242][62475] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-06 22:25:31,056][62475] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-06 22:25:31,856][62475] Updated weights for policy 0, policy_version 64780 (0.0006) [2023-03-06 22:25:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 66340864. Throughput: 0: 12689.5. Samples: 66323886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:32,390][62145] Avg episode reward: [(0, '762.483')] [2023-03-06 22:25:32,666][62475] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-06 22:25:33,486][62475] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-06 22:25:34,286][62475] Updated weights for policy 0, policy_version 64810 (0.0006) [2023-03-06 22:25:35,111][62475] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-06 22:25:35,921][62475] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-06 22:25:36,709][62475] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-06 22:25:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 66404352. Throughput: 0: 12690.8. Samples: 66399972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:37,390][62145] Avg episode reward: [(0, '752.403')] [2023-03-06 22:25:37,518][62475] Updated weights for policy 0, policy_version 64850 (0.0007) [2023-03-06 22:25:38,314][62475] Updated weights for policy 0, policy_version 64860 (0.0006) [2023-03-06 22:25:39,091][62475] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-06 22:25:39,927][62475] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-06 22:25:40,714][62475] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-06 22:25:41,509][62475] Updated weights for policy 0, policy_version 64900 (0.0006) [2023-03-06 22:25:42,306][62475] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-06 22:25:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 66467840. Throughput: 0: 12695.3. Samples: 66438314. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:42,390][62145] Avg episode reward: [(0, '698.777')] [2023-03-06 22:25:43,109][62475] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-06 22:25:43,891][62475] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-06 22:25:44,719][62475] Updated weights for policy 0, policy_version 64940 (0.0007) [2023-03-06 22:25:45,525][62475] Updated weights for policy 0, policy_version 64950 (0.0007) [2023-03-06 22:25:46,341][62475] Updated weights for policy 0, policy_version 64960 (0.0007) [2023-03-06 22:25:47,147][62475] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-06 22:25:47,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 66532352. Throughput: 0: 12703.8. Samples: 66515008. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:47,390][62145] Avg episode reward: [(0, '813.113')] [2023-03-06 22:25:47,954][62475] Updated weights for policy 0, policy_version 64980 (0.0005) [2023-03-06 22:25:48,760][62475] Updated weights for policy 0, policy_version 64990 (0.0007) [2023-03-06 22:25:49,565][62475] Updated weights for policy 0, policy_version 65000 (0.0006) [2023-03-06 22:25:50,365][62475] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-06 22:25:51,160][62475] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-06 22:25:51,964][62475] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-06 22:25:52,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 66595840. Throughput: 0: 12707.4. Samples: 66591247. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:52,390][62145] Avg episode reward: [(0, '853.251')] [2023-03-06 22:25:52,764][62475] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-06 22:25:53,551][62475] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-06 22:25:54,368][62475] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-06 22:25:55,167][62475] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-06 22:25:55,962][62475] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-06 22:25:56,761][62475] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-06 22:25:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 66659328. Throughput: 0: 12713.4. Samples: 66629643. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:25:57,390][62145] Avg episode reward: [(0, '785.477')] [2023-03-06 22:25:57,557][62475] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-06 22:25:58,377][62475] Updated weights for policy 0, policy_version 65110 (0.0006) [2023-03-06 22:25:59,175][62475] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-06 22:25:59,980][62475] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-06 22:26:00,769][62475] Updated weights for policy 0, policy_version 65140 (0.0007) [2023-03-06 22:26:01,608][62475] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-06 22:26:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 66722816. Throughput: 0: 12731.4. Samples: 66706323. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:26:02,390][62145] Avg episode reward: [(0, '670.054')] [2023-03-06 22:26:02,409][62475] Updated weights for policy 0, policy_version 65160 (0.0007) [2023-03-06 22:26:03,205][62475] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-06 22:26:04,018][62475] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-06 22:26:04,817][62475] Updated weights for policy 0, policy_version 65190 (0.0006) [2023-03-06 22:26:05,600][62475] Updated weights for policy 0, policy_version 65200 (0.0006) [2023-03-06 22:26:06,417][62475] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-06 22:26:07,223][62475] Updated weights for policy 0, policy_version 65220 (0.0007) [2023-03-06 22:26:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12694.1). Total num frames: 66787328. Throughput: 0: 12728.1. Samples: 66782556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:26:07,390][62145] Avg episode reward: [(0, '689.455')] [2023-03-06 22:26:08,028][62475] Updated weights for policy 0, policy_version 65230 (0.0007) [2023-03-06 22:26:08,839][62475] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-06 22:26:09,650][62475] Updated weights for policy 0, policy_version 65250 (0.0007) [2023-03-06 22:26:10,469][62475] Updated weights for policy 0, policy_version 65260 (0.0007) [2023-03-06 22:26:11,278][62475] Updated weights for policy 0, policy_version 65270 (0.0006) [2023-03-06 22:26:12,081][62475] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-06 22:26:12,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.6, 300 sec: 12690.7). Total num frames: 66849792. Throughput: 0: 12723.3. Samples: 66820482. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:12,390][62145] Avg episode reward: [(0, '769.734')] [2023-03-06 22:26:12,887][62475] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-06 22:26:13,710][62475] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-06 22:26:14,511][62475] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-06 22:26:15,320][62475] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-06 22:26:16,109][62475] Updated weights for policy 0, policy_version 65330 (0.0007) [2023-03-06 22:26:16,918][62475] Updated weights for policy 0, policy_version 65340 (0.0006) [2023-03-06 22:26:17,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 66913280. Throughput: 0: 12727.5. Samples: 66896625. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:17,390][62145] Avg episode reward: [(0, '675.787')] [2023-03-06 22:26:17,718][62475] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-06 22:26:18,521][62475] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-06 22:26:19,324][62475] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-06 22:26:20,133][62475] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-06 22:26:20,917][62475] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-06 22:26:21,739][62475] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-06 22:26:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12690.7). Total num frames: 66977792. Throughput: 0: 12736.9. Samples: 66973132. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:22,390][62145] Avg episode reward: [(0, '717.385')] [2023-03-06 22:26:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000065408_66977792.pth... [2023-03-06 22:26:22,422][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000062433_63931392.pth [2023-03-06 22:26:22,534][62475] Updated weights for policy 0, policy_version 65410 (0.0007) [2023-03-06 22:26:23,361][62475] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-06 22:26:24,158][62475] Updated weights for policy 0, policy_version 65430 (0.0006) [2023-03-06 22:26:24,952][62475] Updated weights for policy 0, policy_version 65440 (0.0007) [2023-03-06 22:26:25,772][62475] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-06 22:26:26,581][62475] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-06 22:26:27,373][62475] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-06 22:26:27,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12690.7). Total num frames: 67041280. Throughput: 0: 12728.0. Samples: 67011073. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:27,390][62145] Avg episode reward: [(0, '659.413')] [2023-03-06 22:26:28,192][62475] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-06 22:26:29,001][62475] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-06 22:26:29,819][62475] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-06 22:26:30,618][62475] Updated weights for policy 0, policy_version 65510 (0.0007) [2023-03-06 22:26:31,435][62475] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-06 22:26:32,247][62475] Updated weights for policy 0, policy_version 65530 (0.0007) [2023-03-06 22:26:32,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12687.2). Total num frames: 67103744. Throughput: 0: 12717.7. Samples: 67087307. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:32,390][62145] Avg episode reward: [(0, '752.415')] [2023-03-06 22:26:33,038][62475] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-06 22:26:33,834][62475] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-06 22:26:34,668][62475] Updated weights for policy 0, policy_version 65560 (0.0006) [2023-03-06 22:26:35,463][62475] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-06 22:26:36,248][62475] Updated weights for policy 0, policy_version 65580 (0.0007) [2023-03-06 22:26:37,069][62475] Updated weights for policy 0, policy_version 65590 (0.0007) [2023-03-06 22:26:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12690.7). Total num frames: 67168256. Throughput: 0: 12715.0. Samples: 67163422. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:37,390][62145] Avg episode reward: [(0, '860.544')] [2023-03-06 22:26:37,865][62475] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-06 22:26:38,670][62475] Updated weights for policy 0, policy_version 65610 (0.0006) [2023-03-06 22:26:39,477][62475] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-06 22:26:40,300][62475] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-06 22:26:41,096][62475] Updated weights for policy 0, policy_version 65640 (0.0007) [2023-03-06 22:26:41,900][62475] Updated weights for policy 0, policy_version 65650 (0.0006) [2023-03-06 22:26:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12690.7). Total num frames: 67231744. Throughput: 0: 12705.8. Samples: 67201405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:42,390][62145] Avg episode reward: [(0, '777.470')] [2023-03-06 22:26:42,698][62475] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-06 22:26:43,501][62475] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-06 22:26:44,302][62475] Updated weights for policy 0, policy_version 65680 (0.0007) [2023-03-06 22:26:45,121][62475] Updated weights for policy 0, policy_version 65690 (0.0007) [2023-03-06 22:26:45,914][62475] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-06 22:26:46,746][62475] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-06 22:26:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 67295232. Throughput: 0: 12700.8. Samples: 67277861. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:26:47,390][62145] Avg episode reward: [(0, '860.038')] [2023-03-06 22:26:47,542][62475] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-06 22:26:48,355][62475] Updated weights for policy 0, policy_version 65730 (0.0006) [2023-03-06 22:26:49,154][62475] Updated weights for policy 0, policy_version 65740 (0.0007) [2023-03-06 22:26:49,975][62475] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-06 22:26:50,774][62475] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-06 22:26:51,582][62475] Updated weights for policy 0, policy_version 65770 (0.0007) [2023-03-06 22:26:52,385][62475] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-06 22:26:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 67358720. Throughput: 0: 12694.3. Samples: 67353802. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:26:52,390][62145] Avg episode reward: [(0, '715.910')] [2023-03-06 22:26:53,206][62475] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-06 22:26:54,017][62475] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-06 22:26:54,826][62475] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-06 22:26:55,619][62475] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-06 22:26:56,417][62475] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-06 22:26:57,228][62475] Updated weights for policy 0, policy_version 65840 (0.0007) [2023-03-06 22:26:57,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 67421184. Throughput: 0: 12696.8. Samples: 67391835. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:26:57,390][62145] Avg episode reward: [(0, '783.428')] [2023-03-06 22:26:58,038][62475] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-06 22:26:58,834][62475] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-06 22:26:59,662][62475] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-06 22:27:00,460][62475] Updated weights for policy 0, policy_version 65880 (0.0007) [2023-03-06 22:27:01,265][62475] Updated weights for policy 0, policy_version 65890 (0.0006) [2023-03-06 22:27:02,069][62475] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-06 22:27:02,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 67485696. Throughput: 0: 12699.7. Samples: 67468113. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:02,390][62145] Avg episode reward: [(0, '683.701')] [2023-03-06 22:27:02,865][62475] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-06 22:27:03,676][62475] Updated weights for policy 0, policy_version 65920 (0.0007) [2023-03-06 22:27:04,470][62475] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-06 22:27:05,288][62475] Updated weights for policy 0, policy_version 65940 (0.0007) [2023-03-06 22:27:06,108][62475] Updated weights for policy 0, policy_version 65950 (0.0007) [2023-03-06 22:27:06,915][62475] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-06 22:27:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 67549184. Throughput: 0: 12692.8. Samples: 67544308. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:07,390][62145] Avg episode reward: [(0, '746.545')] [2023-03-06 22:27:07,713][62475] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-06 22:27:08,515][62475] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-06 22:27:09,326][62475] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-06 22:27:10,145][62475] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-06 22:27:10,956][62475] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-06 22:27:11,740][62475] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-06 22:27:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 67612672. Throughput: 0: 12698.6. Samples: 67582511. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:12,390][62145] Avg episode reward: [(0, '751.134')] [2023-03-06 22:27:12,547][62475] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-06 22:27:13,382][62475] Updated weights for policy 0, policy_version 66040 (0.0006) [2023-03-06 22:27:14,198][62475] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-06 22:27:15,010][62475] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-06 22:27:15,808][62475] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-06 22:27:16,624][62475] Updated weights for policy 0, policy_version 66080 (0.0008) [2023-03-06 22:27:17,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 67675136. Throughput: 0: 12686.1. Samples: 67658182. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:17,390][62145] Avg episode reward: [(0, '848.278')] [2023-03-06 22:27:17,445][62475] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-06 22:27:18,244][62475] Updated weights for policy 0, policy_version 66100 (0.0006) [2023-03-06 22:27:19,057][62475] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-06 22:27:19,849][62475] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-06 22:27:20,664][62475] Updated weights for policy 0, policy_version 66130 (0.0006) [2023-03-06 22:27:21,461][62475] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-06 22:27:22,274][62475] Updated weights for policy 0, policy_version 66150 (0.0007) [2023-03-06 22:27:22,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 67738624. Throughput: 0: 12681.9. Samples: 67734108. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:22,390][62145] Avg episode reward: [(0, '659.051')] [2023-03-06 22:27:23,107][62475] Updated weights for policy 0, policy_version 66160 (0.0007) [2023-03-06 22:27:23,918][62475] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-06 22:27:24,710][62475] Updated weights for policy 0, policy_version 66180 (0.0006) [2023-03-06 22:27:25,519][62475] Updated weights for policy 0, policy_version 66190 (0.0006) [2023-03-06 22:27:26,304][62475] Updated weights for policy 0, policy_version 66200 (0.0006) [2023-03-06 22:27:27,107][62475] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-06 22:27:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 67802112. Throughput: 0: 12680.7. Samples: 67772034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:27:27,390][62145] Avg episode reward: [(0, '615.635')] [2023-03-06 22:27:27,925][62475] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-06 22:27:28,721][62475] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-06 22:27:29,536][62475] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-06 22:27:30,316][62475] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-06 22:27:31,129][62475] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-06 22:27:31,943][62475] Updated weights for policy 0, policy_version 66270 (0.0006) [2023-03-06 22:27:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 67865600. Throughput: 0: 12685.9. Samples: 67848726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:32,390][62145] Avg episode reward: [(0, '722.185')] [2023-03-06 22:27:32,735][62475] Updated weights for policy 0, policy_version 66280 (0.0006) [2023-03-06 22:27:33,550][62475] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-06 22:27:34,357][62475] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-06 22:27:35,158][62475] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-06 22:27:35,945][62475] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-06 22:27:36,770][62475] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-06 22:27:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 67929088. Throughput: 0: 12694.0. Samples: 67925030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:37,390][62145] Avg episode reward: [(0, '662.393')] [2023-03-06 22:27:37,586][62475] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-06 22:27:38,379][62475] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-06 22:27:39,181][62475] Updated weights for policy 0, policy_version 66360 (0.0005) [2023-03-06 22:27:39,993][62475] Updated weights for policy 0, policy_version 66370 (0.0006) [2023-03-06 22:27:40,782][62475] Updated weights for policy 0, policy_version 66380 (0.0007) [2023-03-06 22:27:41,598][62475] Updated weights for policy 0, policy_version 66390 (0.0007) [2023-03-06 22:27:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 67992576. Throughput: 0: 12697.4. Samples: 67963219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:42,401][62145] Avg episode reward: [(0, '766.924')] [2023-03-06 22:27:42,411][62475] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-06 22:27:43,221][62475] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-06 22:27:44,040][62475] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-06 22:27:44,844][62475] Updated weights for policy 0, policy_version 66430 (0.0006) [2023-03-06 22:27:45,669][62475] Updated weights for policy 0, policy_version 66440 (0.0006) [2023-03-06 22:27:46,470][62475] Updated weights for policy 0, policy_version 66450 (0.0006) [2023-03-06 22:27:47,255][62475] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-06 22:27:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 68056064. Throughput: 0: 12689.3. Samples: 68039134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:47,401][62145] Avg episode reward: [(0, '817.869')] [2023-03-06 22:27:48,070][62475] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-06 22:27:48,893][62475] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-06 22:27:49,692][62475] Updated weights for policy 0, policy_version 66490 (0.0006) [2023-03-06 22:27:50,507][62475] Updated weights for policy 0, policy_version 66500 (0.0006) [2023-03-06 22:27:51,309][62475] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-06 22:27:52,113][62475] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-06 22:27:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 68119552. Throughput: 0: 12686.4. Samples: 68115195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:52,401][62145] Avg episode reward: [(0, '953.563')] [2023-03-06 22:27:52,925][62475] Updated weights for policy 0, policy_version 66530 (0.0007) [2023-03-06 22:27:53,726][62475] Updated weights for policy 0, policy_version 66540 (0.0007) [2023-03-06 22:27:54,524][62475] Updated weights for policy 0, policy_version 66550 (0.0007) [2023-03-06 22:27:55,330][62475] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-06 22:27:56,138][62475] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-06 22:27:56,930][62475] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-06 22:27:57,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 68183040. Throughput: 0: 12685.6. Samples: 68153364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:27:57,401][62145] Avg episode reward: [(0, '848.461')] [2023-03-06 22:27:57,726][62475] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-06 22:27:58,535][62475] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-06 22:27:59,346][62475] Updated weights for policy 0, policy_version 66610 (0.0007) [2023-03-06 22:28:00,141][62475] Updated weights for policy 0, policy_version 66620 (0.0007) [2023-03-06 22:28:00,952][62475] Updated weights for policy 0, policy_version 66630 (0.0007) [2023-03-06 22:28:01,768][62475] Updated weights for policy 0, policy_version 66640 (0.0007) [2023-03-06 22:28:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 68246528. Throughput: 0: 12704.5. Samples: 68229883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:28:02,400][62145] Avg episode reward: [(0, '998.259')] [2023-03-06 22:28:02,569][62475] Updated weights for policy 0, policy_version 66650 (0.0006) [2023-03-06 22:28:03,369][62475] Updated weights for policy 0, policy_version 66660 (0.0007) [2023-03-06 22:28:04,183][62475] Updated weights for policy 0, policy_version 66670 (0.0006) [2023-03-06 22:28:04,976][62475] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-06 22:28:05,775][62475] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-03-06 22:28:06,580][62475] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-06 22:28:07,367][62475] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-06 22:28:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 68311040. Throughput: 0: 12714.5. Samples: 68306262. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:28:07,401][62145] Avg episode reward: [(0, '1054.506')] [2023-03-06 22:28:08,168][62475] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-06 22:28:08,973][62475] Updated weights for policy 0, policy_version 66730 (0.0007) [2023-03-06 22:28:09,775][62475] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-06 22:28:10,579][62475] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-06 22:28:11,388][62475] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-06 22:28:12,182][62475] Updated weights for policy 0, policy_version 66770 (0.0007) [2023-03-06 22:28:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 68374528. Throughput: 0: 12724.5. Samples: 68344635. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:12,400][62145] Avg episode reward: [(0, '955.464')] [2023-03-06 22:28:13,000][62475] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-06 22:28:13,797][62475] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-06 22:28:14,578][62475] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-06 22:28:15,402][62475] Updated weights for policy 0, policy_version 66810 (0.0007) [2023-03-06 22:28:16,211][62475] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-06 22:28:17,013][62475] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-06 22:28:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68438016. Throughput: 0: 12721.0. Samples: 68421170. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:17,390][62145] Avg episode reward: [(0, '984.012')] [2023-03-06 22:28:17,836][62475] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-06 22:28:18,641][62475] Updated weights for policy 0, policy_version 66850 (0.0007) [2023-03-06 22:28:19,438][62475] Updated weights for policy 0, policy_version 66860 (0.0007) [2023-03-06 22:28:20,278][62475] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-06 22:28:21,055][62475] Updated weights for policy 0, policy_version 66880 (0.0005) [2023-03-06 22:28:21,874][62475] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-06 22:28:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68501504. Throughput: 0: 12714.0. Samples: 68497159. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:22,390][62145] Avg episode reward: [(0, '829.958')] [2023-03-06 22:28:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000066896_68501504.pth... [2023-03-06 22:28:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000063919_65453056.pth [2023-03-06 22:28:22,677][62475] Updated weights for policy 0, policy_version 66900 (0.0007) [2023-03-06 22:28:23,475][62475] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-06 22:28:24,293][62475] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-06 22:28:25,099][62475] Updated weights for policy 0, policy_version 66930 (0.0007) [2023-03-06 22:28:25,898][62475] Updated weights for policy 0, policy_version 66940 (0.0006) [2023-03-06 22:28:26,692][62475] Updated weights for policy 0, policy_version 66950 (0.0005) [2023-03-06 22:28:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68564992. Throughput: 0: 12715.2. Samples: 68535402. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:27,390][62145] Avg episode reward: [(0, '866.565')] [2023-03-06 22:28:27,529][62475] Updated weights for policy 0, policy_version 66960 (0.0007) [2023-03-06 22:28:28,346][62475] Updated weights for policy 0, policy_version 66970 (0.0007) [2023-03-06 22:28:29,141][62475] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-06 22:28:29,957][62475] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-06 22:28:30,777][62475] Updated weights for policy 0, policy_version 67000 (0.0006) [2023-03-06 22:28:30,846][62424] KL-divergence is very high: 547431.6250 [2023-03-06 22:28:31,560][62475] Updated weights for policy 0, policy_version 67010 (0.0007) [2023-03-06 22:28:32,357][62475] Updated weights for policy 0, policy_version 67020 (0.0007) [2023-03-06 22:28:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68628480. Throughput: 0: 12710.8. Samples: 68611121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:32,390][62145] Avg episode reward: [(0, '818.326')] [2023-03-06 22:28:33,161][62475] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-06 22:28:33,981][62475] Updated weights for policy 0, policy_version 67040 (0.0006) [2023-03-06 22:28:34,779][62475] Updated weights for policy 0, policy_version 67050 (0.0007) [2023-03-06 22:28:35,585][62475] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-06 22:28:36,376][62475] Updated weights for policy 0, policy_version 67070 (0.0006) [2023-03-06 22:28:37,185][62475] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-06 22:28:37,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68691968. Throughput: 0: 12722.6. Samples: 68687713. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:37,390][62145] Avg episode reward: [(0, '901.239')] [2023-03-06 22:28:37,986][62475] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-06 22:28:38,805][62475] Updated weights for policy 0, policy_version 67100 (0.0007) [2023-03-06 22:28:39,589][62475] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-06 22:28:40,394][62475] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-06 22:28:41,202][62475] Updated weights for policy 0, policy_version 67130 (0.0006) [2023-03-06 22:28:42,000][62475] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-06 22:28:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 68755456. Throughput: 0: 12724.8. Samples: 68725982. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:42,390][62145] Avg episode reward: [(0, '852.064')] [2023-03-06 22:28:42,798][62475] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-06 22:28:43,614][62475] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-06 22:28:44,424][62475] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-06 22:28:45,204][62475] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-06 22:28:46,032][62475] Updated weights for policy 0, policy_version 67190 (0.0007) [2023-03-06 22:28:46,816][62475] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-06 22:28:47,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12704.5). Total num frames: 68819968. Throughput: 0: 12724.5. Samples: 68802487. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:28:47,390][62145] Avg episode reward: [(0, '753.563')] [2023-03-06 22:28:47,597][62475] Updated weights for policy 0, policy_version 67210 (0.0006) [2023-03-06 22:28:48,423][62475] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-06 22:28:49,206][62475] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-06 22:28:50,006][62475] Updated weights for policy 0, policy_version 67240 (0.0007) [2023-03-06 22:28:50,816][62475] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-06 22:28:51,647][62475] Updated weights for policy 0, policy_version 67260 (0.0007) [2023-03-06 22:28:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 68883456. Throughput: 0: 12722.4. Samples: 68878771. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:28:52,390][62145] Avg episode reward: [(0, '673.041')] [2023-03-06 22:28:52,449][62475] Updated weights for policy 0, policy_version 67270 (0.0006) [2023-03-06 22:28:53,265][62475] Updated weights for policy 0, policy_version 67280 (0.0006) [2023-03-06 22:28:54,072][62475] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-06 22:28:54,864][62475] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-06 22:28:55,677][62475] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-06 22:28:56,494][62475] Updated weights for policy 0, policy_version 67320 (0.0006) [2023-03-06 22:28:57,296][62475] Updated weights for policy 0, policy_version 67330 (0.0007) [2023-03-06 22:28:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 68946944. Throughput: 0: 12715.4. Samples: 68916828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:28:57,390][62145] Avg episode reward: [(0, '795.537')] [2023-03-06 22:28:58,107][62475] Updated weights for policy 0, policy_version 67340 (0.0006) [2023-03-06 22:28:58,892][62475] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-06 22:28:59,692][62475] Updated weights for policy 0, policy_version 67360 (0.0006) [2023-03-06 22:29:00,507][62475] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-06 22:29:01,319][62475] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-06 22:29:02,137][62475] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-06 22:29:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 69010432. Throughput: 0: 12712.3. Samples: 68993222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:02,390][62145] Avg episode reward: [(0, '811.106')] [2023-03-06 22:29:02,942][62475] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-06 22:29:03,758][62475] Updated weights for policy 0, policy_version 67410 (0.0006) [2023-03-06 22:29:04,563][62475] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-06 22:29:05,349][62475] Updated weights for policy 0, policy_version 67430 (0.0006) [2023-03-06 22:29:06,158][62475] Updated weights for policy 0, policy_version 67440 (0.0006) [2023-03-06 22:29:06,963][62475] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-06 22:29:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 69073920. Throughput: 0: 12713.7. Samples: 69069274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:07,390][62145] Avg episode reward: [(0, '788.291')] [2023-03-06 22:29:07,781][62475] Updated weights for policy 0, policy_version 67460 (0.0006) [2023-03-06 22:29:08,573][62475] Updated weights for policy 0, policy_version 67470 (0.0006) [2023-03-06 22:29:09,376][62475] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-06 22:29:10,149][62475] Updated weights for policy 0, policy_version 67490 (0.0007) [2023-03-06 22:29:10,941][62475] Updated weights for policy 0, policy_version 67500 (0.0008) [2023-03-06 22:29:11,748][62475] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-06 22:29:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69137408. Throughput: 0: 12722.4. Samples: 69107912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:12,390][62145] Avg episode reward: [(0, '761.487')] [2023-03-06 22:29:12,554][62475] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-06 22:29:13,356][62475] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-06 22:29:14,159][62475] Updated weights for policy 0, policy_version 67540 (0.0006) [2023-03-06 22:29:14,962][62475] Updated weights for policy 0, policy_version 67550 (0.0007) [2023-03-06 22:29:15,806][62475] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-06 22:29:16,619][62475] Updated weights for policy 0, policy_version 67570 (0.0007) [2023-03-06 22:29:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69200896. Throughput: 0: 12734.8. Samples: 69184185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:17,390][62145] Avg episode reward: [(0, '812.729')] [2023-03-06 22:29:17,414][62475] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-06 22:29:18,232][62475] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-06 22:29:19,034][62475] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-06 22:29:19,848][62475] Updated weights for policy 0, policy_version 67610 (0.0007) [2023-03-06 22:29:20,641][62475] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-06 22:29:21,462][62475] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-06 22:29:22,262][62475] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-06 22:29:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69264384. Throughput: 0: 12722.8. Samples: 69260237. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:22,390][62145] Avg episode reward: [(0, '667.649')] [2023-03-06 22:29:23,039][62475] Updated weights for policy 0, policy_version 67650 (0.0006) [2023-03-06 22:29:23,846][62475] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-06 22:29:24,662][62475] Updated weights for policy 0, policy_version 67670 (0.0007) [2023-03-06 22:29:25,461][62475] Updated weights for policy 0, policy_version 67680 (0.0006) [2023-03-06 22:29:26,274][62475] Updated weights for policy 0, policy_version 67690 (0.0006) [2023-03-06 22:29:27,095][62475] Updated weights for policy 0, policy_version 67700 (0.0007) [2023-03-06 22:29:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69327872. Throughput: 0: 12725.6. Samples: 69298633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:27,390][62145] Avg episode reward: [(0, '756.301')] [2023-03-06 22:29:27,895][62475] Updated weights for policy 0, policy_version 67710 (0.0007) [2023-03-06 22:29:28,698][62475] Updated weights for policy 0, policy_version 67720 (0.0006) [2023-03-06 22:29:29,515][62475] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-06 22:29:30,317][62475] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-06 22:29:31,112][62475] Updated weights for policy 0, policy_version 67750 (0.0007) [2023-03-06 22:29:31,942][62475] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-06 22:29:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69391360. Throughput: 0: 12712.9. Samples: 69374568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:29:32,390][62145] Avg episode reward: [(0, '888.142')] [2023-03-06 22:29:32,753][62475] Updated weights for policy 0, policy_version 67770 (0.0006) [2023-03-06 22:29:33,559][62475] Updated weights for policy 0, policy_version 67780 (0.0006) [2023-03-06 22:29:34,365][62475] Updated weights for policy 0, policy_version 67790 (0.0006) [2023-03-06 22:29:35,179][62475] Updated weights for policy 0, policy_version 67800 (0.0006) [2023-03-06 22:29:35,984][62475] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-06 22:29:36,769][62475] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-06 22:29:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69454848. Throughput: 0: 12705.6. Samples: 69450522. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:29:37,390][62145] Avg episode reward: [(0, '797.381')] [2023-03-06 22:29:37,587][62475] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-06 22:29:38,384][62475] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-06 22:29:39,201][62475] Updated weights for policy 0, policy_version 67850 (0.0006) [2023-03-06 22:29:40,017][62475] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-06 22:29:40,804][62475] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-06 22:29:41,637][62475] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-06 22:29:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 69518336. Throughput: 0: 12705.2. Samples: 69488561. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:29:42,390][62145] Avg episode reward: [(0, '843.262')] [2023-03-06 22:29:42,434][62475] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-06 22:29:43,234][62475] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-06 22:29:44,036][62475] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-06 22:29:44,862][62475] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-06 22:29:45,659][62475] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-06 22:29:46,451][62475] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-06 22:29:47,265][62475] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-06 22:29:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 69581824. Throughput: 0: 12702.6. Samples: 69564839. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:29:47,390][62145] Avg episode reward: [(0, '806.300')] [2023-03-06 22:29:48,068][62475] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-06 22:29:48,846][62475] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-06 22:29:49,659][62475] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-06 22:29:50,478][62475] Updated weights for policy 0, policy_version 67990 (0.0007) [2023-03-06 22:29:51,293][62475] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-06 22:29:52,108][62475] Updated weights for policy 0, policy_version 68010 (0.0007) [2023-03-06 22:29:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 69645312. Throughput: 0: 12708.4. Samples: 69641154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:29:52,390][62145] Avg episode reward: [(0, '784.423')] [2023-03-06 22:29:52,891][62475] Updated weights for policy 0, policy_version 68020 (0.0007) [2023-03-06 22:29:53,709][62475] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-06 22:29:54,504][62475] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-06 22:29:55,327][62475] Updated weights for policy 0, policy_version 68050 (0.0006) [2023-03-06 22:29:56,101][62475] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-06 22:29:56,920][62475] Updated weights for policy 0, policy_version 68070 (0.0007) [2023-03-06 22:29:57,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 69709824. Throughput: 0: 12700.5. Samples: 69679432. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:29:57,390][62145] Avg episode reward: [(0, '654.630')] [2023-03-06 22:29:57,713][62475] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-06 22:29:58,506][62475] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-06 22:29:59,305][62475] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-06 22:30:00,109][62475] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-06 22:30:00,906][62475] Updated weights for policy 0, policy_version 68120 (0.0006) [2023-03-06 22:30:01,715][62475] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-06 22:30:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 69773312. Throughput: 0: 12707.4. Samples: 69756017. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:30:02,390][62145] Avg episode reward: [(0, '778.036')] [2023-03-06 22:30:02,533][62475] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-06 22:30:03,327][62475] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-06 22:30:04,129][62475] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-06 22:30:04,942][62475] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-06 22:30:05,733][62475] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-06 22:30:06,507][62475] Updated weights for policy 0, policy_version 68190 (0.0006) [2023-03-06 22:30:07,323][62475] Updated weights for policy 0, policy_version 68200 (0.0007) [2023-03-06 22:30:07,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 69836800. Throughput: 0: 12727.2. Samples: 69832959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:30:07,390][62145] Avg episode reward: [(0, '840.211')] [2023-03-06 22:30:08,117][62475] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-06 22:30:08,921][62475] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-06 22:30:09,735][62475] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-06 22:30:10,530][62475] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-06 22:30:11,343][62475] Updated weights for policy 0, policy_version 68250 (0.0006) [2023-03-06 22:30:12,148][62475] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-06 22:30:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 69901312. Throughput: 0: 12717.3. Samples: 69870912. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:30:12,390][62145] Avg episode reward: [(0, '899.377')] [2023-03-06 22:30:12,942][62475] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-06 22:30:13,752][62475] Updated weights for policy 0, policy_version 68280 (0.0007) [2023-03-06 22:30:14,564][62475] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-06 22:30:15,369][62475] Updated weights for policy 0, policy_version 68300 (0.0007) [2023-03-06 22:30:16,178][62475] Updated weights for policy 0, policy_version 68310 (0.0005) [2023-03-06 22:30:16,986][62475] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-06 22:30:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 69963776. Throughput: 0: 12725.6. Samples: 69947219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:17,390][62145] Avg episode reward: [(0, '738.229')] [2023-03-06 22:30:17,789][62475] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-06 22:30:18,604][62475] Updated weights for policy 0, policy_version 68340 (0.0007) [2023-03-06 22:30:19,399][62475] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-06 22:30:20,234][62475] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-06 22:30:21,045][62475] Updated weights for policy 0, policy_version 68370 (0.0007) [2023-03-06 22:30:21,858][62475] Updated weights for policy 0, policy_version 68380 (0.0006) [2023-03-06 22:30:22,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 70027264. Throughput: 0: 12724.1. Samples: 70023108. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:22,390][62145] Avg episode reward: [(0, '855.685')] [2023-03-06 22:30:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000068386_70027264.pth... [2023-03-06 22:30:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000065408_66977792.pth [2023-03-06 22:30:22,663][62475] Updated weights for policy 0, policy_version 68390 (0.0007) [2023-03-06 22:30:23,468][62475] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-06 22:30:24,271][62475] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-06 22:30:25,076][62475] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-06 22:30:25,872][62475] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-06 22:30:26,665][62475] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-06 22:30:27,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 70091776. Throughput: 0: 12730.7. Samples: 70061444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:27,390][62145] Avg episode reward: [(0, '763.022')] [2023-03-06 22:30:27,475][62475] Updated weights for policy 0, policy_version 68450 (0.0006) [2023-03-06 22:30:28,271][62475] Updated weights for policy 0, policy_version 68460 (0.0006) [2023-03-06 22:30:29,075][62475] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-06 22:30:29,878][62475] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-06 22:30:30,689][62475] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-06 22:30:31,499][62475] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-06 22:30:32,300][62475] Updated weights for policy 0, policy_version 68510 (0.0007) [2023-03-06 22:30:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 70154240. Throughput: 0: 12734.5. Samples: 70137894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:32,390][62145] Avg episode reward: [(0, '715.437')] [2023-03-06 22:30:33,105][62475] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-06 22:30:33,910][62475] Updated weights for policy 0, policy_version 68530 (0.0007) [2023-03-06 22:30:34,716][62475] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-06 22:30:35,512][62475] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-06 22:30:36,334][62475] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-06 22:30:37,149][62475] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-06 22:30:37,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12715.0). Total num frames: 70218752. Throughput: 0: 12729.1. Samples: 70213962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:37,390][62145] Avg episode reward: [(0, '960.182')] [2023-03-06 22:30:37,938][62475] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-06 22:30:38,747][62475] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-06 22:30:39,556][62475] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-06 22:30:40,350][62475] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-06 22:30:41,161][62475] Updated weights for policy 0, policy_version 68620 (0.0007) [2023-03-06 22:30:41,963][62475] Updated weights for policy 0, policy_version 68630 (0.0006) [2023-03-06 22:30:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 70282240. Throughput: 0: 12729.6. Samples: 70252266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:42,390][62145] Avg episode reward: [(0, '848.121')] [2023-03-06 22:30:42,780][62475] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-06 22:30:43,568][62475] Updated weights for policy 0, policy_version 68650 (0.0007) [2023-03-06 22:30:44,370][62475] Updated weights for policy 0, policy_version 68660 (0.0006) [2023-03-06 22:30:45,193][62475] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-06 22:30:45,989][62475] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-06 22:30:46,790][62475] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-06 22:30:47,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 70345728. Throughput: 0: 12725.4. Samples: 70328659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:47,390][62145] Avg episode reward: [(0, '766.881')] [2023-03-06 22:30:47,601][62475] Updated weights for policy 0, policy_version 68700 (0.0006) [2023-03-06 22:30:48,413][62475] Updated weights for policy 0, policy_version 68710 (0.0005) [2023-03-06 22:30:49,202][62475] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-06 22:30:50,026][62475] Updated weights for policy 0, policy_version 68730 (0.0006) [2023-03-06 22:30:50,839][62475] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-06 22:30:51,642][62475] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-06 22:30:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 70409216. Throughput: 0: 12702.6. Samples: 70404578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:30:52,390][62145] Avg episode reward: [(0, '866.269')] [2023-03-06 22:30:52,449][62475] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-06 22:30:53,259][62475] Updated weights for policy 0, policy_version 68770 (0.0007) [2023-03-06 22:30:54,067][62475] Updated weights for policy 0, policy_version 68780 (0.0007) [2023-03-06 22:30:54,866][62475] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-06 22:30:55,649][62475] Updated weights for policy 0, policy_version 68800 (0.0006) [2023-03-06 22:30:56,464][62475] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-06 22:30:57,260][62475] Updated weights for policy 0, policy_version 68820 (0.0006) [2023-03-06 22:30:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12711.5). Total num frames: 70472704. Throughput: 0: 12710.0. Samples: 70442862. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:30:57,390][62145] Avg episode reward: [(0, '788.954')] [2023-03-06 22:30:58,074][62475] Updated weights for policy 0, policy_version 68830 (0.0007) [2023-03-06 22:30:58,879][62475] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-06 22:30:59,681][62475] Updated weights for policy 0, policy_version 68850 (0.0007) [2023-03-06 22:31:00,465][62475] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-06 22:31:01,287][62475] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-06 22:31:02,084][62475] Updated weights for policy 0, policy_version 68880 (0.0006) [2023-03-06 22:31:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 70536192. Throughput: 0: 12714.7. Samples: 70519379. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:02,390][62145] Avg episode reward: [(0, '642.227')] [2023-03-06 22:31:02,886][62475] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-06 22:31:03,694][62475] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-06 22:31:04,519][62475] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-06 22:31:05,327][62475] Updated weights for policy 0, policy_version 68920 (0.0006) [2023-03-06 22:31:06,137][62475] Updated weights for policy 0, policy_version 68930 (0.0006) [2023-03-06 22:31:06,921][62475] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-06 22:31:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 70599680. Throughput: 0: 12723.4. Samples: 70595658. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:07,390][62145] Avg episode reward: [(0, '635.209')] [2023-03-06 22:31:07,719][62475] Updated weights for policy 0, policy_version 68950 (0.0007) [2023-03-06 22:31:08,525][62475] Updated weights for policy 0, policy_version 68960 (0.0007) [2023-03-06 22:31:09,335][62475] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-06 22:31:10,158][62475] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-06 22:31:10,945][62475] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-06 22:31:11,754][62475] Updated weights for policy 0, policy_version 69000 (0.0006) [2023-03-06 22:31:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 70664192. Throughput: 0: 12718.3. Samples: 70633769. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:12,390][62145] Avg episode reward: [(0, '596.826')] [2023-03-06 22:31:12,563][62475] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-06 22:31:13,363][62475] Updated weights for policy 0, policy_version 69020 (0.0005) [2023-03-06 22:31:14,155][62475] Updated weights for policy 0, policy_version 69030 (0.0007) [2023-03-06 22:31:14,973][62475] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-06 22:31:15,780][62475] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-06 22:31:16,574][62475] Updated weights for policy 0, policy_version 69060 (0.0006) [2023-03-06 22:31:17,372][62475] Updated weights for policy 0, policy_version 69070 (0.0006) [2023-03-06 22:31:17,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 70727680. Throughput: 0: 12716.6. Samples: 70710143. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:17,390][62145] Avg episode reward: [(0, '795.146')] [2023-03-06 22:31:18,166][62475] Updated weights for policy 0, policy_version 69080 (0.0007) [2023-03-06 22:31:18,967][62475] Updated weights for policy 0, policy_version 69090 (0.0007) [2023-03-06 22:31:19,762][62475] Updated weights for policy 0, policy_version 69100 (0.0007) [2023-03-06 22:31:20,574][62475] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-06 22:31:21,386][62475] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-06 22:31:22,186][62475] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-06 22:31:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12711.5). Total num frames: 70791168. Throughput: 0: 12729.9. Samples: 70786809. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:22,390][62145] Avg episode reward: [(0, '696.453')] [2023-03-06 22:31:22,982][62475] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-06 22:31:23,786][62475] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-06 22:31:24,605][62475] Updated weights for policy 0, policy_version 69160 (0.0006) [2023-03-06 22:31:25,381][62475] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-06 22:31:26,203][62475] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-06 22:31:27,005][62475] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-06 22:31:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 70855680. Throughput: 0: 12731.9. Samples: 70825200. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:27,390][62145] Avg episode reward: [(0, '683.900')] [2023-03-06 22:31:27,801][62475] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-06 22:31:28,613][62475] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-06 22:31:29,423][62475] Updated weights for policy 0, policy_version 69220 (0.0007) [2023-03-06 22:31:30,237][62475] Updated weights for policy 0, policy_version 69230 (0.0007) [2023-03-06 22:31:31,027][62475] Updated weights for policy 0, policy_version 69240 (0.0007) [2023-03-06 22:31:31,810][62475] Updated weights for policy 0, policy_version 69250 (0.0006) [2023-03-06 22:31:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12715.0). Total num frames: 70919168. Throughput: 0: 12730.1. Samples: 70901512. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:32,390][62145] Avg episode reward: [(0, '657.260')] [2023-03-06 22:31:32,613][62475] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-06 22:31:33,435][62475] Updated weights for policy 0, policy_version 69270 (0.0005) [2023-03-06 22:31:34,209][62475] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-06 22:31:35,034][62475] Updated weights for policy 0, policy_version 69290 (0.0006) [2023-03-06 22:31:35,854][62475] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-06 22:31:36,643][62475] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-06 22:31:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 70982656. Throughput: 0: 12745.6. Samples: 70978129. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 22:31:37,390][62145] Avg episode reward: [(0, '708.948')] [2023-03-06 22:31:37,449][62475] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-06 22:31:38,259][62475] Updated weights for policy 0, policy_version 69330 (0.0007) [2023-03-06 22:31:39,064][62475] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-06 22:31:39,856][62475] Updated weights for policy 0, policy_version 69350 (0.0006) [2023-03-06 22:31:40,667][62475] Updated weights for policy 0, policy_version 69360 (0.0006) [2023-03-06 22:31:41,458][62475] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-06 22:31:42,270][62475] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-06 22:31:42,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 71046144. Throughput: 0: 12741.7. Samples: 71016241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:31:42,390][62145] Avg episode reward: [(0, '631.101')] [2023-03-06 22:31:43,073][62475] Updated weights for policy 0, policy_version 69390 (0.0006) [2023-03-06 22:31:43,899][62475] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-06 22:31:44,693][62475] Updated weights for policy 0, policy_version 69410 (0.0007) [2023-03-06 22:31:45,503][62475] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-06 22:31:46,310][62475] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-06 22:31:47,097][62475] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-06 22:31:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 71109632. Throughput: 0: 12735.0. Samples: 71092452. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:31:47,390][62145] Avg episode reward: [(0, '491.777')] [2023-03-06 22:31:47,904][62475] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-06 22:31:48,716][62475] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-06 22:31:49,520][62475] Updated weights for policy 0, policy_version 69470 (0.0007) [2023-03-06 22:31:50,321][62475] Updated weights for policy 0, policy_version 69480 (0.0007) [2023-03-06 22:31:51,140][62475] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-06 22:31:51,926][62475] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-06 22:31:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 71173120. Throughput: 0: 12744.7. Samples: 71169169. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:31:52,390][62145] Avg episode reward: [(0, '558.560')] [2023-03-06 22:31:52,711][62475] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-06 22:31:53,529][62475] Updated weights for policy 0, policy_version 69520 (0.0007) [2023-03-06 22:31:54,314][62475] Updated weights for policy 0, policy_version 69530 (0.0007) [2023-03-06 22:31:55,126][62475] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-06 22:31:55,943][62475] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-06 22:31:56,768][62475] Updated weights for policy 0, policy_version 69560 (0.0007) [2023-03-06 22:31:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 71236608. Throughput: 0: 12750.7. Samples: 71207552. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:31:57,390][62145] Avg episode reward: [(0, '530.592')] [2023-03-06 22:31:57,566][62475] Updated weights for policy 0, policy_version 69570 (0.0007) [2023-03-06 22:31:58,357][62475] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-06 22:31:59,161][62475] Updated weights for policy 0, policy_version 69590 (0.0006) [2023-03-06 22:31:59,951][62475] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-06 22:32:00,763][62475] Updated weights for policy 0, policy_version 69610 (0.0007) [2023-03-06 22:32:01,554][62475] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-06 22:32:02,356][62475] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-06 22:32:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 71301120. Throughput: 0: 12748.3. Samples: 71283817. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:02,390][62145] Avg episode reward: [(0, '587.019')] [2023-03-06 22:32:03,164][62475] Updated weights for policy 0, policy_version 69640 (0.0006) [2023-03-06 22:32:03,971][62475] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-06 22:32:04,762][62475] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-03-06 22:32:05,569][62475] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-06 22:32:06,374][62475] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-06 22:32:07,153][62475] Updated weights for policy 0, policy_version 69690 (0.0007) [2023-03-06 22:32:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 71364608. Throughput: 0: 12750.2. Samples: 71360566. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:07,390][62145] Avg episode reward: [(0, '865.171')] [2023-03-06 22:32:07,959][62475] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-06 22:32:08,780][62475] Updated weights for policy 0, policy_version 69710 (0.0008) [2023-03-06 22:32:09,561][62475] Updated weights for policy 0, policy_version 69720 (0.0007) [2023-03-06 22:32:10,372][62475] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-06 22:32:11,189][62475] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-06 22:32:11,990][62475] Updated weights for policy 0, policy_version 69750 (0.0006) [2023-03-06 22:32:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 71429120. Throughput: 0: 12751.2. Samples: 71399004. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:12,390][62145] Avg episode reward: [(0, '701.815')] [2023-03-06 22:32:12,781][62475] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-06 22:32:13,585][62475] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-06 22:32:14,379][62475] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-06 22:32:15,175][62475] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-06 22:32:15,998][62475] Updated weights for policy 0, policy_version 69800 (0.0007) [2023-03-06 22:32:16,797][62475] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-06 22:32:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 71492608. Throughput: 0: 12753.0. Samples: 71475395. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:17,390][62145] Avg episode reward: [(0, '839.023')] [2023-03-06 22:32:17,595][62475] Updated weights for policy 0, policy_version 69820 (0.0007) [2023-03-06 22:32:18,376][62475] Updated weights for policy 0, policy_version 69830 (0.0007) [2023-03-06 22:32:19,178][62475] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-06 22:32:19,989][62475] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-06 22:32:20,791][62475] Updated weights for policy 0, policy_version 69860 (0.0007) [2023-03-06 22:32:21,601][62475] Updated weights for policy 0, policy_version 69870 (0.0007) [2023-03-06 22:32:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12728.8). Total num frames: 71557120. Throughput: 0: 12760.3. Samples: 71552342. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:22,390][62475] Updated weights for policy 0, policy_version 69880 (0.0007) [2023-03-06 22:32:22,390][62145] Avg episode reward: [(0, '683.013')] [2023-03-06 22:32:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000069880_71557120.pth... [2023-03-06 22:32:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000066896_68501504.pth [2023-03-06 22:32:23,196][62475] Updated weights for policy 0, policy_version 69890 (0.0007) [2023-03-06 22:32:23,993][62475] Updated weights for policy 0, policy_version 69900 (0.0006) [2023-03-06 22:32:24,825][62475] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-06 22:32:25,619][62475] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-06 22:32:26,426][62475] Updated weights for policy 0, policy_version 69930 (0.0007) [2023-03-06 22:32:27,230][62475] Updated weights for policy 0, policy_version 69940 (0.0006) [2023-03-06 22:32:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 71619584. Throughput: 0: 12757.9. Samples: 71590346. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:27,390][62145] Avg episode reward: [(0, '767.944')] [2023-03-06 22:32:28,036][62475] Updated weights for policy 0, policy_version 69950 (0.0006) [2023-03-06 22:32:28,817][62475] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-06 22:32:29,622][62475] Updated weights for policy 0, policy_version 69970 (0.0006) [2023-03-06 22:32:30,439][62475] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-06 22:32:31,240][62475] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-06 22:32:32,061][62475] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-06 22:32:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 71684096. Throughput: 0: 12760.3. Samples: 71666664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:32,390][62145] Avg episode reward: [(0, '615.252')] [2023-03-06 22:32:32,877][62475] Updated weights for policy 0, policy_version 70010 (0.0007) [2023-03-06 22:32:33,677][62475] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-06 22:32:34,474][62475] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-06 22:32:35,260][62475] Updated weights for policy 0, policy_version 70040 (0.0006) [2023-03-06 22:32:36,071][62475] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-06 22:32:36,890][62475] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-06 22:32:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 71747584. Throughput: 0: 12751.8. Samples: 71743001. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:37,390][62145] Avg episode reward: [(0, '756.209')] [2023-03-06 22:32:37,686][62475] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-06 22:32:38,485][62475] Updated weights for policy 0, policy_version 70080 (0.0006) [2023-03-06 22:32:39,287][62475] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-06 22:32:40,114][62475] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-06 22:32:40,920][62475] Updated weights for policy 0, policy_version 70110 (0.0007) [2023-03-06 22:32:41,730][62475] Updated weights for policy 0, policy_version 70120 (0.0007) [2023-03-06 22:32:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 71811072. Throughput: 0: 12745.5. Samples: 71781102. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:42,390][62145] Avg episode reward: [(0, '713.847')] [2023-03-06 22:32:42,535][62475] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-06 22:32:43,355][62475] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-06 22:32:44,168][62475] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-06 22:32:44,967][62475] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-06 22:32:45,769][62475] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-06 22:32:46,564][62475] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-06 22:32:47,366][62475] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-06 22:32:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 71874560. Throughput: 0: 12743.4. Samples: 71857269. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:47,390][62145] Avg episode reward: [(0, '787.057')] [2023-03-06 22:32:48,173][62475] Updated weights for policy 0, policy_version 70200 (0.0007) [2023-03-06 22:32:48,975][62475] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-06 22:32:49,763][62475] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-06 22:32:50,566][62475] Updated weights for policy 0, policy_version 70230 (0.0005) [2023-03-06 22:32:51,367][62475] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-06 22:32:52,184][62475] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-06 22:32:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 71938048. Throughput: 0: 12741.9. Samples: 71933952. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:52,390][62145] Avg episode reward: [(0, '955.405')] [2023-03-06 22:32:52,993][62475] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-06 22:32:53,787][62475] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-06 22:32:54,594][62475] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-06 22:32:55,390][62475] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-06 22:32:55,877][62424] KL-divergence is very high: 165.7899 [2023-03-06 22:32:56,204][62475] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-06 22:32:56,997][62475] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-06 22:32:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 72001536. Throughput: 0: 12732.3. Samples: 71971959. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:32:57,390][62145] Avg episode reward: [(0, '831.277')] [2023-03-06 22:32:57,824][62475] Updated weights for policy 0, policy_version 70320 (0.0010) [2023-03-06 22:32:58,621][62475] Updated weights for policy 0, policy_version 70330 (0.0006) [2023-03-06 22:32:59,431][62475] Updated weights for policy 0, policy_version 70340 (0.0007) [2023-03-06 22:33:00,225][62475] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-06 22:33:01,051][62475] Updated weights for policy 0, policy_version 70360 (0.0006) [2023-03-06 22:33:01,852][62475] Updated weights for policy 0, policy_version 70370 (0.0007) [2023-03-06 22:33:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 72065024. Throughput: 0: 12728.6. Samples: 72048181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:33:02,390][62145] Avg episode reward: [(0, '720.497')] [2023-03-06 22:33:02,650][62475] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-06 22:33:03,464][62475] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-06 22:33:04,276][62475] Updated weights for policy 0, policy_version 70400 (0.0007) [2023-03-06 22:33:05,083][62475] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-06 22:33:05,877][62475] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-06 22:33:06,678][62475] Updated weights for policy 0, policy_version 70430 (0.0007) [2023-03-06 22:33:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 72129536. Throughput: 0: 12715.3. Samples: 72124530. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:07,390][62145] Avg episode reward: [(0, '649.723')] [2023-03-06 22:33:07,489][62475] Updated weights for policy 0, policy_version 70440 (0.0007) [2023-03-06 22:33:08,283][62475] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-06 22:33:09,101][62475] Updated weights for policy 0, policy_version 70460 (0.0006) [2023-03-06 22:33:09,897][62475] Updated weights for policy 0, policy_version 70470 (0.0006) [2023-03-06 22:33:10,724][62475] Updated weights for policy 0, policy_version 70480 (0.0007) [2023-03-06 22:33:11,518][62475] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-06 22:33:12,314][62475] Updated weights for policy 0, policy_version 70500 (0.0006) [2023-03-06 22:33:12,389][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 72193024. Throughput: 0: 12718.8. Samples: 72162691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:12,390][62145] Avg episode reward: [(0, '681.623')] [2023-03-06 22:33:13,147][62475] Updated weights for policy 0, policy_version 70510 (0.0007) [2023-03-06 22:33:13,943][62475] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-06 22:33:14,736][62475] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-06 22:33:15,561][62475] Updated weights for policy 0, policy_version 70540 (0.0005) [2023-03-06 22:33:16,358][62475] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-06 22:33:17,148][62475] Updated weights for policy 0, policy_version 70560 (0.0007) [2023-03-06 22:33:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 72255488. Throughput: 0: 12714.5. Samples: 72238818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:17,390][62145] Avg episode reward: [(0, '650.415')] [2023-03-06 22:33:17,953][62475] Updated weights for policy 0, policy_version 70570 (0.0006) [2023-03-06 22:33:18,767][62475] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-06 22:33:19,556][62475] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-06 22:33:20,358][62475] Updated weights for policy 0, policy_version 70600 (0.0007) [2023-03-06 22:33:21,162][62475] Updated weights for policy 0, policy_version 70610 (0.0006) [2023-03-06 22:33:22,000][62475] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-06 22:33:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 72320000. Throughput: 0: 12714.1. Samples: 72315134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:22,400][62145] Avg episode reward: [(0, '693.873')] [2023-03-06 22:33:22,801][62475] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-06 22:33:23,615][62475] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-06 22:33:24,418][62475] Updated weights for policy 0, policy_version 70650 (0.0007) [2023-03-06 22:33:25,242][62475] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-06 22:33:26,021][62475] Updated weights for policy 0, policy_version 70670 (0.0007) [2023-03-06 22:33:26,824][62475] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-06 22:33:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 72383488. Throughput: 0: 12712.5. Samples: 72353162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:27,390][62145] Avg episode reward: [(0, '697.074')] [2023-03-06 22:33:27,620][62475] Updated weights for policy 0, policy_version 70690 (0.0007) [2023-03-06 22:33:28,428][62475] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-06 22:33:29,257][62475] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-06 22:33:30,065][62475] Updated weights for policy 0, policy_version 70720 (0.0007) [2023-03-06 22:33:30,868][62475] Updated weights for policy 0, policy_version 70730 (0.0007) [2023-03-06 22:33:31,667][62475] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-06 22:33:32,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.6, 300 sec: 12728.8). Total num frames: 72446976. Throughput: 0: 12716.0. Samples: 72429492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:32,390][62145] Avg episode reward: [(0, '761.800')] [2023-03-06 22:33:32,441][62475] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-06 22:33:33,269][62475] Updated weights for policy 0, policy_version 70760 (0.0007) [2023-03-06 22:33:34,078][62475] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-06 22:33:34,877][62475] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-06 22:33:35,695][62475] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-06 22:33:36,496][62475] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-06 22:33:37,297][62475] Updated weights for policy 0, policy_version 70810 (0.0006) [2023-03-06 22:33:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 72510464. Throughput: 0: 12707.6. Samples: 72505793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:37,390][62145] Avg episode reward: [(0, '723.822')] [2023-03-06 22:33:38,090][62475] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-06 22:33:38,889][62475] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-06 22:33:39,701][62475] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-06 22:33:40,517][62475] Updated weights for policy 0, policy_version 70850 (0.0006) [2023-03-06 22:33:41,327][62475] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-06 22:33:42,133][62475] Updated weights for policy 0, policy_version 70870 (0.0007) [2023-03-06 22:33:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 72573952. Throughput: 0: 12714.3. Samples: 72544102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:33:42,390][62145] Avg episode reward: [(0, '774.340')] [2023-03-06 22:33:42,922][62475] Updated weights for policy 0, policy_version 70880 (0.0007) [2023-03-06 22:33:43,753][62475] Updated weights for policy 0, policy_version 70890 (0.0006) [2023-03-06 22:33:44,547][62475] Updated weights for policy 0, policy_version 70900 (0.0007) [2023-03-06 22:33:45,358][62475] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-06 22:33:46,161][62475] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-06 22:33:46,961][62475] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-06 22:33:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 72637440. Throughput: 0: 12711.9. Samples: 72620218. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:33:47,390][62145] Avg episode reward: [(0, '564.052')] [2023-03-06 22:33:47,766][62475] Updated weights for policy 0, policy_version 70940 (0.0006) [2023-03-06 22:33:48,560][62475] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-06 22:33:49,362][62475] Updated weights for policy 0, policy_version 70960 (0.0007) [2023-03-06 22:33:50,158][62475] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-06 22:33:50,968][62475] Updated weights for policy 0, policy_version 70980 (0.0007) [2023-03-06 22:33:51,754][62475] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-06 22:33:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 72700928. Throughput: 0: 12717.4. Samples: 72696815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:33:52,390][62145] Avg episode reward: [(0, '737.373')] [2023-03-06 22:33:52,566][62475] Updated weights for policy 0, policy_version 71000 (0.0007) [2023-03-06 22:33:53,361][62475] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-06 22:33:54,191][62475] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-06 22:33:54,986][62475] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-06 22:33:55,797][62475] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-06 22:33:56,594][62475] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-06 22:33:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 72764416. Throughput: 0: 12717.3. Samples: 72734968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:33:57,390][62145] Avg episode reward: [(0, '681.522')] [2023-03-06 22:33:57,397][62475] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-06 22:33:58,196][62475] Updated weights for policy 0, policy_version 71070 (0.0007) [2023-03-06 22:33:59,026][62475] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-06 22:33:59,823][62475] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-06 22:34:00,619][62475] Updated weights for policy 0, policy_version 71100 (0.0006) [2023-03-06 22:34:01,442][62475] Updated weights for policy 0, policy_version 71110 (0.0007) [2023-03-06 22:34:02,219][62475] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-06 22:34:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 72828928. Throughput: 0: 12721.7. Samples: 72811293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:02,390][62145] Avg episode reward: [(0, '778.884')] [2023-03-06 22:34:03,029][62475] Updated weights for policy 0, policy_version 71130 (0.0005) [2023-03-06 22:34:03,825][62475] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-06 22:34:04,653][62475] Updated weights for policy 0, policy_version 71150 (0.0006) [2023-03-06 22:34:05,447][62475] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-06 22:34:06,253][62475] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-06 22:34:07,047][62475] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-06 22:34:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 72892416. Throughput: 0: 12721.0. Samples: 72887579. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:07,390][62145] Avg episode reward: [(0, '767.154')] [2023-03-06 22:34:07,875][62475] Updated weights for policy 0, policy_version 71190 (0.0007) [2023-03-06 22:34:08,686][62475] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-06 22:34:09,481][62475] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-06 22:34:10,311][62475] Updated weights for policy 0, policy_version 71220 (0.0007) [2023-03-06 22:34:11,114][62475] Updated weights for policy 0, policy_version 71230 (0.0006) [2023-03-06 22:34:11,916][62475] Updated weights for policy 0, policy_version 71240 (0.0006) [2023-03-06 22:34:12,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 72954880. Throughput: 0: 12719.8. Samples: 72925553. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:12,390][62145] Avg episode reward: [(0, '769.930')] [2023-03-06 22:34:12,724][62475] Updated weights for policy 0, policy_version 71250 (0.0007) [2023-03-06 22:34:13,526][62475] Updated weights for policy 0, policy_version 71260 (0.0006) [2023-03-06 22:34:14,314][62475] Updated weights for policy 0, policy_version 71270 (0.0005) [2023-03-06 22:34:15,140][62475] Updated weights for policy 0, policy_version 71280 (0.0007) [2023-03-06 22:34:15,928][62475] Updated weights for policy 0, policy_version 71290 (0.0007) [2023-03-06 22:34:16,744][62475] Updated weights for policy 0, policy_version 71300 (0.0006) [2023-03-06 22:34:17,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 73018368. Throughput: 0: 12719.0. Samples: 73001849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:17,390][62145] Avg episode reward: [(0, '692.678')] [2023-03-06 22:34:17,554][62475] Updated weights for policy 0, policy_version 71310 (0.0006) [2023-03-06 22:34:18,371][62475] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-06 22:34:19,134][62475] Updated weights for policy 0, policy_version 71330 (0.0006) [2023-03-06 22:34:19,960][62475] Updated weights for policy 0, policy_version 71340 (0.0007) [2023-03-06 22:34:20,763][62475] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-06 22:34:21,550][62475] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-06 22:34:22,342][62475] Updated weights for policy 0, policy_version 71370 (0.0007) [2023-03-06 22:34:22,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12728.8). Total num frames: 73082880. Throughput: 0: 12728.7. Samples: 73078585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:22,390][62145] Avg episode reward: [(0, '852.854')] [2023-03-06 22:34:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000071370_73082880.pth... [2023-03-06 22:34:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000068386_70027264.pth [2023-03-06 22:34:23,147][62475] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-06 22:34:23,930][62475] Updated weights for policy 0, policy_version 71390 (0.0006) [2023-03-06 22:34:24,742][62475] Updated weights for policy 0, policy_version 71400 (0.0008) [2023-03-06 22:34:25,573][62475] Updated weights for policy 0, policy_version 71410 (0.0007) [2023-03-06 22:34:26,366][62475] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-06 22:34:27,181][62475] Updated weights for policy 0, policy_version 71430 (0.0006) [2023-03-06 22:34:27,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 73146368. Throughput: 0: 12729.3. Samples: 73116921. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:34:27,390][62145] Avg episode reward: [(0, '852.102')] [2023-03-06 22:34:27,983][62475] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-06 22:34:28,783][62475] Updated weights for policy 0, policy_version 71450 (0.0007) [2023-03-06 22:34:29,580][62475] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-06 22:34:30,398][62475] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-06 22:34:31,206][62475] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-06 22:34:32,036][62475] Updated weights for policy 0, policy_version 71490 (0.0006) [2023-03-06 22:34:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 73209856. Throughput: 0: 12730.1. Samples: 73193072. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:32,390][62145] Avg episode reward: [(0, '857.906')] [2023-03-06 22:34:32,814][62475] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-06 22:34:33,610][62475] Updated weights for policy 0, policy_version 71510 (0.0007) [2023-03-06 22:34:34,423][62475] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-06 22:34:35,200][62475] Updated weights for policy 0, policy_version 71530 (0.0006) [2023-03-06 22:34:36,023][62475] Updated weights for policy 0, policy_version 71540 (0.0007) [2023-03-06 22:34:36,814][62475] Updated weights for policy 0, policy_version 71550 (0.0007) [2023-03-06 22:34:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 73273344. Throughput: 0: 12728.8. Samples: 73269612. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:37,390][62145] Avg episode reward: [(0, '1011.936')] [2023-03-06 22:34:37,631][62475] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-06 22:34:38,417][62475] Updated weights for policy 0, policy_version 71570 (0.0007) [2023-03-06 22:34:39,212][62475] Updated weights for policy 0, policy_version 71580 (0.0007) [2023-03-06 22:34:40,026][62475] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-06 22:34:40,832][62475] Updated weights for policy 0, policy_version 71600 (0.0006) [2023-03-06 22:34:41,640][62475] Updated weights for policy 0, policy_version 71610 (0.0006) [2023-03-06 22:34:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 73337856. Throughput: 0: 12734.4. Samples: 73308017. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:42,390][62145] Avg episode reward: [(0, '691.125')] [2023-03-06 22:34:42,470][62475] Updated weights for policy 0, policy_version 71620 (0.0007) [2023-03-06 22:34:43,260][62475] Updated weights for policy 0, policy_version 71630 (0.0007) [2023-03-06 22:34:44,058][62475] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-06 22:34:44,870][62475] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-06 22:34:45,664][62475] Updated weights for policy 0, policy_version 71660 (0.0007) [2023-03-06 22:34:46,457][62475] Updated weights for policy 0, policy_version 71670 (0.0006) [2023-03-06 22:34:47,275][62475] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-06 22:34:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 73401344. Throughput: 0: 12731.2. Samples: 73384196. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:47,390][62145] Avg episode reward: [(0, '632.865')] [2023-03-06 22:34:48,061][62475] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-06 22:34:48,861][62475] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-06 22:34:49,642][62475] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-06 22:34:50,474][62475] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-06 22:34:51,280][62475] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-06 22:34:52,069][62475] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-06 22:34:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 73464832. Throughput: 0: 12739.7. Samples: 73460865. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:52,390][62145] Avg episode reward: [(0, '740.108')] [2023-03-06 22:34:52,889][62475] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-06 22:34:53,684][62475] Updated weights for policy 0, policy_version 71760 (0.0007) [2023-03-06 22:34:54,495][62475] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-06 22:34:55,306][62475] Updated weights for policy 0, policy_version 71780 (0.0005) [2023-03-06 22:34:56,106][62475] Updated weights for policy 0, policy_version 71790 (0.0007) [2023-03-06 22:34:56,913][62475] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-06 22:34:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 73528320. Throughput: 0: 12742.3. Samples: 73498958. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:34:57,390][62145] Avg episode reward: [(0, '801.397')] [2023-03-06 22:34:57,733][62475] Updated weights for policy 0, policy_version 71810 (0.0005) [2023-03-06 22:34:58,517][62475] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-06 22:34:59,345][62475] Updated weights for policy 0, policy_version 71830 (0.0007) [2023-03-06 22:35:00,135][62475] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-06 22:35:00,945][62475] Updated weights for policy 0, policy_version 71850 (0.0007) [2023-03-06 22:35:01,754][62475] Updated weights for policy 0, policy_version 71860 (0.0008) [2023-03-06 22:35:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 73591808. Throughput: 0: 12740.5. Samples: 73575168. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:35:02,390][62145] Avg episode reward: [(0, '646.848')] [2023-03-06 22:35:02,570][62475] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-06 22:35:03,371][62475] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-06 22:35:04,173][62475] Updated weights for policy 0, policy_version 71890 (0.0007) [2023-03-06 22:35:04,973][62475] Updated weights for policy 0, policy_version 71900 (0.0006) [2023-03-06 22:35:05,769][62475] Updated weights for policy 0, policy_version 71910 (0.0007) [2023-03-06 22:35:06,549][62475] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-06 22:35:07,382][62475] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-06 22:35:07,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 73656320. Throughput: 0: 12735.5. Samples: 73651680. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:35:07,390][62145] Avg episode reward: [(0, '715.763')] [2023-03-06 22:35:08,188][62475] Updated weights for policy 0, policy_version 71940 (0.0007) [2023-03-06 22:35:08,975][62475] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-06 22:35:09,786][62475] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-06 22:35:10,589][62475] Updated weights for policy 0, policy_version 71970 (0.0007) [2023-03-06 22:35:11,389][62475] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-06 22:35:12,200][62475] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-06 22:35:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 73719808. Throughput: 0: 12734.5. Samples: 73689975. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:35:12,390][62145] Avg episode reward: [(0, '523.320')] [2023-03-06 22:35:13,005][62475] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-06 22:35:13,795][62475] Updated weights for policy 0, policy_version 72010 (0.0006) [2023-03-06 22:35:14,600][62475] Updated weights for policy 0, policy_version 72020 (0.0006) [2023-03-06 22:35:15,403][62475] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-06 22:35:16,207][62475] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-06 22:35:16,993][62475] Updated weights for policy 0, policy_version 72050 (0.0007) [2023-03-06 22:35:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 73783296. Throughput: 0: 12742.4. Samples: 73766481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:17,390][62145] Avg episode reward: [(0, '578.924')] [2023-03-06 22:35:17,790][62475] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-06 22:35:18,605][62475] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-06 22:35:19,402][62475] Updated weights for policy 0, policy_version 72080 (0.0007) [2023-03-06 22:35:20,222][62475] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-06 22:35:21,021][62475] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-06 22:35:21,814][62475] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-06 22:35:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 73846784. Throughput: 0: 12742.0. Samples: 73843004. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:22,390][62145] Avg episode reward: [(0, '696.562')] [2023-03-06 22:35:22,630][62475] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-06 22:35:23,435][62475] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-06 22:35:24,238][62475] Updated weights for policy 0, policy_version 72140 (0.0006) [2023-03-06 22:35:25,048][62475] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-06 22:35:25,833][62475] Updated weights for policy 0, policy_version 72160 (0.0007) [2023-03-06 22:35:26,632][62475] Updated weights for policy 0, policy_version 72170 (0.0007) [2023-03-06 22:35:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 73911296. Throughput: 0: 12735.9. Samples: 73881132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:27,390][62145] Avg episode reward: [(0, '662.570')] [2023-03-06 22:35:27,443][62475] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-06 22:35:28,251][62475] Updated weights for policy 0, policy_version 72190 (0.0007) [2023-03-06 22:35:29,073][62475] Updated weights for policy 0, policy_version 72200 (0.0007) [2023-03-06 22:35:29,875][62475] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-06 22:35:30,675][62475] Updated weights for policy 0, policy_version 72220 (0.0006) [2023-03-06 22:35:31,471][62475] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-06 22:35:32,275][62475] Updated weights for policy 0, policy_version 72240 (0.0006) [2023-03-06 22:35:32,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 73974784. Throughput: 0: 12742.4. Samples: 73957603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:32,390][62145] Avg episode reward: [(0, '942.575')] [2023-03-06 22:35:33,096][62475] Updated weights for policy 0, policy_version 72250 (0.0005) [2023-03-06 22:35:33,882][62475] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-06 22:35:34,677][62475] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-06 22:35:35,499][62475] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-06 22:35:36,281][62475] Updated weights for policy 0, policy_version 72290 (0.0007) [2023-03-06 22:35:37,073][62475] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-06 22:35:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 74038272. Throughput: 0: 12742.0. Samples: 74034258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:37,390][62145] Avg episode reward: [(0, '719.099')] [2023-03-06 22:35:37,895][62475] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-06 22:35:38,686][62475] Updated weights for policy 0, policy_version 72320 (0.0007) [2023-03-06 22:35:39,495][62475] Updated weights for policy 0, policy_version 72330 (0.0006) [2023-03-06 22:35:40,298][62475] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-06 22:35:41,090][62475] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-06 22:35:41,902][62475] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-06 22:35:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 74102784. Throughput: 0: 12745.2. Samples: 74072492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:42,390][62145] Avg episode reward: [(0, '663.529')] [2023-03-06 22:35:42,712][62475] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-06 22:35:43,500][62475] Updated weights for policy 0, policy_version 72380 (0.0007) [2023-03-06 22:35:44,320][62475] Updated weights for policy 0, policy_version 72390 (0.0007) [2023-03-06 22:35:45,111][62475] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-06 22:35:45,900][62475] Updated weights for policy 0, policy_version 72410 (0.0008) [2023-03-06 22:35:46,711][62475] Updated weights for policy 0, policy_version 72420 (0.0006) [2023-03-06 22:35:47,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 74166272. Throughput: 0: 12751.8. Samples: 74148996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:47,390][62145] Avg episode reward: [(0, '715.378')] [2023-03-06 22:35:47,512][62475] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-06 22:35:48,342][62475] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-06 22:35:49,144][62475] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-06 22:35:49,937][62475] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-06 22:35:50,750][62475] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-06 22:35:51,548][62475] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-06 22:35:52,345][62475] Updated weights for policy 0, policy_version 72490 (0.0005) [2023-03-06 22:35:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 74229760. Throughput: 0: 12753.2. Samples: 74225573. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:52,390][62145] Avg episode reward: [(0, '578.557')] [2023-03-06 22:35:53,149][62475] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-06 22:35:53,970][62475] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-06 22:35:54,797][62475] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-06 22:35:55,594][62475] Updated weights for policy 0, policy_version 72530 (0.0005) [2023-03-06 22:35:56,380][62475] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-06 22:35:57,207][62475] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-06 22:35:57,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 74293248. Throughput: 0: 12743.2. Samples: 74263421. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:35:57,390][62145] Avg episode reward: [(0, '751.008')] [2023-03-06 22:35:58,014][62475] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-06 22:35:58,806][62475] Updated weights for policy 0, policy_version 72570 (0.0007) [2023-03-06 22:35:59,627][62475] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-06 22:36:00,441][62475] Updated weights for policy 0, policy_version 72590 (0.0005) [2023-03-06 22:36:01,244][62475] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-06 22:36:02,059][62475] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-06 22:36:02,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 74355712. Throughput: 0: 12728.0. Samples: 74339239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:02,390][62145] Avg episode reward: [(0, '827.050')] [2023-03-06 22:36:02,879][62475] Updated weights for policy 0, policy_version 72620 (0.0007) [2023-03-06 22:36:03,669][62475] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-06 22:36:04,462][62475] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-06 22:36:05,253][62475] Updated weights for policy 0, policy_version 72650 (0.0006) [2023-03-06 22:36:06,046][62475] Updated weights for policy 0, policy_version 72660 (0.0007) [2023-03-06 22:36:06,862][62475] Updated weights for policy 0, policy_version 72670 (0.0007) [2023-03-06 22:36:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 74420224. Throughput: 0: 12730.1. Samples: 74415858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:07,390][62145] Avg episode reward: [(0, '621.334')] [2023-03-06 22:36:07,653][62475] Updated weights for policy 0, policy_version 72680 (0.0007) [2023-03-06 22:36:08,478][62475] Updated weights for policy 0, policy_version 72690 (0.0006) [2023-03-06 22:36:09,292][62475] Updated weights for policy 0, policy_version 72700 (0.0007) [2023-03-06 22:36:10,089][62475] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-06 22:36:10,887][62475] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-06 22:36:11,693][62475] Updated weights for policy 0, policy_version 72730 (0.0007) [2023-03-06 22:36:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 74483712. Throughput: 0: 12735.5. Samples: 74454229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:12,390][62145] Avg episode reward: [(0, '810.542')] [2023-03-06 22:36:12,506][62475] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-06 22:36:13,310][62475] Updated weights for policy 0, policy_version 72750 (0.0007) [2023-03-06 22:36:14,113][62475] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-06 22:36:14,908][62475] Updated weights for policy 0, policy_version 72770 (0.0007) [2023-03-06 22:36:15,698][62475] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-06 22:36:16,488][62475] Updated weights for policy 0, policy_version 72790 (0.0007) [2023-03-06 22:36:17,317][62475] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-06 22:36:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 74548224. Throughput: 0: 12737.9. Samples: 74530809. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:17,390][62145] Avg episode reward: [(0, '817.816')] [2023-03-06 22:36:18,122][62475] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-06 22:36:18,927][62475] Updated weights for policy 0, policy_version 72820 (0.0007) [2023-03-06 22:36:19,745][62475] Updated weights for policy 0, policy_version 72830 (0.0006) [2023-03-06 22:36:20,553][62475] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-06 22:36:21,334][62475] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-06 22:36:22,161][62475] Updated weights for policy 0, policy_version 72860 (0.0007) [2023-03-06 22:36:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 74610688. Throughput: 0: 12725.0. Samples: 74606883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:22,390][62145] Avg episode reward: [(0, '745.508')] [2023-03-06 22:36:22,407][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000072863_74611712.pth... [2023-03-06 22:36:22,438][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000069880_71557120.pth [2023-03-06 22:36:22,987][62475] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-06 22:36:23,790][62475] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-06 22:36:24,605][62475] Updated weights for policy 0, policy_version 72890 (0.0006) [2023-03-06 22:36:25,407][62475] Updated weights for policy 0, policy_version 72900 (0.0007) [2023-03-06 22:36:26,217][62475] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-06 22:36:26,996][62475] Updated weights for policy 0, policy_version 72920 (0.0007) [2023-03-06 22:36:27,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 74674176. Throughput: 0: 12712.7. Samples: 74644562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:27,390][62145] Avg episode reward: [(0, '918.090')] [2023-03-06 22:36:27,806][62475] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-06 22:36:28,626][62475] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-06 22:36:29,420][62475] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-06 22:36:30,232][62475] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-06 22:36:31,014][62475] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-06 22:36:31,840][62475] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-06 22:36:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 74737664. Throughput: 0: 12712.9. Samples: 74721076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:32,390][62145] Avg episode reward: [(0, '720.351')] [2023-03-06 22:36:32,645][62475] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-06 22:36:33,438][62475] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-06 22:36:34,226][62475] Updated weights for policy 0, policy_version 73010 (0.0005) [2023-03-06 22:36:35,031][62475] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-06 22:36:35,830][62475] Updated weights for policy 0, policy_version 73030 (0.0007) [2023-03-06 22:36:36,645][62475] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-06 22:36:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 74802176. Throughput: 0: 12713.0. Samples: 74797659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:37,390][62145] Avg episode reward: [(0, '860.593')] [2023-03-06 22:36:37,437][62475] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-06 22:36:38,237][62475] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-06 22:36:39,049][62475] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-06 22:36:39,858][62475] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-06 22:36:40,657][62475] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-06 22:36:41,455][62475] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-06 22:36:42,259][62475] Updated weights for policy 0, policy_version 73110 (0.0007) [2023-03-06 22:36:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 74865664. Throughput: 0: 12722.9. Samples: 74835951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:42,390][62145] Avg episode reward: [(0, '835.741')] [2023-03-06 22:36:43,059][62475] Updated weights for policy 0, policy_version 73120 (0.0007) [2023-03-06 22:36:43,876][62475] Updated weights for policy 0, policy_version 73130 (0.0007) [2023-03-06 22:36:44,693][62475] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-06 22:36:45,482][62475] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-06 22:36:46,322][62475] Updated weights for policy 0, policy_version 73160 (0.0007) [2023-03-06 22:36:47,111][62475] Updated weights for policy 0, policy_version 73170 (0.0006) [2023-03-06 22:36:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 74929152. Throughput: 0: 12728.9. Samples: 74912040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:47,390][62145] Avg episode reward: [(0, '978.882')] [2023-03-06 22:36:47,910][62475] Updated weights for policy 0, policy_version 73180 (0.0007) [2023-03-06 22:36:48,724][62475] Updated weights for policy 0, policy_version 73190 (0.0007) [2023-03-06 22:36:49,124][62424] KL-divergence is very high: 6715.3267 [2023-03-06 22:36:49,526][62475] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-06 22:36:50,339][62475] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-06 22:36:51,133][62475] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-06 22:36:51,930][62475] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-06 22:36:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 74992640. Throughput: 0: 12722.6. Samples: 74988376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:52,390][62145] Avg episode reward: [(0, '809.028')] [2023-03-06 22:36:52,738][62475] Updated weights for policy 0, policy_version 73240 (0.0007) [2023-03-06 22:36:53,554][62475] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-06 22:36:54,369][62475] Updated weights for policy 0, policy_version 73260 (0.0007) [2023-03-06 22:36:55,178][62475] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-06 22:36:55,977][62475] Updated weights for policy 0, policy_version 73280 (0.0007) [2023-03-06 22:36:56,785][62475] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-06 22:36:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 75056128. Throughput: 0: 12713.2. Samples: 75026323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:36:57,390][62145] Avg episode reward: [(0, '821.860')] [2023-03-06 22:36:57,577][62475] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-06 22:36:58,394][62475] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-06 22:36:59,215][62475] Updated weights for policy 0, policy_version 73320 (0.0007) [2023-03-06 22:37:00,015][62475] Updated weights for policy 0, policy_version 73330 (0.0006) [2023-03-06 22:37:00,835][62475] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-06 22:37:01,659][62475] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-06 22:37:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 75119616. Throughput: 0: 12701.5. Samples: 75102378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:02,390][62145] Avg episode reward: [(0, '877.353')] [2023-03-06 22:37:02,454][62475] Updated weights for policy 0, policy_version 73360 (0.0006) [2023-03-06 22:37:03,260][62475] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-06 22:37:04,055][62475] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-06 22:37:04,872][62475] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-06 22:37:05,658][62475] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-06 22:37:06,474][62475] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-06 22:37:07,299][62475] Updated weights for policy 0, policy_version 73420 (0.0007) [2023-03-06 22:37:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75183104. Throughput: 0: 12704.8. Samples: 75178600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:07,390][62145] Avg episode reward: [(0, '943.402')] [2023-03-06 22:37:08,091][62475] Updated weights for policy 0, policy_version 73430 (0.0006) [2023-03-06 22:37:08,879][62475] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-06 22:37:09,696][62475] Updated weights for policy 0, policy_version 73450 (0.0007) [2023-03-06 22:37:10,505][62475] Updated weights for policy 0, policy_version 73460 (0.0008) [2023-03-06 22:37:11,306][62475] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-06 22:37:12,124][62475] Updated weights for policy 0, policy_version 73480 (0.0007) [2023-03-06 22:37:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75246592. Throughput: 0: 12717.5. Samples: 75216848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:12,390][62145] Avg episode reward: [(0, '890.190')] [2023-03-06 22:37:12,924][62475] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-06 22:37:13,747][62475] Updated weights for policy 0, policy_version 73500 (0.0007) [2023-03-06 22:37:14,555][62475] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-06 22:37:15,365][62475] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-06 22:37:16,173][62475] Updated weights for policy 0, policy_version 73530 (0.0007) [2023-03-06 22:37:16,965][62475] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-06 22:37:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 75310080. Throughput: 0: 12706.2. Samples: 75292857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:17,390][62145] Avg episode reward: [(0, '877.147')] [2023-03-06 22:37:17,766][62475] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-06 22:37:18,590][62475] Updated weights for policy 0, policy_version 73560 (0.0006) [2023-03-06 22:37:19,378][62475] Updated weights for policy 0, policy_version 73570 (0.0005) [2023-03-06 22:37:20,177][62475] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-06 22:37:20,997][62475] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-06 22:37:21,789][62475] Updated weights for policy 0, policy_version 73600 (0.0007) [2023-03-06 22:37:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75373568. Throughput: 0: 12700.7. Samples: 75369190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:22,390][62145] Avg episode reward: [(0, '704.147')] [2023-03-06 22:37:22,586][62475] Updated weights for policy 0, policy_version 73610 (0.0007) [2023-03-06 22:37:23,404][62475] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-06 22:37:24,196][62475] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-06 22:37:24,990][62475] Updated weights for policy 0, policy_version 73640 (0.0007) [2023-03-06 22:37:25,790][62475] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-06 22:37:26,605][62475] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-06 22:37:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 75437056. Throughput: 0: 12703.0. Samples: 75407584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:37:27,390][62145] Avg episode reward: [(0, '883.414')] [2023-03-06 22:37:27,393][62475] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-06 22:37:28,197][62475] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-06 22:37:29,001][62475] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-06 22:37:29,808][62475] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-06 22:37:30,599][62475] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-06 22:37:31,407][62475] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-06 22:37:32,227][62475] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-06 22:37:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 75501568. Throughput: 0: 12716.6. Samples: 75484287. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:32,390][62145] Avg episode reward: [(0, '708.018')] [2023-03-06 22:37:33,027][62475] Updated weights for policy 0, policy_version 73740 (0.0007) [2023-03-06 22:37:33,829][62475] Updated weights for policy 0, policy_version 73750 (0.0005) [2023-03-06 22:37:34,637][62475] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-06 22:37:34,781][62424] KL-divergence is very high: 7561.1592 [2023-03-06 22:37:35,433][62475] Updated weights for policy 0, policy_version 73770 (0.0007) [2023-03-06 22:37:36,249][62475] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-06 22:37:37,052][62475] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-06 22:37:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75565056. Throughput: 0: 12713.6. Samples: 75560488. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:37,390][62145] Avg episode reward: [(0, '862.040')] [2023-03-06 22:37:37,841][62475] Updated weights for policy 0, policy_version 73800 (0.0006) [2023-03-06 22:37:38,633][62475] Updated weights for policy 0, policy_version 73810 (0.0006) [2023-03-06 22:37:39,443][62475] Updated weights for policy 0, policy_version 73820 (0.0006) [2023-03-06 22:37:40,249][62475] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-06 22:37:41,063][62475] Updated weights for policy 0, policy_version 73840 (0.0006) [2023-03-06 22:37:41,858][62475] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-06 22:37:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75628544. Throughput: 0: 12721.2. Samples: 75598777. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:42,390][62145] Avg episode reward: [(0, '718.705')] [2023-03-06 22:37:42,669][62475] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-06 22:37:43,470][62475] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-06 22:37:44,293][62475] Updated weights for policy 0, policy_version 73880 (0.0007) [2023-03-06 22:37:45,097][62475] Updated weights for policy 0, policy_version 73890 (0.0007) [2023-03-06 22:37:45,888][62475] Updated weights for policy 0, policy_version 73900 (0.0007) [2023-03-06 22:37:46,709][62475] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-06 22:37:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75692032. Throughput: 0: 12727.1. Samples: 75675097. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:47,390][62145] Avg episode reward: [(0, '890.079')] [2023-03-06 22:37:47,488][62475] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-06 22:37:48,303][62475] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-06 22:37:48,875][62424] KL-divergence is very high: 114.9604 [2023-03-06 22:37:49,106][62475] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-06 22:37:49,919][62475] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-06 22:37:50,726][62475] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-06 22:37:51,518][62475] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-06 22:37:52,333][62475] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-06 22:37:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 75755520. Throughput: 0: 12729.0. Samples: 75751405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:52,390][62145] Avg episode reward: [(0, '774.074')] [2023-03-06 22:37:53,161][62475] Updated weights for policy 0, policy_version 73990 (0.0007) [2023-03-06 22:37:53,948][62475] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-06 22:37:54,784][62475] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-06 22:37:55,600][62475] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-06 22:37:56,386][62475] Updated weights for policy 0, policy_version 74030 (0.0005) [2023-03-06 22:37:57,193][62475] Updated weights for policy 0, policy_version 74040 (0.0006) [2023-03-06 22:37:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 75819008. Throughput: 0: 12718.3. Samples: 75789171. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:37:57,390][62145] Avg episode reward: [(0, '725.695')] [2023-03-06 22:37:57,990][62475] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-06 22:37:58,812][62475] Updated weights for policy 0, policy_version 74060 (0.0006) [2023-03-06 22:37:59,605][62475] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-06 22:38:00,413][62475] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-06 22:38:01,236][62475] Updated weights for policy 0, policy_version 74090 (0.0007) [2023-03-06 22:38:02,038][62475] Updated weights for policy 0, policy_version 74100 (0.0006) [2023-03-06 22:38:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 75882496. Throughput: 0: 12724.6. Samples: 75865466. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:38:02,390][62145] Avg episode reward: [(0, '624.799')] [2023-03-06 22:38:02,854][62475] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-06 22:38:03,655][62475] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-06 22:38:04,452][62475] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-06 22:38:05,260][62475] Updated weights for policy 0, policy_version 74140 (0.0007) [2023-03-06 22:38:06,065][62475] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-06 22:38:06,859][62475] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-06 22:38:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 75945984. Throughput: 0: 12719.5. Samples: 75941566. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:38:07,390][62145] Avg episode reward: [(0, '742.594')] [2023-03-06 22:38:07,672][62475] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-06 22:38:08,469][62475] Updated weights for policy 0, policy_version 74180 (0.0007) [2023-03-06 22:38:09,275][62475] Updated weights for policy 0, policy_version 74190 (0.0007) [2023-03-06 22:38:10,066][62475] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-06 22:38:10,869][62475] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-06 22:38:11,691][62475] Updated weights for policy 0, policy_version 74220 (0.0006) [2023-03-06 22:38:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 76009472. Throughput: 0: 12719.0. Samples: 75979938. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:38:12,390][62145] Avg episode reward: [(0, '864.536')] [2023-03-06 22:38:12,470][62475] Updated weights for policy 0, policy_version 74230 (0.0006) [2023-03-06 22:38:13,280][62475] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-06 22:38:14,096][62475] Updated weights for policy 0, policy_version 74250 (0.0007) [2023-03-06 22:38:14,899][62475] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-06 22:38:15,697][62475] Updated weights for policy 0, policy_version 74270 (0.0007) [2023-03-06 22:38:16,512][62475] Updated weights for policy 0, policy_version 74280 (0.0007) [2023-03-06 22:38:17,332][62475] Updated weights for policy 0, policy_version 74290 (0.0006) [2023-03-06 22:38:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 76072960. Throughput: 0: 12712.7. Samples: 76056360. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:17,390][62145] Avg episode reward: [(0, '864.507')] [2023-03-06 22:38:18,129][62475] Updated weights for policy 0, policy_version 74300 (0.0011) [2023-03-06 22:38:18,913][62475] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-06 22:38:19,734][62475] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-06 22:38:20,528][62475] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-06 22:38:21,345][62475] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-06 22:38:22,132][62475] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-06 22:38:22,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 76137472. Throughput: 0: 12717.0. Samples: 76132753. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:22,390][62145] Avg episode reward: [(0, '717.367')] [2023-03-06 22:38:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000074353_76137472.pth... [2023-03-06 22:38:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000071370_73082880.pth [2023-03-06 22:38:22,939][62475] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-06 22:38:23,746][62475] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-06 22:38:24,565][62475] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-06 22:38:25,369][62475] Updated weights for policy 0, policy_version 74390 (0.0006) [2023-03-06 22:38:26,158][62475] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-06 22:38:26,966][62475] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-06 22:38:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 76200960. Throughput: 0: 12707.3. Samples: 76170606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:27,390][62145] Avg episode reward: [(0, '713.652')] [2023-03-06 22:38:27,761][62475] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-06 22:38:28,558][62475] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-06 22:38:29,364][62475] Updated weights for policy 0, policy_version 74440 (0.0007) [2023-03-06 22:38:30,169][62475] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-06 22:38:30,966][62475] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-06 22:38:31,773][62475] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-06 22:38:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 76264448. Throughput: 0: 12724.1. Samples: 76247683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:32,390][62145] Avg episode reward: [(0, '552.141')] [2023-03-06 22:38:32,580][62475] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-06 22:38:33,365][62475] Updated weights for policy 0, policy_version 74490 (0.0007) [2023-03-06 22:38:34,199][62475] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-06 22:38:35,011][62475] Updated weights for policy 0, policy_version 74510 (0.0007) [2023-03-06 22:38:35,810][62475] Updated weights for policy 0, policy_version 74520 (0.0006) [2023-03-06 22:38:36,623][62475] Updated weights for policy 0, policy_version 74530 (0.0007) [2023-03-06 22:38:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 76327936. Throughput: 0: 12715.0. Samples: 76323579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:37,390][62145] Avg episode reward: [(0, '746.168')] [2023-03-06 22:38:37,434][62475] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-06 22:38:38,225][62475] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-06 22:38:39,030][62475] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-06 22:38:39,848][62475] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-06 22:38:40,641][62475] Updated weights for policy 0, policy_version 74580 (0.0007) [2023-03-06 22:38:41,437][62475] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-06 22:38:42,247][62475] Updated weights for policy 0, policy_version 74600 (0.0006) [2023-03-06 22:38:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 76391424. Throughput: 0: 12725.3. Samples: 76361811. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:42,390][62145] Avg episode reward: [(0, '658.282')] [2023-03-06 22:38:43,056][62475] Updated weights for policy 0, policy_version 74610 (0.0007) [2023-03-06 22:38:43,866][62475] Updated weights for policy 0, policy_version 74620 (0.0006) [2023-03-06 22:38:44,651][62475] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-06 22:38:45,476][62475] Updated weights for policy 0, policy_version 74640 (0.0007) [2023-03-06 22:38:46,266][62475] Updated weights for policy 0, policy_version 74650 (0.0007) [2023-03-06 22:38:47,064][62475] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-06 22:38:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 76455936. Throughput: 0: 12729.6. Samples: 76438299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:47,390][62145] Avg episode reward: [(0, '467.342')] [2023-03-06 22:38:47,862][62475] Updated weights for policy 0, policy_version 74670 (0.0007) [2023-03-06 22:38:48,683][62475] Updated weights for policy 0, policy_version 74680 (0.0007) [2023-03-06 22:38:49,494][62475] Updated weights for policy 0, policy_version 74690 (0.0006) [2023-03-06 22:38:50,291][62475] Updated weights for policy 0, policy_version 74700 (0.0006) [2023-03-06 22:38:51,081][62475] Updated weights for policy 0, policy_version 74710 (0.0007) [2023-03-06 22:38:51,900][62475] Updated weights for policy 0, policy_version 74720 (0.0007) [2023-03-06 22:38:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 76518400. Throughput: 0: 12732.5. Samples: 76514528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:52,390][62145] Avg episode reward: [(0, '572.712')] [2023-03-06 22:38:52,717][62475] Updated weights for policy 0, policy_version 74730 (0.0007) [2023-03-06 22:38:53,511][62475] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-06 22:38:54,328][62475] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-06 22:38:55,118][62475] Updated weights for policy 0, policy_version 74760 (0.0006) [2023-03-06 22:38:55,926][62475] Updated weights for policy 0, policy_version 74770 (0.0005) [2023-03-06 22:38:56,746][62475] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-06 22:38:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 76582912. Throughput: 0: 12731.6. Samples: 76552857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:38:57,390][62145] Avg episode reward: [(0, '561.344')] [2023-03-06 22:38:57,546][62475] Updated weights for policy 0, policy_version 74790 (0.0007) [2023-03-06 22:38:58,351][62475] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-06 22:38:59,165][62475] Updated weights for policy 0, policy_version 74810 (0.0007) [2023-03-06 22:38:59,978][62475] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-06 22:39:00,774][62475] Updated weights for policy 0, policy_version 74830 (0.0006) [2023-03-06 22:39:01,570][62475] Updated weights for policy 0, policy_version 74840 (0.0007) [2023-03-06 22:39:02,379][62475] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-06 22:39:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 76646400. Throughput: 0: 12720.5. Samples: 76628782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:02,390][62145] Avg episode reward: [(0, '690.306')] [2023-03-06 22:39:03,172][62475] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-06 22:39:03,995][62475] Updated weights for policy 0, policy_version 74870 (0.0007) [2023-03-06 22:39:04,806][62475] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-06 22:39:05,601][62475] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-06 22:39:06,394][62475] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-06 22:39:07,218][62475] Updated weights for policy 0, policy_version 74910 (0.0007) [2023-03-06 22:39:07,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 76709888. Throughput: 0: 12723.2. Samples: 76705297. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:07,390][62145] Avg episode reward: [(0, '699.043')] [2023-03-06 22:39:08,009][62475] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-06 22:39:08,798][62475] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-06 22:39:09,627][62475] Updated weights for policy 0, policy_version 74940 (0.0006) [2023-03-06 22:39:10,432][62475] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-06 22:39:11,225][62475] Updated weights for policy 0, policy_version 74960 (0.0007) [2023-03-06 22:39:12,045][62475] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-06 22:39:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 76773376. Throughput: 0: 12729.7. Samples: 76743441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:12,390][62145] Avg episode reward: [(0, '539.116')] [2023-03-06 22:39:12,837][62475] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-06 22:39:13,640][62475] Updated weights for policy 0, policy_version 74990 (0.0007) [2023-03-06 22:39:14,433][62475] Updated weights for policy 0, policy_version 75000 (0.0007) [2023-03-06 22:39:15,258][62475] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-06 22:39:16,066][62475] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-06 22:39:16,855][62475] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-06 22:39:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 76836864. Throughput: 0: 12716.1. Samples: 76819908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:17,390][62145] Avg episode reward: [(0, '531.201')] [2023-03-06 22:39:17,676][62475] Updated weights for policy 0, policy_version 75040 (0.0007) [2023-03-06 22:39:18,470][62475] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-06 22:39:19,256][62475] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-06 22:39:20,067][62475] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-06 22:39:20,860][62475] Updated weights for policy 0, policy_version 75080 (0.0007) [2023-03-06 22:39:21,674][62475] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-06 22:39:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 76900352. Throughput: 0: 12729.5. Samples: 76896409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:22,390][62145] Avg episode reward: [(0, '659.984')] [2023-03-06 22:39:22,474][62475] Updated weights for policy 0, policy_version 75100 (0.0005) [2023-03-06 22:39:23,287][62475] Updated weights for policy 0, policy_version 75110 (0.0007) [2023-03-06 22:39:24,095][62475] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-06 22:39:24,905][62475] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-06 22:39:25,711][62475] Updated weights for policy 0, policy_version 75140 (0.0007) [2023-03-06 22:39:26,517][62475] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-06 22:39:27,327][62475] Updated weights for policy 0, policy_version 75160 (0.0006) [2023-03-06 22:39:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 76963840. Throughput: 0: 12724.0. Samples: 76934391. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:27,390][62145] Avg episode reward: [(0, '696.773')] [2023-03-06 22:39:28,138][62475] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-06 22:39:28,946][62475] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-06 22:39:29,754][62475] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-06 22:39:30,553][62475] Updated weights for policy 0, policy_version 75200 (0.0006) [2023-03-06 22:39:31,359][62475] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-06 22:39:32,154][62475] Updated weights for policy 0, policy_version 75220 (0.0006) [2023-03-06 22:39:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 77028352. Throughput: 0: 12719.0. Samples: 77010655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:32,390][62145] Avg episode reward: [(0, '677.360')] [2023-03-06 22:39:32,942][62475] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-06 22:39:33,745][62475] Updated weights for policy 0, policy_version 75240 (0.0007) [2023-03-06 22:39:34,533][62475] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-03-06 22:39:35,334][62475] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-06 22:39:36,140][62475] Updated weights for policy 0, policy_version 75270 (0.0006) [2023-03-06 22:39:36,965][62475] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-06 22:39:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 77091840. Throughput: 0: 12731.4. Samples: 77087443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:37,390][62145] Avg episode reward: [(0, '658.066')] [2023-03-06 22:39:37,759][62475] Updated weights for policy 0, policy_version 75290 (0.0007) [2023-03-06 22:39:38,575][62475] Updated weights for policy 0, policy_version 75300 (0.0007) [2023-03-06 22:39:39,368][62475] Updated weights for policy 0, policy_version 75310 (0.0007) [2023-03-06 22:39:40,192][62475] Updated weights for policy 0, policy_version 75320 (0.0006) [2023-03-06 22:39:40,993][62475] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-06 22:39:41,796][62475] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-06 22:39:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 77155328. Throughput: 0: 12724.1. Samples: 77125445. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:42,390][62145] Avg episode reward: [(0, '729.915')] [2023-03-06 22:39:42,598][62475] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-06 22:39:43,387][62475] Updated weights for policy 0, policy_version 75360 (0.0007) [2023-03-06 22:39:44,181][62475] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-06 22:39:44,987][62475] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-06 22:39:45,793][62475] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-06 22:39:46,588][62475] Updated weights for policy 0, policy_version 75400 (0.0007) [2023-03-06 22:39:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 77218816. Throughput: 0: 12740.4. Samples: 77202100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:47,390][62145] Avg episode reward: [(0, '556.224')] [2023-03-06 22:39:47,425][62475] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-06 22:39:48,212][62475] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-06 22:39:49,000][62475] Updated weights for policy 0, policy_version 75430 (0.0007) [2023-03-06 22:39:49,831][62475] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-06 22:39:50,604][62475] Updated weights for policy 0, policy_version 75450 (0.0006) [2023-03-06 22:39:51,416][62475] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-06 22:39:52,222][62475] Updated weights for policy 0, policy_version 75470 (0.0007) [2023-03-06 22:39:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 77283328. Throughput: 0: 12740.3. Samples: 77278612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:52,390][62145] Avg episode reward: [(0, '598.080')] [2023-03-06 22:39:53,030][62475] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-06 22:39:53,828][62475] Updated weights for policy 0, policy_version 75490 (0.0007) [2023-03-06 22:39:54,654][62475] Updated weights for policy 0, policy_version 75500 (0.0006) [2023-03-06 22:39:55,450][62475] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-06 22:39:56,245][62475] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-06 22:39:57,053][62475] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-06 22:39:57,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 77346816. Throughput: 0: 12738.7. Samples: 77316682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:39:57,390][62145] Avg episode reward: [(0, '531.341')] [2023-03-06 22:39:57,843][62475] Updated weights for policy 0, policy_version 75540 (0.0005) [2023-03-06 22:39:58,642][62475] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-06 22:39:59,449][62475] Updated weights for policy 0, policy_version 75560 (0.0006) [2023-03-06 22:40:00,247][62475] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-06 22:40:01,045][62475] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-06 22:40:01,854][62475] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-06 22:40:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 77410304. Throughput: 0: 12744.9. Samples: 77393429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:02,390][62145] Avg episode reward: [(0, '699.184')] [2023-03-06 22:40:02,655][62475] Updated weights for policy 0, policy_version 75600 (0.0006) [2023-03-06 22:40:03,453][62475] Updated weights for policy 0, policy_version 75610 (0.0006) [2023-03-06 22:40:04,262][62475] Updated weights for policy 0, policy_version 75620 (0.0007) [2023-03-06 22:40:05,061][62475] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-06 22:40:05,845][62475] Updated weights for policy 0, policy_version 75640 (0.0006) [2023-03-06 22:40:06,652][62475] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-06 22:40:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 77474816. Throughput: 0: 12749.4. Samples: 77470131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:07,390][62145] Avg episode reward: [(0, '751.997')] [2023-03-06 22:40:07,453][62475] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-06 22:40:08,254][62475] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-06 22:40:09,044][62475] Updated weights for policy 0, policy_version 75680 (0.0006) [2023-03-06 22:40:09,867][62475] Updated weights for policy 0, policy_version 75690 (0.0006) [2023-03-06 22:40:10,665][62475] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-06 22:40:11,466][62475] Updated weights for policy 0, policy_version 75710 (0.0006) [2023-03-06 22:40:12,274][62475] Updated weights for policy 0, policy_version 75720 (0.0006) [2023-03-06 22:40:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 77538304. Throughput: 0: 12757.0. Samples: 77508458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:12,390][62145] Avg episode reward: [(0, '810.197')] [2023-03-06 22:40:13,068][62475] Updated weights for policy 0, policy_version 75730 (0.0006) [2023-03-06 22:40:13,866][62475] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-06 22:40:14,679][62475] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-06 22:40:15,481][62475] Updated weights for policy 0, policy_version 75760 (0.0006) [2023-03-06 22:40:16,273][62475] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-06 22:40:17,069][62475] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-06 22:40:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 77601792. Throughput: 0: 12768.8. Samples: 77585252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:17,390][62145] Avg episode reward: [(0, '709.882')] [2023-03-06 22:40:17,859][62475] Updated weights for policy 0, policy_version 75790 (0.0006) [2023-03-06 22:40:18,685][62475] Updated weights for policy 0, policy_version 75800 (0.0007) [2023-03-06 22:40:19,494][62475] Updated weights for policy 0, policy_version 75810 (0.0006) [2023-03-06 22:40:20,312][62475] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-06 22:40:21,094][62475] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-06 22:40:21,914][62475] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-06 22:40:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 77665280. Throughput: 0: 12754.9. Samples: 77661416. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:22,390][62145] Avg episode reward: [(0, '835.541')] [2023-03-06 22:40:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000075846_77666304.pth... [2023-03-06 22:40:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000072863_74611712.pth [2023-03-06 22:40:22,708][62475] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-06 22:40:23,503][62475] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-06 22:40:24,304][62475] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-06 22:40:25,111][62475] Updated weights for policy 0, policy_version 75880 (0.0006) [2023-03-06 22:40:25,902][62475] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-06 22:40:26,701][62475] Updated weights for policy 0, policy_version 75900 (0.0007) [2023-03-06 22:40:27,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12728.8). Total num frames: 77729792. Throughput: 0: 12764.0. Samples: 77699826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:40:27,390][62145] Avg episode reward: [(0, '903.671')] [2023-03-06 22:40:27,512][62475] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-06 22:40:28,309][62475] Updated weights for policy 0, policy_version 75920 (0.0006) [2023-03-06 22:40:29,109][62475] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-06 22:40:29,937][62475] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-06 22:40:30,750][62475] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-06 22:40:31,530][62475] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-06 22:40:32,337][62475] Updated weights for policy 0, policy_version 75970 (0.0007) [2023-03-06 22:40:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 77793280. Throughput: 0: 12760.5. Samples: 77776322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:32,390][62145] Avg episode reward: [(0, '888.663')] [2023-03-06 22:40:33,150][62475] Updated weights for policy 0, policy_version 75980 (0.0005) [2023-03-06 22:40:33,962][62475] Updated weights for policy 0, policy_version 75990 (0.0008) [2023-03-06 22:40:34,762][62475] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-06 22:40:35,555][62475] Updated weights for policy 0, policy_version 76010 (0.0007) [2023-03-06 22:40:36,358][62475] Updated weights for policy 0, policy_version 76020 (0.0006) [2023-03-06 22:40:37,162][62475] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-06 22:40:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 77856768. Throughput: 0: 12758.9. Samples: 77852762. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:37,390][62145] Avg episode reward: [(0, '944.121')] [2023-03-06 22:40:37,981][62475] Updated weights for policy 0, policy_version 76040 (0.0006) [2023-03-06 22:40:38,767][62475] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-06 22:40:39,585][62475] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-06 22:40:40,390][62475] Updated weights for policy 0, policy_version 76070 (0.0006) [2023-03-06 22:40:41,198][62475] Updated weights for policy 0, policy_version 76080 (0.0006) [2023-03-06 22:40:42,026][62475] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-06 22:40:42,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 77920256. Throughput: 0: 12755.8. Samples: 77890693. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:42,390][62145] Avg episode reward: [(0, '797.466')] [2023-03-06 22:40:42,842][62475] Updated weights for policy 0, policy_version 76100 (0.0007) [2023-03-06 22:40:43,642][62475] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-06 22:40:44,448][62475] Updated weights for policy 0, policy_version 76120 (0.0007) [2023-03-06 22:40:45,278][62475] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-06 22:40:46,065][62475] Updated weights for policy 0, policy_version 76140 (0.0007) [2023-03-06 22:40:46,878][62475] Updated weights for policy 0, policy_version 76150 (0.0006) [2023-03-06 22:40:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 77983744. Throughput: 0: 12736.3. Samples: 77966561. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:47,390][62145] Avg episode reward: [(0, '976.848')] [2023-03-06 22:40:47,675][62475] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-06 22:40:48,484][62475] Updated weights for policy 0, policy_version 76170 (0.0006) [2023-03-06 22:40:49,316][62475] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-06 22:40:50,102][62475] Updated weights for policy 0, policy_version 76190 (0.0007) [2023-03-06 22:40:50,915][62475] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-06 22:40:51,698][62475] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-06 22:40:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 78047232. Throughput: 0: 12727.8. Samples: 78042880. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:52,390][62145] Avg episode reward: [(0, '863.908')] [2023-03-06 22:40:52,522][62475] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-06 22:40:53,298][62475] Updated weights for policy 0, policy_version 76230 (0.0007) [2023-03-06 22:40:54,116][62475] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-06 22:40:54,942][62475] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-06 22:40:55,722][62475] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-06 22:40:56,535][62475] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-06 22:40:57,344][62475] Updated weights for policy 0, policy_version 76280 (0.0006) [2023-03-06 22:40:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 78110720. Throughput: 0: 12723.6. Samples: 78081017. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:40:57,390][62145] Avg episode reward: [(0, '685.165')] [2023-03-06 22:40:58,146][62475] Updated weights for policy 0, policy_version 76290 (0.0007) [2023-03-06 22:40:58,960][62475] Updated weights for policy 0, policy_version 76300 (0.0007) [2023-03-06 22:40:59,785][62475] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-06 22:41:00,583][62475] Updated weights for policy 0, policy_version 76320 (0.0006) [2023-03-06 22:41:01,382][62475] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-06 22:41:02,187][62475] Updated weights for policy 0, policy_version 76340 (0.0006) [2023-03-06 22:41:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 78174208. Throughput: 0: 12710.1. Samples: 78157207. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:41:02,390][62145] Avg episode reward: [(0, '774.434')] [2023-03-06 22:41:02,975][62475] Updated weights for policy 0, policy_version 76350 (0.0005) [2023-03-06 22:41:03,775][62475] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-06 22:41:04,585][62475] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-06 22:41:05,386][62475] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-06 22:41:06,186][62475] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-06 22:41:06,996][62475] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-06 22:41:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78237696. Throughput: 0: 12717.7. Samples: 78233712. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:41:07,390][62145] Avg episode reward: [(0, '684.239')] [2023-03-06 22:41:07,798][62475] Updated weights for policy 0, policy_version 76410 (0.0007) [2023-03-06 22:41:08,610][62475] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-06 22:41:09,426][62475] Updated weights for policy 0, policy_version 76430 (0.0006) [2023-03-06 22:41:10,229][62475] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-06 22:41:11,040][62475] Updated weights for policy 0, policy_version 76450 (0.0007) [2023-03-06 22:41:11,852][62475] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-06 22:41:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 78301184. Throughput: 0: 12710.1. Samples: 78271781. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:41:12,390][62145] Avg episode reward: [(0, '710.250')] [2023-03-06 22:41:12,665][62475] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-06 22:41:13,445][62475] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-06 22:41:14,246][62475] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-06 22:41:15,041][62475] Updated weights for policy 0, policy_version 76500 (0.0006) [2023-03-06 22:41:15,842][62475] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-06 22:41:16,660][62475] Updated weights for policy 0, policy_version 76520 (0.0006) [2023-03-06 22:41:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 78365696. Throughput: 0: 12711.9. Samples: 78348356. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:17,390][62145] Avg episode reward: [(0, '675.536')] [2023-03-06 22:41:17,477][62475] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-06 22:41:18,263][62475] Updated weights for policy 0, policy_version 76540 (0.0006) [2023-03-06 22:41:19,110][62475] Updated weights for policy 0, policy_version 76550 (0.0006) [2023-03-06 22:41:19,918][62475] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-06 22:41:20,737][62475] Updated weights for policy 0, policy_version 76570 (0.0007) [2023-03-06 22:41:21,550][62475] Updated weights for policy 0, policy_version 76580 (0.0007) [2023-03-06 22:41:22,341][62475] Updated weights for policy 0, policy_version 76590 (0.0006) [2023-03-06 22:41:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78428160. Throughput: 0: 12691.9. Samples: 78423896. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:22,390][62145] Avg episode reward: [(0, '817.482')] [2023-03-06 22:41:23,144][62475] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-06 22:41:23,961][62475] Updated weights for policy 0, policy_version 76610 (0.0007) [2023-03-06 22:41:24,771][62475] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-06 22:41:25,582][62475] Updated weights for policy 0, policy_version 76630 (0.0006) [2023-03-06 22:41:26,377][62475] Updated weights for policy 0, policy_version 76640 (0.0007) [2023-03-06 22:41:27,188][62475] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-06 22:41:27,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 78491648. Throughput: 0: 12691.6. Samples: 78461813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:27,390][62145] Avg episode reward: [(0, '827.561')] [2023-03-06 22:41:27,983][62475] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-06 22:41:28,794][62475] Updated weights for policy 0, policy_version 76670 (0.0006) [2023-03-06 22:41:29,604][62475] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-06 22:41:30,389][62475] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-06 22:41:31,219][62475] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-06 22:41:32,014][62475] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-06 22:41:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 78555136. Throughput: 0: 12707.3. Samples: 78538389. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:32,390][62145] Avg episode reward: [(0, '803.222')] [2023-03-06 22:41:32,817][62475] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-06 22:41:33,625][62475] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-06 22:41:34,462][62475] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-06 22:41:35,255][62475] Updated weights for policy 0, policy_version 76750 (0.0006) [2023-03-06 22:41:36,045][62475] Updated weights for policy 0, policy_version 76760 (0.0006) [2023-03-06 22:41:36,862][62475] Updated weights for policy 0, policy_version 76770 (0.0005) [2023-03-06 22:41:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 78618624. Throughput: 0: 12699.6. Samples: 78614361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:37,390][62145] Avg episode reward: [(0, '762.541')] [2023-03-06 22:41:37,681][62475] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-06 22:41:38,462][62475] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-06 22:41:39,291][62475] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-06 22:41:40,097][62475] Updated weights for policy 0, policy_version 76810 (0.0006) [2023-03-06 22:41:40,876][62475] Updated weights for policy 0, policy_version 76820 (0.0005) [2023-03-06 22:41:41,680][62475] Updated weights for policy 0, policy_version 76830 (0.0007) [2023-03-06 22:41:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 78682112. Throughput: 0: 12699.0. Samples: 78652470. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:42,390][62145] Avg episode reward: [(0, '906.096')] [2023-03-06 22:41:42,496][62475] Updated weights for policy 0, policy_version 76840 (0.0007) [2023-03-06 22:41:43,302][62475] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-06 22:41:44,085][62475] Updated weights for policy 0, policy_version 76860 (0.0006) [2023-03-06 22:41:44,881][62475] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-06 22:41:45,698][62475] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-06 22:41:46,484][62475] Updated weights for policy 0, policy_version 76890 (0.0006) [2023-03-06 22:41:47,320][62475] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-06 22:41:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78746624. Throughput: 0: 12712.0. Samples: 78729248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:47,390][62145] Avg episode reward: [(0, '586.842')] [2023-03-06 22:41:48,105][62475] Updated weights for policy 0, policy_version 76910 (0.0006) [2023-03-06 22:41:48,899][62475] Updated weights for policy 0, policy_version 76920 (0.0007) [2023-03-06 22:41:49,714][62475] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-06 22:41:50,518][62475] Updated weights for policy 0, policy_version 76940 (0.0006) [2023-03-06 22:41:51,333][62475] Updated weights for policy 0, policy_version 76950 (0.0007) [2023-03-06 22:41:52,136][62475] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-06 22:41:52,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78810112. Throughput: 0: 12703.9. Samples: 78805387. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:52,390][62145] Avg episode reward: [(0, '897.081')] [2023-03-06 22:41:52,953][62475] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-06 22:41:53,763][62475] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-06 22:41:54,588][62475] Updated weights for policy 0, policy_version 76990 (0.0007) [2023-03-06 22:41:55,378][62475] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-06 22:41:56,192][62475] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-06 22:41:57,004][62475] Updated weights for policy 0, policy_version 77020 (0.0007) [2023-03-06 22:41:57,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78873600. Throughput: 0: 12699.7. Samples: 78843268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:41:57,390][62145] Avg episode reward: [(0, '668.187')] [2023-03-06 22:41:57,780][62475] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-06 22:41:58,602][62475] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-06 22:41:59,392][62475] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-06 22:42:00,203][62475] Updated weights for policy 0, policy_version 77060 (0.0006) [2023-03-06 22:42:01,002][62475] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-06 22:42:01,800][62475] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-06 22:42:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 78937088. Throughput: 0: 12699.3. Samples: 78919826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:02,390][62145] Avg episode reward: [(0, '658.261')] [2023-03-06 22:42:02,625][62475] Updated weights for policy 0, policy_version 77090 (0.0007) [2023-03-06 22:42:03,424][62475] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-06 22:42:04,221][62475] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-06 22:42:05,014][62475] Updated weights for policy 0, policy_version 77120 (0.0006) [2023-03-06 22:42:05,846][62475] Updated weights for policy 0, policy_version 77130 (0.0006) [2023-03-06 22:42:06,634][62475] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-06 22:42:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79000576. Throughput: 0: 12715.1. Samples: 78996077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:07,390][62145] Avg episode reward: [(0, '843.224')] [2023-03-06 22:42:07,437][62475] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-06 22:42:08,246][62475] Updated weights for policy 0, policy_version 77160 (0.0006) [2023-03-06 22:42:09,062][62475] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-06 22:42:09,881][62475] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-06 22:42:10,667][62475] Updated weights for policy 0, policy_version 77190 (0.0007) [2023-03-06 22:42:11,477][62475] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-06 22:42:12,288][62475] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-06 22:42:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79064064. Throughput: 0: 12718.2. Samples: 79034134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:12,390][62145] Avg episode reward: [(0, '731.036')] [2023-03-06 22:42:13,077][62475] Updated weights for policy 0, policy_version 77220 (0.0006) [2023-03-06 22:42:13,877][62475] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-06 22:42:14,697][62475] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-06 22:42:15,488][62475] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-06 22:42:16,293][62475] Updated weights for policy 0, policy_version 77260 (0.0008) [2023-03-06 22:42:17,088][62475] Updated weights for policy 0, policy_version 77270 (0.0007) [2023-03-06 22:42:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 79127552. Throughput: 0: 12716.4. Samples: 79110624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:17,390][62145] Avg episode reward: [(0, '657.541')] [2023-03-06 22:42:17,890][62475] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-06 22:42:18,704][62475] Updated weights for policy 0, policy_version 77290 (0.0007) [2023-03-06 22:42:19,511][62475] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-06 22:42:20,333][62475] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-06 22:42:21,126][62475] Updated weights for policy 0, policy_version 77320 (0.0006) [2023-03-06 22:42:21,915][62475] Updated weights for policy 0, policy_version 77330 (0.0006) [2023-03-06 22:42:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79191040. Throughput: 0: 12728.0. Samples: 79187120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:22,390][62145] Avg episode reward: [(0, '568.077')] [2023-03-06 22:42:22,405][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000077336_79192064.pth... [2023-03-06 22:42:22,436][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000074353_76137472.pth [2023-03-06 22:42:22,739][62475] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-06 22:42:23,545][62475] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-06 22:42:24,331][62475] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-06 22:42:25,131][62475] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-06 22:42:25,916][62475] Updated weights for policy 0, policy_version 77380 (0.0007) [2023-03-06 22:42:26,725][62475] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-06 22:42:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 79255552. Throughput: 0: 12727.8. Samples: 79225220. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:27,401][62145] Avg episode reward: [(0, '745.736')] [2023-03-06 22:42:27,554][62475] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-06 22:42:28,337][62475] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-06 22:42:29,172][62475] Updated weights for policy 0, policy_version 77420 (0.0006) [2023-03-06 22:42:29,962][62475] Updated weights for policy 0, policy_version 77430 (0.0006) [2023-03-06 22:42:30,780][62475] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-06 22:42:31,578][62475] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-06 22:42:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 79318016. Throughput: 0: 12718.8. Samples: 79301597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:32,397][62475] Updated weights for policy 0, policy_version 77460 (0.0007) [2023-03-06 22:42:32,401][62145] Avg episode reward: [(0, '933.325')] [2023-03-06 22:42:33,200][62475] Updated weights for policy 0, policy_version 77470 (0.0007) [2023-03-06 22:42:33,999][62475] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-06 22:42:34,806][62475] Updated weights for policy 0, policy_version 77490 (0.0006) [2023-03-06 22:42:35,622][62475] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-06 22:42:36,434][62475] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-06 22:42:37,224][62475] Updated weights for policy 0, policy_version 77520 (0.0006) [2023-03-06 22:42:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 79382528. Throughput: 0: 12718.1. Samples: 79377701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:37,401][62145] Avg episode reward: [(0, '696.254')] [2023-03-06 22:42:38,027][62475] Updated weights for policy 0, policy_version 77530 (0.0006) [2023-03-06 22:42:38,823][62475] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-06 22:42:39,634][62475] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-06 22:42:40,437][62475] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-06 22:42:41,258][62475] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-06 22:42:42,063][62475] Updated weights for policy 0, policy_version 77580 (0.0006) [2023-03-06 22:42:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 79446016. Throughput: 0: 12725.4. Samples: 79415914. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:42,401][62145] Avg episode reward: [(0, '590.474')] [2023-03-06 22:42:42,859][62475] Updated weights for policy 0, policy_version 77590 (0.0007) [2023-03-06 22:42:43,670][62475] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-06 22:42:44,484][62475] Updated weights for policy 0, policy_version 77610 (0.0007) [2023-03-06 22:42:45,289][62475] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-06 22:42:46,085][62475] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-06 22:42:46,907][62475] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-06 22:42:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12725.4). Total num frames: 79509504. Throughput: 0: 12718.3. Samples: 79492148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:47,401][62145] Avg episode reward: [(0, '654.128')] [2023-03-06 22:42:47,691][62475] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-06 22:42:48,486][62475] Updated weights for policy 0, policy_version 77660 (0.0006) [2023-03-06 22:42:49,304][62475] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-06 22:42:50,104][62475] Updated weights for policy 0, policy_version 77680 (0.0006) [2023-03-06 22:42:50,910][62475] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-06 22:42:51,705][62475] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-06 22:42:52,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79572992. Throughput: 0: 12723.8. Samples: 79568648. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:52,401][62145] Avg episode reward: [(0, '481.911')] [2023-03-06 22:42:52,511][62475] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-06 22:42:53,322][62475] Updated weights for policy 0, policy_version 77720 (0.0007) [2023-03-06 22:42:54,121][62475] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-06 22:42:54,937][62475] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-06 22:42:55,737][62475] Updated weights for policy 0, policy_version 77750 (0.0006) [2023-03-06 22:42:56,536][62475] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-06 22:42:57,351][62475] Updated weights for policy 0, policy_version 77770 (0.0006) [2023-03-06 22:42:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79636480. Throughput: 0: 12726.0. Samples: 79606804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:42:57,390][62145] Avg episode reward: [(0, '700.502')] [2023-03-06 22:42:58,162][62475] Updated weights for policy 0, policy_version 77780 (0.0006) [2023-03-06 22:42:58,956][62475] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-06 22:42:59,774][62475] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-06 22:43:00,574][62475] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-06 22:43:01,371][62475] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-06 22:43:02,182][62475] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-06 22:43:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 79699968. Throughput: 0: 12720.9. Samples: 79683063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:02,390][62145] Avg episode reward: [(0, '661.341')] [2023-03-06 22:43:02,993][62475] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-06 22:43:03,766][62475] Updated weights for policy 0, policy_version 77850 (0.0006) [2023-03-06 22:43:04,585][62475] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-06 22:43:05,406][62475] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-06 22:43:06,198][62475] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-06 22:43:06,989][62475] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-06 22:43:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 79764480. Throughput: 0: 12721.3. Samples: 79759577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:07,390][62145] Avg episode reward: [(0, '664.173')] [2023-03-06 22:43:07,803][62475] Updated weights for policy 0, policy_version 77900 (0.0006) [2023-03-06 22:43:08,604][62475] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-06 22:43:09,434][62475] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-06 22:43:10,217][62475] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-06 22:43:11,029][62475] Updated weights for policy 0, policy_version 77940 (0.0006) [2023-03-06 22:43:11,834][62475] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-06 22:43:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 79827968. Throughput: 0: 12721.1. Samples: 79797669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:12,390][62145] Avg episode reward: [(0, '665.073')] [2023-03-06 22:43:12,617][62475] Updated weights for policy 0, policy_version 77960 (0.0006) [2023-03-06 22:43:13,409][62475] Updated weights for policy 0, policy_version 77970 (0.0007) [2023-03-06 22:43:14,230][62475] Updated weights for policy 0, policy_version 77980 (0.0006) [2023-03-06 22:43:15,029][62475] Updated weights for policy 0, policy_version 77990 (0.0006) [2023-03-06 22:43:15,833][62475] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-06 22:43:16,637][62475] Updated weights for policy 0, policy_version 78010 (0.0005) [2023-03-06 22:43:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 79891456. Throughput: 0: 12725.3. Samples: 79874237. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:17,390][62145] Avg episode reward: [(0, '548.861')] [2023-03-06 22:43:17,443][62475] Updated weights for policy 0, policy_version 78020 (0.0006) [2023-03-06 22:43:18,232][62475] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-06 22:43:19,048][62475] Updated weights for policy 0, policy_version 78040 (0.0007) [2023-03-06 22:43:19,867][62475] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-06 22:43:20,671][62475] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-06 22:43:21,481][62475] Updated weights for policy 0, policy_version 78070 (0.0007) [2023-03-06 22:43:22,286][62475] Updated weights for policy 0, policy_version 78080 (0.0007) [2023-03-06 22:43:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 79954944. Throughput: 0: 12727.1. Samples: 79950421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:22,390][62145] Avg episode reward: [(0, '686.324')] [2023-03-06 22:43:23,092][62475] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-06 22:43:23,885][62475] Updated weights for policy 0, policy_version 78100 (0.0007) [2023-03-06 22:43:24,706][62475] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-06 22:43:25,507][62475] Updated weights for policy 0, policy_version 78120 (0.0006) [2023-03-06 22:43:26,306][62475] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-06 22:43:27,125][62475] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-06 22:43:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 80018432. Throughput: 0: 12727.1. Samples: 79988631. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:27,390][62145] Avg episode reward: [(0, '743.983')] [2023-03-06 22:43:27,929][62475] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-06 22:43:28,730][62475] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-06 22:43:29,548][62475] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-06 22:43:30,340][62475] Updated weights for policy 0, policy_version 78180 (0.0006) [2023-03-06 22:43:31,158][62475] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-06 22:43:31,960][62475] Updated weights for policy 0, policy_version 78200 (0.0007) [2023-03-06 22:43:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 80081920. Throughput: 0: 12726.9. Samples: 80064858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:32,390][62145] Avg episode reward: [(0, '799.999')] [2023-03-06 22:43:32,753][62475] Updated weights for policy 0, policy_version 78210 (0.0006) [2023-03-06 22:43:33,567][62475] Updated weights for policy 0, policy_version 78220 (0.0007) [2023-03-06 22:43:34,360][62475] Updated weights for policy 0, policy_version 78230 (0.0005) [2023-03-06 22:43:35,171][62475] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-06 22:43:35,979][62475] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-06 22:43:36,782][62475] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-06 22:43:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 80145408. Throughput: 0: 12726.0. Samples: 80141319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:43:37,390][62145] Avg episode reward: [(0, '741.786')] [2023-03-06 22:43:37,567][62475] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-06 22:43:38,382][62475] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-06 22:43:39,190][62475] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-06 22:43:39,973][62475] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-06 22:43:40,775][62475] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-06 22:43:41,587][62475] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-06 22:43:42,371][62475] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-06 22:43:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 80209920. Throughput: 0: 12726.5. Samples: 80179499. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:43:42,390][62145] Avg episode reward: [(0, '587.740')] [2023-03-06 22:43:43,189][62475] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-06 22:43:43,988][62475] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-06 22:43:44,809][62475] Updated weights for policy 0, policy_version 78360 (0.0007) [2023-03-06 22:43:45,602][62475] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-06 22:43:46,434][62475] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-06 22:43:47,233][62475] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-06 22:43:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 80272384. Throughput: 0: 12730.9. Samples: 80255954. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:43:47,390][62145] Avg episode reward: [(0, '754.109')] [2023-03-06 22:43:48,038][62475] Updated weights for policy 0, policy_version 78400 (0.0007) [2023-03-06 22:43:48,853][62475] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-06 22:43:49,660][62475] Updated weights for policy 0, policy_version 78420 (0.0007) [2023-03-06 22:43:50,458][62475] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-06 22:43:51,267][62475] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-06 22:43:52,070][62475] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-06 22:43:52,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 80335872. Throughput: 0: 12722.3. Samples: 80332080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:43:52,390][62145] Avg episode reward: [(0, '635.952')] [2023-03-06 22:43:52,867][62475] Updated weights for policy 0, policy_version 78460 (0.0006) [2023-03-06 22:43:53,674][62475] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-06 22:43:54,484][62475] Updated weights for policy 0, policy_version 78480 (0.0007) [2023-03-06 22:43:55,283][62475] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-06 22:43:56,078][62475] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-06 22:43:56,906][62475] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-06 22:43:57,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 80400384. Throughput: 0: 12725.6. Samples: 80370322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:43:57,390][62145] Avg episode reward: [(0, '885.404')] [2023-03-06 22:43:57,718][62475] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-06 22:43:58,511][62475] Updated weights for policy 0, policy_version 78530 (0.0008) [2023-03-06 22:43:59,320][62475] Updated weights for policy 0, policy_version 78540 (0.0007) [2023-03-06 22:44:00,110][62475] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-06 22:44:00,915][62475] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-06 22:44:01,733][62475] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-06 22:44:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 80463872. Throughput: 0: 12720.5. Samples: 80446659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:44:02,390][62145] Avg episode reward: [(0, '762.092')] [2023-03-06 22:44:02,542][62475] Updated weights for policy 0, policy_version 78580 (0.0005) [2023-03-06 22:44:03,334][62475] Updated weights for policy 0, policy_version 78590 (0.0007) [2023-03-06 22:44:04,143][62475] Updated weights for policy 0, policy_version 78600 (0.0006) [2023-03-06 22:44:04,940][62475] Updated weights for policy 0, policy_version 78610 (0.0006) [2023-03-06 22:44:05,744][62475] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-06 22:44:06,536][62475] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-06 22:44:07,358][62475] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-06 22:44:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 80527360. Throughput: 0: 12723.2. Samples: 80522965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:44:07,401][62145] Avg episode reward: [(0, '769.658')] [2023-03-06 22:44:08,152][62475] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-06 22:44:08,965][62475] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-06 22:44:09,759][62475] Updated weights for policy 0, policy_version 78670 (0.0006) [2023-03-06 22:44:10,555][62475] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-06 22:44:11,343][62475] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-06 22:44:12,152][62475] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-06 22:44:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 80591872. Throughput: 0: 12727.2. Samples: 80561354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:44:12,401][62145] Avg episode reward: [(0, '779.346')] [2023-03-06 22:44:12,949][62475] Updated weights for policy 0, policy_version 78710 (0.0007) [2023-03-06 22:44:13,757][62475] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-06 22:44:14,577][62475] Updated weights for policy 0, policy_version 78730 (0.0007) [2023-03-06 22:44:15,377][62475] Updated weights for policy 0, policy_version 78740 (0.0007) [2023-03-06 22:44:16,190][62475] Updated weights for policy 0, policy_version 78750 (0.0007) [2023-03-06 22:44:16,997][62475] Updated weights for policy 0, policy_version 78760 (0.0007) [2023-03-06 22:44:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12728.8). Total num frames: 80655360. Throughput: 0: 12733.3. Samples: 80637857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:44:17,401][62145] Avg episode reward: [(0, '849.931')] [2023-03-06 22:44:17,810][62475] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-06 22:44:18,606][62475] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-06 22:44:19,425][62475] Updated weights for policy 0, policy_version 78790 (0.0007) [2023-03-06 22:44:20,226][62475] Updated weights for policy 0, policy_version 78800 (0.0006) [2023-03-06 22:44:21,025][62475] Updated weights for policy 0, policy_version 78810 (0.0007) [2023-03-06 22:44:21,829][62475] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-06 22:44:22,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 80717824. Throughput: 0: 12725.4. Samples: 80713964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:44:22,400][62145] Avg episode reward: [(0, '902.409')] [2023-03-06 22:44:22,404][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000078827_80718848.pth... [2023-03-06 22:44:22,436][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000075846_77666304.pth [2023-03-06 22:44:22,646][62475] Updated weights for policy 0, policy_version 78830 (0.0006) [2023-03-06 22:44:23,448][62475] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-06 22:44:24,253][62475] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-06 22:44:25,054][62475] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-06 22:44:25,862][62475] Updated weights for policy 0, policy_version 78870 (0.0006) [2023-03-06 22:44:26,677][62475] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-06 22:44:27,389][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 80781312. Throughput: 0: 12721.1. Samples: 80751946. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:27,400][62145] Avg episode reward: [(0, '954.604')] [2023-03-06 22:44:27,498][62475] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-06 22:44:28,312][62475] Updated weights for policy 0, policy_version 78900 (0.0007) [2023-03-06 22:44:29,112][62475] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-06 22:44:29,923][62475] Updated weights for policy 0, policy_version 78920 (0.0006) [2023-03-06 22:44:30,740][62475] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-06 22:44:31,526][62475] Updated weights for policy 0, policy_version 78940 (0.0005) [2023-03-06 22:44:32,334][62475] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-06 22:44:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 80844800. Throughput: 0: 12708.7. Samples: 80827846. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:32,401][62145] Avg episode reward: [(0, '700.359')] [2023-03-06 22:44:33,141][62475] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-06 22:44:33,952][62475] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-06 22:44:34,747][62475] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-06 22:44:35,551][62475] Updated weights for policy 0, policy_version 78990 (0.0006) [2023-03-06 22:44:36,364][62475] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-06 22:44:37,178][62475] Updated weights for policy 0, policy_version 79010 (0.0006) [2023-03-06 22:44:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 80908288. Throughput: 0: 12713.2. Samples: 80904174. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:37,390][62145] Avg episode reward: [(0, '572.640')] [2023-03-06 22:44:37,986][62475] Updated weights for policy 0, policy_version 79020 (0.0007) [2023-03-06 22:44:38,798][62475] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-06 22:44:39,614][62475] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-06 22:44:40,395][62475] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-06 22:44:41,217][62475] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-06 22:44:42,008][62475] Updated weights for policy 0, policy_version 79070 (0.0005) [2023-03-06 22:44:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 80971776. Throughput: 0: 12707.5. Samples: 80942160. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:42,390][62145] Avg episode reward: [(0, '647.298')] [2023-03-06 22:44:42,784][62475] Updated weights for policy 0, policy_version 79080 (0.0006) [2023-03-06 22:44:43,598][62475] Updated weights for policy 0, policy_version 79090 (0.0007) [2023-03-06 22:44:44,403][62475] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-06 22:44:45,204][62475] Updated weights for policy 0, policy_version 79110 (0.0007) [2023-03-06 22:44:46,001][62475] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-06 22:44:46,796][62475] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-06 22:44:47,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12721.9). Total num frames: 81036288. Throughput: 0: 12717.2. Samples: 81018932. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:47,390][62145] Avg episode reward: [(0, '648.672')] [2023-03-06 22:44:47,599][62475] Updated weights for policy 0, policy_version 79140 (0.0006) [2023-03-06 22:44:48,409][62475] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-06 22:44:49,218][62475] Updated weights for policy 0, policy_version 79160 (0.0007) [2023-03-06 22:44:50,017][62475] Updated weights for policy 0, policy_version 79170 (0.0006) [2023-03-06 22:44:50,812][62475] Updated weights for policy 0, policy_version 79180 (0.0007) [2023-03-06 22:44:51,624][62475] Updated weights for policy 0, policy_version 79190 (0.0006) [2023-03-06 22:44:52,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 81099776. Throughput: 0: 12724.2. Samples: 81095554. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:52,390][62145] Avg episode reward: [(0, '631.653')] [2023-03-06 22:44:52,407][62475] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-06 22:44:53,252][62475] Updated weights for policy 0, policy_version 79210 (0.0007) [2023-03-06 22:44:54,038][62475] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-06 22:44:54,822][62475] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-06 22:44:55,629][62475] Updated weights for policy 0, policy_version 79240 (0.0007) [2023-03-06 22:44:56,455][62475] Updated weights for policy 0, policy_version 79250 (0.0006) [2023-03-06 22:44:57,253][62475] Updated weights for policy 0, policy_version 79260 (0.0006) [2023-03-06 22:44:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 81163264. Throughput: 0: 12721.4. Samples: 81133817. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:44:57,390][62145] Avg episode reward: [(0, '721.166')] [2023-03-06 22:44:58,069][62475] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-06 22:44:58,867][62475] Updated weights for policy 0, policy_version 79280 (0.0007) [2023-03-06 22:44:59,653][62475] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-06 22:45:00,466][62475] Updated weights for policy 0, policy_version 79300 (0.0007) [2023-03-06 22:45:01,271][62475] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-06 22:45:02,080][62475] Updated weights for policy 0, policy_version 79320 (0.0006) [2023-03-06 22:45:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81226752. Throughput: 0: 12720.6. Samples: 81210285. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:45:02,390][62145] Avg episode reward: [(0, '879.453')] [2023-03-06 22:45:02,889][62475] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-06 22:45:03,678][62475] Updated weights for policy 0, policy_version 79340 (0.0006) [2023-03-06 22:45:04,502][62475] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-06 22:45:05,319][62475] Updated weights for policy 0, policy_version 79360 (0.0006) [2023-03-06 22:45:06,129][62475] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-06 22:45:06,928][62475] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-06 22:45:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81290240. Throughput: 0: 12711.1. Samples: 81285962. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:45:07,390][62145] Avg episode reward: [(0, '880.222')] [2023-03-06 22:45:07,764][62475] Updated weights for policy 0, policy_version 79390 (0.0007) [2023-03-06 22:45:08,568][62475] Updated weights for policy 0, policy_version 79400 (0.0006) [2023-03-06 22:45:09,369][62475] Updated weights for policy 0, policy_version 79410 (0.0006) [2023-03-06 22:45:10,184][62475] Updated weights for policy 0, policy_version 79420 (0.0006) [2023-03-06 22:45:10,989][62475] Updated weights for policy 0, policy_version 79430 (0.0006) [2023-03-06 22:45:11,783][62475] Updated weights for policy 0, policy_version 79440 (0.0007) [2023-03-06 22:45:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 81353728. Throughput: 0: 12711.0. Samples: 81323942. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:45:12,390][62145] Avg episode reward: [(0, '719.199')] [2023-03-06 22:45:12,578][62475] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-06 22:45:13,385][62475] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-06 22:45:14,194][62475] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-06 22:45:15,001][62475] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-06 22:45:15,791][62475] Updated weights for policy 0, policy_version 79490 (0.0006) [2023-03-06 22:45:16,610][62475] Updated weights for policy 0, policy_version 79500 (0.0006) [2023-03-06 22:45:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 81417216. Throughput: 0: 12725.3. Samples: 81400483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:17,390][62145] Avg episode reward: [(0, '674.365')] [2023-03-06 22:45:17,413][62475] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-06 22:45:18,213][62475] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-06 22:45:19,027][62475] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-06 22:45:19,832][62475] Updated weights for policy 0, policy_version 79540 (0.0007) [2023-03-06 22:45:20,661][62475] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-06 22:45:21,457][62475] Updated weights for policy 0, policy_version 79560 (0.0007) [2023-03-06 22:45:22,282][62475] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-06 22:45:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 81480704. Throughput: 0: 12713.3. Samples: 81476274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:22,390][62145] Avg episode reward: [(0, '908.282')] [2023-03-06 22:45:23,088][62475] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-06 22:45:23,897][62475] Updated weights for policy 0, policy_version 79590 (0.0007) [2023-03-06 22:45:24,694][62475] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-06 22:45:25,500][62475] Updated weights for policy 0, policy_version 79610 (0.0006) [2023-03-06 22:45:26,274][62475] Updated weights for policy 0, policy_version 79620 (0.0007) [2023-03-06 22:45:27,109][62475] Updated weights for policy 0, policy_version 79630 (0.0006) [2023-03-06 22:45:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 81544192. Throughput: 0: 12718.6. Samples: 81514500. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:27,390][62145] Avg episode reward: [(0, '726.108')] [2023-03-06 22:45:27,892][62475] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-06 22:45:28,686][62475] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-06 22:45:29,510][62475] Updated weights for policy 0, policy_version 79660 (0.0007) [2023-03-06 22:45:30,308][62475] Updated weights for policy 0, policy_version 79670 (0.0006) [2023-03-06 22:45:31,102][62475] Updated weights for policy 0, policy_version 79680 (0.0005) [2023-03-06 22:45:31,926][62475] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-06 22:45:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 81607680. Throughput: 0: 12714.9. Samples: 81591101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:32,390][62145] Avg episode reward: [(0, '818.328')] [2023-03-06 22:45:32,718][62475] Updated weights for policy 0, policy_version 79700 (0.0006) [2023-03-06 22:45:33,521][62475] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-06 22:45:34,345][62475] Updated weights for policy 0, policy_version 79720 (0.0007) [2023-03-06 22:45:35,140][62475] Updated weights for policy 0, policy_version 79730 (0.0007) [2023-03-06 22:45:35,945][62475] Updated weights for policy 0, policy_version 79740 (0.0007) [2023-03-06 22:45:36,749][62475] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-06 22:45:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 81672192. Throughput: 0: 12704.7. Samples: 81667266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:37,390][62145] Avg episode reward: [(0, '772.118')] [2023-03-06 22:45:37,550][62475] Updated weights for policy 0, policy_version 79760 (0.0006) [2023-03-06 22:45:38,349][62475] Updated weights for policy 0, policy_version 79770 (0.0008) [2023-03-06 22:45:39,141][62475] Updated weights for policy 0, policy_version 79780 (0.0005) [2023-03-06 22:45:39,942][62475] Updated weights for policy 0, policy_version 79790 (0.0007) [2023-03-06 22:45:40,748][62475] Updated weights for policy 0, policy_version 79800 (0.0007) [2023-03-06 22:45:41,557][62475] Updated weights for policy 0, policy_version 79810 (0.0006) [2023-03-06 22:45:42,371][62475] Updated weights for policy 0, policy_version 79820 (0.0007) [2023-03-06 22:45:42,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 81735680. Throughput: 0: 12709.4. Samples: 81705739. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:42,390][62145] Avg episode reward: [(0, '832.624')] [2023-03-06 22:45:43,149][62475] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-06 22:45:43,954][62475] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-06 22:45:44,781][62475] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-06 22:45:45,575][62475] Updated weights for policy 0, policy_version 79860 (0.0007) [2023-03-06 22:45:46,368][62475] Updated weights for policy 0, policy_version 79870 (0.0006) [2023-03-06 22:45:47,193][62475] Updated weights for policy 0, policy_version 79880 (0.0006) [2023-03-06 22:45:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81799168. Throughput: 0: 12708.2. Samples: 81782153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:47,390][62145] Avg episode reward: [(0, '916.768')] [2023-03-06 22:45:47,976][62475] Updated weights for policy 0, policy_version 79890 (0.0007) [2023-03-06 22:45:48,791][62475] Updated weights for policy 0, policy_version 79900 (0.0007) [2023-03-06 22:45:49,583][62475] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-06 22:45:50,391][62475] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-06 22:45:51,201][62475] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-06 22:45:52,004][62475] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-06 22:45:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81862656. Throughput: 0: 12727.6. Samples: 81858704. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:52,390][62145] Avg episode reward: [(0, '686.217')] [2023-03-06 22:45:52,806][62475] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-06 22:45:53,609][62475] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-06 22:45:54,446][62475] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-06 22:45:55,247][62475] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-06 22:45:56,047][62475] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-06 22:45:56,865][62475] Updated weights for policy 0, policy_version 80000 (0.0006) [2023-03-06 22:45:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81926144. Throughput: 0: 12725.7. Samples: 81896597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:45:57,390][62145] Avg episode reward: [(0, '904.826')] [2023-03-06 22:45:57,655][62475] Updated weights for policy 0, policy_version 80010 (0.0006) [2023-03-06 22:45:58,459][62475] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-06 22:45:59,250][62475] Updated weights for policy 0, policy_version 80030 (0.0007) [2023-03-06 22:46:00,065][62475] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-06 22:46:00,866][62475] Updated weights for policy 0, policy_version 80050 (0.0005) [2023-03-06 22:46:01,672][62475] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-06 22:46:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 81989632. Throughput: 0: 12720.3. Samples: 81972897. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:02,390][62145] Avg episode reward: [(0, '918.499')] [2023-03-06 22:46:02,498][62475] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-06 22:46:03,304][62475] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-06 22:46:04,091][62475] Updated weights for policy 0, policy_version 80090 (0.0006) [2023-03-06 22:46:04,910][62475] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-06 22:46:05,712][62475] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-06 22:46:06,510][62475] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-06 22:46:07,330][62475] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-06 22:46:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12718.4). Total num frames: 82053120. Throughput: 0: 12730.4. Samples: 82049145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:07,390][62145] Avg episode reward: [(0, '1034.793')] [2023-03-06 22:46:08,128][62475] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-03-06 22:46:08,926][62475] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-06 22:46:09,730][62475] Updated weights for policy 0, policy_version 80160 (0.0006) [2023-03-06 22:46:10,527][62475] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-06 22:46:11,339][62475] Updated weights for policy 0, policy_version 80180 (0.0007) [2023-03-06 22:46:12,167][62475] Updated weights for policy 0, policy_version 80190 (0.0006) [2023-03-06 22:46:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 82117632. Throughput: 0: 12732.6. Samples: 82087468. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:12,390][62145] Avg episode reward: [(0, '958.743')] [2023-03-06 22:46:12,961][62475] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-06 22:46:13,760][62475] Updated weights for policy 0, policy_version 80210 (0.0007) [2023-03-06 22:46:14,580][62475] Updated weights for policy 0, policy_version 80220 (0.0007) [2023-03-06 22:46:15,400][62475] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-06 22:46:16,198][62475] Updated weights for policy 0, policy_version 80240 (0.0007) [2023-03-06 22:46:17,007][62475] Updated weights for policy 0, policy_version 80250 (0.0006) [2023-03-06 22:46:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82180096. Throughput: 0: 12719.7. Samples: 82163489. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:17,390][62145] Avg episode reward: [(0, '762.753')] [2023-03-06 22:46:17,817][62475] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-06 22:46:18,614][62475] Updated weights for policy 0, policy_version 80270 (0.0006) [2023-03-06 22:46:19,446][62475] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-06 22:46:20,241][62475] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-06 22:46:21,024][62475] Updated weights for policy 0, policy_version 80300 (0.0006) [2023-03-06 22:46:21,833][62475] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-06 22:46:22,390][62145] Fps is (10 sec: 12595.0, 60 sec: 12714.6, 300 sec: 12718.4). Total num frames: 82243584. Throughput: 0: 12718.7. Samples: 82239610. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:22,390][62145] Avg episode reward: [(0, '751.144')] [2023-03-06 22:46:22,400][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000080317_82244608.pth... [2023-03-06 22:46:22,432][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000077336_79192064.pth [2023-03-06 22:46:22,650][62475] Updated weights for policy 0, policy_version 80320 (0.0006) [2023-03-06 22:46:23,440][62475] Updated weights for policy 0, policy_version 80330 (0.0005) [2023-03-06 22:46:24,249][62475] Updated weights for policy 0, policy_version 80340 (0.0006) [2023-03-06 22:46:25,072][62475] Updated weights for policy 0, policy_version 80350 (0.0007) [2023-03-06 22:46:25,858][62475] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-06 22:46:26,654][62475] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-06 22:46:27,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82307072. Throughput: 0: 12710.6. Samples: 82277716. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:27,390][62145] Avg episode reward: [(0, '835.825')] [2023-03-06 22:46:27,457][62475] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-06 22:46:28,268][62475] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-06 22:46:29,089][62475] Updated weights for policy 0, policy_version 80400 (0.0006) [2023-03-06 22:46:29,911][62475] Updated weights for policy 0, policy_version 80410 (0.0006) [2023-03-06 22:46:30,692][62475] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-06 22:46:31,482][62475] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-06 22:46:32,297][62475] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-06 22:46:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 82371584. Throughput: 0: 12710.3. Samples: 82354119. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:32,390][62145] Avg episode reward: [(0, '711.499')] [2023-03-06 22:46:33,115][62475] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-06 22:46:33,903][62475] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-06 22:46:34,726][62475] Updated weights for policy 0, policy_version 80470 (0.0006) [2023-03-06 22:46:35,520][62475] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-06 22:46:36,311][62475] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-06 22:46:37,114][62475] Updated weights for policy 0, policy_version 80500 (0.0007) [2023-03-06 22:46:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 82435072. Throughput: 0: 12710.7. Samples: 82430682. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:37,390][62145] Avg episode reward: [(0, '869.186')] [2023-03-06 22:46:37,923][62475] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-06 22:46:38,717][62475] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-06 22:46:39,528][62475] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-06 22:46:40,329][62475] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-06 22:46:41,134][62475] Updated weights for policy 0, policy_version 80550 (0.0006) [2023-03-06 22:46:41,944][62475] Updated weights for policy 0, policy_version 80560 (0.0007) [2023-03-06 22:46:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82498560. Throughput: 0: 12717.5. Samples: 82468885. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:42,390][62145] Avg episode reward: [(0, '815.411')] [2023-03-06 22:46:42,749][62475] Updated weights for policy 0, policy_version 80570 (0.0007) [2023-03-06 22:46:43,534][62475] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-06 22:46:44,357][62475] Updated weights for policy 0, policy_version 80590 (0.0007) [2023-03-06 22:46:45,155][62475] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-06 22:46:45,944][62475] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-06 22:46:46,757][62475] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-06 22:46:47,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 82563072. Throughput: 0: 12720.8. Samples: 82545333. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:47,390][62145] Avg episode reward: [(0, '850.417')] [2023-03-06 22:46:47,559][62475] Updated weights for policy 0, policy_version 80630 (0.0007) [2023-03-06 22:46:48,368][62475] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-06 22:46:49,175][62475] Updated weights for policy 0, policy_version 80650 (0.0006) [2023-03-06 22:46:49,973][62475] Updated weights for policy 0, policy_version 80660 (0.0007) [2023-03-06 22:46:50,787][62475] Updated weights for policy 0, policy_version 80670 (0.0006) [2023-03-06 22:46:51,584][62475] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-06 22:46:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82625536. Throughput: 0: 12718.0. Samples: 82621454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:52,390][62145] Avg episode reward: [(0, '840.735')] [2023-03-06 22:46:52,415][62475] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-06 22:46:53,233][62475] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-06 22:46:54,009][62475] Updated weights for policy 0, policy_version 80710 (0.0006) [2023-03-06 22:46:54,818][62475] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-06 22:46:55,630][62475] Updated weights for policy 0, policy_version 80730 (0.0007) [2023-03-06 22:46:56,441][62475] Updated weights for policy 0, policy_version 80740 (0.0007) [2023-03-06 22:46:57,244][62475] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-06 22:46:57,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82689024. Throughput: 0: 12714.4. Samples: 82659615. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:46:57,390][62145] Avg episode reward: [(0, '596.602')] [2023-03-06 22:46:58,041][62475] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-06 22:46:58,849][62475] Updated weights for policy 0, policy_version 80770 (0.0006) [2023-03-06 22:46:59,661][62475] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-06 22:47:00,465][62475] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-06 22:47:01,258][62475] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-06 22:47:02,062][62475] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-06 22:47:02,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 82753536. Throughput: 0: 12719.0. Samples: 82735841. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:02,390][62145] Avg episode reward: [(0, '755.936')] [2023-03-06 22:47:02,882][62475] Updated weights for policy 0, policy_version 80820 (0.0007) [2023-03-06 22:47:03,690][62475] Updated weights for policy 0, policy_version 80830 (0.0007) [2023-03-06 22:47:04,493][62475] Updated weights for policy 0, policy_version 80840 (0.0006) [2023-03-06 22:47:05,296][62475] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-06 22:47:06,111][62475] Updated weights for policy 0, policy_version 80860 (0.0006) [2023-03-06 22:47:06,926][62475] Updated weights for policy 0, policy_version 80870 (0.0007) [2023-03-06 22:47:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 82816000. Throughput: 0: 12718.5. Samples: 82811943. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:07,390][62145] Avg episode reward: [(0, '765.969')] [2023-03-06 22:47:07,726][62475] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-06 22:47:08,541][62475] Updated weights for policy 0, policy_version 80890 (0.0006) [2023-03-06 22:47:09,353][62475] Updated weights for policy 0, policy_version 80900 (0.0008) [2023-03-06 22:47:10,175][62475] Updated weights for policy 0, policy_version 80910 (0.0005) [2023-03-06 22:47:10,979][62475] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-06 22:47:11,761][62475] Updated weights for policy 0, policy_version 80930 (0.0007) [2023-03-06 22:47:12,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 82879488. Throughput: 0: 12719.8. Samples: 82850107. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:12,390][62145] Avg episode reward: [(0, '834.619')] [2023-03-06 22:47:12,592][62475] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-06 22:47:13,381][62475] Updated weights for policy 0, policy_version 80950 (0.0006) [2023-03-06 22:47:14,169][62475] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-06 22:47:14,988][62475] Updated weights for policy 0, policy_version 80970 (0.0007) [2023-03-06 22:47:15,789][62475] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-06 22:47:16,586][62475] Updated weights for policy 0, policy_version 80990 (0.0007) [2023-03-06 22:47:17,388][62475] Updated weights for policy 0, policy_version 81000 (0.0007) [2023-03-06 22:47:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12721.9). Total num frames: 82944000. Throughput: 0: 12716.5. Samples: 82926359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:17,390][62145] Avg episode reward: [(0, '760.229')] [2023-03-06 22:47:18,187][62475] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-06 22:47:18,986][62475] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-06 22:47:19,789][62475] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-06 22:47:20,584][62475] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-06 22:47:21,368][62475] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-06 22:47:22,166][62475] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-06 22:47:22,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 83007488. Throughput: 0: 12724.3. Samples: 83003277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:22,390][62145] Avg episode reward: [(0, '821.561')] [2023-03-06 22:47:22,993][62475] Updated weights for policy 0, policy_version 81070 (0.0006) [2023-03-06 22:47:23,800][62475] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-06 22:47:24,610][62475] Updated weights for policy 0, policy_version 81090 (0.0007) [2023-03-06 22:47:25,411][62475] Updated weights for policy 0, policy_version 81100 (0.0006) [2023-03-06 22:47:26,225][62475] Updated weights for policy 0, policy_version 81110 (0.0007) [2023-03-06 22:47:27,026][62475] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-06 22:47:27,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 83070976. Throughput: 0: 12720.6. Samples: 83041310. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:27,390][62145] Avg episode reward: [(0, '762.403')] [2023-03-06 22:47:27,819][62475] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-06 22:47:28,626][62475] Updated weights for policy 0, policy_version 81140 (0.0007) [2023-03-06 22:47:29,418][62475] Updated weights for policy 0, policy_version 81150 (0.0006) [2023-03-06 22:47:30,235][62475] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-06 22:47:31,026][62475] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-06 22:47:31,835][62475] Updated weights for policy 0, policy_version 81180 (0.0007) [2023-03-06 22:47:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83134464. Throughput: 0: 12722.3. Samples: 83117838. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:32,401][62145] Avg episode reward: [(0, '717.836')] [2023-03-06 22:47:32,655][62475] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-03-06 22:47:33,453][62475] Updated weights for policy 0, policy_version 81200 (0.0006) [2023-03-06 22:47:34,260][62475] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-06 22:47:35,047][62475] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-06 22:47:35,866][62475] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-06 22:47:36,670][62475] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-06 22:47:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 83198976. Throughput: 0: 12727.9. Samples: 83194208. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:47:37,400][62145] Avg episode reward: [(0, '763.647')] [2023-03-06 22:47:37,472][62475] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-06 22:47:38,294][62475] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-06 22:47:39,097][62475] Updated weights for policy 0, policy_version 81270 (0.0007) [2023-03-06 22:47:39,904][62475] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-06 22:47:40,705][62475] Updated weights for policy 0, policy_version 81290 (0.0006) [2023-03-06 22:47:41,496][62475] Updated weights for policy 0, policy_version 81300 (0.0006) [2023-03-06 22:47:42,313][62475] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-06 22:47:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83261440. Throughput: 0: 12722.5. Samples: 83232130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:47:42,401][62145] Avg episode reward: [(0, '665.774')] [2023-03-06 22:47:43,116][62475] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-03-06 22:47:43,921][62475] Updated weights for policy 0, policy_version 81330 (0.0006) [2023-03-06 22:47:44,737][62475] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-06 22:47:45,555][62475] Updated weights for policy 0, policy_version 81350 (0.0007) [2023-03-06 22:47:46,349][62475] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-06 22:47:47,157][62475] Updated weights for policy 0, policy_version 81370 (0.0007) [2023-03-06 22:47:47,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12718.4). Total num frames: 83324928. Throughput: 0: 12719.1. Samples: 83308203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:47:47,401][62145] Avg episode reward: [(0, '729.352')] [2023-03-06 22:47:47,970][62475] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-06 22:47:48,765][62475] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-06 22:47:49,557][62475] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-06 22:47:50,366][62475] Updated weights for policy 0, policy_version 81410 (0.0006) [2023-03-06 22:47:51,179][62475] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-06 22:47:51,977][62475] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-06 22:47:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83388416. Throughput: 0: 12727.1. Samples: 83384663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:47:52,400][62145] Avg episode reward: [(0, '679.048')] [2023-03-06 22:47:52,786][62475] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-06 22:47:53,604][62475] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-06 22:47:54,387][62475] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-06 22:47:55,183][62475] Updated weights for policy 0, policy_version 81470 (0.0006) [2023-03-06 22:47:55,992][62475] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-06 22:47:56,794][62475] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-06 22:47:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 83452928. Throughput: 0: 12726.4. Samples: 83422795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:47:57,401][62145] Avg episode reward: [(0, '652.693')] [2023-03-06 22:47:57,625][62475] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-06 22:47:58,414][62475] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-06 22:47:59,246][62475] Updated weights for policy 0, policy_version 81520 (0.0006) [2023-03-06 22:48:00,057][62475] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-06 22:48:00,840][62475] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-06 22:48:01,665][62475] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-06 22:48:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.6, 300 sec: 12718.4). Total num frames: 83516416. Throughput: 0: 12727.0. Samples: 83499075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:02,401][62145] Avg episode reward: [(0, '771.527')] [2023-03-06 22:48:02,465][62475] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-06 22:48:03,289][62475] Updated weights for policy 0, policy_version 81570 (0.0007) [2023-03-06 22:48:04,105][62475] Updated weights for policy 0, policy_version 81580 (0.0007) [2023-03-06 22:48:04,900][62475] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-06 22:48:05,720][62475] Updated weights for policy 0, policy_version 81600 (0.0007) [2023-03-06 22:48:06,512][62475] Updated weights for policy 0, policy_version 81610 (0.0007) [2023-03-06 22:48:07,301][62475] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-06 22:48:07,389][62145] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 83578880. Throughput: 0: 12707.3. Samples: 83575103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:07,400][62145] Avg episode reward: [(0, '748.825')] [2023-03-06 22:48:08,091][62475] Updated weights for policy 0, policy_version 81630 (0.0007) [2023-03-06 22:48:08,899][62475] Updated weights for policy 0, policy_version 81640 (0.0007) [2023-03-06 22:48:09,700][62475] Updated weights for policy 0, policy_version 81650 (0.0008) [2023-03-06 22:48:10,498][62475] Updated weights for policy 0, policy_version 81660 (0.0007) [2023-03-06 22:48:11,313][62475] Updated weights for policy 0, policy_version 81670 (0.0007) [2023-03-06 22:48:12,106][62475] Updated weights for policy 0, policy_version 81680 (0.0007) [2023-03-06 22:48:12,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 83643392. Throughput: 0: 12714.1. Samples: 83613446. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:12,400][62145] Avg episode reward: [(0, '738.320')] [2023-03-06 22:48:12,912][62475] Updated weights for policy 0, policy_version 81690 (0.0007) [2023-03-06 22:48:13,719][62475] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-06 22:48:14,509][62475] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-06 22:48:15,318][62475] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-06 22:48:16,118][62475] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-06 22:48:16,927][62475] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-06 22:48:17,389][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83706880. Throughput: 0: 12716.2. Samples: 83690064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:17,390][62145] Avg episode reward: [(0, '815.037')] [2023-03-06 22:48:17,744][62475] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-06 22:48:18,546][62475] Updated weights for policy 0, policy_version 81760 (0.0007) [2023-03-06 22:48:19,346][62475] Updated weights for policy 0, policy_version 81770 (0.0006) [2023-03-06 22:48:20,147][62475] Updated weights for policy 0, policy_version 81780 (0.0006) [2023-03-06 22:48:20,968][62475] Updated weights for policy 0, policy_version 81790 (0.0007) [2023-03-06 22:48:21,772][62475] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-06 22:48:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83770368. Throughput: 0: 12708.9. Samples: 83766110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:22,401][62145] Avg episode reward: [(0, '871.665')] [2023-03-06 22:48:22,405][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000081807_83770368.pth... [2023-03-06 22:48:22,437][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000078827_80718848.pth [2023-03-06 22:48:22,591][62475] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-06 22:48:23,389][62475] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-06 22:48:24,206][62475] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-06 22:48:25,035][62475] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-06 22:48:25,826][62475] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-06 22:48:26,634][62475] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-06 22:48:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83833856. Throughput: 0: 12708.4. Samples: 83804009. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:48:27,390][62145] Avg episode reward: [(0, '861.678')] [2023-03-06 22:48:27,430][62475] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-06 22:48:28,276][62475] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-06 22:48:29,067][62475] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-06 22:48:29,858][62475] Updated weights for policy 0, policy_version 81900 (0.0006) [2023-03-06 22:48:30,670][62475] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-06 22:48:31,470][62475] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-06 22:48:32,261][62475] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-06 22:48:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 83897344. Throughput: 0: 12710.0. Samples: 83880153. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:32,390][62145] Avg episode reward: [(0, '881.699')] [2023-03-06 22:48:33,071][62475] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-06 22:48:33,882][62475] Updated weights for policy 0, policy_version 81950 (0.0007) [2023-03-06 22:48:34,697][62475] Updated weights for policy 0, policy_version 81960 (0.0007) [2023-03-06 22:48:35,501][62475] Updated weights for policy 0, policy_version 81970 (0.0007) [2023-03-06 22:48:36,322][62475] Updated weights for policy 0, policy_version 81980 (0.0006) [2023-03-06 22:48:37,119][62475] Updated weights for policy 0, policy_version 81990 (0.0007) [2023-03-06 22:48:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 83960832. Throughput: 0: 12706.2. Samples: 83956443. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:37,390][62145] Avg episode reward: [(0, '1072.687')] [2023-03-06 22:48:37,922][62475] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-06 22:48:38,729][62475] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-06 22:48:39,534][62475] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-06 22:48:40,333][62475] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-06 22:48:41,146][62475] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-06 22:48:41,932][62475] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-06 22:48:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 84024320. Throughput: 0: 12702.8. Samples: 83994420. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:42,390][62145] Avg episode reward: [(0, '886.839')] [2023-03-06 22:48:42,720][62475] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-06 22:48:43,532][62475] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-06 22:48:44,350][62475] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-06 22:48:45,145][62475] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-06 22:48:45,950][62475] Updated weights for policy 0, policy_version 82100 (0.0005) [2023-03-06 22:48:46,765][62475] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-06 22:48:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 84087808. Throughput: 0: 12712.1. Samples: 84071117. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:47,390][62145] Avg episode reward: [(0, '854.508')] [2023-03-06 22:48:47,553][62475] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-06 22:48:48,382][62475] Updated weights for policy 0, policy_version 82130 (0.0007) [2023-03-06 22:48:49,186][62475] Updated weights for policy 0, policy_version 82140 (0.0006) [2023-03-06 22:48:49,987][62475] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-06 22:48:50,802][62475] Updated weights for policy 0, policy_version 82160 (0.0006) [2023-03-06 22:48:51,621][62475] Updated weights for policy 0, policy_version 82170 (0.0006) [2023-03-06 22:48:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 84151296. Throughput: 0: 12711.5. Samples: 84147121. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:52,390][62145] Avg episode reward: [(0, '919.622')] [2023-03-06 22:48:52,429][62475] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-06 22:48:53,255][62475] Updated weights for policy 0, policy_version 82190 (0.0007) [2023-03-06 22:48:54,059][62475] Updated weights for policy 0, policy_version 82200 (0.0006) [2023-03-06 22:48:54,867][62475] Updated weights for policy 0, policy_version 82210 (0.0007) [2023-03-06 22:48:55,670][62475] Updated weights for policy 0, policy_version 82220 (0.0006) [2023-03-06 22:48:56,484][62475] Updated weights for policy 0, policy_version 82230 (0.0006) [2023-03-06 22:48:57,297][62475] Updated weights for policy 0, policy_version 82240 (0.0007) [2023-03-06 22:48:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84214784. Throughput: 0: 12699.6. Samples: 84184927. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:48:57,390][62145] Avg episode reward: [(0, '775.363')] [2023-03-06 22:48:58,106][62475] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-06 22:48:58,901][62475] Updated weights for policy 0, policy_version 82260 (0.0007) [2023-03-06 22:48:59,698][62475] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-06 22:49:00,526][62475] Updated weights for policy 0, policy_version 82280 (0.0006) [2023-03-06 22:49:01,319][62475] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-06 22:49:02,124][62475] Updated weights for policy 0, policy_version 82300 (0.0007) [2023-03-06 22:49:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84278272. Throughput: 0: 12688.9. Samples: 84261063. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:49:02,390][62145] Avg episode reward: [(0, '576.171')] [2023-03-06 22:49:02,953][62475] Updated weights for policy 0, policy_version 82310 (0.0007) [2023-03-06 22:49:03,739][62475] Updated weights for policy 0, policy_version 82320 (0.0006) [2023-03-06 22:49:04,547][62475] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-06 22:49:05,363][62475] Updated weights for policy 0, policy_version 82340 (0.0007) [2023-03-06 22:49:06,163][62475] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-06 22:49:06,954][62475] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-06 22:49:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12711.5). Total num frames: 84341760. Throughput: 0: 12693.1. Samples: 84337299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:49:07,390][62145] Avg episode reward: [(0, '836.820')] [2023-03-06 22:49:07,771][62475] Updated weights for policy 0, policy_version 82370 (0.0007) [2023-03-06 22:49:08,590][62475] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-06 22:49:09,382][62475] Updated weights for policy 0, policy_version 82390 (0.0007) [2023-03-06 22:49:10,181][62475] Updated weights for policy 0, policy_version 82400 (0.0007) [2023-03-06 22:49:10,987][62475] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-06 22:49:11,804][62475] Updated weights for policy 0, policy_version 82420 (0.0007) [2023-03-06 22:49:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 84405248. Throughput: 0: 12699.4. Samples: 84375483. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:49:12,390][62145] Avg episode reward: [(0, '771.853')] [2023-03-06 22:49:12,611][62475] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-06 22:49:13,419][62475] Updated weights for policy 0, policy_version 82440 (0.0006) [2023-03-06 22:49:14,222][62475] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-06 22:49:15,010][62475] Updated weights for policy 0, policy_version 82460 (0.0007) [2023-03-06 22:49:15,833][62475] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-06 22:49:16,633][62475] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-06 22:49:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84468736. Throughput: 0: 12698.7. Samples: 84451596. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:49:17,390][62145] Avg episode reward: [(0, '747.730')] [2023-03-06 22:49:17,430][62475] Updated weights for policy 0, policy_version 82490 (0.0005) [2023-03-06 22:49:18,255][62475] Updated weights for policy 0, policy_version 82500 (0.0005) [2023-03-06 22:49:19,068][62475] Updated weights for policy 0, policy_version 82510 (0.0006) [2023-03-06 22:49:19,869][62475] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-06 22:49:20,685][62475] Updated weights for policy 0, policy_version 82530 (0.0006) [2023-03-06 22:49:21,473][62475] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-06 22:49:22,278][62475] Updated weights for policy 0, policy_version 82550 (0.0006) [2023-03-06 22:49:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84532224. Throughput: 0: 12694.7. Samples: 84527704. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:22,390][62145] Avg episode reward: [(0, '541.913')] [2023-03-06 22:49:23,099][62475] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-06 22:49:23,906][62475] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-06 22:49:24,697][62475] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-06 22:49:25,504][62475] Updated weights for policy 0, policy_version 82590 (0.0005) [2023-03-06 22:49:26,302][62475] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-06 22:49:27,090][62475] Updated weights for policy 0, policy_version 82610 (0.0006) [2023-03-06 22:49:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84595712. Throughput: 0: 12700.2. Samples: 84565930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:27,390][62145] Avg episode reward: [(0, '708.258')] [2023-03-06 22:49:27,907][62475] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-06 22:49:28,725][62475] Updated weights for policy 0, policy_version 82630 (0.0006) [2023-03-06 22:49:29,522][62475] Updated weights for policy 0, policy_version 82640 (0.0007) [2023-03-06 22:49:30,333][62475] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-06 22:49:31,155][62475] Updated weights for policy 0, policy_version 82660 (0.0006) [2023-03-06 22:49:31,960][62475] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-06 22:49:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84659200. Throughput: 0: 12685.6. Samples: 84641971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:32,390][62145] Avg episode reward: [(0, '751.852')] [2023-03-06 22:49:32,757][62475] Updated weights for policy 0, policy_version 82680 (0.0006) [2023-03-06 22:49:33,577][62475] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-06 22:49:34,394][62475] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-06 22:49:35,181][62475] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-06 22:49:36,001][62475] Updated weights for policy 0, policy_version 82720 (0.0007) [2023-03-06 22:49:36,814][62475] Updated weights for policy 0, policy_version 82730 (0.0006) [2023-03-06 22:49:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 84722688. Throughput: 0: 12687.3. Samples: 84718049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:37,390][62145] Avg episode reward: [(0, '593.170')] [2023-03-06 22:49:37,605][62475] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-06 22:49:38,419][62475] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-06 22:49:39,220][62475] Updated weights for policy 0, policy_version 82760 (0.0006) [2023-03-06 22:49:40,031][62475] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-06 22:49:40,853][62475] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-06 22:49:41,665][62475] Updated weights for policy 0, policy_version 82790 (0.0007) [2023-03-06 22:49:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 84786176. Throughput: 0: 12696.0. Samples: 84756248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:42,390][62145] Avg episode reward: [(0, '641.803')] [2023-03-06 22:49:42,456][62475] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-06 22:49:43,250][62475] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-06 22:49:44,049][62475] Updated weights for policy 0, policy_version 82820 (0.0007) [2023-03-06 22:49:44,851][62475] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-06 22:49:45,647][62475] Updated weights for policy 0, policy_version 82840 (0.0006) [2023-03-06 22:49:46,468][62475] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-06 22:49:47,262][62475] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-06 22:49:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 84849664. Throughput: 0: 12697.3. Samples: 84832439. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:47,390][62145] Avg episode reward: [(0, '804.200')] [2023-03-06 22:49:48,069][62475] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-06 22:49:48,885][62475] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-06 22:49:49,674][62475] Updated weights for policy 0, policy_version 82890 (0.0007) [2023-03-06 22:49:50,477][62475] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-06 22:49:51,286][62475] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-06 22:49:52,061][62475] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-06 22:49:52,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 84914176. Throughput: 0: 12708.3. Samples: 84909171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:52,390][62145] Avg episode reward: [(0, '625.657')] [2023-03-06 22:49:52,870][62475] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-06 22:49:53,702][62475] Updated weights for policy 0, policy_version 82940 (0.0007) [2023-03-06 22:49:54,507][62475] Updated weights for policy 0, policy_version 82950 (0.0007) [2023-03-06 22:49:55,318][62475] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-06 22:49:56,134][62475] Updated weights for policy 0, policy_version 82970 (0.0006) [2023-03-06 22:49:56,945][62475] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-06 22:49:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 84976640. Throughput: 0: 12704.0. Samples: 84947162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:49:57,390][62145] Avg episode reward: [(0, '692.403')] [2023-03-06 22:49:57,742][62475] Updated weights for policy 0, policy_version 82990 (0.0007) [2023-03-06 22:49:58,550][62475] Updated weights for policy 0, policy_version 83000 (0.0006) [2023-03-06 22:49:59,357][62475] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-06 22:50:00,152][62475] Updated weights for policy 0, policy_version 83020 (0.0007) [2023-03-06 22:50:00,953][62475] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-06 22:50:01,753][62475] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-06 22:50:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85041152. Throughput: 0: 12708.7. Samples: 85023487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:02,390][62145] Avg episode reward: [(0, '530.729')] [2023-03-06 22:50:02,546][62475] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-06 22:50:03,350][62475] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-06 22:50:04,141][62475] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-06 22:50:04,944][62475] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-06 22:50:05,751][62475] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-06 22:50:06,565][62475] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-06 22:50:07,376][62475] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-06 22:50:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85104640. Throughput: 0: 12719.1. Samples: 85100066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:07,390][62145] Avg episode reward: [(0, '693.132')] [2023-03-06 22:50:08,173][62475] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-06 22:50:08,968][62475] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-06 22:50:09,785][62475] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-06 22:50:10,595][62475] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-06 22:50:11,412][62475] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-06 22:50:12,209][62475] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-06 22:50:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85168128. Throughput: 0: 12716.5. Samples: 85138174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:12,390][62145] Avg episode reward: [(0, '757.784')] [2023-03-06 22:50:13,017][62475] Updated weights for policy 0, policy_version 83180 (0.0007) [2023-03-06 22:50:13,834][62475] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-06 22:50:14,627][62475] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-06 22:50:15,438][62475] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-06 22:50:16,246][62475] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-06 22:50:17,053][62475] Updated weights for policy 0, policy_version 83230 (0.0006) [2023-03-06 22:50:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85231616. Throughput: 0: 12716.8. Samples: 85214227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:17,390][62145] Avg episode reward: [(0, '670.941')] [2023-03-06 22:50:17,839][62475] Updated weights for policy 0, policy_version 83240 (0.0006) [2023-03-06 22:50:18,654][62475] Updated weights for policy 0, policy_version 83250 (0.0007) [2023-03-06 22:50:19,455][62475] Updated weights for policy 0, policy_version 83260 (0.0006) [2023-03-06 22:50:20,270][62475] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-06 22:50:21,051][62475] Updated weights for policy 0, policy_version 83280 (0.0006) [2023-03-06 22:50:21,871][62475] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-06 22:50:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85295104. Throughput: 0: 12726.2. Samples: 85290728. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:22,390][62145] Avg episode reward: [(0, '616.728')] [2023-03-06 22:50:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000083296_85295104.pth... [2023-03-06 22:50:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000080317_82244608.pth [2023-03-06 22:50:22,673][62475] Updated weights for policy 0, policy_version 83300 (0.0006) [2023-03-06 22:50:23,482][62475] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-06 22:50:24,286][62475] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-03-06 22:50:25,082][62475] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-06 22:50:25,892][62475] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-06 22:50:26,689][62475] Updated weights for policy 0, policy_version 83350 (0.0006) [2023-03-06 22:50:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85358592. Throughput: 0: 12725.6. Samples: 85328902. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:27,390][62145] Avg episode reward: [(0, '547.747')] [2023-03-06 22:50:27,486][62475] Updated weights for policy 0, policy_version 83360 (0.0007) [2023-03-06 22:50:28,280][62475] Updated weights for policy 0, policy_version 83370 (0.0007) [2023-03-06 22:50:29,083][62475] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-06 22:50:29,894][62475] Updated weights for policy 0, policy_version 83390 (0.0006) [2023-03-06 22:50:30,679][62475] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-06 22:50:31,478][62475] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-06 22:50:32,280][62475] Updated weights for policy 0, policy_version 83420 (0.0007) [2023-03-06 22:50:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85423104. Throughput: 0: 12740.9. Samples: 85405781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:32,390][62145] Avg episode reward: [(0, '568.849')] [2023-03-06 22:50:33,102][62475] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-06 22:50:33,889][62475] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-06 22:50:34,697][62475] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-06 22:50:35,489][62475] Updated weights for policy 0, policy_version 83460 (0.0007) [2023-03-06 22:50:36,295][62475] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-06 22:50:37,098][62475] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-06 22:50:37,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85486592. Throughput: 0: 12734.4. Samples: 85482216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:37,390][62145] Avg episode reward: [(0, '569.970')] [2023-03-06 22:50:37,918][62475] Updated weights for policy 0, policy_version 83490 (0.0005) [2023-03-06 22:50:38,722][62475] Updated weights for policy 0, policy_version 83500 (0.0007) [2023-03-06 22:50:39,524][62475] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-06 22:50:40,343][62475] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-06 22:50:41,133][62475] Updated weights for policy 0, policy_version 83530 (0.0007) [2023-03-06 22:50:41,923][62475] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-06 22:50:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85550080. Throughput: 0: 12736.3. Samples: 85520295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:42,390][62145] Avg episode reward: [(0, '480.836')] [2023-03-06 22:50:42,749][62475] Updated weights for policy 0, policy_version 83550 (0.0007) [2023-03-06 22:50:43,541][62475] Updated weights for policy 0, policy_version 83560 (0.0007) [2023-03-06 22:50:44,345][62475] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-06 22:50:45,152][62475] Updated weights for policy 0, policy_version 83580 (0.0006) [2023-03-06 22:50:45,954][62475] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-06 22:50:46,766][62475] Updated weights for policy 0, policy_version 83600 (0.0006) [2023-03-06 22:50:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85613568. Throughput: 0: 12741.7. Samples: 85596866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:47,390][62145] Avg episode reward: [(0, '491.030')] [2023-03-06 22:50:47,589][62475] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-06 22:50:48,397][62475] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-06 22:50:49,178][62475] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-06 22:50:49,999][62475] Updated weights for policy 0, policy_version 83640 (0.0006) [2023-03-06 22:50:50,806][62475] Updated weights for policy 0, policy_version 83650 (0.0007) [2023-03-06 22:50:51,622][62475] Updated weights for policy 0, policy_version 83660 (0.0007) [2023-03-06 22:50:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12715.0). Total num frames: 85677056. Throughput: 0: 12731.5. Samples: 85672984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:52,390][62145] Avg episode reward: [(0, '546.796')] [2023-03-06 22:50:52,431][62475] Updated weights for policy 0, policy_version 83670 (0.0007) [2023-03-06 22:50:53,219][62475] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-06 22:50:54,025][62475] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-06 22:50:54,830][62475] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-06 22:50:55,637][62475] Updated weights for policy 0, policy_version 83710 (0.0007) [2023-03-06 22:50:56,449][62475] Updated weights for policy 0, policy_version 83720 (0.0006) [2023-03-06 22:50:57,241][62475] Updated weights for policy 0, policy_version 83730 (0.0006) [2023-03-06 22:50:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85740544. Throughput: 0: 12731.6. Samples: 85711097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:50:57,390][62145] Avg episode reward: [(0, '542.467')] [2023-03-06 22:50:58,054][62475] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-06 22:50:58,859][62475] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-06 22:50:59,650][62475] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-06 22:51:00,453][62475] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-06 22:51:01,252][62475] Updated weights for policy 0, policy_version 83780 (0.0007) [2023-03-06 22:51:02,058][62475] Updated weights for policy 0, policy_version 83790 (0.0005) [2023-03-06 22:51:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 85805056. Throughput: 0: 12742.3. Samples: 85787633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:02,390][62145] Avg episode reward: [(0, '509.769')] [2023-03-06 22:51:02,843][62475] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-06 22:51:03,664][62475] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-06 22:51:04,465][62475] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-06 22:51:05,274][62475] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-06 22:51:06,076][62475] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-06 22:51:06,872][62475] Updated weights for policy 0, policy_version 83850 (0.0007) [2023-03-06 22:51:07,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 85868544. Throughput: 0: 12742.2. Samples: 85864128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:07,390][62145] Avg episode reward: [(0, '596.730')] [2023-03-06 22:51:07,673][62475] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-06 22:51:08,486][62475] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-06 22:51:09,286][62475] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-06 22:51:10,081][62475] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-06 22:51:10,878][62475] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-06 22:51:11,698][62475] Updated weights for policy 0, policy_version 83910 (0.0007) [2023-03-06 22:51:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 85932032. Throughput: 0: 12745.3. Samples: 85902442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:12,390][62145] Avg episode reward: [(0, '723.315')] [2023-03-06 22:51:12,494][62475] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-06 22:51:13,282][62475] Updated weights for policy 0, policy_version 83930 (0.0007) [2023-03-06 22:51:14,099][62475] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-06 22:51:14,884][62475] Updated weights for policy 0, policy_version 83950 (0.0007) [2023-03-06 22:51:15,713][62475] Updated weights for policy 0, policy_version 83960 (0.0007) [2023-03-06 22:51:16,496][62475] Updated weights for policy 0, policy_version 83970 (0.0007) [2023-03-06 22:51:17,317][62475] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-06 22:51:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 85996544. Throughput: 0: 12738.0. Samples: 85978990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:17,390][62145] Avg episode reward: [(0, '485.955')] [2023-03-06 22:51:18,109][62475] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-06 22:51:18,918][62475] Updated weights for policy 0, policy_version 84000 (0.0006) [2023-03-06 22:51:19,720][62475] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-06 22:51:20,497][62475] Updated weights for policy 0, policy_version 84020 (0.0006) [2023-03-06 22:51:21,297][62475] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-06 22:51:22,119][62475] Updated weights for policy 0, policy_version 84040 (0.0007) [2023-03-06 22:51:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 86060032. Throughput: 0: 12743.9. Samples: 86055690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:22,390][62145] Avg episode reward: [(0, '670.154')] [2023-03-06 22:51:22,920][62475] Updated weights for policy 0, policy_version 84050 (0.0005) [2023-03-06 22:51:23,735][62475] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-06 22:51:24,545][62475] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-06 22:51:25,333][62475] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-06 22:51:26,155][62475] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-06 22:51:26,974][62475] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-06 22:51:27,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 86123520. Throughput: 0: 12738.5. Samples: 86093526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:27,390][62145] Avg episode reward: [(0, '619.839')] [2023-03-06 22:51:27,783][62475] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-06 22:51:28,584][62475] Updated weights for policy 0, policy_version 84120 (0.0007) [2023-03-06 22:51:29,382][62475] Updated weights for policy 0, policy_version 84130 (0.0007) [2023-03-06 22:51:30,186][62475] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-06 22:51:30,983][62475] Updated weights for policy 0, policy_version 84150 (0.0006) [2023-03-06 22:51:31,777][62475] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-06 22:51:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 86187008. Throughput: 0: 12731.1. Samples: 86169768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:32,390][62145] Avg episode reward: [(0, '516.510')] [2023-03-06 22:51:32,597][62475] Updated weights for policy 0, policy_version 84170 (0.0006) [2023-03-06 22:51:33,429][62475] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-06 22:51:34,202][62475] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-06 22:51:35,027][62475] Updated weights for policy 0, policy_version 84200 (0.0006) [2023-03-06 22:51:35,835][62475] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-06 22:51:36,640][62475] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-06 22:51:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 86250496. Throughput: 0: 12733.8. Samples: 86246004. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:37,390][62145] Avg episode reward: [(0, '480.908')] [2023-03-06 22:51:37,445][62475] Updated weights for policy 0, policy_version 84230 (0.0006) [2023-03-06 22:51:38,253][62475] Updated weights for policy 0, policy_version 84240 (0.0007) [2023-03-06 22:51:39,073][62475] Updated weights for policy 0, policy_version 84250 (0.0007) [2023-03-06 22:51:39,862][62475] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-06 22:51:40,667][62475] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-06 22:51:41,473][62475] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-06 22:51:42,276][62475] Updated weights for policy 0, policy_version 84290 (0.0007) [2023-03-06 22:51:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 86313984. Throughput: 0: 12731.2. Samples: 86283999. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:42,390][62145] Avg episode reward: [(0, '628.702')] [2023-03-06 22:51:43,077][62475] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-06 22:51:43,896][62475] Updated weights for policy 0, policy_version 84310 (0.0006) [2023-03-06 22:51:44,695][62475] Updated weights for policy 0, policy_version 84320 (0.0007) [2023-03-06 22:51:45,500][62475] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-06 22:51:46,288][62475] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-06 22:51:47,090][62475] Updated weights for policy 0, policy_version 84350 (0.0007) [2023-03-06 22:51:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 86377472. Throughput: 0: 12732.4. Samples: 86360591. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:47,390][62145] Avg episode reward: [(0, '632.065')] [2023-03-06 22:51:47,904][62475] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-06 22:51:48,681][62475] Updated weights for policy 0, policy_version 84370 (0.0006) [2023-03-06 22:51:49,486][62475] Updated weights for policy 0, policy_version 84380 (0.0006) [2023-03-06 22:51:50,285][62475] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-06 22:51:51,107][62475] Updated weights for policy 0, policy_version 84400 (0.0007) [2023-03-06 22:51:51,900][62475] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-06 22:51:52,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 86441984. Throughput: 0: 12735.3. Samples: 86437214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:52,390][62145] Avg episode reward: [(0, '684.739')] [2023-03-06 22:51:52,710][62475] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-06 22:51:53,528][62475] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-06 22:51:54,314][62475] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-06 22:51:55,116][62475] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-06 22:51:55,917][62475] Updated weights for policy 0, policy_version 84460 (0.0006) [2023-03-06 22:51:56,722][62475] Updated weights for policy 0, policy_version 84470 (0.0007) [2023-03-06 22:51:57,389][62145] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 86505472. Throughput: 0: 12733.0. Samples: 86475427. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:51:57,390][62145] Avg episode reward: [(0, '925.479')] [2023-03-06 22:51:57,525][62475] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-06 22:51:58,344][62475] Updated weights for policy 0, policy_version 84490 (0.0006) [2023-03-06 22:51:59,129][62475] Updated weights for policy 0, policy_version 84500 (0.0006) [2023-03-06 22:51:59,953][62475] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-06 22:52:00,766][62475] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-06 22:52:01,561][62475] Updated weights for policy 0, policy_version 84530 (0.0006) [2023-03-06 22:52:02,374][62475] Updated weights for policy 0, policy_version 84540 (0.0006) [2023-03-06 22:52:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 86568960. Throughput: 0: 12724.3. Samples: 86551584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:02,390][62145] Avg episode reward: [(0, '865.179')] [2023-03-06 22:52:03,166][62475] Updated weights for policy 0, policy_version 84550 (0.0006) [2023-03-06 22:52:03,969][62475] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-06 22:52:04,779][62475] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-06 22:52:05,579][62475] Updated weights for policy 0, policy_version 84580 (0.0006) [2023-03-06 22:52:06,370][62475] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-06 22:52:07,181][62475] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-06 22:52:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12721.9). Total num frames: 86632448. Throughput: 0: 12722.5. Samples: 86628204. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:07,390][62145] Avg episode reward: [(0, '573.086')] [2023-03-06 22:52:07,985][62475] Updated weights for policy 0, policy_version 84610 (0.0006) [2023-03-06 22:52:08,796][62475] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-06 22:52:09,612][62475] Updated weights for policy 0, policy_version 84630 (0.0007) [2023-03-06 22:52:10,427][62475] Updated weights for policy 0, policy_version 84640 (0.0006) [2023-03-06 22:52:11,231][62475] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-06 22:52:12,019][62475] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-06 22:52:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 86695936. Throughput: 0: 12725.9. Samples: 86666192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:12,390][62145] Avg episode reward: [(0, '728.265')] [2023-03-06 22:52:12,831][62475] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-06 22:52:13,618][62475] Updated weights for policy 0, policy_version 84680 (0.0007) [2023-03-06 22:52:14,451][62475] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-06 22:52:15,252][62475] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-06 22:52:16,051][62475] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-06 22:52:16,857][62475] Updated weights for policy 0, policy_version 84720 (0.0007) [2023-03-06 22:52:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 86759424. Throughput: 0: 12726.2. Samples: 86742444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:17,390][62145] Avg episode reward: [(0, '631.848')] [2023-03-06 22:52:17,676][62475] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-06 22:52:18,486][62475] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-06 22:52:19,282][62475] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-06 22:52:20,077][62475] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-06 22:52:20,888][62475] Updated weights for policy 0, policy_version 84770 (0.0007) [2023-03-06 22:52:21,674][62475] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-06 22:52:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12718.4). Total num frames: 86822912. Throughput: 0: 12731.2. Samples: 86818906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:22,390][62145] Avg episode reward: [(0, '990.268')] [2023-03-06 22:52:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000084788_86822912.pth... [2023-03-06 22:52:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000081807_83770368.pth [2023-03-06 22:52:22,496][62475] Updated weights for policy 0, policy_version 84790 (0.0007) [2023-03-06 22:52:23,288][62475] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-06 22:52:24,117][62475] Updated weights for policy 0, policy_version 84810 (0.0007) [2023-03-06 22:52:24,905][62475] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-06 22:52:25,709][62475] Updated weights for policy 0, policy_version 84830 (0.0007) [2023-03-06 22:52:26,523][62475] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-06 22:52:27,311][62475] Updated weights for policy 0, policy_version 84850 (0.0006) [2023-03-06 22:52:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 86886400. Throughput: 0: 12731.3. Samples: 86856906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:27,390][62145] Avg episode reward: [(0, '770.686')] [2023-03-06 22:52:28,130][62475] Updated weights for policy 0, policy_version 84860 (0.0007) [2023-03-06 22:52:28,931][62475] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-06 22:52:29,731][62475] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-06 22:52:30,528][62475] Updated weights for policy 0, policy_version 84890 (0.0007) [2023-03-06 22:52:31,333][62475] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-06 22:52:32,133][62475] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-06 22:52:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 86950912. Throughput: 0: 12728.4. Samples: 86933366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:32,390][62145] Avg episode reward: [(0, '838.550')] [2023-03-06 22:52:32,942][62475] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-06 22:52:33,754][62475] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-06 22:52:34,565][62475] Updated weights for policy 0, policy_version 84940 (0.0006) [2023-03-06 22:52:35,376][62475] Updated weights for policy 0, policy_version 84950 (0.0007) [2023-03-06 22:52:36,174][62475] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-06 22:52:36,992][62475] Updated weights for policy 0, policy_version 84970 (0.0008) [2023-03-06 22:52:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 87013376. Throughput: 0: 12718.2. Samples: 87009534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:37,390][62145] Avg episode reward: [(0, '820.653')] [2023-03-06 22:52:37,796][62475] Updated weights for policy 0, policy_version 84980 (0.0007) [2023-03-06 22:52:38,606][62475] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-06 22:52:39,406][62475] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-06 22:52:40,228][62475] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-06 22:52:41,017][62475] Updated weights for policy 0, policy_version 85020 (0.0006) [2023-03-06 22:52:41,830][62475] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-06 22:52:42,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 87076864. Throughput: 0: 12715.0. Samples: 87047604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:52:42,390][62145] Avg episode reward: [(0, '981.449')] [2023-03-06 22:52:42,635][62475] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-06 22:52:43,435][62475] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-06 22:52:44,215][62475] Updated weights for policy 0, policy_version 85060 (0.0006) [2023-03-06 22:52:45,070][62475] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-06 22:52:45,882][62475] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-06 22:52:46,668][62475] Updated weights for policy 0, policy_version 85090 (0.0005) [2023-03-06 22:52:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 87141376. Throughput: 0: 12714.3. Samples: 87123726. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:52:47,390][62145] Avg episode reward: [(0, '666.443')] [2023-03-06 22:52:47,485][62475] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-06 22:52:48,288][62475] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-06 22:52:49,092][62475] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-06 22:52:49,897][62475] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-06 22:52:50,703][62475] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-06 22:52:51,516][62475] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-06 22:52:52,313][62475] Updated weights for policy 0, policy_version 85160 (0.0007) [2023-03-06 22:52:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 87203840. Throughput: 0: 12704.8. Samples: 87199921. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:52:52,390][62145] Avg episode reward: [(0, '785.754')] [2023-03-06 22:52:53,117][62475] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-06 22:52:53,913][62475] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-06 22:52:54,710][62475] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-06 22:52:55,518][62475] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-06 22:52:56,325][62475] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-06 22:52:57,149][62475] Updated weights for policy 0, policy_version 85220 (0.0007) [2023-03-06 22:52:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 87268352. Throughput: 0: 12714.1. Samples: 87238327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:52:57,390][62145] Avg episode reward: [(0, '583.710')] [2023-03-06 22:52:57,932][62475] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-06 22:52:58,749][62475] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-06 22:52:59,537][62475] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-06 22:53:00,337][62475] Updated weights for policy 0, policy_version 85260 (0.0007) [2023-03-06 22:53:01,157][62475] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-06 22:53:01,934][62475] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-06 22:53:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 87331840. Throughput: 0: 12715.2. Samples: 87314627. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:02,390][62145] Avg episode reward: [(0, '888.227')] [2023-03-06 22:53:02,748][62475] Updated weights for policy 0, policy_version 85290 (0.0007) [2023-03-06 22:53:03,557][62475] Updated weights for policy 0, policy_version 85300 (0.0007) [2023-03-06 22:53:04,358][62475] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-06 22:53:05,164][62475] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-06 22:53:05,974][62475] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-06 22:53:06,772][62475] Updated weights for policy 0, policy_version 85340 (0.0007) [2023-03-06 22:53:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 87395328. Throughput: 0: 12718.0. Samples: 87391214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:07,390][62145] Avg episode reward: [(0, '626.210')] [2023-03-06 22:53:07,566][62475] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-06 22:53:08,378][62475] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-06 22:53:09,175][62475] Updated weights for policy 0, policy_version 85370 (0.0006) [2023-03-06 22:53:09,969][62475] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-06 22:53:10,774][62475] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-06 22:53:11,600][62475] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-06 22:53:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12718.4). Total num frames: 87458816. Throughput: 0: 12722.9. Samples: 87429437. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:12,390][62145] Avg episode reward: [(0, '637.969')] [2023-03-06 22:53:12,400][62475] Updated weights for policy 0, policy_version 85410 (0.0007) [2023-03-06 22:53:13,190][62475] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-06 22:53:13,991][62475] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-06 22:53:14,793][62475] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-06 22:53:15,609][62475] Updated weights for policy 0, policy_version 85450 (0.0006) [2023-03-06 22:53:16,409][62475] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-06 22:53:17,203][62475] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-06 22:53:17,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 87523328. Throughput: 0: 12724.5. Samples: 87505969. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:17,400][62145] Avg episode reward: [(0, '671.049')] [2023-03-06 22:53:18,016][62475] Updated weights for policy 0, policy_version 85480 (0.0007) [2023-03-06 22:53:18,817][62475] Updated weights for policy 0, policy_version 85490 (0.0006) [2023-03-06 22:53:19,625][62475] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-06 22:53:20,425][62475] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-06 22:53:21,228][62475] Updated weights for policy 0, policy_version 85520 (0.0007) [2023-03-06 22:53:22,039][62475] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-06 22:53:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 87586816. Throughput: 0: 12731.5. Samples: 87582451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:22,401][62145] Avg episode reward: [(0, '631.292')] [2023-03-06 22:53:22,822][62475] Updated weights for policy 0, policy_version 85540 (0.0007) [2023-03-06 22:53:23,626][62475] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-06 22:53:24,432][62475] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-06 22:53:25,254][62475] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-06 22:53:26,054][62475] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-06 22:53:26,849][62475] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-06 22:53:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 87650304. Throughput: 0: 12737.5. Samples: 87620792. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:27,401][62145] Avg episode reward: [(0, '731.531')] [2023-03-06 22:53:27,637][62475] Updated weights for policy 0, policy_version 85600 (0.0007) [2023-03-06 22:53:28,454][62475] Updated weights for policy 0, policy_version 85610 (0.0007) [2023-03-06 22:53:29,241][62475] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-06 22:53:30,043][62475] Updated weights for policy 0, policy_version 85630 (0.0006) [2023-03-06 22:53:30,840][62475] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-06 22:53:31,641][62475] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-06 22:53:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 87714816. Throughput: 0: 12750.9. Samples: 87697514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:32,401][62145] Avg episode reward: [(0, '646.127')] [2023-03-06 22:53:32,441][62475] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-06 22:53:33,256][62475] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-06 22:53:34,062][62475] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-06 22:53:34,873][62475] Updated weights for policy 0, policy_version 85690 (0.0008) [2023-03-06 22:53:35,654][62475] Updated weights for policy 0, policy_version 85700 (0.0006) [2023-03-06 22:53:36,484][62475] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-06 22:53:37,278][62475] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-06 22:53:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 87778304. Throughput: 0: 12754.0. Samples: 87773850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:37,400][62145] Avg episode reward: [(0, '535.555')] [2023-03-06 22:53:38,085][62475] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-06 22:53:38,900][62475] Updated weights for policy 0, policy_version 85740 (0.0007) [2023-03-06 22:53:39,709][62475] Updated weights for policy 0, policy_version 85750 (0.0006) [2023-03-06 22:53:40,515][62475] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-06 22:53:41,319][62475] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-06 22:53:42,115][62475] Updated weights for policy 0, policy_version 85780 (0.0006) [2023-03-06 22:53:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 87841792. Throughput: 0: 12747.8. Samples: 87811980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:42,401][62145] Avg episode reward: [(0, '671.970')] [2023-03-06 22:53:42,926][62475] Updated weights for policy 0, policy_version 85790 (0.0007) [2023-03-06 22:53:43,710][62475] Updated weights for policy 0, policy_version 85800 (0.0007) [2023-03-06 22:53:44,519][62475] Updated weights for policy 0, policy_version 85810 (0.0006) [2023-03-06 22:53:45,328][62475] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-06 22:53:46,142][62475] Updated weights for policy 0, policy_version 85830 (0.0006) [2023-03-06 22:53:46,937][62475] Updated weights for policy 0, policy_version 85840 (0.0006) [2023-03-06 22:53:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 87905280. Throughput: 0: 12749.5. Samples: 87888356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:47,400][62145] Avg episode reward: [(0, '653.425')] [2023-03-06 22:53:47,750][62475] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-06 22:53:48,546][62475] Updated weights for policy 0, policy_version 85860 (0.0005) [2023-03-06 22:53:49,338][62475] Updated weights for policy 0, policy_version 85870 (0.0007) [2023-03-06 22:53:50,133][62475] Updated weights for policy 0, policy_version 85880 (0.0006) [2023-03-06 22:53:50,942][62475] Updated weights for policy 0, policy_version 85890 (0.0007) [2023-03-06 22:53:51,751][62475] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-06 22:53:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 87968768. Throughput: 0: 12746.7. Samples: 87964815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:52,390][62145] Avg episode reward: [(0, '608.634')] [2023-03-06 22:53:52,570][62475] Updated weights for policy 0, policy_version 85910 (0.0006) [2023-03-06 22:53:53,374][62475] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-06 22:53:54,175][62475] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-06 22:53:54,993][62475] Updated weights for policy 0, policy_version 85940 (0.0007) [2023-03-06 22:53:55,822][62475] Updated weights for policy 0, policy_version 85950 (0.0006) [2023-03-06 22:53:56,608][62475] Updated weights for policy 0, policy_version 85960 (0.0006) [2023-03-06 22:53:57,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 88032256. Throughput: 0: 12740.0. Samples: 88002736. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:53:57,390][62145] Avg episode reward: [(0, '571.492')] [2023-03-06 22:53:57,429][62475] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-06 22:53:58,242][62475] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-06 22:53:59,065][62475] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-06 22:53:59,860][62475] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-06 22:54:00,663][62475] Updated weights for policy 0, policy_version 86010 (0.0006) [2023-03-06 22:54:01,452][62475] Updated weights for policy 0, policy_version 86020 (0.0007) [2023-03-06 22:54:02,266][62475] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-06 22:54:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 88095744. Throughput: 0: 12728.9. Samples: 88078767. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:54:02,390][62145] Avg episode reward: [(0, '706.772')] [2023-03-06 22:54:03,053][62475] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-06 22:54:03,842][62475] Updated weights for policy 0, policy_version 86050 (0.0007) [2023-03-06 22:54:04,678][62475] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-06 22:54:05,474][62475] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-06 22:54:06,281][62475] Updated weights for policy 0, policy_version 86080 (0.0007) [2023-03-06 22:54:07,082][62475] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-06 22:54:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 88159232. Throughput: 0: 12728.0. Samples: 88155214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:54:07,390][62145] Avg episode reward: [(0, '571.274')] [2023-03-06 22:54:07,884][62475] Updated weights for policy 0, policy_version 86100 (0.0007) [2023-03-06 22:54:08,712][62475] Updated weights for policy 0, policy_version 86110 (0.0009) [2023-03-06 22:54:09,523][62475] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-06 22:54:10,330][62475] Updated weights for policy 0, policy_version 86130 (0.0007) [2023-03-06 22:54:11,127][62475] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-06 22:54:11,917][62475] Updated weights for policy 0, policy_version 86150 (0.0006) [2023-03-06 22:54:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 88222720. Throughput: 0: 12718.2. Samples: 88193113. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:54:12,390][62145] Avg episode reward: [(0, '729.125')] [2023-03-06 22:54:12,750][62475] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-06 22:54:13,542][62475] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-06 22:54:14,341][62475] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-06 22:54:15,154][62475] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-06 22:54:15,955][62475] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-06 22:54:16,765][62475] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-06 22:54:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 88286208. Throughput: 0: 12709.9. Samples: 88269459. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:54:17,390][62145] Avg episode reward: [(0, '588.069')] [2023-03-06 22:54:17,574][62475] Updated weights for policy 0, policy_version 86220 (0.0006) [2023-03-06 22:54:18,375][62475] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-06 22:54:19,174][62475] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-06 22:54:19,969][62475] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-06 22:54:20,780][62475] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-06 22:54:21,576][62475] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-06 22:54:22,372][62475] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-06 22:54:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88350720. Throughput: 0: 12716.8. Samples: 88346105. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:54:22,390][62145] Avg episode reward: [(0, '606.270')] [2023-03-06 22:54:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000086280_88350720.pth... [2023-03-06 22:54:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000083296_85295104.pth [2023-03-06 22:54:23,182][62475] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-06 22:54:23,974][62475] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-06 22:54:24,766][62475] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-06 22:54:25,582][62475] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-06 22:54:26,377][62475] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-06 22:54:27,193][62475] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-06 22:54:27,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88414208. Throughput: 0: 12722.8. Samples: 88384507. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:27,390][62145] Avg episode reward: [(0, '676.137')] [2023-03-06 22:54:28,011][62475] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-06 22:54:28,800][62475] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-06 22:54:29,623][62475] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-06 22:54:30,422][62475] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-06 22:54:31,241][62475] Updated weights for policy 0, policy_version 86390 (0.0007) [2023-03-06 22:54:32,061][62475] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-06 22:54:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 88477696. Throughput: 0: 12710.3. Samples: 88460322. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:32,390][62145] Avg episode reward: [(0, '635.172')] [2023-03-06 22:54:32,866][62475] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-06 22:54:33,672][62475] Updated weights for policy 0, policy_version 86420 (0.0006) [2023-03-06 22:54:34,470][62475] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-06 22:54:35,280][62475] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-06 22:54:36,091][62475] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-06 22:54:36,898][62475] Updated weights for policy 0, policy_version 86460 (0.0007) [2023-03-06 22:54:37,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12728.8). Total num frames: 88541184. Throughput: 0: 12706.2. Samples: 88536593. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:37,390][62145] Avg episode reward: [(0, '754.099')] [2023-03-06 22:54:37,691][62475] Updated weights for policy 0, policy_version 86470 (0.0006) [2023-03-06 22:54:38,487][62475] Updated weights for policy 0, policy_version 86480 (0.0005) [2023-03-06 22:54:39,297][62475] Updated weights for policy 0, policy_version 86490 (0.0005) [2023-03-06 22:54:40,094][62475] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-06 22:54:40,900][62475] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-03-06 22:54:41,708][62475] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-06 22:54:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 88604672. Throughput: 0: 12713.8. Samples: 88574859. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:42,390][62145] Avg episode reward: [(0, '635.623')] [2023-03-06 22:54:42,498][62475] Updated weights for policy 0, policy_version 86530 (0.0006) [2023-03-06 22:54:43,314][62475] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-06 22:54:44,110][62475] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-06 22:54:44,918][62475] Updated weights for policy 0, policy_version 86560 (0.0006) [2023-03-06 22:54:45,722][62475] Updated weights for policy 0, policy_version 86570 (0.0007) [2023-03-06 22:54:46,516][62475] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-03-06 22:54:47,338][62475] Updated weights for policy 0, policy_version 86590 (0.0007) [2023-03-06 22:54:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 88668160. Throughput: 0: 12723.3. Samples: 88651315. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:47,390][62145] Avg episode reward: [(0, '820.711')] [2023-03-06 22:54:48,128][62475] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-06 22:54:48,943][62475] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-06 22:54:49,743][62475] Updated weights for policy 0, policy_version 86620 (0.0007) [2023-03-06 22:54:50,531][62475] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-06 22:54:51,350][62475] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-06 22:54:52,155][62475] Updated weights for policy 0, policy_version 86650 (0.0007) [2023-03-06 22:54:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 88731648. Throughput: 0: 12725.7. Samples: 88727869. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:52,390][62145] Avg episode reward: [(0, '749.526')] [2023-03-06 22:54:52,954][62475] Updated weights for policy 0, policy_version 86660 (0.0006) [2023-03-06 22:54:53,755][62475] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-06 22:54:54,559][62475] Updated weights for policy 0, policy_version 86680 (0.0005) [2023-03-06 22:54:55,367][62475] Updated weights for policy 0, policy_version 86690 (0.0007) [2023-03-06 22:54:56,170][62475] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-06 22:54:56,970][62475] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-06 22:54:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88796160. Throughput: 0: 12730.3. Samples: 88765978. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:54:57,390][62145] Avg episode reward: [(0, '750.790')] [2023-03-06 22:54:57,776][62475] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-06 22:54:58,580][62475] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-06 22:54:59,370][62475] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-06 22:55:00,185][62475] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-06 22:55:00,984][62475] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-06 22:55:01,776][62475] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-06 22:55:02,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88859648. Throughput: 0: 12739.0. Samples: 88842714. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:55:02,390][62145] Avg episode reward: [(0, '574.031')] [2023-03-06 22:55:02,577][62475] Updated weights for policy 0, policy_version 86780 (0.0007) [2023-03-06 22:55:03,382][62475] Updated weights for policy 0, policy_version 86790 (0.0005) [2023-03-06 22:55:04,183][62475] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-06 22:55:04,994][62475] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-06 22:55:05,824][62475] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-06 22:55:06,645][62475] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-06 22:55:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88923136. Throughput: 0: 12724.0. Samples: 88918684. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:55:07,390][62145] Avg episode reward: [(0, '694.184')] [2023-03-06 22:55:07,437][62475] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-06 22:55:08,238][62475] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-06 22:55:09,058][62475] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-06 22:55:09,845][62475] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-06 22:55:10,655][62475] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-06 22:55:11,469][62475] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-06 22:55:12,264][62475] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-06 22:55:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 88986624. Throughput: 0: 12720.6. Samples: 88956934. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:55:12,390][62145] Avg episode reward: [(0, '677.696')] [2023-03-06 22:55:13,050][62475] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-06 22:55:13,845][62475] Updated weights for policy 0, policy_version 86920 (0.0006) [2023-03-06 22:55:14,648][62475] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-06 22:55:15,464][62475] Updated weights for policy 0, policy_version 86940 (0.0006) [2023-03-06 22:55:16,229][62475] Updated weights for policy 0, policy_version 86950 (0.0007) [2023-03-06 22:55:17,049][62475] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-06 22:55:17,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89051136. Throughput: 0: 12742.8. Samples: 89033745. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 22:55:17,390][62145] Avg episode reward: [(0, '627.518')] [2023-03-06 22:55:17,831][62475] Updated weights for policy 0, policy_version 86970 (0.0007) [2023-03-06 22:55:18,644][62475] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-06 22:55:19,449][62475] Updated weights for policy 0, policy_version 86990 (0.0007) [2023-03-06 22:55:20,231][62475] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-06 22:55:21,060][62475] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-06 22:55:21,852][62475] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-06 22:55:22,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 89114624. Throughput: 0: 12755.2. Samples: 89110578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:22,390][62145] Avg episode reward: [(0, '610.374')] [2023-03-06 22:55:22,649][62475] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-06 22:55:23,448][62475] Updated weights for policy 0, policy_version 87040 (0.0006) [2023-03-06 22:55:24,253][62475] Updated weights for policy 0, policy_version 87050 (0.0006) [2023-03-06 22:55:25,050][62475] Updated weights for policy 0, policy_version 87060 (0.0006) [2023-03-06 22:55:25,849][62475] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-06 22:55:26,656][62475] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-06 22:55:27,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89179136. Throughput: 0: 12761.2. Samples: 89149111. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:27,390][62145] Avg episode reward: [(0, '510.477')] [2023-03-06 22:55:27,474][62475] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-06 22:55:28,267][62475] Updated weights for policy 0, policy_version 87100 (0.0007) [2023-03-06 22:55:29,062][62475] Updated weights for policy 0, policy_version 87110 (0.0007) [2023-03-06 22:55:29,854][62475] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-06 22:55:30,650][62475] Updated weights for policy 0, policy_version 87130 (0.0006) [2023-03-06 22:55:31,458][62475] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-06 22:55:32,248][62475] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-06 22:55:32,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89242624. Throughput: 0: 12763.4. Samples: 89225668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:32,390][62145] Avg episode reward: [(0, '583.886')] [2023-03-06 22:55:33,056][62475] Updated weights for policy 0, policy_version 87160 (0.0007) [2023-03-06 22:55:33,876][62475] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-06 22:55:34,666][62475] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-06 22:55:35,474][62475] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-06 22:55:36,272][62475] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-06 22:55:37,089][62475] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-06 22:55:37,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89307136. Throughput: 0: 12762.5. Samples: 89302183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:37,401][62145] Avg episode reward: [(0, '627.287')] [2023-03-06 22:55:37,878][62475] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-06 22:55:38,676][62475] Updated weights for policy 0, policy_version 87230 (0.0009) [2023-03-06 22:55:39,462][62475] Updated weights for policy 0, policy_version 87240 (0.0007) [2023-03-06 22:55:40,293][62475] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-06 22:55:41,077][62475] Updated weights for policy 0, policy_version 87260 (0.0007) [2023-03-06 22:55:41,889][62475] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-06 22:55:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89370624. Throughput: 0: 12768.6. Samples: 89340567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:42,401][62145] Avg episode reward: [(0, '466.768')] [2023-03-06 22:55:42,682][62475] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-06 22:55:43,481][62475] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-06 22:55:44,298][62475] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-06 22:55:45,085][62475] Updated weights for policy 0, policy_version 87310 (0.0007) [2023-03-06 22:55:45,894][62475] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-06 22:55:46,713][62475] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-06 22:55:47,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89434112. Throughput: 0: 12766.7. Samples: 89417216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:47,390][62145] Avg episode reward: [(0, '515.164')] [2023-03-06 22:55:47,502][62475] Updated weights for policy 0, policy_version 87340 (0.0006) [2023-03-06 22:55:48,322][62475] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-06 22:55:49,117][62475] Updated weights for policy 0, policy_version 87360 (0.0006) [2023-03-06 22:55:49,921][62475] Updated weights for policy 0, policy_version 87370 (0.0007) [2023-03-06 22:55:50,710][62475] Updated weights for policy 0, policy_version 87380 (0.0007) [2023-03-06 22:55:51,507][62475] Updated weights for policy 0, policy_version 87390 (0.0007) [2023-03-06 22:55:52,309][62475] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-06 22:55:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89497600. Throughput: 0: 12777.5. Samples: 89493671. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:52,390][62145] Avg episode reward: [(0, '538.266')] [2023-03-06 22:55:53,114][62475] Updated weights for policy 0, policy_version 87410 (0.0007) [2023-03-06 22:55:53,917][62475] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-06 22:55:54,726][62475] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-06 22:55:55,539][62475] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-06 22:55:56,329][62475] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-06 22:55:57,130][62475] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-06 22:55:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89562112. Throughput: 0: 12777.7. Samples: 89531930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:55:57,390][62145] Avg episode reward: [(0, '684.000')] [2023-03-06 22:55:57,951][62475] Updated weights for policy 0, policy_version 87470 (0.0007) [2023-03-06 22:55:58,768][62475] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-06 22:55:59,570][62475] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-06 22:56:00,376][62475] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-06 22:56:01,188][62475] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-06 22:56:02,000][62475] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-06 22:56:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89624576. Throughput: 0: 12762.2. Samples: 89608047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:02,390][62145] Avg episode reward: [(0, '477.187')] [2023-03-06 22:56:02,808][62475] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-06 22:56:03,598][62475] Updated weights for policy 0, policy_version 87540 (0.0007) [2023-03-06 22:56:04,406][62475] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-06 22:56:05,199][62475] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-06 22:56:05,994][62475] Updated weights for policy 0, policy_version 87570 (0.0007) [2023-03-06 22:56:06,803][62475] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-06 22:56:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12735.8). Total num frames: 89689088. Throughput: 0: 12752.8. Samples: 89684452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:07,390][62145] Avg episode reward: [(0, '418.157')] [2023-03-06 22:56:07,618][62475] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-06 22:56:08,425][62475] Updated weights for policy 0, policy_version 87600 (0.0007) [2023-03-06 22:56:09,250][62475] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-06 22:56:10,040][62475] Updated weights for policy 0, policy_version 87620 (0.0007) [2023-03-06 22:56:10,831][62475] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-06 22:56:11,652][62475] Updated weights for policy 0, policy_version 87640 (0.0006) [2023-03-06 22:56:12,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12732.3). Total num frames: 89752576. Throughput: 0: 12742.9. Samples: 89722544. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:12,390][62145] Avg episode reward: [(0, '581.325')] [2023-03-06 22:56:12,451][62475] Updated weights for policy 0, policy_version 87650 (0.0007) [2023-03-06 22:56:13,244][62475] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-06 22:56:14,047][62475] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-06 22:56:14,874][62475] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-06 22:56:15,667][62475] Updated weights for policy 0, policy_version 87690 (0.0006) [2023-03-06 22:56:16,472][62475] Updated weights for policy 0, policy_version 87700 (0.0005) [2023-03-06 22:56:17,274][62475] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-06 22:56:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89816064. Throughput: 0: 12737.0. Samples: 89798833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:17,390][62145] Avg episode reward: [(0, '451.182')] [2023-03-06 22:56:18,083][62475] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-06 22:56:18,890][62475] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-06 22:56:19,706][62475] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-06 22:56:20,491][62475] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-06 22:56:21,293][62475] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-06 22:56:22,093][62475] Updated weights for policy 0, policy_version 87770 (0.0006) [2023-03-06 22:56:22,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 89879552. Throughput: 0: 12740.3. Samples: 89875495. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:22,390][62145] Avg episode reward: [(0, '604.335')] [2023-03-06 22:56:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000087773_89879552.pth... [2023-03-06 22:56:22,424][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000084788_86822912.pth [2023-03-06 22:56:22,902][62475] Updated weights for policy 0, policy_version 87780 (0.0007) [2023-03-06 22:56:23,711][62475] Updated weights for policy 0, policy_version 87790 (0.0007) [2023-03-06 22:56:24,502][62475] Updated weights for policy 0, policy_version 87800 (0.0007) [2023-03-06 22:56:25,302][62475] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-06 22:56:26,099][62475] Updated weights for policy 0, policy_version 87820 (0.0007) [2023-03-06 22:56:26,926][62475] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-06 22:56:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 89943040. Throughput: 0: 12733.0. Samples: 89913551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:27,390][62145] Avg episode reward: [(0, '611.781')] [2023-03-06 22:56:27,717][62475] Updated weights for policy 0, policy_version 87840 (0.0008) [2023-03-06 22:56:28,530][62475] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-06 22:56:29,318][62475] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-06 22:56:30,108][62475] Updated weights for policy 0, policy_version 87870 (0.0006) [2023-03-06 22:56:30,928][62475] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-06 22:56:31,738][62475] Updated weights for policy 0, policy_version 87890 (0.0006) [2023-03-06 22:56:32,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 90007552. Throughput: 0: 12735.0. Samples: 89990292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:32,390][62145] Avg episode reward: [(0, '540.404')] [2023-03-06 22:56:32,533][62475] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-06 22:56:33,324][62475] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-06 22:56:34,118][62475] Updated weights for policy 0, policy_version 87920 (0.0006) [2023-03-06 22:56:34,934][62475] Updated weights for policy 0, policy_version 87930 (0.0006) [2023-03-06 22:56:35,719][62475] Updated weights for policy 0, policy_version 87940 (0.0006) [2023-03-06 22:56:36,538][62475] Updated weights for policy 0, policy_version 87950 (0.0007) [2023-03-06 22:56:37,337][62475] Updated weights for policy 0, policy_version 87960 (0.0007) [2023-03-06 22:56:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12735.8). Total num frames: 90071040. Throughput: 0: 12738.6. Samples: 90066906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:37,390][62145] Avg episode reward: [(0, '454.471')] [2023-03-06 22:56:38,162][62475] Updated weights for policy 0, policy_version 87970 (0.0007) [2023-03-06 22:56:38,962][62475] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-06 22:56:39,757][62475] Updated weights for policy 0, policy_version 87990 (0.0007) [2023-03-06 22:56:40,579][62475] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-06 22:56:41,371][62475] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-06 22:56:42,169][62475] Updated weights for policy 0, policy_version 88020 (0.0007) [2023-03-06 22:56:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 90134528. Throughput: 0: 12732.5. Samples: 90104891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:42,390][62145] Avg episode reward: [(0, '580.779')] [2023-03-06 22:56:42,984][62475] Updated weights for policy 0, policy_version 88030 (0.0007) [2023-03-06 22:56:43,790][62475] Updated weights for policy 0, policy_version 88040 (0.0007) [2023-03-06 22:56:44,586][62475] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-06 22:56:45,390][62475] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-06 22:56:46,191][62475] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-06 22:56:46,993][62475] Updated weights for policy 0, policy_version 88080 (0.0007) [2023-03-06 22:56:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 90198016. Throughput: 0: 12742.0. Samples: 90181437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:47,390][62145] Avg episode reward: [(0, '553.505')] [2023-03-06 22:56:47,787][62475] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-06 22:56:48,589][62475] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-06 22:56:49,399][62475] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-06 22:56:50,190][62475] Updated weights for policy 0, policy_version 88120 (0.0005) [2023-03-06 22:56:51,008][62475] Updated weights for policy 0, policy_version 88130 (0.0007) [2023-03-06 22:56:51,801][62475] Updated weights for policy 0, policy_version 88140 (0.0007) [2023-03-06 22:56:52,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 90262528. Throughput: 0: 12744.1. Samples: 90257937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:52,390][62145] Avg episode reward: [(0, '604.116')] [2023-03-06 22:56:52,608][62475] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-06 22:56:53,403][62475] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-06 22:56:54,201][62475] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-06 22:56:55,016][62475] Updated weights for policy 0, policy_version 88180 (0.0006) [2023-03-06 22:56:55,794][62475] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-06 22:56:56,604][62475] Updated weights for policy 0, policy_version 88200 (0.0006) [2023-03-06 22:56:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 90326016. Throughput: 0: 12752.3. Samples: 90296395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:56:57,390][62145] Avg episode reward: [(0, '606.524')] [2023-03-06 22:56:57,429][62475] Updated weights for policy 0, policy_version 88210 (0.0005) [2023-03-06 22:56:58,217][62475] Updated weights for policy 0, policy_version 88220 (0.0006) [2023-03-06 22:56:59,034][62475] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-06 22:56:59,827][62475] Updated weights for policy 0, policy_version 88240 (0.0007) [2023-03-06 22:57:00,628][62475] Updated weights for policy 0, policy_version 88250 (0.0006) [2023-03-06 22:57:01,456][62475] Updated weights for policy 0, policy_version 88260 (0.0007) [2023-03-06 22:57:02,257][62475] Updated weights for policy 0, policy_version 88270 (0.0006) [2023-03-06 22:57:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 90389504. Throughput: 0: 12753.7. Samples: 90372749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:02,390][62145] Avg episode reward: [(0, '592.543')] [2023-03-06 22:57:03,063][62475] Updated weights for policy 0, policy_version 88280 (0.0007) [2023-03-06 22:57:03,870][62475] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-06 22:57:04,670][62475] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-06 22:57:05,470][62475] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-06 22:57:06,277][62475] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-06 22:57:07,092][62475] Updated weights for policy 0, policy_version 88330 (0.0007) [2023-03-06 22:57:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 90452992. Throughput: 0: 12741.9. Samples: 90448879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:07,390][62145] Avg episode reward: [(0, '432.719')] [2023-03-06 22:57:07,905][62475] Updated weights for policy 0, policy_version 88340 (0.0007) [2023-03-06 22:57:08,698][62475] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-06 22:57:09,493][62475] Updated weights for policy 0, policy_version 88360 (0.0005) [2023-03-06 22:57:10,311][62475] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-06 22:57:11,118][62475] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-06 22:57:11,913][62475] Updated weights for policy 0, policy_version 88390 (0.0006) [2023-03-06 22:57:12,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 90517504. Throughput: 0: 12747.2. Samples: 90487173. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:12,390][62145] Avg episode reward: [(0, '515.697')] [2023-03-06 22:57:12,699][62475] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-06 22:57:13,516][62475] Updated weights for policy 0, policy_version 88410 (0.0006) [2023-03-06 22:57:14,309][62475] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-06 22:57:15,109][62475] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-06 22:57:15,919][62475] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-06 22:57:16,728][62475] Updated weights for policy 0, policy_version 88450 (0.0007) [2023-03-06 22:57:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 90579968. Throughput: 0: 12742.5. Samples: 90563703. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:17,390][62145] Avg episode reward: [(0, '645.025')] [2023-03-06 22:57:17,566][62475] Updated weights for policy 0, policy_version 88460 (0.0007) [2023-03-06 22:57:18,345][62475] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-06 22:57:19,162][62475] Updated weights for policy 0, policy_version 88480 (0.0006) [2023-03-06 22:57:19,963][62475] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-06 22:57:20,767][62475] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-06 22:57:21,569][62475] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-06 22:57:22,361][62475] Updated weights for policy 0, policy_version 88520 (0.0007) [2023-03-06 22:57:22,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 90644480. Throughput: 0: 12733.2. Samples: 90639902. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:22,390][62145] Avg episode reward: [(0, '619.227')] [2023-03-06 22:57:23,156][62475] Updated weights for policy 0, policy_version 88530 (0.0007) [2023-03-06 22:57:23,957][62475] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-06 22:57:24,762][62475] Updated weights for policy 0, policy_version 88550 (0.0006) [2023-03-06 22:57:25,574][62475] Updated weights for policy 0, policy_version 88560 (0.0007) [2023-03-06 22:57:26,372][62475] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-06 22:57:27,180][62475] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-06 22:57:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 90707968. Throughput: 0: 12743.9. Samples: 90678368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:27,390][62145] Avg episode reward: [(0, '647.813')] [2023-03-06 22:57:27,987][62475] Updated weights for policy 0, policy_version 88590 (0.0007) [2023-03-06 22:57:28,780][62475] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-06 22:57:29,604][62475] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-06 22:57:30,407][62475] Updated weights for policy 0, policy_version 88620 (0.0007) [2023-03-06 22:57:31,205][62475] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-06 22:57:32,023][62475] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-06 22:57:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12739.3). Total num frames: 90771456. Throughput: 0: 12738.4. Samples: 90754662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:32,401][62145] Avg episode reward: [(0, '595.107')] [2023-03-06 22:57:32,814][62475] Updated weights for policy 0, policy_version 88650 (0.0005) [2023-03-06 22:57:33,605][62475] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-06 22:57:34,440][62475] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-06 22:57:35,223][62475] Updated weights for policy 0, policy_version 88680 (0.0007) [2023-03-06 22:57:36,034][62475] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-06 22:57:36,844][62475] Updated weights for policy 0, policy_version 88700 (0.0007) [2023-03-06 22:57:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 90834944. Throughput: 0: 12732.3. Samples: 90830891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:37,390][62145] Avg episode reward: [(0, '516.808')] [2023-03-06 22:57:37,659][62475] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-06 22:57:38,463][62475] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-06 22:57:39,276][62475] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-06 22:57:40,062][62475] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-06 22:57:40,855][62475] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-06 22:57:41,677][62475] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-06 22:57:42,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 90898432. Throughput: 0: 12725.1. Samples: 90869022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:42,390][62145] Avg episode reward: [(0, '684.379')] [2023-03-06 22:57:42,487][62475] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-06 22:57:43,286][62475] Updated weights for policy 0, policy_version 88780 (0.0007) [2023-03-06 22:57:44,082][62475] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-06 22:57:44,889][62475] Updated weights for policy 0, policy_version 88800 (0.0007) [2023-03-06 22:57:45,678][62475] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-06 22:57:46,508][62475] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-06 22:57:47,308][62475] Updated weights for policy 0, policy_version 88830 (0.0006) [2023-03-06 22:57:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12739.3). Total num frames: 90961920. Throughput: 0: 12724.9. Samples: 90945370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:47,390][62145] Avg episode reward: [(0, '555.116')] [2023-03-06 22:57:48,109][62475] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-06 22:57:48,917][62475] Updated weights for policy 0, policy_version 88850 (0.0007) [2023-03-06 22:57:49,725][62475] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-06 22:57:50,518][62475] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-06 22:57:51,324][62475] Updated weights for policy 0, policy_version 88880 (0.0007) [2023-03-06 22:57:52,129][62475] Updated weights for policy 0, policy_version 88890 (0.0006) [2023-03-06 22:57:52,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12739.2). Total num frames: 91026432. Throughput: 0: 12731.9. Samples: 91021814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:52,390][62145] Avg episode reward: [(0, '503.671')] [2023-03-06 22:57:52,941][62475] Updated weights for policy 0, policy_version 88900 (0.0006) [2023-03-06 22:57:53,745][62475] Updated weights for policy 0, policy_version 88910 (0.0007) [2023-03-06 22:57:54,550][62475] Updated weights for policy 0, policy_version 88920 (0.0007) [2023-03-06 22:57:55,334][62475] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-06 22:57:56,130][62475] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-06 22:57:56,965][62475] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-06 22:57:57,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12739.2). Total num frames: 91089920. Throughput: 0: 12732.5. Samples: 91060136. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:57:57,390][62145] Avg episode reward: [(0, '687.499')] [2023-03-06 22:57:57,770][62475] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-06 22:57:58,552][62475] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-06 22:57:59,380][62475] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-06 22:58:00,180][62475] Updated weights for policy 0, policy_version 88990 (0.0007) [2023-03-06 22:58:00,985][62475] Updated weights for policy 0, policy_version 89000 (0.0006) [2023-03-06 22:58:01,785][62475] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-06 22:58:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 91153408. Throughput: 0: 12728.8. Samples: 91136497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:02,390][62145] Avg episode reward: [(0, '536.190')] [2023-03-06 22:58:02,585][62475] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-06 22:58:03,377][62475] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-06 22:58:04,181][62475] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-06 22:58:04,993][62475] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-06 22:58:05,796][62475] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-06 22:58:06,598][62475] Updated weights for policy 0, policy_version 89070 (0.0006) [2023-03-06 22:58:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 91216896. Throughput: 0: 12730.4. Samples: 91212769. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:07,390][62145] Avg episode reward: [(0, '669.510')] [2023-03-06 22:58:07,417][62475] Updated weights for policy 0, policy_version 89080 (0.0006) [2023-03-06 22:58:08,239][62475] Updated weights for policy 0, policy_version 89090 (0.0008) [2023-03-06 22:58:09,019][62475] Updated weights for policy 0, policy_version 89100 (0.0007) [2023-03-06 22:58:09,845][62475] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-06 22:58:10,650][62475] Updated weights for policy 0, policy_version 89120 (0.0007) [2023-03-06 22:58:11,449][62475] Updated weights for policy 0, policy_version 89130 (0.0007) [2023-03-06 22:58:12,258][62475] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-06 22:58:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 91280384. Throughput: 0: 12721.4. Samples: 91250829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:12,390][62145] Avg episode reward: [(0, '539.018')] [2023-03-06 22:58:13,073][62475] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-06 22:58:13,861][62475] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-06 22:58:14,674][62475] Updated weights for policy 0, policy_version 89170 (0.0007) [2023-03-06 22:58:15,466][62475] Updated weights for policy 0, policy_version 89180 (0.0007) [2023-03-06 22:58:16,273][62475] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-06 22:58:17,086][62475] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-06 22:58:17,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 91343872. Throughput: 0: 12720.6. Samples: 91327090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:17,390][62145] Avg episode reward: [(0, '441.080')] [2023-03-06 22:58:17,881][62475] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-06 22:58:18,677][62475] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-06 22:58:19,486][62475] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-06 22:58:20,266][62475] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-06 22:58:21,076][62475] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-06 22:58:21,886][62475] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-06 22:58:22,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 91408384. Throughput: 0: 12733.2. Samples: 91403883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:22,390][62145] Avg episode reward: [(0, '467.617')] [2023-03-06 22:58:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000089266_91408384.pth... [2023-03-06 22:58:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000086280_88350720.pth [2023-03-06 22:58:22,686][62475] Updated weights for policy 0, policy_version 89270 (0.0007) [2023-03-06 22:58:23,499][62475] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-06 22:58:24,283][62475] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-06 22:58:25,095][62475] Updated weights for policy 0, policy_version 89300 (0.0005) [2023-03-06 22:58:25,880][62475] Updated weights for policy 0, policy_version 89310 (0.0006) [2023-03-06 22:58:26,681][62475] Updated weights for policy 0, policy_version 89320 (0.0006) [2023-03-06 22:58:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 91471872. Throughput: 0: 12737.6. Samples: 91442212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:27,390][62145] Avg episode reward: [(0, '465.863')] [2023-03-06 22:58:27,496][62475] Updated weights for policy 0, policy_version 89330 (0.0007) [2023-03-06 22:58:28,293][62475] Updated weights for policy 0, policy_version 89340 (0.0007) [2023-03-06 22:58:29,087][62475] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-06 22:58:29,890][62475] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-06 22:58:30,707][62475] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-06 22:58:31,505][62475] Updated weights for policy 0, policy_version 89380 (0.0007) [2023-03-06 22:58:32,304][62475] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-06 22:58:32,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 91536384. Throughput: 0: 12742.9. Samples: 91518801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:32,390][62145] Avg episode reward: [(0, '374.055')] [2023-03-06 22:58:33,109][62475] Updated weights for policy 0, policy_version 89400 (0.0006) [2023-03-06 22:58:33,917][62475] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-06 22:58:34,722][62475] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-06 22:58:35,521][62475] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-06 22:58:36,341][62475] Updated weights for policy 0, policy_version 89440 (0.0007) [2023-03-06 22:58:37,146][62475] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-06 22:58:37,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 91599872. Throughput: 0: 12738.2. Samples: 91595031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:37,390][62145] Avg episode reward: [(0, '407.165')] [2023-03-06 22:58:37,953][62475] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-06 22:58:38,748][62475] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-06 22:58:39,574][62475] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-06 22:58:40,356][62475] Updated weights for policy 0, policy_version 89490 (0.0007) [2023-03-06 22:58:41,160][62475] Updated weights for policy 0, policy_version 89500 (0.0007) [2023-03-06 22:58:41,977][62475] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-06 22:58:42,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 91663360. Throughput: 0: 12734.6. Samples: 91633191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:42,390][62145] Avg episode reward: [(0, '516.812')] [2023-03-06 22:58:42,795][62475] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-06 22:58:43,596][62475] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-06 22:58:44,396][62475] Updated weights for policy 0, policy_version 89540 (0.0006) [2023-03-06 22:58:45,197][62475] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-06 22:58:46,003][62475] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-06 22:58:46,799][62475] Updated weights for policy 0, policy_version 89570 (0.0007) [2023-03-06 22:58:47,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 91726848. Throughput: 0: 12733.4. Samples: 91709498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:47,390][62145] Avg episode reward: [(0, '481.873')] [2023-03-06 22:58:47,610][62475] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-06 22:58:48,403][62475] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-06 22:58:49,195][62475] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-06 22:58:49,997][62475] Updated weights for policy 0, policy_version 89610 (0.0007) [2023-03-06 22:58:50,798][62475] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-06 22:58:51,591][62475] Updated weights for policy 0, policy_version 89630 (0.0007) [2023-03-06 22:58:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12739.3). Total num frames: 91790336. Throughput: 0: 12747.9. Samples: 91786424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:52,390][62145] Avg episode reward: [(0, '545.982')] [2023-03-06 22:58:52,399][62475] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-06 22:58:53,206][62475] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-06 22:58:54,002][62475] Updated weights for policy 0, policy_version 89660 (0.0006) [2023-03-06 22:58:54,805][62475] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-06 22:58:55,603][62475] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-06 22:58:56,404][62475] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-06 22:58:57,205][62475] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-06 22:58:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 91854848. Throughput: 0: 12754.0. Samples: 91824759. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:58:57,390][62145] Avg episode reward: [(0, '745.911')] [2023-03-06 22:58:58,019][62475] Updated weights for policy 0, policy_version 89710 (0.0006) [2023-03-06 22:58:58,812][62475] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-06 22:58:59,607][62475] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-06 22:59:00,439][62475] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-06 22:59:01,240][62475] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-06 22:59:02,062][62475] Updated weights for policy 0, policy_version 89760 (0.0007) [2023-03-06 22:59:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 91918336. Throughput: 0: 12757.4. Samples: 91901173. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:02,390][62145] Avg episode reward: [(0, '643.505')] [2023-03-06 22:59:02,858][62475] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-06 22:59:03,667][62475] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-06 22:59:04,441][62475] Updated weights for policy 0, policy_version 89790 (0.0008) [2023-03-06 22:59:05,269][62475] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-06 22:59:06,063][62475] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-06 22:59:06,848][62475] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-06 22:59:07,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 91981824. Throughput: 0: 12747.7. Samples: 91977528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:07,390][62145] Avg episode reward: [(0, '548.114')] [2023-03-06 22:59:07,667][62475] Updated weights for policy 0, policy_version 89830 (0.0006) [2023-03-06 22:59:08,473][62475] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-06 22:59:09,292][62475] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-06 22:59:10,085][62475] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-06 22:59:10,881][62475] Updated weights for policy 0, policy_version 89870 (0.0007) [2023-03-06 22:59:11,690][62475] Updated weights for policy 0, policy_version 89880 (0.0007) [2023-03-06 22:59:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 92045312. Throughput: 0: 12745.0. Samples: 92015737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:12,390][62145] Avg episode reward: [(0, '797.966')] [2023-03-06 22:59:12,479][62475] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-06 22:59:13,310][62475] Updated weights for policy 0, policy_version 89900 (0.0007) [2023-03-06 22:59:14,099][62475] Updated weights for policy 0, policy_version 89910 (0.0006) [2023-03-06 22:59:14,917][62475] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-06 22:59:15,733][62475] Updated weights for policy 0, policy_version 89930 (0.0006) [2023-03-06 22:59:16,511][62475] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-06 22:59:17,337][62475] Updated weights for policy 0, policy_version 89950 (0.0006) [2023-03-06 22:59:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12739.3). Total num frames: 92108800. Throughput: 0: 12737.6. Samples: 92091990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:17,390][62145] Avg episode reward: [(0, '795.740')] [2023-03-06 22:59:18,130][62475] Updated weights for policy 0, policy_version 89960 (0.0007) [2023-03-06 22:59:18,938][62475] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-06 22:59:19,738][62475] Updated weights for policy 0, policy_version 89980 (0.0006) [2023-03-06 22:59:20,537][62475] Updated weights for policy 0, policy_version 89990 (0.0006) [2023-03-06 22:59:21,348][62475] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-06 22:59:22,166][62475] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-06 22:59:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12742.7). Total num frames: 92173312. Throughput: 0: 12738.2. Samples: 92168247. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:22,400][62145] Avg episode reward: [(0, '809.760')] [2023-03-06 22:59:22,974][62475] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-06 22:59:23,786][62475] Updated weights for policy 0, policy_version 90030 (0.0007) [2023-03-06 22:59:24,605][62475] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-06 22:59:25,405][62475] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-06 22:59:26,224][62475] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-06 22:59:27,038][62475] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-06 22:59:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 92235776. Throughput: 0: 12733.5. Samples: 92206198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:27,390][62145] Avg episode reward: [(0, '691.815')] [2023-03-06 22:59:27,833][62475] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-06 22:59:28,618][62475] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-06 22:59:29,422][62475] Updated weights for policy 0, policy_version 90100 (0.0006) [2023-03-06 22:59:30,225][62475] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-06 22:59:31,039][62475] Updated weights for policy 0, policy_version 90120 (0.0007) [2023-03-06 22:59:31,837][62475] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-06 22:59:32,389][62145] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 92299264. Throughput: 0: 12731.2. Samples: 92282400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:32,390][62145] Avg episode reward: [(0, '876.885')] [2023-03-06 22:59:32,639][62475] Updated weights for policy 0, policy_version 90140 (0.0006) [2023-03-06 22:59:33,446][62475] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-06 22:59:34,277][62475] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-06 22:59:35,072][62475] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-06 22:59:35,874][62475] Updated weights for policy 0, policy_version 90180 (0.0006) [2023-03-06 22:59:36,683][62475] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-06 22:59:37,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 92362752. Throughput: 0: 12718.2. Samples: 92358744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:37,390][62145] Avg episode reward: [(0, '609.997')] [2023-03-06 22:59:37,485][62475] Updated weights for policy 0, policy_version 90200 (0.0006) [2023-03-06 22:59:38,280][62475] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-06 22:59:39,091][62475] Updated weights for policy 0, policy_version 90220 (0.0007) [2023-03-06 22:59:39,886][62475] Updated weights for policy 0, policy_version 90230 (0.0007) [2023-03-06 22:59:40,685][62475] Updated weights for policy 0, policy_version 90240 (0.0006) [2023-03-06 22:59:41,481][62475] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-06 22:59:42,277][62475] Updated weights for policy 0, policy_version 90260 (0.0006) [2023-03-06 22:59:42,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 92427264. Throughput: 0: 12718.5. Samples: 92397092. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:42,390][62145] Avg episode reward: [(0, '615.242')] [2023-03-06 22:59:43,085][62475] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-06 22:59:43,878][62475] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-06 22:59:44,686][62475] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-06 22:59:45,485][62475] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-06 22:59:46,300][62475] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-06 22:59:47,101][62475] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-06 22:59:47,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 92490752. Throughput: 0: 12728.4. Samples: 92473950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 22:59:47,390][62145] Avg episode reward: [(0, '720.612')] [2023-03-06 22:59:47,909][62475] Updated weights for policy 0, policy_version 90330 (0.0006) [2023-03-06 22:59:48,702][62475] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-06 22:59:49,527][62475] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-06 22:59:50,317][62475] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-06 22:59:51,101][62475] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-06 22:59:51,916][62475] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-06 22:59:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 92554240. Throughput: 0: 12731.1. Samples: 92550428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:59:52,390][62145] Avg episode reward: [(0, '816.033')] [2023-03-06 22:59:52,724][62475] Updated weights for policy 0, policy_version 90390 (0.0006) [2023-03-06 22:59:53,513][62475] Updated weights for policy 0, policy_version 90400 (0.0006) [2023-03-06 22:59:54,350][62475] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-06 22:59:55,174][62475] Updated weights for policy 0, policy_version 90420 (0.0007) [2023-03-06 22:59:55,960][62475] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-06 22:59:56,762][62475] Updated weights for policy 0, policy_version 90440 (0.0006) [2023-03-06 22:59:57,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 92617728. Throughput: 0: 12722.6. Samples: 92588256. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 22:59:57,390][62145] Avg episode reward: [(0, '802.338')] [2023-03-06 22:59:57,559][62475] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-06 22:59:58,374][62475] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-06 22:59:59,165][62475] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-06 22:59:59,974][62475] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-06 23:00:00,766][62475] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-06 23:00:01,567][62475] Updated weights for policy 0, policy_version 90500 (0.0007) [2023-03-06 23:00:02,384][62475] Updated weights for policy 0, policy_version 90510 (0.0006) [2023-03-06 23:00:02,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 92682240. Throughput: 0: 12729.3. Samples: 92664808. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:02,390][62145] Avg episode reward: [(0, '539.600')] [2023-03-06 23:00:03,173][62475] Updated weights for policy 0, policy_version 90520 (0.0007) [2023-03-06 23:00:03,982][62475] Updated weights for policy 0, policy_version 90530 (0.0007) [2023-03-06 23:00:04,786][62475] Updated weights for policy 0, policy_version 90540 (0.0007) [2023-03-06 23:00:05,607][62475] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-06 23:00:06,419][62475] Updated weights for policy 0, policy_version 90560 (0.0005) [2023-03-06 23:00:07,228][62475] Updated weights for policy 0, policy_version 90570 (0.0007) [2023-03-06 23:00:07,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12742.7). Total num frames: 92745728. Throughput: 0: 12725.7. Samples: 92740906. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:07,390][62145] Avg episode reward: [(0, '827.191')] [2023-03-06 23:00:08,038][62475] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-06 23:00:08,847][62475] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-06 23:00:09,660][62475] Updated weights for policy 0, policy_version 90600 (0.0006) [2023-03-06 23:00:10,467][62475] Updated weights for policy 0, policy_version 90610 (0.0008) [2023-03-06 23:00:11,259][62475] Updated weights for policy 0, policy_version 90620 (0.0006) [2023-03-06 23:00:12,082][62475] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-06 23:00:12,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 92808192. Throughput: 0: 12725.7. Samples: 92778854. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:12,390][62145] Avg episode reward: [(0, '677.300')] [2023-03-06 23:00:12,893][62475] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-06 23:00:13,694][62475] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-06 23:00:14,502][62475] Updated weights for policy 0, policy_version 90660 (0.0007) [2023-03-06 23:00:15,305][62475] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-06 23:00:16,104][62475] Updated weights for policy 0, policy_version 90680 (0.0006) [2023-03-06 23:00:16,916][62475] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-06 23:00:17,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 92871680. Throughput: 0: 12727.1. Samples: 92855120. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:17,390][62145] Avg episode reward: [(0, '762.851')] [2023-03-06 23:00:17,732][62475] Updated weights for policy 0, policy_version 90700 (0.0007) [2023-03-06 23:00:18,541][62475] Updated weights for policy 0, policy_version 90710 (0.0007) [2023-03-06 23:00:19,347][62475] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-06 23:00:20,133][62475] Updated weights for policy 0, policy_version 90730 (0.0007) [2023-03-06 23:00:20,941][62475] Updated weights for policy 0, policy_version 90740 (0.0005) [2023-03-06 23:00:21,774][62475] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-06 23:00:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 92935168. Throughput: 0: 12719.3. Samples: 92931110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:22,390][62145] Avg episode reward: [(0, '836.975')] [2023-03-06 23:00:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000090757_92935168.pth... [2023-03-06 23:00:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000087773_89879552.pth [2023-03-06 23:00:22,588][62475] Updated weights for policy 0, policy_version 90760 (0.0007) [2023-03-06 23:00:23,381][62475] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-06 23:00:24,211][62475] Updated weights for policy 0, policy_version 90780 (0.0007) [2023-03-06 23:00:25,004][62475] Updated weights for policy 0, policy_version 90790 (0.0007) [2023-03-06 23:00:25,805][62475] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-06 23:00:26,619][62475] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-06 23:00:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 92998656. Throughput: 0: 12711.8. Samples: 92969120. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:27,400][62145] Avg episode reward: [(0, '759.080')] [2023-03-06 23:00:27,432][62475] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-06 23:00:28,231][62475] Updated weights for policy 0, policy_version 90830 (0.0007) [2023-03-06 23:00:29,037][62475] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-06 23:00:29,860][62475] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-06 23:00:30,649][62475] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-06 23:00:31,444][62475] Updated weights for policy 0, policy_version 90870 (0.0005) [2023-03-06 23:00:32,256][62475] Updated weights for policy 0, policy_version 90880 (0.0006) [2023-03-06 23:00:32,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 93062144. Throughput: 0: 12696.6. Samples: 93045296. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:32,400][62145] Avg episode reward: [(0, '837.779')] [2023-03-06 23:00:33,053][62475] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-06 23:00:33,846][62475] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-06 23:00:34,664][62475] Updated weights for policy 0, policy_version 90910 (0.0006) [2023-03-06 23:00:35,476][62475] Updated weights for policy 0, policy_version 90920 (0.0007) [2023-03-06 23:00:36,267][62475] Updated weights for policy 0, policy_version 90930 (0.0006) [2023-03-06 23:00:37,080][62475] Updated weights for policy 0, policy_version 90940 (0.0007) [2023-03-06 23:00:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 93125632. Throughput: 0: 12693.4. Samples: 93121629. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:37,400][62145] Avg episode reward: [(0, '779.470')] [2023-03-06 23:00:37,882][62475] Updated weights for policy 0, policy_version 90950 (0.0007) [2023-03-06 23:00:38,687][62475] Updated weights for policy 0, policy_version 90960 (0.0006) [2023-03-06 23:00:39,514][62475] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-06 23:00:40,303][62475] Updated weights for policy 0, policy_version 90980 (0.0006) [2023-03-06 23:00:41,110][62475] Updated weights for policy 0, policy_version 90990 (0.0007) [2023-03-06 23:00:41,911][62475] Updated weights for policy 0, policy_version 91000 (0.0005) [2023-03-06 23:00:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 93189120. Throughput: 0: 12699.7. Samples: 93159740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:42,400][62145] Avg episode reward: [(0, '849.872')] [2023-03-06 23:00:42,739][62475] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-06 23:00:43,515][62475] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-06 23:00:44,305][62475] Updated weights for policy 0, policy_version 91030 (0.0007) [2023-03-06 23:00:45,129][62475] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-06 23:00:45,949][62475] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-06 23:00:46,749][62475] Updated weights for policy 0, policy_version 91060 (0.0006) [2023-03-06 23:00:47,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 93252608. Throughput: 0: 12695.6. Samples: 93236109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:47,401][62145] Avg episode reward: [(0, '767.304')] [2023-03-06 23:00:47,572][62475] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-06 23:00:48,355][62475] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-06 23:00:49,164][62475] Updated weights for policy 0, policy_version 91090 (0.0007) [2023-03-06 23:00:49,978][62475] Updated weights for policy 0, policy_version 91100 (0.0006) [2023-03-06 23:00:50,777][62475] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-06 23:00:51,624][62475] Updated weights for policy 0, policy_version 91120 (0.0007) [2023-03-06 23:00:52,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 93316096. Throughput: 0: 12693.2. Samples: 93312101. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:52,390][62145] Avg episode reward: [(0, '821.908')] [2023-03-06 23:00:52,397][62475] Updated weights for policy 0, policy_version 91130 (0.0006) [2023-03-06 23:00:53,204][62475] Updated weights for policy 0, policy_version 91140 (0.0006) [2023-03-06 23:00:54,009][62475] Updated weights for policy 0, policy_version 91150 (0.0007) [2023-03-06 23:00:54,822][62475] Updated weights for policy 0, policy_version 91160 (0.0005) [2023-03-06 23:00:55,617][62475] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-06 23:00:56,433][62475] Updated weights for policy 0, policy_version 91180 (0.0006) [2023-03-06 23:00:57,229][62475] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-06 23:00:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 93379584. Throughput: 0: 12698.6. Samples: 93350289. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:00:57,390][62145] Avg episode reward: [(0, '701.622')] [2023-03-06 23:00:58,041][62475] Updated weights for policy 0, policy_version 91200 (0.0007) [2023-03-06 23:00:58,850][62475] Updated weights for policy 0, policy_version 91210 (0.0007) [2023-03-06 23:00:59,669][62475] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-06 23:01:00,484][62475] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-06 23:01:01,289][62475] Updated weights for policy 0, policy_version 91240 (0.0006) [2023-03-06 23:01:02,093][62475] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-06 23:01:02,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12725.4). Total num frames: 93443072. Throughput: 0: 12690.3. Samples: 93426184. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:02,390][62145] Avg episode reward: [(0, '796.897')] [2023-03-06 23:01:02,910][62475] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-06 23:01:03,708][62475] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-06 23:01:04,510][62475] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-06 23:01:05,322][62475] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-06 23:01:06,123][62475] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-06 23:01:06,937][62475] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-06 23:01:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12725.4). Total num frames: 93506560. Throughput: 0: 12699.2. Samples: 93502576. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:07,390][62145] Avg episode reward: [(0, '803.191')] [2023-03-06 23:01:07,752][62475] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-06 23:01:08,547][62475] Updated weights for policy 0, policy_version 91330 (0.0007) [2023-03-06 23:01:09,354][62475] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-06 23:01:10,170][62475] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-06 23:01:10,955][62475] Updated weights for policy 0, policy_version 91360 (0.0007) [2023-03-06 23:01:11,767][62475] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-06 23:01:12,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 93570048. Throughput: 0: 12697.3. Samples: 93540498. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:12,390][62145] Avg episode reward: [(0, '860.894')] [2023-03-06 23:01:12,593][62475] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-06 23:01:13,393][62475] Updated weights for policy 0, policy_version 91390 (0.0007) [2023-03-06 23:01:14,201][62475] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-06 23:01:15,014][62475] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-06 23:01:15,810][62475] Updated weights for policy 0, policy_version 91420 (0.0005) [2023-03-06 23:01:16,637][62475] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-06 23:01:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 93633536. Throughput: 0: 12694.2. Samples: 93616538. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:17,390][62145] Avg episode reward: [(0, '812.832')] [2023-03-06 23:01:17,449][62475] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-06 23:01:18,237][62475] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-06 23:01:19,065][62475] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-06 23:01:19,856][62475] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-06 23:01:20,647][62475] Updated weights for policy 0, policy_version 91480 (0.0006) [2023-03-06 23:01:21,461][62475] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-06 23:01:22,283][62475] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-06 23:01:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 93697024. Throughput: 0: 12686.6. Samples: 93692526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:22,390][62145] Avg episode reward: [(0, '892.272')] [2023-03-06 23:01:23,077][62475] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-06 23:01:23,897][62475] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-06 23:01:24,711][62475] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-06 23:01:25,531][62475] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-06 23:01:26,326][62475] Updated weights for policy 0, policy_version 91550 (0.0007) [2023-03-06 23:01:27,133][62475] Updated weights for policy 0, policy_version 91560 (0.0007) [2023-03-06 23:01:27,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 93760512. Throughput: 0: 12685.0. Samples: 93730566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:27,390][62145] Avg episode reward: [(0, '1097.543')] [2023-03-06 23:01:27,941][62475] Updated weights for policy 0, policy_version 91570 (0.0005) [2023-03-06 23:01:28,773][62475] Updated weights for policy 0, policy_version 91580 (0.0007) [2023-03-06 23:01:29,589][62475] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-06 23:01:30,385][62475] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-06 23:01:31,189][62475] Updated weights for policy 0, policy_version 91610 (0.0008) [2023-03-06 23:01:32,007][62475] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-06 23:01:32,390][62145] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12718.4). Total num frames: 93822976. Throughput: 0: 12673.4. Samples: 93806414. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:32,390][62145] Avg episode reward: [(0, '690.806')] [2023-03-06 23:01:32,806][62475] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-06 23:01:33,586][62475] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-06 23:01:34,412][62475] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-06 23:01:35,222][62475] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-06 23:01:36,040][62475] Updated weights for policy 0, policy_version 91670 (0.0007) [2023-03-06 23:01:36,858][62475] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-06 23:01:37,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12718.4). Total num frames: 93886464. Throughput: 0: 12674.5. Samples: 93882455. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:01:37,390][62145] Avg episode reward: [(0, '758.889')] [2023-03-06 23:01:37,662][62475] Updated weights for policy 0, policy_version 91690 (0.0007) [2023-03-06 23:01:38,467][62475] Updated weights for policy 0, policy_version 91700 (0.0007) [2023-03-06 23:01:39,294][62475] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-06 23:01:40,104][62475] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-06 23:01:40,918][62475] Updated weights for policy 0, policy_version 91730 (0.0005) [2023-03-06 23:01:41,709][62475] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-06 23:01:42,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12718.4). Total num frames: 93949952. Throughput: 0: 12662.5. Samples: 93920105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:01:42,390][62145] Avg episode reward: [(0, '863.856')] [2023-03-06 23:01:42,521][62475] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-06 23:01:43,318][62475] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-06 23:01:44,117][62475] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-06 23:01:44,932][62475] Updated weights for policy 0, policy_version 91780 (0.0006) [2023-03-06 23:01:45,722][62475] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-06 23:01:46,520][62475] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-06 23:01:47,336][62475] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-06 23:01:47,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 94013440. Throughput: 0: 12677.7. Samples: 93996679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:01:47,390][62145] Avg episode reward: [(0, '613.939')] [2023-03-06 23:01:48,144][62475] Updated weights for policy 0, policy_version 91820 (0.0006) [2023-03-06 23:01:48,945][62475] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-06 23:01:49,759][62475] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-06 23:01:50,577][62475] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-06 23:01:51,364][62475] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-06 23:01:52,174][62475] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-06 23:01:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 94076928. Throughput: 0: 12673.0. Samples: 94072861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:01:52,390][62145] Avg episode reward: [(0, '685.255')] [2023-03-06 23:01:52,994][62475] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-06 23:01:53,791][62475] Updated weights for policy 0, policy_version 91890 (0.0005) [2023-03-06 23:01:54,606][62475] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-06 23:01:55,425][62475] Updated weights for policy 0, policy_version 91910 (0.0007) [2023-03-06 23:01:56,218][62475] Updated weights for policy 0, policy_version 91920 (0.0007) [2023-03-06 23:01:57,001][62475] Updated weights for policy 0, policy_version 91930 (0.0006) [2023-03-06 23:01:57,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 94140416. Throughput: 0: 12672.5. Samples: 94110758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:01:57,390][62145] Avg episode reward: [(0, '712.525')] [2023-03-06 23:01:57,830][62475] Updated weights for policy 0, policy_version 91940 (0.0006) [2023-03-06 23:01:58,628][62475] Updated weights for policy 0, policy_version 91950 (0.0007) [2023-03-06 23:01:59,434][62475] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-06 23:02:00,238][62475] Updated weights for policy 0, policy_version 91970 (0.0006) [2023-03-06 23:02:01,029][62475] Updated weights for policy 0, policy_version 91980 (0.0005) [2023-03-06 23:02:01,817][62475] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-06 23:02:02,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 94203904. Throughput: 0: 12682.6. Samples: 94187254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:02,390][62145] Avg episode reward: [(0, '794.147')] [2023-03-06 23:02:02,612][62475] Updated weights for policy 0, policy_version 92000 (0.0007) [2023-03-06 23:02:03,425][62475] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-06 23:02:04,230][62475] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-06 23:02:05,032][62475] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-06 23:02:05,843][62475] Updated weights for policy 0, policy_version 92040 (0.0006) [2023-03-06 23:02:06,650][62475] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-06 23:02:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12715.0). Total num frames: 94268416. Throughput: 0: 12693.1. Samples: 94263716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:07,390][62145] Avg episode reward: [(0, '845.302')] [2023-03-06 23:02:07,472][62475] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-06 23:02:08,279][62475] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-06 23:02:09,077][62475] Updated weights for policy 0, policy_version 92080 (0.0007) [2023-03-06 23:02:09,889][62475] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-06 23:02:10,696][62475] Updated weights for policy 0, policy_version 92100 (0.0006) [2023-03-06 23:02:11,510][62475] Updated weights for policy 0, policy_version 92110 (0.0006) [2023-03-06 23:02:12,346][62475] Updated weights for policy 0, policy_version 92120 (0.0006) [2023-03-06 23:02:12,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 94330880. Throughput: 0: 12690.0. Samples: 94301616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:12,390][62145] Avg episode reward: [(0, '801.790')] [2023-03-06 23:02:13,156][62475] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-06 23:02:13,952][62475] Updated weights for policy 0, policy_version 92140 (0.0008) [2023-03-06 23:02:14,760][62475] Updated weights for policy 0, policy_version 92150 (0.0006) [2023-03-06 23:02:15,587][62475] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-06 23:02:16,386][62475] Updated weights for policy 0, policy_version 92170 (0.0007) [2023-03-06 23:02:17,183][62475] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-06 23:02:17,390][62145] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12711.5). Total num frames: 94394368. Throughput: 0: 12689.3. Samples: 94377434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:17,390][62145] Avg episode reward: [(0, '901.158')] [2023-03-06 23:02:17,975][62475] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-06 23:02:18,786][62475] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-06 23:02:19,597][62475] Updated weights for policy 0, policy_version 92210 (0.0007) [2023-03-06 23:02:20,403][62475] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-06 23:02:21,216][62475] Updated weights for policy 0, policy_version 92230 (0.0007) [2023-03-06 23:02:22,020][62475] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-06 23:02:22,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12711.5). Total num frames: 94457856. Throughput: 0: 12694.0. Samples: 94453683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:22,390][62145] Avg episode reward: [(0, '852.368')] [2023-03-06 23:02:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000092244_94457856.pth... [2023-03-06 23:02:22,425][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000089266_91408384.pth [2023-03-06 23:02:22,845][62475] Updated weights for policy 0, policy_version 92250 (0.0006) [2023-03-06 23:02:23,643][62475] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-06 23:02:24,442][62475] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-06 23:02:25,252][62475] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-06 23:02:26,059][62475] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-06 23:02:26,860][62475] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-06 23:02:27,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12711.5). Total num frames: 94521344. Throughput: 0: 12703.2. Samples: 94491748. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:27,390][62145] Avg episode reward: [(0, '851.394')] [2023-03-06 23:02:27,667][62475] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-06 23:02:28,474][62475] Updated weights for policy 0, policy_version 92320 (0.0006) [2023-03-06 23:02:29,287][62475] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-06 23:02:30,100][62475] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-06 23:02:30,907][62475] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-06 23:02:31,710][62475] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-06 23:02:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 94584832. Throughput: 0: 12689.5. Samples: 94567708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:32,390][62145] Avg episode reward: [(0, '767.490')] [2023-03-06 23:02:32,520][62475] Updated weights for policy 0, policy_version 92370 (0.0006) [2023-03-06 23:02:33,319][62475] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-06 23:02:34,124][62475] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-06 23:02:34,938][62475] Updated weights for policy 0, policy_version 92400 (0.0007) [2023-03-06 23:02:35,747][62475] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-06 23:02:36,558][62475] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-06 23:02:37,347][62475] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-06 23:02:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 94648320. Throughput: 0: 12691.1. Samples: 94643963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:37,390][62145] Avg episode reward: [(0, '926.989')] [2023-03-06 23:02:38,159][62475] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-06 23:02:38,958][62475] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-06 23:02:39,773][62475] Updated weights for policy 0, policy_version 92460 (0.0005) [2023-03-06 23:02:40,587][62475] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-06 23:02:41,377][62475] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-06 23:02:42,189][62475] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-06 23:02:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 94711808. Throughput: 0: 12694.5. Samples: 94682011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:42,390][62145] Avg episode reward: [(0, '746.640')] [2023-03-06 23:02:42,990][62475] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-06 23:02:43,782][62475] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-06 23:02:44,603][62475] Updated weights for policy 0, policy_version 92520 (0.0006) [2023-03-06 23:02:45,412][62475] Updated weights for policy 0, policy_version 92530 (0.0006) [2023-03-06 23:02:46,224][62475] Updated weights for policy 0, policy_version 92540 (0.0007) [2023-03-06 23:02:47,037][62475] Updated weights for policy 0, policy_version 92550 (0.0007) [2023-03-06 23:02:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 94775296. Throughput: 0: 12686.6. Samples: 94758150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:47,390][62145] Avg episode reward: [(0, '765.208')] [2023-03-06 23:02:47,866][62475] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-06 23:02:48,667][62475] Updated weights for policy 0, policy_version 92570 (0.0007) [2023-03-06 23:02:49,459][62475] Updated weights for policy 0, policy_version 92580 (0.0006) [2023-03-06 23:02:50,282][62475] Updated weights for policy 0, policy_version 92590 (0.0006) [2023-03-06 23:02:51,076][62475] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-06 23:02:51,882][62475] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-06 23:02:52,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 94838784. Throughput: 0: 12679.5. Samples: 94834293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:52,390][62145] Avg episode reward: [(0, '836.721')] [2023-03-06 23:02:52,694][62475] Updated weights for policy 0, policy_version 92620 (0.0006) [2023-03-06 23:02:53,496][62475] Updated weights for policy 0, policy_version 92630 (0.0007) [2023-03-06 23:02:54,297][62475] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-06 23:02:55,102][62475] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-06 23:02:55,897][62475] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-06 23:02:56,701][62475] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-06 23:02:57,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 94902272. Throughput: 0: 12684.5. Samples: 94872419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:02:57,390][62145] Avg episode reward: [(0, '825.747')] [2023-03-06 23:02:57,499][62475] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-06 23:02:58,310][62475] Updated weights for policy 0, policy_version 92690 (0.0006) [2023-03-06 23:02:59,105][62475] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-06 23:02:59,918][62475] Updated weights for policy 0, policy_version 92710 (0.0005) [2023-03-06 23:03:00,710][62475] Updated weights for policy 0, policy_version 92720 (0.0006) [2023-03-06 23:03:01,517][62475] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-06 23:03:02,321][62475] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-06 23:03:02,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 94966784. Throughput: 0: 12701.1. Samples: 94948984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:02,390][62145] Avg episode reward: [(0, '877.209')] [2023-03-06 23:03:03,134][62475] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-06 23:03:03,939][62475] Updated weights for policy 0, policy_version 92760 (0.0005) [2023-03-06 23:03:04,744][62475] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-06 23:03:05,565][62475] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-06 23:03:06,361][62475] Updated weights for policy 0, policy_version 92790 (0.0007) [2023-03-06 23:03:07,160][62475] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-06 23:03:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12708.0). Total num frames: 95029248. Throughput: 0: 12699.2. Samples: 95025149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:07,390][62145] Avg episode reward: [(0, '965.089')] [2023-03-06 23:03:07,977][62475] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-06 23:03:08,773][62475] Updated weights for policy 0, policy_version 92820 (0.0007) [2023-03-06 23:03:09,587][62475] Updated weights for policy 0, policy_version 92830 (0.0005) [2023-03-06 23:03:10,403][62475] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-06 23:03:11,212][62475] Updated weights for policy 0, policy_version 92850 (0.0007) [2023-03-06 23:03:12,014][62475] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-06 23:03:12,390][62145] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12708.0). Total num frames: 95092736. Throughput: 0: 12701.5. Samples: 95063318. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:12,390][62145] Avg episode reward: [(0, '834.481')] [2023-03-06 23:03:12,814][62475] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-06 23:03:13,621][62475] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-06 23:03:14,431][62475] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-06 23:03:15,246][62475] Updated weights for policy 0, policy_version 92900 (0.0007) [2023-03-06 23:03:16,053][62475] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-06 23:03:16,857][62475] Updated weights for policy 0, policy_version 92920 (0.0007) [2023-03-06 23:03:17,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 95156224. Throughput: 0: 12701.8. Samples: 95139289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:17,390][62145] Avg episode reward: [(0, '762.269')] [2023-03-06 23:03:17,653][62475] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-06 23:03:18,471][62475] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-06 23:03:19,284][62475] Updated weights for policy 0, policy_version 92950 (0.0007) [2023-03-06 23:03:20,076][62475] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-06 23:03:20,880][62475] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-06 23:03:21,685][62475] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-06 23:03:22,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12708.0). Total num frames: 95220736. Throughput: 0: 12708.0. Samples: 95215822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:22,390][62145] Avg episode reward: [(0, '815.769')] [2023-03-06 23:03:22,470][62475] Updated weights for policy 0, policy_version 92990 (0.0007) [2023-03-06 23:03:23,262][62475] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-06 23:03:24,088][62475] Updated weights for policy 0, policy_version 93010 (0.0007) [2023-03-06 23:03:24,896][62475] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-06 23:03:25,707][62475] Updated weights for policy 0, policy_version 93030 (0.0007) [2023-03-06 23:03:26,493][62475] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-06 23:03:27,304][62475] Updated weights for policy 0, policy_version 93050 (0.0006) [2023-03-06 23:03:27,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 95284224. Throughput: 0: 12709.6. Samples: 95253940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:03:27,390][62145] Avg episode reward: [(0, '873.679')] [2023-03-06 23:03:28,124][62475] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-06 23:03:28,927][62475] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-06 23:03:29,743][62475] Updated weights for policy 0, policy_version 93080 (0.0006) [2023-03-06 23:03:30,545][62475] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-06 23:03:31,344][62475] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-06 23:03:32,155][62475] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-06 23:03:32,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 95347712. Throughput: 0: 12711.9. Samples: 95330189. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:32,390][62145] Avg episode reward: [(0, '903.977')] [2023-03-06 23:03:32,974][62475] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-06 23:03:33,770][62475] Updated weights for policy 0, policy_version 93130 (0.0007) [2023-03-06 23:03:34,574][62475] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-06 23:03:35,386][62475] Updated weights for policy 0, policy_version 93150 (0.0007) [2023-03-06 23:03:36,204][62475] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-06 23:03:37,018][62475] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-06 23:03:37,389][62145] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 95410176. Throughput: 0: 12709.5. Samples: 95406221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:37,390][62145] Avg episode reward: [(0, '740.410')] [2023-03-06 23:03:37,827][62475] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-06 23:03:38,630][62475] Updated weights for policy 0, policy_version 93190 (0.0007) [2023-03-06 23:03:39,441][62475] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-06 23:03:40,236][62475] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-06 23:03:41,045][62475] Updated weights for policy 0, policy_version 93220 (0.0007) [2023-03-06 23:03:41,835][62475] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-03-06 23:03:42,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12704.5). Total num frames: 95474688. Throughput: 0: 12707.2. Samples: 95444241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:42,390][62145] Avg episode reward: [(0, '901.213')] [2023-03-06 23:03:42,638][62475] Updated weights for policy 0, policy_version 93240 (0.0006) [2023-03-06 23:03:43,429][62475] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-06 23:03:44,240][62475] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-06 23:03:45,044][62475] Updated weights for policy 0, policy_version 93270 (0.0007) [2023-03-06 23:03:45,837][62475] Updated weights for policy 0, policy_version 93280 (0.0006) [2023-03-06 23:03:46,632][62475] Updated weights for policy 0, policy_version 93290 (0.0007) [2023-03-06 23:03:47,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.6, 300 sec: 12704.5). Total num frames: 95538176. Throughput: 0: 12708.6. Samples: 95520873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:47,390][62145] Avg episode reward: [(0, '781.631')] [2023-03-06 23:03:47,445][62475] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-06 23:03:48,255][62475] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-06 23:03:49,051][62475] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-06 23:03:49,843][62475] Updated weights for policy 0, policy_version 93330 (0.0007) [2023-03-06 23:03:50,650][62475] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-06 23:03:51,454][62475] Updated weights for policy 0, policy_version 93350 (0.0006) [2023-03-06 23:03:52,265][62475] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-06 23:03:52,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 95601664. Throughput: 0: 12717.6. Samples: 95597441. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:52,390][62145] Avg episode reward: [(0, '686.893')] [2023-03-06 23:03:53,056][62475] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-06 23:03:53,854][62475] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-06 23:03:54,664][62475] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-06 23:03:55,473][62475] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-06 23:03:56,294][62475] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-06 23:03:57,074][62475] Updated weights for policy 0, policy_version 93420 (0.0006) [2023-03-06 23:03:57,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 95665152. Throughput: 0: 12717.4. Samples: 95635602. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:03:57,390][62145] Avg episode reward: [(0, '660.026')] [2023-03-06 23:03:57,901][62475] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-06 23:03:58,692][62475] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-06 23:03:59,507][62475] Updated weights for policy 0, policy_version 93450 (0.0005) [2023-03-06 23:04:00,311][62475] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-06 23:04:01,124][62475] Updated weights for policy 0, policy_version 93470 (0.0006) [2023-03-06 23:04:01,905][62475] Updated weights for policy 0, policy_version 93480 (0.0006) [2023-03-06 23:04:02,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 95728640. Throughput: 0: 12725.8. Samples: 95711954. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:04:02,390][62145] Avg episode reward: [(0, '667.185')] [2023-03-06 23:04:02,742][62475] Updated weights for policy 0, policy_version 93490 (0.0006) [2023-03-06 23:04:03,545][62475] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-06 23:04:04,353][62475] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-06 23:04:05,170][62475] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-06 23:04:05,953][62475] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-06 23:04:06,763][62475] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-06 23:04:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 95792128. Throughput: 0: 12720.1. Samples: 95788226. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:04:07,390][62145] Avg episode reward: [(0, '704.488')] [2023-03-06 23:04:07,573][62475] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-06 23:04:08,360][62475] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-06 23:04:09,160][62475] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-06 23:04:09,986][62475] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-06 23:04:10,784][62475] Updated weights for policy 0, policy_version 93590 (0.0005) [2023-03-06 23:04:11,596][62475] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-06 23:04:12,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 95855616. Throughput: 0: 12717.8. Samples: 95826241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:04:12,390][62145] Avg episode reward: [(0, '824.732')] [2023-03-06 23:04:12,402][62475] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-06 23:04:13,218][62475] Updated weights for policy 0, policy_version 93620 (0.0007) [2023-03-06 23:04:14,049][62475] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-06 23:04:14,850][62475] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-06 23:04:15,652][62475] Updated weights for policy 0, policy_version 93650 (0.0007) [2023-03-06 23:04:16,459][62475] Updated weights for policy 0, policy_version 93660 (0.0007) [2023-03-06 23:04:17,273][62475] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-06 23:04:17,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 95919104. Throughput: 0: 12710.9. Samples: 95902180. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:04:17,390][62145] Avg episode reward: [(0, '781.267')] [2023-03-06 23:04:18,079][62475] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-06 23:04:18,879][62475] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-06 23:04:19,693][62475] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-06 23:04:20,493][62475] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-06 23:04:21,300][62475] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-06 23:04:22,097][62475] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-06 23:04:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 95982592. Throughput: 0: 12716.5. Samples: 95978464. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:04:22,390][62145] Avg episode reward: [(0, '803.467')] [2023-03-06 23:04:22,394][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000093733_95982592.pth... [2023-03-06 23:04:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000090757_92935168.pth [2023-03-06 23:04:22,909][62475] Updated weights for policy 0, policy_version 93740 (0.0007) [2023-03-06 23:04:23,721][62475] Updated weights for policy 0, policy_version 93750 (0.0007) [2023-03-06 23:04:24,521][62475] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-06 23:04:25,329][62475] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-06 23:04:26,130][62475] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-06 23:04:26,949][62475] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-06 23:04:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 96046080. Throughput: 0: 12715.6. Samples: 96016443. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:27,390][62145] Avg episode reward: [(0, '766.704')] [2023-03-06 23:04:27,764][62475] Updated weights for policy 0, policy_version 93800 (0.0008) [2023-03-06 23:04:28,566][62475] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-06 23:04:29,378][62475] Updated weights for policy 0, policy_version 93820 (0.0007) [2023-03-06 23:04:30,180][62475] Updated weights for policy 0, policy_version 93830 (0.0007) [2023-03-06 23:04:31,001][62475] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-06 23:04:31,804][62475] Updated weights for policy 0, policy_version 93850 (0.0007) [2023-03-06 23:04:32,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 96109568. Throughput: 0: 12699.4. Samples: 96092343. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:32,390][62145] Avg episode reward: [(0, '709.939')] [2023-03-06 23:04:32,610][62475] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-06 23:04:33,418][62475] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-06 23:04:34,224][62475] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-06 23:04:35,013][62475] Updated weights for policy 0, policy_version 93890 (0.0006) [2023-03-06 23:04:35,823][62475] Updated weights for policy 0, policy_version 93900 (0.0007) [2023-03-06 23:04:36,616][62475] Updated weights for policy 0, policy_version 93910 (0.0006) [2023-03-06 23:04:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 96173056. Throughput: 0: 12697.2. Samples: 96168816. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:37,390][62145] Avg episode reward: [(0, '684.705')] [2023-03-06 23:04:37,434][62475] Updated weights for policy 0, policy_version 93920 (0.0007) [2023-03-06 23:04:38,224][62475] Updated weights for policy 0, policy_version 93930 (0.0007) [2023-03-06 23:04:39,036][62475] Updated weights for policy 0, policy_version 93940 (0.0007) [2023-03-06 23:04:39,844][62475] Updated weights for policy 0, policy_version 93950 (0.0007) [2023-03-06 23:04:40,649][62475] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-06 23:04:41,448][62475] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-06 23:04:42,271][62475] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-06 23:04:42,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 96236544. Throughput: 0: 12696.2. Samples: 96206928. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:42,390][62145] Avg episode reward: [(0, '609.517')] [2023-03-06 23:04:43,078][62475] Updated weights for policy 0, policy_version 93990 (0.0007) [2023-03-06 23:04:43,893][62475] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-06 23:04:44,686][62475] Updated weights for policy 0, policy_version 94010 (0.0007) [2023-03-06 23:04:45,486][62475] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-06 23:04:46,286][62475] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-06 23:04:47,095][62475] Updated weights for policy 0, policy_version 94040 (0.0007) [2023-03-06 23:04:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 96300032. Throughput: 0: 12696.2. Samples: 96283279. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:47,390][62145] Avg episode reward: [(0, '568.699')] [2023-03-06 23:04:47,902][62475] Updated weights for policy 0, policy_version 94050 (0.0006) [2023-03-06 23:04:48,726][62475] Updated weights for policy 0, policy_version 94060 (0.0008) [2023-03-06 23:04:49,521][62475] Updated weights for policy 0, policy_version 94070 (0.0006) [2023-03-06 23:04:50,321][62475] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-06 23:04:51,107][62475] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-06 23:04:51,920][62475] Updated weights for policy 0, policy_version 94100 (0.0006) [2023-03-06 23:04:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 96363520. Throughput: 0: 12696.9. Samples: 96359587. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:52,390][62145] Avg episode reward: [(0, '631.288')] [2023-03-06 23:04:52,724][62475] Updated weights for policy 0, policy_version 94110 (0.0007) [2023-03-06 23:04:53,519][62475] Updated weights for policy 0, policy_version 94120 (0.0006) [2023-03-06 23:04:54,330][62475] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-06 23:04:55,140][62475] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-06 23:04:55,951][62475] Updated weights for policy 0, policy_version 94150 (0.0006) [2023-03-06 23:04:56,758][62475] Updated weights for policy 0, policy_version 94160 (0.0007) [2023-03-06 23:04:57,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 96428032. Throughput: 0: 12695.8. Samples: 96397551. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:04:57,401][62145] Avg episode reward: [(0, '596.686')] [2023-03-06 23:04:57,556][62475] Updated weights for policy 0, policy_version 94170 (0.0005) [2023-03-06 23:04:58,357][62475] Updated weights for policy 0, policy_version 94180 (0.0007) [2023-03-06 23:04:59,157][62475] Updated weights for policy 0, policy_version 94190 (0.0006) [2023-03-06 23:04:59,963][62475] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-06 23:05:00,773][62475] Updated weights for policy 0, policy_version 94210 (0.0006) [2023-03-06 23:05:01,575][62475] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-06 23:05:02,371][62475] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-06 23:05:02,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 96491520. Throughput: 0: 12710.9. Samples: 96474170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:05:02,401][62145] Avg episode reward: [(0, '474.482')] [2023-03-06 23:05:03,184][62475] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-06 23:05:03,993][62475] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-06 23:05:04,785][62475] Updated weights for policy 0, policy_version 94260 (0.0007) [2023-03-06 23:05:05,591][62475] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-06 23:05:06,391][62475] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-06 23:05:07,200][62475] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-06 23:05:07,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 96555008. Throughput: 0: 12714.1. Samples: 96550600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:05:07,390][62145] Avg episode reward: [(0, '495.079')] [2023-03-06 23:05:07,998][62475] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-06 23:05:08,796][62475] Updated weights for policy 0, policy_version 94310 (0.0007) [2023-03-06 23:05:09,595][62475] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-06 23:05:10,417][62475] Updated weights for policy 0, policy_version 94330 (0.0007) [2023-03-06 23:05:11,217][62475] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-06 23:05:12,032][62475] Updated weights for policy 0, policy_version 94350 (0.0007) [2023-03-06 23:05:12,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 96618496. Throughput: 0: 12718.4. Samples: 96588773. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:05:12,390][62145] Avg episode reward: [(0, '672.575')] [2023-03-06 23:05:12,841][62475] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-06 23:05:13,638][62475] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-06 23:05:14,432][62475] Updated weights for policy 0, policy_version 94380 (0.0007) [2023-03-06 23:05:15,236][62475] Updated weights for policy 0, policy_version 94390 (0.0006) [2023-03-06 23:05:16,018][62475] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-06 23:05:16,848][62475] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-06 23:05:17,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 96681984. Throughput: 0: 12731.8. Samples: 96665275. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:05:17,390][62145] Avg episode reward: [(0, '479.609')] [2023-03-06 23:05:17,647][62475] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-06 23:05:18,449][62475] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-06 23:05:19,265][62475] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-06 23:05:20,061][62475] Updated weights for policy 0, policy_version 94450 (0.0005) [2023-03-06 23:05:20,883][62475] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-06 23:05:21,678][62475] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-06 23:05:22,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 96745472. Throughput: 0: 12729.0. Samples: 96741621. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:05:22,390][62145] Avg episode reward: [(0, '518.828')] [2023-03-06 23:05:22,464][62475] Updated weights for policy 0, policy_version 94480 (0.0007) [2023-03-06 23:05:23,271][62475] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-06 23:05:24,062][62475] Updated weights for policy 0, policy_version 94500 (0.0007) [2023-03-06 23:05:24,861][62475] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-06 23:05:25,665][62475] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-06 23:05:26,466][62475] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-06 23:05:27,277][62475] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-06 23:05:27,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 96809984. Throughput: 0: 12734.6. Samples: 96779984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:27,390][62145] Avg episode reward: [(0, '652.804')] [2023-03-06 23:05:28,077][62475] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-06 23:05:28,898][62475] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-06 23:05:29,698][62475] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-06 23:05:30,493][62475] Updated weights for policy 0, policy_version 94580 (0.0006) [2023-03-06 23:05:31,297][62475] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-06 23:05:32,112][62475] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-06 23:05:32,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 96873472. Throughput: 0: 12735.5. Samples: 96856377. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:32,390][62145] Avg episode reward: [(0, '524.385')] [2023-03-06 23:05:32,901][62475] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-06 23:05:33,691][62475] Updated weights for policy 0, policy_version 94620 (0.0006) [2023-03-06 23:05:34,516][62475] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-06 23:05:35,329][62475] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-06 23:05:36,108][62475] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-06 23:05:36,912][62475] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-06 23:05:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12704.5). Total num frames: 96936960. Throughput: 0: 12744.7. Samples: 96933097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:37,390][62145] Avg episode reward: [(0, '740.247')] [2023-03-06 23:05:37,725][62475] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-06 23:05:38,511][62475] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-06 23:05:39,313][62475] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-06 23:05:40,138][62475] Updated weights for policy 0, policy_version 94700 (0.0007) [2023-03-06 23:05:40,926][62475] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-06 23:05:41,715][62475] Updated weights for policy 0, policy_version 94720 (0.0007) [2023-03-06 23:05:42,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 97001472. Throughput: 0: 12749.1. Samples: 96971259. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:42,390][62145] Avg episode reward: [(0, '636.340')] [2023-03-06 23:05:42,547][62475] Updated weights for policy 0, policy_version 94730 (0.0007) [2023-03-06 23:05:43,351][62475] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-06 23:05:44,162][62475] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-06 23:05:44,954][62475] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-06 23:05:45,744][62475] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-06 23:05:46,566][62475] Updated weights for policy 0, policy_version 94780 (0.0007) [2023-03-06 23:05:47,353][62475] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-06 23:05:47,390][62145] Fps is (10 sec: 12799.8, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 97064960. Throughput: 0: 12746.1. Samples: 97047746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:47,390][62145] Avg episode reward: [(0, '413.185')] [2023-03-06 23:05:48,163][62475] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-06 23:05:48,978][62475] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-06 23:05:49,768][62475] Updated weights for policy 0, policy_version 94820 (0.0007) [2023-03-06 23:05:50,574][62475] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-06 23:05:51,394][62475] Updated weights for policy 0, policy_version 94840 (0.0008) [2023-03-06 23:05:52,204][62475] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-06 23:05:52,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 97128448. Throughput: 0: 12743.5. Samples: 97124056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:52,390][62145] Avg episode reward: [(0, '601.901')] [2023-03-06 23:05:53,010][62475] Updated weights for policy 0, policy_version 94860 (0.0006) [2023-03-06 23:05:53,801][62475] Updated weights for policy 0, policy_version 94870 (0.0006) [2023-03-06 23:05:54,628][62475] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-06 23:05:55,429][62475] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-06 23:05:56,229][62475] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-06 23:05:57,036][62475] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-06 23:05:57,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 97191936. Throughput: 0: 12737.5. Samples: 97161962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:05:57,390][62145] Avg episode reward: [(0, '689.093')] [2023-03-06 23:05:57,835][62475] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-06 23:05:58,619][62475] Updated weights for policy 0, policy_version 94930 (0.0007) [2023-03-06 23:05:59,451][62475] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-06 23:06:00,233][62475] Updated weights for policy 0, policy_version 94950 (0.0007) [2023-03-06 23:06:01,036][62475] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-06 23:06:01,844][62475] Updated weights for policy 0, policy_version 94970 (0.0005) [2023-03-06 23:06:02,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12708.0). Total num frames: 97255424. Throughput: 0: 12743.8. Samples: 97238745. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:02,390][62145] Avg episode reward: [(0, '687.000')] [2023-03-06 23:06:02,667][62475] Updated weights for policy 0, policy_version 94980 (0.0006) [2023-03-06 23:06:03,456][62475] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-06 23:06:04,269][62475] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-06 23:06:05,069][62475] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-06 23:06:05,872][62475] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-06 23:06:06,695][62475] Updated weights for policy 0, policy_version 95030 (0.0007) [2023-03-06 23:06:07,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 97318912. Throughput: 0: 12737.0. Samples: 97314787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:07,390][62145] Avg episode reward: [(0, '615.099')] [2023-03-06 23:06:07,500][62475] Updated weights for policy 0, policy_version 95040 (0.0006) [2023-03-06 23:06:08,291][62475] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-06 23:06:09,101][62475] Updated weights for policy 0, policy_version 95060 (0.0007) [2023-03-06 23:06:09,898][62475] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-06 23:06:10,702][62475] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-06 23:06:10,931][62424] KL-divergence is very high: 1141.2214 [2023-03-06 23:06:11,510][62475] Updated weights for policy 0, policy_version 95090 (0.0006) [2023-03-06 23:06:12,311][62475] Updated weights for policy 0, policy_version 95100 (0.0005) [2023-03-06 23:06:12,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 97382400. Throughput: 0: 12734.1. Samples: 97353020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:12,390][62145] Avg episode reward: [(0, '408.023')] [2023-03-06 23:06:13,103][62475] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-06 23:06:13,912][62475] Updated weights for policy 0, policy_version 95120 (0.0006) [2023-03-06 23:06:14,726][62475] Updated weights for policy 0, policy_version 95130 (0.0007) [2023-03-06 23:06:15,517][62475] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-06 23:06:16,312][62475] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-06 23:06:17,111][62475] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-06 23:06:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12711.5). Total num frames: 97446912. Throughput: 0: 12740.9. Samples: 97429716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:17,390][62145] Avg episode reward: [(0, '686.709')] [2023-03-06 23:06:17,901][62475] Updated weights for policy 0, policy_version 95170 (0.0006) [2023-03-06 23:06:18,720][62475] Updated weights for policy 0, policy_version 95180 (0.0007) [2023-03-06 23:06:19,529][62475] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-06 23:06:20,339][62475] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-06 23:06:21,127][62475] Updated weights for policy 0, policy_version 95210 (0.0007) [2023-03-06 23:06:21,959][62475] Updated weights for policy 0, policy_version 95220 (0.0007) [2023-03-06 23:06:22,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12711.5). Total num frames: 97510400. Throughput: 0: 12735.2. Samples: 97506183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:22,390][62145] Avg episode reward: [(0, '637.358')] [2023-03-06 23:06:22,393][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000095225_97510400.pth... [2023-03-06 23:06:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000092244_94457856.pth [2023-03-06 23:06:22,767][62475] Updated weights for policy 0, policy_version 95230 (0.0005) [2023-03-06 23:06:23,553][62475] Updated weights for policy 0, policy_version 95240 (0.0006) [2023-03-06 23:06:24,362][62475] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-06 23:06:25,173][62475] Updated weights for policy 0, policy_version 95260 (0.0006) [2023-03-06 23:06:25,967][62475] Updated weights for policy 0, policy_version 95270 (0.0007) [2023-03-06 23:06:26,790][62475] Updated weights for policy 0, policy_version 95280 (0.0007) [2023-03-06 23:06:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 97573888. Throughput: 0: 12730.4. Samples: 97544129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:27,390][62145] Avg episode reward: [(0, '794.902')] [2023-03-06 23:06:27,600][62475] Updated weights for policy 0, policy_version 95290 (0.0007) [2023-03-06 23:06:28,389][62475] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-06 23:06:29,188][62475] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-06 23:06:30,004][62475] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-06 23:06:30,814][62475] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-06 23:06:31,609][62475] Updated weights for policy 0, policy_version 95340 (0.0007) [2023-03-06 23:06:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12715.0). Total num frames: 97637376. Throughput: 0: 12730.3. Samples: 97620609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:32,390][62145] Avg episode reward: [(0, '608.723')] [2023-03-06 23:06:32,406][62475] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-06 23:06:33,197][62475] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-06 23:06:34,001][62475] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-06 23:06:34,810][62475] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-06 23:06:35,612][62475] Updated weights for policy 0, policy_version 95390 (0.0007) [2023-03-06 23:06:36,396][62475] Updated weights for policy 0, policy_version 95400 (0.0006) [2023-03-06 23:06:37,216][62475] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-06 23:06:37,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 97701888. Throughput: 0: 12736.7. Samples: 97697207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:37,390][62145] Avg episode reward: [(0, '590.370')] [2023-03-06 23:06:38,018][62475] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-06 23:06:38,827][62475] Updated weights for policy 0, policy_version 95430 (0.0006) [2023-03-06 23:06:39,626][62475] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-06 23:06:40,419][62475] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-06 23:06:41,210][62475] Updated weights for policy 0, policy_version 95460 (0.0007) [2023-03-06 23:06:42,012][62475] Updated weights for policy 0, policy_version 95470 (0.0007) [2023-03-06 23:06:42,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 97765376. Throughput: 0: 12743.6. Samples: 97735423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:42,390][62145] Avg episode reward: [(0, '561.629')] [2023-03-06 23:06:42,814][62475] Updated weights for policy 0, policy_version 95480 (0.0007) [2023-03-06 23:06:43,601][62475] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-06 23:06:44,399][62475] Updated weights for policy 0, policy_version 95500 (0.0005) [2023-03-06 23:06:45,228][62475] Updated weights for policy 0, policy_version 95510 (0.0006) [2023-03-06 23:06:46,010][62475] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-06 23:06:46,829][62475] Updated weights for policy 0, policy_version 95530 (0.0006) [2023-03-06 23:06:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12718.4). Total num frames: 97828864. Throughput: 0: 12747.9. Samples: 97812399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:47,390][62145] Avg episode reward: [(0, '608.554')] [2023-03-06 23:06:47,647][62475] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-06 23:06:48,452][62475] Updated weights for policy 0, policy_version 95550 (0.0007) [2023-03-06 23:06:49,245][62475] Updated weights for policy 0, policy_version 95560 (0.0006) [2023-03-06 23:06:50,065][62475] Updated weights for policy 0, policy_version 95570 (0.0007) [2023-03-06 23:06:50,856][62475] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-06 23:06:51,672][62475] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-06 23:06:52,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 97893376. Throughput: 0: 12748.9. Samples: 97888490. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:52,390][62145] Avg episode reward: [(0, '466.279')] [2023-03-06 23:06:52,482][62475] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-06 23:06:53,273][62475] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-06 23:06:54,073][62475] Updated weights for policy 0, policy_version 95620 (0.0006) [2023-03-06 23:06:54,883][62475] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-06 23:06:55,687][62475] Updated weights for policy 0, policy_version 95640 (0.0007) [2023-03-06 23:06:56,491][62475] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-06 23:06:56,658][62424] KL-divergence is very high: 8991443.0000 [2023-03-06 23:06:57,297][62475] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-06 23:06:57,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 97956864. Throughput: 0: 12750.2. Samples: 97926778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:06:57,390][62145] Avg episode reward: [(0, '549.321')] [2023-03-06 23:06:58,080][62475] Updated weights for policy 0, policy_version 95670 (0.0007) [2023-03-06 23:06:58,912][62475] Updated weights for policy 0, policy_version 95680 (0.0007) [2023-03-06 23:06:59,694][62475] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-06 23:06:59,856][62424] KL-divergence is very high: 1789.9911 [2023-03-06 23:07:00,490][62475] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-06 23:07:01,314][62475] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-06 23:07:02,114][62475] Updated weights for policy 0, policy_version 95720 (0.0006) [2023-03-06 23:07:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12718.4). Total num frames: 98020352. Throughput: 0: 12744.5. Samples: 98003218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:02,390][62145] Avg episode reward: [(0, '550.073')] [2023-03-06 23:07:02,926][62475] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-06 23:07:03,722][62475] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-06 23:07:04,518][62475] Updated weights for policy 0, policy_version 95750 (0.0007) [2023-03-06 23:07:05,319][62475] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-06 23:07:06,124][62475] Updated weights for policy 0, policy_version 95770 (0.0006) [2023-03-06 23:07:06,936][62475] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-06 23:07:07,389][62145] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 98083840. Throughput: 0: 12748.4. Samples: 98079859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:07,390][62145] Avg episode reward: [(0, '427.731')] [2023-03-06 23:07:07,715][62475] Updated weights for policy 0, policy_version 95790 (0.0007) [2023-03-06 23:07:08,510][62475] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-06 23:07:09,325][62475] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-06 23:07:10,110][62475] Updated weights for policy 0, policy_version 95820 (0.0006) [2023-03-06 23:07:10,938][62475] Updated weights for policy 0, policy_version 95830 (0.0007) [2023-03-06 23:07:11,733][62475] Updated weights for policy 0, policy_version 95840 (0.0007) [2023-03-06 23:07:12,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12725.4). Total num frames: 98148352. Throughput: 0: 12758.1. Samples: 98118242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:12,390][62145] Avg episode reward: [(0, '515.261')] [2023-03-06 23:07:12,533][62475] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-06 23:07:13,359][62475] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-06 23:07:14,154][62475] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-06 23:07:14,940][62475] Updated weights for policy 0, policy_version 95880 (0.0007) [2023-03-06 23:07:15,779][62475] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-06 23:07:16,565][62475] Updated weights for policy 0, policy_version 95900 (0.0006) [2023-03-06 23:07:17,369][62475] Updated weights for policy 0, policy_version 95910 (0.0008) [2023-03-06 23:07:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 98211840. Throughput: 0: 12753.0. Samples: 98194495. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:17,390][62145] Avg episode reward: [(0, '726.816')] [2023-03-06 23:07:18,173][62475] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-06 23:07:18,959][62475] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-06 23:07:19,768][62475] Updated weights for policy 0, policy_version 95940 (0.0007) [2023-03-06 23:07:20,586][62475] Updated weights for policy 0, policy_version 95950 (0.0006) [2023-03-06 23:07:21,384][62475] Updated weights for policy 0, policy_version 95960 (0.0006) [2023-03-06 23:07:22,209][62475] Updated weights for policy 0, policy_version 95970 (0.0006) [2023-03-06 23:07:22,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 98275328. Throughput: 0: 12745.4. Samples: 98270750. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:22,390][62145] Avg episode reward: [(0, '618.670')] [2023-03-06 23:07:23,012][62475] Updated weights for policy 0, policy_version 95980 (0.0005) [2023-03-06 23:07:23,821][62475] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-06 23:07:24,608][62475] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-06 23:07:25,427][62475] Updated weights for policy 0, policy_version 96010 (0.0007) [2023-03-06 23:07:26,245][62475] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-06 23:07:27,053][62475] Updated weights for policy 0, policy_version 96030 (0.0007) [2023-03-06 23:07:27,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 98338816. Throughput: 0: 12745.6. Samples: 98308973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:27,390][62145] Avg episode reward: [(0, '465.953')] [2023-03-06 23:07:27,855][62475] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-06 23:07:28,650][62475] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-06 23:07:29,437][62475] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-06 23:07:30,264][62475] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-06 23:07:31,067][62475] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-06 23:07:31,861][62475] Updated weights for policy 0, policy_version 96090 (0.0007) [2023-03-06 23:07:32,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 98402304. Throughput: 0: 12731.0. Samples: 98385296. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:32,390][62145] Avg episode reward: [(0, '506.087')] [2023-03-06 23:07:32,670][62475] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-06 23:07:33,486][62475] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-06 23:07:34,285][62475] Updated weights for policy 0, policy_version 96120 (0.0006) [2023-03-06 23:07:35,106][62475] Updated weights for policy 0, policy_version 96130 (0.0006) [2023-03-06 23:07:35,890][62475] Updated weights for policy 0, policy_version 96140 (0.0006) [2023-03-06 23:07:36,702][62475] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-06 23:07:37,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 98465792. Throughput: 0: 12733.9. Samples: 98461516. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:37,390][62145] Avg episode reward: [(0, '445.762')] [2023-03-06 23:07:37,501][62475] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-06 23:07:38,310][62475] Updated weights for policy 0, policy_version 96170 (0.0006) [2023-03-06 23:07:39,117][62475] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-06 23:07:39,910][62475] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-06 23:07:40,725][62475] Updated weights for policy 0, policy_version 96200 (0.0006) [2023-03-06 23:07:41,533][62475] Updated weights for policy 0, policy_version 96210 (0.0006) [2023-03-06 23:07:42,317][62475] Updated weights for policy 0, policy_version 96220 (0.0007) [2023-03-06 23:07:42,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 98529280. Throughput: 0: 12732.0. Samples: 98499722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:42,390][62145] Avg episode reward: [(0, '448.197')] [2023-03-06 23:07:43,142][62475] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-06 23:07:43,936][62475] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-06 23:07:44,729][62475] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-06 23:07:45,548][62475] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-06 23:07:46,341][62475] Updated weights for policy 0, policy_version 96270 (0.0006) [2023-03-06 23:07:47,149][62475] Updated weights for policy 0, policy_version 96280 (0.0007) [2023-03-06 23:07:47,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 98592768. Throughput: 0: 12737.5. Samples: 98576404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:47,390][62145] Avg episode reward: [(0, '664.073')] [2023-03-06 23:07:47,952][62424] KL-divergence is very high: 420.0791 [2023-03-06 23:07:47,960][62475] Updated weights for policy 0, policy_version 96290 (0.0007) [2023-03-06 23:07:48,767][62475] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-06 23:07:49,578][62475] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-06 23:07:50,373][62475] Updated weights for policy 0, policy_version 96320 (0.0007) [2023-03-06 23:07:51,191][62475] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-06 23:07:51,977][62475] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-06 23:07:52,358][62424] KL-divergence is very high: 117954.9688 [2023-03-06 23:07:52,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 98657280. Throughput: 0: 12727.5. Samples: 98652595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:52,390][62145] Avg episode reward: [(0, '496.315')] [2023-03-06 23:07:52,778][62475] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-06 23:07:53,579][62475] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-06 23:07:54,381][62475] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-06 23:07:55,183][62475] Updated weights for policy 0, policy_version 96380 (0.0005) [2023-03-06 23:07:56,005][62475] Updated weights for policy 0, policy_version 96390 (0.0008) [2023-03-06 23:07:56,793][62475] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-06 23:07:57,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 98720768. Throughput: 0: 12725.5. Samples: 98690890. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:07:57,390][62145] Avg episode reward: [(0, '536.369')] [2023-03-06 23:07:57,597][62475] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-06 23:07:58,411][62475] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-06 23:07:59,213][62475] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-06 23:08:00,023][62475] Updated weights for policy 0, policy_version 96440 (0.0006) [2023-03-06 23:08:00,834][62475] Updated weights for policy 0, policy_version 96450 (0.0007) [2023-03-06 23:08:01,654][62475] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-06 23:08:02,390][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 98784256. Throughput: 0: 12723.8. Samples: 98767065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:02,390][62145] Avg episode reward: [(0, '599.523')] [2023-03-06 23:08:02,441][62475] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-06 23:08:03,234][62475] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-06 23:08:04,013][62475] Updated weights for policy 0, policy_version 96490 (0.0006) [2023-03-06 23:08:04,837][62475] Updated weights for policy 0, policy_version 96500 (0.0007) [2023-03-06 23:08:05,635][62475] Updated weights for policy 0, policy_version 96510 (0.0007) [2023-03-06 23:08:06,447][62475] Updated weights for policy 0, policy_version 96520 (0.0007) [2023-03-06 23:08:07,245][62475] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-06 23:08:07,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 98847744. Throughput: 0: 12732.1. Samples: 98843693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:07,390][62145] Avg episode reward: [(0, '726.965')] [2023-03-06 23:08:08,065][62475] Updated weights for policy 0, policy_version 96540 (0.0006) [2023-03-06 23:08:08,858][62475] Updated weights for policy 0, policy_version 96550 (0.0007) [2023-03-06 23:08:09,658][62475] Updated weights for policy 0, policy_version 96560 (0.0006) [2023-03-06 23:08:10,460][62475] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-06 23:08:11,265][62475] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-06 23:08:12,071][62475] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-06 23:08:12,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 98912256. Throughput: 0: 12735.2. Samples: 98882057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:12,390][62145] Avg episode reward: [(0, '507.365')] [2023-03-06 23:08:12,877][62475] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-06 23:08:13,684][62475] Updated weights for policy 0, policy_version 96610 (0.0007) [2023-03-06 23:08:14,481][62475] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-06 23:08:15,286][62475] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-06 23:08:16,096][62475] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-06 23:08:16,885][62475] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-06 23:08:17,389][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 98975744. Throughput: 0: 12734.9. Samples: 98958364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:17,390][62145] Avg episode reward: [(0, '709.460')] [2023-03-06 23:08:17,695][62475] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-06 23:08:18,506][62475] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-06 23:08:19,313][62475] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-06 23:08:20,105][62475] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-06 23:08:20,895][62475] Updated weights for policy 0, policy_version 96700 (0.0006) [2023-03-06 23:08:21,711][62475] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-06 23:08:22,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 99039232. Throughput: 0: 12739.3. Samples: 99034785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:22,390][62145] Avg episode reward: [(0, '701.209')] [2023-03-06 23:08:22,395][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000096718_99039232.pth... [2023-03-06 23:08:22,426][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000093733_95982592.pth [2023-03-06 23:08:22,533][62475] Updated weights for policy 0, policy_version 96720 (0.0007) [2023-03-06 23:08:23,324][62475] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-06 23:08:24,122][62475] Updated weights for policy 0, policy_version 96740 (0.0006) [2023-03-06 23:08:24,959][62475] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-06 23:08:25,746][62475] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-06 23:08:26,554][62475] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-06 23:08:27,346][62475] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-06 23:08:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 99102720. Throughput: 0: 12736.8. Samples: 99072875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:27,390][62145] Avg episode reward: [(0, '513.452')] [2023-03-06 23:08:28,151][62475] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-06 23:08:28,952][62475] Updated weights for policy 0, policy_version 96800 (0.0006) [2023-03-06 23:08:29,744][62475] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-06 23:08:30,566][62475] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-06 23:08:31,355][62475] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-06 23:08:32,160][62475] Updated weights for policy 0, policy_version 96840 (0.0007) [2023-03-06 23:08:32,390][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 99166208. Throughput: 0: 12731.1. Samples: 99149304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:32,390][62145] Avg episode reward: [(0, '559.376')] [2023-03-06 23:08:32,979][62475] Updated weights for policy 0, policy_version 96850 (0.0007) [2023-03-06 23:08:33,781][62475] Updated weights for policy 0, policy_version 96860 (0.0007) [2023-03-06 23:08:34,582][62475] Updated weights for policy 0, policy_version 96870 (0.0006) [2023-03-06 23:08:35,389][62475] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-06 23:08:36,180][62475] Updated weights for policy 0, policy_version 96890 (0.0006) [2023-03-06 23:08:36,988][62475] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-06 23:08:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 99230720. Throughput: 0: 12740.2. Samples: 99225906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:37,390][62145] Avg episode reward: [(0, '509.062')] [2023-03-06 23:08:37,782][62475] Updated weights for policy 0, policy_version 96910 (0.0007) [2023-03-06 23:08:38,593][62475] Updated weights for policy 0, policy_version 96920 (0.0006) [2023-03-06 23:08:39,394][62475] Updated weights for policy 0, policy_version 96930 (0.0006) [2023-03-06 23:08:40,211][62475] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-06 23:08:41,010][62475] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-06 23:08:41,826][62475] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-06 23:08:42,389][62145] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 99294208. Throughput: 0: 12735.3. Samples: 99263977. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:42,390][62145] Avg episode reward: [(0, '414.191')] [2023-03-06 23:08:42,647][62475] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-06 23:08:43,445][62475] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-06 23:08:44,237][62475] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-06 23:08:45,033][62475] Updated weights for policy 0, policy_version 97000 (0.0005) [2023-03-06 23:08:45,854][62475] Updated weights for policy 0, policy_version 97010 (0.0007) [2023-03-06 23:08:46,661][62475] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-06 23:08:47,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12732.3). Total num frames: 99357696. Throughput: 0: 12736.3. Samples: 99340200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:47,400][62145] Avg episode reward: [(0, '540.148')] [2023-03-06 23:08:47,460][62475] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-06 23:08:48,267][62475] Updated weights for policy 0, policy_version 97040 (0.0007) [2023-03-06 23:08:49,073][62475] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-06 23:08:49,858][62475] Updated weights for policy 0, policy_version 97060 (0.0007) [2023-03-06 23:08:50,682][62475] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-06 23:08:51,475][62475] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-06 23:08:52,264][62475] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-06 23:08:52,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 99421184. Throughput: 0: 12733.7. Samples: 99416709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:52,390][62145] Avg episode reward: [(0, '594.250')] [2023-03-06 23:08:53,080][62475] Updated weights for policy 0, policy_version 97100 (0.0007) [2023-03-06 23:08:53,881][62475] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-06 23:08:54,681][62475] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-06 23:08:55,496][62475] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-06 23:08:56,297][62475] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-06 23:08:57,097][62475] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-06 23:08:57,389][62145] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 99484672. Throughput: 0: 12733.0. Samples: 99455041. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:08:57,390][62145] Avg episode reward: [(0, '430.527')] [2023-03-06 23:08:57,883][62475] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-06 23:08:58,700][62475] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-06 23:08:59,513][62475] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-06 23:09:00,313][62475] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-06 23:09:01,117][62475] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-06 23:09:01,923][62475] Updated weights for policy 0, policy_version 97210 (0.0007) [2023-03-06 23:09:02,390][62145] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12732.3). Total num frames: 99548160. Throughput: 0: 12735.9. Samples: 99531483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:02,390][62145] Avg episode reward: [(0, '439.803')] [2023-03-06 23:09:02,717][62475] Updated weights for policy 0, policy_version 97220 (0.0006) [2023-03-06 23:09:03,503][62475] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-06 23:09:04,306][62475] Updated weights for policy 0, policy_version 97240 (0.0006) [2023-03-06 23:09:05,110][62475] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-06 23:09:05,912][62475] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-06 23:09:06,729][62475] Updated weights for policy 0, policy_version 97270 (0.0007) [2023-03-06 23:09:07,390][62145] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12735.8). Total num frames: 99612672. Throughput: 0: 12736.6. Samples: 99607930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:07,390][62145] Avg episode reward: [(0, '493.430')] [2023-03-06 23:09:07,531][62475] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-06 23:09:08,353][62475] Updated weights for policy 0, policy_version 97290 (0.0006) [2023-03-06 23:09:09,145][62475] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-06 23:09:09,953][62475] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-06 23:09:10,765][62475] Updated weights for policy 0, policy_version 97320 (0.0006) [2023-03-06 23:09:11,553][62475] Updated weights for policy 0, policy_version 97330 (0.0006) [2023-03-06 23:09:12,350][62475] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-06 23:09:12,390][62145] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 99676160. Throughput: 0: 12737.8. Samples: 99646075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:12,390][62145] Avg episode reward: [(0, '451.843')] [2023-03-06 23:09:13,169][62475] Updated weights for policy 0, policy_version 97350 (0.0007) [2023-03-06 23:09:13,966][62475] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-06 23:09:14,767][62475] Updated weights for policy 0, policy_version 97370 (0.0006) [2023-03-06 23:09:15,571][62475] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-06 23:09:16,360][62475] Updated weights for policy 0, policy_version 97390 (0.0006) [2023-03-06 23:09:17,183][62475] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-03-06 23:09:17,389][62145] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 99739648. Throughput: 0: 12740.0. Samples: 99722604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:17,390][62145] Avg episode reward: [(0, '644.660')] [2023-03-06 23:09:17,995][62475] Updated weights for policy 0, policy_version 97410 (0.0007) [2023-03-06 23:09:18,812][62475] Updated weights for policy 0, policy_version 97420 (0.0006) [2023-03-06 23:09:19,624][62475] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-06 23:09:20,432][62475] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-06 23:09:21,225][62475] Updated weights for policy 0, policy_version 97450 (0.0007) [2023-03-06 23:09:22,045][62475] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-06 23:09:22,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 99803136. Throughput: 0: 12731.1. Samples: 99798805. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:22,390][62145] Avg episode reward: [(0, '631.921')] [2023-03-06 23:09:22,844][62475] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-06 23:09:23,647][62475] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-06 23:09:24,424][62475] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-06 23:09:25,243][62475] Updated weights for policy 0, policy_version 97500 (0.0007) [2023-03-06 23:09:26,058][62475] Updated weights for policy 0, policy_version 97510 (0.0008) [2023-03-06 23:09:26,837][62475] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-06 23:09:27,390][62145] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 99866624. Throughput: 0: 12739.6. Samples: 99837261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:27,390][62145] Avg episode reward: [(0, '594.638')] [2023-03-06 23:09:27,635][62475] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-06 23:09:28,453][62475] Updated weights for policy 0, policy_version 97540 (0.0005) [2023-03-06 23:09:29,274][62475] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-06 23:09:30,071][62475] Updated weights for policy 0, policy_version 97560 (0.0007) [2023-03-06 23:09:30,899][62475] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-06 23:09:31,709][62475] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-06 23:09:32,389][62145] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12735.8). Total num frames: 99930112. Throughput: 0: 12733.2. Samples: 99913192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:32,390][62145] Avg episode reward: [(0, '785.069')] [2023-03-06 23:09:32,512][62475] Updated weights for policy 0, policy_version 97590 (0.0006) [2023-03-06 23:09:33,322][62475] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-06 23:09:34,123][62475] Updated weights for policy 0, policy_version 97610 (0.0005) [2023-03-06 23:09:34,930][62475] Updated weights for policy 0, policy_version 97620 (0.0006) [2023-03-06 23:09:35,730][62475] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-06 23:09:36,513][62475] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-06 23:09:37,309][62475] Updated weights for policy 0, policy_version 97650 (0.0006) [2023-03-06 23:09:37,390][62145] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12739.3). Total num frames: 99994624. Throughput: 0: 12732.1. Samples: 99989656. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:09:37,390][62145] Avg episode reward: [(0, '565.375')] [2023-03-06 23:09:37,945][62614] Stopping RolloutWorker_w6... [2023-03-06 23:09:37,945][62613] Stopping RolloutWorker_w17... [2023-03-06 23:09:37,945][62942] Stopping RolloutWorker_w29... [2023-03-06 23:09:37,945][62606] Stopping RolloutWorker_w11... [2023-03-06 23:09:37,945][62936] Stopping RolloutWorker_w23... [2023-03-06 23:09:37,945][62647] Stopping RolloutWorker_w10... [2023-03-06 23:09:37,945][62614] Loop rollout_proc6_evt_loop terminating... [2023-03-06 23:09:37,945][62603] Stopping RolloutWorker_w5... [2023-03-06 23:09:37,945][62615] Stopping RolloutWorker_w18... [2023-03-06 23:09:37,945][62935] Stopping RolloutWorker_w22... [2023-03-06 23:09:37,946][62942] Loop rollout_proc29_evt_loop terminating... [2023-03-06 23:09:37,946][62613] Loop rollout_proc17_evt_loop terminating... [2023-03-06 23:09:37,946][62606] Loop rollout_proc11_evt_loop terminating... [2023-03-06 23:09:37,945][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 23:09:37,946][62647] Loop rollout_proc10_evt_loop terminating... [2023-03-06 23:09:37,945][62605] Stopping RolloutWorker_w4... [2023-03-06 23:09:37,945][62609] Stopping RolloutWorker_w3... [2023-03-06 23:09:37,945][62611] Stopping RolloutWorker_w9... [2023-03-06 23:09:37,946][62936] Loop rollout_proc23_evt_loop terminating... [2023-03-06 23:09:37,946][62941] Stopping RolloutWorker_w28... [2023-03-06 23:09:37,946][62615] Loop rollout_proc18_evt_loop terminating... [2023-03-06 23:09:37,946][62935] Loop rollout_proc22_evt_loop terminating... [2023-03-06 23:09:37,946][62902] Stopping RolloutWorker_w21... [2023-03-06 23:09:37,946][62742] Stopping RolloutWorker_w8... [2023-03-06 23:09:37,946][62939] Stopping RolloutWorker_w26... [2023-03-06 23:09:37,946][62603] Loop rollout_proc5_evt_loop terminating... [2023-03-06 23:09:37,946][62937] Stopping RolloutWorker_w24... [2023-03-06 23:09:37,946][62609] Loop rollout_proc3_evt_loop terminating... [2023-03-06 23:09:37,946][62610] Stopping RolloutWorker_w14... [2023-03-06 23:09:37,946][62611] Loop rollout_proc9_evt_loop terminating... [2023-03-06 23:09:37,946][62941] Loop rollout_proc28_evt_loop terminating... [2023-03-06 23:09:37,946][62605] Loop rollout_proc4_evt_loop terminating... [2023-03-06 23:09:37,946][62477] Stopping RolloutWorker_w1... [2023-03-06 23:09:37,946][62901] Stopping RolloutWorker_w20... [2023-03-06 23:09:37,946][62840] Stopping RolloutWorker_w7... [2023-03-06 23:09:37,946][62939] Loop rollout_proc26_evt_loop terminating... [2023-03-06 23:09:37,946][62902] Loop rollout_proc21_evt_loop terminating... [2023-03-06 23:09:37,946][62742] Loop rollout_proc8_evt_loop terminating... [2023-03-06 23:09:37,946][62608] Stopping RolloutWorker_w15... [2023-03-06 23:09:37,946][62974] Stopping RolloutWorker_w30... [2023-03-06 23:09:37,946][62937] Loop rollout_proc24_evt_loop terminating... [2023-03-06 23:09:37,946][62610] Loop rollout_proc14_evt_loop terminating... [2023-03-06 23:09:37,946][62477] Loop rollout_proc1_evt_loop terminating... [2023-03-06 23:09:37,946][62940] Stopping RolloutWorker_w27... [2023-03-06 23:09:37,946][62607] Stopping RolloutWorker_w16... [2023-03-06 23:09:37,946][62901] Loop rollout_proc20_evt_loop terminating... [2023-03-06 23:09:37,946][62608] Loop rollout_proc15_evt_loop terminating... [2023-03-06 23:09:37,946][62840] Loop rollout_proc7_evt_loop terminating... [2023-03-06 23:09:37,946][62982] Stopping RolloutWorker_w31... [2023-03-06 23:09:37,946][62974] Loop rollout_proc30_evt_loop terminating... [2023-03-06 23:09:37,946][62940] Loop rollout_proc27_evt_loop terminating... [2023-03-06 23:09:37,946][62607] Loop rollout_proc16_evt_loop terminating... [2023-03-06 23:09:37,946][62982] Loop rollout_proc31_evt_loop terminating... [2023-03-06 23:09:37,946][62145] Component RolloutWorker_w6 stopped! [2023-03-06 23:09:37,948][62145] Component RolloutWorker_w17 stopped! [2023-03-06 23:09:37,948][62145] Component RolloutWorker_w29 stopped! [2023-03-06 23:09:37,949][62145] Component RolloutWorker_w11 stopped! [2023-03-06 23:09:37,949][62145] Component RolloutWorker_w23 stopped! [2023-03-06 23:09:37,950][62145] Component RolloutWorker_w5 stopped! [2023-03-06 23:09:37,950][62145] Component RolloutWorker_w18 stopped! [2023-03-06 23:09:37,951][62145] Component RolloutWorker_w10 stopped! [2023-03-06 23:09:37,951][62145] Component RolloutWorker_w22 stopped! [2023-03-06 23:09:37,952][62145] Component RolloutWorker_w4 stopped! [2023-03-06 23:09:37,946][62424] Stopping Batcher_0... [2023-03-06 23:09:37,952][62145] Component RolloutWorker_w9 stopped! [2023-03-06 23:09:37,953][62145] Component RolloutWorker_w3 stopped! [2023-03-06 23:09:37,953][62145] Component RolloutWorker_w21 stopped! [2023-03-06 23:09:37,954][62145] Component RolloutWorker_w8 stopped! [2023-03-06 23:09:37,954][62478] Stopping RolloutWorker_w2... [2023-03-06 23:09:37,954][62145] Component RolloutWorker_w28 stopped! [2023-03-06 23:09:37,955][62478] Loop rollout_proc2_evt_loop terminating... [2023-03-06 23:09:37,955][62145] Component RolloutWorker_w24 stopped! [2023-03-06 23:09:37,955][62145] Component Batcher_0 stopped! [2023-03-06 23:09:37,956][62145] Component RolloutWorker_w26 stopped! [2023-03-06 23:09:37,956][62145] Component RolloutWorker_w14 stopped! [2023-03-06 23:09:37,957][62145] Component RolloutWorker_w20 stopped! [2023-03-06 23:09:37,957][62145] Component RolloutWorker_w1 stopped! [2023-03-06 23:09:37,958][62145] Component RolloutWorker_w7 stopped! [2023-03-06 23:09:37,958][62145] Component RolloutWorker_w30 stopped! [2023-03-06 23:09:37,959][62145] Component RolloutWorker_w15 stopped! [2023-03-06 23:09:37,961][62775] Stopping RolloutWorker_w19... [2023-03-06 23:09:37,962][62775] Loop rollout_proc19_evt_loop terminating... [2023-03-06 23:09:37,962][62476] Stopping RolloutWorker_w0... [2023-03-06 23:09:37,963][62476] Loop rollout_proc0_evt_loop terminating... [2023-03-06 23:09:37,959][62145] Component RolloutWorker_w12 stopped! [2023-03-06 23:09:37,964][62145] Component RolloutWorker_w16 stopped! [2023-03-06 23:09:37,965][62145] Component RolloutWorker_w27 stopped! [2023-03-06 23:09:37,965][62145] Component RolloutWorker_w31 stopped! [2023-03-06 23:09:37,965][62145] Component RolloutWorker_w2 stopped! [2023-03-06 23:09:37,966][62145] Component RolloutWorker_w19 stopped! [2023-03-06 23:09:37,966][62145] Component RolloutWorker_w0 stopped! [2023-03-06 23:09:37,946][62604] Stopping RolloutWorker_w12... [2023-03-06 23:09:37,969][62424] Loop batcher_evt_loop terminating... [2023-03-06 23:09:37,969][62604] Loop rollout_proc12_evt_loop terminating... [2023-03-06 23:09:37,978][62938] Stopping RolloutWorker_w25... [2023-03-06 23:09:37,979][62938] Loop rollout_proc25_evt_loop terminating... [2023-03-06 23:09:37,978][62145] Component RolloutWorker_w25 stopped! [2023-03-06 23:09:37,994][62612] Stopping RolloutWorker_w13... [2023-03-06 23:09:37,995][62612] Loop rollout_proc13_evt_loop terminating... [2023-03-06 23:09:37,995][62145] Component RolloutWorker_w13 stopped! [2023-03-06 23:09:38,014][62475] Weights refcount: 2 0 [2023-03-06 23:09:38,017][62475] Stopping InferenceWorker_p0-w0... [2023-03-06 23:09:38,017][62475] Loop inference_proc0-0_evt_loop terminating... [2023-03-06 23:09:38,017][62145] Component InferenceWorker_p0-w0 stopped! [2023-03-06 23:09:38,054][62424] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000095225_97510400.pth [2023-03-06 23:09:38,072][62424] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/box-close-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-06 23:09:38,164][62424] Stopping LearnerWorker_p0... [2023-03-06 23:09:38,164][62424] Loop learner_proc0_evt_loop terminating... [2023-03-06 23:09:38,164][62145] Component LearnerWorker_p0 stopped! [2023-03-06 23:09:38,165][62145] Waiting for process learner_proc0 to stop... [2023-03-06 23:09:39,346][62145] Waiting for process inference_proc0-0 to join... [2023-03-06 23:09:39,346][62145] Waiting for process rollout_proc0 to join... [2023-03-06 23:09:39,347][62145] Waiting for process rollout_proc1 to join... [2023-03-06 23:09:39,347][62145] Waiting for process rollout_proc2 to join... [2023-03-06 23:09:39,347][62145] Waiting for process rollout_proc3 to join... [2023-03-06 23:09:39,348][62145] Waiting for process rollout_proc4 to join... [2023-03-06 23:09:39,348][62145] Waiting for process rollout_proc5 to join... [2023-03-06 23:09:39,349][62145] Waiting for process rollout_proc6 to join... [2023-03-06 23:09:39,349][62145] Waiting for process rollout_proc7 to join... [2023-03-06 23:09:39,350][62145] Waiting for process rollout_proc8 to join... [2023-03-06 23:09:39,350][62145] Waiting for process rollout_proc9 to join... [2023-03-06 23:09:39,351][62145] Waiting for process rollout_proc10 to join... [2023-03-06 23:09:39,351][62145] Waiting for process rollout_proc11 to join... [2023-03-06 23:09:39,352][62145] Waiting for process rollout_proc12 to join... [2023-03-06 23:09:39,352][62145] Waiting for process rollout_proc13 to join... [2023-03-06 23:09:39,353][62145] Waiting for process rollout_proc14 to join... [2023-03-06 23:09:39,353][62145] Waiting for process rollout_proc15 to join... [2023-03-06 23:09:39,353][62145] Waiting for process rollout_proc16 to join... [2023-03-06 23:09:39,354][62145] Waiting for process rollout_proc17 to join... [2023-03-06 23:09:39,354][62145] Waiting for process rollout_proc18 to join... [2023-03-06 23:09:39,355][62145] Waiting for process rollout_proc19 to join... [2023-03-06 23:09:39,355][62145] Waiting for process rollout_proc20 to join... [2023-03-06 23:09:39,356][62145] Waiting for process rollout_proc21 to join... [2023-03-06 23:09:39,356][62145] Waiting for process rollout_proc22 to join... [2023-03-06 23:09:39,357][62145] Waiting for process rollout_proc23 to join... [2023-03-06 23:09:39,357][62145] Waiting for process rollout_proc24 to join... [2023-03-06 23:09:39,358][62145] Waiting for process rollout_proc25 to join... [2023-03-06 23:09:39,358][62145] Waiting for process rollout_proc26 to join... [2023-03-06 23:09:39,358][62145] Waiting for process rollout_proc27 to join... [2023-03-06 23:09:39,359][62145] Waiting for process rollout_proc28 to join... [2023-03-06 23:09:39,359][62145] Waiting for process rollout_proc29 to join... [2023-03-06 23:09:39,360][62145] Waiting for process rollout_proc30 to join... [2023-03-06 23:09:39,360][62145] Waiting for process rollout_proc31 to join... [2023-03-06 23:09:39,361][62145] Batcher 0 profile tree view: batching: 835.5145, releasing_batches: 1.6941 [2023-03-06 23:09:39,361][62145] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 237.1795 update_model: 138.2463 weight_update: 0.0006 one_step: 0.0083 handle_policy_step: 7097.1472 deserialize: 211.5298, stack: 36.9308, obs_to_device_normalize: 1227.1497, forward: 3230.5210, send_messages: 1382.2808 prepare_outputs: 731.2408 to_cpu: 366.3219 [2023-03-06 23:09:39,362][62145] Learner 0 profile tree view: misc: 0.4723, prepare_batch: 387.7806 train: 891.4539 epoch_init: 0.3683, minibatch_init: 0.3991, losses_postprocess: 30.0777, kl_divergence: 35.1325, after_optimizer: 116.5761 calculate_losses: 294.1161 losses_init: 0.2069, forward_head: 15.8328, bptt_initial: 106.2391, tail: 59.6305, advantages_returns: 7.3354, losses: 27.7506 bptt: 68.3593 bptt_forward_core: 65.9575 update: 392.1942 clip: 54.6436 [2023-03-06 23:09:39,362][62145] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.4278, enqueue_policy_requests: 160.6742, env_step: 3178.0106, overhead: 135.3420, complete_rollouts: 8.4526 save_policy_outputs: 188.1453 split_output_tensors: 91.3774 [2023-03-06 23:09:39,362][62145] RolloutWorker_w31 profile tree view: wait_for_trajectories: 3.4261, enqueue_policy_requests: 161.2295, env_step: 3246.2485, overhead: 137.4922, complete_rollouts: 8.3154 save_policy_outputs: 188.5139 split_output_tensors: 91.2717 [2023-03-06 23:09:39,363][62145] Loop Runner_EvtLoop terminating... [2023-03-06 23:09:39,364][62145] Runner profile tree view: main_loop: 7873.0077 [2023-03-06 23:09:39,364][62145] Collected {0: 100001792}, FPS: 12701.9