[2023-03-07 03:22:38,976][117718] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/config.json... [2023-03-07 03:22:38,990][117718] Rollout worker 0 uses device cpu [2023-03-07 03:22:38,990][117718] Rollout worker 1 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 2 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 3 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 4 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 5 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 6 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 7 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 8 uses device cpu [2023-03-07 03:22:38,991][117718] Rollout worker 9 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 10 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 11 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 12 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 13 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 14 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 15 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 16 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 17 uses device cpu [2023-03-07 03:22:38,992][117718] Rollout worker 18 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 19 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 20 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 21 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 22 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 23 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 24 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 25 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 26 uses device cpu [2023-03-07 03:22:38,993][117718] Rollout worker 27 uses device cpu [2023-03-07 03:22:38,994][117718] Rollout worker 28 uses device cpu [2023-03-07 03:22:38,994][117718] Rollout worker 29 uses device cpu [2023-03-07 03:22:38,994][117718] Rollout worker 30 uses device cpu [2023-03-07 03:22:38,994][117718] Rollout worker 31 uses device cpu [2023-03-07 03:22:39,009][117718] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 03:22:39,010][117718] InferenceWorker_p0-w0: min num requests: 10 [2023-03-07 03:22:39,091][117718] Starting all processes... [2023-03-07 03:22:39,091][117718] Starting process learner_proc0 [2023-03-07 03:22:39,141][117718] Starting all processes... [2023-03-07 03:22:39,208][117718] Starting process inference_proc0-0 [2023-03-07 03:22:39,208][117718] Starting process rollout_proc0 [2023-03-07 03:22:39,209][117718] Starting process rollout_proc1 [2023-03-07 03:22:39,209][117718] Starting process rollout_proc2 [2023-03-07 03:22:39,209][117718] Starting process rollout_proc3 [2023-03-07 03:22:39,210][117718] Starting process rollout_proc4 [2023-03-07 03:22:39,215][117718] Starting process rollout_proc5 [2023-03-07 03:22:39,217][117718] Starting process rollout_proc6 [2023-03-07 03:22:39,217][117718] Starting process rollout_proc7 [2023-03-07 03:22:39,218][117718] Starting process rollout_proc8 [2023-03-07 03:22:39,227][117718] Starting process rollout_proc9 [2023-03-07 03:22:39,227][117718] Starting process rollout_proc10 [2023-03-07 03:22:39,228][117718] Starting process rollout_proc11 [2023-03-07 03:22:39,228][117718] Starting process rollout_proc12 [2023-03-07 03:22:39,230][117718] Starting process rollout_proc13 [2023-03-07 03:22:39,230][117718] Starting process rollout_proc14 [2023-03-07 03:22:39,233][117718] Starting process rollout_proc15 [2023-03-07 03:22:39,234][117718] Starting process rollout_proc16 [2023-03-07 03:22:39,234][117718] Starting process rollout_proc17 [2023-03-07 03:22:39,234][117718] Starting process rollout_proc18 [2023-03-07 03:22:39,241][117718] Starting process rollout_proc19 [2023-03-07 03:22:39,342][117718] Starting process rollout_proc20 [2023-03-07 03:22:39,342][117718] Starting process rollout_proc21 [2023-03-07 03:22:39,354][117718] Starting process rollout_proc22 [2023-03-07 03:22:39,390][117718] Starting process rollout_proc23 [2023-03-07 03:22:39,399][117718] Starting process rollout_proc24 [2023-03-07 03:22:39,415][117718] Starting process rollout_proc25 [2023-03-07 03:22:39,415][117718] Starting process rollout_proc26 [2023-03-07 03:22:39,424][117718] Starting process rollout_proc27 [2023-03-07 03:22:39,434][117718] Starting process rollout_proc28 [2023-03-07 03:22:39,434][117718] Starting process rollout_proc29 [2023-03-07 03:22:39,436][117718] Starting process rollout_proc30 [2023-03-07 03:22:39,436][117718] Starting process rollout_proc31 [2023-03-07 03:22:41,018][117993] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 03:22:41,018][117993] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-07 03:22:41,028][117993] Num visible devices: 1 [2023-03-07 03:22:41,073][117993] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-07 03:22:41,073][117993] Starting seed is not provided [2023-03-07 03:22:41,073][117993] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 03:22:41,073][117993] Initializing actor-critic model on device cuda:0 [2023-03-07 03:22:41,074][117993] RunningMeanStd input shape: (39,) [2023-03-07 03:22:41,074][117993] RunningMeanStd input shape: (1,) [2023-03-07 03:22:41,184][117993] Created Actor Critic model with architecture: [2023-03-07 03:22:41,184][117993] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-07 03:22:41,206][118045] Worker 0 uses CPU cores [0] [2023-03-07 03:22:41,314][118204] Worker 4 uses CPU cores [4] [2023-03-07 03:22:41,408][118249] Worker 6 uses CPU cores [6] [2023-03-07 03:22:41,595][118254] Worker 12 uses CPU cores [12] [2023-03-07 03:22:41,602][118510] Worker 24 uses CPU cores [24] [2023-03-07 03:22:41,799][118443] Worker 14 uses CPU cores [14] [2023-03-07 03:22:41,858][118445] Worker 21 uses CPU cores [21] [2023-03-07 03:22:42,087][118212] Worker 10 uses CPU cores [10] [2023-03-07 03:22:42,167][118214] Worker 17 uses CPU cores [17] [2023-03-07 03:22:42,193][118213] Worker 9 uses CPU cores [9] [2023-03-07 03:22:42,411][118509] Worker 23 uses CPU cores [23] [2023-03-07 03:22:42,532][118209] Worker 13 uses CPU cores [13] [2023-03-07 03:22:42,635][118047] Worker 2 uses CPU cores [2] [2023-03-07 03:22:42,754][118512] Worker 26 uses CPU cores [26] [2023-03-07 03:22:42,769][117993] Using optimizer [2023-03-07 03:22:42,769][117993] No checkpoints found [2023-03-07 03:22:42,769][117993] Did not load from checkpoint, starting from scratch! [2023-03-07 03:22:42,770][117993] Initialized policy 0 weights for model version 0 [2023-03-07 03:22:42,772][117993] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 03:22:42,775][117993] LearnerWorker_p0 finished initialization! [2023-03-07 03:22:42,879][118546] Worker 29 uses CPU cores [29] [2023-03-07 03:22:42,956][118508] Worker 22 uses CPU cores [22] [2023-03-07 03:22:43,003][118216] Worker 16 uses CPU cores [16] [2023-03-07 03:22:43,076][118046] Worker 1 uses CPU cores [1] [2023-03-07 03:22:43,203][118545] Worker 28 uses CPU cores [28] [2023-03-07 03:22:43,203][118210] Worker 18 uses CPU cores [18] [2023-03-07 03:22:43,344][118048] Worker 3 uses CPU cores [3] [2023-03-07 03:22:43,461][118205] Worker 5 uses CPU cores [5] [2023-03-07 03:22:43,549][118513] Worker 27 uses CPU cores [27] [2023-03-07 03:22:43,644][118044] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 03:22:43,655][118044] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-07 03:22:43,664][118044] Num visible devices: 1 [2023-03-07 03:22:43,701][118511] Worker 25 uses CPU cores [25] [2023-03-07 03:22:43,748][118044] RunningMeanStd input shape: (39,) [2023-03-07 03:22:43,748][118044] RunningMeanStd input shape: (1,) [2023-03-07 03:22:43,858][118444] Worker 8 uses CPU cores [8] [2023-03-07 03:22:43,939][118207] Worker 15 uses CPU cores [15] [2023-03-07 03:22:44,075][118640] Worker 31 uses CPU cores [31] [2023-03-07 03:22:44,177][118248] Worker 19 uses CPU cores [19] [2023-03-07 03:22:44,225][118641] Worker 30 uses CPU cores [30] [2023-03-07 03:22:44,243][118208] Worker 7 uses CPU cores [7] [2023-03-07 03:22:44,323][118206] Worker 11 uses CPU cores [11] [2023-03-07 03:22:44,387][117718] Inference worker 0-0 is ready! [2023-03-07 03:22:44,387][117718] All inference workers are ready! Signal rollout workers to start! [2023-03-07 03:22:44,567][118296] Worker 20 uses CPU cores [20] [2023-03-07 03:22:45,814][118204] Decorrelating experience for 0 frames... [2023-03-07 03:22:45,868][118513] Decorrelating experience for 0 frames... [2023-03-07 03:22:45,891][118509] Decorrelating experience for 0 frames... [2023-03-07 03:22:45,902][118443] Decorrelating experience for 0 frames... [2023-03-07 03:22:45,938][118254] Decorrelating experience for 0 frames... [2023-03-07 03:22:45,978][118546] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,018][118048] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,075][118209] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,090][117718] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-07 03:22:46,112][118444] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,112][118510] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,114][118445] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,115][118640] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,117][118205] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,123][118214] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,125][118212] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,126][118046] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,128][118512] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,134][118210] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,134][118508] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,134][118047] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,135][118249] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,136][118545] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,137][118213] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,139][118045] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,142][118216] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,145][118248] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,145][118511] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,172][118641] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,179][118207] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,185][118208] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,437][118206] Decorrelating experience for 0 frames... [2023-03-07 03:22:46,622][118296] Decorrelating experience for 0 frames... [2023-03-07 03:22:47,446][118204] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,539][118513] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,551][118509] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,565][118443] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,611][118254] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,641][118546] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,699][118048] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,704][118209] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,723][118208] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,726][118545] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,729][118641] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,770][118445] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,777][118248] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,783][118205] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,814][118640] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,821][118512] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,822][118212] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,825][118207] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,829][118444] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,829][118214] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,829][118046] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,830][118508] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,832][118510] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,832][118249] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,833][118047] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,833][118210] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,833][118213] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,836][118511] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,839][118216] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,840][118045] Decorrelating experience for 32 frames... [2023-03-07 03:22:47,857][118206] Decorrelating experience for 32 frames... [2023-03-07 03:22:48,032][118296] Decorrelating experience for 32 frames... [2023-03-07 03:22:48,206][117993] Signal inference workers to stop experience collection... [2023-03-07 03:22:48,210][118044] InferenceWorker_p0-w0: stopping experience collection [2023-03-07 03:22:48,505][117993] Signal inference workers to resume experience collection... [2023-03-07 03:22:48,506][118044] InferenceWorker_p0-w0: resuming experience collection [2023-03-07 03:22:49,666][118044] Updated weights for policy 0, policy_version 10 (0.0214) [2023-03-07 03:22:50,426][118044] Updated weights for policy 0, policy_version 20 (0.0005) [2023-03-07 03:22:51,085][117718] Fps is (10 sec: 5943.8, 60 sec: 5943.8, 300 sec: 5943.8). Total num frames: 29696. Throughput: 0: 4372.2. Samples: 21844. Policy #0 lag: (min: 0.0, avg: 1.7, max: 3.0) [2023-03-07 03:22:51,086][117718] Avg episode reward: [(0, '202.557')] [2023-03-07 03:22:51,187][118044] Updated weights for policy 0, policy_version 30 (0.0007) [2023-03-07 03:22:51,938][118044] Updated weights for policy 0, policy_version 40 (0.0006) [2023-03-07 03:22:52,680][118044] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-07 03:22:53,445][118044] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-07 03:22:54,225][118044] Updated weights for policy 0, policy_version 70 (0.0005) [2023-03-07 03:22:55,009][118044] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-07 03:22:55,778][118044] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-07 03:22:56,085][117718] Fps is (10 sec: 9526.9, 60 sec: 9526.9, 300 sec: 9526.9). Total num frames: 95232. Throughput: 0: 6186.7. Samples: 61843. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:22:56,086][117718] Avg episode reward: [(0, '363.075')] [2023-03-07 03:22:56,558][118044] Updated weights for policy 0, policy_version 100 (0.0006) [2023-03-07 03:22:57,328][118044] Updated weights for policy 0, policy_version 110 (0.0007) [2023-03-07 03:22:58,119][118044] Updated weights for policy 0, policy_version 120 (0.0005) [2023-03-07 03:22:58,896][118044] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-07 03:22:59,003][117718] Heartbeat connected on Batcher_0 [2023-03-07 03:22:59,005][117718] Heartbeat connected on LearnerWorker_p0 [2023-03-07 03:22:59,013][117718] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-07 03:22:59,015][117718] Heartbeat connected on RolloutWorker_w0 [2023-03-07 03:22:59,017][117718] Heartbeat connected on RolloutWorker_w1 [2023-03-07 03:22:59,021][117718] Heartbeat connected on RolloutWorker_w2 [2023-03-07 03:22:59,023][117718] Heartbeat connected on RolloutWorker_w3 [2023-03-07 03:22:59,026][117718] Heartbeat connected on RolloutWorker_w4 [2023-03-07 03:22:59,029][117718] Heartbeat connected on RolloutWorker_w5 [2023-03-07 03:22:59,030][117718] Heartbeat connected on RolloutWorker_w6 [2023-03-07 03:22:59,033][117718] Heartbeat connected on RolloutWorker_w7 [2023-03-07 03:22:59,034][117718] Heartbeat connected on RolloutWorker_w8 [2023-03-07 03:22:59,038][117718] Heartbeat connected on RolloutWorker_w10 [2023-03-07 03:22:59,039][117718] Heartbeat connected on RolloutWorker_w9 [2023-03-07 03:22:59,054][117718] Heartbeat connected on RolloutWorker_w11 [2023-03-07 03:22:59,056][117718] Heartbeat connected on RolloutWorker_w12 [2023-03-07 03:22:59,057][117718] Heartbeat connected on RolloutWorker_w13 [2023-03-07 03:22:59,059][117718] Heartbeat connected on RolloutWorker_w14 [2023-03-07 03:22:59,062][117718] Heartbeat connected on RolloutWorker_w15 [2023-03-07 03:22:59,064][117718] Heartbeat connected on RolloutWorker_w16 [2023-03-07 03:22:59,064][117718] Heartbeat connected on RolloutWorker_w17 [2023-03-07 03:22:59,067][117718] Heartbeat connected on RolloutWorker_w18 [2023-03-07 03:22:59,068][117718] Heartbeat connected on RolloutWorker_w19 [2023-03-07 03:22:59,069][117718] Heartbeat connected on RolloutWorker_w20 [2023-03-07 03:22:59,071][117718] Heartbeat connected on RolloutWorker_w21 [2023-03-07 03:22:59,073][117718] Heartbeat connected on RolloutWorker_w22 [2023-03-07 03:22:59,075][117718] Heartbeat connected on RolloutWorker_w23 [2023-03-07 03:22:59,079][117718] Heartbeat connected on RolloutWorker_w25 [2023-03-07 03:22:59,081][117718] Heartbeat connected on RolloutWorker_w26 [2023-03-07 03:22:59,082][117718] Heartbeat connected on RolloutWorker_w27 [2023-03-07 03:22:59,085][117718] Heartbeat connected on RolloutWorker_w28 [2023-03-07 03:22:59,085][117718] Heartbeat connected on RolloutWorker_w24 [2023-03-07 03:22:59,087][117718] Heartbeat connected on RolloutWorker_w29 [2023-03-07 03:22:59,088][117718] Heartbeat connected on RolloutWorker_w30 [2023-03-07 03:22:59,090][117718] Heartbeat connected on RolloutWorker_w31 [2023-03-07 03:22:59,658][118044] Updated weights for policy 0, policy_version 140 (0.0006) [2023-03-07 03:23:00,419][118044] Updated weights for policy 0, policy_version 150 (0.0005) [2023-03-07 03:23:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 10788.8, 300 sec: 10788.8). Total num frames: 161792. Throughput: 0: 9414.4. Samples: 141180. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:01,086][117718] Avg episode reward: [(0, '333.742')] [2023-03-07 03:23:01,087][117993] Saving new best policy, reward=333.742! [2023-03-07 03:23:01,212][118044] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-07 03:23:01,979][118044] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-07 03:23:02,741][118044] Updated weights for policy 0, policy_version 180 (0.0007) [2023-03-07 03:23:03,533][118044] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-07 03:23:04,289][118044] Updated weights for policy 0, policy_version 200 (0.0007) [2023-03-07 03:23:05,046][118044] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-07 03:23:05,821][118044] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-07 03:23:06,086][117718] Fps is (10 sec: 13311.8, 60 sec: 11419.8, 300 sec: 11419.8). Total num frames: 228352. Throughput: 0: 11051.8. Samples: 220994. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:06,086][117718] Avg episode reward: [(0, '322.280')] [2023-03-07 03:23:06,598][118044] Updated weights for policy 0, policy_version 230 (0.0006) [2023-03-07 03:23:07,357][118044] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-07 03:23:08,141][118044] Updated weights for policy 0, policy_version 250 (0.0006) [2023-03-07 03:23:08,903][118044] Updated weights for policy 0, policy_version 260 (0.0005) [2023-03-07 03:23:09,661][118044] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-07 03:23:10,449][118044] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-07 03:23:11,086][117718] Fps is (10 sec: 13312.0, 60 sec: 11798.2, 300 sec: 11798.2). Total num frames: 294912. Throughput: 0: 10424.3. Samples: 260568. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:23:11,086][117718] Avg episode reward: [(0, '472.276')] [2023-03-07 03:23:11,087][117993] Saving new best policy, reward=472.276! [2023-03-07 03:23:11,229][118044] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-07 03:23:12,005][118044] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-07 03:23:12,778][118044] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-07 03:23:13,546][118044] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-07 03:23:14,325][118044] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-07 03:23:15,101][118044] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-07 03:23:15,889][118044] Updated weights for policy 0, policy_version 350 (0.0007) [2023-03-07 03:23:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 12016.5, 300 sec: 12016.5). Total num frames: 360448. Throughput: 0: 11339.8. Samples: 340150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:16,086][117718] Avg episode reward: [(0, '659.541')] [2023-03-07 03:23:16,090][117993] Saving new best policy, reward=659.541! [2023-03-07 03:23:16,641][118044] Updated weights for policy 0, policy_version 360 (0.0007) [2023-03-07 03:23:17,414][118044] Updated weights for policy 0, policy_version 370 (0.0007) [2023-03-07 03:23:18,190][118044] Updated weights for policy 0, policy_version 380 (0.0006) [2023-03-07 03:23:18,974][118044] Updated weights for policy 0, policy_version 390 (0.0006) [2023-03-07 03:23:19,756][118044] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-07 03:23:20,538][118044] Updated weights for policy 0, policy_version 410 (0.0005) [2023-03-07 03:23:21,085][117718] Fps is (10 sec: 13209.8, 60 sec: 12201.6, 300 sec: 12201.6). Total num frames: 427008. Throughput: 0: 11974.1. Samples: 419046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:23:21,086][117718] Avg episode reward: [(0, '837.105')] [2023-03-07 03:23:21,087][117993] Saving new best policy, reward=837.105! [2023-03-07 03:23:21,318][118044] Updated weights for policy 0, policy_version 420 (0.0007) [2023-03-07 03:23:22,115][118044] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-07 03:23:22,923][118044] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-07 03:23:23,702][118044] Updated weights for policy 0, policy_version 450 (0.0006) [2023-03-07 03:23:24,472][118044] Updated weights for policy 0, policy_version 460 (0.0007) [2023-03-07 03:23:25,235][118044] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-07 03:23:26,018][118044] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-07 03:23:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 12289.2, 300 sec: 12289.2). Total num frames: 491520. Throughput: 0: 11454.7. Samples: 458145. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:23:26,087][117718] Avg episode reward: [(0, '1146.487')] [2023-03-07 03:23:26,091][117993] Saving new best policy, reward=1146.487! [2023-03-07 03:23:26,784][118044] Updated weights for policy 0, policy_version 490 (0.0006) [2023-03-07 03:23:27,567][118044] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-07 03:23:28,341][118044] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-07 03:23:29,106][118044] Updated weights for policy 0, policy_version 520 (0.0006) [2023-03-07 03:23:29,877][118044] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-07 03:23:30,656][118044] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-07 03:23:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 12402.8, 300 sec: 12402.8). Total num frames: 558080. Throughput: 0: 11946.1. Samples: 537528. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:23:31,086][117718] Avg episode reward: [(0, '1152.965')] [2023-03-07 03:23:31,087][117993] Saving new best policy, reward=1152.965! [2023-03-07 03:23:31,430][118044] Updated weights for policy 0, policy_version 550 (0.0006) [2023-03-07 03:23:32,207][118044] Updated weights for policy 0, policy_version 560 (0.0007) [2023-03-07 03:23:32,983][118044] Updated weights for policy 0, policy_version 570 (0.0005) [2023-03-07 03:23:33,757][118044] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-07 03:23:34,530][118044] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-07 03:23:35,317][118044] Updated weights for policy 0, policy_version 600 (0.0006) [2023-03-07 03:23:36,085][118044] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-07 03:23:36,085][117718] Fps is (10 sec: 13312.0, 60 sec: 12493.8, 300 sec: 12493.8). Total num frames: 624640. Throughput: 0: 13219.5. Samples: 616721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:23:36,086][117718] Avg episode reward: [(0, '1151.787')] [2023-03-07 03:23:36,858][118044] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-07 03:23:37,632][118044] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-07 03:23:38,397][118044] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-07 03:23:39,175][118044] Updated weights for policy 0, policy_version 650 (0.0007) [2023-03-07 03:23:39,975][118044] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-07 03:23:40,749][118044] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-07 03:23:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 12549.5, 300 sec: 12549.5). Total num frames: 690176. Throughput: 0: 13214.3. Samples: 656487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:41,086][117718] Avg episode reward: [(0, '1448.933')] [2023-03-07 03:23:41,087][117993] Saving new best policy, reward=1448.933! [2023-03-07 03:23:41,522][118044] Updated weights for policy 0, policy_version 680 (0.0007) [2023-03-07 03:23:42,310][118044] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-07 03:23:43,070][118044] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-07 03:23:43,849][118044] Updated weights for policy 0, policy_version 710 (0.0007) [2023-03-07 03:23:44,624][118044] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-07 03:23:45,392][118044] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-07 03:23:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 12596.0, 300 sec: 12596.0). Total num frames: 755712. Throughput: 0: 13205.1. Samples: 735409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:46,086][117718] Avg episode reward: [(0, '1778.182')] [2023-03-07 03:23:46,101][117993] Saving new best policy, reward=1778.182! [2023-03-07 03:23:46,176][118044] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-07 03:23:46,968][118044] Updated weights for policy 0, policy_version 750 (0.0005) [2023-03-07 03:23:47,747][118044] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-07 03:23:48,509][118044] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-07 03:23:49,300][118044] Updated weights for policy 0, policy_version 780 (0.0006) [2023-03-07 03:23:50,082][118044] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-07 03:23:50,846][118044] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-07 03:23:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 12651.1). Total num frames: 822272. Throughput: 0: 13184.1. Samples: 814279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:23:51,086][117718] Avg episode reward: [(0, '1815.817')] [2023-03-07 03:23:51,086][117993] Saving new best policy, reward=1815.817! [2023-03-07 03:23:51,615][118044] Updated weights for policy 0, policy_version 810 (0.0007) [2023-03-07 03:23:52,410][118044] Updated weights for policy 0, policy_version 820 (0.0005) [2023-03-07 03:23:53,181][118044] Updated weights for policy 0, policy_version 830 (0.0006) [2023-03-07 03:23:53,958][118044] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-07 03:23:54,728][118044] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-07 03:23:55,515][118044] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-07 03:23:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 12683.6). Total num frames: 887808. Throughput: 0: 13185.4. Samples: 853913. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 03:23:56,086][117718] Avg episode reward: [(0, '1369.442')] [2023-03-07 03:23:56,298][118044] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-07 03:23:57,082][118044] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-07 03:23:57,849][118044] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-07 03:23:58,624][118044] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-07 03:23:59,410][118044] Updated weights for policy 0, policy_version 910 (0.0006) [2023-03-07 03:24:00,189][118044] Updated weights for policy 0, policy_version 920 (0.0006) [2023-03-07 03:24:00,964][118044] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-07 03:24:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13192.6, 300 sec: 12711.9). Total num frames: 953344. Throughput: 0: 13169.9. Samples: 932793. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:24:01,086][117718] Avg episode reward: [(0, '1847.687')] [2023-03-07 03:24:01,087][117993] Saving new best policy, reward=1847.687! [2023-03-07 03:24:01,756][118044] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-07 03:24:02,534][118044] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-07 03:24:03,294][118044] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-07 03:24:04,085][118044] Updated weights for policy 0, policy_version 970 (0.0005) [2023-03-07 03:24:04,867][118044] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-07 03:24:05,641][118044] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-07 03:24:06,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 12736.6). Total num frames: 1018880. Throughput: 0: 13173.3. Samples: 1011845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:24:06,086][117718] Avg episode reward: [(0, '1546.811')] [2023-03-07 03:24:06,423][118044] Updated weights for policy 0, policy_version 1000 (0.0007) [2023-03-07 03:24:07,194][118044] Updated weights for policy 0, policy_version 1010 (0.0007) [2023-03-07 03:24:07,964][118044] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-07 03:24:08,734][118044] Updated weights for policy 0, policy_version 1030 (0.0006) [2023-03-07 03:24:09,509][118044] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-07 03:24:10,293][118044] Updated weights for policy 0, policy_version 1050 (0.0006) [2023-03-07 03:24:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 12770.5). Total num frames: 1085440. Throughput: 0: 13184.3. Samples: 1051437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:24:11,086][117718] Avg episode reward: [(0, '2468.842')] [2023-03-07 03:24:11,086][117993] Saving new best policy, reward=2468.842! [2023-03-07 03:24:11,087][118044] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-07 03:24:11,855][118044] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-07 03:24:12,633][118044] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-07 03:24:13,420][118044] Updated weights for policy 0, policy_version 1090 (0.0007) [2023-03-07 03:24:14,204][118044] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-07 03:24:14,963][118044] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-07 03:24:15,736][118044] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-07 03:24:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12789.2). Total num frames: 1150976. Throughput: 0: 13173.8. Samples: 1130350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:24:16,086][117718] Avg episode reward: [(0, '2463.886')] [2023-03-07 03:24:16,529][118044] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-07 03:24:17,301][118044] Updated weights for policy 0, policy_version 1140 (0.0006) [2023-03-07 03:24:18,070][118044] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-07 03:24:18,863][118044] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-07 03:24:19,642][118044] Updated weights for policy 0, policy_version 1170 (0.0005) [2023-03-07 03:24:20,410][118044] Updated weights for policy 0, policy_version 1180 (0.0007) [2023-03-07 03:24:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 12805.9). Total num frames: 1216512. Throughput: 0: 13168.4. Samples: 1209297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:24:21,086][117718] Avg episode reward: [(0, '2594.791')] [2023-03-07 03:24:21,087][117993] Saving new best policy, reward=2594.791! [2023-03-07 03:24:21,195][118044] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-07 03:24:21,994][118044] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-07 03:24:22,752][118044] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-07 03:24:23,538][118044] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-07 03:24:24,317][118044] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-07 03:24:25,096][118044] Updated weights for policy 0, policy_version 1240 (0.0007) [2023-03-07 03:24:25,862][118044] Updated weights for policy 0, policy_version 1250 (0.0007) [2023-03-07 03:24:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 12821.0). Total num frames: 1282048. Throughput: 0: 13158.1. Samples: 1248603. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:24:26,086][117718] Avg episode reward: [(0, '2477.902')] [2023-03-07 03:24:26,632][118044] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-07 03:24:27,398][118044] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-07 03:24:28,170][118044] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-07 03:24:28,966][118044] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-07 03:24:29,728][118044] Updated weights for policy 0, policy_version 1300 (0.0006) [2023-03-07 03:24:30,481][118044] Updated weights for policy 0, policy_version 1310 (0.0005) [2023-03-07 03:24:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12844.4). Total num frames: 1348608. Throughput: 0: 13168.7. Samples: 1328000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:24:31,086][117718] Avg episode reward: [(0, '2528.077')] [2023-03-07 03:24:31,278][118044] Updated weights for policy 0, policy_version 1320 (0.0006) [2023-03-07 03:24:32,048][118044] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-07 03:24:32,828][118044] Updated weights for policy 0, policy_version 1340 (0.0005) [2023-03-07 03:24:33,610][118044] Updated weights for policy 0, policy_version 1350 (0.0006) [2023-03-07 03:24:34,377][118044] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-07 03:24:35,161][118044] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-07 03:24:35,936][118044] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-07 03:24:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 12856.3). Total num frames: 1414144. Throughput: 0: 13174.1. Samples: 1407116. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:24:36,086][117718] Avg episode reward: [(0, '2829.049')] [2023-03-07 03:24:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000001382_1415168.pth... [2023-03-07 03:24:36,120][117993] Saving new best policy, reward=2829.049! [2023-03-07 03:24:36,717][118044] Updated weights for policy 0, policy_version 1390 (0.0006) [2023-03-07 03:24:37,506][118044] Updated weights for policy 0, policy_version 1400 (0.0006) [2023-03-07 03:24:38,259][118044] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-07 03:24:39,047][118044] Updated weights for policy 0, policy_version 1420 (0.0007) [2023-03-07 03:24:39,826][118044] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-07 03:24:40,598][118044] Updated weights for policy 0, policy_version 1440 (0.0006) [2023-03-07 03:24:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 12876.1). Total num frames: 1480704. Throughput: 0: 13171.8. Samples: 1446642. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:24:41,086][117718] Avg episode reward: [(0, '3041.397')] [2023-03-07 03:24:41,086][117993] Saving new best policy, reward=3041.397! [2023-03-07 03:24:41,370][118044] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-07 03:24:42,147][118044] Updated weights for policy 0, policy_version 1460 (0.0006) [2023-03-07 03:24:42,930][118044] Updated weights for policy 0, policy_version 1470 (0.0007) [2023-03-07 03:24:43,710][118044] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-07 03:24:44,483][118044] Updated weights for policy 0, policy_version 1490 (0.0007) [2023-03-07 03:24:45,269][118044] Updated weights for policy 0, policy_version 1500 (0.0006) [2023-03-07 03:24:46,022][118044] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-07 03:24:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12885.7). Total num frames: 1546240. Throughput: 0: 13179.2. Samples: 1525859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:24:46,086][117718] Avg episode reward: [(0, '2974.441')] [2023-03-07 03:24:46,801][118044] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-07 03:24:47,601][118044] Updated weights for policy 0, policy_version 1530 (0.0007) [2023-03-07 03:24:48,385][118044] Updated weights for policy 0, policy_version 1540 (0.0006) [2023-03-07 03:24:49,152][118044] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-07 03:24:49,941][118044] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-07 03:24:50,715][118044] Updated weights for policy 0, policy_version 1570 (0.0006) [2023-03-07 03:24:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12894.6). Total num frames: 1611776. Throughput: 0: 13175.4. Samples: 1604736. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:24:51,086][117718] Avg episode reward: [(0, '3103.363')] [2023-03-07 03:24:51,086][117993] Saving new best policy, reward=3103.363! [2023-03-07 03:24:51,494][118044] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-07 03:24:52,295][118044] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-07 03:24:53,054][118044] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-07 03:24:53,841][118044] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-07 03:24:54,627][118044] Updated weights for policy 0, policy_version 1620 (0.0007) [2023-03-07 03:24:55,411][118044] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-07 03:24:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12902.8). Total num frames: 1677312. Throughput: 0: 13167.0. Samples: 1643956. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:24:56,097][117718] Avg episode reward: [(0, '3021.207')] [2023-03-07 03:24:56,200][118044] Updated weights for policy 0, policy_version 1640 (0.0007) [2023-03-07 03:24:56,960][118044] Updated weights for policy 0, policy_version 1650 (0.0007) [2023-03-07 03:24:57,724][118044] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-07 03:24:58,498][118044] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-07 03:24:59,276][118044] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-07 03:25:00,054][118044] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-07 03:25:00,820][118044] Updated weights for policy 0, policy_version 1700 (0.0007) [2023-03-07 03:25:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12910.4). Total num frames: 1742848. Throughput: 0: 13170.3. Samples: 1723014. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:01,096][117718] Avg episode reward: [(0, '3174.085')] [2023-03-07 03:25:01,097][117993] Saving new best policy, reward=3174.085! [2023-03-07 03:25:01,599][118044] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-07 03:25:02,384][118044] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-07 03:25:03,145][118044] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-07 03:25:03,928][118044] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-07 03:25:04,694][118044] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-07 03:25:05,484][118044] Updated weights for policy 0, policy_version 1760 (0.0007) [2023-03-07 03:25:06,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12924.7). Total num frames: 1809408. Throughput: 0: 13175.5. Samples: 1802198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:06,086][117718] Avg episode reward: [(0, '3169.805')] [2023-03-07 03:25:06,261][118044] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-07 03:25:07,034][118044] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-07 03:25:07,821][118044] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-07 03:25:08,608][118044] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-07 03:25:09,396][118044] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-07 03:25:10,169][118044] Updated weights for policy 0, policy_version 1820 (0.0006) [2023-03-07 03:25:10,956][118044] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-07 03:25:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 12931.0). Total num frames: 1874944. Throughput: 0: 13178.2. Samples: 1841621. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:25:11,086][117718] Avg episode reward: [(0, '2855.656')] [2023-03-07 03:25:11,734][118044] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-07 03:25:12,497][118044] Updated weights for policy 0, policy_version 1850 (0.0007) [2023-03-07 03:25:13,258][118044] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-07 03:25:14,040][118044] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-07 03:25:14,819][118044] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-07 03:25:15,580][118044] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-07 03:25:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 12943.7). Total num frames: 1941504. Throughput: 0: 13172.7. Samples: 1920771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:25:16,086][117718] Avg episode reward: [(0, '3059.445')] [2023-03-07 03:25:16,359][118044] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-07 03:25:17,141][118044] Updated weights for policy 0, policy_version 1910 (0.0006) [2023-03-07 03:25:17,912][118044] Updated weights for policy 0, policy_version 1920 (0.0008) [2023-03-07 03:25:18,693][118044] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-07 03:25:19,483][118044] Updated weights for policy 0, policy_version 1940 (0.0006) [2023-03-07 03:25:20,252][118044] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-07 03:25:21,030][118044] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-07 03:25:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12949.0). Total num frames: 2007040. Throughput: 0: 13170.1. Samples: 1999769. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:21,086][117718] Avg episode reward: [(0, '3146.461')] [2023-03-07 03:25:21,830][118044] Updated weights for policy 0, policy_version 1970 (0.0007) [2023-03-07 03:25:22,606][118044] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-07 03:25:23,370][118044] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-07 03:25:24,155][118044] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-07 03:25:24,938][118044] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-07 03:25:25,716][118044] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-07 03:25:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 12953.9). Total num frames: 2072576. Throughput: 0: 13165.8. Samples: 2039106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:25:26,086][117718] Avg episode reward: [(0, '3104.389')] [2023-03-07 03:25:26,487][118044] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-07 03:25:27,269][118044] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-07 03:25:28,051][118044] Updated weights for policy 0, policy_version 2050 (0.0006) [2023-03-07 03:25:28,825][118044] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-07 03:25:29,586][118044] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-07 03:25:30,368][118044] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-07 03:25:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12964.8). Total num frames: 2139136. Throughput: 0: 13165.4. Samples: 2118304. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:25:31,086][117718] Avg episode reward: [(0, '3079.275')] [2023-03-07 03:25:31,141][118044] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-07 03:25:31,926][118044] Updated weights for policy 0, policy_version 2100 (0.0006) [2023-03-07 03:25:32,684][118044] Updated weights for policy 0, policy_version 2110 (0.0007) [2023-03-07 03:25:33,469][118044] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-07 03:25:34,269][118044] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-07 03:25:35,030][118044] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-07 03:25:35,821][118044] Updated weights for policy 0, policy_version 2150 (0.0007) [2023-03-07 03:25:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 12969.0). Total num frames: 2204672. Throughput: 0: 13166.4. Samples: 2197222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:36,086][117718] Avg episode reward: [(0, '3210.559')] [2023-03-07 03:25:36,089][117993] Saving new best policy, reward=3210.559! [2023-03-07 03:25:36,607][118044] Updated weights for policy 0, policy_version 2160 (0.0008) [2023-03-07 03:25:37,404][118044] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-07 03:25:38,173][118044] Updated weights for policy 0, policy_version 2180 (0.0006) [2023-03-07 03:25:38,972][118044] Updated weights for policy 0, policy_version 2190 (0.0007) [2023-03-07 03:25:39,730][118044] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-07 03:25:40,501][118044] Updated weights for policy 0, policy_version 2210 (0.0007) [2023-03-07 03:25:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12972.9). Total num frames: 2270208. Throughput: 0: 13166.3. Samples: 2236439. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:41,086][117718] Avg episode reward: [(0, '3167.678')] [2023-03-07 03:25:41,294][118044] Updated weights for policy 0, policy_version 2220 (0.0005) [2023-03-07 03:25:42,072][118044] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-07 03:25:42,846][118044] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-07 03:25:43,624][118044] Updated weights for policy 0, policy_version 2250 (0.0006) [2023-03-07 03:25:44,401][118044] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-07 03:25:45,172][118044] Updated weights for policy 0, policy_version 2270 (0.0006) [2023-03-07 03:25:45,958][118044] Updated weights for policy 0, policy_version 2280 (0.0006) [2023-03-07 03:25:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12976.6). Total num frames: 2335744. Throughput: 0: 13165.7. Samples: 2315470. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:25:46,086][117718] Avg episode reward: [(0, '3251.622')] [2023-03-07 03:25:46,090][117993] Saving new best policy, reward=3251.622! [2023-03-07 03:25:46,758][118044] Updated weights for policy 0, policy_version 2290 (0.0006) [2023-03-07 03:25:47,534][118044] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-07 03:25:48,318][118044] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-07 03:25:49,090][118044] Updated weights for policy 0, policy_version 2320 (0.0006) [2023-03-07 03:25:49,878][118044] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-07 03:25:50,645][118044] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-07 03:25:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 12980.2). Total num frames: 2401280. Throughput: 0: 13154.4. Samples: 2394143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:25:51,086][117718] Avg episode reward: [(0, '3282.577')] [2023-03-07 03:25:51,087][117993] Saving new best policy, reward=3282.577! [2023-03-07 03:25:51,421][118044] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-07 03:25:52,221][118044] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-07 03:25:52,987][118044] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-07 03:25:53,763][118044] Updated weights for policy 0, policy_version 2380 (0.0007) [2023-03-07 03:25:54,546][118044] Updated weights for policy 0, policy_version 2390 (0.0007) [2023-03-07 03:25:55,331][118044] Updated weights for policy 0, policy_version 2400 (0.0007) [2023-03-07 03:25:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12983.5). Total num frames: 2466816. Throughput: 0: 13151.8. Samples: 2433451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:25:56,086][117718] Avg episode reward: [(0, '3281.545')] [2023-03-07 03:25:56,088][118044] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-07 03:25:56,842][118044] Updated weights for policy 0, policy_version 2420 (0.0007) [2023-03-07 03:25:57,642][118044] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-07 03:25:58,406][118044] Updated weights for policy 0, policy_version 2440 (0.0006) [2023-03-07 03:25:59,177][118044] Updated weights for policy 0, policy_version 2450 (0.0007) [2023-03-07 03:25:59,951][118044] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-07 03:26:00,733][118044] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-07 03:26:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 12991.9). Total num frames: 2533376. Throughput: 0: 13153.8. Samples: 2512691. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:26:01,086][117718] Avg episode reward: [(0, '3174.099')] [2023-03-07 03:26:01,517][118044] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-07 03:26:02,296][118044] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-07 03:26:03,071][118044] Updated weights for policy 0, policy_version 2500 (0.0006) [2023-03-07 03:26:03,857][118044] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-07 03:26:04,639][118044] Updated weights for policy 0, policy_version 2520 (0.0007) [2023-03-07 03:26:05,420][118044] Updated weights for policy 0, policy_version 2530 (0.0006) [2023-03-07 03:26:06,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 12994.8). Total num frames: 2598912. Throughput: 0: 13155.0. Samples: 2591744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:26:06,086][117718] Avg episode reward: [(0, '3215.400')] [2023-03-07 03:26:06,197][118044] Updated weights for policy 0, policy_version 2540 (0.0007) [2023-03-07 03:26:06,988][118044] Updated weights for policy 0, policy_version 2550 (0.0007) [2023-03-07 03:26:07,767][118044] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-07 03:26:08,533][118044] Updated weights for policy 0, policy_version 2570 (0.0007) [2023-03-07 03:26:09,311][118044] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-07 03:26:10,092][118044] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-07 03:26:10,871][118044] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-07 03:26:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 12997.6). Total num frames: 2664448. Throughput: 0: 13154.9. Samples: 2631076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:26:11,086][117718] Avg episode reward: [(0, '3290.147')] [2023-03-07 03:26:11,096][117993] Saving new best policy, reward=3290.147! [2023-03-07 03:26:11,640][118044] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-07 03:26:12,395][118044] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-07 03:26:13,184][118044] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-07 03:26:13,957][118044] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-07 03:26:14,759][118044] Updated weights for policy 0, policy_version 2650 (0.0007) [2023-03-07 03:26:15,539][118044] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-07 03:26:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13005.0). Total num frames: 2731008. Throughput: 0: 13152.5. Samples: 2710165. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:26:16,086][117718] Avg episode reward: [(0, '3335.614')] [2023-03-07 03:26:16,090][117993] Saving new best policy, reward=3335.614! [2023-03-07 03:26:16,322][118044] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-07 03:26:17,092][118044] Updated weights for policy 0, policy_version 2680 (0.0007) [2023-03-07 03:26:17,874][118044] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-07 03:26:18,667][118044] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-07 03:26:19,437][118044] Updated weights for policy 0, policy_version 2710 (0.0005) [2023-03-07 03:26:20,212][118044] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-07 03:26:20,992][118044] Updated weights for policy 0, policy_version 2730 (0.0008) [2023-03-07 03:26:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13007.4). Total num frames: 2796544. Throughput: 0: 13147.7. Samples: 2788869. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:26:21,086][117718] Avg episode reward: [(0, '3232.725')] [2023-03-07 03:26:21,764][118044] Updated weights for policy 0, policy_version 2740 (0.0005) [2023-03-07 03:26:22,546][118044] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-07 03:26:23,329][118044] Updated weights for policy 0, policy_version 2760 (0.0007) [2023-03-07 03:26:24,077][118044] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-07 03:26:24,861][118044] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-07 03:26:25,638][118044] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-07 03:26:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13009.7). Total num frames: 2862080. Throughput: 0: 13156.1. Samples: 2828466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:26:26,086][117718] Avg episode reward: [(0, '2951.100')] [2023-03-07 03:26:26,408][118044] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-07 03:26:27,189][118044] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-07 03:26:27,961][118044] Updated weights for policy 0, policy_version 2820 (0.0007) [2023-03-07 03:26:28,755][118044] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-07 03:26:29,540][118044] Updated weights for policy 0, policy_version 2840 (0.0006) [2023-03-07 03:26:30,319][118044] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-07 03:26:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13011.8). Total num frames: 2927616. Throughput: 0: 13155.1. Samples: 2907450. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:26:31,097][117718] Avg episode reward: [(0, '3090.617')] [2023-03-07 03:26:31,108][118044] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-07 03:26:31,895][118044] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-07 03:26:32,655][118044] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-07 03:26:33,434][118044] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-07 03:26:34,227][118044] Updated weights for policy 0, policy_version 2900 (0.0005) [2023-03-07 03:26:34,994][118044] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-07 03:26:35,795][118044] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-07 03:26:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13013.9). Total num frames: 2993152. Throughput: 0: 13152.4. Samples: 2986002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:26:36,096][117718] Avg episode reward: [(0, '3094.851')] [2023-03-07 03:26:36,110][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000002924_2994176.pth... [2023-03-07 03:26:36,583][118044] Updated weights for policy 0, policy_version 2930 (0.0007) [2023-03-07 03:26:37,354][118044] Updated weights for policy 0, policy_version 2940 (0.0007) [2023-03-07 03:26:38,152][118044] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-07 03:26:38,918][118044] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-07 03:26:39,698][118044] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-07 03:26:40,472][118044] Updated weights for policy 0, policy_version 2980 (0.0007) [2023-03-07 03:26:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13015.9). Total num frames: 3058688. Throughput: 0: 13151.4. Samples: 3025264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:26:41,096][117718] Avg episode reward: [(0, '3212.610')] [2023-03-07 03:26:41,254][118044] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-07 03:26:42,023][118044] Updated weights for policy 0, policy_version 3000 (0.0005) [2023-03-07 03:26:42,799][118044] Updated weights for policy 0, policy_version 3010 (0.0007) [2023-03-07 03:26:43,572][118044] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-07 03:26:44,343][118044] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-07 03:26:45,133][118044] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-07 03:26:45,902][118044] Updated weights for policy 0, policy_version 3050 (0.0006) [2023-03-07 03:26:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13022.1). Total num frames: 3125248. Throughput: 0: 13155.8. Samples: 3104699. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:26:46,096][117718] Avg episode reward: [(0, '3211.292')] [2023-03-07 03:26:46,681][118044] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-07 03:26:47,472][118044] Updated weights for policy 0, policy_version 3070 (0.0007) [2023-03-07 03:26:48,235][118044] Updated weights for policy 0, policy_version 3080 (0.0006) [2023-03-07 03:26:49,015][118044] Updated weights for policy 0, policy_version 3090 (0.0006) [2023-03-07 03:26:49,808][118044] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-07 03:26:50,559][118044] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-07 03:26:51,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13023.8). Total num frames: 3190784. Throughput: 0: 13154.0. Samples: 3183676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:26:51,097][117718] Avg episode reward: [(0, '3297.744')] [2023-03-07 03:26:51,329][118044] Updated weights for policy 0, policy_version 3120 (0.0007) [2023-03-07 03:26:52,106][118044] Updated weights for policy 0, policy_version 3130 (0.0007) [2023-03-07 03:26:52,888][118044] Updated weights for policy 0, policy_version 3140 (0.0007) [2023-03-07 03:26:53,674][118044] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-03-07 03:26:54,453][118044] Updated weights for policy 0, policy_version 3160 (0.0007) [2023-03-07 03:26:55,240][118044] Updated weights for policy 0, policy_version 3170 (0.0005) [2023-03-07 03:26:56,001][118044] Updated weights for policy 0, policy_version 3180 (0.0006) [2023-03-07 03:26:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13029.6). Total num frames: 3257344. Throughput: 0: 13160.6. Samples: 3223302. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 03:26:56,096][117718] Avg episode reward: [(0, '3297.701')] [2023-03-07 03:26:56,769][118044] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-07 03:26:57,562][118044] Updated weights for policy 0, policy_version 3200 (0.0007) [2023-03-07 03:26:58,332][118044] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-07 03:26:59,117][118044] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-07 03:26:59,887][118044] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-07 03:27:00,666][118044] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-07 03:27:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13031.1). Total num frames: 3322880. Throughput: 0: 13159.1. Samples: 3302325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:01,086][117718] Avg episode reward: [(0, '3191.384')] [2023-03-07 03:27:01,445][118044] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-07 03:27:02,229][118044] Updated weights for policy 0, policy_version 3260 (0.0007) [2023-03-07 03:27:03,014][118044] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-07 03:27:03,791][118044] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-07 03:27:04,586][118044] Updated weights for policy 0, policy_version 3290 (0.0005) [2023-03-07 03:27:05,363][118044] Updated weights for policy 0, policy_version 3300 (0.0006) [2023-03-07 03:27:06,085][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13032.6). Total num frames: 3388416. Throughput: 0: 13152.1. Samples: 3380714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:27:06,086][117718] Avg episode reward: [(0, '3148.767')] [2023-03-07 03:27:06,126][118044] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-07 03:27:06,922][118044] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-07 03:27:07,722][118044] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-07 03:27:08,502][118044] Updated weights for policy 0, policy_version 3340 (0.0007) [2023-03-07 03:27:09,271][118044] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-07 03:27:10,058][118044] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-07 03:27:10,828][118044] Updated weights for policy 0, policy_version 3370 (0.0006) [2023-03-07 03:27:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13034.0). Total num frames: 3453952. Throughput: 0: 13147.6. Samples: 3420105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:11,086][117718] Avg episode reward: [(0, '3235.605')] [2023-03-07 03:27:11,593][118044] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-07 03:27:12,368][118044] Updated weights for policy 0, policy_version 3390 (0.0006) [2023-03-07 03:27:13,137][118044] Updated weights for policy 0, policy_version 3400 (0.0005) [2023-03-07 03:27:13,924][118044] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-07 03:27:14,710][118044] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-07 03:27:15,487][118044] Updated weights for policy 0, policy_version 3430 (0.0007) [2023-03-07 03:27:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13035.3). Total num frames: 3519488. Throughput: 0: 13151.6. Samples: 3499273. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:16,086][117718] Avg episode reward: [(0, '3152.845')] [2023-03-07 03:27:16,253][118044] Updated weights for policy 0, policy_version 3440 (0.0007) [2023-03-07 03:27:17,016][118044] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-07 03:27:17,817][118044] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-07 03:27:18,577][118044] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-07 03:27:19,370][118044] Updated weights for policy 0, policy_version 3480 (0.0007) [2023-03-07 03:27:20,141][118044] Updated weights for policy 0, policy_version 3490 (0.0006) [2023-03-07 03:27:20,916][118044] Updated weights for policy 0, policy_version 3500 (0.0007) [2023-03-07 03:27:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13040.4). Total num frames: 3586048. Throughput: 0: 13165.8. Samples: 3578463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:21,086][117718] Avg episode reward: [(0, '3181.107')] [2023-03-07 03:27:21,688][118044] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-07 03:27:22,476][118044] Updated weights for policy 0, policy_version 3520 (0.0006) [2023-03-07 03:27:23,247][118044] Updated weights for policy 0, policy_version 3530 (0.0005) [2023-03-07 03:27:24,029][118044] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-07 03:27:24,806][118044] Updated weights for policy 0, policy_version 3550 (0.0006) [2023-03-07 03:27:25,579][118044] Updated weights for policy 0, policy_version 3560 (0.0006) [2023-03-07 03:27:26,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13041.5). Total num frames: 3651584. Throughput: 0: 13170.4. Samples: 3617932. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:26,086][117718] Avg episode reward: [(0, '3198.085')] [2023-03-07 03:27:26,365][118044] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-07 03:27:27,135][118044] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-07 03:27:27,911][118044] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-07 03:27:28,692][118044] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-07 03:27:29,489][118044] Updated weights for policy 0, policy_version 3610 (0.0006) [2023-03-07 03:27:30,274][118044] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-07 03:27:31,048][118044] Updated weights for policy 0, policy_version 3630 (0.0005) [2023-03-07 03:27:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13042.7). Total num frames: 3717120. Throughput: 0: 13153.0. Samples: 3696585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:31,086][117718] Avg episode reward: [(0, '3134.280')] [2023-03-07 03:27:31,820][118044] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-07 03:27:32,574][118044] Updated weights for policy 0, policy_version 3650 (0.0006) [2023-03-07 03:27:33,357][118044] Updated weights for policy 0, policy_version 3660 (0.0005) [2023-03-07 03:27:34,134][118044] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-07 03:27:34,915][118044] Updated weights for policy 0, policy_version 3680 (0.0007) [2023-03-07 03:27:35,722][118044] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-07 03:27:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13043.8). Total num frames: 3782656. Throughput: 0: 13157.5. Samples: 3775765. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:27:36,086][117718] Avg episode reward: [(0, '3235.157')] [2023-03-07 03:27:36,502][118044] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-07 03:27:37,290][118044] Updated weights for policy 0, policy_version 3710 (0.0007) [2023-03-07 03:27:38,057][118044] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-07 03:27:38,853][118044] Updated weights for policy 0, policy_version 3730 (0.0006) [2023-03-07 03:27:39,626][118044] Updated weights for policy 0, policy_version 3740 (0.0007) [2023-03-07 03:27:40,417][118044] Updated weights for policy 0, policy_version 3750 (0.0007) [2023-03-07 03:27:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13044.9). Total num frames: 3848192. Throughput: 0: 13138.5. Samples: 3814536. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:41,086][117718] Avg episode reward: [(0, '3053.807')] [2023-03-07 03:27:41,213][118044] Updated weights for policy 0, policy_version 3760 (0.0005) [2023-03-07 03:27:42,003][118044] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-07 03:27:42,790][118044] Updated weights for policy 0, policy_version 3780 (0.0005) [2023-03-07 03:27:43,587][118044] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-07 03:27:44,360][118044] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-07 03:27:45,126][118044] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-07 03:27:45,892][118044] Updated weights for policy 0, policy_version 3820 (0.0006) [2023-03-07 03:27:46,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 3913728. Throughput: 0: 13125.0. Samples: 3892949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:46,086][117718] Avg episode reward: [(0, '3132.186')] [2023-03-07 03:27:46,679][118044] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-07 03:27:47,458][118044] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-07 03:27:48,225][118044] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-07 03:27:49,027][118044] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-07 03:27:49,778][118044] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-07 03:27:50,572][118044] Updated weights for policy 0, policy_version 3880 (0.0006) [2023-03-07 03:27:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 3979264. Throughput: 0: 13139.3. Samples: 3971983. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:27:51,086][117718] Avg episode reward: [(0, '3069.241')] [2023-03-07 03:27:51,346][118044] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-07 03:27:52,118][118044] Updated weights for policy 0, policy_version 3900 (0.0006) [2023-03-07 03:27:52,910][118044] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-07 03:27:53,693][118044] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-07 03:27:54,473][118044] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-07 03:27:55,261][118044] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-07 03:27:56,030][118044] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-07 03:27:56,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13162.7). Total num frames: 4044800. Throughput: 0: 13135.7. Samples: 4011213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:27:56,086][117718] Avg episode reward: [(0, '3204.597')] [2023-03-07 03:27:56,824][118044] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-07 03:27:57,597][118044] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-07 03:27:58,389][118044] Updated weights for policy 0, policy_version 3980 (0.0007) [2023-03-07 03:27:59,154][118044] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-07 03:27:59,934][118044] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-07 03:28:00,728][118044] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-03-07 03:28:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13159.3). Total num frames: 4110336. Throughput: 0: 13132.0. Samples: 4090213. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:28:01,086][117718] Avg episode reward: [(0, '3035.668')] [2023-03-07 03:28:01,510][118044] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-07 03:28:02,278][118044] Updated weights for policy 0, policy_version 4030 (0.0006) [2023-03-07 03:28:03,054][118044] Updated weights for policy 0, policy_version 4040 (0.0006) [2023-03-07 03:28:03,839][118044] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-07 03:28:04,622][118044] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-07 03:28:05,402][118044] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-07 03:28:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13155.8). Total num frames: 4175872. Throughput: 0: 13121.8. Samples: 4168943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:28:06,086][117718] Avg episode reward: [(0, '3039.186')] [2023-03-07 03:28:06,180][118044] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-07 03:28:06,961][118044] Updated weights for policy 0, policy_version 4090 (0.0006) [2023-03-07 03:28:07,729][118044] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-07 03:28:08,506][118044] Updated weights for policy 0, policy_version 4110 (0.0006) [2023-03-07 03:28:09,297][118044] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-07 03:28:10,081][118044] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-07 03:28:10,866][118044] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-07 03:28:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 4242432. Throughput: 0: 13116.4. Samples: 4208170. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:28:11,086][117718] Avg episode reward: [(0, '2905.670')] [2023-03-07 03:28:11,653][118044] Updated weights for policy 0, policy_version 4150 (0.0007) [2023-03-07 03:28:12,436][118044] Updated weights for policy 0, policy_version 4160 (0.0005) [2023-03-07 03:28:13,212][118044] Updated weights for policy 0, policy_version 4170 (0.0006) [2023-03-07 03:28:13,969][118044] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-07 03:28:14,746][118044] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-07 03:28:15,542][118044] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-07 03:28:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 4307968. Throughput: 0: 13121.6. Samples: 4287059. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:28:16,086][117718] Avg episode reward: [(0, '3110.412')] [2023-03-07 03:28:16,305][118044] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-07 03:28:17,067][118044] Updated weights for policy 0, policy_version 4220 (0.0007) [2023-03-07 03:28:17,873][118044] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-07 03:28:18,640][118044] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-07 03:28:19,449][118044] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-07 03:28:20,218][118044] Updated weights for policy 0, policy_version 4260 (0.0006) [2023-03-07 03:28:21,000][118044] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-07 03:28:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 4373504. Throughput: 0: 13110.0. Samples: 4365714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:28:21,086][117718] Avg episode reward: [(0, '3154.761')] [2023-03-07 03:28:21,762][118044] Updated weights for policy 0, policy_version 4280 (0.0007) [2023-03-07 03:28:22,567][118044] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-07 03:28:23,350][118044] Updated weights for policy 0, policy_version 4300 (0.0005) [2023-03-07 03:28:24,142][118044] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-07 03:28:24,928][118044] Updated weights for policy 0, policy_version 4320 (0.0007) [2023-03-07 03:28:25,682][118044] Updated weights for policy 0, policy_version 4330 (0.0007) [2023-03-07 03:28:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13155.8). Total num frames: 4439040. Throughput: 0: 13123.4. Samples: 4405087. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:28:26,086][117718] Avg episode reward: [(0, '3245.323')] [2023-03-07 03:28:26,451][118044] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-07 03:28:27,243][118044] Updated weights for policy 0, policy_version 4350 (0.0007) [2023-03-07 03:28:28,003][118044] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-07 03:28:28,793][118044] Updated weights for policy 0, policy_version 4370 (0.0007) [2023-03-07 03:28:29,569][118044] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-07 03:28:30,333][118044] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-07 03:28:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 4504576. Throughput: 0: 13136.8. Samples: 4484106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:28:31,086][117718] Avg episode reward: [(0, '3183.616')] [2023-03-07 03:28:31,113][118044] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-07 03:28:31,897][118044] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-07 03:28:32,690][118044] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-07 03:28:33,461][118044] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-07 03:28:34,228][118044] Updated weights for policy 0, policy_version 4440 (0.0007) [2023-03-07 03:28:35,012][118044] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-07 03:28:35,784][118044] Updated weights for policy 0, policy_version 4460 (0.0006) [2023-03-07 03:28:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 4570112. Throughput: 0: 13134.8. Samples: 4563051. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 03:28:36,086][117718] Avg episode reward: [(0, '3249.819')] [2023-03-07 03:28:36,093][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000004464_4571136.pth... [2023-03-07 03:28:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000001382_1415168.pth [2023-03-07 03:28:36,565][118044] Updated weights for policy 0, policy_version 4470 (0.0006) [2023-03-07 03:28:37,338][118044] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-07 03:28:38,129][118044] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-07 03:28:38,917][118044] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-07 03:28:39,710][118044] Updated weights for policy 0, policy_version 4510 (0.0006) [2023-03-07 03:28:40,472][118044] Updated weights for policy 0, policy_version 4520 (0.0005) [2023-03-07 03:28:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 4635648. Throughput: 0: 13136.7. Samples: 4602363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:28:41,086][117718] Avg episode reward: [(0, '3209.407')] [2023-03-07 03:28:41,262][118044] Updated weights for policy 0, policy_version 4530 (0.0007) [2023-03-07 03:28:42,033][118044] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-07 03:28:42,825][118044] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-07 03:28:43,590][118044] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-07 03:28:44,382][118044] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-07 03:28:45,155][118044] Updated weights for policy 0, policy_version 4580 (0.0006) [2023-03-07 03:28:45,943][118044] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-07 03:28:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13148.8). Total num frames: 4701184. Throughput: 0: 13127.9. Samples: 4680969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:28:46,087][117718] Avg episode reward: [(0, '3189.593')] [2023-03-07 03:28:46,732][118044] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-07 03:28:47,507][118044] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-07 03:28:48,292][118044] Updated weights for policy 0, policy_version 4620 (0.0008) [2023-03-07 03:28:49,064][118044] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-07 03:28:49,852][118044] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-07 03:28:50,625][118044] Updated weights for policy 0, policy_version 4650 (0.0007) [2023-03-07 03:28:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 4767744. Throughput: 0: 13130.3. Samples: 4759806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:28:51,086][117718] Avg episode reward: [(0, '3155.840')] [2023-03-07 03:28:51,401][118044] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-07 03:28:52,171][118044] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-07 03:28:52,939][118044] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-07 03:28:53,722][118044] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-07 03:28:54,504][118044] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-07 03:28:55,270][118044] Updated weights for policy 0, policy_version 4710 (0.0006) [2023-03-07 03:28:56,057][118044] Updated weights for policy 0, policy_version 4720 (0.0007) [2023-03-07 03:28:56,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 4833280. Throughput: 0: 13138.2. Samples: 4799390. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:28:56,086][117718] Avg episode reward: [(0, '3112.800')] [2023-03-07 03:28:56,840][118044] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-07 03:28:57,617][118044] Updated weights for policy 0, policy_version 4740 (0.0006) [2023-03-07 03:28:58,395][118044] Updated weights for policy 0, policy_version 4750 (0.0006) [2023-03-07 03:28:59,169][118044] Updated weights for policy 0, policy_version 4760 (0.0006) [2023-03-07 03:28:59,938][118044] Updated weights for policy 0, policy_version 4770 (0.0007) [2023-03-07 03:29:00,729][118044] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-07 03:29:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 4898816. Throughput: 0: 13142.3. Samples: 4878462. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:01,086][117718] Avg episode reward: [(0, '3195.010')] [2023-03-07 03:29:01,514][118044] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-07 03:29:02,275][118044] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-07 03:29:03,059][118044] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-07 03:29:03,856][118044] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-07 03:29:04,638][118044] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-07 03:29:05,427][118044] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-07 03:29:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 4964352. Throughput: 0: 13141.2. Samples: 4957071. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:29:06,086][117718] Avg episode reward: [(0, '3070.983')] [2023-03-07 03:29:06,219][118044] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-07 03:29:07,002][118044] Updated weights for policy 0, policy_version 4860 (0.0007) [2023-03-07 03:29:07,776][118044] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-07 03:29:08,546][118044] Updated weights for policy 0, policy_version 4880 (0.0006) [2023-03-07 03:29:09,314][118044] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-07 03:29:10,094][118044] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-07 03:29:10,881][118044] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-07 03:29:11,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 5029888. Throughput: 0: 13140.0. Samples: 4996386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:11,086][117718] Avg episode reward: [(0, '3215.265')] [2023-03-07 03:29:11,650][118044] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-07 03:29:12,432][118044] Updated weights for policy 0, policy_version 4930 (0.0006) [2023-03-07 03:29:13,213][118044] Updated weights for policy 0, policy_version 4940 (0.0006) [2023-03-07 03:29:13,998][118044] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-07 03:29:14,774][118044] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-07 03:29:15,548][118044] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-07 03:29:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 5095424. Throughput: 0: 13134.5. Samples: 5075157. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:16,086][117718] Avg episode reward: [(0, '3221.608')] [2023-03-07 03:29:16,331][118044] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-07 03:29:17,106][118044] Updated weights for policy 0, policy_version 4990 (0.0006) [2023-03-07 03:29:17,882][118044] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-07 03:29:18,669][118044] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-07 03:29:19,448][118044] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-07 03:29:20,230][118044] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-07 03:29:21,019][118044] Updated weights for policy 0, policy_version 5040 (0.0006) [2023-03-07 03:29:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 5160960. Throughput: 0: 13130.2. Samples: 5153909. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:29:21,086][117718] Avg episode reward: [(0, '3062.860')] [2023-03-07 03:29:21,788][118044] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-07 03:29:22,581][118044] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-07 03:29:23,356][118044] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-07 03:29:24,127][118044] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-07 03:29:24,908][118044] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-07 03:29:25,686][118044] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-07 03:29:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 5227520. Throughput: 0: 13133.4. Samples: 5193366. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:29:26,086][117718] Avg episode reward: [(0, '3130.927')] [2023-03-07 03:29:26,457][118044] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-07 03:29:27,232][118044] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-07 03:29:28,018][118044] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-07 03:29:28,790][118044] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-07 03:29:29,596][118044] Updated weights for policy 0, policy_version 5150 (0.0007) [2023-03-07 03:29:30,376][118044] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-07 03:29:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 5293056. Throughput: 0: 13143.3. Samples: 5272417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:29:31,086][117718] Avg episode reward: [(0, '3225.126')] [2023-03-07 03:29:31,155][118044] Updated weights for policy 0, policy_version 5170 (0.0005) [2023-03-07 03:29:31,929][118044] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-07 03:29:32,719][118044] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-07 03:29:33,517][118044] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-07 03:29:34,282][118044] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-07 03:29:35,078][118044] Updated weights for policy 0, policy_version 5220 (0.0006) [2023-03-07 03:29:35,841][118044] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-07 03:29:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 5358592. Throughput: 0: 13134.3. Samples: 5350850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:36,086][117718] Avg episode reward: [(0, '3160.911')] [2023-03-07 03:29:36,621][118044] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-07 03:29:37,392][118044] Updated weights for policy 0, policy_version 5250 (0.0006) [2023-03-07 03:29:38,178][118044] Updated weights for policy 0, policy_version 5260 (0.0006) [2023-03-07 03:29:38,973][118044] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-07 03:29:39,735][118044] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-07 03:29:40,523][118044] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-07 03:29:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 5424128. Throughput: 0: 13126.7. Samples: 5390092. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:29:41,086][117718] Avg episode reward: [(0, '3068.977')] [2023-03-07 03:29:41,320][118044] Updated weights for policy 0, policy_version 5300 (0.0007) [2023-03-07 03:29:42,103][118044] Updated weights for policy 0, policy_version 5310 (0.0006) [2023-03-07 03:29:42,864][118044] Updated weights for policy 0, policy_version 5320 (0.0005) [2023-03-07 03:29:43,644][118044] Updated weights for policy 0, policy_version 5330 (0.0006) [2023-03-07 03:29:44,421][118044] Updated weights for policy 0, policy_version 5340 (0.0006) [2023-03-07 03:29:45,198][118044] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-07 03:29:45,978][118044] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-07 03:29:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 5489664. Throughput: 0: 13125.7. Samples: 5469121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:46,086][117718] Avg episode reward: [(0, '3064.678')] [2023-03-07 03:29:46,765][118044] Updated weights for policy 0, policy_version 5370 (0.0006) [2023-03-07 03:29:47,556][118044] Updated weights for policy 0, policy_version 5380 (0.0007) [2023-03-07 03:29:48,344][118044] Updated weights for policy 0, policy_version 5390 (0.0007) [2023-03-07 03:29:49,135][118044] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-07 03:29:49,927][118044] Updated weights for policy 0, policy_version 5410 (0.0007) [2023-03-07 03:29:50,712][118044] Updated weights for policy 0, policy_version 5420 (0.0007) [2023-03-07 03:29:51,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 5554176. Throughput: 0: 13112.6. Samples: 5547138. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:29:51,086][117718] Avg episode reward: [(0, '3016.496')] [2023-03-07 03:29:51,483][118044] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-07 03:29:52,264][118044] Updated weights for policy 0, policy_version 5440 (0.0006) [2023-03-07 03:29:53,041][118044] Updated weights for policy 0, policy_version 5450 (0.0007) [2023-03-07 03:29:53,825][118044] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-07 03:29:54,601][118044] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-07 03:29:55,381][118044] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-07 03:29:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 5620736. Throughput: 0: 13116.6. Samples: 5586631. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:29:56,086][117718] Avg episode reward: [(0, '3148.385')] [2023-03-07 03:29:56,151][118044] Updated weights for policy 0, policy_version 5490 (0.0006) [2023-03-07 03:29:56,954][118044] Updated weights for policy 0, policy_version 5500 (0.0007) [2023-03-07 03:29:57,730][118044] Updated weights for policy 0, policy_version 5510 (0.0007) [2023-03-07 03:29:58,505][118044] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-07 03:29:59,295][118044] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-07 03:30:00,081][118044] Updated weights for policy 0, policy_version 5540 (0.0005) [2023-03-07 03:30:00,848][118044] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-07 03:30:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 5686272. Throughput: 0: 13115.0. Samples: 5665333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:01,086][117718] Avg episode reward: [(0, '3116.889')] [2023-03-07 03:30:01,633][118044] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-07 03:30:02,410][118044] Updated weights for policy 0, policy_version 5570 (0.0006) [2023-03-07 03:30:03,174][118044] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-07 03:30:03,961][118044] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-07 03:30:04,761][118044] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-07 03:30:05,536][118044] Updated weights for policy 0, policy_version 5610 (0.0007) [2023-03-07 03:30:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 5751808. Throughput: 0: 13114.6. Samples: 5744064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:06,086][117718] Avg episode reward: [(0, '3267.416')] [2023-03-07 03:30:06,297][118044] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-07 03:30:07,082][118044] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-07 03:30:07,868][118044] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-07 03:30:08,664][118044] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-07 03:30:09,432][118044] Updated weights for policy 0, policy_version 5660 (0.0006) [2023-03-07 03:30:10,200][118044] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-07 03:30:10,988][118044] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-07 03:30:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 5817344. Throughput: 0: 13112.6. Samples: 5783432. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:11,097][117718] Avg episode reward: [(0, '3235.058')] [2023-03-07 03:30:11,768][118044] Updated weights for policy 0, policy_version 5690 (0.0006) [2023-03-07 03:30:12,545][118044] Updated weights for policy 0, policy_version 5700 (0.0007) [2023-03-07 03:30:13,334][118044] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-07 03:30:14,092][118044] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-07 03:30:14,878][118044] Updated weights for policy 0, policy_version 5730 (0.0005) [2023-03-07 03:30:15,667][118044] Updated weights for policy 0, policy_version 5740 (0.0007) [2023-03-07 03:30:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 5882880. Throughput: 0: 13112.3. Samples: 5862471. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:16,096][117718] Avg episode reward: [(0, '3248.869')] [2023-03-07 03:30:16,440][118044] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-07 03:30:17,200][118044] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-07 03:30:17,970][118044] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-07 03:30:18,745][118044] Updated weights for policy 0, policy_version 5780 (0.0007) [2023-03-07 03:30:19,519][118044] Updated weights for policy 0, policy_version 5790 (0.0007) [2023-03-07 03:30:20,281][118044] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-07 03:30:21,068][118044] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-07 03:30:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 5949440. Throughput: 0: 13134.6. Samples: 5941905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:21,097][117718] Avg episode reward: [(0, '3160.553')] [2023-03-07 03:30:21,861][118044] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-07 03:30:22,639][118044] Updated weights for policy 0, policy_version 5830 (0.0005) [2023-03-07 03:30:23,429][118044] Updated weights for policy 0, policy_version 5840 (0.0007) [2023-03-07 03:30:24,210][118044] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-07 03:30:24,986][118044] Updated weights for policy 0, policy_version 5860 (0.0007) [2023-03-07 03:30:25,781][118044] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-07 03:30:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 6013952. Throughput: 0: 13134.9. Samples: 5981163. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:30:26,096][117718] Avg episode reward: [(0, '3196.885')] [2023-03-07 03:30:26,542][118044] Updated weights for policy 0, policy_version 5880 (0.0007) [2023-03-07 03:30:27,325][118044] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-07 03:30:28,103][118044] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-07 03:30:28,874][118044] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-07 03:30:29,653][118044] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-07 03:30:30,428][118044] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-07 03:30:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 6080512. Throughput: 0: 13128.8. Samples: 6059915. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:30:31,096][117718] Avg episode reward: [(0, '3302.773')] [2023-03-07 03:30:31,201][118044] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-07 03:30:31,979][118044] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-07 03:30:32,760][118044] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-07 03:30:33,548][118044] Updated weights for policy 0, policy_version 5970 (0.0007) [2023-03-07 03:30:34,340][118044] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-07 03:30:35,128][118044] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-07 03:30:35,906][118044] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-07 03:30:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 6146048. Throughput: 0: 13140.0. Samples: 6138437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:36,097][117718] Avg episode reward: [(0, '3206.902')] [2023-03-07 03:30:36,101][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000006002_6146048.pth... [2023-03-07 03:30:36,131][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000002924_2994176.pth [2023-03-07 03:30:36,699][118044] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-07 03:30:37,477][118044] Updated weights for policy 0, policy_version 6020 (0.0006) [2023-03-07 03:30:38,257][118044] Updated weights for policy 0, policy_version 6030 (0.0007) [2023-03-07 03:30:39,060][118044] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-07 03:30:39,823][118044] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-07 03:30:40,587][118044] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-07 03:30:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 6211584. Throughput: 0: 13135.6. Samples: 6177733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:41,102][117718] Avg episode reward: [(0, '3114.231')] [2023-03-07 03:30:41,354][118044] Updated weights for policy 0, policy_version 6070 (0.0006) [2023-03-07 03:30:42,127][118044] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-07 03:30:42,914][118044] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-07 03:30:43,678][118044] Updated weights for policy 0, policy_version 6100 (0.0005) [2023-03-07 03:30:44,462][118044] Updated weights for policy 0, policy_version 6110 (0.0005) [2023-03-07 03:30:45,254][118044] Updated weights for policy 0, policy_version 6120 (0.0006) [2023-03-07 03:30:46,026][118044] Updated weights for policy 0, policy_version 6130 (0.0007) [2023-03-07 03:30:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 6277120. Throughput: 0: 13145.8. Samples: 6256896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:30:46,097][117718] Avg episode reward: [(0, '3049.804')] [2023-03-07 03:30:46,797][118044] Updated weights for policy 0, policy_version 6140 (0.0005) [2023-03-07 03:30:47,589][118044] Updated weights for policy 0, policy_version 6150 (0.0007) [2023-03-07 03:30:48,360][118044] Updated weights for policy 0, policy_version 6160 (0.0007) [2023-03-07 03:30:49,129][118044] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-07 03:30:49,901][118044] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-07 03:30:50,690][118044] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-07 03:30:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 6343680. Throughput: 0: 13150.4. Samples: 6335830. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:30:51,086][117718] Avg episode reward: [(0, '3078.761')] [2023-03-07 03:30:51,471][118044] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-07 03:30:52,256][118044] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-07 03:30:53,046][118044] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-07 03:30:53,830][118044] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-07 03:30:54,605][118044] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-07 03:30:55,391][118044] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-07 03:30:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 6408192. Throughput: 0: 13146.7. Samples: 6375032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:30:56,086][117718] Avg episode reward: [(0, '3209.278')] [2023-03-07 03:30:56,191][118044] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-07 03:30:56,961][118044] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-07 03:30:57,743][118044] Updated weights for policy 0, policy_version 6280 (0.0005) [2023-03-07 03:30:58,517][118044] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-07 03:30:59,308][118044] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-07 03:31:00,092][118044] Updated weights for policy 0, policy_version 6310 (0.0006) [2023-03-07 03:31:00,858][118044] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-07 03:31:01,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 6473728. Throughput: 0: 13134.9. Samples: 6453542. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:31:01,086][117718] Avg episode reward: [(0, '3237.420')] [2023-03-07 03:31:01,636][118044] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-07 03:31:02,421][118044] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-07 03:31:03,186][118044] Updated weights for policy 0, policy_version 6350 (0.0007) [2023-03-07 03:31:03,962][118044] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-07 03:31:04,733][118044] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-07 03:31:05,509][118044] Updated weights for policy 0, policy_version 6380 (0.0007) [2023-03-07 03:31:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 6540288. Throughput: 0: 13128.1. Samples: 6532668. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:31:06,086][117718] Avg episode reward: [(0, '3281.986')] [2023-03-07 03:31:06,289][118044] Updated weights for policy 0, policy_version 6390 (0.0006) [2023-03-07 03:31:07,061][118044] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-07 03:31:07,842][118044] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-07 03:31:08,629][118044] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-07 03:31:09,399][118044] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-07 03:31:10,186][118044] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-07 03:31:10,964][118044] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-07 03:31:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 6605824. Throughput: 0: 13132.7. Samples: 6572133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:31:11,086][117718] Avg episode reward: [(0, '3303.151')] [2023-03-07 03:31:11,746][118044] Updated weights for policy 0, policy_version 6460 (0.0007) [2023-03-07 03:31:12,526][118044] Updated weights for policy 0, policy_version 6470 (0.0005) [2023-03-07 03:31:13,313][118044] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-07 03:31:14,109][118044] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-07 03:31:14,882][118044] Updated weights for policy 0, policy_version 6500 (0.0007) [2023-03-07 03:31:15,646][118044] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-07 03:31:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 6671360. Throughput: 0: 13132.0. Samples: 6650859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:31:16,086][117718] Avg episode reward: [(0, '3267.111')] [2023-03-07 03:31:16,426][118044] Updated weights for policy 0, policy_version 6520 (0.0005) [2023-03-07 03:31:17,202][118044] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-07 03:31:17,985][118044] Updated weights for policy 0, policy_version 6540 (0.0005) [2023-03-07 03:31:18,770][118044] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-07 03:31:19,561][118044] Updated weights for policy 0, policy_version 6560 (0.0006) [2023-03-07 03:31:20,337][118044] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-07 03:31:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 6736896. Throughput: 0: 13136.7. Samples: 6729590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:31:21,086][117718] Avg episode reward: [(0, '3250.391')] [2023-03-07 03:31:21,115][118044] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-07 03:31:21,892][118044] Updated weights for policy 0, policy_version 6590 (0.0006) [2023-03-07 03:31:22,661][118044] Updated weights for policy 0, policy_version 6600 (0.0006) [2023-03-07 03:31:23,466][118044] Updated weights for policy 0, policy_version 6610 (0.0006) [2023-03-07 03:31:24,232][118044] Updated weights for policy 0, policy_version 6620 (0.0007) [2023-03-07 03:31:25,021][118044] Updated weights for policy 0, policy_version 6630 (0.0007) [2023-03-07 03:31:25,784][118044] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-07 03:31:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 6802432. Throughput: 0: 13140.0. Samples: 6769035. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:31:26,086][117718] Avg episode reward: [(0, '3300.881')] [2023-03-07 03:31:26,563][118044] Updated weights for policy 0, policy_version 6650 (0.0007) [2023-03-07 03:31:27,350][118044] Updated weights for policy 0, policy_version 6660 (0.0006) [2023-03-07 03:31:28,137][118044] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-07 03:31:28,917][118044] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-07 03:31:29,708][118044] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-07 03:31:30,486][118044] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-07 03:31:31,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13135.0). Total num frames: 6867968. Throughput: 0: 13129.7. Samples: 6847730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:31:31,086][117718] Avg episode reward: [(0, '3246.507')] [2023-03-07 03:31:31,267][118044] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-07 03:31:32,037][118044] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-07 03:31:32,815][118044] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-07 03:31:33,592][118044] Updated weights for policy 0, policy_version 6740 (0.0006) [2023-03-07 03:31:34,382][118044] Updated weights for policy 0, policy_version 6750 (0.0006) [2023-03-07 03:31:35,166][118044] Updated weights for policy 0, policy_version 6760 (0.0007) [2023-03-07 03:31:35,942][118044] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-07 03:31:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 6933504. Throughput: 0: 13126.4. Samples: 6926520. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:31:36,086][117718] Avg episode reward: [(0, '3247.097')] [2023-03-07 03:31:36,727][118044] Updated weights for policy 0, policy_version 6780 (0.0006) [2023-03-07 03:31:37,493][118044] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-07 03:31:38,281][118044] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-07 03:31:39,083][118044] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-07 03:31:39,882][118044] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-07 03:31:40,660][118044] Updated weights for policy 0, policy_version 6830 (0.0007) [2023-03-07 03:31:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 6999040. Throughput: 0: 13123.9. Samples: 6965609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:31:41,086][117718] Avg episode reward: [(0, '3286.067')] [2023-03-07 03:31:41,417][118044] Updated weights for policy 0, policy_version 6840 (0.0007) [2023-03-07 03:31:42,189][118044] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-07 03:31:42,969][118044] Updated weights for policy 0, policy_version 6860 (0.0007) [2023-03-07 03:31:43,746][118044] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-07 03:31:44,541][118044] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-07 03:31:45,317][118044] Updated weights for policy 0, policy_version 6890 (0.0006) [2023-03-07 03:31:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 7064576. Throughput: 0: 13131.2. Samples: 7044446. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:31:46,086][117718] Avg episode reward: [(0, '3103.665')] [2023-03-07 03:31:46,098][118044] Updated weights for policy 0, policy_version 6900 (0.0006) [2023-03-07 03:31:46,886][118044] Updated weights for policy 0, policy_version 6910 (0.0007) [2023-03-07 03:31:47,657][118044] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-07 03:31:48,441][118044] Updated weights for policy 0, policy_version 6930 (0.0005) [2023-03-07 03:31:49,257][118044] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-07 03:31:50,029][118044] Updated weights for policy 0, policy_version 6950 (0.0008) [2023-03-07 03:31:50,813][118044] Updated weights for policy 0, policy_version 6960 (0.0007) [2023-03-07 03:31:51,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 7130112. Throughput: 0: 13116.8. Samples: 7122926. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:31:51,086][117718] Avg episode reward: [(0, '3113.280')] [2023-03-07 03:31:51,595][118044] Updated weights for policy 0, policy_version 6970 (0.0007) [2023-03-07 03:31:52,363][118044] Updated weights for policy 0, policy_version 6980 (0.0007) [2023-03-07 03:31:53,163][118044] Updated weights for policy 0, policy_version 6990 (0.0005) [2023-03-07 03:31:53,942][118044] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-07 03:31:54,717][118044] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-07 03:31:55,515][118044] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-07 03:31:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 7195648. Throughput: 0: 13106.9. Samples: 7161941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:31:56,086][117718] Avg episode reward: [(0, '3079.032')] [2023-03-07 03:31:56,300][118044] Updated weights for policy 0, policy_version 7030 (0.0006) [2023-03-07 03:31:57,067][118044] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-07 03:31:57,863][118044] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-07 03:31:58,658][118044] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-07 03:31:59,438][118044] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-07 03:32:00,202][118044] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-07 03:32:00,984][118044] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-07 03:32:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 7261184. Throughput: 0: 13103.5. Samples: 7240514. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:32:01,086][117718] Avg episode reward: [(0, '3029.987')] [2023-03-07 03:32:01,759][118044] Updated weights for policy 0, policy_version 7100 (0.0007) [2023-03-07 03:32:02,543][118044] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-07 03:32:03,335][118044] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-07 03:32:04,120][118044] Updated weights for policy 0, policy_version 7130 (0.0007) [2023-03-07 03:32:04,916][118044] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-07 03:32:05,670][118044] Updated weights for policy 0, policy_version 7150 (0.0006) [2023-03-07 03:32:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 7326720. Throughput: 0: 13102.0. Samples: 7319180. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:32:06,086][117718] Avg episode reward: [(0, '2976.717')] [2023-03-07 03:32:06,454][118044] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-07 03:32:07,246][118044] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-07 03:32:08,022][118044] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-07 03:32:08,818][118044] Updated weights for policy 0, policy_version 7190 (0.0007) [2023-03-07 03:32:09,598][118044] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-07 03:32:10,381][118044] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-07 03:32:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 7392256. Throughput: 0: 13097.5. Samples: 7358420. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:32:11,086][117718] Avg episode reward: [(0, '2902.757')] [2023-03-07 03:32:11,142][118044] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-07 03:32:11,922][118044] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-07 03:32:12,716][118044] Updated weights for policy 0, policy_version 7240 (0.0007) [2023-03-07 03:32:13,478][118044] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-07 03:32:14,264][118044] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-07 03:32:15,048][118044] Updated weights for policy 0, policy_version 7270 (0.0007) [2023-03-07 03:32:15,838][118044] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-07 03:32:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7457792. Throughput: 0: 13094.7. Samples: 7436990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:32:16,086][117718] Avg episode reward: [(0, '3009.534')] [2023-03-07 03:32:16,646][118044] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-07 03:32:17,402][118044] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-07 03:32:18,183][118044] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-07 03:32:18,973][118044] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-07 03:32:19,752][118044] Updated weights for policy 0, policy_version 7330 (0.0007) [2023-03-07 03:32:20,541][118044] Updated weights for policy 0, policy_version 7340 (0.0006) [2023-03-07 03:32:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7523328. Throughput: 0: 13088.3. Samples: 7515490. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:32:21,086][117718] Avg episode reward: [(0, '2820.845')] [2023-03-07 03:32:21,316][118044] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-07 03:32:22,090][118044] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-07 03:32:22,886][118044] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-07 03:32:23,649][118044] Updated weights for policy 0, policy_version 7380 (0.0005) [2023-03-07 03:32:24,435][118044] Updated weights for policy 0, policy_version 7390 (0.0007) [2023-03-07 03:32:25,204][118044] Updated weights for policy 0, policy_version 7400 (0.0007) [2023-03-07 03:32:25,989][118044] Updated weights for policy 0, policy_version 7410 (0.0006) [2023-03-07 03:32:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7588864. Throughput: 0: 13092.3. Samples: 7554762. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:32:26,086][117718] Avg episode reward: [(0, '2922.635')] [2023-03-07 03:32:26,792][118044] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-07 03:32:27,567][118044] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-07 03:32:28,342][118044] Updated weights for policy 0, policy_version 7440 (0.0007) [2023-03-07 03:32:29,122][118044] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-07 03:32:29,892][118044] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-07 03:32:30,670][118044] Updated weights for policy 0, policy_version 7470 (0.0005) [2023-03-07 03:32:31,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7654400. Throughput: 0: 13089.5. Samples: 7633473. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:32:31,086][117718] Avg episode reward: [(0, '2766.509')] [2023-03-07 03:32:31,469][118044] Updated weights for policy 0, policy_version 7480 (0.0007) [2023-03-07 03:32:32,253][118044] Updated weights for policy 0, policy_version 7490 (0.0007) [2023-03-07 03:32:33,017][118044] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-07 03:32:33,810][118044] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-07 03:32:34,582][118044] Updated weights for policy 0, policy_version 7520 (0.0007) [2023-03-07 03:32:35,358][118044] Updated weights for policy 0, policy_version 7530 (0.0006) [2023-03-07 03:32:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7719936. Throughput: 0: 13100.2. Samples: 7712433. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:32:36,086][117718] Avg episode reward: [(0, '2881.657')] [2023-03-07 03:32:36,089][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000007539_7719936.pth... [2023-03-07 03:32:36,119][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000004464_4571136.pth [2023-03-07 03:32:36,151][118044] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-07 03:32:36,946][118044] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-07 03:32:37,721][118044] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-07 03:32:38,507][118044] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-07 03:32:39,276][118044] Updated weights for policy 0, policy_version 7580 (0.0007) [2023-03-07 03:32:40,041][118044] Updated weights for policy 0, policy_version 7590 (0.0006) [2023-03-07 03:32:40,809][118044] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-07 03:32:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7785472. Throughput: 0: 13103.8. Samples: 7751613. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:32:41,086][117718] Avg episode reward: [(0, '2898.896')] [2023-03-07 03:32:41,582][118044] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-07 03:32:42,365][118044] Updated weights for policy 0, policy_version 7620 (0.0006) [2023-03-07 03:32:43,133][118044] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-07 03:32:43,921][118044] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-07 03:32:44,703][118044] Updated weights for policy 0, policy_version 7650 (0.0007) [2023-03-07 03:32:45,475][118044] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-07 03:32:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13124.6). Total num frames: 7851008. Throughput: 0: 13117.0. Samples: 7830779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:32:46,086][117718] Avg episode reward: [(0, '2980.455')] [2023-03-07 03:32:46,262][118044] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-07 03:32:47,029][118044] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-07 03:32:47,802][118044] Updated weights for policy 0, policy_version 7690 (0.0006) [2023-03-07 03:32:48,587][118044] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-07 03:32:49,348][118044] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-07 03:32:50,128][118044] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-07 03:32:50,920][118044] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-07 03:32:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 7917568. Throughput: 0: 13128.6. Samples: 7909965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-07 03:32:51,086][117718] Avg episode reward: [(0, '2926.212')] [2023-03-07 03:32:51,681][118044] Updated weights for policy 0, policy_version 7740 (0.0007) [2023-03-07 03:32:52,488][118044] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-07 03:32:53,248][118044] Updated weights for policy 0, policy_version 7760 (0.0007) [2023-03-07 03:32:54,027][118044] Updated weights for policy 0, policy_version 7770 (0.0007) [2023-03-07 03:32:54,790][118044] Updated weights for policy 0, policy_version 7780 (0.0005) [2023-03-07 03:32:55,571][118044] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-07 03:32:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 7983104. Throughput: 0: 13134.2. Samples: 7949461. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:32:56,086][117718] Avg episode reward: [(0, '2869.620')] [2023-03-07 03:32:56,356][118044] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-07 03:32:57,130][118044] Updated weights for policy 0, policy_version 7810 (0.0005) [2023-03-07 03:32:57,912][118044] Updated weights for policy 0, policy_version 7820 (0.0007) [2023-03-07 03:32:58,697][118044] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-07 03:32:59,477][118044] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-07 03:33:00,240][118044] Updated weights for policy 0, policy_version 7850 (0.0007) [2023-03-07 03:33:01,009][118044] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-07 03:33:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 8049664. Throughput: 0: 13141.7. Samples: 8028367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:33:01,086][117718] Avg episode reward: [(0, '2965.621')] [2023-03-07 03:33:01,786][118044] Updated weights for policy 0, policy_version 7870 (0.0007) [2023-03-07 03:33:02,567][118044] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-07 03:33:03,329][118044] Updated weights for policy 0, policy_version 7890 (0.0006) [2023-03-07 03:33:04,115][118044] Updated weights for policy 0, policy_version 7900 (0.0007) [2023-03-07 03:33:04,897][118044] Updated weights for policy 0, policy_version 7910 (0.0007) [2023-03-07 03:33:05,690][118044] Updated weights for policy 0, policy_version 7920 (0.0006) [2023-03-07 03:33:06,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 8115200. Throughput: 0: 13158.1. Samples: 8107606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:33:06,086][117718] Avg episode reward: [(0, '3011.363')] [2023-03-07 03:33:06,449][118044] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-07 03:33:07,239][118044] Updated weights for policy 0, policy_version 7940 (0.0007) [2023-03-07 03:33:08,019][118044] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-07 03:33:08,817][118044] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-07 03:33:09,606][118044] Updated weights for policy 0, policy_version 7970 (0.0007) [2023-03-07 03:33:10,376][118044] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-07 03:33:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 8180736. Throughput: 0: 13159.2. Samples: 8146924. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:33:11,086][117718] Avg episode reward: [(0, '2943.693')] [2023-03-07 03:33:11,139][118044] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-07 03:33:11,924][118044] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-03-07 03:33:12,708][118044] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-07 03:33:13,491][118044] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-07 03:33:14,260][118044] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-07 03:33:15,034][118044] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-07 03:33:15,802][118044] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-07 03:33:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 8246272. Throughput: 0: 13161.2. Samples: 8225726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:33:16,097][117718] Avg episode reward: [(0, '2976.148')] [2023-03-07 03:33:16,589][118044] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-07 03:33:17,365][118044] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-07 03:33:18,134][118044] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-07 03:33:18,910][118044] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-07 03:33:19,674][118044] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-07 03:33:20,466][118044] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-07 03:33:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 8312832. Throughput: 0: 13166.4. Samples: 8304920. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:33:21,098][117718] Avg episode reward: [(0, '2944.316')] [2023-03-07 03:33:21,238][118044] Updated weights for policy 0, policy_version 8120 (0.0007) [2023-03-07 03:33:22,011][118044] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-07 03:33:22,808][118044] Updated weights for policy 0, policy_version 8140 (0.0006) [2023-03-07 03:33:23,584][118044] Updated weights for policy 0, policy_version 8150 (0.0006) [2023-03-07 03:33:24,369][118044] Updated weights for policy 0, policy_version 8160 (0.0005) [2023-03-07 03:33:25,144][118044] Updated weights for policy 0, policy_version 8170 (0.0006) [2023-03-07 03:33:25,921][118044] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-07 03:33:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 8378368. Throughput: 0: 13170.9. Samples: 8344302. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:33:26,097][117718] Avg episode reward: [(0, '2919.705')] [2023-03-07 03:33:26,715][118044] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-07 03:33:27,491][118044] Updated weights for policy 0, policy_version 8200 (0.0006) [2023-03-07 03:33:28,273][118044] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-07 03:33:29,038][118044] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-07 03:33:29,823][118044] Updated weights for policy 0, policy_version 8230 (0.0005) [2023-03-07 03:33:30,597][118044] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-07 03:33:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 8443904. Throughput: 0: 13166.0. Samples: 8423249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:33:31,097][117718] Avg episode reward: [(0, '2970.781')] [2023-03-07 03:33:31,382][118044] Updated weights for policy 0, policy_version 8250 (0.0006) [2023-03-07 03:33:32,148][118044] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-07 03:33:32,943][118044] Updated weights for policy 0, policy_version 8270 (0.0006) [2023-03-07 03:33:33,732][118044] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-03-07 03:33:34,522][118044] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-07 03:33:35,297][118044] Updated weights for policy 0, policy_version 8300 (0.0006) [2023-03-07 03:33:36,083][118044] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-07 03:33:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 8509440. Throughput: 0: 13151.9. Samples: 8501801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:33:36,097][117718] Avg episode reward: [(0, '3023.843')] [2023-03-07 03:33:36,843][118044] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-07 03:33:37,630][118044] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-07 03:33:38,398][118044] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-07 03:33:39,186][118044] Updated weights for policy 0, policy_version 8350 (0.0007) [2023-03-07 03:33:39,974][118044] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-07 03:33:40,755][118044] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-07 03:33:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 8574976. Throughput: 0: 13148.0. Samples: 8541120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:33:41,097][117718] Avg episode reward: [(0, '3067.768')] [2023-03-07 03:33:41,527][118044] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-07 03:33:42,314][118044] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-07 03:33:43,090][118044] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-07 03:33:43,897][118044] Updated weights for policy 0, policy_version 8410 (0.0007) [2023-03-07 03:33:44,691][118044] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-07 03:33:45,465][118044] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-07 03:33:46,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13128.0). Total num frames: 8640512. Throughput: 0: 13137.5. Samples: 8619553. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:33:46,096][117718] Avg episode reward: [(0, '3036.069')] [2023-03-07 03:33:46,236][118044] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-07 03:33:47,004][118044] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-07 03:33:47,781][118044] Updated weights for policy 0, policy_version 8460 (0.0007) [2023-03-07 03:33:48,558][118044] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-07 03:33:49,338][118044] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-07 03:33:50,113][118044] Updated weights for policy 0, policy_version 8490 (0.0006) [2023-03-07 03:33:50,882][118044] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-07 03:33:51,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 8706048. Throughput: 0: 13133.2. Samples: 8698600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:33:51,097][117718] Avg episode reward: [(0, '3063.723')] [2023-03-07 03:33:51,666][118044] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-07 03:33:52,469][118044] Updated weights for policy 0, policy_version 8520 (0.0007) [2023-03-07 03:33:53,241][118044] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-07 03:33:54,015][118044] Updated weights for policy 0, policy_version 8540 (0.0007) [2023-03-07 03:33:54,800][118044] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-07 03:33:55,584][118044] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-07 03:33:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 8771584. Throughput: 0: 13131.4. Samples: 8737839. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:33:56,086][117718] Avg episode reward: [(0, '3128.795')] [2023-03-07 03:33:56,354][118044] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-07 03:33:57,149][118044] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-07 03:33:57,917][118044] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-07 03:33:58,695][118044] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-07 03:33:59,482][118044] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-07 03:34:00,257][118044] Updated weights for policy 0, policy_version 8620 (0.0007) [2023-03-07 03:34:01,041][118044] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-07 03:34:01,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 8837120. Throughput: 0: 13134.1. Samples: 8816759. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:01,086][117718] Avg episode reward: [(0, '3136.420')] [2023-03-07 03:34:01,824][118044] Updated weights for policy 0, policy_version 8640 (0.0008) [2023-03-07 03:34:02,586][118044] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-07 03:34:03,385][118044] Updated weights for policy 0, policy_version 8660 (0.0006) [2023-03-07 03:34:04,133][118044] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-07 03:34:04,927][118044] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-07 03:34:05,707][118044] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-07 03:34:06,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 8902656. Throughput: 0: 13125.4. Samples: 8895566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:34:06,086][117718] Avg episode reward: [(0, '3064.640')] [2023-03-07 03:34:06,478][118044] Updated weights for policy 0, policy_version 8700 (0.0007) [2023-03-07 03:34:07,260][118044] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-07 03:34:08,061][118044] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-07 03:34:08,818][118044] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-07 03:34:09,617][118044] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-07 03:34:10,391][118044] Updated weights for policy 0, policy_version 8750 (0.0005) [2023-03-07 03:34:11,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 8969216. Throughput: 0: 13129.1. Samples: 8935111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:34:11,086][117718] Avg episode reward: [(0, '3161.751')] [2023-03-07 03:34:11,165][118044] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-07 03:34:11,947][118044] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-07 03:34:12,726][118044] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-07 03:34:13,497][118044] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-07 03:34:14,267][118044] Updated weights for policy 0, policy_version 8800 (0.0005) [2023-03-07 03:34:15,063][118044] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-07 03:34:15,833][118044] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-07 03:34:16,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 9034752. Throughput: 0: 13128.3. Samples: 9014021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:34:16,086][117718] Avg episode reward: [(0, '3111.465')] [2023-03-07 03:34:16,604][118044] Updated weights for policy 0, policy_version 8830 (0.0006) [2023-03-07 03:34:17,389][118044] Updated weights for policy 0, policy_version 8840 (0.0006) [2023-03-07 03:34:18,168][118044] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-07 03:34:18,956][118044] Updated weights for policy 0, policy_version 8860 (0.0005) [2023-03-07 03:34:19,718][118044] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-07 03:34:20,504][118044] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-07 03:34:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 9100288. Throughput: 0: 13136.0. Samples: 9092919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:21,086][117718] Avg episode reward: [(0, '3068.855')] [2023-03-07 03:34:21,276][118044] Updated weights for policy 0, policy_version 8890 (0.0005) [2023-03-07 03:34:22,050][118044] Updated weights for policy 0, policy_version 8900 (0.0006) [2023-03-07 03:34:22,841][118044] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-07 03:34:23,614][118044] Updated weights for policy 0, policy_version 8920 (0.0007) [2023-03-07 03:34:24,395][118044] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-07 03:34:25,191][118044] Updated weights for policy 0, policy_version 8940 (0.0006) [2023-03-07 03:34:25,974][118044] Updated weights for policy 0, policy_version 8950 (0.0006) [2023-03-07 03:34:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9165824. Throughput: 0: 13138.3. Samples: 9132343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:26,086][117718] Avg episode reward: [(0, '3062.987')] [2023-03-07 03:34:26,749][118044] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-07 03:34:27,526][118044] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-07 03:34:28,313][118044] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-07 03:34:29,122][118044] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-07 03:34:29,899][118044] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-07 03:34:30,683][118044] Updated weights for policy 0, policy_version 9010 (0.0006) [2023-03-07 03:34:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9231360. Throughput: 0: 13136.4. Samples: 9210694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:31,086][117718] Avg episode reward: [(0, '3129.788')] [2023-03-07 03:34:31,469][118044] Updated weights for policy 0, policy_version 9020 (0.0006) [2023-03-07 03:34:32,234][118044] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-07 03:34:33,032][118044] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-07 03:34:33,805][118044] Updated weights for policy 0, policy_version 9050 (0.0006) [2023-03-07 03:34:34,593][118044] Updated weights for policy 0, policy_version 9060 (0.0007) [2023-03-07 03:34:35,356][118044] Updated weights for policy 0, policy_version 9070 (0.0006) [2023-03-07 03:34:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9296896. Throughput: 0: 13127.1. Samples: 9289317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:34:36,086][117718] Avg episode reward: [(0, '3042.913')] [2023-03-07 03:34:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000009079_9296896.pth... [2023-03-07 03:34:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000006002_6146048.pth [2023-03-07 03:34:36,149][118044] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-07 03:34:36,928][118044] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-07 03:34:37,677][118044] Updated weights for policy 0, policy_version 9100 (0.0006) [2023-03-07 03:34:38,480][118044] Updated weights for policy 0, policy_version 9110 (0.0007) [2023-03-07 03:34:39,247][118044] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-07 03:34:40,026][118044] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-07 03:34:40,817][118044] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-07 03:34:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9362432. Throughput: 0: 13131.2. Samples: 9328743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:34:41,086][117718] Avg episode reward: [(0, '3122.392')] [2023-03-07 03:34:41,614][118044] Updated weights for policy 0, policy_version 9150 (0.0006) [2023-03-07 03:34:42,382][118044] Updated weights for policy 0, policy_version 9160 (0.0006) [2023-03-07 03:34:43,138][118044] Updated weights for policy 0, policy_version 9170 (0.0006) [2023-03-07 03:34:43,945][118044] Updated weights for policy 0, policy_version 9180 (0.0007) [2023-03-07 03:34:44,712][118044] Updated weights for policy 0, policy_version 9190 (0.0006) [2023-03-07 03:34:45,486][118044] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-07 03:34:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 9427968. Throughput: 0: 13130.7. Samples: 9407640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:34:46,086][117718] Avg episode reward: [(0, '2900.100')] [2023-03-07 03:34:46,273][118044] Updated weights for policy 0, policy_version 9210 (0.0006) [2023-03-07 03:34:47,057][118044] Updated weights for policy 0, policy_version 9220 (0.0006) [2023-03-07 03:34:47,850][118044] Updated weights for policy 0, policy_version 9230 (0.0006) [2023-03-07 03:34:48,629][118044] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-07 03:34:49,396][118044] Updated weights for policy 0, policy_version 9250 (0.0007) [2023-03-07 03:34:50,156][118044] Updated weights for policy 0, policy_version 9260 (0.0007) [2023-03-07 03:34:50,935][118044] Updated weights for policy 0, policy_version 9270 (0.0007) [2023-03-07 03:34:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9493504. Throughput: 0: 13131.6. Samples: 9486483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:51,086][117718] Avg episode reward: [(0, '3015.473')] [2023-03-07 03:34:51,721][118044] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-07 03:34:52,484][118044] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-07 03:34:53,261][118044] Updated weights for policy 0, policy_version 9300 (0.0005) [2023-03-07 03:34:54,054][118044] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-07 03:34:54,837][118044] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-07 03:34:55,621][118044] Updated weights for policy 0, policy_version 9330 (0.0006) [2023-03-07 03:34:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9559040. Throughput: 0: 13133.1. Samples: 9526100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:34:56,086][117718] Avg episode reward: [(0, '2972.622')] [2023-03-07 03:34:56,402][118044] Updated weights for policy 0, policy_version 9340 (0.0007) [2023-03-07 03:34:57,186][118044] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-07 03:34:57,988][118044] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-07 03:34:58,756][118044] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-07 03:34:59,520][118044] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-07 03:35:00,289][118044] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-07 03:35:01,085][118044] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-07 03:35:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 9625600. Throughput: 0: 13125.7. Samples: 9604676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:01,086][117718] Avg episode reward: [(0, '3005.783')] [2023-03-07 03:35:01,858][118044] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-07 03:35:02,639][118044] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-07 03:35:03,430][118044] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-07 03:35:04,233][118044] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-07 03:35:04,997][118044] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-07 03:35:05,784][118044] Updated weights for policy 0, policy_version 9460 (0.0007) [2023-03-07 03:35:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 9690112. Throughput: 0: 13119.3. Samples: 9683288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:06,086][117718] Avg episode reward: [(0, '3037.846')] [2023-03-07 03:35:06,552][118044] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-07 03:35:07,328][118044] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-07 03:35:08,106][118044] Updated weights for policy 0, policy_version 9490 (0.0006) [2023-03-07 03:35:08,882][118044] Updated weights for policy 0, policy_version 9500 (0.0007) [2023-03-07 03:35:09,669][118044] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-07 03:35:10,456][118044] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-07 03:35:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 9756672. Throughput: 0: 13122.4. Samples: 9722853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:11,086][117718] Avg episode reward: [(0, '3043.532')] [2023-03-07 03:35:11,217][118044] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-07 03:35:12,000][118044] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-07 03:35:12,772][118044] Updated weights for policy 0, policy_version 9550 (0.0007) [2023-03-07 03:35:13,547][118044] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-07 03:35:14,323][118044] Updated weights for policy 0, policy_version 9570 (0.0007) [2023-03-07 03:35:15,115][118044] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-07 03:35:15,897][118044] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-07 03:35:16,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 9822208. Throughput: 0: 13137.7. Samples: 9801894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:16,087][117718] Avg episode reward: [(0, '3089.093')] [2023-03-07 03:35:16,670][118044] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-07 03:35:17,454][118044] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-07 03:35:18,238][118044] Updated weights for policy 0, policy_version 9620 (0.0007) [2023-03-07 03:35:19,021][118044] Updated weights for policy 0, policy_version 9630 (0.0006) [2023-03-07 03:35:19,781][118044] Updated weights for policy 0, policy_version 9640 (0.0005) [2023-03-07 03:35:20,570][118044] Updated weights for policy 0, policy_version 9650 (0.0007) [2023-03-07 03:35:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 9887744. Throughput: 0: 13141.2. Samples: 9880671. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:35:21,086][117718] Avg episode reward: [(0, '3125.355')] [2023-03-07 03:35:21,343][118044] Updated weights for policy 0, policy_version 9660 (0.0007) [2023-03-07 03:35:22,114][118044] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-07 03:35:22,906][118044] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-07 03:35:23,674][118044] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-07 03:35:24,447][118044] Updated weights for policy 0, policy_version 9700 (0.0006) [2023-03-07 03:35:25,227][118044] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-07 03:35:26,040][118044] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-07 03:35:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 9953280. Throughput: 0: 13140.1. Samples: 9920047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:26,086][117718] Avg episode reward: [(0, '3006.120')] [2023-03-07 03:35:26,814][118044] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-07 03:35:27,597][118044] Updated weights for policy 0, policy_version 9740 (0.0006) [2023-03-07 03:35:28,370][118044] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-07 03:35:29,166][118044] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-07 03:35:29,950][118044] Updated weights for policy 0, policy_version 9770 (0.0006) [2023-03-07 03:35:30,700][118044] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-07 03:35:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 10018816. Throughput: 0: 13129.9. Samples: 9998486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:31,097][117718] Avg episode reward: [(0, '2931.330')] [2023-03-07 03:35:31,492][118044] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-07 03:35:32,273][118044] Updated weights for policy 0, policy_version 9800 (0.0005) [2023-03-07 03:35:33,062][118044] Updated weights for policy 0, policy_version 9810 (0.0005) [2023-03-07 03:35:33,827][118044] Updated weights for policy 0, policy_version 9820 (0.0007) [2023-03-07 03:35:34,604][118044] Updated weights for policy 0, policy_version 9830 (0.0006) [2023-03-07 03:35:35,385][118044] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-07 03:35:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 10084352. Throughput: 0: 13133.5. Samples: 10077491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:35:36,096][117718] Avg episode reward: [(0, '2873.264')] [2023-03-07 03:35:36,157][118044] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-07 03:35:36,931][118044] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-07 03:35:37,712][118044] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-07 03:35:38,488][118044] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-07 03:35:39,268][118044] Updated weights for policy 0, policy_version 9890 (0.0007) [2023-03-07 03:35:40,047][118044] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-07 03:35:40,825][118044] Updated weights for policy 0, policy_version 9910 (0.0007) [2023-03-07 03:35:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 10150912. Throughput: 0: 13135.0. Samples: 10117175. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 03:35:41,096][117718] Avg episode reward: [(0, '3055.150')] [2023-03-07 03:35:41,610][118044] Updated weights for policy 0, policy_version 9920 (0.0007) [2023-03-07 03:35:42,384][118044] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-07 03:35:43,165][118044] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-07 03:35:43,947][118044] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-07 03:35:44,708][118044] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-07 03:35:45,488][118044] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-07 03:35:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 10216448. Throughput: 0: 13142.8. Samples: 10196102. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 03:35:46,097][117718] Avg episode reward: [(0, '3043.969')] [2023-03-07 03:35:46,273][118044] Updated weights for policy 0, policy_version 9980 (0.0006) [2023-03-07 03:35:47,040][118044] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-07 03:35:47,813][118044] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-03-07 03:35:48,592][118044] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-07 03:35:49,374][118044] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-07 03:35:50,141][118044] Updated weights for policy 0, policy_version 10030 (0.0006) [2023-03-07 03:35:50,923][118044] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-07 03:35:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 10283008. Throughput: 0: 13153.5. Samples: 10275196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:35:51,086][117718] Avg episode reward: [(0, '3014.742')] [2023-03-07 03:35:51,698][118044] Updated weights for policy 0, policy_version 10050 (0.0008) [2023-03-07 03:35:52,484][118044] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-07 03:35:53,273][118044] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-07 03:35:54,060][118044] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-07 03:35:54,837][118044] Updated weights for policy 0, policy_version 10090 (0.0007) [2023-03-07 03:35:55,616][118044] Updated weights for policy 0, policy_version 10100 (0.0007) [2023-03-07 03:35:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 10348544. Throughput: 0: 13148.0. Samples: 10314512. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:35:56,086][117718] Avg episode reward: [(0, '3101.885')] [2023-03-07 03:35:56,412][118044] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-07 03:35:57,174][118044] Updated weights for policy 0, policy_version 10120 (0.0005) [2023-03-07 03:35:57,941][118044] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-03-07 03:35:58,728][118044] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-07 03:35:59,503][118044] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-07 03:36:00,290][118044] Updated weights for policy 0, policy_version 10160 (0.0007) [2023-03-07 03:36:01,070][118044] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-07 03:36:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 10414080. Throughput: 0: 13145.1. Samples: 10393423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:01,086][117718] Avg episode reward: [(0, '3002.497')] [2023-03-07 03:36:01,858][118044] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-07 03:36:02,625][118044] Updated weights for policy 0, policy_version 10190 (0.0007) [2023-03-07 03:36:03,409][118044] Updated weights for policy 0, policy_version 10200 (0.0006) [2023-03-07 03:36:04,184][118044] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-07 03:36:04,961][118044] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-07 03:36:05,740][118044] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-07 03:36:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13131.5). Total num frames: 10479616. Throughput: 0: 13145.9. Samples: 10472238. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:36:06,086][117718] Avg episode reward: [(0, '2993.828')] [2023-03-07 03:36:06,535][118044] Updated weights for policy 0, policy_version 10240 (0.0007) [2023-03-07 03:36:07,295][118044] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-07 03:36:08,074][118044] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-07 03:36:08,854][118044] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-07 03:36:09,633][118044] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-07 03:36:10,391][118044] Updated weights for policy 0, policy_version 10290 (0.0007) [2023-03-07 03:36:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 10545152. Throughput: 0: 13148.8. Samples: 10511742. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:36:11,086][117718] Avg episode reward: [(0, '3006.796')] [2023-03-07 03:36:11,177][118044] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-07 03:36:11,971][118044] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-07 03:36:12,751][118044] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-07 03:36:13,518][118044] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-07 03:36:14,314][118044] Updated weights for policy 0, policy_version 10340 (0.0005) [2023-03-07 03:36:15,098][118044] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-07 03:36:15,874][118044] Updated weights for policy 0, policy_version 10360 (0.0006) [2023-03-07 03:36:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 10610688. Throughput: 0: 13155.4. Samples: 10590480. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:36:16,086][117718] Avg episode reward: [(0, '2957.376')] [2023-03-07 03:36:16,655][118044] Updated weights for policy 0, policy_version 10370 (0.0006) [2023-03-07 03:36:17,448][118044] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-07 03:36:18,245][118044] Updated weights for policy 0, policy_version 10390 (0.0006) [2023-03-07 03:36:19,008][118044] Updated weights for policy 0, policy_version 10400 (0.0006) [2023-03-07 03:36:19,776][118044] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-07 03:36:20,554][118044] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-07 03:36:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 10676224. Throughput: 0: 13149.9. Samples: 10669234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:21,086][117718] Avg episode reward: [(0, '2840.187')] [2023-03-07 03:36:21,348][118044] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-07 03:36:22,131][118044] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-07 03:36:22,917][118044] Updated weights for policy 0, policy_version 10450 (0.0007) [2023-03-07 03:36:23,701][118044] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-07 03:36:24,469][118044] Updated weights for policy 0, policy_version 10470 (0.0007) [2023-03-07 03:36:25,246][118044] Updated weights for policy 0, policy_version 10480 (0.0006) [2023-03-07 03:36:26,045][118044] Updated weights for policy 0, policy_version 10490 (0.0006) [2023-03-07 03:36:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 10741760. Throughput: 0: 13140.9. Samples: 10708517. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:26,086][117718] Avg episode reward: [(0, '2958.621')] [2023-03-07 03:36:26,807][118044] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-07 03:36:27,570][118044] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-07 03:36:28,350][118044] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-07 03:36:29,117][118044] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-07 03:36:29,892][118044] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-07 03:36:30,692][118044] Updated weights for policy 0, policy_version 10550 (0.0007) [2023-03-07 03:36:31,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 10808320. Throughput: 0: 13142.8. Samples: 10787529. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:36:31,086][117718] Avg episode reward: [(0, '2905.054')] [2023-03-07 03:36:31,465][118044] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-07 03:36:32,245][118044] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-07 03:36:33,010][118044] Updated weights for policy 0, policy_version 10580 (0.0005) [2023-03-07 03:36:33,793][118044] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-07 03:36:34,569][118044] Updated weights for policy 0, policy_version 10600 (0.0008) [2023-03-07 03:36:35,355][118044] Updated weights for policy 0, policy_version 10610 (0.0006) [2023-03-07 03:36:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 10873856. Throughput: 0: 13137.5. Samples: 10866381. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:36:36,086][117718] Avg episode reward: [(0, '2900.800')] [2023-03-07 03:36:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000010619_10873856.pth... [2023-03-07 03:36:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000007539_7719936.pth [2023-03-07 03:36:36,141][118044] Updated weights for policy 0, policy_version 10620 (0.0005) [2023-03-07 03:36:36,916][118044] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-07 03:36:37,691][118044] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-07 03:36:38,477][118044] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-07 03:36:39,246][118044] Updated weights for policy 0, policy_version 10660 (0.0006) [2023-03-07 03:36:40,023][118044] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-07 03:36:40,811][118044] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-07 03:36:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 10939392. Throughput: 0: 13138.6. Samples: 10905749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:41,086][117718] Avg episode reward: [(0, '2929.373')] [2023-03-07 03:36:41,602][118044] Updated weights for policy 0, policy_version 10690 (0.0007) [2023-03-07 03:36:42,384][118044] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-07 03:36:43,171][118044] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-07 03:36:43,949][118044] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-07 03:36:44,741][118044] Updated weights for policy 0, policy_version 10730 (0.0006) [2023-03-07 03:36:45,530][118044] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-07 03:36:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 11004928. Throughput: 0: 13131.7. Samples: 10984350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:46,086][117718] Avg episode reward: [(0, '2897.984')] [2023-03-07 03:36:46,317][118044] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-07 03:36:47,088][118044] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-07 03:36:47,884][118044] Updated weights for policy 0, policy_version 10770 (0.0007) [2023-03-07 03:36:48,661][118044] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-07 03:36:49,429][118044] Updated weights for policy 0, policy_version 10790 (0.0007) [2023-03-07 03:36:50,226][118044] Updated weights for policy 0, policy_version 10800 (0.0007) [2023-03-07 03:36:50,998][118044] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-07 03:36:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 11070464. Throughput: 0: 13120.9. Samples: 11062679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:51,086][117718] Avg episode reward: [(0, '2742.938')] [2023-03-07 03:36:51,771][118044] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-07 03:36:52,558][118044] Updated weights for policy 0, policy_version 10830 (0.0006) [2023-03-07 03:36:53,336][118044] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-07 03:36:54,129][118044] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-07 03:36:54,900][118044] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-07 03:36:55,664][118044] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-07 03:36:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 11136000. Throughput: 0: 13120.5. Samples: 11102163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:36:56,086][117718] Avg episode reward: [(0, '2671.789')] [2023-03-07 03:36:56,447][118044] Updated weights for policy 0, policy_version 10880 (0.0007) [2023-03-07 03:36:57,218][118044] Updated weights for policy 0, policy_version 10890 (0.0006) [2023-03-07 03:36:58,009][118044] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-07 03:36:58,791][118044] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-07 03:36:59,563][118044] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-07 03:37:00,354][118044] Updated weights for policy 0, policy_version 10930 (0.0005) [2023-03-07 03:37:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13135.0). Total num frames: 11201536. Throughput: 0: 13125.1. Samples: 11181109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:01,086][117718] Avg episode reward: [(0, '2837.046')] [2023-03-07 03:37:01,117][118044] Updated weights for policy 0, policy_version 10940 (0.0007) [2023-03-07 03:37:01,885][118044] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-07 03:37:02,681][118044] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-07 03:37:03,463][118044] Updated weights for policy 0, policy_version 10970 (0.0007) [2023-03-07 03:37:04,247][118044] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-07 03:37:05,021][118044] Updated weights for policy 0, policy_version 10990 (0.0007) [2023-03-07 03:37:05,789][118044] Updated weights for policy 0, policy_version 11000 (0.0006) [2023-03-07 03:37:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 11267072. Throughput: 0: 13125.6. Samples: 11259886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:06,086][117718] Avg episode reward: [(0, '2943.193')] [2023-03-07 03:37:06,574][118044] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-07 03:37:07,361][118044] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-07 03:37:08,131][118044] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-07 03:37:08,917][118044] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-07 03:37:09,689][118044] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-07 03:37:10,462][118044] Updated weights for policy 0, policy_version 11060 (0.0006) [2023-03-07 03:37:11,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13135.0). Total num frames: 11332608. Throughput: 0: 13126.4. Samples: 11299207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:11,086][117718] Avg episode reward: [(0, '2956.363')] [2023-03-07 03:37:11,250][118044] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-07 03:37:12,034][118044] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-07 03:37:12,818][118044] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-07 03:37:13,601][118044] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-07 03:37:14,363][118044] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-07 03:37:15,161][118044] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-07 03:37:15,930][118044] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-07 03:37:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 11399168. Throughput: 0: 13124.7. Samples: 11378141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:16,086][117718] Avg episode reward: [(0, '2795.346')] [2023-03-07 03:37:16,724][118044] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-07 03:37:17,498][118044] Updated weights for policy 0, policy_version 11150 (0.0007) [2023-03-07 03:37:18,273][118044] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-07 03:37:19,068][118044] Updated weights for policy 0, policy_version 11170 (0.0007) [2023-03-07 03:37:19,847][118044] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-07 03:37:20,617][118044] Updated weights for policy 0, policy_version 11190 (0.0007) [2023-03-07 03:37:21,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 11464704. Throughput: 0: 13121.0. Samples: 11456826. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:37:21,086][117718] Avg episode reward: [(0, '2854.168')] [2023-03-07 03:37:21,383][118044] Updated weights for policy 0, policy_version 11200 (0.0006) [2023-03-07 03:37:22,179][118044] Updated weights for policy 0, policy_version 11210 (0.0006) [2023-03-07 03:37:22,963][118044] Updated weights for policy 0, policy_version 11220 (0.0008) [2023-03-07 03:37:23,740][118044] Updated weights for policy 0, policy_version 11230 (0.0006) [2023-03-07 03:37:24,520][118044] Updated weights for policy 0, policy_version 11240 (0.0007) [2023-03-07 03:37:25,294][118044] Updated weights for policy 0, policy_version 11250 (0.0007) [2023-03-07 03:37:26,065][118044] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-07 03:37:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 11530240. Throughput: 0: 13119.8. Samples: 11496137. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:37:26,086][117718] Avg episode reward: [(0, '2728.562')] [2023-03-07 03:37:26,869][118044] Updated weights for policy 0, policy_version 11270 (0.0007) [2023-03-07 03:37:27,645][118044] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-07 03:37:28,419][118044] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-07 03:37:29,189][118044] Updated weights for policy 0, policy_version 11300 (0.0007) [2023-03-07 03:37:29,988][118044] Updated weights for policy 0, policy_version 11310 (0.0006) [2023-03-07 03:37:30,756][118044] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-07 03:37:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 11595776. Throughput: 0: 13125.7. Samples: 11575006. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:37:31,086][117718] Avg episode reward: [(0, '2903.072')] [2023-03-07 03:37:31,531][118044] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-07 03:37:32,310][118044] Updated weights for policy 0, policy_version 11340 (0.0007) [2023-03-07 03:37:33,104][118044] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-07 03:37:33,881][118044] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-07 03:37:34,667][118044] Updated weights for policy 0, policy_version 11370 (0.0007) [2023-03-07 03:37:35,447][118044] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-07 03:37:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 11661312. Throughput: 0: 13133.1. Samples: 11653668. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:37:36,086][117718] Avg episode reward: [(0, '2729.201')] [2023-03-07 03:37:36,234][118044] Updated weights for policy 0, policy_version 11390 (0.0006) [2023-03-07 03:37:37,020][118044] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-07 03:37:37,806][118044] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-07 03:37:38,581][118044] Updated weights for policy 0, policy_version 11420 (0.0006) [2023-03-07 03:37:39,368][118044] Updated weights for policy 0, policy_version 11430 (0.0005) [2023-03-07 03:37:40,146][118044] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-07 03:37:40,907][118044] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-07 03:37:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 11726848. Throughput: 0: 13124.2. Samples: 11692753. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:37:41,086][117718] Avg episode reward: [(0, '2694.473')] [2023-03-07 03:37:41,697][118044] Updated weights for policy 0, policy_version 11460 (0.0006) [2023-03-07 03:37:42,466][118044] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-07 03:37:43,245][118044] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-07 03:37:44,041][118044] Updated weights for policy 0, policy_version 11490 (0.0006) [2023-03-07 03:37:44,811][118044] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-07 03:37:45,609][118044] Updated weights for policy 0, policy_version 11510 (0.0007) [2023-03-07 03:37:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 11792384. Throughput: 0: 13123.8. Samples: 11771679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:46,086][117718] Avg episode reward: [(0, '2724.940')] [2023-03-07 03:37:46,374][118044] Updated weights for policy 0, policy_version 11520 (0.0006) [2023-03-07 03:37:47,171][118044] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-07 03:37:47,942][118044] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-07 03:37:48,727][118044] Updated weights for policy 0, policy_version 11550 (0.0005) [2023-03-07 03:37:49,533][118044] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-07 03:37:50,290][118044] Updated weights for policy 0, policy_version 11570 (0.0005) [2023-03-07 03:37:51,074][118044] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-07 03:37:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 11857920. Throughput: 0: 13120.7. Samples: 11850316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:37:51,086][117718] Avg episode reward: [(0, '2680.624')] [2023-03-07 03:37:51,842][118044] Updated weights for policy 0, policy_version 11590 (0.0006) [2023-03-07 03:37:52,612][118044] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-07 03:37:53,395][118044] Updated weights for policy 0, policy_version 11610 (0.0008) [2023-03-07 03:37:54,199][118044] Updated weights for policy 0, policy_version 11620 (0.0005) [2023-03-07 03:37:54,958][118044] Updated weights for policy 0, policy_version 11630 (0.0005) [2023-03-07 03:37:55,760][118044] Updated weights for policy 0, policy_version 11640 (0.0007) [2023-03-07 03:37:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 11923456. Throughput: 0: 13122.0. Samples: 11889694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:37:56,086][117718] Avg episode reward: [(0, '2693.834')] [2023-03-07 03:37:56,547][118044] Updated weights for policy 0, policy_version 11650 (0.0006) [2023-03-07 03:37:57,314][118044] Updated weights for policy 0, policy_version 11660 (0.0005) [2023-03-07 03:37:58,091][118044] Updated weights for policy 0, policy_version 11670 (0.0007) [2023-03-07 03:37:58,873][118044] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-07 03:37:59,662][118044] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-07 03:38:00,424][118044] Updated weights for policy 0, policy_version 11700 (0.0006) [2023-03-07 03:38:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 11988992. Throughput: 0: 13113.5. Samples: 11968251. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:38:01,086][117718] Avg episode reward: [(0, '2844.188')] [2023-03-07 03:38:01,218][118044] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-07 03:38:02,004][118044] Updated weights for policy 0, policy_version 11720 (0.0007) [2023-03-07 03:38:02,774][118044] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-07 03:38:03,568][118044] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-07 03:38:04,348][118044] Updated weights for policy 0, policy_version 11750 (0.0006) [2023-03-07 03:38:05,101][118044] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-07 03:38:05,890][118044] Updated weights for policy 0, policy_version 11770 (0.0007) [2023-03-07 03:38:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 12054528. Throughput: 0: 13120.5. Samples: 12047248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:38:06,086][117718] Avg episode reward: [(0, '2952.846')] [2023-03-07 03:38:06,678][118044] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-07 03:38:07,443][118044] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-07 03:38:08,220][118044] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-07 03:38:09,003][118044] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-07 03:38:09,777][118044] Updated weights for policy 0, policy_version 11820 (0.0007) [2023-03-07 03:38:10,566][118044] Updated weights for policy 0, policy_version 11830 (0.0006) [2023-03-07 03:38:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 12120064. Throughput: 0: 13124.3. Samples: 12086732. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:38:11,086][117718] Avg episode reward: [(0, '3034.874')] [2023-03-07 03:38:11,345][118044] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-07 03:38:12,122][118044] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-07 03:38:12,891][118044] Updated weights for policy 0, policy_version 11860 (0.0006) [2023-03-07 03:38:13,673][118044] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-07 03:38:14,447][118044] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-07 03:38:15,206][118044] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-07 03:38:15,986][118044] Updated weights for policy 0, policy_version 11900 (0.0006) [2023-03-07 03:38:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 12186624. Throughput: 0: 13127.3. Samples: 12165733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:38:16,086][117718] Avg episode reward: [(0, '2883.787')] [2023-03-07 03:38:16,750][118044] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-07 03:38:17,536][118044] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-07 03:38:18,307][118044] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-07 03:38:19,078][118044] Updated weights for policy 0, policy_version 11940 (0.0007) [2023-03-07 03:38:19,856][118044] Updated weights for policy 0, policy_version 11950 (0.0006) [2023-03-07 03:38:20,641][118044] Updated weights for policy 0, policy_version 11960 (0.0006) [2023-03-07 03:38:21,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 12252160. Throughput: 0: 13143.2. Samples: 12245115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:38:21,086][117718] Avg episode reward: [(0, '2798.827')] [2023-03-07 03:38:21,409][118044] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-07 03:38:22,182][118044] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-07 03:38:22,974][118044] Updated weights for policy 0, policy_version 11990 (0.0006) [2023-03-07 03:38:23,747][118044] Updated weights for policy 0, policy_version 12000 (0.0006) [2023-03-07 03:38:24,526][118044] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-07 03:38:25,317][118044] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-07 03:38:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 12317696. Throughput: 0: 13151.3. Samples: 12284560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:38:26,086][117718] Avg episode reward: [(0, '2818.227')] [2023-03-07 03:38:26,090][118044] Updated weights for policy 0, policy_version 12030 (0.0007) [2023-03-07 03:38:26,878][118044] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-07 03:38:27,627][118044] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-07 03:38:28,423][118044] Updated weights for policy 0, policy_version 12060 (0.0007) [2023-03-07 03:38:29,206][118044] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-07 03:38:29,987][118044] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-07 03:38:30,760][118044] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-07 03:38:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 12384256. Throughput: 0: 13150.1. Samples: 12363434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:38:31,086][117718] Avg episode reward: [(0, '2909.989')] [2023-03-07 03:38:31,548][118044] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-07 03:38:32,312][118044] Updated weights for policy 0, policy_version 12110 (0.0007) [2023-03-07 03:38:33,082][118044] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-07 03:38:33,866][118044] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-07 03:38:34,639][118044] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-07 03:38:35,416][118044] Updated weights for policy 0, policy_version 12150 (0.0006) [2023-03-07 03:38:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 12449792. Throughput: 0: 13158.7. Samples: 12442459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:38:36,086][117718] Avg episode reward: [(0, '2841.113')] [2023-03-07 03:38:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000012158_12449792.pth... [2023-03-07 03:38:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000009079_9296896.pth [2023-03-07 03:38:36,208][118044] Updated weights for policy 0, policy_version 12160 (0.0006) [2023-03-07 03:38:36,990][118044] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-07 03:38:37,753][118044] Updated weights for policy 0, policy_version 12180 (0.0006) [2023-03-07 03:38:38,542][118044] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-07 03:38:39,302][118044] Updated weights for policy 0, policy_version 12200 (0.0007) [2023-03-07 03:38:40,094][118044] Updated weights for policy 0, policy_version 12210 (0.0007) [2023-03-07 03:38:40,869][118044] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-07 03:38:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 12516352. Throughput: 0: 13158.9. Samples: 12481844. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:38:41,086][117718] Avg episode reward: [(0, '2866.194')] [2023-03-07 03:38:41,628][118044] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-07 03:38:42,422][118044] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-07 03:38:43,203][118044] Updated weights for policy 0, policy_version 12250 (0.0006) [2023-03-07 03:38:43,958][118044] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-07 03:38:44,742][118044] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-07 03:38:45,516][118044] Updated weights for policy 0, policy_version 12280 (0.0006) [2023-03-07 03:38:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 12581888. Throughput: 0: 13176.5. Samples: 12561192. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:38:46,086][117718] Avg episode reward: [(0, '2821.815')] [2023-03-07 03:38:46,286][118044] Updated weights for policy 0, policy_version 12290 (0.0006) [2023-03-07 03:38:47,047][118044] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-07 03:38:47,829][118044] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-07 03:38:48,618][118044] Updated weights for policy 0, policy_version 12320 (0.0007) [2023-03-07 03:38:49,384][118044] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-07 03:38:50,160][118044] Updated weights for policy 0, policy_version 12340 (0.0008) [2023-03-07 03:38:50,928][118044] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-07 03:38:51,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 12647424. Throughput: 0: 13181.5. Samples: 12640415. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:38:51,086][117718] Avg episode reward: [(0, '2816.391')] [2023-03-07 03:38:51,704][118044] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-07 03:38:52,485][118044] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-07 03:38:53,246][118044] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-07 03:38:54,036][118044] Updated weights for policy 0, policy_version 12390 (0.0007) [2023-03-07 03:38:54,818][118044] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-07 03:38:55,594][118044] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-07 03:38:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13141.9). Total num frames: 12713984. Throughput: 0: 13186.9. Samples: 12680144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:38:56,086][117718] Avg episode reward: [(0, '2806.783')] [2023-03-07 03:38:56,371][118044] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-07 03:38:57,145][118044] Updated weights for policy 0, policy_version 12430 (0.0006) [2023-03-07 03:38:57,929][118044] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-07 03:38:58,703][118044] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-07 03:38:59,494][118044] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-07 03:39:00,264][118044] Updated weights for policy 0, policy_version 12470 (0.0006) [2023-03-07 03:39:01,031][118044] Updated weights for policy 0, policy_version 12480 (0.0006) [2023-03-07 03:39:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13141.9). Total num frames: 12779520. Throughput: 0: 13184.5. Samples: 12759037. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:39:01,086][117718] Avg episode reward: [(0, '2817.819')] [2023-03-07 03:39:01,810][118044] Updated weights for policy 0, policy_version 12490 (0.0007) [2023-03-07 03:39:02,594][118044] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-07 03:39:03,364][118044] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-07 03:39:04,127][118044] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-07 03:39:04,894][118044] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-07 03:39:05,694][118044] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-07 03:39:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13141.9). Total num frames: 12846080. Throughput: 0: 13186.5. Samples: 12838505. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:39:06,086][117718] Avg episode reward: [(0, '2933.006')] [2023-03-07 03:39:06,456][118044] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-07 03:39:07,262][118044] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-07 03:39:08,041][118044] Updated weights for policy 0, policy_version 12570 (0.0007) [2023-03-07 03:39:08,794][118044] Updated weights for policy 0, policy_version 12580 (0.0006) [2023-03-07 03:39:09,586][118044] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-07 03:39:10,357][118044] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-07 03:39:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13141.9). Total num frames: 12911616. Throughput: 0: 13185.7. Samples: 12877917. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:39:11,096][117718] Avg episode reward: [(0, '2977.592')] [2023-03-07 03:39:11,128][118044] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-07 03:39:11,918][118044] Updated weights for policy 0, policy_version 12620 (0.0006) [2023-03-07 03:39:12,703][118044] Updated weights for policy 0, policy_version 12630 (0.0006) [2023-03-07 03:39:13,468][118044] Updated weights for policy 0, policy_version 12640 (0.0005) [2023-03-07 03:39:14,265][118044] Updated weights for policy 0, policy_version 12650 (0.0006) [2023-03-07 03:39:15,046][118044] Updated weights for policy 0, policy_version 12660 (0.0006) [2023-03-07 03:39:15,825][118044] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-07 03:39:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13141.9). Total num frames: 12977152. Throughput: 0: 13180.3. Samples: 12956545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:16,097][117718] Avg episode reward: [(0, '2878.806')] [2023-03-07 03:39:16,615][118044] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-07 03:39:17,392][118044] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-07 03:39:18,165][118044] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-07 03:39:18,945][118044] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-07 03:39:19,724][118044] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-07 03:39:20,493][118044] Updated weights for policy 0, policy_version 12730 (0.0007) [2023-03-07 03:39:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13141.9). Total num frames: 13042688. Throughput: 0: 13176.7. Samples: 13035410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:21,086][117718] Avg episode reward: [(0, '2960.502')] [2023-03-07 03:39:21,282][118044] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-07 03:39:22,074][118044] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-07 03:39:22,850][118044] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-07 03:39:23,625][118044] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-07 03:39:24,407][118044] Updated weights for policy 0, policy_version 12780 (0.0006) [2023-03-07 03:39:25,178][118044] Updated weights for policy 0, policy_version 12790 (0.0005) [2023-03-07 03:39:25,957][118044] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-07 03:39:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13141.9). Total num frames: 13108224. Throughput: 0: 13175.4. Samples: 13074738. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:39:26,086][117718] Avg episode reward: [(0, '2813.811')] [2023-03-07 03:39:26,744][118044] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-07 03:39:27,527][118044] Updated weights for policy 0, policy_version 12820 (0.0006) [2023-03-07 03:39:28,281][118044] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-07 03:39:29,079][118044] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-07 03:39:29,849][118044] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-07 03:39:30,627][118044] Updated weights for policy 0, policy_version 12860 (0.0007) [2023-03-07 03:39:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 13173760. Throughput: 0: 13165.6. Samples: 13153645. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:39:31,086][117718] Avg episode reward: [(0, '2924.544')] [2023-03-07 03:39:31,395][118044] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-07 03:39:32,175][118044] Updated weights for policy 0, policy_version 12880 (0.0007) [2023-03-07 03:39:32,950][118044] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-07 03:39:33,734][118044] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-07 03:39:34,503][118044] Updated weights for policy 0, policy_version 12910 (0.0007) [2023-03-07 03:39:35,290][118044] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-07 03:39:36,058][118044] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-07 03:39:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 13240320. Throughput: 0: 13162.3. Samples: 13232717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:36,086][117718] Avg episode reward: [(0, '2792.037')] [2023-03-07 03:39:36,849][118044] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-07 03:39:37,625][118044] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-07 03:39:38,407][118044] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-07 03:39:39,181][118044] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-07 03:39:39,955][118044] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-07 03:39:40,739][118044] Updated weights for policy 0, policy_version 12990 (0.0006) [2023-03-07 03:39:41,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 13305856. Throughput: 0: 13155.7. Samples: 13272151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:41,086][117718] Avg episode reward: [(0, '2901.773')] [2023-03-07 03:39:41,523][118044] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-07 03:39:42,305][118044] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-07 03:39:43,086][118044] Updated weights for policy 0, policy_version 13020 (0.0006) [2023-03-07 03:39:43,879][118044] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-07 03:39:44,658][118044] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-07 03:39:45,441][118044] Updated weights for policy 0, policy_version 13050 (0.0006) [2023-03-07 03:39:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 13371392. Throughput: 0: 13146.6. Samples: 13350635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:46,086][117718] Avg episode reward: [(0, '2896.971')] [2023-03-07 03:39:46,214][118044] Updated weights for policy 0, policy_version 13060 (0.0007) [2023-03-07 03:39:46,994][118044] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-07 03:39:47,761][118044] Updated weights for policy 0, policy_version 13080 (0.0005) [2023-03-07 03:39:48,543][118044] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-07 03:39:49,343][118044] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-07 03:39:50,118][118044] Updated weights for policy 0, policy_version 13110 (0.0006) [2023-03-07 03:39:50,882][118044] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-07 03:39:51,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 13436928. Throughput: 0: 13134.9. Samples: 13429578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:51,086][117718] Avg episode reward: [(0, '2851.899')] [2023-03-07 03:39:51,673][118044] Updated weights for policy 0, policy_version 13130 (0.0007) [2023-03-07 03:39:52,446][118044] Updated weights for policy 0, policy_version 13140 (0.0005) [2023-03-07 03:39:53,225][118044] Updated weights for policy 0, policy_version 13150 (0.0005) [2023-03-07 03:39:54,003][118044] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-07 03:39:54,760][118044] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-07 03:39:55,546][118044] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-07 03:39:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 13502464. Throughput: 0: 13132.2. Samples: 13468868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:39:56,086][117718] Avg episode reward: [(0, '2817.547')] [2023-03-07 03:39:56,338][118044] Updated weights for policy 0, policy_version 13190 (0.0006) [2023-03-07 03:39:57,113][118044] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-07 03:39:57,889][118044] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-07 03:39:58,681][118044] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-07 03:39:59,449][118044] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-07 03:40:00,230][118044] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-07 03:40:00,999][118044] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-07 03:40:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13568000. Throughput: 0: 13142.8. Samples: 13547972. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:40:01,086][117718] Avg episode reward: [(0, '2853.849')] [2023-03-07 03:40:01,800][118044] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-07 03:40:02,570][118044] Updated weights for policy 0, policy_version 13270 (0.0006) [2023-03-07 03:40:03,361][118044] Updated weights for policy 0, policy_version 13280 (0.0006) [2023-03-07 03:40:04,129][118044] Updated weights for policy 0, policy_version 13290 (0.0007) [2023-03-07 03:40:04,921][118044] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-07 03:40:05,675][118044] Updated weights for policy 0, policy_version 13310 (0.0007) [2023-03-07 03:40:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13634560. Throughput: 0: 13145.8. Samples: 13626971. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:06,086][117718] Avg episode reward: [(0, '2797.378')] [2023-03-07 03:40:06,460][118044] Updated weights for policy 0, policy_version 13320 (0.0006) [2023-03-07 03:40:07,237][118044] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-03-07 03:40:08,011][118044] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-07 03:40:08,807][118044] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-07 03:40:09,565][118044] Updated weights for policy 0, policy_version 13360 (0.0007) [2023-03-07 03:40:10,369][118044] Updated weights for policy 0, policy_version 13370 (0.0005) [2023-03-07 03:40:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13700096. Throughput: 0: 13149.1. Samples: 13666447. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:11,086][117718] Avg episode reward: [(0, '2942.115')] [2023-03-07 03:40:11,161][118044] Updated weights for policy 0, policy_version 13380 (0.0007) [2023-03-07 03:40:11,928][118044] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-07 03:40:12,691][118044] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-07 03:40:13,488][118044] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-07 03:40:14,266][118044] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-07 03:40:15,042][118044] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-07 03:40:15,817][118044] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-07 03:40:16,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13765632. Throughput: 0: 13145.2. Samples: 13745179. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:40:16,086][117718] Avg episode reward: [(0, '2902.140')] [2023-03-07 03:40:16,613][118044] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-07 03:40:17,393][118044] Updated weights for policy 0, policy_version 13460 (0.0006) [2023-03-07 03:40:18,184][118044] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-07 03:40:18,949][118044] Updated weights for policy 0, policy_version 13480 (0.0006) [2023-03-07 03:40:19,721][118044] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-07 03:40:20,509][118044] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-07 03:40:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13831168. Throughput: 0: 13134.4. Samples: 13823764. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:40:21,086][117718] Avg episode reward: [(0, '2725.083')] [2023-03-07 03:40:21,289][118044] Updated weights for policy 0, policy_version 13510 (0.0007) [2023-03-07 03:40:22,070][118044] Updated weights for policy 0, policy_version 13520 (0.0006) [2023-03-07 03:40:22,845][118044] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-07 03:40:23,618][118044] Updated weights for policy 0, policy_version 13540 (0.0006) [2023-03-07 03:40:24,401][118044] Updated weights for policy 0, policy_version 13550 (0.0007) [2023-03-07 03:40:25,188][118044] Updated weights for policy 0, policy_version 13560 (0.0006) [2023-03-07 03:40:25,965][118044] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-07 03:40:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13896704. Throughput: 0: 13138.5. Samples: 13863386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:40:26,086][117718] Avg episode reward: [(0, '2817.481')] [2023-03-07 03:40:26,735][118044] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-07 03:40:27,505][118044] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-07 03:40:28,304][118044] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-07 03:40:29,070][118044] Updated weights for policy 0, policy_version 13610 (0.0006) [2023-03-07 03:40:29,846][118044] Updated weights for policy 0, policy_version 13620 (0.0005) [2023-03-07 03:40:30,633][118044] Updated weights for policy 0, policy_version 13630 (0.0007) [2023-03-07 03:40:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 13962240. Throughput: 0: 13144.1. Samples: 13942119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:40:31,086][117718] Avg episode reward: [(0, '2751.663')] [2023-03-07 03:40:31,403][118044] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-07 03:40:32,170][118044] Updated weights for policy 0, policy_version 13650 (0.0006) [2023-03-07 03:40:32,947][118044] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-07 03:40:33,737][118044] Updated weights for policy 0, policy_version 13670 (0.0007) [2023-03-07 03:40:34,507][118044] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-07 03:40:35,296][118044] Updated weights for policy 0, policy_version 13690 (0.0005) [2023-03-07 03:40:36,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 14027776. Throughput: 0: 13142.7. Samples: 14021001. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:36,086][117718] Avg episode reward: [(0, '2652.685')] [2023-03-07 03:40:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000013700_14028800.pth... [2023-03-07 03:40:36,091][118044] Updated weights for policy 0, policy_version 13700 (0.0006) [2023-03-07 03:40:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000010619_10873856.pth [2023-03-07 03:40:36,857][118044] Updated weights for policy 0, policy_version 13710 (0.0007) [2023-03-07 03:40:37,661][118044] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-07 03:40:38,449][118044] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-07 03:40:39,230][118044] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-07 03:40:40,004][118044] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-07 03:40:40,798][118044] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-07 03:40:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 14093312. Throughput: 0: 13136.7. Samples: 14060021. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:41,086][117718] Avg episode reward: [(0, '2774.685')] [2023-03-07 03:40:41,570][118044] Updated weights for policy 0, policy_version 13770 (0.0007) [2023-03-07 03:40:42,369][118044] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-07 03:40:43,151][118044] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-07 03:40:43,935][118044] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-07 03:40:44,723][118044] Updated weights for policy 0, policy_version 13810 (0.0006) [2023-03-07 03:40:45,515][118044] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-07 03:40:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 14158848. Throughput: 0: 13120.4. Samples: 14138392. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:46,086][117718] Avg episode reward: [(0, '2903.279')] [2023-03-07 03:40:46,294][118044] Updated weights for policy 0, policy_version 13830 (0.0007) [2023-03-07 03:40:47,065][118044] Updated weights for policy 0, policy_version 13840 (0.0006) [2023-03-07 03:40:47,859][118044] Updated weights for policy 0, policy_version 13850 (0.0005) [2023-03-07 03:40:48,629][118044] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-07 03:40:49,405][118044] Updated weights for policy 0, policy_version 13870 (0.0005) [2023-03-07 03:40:50,175][118044] Updated weights for policy 0, policy_version 13880 (0.0005) [2023-03-07 03:40:50,953][118044] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-07 03:40:51,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 14224384. Throughput: 0: 13114.5. Samples: 14217123. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:40:51,086][117718] Avg episode reward: [(0, '2775.553')] [2023-03-07 03:40:51,756][118044] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-07 03:40:52,530][118044] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-07 03:40:53,303][118044] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-07 03:40:54,076][118044] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-07 03:40:54,865][118044] Updated weights for policy 0, policy_version 13940 (0.0007) [2023-03-07 03:40:55,649][118044] Updated weights for policy 0, policy_version 13950 (0.0007) [2023-03-07 03:40:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 14289920. Throughput: 0: 13112.3. Samples: 14256501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:40:56,086][117718] Avg episode reward: [(0, '2749.510')] [2023-03-07 03:40:56,441][118044] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-07 03:40:57,214][118044] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-07 03:40:57,990][118044] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-07 03:40:58,762][118044] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-07 03:40:59,542][118044] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-07 03:41:00,329][118044] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-07 03:41:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 14355456. Throughput: 0: 13114.8. Samples: 14335343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:01,086][117718] Avg episode reward: [(0, '2727.522')] [2023-03-07 03:41:01,096][118044] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-07 03:41:01,870][118044] Updated weights for policy 0, policy_version 14030 (0.0005) [2023-03-07 03:41:02,654][118044] Updated weights for policy 0, policy_version 14040 (0.0005) [2023-03-07 03:41:03,438][118044] Updated weights for policy 0, policy_version 14050 (0.0005) [2023-03-07 03:41:04,235][118044] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-07 03:41:05,009][118044] Updated weights for policy 0, policy_version 14070 (0.0008) [2023-03-07 03:41:05,782][118044] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-07 03:41:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 14420992. Throughput: 0: 13120.9. Samples: 14414205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:06,086][117718] Avg episode reward: [(0, '2768.661')] [2023-03-07 03:41:06,536][118044] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-07 03:41:07,336][118044] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-07 03:41:08,102][118044] Updated weights for policy 0, policy_version 14110 (0.0006) [2023-03-07 03:41:08,881][118044] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-07 03:41:09,664][118044] Updated weights for policy 0, policy_version 14130 (0.0007) [2023-03-07 03:41:10,440][118044] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-07 03:41:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 14486528. Throughput: 0: 13118.4. Samples: 14453712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:11,086][117718] Avg episode reward: [(0, '2799.982')] [2023-03-07 03:41:11,227][118044] Updated weights for policy 0, policy_version 14150 (0.0005) [2023-03-07 03:41:11,990][118044] Updated weights for policy 0, policy_version 14160 (0.0006) [2023-03-07 03:41:12,784][118044] Updated weights for policy 0, policy_version 14170 (0.0007) [2023-03-07 03:41:13,547][118044] Updated weights for policy 0, policy_version 14180 (0.0005) [2023-03-07 03:41:14,326][118044] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-07 03:41:15,120][118044] Updated weights for policy 0, policy_version 14200 (0.0007) [2023-03-07 03:41:15,905][118044] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-07 03:41:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 14553088. Throughput: 0: 13121.6. Samples: 14532589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:41:16,086][117718] Avg episode reward: [(0, '2813.529')] [2023-03-07 03:41:16,675][118044] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-07 03:41:17,448][118044] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-07 03:41:18,216][118044] Updated weights for policy 0, policy_version 14240 (0.0007) [2023-03-07 03:41:18,989][118044] Updated weights for policy 0, policy_version 14250 (0.0007) [2023-03-07 03:41:19,746][118044] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-07 03:41:20,520][118044] Updated weights for policy 0, policy_version 14270 (0.0006) [2023-03-07 03:41:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 14618624. Throughput: 0: 13131.9. Samples: 14611934. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:41:21,086][117718] Avg episode reward: [(0, '2837.447')] [2023-03-07 03:41:21,297][118044] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-07 03:41:22,086][118044] Updated weights for policy 0, policy_version 14290 (0.0006) [2023-03-07 03:41:22,870][118044] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-07 03:41:23,660][118044] Updated weights for policy 0, policy_version 14310 (0.0007) [2023-03-07 03:41:24,439][118044] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-07 03:41:25,218][118044] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-07 03:41:25,996][118044] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-07 03:41:26,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 14685184. Throughput: 0: 13137.7. Samples: 14651216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:41:26,086][117718] Avg episode reward: [(0, '2676.046')] [2023-03-07 03:41:26,771][118044] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-07 03:41:27,546][118044] Updated weights for policy 0, policy_version 14360 (0.0006) [2023-03-07 03:41:28,314][118044] Updated weights for policy 0, policy_version 14370 (0.0005) [2023-03-07 03:41:29,094][118044] Updated weights for policy 0, policy_version 14380 (0.0006) [2023-03-07 03:41:29,873][118044] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-07 03:41:30,661][118044] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-07 03:41:31,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 14750720. Throughput: 0: 13155.9. Samples: 14730407. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:41:31,097][117718] Avg episode reward: [(0, '2625.366')] [2023-03-07 03:41:31,436][118044] Updated weights for policy 0, policy_version 14410 (0.0005) [2023-03-07 03:41:32,207][118044] Updated weights for policy 0, policy_version 14420 (0.0007) [2023-03-07 03:41:32,986][118044] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-07 03:41:33,761][118044] Updated weights for policy 0, policy_version 14440 (0.0006) [2023-03-07 03:41:34,537][118044] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-07 03:41:35,310][118044] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-07 03:41:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 14816256. Throughput: 0: 13159.8. Samples: 14809315. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:41:36,095][118044] Updated weights for policy 0, policy_version 14470 (0.0007) [2023-03-07 03:41:36,097][117718] Avg episode reward: [(0, '2660.918')] [2023-03-07 03:41:36,883][118044] Updated weights for policy 0, policy_version 14480 (0.0006) [2023-03-07 03:41:37,657][118044] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-07 03:41:38,442][118044] Updated weights for policy 0, policy_version 14500 (0.0006) [2023-03-07 03:41:39,211][118044] Updated weights for policy 0, policy_version 14510 (0.0005) [2023-03-07 03:41:39,992][118044] Updated weights for policy 0, policy_version 14520 (0.0007) [2023-03-07 03:41:40,773][118044] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-07 03:41:41,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 14882816. Throughput: 0: 13161.4. Samples: 14848762. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:41:41,086][117718] Avg episode reward: [(0, '2772.335')] [2023-03-07 03:41:41,554][118044] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-07 03:41:42,329][118044] Updated weights for policy 0, policy_version 14550 (0.0007) [2023-03-07 03:41:43,109][118044] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-07 03:41:43,891][118044] Updated weights for policy 0, policy_version 14570 (0.0006) [2023-03-07 03:41:44,685][118044] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-07 03:41:45,470][118044] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-07 03:41:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 14948352. Throughput: 0: 13159.2. Samples: 14927505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:46,086][117718] Avg episode reward: [(0, '2767.510')] [2023-03-07 03:41:46,249][118044] Updated weights for policy 0, policy_version 14600 (0.0005) [2023-03-07 03:41:47,029][118044] Updated weights for policy 0, policy_version 14610 (0.0007) [2023-03-07 03:41:47,824][118044] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-07 03:41:48,599][118044] Updated weights for policy 0, policy_version 14630 (0.0005) [2023-03-07 03:41:49,391][118044] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-07 03:41:50,188][118044] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-07 03:41:50,969][118044] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-07 03:41:51,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 15012864. Throughput: 0: 13144.6. Samples: 15005712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:51,086][117718] Avg episode reward: [(0, '2636.743')] [2023-03-07 03:41:51,737][118044] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-07 03:41:52,519][118044] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-07 03:41:53,284][118044] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-07 03:41:54,069][118044] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-07 03:41:54,851][118044] Updated weights for policy 0, policy_version 14710 (0.0007) [2023-03-07 03:41:55,644][118044] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-07 03:41:56,086][117718] Fps is (10 sec: 13004.9, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 15078400. Throughput: 0: 13143.0. Samples: 15045148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:41:56,086][117718] Avg episode reward: [(0, '2573.041')] [2023-03-07 03:41:56,426][118044] Updated weights for policy 0, policy_version 14730 (0.0007) [2023-03-07 03:41:57,195][118044] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-07 03:41:57,973][118044] Updated weights for policy 0, policy_version 14750 (0.0007) [2023-03-07 03:41:58,759][118044] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-07 03:41:59,524][118044] Updated weights for policy 0, policy_version 14770 (0.0007) [2023-03-07 03:42:00,309][118044] Updated weights for policy 0, policy_version 14780 (0.0005) [2023-03-07 03:42:01,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 15143936. Throughput: 0: 13137.5. Samples: 15123777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:42:01,086][117718] Avg episode reward: [(0, '2825.256')] [2023-03-07 03:42:01,102][118044] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-07 03:42:01,874][118044] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-07 03:42:02,664][118044] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-07 03:42:03,424][118044] Updated weights for policy 0, policy_version 14820 (0.0007) [2023-03-07 03:42:04,200][118044] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-07 03:42:04,984][118044] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-07 03:42:05,751][118044] Updated weights for policy 0, policy_version 14850 (0.0006) [2023-03-07 03:42:06,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 15210496. Throughput: 0: 13132.9. Samples: 15202914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:42:06,086][117718] Avg episode reward: [(0, '2780.907')] [2023-03-07 03:42:06,517][118044] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-07 03:42:07,307][118044] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-07 03:42:08,090][118044] Updated weights for policy 0, policy_version 14880 (0.0006) [2023-03-07 03:42:08,850][118044] Updated weights for policy 0, policy_version 14890 (0.0005) [2023-03-07 03:42:09,650][118044] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-07 03:42:10,407][118044] Updated weights for policy 0, policy_version 14910 (0.0007) [2023-03-07 03:42:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 15276032. Throughput: 0: 13136.5. Samples: 15242359. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:42:11,086][117718] Avg episode reward: [(0, '2773.099')] [2023-03-07 03:42:11,190][118044] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-07 03:42:11,964][118044] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-07 03:42:12,738][118044] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-07 03:42:13,521][118044] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-07 03:42:14,304][118044] Updated weights for policy 0, policy_version 14960 (0.0007) [2023-03-07 03:42:15,122][118044] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-07 03:42:15,881][118044] Updated weights for policy 0, policy_version 14980 (0.0007) [2023-03-07 03:42:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 15341568. Throughput: 0: 13130.9. Samples: 15321297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:42:16,086][117718] Avg episode reward: [(0, '2725.518')] [2023-03-07 03:42:16,663][118044] Updated weights for policy 0, policy_version 14990 (0.0007) [2023-03-07 03:42:17,433][118044] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-07 03:42:18,203][118044] Updated weights for policy 0, policy_version 15010 (0.0006) [2023-03-07 03:42:19,006][118044] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-07 03:42:19,787][118044] Updated weights for policy 0, policy_version 15030 (0.0006) [2023-03-07 03:42:20,577][118044] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-07 03:42:21,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 15407104. Throughput: 0: 13126.3. Samples: 15399999. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:42:21,086][117718] Avg episode reward: [(0, '2756.942')] [2023-03-07 03:42:21,343][118044] Updated weights for policy 0, policy_version 15050 (0.0006) [2023-03-07 03:42:22,148][118044] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-07 03:42:22,912][118044] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-07 03:42:23,678][118044] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-07 03:42:24,463][118044] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-07 03:42:25,250][118044] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-07 03:42:26,010][118044] Updated weights for policy 0, policy_version 15110 (0.0006) [2023-03-07 03:42:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 15472640. Throughput: 0: 13124.8. Samples: 15439381. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:42:26,086][117718] Avg episode reward: [(0, '2576.185')] [2023-03-07 03:42:26,798][118044] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-07 03:42:27,585][118044] Updated weights for policy 0, policy_version 15130 (0.0007) [2023-03-07 03:42:28,352][118044] Updated weights for policy 0, policy_version 15140 (0.0006) [2023-03-07 03:42:29,129][118044] Updated weights for policy 0, policy_version 15150 (0.0005) [2023-03-07 03:42:29,903][118044] Updated weights for policy 0, policy_version 15160 (0.0006) [2023-03-07 03:42:30,683][118044] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-07 03:42:31,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 15539200. Throughput: 0: 13127.2. Samples: 15518228. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:42:31,086][117718] Avg episode reward: [(0, '2638.170')] [2023-03-07 03:42:31,467][118044] Updated weights for policy 0, policy_version 15180 (0.0007) [2023-03-07 03:42:32,243][118044] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-07 03:42:33,007][118044] Updated weights for policy 0, policy_version 15200 (0.0006) [2023-03-07 03:42:33,799][118044] Updated weights for policy 0, policy_version 15210 (0.0005) [2023-03-07 03:42:34,580][118044] Updated weights for policy 0, policy_version 15220 (0.0007) [2023-03-07 03:42:35,346][118044] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-07 03:42:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 15604736. Throughput: 0: 13146.2. Samples: 15597291. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:42:36,086][117718] Avg episode reward: [(0, '2675.482')] [2023-03-07 03:42:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000015239_15604736.pth... [2023-03-07 03:42:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000012158_12449792.pth [2023-03-07 03:42:36,145][118044] Updated weights for policy 0, policy_version 15240 (0.0005) [2023-03-07 03:42:36,931][118044] Updated weights for policy 0, policy_version 15250 (0.0005) [2023-03-07 03:42:37,702][118044] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-07 03:42:38,474][118044] Updated weights for policy 0, policy_version 15270 (0.0007) [2023-03-07 03:42:39,253][118044] Updated weights for policy 0, policy_version 15280 (0.0006) [2023-03-07 03:42:40,034][118044] Updated weights for policy 0, policy_version 15290 (0.0006) [2023-03-07 03:42:40,810][118044] Updated weights for policy 0, policy_version 15300 (0.0006) [2023-03-07 03:42:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 15670272. Throughput: 0: 13143.3. Samples: 15636595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:42:41,086][117718] Avg episode reward: [(0, '2606.874')] [2023-03-07 03:42:41,609][118044] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-07 03:42:42,403][118044] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-07 03:42:43,178][118044] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-07 03:42:43,962][118044] Updated weights for policy 0, policy_version 15340 (0.0008) [2023-03-07 03:42:44,766][118044] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-07 03:42:45,544][118044] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-07 03:42:46,086][117718] Fps is (10 sec: 13004.8, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 15734784. Throughput: 0: 13133.5. Samples: 15714784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:42:46,086][117718] Avg episode reward: [(0, '2779.087')] [2023-03-07 03:42:46,309][118044] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-07 03:42:47,108][118044] Updated weights for policy 0, policy_version 15380 (0.0006) [2023-03-07 03:42:47,869][118044] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-07 03:42:48,645][118044] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-07 03:42:49,442][118044] Updated weights for policy 0, policy_version 15410 (0.0005) [2023-03-07 03:42:50,222][118044] Updated weights for policy 0, policy_version 15420 (0.0007) [2023-03-07 03:42:51,012][118044] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-07 03:42:51,085][117718] Fps is (10 sec: 13005.0, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 15800320. Throughput: 0: 13123.1. Samples: 15793451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:42:51,086][117718] Avg episode reward: [(0, '2650.724')] [2023-03-07 03:42:51,791][118044] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-07 03:42:52,550][118044] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-07 03:42:53,346][118044] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-07 03:42:54,124][118044] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-07 03:42:54,909][118044] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-07 03:42:55,650][118044] Updated weights for policy 0, policy_version 15490 (0.0007) [2023-03-07 03:42:56,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 15866880. Throughput: 0: 13122.2. Samples: 15832858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:42:56,086][117718] Avg episode reward: [(0, '2716.801')] [2023-03-07 03:42:56,447][118044] Updated weights for policy 0, policy_version 15500 (0.0006) [2023-03-07 03:42:57,221][118044] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-07 03:42:57,999][118044] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-07 03:42:58,788][118044] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-07 03:42:59,557][118044] Updated weights for policy 0, policy_version 15540 (0.0006) [2023-03-07 03:43:00,338][118044] Updated weights for policy 0, policy_version 15550 (0.0005) [2023-03-07 03:43:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 15932416. Throughput: 0: 13127.5. Samples: 15912033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:43:01,086][117718] Avg episode reward: [(0, '2621.337')] [2023-03-07 03:43:01,112][118044] Updated weights for policy 0, policy_version 15560 (0.0006) [2023-03-07 03:43:01,886][118044] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-07 03:43:02,657][118044] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-07 03:43:03,458][118044] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-07 03:43:04,244][118044] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-07 03:43:05,000][118044] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-07 03:43:05,794][118044] Updated weights for policy 0, policy_version 15620 (0.0007) [2023-03-07 03:43:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 15997952. Throughput: 0: 13132.1. Samples: 15990942. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:43:06,086][117718] Avg episode reward: [(0, '2623.845')] [2023-03-07 03:43:06,565][118044] Updated weights for policy 0, policy_version 15630 (0.0007) [2023-03-07 03:43:07,342][118044] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-07 03:43:08,136][118044] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-07 03:43:08,908][118044] Updated weights for policy 0, policy_version 15660 (0.0006) [2023-03-07 03:43:09,672][118044] Updated weights for policy 0, policy_version 15670 (0.0006) [2023-03-07 03:43:10,455][118044] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-07 03:43:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 16064512. Throughput: 0: 13128.2. Samples: 16030149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:43:11,086][117718] Avg episode reward: [(0, '2683.297')] [2023-03-07 03:43:11,247][118044] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-07 03:43:12,040][118044] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-07 03:43:12,821][118044] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-07 03:43:13,593][118044] Updated weights for policy 0, policy_version 15720 (0.0006) [2023-03-07 03:43:14,367][118044] Updated weights for policy 0, policy_version 15730 (0.0006) [2023-03-07 03:43:15,147][118044] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-07 03:43:15,926][118044] Updated weights for policy 0, policy_version 15750 (0.0006) [2023-03-07 03:43:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 16130048. Throughput: 0: 13129.4. Samples: 16109050. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:43:16,086][117718] Avg episode reward: [(0, '2739.209')] [2023-03-07 03:43:16,714][118044] Updated weights for policy 0, policy_version 15760 (0.0007) [2023-03-07 03:43:17,492][118044] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-07 03:43:18,278][118044] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-07 03:43:19,055][118044] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-07 03:43:19,835][118044] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-07 03:43:20,638][118044] Updated weights for policy 0, policy_version 15810 (0.0007) [2023-03-07 03:43:21,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 16194560. Throughput: 0: 13116.3. Samples: 16187524. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:43:21,086][117718] Avg episode reward: [(0, '2560.599')] [2023-03-07 03:43:21,421][118044] Updated weights for policy 0, policy_version 15820 (0.0007) [2023-03-07 03:43:22,201][118044] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-07 03:43:22,985][118044] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-07 03:43:23,749][118044] Updated weights for policy 0, policy_version 15850 (0.0007) [2023-03-07 03:43:24,527][118044] Updated weights for policy 0, policy_version 15860 (0.0006) [2023-03-07 03:43:25,323][118044] Updated weights for policy 0, policy_version 15870 (0.0007) [2023-03-07 03:43:26,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 16260096. Throughput: 0: 13119.1. Samples: 16226953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:43:26,086][117718] Avg episode reward: [(0, '2618.101')] [2023-03-07 03:43:26,099][118044] Updated weights for policy 0, policy_version 15880 (0.0007) [2023-03-07 03:43:26,859][118044] Updated weights for policy 0, policy_version 15890 (0.0006) [2023-03-07 03:43:27,643][118044] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-07 03:43:28,406][118044] Updated weights for policy 0, policy_version 15910 (0.0007) [2023-03-07 03:43:29,189][118044] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-07 03:43:29,966][118044] Updated weights for policy 0, policy_version 15930 (0.0008) [2023-03-07 03:43:30,742][118044] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-07 03:43:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 16326656. Throughput: 0: 13135.5. Samples: 16305881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:43:31,086][117718] Avg episode reward: [(0, '2711.161')] [2023-03-07 03:43:31,523][118044] Updated weights for policy 0, policy_version 15950 (0.0006) [2023-03-07 03:43:32,294][118044] Updated weights for policy 0, policy_version 15960 (0.0005) [2023-03-07 03:43:33,097][118044] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-07 03:43:33,899][118044] Updated weights for policy 0, policy_version 15980 (0.0007) [2023-03-07 03:43:34,662][118044] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-07 03:43:35,461][118044] Updated weights for policy 0, policy_version 16000 (0.0007) [2023-03-07 03:43:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 16392192. Throughput: 0: 13134.7. Samples: 16384512. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:43:36,086][117718] Avg episode reward: [(0, '2805.948')] [2023-03-07 03:43:36,212][118044] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-07 03:43:36,978][118044] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-07 03:43:37,775][118044] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-07 03:43:38,538][118044] Updated weights for policy 0, policy_version 16040 (0.0005) [2023-03-07 03:43:39,322][118044] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-07 03:43:40,108][118044] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-07 03:43:40,893][118044] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-07 03:43:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 16457728. Throughput: 0: 13141.2. Samples: 16424212. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:43:41,086][117718] Avg episode reward: [(0, '2735.717')] [2023-03-07 03:43:41,678][118044] Updated weights for policy 0, policy_version 16080 (0.0007) [2023-03-07 03:43:42,447][118044] Updated weights for policy 0, policy_version 16090 (0.0006) [2023-03-07 03:43:43,222][118044] Updated weights for policy 0, policy_version 16100 (0.0006) [2023-03-07 03:43:44,015][118044] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-07 03:43:44,816][118044] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-07 03:43:45,588][118044] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-07 03:43:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 16523264. Throughput: 0: 13129.5. Samples: 16502858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:43:46,086][117718] Avg episode reward: [(0, '2772.491')] [2023-03-07 03:43:46,366][118044] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-07 03:43:47,130][118044] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-07 03:43:47,923][118044] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-07 03:43:48,696][118044] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-07 03:43:49,485][118044] Updated weights for policy 0, policy_version 16180 (0.0006) [2023-03-07 03:43:50,252][118044] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-07 03:43:51,026][118044] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-07 03:43:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 16588800. Throughput: 0: 13126.5. Samples: 16581637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:43:51,086][117718] Avg episode reward: [(0, '2747.261')] [2023-03-07 03:43:51,819][118044] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-07 03:43:52,587][118044] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-07 03:43:53,350][118044] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-07 03:43:54,142][118044] Updated weights for policy 0, policy_version 16240 (0.0005) [2023-03-07 03:43:54,925][118044] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-07 03:43:55,706][118044] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-07 03:43:56,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 16655360. Throughput: 0: 13136.0. Samples: 16621270. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:43:56,086][117718] Avg episode reward: [(0, '2845.962')] [2023-03-07 03:43:56,498][118044] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-07 03:43:57,264][118044] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-07 03:43:58,033][118044] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-07 03:43:58,802][118044] Updated weights for policy 0, policy_version 16300 (0.0007) [2023-03-07 03:43:59,578][118044] Updated weights for policy 0, policy_version 16310 (0.0006) [2023-03-07 03:44:00,354][118044] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-07 03:44:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 16720896. Throughput: 0: 13136.3. Samples: 16700185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:01,086][117718] Avg episode reward: [(0, '2833.425')] [2023-03-07 03:44:01,138][118044] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-07 03:44:01,908][118044] Updated weights for policy 0, policy_version 16340 (0.0005) [2023-03-07 03:44:02,696][118044] Updated weights for policy 0, policy_version 16350 (0.0007) [2023-03-07 03:44:03,455][118044] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-07 03:44:04,256][118044] Updated weights for policy 0, policy_version 16370 (0.0007) [2023-03-07 03:44:05,036][118044] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-07 03:44:05,835][118044] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-07 03:44:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 16786432. Throughput: 0: 13147.2. Samples: 16779148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:06,086][117718] Avg episode reward: [(0, '2724.366')] [2023-03-07 03:44:06,614][118044] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-07 03:44:07,365][118044] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-07 03:44:08,149][118044] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-07 03:44:08,918][118044] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-07 03:44:09,712][118044] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-07 03:44:09,764][117993] KL-divergence is very high: 156.8039 [2023-03-07 03:44:10,466][118044] Updated weights for policy 0, policy_version 16450 (0.0007) [2023-03-07 03:44:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 16851968. Throughput: 0: 13146.4. Samples: 16818540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:44:11,086][117718] Avg episode reward: [(0, '2749.874')] [2023-03-07 03:44:11,261][118044] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-07 03:44:12,043][118044] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-07 03:44:12,812][118044] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-07 03:44:13,601][118044] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-07 03:44:14,372][118044] Updated weights for policy 0, policy_version 16500 (0.0005) [2023-03-07 03:44:15,156][118044] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-07 03:44:15,942][118044] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-07 03:44:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 16917504. Throughput: 0: 13149.7. Samples: 16897618. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:44:16,086][117718] Avg episode reward: [(0, '2508.501')] [2023-03-07 03:44:16,709][118044] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-07 03:44:17,494][118044] Updated weights for policy 0, policy_version 16540 (0.0006) [2023-03-07 03:44:18,263][118044] Updated weights for policy 0, policy_version 16550 (0.0006) [2023-03-07 03:44:19,050][118044] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-07 03:44:19,830][118044] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-07 03:44:20,599][118044] Updated weights for policy 0, policy_version 16580 (0.0006) [2023-03-07 03:44:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 16983040. Throughput: 0: 13151.1. Samples: 16976312. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:21,086][117718] Avg episode reward: [(0, '2785.658')] [2023-03-07 03:44:21,401][118044] Updated weights for policy 0, policy_version 16590 (0.0007) [2023-03-07 03:44:22,181][118044] Updated weights for policy 0, policy_version 16600 (0.0007) [2023-03-07 03:44:22,974][118044] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-07 03:44:23,756][118044] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-07 03:44:24,532][118044] Updated weights for policy 0, policy_version 16630 (0.0007) [2023-03-07 03:44:25,302][118044] Updated weights for policy 0, policy_version 16640 (0.0007) [2023-03-07 03:44:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17048576. Throughput: 0: 13137.5. Samples: 17015399. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:26,086][117718] Avg episode reward: [(0, '2648.465')] [2023-03-07 03:44:26,088][118044] Updated weights for policy 0, policy_version 16650 (0.0007) [2023-03-07 03:44:26,858][118044] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-07 03:44:27,638][118044] Updated weights for policy 0, policy_version 16670 (0.0007) [2023-03-07 03:44:28,406][118044] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-07 03:44:29,190][118044] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-07 03:44:29,951][118044] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-07 03:44:30,723][118044] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-07 03:44:31,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17115136. Throughput: 0: 13149.8. Samples: 17094601. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:31,086][117718] Avg episode reward: [(0, '2694.555')] [2023-03-07 03:44:31,512][118044] Updated weights for policy 0, policy_version 16720 (0.0006) [2023-03-07 03:44:32,292][118044] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-07 03:44:33,079][118044] Updated weights for policy 0, policy_version 16740 (0.0007) [2023-03-07 03:44:33,862][118044] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-07 03:44:34,629][118044] Updated weights for policy 0, policy_version 16760 (0.0007) [2023-03-07 03:44:35,398][118044] Updated weights for policy 0, policy_version 16770 (0.0006) [2023-03-07 03:44:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17180672. Throughput: 0: 13156.0. Samples: 17173659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:44:36,086][117718] Avg episode reward: [(0, '2765.242')] [2023-03-07 03:44:36,093][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000016779_17181696.pth... [2023-03-07 03:44:36,125][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000013700_14028800.pth [2023-03-07 03:44:36,179][118044] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-07 03:44:36,946][118044] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-07 03:44:37,723][118044] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-07 03:44:38,516][118044] Updated weights for policy 0, policy_version 16810 (0.0006) [2023-03-07 03:44:39,296][118044] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-07 03:44:40,072][118044] Updated weights for policy 0, policy_version 16830 (0.0007) [2023-03-07 03:44:40,853][118044] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-07 03:44:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 17247232. Throughput: 0: 13154.1. Samples: 17213203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:44:41,086][117718] Avg episode reward: [(0, '2685.737')] [2023-03-07 03:44:41,610][118044] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-07 03:44:42,399][118044] Updated weights for policy 0, policy_version 16860 (0.0006) [2023-03-07 03:44:43,191][118044] Updated weights for policy 0, policy_version 16870 (0.0005) [2023-03-07 03:44:43,965][118044] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-07 03:44:44,738][118044] Updated weights for policy 0, policy_version 16890 (0.0006) [2023-03-07 03:44:45,500][118044] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-07 03:44:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 17312768. Throughput: 0: 13152.6. Samples: 17292055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:46,086][117718] Avg episode reward: [(0, '2882.887')] [2023-03-07 03:44:46,298][118044] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-07 03:44:47,087][118044] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-07 03:44:47,869][118044] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-07 03:44:48,652][118044] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-07 03:44:49,455][118044] Updated weights for policy 0, policy_version 16950 (0.0006) [2023-03-07 03:44:50,221][118044] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-07 03:44:51,001][118044] Updated weights for policy 0, policy_version 16970 (0.0005) [2023-03-07 03:44:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 17378304. Throughput: 0: 13141.1. Samples: 17370497. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:51,086][117718] Avg episode reward: [(0, '2866.753')] [2023-03-07 03:44:51,790][118044] Updated weights for policy 0, policy_version 16980 (0.0007) [2023-03-07 03:44:52,567][118044] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-07 03:44:53,339][118044] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-07 03:44:54,137][118044] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-07 03:44:54,895][118044] Updated weights for policy 0, policy_version 17020 (0.0007) [2023-03-07 03:44:55,686][118044] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-07 03:44:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 17443840. Throughput: 0: 13141.4. Samples: 17409903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:44:56,086][117718] Avg episode reward: [(0, '2853.575')] [2023-03-07 03:44:56,465][118044] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-07 03:44:57,248][118044] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-07 03:44:58,024][118044] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-07 03:44:58,797][118044] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-07 03:44:59,584][118044] Updated weights for policy 0, policy_version 17080 (0.0007) [2023-03-07 03:45:00,358][118044] Updated weights for policy 0, policy_version 17090 (0.0007) [2023-03-07 03:45:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17509376. Throughput: 0: 13137.3. Samples: 17488794. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:45:01,086][117718] Avg episode reward: [(0, '2960.570')] [2023-03-07 03:45:01,133][118044] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-07 03:45:01,931][118044] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-07 03:45:02,709][118044] Updated weights for policy 0, policy_version 17120 (0.0007) [2023-03-07 03:45:03,497][118044] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-07 03:45:04,274][118044] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-07 03:45:05,048][118044] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-07 03:45:05,830][118044] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-07 03:45:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17574912. Throughput: 0: 13131.9. Samples: 17567248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:45:06,086][117718] Avg episode reward: [(0, '2888.063')] [2023-03-07 03:45:06,606][118044] Updated weights for policy 0, policy_version 17170 (0.0006) [2023-03-07 03:45:07,384][118044] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-07 03:45:08,178][118044] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-07 03:45:08,943][118044] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-07 03:45:09,746][118044] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-07 03:45:10,509][118044] Updated weights for policy 0, policy_version 17220 (0.0006) [2023-03-07 03:45:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17640448. Throughput: 0: 13136.5. Samples: 17606540. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:45:11,086][117718] Avg episode reward: [(0, '2952.555')] [2023-03-07 03:45:11,281][118044] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-07 03:45:12,090][118044] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-07 03:45:12,853][118044] Updated weights for policy 0, policy_version 17250 (0.0007) [2023-03-07 03:45:13,639][118044] Updated weights for policy 0, policy_version 17260 (0.0006) [2023-03-07 03:45:14,421][118044] Updated weights for policy 0, policy_version 17270 (0.0007) [2023-03-07 03:45:15,190][118044] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-07 03:45:15,981][118044] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-07 03:45:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17705984. Throughput: 0: 13130.8. Samples: 17685485. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:45:16,086][117718] Avg episode reward: [(0, '2900.700')] [2023-03-07 03:45:16,752][118044] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-07 03:45:17,543][118044] Updated weights for policy 0, policy_version 17310 (0.0007) [2023-03-07 03:45:18,320][118044] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-07 03:45:19,095][118044] Updated weights for policy 0, policy_version 17330 (0.0007) [2023-03-07 03:45:19,865][118044] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-07 03:45:20,650][118044] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-07 03:45:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17771520. Throughput: 0: 13127.0. Samples: 17764376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:21,086][117718] Avg episode reward: [(0, '2806.253')] [2023-03-07 03:45:21,440][118044] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-07 03:45:22,211][118044] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-07 03:45:22,984][118044] Updated weights for policy 0, policy_version 17380 (0.0007) [2023-03-07 03:45:23,766][118044] Updated weights for policy 0, policy_version 17390 (0.0005) [2023-03-07 03:45:24,538][118044] Updated weights for policy 0, policy_version 17400 (0.0007) [2023-03-07 03:45:25,329][118044] Updated weights for policy 0, policy_version 17410 (0.0005) [2023-03-07 03:45:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 17837056. Throughput: 0: 13122.6. Samples: 17803722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:26,097][117718] Avg episode reward: [(0, '2819.700')] [2023-03-07 03:45:26,122][118044] Updated weights for policy 0, policy_version 17420 (0.0007) [2023-03-07 03:45:26,900][118044] Updated weights for policy 0, policy_version 17430 (0.0005) [2023-03-07 03:45:27,674][118044] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-07 03:45:28,440][118044] Updated weights for policy 0, policy_version 17450 (0.0007) [2023-03-07 03:45:29,212][118044] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-07 03:45:29,984][118044] Updated weights for policy 0, policy_version 17470 (0.0008) [2023-03-07 03:45:30,757][118044] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-07 03:45:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 17903616. Throughput: 0: 13122.5. Samples: 17882566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:31,086][117718] Avg episode reward: [(0, '2706.327')] [2023-03-07 03:45:31,549][118044] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-07 03:45:32,321][118044] Updated weights for policy 0, policy_version 17500 (0.0008) [2023-03-07 03:45:33,095][118044] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-07 03:45:33,885][118044] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-07 03:45:34,674][118044] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-07 03:45:35,437][118044] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-07 03:45:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 17969152. Throughput: 0: 13133.4. Samples: 17961504. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:36,086][117718] Avg episode reward: [(0, '2773.640')] [2023-03-07 03:45:36,216][118044] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-07 03:45:37,002][118044] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-07 03:45:37,793][118044] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-07 03:45:38,581][118044] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-07 03:45:39,359][118044] Updated weights for policy 0, policy_version 17590 (0.0006) [2023-03-07 03:45:40,142][118044] Updated weights for policy 0, policy_version 17600 (0.0007) [2023-03-07 03:45:40,940][118044] Updated weights for policy 0, policy_version 17610 (0.0006) [2023-03-07 03:45:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 18034688. Throughput: 0: 13130.8. Samples: 18000788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:41,086][117718] Avg episode reward: [(0, '2591.421')] [2023-03-07 03:45:41,721][118044] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-07 03:45:42,489][118044] Updated weights for policy 0, policy_version 17630 (0.0005) [2023-03-07 03:45:43,262][118044] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-07 03:45:44,046][118044] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-07 03:45:44,832][118044] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-07 03:45:45,598][118044] Updated weights for policy 0, policy_version 17670 (0.0007) [2023-03-07 03:45:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 18100224. Throughput: 0: 13123.5. Samples: 18079355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:46,087][117718] Avg episode reward: [(0, '2636.581')] [2023-03-07 03:45:46,394][118044] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-07 03:45:47,182][118044] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-07 03:45:47,942][118044] Updated weights for policy 0, policy_version 17700 (0.0005) [2023-03-07 03:45:48,742][118044] Updated weights for policy 0, policy_version 17710 (0.0005) [2023-03-07 03:45:49,511][118044] Updated weights for policy 0, policy_version 17720 (0.0007) [2023-03-07 03:45:50,302][118044] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-07 03:45:51,073][118044] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-07 03:45:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 18165760. Throughput: 0: 13129.8. Samples: 18158089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:51,086][117718] Avg episode reward: [(0, '2617.198')] [2023-03-07 03:45:51,845][118044] Updated weights for policy 0, policy_version 17750 (0.0005) [2023-03-07 03:45:52,628][118044] Updated weights for policy 0, policy_version 17760 (0.0007) [2023-03-07 03:45:53,405][118044] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-07 03:45:54,171][118044] Updated weights for policy 0, policy_version 17780 (0.0006) [2023-03-07 03:45:54,950][118044] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-07 03:45:55,730][118044] Updated weights for policy 0, policy_version 17800 (0.0007) [2023-03-07 03:45:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 18231296. Throughput: 0: 13133.6. Samples: 18197550. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:45:56,086][117718] Avg episode reward: [(0, '2775.333')] [2023-03-07 03:45:56,505][118044] Updated weights for policy 0, policy_version 17810 (0.0005) [2023-03-07 03:45:57,286][118044] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-07 03:45:58,073][118044] Updated weights for policy 0, policy_version 17830 (0.0005) [2023-03-07 03:45:58,840][118044] Updated weights for policy 0, policy_version 17840 (0.0006) [2023-03-07 03:45:59,613][118044] Updated weights for policy 0, policy_version 17850 (0.0005) [2023-03-07 03:46:00,405][118044] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-07 03:46:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 18296832. Throughput: 0: 13137.2. Samples: 18276661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:46:01,086][117718] Avg episode reward: [(0, '2670.737')] [2023-03-07 03:46:01,182][118044] Updated weights for policy 0, policy_version 17870 (0.0006) [2023-03-07 03:46:01,957][118044] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-07 03:46:02,725][118044] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-07 03:46:03,501][118044] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-07 03:46:04,269][118044] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-07 03:46:05,036][118044] Updated weights for policy 0, policy_version 17920 (0.0005) [2023-03-07 03:46:05,814][118044] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-07 03:46:06,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18363392. Throughput: 0: 13145.4. Samples: 18355918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:46:06,086][117718] Avg episode reward: [(0, '2834.908')] [2023-03-07 03:46:06,598][118044] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-07 03:46:07,393][118044] Updated weights for policy 0, policy_version 17950 (0.0007) [2023-03-07 03:46:08,151][118044] Updated weights for policy 0, policy_version 17960 (0.0006) [2023-03-07 03:46:08,925][118044] Updated weights for policy 0, policy_version 17970 (0.0007) [2023-03-07 03:46:09,705][118044] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-07 03:46:10,492][118044] Updated weights for policy 0, policy_version 17990 (0.0007) [2023-03-07 03:46:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 18428928. Throughput: 0: 13149.0. Samples: 18395427. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:46:11,086][117718] Avg episode reward: [(0, '2847.395')] [2023-03-07 03:46:11,272][118044] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-07 03:46:12,062][118044] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-07 03:46:12,825][118044] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-07 03:46:13,599][118044] Updated weights for policy 0, policy_version 18030 (0.0006) [2023-03-07 03:46:14,401][118044] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-07 03:46:15,158][118044] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-07 03:46:15,936][118044] Updated weights for policy 0, policy_version 18060 (0.0007) [2023-03-07 03:46:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 18494464. Throughput: 0: 13151.1. Samples: 18474367. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:46:16,086][117718] Avg episode reward: [(0, '2677.342')] [2023-03-07 03:46:16,705][118044] Updated weights for policy 0, policy_version 18070 (0.0005) [2023-03-07 03:46:17,506][118044] Updated weights for policy 0, policy_version 18080 (0.0006) [2023-03-07 03:46:18,294][118044] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-07 03:46:19,065][118044] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-07 03:46:19,859][118044] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-07 03:46:20,623][118044] Updated weights for policy 0, policy_version 18120 (0.0006) [2023-03-07 03:46:21,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 18561024. Throughput: 0: 13145.7. Samples: 18553062. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:46:21,086][117718] Avg episode reward: [(0, '2892.995')] [2023-03-07 03:46:21,401][118044] Updated weights for policy 0, policy_version 18130 (0.0007) [2023-03-07 03:46:22,181][118044] Updated weights for policy 0, policy_version 18140 (0.0007) [2023-03-07 03:46:22,967][118044] Updated weights for policy 0, policy_version 18150 (0.0006) [2023-03-07 03:46:23,731][118044] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-07 03:46:24,520][118044] Updated weights for policy 0, policy_version 18170 (0.0005) [2023-03-07 03:46:25,292][118044] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-07 03:46:26,073][118044] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-07 03:46:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 18626560. Throughput: 0: 13148.9. Samples: 18592488. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:46:26,086][117718] Avg episode reward: [(0, '2652.346')] [2023-03-07 03:46:26,846][118044] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-07 03:46:27,641][118044] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-07 03:46:28,419][118044] Updated weights for policy 0, policy_version 18220 (0.0006) [2023-03-07 03:46:29,194][118044] Updated weights for policy 0, policy_version 18230 (0.0006) [2023-03-07 03:46:29,967][118044] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-07 03:46:30,752][118044] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-07 03:46:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 18692096. Throughput: 0: 13156.7. Samples: 18671402. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:46:31,086][117718] Avg episode reward: [(0, '2634.096')] [2023-03-07 03:46:31,516][118044] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-07 03:46:32,299][118044] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-07 03:46:33,070][118044] Updated weights for policy 0, policy_version 18280 (0.0006) [2023-03-07 03:46:33,853][118044] Updated weights for policy 0, policy_version 18290 (0.0007) [2023-03-07 03:46:34,625][118044] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-07 03:46:35,391][118044] Updated weights for policy 0, policy_version 18310 (0.0007) [2023-03-07 03:46:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 18757632. Throughput: 0: 13167.5. Samples: 18750629. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:46:36,086][117718] Avg episode reward: [(0, '2663.256')] [2023-03-07 03:46:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000018318_18757632.pth... [2023-03-07 03:46:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000015239_15604736.pth [2023-03-07 03:46:36,169][118044] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-07 03:46:36,969][118044] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-07 03:46:37,749][118044] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-07 03:46:38,537][118044] Updated weights for policy 0, policy_version 18350 (0.0007) [2023-03-07 03:46:39,312][118044] Updated weights for policy 0, policy_version 18360 (0.0006) [2023-03-07 03:46:40,077][118044] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-07 03:46:40,857][118044] Updated weights for policy 0, policy_version 18380 (0.0006) [2023-03-07 03:46:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 18823168. Throughput: 0: 13159.8. Samples: 18789743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:46:41,086][117718] Avg episode reward: [(0, '2789.601')] [2023-03-07 03:46:41,648][118044] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-07 03:46:42,413][118044] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-07 03:46:43,183][118044] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-07 03:46:43,966][118044] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-07 03:46:44,750][118044] Updated weights for policy 0, policy_version 18430 (0.0006) [2023-03-07 03:46:45,516][118044] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-07 03:46:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.5, 300 sec: 13141.9). Total num frames: 18889728. Throughput: 0: 13161.7. Samples: 18868938. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:46:46,086][117718] Avg episode reward: [(0, '2759.955')] [2023-03-07 03:46:46,307][118044] Updated weights for policy 0, policy_version 18450 (0.0007) [2023-03-07 03:46:47,086][118044] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-07 03:46:47,862][118044] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-07 03:46:48,641][118044] Updated weights for policy 0, policy_version 18480 (0.0007) [2023-03-07 03:46:49,403][118044] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-07 03:46:50,199][118044] Updated weights for policy 0, policy_version 18500 (0.0006) [2023-03-07 03:46:50,971][118044] Updated weights for policy 0, policy_version 18510 (0.0007) [2023-03-07 03:46:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 18955264. Throughput: 0: 13154.1. Samples: 18947852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:46:51,086][117718] Avg episode reward: [(0, '2710.609')] [2023-03-07 03:46:51,758][118044] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-07 03:46:52,537][118044] Updated weights for policy 0, policy_version 18530 (0.0007) [2023-03-07 03:46:53,313][118044] Updated weights for policy 0, policy_version 18540 (0.0007) [2023-03-07 03:46:54,098][118044] Updated weights for policy 0, policy_version 18550 (0.0005) [2023-03-07 03:46:54,874][118044] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-07 03:46:55,645][118044] Updated weights for policy 0, policy_version 18570 (0.0007) [2023-03-07 03:46:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 19020800. Throughput: 0: 13151.0. Samples: 18987222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:46:56,086][117718] Avg episode reward: [(0, '2690.378')] [2023-03-07 03:46:56,419][118044] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-07 03:46:57,210][118044] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-07 03:46:57,974][118044] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-07 03:46:58,757][118044] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-07 03:46:59,536][118044] Updated weights for policy 0, policy_version 18620 (0.0007) [2023-03-07 03:47:00,315][118044] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-07 03:47:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 19086336. Throughput: 0: 13153.3. Samples: 19066263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:01,086][117718] Avg episode reward: [(0, '2654.159')] [2023-03-07 03:47:01,090][118044] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-07 03:47:01,869][118044] Updated weights for policy 0, policy_version 18650 (0.0007) [2023-03-07 03:47:02,646][118044] Updated weights for policy 0, policy_version 18660 (0.0006) [2023-03-07 03:47:03,430][118044] Updated weights for policy 0, policy_version 18670 (0.0008) [2023-03-07 03:47:04,208][118044] Updated weights for policy 0, policy_version 18680 (0.0007) [2023-03-07 03:47:04,984][118044] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-07 03:47:05,761][118044] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-07 03:47:06,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 19152896. Throughput: 0: 13160.6. Samples: 19145290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:06,086][117718] Avg episode reward: [(0, '2896.758')] [2023-03-07 03:47:06,556][118044] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-07 03:47:07,326][118044] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-07 03:47:08,134][118044] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-07 03:47:08,909][118044] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-07 03:47:09,693][118044] Updated weights for policy 0, policy_version 18750 (0.0008) [2023-03-07 03:47:10,466][118044] Updated weights for policy 0, policy_version 18760 (0.0006) [2023-03-07 03:47:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 19217408. Throughput: 0: 13152.8. Samples: 19184365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:11,086][117718] Avg episode reward: [(0, '2721.553')] [2023-03-07 03:47:11,243][118044] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-07 03:47:12,020][118044] Updated weights for policy 0, policy_version 18780 (0.0006) [2023-03-07 03:47:12,797][118044] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-07 03:47:13,571][118044] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-07 03:47:14,352][118044] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-07 03:47:15,132][118044] Updated weights for policy 0, policy_version 18820 (0.0006) [2023-03-07 03:47:15,929][118044] Updated weights for policy 0, policy_version 18830 (0.0007) [2023-03-07 03:47:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 19283968. Throughput: 0: 13153.4. Samples: 19263304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:47:16,086][117718] Avg episode reward: [(0, '2723.979')] [2023-03-07 03:47:16,701][118044] Updated weights for policy 0, policy_version 18840 (0.0007) [2023-03-07 03:47:17,490][118044] Updated weights for policy 0, policy_version 18850 (0.0006) [2023-03-07 03:47:18,254][118044] Updated weights for policy 0, policy_version 18860 (0.0007) [2023-03-07 03:47:19,022][118044] Updated weights for policy 0, policy_version 18870 (0.0007) [2023-03-07 03:47:19,820][118044] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-07 03:47:20,585][118044] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-07 03:47:21,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 19349504. Throughput: 0: 13145.6. Samples: 19342179. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:47:21,086][117718] Avg episode reward: [(0, '2764.095')] [2023-03-07 03:47:21,360][118044] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-07 03:47:22,144][118044] Updated weights for policy 0, policy_version 18910 (0.0006) [2023-03-07 03:47:22,917][118044] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-07 03:47:23,692][118044] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-07 03:47:24,475][118044] Updated weights for policy 0, policy_version 18940 (0.0006) [2023-03-07 03:47:25,246][118044] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-07 03:47:26,031][118044] Updated weights for policy 0, policy_version 18960 (0.0006) [2023-03-07 03:47:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 19415040. Throughput: 0: 13153.4. Samples: 19381646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:47:26,086][117718] Avg episode reward: [(0, '2641.925')] [2023-03-07 03:47:26,808][118044] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-07 03:47:27,591][118044] Updated weights for policy 0, policy_version 18980 (0.0006) [2023-03-07 03:47:28,371][118044] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-07 03:47:29,154][118044] Updated weights for policy 0, policy_version 19000 (0.0006) [2023-03-07 03:47:29,925][118044] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-07 03:47:30,706][118044] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-07 03:47:31,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 19480576. Throughput: 0: 13144.5. Samples: 19460441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:31,086][117718] Avg episode reward: [(0, '2757.988')] [2023-03-07 03:47:31,468][118044] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-07 03:47:32,255][118044] Updated weights for policy 0, policy_version 19040 (0.0006) [2023-03-07 03:47:33,038][118044] Updated weights for policy 0, policy_version 19050 (0.0007) [2023-03-07 03:47:33,830][118044] Updated weights for policy 0, policy_version 19060 (0.0006) [2023-03-07 03:47:34,605][118044] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-07 03:47:35,373][118044] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-07 03:47:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 19546112. Throughput: 0: 13143.0. Samples: 19539288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:36,086][117718] Avg episode reward: [(0, '2791.546')] [2023-03-07 03:47:36,147][118044] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-07 03:47:36,913][118044] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-07 03:47:37,713][118044] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-07 03:47:38,489][118044] Updated weights for policy 0, policy_version 19120 (0.0006) [2023-03-07 03:47:39,261][118044] Updated weights for policy 0, policy_version 19130 (0.0006) [2023-03-07 03:47:40,069][118044] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-07 03:47:40,843][118044] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-07 03:47:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 19612672. Throughput: 0: 13147.4. Samples: 19578857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:41,086][117718] Avg episode reward: [(0, '2670.078')] [2023-03-07 03:47:41,602][118044] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-07 03:47:42,396][118044] Updated weights for policy 0, policy_version 19170 (0.0006) [2023-03-07 03:47:43,181][118044] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-07 03:47:43,963][118044] Updated weights for policy 0, policy_version 19190 (0.0006) [2023-03-07 03:47:44,746][118044] Updated weights for policy 0, policy_version 19200 (0.0007) [2023-03-07 03:47:45,525][118044] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-07 03:47:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 19678208. Throughput: 0: 13141.1. Samples: 19657613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:46,086][117718] Avg episode reward: [(0, '2634.788')] [2023-03-07 03:47:46,315][118044] Updated weights for policy 0, policy_version 19220 (0.0006) [2023-03-07 03:47:47,092][118044] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-07 03:47:47,876][118044] Updated weights for policy 0, policy_version 19240 (0.0005) [2023-03-07 03:47:48,645][118044] Updated weights for policy 0, policy_version 19250 (0.0005) [2023-03-07 03:47:49,436][118044] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-07 03:47:50,213][118044] Updated weights for policy 0, policy_version 19270 (0.0007) [2023-03-07 03:47:50,987][118044] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-07 03:47:51,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 19743744. Throughput: 0: 13131.1. Samples: 19736192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:51,086][117718] Avg episode reward: [(0, '2543.896')] [2023-03-07 03:47:51,770][118044] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-07 03:47:52,566][118044] Updated weights for policy 0, policy_version 19300 (0.0005) [2023-03-07 03:47:53,327][118044] Updated weights for policy 0, policy_version 19310 (0.0005) [2023-03-07 03:47:54,105][118044] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-07 03:47:54,890][118044] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-07 03:47:55,675][118044] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-07 03:47:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 19809280. Throughput: 0: 13135.0. Samples: 19775440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:47:56,086][117718] Avg episode reward: [(0, '2767.954')] [2023-03-07 03:47:56,449][118044] Updated weights for policy 0, policy_version 19350 (0.0007) [2023-03-07 03:47:57,239][118044] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-07 03:47:58,006][118044] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-07 03:47:58,792][118044] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-07 03:47:59,561][118044] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-07 03:48:00,330][118044] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-07 03:48:01,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 19874816. Throughput: 0: 13133.5. Samples: 19854312. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:01,086][117718] Avg episode reward: [(0, '2759.906')] [2023-03-07 03:48:01,125][118044] Updated weights for policy 0, policy_version 19410 (0.0007) [2023-03-07 03:48:01,905][118044] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-07 03:48:02,699][118044] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-07 03:48:03,488][118044] Updated weights for policy 0, policy_version 19440 (0.0006) [2023-03-07 03:48:04,245][118044] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-07 03:48:05,030][118044] Updated weights for policy 0, policy_version 19460 (0.0007) [2023-03-07 03:48:05,819][118044] Updated weights for policy 0, policy_version 19470 (0.0007) [2023-03-07 03:48:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 19940352. Throughput: 0: 13129.2. Samples: 19932995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:06,086][117718] Avg episode reward: [(0, '2801.001')] [2023-03-07 03:48:06,596][118044] Updated weights for policy 0, policy_version 19480 (0.0006) [2023-03-07 03:48:07,366][118044] Updated weights for policy 0, policy_version 19490 (0.0006) [2023-03-07 03:48:08,148][118044] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-07 03:48:08,949][118044] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-07 03:48:09,715][118044] Updated weights for policy 0, policy_version 19520 (0.0006) [2023-03-07 03:48:10,479][118044] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-07 03:48:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 20005888. Throughput: 0: 13126.9. Samples: 19972357. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:11,086][117718] Avg episode reward: [(0, '2633.202')] [2023-03-07 03:48:11,261][118044] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-03-07 03:48:12,054][118044] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-07 03:48:12,834][118044] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-07 03:48:13,613][118044] Updated weights for policy 0, policy_version 19570 (0.0007) [2023-03-07 03:48:14,374][118044] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-07 03:48:15,152][118044] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-07 03:48:15,930][118044] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-07 03:48:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 20072448. Throughput: 0: 13131.9. Samples: 20051376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:16,086][117718] Avg episode reward: [(0, '2715.153')] [2023-03-07 03:48:16,694][118044] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-07 03:48:17,490][118044] Updated weights for policy 0, policy_version 19620 (0.0007) [2023-03-07 03:48:18,264][118044] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-07 03:48:19,043][118044] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-07 03:48:19,807][118044] Updated weights for policy 0, policy_version 19650 (0.0006) [2023-03-07 03:48:20,590][118044] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-07 03:48:21,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 20137984. Throughput: 0: 13139.1. Samples: 20130548. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:48:21,086][117718] Avg episode reward: [(0, '2723.344')] [2023-03-07 03:48:21,369][118044] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-07 03:48:22,135][118044] Updated weights for policy 0, policy_version 19680 (0.0007) [2023-03-07 03:48:22,897][118044] Updated weights for policy 0, policy_version 19690 (0.0007) [2023-03-07 03:48:23,700][118044] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-07 03:48:24,451][118044] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-07 03:48:25,228][118044] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-07 03:48:26,010][118044] Updated weights for policy 0, policy_version 19730 (0.0005) [2023-03-07 03:48:26,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 20203520. Throughput: 0: 13137.1. Samples: 20170027. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:48:26,086][117718] Avg episode reward: [(0, '2712.248')] [2023-03-07 03:48:26,791][118044] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-07 03:48:27,565][118044] Updated weights for policy 0, policy_version 19750 (0.0007) [2023-03-07 03:48:28,323][118044] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-07 03:48:29,084][118044] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-07 03:48:29,870][118044] Updated weights for policy 0, policy_version 19780 (0.0005) [2023-03-07 03:48:30,652][118044] Updated weights for policy 0, policy_version 19790 (0.0006) [2023-03-07 03:48:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 20270080. Throughput: 0: 13157.7. Samples: 20249706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:48:31,086][117718] Avg episode reward: [(0, '2613.069')] [2023-03-07 03:48:31,430][118044] Updated weights for policy 0, policy_version 19800 (0.0006) [2023-03-07 03:48:32,205][118044] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-07 03:48:32,977][118044] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-07 03:48:33,746][118044] Updated weights for policy 0, policy_version 19830 (0.0007) [2023-03-07 03:48:34,536][118044] Updated weights for policy 0, policy_version 19840 (0.0007) [2023-03-07 03:48:35,313][118044] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-07 03:48:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 20335616. Throughput: 0: 13168.8. Samples: 20328789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:36,086][117718] Avg episode reward: [(0, '2706.028')] [2023-03-07 03:48:36,090][118044] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-07 03:48:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000019860_20336640.pth... [2023-03-07 03:48:36,123][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000016779_17181696.pth [2023-03-07 03:48:36,871][118044] Updated weights for policy 0, policy_version 19870 (0.0007) [2023-03-07 03:48:37,658][118044] Updated weights for policy 0, policy_version 19880 (0.0005) [2023-03-07 03:48:38,414][118044] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-07 03:48:39,200][118044] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-07 03:48:39,986][118044] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-07 03:48:40,753][118044] Updated weights for policy 0, policy_version 19920 (0.0005) [2023-03-07 03:48:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 20402176. Throughput: 0: 13173.8. Samples: 20368261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:41,086][117718] Avg episode reward: [(0, '2643.915')] [2023-03-07 03:48:41,532][118044] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-07 03:48:42,292][118044] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-07 03:48:43,076][118044] Updated weights for policy 0, policy_version 19950 (0.0009) [2023-03-07 03:48:43,888][118044] Updated weights for policy 0, policy_version 19960 (0.0006) [2023-03-07 03:48:44,667][118044] Updated weights for policy 0, policy_version 19970 (0.0006) [2023-03-07 03:48:45,482][118044] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-07 03:48:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 20466688. Throughput: 0: 13169.4. Samples: 20446934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:48:46,086][117718] Avg episode reward: [(0, '2641.036')] [2023-03-07 03:48:46,255][118044] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-07 03:48:47,030][118044] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-07 03:48:47,812][118044] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-07 03:48:48,591][118044] Updated weights for policy 0, policy_version 20020 (0.0006) [2023-03-07 03:48:49,362][118044] Updated weights for policy 0, policy_version 20030 (0.0005) [2023-03-07 03:48:50,140][118044] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-07 03:48:50,915][118044] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-07 03:48:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 20533248. Throughput: 0: 13166.0. Samples: 20525464. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:48:51,086][117718] Avg episode reward: [(0, '2686.057')] [2023-03-07 03:48:51,705][118044] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-07 03:48:52,470][118044] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-07 03:48:53,240][118044] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-07 03:48:54,017][118044] Updated weights for policy 0, policy_version 20090 (0.0005) [2023-03-07 03:48:54,781][118044] Updated weights for policy 0, policy_version 20100 (0.0006) [2023-03-07 03:48:55,562][118044] Updated weights for policy 0, policy_version 20110 (0.0005) [2023-03-07 03:48:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 20598784. Throughput: 0: 13176.2. Samples: 20565285. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:48:56,086][117718] Avg episode reward: [(0, '2590.403')] [2023-03-07 03:48:56,341][118044] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-07 03:48:57,098][118044] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-07 03:48:57,910][118044] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-07 03:48:58,675][118044] Updated weights for policy 0, policy_version 20150 (0.0006) [2023-03-07 03:48:59,450][118044] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-07 03:49:00,235][118044] Updated weights for policy 0, policy_version 20170 (0.0007) [2023-03-07 03:49:01,001][118044] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-07 03:49:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13148.8). Total num frames: 20665344. Throughput: 0: 13177.4. Samples: 20644362. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:49:01,086][117718] Avg episode reward: [(0, '2694.293')] [2023-03-07 03:49:01,781][118044] Updated weights for policy 0, policy_version 20190 (0.0006) [2023-03-07 03:49:02,550][118044] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-07 03:49:03,343][118044] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-07 03:49:04,113][118044] Updated weights for policy 0, policy_version 20220 (0.0008) [2023-03-07 03:49:04,893][118044] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-07 03:49:05,660][118044] Updated weights for policy 0, policy_version 20240 (0.0007) [2023-03-07 03:49:06,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 20730880. Throughput: 0: 13174.6. Samples: 20723404. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:49:06,086][117718] Avg episode reward: [(0, '2654.486')] [2023-03-07 03:49:06,437][118044] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-07 03:49:07,207][118044] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-07 03:49:07,979][118044] Updated weights for policy 0, policy_version 20270 (0.0006) [2023-03-07 03:49:08,786][118044] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-07 03:49:09,542][118044] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-07 03:49:10,334][118044] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-07 03:49:11,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13148.8). Total num frames: 20796416. Throughput: 0: 13176.8. Samples: 20762984. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:49:11,086][117718] Avg episode reward: [(0, '2759.469')] [2023-03-07 03:49:11,102][118044] Updated weights for policy 0, policy_version 20310 (0.0005) [2023-03-07 03:49:11,875][118044] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-07 03:49:12,646][118044] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-07 03:49:13,435][118044] Updated weights for policy 0, policy_version 20340 (0.0007) [2023-03-07 03:49:14,202][118044] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-07 03:49:14,999][118044] Updated weights for policy 0, policy_version 20360 (0.0007) [2023-03-07 03:49:15,758][118044] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-07 03:49:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 20862976. Throughput: 0: 13162.6. Samples: 20842024. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:16,097][117718] Avg episode reward: [(0, '2715.933')] [2023-03-07 03:49:16,547][118044] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-07 03:49:17,348][118044] Updated weights for policy 0, policy_version 20390 (0.0006) [2023-03-07 03:49:18,129][118044] Updated weights for policy 0, policy_version 20400 (0.0006) [2023-03-07 03:49:18,908][118044] Updated weights for policy 0, policy_version 20410 (0.0005) [2023-03-07 03:49:19,692][118044] Updated weights for policy 0, policy_version 20420 (0.0007) [2023-03-07 03:49:20,460][118044] Updated weights for policy 0, policy_version 20430 (0.0007) [2023-03-07 03:49:21,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 20928512. Throughput: 0: 13152.0. Samples: 20920630. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:21,097][117718] Avg episode reward: [(0, '2695.689')] [2023-03-07 03:49:21,232][118044] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-07 03:49:22,016][118044] Updated weights for policy 0, policy_version 20450 (0.0006) [2023-03-07 03:49:22,811][118044] Updated weights for policy 0, policy_version 20460 (0.0007) [2023-03-07 03:49:23,601][118044] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-07 03:49:24,370][118044] Updated weights for policy 0, policy_version 20480 (0.0006) [2023-03-07 03:49:25,152][118044] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-07 03:49:25,921][118044] Updated weights for policy 0, policy_version 20500 (0.0006) [2023-03-07 03:49:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 20994048. Throughput: 0: 13149.9. Samples: 20960008. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:26,096][117718] Avg episode reward: [(0, '2756.834')] [2023-03-07 03:49:26,702][118044] Updated weights for policy 0, policy_version 20510 (0.0007) [2023-03-07 03:49:27,486][118044] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-07 03:49:28,253][118044] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-07 03:49:29,040][118044] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-07 03:49:29,821][118044] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-07 03:49:30,596][118044] Updated weights for policy 0, policy_version 20560 (0.0006) [2023-03-07 03:49:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 21059584. Throughput: 0: 13153.9. Samples: 21038858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:31,096][117718] Avg episode reward: [(0, '2546.181')] [2023-03-07 03:49:31,355][118044] Updated weights for policy 0, policy_version 20570 (0.0005) [2023-03-07 03:49:32,132][118044] Updated weights for policy 0, policy_version 20580 (0.0007) [2023-03-07 03:49:32,907][118044] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-07 03:49:33,705][118044] Updated weights for policy 0, policy_version 20600 (0.0007) [2023-03-07 03:49:34,493][118044] Updated weights for policy 0, policy_version 20610 (0.0006) [2023-03-07 03:49:35,255][118044] Updated weights for policy 0, policy_version 20620 (0.0006) [2023-03-07 03:49:36,046][118044] Updated weights for policy 0, policy_version 20630 (0.0006) [2023-03-07 03:49:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 21125120. Throughput: 0: 13165.8. Samples: 21117925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:36,086][117718] Avg episode reward: [(0, '2546.723')] [2023-03-07 03:49:36,854][118044] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-07 03:49:37,614][118044] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-07 03:49:38,390][118044] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-07 03:49:39,166][118044] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-07 03:49:39,945][118044] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-07 03:49:40,710][118044] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-07 03:49:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 21190656. Throughput: 0: 13151.9. Samples: 21157118. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:41,086][117718] Avg episode reward: [(0, '2578.040')] [2023-03-07 03:49:41,490][118044] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-07 03:49:42,253][118044] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-07 03:49:43,038][118044] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-07 03:49:43,822][118044] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-07 03:49:44,611][118044] Updated weights for policy 0, policy_version 20740 (0.0007) [2023-03-07 03:49:45,398][118044] Updated weights for policy 0, policy_version 20750 (0.0007) [2023-03-07 03:49:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 21257216. Throughput: 0: 13152.5. Samples: 21236224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:46,086][117718] Avg episode reward: [(0, '2732.339')] [2023-03-07 03:49:46,162][118044] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-07 03:49:46,949][118044] Updated weights for policy 0, policy_version 20770 (0.0007) [2023-03-07 03:49:47,723][118044] Updated weights for policy 0, policy_version 20780 (0.0006) [2023-03-07 03:49:48,511][118044] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-07 03:49:49,297][118044] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-07 03:49:50,064][118044] Updated weights for policy 0, policy_version 20810 (0.0006) [2023-03-07 03:49:50,840][118044] Updated weights for policy 0, policy_version 20820 (0.0006) [2023-03-07 03:49:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 21322752. Throughput: 0: 13146.9. Samples: 21315015. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:51,086][117718] Avg episode reward: [(0, '2753.294')] [2023-03-07 03:49:51,626][118044] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-07 03:49:52,411][118044] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-07 03:49:53,208][118044] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-07 03:49:53,977][118044] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-07 03:49:54,763][118044] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-07 03:49:55,522][118044] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-07 03:49:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 21388288. Throughput: 0: 13137.2. Samples: 21354154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:49:56,086][117718] Avg episode reward: [(0, '2684.652')] [2023-03-07 03:49:56,301][118044] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-07 03:49:57,073][118044] Updated weights for policy 0, policy_version 20900 (0.0005) [2023-03-07 03:49:57,836][118044] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-07 03:49:58,609][118044] Updated weights for policy 0, policy_version 20920 (0.0007) [2023-03-07 03:49:59,414][118044] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-07 03:50:00,174][118044] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-07 03:50:00,961][118044] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-07 03:50:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 21453824. Throughput: 0: 13139.4. Samples: 21433300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:01,086][117718] Avg episode reward: [(0, '2676.975')] [2023-03-07 03:50:01,723][118044] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-07 03:50:02,516][118044] Updated weights for policy 0, policy_version 20970 (0.0006) [2023-03-07 03:50:03,287][118044] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-07 03:50:04,065][118044] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-07 03:50:04,834][118044] Updated weights for policy 0, policy_version 21000 (0.0007) [2023-03-07 03:50:05,630][118044] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-07 03:50:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 21519360. Throughput: 0: 13152.4. Samples: 21512487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:06,086][117718] Avg episode reward: [(0, '2579.733')] [2023-03-07 03:50:06,398][118044] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-07 03:50:07,182][118044] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-07 03:50:07,961][118044] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-07 03:50:08,744][118044] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-07 03:50:09,525][118044] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-07 03:50:10,290][118044] Updated weights for policy 0, policy_version 21070 (0.0005) [2023-03-07 03:50:11,063][118044] Updated weights for policy 0, policy_version 21080 (0.0005) [2023-03-07 03:50:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21585920. Throughput: 0: 13152.2. Samples: 21551858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:11,086][117718] Avg episode reward: [(0, '2550.943')] [2023-03-07 03:50:11,832][118044] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-07 03:50:12,612][118044] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-07 03:50:13,394][118044] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-07 03:50:14,170][118044] Updated weights for policy 0, policy_version 21120 (0.0005) [2023-03-07 03:50:14,939][118044] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-07 03:50:15,713][118044] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-07 03:50:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 21651456. Throughput: 0: 13159.3. Samples: 21631029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:16,086][117718] Avg episode reward: [(0, '2500.773')] [2023-03-07 03:50:16,485][118044] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-07 03:50:17,250][118044] Updated weights for policy 0, policy_version 21160 (0.0005) [2023-03-07 03:50:18,034][118044] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-07 03:50:18,813][118044] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-07 03:50:19,595][118044] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-07 03:50:20,371][118044] Updated weights for policy 0, policy_version 21200 (0.0006) [2023-03-07 03:50:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 21716992. Throughput: 0: 13161.8. Samples: 21710205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:21,086][117718] Avg episode reward: [(0, '2682.789')] [2023-03-07 03:50:21,162][118044] Updated weights for policy 0, policy_version 21210 (0.0006) [2023-03-07 03:50:21,943][118044] Updated weights for policy 0, policy_version 21220 (0.0006) [2023-03-07 03:50:22,726][118044] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-07 03:50:23,510][118044] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-07 03:50:24,264][118044] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-07 03:50:25,041][118044] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-07 03:50:25,838][118044] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-07 03:50:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21783552. Throughput: 0: 13165.9. Samples: 21749585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:26,086][117718] Avg episode reward: [(0, '2537.958')] [2023-03-07 03:50:26,613][118044] Updated weights for policy 0, policy_version 21280 (0.0007) [2023-03-07 03:50:27,400][118044] Updated weights for policy 0, policy_version 21290 (0.0006) [2023-03-07 03:50:28,197][118044] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-07 03:50:28,978][118044] Updated weights for policy 0, policy_version 21310 (0.0006) [2023-03-07 03:50:29,740][118044] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-07 03:50:30,528][118044] Updated weights for policy 0, policy_version 21330 (0.0007) [2023-03-07 03:50:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21849088. Throughput: 0: 13155.8. Samples: 21828236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:31,086][117718] Avg episode reward: [(0, '2441.674')] [2023-03-07 03:50:31,312][118044] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-07 03:50:32,090][118044] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-07 03:50:32,872][118044] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-07 03:50:33,653][118044] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-07 03:50:34,426][118044] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-07 03:50:35,217][118044] Updated weights for policy 0, policy_version 21390 (0.0005) [2023-03-07 03:50:35,987][118044] Updated weights for policy 0, policy_version 21400 (0.0007) [2023-03-07 03:50:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21914624. Throughput: 0: 13153.8. Samples: 21906937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:36,086][117718] Avg episode reward: [(0, '2530.558')] [2023-03-07 03:50:36,093][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000021401_21914624.pth... [2023-03-07 03:50:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000018318_18757632.pth [2023-03-07 03:50:36,790][118044] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-07 03:50:37,573][118044] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-07 03:50:38,345][118044] Updated weights for policy 0, policy_version 21430 (0.0005) [2023-03-07 03:50:39,119][118044] Updated weights for policy 0, policy_version 21440 (0.0006) [2023-03-07 03:50:39,900][118044] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-07 03:50:40,683][118044] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-07 03:50:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21980160. Throughput: 0: 13157.2. Samples: 21946227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:41,086][117718] Avg episode reward: [(0, '2632.726')] [2023-03-07 03:50:41,466][118044] Updated weights for policy 0, policy_version 21470 (0.0005) [2023-03-07 03:50:42,241][118044] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-07 03:50:43,024][118044] Updated weights for policy 0, policy_version 21490 (0.0005) [2023-03-07 03:50:43,803][118044] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-07 03:50:44,577][118044] Updated weights for policy 0, policy_version 21510 (0.0006) [2023-03-07 03:50:45,353][118044] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-07 03:50:46,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 22045696. Throughput: 0: 13150.8. Samples: 22025084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:46,086][117718] Avg episode reward: [(0, '2488.978')] [2023-03-07 03:50:46,131][118044] Updated weights for policy 0, policy_version 21530 (0.0007) [2023-03-07 03:50:46,907][118044] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-07 03:50:47,676][118044] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-07 03:50:48,449][118044] Updated weights for policy 0, policy_version 21560 (0.0006) [2023-03-07 03:50:49,232][118044] Updated weights for policy 0, policy_version 21570 (0.0007) [2023-03-07 03:50:50,018][118044] Updated weights for policy 0, policy_version 21580 (0.0006) [2023-03-07 03:50:50,805][118044] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-07 03:50:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 22111232. Throughput: 0: 13143.0. Samples: 22103923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:50:51,086][117718] Avg episode reward: [(0, '2533.714')] [2023-03-07 03:50:51,578][118044] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-07 03:50:52,357][118044] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-07 03:50:53,133][118044] Updated weights for policy 0, policy_version 21620 (0.0005) [2023-03-07 03:50:53,914][118044] Updated weights for policy 0, policy_version 21630 (0.0006) [2023-03-07 03:50:54,669][118044] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-07 03:50:55,469][118044] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-07 03:50:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 22177792. Throughput: 0: 13146.9. Samples: 22143468. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:50:56,086][117718] Avg episode reward: [(0, '2465.233')] [2023-03-07 03:50:56,233][118044] Updated weights for policy 0, policy_version 21660 (0.0006) [2023-03-07 03:50:57,024][118044] Updated weights for policy 0, policy_version 21670 (0.0008) [2023-03-07 03:50:57,822][118044] Updated weights for policy 0, policy_version 21680 (0.0007) [2023-03-07 03:50:58,610][118044] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-07 03:50:59,394][118044] Updated weights for policy 0, policy_version 21700 (0.0007) [2023-03-07 03:51:00,183][118044] Updated weights for policy 0, policy_version 21710 (0.0007) [2023-03-07 03:51:00,949][118044] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-07 03:51:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 22242304. Throughput: 0: 13134.3. Samples: 22222073. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:51:01,086][117718] Avg episode reward: [(0, '2441.207')] [2023-03-07 03:51:01,704][118044] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-07 03:51:02,493][118044] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-07 03:51:03,280][118044] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-07 03:51:04,046][118044] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-07 03:51:04,839][118044] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-07 03:51:05,606][118044] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-07 03:51:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 22308864. Throughput: 0: 13131.9. Samples: 22301141. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:51:06,086][117718] Avg episode reward: [(0, '2545.944')] [2023-03-07 03:51:06,378][118044] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-07 03:51:07,178][118044] Updated weights for policy 0, policy_version 21800 (0.0006) [2023-03-07 03:51:07,954][118044] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-07 03:51:08,736][118044] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-07 03:51:09,519][118044] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-07 03:51:10,280][118044] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-07 03:51:11,062][118044] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-07 03:51:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 22374400. Throughput: 0: 13133.2. Samples: 22340578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:11,086][117718] Avg episode reward: [(0, '2455.799')] [2023-03-07 03:51:11,836][118044] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-07 03:51:12,615][118044] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-07 03:51:13,378][118044] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-07 03:51:14,172][118044] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-07 03:51:14,956][118044] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-07 03:51:15,757][118044] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-07 03:51:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 22439936. Throughput: 0: 13137.9. Samples: 22419440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:16,086][117718] Avg episode reward: [(0, '2423.994')] [2023-03-07 03:51:16,527][118044] Updated weights for policy 0, policy_version 21920 (0.0006) [2023-03-07 03:51:17,319][118044] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-07 03:51:18,119][118044] Updated weights for policy 0, policy_version 21940 (0.0006) [2023-03-07 03:51:18,885][118044] Updated weights for policy 0, policy_version 21950 (0.0005) [2023-03-07 03:51:19,671][118044] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-07 03:51:20,442][118044] Updated weights for policy 0, policy_version 21970 (0.0006) [2023-03-07 03:51:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 22505472. Throughput: 0: 13132.7. Samples: 22497905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:21,086][117718] Avg episode reward: [(0, '2509.657')] [2023-03-07 03:51:21,207][118044] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-07 03:51:22,017][118044] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-07 03:51:22,802][118044] Updated weights for policy 0, policy_version 22000 (0.0007) [2023-03-07 03:51:23,550][118044] Updated weights for policy 0, policy_version 22010 (0.0007) [2023-03-07 03:51:24,342][118044] Updated weights for policy 0, policy_version 22020 (0.0007) [2023-03-07 03:51:25,100][118044] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-07 03:51:25,859][118044] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-07 03:51:26,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 22571008. Throughput: 0: 13135.6. Samples: 22537329. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:26,086][117718] Avg episode reward: [(0, '2499.877')] [2023-03-07 03:51:26,644][118044] Updated weights for policy 0, policy_version 22050 (0.0005) [2023-03-07 03:51:27,421][118044] Updated weights for policy 0, policy_version 22060 (0.0007) [2023-03-07 03:51:28,210][118044] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-07 03:51:29,005][118044] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-07 03:51:29,783][118044] Updated weights for policy 0, policy_version 22090 (0.0006) [2023-03-07 03:51:30,555][118044] Updated weights for policy 0, policy_version 22100 (0.0005) [2023-03-07 03:51:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 22636544. Throughput: 0: 13138.4. Samples: 22616311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:31,086][117718] Avg episode reward: [(0, '2616.571')] [2023-03-07 03:51:31,333][118044] Updated weights for policy 0, policy_version 22110 (0.0005) [2023-03-07 03:51:32,122][118044] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-07 03:51:32,906][118044] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-07 03:51:33,691][118044] Updated weights for policy 0, policy_version 22140 (0.0005) [2023-03-07 03:51:34,475][118044] Updated weights for policy 0, policy_version 22150 (0.0007) [2023-03-07 03:51:35,250][118044] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-07 03:51:36,012][118044] Updated weights for policy 0, policy_version 22170 (0.0007) [2023-03-07 03:51:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 22702080. Throughput: 0: 13133.4. Samples: 22694929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:36,086][117718] Avg episode reward: [(0, '2590.196')] [2023-03-07 03:51:36,802][118044] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-07 03:51:37,597][118044] Updated weights for policy 0, policy_version 22190 (0.0007) [2023-03-07 03:51:38,367][118044] Updated weights for policy 0, policy_version 22200 (0.0007) [2023-03-07 03:51:39,142][118044] Updated weights for policy 0, policy_version 22210 (0.0007) [2023-03-07 03:51:39,936][118044] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-07 03:51:40,704][118044] Updated weights for policy 0, policy_version 22230 (0.0006) [2023-03-07 03:51:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 22767616. Throughput: 0: 13128.7. Samples: 22734259. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:51:41,086][117718] Avg episode reward: [(0, '2517.617')] [2023-03-07 03:51:41,486][118044] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-07 03:51:42,281][118044] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-07 03:51:43,050][118044] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-07 03:51:43,810][118044] Updated weights for policy 0, policy_version 22270 (0.0006) [2023-03-07 03:51:44,605][118044] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-07 03:51:45,378][118044] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-07 03:51:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 22834176. Throughput: 0: 13136.9. Samples: 22813231. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:51:46,086][117718] Avg episode reward: [(0, '2571.045')] [2023-03-07 03:51:46,159][118044] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-07 03:51:46,946][118044] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-07 03:51:47,712][118044] Updated weights for policy 0, policy_version 22320 (0.0005) [2023-03-07 03:51:48,510][118044] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-07 03:51:49,269][118044] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-07 03:51:50,019][118044] Updated weights for policy 0, policy_version 22350 (0.0005) [2023-03-07 03:51:50,810][118044] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-07 03:51:51,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 22899712. Throughput: 0: 13136.3. Samples: 22892277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:51:51,086][117718] Avg episode reward: [(0, '2488.293')] [2023-03-07 03:51:51,573][118044] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-07 03:51:52,372][118044] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-07 03:51:53,148][118044] Updated weights for policy 0, policy_version 22390 (0.0006) [2023-03-07 03:51:53,922][118044] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-07 03:51:54,701][118044] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-07 03:51:55,487][118044] Updated weights for policy 0, policy_version 22420 (0.0007) [2023-03-07 03:51:56,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13148.8). Total num frames: 22965248. Throughput: 0: 13139.6. Samples: 22931859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:51:56,097][117718] Avg episode reward: [(0, '2511.638')] [2023-03-07 03:51:56,255][118044] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-07 03:51:57,040][118044] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-07 03:51:57,813][118044] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-07 03:51:58,592][118044] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-07 03:51:59,385][118044] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-07 03:52:00,153][118044] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-07 03:52:00,959][118044] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-07 03:52:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 23030784. Throughput: 0: 13137.6. Samples: 23010635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:01,096][117718] Avg episode reward: [(0, '2577.417')] [2023-03-07 03:52:01,735][118044] Updated weights for policy 0, policy_version 22500 (0.0007) [2023-03-07 03:52:02,517][118044] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-07 03:52:03,308][118044] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-07 03:52:04,078][118044] Updated weights for policy 0, policy_version 22530 (0.0005) [2023-03-07 03:52:04,868][118044] Updated weights for policy 0, policy_version 22540 (0.0008) [2023-03-07 03:52:05,640][118044] Updated weights for policy 0, policy_version 22550 (0.0006) [2023-03-07 03:52:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 23096320. Throughput: 0: 13138.4. Samples: 23089132. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:06,097][117718] Avg episode reward: [(0, '2601.385')] [2023-03-07 03:52:06,415][118044] Updated weights for policy 0, policy_version 22560 (0.0006) [2023-03-07 03:52:07,192][118044] Updated weights for policy 0, policy_version 22570 (0.0007) [2023-03-07 03:52:07,971][118044] Updated weights for policy 0, policy_version 22580 (0.0006) [2023-03-07 03:52:08,734][118044] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-07 03:52:09,514][118044] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-07 03:52:10,305][118044] Updated weights for policy 0, policy_version 22610 (0.0007) [2023-03-07 03:52:11,070][118044] Updated weights for policy 0, policy_version 22620 (0.0007) [2023-03-07 03:52:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 23162880. Throughput: 0: 13142.3. Samples: 23128732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:11,096][117718] Avg episode reward: [(0, '2656.329')] [2023-03-07 03:52:11,838][118044] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-07 03:52:12,622][118044] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-07 03:52:13,396][118044] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-07 03:52:14,179][118044] Updated weights for policy 0, policy_version 22660 (0.0005) [2023-03-07 03:52:14,951][118044] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-07 03:52:15,734][118044] Updated weights for policy 0, policy_version 22680 (0.0006) [2023-03-07 03:52:16,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 23228416. Throughput: 0: 13147.7. Samples: 23207956. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:16,097][117718] Avg episode reward: [(0, '2570.745')] [2023-03-07 03:52:16,516][118044] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-07 03:52:17,285][118044] Updated weights for policy 0, policy_version 22700 (0.0007) [2023-03-07 03:52:18,070][118044] Updated weights for policy 0, policy_version 22710 (0.0008) [2023-03-07 03:52:18,854][118044] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-07 03:52:19,633][118044] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-07 03:52:20,404][118044] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-07 03:52:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 23293952. Throughput: 0: 13154.7. Samples: 23286892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:21,086][117718] Avg episode reward: [(0, '2682.228')] [2023-03-07 03:52:21,165][118044] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-07 03:52:21,939][118044] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-07 03:52:22,725][118044] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-07 03:52:23,489][118044] Updated weights for policy 0, policy_version 22780 (0.0006) [2023-03-07 03:52:24,262][118044] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-07 03:52:25,054][118044] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-07 03:52:25,836][118044] Updated weights for policy 0, policy_version 22810 (0.0007) [2023-03-07 03:52:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23360512. Throughput: 0: 13161.5. Samples: 23326528. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:52:26,086][117718] Avg episode reward: [(0, '2654.512')] [2023-03-07 03:52:26,603][118044] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-07 03:52:27,386][118044] Updated weights for policy 0, policy_version 22830 (0.0006) [2023-03-07 03:52:28,163][118044] Updated weights for policy 0, policy_version 22840 (0.0007) [2023-03-07 03:52:28,938][118044] Updated weights for policy 0, policy_version 22850 (0.0007) [2023-03-07 03:52:29,734][118044] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-07 03:52:30,496][118044] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-07 03:52:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23426048. Throughput: 0: 13161.1. Samples: 23405481. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:52:31,086][117718] Avg episode reward: [(0, '2541.525')] [2023-03-07 03:52:31,284][118044] Updated weights for policy 0, policy_version 22880 (0.0007) [2023-03-07 03:52:32,061][118044] Updated weights for policy 0, policy_version 22890 (0.0007) [2023-03-07 03:52:32,845][118044] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-07 03:52:33,629][118044] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-07 03:52:34,422][118044] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-07 03:52:35,198][118044] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-07 03:52:35,983][118044] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-07 03:52:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 23491584. Throughput: 0: 13154.9. Samples: 23484247. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:52:36,086][117718] Avg episode reward: [(0, '2523.720')] [2023-03-07 03:52:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000022941_23491584.pth... [2023-03-07 03:52:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000019860_20336640.pth [2023-03-07 03:52:36,767][118044] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-07 03:52:37,524][118044] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-07 03:52:38,316][118044] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-07 03:52:39,104][118044] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-07 03:52:39,881][118044] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-07 03:52:40,648][118044] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-07 03:52:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 23557120. Throughput: 0: 13150.5. Samples: 23523631. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:52:41,086][117718] Avg episode reward: [(0, '2586.280')] [2023-03-07 03:52:41,427][118044] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-07 03:52:42,220][118044] Updated weights for policy 0, policy_version 23020 (0.0007) [2023-03-07 03:52:42,993][118044] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-07 03:52:43,777][118044] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-07 03:52:44,549][118044] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-07 03:52:45,314][118044] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-07 03:52:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 23622656. Throughput: 0: 13150.6. Samples: 23602413. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:52:46,086][117718] Avg episode reward: [(0, '2596.187')] [2023-03-07 03:52:46,089][118044] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-07 03:52:46,871][118044] Updated weights for policy 0, policy_version 23080 (0.0006) [2023-03-07 03:52:47,656][118044] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-07 03:52:48,433][118044] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-07 03:52:49,213][118044] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-07 03:52:49,990][118044] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-07 03:52:50,766][118044] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-07 03:52:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23689216. Throughput: 0: 13161.7. Samples: 23681409. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 03:52:51,086][117718] Avg episode reward: [(0, '2498.569')] [2023-03-07 03:52:51,533][118044] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-07 03:52:52,306][118044] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-07 03:52:53,114][118044] Updated weights for policy 0, policy_version 23160 (0.0005) [2023-03-07 03:52:53,889][118044] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-07 03:52:54,660][118044] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-07 03:52:55,435][118044] Updated weights for policy 0, policy_version 23190 (0.0007) [2023-03-07 03:52:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23754752. Throughput: 0: 13158.5. Samples: 23720865. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:52:56,086][117718] Avg episode reward: [(0, '2615.921')] [2023-03-07 03:52:56,210][118044] Updated weights for policy 0, policy_version 23200 (0.0006) [2023-03-07 03:52:56,985][118044] Updated weights for policy 0, policy_version 23210 (0.0006) [2023-03-07 03:52:57,757][118044] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-07 03:52:58,529][118044] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-07 03:52:59,317][118044] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-07 03:53:00,102][118044] Updated weights for policy 0, policy_version 23250 (0.0005) [2023-03-07 03:53:00,868][118044] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-07 03:53:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23820288. Throughput: 0: 13155.8. Samples: 23799968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:01,086][117718] Avg episode reward: [(0, '2745.087')] [2023-03-07 03:53:01,642][118044] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-07 03:53:02,430][118044] Updated weights for policy 0, policy_version 23280 (0.0007) [2023-03-07 03:53:03,199][118044] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-07 03:53:04,005][118044] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-07 03:53:04,773][118044] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-07 03:53:05,562][118044] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-07 03:53:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 23885824. Throughput: 0: 13152.8. Samples: 23878771. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:06,086][117718] Avg episode reward: [(0, '2672.727')] [2023-03-07 03:53:06,357][118044] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-07 03:53:07,138][118044] Updated weights for policy 0, policy_version 23340 (0.0006) [2023-03-07 03:53:07,924][118044] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-07 03:53:08,690][118044] Updated weights for policy 0, policy_version 23360 (0.0007) [2023-03-07 03:53:09,461][118044] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-07 03:53:10,239][118044] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-07 03:53:11,020][118044] Updated weights for policy 0, policy_version 23390 (0.0006) [2023-03-07 03:53:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 23951360. Throughput: 0: 13144.1. Samples: 23918013. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:53:11,097][117718] Avg episode reward: [(0, '2635.322')] [2023-03-07 03:53:11,817][118044] Updated weights for policy 0, policy_version 23400 (0.0007) [2023-03-07 03:53:12,577][118044] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-07 03:53:13,369][118044] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-07 03:53:14,146][118044] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-07 03:53:14,927][118044] Updated weights for policy 0, policy_version 23440 (0.0006) [2023-03-07 03:53:15,712][118044] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-07 03:53:16,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 24017920. Throughput: 0: 13140.5. Samples: 23996805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:53:16,097][117718] Avg episode reward: [(0, '2609.256')] [2023-03-07 03:53:16,482][118044] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-07 03:53:17,261][118044] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-07 03:53:18,054][118044] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-07 03:53:18,830][118044] Updated weights for policy 0, policy_version 23490 (0.0006) [2023-03-07 03:53:19,614][118044] Updated weights for policy 0, policy_version 23500 (0.0005) [2023-03-07 03:53:20,387][118044] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-07 03:53:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 24082432. Throughput: 0: 13138.0. Samples: 24075458. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:53:21,096][117718] Avg episode reward: [(0, '2650.840')] [2023-03-07 03:53:21,154][118044] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-07 03:53:21,949][118044] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-07 03:53:22,740][118044] Updated weights for policy 0, policy_version 23540 (0.0007) [2023-03-07 03:53:23,522][118044] Updated weights for policy 0, policy_version 23550 (0.0005) [2023-03-07 03:53:24,306][118044] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-07 03:53:25,080][118044] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-07 03:53:25,846][118044] Updated weights for policy 0, policy_version 23580 (0.0006) [2023-03-07 03:53:26,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 24147968. Throughput: 0: 13132.5. Samples: 24114594. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:26,097][117718] Avg episode reward: [(0, '2624.772')] [2023-03-07 03:53:26,632][118044] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-07 03:53:27,394][118044] Updated weights for policy 0, policy_version 23600 (0.0006) [2023-03-07 03:53:28,186][118044] Updated weights for policy 0, policy_version 23610 (0.0006) [2023-03-07 03:53:28,943][118044] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-07 03:53:29,720][118044] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-07 03:53:30,494][118044] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-07 03:53:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 24214528. Throughput: 0: 13147.2. Samples: 24194035. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:31,097][117718] Avg episode reward: [(0, '2667.583')] [2023-03-07 03:53:31,272][118044] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-07 03:53:32,057][118044] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-07 03:53:32,839][118044] Updated weights for policy 0, policy_version 23670 (0.0007) [2023-03-07 03:53:33,605][118044] Updated weights for policy 0, policy_version 23680 (0.0006) [2023-03-07 03:53:34,382][118044] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-07 03:53:35,149][118044] Updated weights for policy 0, policy_version 23700 (0.0006) [2023-03-07 03:53:35,926][118044] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-07 03:53:36,085][117718] Fps is (10 sec: 13312.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 24281088. Throughput: 0: 13151.0. Samples: 24273202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:36,096][117718] Avg episode reward: [(0, '2601.958')] [2023-03-07 03:53:36,700][118044] Updated weights for policy 0, policy_version 23720 (0.0006) [2023-03-07 03:53:37,485][118044] Updated weights for policy 0, policy_version 23730 (0.0006) [2023-03-07 03:53:38,255][118044] Updated weights for policy 0, policy_version 23740 (0.0007) [2023-03-07 03:53:39,028][118044] Updated weights for policy 0, policy_version 23750 (0.0006) [2023-03-07 03:53:39,819][118044] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-07 03:53:40,593][118044] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-07 03:53:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 24346624. Throughput: 0: 13156.3. Samples: 24312895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:53:41,086][117718] Avg episode reward: [(0, '2639.000')] [2023-03-07 03:53:41,381][118044] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-07 03:53:42,159][118044] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-07 03:53:42,936][118044] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-07 03:53:43,708][118044] Updated weights for policy 0, policy_version 23810 (0.0006) [2023-03-07 03:53:44,487][118044] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-07 03:53:45,283][118044] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-07 03:53:46,040][118044] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-07 03:53:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 24412160. Throughput: 0: 13147.0. Samples: 24391584. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:53:46,086][117718] Avg episode reward: [(0, '2615.856')] [2023-03-07 03:53:46,821][118044] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-07 03:53:47,604][118044] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-07 03:53:48,377][118044] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-07 03:53:49,141][118044] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-07 03:53:49,922][118044] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-07 03:53:50,694][118044] Updated weights for policy 0, policy_version 23900 (0.0005) [2023-03-07 03:53:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 24478720. Throughput: 0: 13157.7. Samples: 24470868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:53:51,086][117718] Avg episode reward: [(0, '2665.866')] [2023-03-07 03:53:51,469][118044] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-07 03:53:52,257][118044] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-07 03:53:53,023][118044] Updated weights for policy 0, policy_version 23930 (0.0007) [2023-03-07 03:53:53,796][118044] Updated weights for policy 0, policy_version 23940 (0.0005) [2023-03-07 03:53:54,570][118044] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-07 03:53:55,341][118044] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-07 03:53:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 24544256. Throughput: 0: 13167.6. Samples: 24510556. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:53:56,086][117718] Avg episode reward: [(0, '2672.127')] [2023-03-07 03:53:56,123][118044] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-07 03:53:56,907][118044] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-07 03:53:57,704][118044] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-07 03:53:58,478][118044] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-07 03:53:59,249][118044] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-07 03:54:00,046][118044] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-07 03:54:00,815][118044] Updated weights for policy 0, policy_version 24030 (0.0006) [2023-03-07 03:54:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 24609792. Throughput: 0: 13165.5. Samples: 24589251. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:54:01,086][117718] Avg episode reward: [(0, '2686.726')] [2023-03-07 03:54:01,592][118044] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-07 03:54:02,369][118044] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-07 03:54:03,156][118044] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-07 03:54:03,926][118044] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-07 03:54:04,699][118044] Updated weights for policy 0, policy_version 24080 (0.0006) [2023-03-07 03:54:05,496][118044] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-07 03:54:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 24675328. Throughput: 0: 13168.1. Samples: 24668021. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:54:06,086][117718] Avg episode reward: [(0, '2690.532')] [2023-03-07 03:54:06,278][118044] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-07 03:54:07,059][118044] Updated weights for policy 0, policy_version 24110 (0.0006) [2023-03-07 03:54:07,846][118044] Updated weights for policy 0, policy_version 24120 (0.0005) [2023-03-07 03:54:08,630][118044] Updated weights for policy 0, policy_version 24130 (0.0006) [2023-03-07 03:54:09,413][118044] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-07 03:54:10,196][118044] Updated weights for policy 0, policy_version 24150 (0.0007) [2023-03-07 03:54:10,971][118044] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-07 03:54:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 24740864. Throughput: 0: 13167.3. Samples: 24707122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:54:11,086][117718] Avg episode reward: [(0, '2719.064')] [2023-03-07 03:54:11,747][118044] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-07 03:54:12,554][118044] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-07 03:54:13,332][118044] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-07 03:54:14,113][118044] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-07 03:54:14,910][118044] Updated weights for policy 0, policy_version 24210 (0.0007) [2023-03-07 03:54:15,685][118044] Updated weights for policy 0, policy_version 24220 (0.0007) [2023-03-07 03:54:16,085][117718] Fps is (10 sec: 13004.9, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 24805376. Throughput: 0: 13147.9. Samples: 24785689. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:54:16,086][117718] Avg episode reward: [(0, '2890.416')] [2023-03-07 03:54:16,457][118044] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-07 03:54:17,234][118044] Updated weights for policy 0, policy_version 24240 (0.0005) [2023-03-07 03:54:18,019][118044] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-07 03:54:18,806][118044] Updated weights for policy 0, policy_version 24260 (0.0006) [2023-03-07 03:54:19,593][118044] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-07 03:54:20,357][118044] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-07 03:54:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 24871936. Throughput: 0: 13136.1. Samples: 24864325. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:54:21,086][117718] Avg episode reward: [(0, '2858.311')] [2023-03-07 03:54:21,166][118044] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-07 03:54:21,934][118044] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-07 03:54:22,710][118044] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-07 03:54:23,492][118044] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-07 03:54:24,267][118044] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-07 03:54:25,038][118044] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-07 03:54:25,815][118044] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-07 03:54:26,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 24937472. Throughput: 0: 13128.4. Samples: 24903675. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:54:26,086][117718] Avg episode reward: [(0, '2859.569')] [2023-03-07 03:54:26,593][118044] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-07 03:54:27,372][118044] Updated weights for policy 0, policy_version 24370 (0.0006) [2023-03-07 03:54:28,137][118044] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-07 03:54:28,925][118044] Updated weights for policy 0, policy_version 24390 (0.0007) [2023-03-07 03:54:29,711][118044] Updated weights for policy 0, policy_version 24400 (0.0006) [2023-03-07 03:54:30,475][118044] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-07 03:54:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 25003008. Throughput: 0: 13136.6. Samples: 24982733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:54:31,086][117718] Avg episode reward: [(0, '2929.258')] [2023-03-07 03:54:31,245][118044] Updated weights for policy 0, policy_version 24420 (0.0007) [2023-03-07 03:54:32,034][118044] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-07 03:54:32,816][118044] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-07 03:54:33,610][118044] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-07 03:54:34,395][118044] Updated weights for policy 0, policy_version 24460 (0.0008) [2023-03-07 03:54:35,191][118044] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-07 03:54:35,954][118044] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-07 03:54:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 25068544. Throughput: 0: 13122.5. Samples: 25061383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:54:36,086][117718] Avg episode reward: [(0, '2872.556')] [2023-03-07 03:54:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000024481_25068544.pth... [2023-03-07 03:54:36,119][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000021401_21914624.pth [2023-03-07 03:54:36,745][118044] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-07 03:54:37,518][118044] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-07 03:54:38,290][118044] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-07 03:54:39,075][118044] Updated weights for policy 0, policy_version 24520 (0.0006) [2023-03-07 03:54:39,842][118044] Updated weights for policy 0, policy_version 24530 (0.0005) [2023-03-07 03:54:40,614][118044] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-07 03:54:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 25135104. Throughput: 0: 13119.4. Samples: 25100931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:54:41,086][117718] Avg episode reward: [(0, '2774.245')] [2023-03-07 03:54:41,393][118044] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-07 03:54:42,174][118044] Updated weights for policy 0, policy_version 24560 (0.0006) [2023-03-07 03:54:42,954][118044] Updated weights for policy 0, policy_version 24570 (0.0007) [2023-03-07 03:54:43,733][118044] Updated weights for policy 0, policy_version 24580 (0.0006) [2023-03-07 03:54:44,494][118044] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-07 03:54:45,276][118044] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-07 03:54:46,066][118044] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-07 03:54:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 25200640. Throughput: 0: 13123.9. Samples: 25179824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:54:46,086][117718] Avg episode reward: [(0, '2788.293')] [2023-03-07 03:54:46,847][118044] Updated weights for policy 0, policy_version 24620 (0.0007) [2023-03-07 03:54:47,629][118044] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-07 03:54:48,421][118044] Updated weights for policy 0, policy_version 24640 (0.0007) [2023-03-07 03:54:49,192][118044] Updated weights for policy 0, policy_version 24650 (0.0005) [2023-03-07 03:54:49,969][118044] Updated weights for policy 0, policy_version 24660 (0.0005) [2023-03-07 03:54:50,757][118044] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-07 03:54:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 25266176. Throughput: 0: 13122.8. Samples: 25258549. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:54:51,086][117718] Avg episode reward: [(0, '2700.185')] [2023-03-07 03:54:51,540][118044] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-07 03:54:52,321][118044] Updated weights for policy 0, policy_version 24690 (0.0005) [2023-03-07 03:54:53,094][118044] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-07 03:54:53,901][118044] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-07 03:54:54,680][118044] Updated weights for policy 0, policy_version 24720 (0.0007) [2023-03-07 03:54:55,449][118044] Updated weights for policy 0, policy_version 24730 (0.0006) [2023-03-07 03:54:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 25331712. Throughput: 0: 13127.7. Samples: 25297871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:54:56,086][117718] Avg episode reward: [(0, '2707.609')] [2023-03-07 03:54:56,221][118044] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-07 03:54:56,998][118044] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-07 03:54:57,769][118044] Updated weights for policy 0, policy_version 24760 (0.0006) [2023-03-07 03:54:58,550][118044] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-07 03:54:59,322][118044] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-07 03:55:00,125][118044] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-07 03:55:00,897][118044] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-07 03:55:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 25397248. Throughput: 0: 13133.6. Samples: 25376702. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:55:01,086][117718] Avg episode reward: [(0, '2885.253')] [2023-03-07 03:55:01,697][118044] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-07 03:55:02,476][118044] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-07 03:55:03,244][118044] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-07 03:55:04,029][118044] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-07 03:55:04,794][118044] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-07 03:55:05,568][118044] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-07 03:55:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 25462784. Throughput: 0: 13139.9. Samples: 25455624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:06,086][117718] Avg episode reward: [(0, '2847.140')] [2023-03-07 03:55:06,349][118044] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-07 03:55:07,124][118044] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-07 03:55:07,893][118044] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-07 03:55:08,676][118044] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-07 03:55:09,449][118044] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-07 03:55:10,234][118044] Updated weights for policy 0, policy_version 24920 (0.0005) [2023-03-07 03:55:10,994][118044] Updated weights for policy 0, policy_version 24930 (0.0007) [2023-03-07 03:55:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 25529344. Throughput: 0: 13144.4. Samples: 25495174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:11,086][117718] Avg episode reward: [(0, '2880.520')] [2023-03-07 03:55:11,772][118044] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-07 03:55:12,557][118044] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-07 03:55:13,350][118044] Updated weights for policy 0, policy_version 24960 (0.0007) [2023-03-07 03:55:14,117][118044] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-07 03:55:14,903][118044] Updated weights for policy 0, policy_version 24980 (0.0007) [2023-03-07 03:55:15,669][118044] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-07 03:55:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 25594880. Throughput: 0: 13144.2. Samples: 25574221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:16,086][117718] Avg episode reward: [(0, '2720.012')] [2023-03-07 03:55:16,453][118044] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-07 03:55:17,231][118044] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-07 03:55:18,009][118044] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-07 03:55:18,785][118044] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-07 03:55:19,568][118044] Updated weights for policy 0, policy_version 25040 (0.0007) [2023-03-07 03:55:20,363][118044] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-07 03:55:21,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 25660416. Throughput: 0: 13145.7. Samples: 25652940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:21,086][117718] Avg episode reward: [(0, '2724.114')] [2023-03-07 03:55:21,128][118044] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-07 03:55:21,927][118044] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-07 03:55:22,703][118044] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-07 03:55:23,472][118044] Updated weights for policy 0, policy_version 25090 (0.0006) [2023-03-07 03:55:24,275][118044] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-07 03:55:25,045][118044] Updated weights for policy 0, policy_version 25110 (0.0005) [2023-03-07 03:55:25,814][118044] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-07 03:55:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 25725952. Throughput: 0: 13142.3. Samples: 25692335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:26,086][117718] Avg episode reward: [(0, '2753.338')] [2023-03-07 03:55:26,603][118044] Updated weights for policy 0, policy_version 25130 (0.0007) [2023-03-07 03:55:27,402][118044] Updated weights for policy 0, policy_version 25140 (0.0006) [2023-03-07 03:55:28,183][118044] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-07 03:55:28,966][118044] Updated weights for policy 0, policy_version 25160 (0.0006) [2023-03-07 03:55:29,757][118044] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-07 03:55:30,537][118044] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-07 03:55:31,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 25791488. Throughput: 0: 13131.5. Samples: 25770740. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:31,086][117718] Avg episode reward: [(0, '2788.874')] [2023-03-07 03:55:31,313][118044] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-07 03:55:32,094][118044] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-07 03:55:32,876][118044] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-07 03:55:33,655][118044] Updated weights for policy 0, policy_version 25220 (0.0006) [2023-03-07 03:55:34,441][118044] Updated weights for policy 0, policy_version 25230 (0.0007) [2023-03-07 03:55:35,229][118044] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-07 03:55:36,002][118044] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-07 03:55:36,086][117718] Fps is (10 sec: 13004.5, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 25856000. Throughput: 0: 13125.4. Samples: 25849195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:36,086][117718] Avg episode reward: [(0, '2781.482')] [2023-03-07 03:55:36,789][118044] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-07 03:55:37,573][118044] Updated weights for policy 0, policy_version 25270 (0.0006) [2023-03-07 03:55:38,353][118044] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-07 03:55:39,134][118044] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-07 03:55:39,905][118044] Updated weights for policy 0, policy_version 25300 (0.0005) [2023-03-07 03:55:40,681][118044] Updated weights for policy 0, policy_version 25310 (0.0007) [2023-03-07 03:55:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 25922560. Throughput: 0: 13121.9. Samples: 25888356. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:55:41,086][117718] Avg episode reward: [(0, '2727.685')] [2023-03-07 03:55:41,470][118044] Updated weights for policy 0, policy_version 25320 (0.0006) [2023-03-07 03:55:42,254][118044] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-07 03:55:43,029][118044] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-07 03:55:43,799][118044] Updated weights for policy 0, policy_version 25350 (0.0007) [2023-03-07 03:55:44,590][118044] Updated weights for policy 0, policy_version 25360 (0.0006) [2023-03-07 03:55:45,374][118044] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-07 03:55:46,086][117718] Fps is (10 sec: 13209.8, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 25988096. Throughput: 0: 13124.3. Samples: 25967297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:55:46,086][117718] Avg episode reward: [(0, '2719.320')] [2023-03-07 03:55:46,133][118044] Updated weights for policy 0, policy_version 25380 (0.0006) [2023-03-07 03:55:46,922][118044] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-07 03:55:47,708][118044] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-07 03:55:48,495][118044] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-07 03:55:49,255][118044] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-07 03:55:50,046][118044] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-07 03:55:50,834][118044] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-07 03:55:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 26053632. Throughput: 0: 13120.7. Samples: 26046052. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:55:51,086][117718] Avg episode reward: [(0, '2767.936')] [2023-03-07 03:55:51,608][118044] Updated weights for policy 0, policy_version 25450 (0.0005) [2023-03-07 03:55:52,393][118044] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-07 03:55:53,168][118044] Updated weights for policy 0, policy_version 25470 (0.0007) [2023-03-07 03:55:53,946][118044] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-07 03:55:54,718][118044] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-07 03:55:55,483][118044] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-07 03:55:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 26119168. Throughput: 0: 13119.7. Samples: 26085563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:55:56,086][117718] Avg episode reward: [(0, '2675.608')] [2023-03-07 03:55:56,271][118044] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-07 03:55:57,049][118044] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-07 03:55:57,822][118044] Updated weights for policy 0, policy_version 25530 (0.0007) [2023-03-07 03:55:58,599][118044] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-07 03:55:59,381][118044] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-07 03:56:00,154][118044] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-07 03:56:00,947][118044] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-07 03:56:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 26184704. Throughput: 0: 13115.2. Samples: 26164408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:01,086][117718] Avg episode reward: [(0, '2823.303')] [2023-03-07 03:56:01,730][118044] Updated weights for policy 0, policy_version 25580 (0.0007) [2023-03-07 03:56:02,510][118044] Updated weights for policy 0, policy_version 25590 (0.0007) [2023-03-07 03:56:03,281][118044] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-07 03:56:04,082][118044] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-07 03:56:04,850][118044] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-07 03:56:05,622][118044] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-07 03:56:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 26250240. Throughput: 0: 13121.0. Samples: 26243386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:06,086][117718] Avg episode reward: [(0, '2750.005')] [2023-03-07 03:56:06,408][118044] Updated weights for policy 0, policy_version 25640 (0.0006) [2023-03-07 03:56:07,176][118044] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-07 03:56:07,958][118044] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-07 03:56:08,732][118044] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-07 03:56:09,509][118044] Updated weights for policy 0, policy_version 25680 (0.0007) [2023-03-07 03:56:10,271][118044] Updated weights for policy 0, policy_version 25690 (0.0007) [2023-03-07 03:56:11,062][118044] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-07 03:56:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 26316800. Throughput: 0: 13123.4. Samples: 26282889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:11,086][117718] Avg episode reward: [(0, '2786.498')] [2023-03-07 03:56:11,831][118044] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-07 03:56:12,592][118044] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-07 03:56:13,382][118044] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-07 03:56:14,161][118044] Updated weights for policy 0, policy_version 25740 (0.0007) [2023-03-07 03:56:14,938][118044] Updated weights for policy 0, policy_version 25750 (0.0005) [2023-03-07 03:56:15,717][118044] Updated weights for policy 0, policy_version 25760 (0.0006) [2023-03-07 03:56:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 26382336. Throughput: 0: 13141.6. Samples: 26362113. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:16,086][117718] Avg episode reward: [(0, '2881.145')] [2023-03-07 03:56:16,497][118044] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-07 03:56:17,272][118044] Updated weights for policy 0, policy_version 25780 (0.0005) [2023-03-07 03:56:18,035][118044] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-07 03:56:18,827][118044] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-07 03:56:19,603][118044] Updated weights for policy 0, policy_version 25810 (0.0006) [2023-03-07 03:56:20,394][118044] Updated weights for policy 0, policy_version 25820 (0.0007) [2023-03-07 03:56:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 26448896. Throughput: 0: 13150.5. Samples: 26440965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:21,086][117718] Avg episode reward: [(0, '3012.295')] [2023-03-07 03:56:21,149][118044] Updated weights for policy 0, policy_version 25830 (0.0005) [2023-03-07 03:56:21,936][118044] Updated weights for policy 0, policy_version 25840 (0.0007) [2023-03-07 03:56:22,726][118044] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-07 03:56:23,501][118044] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-07 03:56:24,266][118044] Updated weights for policy 0, policy_version 25870 (0.0006) [2023-03-07 03:56:25,046][118044] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-07 03:56:25,820][118044] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-07 03:56:26,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 26514432. Throughput: 0: 13159.3. Samples: 26480524. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:26,086][117718] Avg episode reward: [(0, '2882.721')] [2023-03-07 03:56:26,605][118044] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-07 03:56:27,374][118044] Updated weights for policy 0, policy_version 25910 (0.0006) [2023-03-07 03:56:28,154][118044] Updated weights for policy 0, policy_version 25920 (0.0007) [2023-03-07 03:56:28,930][118044] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-07 03:56:29,716][118044] Updated weights for policy 0, policy_version 25940 (0.0005) [2023-03-07 03:56:30,493][118044] Updated weights for policy 0, policy_version 25950 (0.0006) [2023-03-07 03:56:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 26579968. Throughput: 0: 13160.2. Samples: 26559505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:31,086][117718] Avg episode reward: [(0, '2791.121')] [2023-03-07 03:56:31,273][118044] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-07 03:56:32,040][118044] Updated weights for policy 0, policy_version 25970 (0.0006) [2023-03-07 03:56:32,814][118044] Updated weights for policy 0, policy_version 25980 (0.0005) [2023-03-07 03:56:33,583][118044] Updated weights for policy 0, policy_version 25990 (0.0005) [2023-03-07 03:56:34,354][118044] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-07 03:56:35,127][118044] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-07 03:56:35,901][118044] Updated weights for policy 0, policy_version 26020 (0.0006) [2023-03-07 03:56:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 26646528. Throughput: 0: 13176.1. Samples: 26638980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:36,086][117718] Avg episode reward: [(0, '2774.079')] [2023-03-07 03:56:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000026022_26646528.pth... [2023-03-07 03:56:36,124][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000022941_23491584.pth [2023-03-07 03:56:36,670][118044] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-07 03:56:37,468][118044] Updated weights for policy 0, policy_version 26040 (0.0005) [2023-03-07 03:56:38,250][118044] Updated weights for policy 0, policy_version 26050 (0.0005) [2023-03-07 03:56:39,025][118044] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-07 03:56:39,814][118044] Updated weights for policy 0, policy_version 26070 (0.0007) [2023-03-07 03:56:40,589][118044] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-07 03:56:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 26712064. Throughput: 0: 13172.5. Samples: 26678327. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:56:41,086][117718] Avg episode reward: [(0, '2776.754')] [2023-03-07 03:56:41,354][118044] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-07 03:56:42,155][118044] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-07 03:56:42,926][118044] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-07 03:56:43,706][118044] Updated weights for policy 0, policy_version 26120 (0.0005) [2023-03-07 03:56:44,474][118044] Updated weights for policy 0, policy_version 26130 (0.0006) [2023-03-07 03:56:45,270][118044] Updated weights for policy 0, policy_version 26140 (0.0006) [2023-03-07 03:56:46,049][118044] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-07 03:56:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 26777600. Throughput: 0: 13170.1. Samples: 26757063. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:56:46,086][117718] Avg episode reward: [(0, '2762.379')] [2023-03-07 03:56:46,829][118044] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-07 03:56:47,590][118044] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-07 03:56:48,385][118044] Updated weights for policy 0, policy_version 26180 (0.0007) [2023-03-07 03:56:49,164][118044] Updated weights for policy 0, policy_version 26190 (0.0005) [2023-03-07 03:56:49,930][118044] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-07 03:56:50,713][118044] Updated weights for policy 0, policy_version 26210 (0.0007) [2023-03-07 03:56:51,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 26843136. Throughput: 0: 13171.8. Samples: 26836114. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:56:51,086][117718] Avg episode reward: [(0, '2686.152')] [2023-03-07 03:56:51,480][118044] Updated weights for policy 0, policy_version 26220 (0.0005) [2023-03-07 03:56:52,262][118044] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-07 03:56:53,026][118044] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-07 03:56:53,807][118044] Updated weights for policy 0, policy_version 26250 (0.0007) [2023-03-07 03:56:54,587][118044] Updated weights for policy 0, policy_version 26260 (0.0007) [2023-03-07 03:56:55,358][118044] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-07 03:56:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 26909696. Throughput: 0: 13176.6. Samples: 26875836. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:56:56,086][117718] Avg episode reward: [(0, '2787.242')] [2023-03-07 03:56:56,139][118044] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-07 03:56:56,913][118044] Updated weights for policy 0, policy_version 26290 (0.0007) [2023-03-07 03:56:57,703][118044] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-07 03:56:58,486][118044] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-07 03:56:59,245][118044] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-07 03:57:00,017][118044] Updated weights for policy 0, policy_version 26330 (0.0006) [2023-03-07 03:57:00,789][118044] Updated weights for policy 0, policy_version 26340 (0.0005) [2023-03-07 03:57:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 26975232. Throughput: 0: 13174.9. Samples: 26954980. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:57:01,086][117718] Avg episode reward: [(0, '2686.610')] [2023-03-07 03:57:01,574][118044] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-07 03:57:02,339][118044] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-07 03:57:03,117][118044] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-07 03:57:03,900][118044] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-07 03:57:04,685][118044] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-07 03:57:05,460][118044] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-07 03:57:06,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13148.9). Total num frames: 27041792. Throughput: 0: 13175.1. Samples: 27033846. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:57:06,086][117718] Avg episode reward: [(0, '2908.150')] [2023-03-07 03:57:06,245][118044] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-07 03:57:07,025][118044] Updated weights for policy 0, policy_version 26420 (0.0007) [2023-03-07 03:57:07,834][118044] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-07 03:57:08,602][118044] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-07 03:57:09,395][118044] Updated weights for policy 0, policy_version 26450 (0.0005) [2023-03-07 03:57:10,159][118044] Updated weights for policy 0, policy_version 26460 (0.0006) [2023-03-07 03:57:10,944][118044] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-07 03:57:11,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27106304. Throughput: 0: 13166.7. Samples: 27073030. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:57:11,087][117718] Avg episode reward: [(0, '2711.966')] [2023-03-07 03:57:11,721][118044] Updated weights for policy 0, policy_version 26480 (0.0007) [2023-03-07 03:57:12,498][118044] Updated weights for policy 0, policy_version 26490 (0.0007) [2023-03-07 03:57:13,290][118044] Updated weights for policy 0, policy_version 26500 (0.0005) [2023-03-07 03:57:14,064][118044] Updated weights for policy 0, policy_version 26510 (0.0006) [2023-03-07 03:57:14,837][118044] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-07 03:57:15,602][118044] Updated weights for policy 0, policy_version 26530 (0.0006) [2023-03-07 03:57:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 27172864. Throughput: 0: 13166.0. Samples: 27151974. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:57:16,086][117718] Avg episode reward: [(0, '2832.582')] [2023-03-07 03:57:16,374][118044] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-07 03:57:17,155][118044] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-07 03:57:17,924][118044] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-07 03:57:18,703][118044] Updated weights for policy 0, policy_version 26570 (0.0005) [2023-03-07 03:57:19,477][118044] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-07 03:57:20,265][118044] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-07 03:57:21,043][118044] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-07 03:57:21,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27238400. Throughput: 0: 13158.5. Samples: 27231111. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:57:21,086][117718] Avg episode reward: [(0, '2707.779')] [2023-03-07 03:57:21,816][118044] Updated weights for policy 0, policy_version 26610 (0.0007) [2023-03-07 03:57:22,586][118044] Updated weights for policy 0, policy_version 26620 (0.0007) [2023-03-07 03:57:23,381][118044] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-07 03:57:24,160][118044] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-07 03:57:24,953][118044] Updated weights for policy 0, policy_version 26650 (0.0006) [2023-03-07 03:57:25,720][118044] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-07 03:57:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27303936. Throughput: 0: 13162.3. Samples: 27270632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:57:26,086][117718] Avg episode reward: [(0, '2775.439')] [2023-03-07 03:57:26,518][118044] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-07 03:57:27,285][118044] Updated weights for policy 0, policy_version 26680 (0.0007) [2023-03-07 03:57:28,073][118044] Updated weights for policy 0, policy_version 26690 (0.0007) [2023-03-07 03:57:28,848][118044] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-07 03:57:29,629][118044] Updated weights for policy 0, policy_version 26710 (0.0006) [2023-03-07 03:57:30,407][118044] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-07 03:57:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27369472. Throughput: 0: 13158.0. Samples: 27349171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:57:31,086][117718] Avg episode reward: [(0, '2630.315')] [2023-03-07 03:57:31,177][118044] Updated weights for policy 0, policy_version 26730 (0.0007) [2023-03-07 03:57:31,971][118044] Updated weights for policy 0, policy_version 26740 (0.0006) [2023-03-07 03:57:32,745][118044] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-07 03:57:33,529][118044] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-07 03:57:34,308][118044] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-07 03:57:35,083][118044] Updated weights for policy 0, policy_version 26780 (0.0005) [2023-03-07 03:57:35,867][118044] Updated weights for policy 0, policy_version 26790 (0.0005) [2023-03-07 03:57:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 27435008. Throughput: 0: 13155.4. Samples: 27428109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:57:36,086][117718] Avg episode reward: [(0, '2623.457')] [2023-03-07 03:57:36,632][118044] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-07 03:57:37,412][118044] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-07 03:57:38,184][118044] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-07 03:57:38,966][118044] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-07 03:57:39,741][118044] Updated weights for policy 0, policy_version 26840 (0.0006) [2023-03-07 03:57:40,518][118044] Updated weights for policy 0, policy_version 26850 (0.0006) [2023-03-07 03:57:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 27501568. Throughput: 0: 13153.0. Samples: 27467720. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:57:41,086][117718] Avg episode reward: [(0, '2650.807')] [2023-03-07 03:57:41,274][118044] Updated weights for policy 0, policy_version 26860 (0.0007) [2023-03-07 03:57:42,063][118044] Updated weights for policy 0, policy_version 26870 (0.0006) [2023-03-07 03:57:42,842][118044] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-07 03:57:43,615][118044] Updated weights for policy 0, policy_version 26890 (0.0006) [2023-03-07 03:57:44,407][118044] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-07 03:57:45,175][118044] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-07 03:57:45,947][118044] Updated weights for policy 0, policy_version 26920 (0.0006) [2023-03-07 03:57:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27567104. Throughput: 0: 13150.9. Samples: 27546769. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:57:46,086][117718] Avg episode reward: [(0, '2865.414')] [2023-03-07 03:57:46,710][118044] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-07 03:57:47,494][118044] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-07 03:57:48,276][118044] Updated weights for policy 0, policy_version 26950 (0.0007) [2023-03-07 03:57:49,037][118044] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-07 03:57:49,818][118044] Updated weights for policy 0, policy_version 26970 (0.0007) [2023-03-07 03:57:50,614][118044] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-07 03:57:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 27632640. Throughput: 0: 13156.1. Samples: 27625872. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:57:51,086][117718] Avg episode reward: [(0, '2662.372')] [2023-03-07 03:57:51,398][118044] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-07 03:57:52,169][118044] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-07 03:57:52,931][118044] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-07 03:57:53,730][118044] Updated weights for policy 0, policy_version 27020 (0.0006) [2023-03-07 03:57:54,496][118044] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-07 03:57:55,286][118044] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-07 03:57:56,063][118044] Updated weights for policy 0, policy_version 27050 (0.0005) [2023-03-07 03:57:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 27699200. Throughput: 0: 13164.5. Samples: 27665429. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:57:56,086][117718] Avg episode reward: [(0, '2786.594')] [2023-03-07 03:57:56,835][118044] Updated weights for policy 0, policy_version 27060 (0.0007) [2023-03-07 03:57:57,612][118044] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-07 03:57:58,399][118044] Updated weights for policy 0, policy_version 27080 (0.0007) [2023-03-07 03:57:59,173][118044] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-07 03:57:59,951][118044] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-07 03:58:00,731][118044] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-07 03:58:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 27764736. Throughput: 0: 13161.9. Samples: 27744259. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:58:01,086][117718] Avg episode reward: [(0, '2693.485')] [2023-03-07 03:58:01,514][118044] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-07 03:58:02,295][118044] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-07 03:58:03,078][118044] Updated weights for policy 0, policy_version 27140 (0.0006) [2023-03-07 03:58:03,857][118044] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-07 03:58:04,629][118044] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-07 03:58:05,401][118044] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-07 03:58:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 27831296. Throughput: 0: 13160.6. Samples: 27823338. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:58:06,086][117718] Avg episode reward: [(0, '2576.198')] [2023-03-07 03:58:06,180][118044] Updated weights for policy 0, policy_version 27180 (0.0007) [2023-03-07 03:58:06,949][118044] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-07 03:58:07,734][118044] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-07 03:58:08,518][118044] Updated weights for policy 0, policy_version 27210 (0.0006) [2023-03-07 03:58:09,287][118044] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-07 03:58:10,052][118044] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-07 03:58:10,842][118044] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-07 03:58:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 27896832. Throughput: 0: 13155.4. Samples: 27862624. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:58:11,086][117718] Avg episode reward: [(0, '2724.287')] [2023-03-07 03:58:11,627][118044] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-07 03:58:12,400][118044] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-07 03:58:13,196][118044] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-07 03:58:13,960][118044] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-07 03:58:14,718][118044] Updated weights for policy 0, policy_version 27290 (0.0005) [2023-03-07 03:58:15,528][118044] Updated weights for policy 0, policy_version 27300 (0.0007) [2023-03-07 03:58:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 27962368. Throughput: 0: 13166.2. Samples: 27941649. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:58:16,086][117718] Avg episode reward: [(0, '2669.146')] [2023-03-07 03:58:16,300][118044] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-07 03:58:17,073][118044] Updated weights for policy 0, policy_version 27320 (0.0008) [2023-03-07 03:58:17,853][118044] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-07 03:58:18,623][118044] Updated weights for policy 0, policy_version 27340 (0.0007) [2023-03-07 03:58:19,398][118044] Updated weights for policy 0, policy_version 27350 (0.0006) [2023-03-07 03:58:20,183][118044] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-07 03:58:20,958][118044] Updated weights for policy 0, policy_version 27370 (0.0007) [2023-03-07 03:58:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 28027904. Throughput: 0: 13169.8. Samples: 28020752. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:58:21,086][117718] Avg episode reward: [(0, '2632.268')] [2023-03-07 03:58:21,734][118044] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-07 03:58:22,501][118044] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-07 03:58:23,281][118044] Updated weights for policy 0, policy_version 27400 (0.0006) [2023-03-07 03:58:24,051][118044] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-07 03:58:24,828][118044] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-07 03:58:25,591][118044] Updated weights for policy 0, policy_version 27430 (0.0006) [2023-03-07 03:58:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 28094464. Throughput: 0: 13171.3. Samples: 28060429. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 03:58:26,086][117718] Avg episode reward: [(0, '2666.544')] [2023-03-07 03:58:26,369][118044] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-07 03:58:27,142][118044] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-07 03:58:27,897][118044] Updated weights for policy 0, policy_version 27460 (0.0006) [2023-03-07 03:58:28,687][118044] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-07 03:58:29,456][118044] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-07 03:58:30,236][118044] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-07 03:58:31,012][118044] Updated weights for policy 0, policy_version 27500 (0.0005) [2023-03-07 03:58:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.8). Total num frames: 28160000. Throughput: 0: 13180.3. Samples: 28139882. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:58:31,086][117718] Avg episode reward: [(0, '2601.300')] [2023-03-07 03:58:31,789][118044] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-07 03:58:32,556][118044] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-07 03:58:33,341][118044] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-07 03:58:34,137][118044] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-07 03:58:34,920][118044] Updated weights for policy 0, policy_version 27550 (0.0007) [2023-03-07 03:58:35,692][118044] Updated weights for policy 0, policy_version 27560 (0.0005) [2023-03-07 03:58:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 28226560. Throughput: 0: 13172.8. Samples: 28218648. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:58:36,086][117718] Avg episode reward: [(0, '2621.395')] [2023-03-07 03:58:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000027565_28226560.pth... [2023-03-07 03:58:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000024481_25068544.pth [2023-03-07 03:58:36,474][118044] Updated weights for policy 0, policy_version 27570 (0.0007) [2023-03-07 03:58:37,263][118044] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-07 03:58:38,041][118044] Updated weights for policy 0, policy_version 27590 (0.0005) [2023-03-07 03:58:38,806][118044] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-07 03:58:39,590][118044] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-07 03:58:40,357][118044] Updated weights for policy 0, policy_version 27620 (0.0006) [2023-03-07 03:58:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 28292096. Throughput: 0: 13171.1. Samples: 28258129. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:58:41,086][117718] Avg episode reward: [(0, '2762.817')] [2023-03-07 03:58:41,136][118044] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-07 03:58:41,894][118044] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-07 03:58:42,662][118044] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-07 03:58:43,434][118044] Updated weights for policy 0, policy_version 27660 (0.0006) [2023-03-07 03:58:44,218][118044] Updated weights for policy 0, policy_version 27670 (0.0007) [2023-03-07 03:58:44,971][118044] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-07 03:58:45,764][118044] Updated weights for policy 0, policy_version 27690 (0.0007) [2023-03-07 03:58:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 28358656. Throughput: 0: 13185.7. Samples: 28337615. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 03:58:46,086][117718] Avg episode reward: [(0, '2772.559')] [2023-03-07 03:58:46,541][118044] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-07 03:58:47,314][118044] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-07 03:58:48,084][118044] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-07 03:58:48,879][118044] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-07 03:58:49,664][118044] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-07 03:58:50,418][118044] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-07 03:58:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 28424192. Throughput: 0: 13189.7. Samples: 28416876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:58:51,086][117718] Avg episode reward: [(0, '2773.303')] [2023-03-07 03:58:51,226][118044] Updated weights for policy 0, policy_version 27760 (0.0006) [2023-03-07 03:58:52,002][118044] Updated weights for policy 0, policy_version 27770 (0.0007) [2023-03-07 03:58:52,781][118044] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-07 03:58:53,544][118044] Updated weights for policy 0, policy_version 27790 (0.0006) [2023-03-07 03:58:54,321][118044] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-07 03:58:55,077][118044] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-07 03:58:55,855][118044] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-07 03:58:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 28489728. Throughput: 0: 13188.9. Samples: 28456123. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:58:56,086][117718] Avg episode reward: [(0, '2857.146')] [2023-03-07 03:58:56,637][118044] Updated weights for policy 0, policy_version 27830 (0.0006) [2023-03-07 03:58:57,414][118044] Updated weights for policy 0, policy_version 27840 (0.0005) [2023-03-07 03:58:58,211][118044] Updated weights for policy 0, policy_version 27850 (0.0007) [2023-03-07 03:58:58,991][118044] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-07 03:58:59,766][118044] Updated weights for policy 0, policy_version 27870 (0.0006) [2023-03-07 03:59:00,553][118044] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-07 03:59:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 28555264. Throughput: 0: 13186.4. Samples: 28535040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:01,087][117718] Avg episode reward: [(0, '2823.498')] [2023-03-07 03:59:01,337][118044] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-07 03:59:02,117][118044] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-07 03:59:02,886][118044] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-07 03:59:03,673][118044] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-07 03:59:04,450][118044] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-07 03:59:05,240][118044] Updated weights for policy 0, policy_version 27940 (0.0007) [2023-03-07 03:59:06,025][118044] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-07 03:59:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 28620800. Throughput: 0: 13180.2. Samples: 28613860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:06,086][117718] Avg episode reward: [(0, '2854.954')] [2023-03-07 03:59:06,795][118044] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-07 03:59:07,565][118044] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-07 03:59:08,347][118044] Updated weights for policy 0, policy_version 27980 (0.0006) [2023-03-07 03:59:09,133][118044] Updated weights for policy 0, policy_version 27990 (0.0005) [2023-03-07 03:59:09,904][118044] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-07 03:59:10,684][118044] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-07 03:59:11,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 28687360. Throughput: 0: 13176.7. Samples: 28653380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:59:11,086][117718] Avg episode reward: [(0, '2794.993')] [2023-03-07 03:59:11,466][118044] Updated weights for policy 0, policy_version 28020 (0.0005) [2023-03-07 03:59:12,234][118044] Updated weights for policy 0, policy_version 28030 (0.0006) [2023-03-07 03:59:13,013][118044] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-07 03:59:13,774][118044] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-07 03:59:14,560][118044] Updated weights for policy 0, policy_version 28060 (0.0007) [2023-03-07 03:59:15,341][118044] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-07 03:59:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 28752896. Throughput: 0: 13167.6. Samples: 28732425. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:59:16,086][117718] Avg episode reward: [(0, '2770.822')] [2023-03-07 03:59:16,124][118044] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-07 03:59:16,901][118044] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-07 03:59:17,680][118044] Updated weights for policy 0, policy_version 28100 (0.0007) [2023-03-07 03:59:18,474][118044] Updated weights for policy 0, policy_version 28110 (0.0006) [2023-03-07 03:59:19,250][118044] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-07 03:59:20,031][118044] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-07 03:59:20,811][118044] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-07 03:59:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 28818432. Throughput: 0: 13168.4. Samples: 28811227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 03:59:21,097][117718] Avg episode reward: [(0, '2682.705')] [2023-03-07 03:59:21,586][118044] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-07 03:59:22,368][118044] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-07 03:59:23,153][118044] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-07 03:59:23,968][118044] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-07 03:59:24,741][118044] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-07 03:59:25,517][118044] Updated weights for policy 0, policy_version 28200 (0.0006) [2023-03-07 03:59:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 28883968. Throughput: 0: 13163.0. Samples: 28850466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:26,097][117718] Avg episode reward: [(0, '2777.845')] [2023-03-07 03:59:26,297][118044] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-07 03:59:27,073][118044] Updated weights for policy 0, policy_version 28220 (0.0006) [2023-03-07 03:59:27,850][118044] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-07 03:59:28,626][118044] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-07 03:59:29,409][118044] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-07 03:59:30,203][118044] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-07 03:59:30,967][118044] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-07 03:59:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 28949504. Throughput: 0: 13146.8. Samples: 28929218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:31,086][117718] Avg episode reward: [(0, '2854.353')] [2023-03-07 03:59:31,755][118044] Updated weights for policy 0, policy_version 28280 (0.0006) [2023-03-07 03:59:32,541][118044] Updated weights for policy 0, policy_version 28290 (0.0007) [2023-03-07 03:59:33,310][118044] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-07 03:59:34,089][118044] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-07 03:59:34,871][118044] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-07 03:59:35,653][118044] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-07 03:59:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 29015040. Throughput: 0: 13132.6. Samples: 29007841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:36,086][117718] Avg episode reward: [(0, '2860.677')] [2023-03-07 03:59:36,430][118044] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-07 03:59:37,212][118044] Updated weights for policy 0, policy_version 28350 (0.0005) [2023-03-07 03:59:37,998][118044] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-07 03:59:38,774][118044] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-07 03:59:39,558][118044] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-07 03:59:40,332][118044] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-07 03:59:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 29080576. Throughput: 0: 13132.7. Samples: 29047097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:41,086][117718] Avg episode reward: [(0, '2681.747')] [2023-03-07 03:59:41,129][118044] Updated weights for policy 0, policy_version 28400 (0.0007) [2023-03-07 03:59:41,913][118044] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-07 03:59:42,694][118044] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-07 03:59:43,495][118044] Updated weights for policy 0, policy_version 28430 (0.0007) [2023-03-07 03:59:44,263][118044] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-07 03:59:45,053][118044] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-07 03:59:45,831][118044] Updated weights for policy 0, policy_version 28460 (0.0005) [2023-03-07 03:59:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29146112. Throughput: 0: 13120.4. Samples: 29125454. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:46,086][117718] Avg episode reward: [(0, '2754.468')] [2023-03-07 03:59:46,616][118044] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-07 03:59:47,380][118044] Updated weights for policy 0, policy_version 28480 (0.0006) [2023-03-07 03:59:48,167][118044] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-07 03:59:48,942][118044] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-07 03:59:49,737][118044] Updated weights for policy 0, policy_version 28510 (0.0007) [2023-03-07 03:59:50,514][118044] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-07 03:59:51,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29211648. Throughput: 0: 13116.6. Samples: 29204109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:51,086][117718] Avg episode reward: [(0, '2707.456')] [2023-03-07 03:59:51,299][118044] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-07 03:59:52,063][118044] Updated weights for policy 0, policy_version 28540 (0.0006) [2023-03-07 03:59:52,873][118044] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-07 03:59:53,653][118044] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-07 03:59:54,435][118044] Updated weights for policy 0, policy_version 28570 (0.0006) [2023-03-07 03:59:55,230][118044] Updated weights for policy 0, policy_version 28580 (0.0007) [2023-03-07 03:59:55,985][118044] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-07 03:59:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29277184. Throughput: 0: 13109.1. Samples: 29243288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 03:59:56,086][117718] Avg episode reward: [(0, '2669.973')] [2023-03-07 03:59:56,763][118044] Updated weights for policy 0, policy_version 28600 (0.0007) [2023-03-07 03:59:57,531][118044] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-07 03:59:58,327][118044] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-07 03:59:59,107][118044] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-07 03:59:59,874][118044] Updated weights for policy 0, policy_version 28640 (0.0007) [2023-03-07 04:00:00,640][118044] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-07 04:00:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29342720. Throughput: 0: 13108.3. Samples: 29322299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:00:01,086][117718] Avg episode reward: [(0, '2832.359')] [2023-03-07 04:00:01,414][118044] Updated weights for policy 0, policy_version 28660 (0.0006) [2023-03-07 04:00:02,193][118044] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-07 04:00:02,967][118044] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-07 04:00:03,731][118044] Updated weights for policy 0, policy_version 28690 (0.0006) [2023-03-07 04:00:04,517][118044] Updated weights for policy 0, policy_version 28700 (0.0005) [2023-03-07 04:00:05,282][118044] Updated weights for policy 0, policy_version 28710 (0.0006) [2023-03-07 04:00:06,067][118044] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-07 04:00:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 29409280. Throughput: 0: 13116.6. Samples: 29401472. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:06,086][117718] Avg episode reward: [(0, '2633.479')] [2023-03-07 04:00:06,870][118044] Updated weights for policy 0, policy_version 28730 (0.0006) [2023-03-07 04:00:07,658][118044] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-07 04:00:08,437][118044] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-07 04:00:09,217][118044] Updated weights for policy 0, policy_version 28760 (0.0007) [2023-03-07 04:00:09,984][118044] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-07 04:00:10,777][118044] Updated weights for policy 0, policy_version 28780 (0.0005) [2023-03-07 04:00:11,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13148.8). Total num frames: 29473792. Throughput: 0: 13115.1. Samples: 29440645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:11,086][117718] Avg episode reward: [(0, '2813.038')] [2023-03-07 04:00:11,563][118044] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-07 04:00:12,329][118044] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-07 04:00:13,106][118044] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-07 04:00:13,872][118044] Updated weights for policy 0, policy_version 28820 (0.0007) [2023-03-07 04:00:14,644][118044] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-07 04:00:15,426][118044] Updated weights for policy 0, policy_version 28840 (0.0007) [2023-03-07 04:00:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29540352. Throughput: 0: 13124.1. Samples: 29519804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:16,086][117718] Avg episode reward: [(0, '2772.905')] [2023-03-07 04:00:16,215][118044] Updated weights for policy 0, policy_version 28850 (0.0007) [2023-03-07 04:00:17,011][118044] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-07 04:00:17,789][118044] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-07 04:00:18,582][118044] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-07 04:00:19,353][118044] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-07 04:00:20,110][118044] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-07 04:00:20,884][118044] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-07 04:00:21,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 29605888. Throughput: 0: 13127.2. Samples: 29598566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:21,086][117718] Avg episode reward: [(0, '2722.834')] [2023-03-07 04:00:21,653][118044] Updated weights for policy 0, policy_version 28920 (0.0005) [2023-03-07 04:00:22,424][118044] Updated weights for policy 0, policy_version 28930 (0.0006) [2023-03-07 04:00:23,199][118044] Updated weights for policy 0, policy_version 28940 (0.0006) [2023-03-07 04:00:23,978][118044] Updated weights for policy 0, policy_version 28950 (0.0006) [2023-03-07 04:00:24,741][118044] Updated weights for policy 0, policy_version 28960 (0.0006) [2023-03-07 04:00:25,526][118044] Updated weights for policy 0, policy_version 28970 (0.0007) [2023-03-07 04:00:26,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 29672448. Throughput: 0: 13138.6. Samples: 29638336. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:26,086][117718] Avg episode reward: [(0, '2923.595')] [2023-03-07 04:00:26,295][118044] Updated weights for policy 0, policy_version 28980 (0.0006) [2023-03-07 04:00:27,062][118044] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-07 04:00:27,844][118044] Updated weights for policy 0, policy_version 29000 (0.0005) [2023-03-07 04:00:28,612][118044] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-07 04:00:29,375][118044] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-07 04:00:30,158][118044] Updated weights for policy 0, policy_version 29030 (0.0005) [2023-03-07 04:00:30,931][118044] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-07 04:00:31,085][117718] Fps is (10 sec: 13312.1, 60 sec: 13158.4, 300 sec: 13162.8). Total num frames: 29739008. Throughput: 0: 13164.5. Samples: 29717858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:31,086][117718] Avg episode reward: [(0, '2758.669')] [2023-03-07 04:00:31,715][118044] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-07 04:00:32,498][118044] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-07 04:00:33,272][118044] Updated weights for policy 0, policy_version 29070 (0.0005) [2023-03-07 04:00:34,056][118044] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-07 04:00:34,833][118044] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-07 04:00:35,621][118044] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-07 04:00:36,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 29803520. Throughput: 0: 13166.6. Samples: 29796608. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:00:36,086][117718] Avg episode reward: [(0, '2871.242')] [2023-03-07 04:00:36,100][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000029106_29804544.pth... [2023-03-07 04:00:36,131][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000026022_26646528.pth [2023-03-07 04:00:36,399][118044] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-07 04:00:37,193][118044] Updated weights for policy 0, policy_version 29120 (0.0006) [2023-03-07 04:00:37,975][118044] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-07 04:00:38,753][118044] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-07 04:00:39,526][118044] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-07 04:00:40,339][118044] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-07 04:00:41,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 29869056. Throughput: 0: 13171.3. Samples: 29835998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:00:41,086][117718] Avg episode reward: [(0, '2678.414')] [2023-03-07 04:00:41,114][118044] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-07 04:00:41,886][118044] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-07 04:00:42,677][118044] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-07 04:00:43,436][118044] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-07 04:00:44,215][118044] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-07 04:00:44,993][118044] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-07 04:00:45,777][118044] Updated weights for policy 0, policy_version 29230 (0.0006) [2023-03-07 04:00:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 29934592. Throughput: 0: 13160.6. Samples: 29914528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:00:46,086][117718] Avg episode reward: [(0, '2678.657')] [2023-03-07 04:00:46,561][118044] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-07 04:00:47,351][118044] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-07 04:00:48,120][118044] Updated weights for policy 0, policy_version 29260 (0.0007) [2023-03-07 04:00:48,901][118044] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-07 04:00:49,672][118044] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-07 04:00:50,457][118044] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-07 04:00:51,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 30001152. Throughput: 0: 13153.1. Samples: 29993361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:00:51,086][117718] Avg episode reward: [(0, '2794.295')] [2023-03-07 04:00:51,241][118044] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-07 04:00:52,005][118044] Updated weights for policy 0, policy_version 29310 (0.0005) [2023-03-07 04:00:52,781][118044] Updated weights for policy 0, policy_version 29320 (0.0007) [2023-03-07 04:00:53,574][118044] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-07 04:00:54,355][118044] Updated weights for policy 0, policy_version 29340 (0.0006) [2023-03-07 04:00:55,132][118044] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-07 04:00:55,901][118044] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-07 04:00:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 30066688. Throughput: 0: 13162.8. Samples: 30032968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:00:56,086][117718] Avg episode reward: [(0, '2856.693')] [2023-03-07 04:00:56,670][118044] Updated weights for policy 0, policy_version 29370 (0.0007) [2023-03-07 04:00:57,488][118044] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-07 04:00:58,244][118044] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-07 04:00:59,019][118044] Updated weights for policy 0, policy_version 29400 (0.0007) [2023-03-07 04:00:59,820][118044] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-07 04:01:00,590][118044] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-07 04:01:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 30132224. Throughput: 0: 13153.0. Samples: 30111690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:01,086][117718] Avg episode reward: [(0, '2781.052')] [2023-03-07 04:01:01,362][118044] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-07 04:01:02,140][118044] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-07 04:01:02,932][118044] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-07 04:01:03,705][118044] Updated weights for policy 0, policy_version 29460 (0.0007) [2023-03-07 04:01:04,496][118044] Updated weights for policy 0, policy_version 29470 (0.0007) [2023-03-07 04:01:05,255][118044] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-07 04:01:06,044][118044] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-07 04:01:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 30197760. Throughput: 0: 13157.8. Samples: 30190667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:06,086][117718] Avg episode reward: [(0, '2791.825')] [2023-03-07 04:01:06,828][118044] Updated weights for policy 0, policy_version 29500 (0.0006) [2023-03-07 04:01:07,595][118044] Updated weights for policy 0, policy_version 29510 (0.0005) [2023-03-07 04:01:08,371][118044] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-07 04:01:09,148][118044] Updated weights for policy 0, policy_version 29530 (0.0006) [2023-03-07 04:01:09,923][118044] Updated weights for policy 0, policy_version 29540 (0.0006) [2023-03-07 04:01:10,707][118044] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-07 04:01:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 30263296. Throughput: 0: 13148.8. Samples: 30230029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:11,086][117718] Avg episode reward: [(0, '2719.555')] [2023-03-07 04:01:11,487][118044] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-07 04:01:12,280][118044] Updated weights for policy 0, policy_version 29570 (0.0005) [2023-03-07 04:01:13,033][118044] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-07 04:01:13,813][118044] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-07 04:01:14,599][118044] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-07 04:01:15,402][118044] Updated weights for policy 0, policy_version 29610 (0.0005) [2023-03-07 04:01:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 30328832. Throughput: 0: 13132.4. Samples: 30308819. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:16,086][117718] Avg episode reward: [(0, '2812.925')] [2023-03-07 04:01:16,171][118044] Updated weights for policy 0, policy_version 29620 (0.0005) [2023-03-07 04:01:16,967][118044] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-07 04:01:17,742][118044] Updated weights for policy 0, policy_version 29640 (0.0007) [2023-03-07 04:01:18,520][118044] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-07 04:01:19,307][118044] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-07 04:01:20,078][118044] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-07 04:01:20,846][118044] Updated weights for policy 0, policy_version 29680 (0.0007) [2023-03-07 04:01:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 30394368. Throughput: 0: 13131.5. Samples: 30387526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:21,086][117718] Avg episode reward: [(0, '2698.663')] [2023-03-07 04:01:21,617][118044] Updated weights for policy 0, policy_version 29690 (0.0007) [2023-03-07 04:01:22,387][118044] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-07 04:01:23,156][118044] Updated weights for policy 0, policy_version 29710 (0.0006) [2023-03-07 04:01:23,923][118044] Updated weights for policy 0, policy_version 29720 (0.0005) [2023-03-07 04:01:24,713][118044] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-07 04:01:25,478][118044] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-07 04:01:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 30460928. Throughput: 0: 13140.7. Samples: 30427328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:26,086][117718] Avg episode reward: [(0, '2915.267')] [2023-03-07 04:01:26,263][118044] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-07 04:01:27,078][118044] Updated weights for policy 0, policy_version 29760 (0.0007) [2023-03-07 04:01:27,841][118044] Updated weights for policy 0, policy_version 29770 (0.0005) [2023-03-07 04:01:28,615][118044] Updated weights for policy 0, policy_version 29780 (0.0007) [2023-03-07 04:01:29,391][118044] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-07 04:01:30,168][118044] Updated weights for policy 0, policy_version 29800 (0.0007) [2023-03-07 04:01:30,950][118044] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-07 04:01:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 30526464. Throughput: 0: 13149.0. Samples: 30506232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:31,086][117718] Avg episode reward: [(0, '2964.600')] [2023-03-07 04:01:31,733][118044] Updated weights for policy 0, policy_version 29820 (0.0006) [2023-03-07 04:01:32,506][118044] Updated weights for policy 0, policy_version 29830 (0.0005) [2023-03-07 04:01:33,301][118044] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-07 04:01:34,096][118044] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-07 04:01:34,861][118044] Updated weights for policy 0, policy_version 29860 (0.0005) [2023-03-07 04:01:35,638][118044] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-07 04:01:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 30592000. Throughput: 0: 13147.4. Samples: 30584997. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:01:36,086][117718] Avg episode reward: [(0, '2914.062')] [2023-03-07 04:01:36,440][118044] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-07 04:01:37,227][118044] Updated weights for policy 0, policy_version 29890 (0.0006) [2023-03-07 04:01:37,991][118044] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-07 04:01:38,776][118044] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-07 04:01:39,573][118044] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-07 04:01:40,333][118044] Updated weights for policy 0, policy_version 29930 (0.0006) [2023-03-07 04:01:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 30657536. Throughput: 0: 13136.1. Samples: 30624092. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:01:41,086][117718] Avg episode reward: [(0, '2978.748')] [2023-03-07 04:01:41,107][118044] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-07 04:01:41,879][118044] Updated weights for policy 0, policy_version 29950 (0.0005) [2023-03-07 04:01:42,682][118044] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-07 04:01:43,465][118044] Updated weights for policy 0, policy_version 29970 (0.0006) [2023-03-07 04:01:44,234][118044] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-07 04:01:44,994][118044] Updated weights for policy 0, policy_version 29990 (0.0006) [2023-03-07 04:01:45,770][118044] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-07 04:01:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 30724096. Throughput: 0: 13139.4. Samples: 30702965. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:01:46,086][117718] Avg episode reward: [(0, '2915.564')] [2023-03-07 04:01:46,555][118044] Updated weights for policy 0, policy_version 30010 (0.0007) [2023-03-07 04:01:47,345][118044] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-07 04:01:48,109][118044] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-07 04:01:48,896][118044] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-07 04:01:49,667][118044] Updated weights for policy 0, policy_version 30050 (0.0007) [2023-03-07 04:01:50,442][118044] Updated weights for policy 0, policy_version 30060 (0.0006) [2023-03-07 04:01:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 30789632. Throughput: 0: 13140.0. Samples: 30781968. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:01:51,086][117718] Avg episode reward: [(0, '2919.379')] [2023-03-07 04:01:51,211][118044] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-07 04:01:51,986][118044] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-07 04:01:52,763][118044] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-07 04:01:53,533][118044] Updated weights for policy 0, policy_version 30100 (0.0006) [2023-03-07 04:01:54,320][118044] Updated weights for policy 0, policy_version 30110 (0.0006) [2023-03-07 04:01:55,119][118044] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-07 04:01:55,878][118044] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-07 04:01:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 30855168. Throughput: 0: 13147.0. Samples: 30821644. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:01:56,086][117718] Avg episode reward: [(0, '3003.156')] [2023-03-07 04:01:56,641][118044] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-07 04:01:57,442][118044] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-07 04:01:58,227][118044] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-07 04:01:58,983][118044] Updated weights for policy 0, policy_version 30170 (0.0005) [2023-03-07 04:01:59,778][118044] Updated weights for policy 0, policy_version 30180 (0.0006) [2023-03-07 04:02:00,561][118044] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-07 04:02:01,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 30920704. Throughput: 0: 13150.6. Samples: 30900598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:01,086][117718] Avg episode reward: [(0, '2930.314')] [2023-03-07 04:02:01,338][118044] Updated weights for policy 0, policy_version 30200 (0.0007) [2023-03-07 04:02:02,114][118044] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-07 04:02:02,885][118044] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-07 04:02:03,648][118044] Updated weights for policy 0, policy_version 30230 (0.0006) [2023-03-07 04:02:04,426][118044] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-07 04:02:05,215][118044] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-07 04:02:05,995][118044] Updated weights for policy 0, policy_version 30260 (0.0007) [2023-03-07 04:02:06,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 30987264. Throughput: 0: 13159.5. Samples: 30979706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:06,086][117718] Avg episode reward: [(0, '2771.075')] [2023-03-07 04:02:06,769][118044] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-07 04:02:07,529][118044] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-07 04:02:08,313][118044] Updated weights for policy 0, policy_version 30290 (0.0006) [2023-03-07 04:02:09,089][118044] Updated weights for policy 0, policy_version 30300 (0.0006) [2023-03-07 04:02:09,881][118044] Updated weights for policy 0, policy_version 30310 (0.0005) [2023-03-07 04:02:10,657][118044] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-07 04:02:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31052800. Throughput: 0: 13153.1. Samples: 31019217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:11,086][117718] Avg episode reward: [(0, '2883.476')] [2023-03-07 04:02:11,442][118044] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-07 04:02:12,220][118044] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-07 04:02:13,013][118044] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-07 04:02:13,791][118044] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-07 04:02:14,590][118044] Updated weights for policy 0, policy_version 30370 (0.0008) [2023-03-07 04:02:15,366][118044] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-07 04:02:16,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31118336. Throughput: 0: 13142.5. Samples: 31097646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:16,086][117718] Avg episode reward: [(0, '2907.344')] [2023-03-07 04:02:16,145][118044] Updated weights for policy 0, policy_version 30390 (0.0006) [2023-03-07 04:02:16,914][118044] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-07 04:02:17,698][118044] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-07 04:02:18,480][118044] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-07 04:02:19,244][118044] Updated weights for policy 0, policy_version 30430 (0.0007) [2023-03-07 04:02:20,030][118044] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-07 04:02:20,808][118044] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-07 04:02:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31183872. Throughput: 0: 13145.8. Samples: 31176558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:21,086][117718] Avg episode reward: [(0, '2863.793')] [2023-03-07 04:02:21,589][118044] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-07 04:02:22,367][118044] Updated weights for policy 0, policy_version 30470 (0.0007) [2023-03-07 04:02:23,139][118044] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-07 04:02:23,941][118044] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-07 04:02:24,710][118044] Updated weights for policy 0, policy_version 30500 (0.0006) [2023-03-07 04:02:25,489][118044] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-07 04:02:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 31249408. Throughput: 0: 13151.7. Samples: 31215920. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:26,086][117718] Avg episode reward: [(0, '2740.635')] [2023-03-07 04:02:26,270][118044] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-07 04:02:27,049][118044] Updated weights for policy 0, policy_version 30530 (0.0007) [2023-03-07 04:02:27,828][118044] Updated weights for policy 0, policy_version 30540 (0.0007) [2023-03-07 04:02:28,600][118044] Updated weights for policy 0, policy_version 30550 (0.0005) [2023-03-07 04:02:29,373][118044] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-07 04:02:30,170][118044] Updated weights for policy 0, policy_version 30570 (0.0005) [2023-03-07 04:02:30,928][118044] Updated weights for policy 0, policy_version 30580 (0.0008) [2023-03-07 04:02:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 31315968. Throughput: 0: 13151.9. Samples: 31294800. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:31,086][117718] Avg episode reward: [(0, '2799.958')] [2023-03-07 04:02:31,729][118044] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-07 04:02:32,509][118044] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-07 04:02:33,262][118044] Updated weights for policy 0, policy_version 30610 (0.0006) [2023-03-07 04:02:34,050][118044] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-07 04:02:34,809][118044] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-07 04:02:35,586][118044] Updated weights for policy 0, policy_version 30640 (0.0007) [2023-03-07 04:02:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31381504. Throughput: 0: 13153.6. Samples: 31373878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:02:36,086][117718] Avg episode reward: [(0, '2796.387')] [2023-03-07 04:02:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000030646_31381504.pth... [2023-03-07 04:02:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000027565_28226560.pth [2023-03-07 04:02:36,383][118044] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-07 04:02:37,149][118044] Updated weights for policy 0, policy_version 30660 (0.0007) [2023-03-07 04:02:37,910][118044] Updated weights for policy 0, policy_version 30670 (0.0006) [2023-03-07 04:02:38,694][118044] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-07 04:02:39,466][118044] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-07 04:02:40,241][118044] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-07 04:02:41,040][118044] Updated weights for policy 0, policy_version 30710 (0.0006) [2023-03-07 04:02:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31447040. Throughput: 0: 13154.3. Samples: 31413589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:02:41,086][117718] Avg episode reward: [(0, '2763.628')] [2023-03-07 04:02:41,821][118044] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-07 04:02:42,593][118044] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-07 04:02:43,377][118044] Updated weights for policy 0, policy_version 30740 (0.0007) [2023-03-07 04:02:44,152][118044] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-07 04:02:44,952][118044] Updated weights for policy 0, policy_version 30760 (0.0007) [2023-03-07 04:02:45,716][118044] Updated weights for policy 0, policy_version 30770 (0.0007) [2023-03-07 04:02:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 31512576. Throughput: 0: 13148.1. Samples: 31492262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:02:46,086][117718] Avg episode reward: [(0, '2937.948')] [2023-03-07 04:02:46,513][118044] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-07 04:02:47,293][118044] Updated weights for policy 0, policy_version 30790 (0.0006) [2023-03-07 04:02:48,075][118044] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-07 04:02:48,849][118044] Updated weights for policy 0, policy_version 30810 (0.0005) [2023-03-07 04:02:49,642][118044] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-07 04:02:50,414][118044] Updated weights for policy 0, policy_version 30830 (0.0006) [2023-03-07 04:02:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 31578112. Throughput: 0: 13135.1. Samples: 31570785. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:02:51,086][117718] Avg episode reward: [(0, '2783.110')] [2023-03-07 04:02:51,194][118044] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-07 04:02:51,995][118044] Updated weights for policy 0, policy_version 30850 (0.0007) [2023-03-07 04:02:52,771][118044] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-07 04:02:53,549][118044] Updated weights for policy 0, policy_version 30870 (0.0007) [2023-03-07 04:02:54,318][118044] Updated weights for policy 0, policy_version 30880 (0.0007) [2023-03-07 04:02:55,107][118044] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-07 04:02:55,887][118044] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-07 04:02:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 31643648. Throughput: 0: 13129.7. Samples: 31610056. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:02:56,086][117718] Avg episode reward: [(0, '2815.132')] [2023-03-07 04:02:56,662][118044] Updated weights for policy 0, policy_version 30910 (0.0005) [2023-03-07 04:02:57,433][118044] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-07 04:02:58,223][118044] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-07 04:02:59,001][118044] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-07 04:02:59,764][118044] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-07 04:03:00,539][118044] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-07 04:03:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 31710208. Throughput: 0: 13144.5. Samples: 31689150. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:01,086][117718] Avg episode reward: [(0, '2801.878')] [2023-03-07 04:03:01,303][118044] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-07 04:03:02,093][118044] Updated weights for policy 0, policy_version 30980 (0.0006) [2023-03-07 04:03:02,881][118044] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-07 04:03:03,661][118044] Updated weights for policy 0, policy_version 31000 (0.0007) [2023-03-07 04:03:04,438][118044] Updated weights for policy 0, policy_version 31010 (0.0006) [2023-03-07 04:03:05,201][118044] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-07 04:03:05,970][118044] Updated weights for policy 0, policy_version 31030 (0.0005) [2023-03-07 04:03:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 31775744. Throughput: 0: 13151.0. Samples: 31768350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:06,086][117718] Avg episode reward: [(0, '2676.649')] [2023-03-07 04:03:06,742][118044] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-07 04:03:07,505][118044] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-07 04:03:08,285][118044] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-07 04:03:09,049][118044] Updated weights for policy 0, policy_version 31070 (0.0005) [2023-03-07 04:03:09,839][118044] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-07 04:03:10,619][118044] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-07 04:03:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 31841280. Throughput: 0: 13161.3. Samples: 31808179. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:11,097][117718] Avg episode reward: [(0, '2719.794')] [2023-03-07 04:03:11,400][118044] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-07 04:03:12,176][118044] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-07 04:03:12,942][118044] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-07 04:03:13,736][118044] Updated weights for policy 0, policy_version 31130 (0.0007) [2023-03-07 04:03:14,502][118044] Updated weights for policy 0, policy_version 31140 (0.0005) [2023-03-07 04:03:15,277][118044] Updated weights for policy 0, policy_version 31150 (0.0007) [2023-03-07 04:03:16,076][118044] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-07 04:03:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 31907840. Throughput: 0: 13164.2. Samples: 31887191. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:16,097][117718] Avg episode reward: [(0, '2758.531')] [2023-03-07 04:03:16,855][118044] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-07 04:03:17,634][118044] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-03-07 04:03:18,409][118044] Updated weights for policy 0, policy_version 31190 (0.0006) [2023-03-07 04:03:19,188][118044] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-07 04:03:19,966][118044] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-07 04:03:20,748][118044] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-07 04:03:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 31973376. Throughput: 0: 13158.1. Samples: 31965992. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:21,097][117718] Avg episode reward: [(0, '2721.502')] [2023-03-07 04:03:21,519][118044] Updated weights for policy 0, policy_version 31230 (0.0007) [2023-03-07 04:03:22,301][118044] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-07 04:03:23,088][118044] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-07 04:03:23,889][118044] Updated weights for policy 0, policy_version 31260 (0.0006) [2023-03-07 04:03:24,664][118044] Updated weights for policy 0, policy_version 31270 (0.0006) [2023-03-07 04:03:25,436][118044] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-07 04:03:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 32038912. Throughput: 0: 13149.3. Samples: 32005305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:26,086][117718] Avg episode reward: [(0, '2929.365')] [2023-03-07 04:03:26,224][118044] Updated weights for policy 0, policy_version 31290 (0.0006) [2023-03-07 04:03:26,985][118044] Updated weights for policy 0, policy_version 31300 (0.0005) [2023-03-07 04:03:27,769][118044] Updated weights for policy 0, policy_version 31310 (0.0005) [2023-03-07 04:03:28,539][118044] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-07 04:03:29,333][118044] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-07 04:03:30,108][118044] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-07 04:03:30,885][118044] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-07 04:03:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 32104448. Throughput: 0: 13151.0. Samples: 32084056. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:31,086][117718] Avg episode reward: [(0, '2931.415')] [2023-03-07 04:03:31,657][118044] Updated weights for policy 0, policy_version 31360 (0.0006) [2023-03-07 04:03:32,452][118044] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-07 04:03:33,222][118044] Updated weights for policy 0, policy_version 31380 (0.0007) [2023-03-07 04:03:33,986][118044] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-07 04:03:34,779][118044] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-07 04:03:35,565][118044] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-07 04:03:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 32171008. Throughput: 0: 13162.0. Samples: 32163077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:36,086][117718] Avg episode reward: [(0, '2755.461')] [2023-03-07 04:03:36,316][118044] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-07 04:03:37,094][118044] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-07 04:03:37,853][118044] Updated weights for policy 0, policy_version 31440 (0.0006) [2023-03-07 04:03:38,661][118044] Updated weights for policy 0, policy_version 31450 (0.0007) [2023-03-07 04:03:39,433][118044] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-03-07 04:03:40,223][118044] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-07 04:03:41,007][118044] Updated weights for policy 0, policy_version 31480 (0.0007) [2023-03-07 04:03:41,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32236544. Throughput: 0: 13169.1. Samples: 32202667. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:41,086][117718] Avg episode reward: [(0, '2778.268')] [2023-03-07 04:03:41,779][118044] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-07 04:03:42,544][118044] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-07 04:03:43,330][118044] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-07 04:03:44,128][118044] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-07 04:03:44,906][118044] Updated weights for policy 0, policy_version 31530 (0.0006) [2023-03-07 04:03:45,677][118044] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-07 04:03:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32302080. Throughput: 0: 13162.0. Samples: 32281442. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:46,086][117718] Avg episode reward: [(0, '2690.830')] [2023-03-07 04:03:46,478][118044] Updated weights for policy 0, policy_version 31550 (0.0006) [2023-03-07 04:03:47,248][118044] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-07 04:03:48,038][118044] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-07 04:03:48,806][118044] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-07 04:03:49,574][118044] Updated weights for policy 0, policy_version 31590 (0.0005) [2023-03-07 04:03:50,359][118044] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-07 04:03:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32367616. Throughput: 0: 13149.0. Samples: 32360052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:51,086][117718] Avg episode reward: [(0, '2758.516')] [2023-03-07 04:03:51,156][118044] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-07 04:03:51,931][118044] Updated weights for policy 0, policy_version 31620 (0.0007) [2023-03-07 04:03:52,705][118044] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-07 04:03:53,490][118044] Updated weights for policy 0, policy_version 31640 (0.0006) [2023-03-07 04:03:54,259][118044] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-07 04:03:55,014][118044] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-07 04:03:55,798][118044] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-07 04:03:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32433152. Throughput: 0: 13139.4. Samples: 32399451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:03:56,086][117718] Avg episode reward: [(0, '2767.629')] [2023-03-07 04:03:56,568][118044] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-07 04:03:57,329][118044] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-07 04:03:58,098][118044] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-07 04:03:58,884][118044] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-07 04:03:59,670][118044] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-07 04:04:00,456][118044] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-07 04:04:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 32498688. Throughput: 0: 13147.9. Samples: 32478848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:01,086][117718] Avg episode reward: [(0, '2777.124')] [2023-03-07 04:04:01,242][118044] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-07 04:04:02,023][118044] Updated weights for policy 0, policy_version 31750 (0.0005) [2023-03-07 04:04:02,796][118044] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-07 04:04:03,570][118044] Updated weights for policy 0, policy_version 31770 (0.0005) [2023-03-07 04:04:04,355][118044] Updated weights for policy 0, policy_version 31780 (0.0005) [2023-03-07 04:04:05,122][118044] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-07 04:04:05,893][118044] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-07 04:04:06,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32565248. Throughput: 0: 13151.1. Samples: 32557794. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:06,086][117718] Avg episode reward: [(0, '2774.854')] [2023-03-07 04:04:06,674][118044] Updated weights for policy 0, policy_version 31810 (0.0006) [2023-03-07 04:04:07,461][118044] Updated weights for policy 0, policy_version 31820 (0.0006) [2023-03-07 04:04:08,228][118044] Updated weights for policy 0, policy_version 31830 (0.0007) [2023-03-07 04:04:09,003][118044] Updated weights for policy 0, policy_version 31840 (0.0007) [2023-03-07 04:04:09,774][118044] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-07 04:04:10,548][118044] Updated weights for policy 0, policy_version 31860 (0.0005) [2023-03-07 04:04:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 32630784. Throughput: 0: 13154.9. Samples: 32597276. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:11,086][117718] Avg episode reward: [(0, '2998.535')] [2023-03-07 04:04:11,328][118044] Updated weights for policy 0, policy_version 31870 (0.0007) [2023-03-07 04:04:12,107][118044] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-07 04:04:12,897][118044] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-07 04:04:13,686][118044] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-07 04:04:14,476][118044] Updated weights for policy 0, policy_version 31910 (0.0007) [2023-03-07 04:04:15,246][118044] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-07 04:04:16,035][118044] Updated weights for policy 0, policy_version 31930 (0.0007) [2023-03-07 04:04:16,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 32696320. Throughput: 0: 13157.4. Samples: 32676140. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:16,086][117718] Avg episode reward: [(0, '2926.982')] [2023-03-07 04:04:16,821][118044] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-07 04:04:17,603][118044] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-07 04:04:18,383][118044] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-07 04:04:19,148][118044] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-07 04:04:19,932][118044] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-07 04:04:20,709][118044] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-07 04:04:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 32761856. Throughput: 0: 13149.0. Samples: 32754783. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:04:21,086][117718] Avg episode reward: [(0, '2731.438')] [2023-03-07 04:04:21,507][118044] Updated weights for policy 0, policy_version 32000 (0.0005) [2023-03-07 04:04:22,285][118044] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-07 04:04:23,058][118044] Updated weights for policy 0, policy_version 32020 (0.0006) [2023-03-07 04:04:23,824][118044] Updated weights for policy 0, policy_version 32030 (0.0006) [2023-03-07 04:04:24,600][118044] Updated weights for policy 0, policy_version 32040 (0.0006) [2023-03-07 04:04:25,373][118044] Updated weights for policy 0, policy_version 32050 (0.0007) [2023-03-07 04:04:26,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 32828416. Throughput: 0: 13146.0. Samples: 32794237. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:04:26,086][117718] Avg episode reward: [(0, '2825.663')] [2023-03-07 04:04:26,141][118044] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-07 04:04:26,934][118044] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-07 04:04:27,701][118044] Updated weights for policy 0, policy_version 32080 (0.0006) [2023-03-07 04:04:28,475][118044] Updated weights for policy 0, policy_version 32090 (0.0005) [2023-03-07 04:04:29,256][118044] Updated weights for policy 0, policy_version 32100 (0.0005) [2023-03-07 04:04:30,033][118044] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-07 04:04:30,813][118044] Updated weights for policy 0, policy_version 32120 (0.0006) [2023-03-07 04:04:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 32893952. Throughput: 0: 13157.7. Samples: 32873540. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:04:31,086][117718] Avg episode reward: [(0, '2663.223')] [2023-03-07 04:04:31,596][118044] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-03-07 04:04:32,360][118044] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-07 04:04:33,137][118044] Updated weights for policy 0, policy_version 32150 (0.0005) [2023-03-07 04:04:33,918][118044] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-07 04:04:34,681][118044] Updated weights for policy 0, policy_version 32170 (0.0005) [2023-03-07 04:04:35,457][118044] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-07 04:04:36,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 32960512. Throughput: 0: 13170.5. Samples: 32952729. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:04:36,086][117718] Avg episode reward: [(0, '2769.344')] [2023-03-07 04:04:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000032188_32960512.pth... [2023-03-07 04:04:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000029106_29804544.pth [2023-03-07 04:04:36,232][118044] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-07 04:04:37,021][118044] Updated weights for policy 0, policy_version 32200 (0.0006) [2023-03-07 04:04:37,800][118044] Updated weights for policy 0, policy_version 32210 (0.0007) [2023-03-07 04:04:38,570][118044] Updated weights for policy 0, policy_version 32220 (0.0006) [2023-03-07 04:04:39,345][118044] Updated weights for policy 0, policy_version 32230 (0.0007) [2023-03-07 04:04:40,126][118044] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-07 04:04:40,896][118044] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-07 04:04:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 33026048. Throughput: 0: 13174.3. Samples: 32992293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:41,086][117718] Avg episode reward: [(0, '2684.118')] [2023-03-07 04:04:41,688][118044] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-07 04:04:42,460][118044] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-07 04:04:43,232][118044] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-07 04:04:44,017][118044] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-07 04:04:44,790][118044] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-07 04:04:45,558][118044] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-07 04:04:46,085][117718] Fps is (10 sec: 13107.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 33091584. Throughput: 0: 13164.0. Samples: 33071224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:46,086][117718] Avg episode reward: [(0, '2806.255')] [2023-03-07 04:04:46,343][118044] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-07 04:04:47,115][118044] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-07 04:04:47,891][118044] Updated weights for policy 0, policy_version 32340 (0.0006) [2023-03-07 04:04:48,675][118044] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-07 04:04:49,440][118044] Updated weights for policy 0, policy_version 32360 (0.0007) [2023-03-07 04:04:50,209][118044] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-07 04:04:50,970][118044] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-07 04:04:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 33158144. Throughput: 0: 13172.2. Samples: 33150543. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:51,086][117718] Avg episode reward: [(0, '2809.202')] [2023-03-07 04:04:51,756][118044] Updated weights for policy 0, policy_version 32390 (0.0006) [2023-03-07 04:04:52,513][118044] Updated weights for policy 0, policy_version 32400 (0.0006) [2023-03-07 04:04:53,290][118044] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-07 04:04:54,084][118044] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-07 04:04:54,850][118044] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-07 04:04:55,623][118044] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-07 04:04:56,085][117718] Fps is (10 sec: 13311.9, 60 sec: 13192.5, 300 sec: 13159.3). Total num frames: 33224704. Throughput: 0: 13178.3. Samples: 33190296. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:04:56,086][117718] Avg episode reward: [(0, '2917.172')] [2023-03-07 04:04:56,393][118044] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-07 04:04:57,174][118044] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-07 04:04:57,967][118044] Updated weights for policy 0, policy_version 32470 (0.0007) [2023-03-07 04:04:58,749][118044] Updated weights for policy 0, policy_version 32480 (0.0006) [2023-03-07 04:04:59,533][118044] Updated weights for policy 0, policy_version 32490 (0.0007) [2023-03-07 04:05:00,309][118044] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-07 04:05:01,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 33289216. Throughput: 0: 13178.0. Samples: 33269148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:05:01,086][117718] Avg episode reward: [(0, '2818.826')] [2023-03-07 04:05:01,104][118044] Updated weights for policy 0, policy_version 32510 (0.0007) [2023-03-07 04:05:01,858][118044] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-07 04:05:02,636][118044] Updated weights for policy 0, policy_version 32530 (0.0007) [2023-03-07 04:05:03,423][118044] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-07 04:05:04,189][118044] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-07 04:05:04,967][118044] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-07 04:05:05,744][118044] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-07 04:05:06,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 33355776. Throughput: 0: 13187.0. Samples: 33348201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:05:06,086][117718] Avg episode reward: [(0, '2858.027')] [2023-03-07 04:05:06,533][118044] Updated weights for policy 0, policy_version 32580 (0.0005) [2023-03-07 04:05:07,314][118044] Updated weights for policy 0, policy_version 32590 (0.0006) [2023-03-07 04:05:08,081][118044] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-07 04:05:08,854][118044] Updated weights for policy 0, policy_version 32610 (0.0007) [2023-03-07 04:05:09,630][118044] Updated weights for policy 0, policy_version 32620 (0.0006) [2023-03-07 04:05:10,407][118044] Updated weights for policy 0, policy_version 32630 (0.0006) [2023-03-07 04:05:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 33421312. Throughput: 0: 13185.9. Samples: 33387602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:05:11,086][117718] Avg episode reward: [(0, '2866.638')] [2023-03-07 04:05:11,209][118044] Updated weights for policy 0, policy_version 32640 (0.0007) [2023-03-07 04:05:11,982][118044] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-07 04:05:12,764][118044] Updated weights for policy 0, policy_version 32660 (0.0007) [2023-03-07 04:05:13,554][118044] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-07 04:05:14,328][118044] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-07 04:05:15,111][118044] Updated weights for policy 0, policy_version 32690 (0.0007) [2023-03-07 04:05:15,899][118044] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-07 04:05:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 33486848. Throughput: 0: 13174.2. Samples: 33466378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:05:16,086][117718] Avg episode reward: [(0, '2799.005')] [2023-03-07 04:05:16,674][118044] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-07 04:05:17,441][118044] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-07 04:05:18,230][118044] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-07 04:05:19,004][118044] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-07 04:05:19,788][118044] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-07 04:05:20,571][118044] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-07 04:05:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 33552384. Throughput: 0: 13164.7. Samples: 33545142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:21,086][117718] Avg episode reward: [(0, '2665.284')] [2023-03-07 04:05:21,354][118044] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-07 04:05:22,140][118044] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-07 04:05:22,936][118044] Updated weights for policy 0, policy_version 32790 (0.0007) [2023-03-07 04:05:23,710][118044] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-07 04:05:24,494][118044] Updated weights for policy 0, policy_version 32810 (0.0005) [2023-03-07 04:05:25,276][118044] Updated weights for policy 0, policy_version 32820 (0.0006) [2023-03-07 04:05:26,058][118044] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-07 04:05:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 33617920. Throughput: 0: 13155.8. Samples: 33584304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:26,086][117718] Avg episode reward: [(0, '2707.480')] [2023-03-07 04:05:26,861][118044] Updated weights for policy 0, policy_version 32840 (0.0005) [2023-03-07 04:05:27,618][118044] Updated weights for policy 0, policy_version 32850 (0.0005) [2023-03-07 04:05:28,401][118044] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-07 04:05:29,186][118044] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-07 04:05:29,972][118044] Updated weights for policy 0, policy_version 32880 (0.0007) [2023-03-07 04:05:30,762][118044] Updated weights for policy 0, policy_version 32890 (0.0005) [2023-03-07 04:05:31,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 33683456. Throughput: 0: 13151.0. Samples: 33663021. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:31,086][117718] Avg episode reward: [(0, '2770.491')] [2023-03-07 04:05:31,540][118044] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-07 04:05:32,331][118044] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-07 04:05:33,106][118044] Updated weights for policy 0, policy_version 32920 (0.0008) [2023-03-07 04:05:33,873][118044] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-07 04:05:34,654][118044] Updated weights for policy 0, policy_version 32940 (0.0006) [2023-03-07 04:05:35,429][118044] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-07 04:05:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 33748992. Throughput: 0: 13135.4. Samples: 33741637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:36,086][117718] Avg episode reward: [(0, '2811.874')] [2023-03-07 04:05:36,225][118044] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-07 04:05:36,997][118044] Updated weights for policy 0, policy_version 32970 (0.0006) [2023-03-07 04:05:37,773][118044] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-07 04:05:38,552][118044] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-07 04:05:39,324][118044] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-07 04:05:40,113][118044] Updated weights for policy 0, policy_version 33010 (0.0008) [2023-03-07 04:05:40,890][118044] Updated weights for policy 0, policy_version 33020 (0.0008) [2023-03-07 04:05:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 33814528. Throughput: 0: 13125.6. Samples: 33780948. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:41,086][117718] Avg episode reward: [(0, '2744.674')] [2023-03-07 04:05:41,669][118044] Updated weights for policy 0, policy_version 33030 (0.0007) [2023-03-07 04:05:42,435][118044] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-07 04:05:43,220][118044] Updated weights for policy 0, policy_version 33050 (0.0007) [2023-03-07 04:05:43,993][118044] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-07 04:05:44,761][118044] Updated weights for policy 0, policy_version 33070 (0.0006) [2023-03-07 04:05:45,546][118044] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-07 04:05:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 33881088. Throughput: 0: 13132.1. Samples: 33860095. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:46,086][117718] Avg episode reward: [(0, '2769.509')] [2023-03-07 04:05:46,321][118044] Updated weights for policy 0, policy_version 33090 (0.0007) [2023-03-07 04:05:47,092][118044] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-07 04:05:47,878][118044] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-07 04:05:48,652][118044] Updated weights for policy 0, policy_version 33120 (0.0007) [2023-03-07 04:05:49,437][118044] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-07 04:05:50,242][118044] Updated weights for policy 0, policy_version 33140 (0.0006) [2023-03-07 04:05:51,019][118044] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-07 04:05:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 33945600. Throughput: 0: 13124.1. Samples: 33938786. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:51,086][117718] Avg episode reward: [(0, '2904.474')] [2023-03-07 04:05:51,797][118044] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-07 04:05:52,581][118044] Updated weights for policy 0, policy_version 33170 (0.0007) [2023-03-07 04:05:53,356][118044] Updated weights for policy 0, policy_version 33180 (0.0007) [2023-03-07 04:05:54,125][118044] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-07 04:05:54,895][118044] Updated weights for policy 0, policy_version 33200 (0.0006) [2023-03-07 04:05:55,675][118044] Updated weights for policy 0, policy_version 33210 (0.0005) [2023-03-07 04:05:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 34012160. Throughput: 0: 13123.9. Samples: 33978178. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:05:56,086][117718] Avg episode reward: [(0, '2864.927')] [2023-03-07 04:05:56,443][118044] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-07 04:05:57,235][118044] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-07 04:05:58,025][118044] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-07 04:05:58,778][118044] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-07 04:05:59,571][118044] Updated weights for policy 0, policy_version 33260 (0.0005) [2023-03-07 04:06:00,322][118044] Updated weights for policy 0, policy_version 33270 (0.0005) [2023-03-07 04:06:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34077696. Throughput: 0: 13130.9. Samples: 34057267. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:06:01,086][117718] Avg episode reward: [(0, '2972.870')] [2023-03-07 04:06:01,105][118044] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-07 04:06:01,878][118044] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-07 04:06:02,637][118044] Updated weights for policy 0, policy_version 33300 (0.0007) [2023-03-07 04:06:03,417][118044] Updated weights for policy 0, policy_version 33310 (0.0007) [2023-03-07 04:06:04,209][118044] Updated weights for policy 0, policy_version 33320 (0.0006) [2023-03-07 04:06:04,981][118044] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-07 04:06:05,756][118044] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-07 04:06:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 34144256. Throughput: 0: 13142.1. Samples: 34136533. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:06:06,086][117718] Avg episode reward: [(0, '2908.684')] [2023-03-07 04:06:06,519][118044] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-07 04:06:07,290][118044] Updated weights for policy 0, policy_version 33360 (0.0005) [2023-03-07 04:06:08,067][118044] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-07 04:06:08,863][118044] Updated weights for policy 0, policy_version 33380 (0.0006) [2023-03-07 04:06:09,644][118044] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-07 04:06:10,413][118044] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-07 04:06:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 34209792. Throughput: 0: 13155.4. Samples: 34176296. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:06:11,086][117718] Avg episode reward: [(0, '2821.099')] [2023-03-07 04:06:11,207][118044] Updated weights for policy 0, policy_version 33410 (0.0006) [2023-03-07 04:06:11,993][118044] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-07 04:06:12,757][118044] Updated weights for policy 0, policy_version 33430 (0.0005) [2023-03-07 04:06:13,521][118044] Updated weights for policy 0, policy_version 33440 (0.0006) [2023-03-07 04:06:14,313][118044] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-07 04:06:15,095][118044] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-07 04:06:15,867][118044] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-07 04:06:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 34275328. Throughput: 0: 13156.5. Samples: 34255061. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:06:16,086][117718] Avg episode reward: [(0, '2701.545')] [2023-03-07 04:06:16,649][118044] Updated weights for policy 0, policy_version 33480 (0.0008) [2023-03-07 04:06:17,439][118044] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-07 04:06:18,201][118044] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-07 04:06:18,989][118044] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-07 04:06:19,770][118044] Updated weights for policy 0, policy_version 33520 (0.0006) [2023-03-07 04:06:20,555][118044] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-07 04:06:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34340864. Throughput: 0: 13159.3. Samples: 34333808. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:06:21,086][117718] Avg episode reward: [(0, '2787.983')] [2023-03-07 04:06:21,333][118044] Updated weights for policy 0, policy_version 33540 (0.0007) [2023-03-07 04:06:22,114][118044] Updated weights for policy 0, policy_version 33550 (0.0005) [2023-03-07 04:06:22,904][118044] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-07 04:06:23,671][118044] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-07 04:06:24,444][118044] Updated weights for policy 0, policy_version 33580 (0.0005) [2023-03-07 04:06:25,210][118044] Updated weights for policy 0, policy_version 33590 (0.0005) [2023-03-07 04:06:25,997][118044] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-07 04:06:26,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 34407424. Throughput: 0: 13160.8. Samples: 34373187. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:26,086][117718] Avg episode reward: [(0, '2785.380')] [2023-03-07 04:06:26,777][118044] Updated weights for policy 0, policy_version 33610 (0.0006) [2023-03-07 04:06:27,554][118044] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-07 04:06:28,338][118044] Updated weights for policy 0, policy_version 33630 (0.0007) [2023-03-07 04:06:29,115][118044] Updated weights for policy 0, policy_version 33640 (0.0005) [2023-03-07 04:06:29,896][118044] Updated weights for policy 0, policy_version 33650 (0.0007) [2023-03-07 04:06:30,666][118044] Updated weights for policy 0, policy_version 33660 (0.0005) [2023-03-07 04:06:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 34472960. Throughput: 0: 13159.0. Samples: 34452248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:31,086][117718] Avg episode reward: [(0, '2718.532')] [2023-03-07 04:06:31,449][118044] Updated weights for policy 0, policy_version 33670 (0.0007) [2023-03-07 04:06:32,224][118044] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-07 04:06:32,997][118044] Updated weights for policy 0, policy_version 33690 (0.0007) [2023-03-07 04:06:33,794][118044] Updated weights for policy 0, policy_version 33700 (0.0007) [2023-03-07 04:06:34,590][118044] Updated weights for policy 0, policy_version 33710 (0.0005) [2023-03-07 04:06:35,363][118044] Updated weights for policy 0, policy_version 33720 (0.0007) [2023-03-07 04:06:36,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 34538496. Throughput: 0: 13158.7. Samples: 34530927. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:36,086][117718] Avg episode reward: [(0, '2851.737')] [2023-03-07 04:06:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000033729_34538496.pth... [2023-03-07 04:06:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000030646_31381504.pth [2023-03-07 04:06:36,141][118044] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-07 04:06:36,937][118044] Updated weights for policy 0, policy_version 33740 (0.0007) [2023-03-07 04:06:37,710][118044] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-07 04:06:38,491][118044] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-07 04:06:39,270][118044] Updated weights for policy 0, policy_version 33770 (0.0005) [2023-03-07 04:06:40,037][118044] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-07 04:06:40,830][118044] Updated weights for policy 0, policy_version 33790 (0.0007) [2023-03-07 04:06:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 34604032. Throughput: 0: 13156.7. Samples: 34570232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:41,096][117718] Avg episode reward: [(0, '2856.969')] [2023-03-07 04:06:41,602][118044] Updated weights for policy 0, policy_version 33800 (0.0006) [2023-03-07 04:06:42,366][118044] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-07 04:06:43,141][118044] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-07 04:06:43,918][118044] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-07 04:06:44,710][118044] Updated weights for policy 0, policy_version 33840 (0.0006) [2023-03-07 04:06:45,498][118044] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-07 04:06:46,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34669568. Throughput: 0: 13157.7. Samples: 34649363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:46,097][117718] Avg episode reward: [(0, '2821.742')] [2023-03-07 04:06:46,285][118044] Updated weights for policy 0, policy_version 33860 (0.0007) [2023-03-07 04:06:47,055][118044] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-07 04:06:47,833][118044] Updated weights for policy 0, policy_version 33880 (0.0006) [2023-03-07 04:06:48,617][118044] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-07 04:06:49,405][118044] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-07 04:06:50,166][118044] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-07 04:06:50,962][118044] Updated weights for policy 0, policy_version 33920 (0.0006) [2023-03-07 04:06:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 34735104. Throughput: 0: 13142.2. Samples: 34727931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:51,096][117718] Avg episode reward: [(0, '2738.797')] [2023-03-07 04:06:51,757][118044] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-07 04:06:52,524][118044] Updated weights for policy 0, policy_version 33940 (0.0007) [2023-03-07 04:06:53,309][118044] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-07 04:06:54,092][118044] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-07 04:06:54,874][118044] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-07 04:06:55,643][118044] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-07 04:06:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34800640. Throughput: 0: 13128.5. Samples: 34767080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:06:56,097][117718] Avg episode reward: [(0, '2806.525')] [2023-03-07 04:06:56,421][118044] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-07 04:06:57,210][118044] Updated weights for policy 0, policy_version 34000 (0.0007) [2023-03-07 04:06:57,989][118044] Updated weights for policy 0, policy_version 34010 (0.0007) [2023-03-07 04:06:58,757][118044] Updated weights for policy 0, policy_version 34020 (0.0006) [2023-03-07 04:06:59,545][118044] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-07 04:07:00,320][118044] Updated weights for policy 0, policy_version 34040 (0.0005) [2023-03-07 04:07:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 34866176. Throughput: 0: 13131.4. Samples: 34845974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:07:01,097][117718] Avg episode reward: [(0, '2765.471')] [2023-03-07 04:07:01,106][118044] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-07 04:07:01,877][118044] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-07 04:07:02,662][118044] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-07 04:07:03,430][118044] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-07 04:07:04,215][118044] Updated weights for policy 0, policy_version 34090 (0.0007) [2023-03-07 04:07:04,983][118044] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-07 04:07:05,772][118044] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-07 04:07:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34932736. Throughput: 0: 13137.6. Samples: 34925000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:07:06,097][117718] Avg episode reward: [(0, '2809.483')] [2023-03-07 04:07:06,547][118044] Updated weights for policy 0, policy_version 34120 (0.0006) [2023-03-07 04:07:07,322][118044] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-07 04:07:08,107][118044] Updated weights for policy 0, policy_version 34140 (0.0006) [2023-03-07 04:07:08,875][118044] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-07 04:07:09,659][118044] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-07 04:07:10,456][118044] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-07 04:07:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 34998272. Throughput: 0: 13137.9. Samples: 34964390. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:07:11,097][117718] Avg episode reward: [(0, '2823.390')] [2023-03-07 04:07:11,233][118044] Updated weights for policy 0, policy_version 34180 (0.0007) [2023-03-07 04:07:12,019][118044] Updated weights for policy 0, policy_version 34190 (0.0006) [2023-03-07 04:07:12,777][118044] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-07 04:07:13,550][118044] Updated weights for policy 0, policy_version 34210 (0.0005) [2023-03-07 04:07:14,339][118044] Updated weights for policy 0, policy_version 34220 (0.0007) [2023-03-07 04:07:15,122][118044] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-07 04:07:15,906][118044] Updated weights for policy 0, policy_version 34240 (0.0007) [2023-03-07 04:07:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 35063808. Throughput: 0: 13134.4. Samples: 35043297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:07:16,097][117718] Avg episode reward: [(0, '2976.394')] [2023-03-07 04:07:16,677][118044] Updated weights for policy 0, policy_version 34250 (0.0007) [2023-03-07 04:07:17,466][118044] Updated weights for policy 0, policy_version 34260 (0.0006) [2023-03-07 04:07:18,262][118044] Updated weights for policy 0, policy_version 34270 (0.0006) [2023-03-07 04:07:19,055][118044] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-07 04:07:19,820][118044] Updated weights for policy 0, policy_version 34290 (0.0007) [2023-03-07 04:07:20,605][118044] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-07 04:07:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 35129344. Throughput: 0: 13128.2. Samples: 35121695. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:07:21,086][117718] Avg episode reward: [(0, '2867.489')] [2023-03-07 04:07:21,384][118044] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-07 04:07:22,196][118044] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-07 04:07:22,985][118044] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-07 04:07:23,750][118044] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-07 04:07:24,518][118044] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-07 04:07:25,312][118044] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-07 04:07:26,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 35193856. Throughput: 0: 13123.0. Samples: 35160768. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:07:26,086][117718] Avg episode reward: [(0, '3023.344')] [2023-03-07 04:07:26,094][118044] Updated weights for policy 0, policy_version 34370 (0.0005) [2023-03-07 04:07:26,878][118044] Updated weights for policy 0, policy_version 34380 (0.0005) [2023-03-07 04:07:27,653][118044] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-07 04:07:28,438][118044] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-07 04:07:29,236][118044] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-07 04:07:30,015][118044] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-07 04:07:30,786][118044] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-07 04:07:31,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 35259392. Throughput: 0: 13109.4. Samples: 35239284. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:07:31,086][117718] Avg episode reward: [(0, '3052.593')] [2023-03-07 04:07:31,569][118044] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-07 04:07:32,327][118044] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-07 04:07:33,110][118044] Updated weights for policy 0, policy_version 34460 (0.0006) [2023-03-07 04:07:33,893][118044] Updated weights for policy 0, policy_version 34470 (0.0006) [2023-03-07 04:07:34,682][118044] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-07 04:07:35,455][118044] Updated weights for policy 0, policy_version 34490 (0.0005) [2023-03-07 04:07:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13148.8). Total num frames: 35325952. Throughput: 0: 13116.7. Samples: 35318183. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:07:36,086][117718] Avg episode reward: [(0, '2824.446')] [2023-03-07 04:07:36,233][118044] Updated weights for policy 0, policy_version 34500 (0.0007) [2023-03-07 04:07:37,005][118044] Updated weights for policy 0, policy_version 34510 (0.0006) [2023-03-07 04:07:37,802][118044] Updated weights for policy 0, policy_version 34520 (0.0007) [2023-03-07 04:07:38,569][118044] Updated weights for policy 0, policy_version 34530 (0.0005) [2023-03-07 04:07:39,349][118044] Updated weights for policy 0, policy_version 34540 (0.0005) [2023-03-07 04:07:40,140][118044] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-07 04:07:40,918][118044] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-07 04:07:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 35391488. Throughput: 0: 13120.2. Samples: 35357488. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:07:41,086][117718] Avg episode reward: [(0, '2986.602')] [2023-03-07 04:07:41,682][118044] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-07 04:07:42,488][118044] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-07 04:07:43,257][118044] Updated weights for policy 0, policy_version 34590 (0.0006) [2023-03-07 04:07:44,009][118044] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-07 04:07:44,806][118044] Updated weights for policy 0, policy_version 34610 (0.0007) [2023-03-07 04:07:45,573][118044] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-07 04:07:46,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 35457024. Throughput: 0: 13121.1. Samples: 35436422. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:07:46,086][117718] Avg episode reward: [(0, '2907.628')] [2023-03-07 04:07:46,345][118044] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-07 04:07:47,145][118044] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-07 04:07:47,933][118044] Updated weights for policy 0, policy_version 34650 (0.0006) [2023-03-07 04:07:48,732][118044] Updated weights for policy 0, policy_version 34660 (0.0008) [2023-03-07 04:07:49,499][118044] Updated weights for policy 0, policy_version 34670 (0.0008) [2023-03-07 04:07:50,277][118044] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-07 04:07:51,048][118044] Updated weights for policy 0, policy_version 34690 (0.0005) [2023-03-07 04:07:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 35522560. Throughput: 0: 13110.9. Samples: 35514990. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:07:51,086][117718] Avg episode reward: [(0, '3071.814')] [2023-03-07 04:07:51,839][118044] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-07 04:07:52,627][118044] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-07 04:07:53,415][118044] Updated weights for policy 0, policy_version 34720 (0.0007) [2023-03-07 04:07:54,194][118044] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-07 04:07:54,957][118044] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-07 04:07:55,750][118044] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-07 04:07:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 35588096. Throughput: 0: 13107.4. Samples: 35554223. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:07:56,086][117718] Avg episode reward: [(0, '2924.629')] [2023-03-07 04:07:56,502][118044] Updated weights for policy 0, policy_version 34760 (0.0006) [2023-03-07 04:07:57,279][118044] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-07 04:07:58,103][118044] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-07 04:07:58,873][118044] Updated weights for policy 0, policy_version 34790 (0.0008) [2023-03-07 04:07:59,660][118044] Updated weights for policy 0, policy_version 34800 (0.0007) [2023-03-07 04:08:00,439][118044] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-07 04:08:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 35653632. Throughput: 0: 13103.4. Samples: 35632950. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:08:01,087][117718] Avg episode reward: [(0, '2920.003')] [2023-03-07 04:08:01,223][118044] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-07 04:08:01,998][118044] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-07 04:08:02,777][118044] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-07 04:08:03,574][118044] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-07 04:08:04,348][118044] Updated weights for policy 0, policy_version 34860 (0.0006) [2023-03-07 04:08:05,116][118044] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-07 04:08:05,917][118044] Updated weights for policy 0, policy_version 34880 (0.0007) [2023-03-07 04:08:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 35719168. Throughput: 0: 13111.4. Samples: 35711706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:08:06,086][117718] Avg episode reward: [(0, '2952.283')] [2023-03-07 04:08:06,685][118044] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-07 04:08:07,476][118044] Updated weights for policy 0, policy_version 34900 (0.0006) [2023-03-07 04:08:08,237][118044] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-07 04:08:09,035][118044] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-07 04:08:09,812][118044] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-07 04:08:10,600][118044] Updated weights for policy 0, policy_version 34940 (0.0006) [2023-03-07 04:08:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 35784704. Throughput: 0: 13117.5. Samples: 35751053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:08:11,086][117718] Avg episode reward: [(0, '3117.771')] [2023-03-07 04:08:11,361][118044] Updated weights for policy 0, policy_version 34950 (0.0007) [2023-03-07 04:08:12,164][118044] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-07 04:08:12,939][118044] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-07 04:08:13,715][118044] Updated weights for policy 0, policy_version 34980 (0.0007) [2023-03-07 04:08:14,495][118044] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-07 04:08:15,282][118044] Updated weights for policy 0, policy_version 35000 (0.0007) [2023-03-07 04:08:16,068][118044] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-07 04:08:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 35850240. Throughput: 0: 13122.0. Samples: 35829772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:16,086][117718] Avg episode reward: [(0, '3103.226')] [2023-03-07 04:08:16,833][118044] Updated weights for policy 0, policy_version 35020 (0.0006) [2023-03-07 04:08:17,620][118044] Updated weights for policy 0, policy_version 35030 (0.0007) [2023-03-07 04:08:18,396][118044] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-07 04:08:19,161][118044] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-07 04:08:19,931][118044] Updated weights for policy 0, policy_version 35060 (0.0007) [2023-03-07 04:08:20,722][118044] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-07 04:08:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 35915776. Throughput: 0: 13122.4. Samples: 35908688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:21,086][117718] Avg episode reward: [(0, '2980.775')] [2023-03-07 04:08:21,495][118044] Updated weights for policy 0, policy_version 35080 (0.0006) [2023-03-07 04:08:22,271][118044] Updated weights for policy 0, policy_version 35090 (0.0006) [2023-03-07 04:08:23,038][118044] Updated weights for policy 0, policy_version 35100 (0.0005) [2023-03-07 04:08:23,831][118044] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-07 04:08:24,611][118044] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-07 04:08:25,391][118044] Updated weights for policy 0, policy_version 35130 (0.0006) [2023-03-07 04:08:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 35981312. Throughput: 0: 13126.6. Samples: 35948184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:26,086][117718] Avg episode reward: [(0, '2950.406')] [2023-03-07 04:08:26,174][118044] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-07 04:08:26,950][118044] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-07 04:08:27,725][118044] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-07 04:08:28,502][118044] Updated weights for policy 0, policy_version 35170 (0.0006) [2023-03-07 04:08:29,293][118044] Updated weights for policy 0, policy_version 35180 (0.0005) [2023-03-07 04:08:30,080][118044] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-07 04:08:30,849][118044] Updated weights for policy 0, policy_version 35200 (0.0007) [2023-03-07 04:08:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 36047872. Throughput: 0: 13119.2. Samples: 36026783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:31,086][117718] Avg episode reward: [(0, '2881.989')] [2023-03-07 04:08:31,626][118044] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-07 04:08:32,406][118044] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-07 04:08:33,178][118044] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-07 04:08:33,948][118044] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-07 04:08:34,749][118044] Updated weights for policy 0, policy_version 35250 (0.0006) [2023-03-07 04:08:35,515][118044] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-07 04:08:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 36113408. Throughput: 0: 13129.8. Samples: 36105832. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:36,086][117718] Avg episode reward: [(0, '3057.694')] [2023-03-07 04:08:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000035267_36113408.pth... [2023-03-07 04:08:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000032188_32960512.pth [2023-03-07 04:08:36,290][118044] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-07 04:08:37,059][118044] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-07 04:08:37,834][118044] Updated weights for policy 0, policy_version 35290 (0.0007) [2023-03-07 04:08:38,619][118044] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-07 04:08:39,410][118044] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-07 04:08:40,186][118044] Updated weights for policy 0, policy_version 35320 (0.0006) [2023-03-07 04:08:40,960][118044] Updated weights for policy 0, policy_version 35330 (0.0007) [2023-03-07 04:08:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 36178944. Throughput: 0: 13138.6. Samples: 36145462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:41,086][117718] Avg episode reward: [(0, '3030.799')] [2023-03-07 04:08:41,742][118044] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-07 04:08:42,523][118044] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-07 04:08:43,311][118044] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-07 04:08:44,076][118044] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-07 04:08:44,856][118044] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-07 04:08:45,647][118044] Updated weights for policy 0, policy_version 35390 (0.0005) [2023-03-07 04:08:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 36244480. Throughput: 0: 13137.1. Samples: 36224120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:46,086][117718] Avg episode reward: [(0, '2935.607')] [2023-03-07 04:08:46,430][118044] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-07 04:08:47,196][118044] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-07 04:08:47,972][118044] Updated weights for policy 0, policy_version 35420 (0.0005) [2023-03-07 04:08:48,752][118044] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-07 04:08:49,529][118044] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-07 04:08:50,316][118044] Updated weights for policy 0, policy_version 35450 (0.0007) [2023-03-07 04:08:51,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 36310016. Throughput: 0: 13141.1. Samples: 36303056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:51,086][117718] Avg episode reward: [(0, '2949.037')] [2023-03-07 04:08:51,098][118044] Updated weights for policy 0, policy_version 35460 (0.0007) [2023-03-07 04:08:51,862][118044] Updated weights for policy 0, policy_version 35470 (0.0006) [2023-03-07 04:08:52,633][118044] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-07 04:08:53,429][118044] Updated weights for policy 0, policy_version 35490 (0.0006) [2023-03-07 04:08:54,221][118044] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-07 04:08:54,981][118044] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-07 04:08:55,766][118044] Updated weights for policy 0, policy_version 35520 (0.0006) [2023-03-07 04:08:56,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 36376576. Throughput: 0: 13143.7. Samples: 36342520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:08:56,086][117718] Avg episode reward: [(0, '2971.474')] [2023-03-07 04:08:56,532][118044] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-07 04:08:57,315][118044] Updated weights for policy 0, policy_version 35540 (0.0007) [2023-03-07 04:08:58,083][118044] Updated weights for policy 0, policy_version 35550 (0.0005) [2023-03-07 04:08:58,861][118044] Updated weights for policy 0, policy_version 35560 (0.0006) [2023-03-07 04:08:59,644][118044] Updated weights for policy 0, policy_version 35570 (0.0007) [2023-03-07 04:09:00,420][118044] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-07 04:09:01,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 36442112. Throughput: 0: 13155.0. Samples: 36421748. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:01,086][117718] Avg episode reward: [(0, '3088.757')] [2023-03-07 04:09:01,204][118044] Updated weights for policy 0, policy_version 35590 (0.0005) [2023-03-07 04:09:01,974][118044] Updated weights for policy 0, policy_version 35600 (0.0008) [2023-03-07 04:09:02,744][118044] Updated weights for policy 0, policy_version 35610 (0.0006) [2023-03-07 04:09:03,532][118044] Updated weights for policy 0, policy_version 35620 (0.0006) [2023-03-07 04:09:04,307][118044] Updated weights for policy 0, policy_version 35630 (0.0005) [2023-03-07 04:09:05,083][118044] Updated weights for policy 0, policy_version 35640 (0.0007) [2023-03-07 04:09:05,880][118044] Updated weights for policy 0, policy_version 35650 (0.0006) [2023-03-07 04:09:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 36507648. Throughput: 0: 13152.6. Samples: 36500554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:06,086][117718] Avg episode reward: [(0, '3080.170')] [2023-03-07 04:09:06,647][118044] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-07 04:09:07,429][118044] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-07 04:09:08,206][118044] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-07 04:09:08,993][118044] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-07 04:09:09,784][118044] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-07 04:09:10,554][118044] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-07 04:09:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 36573184. Throughput: 0: 13149.2. Samples: 36539897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:11,096][117718] Avg episode reward: [(0, '3161.857')] [2023-03-07 04:09:11,349][118044] Updated weights for policy 0, policy_version 35720 (0.0007) [2023-03-07 04:09:12,136][118044] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-07 04:09:12,906][118044] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-07 04:09:13,693][118044] Updated weights for policy 0, policy_version 35750 (0.0007) [2023-03-07 04:09:14,467][118044] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-07 04:09:15,233][118044] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-07 04:09:16,012][118044] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-07 04:09:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 36638720. Throughput: 0: 13148.8. Samples: 36618481. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:16,097][117718] Avg episode reward: [(0, '2990.533')] [2023-03-07 04:09:16,773][118044] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-07 04:09:17,573][118044] Updated weights for policy 0, policy_version 35800 (0.0006) [2023-03-07 04:09:18,345][118044] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-07 04:09:19,103][118044] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-07 04:09:19,885][118044] Updated weights for policy 0, policy_version 35830 (0.0005) [2023-03-07 04:09:20,666][118044] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-07 04:09:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 36705280. Throughput: 0: 13152.4. Samples: 36697691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:21,097][117718] Avg episode reward: [(0, '3016.397')] [2023-03-07 04:09:21,433][118044] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-07 04:09:22,225][118044] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-07 04:09:23,013][118044] Updated weights for policy 0, policy_version 35870 (0.0006) [2023-03-07 04:09:23,772][118044] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-07 04:09:24,552][118044] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-07 04:09:25,328][118044] Updated weights for policy 0, policy_version 35900 (0.0006) [2023-03-07 04:09:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 36770816. Throughput: 0: 13152.2. Samples: 36737309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:26,097][117718] Avg episode reward: [(0, '3050.386')] [2023-03-07 04:09:26,101][118044] Updated weights for policy 0, policy_version 35910 (0.0007) [2023-03-07 04:09:26,878][118044] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-07 04:09:27,662][118044] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-07 04:09:28,433][118044] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-07 04:09:29,202][118044] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-07 04:09:29,981][118044] Updated weights for policy 0, policy_version 35960 (0.0005) [2023-03-07 04:09:30,753][118044] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-07 04:09:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 36837376. Throughput: 0: 13166.5. Samples: 36816616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:31,097][117718] Avg episode reward: [(0, '3075.834')] [2023-03-07 04:09:31,533][118044] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-07 04:09:32,319][118044] Updated weights for policy 0, policy_version 35990 (0.0005) [2023-03-07 04:09:33,097][118044] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-07 04:09:33,875][118044] Updated weights for policy 0, policy_version 36010 (0.0007) [2023-03-07 04:09:34,666][118044] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-07 04:09:35,442][118044] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-07 04:09:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 36902912. Throughput: 0: 13164.9. Samples: 36895477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:36,097][117718] Avg episode reward: [(0, '3105.745')] [2023-03-07 04:09:36,219][118044] Updated weights for policy 0, policy_version 36040 (0.0006) [2023-03-07 04:09:36,989][118044] Updated weights for policy 0, policy_version 36050 (0.0007) [2023-03-07 04:09:37,788][118044] Updated weights for policy 0, policy_version 36060 (0.0007) [2023-03-07 04:09:38,574][118044] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-07 04:09:39,375][118044] Updated weights for policy 0, policy_version 36080 (0.0007) [2023-03-07 04:09:40,155][118044] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-07 04:09:40,933][118044] Updated weights for policy 0, policy_version 36100 (0.0007) [2023-03-07 04:09:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 36968448. Throughput: 0: 13158.2. Samples: 36934639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:41,096][117718] Avg episode reward: [(0, '3021.155')] [2023-03-07 04:09:41,701][118044] Updated weights for policy 0, policy_version 36110 (0.0006) [2023-03-07 04:09:42,494][118044] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-07 04:09:43,277][118044] Updated weights for policy 0, policy_version 36130 (0.0006) [2023-03-07 04:09:44,057][118044] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-07 04:09:44,846][118044] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-07 04:09:45,633][118044] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-07 04:09:46,085][117718] Fps is (10 sec: 13004.9, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 37032960. Throughput: 0: 13136.3. Samples: 37012882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:46,096][117718] Avg episode reward: [(0, '3010.866')] [2023-03-07 04:09:46,403][118044] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-07 04:09:47,190][118044] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-07 04:09:47,970][118044] Updated weights for policy 0, policy_version 36190 (0.0006) [2023-03-07 04:09:48,766][118044] Updated weights for policy 0, policy_version 36200 (0.0006) [2023-03-07 04:09:49,534][118044] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-07 04:09:50,325][118044] Updated weights for policy 0, policy_version 36220 (0.0007) [2023-03-07 04:09:51,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 37098496. Throughput: 0: 13132.6. Samples: 37091520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:51,097][117718] Avg episode reward: [(0, '2973.035')] [2023-03-07 04:09:51,117][118044] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-07 04:09:51,889][118044] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-07 04:09:52,659][118044] Updated weights for policy 0, policy_version 36250 (0.0007) [2023-03-07 04:09:53,446][118044] Updated weights for policy 0, policy_version 36260 (0.0006) [2023-03-07 04:09:54,230][118044] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-07 04:09:55,009][118044] Updated weights for policy 0, policy_version 36280 (0.0007) [2023-03-07 04:09:55,777][118044] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-07 04:09:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 37164032. Throughput: 0: 13131.8. Samples: 37130828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:09:56,096][117718] Avg episode reward: [(0, '2889.682')] [2023-03-07 04:09:56,573][118044] Updated weights for policy 0, policy_version 36300 (0.0007) [2023-03-07 04:09:57,347][118044] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-07 04:09:58,120][118044] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-07 04:09:58,914][118044] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-07 04:09:59,686][118044] Updated weights for policy 0, policy_version 36340 (0.0006) [2023-03-07 04:10:00,459][118044] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-07 04:10:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13131.5). Total num frames: 37229568. Throughput: 0: 13134.1. Samples: 37209518. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:01,097][117718] Avg episode reward: [(0, '2909.118')] [2023-03-07 04:10:01,246][118044] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-07 04:10:02,014][118044] Updated weights for policy 0, policy_version 36370 (0.0006) [2023-03-07 04:10:02,799][118044] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-03-07 04:10:03,569][118044] Updated weights for policy 0, policy_version 36390 (0.0007) [2023-03-07 04:10:04,368][118044] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-07 04:10:05,147][118044] Updated weights for policy 0, policy_version 36410 (0.0005) [2023-03-07 04:10:05,911][118044] Updated weights for policy 0, policy_version 36420 (0.0006) [2023-03-07 04:10:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 37296128. Throughput: 0: 13125.8. Samples: 37288350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:06,096][117718] Avg episode reward: [(0, '2956.981')] [2023-03-07 04:10:06,701][118044] Updated weights for policy 0, policy_version 36430 (0.0006) [2023-03-07 04:10:07,470][118044] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-07 04:10:08,261][118044] Updated weights for policy 0, policy_version 36450 (0.0007) [2023-03-07 04:10:09,042][118044] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-07 04:10:09,819][118044] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-07 04:10:10,606][118044] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-07 04:10:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 37361664. Throughput: 0: 13122.2. Samples: 37327810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:11,097][117718] Avg episode reward: [(0, '3078.072')] [2023-03-07 04:10:11,402][118044] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-07 04:10:12,167][118044] Updated weights for policy 0, policy_version 36500 (0.0007) [2023-03-07 04:10:12,954][118044] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-07 04:10:13,729][118044] Updated weights for policy 0, policy_version 36520 (0.0006) [2023-03-07 04:10:14,509][118044] Updated weights for policy 0, policy_version 36530 (0.0006) [2023-03-07 04:10:15,279][118044] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-07 04:10:16,081][118044] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-07 04:10:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 37427200. Throughput: 0: 13108.2. Samples: 37406483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:16,096][117718] Avg episode reward: [(0, '2970.246')] [2023-03-07 04:10:16,847][118044] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-07 04:10:17,607][118044] Updated weights for policy 0, policy_version 36570 (0.0007) [2023-03-07 04:10:18,394][118044] Updated weights for policy 0, policy_version 36580 (0.0005) [2023-03-07 04:10:19,166][118044] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-07 04:10:19,947][118044] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-07 04:10:20,733][118044] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-07 04:10:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 37492736. Throughput: 0: 13107.0. Samples: 37485289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:21,097][117718] Avg episode reward: [(0, '3037.947')] [2023-03-07 04:10:21,516][118044] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-07 04:10:22,291][118044] Updated weights for policy 0, policy_version 36630 (0.0005) [2023-03-07 04:10:23,064][118044] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-07 04:10:23,855][118044] Updated weights for policy 0, policy_version 36650 (0.0006) [2023-03-07 04:10:24,618][118044] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-07 04:10:25,401][118044] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-07 04:10:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 37558272. Throughput: 0: 13115.4. Samples: 37524831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:26,086][117718] Avg episode reward: [(0, '2906.612')] [2023-03-07 04:10:26,194][118044] Updated weights for policy 0, policy_version 36680 (0.0006) [2023-03-07 04:10:26,972][118044] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-07 04:10:27,762][118044] Updated weights for policy 0, policy_version 36700 (0.0005) [2023-03-07 04:10:28,540][118044] Updated weights for policy 0, policy_version 36710 (0.0007) [2023-03-07 04:10:29,315][118044] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-07 04:10:30,101][118044] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-07 04:10:30,873][118044] Updated weights for policy 0, policy_version 36740 (0.0006) [2023-03-07 04:10:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 37623808. Throughput: 0: 13126.1. Samples: 37603557. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:31,086][117718] Avg episode reward: [(0, '2867.703')] [2023-03-07 04:10:31,650][118044] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-07 04:10:32,426][118044] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-07 04:10:33,194][118044] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-07 04:10:33,990][118044] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-07 04:10:34,774][118044] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-07 04:10:35,555][118044] Updated weights for policy 0, policy_version 36800 (0.0007) [2023-03-07 04:10:36,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 37689344. Throughput: 0: 13128.5. Samples: 37682307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:36,086][117718] Avg episode reward: [(0, '3009.220')] [2023-03-07 04:10:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000036806_37689344.pth... [2023-03-07 04:10:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000033729_34538496.pth [2023-03-07 04:10:36,332][118044] Updated weights for policy 0, policy_version 36810 (0.0006) [2023-03-07 04:10:37,105][118044] Updated weights for policy 0, policy_version 36820 (0.0006) [2023-03-07 04:10:37,890][118044] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-07 04:10:38,666][118044] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-07 04:10:39,448][118044] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-07 04:10:40,228][118044] Updated weights for policy 0, policy_version 36860 (0.0007) [2023-03-07 04:10:41,009][118044] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-07 04:10:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13131.5). Total num frames: 37754880. Throughput: 0: 13132.2. Samples: 37721778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:41,086][117718] Avg episode reward: [(0, '2965.974')] [2023-03-07 04:10:41,804][118044] Updated weights for policy 0, policy_version 36880 (0.0007) [2023-03-07 04:10:42,605][118044] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-07 04:10:43,382][118044] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-07 04:10:44,154][118044] Updated weights for policy 0, policy_version 36910 (0.0005) [2023-03-07 04:10:44,942][118044] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-07 04:10:45,708][118044] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-07 04:10:46,086][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 37820416. Throughput: 0: 13125.5. Samples: 37800165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:46,086][117718] Avg episode reward: [(0, '3037.538')] [2023-03-07 04:10:46,489][118044] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-07 04:10:47,282][118044] Updated weights for policy 0, policy_version 36950 (0.0006) [2023-03-07 04:10:48,067][118044] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-07 04:10:48,869][118044] Updated weights for policy 0, policy_version 36970 (0.0006) [2023-03-07 04:10:49,649][118044] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-07 04:10:50,437][118044] Updated weights for policy 0, policy_version 36990 (0.0007) [2023-03-07 04:10:51,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 37885952. Throughput: 0: 13116.0. Samples: 37878572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:10:51,086][117718] Avg episode reward: [(0, '2933.052')] [2023-03-07 04:10:51,209][118044] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-07 04:10:51,978][118044] Updated weights for policy 0, policy_version 37010 (0.0007) [2023-03-07 04:10:52,765][118044] Updated weights for policy 0, policy_version 37020 (0.0006) [2023-03-07 04:10:53,546][118044] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-07 04:10:54,336][118044] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-07 04:10:55,120][118044] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-07 04:10:55,899][118044] Updated weights for policy 0, policy_version 37060 (0.0005) [2023-03-07 04:10:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13131.5). Total num frames: 37951488. Throughput: 0: 13115.5. Samples: 37918006. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:10:56,086][117718] Avg episode reward: [(0, '2913.726')] [2023-03-07 04:10:56,698][118044] Updated weights for policy 0, policy_version 37070 (0.0007) [2023-03-07 04:10:57,472][118044] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-07 04:10:58,266][118044] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-07 04:10:59,040][118044] Updated weights for policy 0, policy_version 37100 (0.0008) [2023-03-07 04:10:59,811][118044] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-07 04:11:00,593][118044] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-07 04:11:01,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38017024. Throughput: 0: 13111.4. Samples: 37996499. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:01,086][117718] Avg episode reward: [(0, '2926.056')] [2023-03-07 04:11:01,389][118044] Updated weights for policy 0, policy_version 37130 (0.0005) [2023-03-07 04:11:02,161][118044] Updated weights for policy 0, policy_version 37140 (0.0005) [2023-03-07 04:11:02,943][118044] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-07 04:11:03,714][118044] Updated weights for policy 0, policy_version 37160 (0.0006) [2023-03-07 04:11:04,489][118044] Updated weights for policy 0, policy_version 37170 (0.0007) [2023-03-07 04:11:05,276][118044] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-07 04:11:06,059][118044] Updated weights for policy 0, policy_version 37190 (0.0007) [2023-03-07 04:11:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 38082560. Throughput: 0: 13108.1. Samples: 38075154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:06,086][117718] Avg episode reward: [(0, '2918.665')] [2023-03-07 04:11:06,823][118044] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-07 04:11:07,613][118044] Updated weights for policy 0, policy_version 37210 (0.0005) [2023-03-07 04:11:08,381][118044] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-07 04:11:09,172][118044] Updated weights for policy 0, policy_version 37230 (0.0005) [2023-03-07 04:11:09,946][118044] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-07 04:11:10,728][118044] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-07 04:11:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 38148096. Throughput: 0: 13107.1. Samples: 38114651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:11,086][117718] Avg episode reward: [(0, '2988.515')] [2023-03-07 04:11:11,522][118044] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-07 04:11:12,296][118044] Updated weights for policy 0, policy_version 37270 (0.0007) [2023-03-07 04:11:13,072][118044] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-07 04:11:13,848][118044] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-07 04:11:14,618][118044] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-07 04:11:15,408][118044] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-07 04:11:16,086][117718] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13128.0). Total num frames: 38213632. Throughput: 0: 13108.2. Samples: 38193426. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:16,086][117718] Avg episode reward: [(0, '2766.655')] [2023-03-07 04:11:16,186][118044] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-07 04:11:16,954][118044] Updated weights for policy 0, policy_version 37330 (0.0005) [2023-03-07 04:11:17,726][118044] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-07 04:11:18,500][118044] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-07 04:11:19,276][118044] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-07 04:11:20,046][118044] Updated weights for policy 0, policy_version 37370 (0.0006) [2023-03-07 04:11:20,826][118044] Updated weights for policy 0, policy_version 37380 (0.0006) [2023-03-07 04:11:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38280192. Throughput: 0: 13118.2. Samples: 38272623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:11:21,086][117718] Avg episode reward: [(0, '2787.811')] [2023-03-07 04:11:21,610][118044] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-07 04:11:22,398][118044] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-07 04:11:23,166][118044] Updated weights for policy 0, policy_version 37410 (0.0005) [2023-03-07 04:11:23,949][118044] Updated weights for policy 0, policy_version 37420 (0.0006) [2023-03-07 04:11:24,722][118044] Updated weights for policy 0, policy_version 37430 (0.0008) [2023-03-07 04:11:25,501][118044] Updated weights for policy 0, policy_version 37440 (0.0007) [2023-03-07 04:11:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38345728. Throughput: 0: 13118.1. Samples: 38312090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:11:26,086][117718] Avg episode reward: [(0, '2777.282')] [2023-03-07 04:11:26,283][118044] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-07 04:11:27,063][118044] Updated weights for policy 0, policy_version 37460 (0.0006) [2023-03-07 04:11:27,831][118044] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-07 04:11:28,606][118044] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-07 04:11:29,401][118044] Updated weights for policy 0, policy_version 37490 (0.0007) [2023-03-07 04:11:30,190][118044] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-07 04:11:30,954][118044] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-07 04:11:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 38411264. Throughput: 0: 13125.8. Samples: 38390829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:11:31,086][117718] Avg episode reward: [(0, '2739.017')] [2023-03-07 04:11:31,758][118044] Updated weights for policy 0, policy_version 37520 (0.0007) [2023-03-07 04:11:32,530][118044] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-07 04:11:33,296][118044] Updated weights for policy 0, policy_version 37540 (0.0006) [2023-03-07 04:11:34,092][118044] Updated weights for policy 0, policy_version 37550 (0.0006) [2023-03-07 04:11:34,881][118044] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-07 04:11:35,653][118044] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-07 04:11:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38476800. Throughput: 0: 13131.4. Samples: 38469485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:11:36,097][117718] Avg episode reward: [(0, '2800.060')] [2023-03-07 04:11:36,441][118044] Updated weights for policy 0, policy_version 37580 (0.0008) [2023-03-07 04:11:37,217][118044] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-07 04:11:38,002][118044] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-07 04:11:38,789][118044] Updated weights for policy 0, policy_version 37610 (0.0006) [2023-03-07 04:11:39,565][118044] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-07 04:11:40,339][118044] Updated weights for policy 0, policy_version 37630 (0.0006) [2023-03-07 04:11:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38542336. Throughput: 0: 13129.5. Samples: 38508834. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:11:41,097][117718] Avg episode reward: [(0, '2887.262')] [2023-03-07 04:11:41,110][118044] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-07 04:11:41,889][118044] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-07 04:11:42,677][118044] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-07 04:11:43,458][118044] Updated weights for policy 0, policy_version 37670 (0.0007) [2023-03-07 04:11:44,241][118044] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-07 04:11:45,033][118044] Updated weights for policy 0, policy_version 37690 (0.0006) [2023-03-07 04:11:45,800][118044] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-07 04:11:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13128.0). Total num frames: 38607872. Throughput: 0: 13135.3. Samples: 38587588. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:46,097][117718] Avg episode reward: [(0, '2879.542')] [2023-03-07 04:11:46,584][118044] Updated weights for policy 0, policy_version 37710 (0.0006) [2023-03-07 04:11:47,351][118044] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-07 04:11:48,135][118044] Updated weights for policy 0, policy_version 37730 (0.0006) [2023-03-07 04:11:48,922][118044] Updated weights for policy 0, policy_version 37740 (0.0005) [2023-03-07 04:11:49,705][118044] Updated weights for policy 0, policy_version 37750 (0.0007) [2023-03-07 04:11:50,474][118044] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-07 04:11:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38673408. Throughput: 0: 13139.0. Samples: 38666407. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:51,096][117718] Avg episode reward: [(0, '2828.214')] [2023-03-07 04:11:51,265][118044] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-07 04:11:52,049][118044] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-07 04:11:52,819][118044] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-07 04:11:53,594][118044] Updated weights for policy 0, policy_version 37800 (0.0006) [2023-03-07 04:11:54,395][118044] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-07 04:11:55,178][118044] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-07 04:11:55,943][118044] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-07 04:11:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 38738944. Throughput: 0: 13134.9. Samples: 38705725. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:11:56,097][117718] Avg episode reward: [(0, '2908.432')] [2023-03-07 04:11:56,734][118044] Updated weights for policy 0, policy_version 37840 (0.0006) [2023-03-07 04:11:57,499][118044] Updated weights for policy 0, policy_version 37850 (0.0007) [2023-03-07 04:11:58,276][118044] Updated weights for policy 0, policy_version 37860 (0.0005) [2023-03-07 04:11:59,054][118044] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-07 04:11:59,832][118044] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-07 04:12:00,615][118044] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-07 04:12:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13128.0). Total num frames: 38805504. Throughput: 0: 13139.7. Samples: 38784710. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:01,086][117718] Avg episode reward: [(0, '2880.235')] [2023-03-07 04:12:01,383][118044] Updated weights for policy 0, policy_version 37900 (0.0006) [2023-03-07 04:12:02,147][118044] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-07 04:12:02,928][118044] Updated weights for policy 0, policy_version 37920 (0.0005) [2023-03-07 04:12:03,710][118044] Updated weights for policy 0, policy_version 37930 (0.0006) [2023-03-07 04:12:04,481][118044] Updated weights for policy 0, policy_version 37940 (0.0006) [2023-03-07 04:12:05,264][118044] Updated weights for policy 0, policy_version 37950 (0.0006) [2023-03-07 04:12:06,038][118044] Updated weights for policy 0, policy_version 37960 (0.0006) [2023-03-07 04:12:06,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 38871040. Throughput: 0: 13136.3. Samples: 38863760. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:06,086][117718] Avg episode reward: [(0, '2769.724')] [2023-03-07 04:12:06,822][118044] Updated weights for policy 0, policy_version 37970 (0.0006) [2023-03-07 04:12:07,616][118044] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-07 04:12:08,409][118044] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-03-07 04:12:09,186][118044] Updated weights for policy 0, policy_version 38000 (0.0007) [2023-03-07 04:12:09,976][118044] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-07 04:12:10,755][118044] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-07 04:12:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 38936576. Throughput: 0: 13123.2. Samples: 38902631. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:11,086][117718] Avg episode reward: [(0, '2844.176')] [2023-03-07 04:12:11,532][118044] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-07 04:12:12,318][118044] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-07 04:12:13,099][118044] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-07 04:12:13,883][118044] Updated weights for policy 0, policy_version 38060 (0.0007) [2023-03-07 04:12:14,642][118044] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-07 04:12:15,413][118044] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-07 04:12:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13128.0). Total num frames: 39002112. Throughput: 0: 13129.1. Samples: 38981640. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:16,086][117718] Avg episode reward: [(0, '2869.662')] [2023-03-07 04:12:16,195][118044] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-07 04:12:16,997][118044] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-07 04:12:17,778][118044] Updated weights for policy 0, policy_version 38110 (0.0007) [2023-03-07 04:12:18,548][118044] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-07 04:12:19,332][118044] Updated weights for policy 0, policy_version 38130 (0.0006) [2023-03-07 04:12:20,109][118044] Updated weights for policy 0, policy_version 38140 (0.0006) [2023-03-07 04:12:20,884][118044] Updated weights for policy 0, policy_version 38150 (0.0007) [2023-03-07 04:12:21,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 39067648. Throughput: 0: 13132.8. Samples: 39060460. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:21,086][117718] Avg episode reward: [(0, '2815.869')] [2023-03-07 04:12:21,674][118044] Updated weights for policy 0, policy_version 38160 (0.0006) [2023-03-07 04:12:22,453][118044] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-07 04:12:23,215][118044] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-07 04:12:23,999][118044] Updated weights for policy 0, policy_version 38190 (0.0007) [2023-03-07 04:12:24,776][118044] Updated weights for policy 0, policy_version 38200 (0.0007) [2023-03-07 04:12:25,553][118044] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-07 04:12:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.2, 300 sec: 13131.5). Total num frames: 39133184. Throughput: 0: 13133.6. Samples: 39099848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:26,086][117718] Avg episode reward: [(0, '2941.557')] [2023-03-07 04:12:26,322][118044] Updated weights for policy 0, policy_version 38220 (0.0005) [2023-03-07 04:12:27,099][118044] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-07 04:12:27,873][118044] Updated weights for policy 0, policy_version 38240 (0.0007) [2023-03-07 04:12:28,669][118044] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-07 04:12:29,419][118044] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-07 04:12:30,180][118044] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-07 04:12:30,952][118044] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-07 04:12:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 39199744. Throughput: 0: 13143.8. Samples: 39179055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:31,086][117718] Avg episode reward: [(0, '2907.349')] [2023-03-07 04:12:31,746][118044] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-07 04:12:32,508][118044] Updated weights for policy 0, policy_version 38300 (0.0007) [2023-03-07 04:12:33,300][118044] Updated weights for policy 0, policy_version 38310 (0.0007) [2023-03-07 04:12:34,082][118044] Updated weights for policy 0, policy_version 38320 (0.0007) [2023-03-07 04:12:34,872][118044] Updated weights for policy 0, policy_version 38330 (0.0006) [2023-03-07 04:12:35,643][118044] Updated weights for policy 0, policy_version 38340 (0.0007) [2023-03-07 04:12:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 39265280. Throughput: 0: 13145.4. Samples: 39257951. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:36,086][117718] Avg episode reward: [(0, '2924.430')] [2023-03-07 04:12:36,089][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000038345_39265280.pth... [2023-03-07 04:12:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000035267_36113408.pth [2023-03-07 04:12:36,430][118044] Updated weights for policy 0, policy_version 38350 (0.0007) [2023-03-07 04:12:37,209][118044] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-07 04:12:37,999][118044] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-07 04:12:38,778][118044] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-07 04:12:39,570][118044] Updated weights for policy 0, policy_version 38390 (0.0008) [2023-03-07 04:12:40,362][118044] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-07 04:12:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 39330816. Throughput: 0: 13145.3. Samples: 39297260. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:41,086][117718] Avg episode reward: [(0, '2920.646')] [2023-03-07 04:12:41,130][118044] Updated weights for policy 0, policy_version 38410 (0.0007) [2023-03-07 04:12:41,911][118044] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-07 04:12:42,683][118044] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-07 04:12:43,483][118044] Updated weights for policy 0, policy_version 38440 (0.0007) [2023-03-07 04:12:44,270][118044] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-07 04:12:45,057][118044] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-07 04:12:45,835][118044] Updated weights for policy 0, policy_version 38470 (0.0006) [2023-03-07 04:12:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 39396352. Throughput: 0: 13133.5. Samples: 39375719. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:46,086][117718] Avg episode reward: [(0, '2933.056')] [2023-03-07 04:12:46,606][118044] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-07 04:12:47,372][118044] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-07 04:12:48,152][118044] Updated weights for policy 0, policy_version 38500 (0.0007) [2023-03-07 04:12:48,939][118044] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-07 04:12:49,713][118044] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-07 04:12:50,478][118044] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-07 04:12:51,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 39461888. Throughput: 0: 13135.2. Samples: 39454847. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:12:51,086][117718] Avg episode reward: [(0, '2822.819')] [2023-03-07 04:12:51,268][118044] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-07 04:12:52,047][118044] Updated weights for policy 0, policy_version 38550 (0.0006) [2023-03-07 04:12:52,829][118044] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-07 04:12:53,614][118044] Updated weights for policy 0, policy_version 38570 (0.0006) [2023-03-07 04:12:54,388][118044] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-07 04:12:55,166][118044] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-07 04:12:55,956][118044] Updated weights for policy 0, policy_version 38600 (0.0007) [2023-03-07 04:12:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 39527424. Throughput: 0: 13145.6. Samples: 39494183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:12:56,086][117718] Avg episode reward: [(0, '2932.150')] [2023-03-07 04:12:56,741][118044] Updated weights for policy 0, policy_version 38610 (0.0006) [2023-03-07 04:12:57,517][118044] Updated weights for policy 0, policy_version 38620 (0.0005) [2023-03-07 04:12:58,303][118044] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-07 04:12:59,078][118044] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-07 04:12:59,858][118044] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-07 04:13:00,639][118044] Updated weights for policy 0, policy_version 38660 (0.0006) [2023-03-07 04:13:01,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 39592960. Throughput: 0: 13139.6. Samples: 39572919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:01,086][117718] Avg episode reward: [(0, '2790.904')] [2023-03-07 04:13:01,415][118044] Updated weights for policy 0, policy_version 38670 (0.0006) [2023-03-07 04:13:02,191][118044] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-07 04:13:02,974][118044] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-07 04:13:03,756][118044] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-07 04:13:04,533][118044] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 04:13:05,309][118044] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-07 04:13:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 39658496. Throughput: 0: 13138.3. Samples: 39651684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:06,086][117718] Avg episode reward: [(0, '2778.939')] [2023-03-07 04:13:06,097][118044] Updated weights for policy 0, policy_version 38730 (0.0005) [2023-03-07 04:13:06,863][118044] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 04:13:07,649][118044] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 04:13:08,430][118044] Updated weights for policy 0, policy_version 38760 (0.0007) [2023-03-07 04:13:09,206][118044] Updated weights for policy 0, policy_version 38770 (0.0007) [2023-03-07 04:13:09,985][118044] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 04:13:10,760][118044] Updated weights for policy 0, policy_version 38790 (0.0007) [2023-03-07 04:13:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 39725056. Throughput: 0: 13138.5. Samples: 39691077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:11,086][117718] Avg episode reward: [(0, '2742.664')] [2023-03-07 04:13:11,538][118044] Updated weights for policy 0, policy_version 38800 (0.0006) [2023-03-07 04:13:12,315][118044] Updated weights for policy 0, policy_version 38810 (0.0007) [2023-03-07 04:13:13,084][118044] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-07 04:13:13,866][118044] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 04:13:14,629][118044] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-07 04:13:15,424][118044] Updated weights for policy 0, policy_version 38850 (0.0006) [2023-03-07 04:13:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 39790592. Throughput: 0: 13135.1. Samples: 39770137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:16,096][117718] Avg episode reward: [(0, '2853.466')] [2023-03-07 04:13:16,193][118044] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 04:13:16,963][118044] Updated weights for policy 0, policy_version 38870 (0.0005) [2023-03-07 04:13:17,749][118044] Updated weights for policy 0, policy_version 38880 (0.0007) [2023-03-07 04:13:18,529][118044] Updated weights for policy 0, policy_version 38890 (0.0007) [2023-03-07 04:13:19,298][118044] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-07 04:13:20,103][118044] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 04:13:20,881][118044] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-07 04:13:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 39856128. Throughput: 0: 13133.9. Samples: 39848978. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:21,097][117718] Avg episode reward: [(0, '2816.694')] [2023-03-07 04:13:21,641][118044] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-07 04:13:22,423][118044] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-07 04:13:23,185][118044] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 04:13:23,954][118044] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 04:13:24,741][118044] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-07 04:13:25,515][118044] Updated weights for policy 0, policy_version 38980 (0.0007) [2023-03-07 04:13:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 39922688. Throughput: 0: 13144.5. Samples: 39888764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:26,096][117718] Avg episode reward: [(0, '2802.676')] [2023-03-07 04:13:26,291][118044] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-07 04:13:27,073][118044] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-07 04:13:27,850][118044] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-07 04:13:28,632][118044] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-07 04:13:29,429][118044] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-07 04:13:30,186][118044] Updated weights for policy 0, policy_version 39040 (0.0008) [2023-03-07 04:13:30,973][118044] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-07 04:13:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 39988224. Throughput: 0: 13155.5. Samples: 39967718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:31,086][117718] Avg episode reward: [(0, '2869.397')] [2023-03-07 04:13:31,745][118044] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-07 04:13:32,523][118044] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-07 04:13:33,285][118044] Updated weights for policy 0, policy_version 39080 (0.0007) [2023-03-07 04:13:34,066][118044] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-07 04:13:34,866][118044] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-07 04:13:35,629][118044] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-07 04:13:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 40053760. Throughput: 0: 13152.2. Samples: 40046692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:36,086][117718] Avg episode reward: [(0, '2782.648')] [2023-03-07 04:13:36,426][118044] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-07 04:13:37,201][118044] Updated weights for policy 0, policy_version 39130 (0.0006) [2023-03-07 04:13:37,970][118044] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-07 04:13:38,756][118044] Updated weights for policy 0, policy_version 39150 (0.0007) [2023-03-07 04:13:39,552][118044] Updated weights for policy 0, policy_version 39160 (0.0006) [2023-03-07 04:13:40,318][118044] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-07 04:13:41,081][118044] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 04:13:41,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 40120320. Throughput: 0: 13150.7. Samples: 40085964. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:41,086][117718] Avg episode reward: [(0, '2798.935')] [2023-03-07 04:13:41,862][118044] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 04:13:42,638][118044] Updated weights for policy 0, policy_version 39200 (0.0007) [2023-03-07 04:13:43,419][118044] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-07 04:13:44,217][118044] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-07 04:13:45,002][118044] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-07 04:13:45,788][118044] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 04:13:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 40184832. Throughput: 0: 13155.7. Samples: 40164928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:46,086][117718] Avg episode reward: [(0, '2680.719')] [2023-03-07 04:13:46,563][118044] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 04:13:47,346][118044] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-07 04:13:48,116][118044] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 04:13:48,902][118044] Updated weights for policy 0, policy_version 39280 (0.0007) [2023-03-07 04:13:49,685][118044] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-07 04:13:50,481][118044] Updated weights for policy 0, policy_version 39300 (0.0006) [2023-03-07 04:13:51,086][117718] Fps is (10 sec: 13004.8, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 40250368. Throughput: 0: 13150.5. Samples: 40243456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:51,086][117718] Avg episode reward: [(0, '2862.846')] [2023-03-07 04:13:51,253][118044] Updated weights for policy 0, policy_version 39310 (0.0006) [2023-03-07 04:13:52,034][118044] Updated weights for policy 0, policy_version 39320 (0.0005) [2023-03-07 04:13:52,830][118044] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-07 04:13:53,596][118044] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-07 04:13:54,380][118044] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-07 04:13:55,170][118044] Updated weights for policy 0, policy_version 39360 (0.0006) [2023-03-07 04:13:55,964][118044] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-07 04:13:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 40315904. Throughput: 0: 13144.6. Samples: 40282587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:13:56,086][117718] Avg episode reward: [(0, '2933.927')] [2023-03-07 04:13:56,750][118044] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-07 04:13:57,517][118044] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 04:13:58,302][118044] Updated weights for policy 0, policy_version 39400 (0.0006) [2023-03-07 04:13:59,088][118044] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-07 04:13:59,868][118044] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-07 04:14:00,623][118044] Updated weights for policy 0, policy_version 39430 (0.0006) [2023-03-07 04:14:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 40381440. Throughput: 0: 13134.2. Samples: 40361174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:01,086][117718] Avg episode reward: [(0, '2996.405')] [2023-03-07 04:14:01,430][118044] Updated weights for policy 0, policy_version 39440 (0.0005) [2023-03-07 04:14:02,204][118044] Updated weights for policy 0, policy_version 39450 (0.0005) [2023-03-07 04:14:03,003][118044] Updated weights for policy 0, policy_version 39460 (0.0007) [2023-03-07 04:14:03,782][118044] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-07 04:14:04,542][118044] Updated weights for policy 0, policy_version 39480 (0.0005) [2023-03-07 04:14:05,313][118044] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-07 04:14:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 40446976. Throughput: 0: 13135.0. Samples: 40440052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:06,086][117718] Avg episode reward: [(0, '2889.668')] [2023-03-07 04:14:06,094][118044] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-07 04:14:06,865][118044] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-07 04:14:07,636][118044] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-07 04:14:08,401][118044] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 04:14:09,181][118044] Updated weights for policy 0, policy_version 39540 (0.0006) [2023-03-07 04:14:09,979][118044] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-07 04:14:10,756][118044] Updated weights for policy 0, policy_version 39560 (0.0005) [2023-03-07 04:14:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 40513536. Throughput: 0: 13135.2. Samples: 40479849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:11,086][117718] Avg episode reward: [(0, '2968.179')] [2023-03-07 04:14:11,521][118044] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-07 04:14:12,303][118044] Updated weights for policy 0, policy_version 39580 (0.0006) [2023-03-07 04:14:13,085][118044] Updated weights for policy 0, policy_version 39590 (0.0007) [2023-03-07 04:14:13,868][118044] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-07 04:14:14,641][118044] Updated weights for policy 0, policy_version 39610 (0.0007) [2023-03-07 04:14:15,410][118044] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-07 04:14:16,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 40579072. Throughput: 0: 13130.7. Samples: 40558599. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:16,086][117718] Avg episode reward: [(0, '2878.234')] [2023-03-07 04:14:16,186][118044] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-07 04:14:16,959][118044] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-07 04:14:17,743][118044] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-07 04:14:18,529][118044] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-07 04:14:19,313][118044] Updated weights for policy 0, policy_version 39670 (0.0007) [2023-03-07 04:14:20,099][118044] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 04:14:20,899][118044] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-07 04:14:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 40644608. Throughput: 0: 13125.7. Samples: 40637347. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:21,086][117718] Avg episode reward: [(0, '2928.632')] [2023-03-07 04:14:21,659][118044] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-07 04:14:22,450][118044] Updated weights for policy 0, policy_version 39710 (0.0006) [2023-03-07 04:14:23,218][118044] Updated weights for policy 0, policy_version 39720 (0.0006) [2023-03-07 04:14:24,002][118044] Updated weights for policy 0, policy_version 39730 (0.0006) [2023-03-07 04:14:24,779][118044] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-07 04:14:25,555][118044] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 04:14:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13128.0). Total num frames: 40710144. Throughput: 0: 13125.4. Samples: 40676606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:26,086][117718] Avg episode reward: [(0, '2966.778')] [2023-03-07 04:14:26,328][118044] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-07 04:14:27,116][118044] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-07 04:14:27,893][118044] Updated weights for policy 0, policy_version 39780 (0.0007) [2023-03-07 04:14:28,661][118044] Updated weights for policy 0, policy_version 39790 (0.0005) [2023-03-07 04:14:29,425][118044] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-07 04:14:30,213][118044] Updated weights for policy 0, policy_version 39810 (0.0007) [2023-03-07 04:14:30,986][118044] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-07 04:14:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 40776704. Throughput: 0: 13136.1. Samples: 40756051. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:14:31,086][117718] Avg episode reward: [(0, '2907.120')] [2023-03-07 04:14:31,773][118044] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-07 04:14:32,542][118044] Updated weights for policy 0, policy_version 39840 (0.0005) [2023-03-07 04:14:33,324][118044] Updated weights for policy 0, policy_version 39850 (0.0006) [2023-03-07 04:14:34,103][118044] Updated weights for policy 0, policy_version 39860 (0.0005) [2023-03-07 04:14:34,882][118044] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-07 04:14:35,665][118044] Updated weights for policy 0, policy_version 39880 (0.0007) [2023-03-07 04:14:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 40842240. Throughput: 0: 13137.2. Samples: 40834631. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:14:36,086][117718] Avg episode reward: [(0, '2916.582')] [2023-03-07 04:14:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000039885_40842240.pth... [2023-03-07 04:14:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000036806_37689344.pth [2023-03-07 04:14:36,458][118044] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-07 04:14:37,236][118044] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-07 04:14:38,030][118044] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-07 04:14:38,804][118044] Updated weights for policy 0, policy_version 39920 (0.0007) [2023-03-07 04:14:39,574][118044] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-07 04:14:40,353][118044] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-07 04:14:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 40907776. Throughput: 0: 13142.5. Samples: 40874000. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:14:41,086][117718] Avg episode reward: [(0, '3024.017')] [2023-03-07 04:14:41,123][118044] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-07 04:14:41,905][118044] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-07 04:14:42,684][118044] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-07 04:14:43,466][118044] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 04:14:44,246][118044] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-07 04:14:45,008][118044] Updated weights for policy 0, policy_version 40000 (0.0005) [2023-03-07 04:14:45,782][118044] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-07 04:14:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 40973312. Throughput: 0: 13155.5. Samples: 40953170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:14:46,086][117718] Avg episode reward: [(0, '3003.325')] [2023-03-07 04:14:46,570][118044] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-07 04:14:47,361][118044] Updated weights for policy 0, policy_version 40030 (0.0006) [2023-03-07 04:14:48,140][118044] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-07 04:14:48,912][118044] Updated weights for policy 0, policy_version 40050 (0.0006) [2023-03-07 04:14:49,717][118044] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-07 04:14:50,477][118044] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 04:14:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 41038848. Throughput: 0: 13147.2. Samples: 41031679. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:14:51,086][117718] Avg episode reward: [(0, '3076.170')] [2023-03-07 04:14:51,269][118044] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 04:14:52,021][118044] Updated weights for policy 0, policy_version 40090 (0.0005) [2023-03-07 04:14:52,801][118044] Updated weights for policy 0, policy_version 40100 (0.0007) [2023-03-07 04:14:53,568][118044] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-07 04:14:54,354][118044] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-07 04:14:55,133][118044] Updated weights for policy 0, policy_version 40130 (0.0007) [2023-03-07 04:14:55,917][118044] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-07 04:14:56,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 41105408. Throughput: 0: 13147.7. Samples: 41071496. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:14:56,086][117718] Avg episode reward: [(0, '2953.461')] [2023-03-07 04:14:56,710][118044] Updated weights for policy 0, policy_version 40150 (0.0007) [2023-03-07 04:14:57,482][118044] Updated weights for policy 0, policy_version 40160 (0.0006) [2023-03-07 04:14:58,266][118044] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-07 04:14:59,041][118044] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-07 04:14:59,811][118044] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-07 04:15:00,577][118044] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-07 04:15:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 41170944. Throughput: 0: 13146.1. Samples: 41150172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:01,086][117718] Avg episode reward: [(0, '3012.777')] [2023-03-07 04:15:01,366][118044] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-07 04:15:02,136][118044] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 04:15:02,918][118044] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-07 04:15:03,711][118044] Updated weights for policy 0, policy_version 40240 (0.0005) [2023-03-07 04:15:04,483][118044] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-07 04:15:05,249][118044] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-07 04:15:06,022][118044] Updated weights for policy 0, policy_version 40270 (0.0005) [2023-03-07 04:15:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13138.4). Total num frames: 41237504. Throughput: 0: 13158.2. Samples: 41229465. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:06,086][117718] Avg episode reward: [(0, '2838.177')] [2023-03-07 04:15:06,798][118044] Updated weights for policy 0, policy_version 40280 (0.0006) [2023-03-07 04:15:07,573][118044] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-07 04:15:08,350][118044] Updated weights for policy 0, policy_version 40300 (0.0007) [2023-03-07 04:15:09,136][118044] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 04:15:09,901][118044] Updated weights for policy 0, policy_version 40320 (0.0006) [2023-03-07 04:15:10,663][118044] Updated weights for policy 0, policy_version 40330 (0.0007) [2023-03-07 04:15:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 41303040. Throughput: 0: 13164.6. Samples: 41269012. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:11,086][117718] Avg episode reward: [(0, '2870.217')] [2023-03-07 04:15:11,432][118044] Updated weights for policy 0, policy_version 40340 (0.0005) [2023-03-07 04:15:12,210][118044] Updated weights for policy 0, policy_version 40350 (0.0006) [2023-03-07 04:15:12,992][118044] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-07 04:15:13,762][118044] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-07 04:15:14,527][118044] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 04:15:15,323][118044] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 04:15:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 41368576. Throughput: 0: 13163.1. Samples: 41348391. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:16,086][117718] Avg episode reward: [(0, '2880.026')] [2023-03-07 04:15:16,098][118044] Updated weights for policy 0, policy_version 40400 (0.0006) [2023-03-07 04:15:16,869][118044] Updated weights for policy 0, policy_version 40410 (0.0006) [2023-03-07 04:15:17,673][118044] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 04:15:18,458][118044] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 04:15:19,233][118044] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-07 04:15:20,002][118044] Updated weights for policy 0, policy_version 40450 (0.0007) [2023-03-07 04:15:20,784][118044] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-07 04:15:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 41434112. Throughput: 0: 13165.7. Samples: 41427090. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:21,086][117718] Avg episode reward: [(0, '2919.266')] [2023-03-07 04:15:21,576][118044] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-07 04:15:22,360][118044] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-07 04:15:23,135][118044] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-07 04:15:23,914][118044] Updated weights for policy 0, policy_version 40500 (0.0007) [2023-03-07 04:15:24,705][118044] Updated weights for policy 0, policy_version 40510 (0.0005) [2023-03-07 04:15:25,480][118044] Updated weights for policy 0, policy_version 40520 (0.0007) [2023-03-07 04:15:26,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 41499648. Throughput: 0: 13163.7. Samples: 41466365. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:15:26,086][117718] Avg episode reward: [(0, '2846.707')] [2023-03-07 04:15:26,261][118044] Updated weights for policy 0, policy_version 40530 (0.0008) [2023-03-07 04:15:27,045][118044] Updated weights for policy 0, policy_version 40540 (0.0005) [2023-03-07 04:15:27,824][118044] Updated weights for policy 0, policy_version 40550 (0.0006) [2023-03-07 04:15:28,594][118044] Updated weights for policy 0, policy_version 40560 (0.0006) [2023-03-07 04:15:29,363][118044] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 04:15:30,144][118044] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-07 04:15:30,934][118044] Updated weights for policy 0, policy_version 40590 (0.0006) [2023-03-07 04:15:31,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 41566208. Throughput: 0: 13155.7. Samples: 41545177. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 04:15:31,086][117718] Avg episode reward: [(0, '2959.688')] [2023-03-07 04:15:31,709][118044] Updated weights for policy 0, policy_version 40600 (0.0007) [2023-03-07 04:15:32,502][118044] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-07 04:15:33,288][118044] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-07 04:15:34,061][118044] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-07 04:15:34,832][118044] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-07 04:15:35,623][118044] Updated weights for policy 0, policy_version 40650 (0.0007) [2023-03-07 04:15:36,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 41630720. Throughput: 0: 13162.5. Samples: 41623991. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 04:15:36,086][117718] Avg episode reward: [(0, '2950.782')] [2023-03-07 04:15:36,399][118044] Updated weights for policy 0, policy_version 40660 (0.0006) [2023-03-07 04:15:37,172][118044] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-07 04:15:37,945][118044] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-07 04:15:38,728][118044] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-07 04:15:39,512][118044] Updated weights for policy 0, policy_version 40700 (0.0005) [2023-03-07 04:15:40,282][118044] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-07 04:15:41,049][118044] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 04:15:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 41697280. Throughput: 0: 13156.1. Samples: 41663518. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 04:15:41,086][117718] Avg episode reward: [(0, '2954.582')] [2023-03-07 04:15:41,828][118044] Updated weights for policy 0, policy_version 40730 (0.0006) [2023-03-07 04:15:42,583][118044] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 04:15:43,367][118044] Updated weights for policy 0, policy_version 40750 (0.0006) [2023-03-07 04:15:44,148][118044] Updated weights for policy 0, policy_version 40760 (0.0007) [2023-03-07 04:15:44,939][118044] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-07 04:15:45,705][118044] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-07 04:15:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 41762816. Throughput: 0: 13164.0. Samples: 41742553. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 04:15:46,086][117718] Avg episode reward: [(0, '3059.859')] [2023-03-07 04:15:46,510][118044] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-07 04:15:47,283][118044] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-07 04:15:48,051][118044] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-07 04:15:48,837][118044] Updated weights for policy 0, policy_version 40820 (0.0007) [2023-03-07 04:15:49,614][118044] Updated weights for policy 0, policy_version 40830 (0.0007) [2023-03-07 04:15:50,382][118044] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 04:15:51,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 41828352. Throughput: 0: 13154.0. Samples: 41821394. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 04:15:51,086][117718] Avg episode reward: [(0, '2967.084')] [2023-03-07 04:15:51,173][118044] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-07 04:15:51,934][118044] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-07 04:15:52,727][118044] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-07 04:15:53,484][118044] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-07 04:15:54,272][118044] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-07 04:15:55,036][118044] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-07 04:15:55,823][118044] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-07 04:15:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 41894912. Throughput: 0: 13156.1. Samples: 41861040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:15:56,086][117718] Avg episode reward: [(0, '2943.851')] [2023-03-07 04:15:56,602][118044] Updated weights for policy 0, policy_version 40920 (0.0006) [2023-03-07 04:15:57,378][118044] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 04:15:58,163][118044] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-07 04:15:58,954][118044] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 04:15:59,729][118044] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-07 04:16:00,514][118044] Updated weights for policy 0, policy_version 40970 (0.0005) [2023-03-07 04:16:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 41960448. Throughput: 0: 13143.5. Samples: 41939850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:16:01,087][117718] Avg episode reward: [(0, '2869.016')] [2023-03-07 04:16:01,275][118044] Updated weights for policy 0, policy_version 40980 (0.0006) [2023-03-07 04:16:02,066][118044] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-07 04:16:02,849][118044] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-07 04:16:03,631][118044] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 04:16:04,420][118044] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-07 04:16:05,199][118044] Updated weights for policy 0, policy_version 41030 (0.0005) [2023-03-07 04:16:05,969][118044] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-07 04:16:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42025984. Throughput: 0: 13145.1. Samples: 42018620. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:16:06,086][117718] Avg episode reward: [(0, '2935.576')] [2023-03-07 04:16:06,749][118044] Updated weights for policy 0, policy_version 41050 (0.0006) [2023-03-07 04:16:07,523][118044] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-07 04:16:08,293][118044] Updated weights for policy 0, policy_version 41070 (0.0006) [2023-03-07 04:16:09,087][118044] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-07 04:16:09,849][118044] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-07 04:16:10,635][118044] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-07 04:16:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42091520. Throughput: 0: 13150.6. Samples: 42058143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:16:11,086][117718] Avg episode reward: [(0, '2988.395')] [2023-03-07 04:16:11,400][118044] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-07 04:16:12,177][118044] Updated weights for policy 0, policy_version 41120 (0.0005) [2023-03-07 04:16:12,948][118044] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-07 04:16:13,710][118044] Updated weights for policy 0, policy_version 41140 (0.0006) [2023-03-07 04:16:14,497][118044] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-07 04:16:15,282][118044] Updated weights for policy 0, policy_version 41160 (0.0007) [2023-03-07 04:16:16,052][118044] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 04:16:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 42158080. Throughput: 0: 13161.4. Samples: 42137438. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:16:16,086][117718] Avg episode reward: [(0, '3009.826')] [2023-03-07 04:16:16,821][118044] Updated weights for policy 0, policy_version 41180 (0.0007) [2023-03-07 04:16:17,601][118044] Updated weights for policy 0, policy_version 41190 (0.0007) [2023-03-07 04:16:18,384][118044] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-07 04:16:19,173][118044] Updated weights for policy 0, policy_version 41210 (0.0006) [2023-03-07 04:16:19,977][118044] Updated weights for policy 0, policy_version 41220 (0.0007) [2023-03-07 04:16:20,745][118044] Updated weights for policy 0, policy_version 41230 (0.0007) [2023-03-07 04:16:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 42223616. Throughput: 0: 13157.1. Samples: 42216058. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:16:21,086][117718] Avg episode reward: [(0, '3040.486')] [2023-03-07 04:16:21,514][118044] Updated weights for policy 0, policy_version 41240 (0.0006) [2023-03-07 04:16:22,302][118044] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-07 04:16:23,085][118044] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-07 04:16:23,852][118044] Updated weights for policy 0, policy_version 41270 (0.0007) [2023-03-07 04:16:24,636][118044] Updated weights for policy 0, policy_version 41280 (0.0007) [2023-03-07 04:16:25,422][118044] Updated weights for policy 0, policy_version 41290 (0.0006) [2023-03-07 04:16:26,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 42289152. Throughput: 0: 13157.3. Samples: 42255599. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:16:26,086][117718] Avg episode reward: [(0, '2970.828')] [2023-03-07 04:16:26,186][118044] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-07 04:16:26,979][118044] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-07 04:16:27,752][118044] Updated weights for policy 0, policy_version 41320 (0.0006) [2023-03-07 04:16:28,527][118044] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-07 04:16:29,313][118044] Updated weights for policy 0, policy_version 41340 (0.0007) [2023-03-07 04:16:30,099][118044] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-07 04:16:30,873][118044] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-07 04:16:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42354688. Throughput: 0: 13152.2. Samples: 42334404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:16:31,086][117718] Avg episode reward: [(0, '2900.543')] [2023-03-07 04:16:31,641][118044] Updated weights for policy 0, policy_version 41370 (0.0006) [2023-03-07 04:16:32,437][118044] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-07 04:16:33,230][118044] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-07 04:16:33,991][118044] Updated weights for policy 0, policy_version 41400 (0.0005) [2023-03-07 04:16:34,783][118044] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-07 04:16:35,568][118044] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-07 04:16:36,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 42420224. Throughput: 0: 13148.6. Samples: 42413083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:16:36,086][117718] Avg episode reward: [(0, '2938.946')] [2023-03-07 04:16:36,092][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000041426_42420224.pth... [2023-03-07 04:16:36,126][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000038345_39265280.pth [2023-03-07 04:16:36,325][118044] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-07 04:16:37,125][118044] Updated weights for policy 0, policy_version 41440 (0.0005) [2023-03-07 04:16:37,932][118044] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-07 04:16:38,693][118044] Updated weights for policy 0, policy_version 41460 (0.0005) [2023-03-07 04:16:39,476][118044] Updated weights for policy 0, policy_version 41470 (0.0008) [2023-03-07 04:16:40,243][118044] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-07 04:16:41,031][118044] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-07 04:16:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42485760. Throughput: 0: 13139.2. Samples: 42452304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:16:41,086][117718] Avg episode reward: [(0, '2997.415')] [2023-03-07 04:16:41,812][118044] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-07 04:16:42,598][118044] Updated weights for policy 0, policy_version 41510 (0.0007) [2023-03-07 04:16:43,363][118044] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-07 04:16:44,133][118044] Updated weights for policy 0, policy_version 41530 (0.0005) [2023-03-07 04:16:44,907][118044] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-07 04:16:45,709][118044] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-07 04:16:46,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 42552320. Throughput: 0: 13143.6. Samples: 42531313. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:16:46,086][117718] Avg episode reward: [(0, '3009.144')] [2023-03-07 04:16:46,471][118044] Updated weights for policy 0, policy_version 41560 (0.0005) [2023-03-07 04:16:47,247][118044] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-07 04:16:48,040][118044] Updated weights for policy 0, policy_version 41580 (0.0006) [2023-03-07 04:16:48,816][118044] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-07 04:16:49,594][118044] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-07 04:16:50,379][118044] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-07 04:16:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 42616832. Throughput: 0: 13143.6. Samples: 42610083. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:16:51,086][117718] Avg episode reward: [(0, '3010.211')] [2023-03-07 04:16:51,174][118044] Updated weights for policy 0, policy_version 41620 (0.0005) [2023-03-07 04:16:51,948][118044] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-07 04:16:52,729][118044] Updated weights for policy 0, policy_version 41640 (0.0006) [2023-03-07 04:16:53,493][118044] Updated weights for policy 0, policy_version 41650 (0.0006) [2023-03-07 04:16:54,257][118044] Updated weights for policy 0, policy_version 41660 (0.0007) [2023-03-07 04:16:55,049][118044] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-07 04:16:55,832][118044] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-07 04:16:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 42683392. Throughput: 0: 13141.6. Samples: 42649513. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:16:56,086][117718] Avg episode reward: [(0, '2982.186')] [2023-03-07 04:16:56,605][118044] Updated weights for policy 0, policy_version 41690 (0.0006) [2023-03-07 04:16:57,383][118044] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 04:16:58,177][118044] Updated weights for policy 0, policy_version 41710 (0.0007) [2023-03-07 04:16:58,946][118044] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-07 04:16:59,724][118044] Updated weights for policy 0, policy_version 41730 (0.0007) [2023-03-07 04:17:00,487][118044] Updated weights for policy 0, policy_version 41740 (0.0006) [2023-03-07 04:17:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 42748928. Throughput: 0: 13133.3. Samples: 42728437. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:17:01,086][117718] Avg episode reward: [(0, '2994.599')] [2023-03-07 04:17:01,258][118044] Updated weights for policy 0, policy_version 41750 (0.0006) [2023-03-07 04:17:02,050][118044] Updated weights for policy 0, policy_version 41760 (0.0006) [2023-03-07 04:17:02,844][118044] Updated weights for policy 0, policy_version 41770 (0.0006) [2023-03-07 04:17:03,600][118044] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 04:17:04,381][118044] Updated weights for policy 0, policy_version 41790 (0.0006) [2023-03-07 04:17:05,170][118044] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-07 04:17:05,937][118044] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-07 04:17:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42814464. Throughput: 0: 13141.2. Samples: 42807412. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:17:06,096][117718] Avg episode reward: [(0, '3029.454')] [2023-03-07 04:17:06,725][118044] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-07 04:17:07,525][118044] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-07 04:17:08,301][118044] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 04:17:09,076][118044] Updated weights for policy 0, policy_version 41850 (0.0007) [2023-03-07 04:17:09,839][118044] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-07 04:17:10,634][118044] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-07 04:17:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 42880000. Throughput: 0: 13138.6. Samples: 42846837. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:11,097][117718] Avg episode reward: [(0, '2995.935')] [2023-03-07 04:17:11,422][118044] Updated weights for policy 0, policy_version 41880 (0.0006) [2023-03-07 04:17:12,199][118044] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-07 04:17:12,971][118044] Updated weights for policy 0, policy_version 41900 (0.0007) [2023-03-07 04:17:13,743][118044] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 04:17:14,518][118044] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-07 04:17:15,289][118044] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 04:17:16,071][118044] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-07 04:17:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 42946560. Throughput: 0: 13140.0. Samples: 42925704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:16,096][117718] Avg episode reward: [(0, '3030.191')] [2023-03-07 04:17:16,854][118044] Updated weights for policy 0, policy_version 41950 (0.0007) [2023-03-07 04:17:17,611][118044] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-07 04:17:18,398][118044] Updated weights for policy 0, policy_version 41970 (0.0007) [2023-03-07 04:17:19,192][118044] Updated weights for policy 0, policy_version 41980 (0.0007) [2023-03-07 04:17:19,963][118044] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-07 04:17:20,751][118044] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-07 04:17:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 43012096. Throughput: 0: 13142.9. Samples: 43004509. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:21,097][117718] Avg episode reward: [(0, '3068.925')] [2023-03-07 04:17:21,525][118044] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-07 04:17:22,306][118044] Updated weights for policy 0, policy_version 42020 (0.0007) [2023-03-07 04:17:23,098][118044] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 04:17:23,876][118044] Updated weights for policy 0, policy_version 42040 (0.0007) [2023-03-07 04:17:24,658][118044] Updated weights for policy 0, policy_version 42050 (0.0006) [2023-03-07 04:17:25,415][118044] Updated weights for policy 0, policy_version 42060 (0.0005) [2023-03-07 04:17:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43077632. Throughput: 0: 13144.4. Samples: 43043802. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:26,097][117718] Avg episode reward: [(0, '2993.119')] [2023-03-07 04:17:26,218][118044] Updated weights for policy 0, policy_version 42070 (0.0006) [2023-03-07 04:17:26,996][118044] Updated weights for policy 0, policy_version 42080 (0.0005) [2023-03-07 04:17:27,754][118044] Updated weights for policy 0, policy_version 42090 (0.0006) [2023-03-07 04:17:28,530][118044] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-07 04:17:29,305][118044] Updated weights for policy 0, policy_version 42110 (0.0007) [2023-03-07 04:17:30,080][118044] Updated weights for policy 0, policy_version 42120 (0.0006) [2023-03-07 04:17:30,857][118044] Updated weights for policy 0, policy_version 42130 (0.0007) [2023-03-07 04:17:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43143168. Throughput: 0: 13149.7. Samples: 43123048. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:31,097][117718] Avg episode reward: [(0, '3047.549')] [2023-03-07 04:17:31,626][118044] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-07 04:17:32,419][118044] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-07 04:17:33,208][118044] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-07 04:17:33,991][118044] Updated weights for policy 0, policy_version 42170 (0.0006) [2023-03-07 04:17:34,759][118044] Updated weights for policy 0, policy_version 42180 (0.0007) [2023-03-07 04:17:35,550][118044] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-07 04:17:36,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 43209728. Throughput: 0: 13151.1. Samples: 43201883. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:36,096][117718] Avg episode reward: [(0, '2979.469')] [2023-03-07 04:17:36,319][118044] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-07 04:17:37,117][118044] Updated weights for policy 0, policy_version 42210 (0.0007) [2023-03-07 04:17:37,897][118044] Updated weights for policy 0, policy_version 42220 (0.0007) [2023-03-07 04:17:38,676][118044] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-07 04:17:39,455][118044] Updated weights for policy 0, policy_version 42240 (0.0005) [2023-03-07 04:17:40,225][118044] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-07 04:17:40,996][118044] Updated weights for policy 0, policy_version 42260 (0.0007) [2023-03-07 04:17:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 43275264. Throughput: 0: 13145.7. Samples: 43241071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:41,097][117718] Avg episode reward: [(0, '3055.391')] [2023-03-07 04:17:41,802][118044] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-07 04:17:42,570][118044] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-07 04:17:43,353][118044] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-07 04:17:44,119][118044] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-07 04:17:44,883][118044] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-07 04:17:45,672][118044] Updated weights for policy 0, policy_version 42320 (0.0006) [2023-03-07 04:17:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 43340800. Throughput: 0: 13149.1. Samples: 43320148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:46,097][117718] Avg episode reward: [(0, '2833.497')] [2023-03-07 04:17:46,445][118044] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-07 04:17:47,212][118044] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-07 04:17:47,989][118044] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 04:17:48,763][118044] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-07 04:17:49,556][118044] Updated weights for policy 0, policy_version 42370 (0.0007) [2023-03-07 04:17:50,333][118044] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-07 04:17:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 43406336. Throughput: 0: 13150.8. Samples: 43399198. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:51,096][117718] Avg episode reward: [(0, '3073.709')] [2023-03-07 04:17:51,106][118044] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-07 04:17:51,898][118044] Updated weights for policy 0, policy_version 42400 (0.0005) [2023-03-07 04:17:52,670][118044] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-07 04:17:53,457][118044] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-07 04:17:54,229][118044] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-07 04:17:55,011][118044] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 04:17:55,815][118044] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-07 04:17:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 43471872. Throughput: 0: 13147.0. Samples: 43438450. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:17:56,097][117718] Avg episode reward: [(0, '3039.492')] [2023-03-07 04:17:56,599][118044] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-07 04:17:57,375][118044] Updated weights for policy 0, policy_version 42470 (0.0006) [2023-03-07 04:17:58,157][118044] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 04:17:58,937][118044] Updated weights for policy 0, policy_version 42490 (0.0006) [2023-03-07 04:17:59,728][118044] Updated weights for policy 0, policy_version 42500 (0.0007) [2023-03-07 04:18:00,498][118044] Updated weights for policy 0, policy_version 42510 (0.0006) [2023-03-07 04:18:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 43537408. Throughput: 0: 13139.0. Samples: 43516960. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:18:01,097][117718] Avg episode reward: [(0, '2980.890')] [2023-03-07 04:18:01,264][118044] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-03-07 04:18:02,040][118044] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-07 04:18:02,835][118044] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-07 04:18:03,611][118044] Updated weights for policy 0, policy_version 42550 (0.0007) [2023-03-07 04:18:04,390][118044] Updated weights for policy 0, policy_version 42560 (0.0007) [2023-03-07 04:18:05,157][118044] Updated weights for policy 0, policy_version 42570 (0.0007) [2023-03-07 04:18:05,935][118044] Updated weights for policy 0, policy_version 42580 (0.0007) [2023-03-07 04:18:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43602944. Throughput: 0: 13143.7. Samples: 43595977. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:06,097][117718] Avg episode reward: [(0, '2888.188')] [2023-03-07 04:18:06,721][118044] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 04:18:07,497][118044] Updated weights for policy 0, policy_version 42600 (0.0006) [2023-03-07 04:18:08,286][118044] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-07 04:18:09,069][118044] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 04:18:09,833][118044] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-07 04:18:10,645][118044] Updated weights for policy 0, policy_version 42640 (0.0007) [2023-03-07 04:18:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43668480. Throughput: 0: 13141.7. Samples: 43635182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:11,097][117718] Avg episode reward: [(0, '2977.386')] [2023-03-07 04:18:11,421][118044] Updated weights for policy 0, policy_version 42650 (0.0005) [2023-03-07 04:18:12,199][118044] Updated weights for policy 0, policy_version 42660 (0.0006) [2023-03-07 04:18:12,969][118044] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-07 04:18:13,753][118044] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-07 04:18:14,527][118044] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 04:18:15,297][118044] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-07 04:18:16,084][118044] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 04:18:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 43735040. Throughput: 0: 13136.7. Samples: 43714203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:16,095][117718] Avg episode reward: [(0, '2977.356')] [2023-03-07 04:18:16,850][118044] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-07 04:18:17,633][118044] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-07 04:18:18,406][118044] Updated weights for policy 0, policy_version 42740 (0.0007) [2023-03-07 04:18:19,179][118044] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-07 04:18:19,953][118044] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-07 04:18:20,722][118044] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-07 04:18:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43800576. Throughput: 0: 13145.8. Samples: 43793445. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:21,097][117718] Avg episode reward: [(0, '2942.856')] [2023-03-07 04:18:21,489][118044] Updated weights for policy 0, policy_version 42780 (0.0007) [2023-03-07 04:18:22,263][118044] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-07 04:18:23,060][118044] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-07 04:18:23,820][118044] Updated weights for policy 0, policy_version 42810 (0.0006) [2023-03-07 04:18:24,574][118044] Updated weights for policy 0, policy_version 42820 (0.0006) [2023-03-07 04:18:25,368][118044] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-07 04:18:26,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 43867136. Throughput: 0: 13155.0. Samples: 43833047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:26,097][117718] Avg episode reward: [(0, '3163.262')] [2023-03-07 04:18:26,125][118044] Updated weights for policy 0, policy_version 42840 (0.0007) [2023-03-07 04:18:26,892][118044] Updated weights for policy 0, policy_version 42850 (0.0006) [2023-03-07 04:18:27,700][118044] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 04:18:28,474][118044] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-07 04:18:29,241][118044] Updated weights for policy 0, policy_version 42880 (0.0006) [2023-03-07 04:18:30,031][118044] Updated weights for policy 0, policy_version 42890 (0.0007) [2023-03-07 04:18:30,796][118044] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 04:18:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 43932672. Throughput: 0: 13161.7. Samples: 43912424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:31,097][117718] Avg episode reward: [(0, '3065.525')] [2023-03-07 04:18:31,599][118044] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-07 04:18:32,364][118044] Updated weights for policy 0, policy_version 42920 (0.0007) [2023-03-07 04:18:33,141][118044] Updated weights for policy 0, policy_version 42930 (0.0007) [2023-03-07 04:18:33,921][118044] Updated weights for policy 0, policy_version 42940 (0.0007) [2023-03-07 04:18:34,694][118044] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-07 04:18:35,480][118044] Updated weights for policy 0, policy_version 42960 (0.0007) [2023-03-07 04:18:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 43998208. Throughput: 0: 13155.6. Samples: 43991203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:36,097][117718] Avg episode reward: [(0, '3007.486')] [2023-03-07 04:18:36,107][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000042968_43999232.pth... [2023-03-07 04:18:36,137][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000039885_40842240.pth [2023-03-07 04:18:36,276][118044] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-07 04:18:37,046][118044] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-07 04:18:37,810][118044] Updated weights for policy 0, policy_version 42990 (0.0007) [2023-03-07 04:18:38,602][118044] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-07 04:18:39,400][118044] Updated weights for policy 0, policy_version 43010 (0.0005) [2023-03-07 04:18:40,182][118044] Updated weights for policy 0, policy_version 43020 (0.0006) [2023-03-07 04:18:40,942][118044] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-07 04:18:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 44063744. Throughput: 0: 13156.6. Samples: 44030496. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:41,086][117718] Avg episode reward: [(0, '2997.771')] [2023-03-07 04:18:41,718][118044] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-07 04:18:42,520][118044] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 04:18:43,278][118044] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-07 04:18:44,044][118044] Updated weights for policy 0, policy_version 43070 (0.0006) [2023-03-07 04:18:44,827][118044] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-07 04:18:45,607][118044] Updated weights for policy 0, policy_version 43090 (0.0007) [2023-03-07 04:18:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44130304. Throughput: 0: 13162.7. Samples: 44109285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:46,086][117718] Avg episode reward: [(0, '3026.191')] [2023-03-07 04:18:46,394][118044] Updated weights for policy 0, policy_version 43100 (0.0007) [2023-03-07 04:18:47,168][118044] Updated weights for policy 0, policy_version 43110 (0.0006) [2023-03-07 04:18:47,953][118044] Updated weights for policy 0, policy_version 43120 (0.0007) [2023-03-07 04:18:48,732][118044] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-07 04:18:49,519][118044] Updated weights for policy 0, policy_version 43140 (0.0006) [2023-03-07 04:18:50,285][118044] Updated weights for policy 0, policy_version 43150 (0.0007) [2023-03-07 04:18:51,072][118044] Updated weights for policy 0, policy_version 43160 (0.0006) [2023-03-07 04:18:51,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44195840. Throughput: 0: 13161.7. Samples: 44188252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:51,086][117718] Avg episode reward: [(0, '3007.093')] [2023-03-07 04:18:51,844][118044] Updated weights for policy 0, policy_version 43170 (0.0006) [2023-03-07 04:18:52,641][118044] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-07 04:18:53,402][118044] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 04:18:54,175][118044] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-07 04:18:54,945][118044] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-07 04:18:55,719][118044] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 04:18:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44261376. Throughput: 0: 13165.4. Samples: 44227625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:18:56,086][117718] Avg episode reward: [(0, '2953.367')] [2023-03-07 04:18:56,519][118044] Updated weights for policy 0, policy_version 43230 (0.0007) [2023-03-07 04:18:57,302][118044] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-07 04:18:58,066][118044] Updated weights for policy 0, policy_version 43250 (0.0007) [2023-03-07 04:18:58,855][118044] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-07 04:18:59,640][118044] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-07 04:19:00,403][118044] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-07 04:19:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44326912. Throughput: 0: 13164.7. Samples: 44306611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:01,086][117718] Avg episode reward: [(0, '3035.467')] [2023-03-07 04:19:01,193][118044] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 04:19:01,954][118044] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-07 04:19:02,744][118044] Updated weights for policy 0, policy_version 43310 (0.0005) [2023-03-07 04:19:03,522][118044] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-07 04:19:04,307][118044] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-07 04:19:05,081][118044] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 04:19:05,865][118044] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-07 04:19:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 44393472. Throughput: 0: 13159.4. Samples: 44385618. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:06,086][117718] Avg episode reward: [(0, '3012.504')] [2023-03-07 04:19:06,637][118044] Updated weights for policy 0, policy_version 43360 (0.0006) [2023-03-07 04:19:07,410][118044] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-07 04:19:08,184][118044] Updated weights for policy 0, policy_version 43380 (0.0007) [2023-03-07 04:19:08,957][118044] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-07 04:19:09,743][118044] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-07 04:19:10,540][118044] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 04:19:11,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 44459008. Throughput: 0: 13158.9. Samples: 44425197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:11,086][117718] Avg episode reward: [(0, '2985.031')] [2023-03-07 04:19:11,318][118044] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-07 04:19:12,095][118044] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-07 04:19:12,863][118044] Updated weights for policy 0, policy_version 43440 (0.0007) [2023-03-07 04:19:13,637][118044] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-07 04:19:14,414][118044] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-07 04:19:15,202][118044] Updated weights for policy 0, policy_version 43470 (0.0007) [2023-03-07 04:19:15,982][118044] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-07 04:19:16,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44524544. Throughput: 0: 13147.3. Samples: 44504053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:16,087][117718] Avg episode reward: [(0, '3005.302')] [2023-03-07 04:19:16,754][118044] Updated weights for policy 0, policy_version 43490 (0.0006) [2023-03-07 04:19:17,526][118044] Updated weights for policy 0, policy_version 43500 (0.0006) [2023-03-07 04:19:18,320][118044] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-07 04:19:19,089][118044] Updated weights for policy 0, policy_version 43520 (0.0007) [2023-03-07 04:19:19,863][118044] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-07 04:19:20,661][118044] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-07 04:19:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44590080. Throughput: 0: 13146.1. Samples: 44582775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:21,086][117718] Avg episode reward: [(0, '3001.067')] [2023-03-07 04:19:21,431][118044] Updated weights for policy 0, policy_version 43550 (0.0007) [2023-03-07 04:19:22,230][118044] Updated weights for policy 0, policy_version 43560 (0.0007) [2023-03-07 04:19:22,999][118044] Updated weights for policy 0, policy_version 43570 (0.0006) [2023-03-07 04:19:23,773][118044] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-07 04:19:24,554][118044] Updated weights for policy 0, policy_version 43590 (0.0005) [2023-03-07 04:19:25,329][118044] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-07 04:19:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 44655616. Throughput: 0: 13146.9. Samples: 44622110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:26,086][117718] Avg episode reward: [(0, '2981.379')] [2023-03-07 04:19:26,113][118044] Updated weights for policy 0, policy_version 43610 (0.0006) [2023-03-07 04:19:26,905][118044] Updated weights for policy 0, policy_version 43620 (0.0007) [2023-03-07 04:19:27,687][118044] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 04:19:28,461][118044] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-07 04:19:29,225][118044] Updated weights for policy 0, policy_version 43650 (0.0006) [2023-03-07 04:19:30,017][118044] Updated weights for policy 0, policy_version 43660 (0.0006) [2023-03-07 04:19:30,775][118044] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-07 04:19:31,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 44721152. Throughput: 0: 13148.7. Samples: 44700976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:31,086][117718] Avg episode reward: [(0, '2985.245')] [2023-03-07 04:19:31,550][118044] Updated weights for policy 0, policy_version 43680 (0.0006) [2023-03-07 04:19:32,326][118044] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 04:19:33,097][118044] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-07 04:19:33,882][118044] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-07 04:19:34,660][118044] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-07 04:19:35,442][118044] Updated weights for policy 0, policy_version 43730 (0.0006) [2023-03-07 04:19:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 44787712. Throughput: 0: 13155.0. Samples: 44780226. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:36,086][117718] Avg episode reward: [(0, '2913.635')] [2023-03-07 04:19:36,236][118044] Updated weights for policy 0, policy_version 43740 (0.0007) [2023-03-07 04:19:37,006][118044] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-07 04:19:37,765][118044] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-07 04:19:38,573][118044] Updated weights for policy 0, policy_version 43770 (0.0006) [2023-03-07 04:19:39,344][118044] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-07 04:19:40,134][118044] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-07 04:19:40,927][118044] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-07 04:19:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 44852224. Throughput: 0: 13154.5. Samples: 44819576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:41,086][117718] Avg episode reward: [(0, '2853.408')] [2023-03-07 04:19:41,710][118044] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-07 04:19:42,487][118044] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-07 04:19:43,274][118044] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 04:19:44,064][118044] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-07 04:19:44,836][118044] Updated weights for policy 0, policy_version 43850 (0.0007) [2023-03-07 04:19:45,610][118044] Updated weights for policy 0, policy_version 43860 (0.0007) [2023-03-07 04:19:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 44918784. Throughput: 0: 13138.8. Samples: 44897859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:46,086][117718] Avg episode reward: [(0, '2910.783')] [2023-03-07 04:19:46,387][118044] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-07 04:19:47,160][118044] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-07 04:19:47,962][118044] Updated weights for policy 0, policy_version 43890 (0.0007) [2023-03-07 04:19:48,724][118044] Updated weights for policy 0, policy_version 43900 (0.0005) [2023-03-07 04:19:49,501][118044] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-07 04:19:50,288][118044] Updated weights for policy 0, policy_version 43920 (0.0007) [2023-03-07 04:19:51,055][118044] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-07 04:19:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 44984320. Throughput: 0: 13136.4. Samples: 44976757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:51,086][117718] Avg episode reward: [(0, '3003.482')] [2023-03-07 04:19:51,844][118044] Updated weights for policy 0, policy_version 43940 (0.0005) [2023-03-07 04:19:52,616][118044] Updated weights for policy 0, policy_version 43950 (0.0008) [2023-03-07 04:19:53,403][118044] Updated weights for policy 0, policy_version 43960 (0.0006) [2023-03-07 04:19:54,185][118044] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-07 04:19:54,964][118044] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-07 04:19:55,745][118044] Updated weights for policy 0, policy_version 43990 (0.0006) [2023-03-07 04:19:56,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 45049856. Throughput: 0: 13134.3. Samples: 45016243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:19:56,086][117718] Avg episode reward: [(0, '3074.444')] [2023-03-07 04:19:56,511][118044] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-07 04:19:57,303][118044] Updated weights for policy 0, policy_version 44010 (0.0007) [2023-03-07 04:19:58,069][118044] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-07 04:19:58,827][118044] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 04:19:59,618][118044] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-07 04:20:00,397][118044] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-07 04:20:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 45116416. Throughput: 0: 13138.3. Samples: 45095276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:01,086][117718] Avg episode reward: [(0, '2960.126')] [2023-03-07 04:20:01,155][118044] Updated weights for policy 0, policy_version 44060 (0.0007) [2023-03-07 04:20:01,940][118044] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-07 04:20:02,717][118044] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-07 04:20:03,484][118044] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-07 04:20:04,262][118044] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-07 04:20:05,039][118044] Updated weights for policy 0, policy_version 44110 (0.0007) [2023-03-07 04:20:05,814][118044] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 04:20:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 45181952. Throughput: 0: 13153.7. Samples: 45174690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:06,086][117718] Avg episode reward: [(0, '2921.371')] [2023-03-07 04:20:06,584][118044] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-07 04:20:07,368][118044] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-07 04:20:08,148][118044] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-07 04:20:08,930][118044] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-07 04:20:09,704][118044] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-07 04:20:10,486][118044] Updated weights for policy 0, policy_version 44180 (0.0006) [2023-03-07 04:20:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 45247488. Throughput: 0: 13153.3. Samples: 45214010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:11,086][117718] Avg episode reward: [(0, '2964.467')] [2023-03-07 04:20:11,295][118044] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-07 04:20:12,067][118044] Updated weights for policy 0, policy_version 44200 (0.0006) [2023-03-07 04:20:12,835][118044] Updated weights for policy 0, policy_version 44210 (0.0005) [2023-03-07 04:20:13,615][118044] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-07 04:20:14,405][118044] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-07 04:20:15,195][118044] Updated weights for policy 0, policy_version 44240 (0.0007) [2023-03-07 04:20:15,972][118044] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-07 04:20:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 45313024. Throughput: 0: 13146.4. Samples: 45292565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:16,086][117718] Avg episode reward: [(0, '2912.493')] [2023-03-07 04:20:16,746][118044] Updated weights for policy 0, policy_version 44260 (0.0006) [2023-03-07 04:20:17,521][118044] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-07 04:20:18,302][118044] Updated weights for policy 0, policy_version 44280 (0.0006) [2023-03-07 04:20:19,085][118044] Updated weights for policy 0, policy_version 44290 (0.0006) [2023-03-07 04:20:19,856][118044] Updated weights for policy 0, policy_version 44300 (0.0006) [2023-03-07 04:20:20,626][118044] Updated weights for policy 0, policy_version 44310 (0.0006) [2023-03-07 04:20:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 45378560. Throughput: 0: 13139.4. Samples: 45371501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:21,086][117718] Avg episode reward: [(0, '2896.597')] [2023-03-07 04:20:21,406][118044] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-07 04:20:22,195][118044] Updated weights for policy 0, policy_version 44330 (0.0006) [2023-03-07 04:20:22,992][118044] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-07 04:20:23,771][118044] Updated weights for policy 0, policy_version 44350 (0.0007) [2023-03-07 04:20:24,562][118044] Updated weights for policy 0, policy_version 44360 (0.0006) [2023-03-07 04:20:25,338][118044] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-07 04:20:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 45444096. Throughput: 0: 13135.2. Samples: 45410658. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:26,086][117718] Avg episode reward: [(0, '2824.613')] [2023-03-07 04:20:26,125][118044] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-07 04:20:26,906][118044] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 04:20:27,687][118044] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-07 04:20:28,469][118044] Updated weights for policy 0, policy_version 44410 (0.0007) [2023-03-07 04:20:29,246][118044] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-07 04:20:30,026][118044] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-07 04:20:30,789][118044] Updated weights for policy 0, policy_version 44440 (0.0006) [2023-03-07 04:20:31,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 45509632. Throughput: 0: 13145.5. Samples: 45489406. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:31,086][117718] Avg episode reward: [(0, '2973.212')] [2023-03-07 04:20:31,577][118044] Updated weights for policy 0, policy_version 44450 (0.0007) [2023-03-07 04:20:32,362][118044] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 04:20:33,142][118044] Updated weights for policy 0, policy_version 44470 (0.0006) [2023-03-07 04:20:33,940][118044] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-07 04:20:34,709][118044] Updated weights for policy 0, policy_version 44490 (0.0006) [2023-03-07 04:20:35,481][118044] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-07 04:20:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 45575168. Throughput: 0: 13140.7. Samples: 45568091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:36,086][117718] Avg episode reward: [(0, '2964.511')] [2023-03-07 04:20:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000044507_45575168.pth... [2023-03-07 04:20:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000041426_42420224.pth [2023-03-07 04:20:36,269][118044] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-07 04:20:37,067][118044] Updated weights for policy 0, policy_version 44520 (0.0006) [2023-03-07 04:20:37,841][118044] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-07 04:20:38,615][118044] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-07 04:20:39,393][118044] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-07 04:20:40,180][118044] Updated weights for policy 0, policy_version 44560 (0.0007) [2023-03-07 04:20:40,958][118044] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-07 04:20:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 45640704. Throughput: 0: 13135.3. Samples: 45607330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:41,086][117718] Avg episode reward: [(0, '2990.495')] [2023-03-07 04:20:41,739][118044] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-07 04:20:42,509][118044] Updated weights for policy 0, policy_version 44590 (0.0006) [2023-03-07 04:20:43,301][118044] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-07 04:20:44,087][118044] Updated weights for policy 0, policy_version 44610 (0.0007) [2023-03-07 04:20:44,853][118044] Updated weights for policy 0, policy_version 44620 (0.0007) [2023-03-07 04:20:45,639][118044] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-07 04:20:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 45706240. Throughput: 0: 13123.5. Samples: 45685835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:46,086][117718] Avg episode reward: [(0, '2982.418')] [2023-03-07 04:20:46,409][118044] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-07 04:20:47,198][118044] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-07 04:20:47,970][118044] Updated weights for policy 0, policy_version 44660 (0.0006) [2023-03-07 04:20:48,770][118044] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-07 04:20:49,537][118044] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-07 04:20:50,305][118044] Updated weights for policy 0, policy_version 44690 (0.0007) [2023-03-07 04:20:51,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 45771776. Throughput: 0: 13114.1. Samples: 45764824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:51,086][117718] Avg episode reward: [(0, '2999.072')] [2023-03-07 04:20:51,094][118044] Updated weights for policy 0, policy_version 44700 (0.0007) [2023-03-07 04:20:51,886][118044] Updated weights for policy 0, policy_version 44710 (0.0006) [2023-03-07 04:20:52,658][118044] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-07 04:20:53,454][118044] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-07 04:20:54,220][118044] Updated weights for policy 0, policy_version 44740 (0.0007) [2023-03-07 04:20:55,010][118044] Updated weights for policy 0, policy_version 44750 (0.0008) [2023-03-07 04:20:55,788][118044] Updated weights for policy 0, policy_version 44760 (0.0005) [2023-03-07 04:20:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 45837312. Throughput: 0: 13111.2. Samples: 45804016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:20:56,086][117718] Avg episode reward: [(0, '2968.620')] [2023-03-07 04:20:56,559][118044] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 04:20:57,340][118044] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-07 04:20:58,107][118044] Updated weights for policy 0, policy_version 44790 (0.0005) [2023-03-07 04:20:58,894][118044] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-07 04:20:59,685][118044] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-07 04:21:00,458][118044] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-07 04:21:01,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 45903872. Throughput: 0: 13119.2. Samples: 45882930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:01,086][117718] Avg episode reward: [(0, '2965.076')] [2023-03-07 04:21:01,222][118044] Updated weights for policy 0, policy_version 44830 (0.0007) [2023-03-07 04:21:01,998][118044] Updated weights for policy 0, policy_version 44840 (0.0007) [2023-03-07 04:21:02,791][118044] Updated weights for policy 0, policy_version 44850 (0.0007) [2023-03-07 04:21:03,585][118044] Updated weights for policy 0, policy_version 44860 (0.0006) [2023-03-07 04:21:04,339][118044] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-07 04:21:05,121][118044] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-07 04:21:05,900][118044] Updated weights for policy 0, policy_version 44890 (0.0007) [2023-03-07 04:21:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 45969408. Throughput: 0: 13121.6. Samples: 45961973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:06,086][117718] Avg episode reward: [(0, '3035.649')] [2023-03-07 04:21:06,694][118044] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-07 04:21:07,455][118044] Updated weights for policy 0, policy_version 44910 (0.0006) [2023-03-07 04:21:08,256][118044] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-07 04:21:09,026][118044] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-07 04:21:09,802][118044] Updated weights for policy 0, policy_version 44940 (0.0005) [2023-03-07 04:21:10,575][118044] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-03-07 04:21:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 46034944. Throughput: 0: 13127.0. Samples: 46001375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:11,086][117718] Avg episode reward: [(0, '3033.884')] [2023-03-07 04:21:11,344][118044] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 04:21:12,124][118044] Updated weights for policy 0, policy_version 44970 (0.0006) [2023-03-07 04:21:12,905][118044] Updated weights for policy 0, policy_version 44980 (0.0007) [2023-03-07 04:21:13,687][118044] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-07 04:21:14,469][118044] Updated weights for policy 0, policy_version 45000 (0.0005) [2023-03-07 04:21:15,245][118044] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-07 04:21:16,038][118044] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-07 04:21:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 46100480. Throughput: 0: 13129.0. Samples: 46080211. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:16,086][117718] Avg episode reward: [(0, '3038.392')] [2023-03-07 04:21:16,818][118044] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-07 04:21:17,599][118044] Updated weights for policy 0, policy_version 45040 (0.0006) [2023-03-07 04:21:18,365][118044] Updated weights for policy 0, policy_version 45050 (0.0008) [2023-03-07 04:21:19,154][118044] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-07 04:21:19,938][118044] Updated weights for policy 0, policy_version 45070 (0.0006) [2023-03-07 04:21:20,704][118044] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-07 04:21:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 46166016. Throughput: 0: 13134.2. Samples: 46159133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:21,086][117718] Avg episode reward: [(0, '3027.253')] [2023-03-07 04:21:21,487][118044] Updated weights for policy 0, policy_version 45090 (0.0007) [2023-03-07 04:21:22,267][118044] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 04:21:23,040][118044] Updated weights for policy 0, policy_version 45110 (0.0006) [2023-03-07 04:21:23,809][118044] Updated weights for policy 0, policy_version 45120 (0.0007) [2023-03-07 04:21:24,581][118044] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-07 04:21:25,373][118044] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 04:21:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 46232576. Throughput: 0: 13137.4. Samples: 46198515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:26,086][117718] Avg episode reward: [(0, '2951.723')] [2023-03-07 04:21:26,150][118044] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-07 04:21:26,963][118044] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-07 04:21:27,724][118044] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-07 04:21:28,496][118044] Updated weights for policy 0, policy_version 45180 (0.0006) [2023-03-07 04:21:29,289][118044] Updated weights for policy 0, policy_version 45190 (0.0005) [2023-03-07 04:21:30,058][118044] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-07 04:21:30,818][118044] Updated weights for policy 0, policy_version 45210 (0.0006) [2023-03-07 04:21:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 46298112. Throughput: 0: 13145.2. Samples: 46277367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:31,086][117718] Avg episode reward: [(0, '3014.653')] [2023-03-07 04:21:31,601][118044] Updated weights for policy 0, policy_version 45220 (0.0005) [2023-03-07 04:21:32,385][118044] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-07 04:21:33,169][118044] Updated weights for policy 0, policy_version 45240 (0.0007) [2023-03-07 04:21:33,945][118044] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-07 04:21:34,741][118044] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-07 04:21:35,523][118044] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 04:21:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 46363648. Throughput: 0: 13139.7. Samples: 46356110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:36,086][117718] Avg episode reward: [(0, '2932.264')] [2023-03-07 04:21:36,289][118044] Updated weights for policy 0, policy_version 45280 (0.0007) [2023-03-07 04:21:37,073][118044] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 04:21:37,838][118044] Updated weights for policy 0, policy_version 45300 (0.0005) [2023-03-07 04:21:38,612][118044] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-07 04:21:39,389][118044] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 04:21:40,170][118044] Updated weights for policy 0, policy_version 45330 (0.0006) [2023-03-07 04:21:40,959][118044] Updated weights for policy 0, policy_version 45340 (0.0007) [2023-03-07 04:21:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 46429184. Throughput: 0: 13147.1. Samples: 46395637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:41,086][117718] Avg episode reward: [(0, '3080.883')] [2023-03-07 04:21:41,724][118044] Updated weights for policy 0, policy_version 45350 (0.0006) [2023-03-07 04:21:42,509][118044] Updated weights for policy 0, policy_version 45360 (0.0006) [2023-03-07 04:21:43,286][118044] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-07 04:21:44,075][118044] Updated weights for policy 0, policy_version 45380 (0.0005) [2023-03-07 04:21:44,837][118044] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-07 04:21:45,615][118044] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-07 04:21:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 46495744. Throughput: 0: 13146.4. Samples: 46474516. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:46,086][117718] Avg episode reward: [(0, '2988.005')] [2023-03-07 04:21:46,392][118044] Updated weights for policy 0, policy_version 45410 (0.0006) [2023-03-07 04:21:47,160][118044] Updated weights for policy 0, policy_version 45420 (0.0005) [2023-03-07 04:21:47,933][118044] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-07 04:21:48,715][118044] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-07 04:21:49,507][118044] Updated weights for policy 0, policy_version 45450 (0.0005) [2023-03-07 04:21:50,257][118044] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-07 04:21:51,046][118044] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-07 04:21:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 46561280. Throughput: 0: 13154.8. Samples: 46553941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:51,086][117718] Avg episode reward: [(0, '2914.493')] [2023-03-07 04:21:51,827][118044] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-07 04:21:52,617][118044] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-07 04:21:53,379][118044] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-07 04:21:54,165][118044] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-07 04:21:54,932][118044] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-07 04:21:55,725][118044] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-07 04:21:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 46626816. Throughput: 0: 13154.3. Samples: 46593319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:21:56,086][117718] Avg episode reward: [(0, '2987.741')] [2023-03-07 04:21:56,477][118044] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-07 04:21:57,274][118044] Updated weights for policy 0, policy_version 45550 (0.0008) [2023-03-07 04:21:58,045][118044] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-07 04:21:58,817][118044] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 04:21:59,593][118044] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-07 04:22:00,385][118044] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-07 04:22:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 46693376. Throughput: 0: 13163.5. Samples: 46672569. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:01,086][117718] Avg episode reward: [(0, '2955.237')] [2023-03-07 04:22:01,163][118044] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 04:22:01,942][118044] Updated weights for policy 0, policy_version 45610 (0.0007) [2023-03-07 04:22:02,706][118044] Updated weights for policy 0, policy_version 45620 (0.0005) [2023-03-07 04:22:03,483][118044] Updated weights for policy 0, policy_version 45630 (0.0007) [2023-03-07 04:22:04,261][118044] Updated weights for policy 0, policy_version 45640 (0.0005) [2023-03-07 04:22:05,032][118044] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-07 04:22:05,809][118044] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-07 04:22:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 46758912. Throughput: 0: 13165.2. Samples: 46751568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:06,086][117718] Avg episode reward: [(0, '2965.895')] [2023-03-07 04:22:06,613][118044] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 04:22:07,403][118044] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-07 04:22:08,170][118044] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-07 04:22:08,958][118044] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-07 04:22:09,747][118044] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-07 04:22:10,518][118044] Updated weights for policy 0, policy_version 45720 (0.0008) [2023-03-07 04:22:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 46824448. Throughput: 0: 13162.1. Samples: 46790806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:11,086][117718] Avg episode reward: [(0, '3106.576')] [2023-03-07 04:22:11,293][118044] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-07 04:22:12,067][118044] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-07 04:22:12,833][118044] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 04:22:13,601][118044] Updated weights for policy 0, policy_version 45760 (0.0007) [2023-03-07 04:22:14,382][118044] Updated weights for policy 0, policy_version 45770 (0.0005) [2023-03-07 04:22:15,151][118044] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-07 04:22:15,930][118044] Updated weights for policy 0, policy_version 45790 (0.0006) [2023-03-07 04:22:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 46891008. Throughput: 0: 13165.3. Samples: 46869805. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:16,086][117718] Avg episode reward: [(0, '3000.511')] [2023-03-07 04:22:16,707][118044] Updated weights for policy 0, policy_version 45800 (0.0007) [2023-03-07 04:22:17,490][118044] Updated weights for policy 0, policy_version 45810 (0.0006) [2023-03-07 04:22:18,248][118044] Updated weights for policy 0, policy_version 45820 (0.0005) [2023-03-07 04:22:19,055][118044] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-07 04:22:19,813][118044] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-07 04:22:20,609][118044] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-07 04:22:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 46956544. Throughput: 0: 13174.6. Samples: 46948965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:21,086][117718] Avg episode reward: [(0, '3030.079')] [2023-03-07 04:22:21,394][118044] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-07 04:22:22,162][118044] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-07 04:22:22,937][118044] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-07 04:22:23,728][118044] Updated weights for policy 0, policy_version 45890 (0.0007) [2023-03-07 04:22:24,495][118044] Updated weights for policy 0, policy_version 45900 (0.0008) [2023-03-07 04:22:25,278][118044] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-07 04:22:26,061][118044] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-07 04:22:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 47022080. Throughput: 0: 13171.9. Samples: 46988370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:26,086][117718] Avg episode reward: [(0, '2894.263')] [2023-03-07 04:22:26,845][118044] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 04:22:27,633][118044] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 04:22:28,388][118044] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 04:22:29,161][118044] Updated weights for policy 0, policy_version 45960 (0.0006) [2023-03-07 04:22:29,937][118044] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-07 04:22:30,711][118044] Updated weights for policy 0, policy_version 45980 (0.0007) [2023-03-07 04:22:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 47087616. Throughput: 0: 13171.1. Samples: 47067218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:31,086][117718] Avg episode reward: [(0, '2982.979')] [2023-03-07 04:22:31,485][118044] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-07 04:22:32,259][118044] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 04:22:33,049][118044] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 04:22:33,811][118044] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-07 04:22:34,587][118044] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-07 04:22:35,372][118044] Updated weights for policy 0, policy_version 46040 (0.0007) [2023-03-07 04:22:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 47154176. Throughput: 0: 13165.8. Samples: 47146404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:36,086][117718] Avg episode reward: [(0, '2994.306')] [2023-03-07 04:22:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000046049_47154176.pth... [2023-03-07 04:22:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000042968_43999232.pth [2023-03-07 04:22:36,148][118044] Updated weights for policy 0, policy_version 46050 (0.0005) [2023-03-07 04:22:36,919][118044] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-07 04:22:37,694][118044] Updated weights for policy 0, policy_version 46070 (0.0006) [2023-03-07 04:22:38,477][118044] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-07 04:22:39,242][118044] Updated weights for policy 0, policy_version 46090 (0.0006) [2023-03-07 04:22:40,016][118044] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 04:22:40,790][118044] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-07 04:22:41,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 47219712. Throughput: 0: 13172.8. Samples: 47186098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:22:41,086][117718] Avg episode reward: [(0, '2880.035')] [2023-03-07 04:22:41,566][118044] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-07 04:22:42,352][118044] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-07 04:22:43,142][118044] Updated weights for policy 0, policy_version 46140 (0.0006) [2023-03-07 04:22:43,928][118044] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-07 04:22:44,705][118044] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-07 04:22:45,473][118044] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-07 04:22:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 47285248. Throughput: 0: 13165.3. Samples: 47265008. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:22:46,086][117718] Avg episode reward: [(0, '2933.561')] [2023-03-07 04:22:46,261][118044] Updated weights for policy 0, policy_version 46180 (0.0007) [2023-03-07 04:22:47,037][118044] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 04:22:47,812][118044] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-07 04:22:48,589][118044] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-07 04:22:49,370][118044] Updated weights for policy 0, policy_version 46220 (0.0005) [2023-03-07 04:22:50,160][118044] Updated weights for policy 0, policy_version 46230 (0.0007) [2023-03-07 04:22:50,929][118044] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-07 04:22:51,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 47351808. Throughput: 0: 13162.5. Samples: 47343880. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:22:51,086][117718] Avg episode reward: [(0, '2992.067')] [2023-03-07 04:22:51,731][118044] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-07 04:22:52,498][118044] Updated weights for policy 0, policy_version 46260 (0.0006) [2023-03-07 04:22:53,267][118044] Updated weights for policy 0, policy_version 46270 (0.0007) [2023-03-07 04:22:54,036][118044] Updated weights for policy 0, policy_version 46280 (0.0005) [2023-03-07 04:22:54,830][118044] Updated weights for policy 0, policy_version 46290 (0.0007) [2023-03-07 04:22:55,602][118044] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-07 04:22:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 47417344. Throughput: 0: 13171.6. Samples: 47383528. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:22:56,086][117718] Avg episode reward: [(0, '2976.708')] [2023-03-07 04:22:56,390][118044] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 04:22:57,141][118044] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-07 04:22:57,931][118044] Updated weights for policy 0, policy_version 46330 (0.0007) [2023-03-07 04:22:58,711][118044] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-07 04:22:59,477][118044] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-07 04:23:00,249][118044] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-07 04:23:01,030][118044] Updated weights for policy 0, policy_version 46370 (0.0007) [2023-03-07 04:23:01,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 47482880. Throughput: 0: 13172.5. Samples: 47462568. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:23:01,086][117718] Avg episode reward: [(0, '2906.630')] [2023-03-07 04:23:01,804][118044] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-07 04:23:02,574][118044] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-07 04:23:03,368][118044] Updated weights for policy 0, policy_version 46400 (0.0006) [2023-03-07 04:23:04,148][118044] Updated weights for policy 0, policy_version 46410 (0.0007) [2023-03-07 04:23:04,939][118044] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-07 04:23:05,713][118044] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-07 04:23:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 47549440. Throughput: 0: 13170.0. Samples: 47541614. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:23:06,086][117718] Avg episode reward: [(0, '2866.521')] [2023-03-07 04:23:06,481][118044] Updated weights for policy 0, policy_version 46440 (0.0007) [2023-03-07 04:23:07,259][118044] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 04:23:08,023][118044] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 04:23:08,795][118044] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-07 04:23:09,557][118044] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-07 04:23:10,360][118044] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-07 04:23:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 47614976. Throughput: 0: 13173.5. Samples: 47581177. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:23:11,086][117718] Avg episode reward: [(0, '2873.129')] [2023-03-07 04:23:11,130][118044] Updated weights for policy 0, policy_version 46500 (0.0006) [2023-03-07 04:23:11,902][118044] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 04:23:12,672][118044] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-07 04:23:13,461][118044] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-07 04:23:14,254][118044] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-07 04:23:15,006][118044] Updated weights for policy 0, policy_version 46550 (0.0007) [2023-03-07 04:23:15,808][118044] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-07 04:23:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 47680512. Throughput: 0: 13176.1. Samples: 47660143. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:16,086][117718] Avg episode reward: [(0, '2980.766')] [2023-03-07 04:23:16,587][118044] Updated weights for policy 0, policy_version 46570 (0.0007) [2023-03-07 04:23:17,374][118044] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-07 04:23:18,148][118044] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 04:23:18,925][118044] Updated weights for policy 0, policy_version 46600 (0.0005) [2023-03-07 04:23:19,712][118044] Updated weights for policy 0, policy_version 46610 (0.0008) [2023-03-07 04:23:20,470][118044] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-07 04:23:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 47746048. Throughput: 0: 13166.2. Samples: 47738881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:21,086][117718] Avg episode reward: [(0, '2918.442')] [2023-03-07 04:23:21,257][118044] Updated weights for policy 0, policy_version 46630 (0.0007) [2023-03-07 04:23:22,045][118044] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-07 04:23:22,802][118044] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-07 04:23:23,593][118044] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 04:23:24,345][118044] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-07 04:23:25,136][118044] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-07 04:23:25,936][118044] Updated weights for policy 0, policy_version 46690 (0.0007) [2023-03-07 04:23:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 47811584. Throughput: 0: 13163.2. Samples: 47778444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:26,086][117718] Avg episode reward: [(0, '3018.022')] [2023-03-07 04:23:26,721][118044] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-07 04:23:27,498][118044] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-07 04:23:28,287][118044] Updated weights for policy 0, policy_version 46720 (0.0006) [2023-03-07 04:23:29,046][118044] Updated weights for policy 0, policy_version 46730 (0.0006) [2023-03-07 04:23:29,834][118044] Updated weights for policy 0, policy_version 46740 (0.0006) [2023-03-07 04:23:30,634][118044] Updated weights for policy 0, policy_version 46750 (0.0007) [2023-03-07 04:23:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 47878144. Throughput: 0: 13163.5. Samples: 47857365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:31,086][117718] Avg episode reward: [(0, '2929.487')] [2023-03-07 04:23:31,397][118044] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-07 04:23:32,178][118044] Updated weights for policy 0, policy_version 46770 (0.0007) [2023-03-07 04:23:32,969][118044] Updated weights for policy 0, policy_version 46780 (0.0008) [2023-03-07 04:23:33,749][118044] Updated weights for policy 0, policy_version 46790 (0.0007) [2023-03-07 04:23:34,514][118044] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-07 04:23:35,301][118044] Updated weights for policy 0, policy_version 46810 (0.0006) [2023-03-07 04:23:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 47942656. Throughput: 0: 13153.0. Samples: 47935764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:36,086][117718] Avg episode reward: [(0, '3021.381')] [2023-03-07 04:23:36,094][118044] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-07 04:23:36,865][118044] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-07 04:23:37,650][118044] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-07 04:23:38,424][118044] Updated weights for policy 0, policy_version 46850 (0.0008) [2023-03-07 04:23:39,208][118044] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-07 04:23:39,990][118044] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 04:23:40,767][118044] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-07 04:23:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 48009216. Throughput: 0: 13149.2. Samples: 47975244. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:41,086][117718] Avg episode reward: [(0, '2974.944')] [2023-03-07 04:23:41,537][118044] Updated weights for policy 0, policy_version 46890 (0.0006) [2023-03-07 04:23:42,312][118044] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 04:23:43,094][118044] Updated weights for policy 0, policy_version 46910 (0.0007) [2023-03-07 04:23:43,885][118044] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-07 04:23:44,659][118044] Updated weights for policy 0, policy_version 46930 (0.0007) [2023-03-07 04:23:45,455][118044] Updated weights for policy 0, policy_version 46940 (0.0006) [2023-03-07 04:23:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 48074752. Throughput: 0: 13145.0. Samples: 48054094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:46,086][117718] Avg episode reward: [(0, '2840.124')] [2023-03-07 04:23:46,222][118044] Updated weights for policy 0, policy_version 46950 (0.0007) [2023-03-07 04:23:46,999][118044] Updated weights for policy 0, policy_version 46960 (0.0007) [2023-03-07 04:23:47,762][118044] Updated weights for policy 0, policy_version 46970 (0.0006) [2023-03-07 04:23:48,544][118044] Updated weights for policy 0, policy_version 46980 (0.0005) [2023-03-07 04:23:49,337][118044] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-07 04:23:50,111][118044] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-07 04:23:50,891][118044] Updated weights for policy 0, policy_version 47010 (0.0006) [2023-03-07 04:23:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48140288. Throughput: 0: 13140.2. Samples: 48132921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:51,086][117718] Avg episode reward: [(0, '2962.443')] [2023-03-07 04:23:51,681][118044] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-07 04:23:52,480][118044] Updated weights for policy 0, policy_version 47030 (0.0006) [2023-03-07 04:23:53,245][118044] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-07 04:23:54,002][118044] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-07 04:23:54,818][118044] Updated weights for policy 0, policy_version 47060 (0.0006) [2023-03-07 04:23:55,570][118044] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-07 04:23:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48205824. Throughput: 0: 13132.2. Samples: 48172124. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:23:56,086][117718] Avg episode reward: [(0, '2994.738')] [2023-03-07 04:23:56,323][118044] Updated weights for policy 0, policy_version 47080 (0.0007) [2023-03-07 04:23:57,122][118044] Updated weights for policy 0, policy_version 47090 (0.0005) [2023-03-07 04:23:57,878][118044] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-07 04:23:58,658][118044] Updated weights for policy 0, policy_version 47110 (0.0006) [2023-03-07 04:23:59,454][118044] Updated weights for policy 0, policy_version 47120 (0.0006) [2023-03-07 04:24:00,220][118044] Updated weights for policy 0, policy_version 47130 (0.0006) [2023-03-07 04:24:00,978][118044] Updated weights for policy 0, policy_version 47140 (0.0007) [2023-03-07 04:24:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 48272384. Throughput: 0: 13141.6. Samples: 48251516. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:24:01,086][117718] Avg episode reward: [(0, '3003.727')] [2023-03-07 04:24:01,773][118044] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-07 04:24:02,543][118044] Updated weights for policy 0, policy_version 47160 (0.0006) [2023-03-07 04:24:03,326][118044] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-07 04:24:04,100][118044] Updated weights for policy 0, policy_version 47180 (0.0005) [2023-03-07 04:24:04,871][118044] Updated weights for policy 0, policy_version 47190 (0.0006) [2023-03-07 04:24:05,657][118044] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-07 04:24:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48337920. Throughput: 0: 13148.9. Samples: 48330583. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:24:06,086][117718] Avg episode reward: [(0, '2832.669')] [2023-03-07 04:24:06,451][118044] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-07 04:24:07,211][118044] Updated weights for policy 0, policy_version 47220 (0.0006) [2023-03-07 04:24:07,982][118044] Updated weights for policy 0, policy_version 47230 (0.0007) [2023-03-07 04:24:08,769][118044] Updated weights for policy 0, policy_version 47240 (0.0006) [2023-03-07 04:24:09,540][118044] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-07 04:24:10,317][118044] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-07 04:24:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48403456. Throughput: 0: 13148.6. Samples: 48370131. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:11,086][117718] Avg episode reward: [(0, '2996.433')] [2023-03-07 04:24:11,102][118044] Updated weights for policy 0, policy_version 47270 (0.0005) [2023-03-07 04:24:11,868][118044] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-07 04:24:12,669][118044] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-07 04:24:13,437][118044] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-07 04:24:14,213][118044] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 04:24:15,005][118044] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 04:24:15,755][118044] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-07 04:24:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48470016. Throughput: 0: 13147.6. Samples: 48449005. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:16,086][117718] Avg episode reward: [(0, '2999.196')] [2023-03-07 04:24:16,542][118044] Updated weights for policy 0, policy_version 47340 (0.0007) [2023-03-07 04:24:17,321][118044] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 04:24:18,094][118044] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 04:24:18,878][118044] Updated weights for policy 0, policy_version 47370 (0.0007) [2023-03-07 04:24:19,650][118044] Updated weights for policy 0, policy_version 47380 (0.0006) [2023-03-07 04:24:20,437][118044] Updated weights for policy 0, policy_version 47390 (0.0007) [2023-03-07 04:24:21,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48535552. Throughput: 0: 13164.4. Samples: 48528163. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:21,086][117718] Avg episode reward: [(0, '2961.474')] [2023-03-07 04:24:21,208][118044] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-07 04:24:21,987][118044] Updated weights for policy 0, policy_version 47410 (0.0007) [2023-03-07 04:24:22,776][118044] Updated weights for policy 0, policy_version 47420 (0.0007) [2023-03-07 04:24:23,551][118044] Updated weights for policy 0, policy_version 47430 (0.0006) [2023-03-07 04:24:24,329][118044] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-07 04:24:25,108][118044] Updated weights for policy 0, policy_version 47450 (0.0007) [2023-03-07 04:24:25,892][118044] Updated weights for policy 0, policy_version 47460 (0.0007) [2023-03-07 04:24:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48601088. Throughput: 0: 13158.7. Samples: 48567384. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:26,086][117718] Avg episode reward: [(0, '2801.886')] [2023-03-07 04:24:26,667][118044] Updated weights for policy 0, policy_version 47470 (0.0007) [2023-03-07 04:24:27,439][118044] Updated weights for policy 0, policy_version 47480 (0.0006) [2023-03-07 04:24:28,213][118044] Updated weights for policy 0, policy_version 47490 (0.0005) [2023-03-07 04:24:28,993][118044] Updated weights for policy 0, policy_version 47500 (0.0006) [2023-03-07 04:24:29,762][118044] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-07 04:24:30,538][118044] Updated weights for policy 0, policy_version 47520 (0.0007) [2023-03-07 04:24:31,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48666624. Throughput: 0: 13166.4. Samples: 48646581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:31,086][117718] Avg episode reward: [(0, '3096.049')] [2023-03-07 04:24:31,324][118044] Updated weights for policy 0, policy_version 47530 (0.0007) [2023-03-07 04:24:32,100][118044] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-07 04:24:32,888][118044] Updated weights for policy 0, policy_version 47550 (0.0006) [2023-03-07 04:24:33,670][118044] Updated weights for policy 0, policy_version 47560 (0.0007) [2023-03-07 04:24:34,443][118044] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-07 04:24:35,212][118044] Updated weights for policy 0, policy_version 47580 (0.0006) [2023-03-07 04:24:35,997][118044] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-07 04:24:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48732160. Throughput: 0: 13165.8. Samples: 48725383. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:24:36,086][117718] Avg episode reward: [(0, '3085.283')] [2023-03-07 04:24:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000047591_48733184.pth... [2023-03-07 04:24:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000044507_45575168.pth [2023-03-07 04:24:36,771][118044] Updated weights for policy 0, policy_version 47600 (0.0005) [2023-03-07 04:24:37,555][118044] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-07 04:24:38,329][118044] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 04:24:39,132][118044] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-07 04:24:39,894][118044] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-07 04:24:40,667][118044] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-07 04:24:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48798720. Throughput: 0: 13173.3. Samples: 48764922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:24:41,086][117718] Avg episode reward: [(0, '3103.911')] [2023-03-07 04:24:41,434][118044] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-07 04:24:42,238][118044] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-07 04:24:43,010][118044] Updated weights for policy 0, policy_version 47680 (0.0007) [2023-03-07 04:24:43,778][118044] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 04:24:44,575][118044] Updated weights for policy 0, policy_version 47700 (0.0007) [2023-03-07 04:24:45,368][118044] Updated weights for policy 0, policy_version 47710 (0.0005) [2023-03-07 04:24:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 48864256. Throughput: 0: 13159.0. Samples: 48843670. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:24:46,086][117718] Avg episode reward: [(0, '2990.323')] [2023-03-07 04:24:46,144][118044] Updated weights for policy 0, policy_version 47720 (0.0007) [2023-03-07 04:24:46,943][118044] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-07 04:24:47,724][118044] Updated weights for policy 0, policy_version 47740 (0.0007) [2023-03-07 04:24:48,490][118044] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-07 04:24:49,274][118044] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-07 04:24:50,050][118044] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 04:24:50,845][118044] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-07 04:24:51,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 48928768. Throughput: 0: 13143.2. Samples: 48922025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:24:51,086][117718] Avg episode reward: [(0, '2930.378')] [2023-03-07 04:24:51,627][118044] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-07 04:24:52,401][118044] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-07 04:24:53,187][118044] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-07 04:24:53,974][118044] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-07 04:24:54,749][118044] Updated weights for policy 0, policy_version 47830 (0.0007) [2023-03-07 04:24:55,524][118044] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-07 04:24:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 48995328. Throughput: 0: 13140.3. Samples: 48961444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:24:56,086][117718] Avg episode reward: [(0, '3063.309')] [2023-03-07 04:24:56,290][118044] Updated weights for policy 0, policy_version 47850 (0.0006) [2023-03-07 04:24:57,061][118044] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-07 04:24:57,843][118044] Updated weights for policy 0, policy_version 47870 (0.0007) [2023-03-07 04:24:58,637][118044] Updated weights for policy 0, policy_version 47880 (0.0007) [2023-03-07 04:24:59,424][118044] Updated weights for policy 0, policy_version 47890 (0.0007) [2023-03-07 04:25:00,205][118044] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-07 04:25:00,991][118044] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-07 04:25:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 49060864. Throughput: 0: 13136.5. Samples: 49040149. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:01,086][117718] Avg episode reward: [(0, '3026.566')] [2023-03-07 04:25:01,780][118044] Updated weights for policy 0, policy_version 47920 (0.0005) [2023-03-07 04:25:02,563][118044] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 04:25:03,331][118044] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 04:25:04,111][118044] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-07 04:25:04,893][118044] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-07 04:25:05,676][118044] Updated weights for policy 0, policy_version 47970 (0.0005) [2023-03-07 04:25:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 49126400. Throughput: 0: 13126.8. Samples: 49118868. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:06,086][117718] Avg episode reward: [(0, '2980.598')] [2023-03-07 04:25:06,456][118044] Updated weights for policy 0, policy_version 47980 (0.0007) [2023-03-07 04:25:07,235][118044] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-07 04:25:08,018][118044] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-07 04:25:08,778][118044] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-07 04:25:09,570][118044] Updated weights for policy 0, policy_version 48020 (0.0006) [2023-03-07 04:25:10,342][118044] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 04:25:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 49191936. Throughput: 0: 13131.9. Samples: 49158320. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:11,086][117718] Avg episode reward: [(0, '2967.425')] [2023-03-07 04:25:11,117][118044] Updated weights for policy 0, policy_version 48040 (0.0006) [2023-03-07 04:25:11,900][118044] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-07 04:25:12,686][118044] Updated weights for policy 0, policy_version 48060 (0.0007) [2023-03-07 04:25:13,447][118044] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-07 04:25:14,224][118044] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-07 04:25:15,018][118044] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-07 04:25:15,801][118044] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-07 04:25:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 49257472. Throughput: 0: 13125.7. Samples: 49237235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:16,086][117718] Avg episode reward: [(0, '2922.623')] [2023-03-07 04:25:16,574][118044] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-07 04:25:17,363][118044] Updated weights for policy 0, policy_version 48120 (0.0007) [2023-03-07 04:25:18,146][118044] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-07 04:25:18,938][118044] Updated weights for policy 0, policy_version 48140 (0.0008) [2023-03-07 04:25:19,726][118044] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 04:25:20,493][118044] Updated weights for policy 0, policy_version 48160 (0.0007) [2023-03-07 04:25:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 49323008. Throughput: 0: 13117.5. Samples: 49315669. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:21,086][117718] Avg episode reward: [(0, '2958.558')] [2023-03-07 04:25:21,270][118044] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-07 04:25:22,053][118044] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 04:25:22,838][118044] Updated weights for policy 0, policy_version 48190 (0.0006) [2023-03-07 04:25:23,626][118044] Updated weights for policy 0, policy_version 48200 (0.0007) [2023-03-07 04:25:24,401][118044] Updated weights for policy 0, policy_version 48210 (0.0007) [2023-03-07 04:25:25,181][118044] Updated weights for policy 0, policy_version 48220 (0.0005) [2023-03-07 04:25:25,965][118044] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-07 04:25:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 49388544. Throughput: 0: 13113.5. Samples: 49355029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:26,086][117718] Avg episode reward: [(0, '2939.004')] [2023-03-07 04:25:26,740][118044] Updated weights for policy 0, policy_version 48240 (0.0007) [2023-03-07 04:25:27,521][118044] Updated weights for policy 0, policy_version 48250 (0.0005) [2023-03-07 04:25:28,293][118044] Updated weights for policy 0, policy_version 48260 (0.0007) [2023-03-07 04:25:29,065][118044] Updated weights for policy 0, policy_version 48270 (0.0005) [2023-03-07 04:25:29,862][118044] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-07 04:25:30,634][118044] Updated weights for policy 0, policy_version 48290 (0.0007) [2023-03-07 04:25:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 49454080. Throughput: 0: 13114.6. Samples: 49433828. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:31,086][117718] Avg episode reward: [(0, '2855.712')] [2023-03-07 04:25:31,420][118044] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 04:25:32,211][118044] Updated weights for policy 0, policy_version 48310 (0.0006) [2023-03-07 04:25:33,005][118044] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-07 04:25:33,779][118044] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-07 04:25:34,540][118044] Updated weights for policy 0, policy_version 48340 (0.0006) [2023-03-07 04:25:35,322][118044] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-07 04:25:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13148.8). Total num frames: 49519616. Throughput: 0: 13122.7. Samples: 49512547. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:25:36,086][117718] Avg episode reward: [(0, '2958.703')] [2023-03-07 04:25:36,117][118044] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-07 04:25:36,894][118044] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-07 04:25:37,650][118044] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-07 04:25:38,425][118044] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-07 04:25:39,209][118044] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-07 04:25:39,967][118044] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-07 04:25:40,757][118044] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-07 04:25:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 49586176. Throughput: 0: 13126.5. Samples: 49552136. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:25:41,086][117718] Avg episode reward: [(0, '2992.281')] [2023-03-07 04:25:41,533][118044] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-07 04:25:42,317][118044] Updated weights for policy 0, policy_version 48440 (0.0007) [2023-03-07 04:25:43,101][118044] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-07 04:25:43,873][118044] Updated weights for policy 0, policy_version 48460 (0.0007) [2023-03-07 04:25:44,646][118044] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-07 04:25:45,424][118044] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-07 04:25:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 49651712. Throughput: 0: 13137.5. Samples: 49631340. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:25:46,086][117718] Avg episode reward: [(0, '2905.816')] [2023-03-07 04:25:46,205][118044] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-07 04:25:46,988][118044] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 04:25:47,733][118044] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-07 04:25:48,518][118044] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-07 04:25:49,300][118044] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-07 04:25:50,082][118044] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-07 04:25:50,882][118044] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-07 04:25:51,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 49717248. Throughput: 0: 13140.7. Samples: 49710201. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:25:51,086][117718] Avg episode reward: [(0, '2956.980')] [2023-03-07 04:25:51,647][118044] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-07 04:25:52,416][118044] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-07 04:25:53,189][118044] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-07 04:25:53,968][118044] Updated weights for policy 0, policy_version 48590 (0.0006) [2023-03-07 04:25:54,740][118044] Updated weights for policy 0, policy_version 48600 (0.0006) [2023-03-07 04:25:55,521][118044] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 04:25:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 49783808. Throughput: 0: 13144.6. Samples: 49749829. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:25:56,086][117718] Avg episode reward: [(0, '2916.275')] [2023-03-07 04:25:56,312][118044] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-07 04:25:57,074][118044] Updated weights for policy 0, policy_version 48630 (0.0007) [2023-03-07 04:25:57,873][118044] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 04:25:58,629][118044] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-07 04:25:59,430][118044] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 04:26:00,219][118044] Updated weights for policy 0, policy_version 48670 (0.0006) [2023-03-07 04:26:01,013][118044] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-07 04:26:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 49848320. Throughput: 0: 13139.7. Samples: 49828522. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:26:01,086][117718] Avg episode reward: [(0, '3040.124')] [2023-03-07 04:26:01,791][118044] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-07 04:26:02,562][118044] Updated weights for policy 0, policy_version 48700 (0.0007) [2023-03-07 04:26:03,345][118044] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-07 04:26:04,117][118044] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-07 04:26:04,878][118044] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-07 04:26:05,656][118044] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-07 04:26:06,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 49914880. Throughput: 0: 13149.6. Samples: 49907402. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:26:06,086][117718] Avg episode reward: [(0, '3000.755')] [2023-03-07 04:26:06,416][118044] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-07 04:26:07,208][118044] Updated weights for policy 0, policy_version 48760 (0.0007) [2023-03-07 04:26:07,974][118044] Updated weights for policy 0, policy_version 48770 (0.0006) [2023-03-07 04:26:08,758][118044] Updated weights for policy 0, policy_version 48780 (0.0006) [2023-03-07 04:26:09,532][118044] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-07 04:26:10,309][118044] Updated weights for policy 0, policy_version 48800 (0.0007) [2023-03-07 04:26:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 49980416. Throughput: 0: 13159.0. Samples: 49947185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:11,086][117718] Avg episode reward: [(0, '2929.484')] [2023-03-07 04:26:11,089][118044] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-07 04:26:11,875][118044] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 04:26:12,649][118044] Updated weights for policy 0, policy_version 48830 (0.0007) [2023-03-07 04:26:13,436][118044] Updated weights for policy 0, policy_version 48840 (0.0007) [2023-03-07 04:26:14,213][118044] Updated weights for policy 0, policy_version 48850 (0.0006) [2023-03-07 04:26:15,000][118044] Updated weights for policy 0, policy_version 48860 (0.0006) [2023-03-07 04:26:15,777][118044] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 04:26:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 50046976. Throughput: 0: 13159.0. Samples: 50025981. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:16,086][117718] Avg episode reward: [(0, '2804.204')] [2023-03-07 04:26:16,548][118044] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-07 04:26:17,337][118044] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-07 04:26:18,109][118044] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-07 04:26:18,883][118044] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-07 04:26:19,668][118044] Updated weights for policy 0, policy_version 48920 (0.0007) [2023-03-07 04:26:20,444][118044] Updated weights for policy 0, policy_version 48930 (0.0006) [2023-03-07 04:26:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 50112512. Throughput: 0: 13162.3. Samples: 50104850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:21,086][117718] Avg episode reward: [(0, '2818.123')] [2023-03-07 04:26:21,224][118044] Updated weights for policy 0, policy_version 48940 (0.0007) [2023-03-07 04:26:22,003][118044] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-07 04:26:22,765][118044] Updated weights for policy 0, policy_version 48960 (0.0005) [2023-03-07 04:26:23,540][118044] Updated weights for policy 0, policy_version 48970 (0.0007) [2023-03-07 04:26:24,318][118044] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-07 04:26:25,097][118044] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-07 04:26:25,879][118044] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-07 04:26:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 50178048. Throughput: 0: 13166.2. Samples: 50144617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:26,086][117718] Avg episode reward: [(0, '2830.884')] [2023-03-07 04:26:26,661][118044] Updated weights for policy 0, policy_version 49010 (0.0007) [2023-03-07 04:26:27,429][118044] Updated weights for policy 0, policy_version 49020 (0.0006) [2023-03-07 04:26:28,219][118044] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-07 04:26:29,004][118044] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-07 04:26:29,771][118044] Updated weights for policy 0, policy_version 49050 (0.0006) [2023-03-07 04:26:30,567][118044] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-07 04:26:31,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 50243584. Throughput: 0: 13155.8. Samples: 50223352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:31,086][117718] Avg episode reward: [(0, '3031.282')] [2023-03-07 04:26:31,337][118044] Updated weights for policy 0, policy_version 49070 (0.0006) [2023-03-07 04:26:32,121][118044] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-07 04:26:32,895][118044] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-07 04:26:33,670][118044] Updated weights for policy 0, policy_version 49100 (0.0007) [2023-03-07 04:26:34,441][118044] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-07 04:26:35,213][118044] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-07 04:26:35,982][118044] Updated weights for policy 0, policy_version 49130 (0.0007) [2023-03-07 04:26:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 50310144. Throughput: 0: 13158.2. Samples: 50302322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:36,086][117718] Avg episode reward: [(0, '2956.978')] [2023-03-07 04:26:36,092][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000049131_50310144.pth... [2023-03-07 04:26:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000046049_47154176.pth [2023-03-07 04:26:36,768][118044] Updated weights for policy 0, policy_version 49140 (0.0006) [2023-03-07 04:26:37,565][118044] Updated weights for policy 0, policy_version 49150 (0.0007) [2023-03-07 04:26:38,327][118044] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 04:26:39,112][118044] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-07 04:26:39,899][118044] Updated weights for policy 0, policy_version 49180 (0.0005) [2023-03-07 04:26:40,684][118044] Updated weights for policy 0, policy_version 49190 (0.0005) [2023-03-07 04:26:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50374656. Throughput: 0: 13154.8. Samples: 50341796. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:41,086][117718] Avg episode reward: [(0, '3003.873')] [2023-03-07 04:26:41,464][118044] Updated weights for policy 0, policy_version 49200 (0.0006) [2023-03-07 04:26:42,264][118044] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-07 04:26:43,041][118044] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-07 04:26:43,836][118044] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-07 04:26:44,595][118044] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 04:26:45,375][118044] Updated weights for policy 0, policy_version 49250 (0.0007) [2023-03-07 04:26:46,086][117718] Fps is (10 sec: 13004.8, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50440192. Throughput: 0: 13148.6. Samples: 50420208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:46,086][117718] Avg episode reward: [(0, '2835.438')] [2023-03-07 04:26:46,162][118044] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-07 04:26:46,936][118044] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-07 04:26:47,733][118044] Updated weights for policy 0, policy_version 49280 (0.0006) [2023-03-07 04:26:48,506][118044] Updated weights for policy 0, policy_version 49290 (0.0005) [2023-03-07 04:26:49,274][118044] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-07 04:26:50,059][118044] Updated weights for policy 0, policy_version 49310 (0.0007) [2023-03-07 04:26:50,854][118044] Updated weights for policy 0, policy_version 49320 (0.0006) [2023-03-07 04:26:51,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50505728. Throughput: 0: 13144.8. Samples: 50498920. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:51,086][117718] Avg episode reward: [(0, '2828.710')] [2023-03-07 04:26:51,617][118044] Updated weights for policy 0, policy_version 49330 (0.0007) [2023-03-07 04:26:52,394][118044] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-07 04:26:53,171][118044] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-07 04:26:53,962][118044] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-07 04:26:54,744][118044] Updated weights for policy 0, policy_version 49370 (0.0008) [2023-03-07 04:26:55,534][118044] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-07 04:26:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 50572288. Throughput: 0: 13142.9. Samples: 50538616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:26:56,086][117718] Avg episode reward: [(0, '2766.657')] [2023-03-07 04:26:56,302][118044] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-07 04:26:57,098][118044] Updated weights for policy 0, policy_version 49400 (0.0006) [2023-03-07 04:26:57,861][118044] Updated weights for policy 0, policy_version 49410 (0.0007) [2023-03-07 04:26:58,658][118044] Updated weights for policy 0, policy_version 49420 (0.0006) [2023-03-07 04:26:59,447][118044] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-07 04:27:00,220][118044] Updated weights for policy 0, policy_version 49440 (0.0008) [2023-03-07 04:27:01,002][118044] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-07 04:27:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 50637824. Throughput: 0: 13135.9. Samples: 50617099. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:01,086][117718] Avg episode reward: [(0, '2978.298')] [2023-03-07 04:27:01,786][118044] Updated weights for policy 0, policy_version 49460 (0.0007) [2023-03-07 04:27:02,543][118044] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-07 04:27:03,316][118044] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-07 04:27:04,089][118044] Updated weights for policy 0, policy_version 49490 (0.0007) [2023-03-07 04:27:04,848][118044] Updated weights for policy 0, policy_version 49500 (0.0008) [2023-03-07 04:27:05,613][118044] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-07 04:27:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50703360. Throughput: 0: 13145.7. Samples: 50696409. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:06,086][117718] Avg episode reward: [(0, '2951.971')] [2023-03-07 04:27:06,396][118044] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-07 04:27:07,188][118044] Updated weights for policy 0, policy_version 49530 (0.0007) [2023-03-07 04:27:07,957][118044] Updated weights for policy 0, policy_version 49540 (0.0006) [2023-03-07 04:27:08,755][118044] Updated weights for policy 0, policy_version 49550 (0.0006) [2023-03-07 04:27:09,524][118044] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-07 04:27:10,303][118044] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 04:27:11,082][118044] Updated weights for policy 0, policy_version 49580 (0.0007) [2023-03-07 04:27:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 50769920. Throughput: 0: 13136.3. Samples: 50735747. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:11,086][117718] Avg episode reward: [(0, '2966.818')] [2023-03-07 04:27:11,850][118044] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 04:27:12,629][118044] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-07 04:27:13,422][118044] Updated weights for policy 0, policy_version 49610 (0.0007) [2023-03-07 04:27:14,189][118044] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-07 04:27:14,978][118044] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-07 04:27:15,748][118044] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 04:27:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50835456. Throughput: 0: 13143.6. Samples: 50814814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:16,086][117718] Avg episode reward: [(0, '3002.414')] [2023-03-07 04:27:16,529][118044] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-07 04:27:17,301][118044] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-07 04:27:18,078][118044] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-07 04:27:18,852][118044] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-07 04:27:19,628][118044] Updated weights for policy 0, policy_version 49690 (0.0006) [2023-03-07 04:27:20,395][118044] Updated weights for policy 0, policy_version 49700 (0.0006) [2023-03-07 04:27:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 50900992. Throughput: 0: 13147.6. Samples: 50893962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:21,086][117718] Avg episode reward: [(0, '2934.271')] [2023-03-07 04:27:21,168][118044] Updated weights for policy 0, policy_version 49710 (0.0006) [2023-03-07 04:27:21,974][118044] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-07 04:27:22,740][118044] Updated weights for policy 0, policy_version 49730 (0.0006) [2023-03-07 04:27:23,514][118044] Updated weights for policy 0, policy_version 49740 (0.0005) [2023-03-07 04:27:24,315][118044] Updated weights for policy 0, policy_version 49750 (0.0006) [2023-03-07 04:27:25,082][118044] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-07 04:27:25,868][118044] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 04:27:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 50966528. Throughput: 0: 13142.5. Samples: 50933208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:26,086][117718] Avg episode reward: [(0, '2899.049')] [2023-03-07 04:27:26,650][118044] Updated weights for policy 0, policy_version 49780 (0.0006) [2023-03-07 04:27:27,425][118044] Updated weights for policy 0, policy_version 49790 (0.0007) [2023-03-07 04:27:28,197][118044] Updated weights for policy 0, policy_version 49800 (0.0007) [2023-03-07 04:27:28,998][118044] Updated weights for policy 0, policy_version 49810 (0.0007) [2023-03-07 04:27:29,761][118044] Updated weights for policy 0, policy_version 49820 (0.0007) [2023-03-07 04:27:30,536][118044] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-07 04:27:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 51033088. Throughput: 0: 13154.5. Samples: 51012159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:31,086][117718] Avg episode reward: [(0, '2883.255')] [2023-03-07 04:27:31,323][118044] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-07 04:27:32,095][118044] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-07 04:27:32,868][118044] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-07 04:27:33,651][118044] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-07 04:27:34,432][118044] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-07 04:27:35,194][118044] Updated weights for policy 0, policy_version 49890 (0.0007) [2023-03-07 04:27:35,971][118044] Updated weights for policy 0, policy_version 49900 (0.0005) [2023-03-07 04:27:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 51098624. Throughput: 0: 13160.5. Samples: 51091144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:36,086][117718] Avg episode reward: [(0, '2857.909')] [2023-03-07 04:27:36,751][118044] Updated weights for policy 0, policy_version 49910 (0.0007) [2023-03-07 04:27:37,524][118044] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-07 04:27:38,292][118044] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-07 04:27:39,071][118044] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-07 04:27:39,848][118044] Updated weights for policy 0, policy_version 49950 (0.0007) [2023-03-07 04:27:40,620][118044] Updated weights for policy 0, policy_version 49960 (0.0007) [2023-03-07 04:27:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 51164160. Throughput: 0: 13157.3. Samples: 51130697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:27:41,086][117718] Avg episode reward: [(0, '2880.624')] [2023-03-07 04:27:41,411][118044] Updated weights for policy 0, policy_version 49970 (0.0007) [2023-03-07 04:27:42,192][118044] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 04:27:42,953][118044] Updated weights for policy 0, policy_version 49990 (0.0006) [2023-03-07 04:27:43,742][118044] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-07 04:27:44,522][118044] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-07 04:27:45,310][118044] Updated weights for policy 0, policy_version 50020 (0.0006) [2023-03-07 04:27:46,074][118044] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-07 04:27:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.8). Total num frames: 51230720. Throughput: 0: 13168.8. Samples: 51209697. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:46,086][117718] Avg episode reward: [(0, '2913.935')] [2023-03-07 04:27:46,844][118044] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-07 04:27:47,617][118044] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-07 04:27:48,402][118044] Updated weights for policy 0, policy_version 50060 (0.0005) [2023-03-07 04:27:49,170][118044] Updated weights for policy 0, policy_version 50070 (0.0005) [2023-03-07 04:27:49,945][118044] Updated weights for policy 0, policy_version 50080 (0.0006) [2023-03-07 04:27:50,738][118044] Updated weights for policy 0, policy_version 50090 (0.0007) [2023-03-07 04:27:51,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 51296256. Throughput: 0: 13169.2. Samples: 51289025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:51,086][117718] Avg episode reward: [(0, '2959.346')] [2023-03-07 04:27:51,497][118044] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-07 04:27:52,260][118044] Updated weights for policy 0, policy_version 50110 (0.0006) [2023-03-07 04:27:53,040][118044] Updated weights for policy 0, policy_version 50120 (0.0007) [2023-03-07 04:27:53,817][118044] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-07 04:27:54,595][118044] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-07 04:27:55,379][118044] Updated weights for policy 0, policy_version 50150 (0.0006) [2023-03-07 04:27:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 51362816. Throughput: 0: 13177.6. Samples: 51328742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:27:56,086][117718] Avg episode reward: [(0, '2995.925')] [2023-03-07 04:27:56,155][118044] Updated weights for policy 0, policy_version 50160 (0.0006) [2023-03-07 04:27:56,938][118044] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-07 04:27:57,722][118044] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-07 04:27:58,502][118044] Updated weights for policy 0, policy_version 50190 (0.0006) [2023-03-07 04:27:59,280][118044] Updated weights for policy 0, policy_version 50200 (0.0006) [2023-03-07 04:28:00,051][118044] Updated weights for policy 0, policy_version 50210 (0.0006) [2023-03-07 04:28:00,822][118044] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-07 04:28:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 51428352. Throughput: 0: 13172.8. Samples: 51407588. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:28:01,086][117718] Avg episode reward: [(0, '2899.933')] [2023-03-07 04:28:01,610][118044] Updated weights for policy 0, policy_version 50230 (0.0007) [2023-03-07 04:28:02,377][118044] Updated weights for policy 0, policy_version 50240 (0.0007) [2023-03-07 04:28:03,145][118044] Updated weights for policy 0, policy_version 50250 (0.0005) [2023-03-07 04:28:03,918][118044] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-07 04:28:04,709][118044] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 04:28:05,473][118044] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-07 04:28:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 51493888. Throughput: 0: 13171.2. Samples: 51486667. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:28:06,086][117718] Avg episode reward: [(0, '2929.160')] [2023-03-07 04:28:06,271][118044] Updated weights for policy 0, policy_version 50290 (0.0007) [2023-03-07 04:28:07,053][118044] Updated weights for policy 0, policy_version 50300 (0.0006) [2023-03-07 04:28:07,818][118044] Updated weights for policy 0, policy_version 50310 (0.0007) [2023-03-07 04:28:08,606][118044] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-07 04:28:09,382][118044] Updated weights for policy 0, policy_version 50330 (0.0007) [2023-03-07 04:28:10,177][118044] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-07 04:28:10,939][118044] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-07 04:28:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 51559424. Throughput: 0: 13176.1. Samples: 51526130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:28:11,086][117718] Avg episode reward: [(0, '3006.178')] [2023-03-07 04:28:11,723][118044] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-07 04:28:12,489][118044] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-07 04:28:13,265][118044] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-07 04:28:14,057][118044] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-07 04:28:14,837][118044] Updated weights for policy 0, policy_version 50400 (0.0007) [2023-03-07 04:28:15,613][118044] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-07 04:28:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 51625984. Throughput: 0: 13175.0. Samples: 51605037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:16,086][117718] Avg episode reward: [(0, '2859.084')] [2023-03-07 04:28:16,386][118044] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-07 04:28:17,146][118044] Updated weights for policy 0, policy_version 50430 (0.0005) [2023-03-07 04:28:17,937][118044] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-07 04:28:18,724][118044] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-07 04:28:19,485][118044] Updated weights for policy 0, policy_version 50460 (0.0005) [2023-03-07 04:28:20,263][118044] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-07 04:28:21,017][118044] Updated weights for policy 0, policy_version 50480 (0.0006) [2023-03-07 04:28:21,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 51691520. Throughput: 0: 13182.7. Samples: 51684366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:21,086][117718] Avg episode reward: [(0, '2920.509')] [2023-03-07 04:28:21,817][118044] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-07 04:28:22,602][118044] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-07 04:28:23,387][118044] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-07 04:28:24,182][118044] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-07 04:28:24,962][118044] Updated weights for policy 0, policy_version 50530 (0.0006) [2023-03-07 04:28:25,738][118044] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-07 04:28:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13148.8). Total num frames: 51757056. Throughput: 0: 13174.7. Samples: 51723557. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:26,086][117718] Avg episode reward: [(0, '2888.918')] [2023-03-07 04:28:26,529][118044] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-07 04:28:27,305][118044] Updated weights for policy 0, policy_version 50560 (0.0007) [2023-03-07 04:28:28,089][118044] Updated weights for policy 0, policy_version 50570 (0.0007) [2023-03-07 04:28:28,856][118044] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-07 04:28:29,644][118044] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-07 04:28:30,438][118044] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-07 04:28:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 51822592. Throughput: 0: 13164.0. Samples: 51802075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:31,097][117718] Avg episode reward: [(0, '2909.713')] [2023-03-07 04:28:31,229][118044] Updated weights for policy 0, policy_version 50610 (0.0006) [2023-03-07 04:28:32,003][118044] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 04:28:32,786][118044] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-07 04:28:33,547][118044] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-07 04:28:34,345][118044] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-07 04:28:35,109][118044] Updated weights for policy 0, policy_version 50660 (0.0005) [2023-03-07 04:28:35,900][118044] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-07 04:28:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 51888128. Throughput: 0: 13150.7. Samples: 51880807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:36,096][117718] Avg episode reward: [(0, '3081.100')] [2023-03-07 04:28:36,100][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000050672_51888128.pth... [2023-03-07 04:28:36,130][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000047591_48733184.pth [2023-03-07 04:28:36,674][118044] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 04:28:37,459][118044] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-07 04:28:38,226][118044] Updated weights for policy 0, policy_version 50700 (0.0006) [2023-03-07 04:28:39,007][118044] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 04:28:39,790][118044] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-07 04:28:40,544][118044] Updated weights for policy 0, policy_version 50730 (0.0006) [2023-03-07 04:28:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 51953664. Throughput: 0: 13143.7. Samples: 51920209. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:28:41,086][117718] Avg episode reward: [(0, '2983.185')] [2023-03-07 04:28:41,327][118044] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-07 04:28:42,109][118044] Updated weights for policy 0, policy_version 50750 (0.0005) [2023-03-07 04:28:42,873][118044] Updated weights for policy 0, policy_version 50760 (0.0005) [2023-03-07 04:28:43,651][118044] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-07 04:28:44,436][118044] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-07 04:28:45,205][118044] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-07 04:28:45,966][118044] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 04:28:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 52020224. Throughput: 0: 13149.9. Samples: 51999335. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:28:46,086][117718] Avg episode reward: [(0, '2984.820')] [2023-03-07 04:28:46,756][118044] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-07 04:28:47,533][118044] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-07 04:28:48,332][118044] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-07 04:28:49,103][118044] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-07 04:28:49,875][118044] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-07 04:28:50,647][118044] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-07 04:28:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 52085760. Throughput: 0: 13154.3. Samples: 52078608. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:28:51,086][117718] Avg episode reward: [(0, '2996.684')] [2023-03-07 04:28:51,424][118044] Updated weights for policy 0, policy_version 50870 (0.0005) [2023-03-07 04:28:52,202][118044] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-07 04:28:52,990][118044] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-07 04:28:53,763][118044] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-07 04:28:54,541][118044] Updated weights for policy 0, policy_version 50910 (0.0006) [2023-03-07 04:28:55,312][118044] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-07 04:28:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 52151296. Throughput: 0: 13152.2. Samples: 52117977. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:28:56,086][117718] Avg episode reward: [(0, '2973.488')] [2023-03-07 04:28:56,087][118044] Updated weights for policy 0, policy_version 50930 (0.0006) [2023-03-07 04:28:56,855][118044] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-07 04:28:57,627][118044] Updated weights for policy 0, policy_version 50950 (0.0006) [2023-03-07 04:28:58,420][118044] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-07 04:28:59,193][118044] Updated weights for policy 0, policy_version 50970 (0.0007) [2023-03-07 04:28:59,973][118044] Updated weights for policy 0, policy_version 50980 (0.0006) [2023-03-07 04:29:00,763][118044] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-07 04:29:01,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 52217856. Throughput: 0: 13158.6. Samples: 52197173. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:29:01,086][117718] Avg episode reward: [(0, '2972.093')] [2023-03-07 04:29:01,542][118044] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-07 04:29:02,294][118044] Updated weights for policy 0, policy_version 51010 (0.0007) [2023-03-07 04:29:03,074][118044] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-07 04:29:03,857][118044] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-07 04:29:04,641][118044] Updated weights for policy 0, policy_version 51040 (0.0007) [2023-03-07 04:29:05,399][118044] Updated weights for policy 0, policy_version 51050 (0.0007) [2023-03-07 04:29:06,086][117718] Fps is (10 sec: 13311.9, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 52284416. Throughput: 0: 13154.8. Samples: 52276334. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:29:06,086][117718] Avg episode reward: [(0, '2898.372')] [2023-03-07 04:29:06,185][118044] Updated weights for policy 0, policy_version 51060 (0.0006) [2023-03-07 04:29:06,944][118044] Updated weights for policy 0, policy_version 51070 (0.0005) [2023-03-07 04:29:07,713][118044] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-07 04:29:08,498][118044] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-07 04:29:09,285][118044] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 04:29:10,066][118044] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-07 04:29:10,841][118044] Updated weights for policy 0, policy_version 51120 (0.0007) [2023-03-07 04:29:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 52349952. Throughput: 0: 13166.6. Samples: 52316054. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:29:11,086][117718] Avg episode reward: [(0, '2892.603')] [2023-03-07 04:29:11,615][118044] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-07 04:29:12,390][118044] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-07 04:29:13,166][118044] Updated weights for policy 0, policy_version 51150 (0.0007) [2023-03-07 04:29:13,957][118044] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-07 04:29:14,729][118044] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-07 04:29:15,500][118044] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-07 04:29:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 52415488. Throughput: 0: 13175.4. Samples: 52394971. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 04:29:16,086][117718] Avg episode reward: [(0, '3060.708')] [2023-03-07 04:29:16,289][118044] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-07 04:29:17,046][118044] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 04:29:17,826][118044] Updated weights for policy 0, policy_version 51210 (0.0006) [2023-03-07 04:29:18,623][118044] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-07 04:29:19,399][118044] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-07 04:29:20,183][118044] Updated weights for policy 0, policy_version 51240 (0.0006) [2023-03-07 04:29:20,943][118044] Updated weights for policy 0, policy_version 51250 (0.0006) [2023-03-07 04:29:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 52481024. Throughput: 0: 13181.5. Samples: 52473975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:21,086][117718] Avg episode reward: [(0, '2899.707')] [2023-03-07 04:29:21,734][118044] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-07 04:29:22,500][118044] Updated weights for policy 0, policy_version 51270 (0.0006) [2023-03-07 04:29:23,281][118044] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-07 04:29:24,052][118044] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-07 04:29:24,827][118044] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-07 04:29:25,598][118044] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-07 04:29:26,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 52547584. Throughput: 0: 13185.8. Samples: 52513567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:26,086][117718] Avg episode reward: [(0, '3036.125')] [2023-03-07 04:29:26,386][118044] Updated weights for policy 0, policy_version 51320 (0.0006) [2023-03-07 04:29:27,166][118044] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-07 04:29:27,954][118044] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-07 04:29:28,739][118044] Updated weights for policy 0, policy_version 51350 (0.0006) [2023-03-07 04:29:29,505][118044] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-07 04:29:30,297][118044] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-07 04:29:31,074][118044] Updated weights for policy 0, policy_version 51380 (0.0006) [2023-03-07 04:29:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 52613120. Throughput: 0: 13175.7. Samples: 52592241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:31,086][117718] Avg episode reward: [(0, '2913.277')] [2023-03-07 04:29:31,856][118044] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-07 04:29:32,630][118044] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 04:29:33,401][118044] Updated weights for policy 0, policy_version 51410 (0.0006) [2023-03-07 04:29:34,182][118044] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-07 04:29:34,956][118044] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-07 04:29:35,730][118044] Updated weights for policy 0, policy_version 51440 (0.0007) [2023-03-07 04:29:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 52678656. Throughput: 0: 13172.1. Samples: 52671355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:36,086][117718] Avg episode reward: [(0, '2964.666')] [2023-03-07 04:29:36,508][118044] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-07 04:29:37,287][118044] Updated weights for policy 0, policy_version 51460 (0.0005) [2023-03-07 04:29:38,058][118044] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-07 04:29:38,858][118044] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-07 04:29:39,624][118044] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-07 04:29:40,403][118044] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-07 04:29:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 52744192. Throughput: 0: 13169.2. Samples: 52710594. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:41,086][117718] Avg episode reward: [(0, '3007.705')] [2023-03-07 04:29:41,184][118044] Updated weights for policy 0, policy_version 51510 (0.0006) [2023-03-07 04:29:41,965][118044] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-07 04:29:42,756][118044] Updated weights for policy 0, policy_version 51530 (0.0005) [2023-03-07 04:29:43,532][118044] Updated weights for policy 0, policy_version 51540 (0.0007) [2023-03-07 04:29:44,298][118044] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-07 04:29:45,082][118044] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-07 04:29:45,847][118044] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-07 04:29:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 52810752. Throughput: 0: 13166.8. Samples: 52789677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:46,086][117718] Avg episode reward: [(0, '2924.034')] [2023-03-07 04:29:46,624][118044] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-07 04:29:47,410][118044] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-07 04:29:48,176][118044] Updated weights for policy 0, policy_version 51600 (0.0006) [2023-03-07 04:29:48,973][118044] Updated weights for policy 0, policy_version 51610 (0.0006) [2023-03-07 04:29:49,746][118044] Updated weights for policy 0, policy_version 51620 (0.0006) [2023-03-07 04:29:50,513][118044] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-07 04:29:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 52876288. Throughput: 0: 13167.3. Samples: 52868864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:51,086][117718] Avg episode reward: [(0, '2894.362')] [2023-03-07 04:29:51,291][118044] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-07 04:29:52,062][118044] Updated weights for policy 0, policy_version 51650 (0.0007) [2023-03-07 04:29:52,826][118044] Updated weights for policy 0, policy_version 51660 (0.0006) [2023-03-07 04:29:53,618][118044] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-07 04:29:54,377][118044] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-07 04:29:55,142][118044] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-07 04:29:55,956][118044] Updated weights for policy 0, policy_version 51700 (0.0007) [2023-03-07 04:29:56,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13155.8). Total num frames: 52941824. Throughput: 0: 13164.7. Samples: 52908465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:29:56,086][117718] Avg episode reward: [(0, '2905.108')] [2023-03-07 04:29:56,736][118044] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 04:29:57,496][118044] Updated weights for policy 0, policy_version 51720 (0.0006) [2023-03-07 04:29:58,273][118044] Updated weights for policy 0, policy_version 51730 (0.0007) [2023-03-07 04:29:59,053][118044] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-07 04:29:59,842][118044] Updated weights for policy 0, policy_version 51750 (0.0007) [2023-03-07 04:30:00,617][118044] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-07 04:30:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 53008384. Throughput: 0: 13167.1. Samples: 52987487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:01,086][117718] Avg episode reward: [(0, '2964.139')] [2023-03-07 04:30:01,406][118044] Updated weights for policy 0, policy_version 51770 (0.0006) [2023-03-07 04:30:02,177][118044] Updated weights for policy 0, policy_version 51780 (0.0006) [2023-03-07 04:30:02,953][118044] Updated weights for policy 0, policy_version 51790 (0.0006) [2023-03-07 04:30:03,722][118044] Updated weights for policy 0, policy_version 51800 (0.0006) [2023-03-07 04:30:04,498][118044] Updated weights for policy 0, policy_version 51810 (0.0007) [2023-03-07 04:30:05,277][118044] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-07 04:30:06,049][118044] Updated weights for policy 0, policy_version 51830 (0.0005) [2023-03-07 04:30:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 53073920. Throughput: 0: 13167.7. Samples: 53066522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:06,086][117718] Avg episode reward: [(0, '2839.009')] [2023-03-07 04:30:06,829][118044] Updated weights for policy 0, policy_version 51840 (0.0007) [2023-03-07 04:30:07,612][118044] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-07 04:30:08,366][118044] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-07 04:30:09,159][118044] Updated weights for policy 0, policy_version 51870 (0.0007) [2023-03-07 04:30:09,942][118044] Updated weights for policy 0, policy_version 51880 (0.0005) [2023-03-07 04:30:10,717][118044] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-07 04:30:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 53139456. Throughput: 0: 13164.0. Samples: 53105946. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:11,086][117718] Avg episode reward: [(0, '2797.155')] [2023-03-07 04:30:11,487][118044] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-07 04:30:12,267][118044] Updated weights for policy 0, policy_version 51910 (0.0007) [2023-03-07 04:30:13,041][118044] Updated weights for policy 0, policy_version 51920 (0.0007) [2023-03-07 04:30:13,813][118044] Updated weights for policy 0, policy_version 51930 (0.0005) [2023-03-07 04:30:14,592][118044] Updated weights for policy 0, policy_version 51940 (0.0006) [2023-03-07 04:30:15,365][118044] Updated weights for policy 0, policy_version 51950 (0.0007) [2023-03-07 04:30:16,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 53206016. Throughput: 0: 13177.8. Samples: 53185243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:16,097][117718] Avg episode reward: [(0, '2938.730')] [2023-03-07 04:30:16,137][118044] Updated weights for policy 0, policy_version 51960 (0.0006) [2023-03-07 04:30:16,917][118044] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-07 04:30:17,711][118044] Updated weights for policy 0, policy_version 51980 (0.0006) [2023-03-07 04:30:18,470][118044] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-07 04:30:19,252][118044] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 04:30:20,027][118044] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-07 04:30:20,787][118044] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-07 04:30:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 53271552. Throughput: 0: 13179.1. Samples: 53264413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:21,086][117718] Avg episode reward: [(0, '2832.949')] [2023-03-07 04:30:21,601][118044] Updated weights for policy 0, policy_version 52030 (0.0006) [2023-03-07 04:30:22,368][118044] Updated weights for policy 0, policy_version 52040 (0.0007) [2023-03-07 04:30:23,169][118044] Updated weights for policy 0, policy_version 52050 (0.0007) [2023-03-07 04:30:23,959][118044] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 04:30:24,726][118044] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 04:30:25,528][118044] Updated weights for policy 0, policy_version 52080 (0.0005) [2023-03-07 04:30:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 53337088. Throughput: 0: 13174.0. Samples: 53303425. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:26,086][117718] Avg episode reward: [(0, '2923.500')] [2023-03-07 04:30:26,318][118044] Updated weights for policy 0, policy_version 52090 (0.0006) [2023-03-07 04:30:27,106][118044] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-07 04:30:27,880][118044] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-07 04:30:28,655][118044] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-07 04:30:29,427][118044] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-07 04:30:30,201][118044] Updated weights for policy 0, policy_version 52140 (0.0006) [2023-03-07 04:30:30,976][118044] Updated weights for policy 0, policy_version 52150 (0.0006) [2023-03-07 04:30:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 53402624. Throughput: 0: 13165.5. Samples: 53382126. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:31,086][117718] Avg episode reward: [(0, '2998.066')] [2023-03-07 04:30:31,750][118044] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-07 04:30:32,524][118044] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-07 04:30:33,288][118044] Updated weights for policy 0, policy_version 52180 (0.0005) [2023-03-07 04:30:34,075][118044] Updated weights for policy 0, policy_version 52190 (0.0006) [2023-03-07 04:30:34,849][118044] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-07 04:30:35,625][118044] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-07 04:30:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 53468160. Throughput: 0: 13164.8. Samples: 53461279. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:36,086][117718] Avg episode reward: [(0, '2906.798')] [2023-03-07 04:30:36,101][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000052216_53469184.pth... [2023-03-07 04:30:36,131][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000049131_50310144.pth [2023-03-07 04:30:36,414][118044] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-07 04:30:37,182][118044] Updated weights for policy 0, policy_version 52230 (0.0005) [2023-03-07 04:30:37,949][118044] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-07 04:30:38,729][118044] Updated weights for policy 0, policy_version 52250 (0.0006) [2023-03-07 04:30:39,493][118044] Updated weights for policy 0, policy_version 52260 (0.0007) [2023-03-07 04:30:40,276][118044] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-07 04:30:41,053][118044] Updated weights for policy 0, policy_version 52280 (0.0007) [2023-03-07 04:30:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 53534720. Throughput: 0: 13163.4. Samples: 53500817. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:41,086][117718] Avg episode reward: [(0, '3050.292')] [2023-03-07 04:30:41,826][118044] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-07 04:30:42,598][118044] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-07 04:30:43,374][118044] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-07 04:30:44,163][118044] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-07 04:30:44,941][118044] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-07 04:30:45,720][118044] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 04:30:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 53600256. Throughput: 0: 13164.4. Samples: 53579886. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:46,086][117718] Avg episode reward: [(0, '2985.008')] [2023-03-07 04:30:46,500][118044] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-07 04:30:47,273][118044] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-07 04:30:48,035][118044] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 04:30:48,822][118044] Updated weights for policy 0, policy_version 52380 (0.0006) [2023-03-07 04:30:49,592][118044] Updated weights for policy 0, policy_version 52390 (0.0008) [2023-03-07 04:30:50,352][118044] Updated weights for policy 0, policy_version 52400 (0.0007) [2023-03-07 04:30:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 53666816. Throughput: 0: 13168.6. Samples: 53659109. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:30:51,086][117718] Avg episode reward: [(0, '2946.067')] [2023-03-07 04:30:51,147][118044] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-07 04:30:51,930][118044] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-07 04:30:52,698][118044] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 04:30:53,489][118044] Updated weights for policy 0, policy_version 52440 (0.0006) [2023-03-07 04:30:54,278][118044] Updated weights for policy 0, policy_version 52450 (0.0006) [2023-03-07 04:30:55,037][118044] Updated weights for policy 0, policy_version 52460 (0.0006) [2023-03-07 04:30:55,813][118044] Updated weights for policy 0, policy_version 52470 (0.0007) [2023-03-07 04:30:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 53732352. Throughput: 0: 13167.8. Samples: 53698497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:30:56,086][117718] Avg episode reward: [(0, '2902.831')] [2023-03-07 04:30:56,580][118044] Updated weights for policy 0, policy_version 52480 (0.0007) [2023-03-07 04:30:57,374][118044] Updated weights for policy 0, policy_version 52490 (0.0006) [2023-03-07 04:30:58,159][118044] Updated weights for policy 0, policy_version 52500 (0.0006) [2023-03-07 04:30:58,944][118044] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-07 04:30:59,714][118044] Updated weights for policy 0, policy_version 52520 (0.0006) [2023-03-07 04:31:00,475][118044] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-07 04:31:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 53797888. Throughput: 0: 13162.0. Samples: 53777530. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:01,086][117718] Avg episode reward: [(0, '2877.640')] [2023-03-07 04:31:01,239][118044] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 04:31:02,014][118044] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-07 04:31:02,794][118044] Updated weights for policy 0, policy_version 52560 (0.0006) [2023-03-07 04:31:03,559][118044] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-07 04:31:04,328][118044] Updated weights for policy 0, policy_version 52580 (0.0007) [2023-03-07 04:31:05,094][118044] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-07 04:31:05,874][118044] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 04:31:06,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13166.2). Total num frames: 53864448. Throughput: 0: 13175.7. Samples: 53857323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:06,086][117718] Avg episode reward: [(0, '2862.817')] [2023-03-07 04:31:06,652][118044] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-07 04:31:07,434][118044] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-07 04:31:08,199][118044] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-07 04:31:08,973][118044] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-07 04:31:09,770][118044] Updated weights for policy 0, policy_version 52650 (0.0007) [2023-03-07 04:31:10,551][118044] Updated weights for policy 0, policy_version 52660 (0.0005) [2023-03-07 04:31:11,085][117718] Fps is (10 sec: 13312.0, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 53931008. Throughput: 0: 13190.5. Samples: 53896996. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:11,086][117718] Avg episode reward: [(0, '2939.282')] [2023-03-07 04:31:11,326][118044] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-07 04:31:12,097][118044] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-07 04:31:12,870][118044] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-07 04:31:13,647][118044] Updated weights for policy 0, policy_version 52700 (0.0007) [2023-03-07 04:31:14,424][118044] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-07 04:31:15,204][118044] Updated weights for policy 0, policy_version 52720 (0.0006) [2023-03-07 04:31:15,957][118044] Updated weights for policy 0, policy_version 52730 (0.0006) [2023-03-07 04:31:16,086][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 53996544. Throughput: 0: 13195.3. Samples: 53975915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:16,086][117718] Avg episode reward: [(0, '2900.140')] [2023-03-07 04:31:16,752][118044] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-07 04:31:17,517][118044] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 04:31:18,285][118044] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-07 04:31:19,073][118044] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 04:31:19,856][118044] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-07 04:31:20,625][118044] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-07 04:31:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13166.2). Total num frames: 54062080. Throughput: 0: 13197.2. Samples: 54055154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:21,086][117718] Avg episode reward: [(0, '2898.205')] [2023-03-07 04:31:21,425][118044] Updated weights for policy 0, policy_version 52800 (0.0006) [2023-03-07 04:31:22,207][118044] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-07 04:31:22,966][118044] Updated weights for policy 0, policy_version 52820 (0.0006) [2023-03-07 04:31:23,733][118044] Updated weights for policy 0, policy_version 52830 (0.0007) [2023-03-07 04:31:24,511][118044] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-07 04:31:25,298][118044] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-07 04:31:26,061][118044] Updated weights for policy 0, policy_version 52860 (0.0006) [2023-03-07 04:31:26,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 54128640. Throughput: 0: 13194.8. Samples: 54094586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:26,086][117718] Avg episode reward: [(0, '2862.296')] [2023-03-07 04:31:26,848][118044] Updated weights for policy 0, policy_version 52870 (0.0006) [2023-03-07 04:31:27,630][118044] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-07 04:31:28,412][118044] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-07 04:31:29,168][118044] Updated weights for policy 0, policy_version 52900 (0.0005) [2023-03-07 04:31:29,954][118044] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-07 04:31:30,719][118044] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-07 04:31:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 54194176. Throughput: 0: 13194.5. Samples: 54173637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:31,086][117718] Avg episode reward: [(0, '2985.575')] [2023-03-07 04:31:31,501][118044] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-07 04:31:32,286][118044] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-07 04:31:33,055][118044] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-07 04:31:33,827][118044] Updated weights for policy 0, policy_version 52960 (0.0006) [2023-03-07 04:31:34,628][118044] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 04:31:35,400][118044] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 04:31:36,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 54259712. Throughput: 0: 13192.3. Samples: 54252766. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:36,086][117718] Avg episode reward: [(0, '2907.557')] [2023-03-07 04:31:36,178][118044] Updated weights for policy 0, policy_version 52990 (0.0007) [2023-03-07 04:31:36,949][118044] Updated weights for policy 0, policy_version 53000 (0.0005) [2023-03-07 04:31:37,727][118044] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-07 04:31:38,500][118044] Updated weights for policy 0, policy_version 53020 (0.0007) [2023-03-07 04:31:39,275][118044] Updated weights for policy 0, policy_version 53030 (0.0005) [2023-03-07 04:31:40,053][118044] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-07 04:31:40,829][118044] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-03-07 04:31:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 54326272. Throughput: 0: 13194.3. Samples: 54292241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:41,086][117718] Avg episode reward: [(0, '2886.321')] [2023-03-07 04:31:41,608][118044] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-07 04:31:42,382][118044] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-07 04:31:43,137][118044] Updated weights for policy 0, policy_version 53080 (0.0005) [2023-03-07 04:31:43,912][118044] Updated weights for policy 0, policy_version 53090 (0.0007) [2023-03-07 04:31:44,689][118044] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-07 04:31:45,469][118044] Updated weights for policy 0, policy_version 53110 (0.0006) [2023-03-07 04:31:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 54391808. Throughput: 0: 13202.4. Samples: 54371638. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:46,086][117718] Avg episode reward: [(0, '2907.227')] [2023-03-07 04:31:46,243][118044] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 04:31:47,029][118044] Updated weights for policy 0, policy_version 53130 (0.0006) [2023-03-07 04:31:47,797][118044] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-07 04:31:48,582][118044] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-07 04:31:49,371][118044] Updated weights for policy 0, policy_version 53160 (0.0006) [2023-03-07 04:31:50,148][118044] Updated weights for policy 0, policy_version 53170 (0.0008) [2023-03-07 04:31:50,944][118044] Updated weights for policy 0, policy_version 53180 (0.0007) [2023-03-07 04:31:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 54457344. Throughput: 0: 13185.4. Samples: 54450665. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:51,086][117718] Avg episode reward: [(0, '2853.583')] [2023-03-07 04:31:51,702][118044] Updated weights for policy 0, policy_version 53190 (0.0005) [2023-03-07 04:31:52,479][118044] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-07 04:31:53,244][118044] Updated weights for policy 0, policy_version 53210 (0.0006) [2023-03-07 04:31:54,018][118044] Updated weights for policy 0, policy_version 53220 (0.0006) [2023-03-07 04:31:54,796][118044] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-07 04:31:55,564][118044] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-07 04:31:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 54523904. Throughput: 0: 13186.5. Samples: 54490391. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:31:56,086][117718] Avg episode reward: [(0, '2908.242')] [2023-03-07 04:31:56,366][118044] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-07 04:31:57,127][118044] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-07 04:31:57,913][118044] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 04:31:58,682][118044] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-07 04:31:59,474][118044] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-07 04:32:00,260][118044] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-07 04:32:01,031][118044] Updated weights for policy 0, policy_version 53310 (0.0007) [2023-03-07 04:32:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 54589440. Throughput: 0: 13182.8. Samples: 54569141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:01,086][117718] Avg episode reward: [(0, '3003.751')] [2023-03-07 04:32:01,799][118044] Updated weights for policy 0, policy_version 53320 (0.0007) [2023-03-07 04:32:02,578][118044] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-07 04:32:03,364][118044] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-07 04:32:04,115][118044] Updated weights for policy 0, policy_version 53350 (0.0006) [2023-03-07 04:32:04,897][118044] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-07 04:32:05,667][118044] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 04:32:06,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13173.1). Total num frames: 54656000. Throughput: 0: 13178.8. Samples: 54648201. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:06,086][117718] Avg episode reward: [(0, '3124.408')] [2023-03-07 04:32:06,447][118044] Updated weights for policy 0, policy_version 53380 (0.0007) [2023-03-07 04:32:07,222][118044] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-07 04:32:08,003][118044] Updated weights for policy 0, policy_version 53400 (0.0006) [2023-03-07 04:32:08,776][118044] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-07 04:32:09,539][118044] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-07 04:32:10,348][118044] Updated weights for policy 0, policy_version 53430 (0.0007) [2023-03-07 04:32:11,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13173.1). Total num frames: 54721536. Throughput: 0: 13187.3. Samples: 54688013. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:11,086][117718] Avg episode reward: [(0, '3035.778')] [2023-03-07 04:32:11,105][118044] Updated weights for policy 0, policy_version 53440 (0.0006) [2023-03-07 04:32:11,875][118044] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-07 04:32:12,666][118044] Updated weights for policy 0, policy_version 53460 (0.0007) [2023-03-07 04:32:13,433][118044] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-07 04:32:14,204][118044] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 04:32:14,977][118044] Updated weights for policy 0, policy_version 53490 (0.0006) [2023-03-07 04:32:15,749][118044] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-07 04:32:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 54788096. Throughput: 0: 13191.1. Samples: 54767241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:16,086][117718] Avg episode reward: [(0, '3057.802')] [2023-03-07 04:32:16,525][118044] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-07 04:32:17,306][118044] Updated weights for policy 0, policy_version 53520 (0.0006) [2023-03-07 04:32:18,093][118044] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-07 04:32:18,870][118044] Updated weights for policy 0, policy_version 53540 (0.0005) [2023-03-07 04:32:19,640][118044] Updated weights for policy 0, policy_version 53550 (0.0006) [2023-03-07 04:32:20,430][118044] Updated weights for policy 0, policy_version 53560 (0.0005) [2023-03-07 04:32:21,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 54853632. Throughput: 0: 13188.4. Samples: 54846244. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:21,086][117718] Avg episode reward: [(0, '2957.582')] [2023-03-07 04:32:21,213][118044] Updated weights for policy 0, policy_version 53570 (0.0007) [2023-03-07 04:32:21,971][118044] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-07 04:32:22,770][118044] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 04:32:23,545][118044] Updated weights for policy 0, policy_version 53600 (0.0005) [2023-03-07 04:32:24,297][118044] Updated weights for policy 0, policy_version 53610 (0.0005) [2023-03-07 04:32:25,085][118044] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-07 04:32:25,865][118044] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-07 04:32:26,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 54919168. Throughput: 0: 13188.6. Samples: 54885730. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:26,086][117718] Avg episode reward: [(0, '3089.981')] [2023-03-07 04:32:26,648][118044] Updated weights for policy 0, policy_version 53640 (0.0006) [2023-03-07 04:32:27,421][118044] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-07 04:32:28,224][118044] Updated weights for policy 0, policy_version 53660 (0.0007) [2023-03-07 04:32:28,998][118044] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-07 04:32:29,783][118044] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-07 04:32:30,565][118044] Updated weights for policy 0, policy_version 53690 (0.0005) [2023-03-07 04:32:31,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 54984704. Throughput: 0: 13174.0. Samples: 54964469. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:32:31,086][117718] Avg episode reward: [(0, '3061.034')] [2023-03-07 04:32:31,337][118044] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-07 04:32:32,102][118044] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-07 04:32:32,867][118044] Updated weights for policy 0, policy_version 53720 (0.0006) [2023-03-07 04:32:33,649][118044] Updated weights for policy 0, policy_version 53730 (0.0007) [2023-03-07 04:32:34,412][118044] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-07 04:32:35,206][118044] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-07 04:32:35,981][118044] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-07 04:32:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 13176.6). Total num frames: 55051264. Throughput: 0: 13180.7. Samples: 55043796. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:32:36,086][117718] Avg episode reward: [(0, '3009.322')] [2023-03-07 04:32:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000053761_55051264.pth... [2023-03-07 04:32:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000050672_51888128.pth [2023-03-07 04:32:36,745][118044] Updated weights for policy 0, policy_version 53770 (0.0005) [2023-03-07 04:32:37,538][118044] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 04:32:38,317][118044] Updated weights for policy 0, policy_version 53790 (0.0007) [2023-03-07 04:32:39,085][118044] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-07 04:32:39,859][118044] Updated weights for policy 0, policy_version 53810 (0.0006) [2023-03-07 04:32:40,639][118044] Updated weights for policy 0, policy_version 53820 (0.0007) [2023-03-07 04:32:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 55116800. Throughput: 0: 13174.0. Samples: 55083219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:32:41,086][117718] Avg episode reward: [(0, '2999.716')] [2023-03-07 04:32:41,392][118044] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-07 04:32:42,180][118044] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-07 04:32:42,949][118044] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-07 04:32:43,721][118044] Updated weights for policy 0, policy_version 53860 (0.0006) [2023-03-07 04:32:44,502][118044] Updated weights for policy 0, policy_version 53870 (0.0006) [2023-03-07 04:32:45,263][118044] Updated weights for policy 0, policy_version 53880 (0.0007) [2023-03-07 04:32:46,038][118044] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-07 04:32:46,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 55183360. Throughput: 0: 13190.3. Samples: 55162705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:32:46,086][117718] Avg episode reward: [(0, '2984.580')] [2023-03-07 04:32:46,821][118044] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-07 04:32:47,591][118044] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 04:32:48,367][118044] Updated weights for policy 0, policy_version 53920 (0.0006) [2023-03-07 04:32:49,154][118044] Updated weights for policy 0, policy_version 53930 (0.0006) [2023-03-07 04:32:49,937][118044] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-07 04:32:50,710][118044] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-07 04:32:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 55248896. Throughput: 0: 13191.7. Samples: 55241825. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:32:51,086][117718] Avg episode reward: [(0, '3047.270')] [2023-03-07 04:32:51,494][118044] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-03-07 04:32:52,272][118044] Updated weights for policy 0, policy_version 53970 (0.0008) [2023-03-07 04:32:53,042][118044] Updated weights for policy 0, policy_version 53980 (0.0009) [2023-03-07 04:32:53,838][118044] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-07 04:32:54,625][118044] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-07 04:32:55,398][118044] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-07 04:32:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 55314432. Throughput: 0: 13182.8. Samples: 55281237. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:32:56,086][117718] Avg episode reward: [(0, '2970.679')] [2023-03-07 04:32:56,190][118044] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-07 04:32:56,974][118044] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-07 04:32:57,732][118044] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-07 04:32:58,499][118044] Updated weights for policy 0, policy_version 54050 (0.0006) [2023-03-07 04:32:59,305][118044] Updated weights for policy 0, policy_version 54060 (0.0006) [2023-03-07 04:33:00,083][118044] Updated weights for policy 0, policy_version 54070 (0.0006) [2023-03-07 04:33:00,863][118044] Updated weights for policy 0, policy_version 54080 (0.0006) [2023-03-07 04:33:01,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 55380992. Throughput: 0: 13171.1. Samples: 55359936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:01,086][117718] Avg episode reward: [(0, '2895.513')] [2023-03-07 04:33:01,630][118044] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-07 04:33:02,407][118044] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-07 04:33:03,197][118044] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-07 04:33:03,996][118044] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-07 04:33:04,769][118044] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-07 04:33:05,548][118044] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-07 04:33:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 55446528. Throughput: 0: 13166.1. Samples: 55438718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:06,086][117718] Avg episode reward: [(0, '3007.850')] [2023-03-07 04:33:06,309][118044] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-07 04:33:07,067][118044] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-07 04:33:07,851][118044] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-07 04:33:08,636][118044] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-07 04:33:09,419][118044] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-07 04:33:10,196][118044] Updated weights for policy 0, policy_version 54200 (0.0007) [2023-03-07 04:33:10,995][118044] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-07 04:33:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 55512064. Throughput: 0: 13169.7. Samples: 55478364. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:11,086][117718] Avg episode reward: [(0, '2908.451')] [2023-03-07 04:33:11,764][118044] Updated weights for policy 0, policy_version 54220 (0.0005) [2023-03-07 04:33:12,558][118044] Updated weights for policy 0, policy_version 54230 (0.0007) [2023-03-07 04:33:13,319][118044] Updated weights for policy 0, policy_version 54240 (0.0006) [2023-03-07 04:33:14,104][118044] Updated weights for policy 0, policy_version 54250 (0.0006) [2023-03-07 04:33:14,885][118044] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-07 04:33:15,675][118044] Updated weights for policy 0, policy_version 54270 (0.0007) [2023-03-07 04:33:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 55577600. Throughput: 0: 13167.2. Samples: 55556993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:16,086][117718] Avg episode reward: [(0, '2915.801')] [2023-03-07 04:33:16,457][118044] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-07 04:33:17,240][118044] Updated weights for policy 0, policy_version 54290 (0.0006) [2023-03-07 04:33:18,005][118044] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-07 04:33:18,762][118044] Updated weights for policy 0, policy_version 54310 (0.0007) [2023-03-07 04:33:19,538][118044] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-07 04:33:20,309][118044] Updated weights for policy 0, policy_version 54330 (0.0006) [2023-03-07 04:33:21,079][118044] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-07 04:33:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 55644160. Throughput: 0: 13165.5. Samples: 55636245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:21,086][117718] Avg episode reward: [(0, '2913.555')] [2023-03-07 04:33:21,870][118044] Updated weights for policy 0, policy_version 54350 (0.0006) [2023-03-07 04:33:22,646][118044] Updated weights for policy 0, policy_version 54360 (0.0007) [2023-03-07 04:33:23,426][118044] Updated weights for policy 0, policy_version 54370 (0.0007) [2023-03-07 04:33:24,194][118044] Updated weights for policy 0, policy_version 54380 (0.0005) [2023-03-07 04:33:24,975][118044] Updated weights for policy 0, policy_version 54390 (0.0005) [2023-03-07 04:33:25,738][118044] Updated weights for policy 0, policy_version 54400 (0.0005) [2023-03-07 04:33:26,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 55709696. Throughput: 0: 13168.7. Samples: 55675815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:26,086][117718] Avg episode reward: [(0, '2934.215')] [2023-03-07 04:33:26,510][118044] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-07 04:33:27,299][118044] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-07 04:33:28,082][118044] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-07 04:33:28,850][118044] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-07 04:33:29,633][118044] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-07 04:33:30,413][118044] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 04:33:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 55775232. Throughput: 0: 13159.8. Samples: 55754896. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:31,086][117718] Avg episode reward: [(0, '2914.092')] [2023-03-07 04:33:31,195][118044] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-07 04:33:31,973][118044] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-07 04:33:32,767][118044] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-07 04:33:33,538][118044] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-07 04:33:34,333][118044] Updated weights for policy 0, policy_version 54510 (0.0006) [2023-03-07 04:33:35,101][118044] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-07 04:33:35,877][118044] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-07 04:33:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 55840768. Throughput: 0: 13150.2. Samples: 55833582. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:33:36,086][117718] Avg episode reward: [(0, '3042.582')] [2023-03-07 04:33:36,653][118044] Updated weights for policy 0, policy_version 54540 (0.0007) [2023-03-07 04:33:37,445][118044] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-07 04:33:38,208][118044] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-07 04:33:38,984][118044] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-07 04:33:39,769][118044] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-07 04:33:40,552][118044] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-07 04:33:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 55906304. Throughput: 0: 13154.3. Samples: 55873179. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:41,086][117718] Avg episode reward: [(0, '2957.716')] [2023-03-07 04:33:41,328][118044] Updated weights for policy 0, policy_version 54600 (0.0007) [2023-03-07 04:33:42,107][118044] Updated weights for policy 0, policy_version 54610 (0.0007) [2023-03-07 04:33:42,876][118044] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-07 04:33:43,657][118044] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 04:33:44,451][118044] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-07 04:33:45,221][118044] Updated weights for policy 0, policy_version 54650 (0.0007) [2023-03-07 04:33:46,005][118044] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-07 04:33:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 55972864. Throughput: 0: 13158.2. Samples: 55952055. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:46,097][117718] Avg episode reward: [(0, '3022.045')] [2023-03-07 04:33:46,769][118044] Updated weights for policy 0, policy_version 54670 (0.0007) [2023-03-07 04:33:47,541][118044] Updated weights for policy 0, policy_version 54680 (0.0006) [2023-03-07 04:33:48,316][118044] Updated weights for policy 0, policy_version 54690 (0.0007) [2023-03-07 04:33:49,094][118044] Updated weights for policy 0, policy_version 54700 (0.0007) [2023-03-07 04:33:49,866][118044] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-07 04:33:50,624][118044] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 04:33:51,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 56038400. Throughput: 0: 13168.1. Samples: 56031280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:51,086][117718] Avg episode reward: [(0, '3086.586')] [2023-03-07 04:33:51,415][118044] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-07 04:33:52,193][118044] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-07 04:33:52,959][118044] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-07 04:33:53,736][118044] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-07 04:33:54,485][118044] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-07 04:33:55,265][118044] Updated weights for policy 0, policy_version 54780 (0.0007) [2023-03-07 04:33:56,026][118044] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 04:33:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 56104960. Throughput: 0: 13170.3. Samples: 56071030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:33:56,086][117718] Avg episode reward: [(0, '2912.034')] [2023-03-07 04:33:56,811][118044] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-07 04:33:57,586][118044] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-07 04:33:58,362][118044] Updated weights for policy 0, policy_version 54820 (0.0007) [2023-03-07 04:33:59,142][118044] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-07 04:33:59,928][118044] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-07 04:34:00,698][118044] Updated weights for policy 0, policy_version 54850 (0.0007) [2023-03-07 04:34:01,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 56170496. Throughput: 0: 13183.0. Samples: 56150230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:34:01,086][117718] Avg episode reward: [(0, '3052.658')] [2023-03-07 04:34:01,487][118044] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-07 04:34:02,258][118044] Updated weights for policy 0, policy_version 54870 (0.0007) [2023-03-07 04:34:03,053][118044] Updated weights for policy 0, policy_version 54880 (0.0006) [2023-03-07 04:34:03,829][118044] Updated weights for policy 0, policy_version 54890 (0.0007) [2023-03-07 04:34:04,602][118044] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 04:34:05,387][118044] Updated weights for policy 0, policy_version 54910 (0.0005) [2023-03-07 04:34:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 56237056. Throughput: 0: 13174.9. Samples: 56229116. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:34:06,086][117718] Avg episode reward: [(0, '3126.758')] [2023-03-07 04:34:06,158][118044] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-07 04:34:06,942][118044] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-07 04:34:07,698][118044] Updated weights for policy 0, policy_version 54940 (0.0005) [2023-03-07 04:34:08,472][118044] Updated weights for policy 0, policy_version 54950 (0.0006) [2023-03-07 04:34:09,261][118044] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-07 04:34:10,021][118044] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-07 04:34:10,804][118044] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-07 04:34:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 56302592. Throughput: 0: 13181.4. Samples: 56268977. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:34:11,086][117718] Avg episode reward: [(0, '3011.070')] [2023-03-07 04:34:11,592][118044] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 04:34:12,348][118044] Updated weights for policy 0, policy_version 55000 (0.0005) [2023-03-07 04:34:13,146][118044] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-07 04:34:13,931][118044] Updated weights for policy 0, policy_version 55020 (0.0006) [2023-03-07 04:34:14,700][118044] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-07 04:34:15,469][118044] Updated weights for policy 0, policy_version 55040 (0.0007) [2023-03-07 04:34:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 56368128. Throughput: 0: 13179.3. Samples: 56347965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:16,086][117718] Avg episode reward: [(0, '3072.629')] [2023-03-07 04:34:16,281][118044] Updated weights for policy 0, policy_version 55050 (0.0006) [2023-03-07 04:34:17,045][118044] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-07 04:34:17,832][118044] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 04:34:18,609][118044] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-07 04:34:19,390][118044] Updated weights for policy 0, policy_version 55090 (0.0006) [2023-03-07 04:34:20,169][118044] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 04:34:20,954][118044] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-07 04:34:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 56433664. Throughput: 0: 13177.5. Samples: 56426570. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:21,086][117718] Avg episode reward: [(0, '3044.434')] [2023-03-07 04:34:21,721][118044] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-07 04:34:22,502][118044] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-07 04:34:23,281][118044] Updated weights for policy 0, policy_version 55140 (0.0007) [2023-03-07 04:34:24,051][118044] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-07 04:34:24,831][118044] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-07 04:34:25,619][118044] Updated weights for policy 0, policy_version 55170 (0.0005) [2023-03-07 04:34:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 56499200. Throughput: 0: 13175.4. Samples: 56466071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:26,086][117718] Avg episode reward: [(0, '3120.487')] [2023-03-07 04:34:26,415][118044] Updated weights for policy 0, policy_version 55180 (0.0007) [2023-03-07 04:34:27,175][118044] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-07 04:34:27,958][118044] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-07 04:34:28,755][118044] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-07 04:34:29,527][118044] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 04:34:30,280][118044] Updated weights for policy 0, policy_version 55230 (0.0007) [2023-03-07 04:34:31,073][118044] Updated weights for policy 0, policy_version 55240 (0.0006) [2023-03-07 04:34:31,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 56565760. Throughput: 0: 13172.2. Samples: 56544801. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:31,086][117718] Avg episode reward: [(0, '3070.095')] [2023-03-07 04:34:31,853][118044] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 04:34:32,624][118044] Updated weights for policy 0, policy_version 55260 (0.0007) [2023-03-07 04:34:33,410][118044] Updated weights for policy 0, policy_version 55270 (0.0006) [2023-03-07 04:34:34,186][118044] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-07 04:34:34,959][118044] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-07 04:34:35,747][118044] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 04:34:36,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 56631296. Throughput: 0: 13166.2. Samples: 56623763. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:36,086][117718] Avg episode reward: [(0, '3140.314')] [2023-03-07 04:34:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000055304_56631296.pth... [2023-03-07 04:34:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000052216_53469184.pth [2023-03-07 04:34:36,526][118044] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 04:34:37,296][118044] Updated weights for policy 0, policy_version 55320 (0.0006) [2023-03-07 04:34:38,096][118044] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-07 04:34:38,865][118044] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-07 04:34:39,635][118044] Updated weights for policy 0, policy_version 55350 (0.0007) [2023-03-07 04:34:40,410][118044] Updated weights for policy 0, policy_version 55360 (0.0006) [2023-03-07 04:34:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 56696832. Throughput: 0: 13158.6. Samples: 56663166. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:41,086][117718] Avg episode reward: [(0, '3164.754')] [2023-03-07 04:34:41,197][118044] Updated weights for policy 0, policy_version 55370 (0.0006) [2023-03-07 04:34:41,963][118044] Updated weights for policy 0, policy_version 55380 (0.0005) [2023-03-07 04:34:42,730][118044] Updated weights for policy 0, policy_version 55390 (0.0006) [2023-03-07 04:34:43,535][118044] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-07 04:34:44,301][118044] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-07 04:34:45,084][118044] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 04:34:45,860][118044] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-07 04:34:46,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 56762368. Throughput: 0: 13154.1. Samples: 56742164. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:34:46,086][117718] Avg episode reward: [(0, '3027.205')] [2023-03-07 04:34:46,651][118044] Updated weights for policy 0, policy_version 55440 (0.0007) [2023-03-07 04:34:47,429][118044] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-07 04:34:48,215][118044] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 04:34:48,967][118044] Updated weights for policy 0, policy_version 55470 (0.0006) [2023-03-07 04:34:49,763][118044] Updated weights for policy 0, policy_version 55480 (0.0006) [2023-03-07 04:34:50,534][118044] Updated weights for policy 0, policy_version 55490 (0.0007) [2023-03-07 04:34:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 56828928. Throughput: 0: 13155.9. Samples: 56821134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:34:51,086][117718] Avg episode reward: [(0, '3076.887')] [2023-03-07 04:34:51,310][118044] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-07 04:34:52,089][118044] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-07 04:34:52,865][118044] Updated weights for policy 0, policy_version 55520 (0.0007) [2023-03-07 04:34:53,644][118044] Updated weights for policy 0, policy_version 55530 (0.0007) [2023-03-07 04:34:54,425][118044] Updated weights for policy 0, policy_version 55540 (0.0007) [2023-03-07 04:34:55,214][118044] Updated weights for policy 0, policy_version 55550 (0.0007) [2023-03-07 04:34:55,996][118044] Updated weights for policy 0, policy_version 55560 (0.0007) [2023-03-07 04:34:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 56894464. Throughput: 0: 13148.7. Samples: 56860668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:34:56,086][117718] Avg episode reward: [(0, '3104.875')] [2023-03-07 04:34:56,769][118044] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-07 04:34:57,541][118044] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 04:34:58,325][118044] Updated weights for policy 0, policy_version 55590 (0.0006) [2023-03-07 04:34:59,098][118044] Updated weights for policy 0, policy_version 55600 (0.0007) [2023-03-07 04:34:59,872][118044] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-07 04:35:00,639][118044] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-07 04:35:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 56960000. Throughput: 0: 13146.8. Samples: 56939571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:35:01,086][117718] Avg episode reward: [(0, '3124.918')] [2023-03-07 04:35:01,418][118044] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 04:35:02,201][118044] Updated weights for policy 0, policy_version 55640 (0.0006) [2023-03-07 04:35:02,994][118044] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-07 04:35:03,777][118044] Updated weights for policy 0, policy_version 55660 (0.0006) [2023-03-07 04:35:04,557][118044] Updated weights for policy 0, policy_version 55670 (0.0006) [2023-03-07 04:35:05,337][118044] Updated weights for policy 0, policy_version 55680 (0.0007) [2023-03-07 04:35:06,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 57025536. Throughput: 0: 13152.8. Samples: 57018443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:35:06,086][117718] Avg episode reward: [(0, '3040.123')] [2023-03-07 04:35:06,107][118044] Updated weights for policy 0, policy_version 55690 (0.0005) [2023-03-07 04:35:06,887][118044] Updated weights for policy 0, policy_version 55700 (0.0007) [2023-03-07 04:35:07,664][118044] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-07 04:35:08,440][118044] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 04:35:09,224][118044] Updated weights for policy 0, policy_version 55730 (0.0006) [2023-03-07 04:35:09,982][118044] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-07 04:35:10,769][118044] Updated weights for policy 0, policy_version 55750 (0.0007) [2023-03-07 04:35:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 57092096. Throughput: 0: 13153.5. Samples: 57057976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:35:11,086][117718] Avg episode reward: [(0, '3111.543')] [2023-03-07 04:35:11,522][118044] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-07 04:35:12,307][118044] Updated weights for policy 0, policy_version 55770 (0.0005) [2023-03-07 04:35:13,062][118044] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-07 04:35:13,854][118044] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-07 04:35:14,631][118044] Updated weights for policy 0, policy_version 55800 (0.0006) [2023-03-07 04:35:15,433][118044] Updated weights for policy 0, policy_version 55810 (0.0005) [2023-03-07 04:35:16,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 57157632. Throughput: 0: 13166.9. Samples: 57137314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:35:16,086][117718] Avg episode reward: [(0, '3175.785')] [2023-03-07 04:35:16,185][118044] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-07 04:35:16,982][118044] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-07 04:35:17,770][118044] Updated weights for policy 0, policy_version 55840 (0.0006) [2023-03-07 04:35:18,545][118044] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-07 04:35:19,321][118044] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 04:35:20,104][118044] Updated weights for policy 0, policy_version 55870 (0.0005) [2023-03-07 04:35:20,861][118044] Updated weights for policy 0, policy_version 55880 (0.0007) [2023-03-07 04:35:21,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 57224192. Throughput: 0: 13166.4. Samples: 57216248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:35:21,086][117718] Avg episode reward: [(0, '3125.688')] [2023-03-07 04:35:21,645][118044] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-07 04:35:22,431][118044] Updated weights for policy 0, policy_version 55900 (0.0007) [2023-03-07 04:35:23,211][118044] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-07 04:35:24,006][118044] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 04:35:24,777][118044] Updated weights for policy 0, policy_version 55930 (0.0005) [2023-03-07 04:35:25,547][118044] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-07 04:35:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 57288704. Throughput: 0: 13163.8. Samples: 57255539. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:26,086][117718] Avg episode reward: [(0, '3191.309')] [2023-03-07 04:35:26,323][118044] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-07 04:35:27,094][118044] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-07 04:35:27,874][118044] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-07 04:35:28,655][118044] Updated weights for policy 0, policy_version 55980 (0.0005) [2023-03-07 04:35:29,437][118044] Updated weights for policy 0, policy_version 55990 (0.0007) [2023-03-07 04:35:30,215][118044] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-07 04:35:30,999][118044] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-07 04:35:31,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 57355264. Throughput: 0: 13160.3. Samples: 57334377. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:31,086][117718] Avg episode reward: [(0, '3149.451')] [2023-03-07 04:35:31,774][118044] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-07 04:35:32,571][118044] Updated weights for policy 0, policy_version 56030 (0.0007) [2023-03-07 04:35:33,326][118044] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-07 04:35:34,078][118044] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-07 04:35:34,863][118044] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-07 04:35:35,629][118044] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-07 04:35:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 57420800. Throughput: 0: 13167.7. Samples: 57413679. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:36,086][117718] Avg episode reward: [(0, '3111.126')] [2023-03-07 04:35:36,414][118044] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-07 04:35:37,199][118044] Updated weights for policy 0, policy_version 56090 (0.0006) [2023-03-07 04:35:37,982][118044] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-07 04:35:38,794][118044] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-07 04:35:39,557][118044] Updated weights for policy 0, policy_version 56120 (0.0008) [2023-03-07 04:35:40,333][118044] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-07 04:35:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 57486336. Throughput: 0: 13158.6. Samples: 57452805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:41,086][117718] Avg episode reward: [(0, '3077.761')] [2023-03-07 04:35:41,113][118044] Updated weights for policy 0, policy_version 56140 (0.0007) [2023-03-07 04:35:41,888][118044] Updated weights for policy 0, policy_version 56150 (0.0007) [2023-03-07 04:35:42,669][118044] Updated weights for policy 0, policy_version 56160 (0.0007) [2023-03-07 04:35:43,445][118044] Updated weights for policy 0, policy_version 56170 (0.0007) [2023-03-07 04:35:44,220][118044] Updated weights for policy 0, policy_version 56180 (0.0006) [2023-03-07 04:35:44,992][118044] Updated weights for policy 0, policy_version 56190 (0.0007) [2023-03-07 04:35:45,781][118044] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 04:35:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 57551872. Throughput: 0: 13158.1. Samples: 57531685. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:46,086][117718] Avg episode reward: [(0, '3126.219')] [2023-03-07 04:35:46,569][118044] Updated weights for policy 0, policy_version 56210 (0.0007) [2023-03-07 04:35:47,360][118044] Updated weights for policy 0, policy_version 56220 (0.0007) [2023-03-07 04:35:48,132][118044] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-07 04:35:48,909][118044] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-07 04:35:49,701][118044] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-07 04:35:50,462][118044] Updated weights for policy 0, policy_version 56260 (0.0005) [2023-03-07 04:35:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 57618432. Throughput: 0: 13159.6. Samples: 57610626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:51,086][117718] Avg episode reward: [(0, '3134.537')] [2023-03-07 04:35:51,242][118044] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 04:35:52,004][118044] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-07 04:35:52,767][118044] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 04:35:53,569][118044] Updated weights for policy 0, policy_version 56300 (0.0006) [2023-03-07 04:35:54,354][118044] Updated weights for policy 0, policy_version 56310 (0.0007) [2023-03-07 04:35:55,130][118044] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-07 04:35:55,921][118044] Updated weights for policy 0, policy_version 56330 (0.0007) [2023-03-07 04:35:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 57683968. Throughput: 0: 13158.7. Samples: 57650116. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:35:56,086][117718] Avg episode reward: [(0, '3137.339')] [2023-03-07 04:35:56,693][118044] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-07 04:35:57,470][118044] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-07 04:35:58,255][118044] Updated weights for policy 0, policy_version 56360 (0.0007) [2023-03-07 04:35:59,026][118044] Updated weights for policy 0, policy_version 56370 (0.0006) [2023-03-07 04:35:59,819][118044] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-07 04:36:00,602][118044] Updated weights for policy 0, policy_version 56390 (0.0007) [2023-03-07 04:36:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 57749504. Throughput: 0: 13143.0. Samples: 57728750. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:01,086][117718] Avg episode reward: [(0, '3087.610')] [2023-03-07 04:36:01,405][118044] Updated weights for policy 0, policy_version 56400 (0.0006) [2023-03-07 04:36:02,163][118044] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 04:36:02,961][118044] Updated weights for policy 0, policy_version 56420 (0.0006) [2023-03-07 04:36:03,741][118044] Updated weights for policy 0, policy_version 56430 (0.0006) [2023-03-07 04:36:04,524][118044] Updated weights for policy 0, policy_version 56440 (0.0007) [2023-03-07 04:36:05,302][118044] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-07 04:36:06,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 57814016. Throughput: 0: 13131.5. Samples: 57807167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:06,086][117718] Avg episode reward: [(0, '3127.620')] [2023-03-07 04:36:06,087][118044] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-07 04:36:06,869][118044] Updated weights for policy 0, policy_version 56470 (0.0006) [2023-03-07 04:36:07,641][118044] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-07 04:36:08,417][118044] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-07 04:36:09,197][118044] Updated weights for policy 0, policy_version 56500 (0.0006) [2023-03-07 04:36:09,961][118044] Updated weights for policy 0, policy_version 56510 (0.0007) [2023-03-07 04:36:10,735][118044] Updated weights for policy 0, policy_version 56520 (0.0005) [2023-03-07 04:36:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 57880576. Throughput: 0: 13137.5. Samples: 57846728. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:11,086][117718] Avg episode reward: [(0, '3023.246')] [2023-03-07 04:36:11,508][118044] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 04:36:12,294][118044] Updated weights for policy 0, policy_version 56540 (0.0007) [2023-03-07 04:36:13,061][118044] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-07 04:36:13,834][118044] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-07 04:36:14,613][118044] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-07 04:36:15,389][118044] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-07 04:36:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 57946112. Throughput: 0: 13150.5. Samples: 57926150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:16,086][117718] Avg episode reward: [(0, '2993.185')] [2023-03-07 04:36:16,153][118044] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-07 04:36:16,925][118044] Updated weights for policy 0, policy_version 56600 (0.0006) [2023-03-07 04:36:17,698][118044] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-07 04:36:18,474][118044] Updated weights for policy 0, policy_version 56620 (0.0006) [2023-03-07 04:36:19,261][118044] Updated weights for policy 0, policy_version 56630 (0.0006) [2023-03-07 04:36:20,050][118044] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-07 04:36:20,821][118044] Updated weights for policy 0, policy_version 56650 (0.0007) [2023-03-07 04:36:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 58012672. Throughput: 0: 13145.0. Samples: 58005205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:21,086][117718] Avg episode reward: [(0, '2972.417')] [2023-03-07 04:36:21,595][118044] Updated weights for policy 0, policy_version 56660 (0.0006) [2023-03-07 04:36:22,361][118044] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-07 04:36:23,146][118044] Updated weights for policy 0, policy_version 56680 (0.0008) [2023-03-07 04:36:23,933][118044] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 04:36:24,727][118044] Updated weights for policy 0, policy_version 56700 (0.0006) [2023-03-07 04:36:25,499][118044] Updated weights for policy 0, policy_version 56710 (0.0005) [2023-03-07 04:36:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58078208. Throughput: 0: 13157.4. Samples: 58044889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:26,086][117718] Avg episode reward: [(0, '2971.371')] [2023-03-07 04:36:26,275][118044] Updated weights for policy 0, policy_version 56720 (0.0007) [2023-03-07 04:36:27,057][118044] Updated weights for policy 0, policy_version 56730 (0.0005) [2023-03-07 04:36:27,823][118044] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-07 04:36:28,580][118044] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 04:36:29,346][118044] Updated weights for policy 0, policy_version 56760 (0.0006) [2023-03-07 04:36:30,121][118044] Updated weights for policy 0, policy_version 56770 (0.0007) [2023-03-07 04:36:30,905][118044] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-07 04:36:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 58144768. Throughput: 0: 13164.3. Samples: 58124080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:31,086][117718] Avg episode reward: [(0, '3076.305')] [2023-03-07 04:36:31,674][118044] Updated weights for policy 0, policy_version 56790 (0.0006) [2023-03-07 04:36:32,473][118044] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-07 04:36:33,241][118044] Updated weights for policy 0, policy_version 56810 (0.0007) [2023-03-07 04:36:34,031][118044] Updated weights for policy 0, policy_version 56820 (0.0007) [2023-03-07 04:36:34,813][118044] Updated weights for policy 0, policy_version 56830 (0.0006) [2023-03-07 04:36:35,586][118044] Updated weights for policy 0, policy_version 56840 (0.0006) [2023-03-07 04:36:36,086][117718] Fps is (10 sec: 13209.3, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58210304. Throughput: 0: 13162.1. Samples: 58202923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:36,087][117718] Avg episode reward: [(0, '3071.779')] [2023-03-07 04:36:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000056846_58210304.pth... [2023-03-07 04:36:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000053761_55051264.pth [2023-03-07 04:36:36,365][118044] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-07 04:36:37,137][118044] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-07 04:36:37,919][118044] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 04:36:38,718][118044] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-07 04:36:39,489][118044] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-07 04:36:40,260][118044] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 04:36:41,039][118044] Updated weights for policy 0, policy_version 56910 (0.0007) [2023-03-07 04:36:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58275840. Throughput: 0: 13159.9. Samples: 58242313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:41,086][117718] Avg episode reward: [(0, '2961.748')] [2023-03-07 04:36:41,830][118044] Updated weights for policy 0, policy_version 56920 (0.0007) [2023-03-07 04:36:42,603][118044] Updated weights for policy 0, policy_version 56930 (0.0006) [2023-03-07 04:36:43,382][118044] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-07 04:36:44,152][118044] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-07 04:36:44,933][118044] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-07 04:36:45,716][118044] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-07 04:36:46,085][117718] Fps is (10 sec: 13107.5, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58341376. Throughput: 0: 13166.2. Samples: 58321231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:46,097][117718] Avg episode reward: [(0, '2976.909')] [2023-03-07 04:36:46,505][118044] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 04:36:47,271][118044] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-07 04:36:48,038][118044] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-07 04:36:48,826][118044] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-07 04:36:49,609][118044] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 04:36:50,394][118044] Updated weights for policy 0, policy_version 57030 (0.0005) [2023-03-07 04:36:51,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58407936. Throughput: 0: 13173.0. Samples: 58399952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:51,096][117718] Avg episode reward: [(0, '3121.006')] [2023-03-07 04:36:51,180][118044] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-07 04:36:51,968][118044] Updated weights for policy 0, policy_version 57050 (0.0007) [2023-03-07 04:36:52,740][118044] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-07 04:36:53,518][118044] Updated weights for policy 0, policy_version 57070 (0.0006) [2023-03-07 04:36:54,291][118044] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-07 04:36:55,065][118044] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-07 04:36:55,843][118044] Updated weights for policy 0, policy_version 57100 (0.0005) [2023-03-07 04:36:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 58473472. Throughput: 0: 13167.7. Samples: 58439276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:36:56,096][117718] Avg episode reward: [(0, '2941.869')] [2023-03-07 04:36:56,633][118044] Updated weights for policy 0, policy_version 57110 (0.0006) [2023-03-07 04:36:57,386][118044] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 04:36:58,170][118044] Updated weights for policy 0, policy_version 57130 (0.0006) [2023-03-07 04:36:58,952][118044] Updated weights for policy 0, policy_version 57140 (0.0006) [2023-03-07 04:36:59,723][118044] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-07 04:37:00,501][118044] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-07 04:37:01,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 58539008. Throughput: 0: 13163.6. Samples: 58518511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:01,097][117718] Avg episode reward: [(0, '3048.595')] [2023-03-07 04:37:01,290][118044] Updated weights for policy 0, policy_version 57170 (0.0006) [2023-03-07 04:37:02,093][118044] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-07 04:37:02,869][118044] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 04:37:03,617][118044] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-07 04:37:04,406][118044] Updated weights for policy 0, policy_version 57210 (0.0007) [2023-03-07 04:37:05,186][118044] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-07 04:37:05,961][118044] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-07 04:37:06,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 58604544. Throughput: 0: 13158.0. Samples: 58597314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:06,097][117718] Avg episode reward: [(0, '3017.310')] [2023-03-07 04:37:06,745][118044] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-07 04:37:07,522][118044] Updated weights for policy 0, policy_version 57250 (0.0007) [2023-03-07 04:37:08,307][118044] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-07 04:37:09,078][118044] Updated weights for policy 0, policy_version 57270 (0.0007) [2023-03-07 04:37:09,858][118044] Updated weights for policy 0, policy_version 57280 (0.0007) [2023-03-07 04:37:10,633][118044] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 04:37:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 58670080. Throughput: 0: 13151.7. Samples: 58636717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:11,086][117718] Avg episode reward: [(0, '3132.748')] [2023-03-07 04:37:11,401][118044] Updated weights for policy 0, policy_version 57300 (0.0006) [2023-03-07 04:37:12,201][118044] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 04:37:12,965][118044] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-07 04:37:13,745][118044] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-07 04:37:14,522][118044] Updated weights for policy 0, policy_version 57340 (0.0006) [2023-03-07 04:37:15,314][118044] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-07 04:37:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 58735616. Throughput: 0: 13143.4. Samples: 58715535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:16,097][117718] Avg episode reward: [(0, '3204.116')] [2023-03-07 04:37:16,099][118044] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-07 04:37:16,878][118044] Updated weights for policy 0, policy_version 57370 (0.0007) [2023-03-07 04:37:17,631][118044] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-07 04:37:18,419][118044] Updated weights for policy 0, policy_version 57390 (0.0005) [2023-03-07 04:37:19,201][118044] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-07 04:37:19,989][118044] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 04:37:20,757][118044] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-07 04:37:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 58802176. Throughput: 0: 13148.2. Samples: 58794590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:21,086][117718] Avg episode reward: [(0, '3148.250')] [2023-03-07 04:37:21,524][118044] Updated weights for policy 0, policy_version 57430 (0.0007) [2023-03-07 04:37:22,309][118044] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-07 04:37:23,082][118044] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-07 04:37:23,853][118044] Updated weights for policy 0, policy_version 57460 (0.0007) [2023-03-07 04:37:24,617][118044] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-07 04:37:25,408][118044] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 04:37:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 58867712. Throughput: 0: 13154.7. Samples: 58834274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:26,086][117718] Avg episode reward: [(0, '2998.555')] [2023-03-07 04:37:26,177][118044] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-07 04:37:26,966][118044] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-07 04:37:27,730][118044] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 04:37:28,497][118044] Updated weights for policy 0, policy_version 57520 (0.0007) [2023-03-07 04:37:29,290][118044] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-07 04:37:30,060][118044] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-07 04:37:30,838][118044] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 04:37:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 58934272. Throughput: 0: 13156.5. Samples: 58913273. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:31,086][117718] Avg episode reward: [(0, '3007.925')] [2023-03-07 04:37:31,613][118044] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-07 04:37:32,397][118044] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-07 04:37:33,198][118044] Updated weights for policy 0, policy_version 57580 (0.0006) [2023-03-07 04:37:33,971][118044] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-03-07 04:37:34,759][118044] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-07 04:37:35,543][118044] Updated weights for policy 0, policy_version 57610 (0.0007) [2023-03-07 04:37:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.5, 300 sec: 13162.7). Total num frames: 58999808. Throughput: 0: 13157.6. Samples: 58992043. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:36,086][117718] Avg episode reward: [(0, '2961.760')] [2023-03-07 04:37:36,325][118044] Updated weights for policy 0, policy_version 57620 (0.0007) [2023-03-07 04:37:37,099][118044] Updated weights for policy 0, policy_version 57630 (0.0007) [2023-03-07 04:37:37,875][118044] Updated weights for policy 0, policy_version 57640 (0.0006) [2023-03-07 04:37:38,657][118044] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-07 04:37:39,433][118044] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 04:37:40,210][118044] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-07 04:37:40,993][118044] Updated weights for policy 0, policy_version 57680 (0.0007) [2023-03-07 04:37:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 59065344. Throughput: 0: 13157.3. Samples: 59031355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:41,086][117718] Avg episode reward: [(0, '3015.840')] [2023-03-07 04:37:41,766][118044] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 04:37:42,552][118044] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-07 04:37:43,321][118044] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-07 04:37:44,102][118044] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-07 04:37:44,871][118044] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-07 04:37:45,644][118044] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-07 04:37:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 59130880. Throughput: 0: 13154.7. Samples: 59110476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:46,086][117718] Avg episode reward: [(0, '3033.348')] [2023-03-07 04:37:46,416][118044] Updated weights for policy 0, policy_version 57750 (0.0007) [2023-03-07 04:37:47,213][118044] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-07 04:37:47,979][118044] Updated weights for policy 0, policy_version 57770 (0.0007) [2023-03-07 04:37:48,767][118044] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-07 04:37:49,550][118044] Updated weights for policy 0, policy_version 57790 (0.0007) [2023-03-07 04:37:50,324][118044] Updated weights for policy 0, policy_version 57800 (0.0007) [2023-03-07 04:37:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 59196416. Throughput: 0: 13158.2. Samples: 59189431. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:51,086][117718] Avg episode reward: [(0, '3027.080')] [2023-03-07 04:37:51,111][118044] Updated weights for policy 0, policy_version 57810 (0.0007) [2023-03-07 04:37:51,888][118044] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-07 04:37:52,680][118044] Updated weights for policy 0, policy_version 57830 (0.0005) [2023-03-07 04:37:53,458][118044] Updated weights for policy 0, policy_version 57840 (0.0007) [2023-03-07 04:37:54,235][118044] Updated weights for policy 0, policy_version 57850 (0.0007) [2023-03-07 04:37:55,016][118044] Updated weights for policy 0, policy_version 57860 (0.0007) [2023-03-07 04:37:55,800][118044] Updated weights for policy 0, policy_version 57870 (0.0006) [2023-03-07 04:37:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 59261952. Throughput: 0: 13152.5. Samples: 59228578. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:37:56,086][117718] Avg episode reward: [(0, '3093.641')] [2023-03-07 04:37:56,582][118044] Updated weights for policy 0, policy_version 57880 (0.0006) [2023-03-07 04:37:57,353][118044] Updated weights for policy 0, policy_version 57890 (0.0006) [2023-03-07 04:37:58,126][118044] Updated weights for policy 0, policy_version 57900 (0.0006) [2023-03-07 04:37:58,910][118044] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-07 04:37:59,683][118044] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-07 04:38:00,482][118044] Updated weights for policy 0, policy_version 57930 (0.0007) [2023-03-07 04:38:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 59327488. Throughput: 0: 13155.2. Samples: 59307518. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:01,086][117718] Avg episode reward: [(0, '3132.549')] [2023-03-07 04:38:01,250][118044] Updated weights for policy 0, policy_version 57940 (0.0006) [2023-03-07 04:38:02,034][118044] Updated weights for policy 0, policy_version 57950 (0.0005) [2023-03-07 04:38:02,790][118044] Updated weights for policy 0, policy_version 57960 (0.0005) [2023-03-07 04:38:03,569][118044] Updated weights for policy 0, policy_version 57970 (0.0006) [2023-03-07 04:38:04,366][118044] Updated weights for policy 0, policy_version 57980 (0.0005) [2023-03-07 04:38:05,140][118044] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 04:38:05,927][118044] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-07 04:38:06,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 59394048. Throughput: 0: 13152.9. Samples: 59386474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:06,086][117718] Avg episode reward: [(0, '3118.245')] [2023-03-07 04:38:06,697][118044] Updated weights for policy 0, policy_version 58010 (0.0005) [2023-03-07 04:38:07,460][118044] Updated weights for policy 0, policy_version 58020 (0.0006) [2023-03-07 04:38:08,230][118044] Updated weights for policy 0, policy_version 58030 (0.0005) [2023-03-07 04:38:09,027][118044] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-07 04:38:09,781][118044] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-07 04:38:10,573][118044] Updated weights for policy 0, policy_version 58060 (0.0005) [2023-03-07 04:38:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 59459584. Throughput: 0: 13151.6. Samples: 59426096. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:11,086][117718] Avg episode reward: [(0, '3127.583')] [2023-03-07 04:38:11,340][118044] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-07 04:38:12,109][118044] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-07 04:38:12,914][118044] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 04:38:13,700][118044] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-07 04:38:14,469][118044] Updated weights for policy 0, policy_version 58110 (0.0007) [2023-03-07 04:38:15,246][118044] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-07 04:38:15,999][118044] Updated weights for policy 0, policy_version 58130 (0.0007) [2023-03-07 04:38:16,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 59525120. Throughput: 0: 13144.7. Samples: 59504787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:16,086][117718] Avg episode reward: [(0, '2963.660')] [2023-03-07 04:38:16,803][118044] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-07 04:38:17,578][118044] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 04:38:18,367][118044] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-07 04:38:19,144][118044] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-07 04:38:19,927][118044] Updated weights for policy 0, policy_version 58180 (0.0006) [2023-03-07 04:38:20,719][118044] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-07 04:38:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 59590656. Throughput: 0: 13143.6. Samples: 59583504. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:21,086][117718] Avg episode reward: [(0, '3069.802')] [2023-03-07 04:38:21,498][118044] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-07 04:38:22,280][118044] Updated weights for policy 0, policy_version 58210 (0.0006) [2023-03-07 04:38:23,059][118044] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-07 04:38:23,845][118044] Updated weights for policy 0, policy_version 58230 (0.0007) [2023-03-07 04:38:24,617][118044] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-07 04:38:25,379][118044] Updated weights for policy 0, policy_version 58250 (0.0005) [2023-03-07 04:38:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 59656192. Throughput: 0: 13144.3. Samples: 59622848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:26,086][117718] Avg episode reward: [(0, '3129.211')] [2023-03-07 04:38:26,169][118044] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-07 04:38:26,944][118044] Updated weights for policy 0, policy_version 58270 (0.0006) [2023-03-07 04:38:27,735][118044] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-07 04:38:28,497][118044] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 04:38:29,297][118044] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-07 04:38:30,063][118044] Updated weights for policy 0, policy_version 58310 (0.0008) [2023-03-07 04:38:30,822][118044] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 04:38:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 59722752. Throughput: 0: 13140.9. Samples: 59701817. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:31,086][117718] Avg episode reward: [(0, '3000.532')] [2023-03-07 04:38:31,617][118044] Updated weights for policy 0, policy_version 58330 (0.0007) [2023-03-07 04:38:32,401][118044] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-07 04:38:33,165][118044] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-07 04:38:33,944][118044] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 04:38:34,725][118044] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-07 04:38:35,511][118044] Updated weights for policy 0, policy_version 58380 (0.0007) [2023-03-07 04:38:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 59788288. Throughput: 0: 13143.8. Samples: 59780903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:36,086][117718] Avg episode reward: [(0, '2961.948')] [2023-03-07 04:38:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000058387_59788288.pth... [2023-03-07 04:38:36,118][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000055304_56631296.pth [2023-03-07 04:38:36,285][118044] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-07 04:38:37,057][118044] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-07 04:38:37,837][118044] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 04:38:38,609][118044] Updated weights for policy 0, policy_version 58420 (0.0007) [2023-03-07 04:38:39,404][118044] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-07 04:38:40,203][118044] Updated weights for policy 0, policy_version 58440 (0.0006) [2023-03-07 04:38:40,984][118044] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-07 04:38:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 59853824. Throughput: 0: 13154.1. Samples: 59820515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:41,086][117718] Avg episode reward: [(0, '2985.042')] [2023-03-07 04:38:41,748][118044] Updated weights for policy 0, policy_version 58460 (0.0007) [2023-03-07 04:38:42,528][118044] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-07 04:38:43,291][118044] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 04:38:44,089][118044] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 04:38:44,850][118044] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-07 04:38:45,646][118044] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-07 04:38:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 59919360. Throughput: 0: 13145.9. Samples: 59899083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:46,086][117718] Avg episode reward: [(0, '3157.775')] [2023-03-07 04:38:46,432][118044] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-07 04:38:47,200][118044] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-07 04:38:47,969][118044] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-07 04:38:48,752][118044] Updated weights for policy 0, policy_version 58550 (0.0006) [2023-03-07 04:38:49,522][118044] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 04:38:50,289][118044] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-07 04:38:51,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 59984896. Throughput: 0: 13146.0. Samples: 59978040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:51,086][117718] Avg episode reward: [(0, '3031.879')] [2023-03-07 04:38:51,091][118044] Updated weights for policy 0, policy_version 58580 (0.0007) [2023-03-07 04:38:51,894][118044] Updated weights for policy 0, policy_version 58590 (0.0007) [2023-03-07 04:38:52,666][118044] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-07 04:38:53,442][118044] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-07 04:38:54,214][118044] Updated weights for policy 0, policy_version 58620 (0.0005) [2023-03-07 04:38:54,989][118044] Updated weights for policy 0, policy_version 58630 (0.0006) [2023-03-07 04:38:55,762][118044] Updated weights for policy 0, policy_version 58640 (0.0005) [2023-03-07 04:38:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60051456. Throughput: 0: 13136.4. Samples: 60017234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:38:56,086][117718] Avg episode reward: [(0, '3031.741')] [2023-03-07 04:38:56,538][118044] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-07 04:38:57,335][118044] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-07 04:38:58,101][118044] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-07 04:38:58,889][118044] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-07 04:38:59,654][118044] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-07 04:39:00,432][118044] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 04:39:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 60116992. Throughput: 0: 13147.6. Samples: 60096429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:01,086][117718] Avg episode reward: [(0, '2962.931')] [2023-03-07 04:39:01,216][118044] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-07 04:39:01,980][118044] Updated weights for policy 0, policy_version 58720 (0.0007) [2023-03-07 04:39:02,762][118044] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-07 04:39:03,531][118044] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-07 04:39:04,306][118044] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-07 04:39:05,084][118044] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-07 04:39:05,870][118044] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-07 04:39:06,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60183552. Throughput: 0: 13160.0. Samples: 60175703. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:06,097][117718] Avg episode reward: [(0, '3003.043')] [2023-03-07 04:39:06,650][118044] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 04:39:07,433][118044] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 04:39:08,209][118044] Updated weights for policy 0, policy_version 58800 (0.0007) [2023-03-07 04:39:08,992][118044] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-07 04:39:09,765][118044] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-07 04:39:10,545][118044] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-07 04:39:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60249088. Throughput: 0: 13158.4. Samples: 60214975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:11,096][117718] Avg episode reward: [(0, '3033.401')] [2023-03-07 04:39:11,320][118044] Updated weights for policy 0, policy_version 58840 (0.0007) [2023-03-07 04:39:12,086][118044] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-07 04:39:12,865][118044] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-07 04:39:13,631][118044] Updated weights for policy 0, policy_version 58870 (0.0007) [2023-03-07 04:39:14,394][118044] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-07 04:39:15,177][118044] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-07 04:39:15,951][118044] Updated weights for policy 0, policy_version 58900 (0.0006) [2023-03-07 04:39:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60314624. Throughput: 0: 13163.3. Samples: 60294165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:16,097][117718] Avg episode reward: [(0, '3026.035')] [2023-03-07 04:39:16,736][118044] Updated weights for policy 0, policy_version 58910 (0.0008) [2023-03-07 04:39:17,509][118044] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-07 04:39:18,282][118044] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-07 04:39:19,057][118044] Updated weights for policy 0, policy_version 58940 (0.0007) [2023-03-07 04:39:19,829][118044] Updated weights for policy 0, policy_version 58950 (0.0007) [2023-03-07 04:39:20,622][118044] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-07 04:39:21,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 60381184. Throughput: 0: 13163.7. Samples: 60373271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:21,097][117718] Avg episode reward: [(0, '2921.121')] [2023-03-07 04:39:21,379][118044] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-07 04:39:22,172][118044] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-07 04:39:22,961][118044] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-07 04:39:23,725][118044] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-07 04:39:24,514][118044] Updated weights for policy 0, policy_version 59010 (0.0006) [2023-03-07 04:39:25,288][118044] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-07 04:39:26,062][118044] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-07 04:39:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 60446720. Throughput: 0: 13156.8. Samples: 60412569. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:26,097][117718] Avg episode reward: [(0, '3010.563')] [2023-03-07 04:39:26,851][118044] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-07 04:39:27,626][118044] Updated weights for policy 0, policy_version 59050 (0.0007) [2023-03-07 04:39:28,396][118044] Updated weights for policy 0, policy_version 59060 (0.0006) [2023-03-07 04:39:29,163][118044] Updated weights for policy 0, policy_version 59070 (0.0006) [2023-03-07 04:39:29,942][118044] Updated weights for policy 0, policy_version 59080 (0.0007) [2023-03-07 04:39:30,715][118044] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-07 04:39:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60512256. Throughput: 0: 13172.1. Samples: 60491828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:39:31,097][117718] Avg episode reward: [(0, '2909.201')] [2023-03-07 04:39:31,487][118044] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-07 04:39:32,275][118044] Updated weights for policy 0, policy_version 59110 (0.0006) [2023-03-07 04:39:33,080][118044] Updated weights for policy 0, policy_version 59120 (0.0007) [2023-03-07 04:39:33,852][118044] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-07 04:39:34,637][118044] Updated weights for policy 0, policy_version 59140 (0.0006) [2023-03-07 04:39:35,420][118044] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-07 04:39:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60577792. Throughput: 0: 13165.4. Samples: 60570487. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:39:36,097][117718] Avg episode reward: [(0, '2889.077')] [2023-03-07 04:39:36,205][118044] Updated weights for policy 0, policy_version 59160 (0.0007) [2023-03-07 04:39:36,978][118044] Updated weights for policy 0, policy_version 59170 (0.0006) [2023-03-07 04:39:37,741][118044] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 04:39:38,526][118044] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-07 04:39:39,301][118044] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-07 04:39:40,079][118044] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-07 04:39:40,848][118044] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-07 04:39:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60643328. Throughput: 0: 13176.0. Samples: 60610158. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:39:41,097][117718] Avg episode reward: [(0, '2899.877')] [2023-03-07 04:39:41,628][118044] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-07 04:39:42,416][118044] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-07 04:39:43,178][118044] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 04:39:43,966][118044] Updated weights for policy 0, policy_version 59260 (0.0007) [2023-03-07 04:39:44,749][118044] Updated weights for policy 0, policy_version 59270 (0.0007) [2023-03-07 04:39:45,530][118044] Updated weights for policy 0, policy_version 59280 (0.0007) [2023-03-07 04:39:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 60709888. Throughput: 0: 13170.1. Samples: 60689084. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:39:46,097][117718] Avg episode reward: [(0, '2932.668')] [2023-03-07 04:39:46,314][118044] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-07 04:39:47,081][118044] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-07 04:39:47,847][118044] Updated weights for policy 0, policy_version 59310 (0.0005) [2023-03-07 04:39:48,626][118044] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-07 04:39:49,403][118044] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-07 04:39:50,172][118044] Updated weights for policy 0, policy_version 59340 (0.0006) [2023-03-07 04:39:50,949][118044] Updated weights for policy 0, policy_version 59350 (0.0007) [2023-03-07 04:39:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 60775424. Throughput: 0: 13170.4. Samples: 60768371. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:39:51,096][117718] Avg episode reward: [(0, '3009.250')] [2023-03-07 04:39:51,738][118044] Updated weights for policy 0, policy_version 59360 (0.0007) [2023-03-07 04:39:52,512][118044] Updated weights for policy 0, policy_version 59370 (0.0006) [2023-03-07 04:39:53,274][118044] Updated weights for policy 0, policy_version 59380 (0.0005) [2023-03-07 04:39:54,045][118044] Updated weights for policy 0, policy_version 59390 (0.0007) [2023-03-07 04:39:54,851][118044] Updated weights for policy 0, policy_version 59400 (0.0006) [2023-03-07 04:39:55,639][118044] Updated weights for policy 0, policy_version 59410 (0.0007) [2023-03-07 04:39:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60840960. Throughput: 0: 13173.9. Samples: 60807803. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:39:56,097][117718] Avg episode reward: [(0, '2998.308')] [2023-03-07 04:39:56,412][118044] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 04:39:57,182][118044] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 04:39:57,963][118044] Updated weights for policy 0, policy_version 59440 (0.0007) [2023-03-07 04:39:58,741][118044] Updated weights for policy 0, policy_version 59450 (0.0007) [2023-03-07 04:39:59,506][118044] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 04:40:00,297][118044] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 04:40:01,073][118044] Updated weights for policy 0, policy_version 59480 (0.0008) [2023-03-07 04:40:01,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13159.3). Total num frames: 60907520. Throughput: 0: 13164.0. Samples: 60886546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:40:01,097][117718] Avg episode reward: [(0, '2927.688')] [2023-03-07 04:40:01,849][118044] Updated weights for policy 0, policy_version 59490 (0.0007) [2023-03-07 04:40:02,649][118044] Updated weights for policy 0, policy_version 59500 (0.0006) [2023-03-07 04:40:03,422][118044] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 04:40:04,200][118044] Updated weights for policy 0, policy_version 59520 (0.0006) [2023-03-07 04:40:04,985][118044] Updated weights for policy 0, policy_version 59530 (0.0005) [2023-03-07 04:40:05,738][118044] Updated weights for policy 0, policy_version 59540 (0.0007) [2023-03-07 04:40:06,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 60973056. Throughput: 0: 13161.9. Samples: 60965557. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:40:06,097][117718] Avg episode reward: [(0, '2770.907')] [2023-03-07 04:40:06,515][118044] Updated weights for policy 0, policy_version 59550 (0.0006) [2023-03-07 04:40:07,292][118044] Updated weights for policy 0, policy_version 59560 (0.0005) [2023-03-07 04:40:08,054][118044] Updated weights for policy 0, policy_version 59570 (0.0006) [2023-03-07 04:40:08,847][118044] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-07 04:40:09,613][118044] Updated weights for policy 0, policy_version 59590 (0.0005) [2023-03-07 04:40:10,385][118044] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-07 04:40:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61039616. Throughput: 0: 13169.8. Samples: 61005209. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:11,096][117718] Avg episode reward: [(0, '2876.023')] [2023-03-07 04:40:11,175][118044] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-07 04:40:11,953][118044] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-07 04:40:12,727][118044] Updated weights for policy 0, policy_version 59630 (0.0007) [2023-03-07 04:40:13,504][118044] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-07 04:40:14,284][118044] Updated weights for policy 0, policy_version 59650 (0.0006) [2023-03-07 04:40:15,055][118044] Updated weights for policy 0, policy_version 59660 (0.0006) [2023-03-07 04:40:15,830][118044] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 04:40:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 61105152. Throughput: 0: 13167.9. Samples: 61084382. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:16,096][117718] Avg episode reward: [(0, '2889.139')] [2023-03-07 04:40:16,597][118044] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-07 04:40:17,378][118044] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 04:40:18,168][118044] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-07 04:40:18,942][118044] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-07 04:40:19,734][118044] Updated weights for policy 0, policy_version 59720 (0.0006) [2023-03-07 04:40:20,495][118044] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-07 04:40:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 61170688. Throughput: 0: 13175.7. Samples: 61163390. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:21,097][117718] Avg episode reward: [(0, '2806.894')] [2023-03-07 04:40:21,280][118044] Updated weights for policy 0, policy_version 59740 (0.0006) [2023-03-07 04:40:22,060][118044] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-07 04:40:22,832][118044] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-07 04:40:23,616][118044] Updated weights for policy 0, policy_version 59770 (0.0008) [2023-03-07 04:40:24,393][118044] Updated weights for policy 0, policy_version 59780 (0.0005) [2023-03-07 04:40:25,161][118044] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 04:40:25,915][118044] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-07 04:40:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61237248. Throughput: 0: 13172.8. Samples: 61202932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:26,096][117718] Avg episode reward: [(0, '2945.684')] [2023-03-07 04:40:26,709][118044] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 04:40:27,476][118044] Updated weights for policy 0, policy_version 59820 (0.0005) [2023-03-07 04:40:28,257][118044] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-07 04:40:29,040][118044] Updated weights for policy 0, policy_version 59840 (0.0007) [2023-03-07 04:40:29,810][118044] Updated weights for policy 0, policy_version 59850 (0.0005) [2023-03-07 04:40:30,593][118044] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-07 04:40:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61302784. Throughput: 0: 13180.0. Samples: 61282185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:31,096][117718] Avg episode reward: [(0, '2896.488')] [2023-03-07 04:40:31,378][118044] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 04:40:32,143][118044] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-07 04:40:32,920][118044] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-07 04:40:33,690][118044] Updated weights for policy 0, policy_version 59900 (0.0006) [2023-03-07 04:40:34,482][118044] Updated weights for policy 0, policy_version 59910 (0.0006) [2023-03-07 04:40:35,256][118044] Updated weights for policy 0, policy_version 59920 (0.0006) [2023-03-07 04:40:36,026][118044] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 04:40:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61368320. Throughput: 0: 13171.8. Samples: 61361102. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:36,097][117718] Avg episode reward: [(0, '2872.651')] [2023-03-07 04:40:36,101][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000059930_61368320.pth... [2023-03-07 04:40:36,132][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000056846_58210304.pth [2023-03-07 04:40:36,805][118044] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-07 04:40:37,579][118044] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 04:40:38,382][118044] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-07 04:40:39,154][118044] Updated weights for policy 0, policy_version 59970 (0.0006) [2023-03-07 04:40:39,938][118044] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-07 04:40:40,694][118044] Updated weights for policy 0, policy_version 59990 (0.0006) [2023-03-07 04:40:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61433856. Throughput: 0: 13168.0. Samples: 61400362. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:40:41,096][117718] Avg episode reward: [(0, '2910.110')] [2023-03-07 04:40:41,475][118044] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-07 04:40:42,241][118044] Updated weights for policy 0, policy_version 60010 (0.0006) [2023-03-07 04:40:43,005][118044] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 04:40:43,813][118044] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-07 04:40:44,600][118044] Updated weights for policy 0, policy_version 60040 (0.0005) [2023-03-07 04:40:45,377][118044] Updated weights for policy 0, policy_version 60050 (0.0007) [2023-03-07 04:40:46,085][117718] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61500416. Throughput: 0: 13176.8. Samples: 61479498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:40:46,096][117718] Avg episode reward: [(0, '3067.069')] [2023-03-07 04:40:46,163][118044] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-07 04:40:46,932][118044] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-07 04:40:47,709][118044] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-07 04:40:48,493][118044] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-07 04:40:49,271][118044] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-07 04:40:50,051][118044] Updated weights for policy 0, policy_version 60110 (0.0007) [2023-03-07 04:40:50,827][118044] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-07 04:40:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 61565952. Throughput: 0: 13175.9. Samples: 61558471. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:40:51,086][117718] Avg episode reward: [(0, '3009.524')] [2023-03-07 04:40:51,601][118044] Updated weights for policy 0, policy_version 60130 (0.0007) [2023-03-07 04:40:52,363][118044] Updated weights for policy 0, policy_version 60140 (0.0006) [2023-03-07 04:40:53,132][118044] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 04:40:53,902][118044] Updated weights for policy 0, policy_version 60160 (0.0007) [2023-03-07 04:40:54,687][118044] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-07 04:40:55,454][118044] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-07 04:40:56,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 61632512. Throughput: 0: 13181.9. Samples: 61598397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:40:56,086][117718] Avg episode reward: [(0, '3025.128')] [2023-03-07 04:40:56,232][118044] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-07 04:40:57,009][118044] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-07 04:40:57,790][118044] Updated weights for policy 0, policy_version 60210 (0.0006) [2023-03-07 04:40:58,559][118044] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-07 04:40:59,341][118044] Updated weights for policy 0, policy_version 60230 (0.0006) [2023-03-07 04:41:00,133][118044] Updated weights for policy 0, policy_version 60240 (0.0006) [2023-03-07 04:41:00,899][118044] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-07 04:41:01,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 61698048. Throughput: 0: 13181.3. Samples: 61677541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:41:01,086][117718] Avg episode reward: [(0, '2991.861')] [2023-03-07 04:41:01,672][118044] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 04:41:02,461][118044] Updated weights for policy 0, policy_version 60270 (0.0008) [2023-03-07 04:41:03,231][118044] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-07 04:41:04,001][118044] Updated weights for policy 0, policy_version 60290 (0.0007) [2023-03-07 04:41:04,799][118044] Updated weights for policy 0, policy_version 60300 (0.0008) [2023-03-07 04:41:05,582][118044] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-07 04:41:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 61763584. Throughput: 0: 13177.3. Samples: 61756369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:41:06,086][117718] Avg episode reward: [(0, '3094.532')] [2023-03-07 04:41:06,361][118044] Updated weights for policy 0, policy_version 60320 (0.0007) [2023-03-07 04:41:07,135][118044] Updated weights for policy 0, policy_version 60330 (0.0006) [2023-03-07 04:41:07,906][118044] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-07 04:41:08,687][118044] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-07 04:41:09,454][118044] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-07 04:41:10,233][118044] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-07 04:41:11,003][118044] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 04:41:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 61830144. Throughput: 0: 13177.7. Samples: 61795928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:41:11,086][117718] Avg episode reward: [(0, '2949.859')] [2023-03-07 04:41:11,781][118044] Updated weights for policy 0, policy_version 60390 (0.0007) [2023-03-07 04:41:12,563][118044] Updated weights for policy 0, policy_version 60400 (0.0007) [2023-03-07 04:41:13,345][118044] Updated weights for policy 0, policy_version 60410 (0.0005) [2023-03-07 04:41:14,125][118044] Updated weights for policy 0, policy_version 60420 (0.0005) [2023-03-07 04:41:14,894][118044] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-07 04:41:15,680][118044] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 04:41:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 61895680. Throughput: 0: 13171.7. Samples: 61874910. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:41:16,086][117718] Avg episode reward: [(0, '3089.502')] [2023-03-07 04:41:16,452][118044] Updated weights for policy 0, policy_version 60450 (0.0005) [2023-03-07 04:41:17,222][118044] Updated weights for policy 0, policy_version 60460 (0.0006) [2023-03-07 04:41:18,014][118044] Updated weights for policy 0, policy_version 60470 (0.0006) [2023-03-07 04:41:18,789][118044] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-07 04:41:19,570][118044] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-07 04:41:20,358][118044] Updated weights for policy 0, policy_version 60500 (0.0006) [2023-03-07 04:41:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 61961216. Throughput: 0: 13172.0. Samples: 61953838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:41:21,086][117718] Avg episode reward: [(0, '3025.261')] [2023-03-07 04:41:21,147][118044] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 04:41:21,922][118044] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-07 04:41:22,690][118044] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-07 04:41:23,477][118044] Updated weights for policy 0, policy_version 60540 (0.0007) [2023-03-07 04:41:24,259][118044] Updated weights for policy 0, policy_version 60550 (0.0007) [2023-03-07 04:41:25,029][118044] Updated weights for policy 0, policy_version 60560 (0.0006) [2023-03-07 04:41:25,803][118044] Updated weights for policy 0, policy_version 60570 (0.0006) [2023-03-07 04:41:26,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 62026752. Throughput: 0: 13174.6. Samples: 61993220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:26,086][117718] Avg episode reward: [(0, '2967.096')] [2023-03-07 04:41:26,564][118044] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-03-07 04:41:27,347][118044] Updated weights for policy 0, policy_version 60590 (0.0007) [2023-03-07 04:41:28,111][118044] Updated weights for policy 0, policy_version 60600 (0.0007) [2023-03-07 04:41:28,887][118044] Updated weights for policy 0, policy_version 60610 (0.0007) [2023-03-07 04:41:29,649][118044] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-07 04:41:30,418][118044] Updated weights for policy 0, policy_version 60630 (0.0006) [2023-03-07 04:41:31,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 62093312. Throughput: 0: 13183.1. Samples: 62072740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:31,086][117718] Avg episode reward: [(0, '2935.488')] [2023-03-07 04:41:31,198][118044] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-07 04:41:31,973][118044] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-07 04:41:32,735][118044] Updated weights for policy 0, policy_version 60660 (0.0007) [2023-03-07 04:41:33,517][118044] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-07 04:41:34,303][118044] Updated weights for policy 0, policy_version 60680 (0.0007) [2023-03-07 04:41:35,077][118044] Updated weights for policy 0, policy_version 60690 (0.0006) [2023-03-07 04:41:35,854][118044] Updated weights for policy 0, policy_version 60700 (0.0006) [2023-03-07 04:41:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 62158848. Throughput: 0: 13186.6. Samples: 62151870. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:36,097][117718] Avg episode reward: [(0, '2986.807')] [2023-03-07 04:41:36,625][118044] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 04:41:37,390][118044] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-07 04:41:38,165][118044] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-07 04:41:38,956][118044] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-07 04:41:39,726][118044] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-07 04:41:40,513][118044] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-07 04:41:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 62225408. Throughput: 0: 13184.5. Samples: 62191699. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:41,097][117718] Avg episode reward: [(0, '3036.377')] [2023-03-07 04:41:41,281][118044] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-07 04:41:42,064][118044] Updated weights for policy 0, policy_version 60780 (0.0005) [2023-03-07 04:41:42,847][118044] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-07 04:41:43,616][118044] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-07 04:41:44,389][118044] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 04:41:45,177][118044] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 04:41:45,946][118044] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-07 04:41:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 62290944. Throughput: 0: 13184.7. Samples: 62270854. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:46,096][117718] Avg episode reward: [(0, '2813.872')] [2023-03-07 04:41:46,739][118044] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 04:41:47,517][118044] Updated weights for policy 0, policy_version 60850 (0.0007) [2023-03-07 04:41:48,316][118044] Updated weights for policy 0, policy_version 60860 (0.0005) [2023-03-07 04:41:49,093][118044] Updated weights for policy 0, policy_version 60870 (0.0007) [2023-03-07 04:41:49,860][118044] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-07 04:41:50,626][118044] Updated weights for policy 0, policy_version 60890 (0.0009) [2023-03-07 04:41:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 62356480. Throughput: 0: 13180.9. Samples: 62349510. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:51,097][117718] Avg episode reward: [(0, '2968.714')] [2023-03-07 04:41:51,414][118044] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-07 04:41:52,187][118044] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 04:41:52,951][118044] Updated weights for policy 0, policy_version 60920 (0.0006) [2023-03-07 04:41:53,721][118044] Updated weights for policy 0, policy_version 60930 (0.0005) [2023-03-07 04:41:54,507][118044] Updated weights for policy 0, policy_version 60940 (0.0006) [2023-03-07 04:41:55,269][118044] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-07 04:41:56,055][118044] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-07 04:41:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 62423040. Throughput: 0: 13182.0. Samples: 62389117. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:41:56,096][117718] Avg episode reward: [(0, '2998.354')] [2023-03-07 04:41:56,814][118044] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-07 04:41:57,586][118044] Updated weights for policy 0, policy_version 60980 (0.0006) [2023-03-07 04:41:58,389][118044] Updated weights for policy 0, policy_version 60990 (0.0006) [2023-03-07 04:41:59,167][118044] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-07 04:41:59,934][118044] Updated weights for policy 0, policy_version 61010 (0.0006) [2023-03-07 04:42:00,716][118044] Updated weights for policy 0, policy_version 61020 (0.0007) [2023-03-07 04:42:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 62488576. Throughput: 0: 13186.5. Samples: 62468303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:01,096][117718] Avg episode reward: [(0, '2919.171')] [2023-03-07 04:42:01,493][118044] Updated weights for policy 0, policy_version 61030 (0.0006) [2023-03-07 04:42:02,259][118044] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-07 04:42:03,049][118044] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-07 04:42:03,835][118044] Updated weights for policy 0, policy_version 61060 (0.0007) [2023-03-07 04:42:04,614][118044] Updated weights for policy 0, policy_version 61070 (0.0007) [2023-03-07 04:42:05,391][118044] Updated weights for policy 0, policy_version 61080 (0.0007) [2023-03-07 04:42:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 62554112. Throughput: 0: 13185.9. Samples: 62547206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:06,097][117718] Avg episode reward: [(0, '3048.189')] [2023-03-07 04:42:06,190][118044] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 04:42:06,971][118044] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-07 04:42:07,734][118044] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-07 04:42:08,517][118044] Updated weights for policy 0, policy_version 61120 (0.0006) [2023-03-07 04:42:09,305][118044] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-07 04:42:10,092][118044] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-07 04:42:10,868][118044] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-07 04:42:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 62619648. Throughput: 0: 13181.9. Samples: 62586401. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:11,096][117718] Avg episode reward: [(0, '2824.276')] [2023-03-07 04:42:11,631][118044] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-07 04:42:12,425][118044] Updated weights for policy 0, policy_version 61170 (0.0007) [2023-03-07 04:42:13,210][118044] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-07 04:42:13,985][118044] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-07 04:42:14,758][118044] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-07 04:42:15,550][118044] Updated weights for policy 0, policy_version 61210 (0.0005) [2023-03-07 04:42:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13166.2). Total num frames: 62686208. Throughput: 0: 13166.9. Samples: 62665251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:16,096][117718] Avg episode reward: [(0, '2901.453')] [2023-03-07 04:42:16,312][118044] Updated weights for policy 0, policy_version 61220 (0.0006) [2023-03-07 04:42:17,079][118044] Updated weights for policy 0, policy_version 61230 (0.0007) [2023-03-07 04:42:17,874][118044] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-07 04:42:18,652][118044] Updated weights for policy 0, policy_version 61250 (0.0005) [2023-03-07 04:42:19,429][118044] Updated weights for policy 0, policy_version 61260 (0.0007) [2023-03-07 04:42:20,221][118044] Updated weights for policy 0, policy_version 61270 (0.0008) [2023-03-07 04:42:21,001][118044] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 04:42:21,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 62751744. Throughput: 0: 13159.4. Samples: 62744041. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:21,097][117718] Avg episode reward: [(0, '2862.786')] [2023-03-07 04:42:21,798][118044] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-07 04:42:22,564][118044] Updated weights for policy 0, policy_version 61300 (0.0006) [2023-03-07 04:42:23,357][118044] Updated weights for policy 0, policy_version 61310 (0.0006) [2023-03-07 04:42:24,126][118044] Updated weights for policy 0, policy_version 61320 (0.0006) [2023-03-07 04:42:24,886][118044] Updated weights for policy 0, policy_version 61330 (0.0005) [2023-03-07 04:42:25,674][118044] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-07 04:42:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 62817280. Throughput: 0: 13146.9. Samples: 62783311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:26,096][117718] Avg episode reward: [(0, '2903.164')] [2023-03-07 04:42:26,466][118044] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-07 04:42:27,240][118044] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 04:42:28,005][118044] Updated weights for policy 0, policy_version 61370 (0.0006) [2023-03-07 04:42:28,806][118044] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-07 04:42:29,593][118044] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 04:42:30,382][118044] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-07 04:42:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 62882816. Throughput: 0: 13139.4. Samples: 62862127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:31,097][117718] Avg episode reward: [(0, '2869.318')] [2023-03-07 04:42:31,159][118044] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-07 04:42:31,934][118044] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 04:42:32,707][118044] Updated weights for policy 0, policy_version 61430 (0.0007) [2023-03-07 04:42:33,481][118044] Updated weights for policy 0, policy_version 61440 (0.0005) [2023-03-07 04:42:34,268][118044] Updated weights for policy 0, policy_version 61450 (0.0006) [2023-03-07 04:42:35,049][118044] Updated weights for policy 0, policy_version 61460 (0.0006) [2023-03-07 04:42:35,838][118044] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-07 04:42:36,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 62947328. Throughput: 0: 13134.6. Samples: 62940569. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:36,086][117718] Avg episode reward: [(0, '2817.427')] [2023-03-07 04:42:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000061473_62948352.pth... [2023-03-07 04:42:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000058387_59788288.pth [2023-03-07 04:42:36,625][118044] Updated weights for policy 0, policy_version 61480 (0.0005) [2023-03-07 04:42:37,387][118044] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-07 04:42:38,144][118044] Updated weights for policy 0, policy_version 61500 (0.0007) [2023-03-07 04:42:38,933][118044] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-07 04:42:39,692][118044] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-07 04:42:40,469][118044] Updated weights for policy 0, policy_version 61530 (0.0006) [2023-03-07 04:42:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 63013888. Throughput: 0: 13136.9. Samples: 62980276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:41,086][117718] Avg episode reward: [(0, '2771.661')] [2023-03-07 04:42:41,252][118044] Updated weights for policy 0, policy_version 61540 (0.0007) [2023-03-07 04:42:42,026][118044] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-07 04:42:42,812][118044] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-07 04:42:43,582][118044] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-07 04:42:44,363][118044] Updated weights for policy 0, policy_version 61580 (0.0005) [2023-03-07 04:42:45,135][118044] Updated weights for policy 0, policy_version 61590 (0.0007) [2023-03-07 04:42:45,899][118044] Updated weights for policy 0, policy_version 61600 (0.0007) [2023-03-07 04:42:46,086][117718] Fps is (10 sec: 13311.9, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63080448. Throughput: 0: 13138.2. Samples: 63059523. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:46,086][117718] Avg episode reward: [(0, '2929.713')] [2023-03-07 04:42:46,688][118044] Updated weights for policy 0, policy_version 61610 (0.0007) [2023-03-07 04:42:47,451][118044] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-07 04:42:48,229][118044] Updated weights for policy 0, policy_version 61630 (0.0006) [2023-03-07 04:42:49,004][118044] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-07 04:42:49,777][118044] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-07 04:42:50,562][118044] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-07 04:42:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63145984. Throughput: 0: 13149.3. Samples: 63138925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:51,086][117718] Avg episode reward: [(0, '2756.302')] [2023-03-07 04:42:51,341][118044] Updated weights for policy 0, policy_version 61670 (0.0006) [2023-03-07 04:42:52,117][118044] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-07 04:42:52,877][118044] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-07 04:42:53,667][118044] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-07 04:42:54,442][118044] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-07 04:42:55,206][118044] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-07 04:42:55,981][118044] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-07 04:42:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63212544. Throughput: 0: 13155.6. Samples: 63178405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:42:56,086][117718] Avg episode reward: [(0, '2929.995')] [2023-03-07 04:42:56,761][118044] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-07 04:42:57,546][118044] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-07 04:42:58,303][118044] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-07 04:42:59,087][118044] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-07 04:42:59,862][118044] Updated weights for policy 0, policy_version 61780 (0.0007) [2023-03-07 04:43:00,648][118044] Updated weights for policy 0, policy_version 61790 (0.0008) [2023-03-07 04:43:01,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63278080. Throughput: 0: 13165.4. Samples: 63257696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:01,086][117718] Avg episode reward: [(0, '2917.721')] [2023-03-07 04:43:01,432][118044] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-07 04:43:02,215][118044] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-07 04:43:03,011][118044] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-07 04:43:03,770][118044] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-07 04:43:04,563][118044] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-07 04:43:05,333][118044] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 04:43:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63343616. Throughput: 0: 13164.0. Samples: 63336420. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:06,086][117718] Avg episode reward: [(0, '2822.968')] [2023-03-07 04:43:06,134][118044] Updated weights for policy 0, policy_version 61860 (0.0005) [2023-03-07 04:43:06,880][118044] Updated weights for policy 0, policy_version 61870 (0.0005) [2023-03-07 04:43:07,649][118044] Updated weights for policy 0, policy_version 61880 (0.0007) [2023-03-07 04:43:08,428][118044] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-07 04:43:09,202][118044] Updated weights for policy 0, policy_version 61900 (0.0007) [2023-03-07 04:43:09,984][118044] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-07 04:43:10,761][118044] Updated weights for policy 0, policy_version 61920 (0.0007) [2023-03-07 04:43:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63410176. Throughput: 0: 13175.4. Samples: 63376203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:11,086][117718] Avg episode reward: [(0, '2899.964')] [2023-03-07 04:43:11,545][118044] Updated weights for policy 0, policy_version 61930 (0.0007) [2023-03-07 04:43:12,318][118044] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-07 04:43:13,077][118044] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-07 04:43:13,857][118044] Updated weights for policy 0, policy_version 61960 (0.0006) [2023-03-07 04:43:14,644][118044] Updated weights for policy 0, policy_version 61970 (0.0005) [2023-03-07 04:43:15,436][118044] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-07 04:43:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63475712. Throughput: 0: 13177.8. Samples: 63455126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:16,086][117718] Avg episode reward: [(0, '2872.015')] [2023-03-07 04:43:16,201][118044] Updated weights for policy 0, policy_version 61990 (0.0006) [2023-03-07 04:43:16,989][118044] Updated weights for policy 0, policy_version 62000 (0.0007) [2023-03-07 04:43:17,744][118044] Updated weights for policy 0, policy_version 62010 (0.0006) [2023-03-07 04:43:18,519][118044] Updated weights for policy 0, policy_version 62020 (0.0005) [2023-03-07 04:43:19,304][118044] Updated weights for policy 0, policy_version 62030 (0.0006) [2023-03-07 04:43:20,082][118044] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-07 04:43:20,865][118044] Updated weights for policy 0, policy_version 62050 (0.0007) [2023-03-07 04:43:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63541248. Throughput: 0: 13194.3. Samples: 63534312. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:21,086][117718] Avg episode reward: [(0, '2843.013')] [2023-03-07 04:43:21,634][118044] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 04:43:22,412][118044] Updated weights for policy 0, policy_version 62070 (0.0005) [2023-03-07 04:43:23,190][118044] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-07 04:43:23,966][118044] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-07 04:43:24,716][118044] Updated weights for policy 0, policy_version 62100 (0.0007) [2023-03-07 04:43:25,496][118044] Updated weights for policy 0, policy_version 62110 (0.0005) [2023-03-07 04:43:26,086][117718] Fps is (10 sec: 13209.3, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 63607808. Throughput: 0: 13193.2. Samples: 63573972. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:26,087][117718] Avg episode reward: [(0, '2859.488')] [2023-03-07 04:43:26,282][118044] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 04:43:27,081][118044] Updated weights for policy 0, policy_version 62130 (0.0007) [2023-03-07 04:43:27,847][118044] Updated weights for policy 0, policy_version 62140 (0.0007) [2023-03-07 04:43:28,636][118044] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-07 04:43:29,420][118044] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-07 04:43:30,178][118044] Updated weights for policy 0, policy_version 62170 (0.0005) [2023-03-07 04:43:30,946][118044] Updated weights for policy 0, policy_version 62180 (0.0006) [2023-03-07 04:43:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63673344. Throughput: 0: 13187.4. Samples: 63652955. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:31,086][117718] Avg episode reward: [(0, '2989.663')] [2023-03-07 04:43:31,720][118044] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-07 04:43:32,498][118044] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-07 04:43:33,273][118044] Updated weights for policy 0, policy_version 62210 (0.0006) [2023-03-07 04:43:34,061][118044] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-03-07 04:43:34,822][118044] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-07 04:43:35,613][118044] Updated weights for policy 0, policy_version 62240 (0.0007) [2023-03-07 04:43:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13173.2). Total num frames: 63739904. Throughput: 0: 13182.9. Samples: 63732158. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:36,086][117718] Avg episode reward: [(0, '2854.745')] [2023-03-07 04:43:36,385][118044] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-07 04:43:37,156][118044] Updated weights for policy 0, policy_version 62260 (0.0007) [2023-03-07 04:43:37,941][118044] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-07 04:43:38,726][118044] Updated weights for policy 0, policy_version 62280 (0.0007) [2023-03-07 04:43:39,502][118044] Updated weights for policy 0, policy_version 62290 (0.0007) [2023-03-07 04:43:40,286][118044] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-07 04:43:41,064][118044] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-07 04:43:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 63805440. Throughput: 0: 13183.4. Samples: 63771659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:41,086][117718] Avg episode reward: [(0, '3007.584')] [2023-03-07 04:43:41,842][118044] Updated weights for policy 0, policy_version 62320 (0.0007) [2023-03-07 04:43:42,632][118044] Updated weights for policy 0, policy_version 62330 (0.0007) [2023-03-07 04:43:43,397][118044] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 04:43:44,173][118044] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-07 04:43:44,957][118044] Updated weights for policy 0, policy_version 62360 (0.0006) [2023-03-07 04:43:45,726][118044] Updated weights for policy 0, policy_version 62370 (0.0006) [2023-03-07 04:43:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 63870976. Throughput: 0: 13175.2. Samples: 63850581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:46,086][117718] Avg episode reward: [(0, '2916.530')] [2023-03-07 04:43:46,513][118044] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-07 04:43:47,292][118044] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-07 04:43:48,063][118044] Updated weights for policy 0, policy_version 62400 (0.0007) [2023-03-07 04:43:48,853][118044] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-07 04:43:49,631][118044] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-07 04:43:50,420][118044] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-07 04:43:51,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63936512. Throughput: 0: 13176.7. Samples: 63929369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:51,086][117718] Avg episode reward: [(0, '2881.615')] [2023-03-07 04:43:51,208][118044] Updated weights for policy 0, policy_version 62440 (0.0006) [2023-03-07 04:43:51,972][118044] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 04:43:52,753][118044] Updated weights for policy 0, policy_version 62460 (0.0006) [2023-03-07 04:43:53,539][118044] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-07 04:43:54,323][118044] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-07 04:43:55,113][118044] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-07 04:43:55,871][118044] Updated weights for policy 0, policy_version 62500 (0.0006) [2023-03-07 04:43:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 64002048. Throughput: 0: 13165.5. Samples: 63968649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:43:56,086][117718] Avg episode reward: [(0, '2919.784')] [2023-03-07 04:43:56,654][118044] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-07 04:43:57,447][118044] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-07 04:43:58,225][118044] Updated weights for policy 0, policy_version 62530 (0.0005) [2023-03-07 04:43:59,011][118044] Updated weights for policy 0, policy_version 62540 (0.0005) [2023-03-07 04:43:59,790][118044] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-07 04:44:00,562][118044] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 04:44:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 64067584. Throughput: 0: 13160.3. Samples: 64047339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:01,086][117718] Avg episode reward: [(0, '2956.659')] [2023-03-07 04:44:01,325][118044] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-07 04:44:02,108][118044] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-07 04:44:02,885][118044] Updated weights for policy 0, policy_version 62590 (0.0006) [2023-03-07 04:44:03,662][118044] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-07 04:44:04,451][118044] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-07 04:44:05,225][118044] Updated weights for policy 0, policy_version 62620 (0.0007) [2023-03-07 04:44:06,021][118044] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-07 04:44:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 64133120. Throughput: 0: 13153.6. Samples: 64126223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:06,086][117718] Avg episode reward: [(0, '3022.639')] [2023-03-07 04:44:06,781][118044] Updated weights for policy 0, policy_version 62640 (0.0007) [2023-03-07 04:44:07,184][117993] KL-divergence is very high: 109.9095 [2023-03-07 04:44:07,235][117993] KL-divergence is very high: 662.2933 [2023-03-07 04:44:07,553][118044] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-07 04:44:08,349][118044] Updated weights for policy 0, policy_version 62660 (0.0006) [2023-03-07 04:44:09,127][118044] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-07 04:44:09,909][118044] Updated weights for policy 0, policy_version 62680 (0.0006) [2023-03-07 04:44:10,702][118044] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-07 04:44:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 64199680. Throughput: 0: 13148.0. Samples: 64165630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:11,086][117718] Avg episode reward: [(0, '3070.215')] [2023-03-07 04:44:11,477][118044] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-07 04:44:12,283][118044] Updated weights for policy 0, policy_version 62710 (0.0006) [2023-03-07 04:44:13,063][118044] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 04:44:13,832][118044] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-07 04:44:14,612][118044] Updated weights for policy 0, policy_version 62740 (0.0005) [2023-03-07 04:44:15,396][118044] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-07 04:44:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 64264192. Throughput: 0: 13136.3. Samples: 64244091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:16,086][117718] Avg episode reward: [(0, '3050.037')] [2023-03-07 04:44:16,166][118044] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-07 04:44:16,940][118044] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-07 04:44:17,723][118044] Updated weights for policy 0, policy_version 62780 (0.0007) [2023-03-07 04:44:18,496][118044] Updated weights for policy 0, policy_version 62790 (0.0005) [2023-03-07 04:44:19,286][118044] Updated weights for policy 0, policy_version 62800 (0.0007) [2023-03-07 04:44:20,073][118044] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-07 04:44:20,835][118044] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-07 04:44:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 64330752. Throughput: 0: 13127.0. Samples: 64322874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:21,086][117718] Avg episode reward: [(0, '2924.385')] [2023-03-07 04:44:21,625][118044] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-07 04:44:22,402][118044] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 04:44:23,188][118044] Updated weights for policy 0, policy_version 62850 (0.0006) [2023-03-07 04:44:23,967][118044] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-07 04:44:24,746][118044] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-07 04:44:25,507][118044] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 04:44:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13166.2). Total num frames: 64396288. Throughput: 0: 13126.1. Samples: 64362336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:26,086][117718] Avg episode reward: [(0, '2928.308')] [2023-03-07 04:44:26,283][118044] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-07 04:44:27,069][118044] Updated weights for policy 0, policy_version 62900 (0.0007) [2023-03-07 04:44:27,838][118044] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-07 04:44:28,627][118044] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 04:44:29,397][118044] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-07 04:44:30,158][118044] Updated weights for policy 0, policy_version 62940 (0.0005) [2023-03-07 04:44:30,940][118044] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 04:44:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 64461824. Throughput: 0: 13130.7. Samples: 64441461. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:44:31,086][117718] Avg episode reward: [(0, '2820.464')] [2023-03-07 04:44:31,724][118044] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-07 04:44:32,498][118044] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-07 04:44:33,269][118044] Updated weights for policy 0, policy_version 62980 (0.0006) [2023-03-07 04:44:34,065][118044] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 04:44:34,834][118044] Updated weights for policy 0, policy_version 63000 (0.0006) [2023-03-07 04:44:35,613][118044] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-07 04:44:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 64528384. Throughput: 0: 13138.5. Samples: 64520600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:44:36,086][117718] Avg episode reward: [(0, '2926.866')] [2023-03-07 04:44:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000063016_64528384.pth... [2023-03-07 04:44:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000059930_61368320.pth [2023-03-07 04:44:36,400][118044] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 04:44:37,174][118044] Updated weights for policy 0, policy_version 63030 (0.0007) [2023-03-07 04:44:37,961][118044] Updated weights for policy 0, policy_version 63040 (0.0006) [2023-03-07 04:44:38,744][118044] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 04:44:39,528][118044] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-07 04:44:40,309][118044] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-07 04:44:41,077][118044] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-07 04:44:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 64593920. Throughput: 0: 13139.3. Samples: 64559916. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:44:41,086][117718] Avg episode reward: [(0, '2901.275')] [2023-03-07 04:44:41,864][118044] Updated weights for policy 0, policy_version 63090 (0.0006) [2023-03-07 04:44:42,650][118044] Updated weights for policy 0, policy_version 63100 (0.0007) [2023-03-07 04:44:43,432][118044] Updated weights for policy 0, policy_version 63110 (0.0007) [2023-03-07 04:44:44,206][118044] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-07 04:44:44,981][118044] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-07 04:44:45,760][118044] Updated weights for policy 0, policy_version 63140 (0.0007) [2023-03-07 04:44:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 64659456. Throughput: 0: 13136.2. Samples: 64638469. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:44:46,087][117718] Avg episode reward: [(0, '3013.441')] [2023-03-07 04:44:46,546][118044] Updated weights for policy 0, policy_version 63150 (0.0006) [2023-03-07 04:44:47,324][118044] Updated weights for policy 0, policy_version 63160 (0.0006) [2023-03-07 04:44:48,096][118044] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-03-07 04:44:48,859][118044] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-07 04:44:49,640][118044] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 04:44:50,425][118044] Updated weights for policy 0, policy_version 63200 (0.0006) [2023-03-07 04:44:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 64724992. Throughput: 0: 13141.9. Samples: 64717607. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:44:51,086][117718] Avg episode reward: [(0, '2995.119')] [2023-03-07 04:44:51,198][118044] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-07 04:44:51,990][118044] Updated weights for policy 0, policy_version 63220 (0.0007) [2023-03-07 04:44:52,751][118044] Updated weights for policy 0, policy_version 63230 (0.0006) [2023-03-07 04:44:53,534][118044] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 04:44:54,297][118044] Updated weights for policy 0, policy_version 63250 (0.0006) [2023-03-07 04:44:55,078][118044] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 04:44:55,858][118044] Updated weights for policy 0, policy_version 63270 (0.0007) [2023-03-07 04:44:56,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 64791552. Throughput: 0: 13146.1. Samples: 64757206. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:44:56,086][117718] Avg episode reward: [(0, '2943.743')] [2023-03-07 04:44:56,639][118044] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 04:44:57,438][118044] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-07 04:44:58,212][118044] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-07 04:44:58,997][118044] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-07 04:44:59,768][118044] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-07 04:45:00,540][118044] Updated weights for policy 0, policy_version 63330 (0.0006) [2023-03-07 04:45:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 64857088. Throughput: 0: 13151.6. Samples: 64835910. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:45:01,086][117718] Avg episode reward: [(0, '3083.951')] [2023-03-07 04:45:01,319][118044] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-07 04:45:02,097][118044] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-07 04:45:02,856][118044] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-07 04:45:03,634][118044] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-07 04:45:04,406][118044] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-07 04:45:05,183][118044] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-07 04:45:05,970][118044] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-07 04:45:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 64922624. Throughput: 0: 13164.4. Samples: 64915273. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:45:06,086][117718] Avg episode reward: [(0, '2949.154')] [2023-03-07 04:45:06,750][118044] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-07 04:45:07,559][118044] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 04:45:08,336][118044] Updated weights for policy 0, policy_version 63430 (0.0007) [2023-03-07 04:45:09,092][118044] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 04:45:09,886][118044] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 04:45:10,645][118044] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-07 04:45:11,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 64988160. Throughput: 0: 13158.3. Samples: 64954459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:11,086][117718] Avg episode reward: [(0, '3016.983')] [2023-03-07 04:45:11,407][118044] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 04:45:12,205][118044] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-07 04:45:12,977][118044] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-07 04:45:13,738][118044] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-07 04:45:14,531][118044] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-07 04:45:15,312][118044] Updated weights for policy 0, policy_version 63520 (0.0005) [2023-03-07 04:45:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65053696. Throughput: 0: 13156.0. Samples: 65033485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:16,086][117718] Avg episode reward: [(0, '2892.786')] [2023-03-07 04:45:16,103][118044] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-07 04:45:16,882][118044] Updated weights for policy 0, policy_version 63540 (0.0007) [2023-03-07 04:45:17,650][118044] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-07 04:45:18,424][118044] Updated weights for policy 0, policy_version 63560 (0.0005) [2023-03-07 04:45:19,195][118044] Updated weights for policy 0, policy_version 63570 (0.0006) [2023-03-07 04:45:19,960][118044] Updated weights for policy 0, policy_version 63580 (0.0007) [2023-03-07 04:45:20,742][118044] Updated weights for policy 0, policy_version 63590 (0.0007) [2023-03-07 04:45:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65120256. Throughput: 0: 13157.4. Samples: 65112681. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:21,086][117718] Avg episode reward: [(0, '2837.146')] [2023-03-07 04:45:21,524][118044] Updated weights for policy 0, policy_version 63600 (0.0006) [2023-03-07 04:45:22,323][118044] Updated weights for policy 0, policy_version 63610 (0.0005) [2023-03-07 04:45:23,108][118044] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-07 04:45:23,882][118044] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-07 04:45:24,660][118044] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-07 04:45:25,446][118044] Updated weights for policy 0, policy_version 63650 (0.0006) [2023-03-07 04:45:26,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65185792. Throughput: 0: 13154.1. Samples: 65151852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:26,086][117718] Avg episode reward: [(0, '3015.801')] [2023-03-07 04:45:26,205][118044] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-07 04:45:26,998][118044] Updated weights for policy 0, policy_version 63670 (0.0005) [2023-03-07 04:45:27,769][118044] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 04:45:28,535][118044] Updated weights for policy 0, policy_version 63690 (0.0006) [2023-03-07 04:45:29,316][118044] Updated weights for policy 0, policy_version 63700 (0.0007) [2023-03-07 04:45:30,077][118044] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-07 04:45:30,853][118044] Updated weights for policy 0, policy_version 63720 (0.0006) [2023-03-07 04:45:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 65252352. Throughput: 0: 13172.2. Samples: 65231217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:31,086][117718] Avg episode reward: [(0, '2986.670')] [2023-03-07 04:45:31,630][118044] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-07 04:45:32,423][118044] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-07 04:45:33,194][118044] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-07 04:45:33,961][118044] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-07 04:45:34,752][118044] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-07 04:45:35,516][118044] Updated weights for policy 0, policy_version 63780 (0.0007) [2023-03-07 04:45:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 65317888. Throughput: 0: 13171.3. Samples: 65310316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:36,086][117718] Avg episode reward: [(0, '2953.301')] [2023-03-07 04:45:36,293][118044] Updated weights for policy 0, policy_version 63790 (0.0007) [2023-03-07 04:45:37,072][118044] Updated weights for policy 0, policy_version 63800 (0.0006) [2023-03-07 04:45:37,848][118044] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 04:45:38,642][118044] Updated weights for policy 0, policy_version 63820 (0.0006) [2023-03-07 04:45:39,413][118044] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 04:45:40,197][118044] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-07 04:45:40,990][118044] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 04:45:41,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 65382400. Throughput: 0: 13168.2. Samples: 65349776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:41,086][117718] Avg episode reward: [(0, '2967.805')] [2023-03-07 04:45:41,768][118044] Updated weights for policy 0, policy_version 63860 (0.0007) [2023-03-07 04:45:42,530][118044] Updated weights for policy 0, policy_version 63870 (0.0006) [2023-03-07 04:45:43,308][118044] Updated weights for policy 0, policy_version 63880 (0.0007) [2023-03-07 04:45:44,081][118044] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-07 04:45:44,854][118044] Updated weights for policy 0, policy_version 63900 (0.0006) [2023-03-07 04:45:45,628][118044] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-07 04:45:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65448960. Throughput: 0: 13171.8. Samples: 65428643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:45:46,086][117718] Avg episode reward: [(0, '3066.740')] [2023-03-07 04:45:46,404][118044] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 04:45:47,181][118044] Updated weights for policy 0, policy_version 63930 (0.0006) [2023-03-07 04:45:47,971][118044] Updated weights for policy 0, policy_version 63940 (0.0007) [2023-03-07 04:45:48,736][118044] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-07 04:45:49,533][118044] Updated weights for policy 0, policy_version 63960 (0.0007) [2023-03-07 04:45:50,301][118044] Updated weights for policy 0, policy_version 63970 (0.0007) [2023-03-07 04:45:51,069][118044] Updated weights for policy 0, policy_version 63980 (0.0007) [2023-03-07 04:45:51,085][117718] Fps is (10 sec: 13312.1, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 65515520. Throughput: 0: 13168.1. Samples: 65507837. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:45:51,086][117718] Avg episode reward: [(0, '2867.367')] [2023-03-07 04:45:51,861][118044] Updated weights for policy 0, policy_version 63990 (0.0006) [2023-03-07 04:45:52,636][118044] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-07 04:45:53,426][118044] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-07 04:45:54,203][118044] Updated weights for policy 0, policy_version 64020 (0.0008) [2023-03-07 04:45:54,984][118044] Updated weights for policy 0, policy_version 64030 (0.0006) [2023-03-07 04:45:55,747][118044] Updated weights for policy 0, policy_version 64040 (0.0006) [2023-03-07 04:45:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65581056. Throughput: 0: 13171.7. Samples: 65547185. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:45:56,086][117718] Avg episode reward: [(0, '2901.295')] [2023-03-07 04:45:56,541][118044] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-07 04:45:57,330][118044] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-07 04:45:58,123][118044] Updated weights for policy 0, policy_version 64070 (0.0007) [2023-03-07 04:45:58,898][118044] Updated weights for policy 0, policy_version 64080 (0.0006) [2023-03-07 04:45:59,685][118044] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-07 04:46:00,477][118044] Updated weights for policy 0, policy_version 64100 (0.0006) [2023-03-07 04:46:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65646592. Throughput: 0: 13156.2. Samples: 65625513. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:01,086][117718] Avg episode reward: [(0, '2851.532')] [2023-03-07 04:46:01,239][118044] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-07 04:46:02,021][118044] Updated weights for policy 0, policy_version 64120 (0.0007) [2023-03-07 04:46:02,799][118044] Updated weights for policy 0, policy_version 64130 (0.0007) [2023-03-07 04:46:03,580][118044] Updated weights for policy 0, policy_version 64140 (0.0006) [2023-03-07 04:46:04,344][118044] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 04:46:05,125][118044] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-07 04:46:05,917][118044] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 04:46:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 65712128. Throughput: 0: 13151.4. Samples: 65704496. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:06,086][117718] Avg episode reward: [(0, '2802.703')] [2023-03-07 04:46:06,669][118044] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 04:46:07,470][118044] Updated weights for policy 0, policy_version 64190 (0.0007) [2023-03-07 04:46:08,239][118044] Updated weights for policy 0, policy_version 64200 (0.0007) [2023-03-07 04:46:09,024][118044] Updated weights for policy 0, policy_version 64210 (0.0005) [2023-03-07 04:46:09,788][118044] Updated weights for policy 0, policy_version 64220 (0.0006) [2023-03-07 04:46:10,574][118044] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-07 04:46:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 65777664. Throughput: 0: 13159.6. Samples: 65744035. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:11,086][117718] Avg episode reward: [(0, '2971.328')] [2023-03-07 04:46:11,351][118044] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-07 04:46:12,146][118044] Updated weights for policy 0, policy_version 64250 (0.0006) [2023-03-07 04:46:12,908][118044] Updated weights for policy 0, policy_version 64260 (0.0006) [2023-03-07 04:46:13,692][118044] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-07 04:46:14,472][118044] Updated weights for policy 0, policy_version 64280 (0.0006) [2023-03-07 04:46:15,239][118044] Updated weights for policy 0, policy_version 64290 (0.0005) [2023-03-07 04:46:16,014][118044] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-07 04:46:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 65843200. Throughput: 0: 13149.3. Samples: 65822937. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:16,086][117718] Avg episode reward: [(0, '2837.669')] [2023-03-07 04:46:16,797][118044] Updated weights for policy 0, policy_version 64310 (0.0005) [2023-03-07 04:46:17,592][118044] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-07 04:46:18,373][118044] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-07 04:46:19,139][118044] Updated weights for policy 0, policy_version 64340 (0.0006) [2023-03-07 04:46:19,921][118044] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 04:46:20,709][118044] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-07 04:46:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 65909760. Throughput: 0: 13146.2. Samples: 65901895. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:21,086][117718] Avg episode reward: [(0, '2817.419')] [2023-03-07 04:46:21,469][118044] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-07 04:46:22,241][118044] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-07 04:46:23,022][118044] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-07 04:46:23,810][118044] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-07 04:46:24,584][118044] Updated weights for policy 0, policy_version 64410 (0.0007) [2023-03-07 04:46:25,367][118044] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-07 04:46:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 65975296. Throughput: 0: 13146.7. Samples: 65941374. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 04:46:26,086][117718] Avg episode reward: [(0, '2951.076')] [2023-03-07 04:46:26,158][118044] Updated weights for policy 0, policy_version 64430 (0.0007) [2023-03-07 04:46:26,926][118044] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-07 04:46:27,689][118044] Updated weights for policy 0, policy_version 64450 (0.0006) [2023-03-07 04:46:28,474][118044] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-07 04:46:29,250][118044] Updated weights for policy 0, policy_version 64470 (0.0007) [2023-03-07 04:46:30,037][118044] Updated weights for policy 0, policy_version 64480 (0.0007) [2023-03-07 04:46:30,815][118044] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 04:46:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 66040832. Throughput: 0: 13148.5. Samples: 66020324. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:31,086][117718] Avg episode reward: [(0, '2849.552')] [2023-03-07 04:46:31,599][118044] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-07 04:46:32,367][118044] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-07 04:46:33,157][118044] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 04:46:33,927][118044] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-07 04:46:34,715][118044] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-07 04:46:35,490][118044] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-07 04:46:36,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 66106368. Throughput: 0: 13143.0. Samples: 66099273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:36,086][117718] Avg episode reward: [(0, '2789.036')] [2023-03-07 04:46:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000064557_66106368.pth... [2023-03-07 04:46:36,124][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000061473_62948352.pth [2023-03-07 04:46:36,276][118044] Updated weights for policy 0, policy_version 64560 (0.0007) [2023-03-07 04:46:37,049][118044] Updated weights for policy 0, policy_version 64570 (0.0007) [2023-03-07 04:46:37,820][118044] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-07 04:46:38,608][118044] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 04:46:39,390][118044] Updated weights for policy 0, policy_version 64600 (0.0006) [2023-03-07 04:46:40,153][118044] Updated weights for policy 0, policy_version 64610 (0.0006) [2023-03-07 04:46:40,930][118044] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-07 04:46:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 66172928. Throughput: 0: 13144.8. Samples: 66138700. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:41,086][117718] Avg episode reward: [(0, '2953.839')] [2023-03-07 04:46:41,713][118044] Updated weights for policy 0, policy_version 64630 (0.0007) [2023-03-07 04:46:42,484][118044] Updated weights for policy 0, policy_version 64640 (0.0005) [2023-03-07 04:46:43,276][118044] Updated weights for policy 0, policy_version 64650 (0.0007) [2023-03-07 04:46:44,051][118044] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 04:46:44,837][118044] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 04:46:45,613][118044] Updated weights for policy 0, policy_version 64680 (0.0008) [2023-03-07 04:46:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 66238464. Throughput: 0: 13160.1. Samples: 66217718. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:46,086][117718] Avg episode reward: [(0, '2961.082')] [2023-03-07 04:46:46,388][118044] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-07 04:46:47,147][118044] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-07 04:46:47,938][118044] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-07 04:46:48,715][118044] Updated weights for policy 0, policy_version 64720 (0.0005) [2023-03-07 04:46:49,482][118044] Updated weights for policy 0, policy_version 64730 (0.0007) [2023-03-07 04:46:50,256][118044] Updated weights for policy 0, policy_version 64740 (0.0008) [2023-03-07 04:46:51,035][118044] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 04:46:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 66304000. Throughput: 0: 13159.6. Samples: 66296676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:51,086][117718] Avg episode reward: [(0, '3043.051')] [2023-03-07 04:46:51,823][118044] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-07 04:46:52,614][118044] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-07 04:46:53,404][118044] Updated weights for policy 0, policy_version 64780 (0.0006) [2023-03-07 04:46:54,183][118044] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 04:46:54,960][118044] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-07 04:46:55,729][118044] Updated weights for policy 0, policy_version 64810 (0.0007) [2023-03-07 04:46:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 66369536. Throughput: 0: 13153.9. Samples: 66335958. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:46:56,086][117718] Avg episode reward: [(0, '2981.410')] [2023-03-07 04:46:56,524][118044] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-07 04:46:57,305][118044] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-07 04:46:58,092][118044] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-07 04:46:58,863][118044] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-07 04:46:59,640][118044] Updated weights for policy 0, policy_version 64860 (0.0006) [2023-03-07 04:47:00,410][118044] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-07 04:47:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 66435072. Throughput: 0: 13147.1. Samples: 66414559. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:01,086][117718] Avg episode reward: [(0, '2932.352')] [2023-03-07 04:47:01,202][118044] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-07 04:47:01,991][118044] Updated weights for policy 0, policy_version 64890 (0.0007) [2023-03-07 04:47:02,758][118044] Updated weights for policy 0, policy_version 64900 (0.0007) [2023-03-07 04:47:03,539][118044] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-07 04:47:04,315][118044] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-07 04:47:05,080][118044] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 04:47:05,875][118044] Updated weights for policy 0, policy_version 64940 (0.0006) [2023-03-07 04:47:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 66500608. Throughput: 0: 13143.7. Samples: 66493363. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:06,086][117718] Avg episode reward: [(0, '2904.250')] [2023-03-07 04:47:06,663][118044] Updated weights for policy 0, policy_version 64950 (0.0007) [2023-03-07 04:47:07,467][118044] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-07 04:47:08,261][118044] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 04:47:09,036][118044] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-03-07 04:47:09,830][118044] Updated weights for policy 0, policy_version 64990 (0.0006) [2023-03-07 04:47:10,619][118044] Updated weights for policy 0, policy_version 65000 (0.0007) [2023-03-07 04:47:11,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 66566144. Throughput: 0: 13135.4. Samples: 66532468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:11,086][117718] Avg episode reward: [(0, '2889.463')] [2023-03-07 04:47:11,391][118044] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-07 04:47:12,158][118044] Updated weights for policy 0, policy_version 65020 (0.0007) [2023-03-07 04:47:12,951][118044] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 04:47:13,730][118044] Updated weights for policy 0, policy_version 65040 (0.0007) [2023-03-07 04:47:14,518][118044] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-07 04:47:15,284][118044] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-07 04:47:16,065][118044] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 04:47:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 66631680. Throughput: 0: 13125.9. Samples: 66610989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:16,086][117718] Avg episode reward: [(0, '3035.164')] [2023-03-07 04:47:16,848][118044] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-07 04:47:17,628][118044] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-07 04:47:18,405][118044] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-07 04:47:19,182][118044] Updated weights for policy 0, policy_version 65110 (0.0005) [2023-03-07 04:47:19,964][118044] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-07 04:47:20,750][118044] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-07 04:47:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 66697216. Throughput: 0: 13119.3. Samples: 66689641. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:21,086][117718] Avg episode reward: [(0, '2961.338')] [2023-03-07 04:47:21,544][118044] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-07 04:47:22,309][118044] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-07 04:47:23,105][118044] Updated weights for policy 0, policy_version 65160 (0.0006) [2023-03-07 04:47:23,880][118044] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-07 04:47:24,654][118044] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-07 04:47:25,421][118044] Updated weights for policy 0, policy_version 65190 (0.0007) [2023-03-07 04:47:26,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 66762752. Throughput: 0: 13116.1. Samples: 66728925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:26,086][117718] Avg episode reward: [(0, '2958.425')] [2023-03-07 04:47:26,184][118044] Updated weights for policy 0, policy_version 65200 (0.0006) [2023-03-07 04:47:26,973][118044] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-07 04:47:27,751][118044] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-07 04:47:28,537][118044] Updated weights for policy 0, policy_version 65230 (0.0006) [2023-03-07 04:47:29,312][118044] Updated weights for policy 0, policy_version 65240 (0.0007) [2023-03-07 04:47:30,094][118044] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-07 04:47:30,870][118044] Updated weights for policy 0, policy_version 65260 (0.0006) [2023-03-07 04:47:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13155.8). Total num frames: 66828288. Throughput: 0: 13119.5. Samples: 66808097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:31,086][117718] Avg episode reward: [(0, '2956.683')] [2023-03-07 04:47:31,652][118044] Updated weights for policy 0, policy_version 65270 (0.0007) [2023-03-07 04:47:32,428][118044] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-07 04:47:33,220][118044] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 04:47:34,001][118044] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-07 04:47:34,780][118044] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-07 04:47:35,561][118044] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-07 04:47:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 66893824. Throughput: 0: 13113.3. Samples: 66886776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:36,086][117718] Avg episode reward: [(0, '2923.494')] [2023-03-07 04:47:36,326][118044] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-07 04:47:37,107][118044] Updated weights for policy 0, policy_version 65340 (0.0007) [2023-03-07 04:47:37,897][118044] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-07 04:47:38,686][118044] Updated weights for policy 0, policy_version 65360 (0.0007) [2023-03-07 04:47:39,465][118044] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 04:47:40,230][118044] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-07 04:47:41,022][118044] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-07 04:47:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 66960384. Throughput: 0: 13115.1. Samples: 66926137. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:41,086][117718] Avg episode reward: [(0, '2826.787')] [2023-03-07 04:47:41,794][118044] Updated weights for policy 0, policy_version 65400 (0.0008) [2023-03-07 04:47:42,577][118044] Updated weights for policy 0, policy_version 65410 (0.0007) [2023-03-07 04:47:43,345][118044] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-07 04:47:44,128][118044] Updated weights for policy 0, policy_version 65430 (0.0006) [2023-03-07 04:47:44,913][118044] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-07 04:47:45,674][118044] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-07 04:47:46,086][117718] Fps is (10 sec: 13209.3, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 67025920. Throughput: 0: 13125.1. Samples: 67005187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:47:46,086][117718] Avg episode reward: [(0, '2972.466')] [2023-03-07 04:47:46,439][118044] Updated weights for policy 0, policy_version 65460 (0.0005) [2023-03-07 04:47:47,220][118044] Updated weights for policy 0, policy_version 65470 (0.0007) [2023-03-07 04:47:48,020][118044] Updated weights for policy 0, policy_version 65480 (0.0008) [2023-03-07 04:47:48,815][118044] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-07 04:47:49,576][118044] Updated weights for policy 0, policy_version 65500 (0.0005) [2023-03-07 04:47:50,377][118044] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-07 04:47:51,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 67091456. Throughput: 0: 13118.7. Samples: 67083706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:47:51,086][117718] Avg episode reward: [(0, '2956.779')] [2023-03-07 04:47:51,145][118044] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-07 04:47:51,938][118044] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-07 04:47:52,699][118044] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-07 04:47:53,491][118044] Updated weights for policy 0, policy_version 65550 (0.0007) [2023-03-07 04:47:54,254][118044] Updated weights for policy 0, policy_version 65560 (0.0006) [2023-03-07 04:47:55,028][118044] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 04:47:55,815][118044] Updated weights for policy 0, policy_version 65580 (0.0006) [2023-03-07 04:47:56,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 67156992. Throughput: 0: 13126.7. Samples: 67123170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:47:56,086][117718] Avg episode reward: [(0, '2982.414')] [2023-03-07 04:47:56,593][118044] Updated weights for policy 0, policy_version 65590 (0.0008) [2023-03-07 04:47:57,362][118044] Updated weights for policy 0, policy_version 65600 (0.0007) [2023-03-07 04:47:58,131][118044] Updated weights for policy 0, policy_version 65610 (0.0005) [2023-03-07 04:47:58,912][118044] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-07 04:47:59,698][118044] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-07 04:48:00,472][118044] Updated weights for policy 0, policy_version 65640 (0.0007) [2023-03-07 04:48:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 67222528. Throughput: 0: 13143.7. Samples: 67202459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:01,086][117718] Avg episode reward: [(0, '2896.875')] [2023-03-07 04:48:01,279][118044] Updated weights for policy 0, policy_version 65650 (0.0006) [2023-03-07 04:48:02,051][118044] Updated weights for policy 0, policy_version 65660 (0.0005) [2023-03-07 04:48:02,814][118044] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-07 04:48:03,606][118044] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-07 04:48:04,396][118044] Updated weights for policy 0, policy_version 65690 (0.0006) [2023-03-07 04:48:05,158][118044] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-07 04:48:05,936][118044] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-07 04:48:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 67288064. Throughput: 0: 13143.3. Samples: 67281088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:06,086][117718] Avg episode reward: [(0, '2861.452')] [2023-03-07 04:48:06,705][118044] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-07 04:48:07,485][118044] Updated weights for policy 0, policy_version 65730 (0.0007) [2023-03-07 04:48:08,249][118044] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-07 04:48:09,030][118044] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-07 04:48:09,800][118044] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-07 04:48:10,584][118044] Updated weights for policy 0, policy_version 65770 (0.0006) [2023-03-07 04:48:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 67354624. Throughput: 0: 13153.5. Samples: 67320831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:11,086][117718] Avg episode reward: [(0, '2904.657')] [2023-03-07 04:48:11,361][118044] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-07 04:48:12,150][118044] Updated weights for policy 0, policy_version 65790 (0.0005) [2023-03-07 04:48:12,938][118044] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-07 04:48:13,708][118044] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 04:48:14,498][118044] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-07 04:48:15,285][118044] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 04:48:16,052][118044] Updated weights for policy 0, policy_version 65840 (0.0007) [2023-03-07 04:48:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 67420160. Throughput: 0: 13144.8. Samples: 67399615. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:16,086][117718] Avg episode reward: [(0, '2769.673')] [2023-03-07 04:48:16,833][118044] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-07 04:48:17,618][118044] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 04:48:18,394][118044] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 04:48:19,172][118044] Updated weights for policy 0, policy_version 65880 (0.0006) [2023-03-07 04:48:19,939][118044] Updated weights for policy 0, policy_version 65890 (0.0007) [2023-03-07 04:48:20,721][118044] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-07 04:48:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 67485696. Throughput: 0: 13152.3. Samples: 67478632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:21,086][117718] Avg episode reward: [(0, '2860.045')] [2023-03-07 04:48:21,491][118044] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 04:48:22,263][118044] Updated weights for policy 0, policy_version 65920 (0.0006) [2023-03-07 04:48:23,036][118044] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-07 04:48:23,809][118044] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 04:48:24,590][118044] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-07 04:48:25,401][118044] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-07 04:48:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 67551232. Throughput: 0: 13158.9. Samples: 67518290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:48:26,086][117718] Avg episode reward: [(0, '2842.744')] [2023-03-07 04:48:26,168][118044] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 04:48:26,960][118044] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-07 04:48:27,742][118044] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 04:48:28,524][118044] Updated weights for policy 0, policy_version 66000 (0.0007) [2023-03-07 04:48:29,289][118044] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-07 04:48:30,068][118044] Updated weights for policy 0, policy_version 66020 (0.0008) [2023-03-07 04:48:30,844][118044] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-07 04:48:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 67617792. Throughput: 0: 13149.8. Samples: 67596929. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:31,086][117718] Avg episode reward: [(0, '2914.631')] [2023-03-07 04:48:31,635][118044] Updated weights for policy 0, policy_version 66040 (0.0006) [2023-03-07 04:48:32,403][118044] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-07 04:48:33,169][118044] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-07 04:48:33,973][118044] Updated weights for policy 0, policy_version 66070 (0.0007) [2023-03-07 04:48:34,741][118044] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-07 04:48:35,519][118044] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-07 04:48:36,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 67683328. Throughput: 0: 13154.5. Samples: 67675657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:36,086][117718] Avg episode reward: [(0, '2782.828')] [2023-03-07 04:48:36,092][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000066097_67683328.pth... [2023-03-07 04:48:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000063016_64528384.pth [2023-03-07 04:48:36,306][118044] Updated weights for policy 0, policy_version 66100 (0.0005) [2023-03-07 04:48:37,078][118044] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-07 04:48:37,849][118044] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-07 04:48:38,625][118044] Updated weights for policy 0, policy_version 66130 (0.0007) [2023-03-07 04:48:39,407][118044] Updated weights for policy 0, policy_version 66140 (0.0007) [2023-03-07 04:48:40,189][118044] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-07 04:48:40,967][118044] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 04:48:41,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 67748864. Throughput: 0: 13158.7. Samples: 67715313. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:41,086][117718] Avg episode reward: [(0, '2775.659')] [2023-03-07 04:48:41,729][118044] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-07 04:48:42,507][118044] Updated weights for policy 0, policy_version 66180 (0.0007) [2023-03-07 04:48:43,293][118044] Updated weights for policy 0, policy_version 66190 (0.0006) [2023-03-07 04:48:44,071][118044] Updated weights for policy 0, policy_version 66200 (0.0007) [2023-03-07 04:48:44,858][118044] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-07 04:48:45,639][118044] Updated weights for policy 0, policy_version 66220 (0.0008) [2023-03-07 04:48:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 67814400. Throughput: 0: 13151.0. Samples: 67794252. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:46,086][117718] Avg episode reward: [(0, '2856.158')] [2023-03-07 04:48:46,422][118044] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-07 04:48:47,199][118044] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-07 04:48:47,985][118044] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-07 04:48:48,772][118044] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-07 04:48:49,544][118044] Updated weights for policy 0, policy_version 66270 (0.0006) [2023-03-07 04:48:50,329][118044] Updated weights for policy 0, policy_version 66280 (0.0006) [2023-03-07 04:48:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 67879936. Throughput: 0: 13151.6. Samples: 67872910. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:51,086][117718] Avg episode reward: [(0, '2867.205')] [2023-03-07 04:48:51,099][118044] Updated weights for policy 0, policy_version 66290 (0.0007) [2023-03-07 04:48:51,891][118044] Updated weights for policy 0, policy_version 66300 (0.0007) [2023-03-07 04:48:52,660][118044] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-07 04:48:53,448][118044] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-07 04:48:54,254][118044] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-07 04:48:55,030][118044] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 04:48:55,806][118044] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-07 04:48:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 67945472. Throughput: 0: 13140.2. Samples: 67912139. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:48:56,086][117718] Avg episode reward: [(0, '2773.854')] [2023-03-07 04:48:56,583][118044] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-07 04:48:57,357][118044] Updated weights for policy 0, policy_version 66370 (0.0007) [2023-03-07 04:48:58,125][118044] Updated weights for policy 0, policy_version 66380 (0.0006) [2023-03-07 04:48:58,914][118044] Updated weights for policy 0, policy_version 66390 (0.0006) [2023-03-07 04:48:59,700][118044] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-07 04:49:00,482][118044] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 04:49:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 68011008. Throughput: 0: 13140.1. Samples: 67990918. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:49:01,086][117718] Avg episode reward: [(0, '2894.821')] [2023-03-07 04:49:01,253][118044] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-07 04:49:02,031][118044] Updated weights for policy 0, policy_version 66430 (0.0007) [2023-03-07 04:49:02,817][118044] Updated weights for policy 0, policy_version 66440 (0.0005) [2023-03-07 04:49:03,602][118044] Updated weights for policy 0, policy_version 66450 (0.0007) [2023-03-07 04:49:04,388][118044] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-07 04:49:05,149][118044] Updated weights for policy 0, policy_version 66470 (0.0007) [2023-03-07 04:49:05,917][118044] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 04:49:06,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 68077568. Throughput: 0: 13134.5. Samples: 68069683. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:49:06,086][117718] Avg episode reward: [(0, '2735.153')] [2023-03-07 04:49:06,692][118044] Updated weights for policy 0, policy_version 66490 (0.0006) [2023-03-07 04:49:07,474][118044] Updated weights for policy 0, policy_version 66500 (0.0006) [2023-03-07 04:49:08,240][118044] Updated weights for policy 0, policy_version 66510 (0.0007) [2023-03-07 04:49:09,012][118044] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 04:49:09,789][118044] Updated weights for policy 0, policy_version 66530 (0.0006) [2023-03-07 04:49:10,586][118044] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-07 04:49:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 68143104. Throughput: 0: 13136.1. Samples: 68109412. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:11,086][117718] Avg episode reward: [(0, '2853.788')] [2023-03-07 04:49:11,354][118044] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-07 04:49:12,126][118044] Updated weights for policy 0, policy_version 66560 (0.0007) [2023-03-07 04:49:12,891][118044] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-07 04:49:13,669][118044] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-07 04:49:14,453][118044] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 04:49:15,224][118044] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-07 04:49:16,011][118044] Updated weights for policy 0, policy_version 66610 (0.0005) [2023-03-07 04:49:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 68208640. Throughput: 0: 13148.7. Samples: 68188620. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:16,086][117718] Avg episode reward: [(0, '2929.708')] [2023-03-07 04:49:16,791][118044] Updated weights for policy 0, policy_version 66620 (0.0006) [2023-03-07 04:49:17,583][118044] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-07 04:49:18,378][118044] Updated weights for policy 0, policy_version 66640 (0.0006) [2023-03-07 04:49:19,161][118044] Updated weights for policy 0, policy_version 66650 (0.0005) [2023-03-07 04:49:19,918][118044] Updated weights for policy 0, policy_version 66660 (0.0008) [2023-03-07 04:49:20,700][118044] Updated weights for policy 0, policy_version 66670 (0.0006) [2023-03-07 04:49:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 68275200. Throughput: 0: 13147.7. Samples: 68267301. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:21,086][117718] Avg episode reward: [(0, '2997.701')] [2023-03-07 04:49:21,473][118044] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-07 04:49:22,241][118044] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-03-07 04:49:23,013][118044] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 04:49:23,819][118044] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 04:49:24,574][118044] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-07 04:49:25,343][118044] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-07 04:49:26,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 68340736. Throughput: 0: 13149.9. Samples: 68307055. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:26,097][117718] Avg episode reward: [(0, '2827.847')] [2023-03-07 04:49:26,138][118044] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-07 04:49:26,911][118044] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 04:49:27,705][118044] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 04:49:28,484][118044] Updated weights for policy 0, policy_version 66770 (0.0006) [2023-03-07 04:49:29,261][118044] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-07 04:49:30,058][118044] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-07 04:49:30,822][118044] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-07 04:49:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 68406272. Throughput: 0: 13144.1. Samples: 68385738. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:31,094][117718] Avg episode reward: [(0, '2908.449')] [2023-03-07 04:49:31,611][118044] Updated weights for policy 0, policy_version 66810 (0.0007) [2023-03-07 04:49:32,393][118044] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-07 04:49:33,173][118044] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 04:49:33,946][118044] Updated weights for policy 0, policy_version 66840 (0.0007) [2023-03-07 04:49:34,729][118044] Updated weights for policy 0, policy_version 66850 (0.0007) [2023-03-07 04:49:35,492][118044] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-07 04:49:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 68471808. Throughput: 0: 13148.3. Samples: 68464584. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:36,097][117718] Avg episode reward: [(0, '2902.295')] [2023-03-07 04:49:36,283][118044] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-07 04:49:37,046][118044] Updated weights for policy 0, policy_version 66880 (0.0007) [2023-03-07 04:49:37,806][118044] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-07 04:49:38,604][118044] Updated weights for policy 0, policy_version 66900 (0.0007) [2023-03-07 04:49:39,378][118044] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-07 04:49:40,166][118044] Updated weights for policy 0, policy_version 66920 (0.0005) [2023-03-07 04:49:40,938][118044] Updated weights for policy 0, policy_version 66930 (0.0006) [2023-03-07 04:49:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 68537344. Throughput: 0: 13153.6. Samples: 68504051. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:41,086][117718] Avg episode reward: [(0, '2860.679')] [2023-03-07 04:49:41,716][118044] Updated weights for policy 0, policy_version 66940 (0.0005) [2023-03-07 04:49:42,499][118044] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 04:49:43,283][118044] Updated weights for policy 0, policy_version 66960 (0.0007) [2023-03-07 04:49:44,072][118044] Updated weights for policy 0, policy_version 66970 (0.0006) [2023-03-07 04:49:44,851][118044] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-07 04:49:45,621][118044] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 04:49:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 68602880. Throughput: 0: 13152.8. Samples: 68582794. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:49:46,086][117718] Avg episode reward: [(0, '3044.844')] [2023-03-07 04:49:46,400][118044] Updated weights for policy 0, policy_version 67000 (0.0006) [2023-03-07 04:49:47,183][118044] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-07 04:49:47,964][118044] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-07 04:49:48,738][118044] Updated weights for policy 0, policy_version 67030 (0.0007) [2023-03-07 04:49:49,517][118044] Updated weights for policy 0, policy_version 67040 (0.0006) [2023-03-07 04:49:50,296][118044] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-07 04:49:51,082][118044] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 04:49:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 68669440. Throughput: 0: 13158.0. Samples: 68661790. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:49:51,086][117718] Avg episode reward: [(0, '3042.300')] [2023-03-07 04:49:51,849][118044] Updated weights for policy 0, policy_version 67070 (0.0005) [2023-03-07 04:49:52,652][118044] Updated weights for policy 0, policy_version 67080 (0.0007) [2023-03-07 04:49:53,428][118044] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-07 04:49:54,204][118044] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 04:49:54,968][118044] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 04:49:55,760][118044] Updated weights for policy 0, policy_version 67120 (0.0007) [2023-03-07 04:49:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 68734976. Throughput: 0: 13147.8. Samples: 68701063. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:49:56,086][117718] Avg episode reward: [(0, '3047.419')] [2023-03-07 04:49:56,533][118044] Updated weights for policy 0, policy_version 67130 (0.0007) [2023-03-07 04:49:57,322][118044] Updated weights for policy 0, policy_version 67140 (0.0008) [2023-03-07 04:49:58,093][118044] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-07 04:49:58,868][118044] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-07 04:49:59,641][118044] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-07 04:50:00,448][118044] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-07 04:50:01,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 68799488. Throughput: 0: 13141.4. Samples: 68779982. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:01,086][117718] Avg episode reward: [(0, '2945.142')] [2023-03-07 04:50:01,241][118044] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 04:50:02,014][118044] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-07 04:50:02,782][118044] Updated weights for policy 0, policy_version 67210 (0.0005) [2023-03-07 04:50:03,583][118044] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-07 04:50:04,363][118044] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 04:50:05,147][118044] Updated weights for policy 0, policy_version 67240 (0.0007) [2023-03-07 04:50:05,957][118044] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-07 04:50:06,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 68865024. Throughput: 0: 13128.1. Samples: 68858067. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:06,086][117718] Avg episode reward: [(0, '2901.013')] [2023-03-07 04:50:06,720][118044] Updated weights for policy 0, policy_version 67260 (0.0005) [2023-03-07 04:50:07,503][118044] Updated weights for policy 0, policy_version 67270 (0.0007) [2023-03-07 04:50:08,280][118044] Updated weights for policy 0, policy_version 67280 (0.0006) [2023-03-07 04:50:09,043][118044] Updated weights for policy 0, policy_version 67290 (0.0008) [2023-03-07 04:50:09,802][118044] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-07 04:50:10,617][118044] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-07 04:50:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 68931584. Throughput: 0: 13125.5. Samples: 68897703. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:11,086][117718] Avg episode reward: [(0, '2956.521')] [2023-03-07 04:50:11,378][118044] Updated weights for policy 0, policy_version 67320 (0.0006) [2023-03-07 04:50:12,160][118044] Updated weights for policy 0, policy_version 67330 (0.0006) [2023-03-07 04:50:12,950][118044] Updated weights for policy 0, policy_version 67340 (0.0006) [2023-03-07 04:50:13,711][118044] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-07 04:50:14,480][118044] Updated weights for policy 0, policy_version 67360 (0.0006) [2023-03-07 04:50:15,254][118044] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-07 04:50:16,041][118044] Updated weights for policy 0, policy_version 67380 (0.0007) [2023-03-07 04:50:16,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 68997120. Throughput: 0: 13131.5. Samples: 68976653. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:16,086][117718] Avg episode reward: [(0, '3079.222')] [2023-03-07 04:50:16,845][118044] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-07 04:50:17,605][118044] Updated weights for policy 0, policy_version 67400 (0.0005) [2023-03-07 04:50:18,381][118044] Updated weights for policy 0, policy_version 67410 (0.0006) [2023-03-07 04:50:19,146][118044] Updated weights for policy 0, policy_version 67420 (0.0007) [2023-03-07 04:50:19,927][118044] Updated weights for policy 0, policy_version 67430 (0.0006) [2023-03-07 04:50:20,721][118044] Updated weights for policy 0, policy_version 67440 (0.0007) [2023-03-07 04:50:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 69062656. Throughput: 0: 13135.1. Samples: 69055664. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:21,086][117718] Avg episode reward: [(0, '3053.429')] [2023-03-07 04:50:21,486][118044] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 04:50:22,269][118044] Updated weights for policy 0, policy_version 67460 (0.0006) [2023-03-07 04:50:23,068][118044] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-03-07 04:50:23,824][118044] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-07 04:50:24,588][118044] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-07 04:50:25,377][118044] Updated weights for policy 0, policy_version 67500 (0.0005) [2023-03-07 04:50:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 69129216. Throughput: 0: 13133.4. Samples: 69095053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:26,086][117718] Avg episode reward: [(0, '2993.830')] [2023-03-07 04:50:26,136][118044] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-07 04:50:26,924][118044] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-07 04:50:27,694][118044] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 04:50:28,493][118044] Updated weights for policy 0, policy_version 67540 (0.0006) [2023-03-07 04:50:29,290][118044] Updated weights for policy 0, policy_version 67550 (0.0007) [2023-03-07 04:50:30,092][118044] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-07 04:50:30,857][118044] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-07 04:50:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 69193728. Throughput: 0: 13132.2. Samples: 69173742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:50:31,086][117718] Avg episode reward: [(0, '3021.634')] [2023-03-07 04:50:31,650][118044] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-07 04:50:32,438][118044] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-07 04:50:33,218][118044] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-07 04:50:33,999][118044] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-07 04:50:34,764][118044] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 04:50:35,530][118044] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-07 04:50:36,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 69260288. Throughput: 0: 13124.4. Samples: 69252389. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:50:36,086][117718] Avg episode reward: [(0, '3013.355')] [2023-03-07 04:50:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000067637_69260288.pth... [2023-03-07 04:50:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000064557_66106368.pth [2023-03-07 04:50:36,311][118044] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-07 04:50:37,116][118044] Updated weights for policy 0, policy_version 67650 (0.0007) [2023-03-07 04:50:37,893][118044] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-07 04:50:38,665][118044] Updated weights for policy 0, policy_version 67670 (0.0007) [2023-03-07 04:50:39,450][118044] Updated weights for policy 0, policy_version 67680 (0.0006) [2023-03-07 04:50:40,229][118044] Updated weights for policy 0, policy_version 67690 (0.0007) [2023-03-07 04:50:41,005][118044] Updated weights for policy 0, policy_version 67700 (0.0006) [2023-03-07 04:50:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 69325824. Throughput: 0: 13125.9. Samples: 69291726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:50:41,086][117718] Avg episode reward: [(0, '3075.162')] [2023-03-07 04:50:41,803][118044] Updated weights for policy 0, policy_version 67710 (0.0006) [2023-03-07 04:50:42,570][118044] Updated weights for policy 0, policy_version 67720 (0.0005) [2023-03-07 04:50:43,332][118044] Updated weights for policy 0, policy_version 67730 (0.0005) [2023-03-07 04:50:44,099][118044] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-07 04:50:44,873][118044] Updated weights for policy 0, policy_version 67750 (0.0005) [2023-03-07 04:50:45,648][118044] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-07 04:50:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 69391360. Throughput: 0: 13131.5. Samples: 69370901. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:50:46,086][117718] Avg episode reward: [(0, '3089.042')] [2023-03-07 04:50:46,431][118044] Updated weights for policy 0, policy_version 67770 (0.0007) [2023-03-07 04:50:47,217][118044] Updated weights for policy 0, policy_version 67780 (0.0005) [2023-03-07 04:50:47,993][118044] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-03-07 04:50:48,780][118044] Updated weights for policy 0, policy_version 67800 (0.0006) [2023-03-07 04:50:49,568][118044] Updated weights for policy 0, policy_version 67810 (0.0007) [2023-03-07 04:50:50,343][118044] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-07 04:50:51,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 69456896. Throughput: 0: 13146.0. Samples: 69449636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:50:51,086][117718] Avg episode reward: [(0, '2981.903')] [2023-03-07 04:50:51,112][118044] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 04:50:51,887][118044] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-07 04:50:52,664][118044] Updated weights for policy 0, policy_version 67850 (0.0007) [2023-03-07 04:50:53,437][118044] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-07 04:50:54,226][118044] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 04:50:55,001][118044] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-07 04:50:55,767][118044] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-07 04:50:56,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 69523456. Throughput: 0: 13146.8. Samples: 69489308. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:50:56,086][117718] Avg episode reward: [(0, '3017.769')] [2023-03-07 04:50:56,538][118044] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-07 04:50:57,309][118044] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-07 04:50:58,094][118044] Updated weights for policy 0, policy_version 67920 (0.0007) [2023-03-07 04:50:58,883][118044] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 04:50:59,665][118044] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-07 04:51:00,447][118044] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-07 04:51:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 69588992. Throughput: 0: 13147.6. Samples: 69568296. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:01,086][117718] Avg episode reward: [(0, '2985.356')] [2023-03-07 04:51:01,243][118044] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-07 04:51:02,017][118044] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-07 04:51:02,779][118044] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-07 04:51:03,551][118044] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-07 04:51:04,356][118044] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-07 04:51:05,131][118044] Updated weights for policy 0, policy_version 68010 (0.0006) [2023-03-07 04:51:05,901][118044] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-07 04:51:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 69654528. Throughput: 0: 13138.4. Samples: 69646893. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:06,086][117718] Avg episode reward: [(0, '2900.181')] [2023-03-07 04:51:06,697][118044] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-07 04:51:07,482][118044] Updated weights for policy 0, policy_version 68040 (0.0007) [2023-03-07 04:51:08,242][118044] Updated weights for policy 0, policy_version 68050 (0.0006) [2023-03-07 04:51:09,028][118044] Updated weights for policy 0, policy_version 68060 (0.0007) [2023-03-07 04:51:09,805][118044] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-07 04:51:10,589][118044] Updated weights for policy 0, policy_version 68080 (0.0005) [2023-03-07 04:51:11,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 69720064. Throughput: 0: 13139.2. Samples: 69686318. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:11,086][117718] Avg episode reward: [(0, '2946.968')] [2023-03-07 04:51:11,369][118044] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-07 04:51:12,157][118044] Updated weights for policy 0, policy_version 68100 (0.0006) [2023-03-07 04:51:12,921][118044] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-07 04:51:13,695][118044] Updated weights for policy 0, policy_version 68120 (0.0007) [2023-03-07 04:51:14,485][118044] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-07 04:51:15,265][118044] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-07 04:51:16,050][118044] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 04:51:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 69785600. Throughput: 0: 13141.7. Samples: 69765118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:16,086][117718] Avg episode reward: [(0, '2909.778')] [2023-03-07 04:51:16,826][118044] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 04:51:17,606][118044] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-07 04:51:18,380][118044] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-07 04:51:19,150][118044] Updated weights for policy 0, policy_version 68190 (0.0006) [2023-03-07 04:51:19,937][118044] Updated weights for policy 0, policy_version 68200 (0.0006) [2023-03-07 04:51:20,711][118044] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-07 04:51:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 69851136. Throughput: 0: 13147.7. Samples: 69844036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:21,086][117718] Avg episode reward: [(0, '2913.443')] [2023-03-07 04:51:21,485][118044] Updated weights for policy 0, policy_version 68220 (0.0007) [2023-03-07 04:51:22,250][118044] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-07 04:51:23,017][118044] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-07 04:51:23,798][118044] Updated weights for policy 0, policy_version 68250 (0.0006) [2023-03-07 04:51:24,565][118044] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-07 04:51:25,342][118044] Updated weights for policy 0, policy_version 68270 (0.0005) [2023-03-07 04:51:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 69917696. Throughput: 0: 13158.9. Samples: 69883878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:26,086][117718] Avg episode reward: [(0, '2981.845')] [2023-03-07 04:51:26,121][118044] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-07 04:51:26,883][118044] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-07 04:51:27,654][118044] Updated weights for policy 0, policy_version 68300 (0.0006) [2023-03-07 04:51:28,429][118044] Updated weights for policy 0, policy_version 68310 (0.0007) [2023-03-07 04:51:29,208][118044] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-07 04:51:30,003][118044] Updated weights for policy 0, policy_version 68330 (0.0007) [2023-03-07 04:51:30,784][118044] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-07 04:51:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 69983232. Throughput: 0: 13162.1. Samples: 69963192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:31,086][117718] Avg episode reward: [(0, '2990.695')] [2023-03-07 04:51:31,550][118044] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-07 04:51:32,336][118044] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-07 04:51:33,121][118044] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-07 04:51:33,879][118044] Updated weights for policy 0, policy_version 68380 (0.0007) [2023-03-07 04:51:34,653][118044] Updated weights for policy 0, policy_version 68390 (0.0006) [2023-03-07 04:51:35,438][118044] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-07 04:51:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 70049792. Throughput: 0: 13173.5. Samples: 70042443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:36,086][117718] Avg episode reward: [(0, '3006.457')] [2023-03-07 04:51:36,198][118044] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-07 04:51:36,981][118044] Updated weights for policy 0, policy_version 68420 (0.0007) [2023-03-07 04:51:37,758][118044] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-07 04:51:38,530][118044] Updated weights for policy 0, policy_version 68440 (0.0007) [2023-03-07 04:51:39,301][118044] Updated weights for policy 0, policy_version 68450 (0.0005) [2023-03-07 04:51:40,084][118044] Updated weights for policy 0, policy_version 68460 (0.0007) [2023-03-07 04:51:40,846][118044] Updated weights for policy 0, policy_version 68470 (0.0006) [2023-03-07 04:51:41,085][117718] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 70116352. Throughput: 0: 13166.8. Samples: 70081812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:41,086][117718] Avg episode reward: [(0, '2983.202')] [2023-03-07 04:51:41,631][118044] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-07 04:51:42,391][118044] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-07 04:51:43,174][118044] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-07 04:51:43,949][118044] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-07 04:51:44,724][118044] Updated weights for policy 0, policy_version 68520 (0.0007) [2023-03-07 04:51:45,508][118044] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-07 04:51:46,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 70181888. Throughput: 0: 13178.3. Samples: 70161319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:46,086][117718] Avg episode reward: [(0, '2891.883')] [2023-03-07 04:51:46,296][118044] Updated weights for policy 0, policy_version 68540 (0.0007) [2023-03-07 04:51:47,051][118044] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-07 04:51:47,821][118044] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-07 04:51:48,610][118044] Updated weights for policy 0, policy_version 68570 (0.0007) [2023-03-07 04:51:49,396][118044] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-07 04:51:50,178][118044] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-07 04:51:50,952][118044] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-07 04:51:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 70247424. Throughput: 0: 13186.2. Samples: 70240269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:51:51,086][117718] Avg episode reward: [(0, '2943.533')] [2023-03-07 04:51:51,751][118044] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-07 04:51:52,528][118044] Updated weights for policy 0, policy_version 68620 (0.0006) [2023-03-07 04:51:53,299][118044] Updated weights for policy 0, policy_version 68630 (0.0006) [2023-03-07 04:51:54,079][118044] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-07 04:51:54,850][118044] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-07 04:51:55,625][118044] Updated weights for policy 0, policy_version 68660 (0.0006) [2023-03-07 04:51:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 70313984. Throughput: 0: 13178.9. Samples: 70279366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:51:56,086][117718] Avg episode reward: [(0, '2937.368')] [2023-03-07 04:51:56,402][118044] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-07 04:51:57,188][118044] Updated weights for policy 0, policy_version 68680 (0.0007) [2023-03-07 04:51:57,966][118044] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-07 04:51:58,741][118044] Updated weights for policy 0, policy_version 68700 (0.0006) [2023-03-07 04:51:59,532][118044] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-07 04:52:00,301][118044] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-07 04:52:01,073][118044] Updated weights for policy 0, policy_version 68730 (0.0006) [2023-03-07 04:52:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 70379520. Throughput: 0: 13187.8. Samples: 70358569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:01,086][117718] Avg episode reward: [(0, '2882.726')] [2023-03-07 04:52:01,857][118044] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-07 04:52:02,654][118044] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-07 04:52:03,429][118044] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-07 04:52:04,218][118044] Updated weights for policy 0, policy_version 68770 (0.0006) [2023-03-07 04:52:04,989][118044] Updated weights for policy 0, policy_version 68780 (0.0006) [2023-03-07 04:52:05,766][118044] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-07 04:52:06,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 70445056. Throughput: 0: 13185.9. Samples: 70437400. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:06,086][117718] Avg episode reward: [(0, '2938.926')] [2023-03-07 04:52:06,545][118044] Updated weights for policy 0, policy_version 68800 (0.0006) [2023-03-07 04:52:07,314][118044] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-07 04:52:08,087][118044] Updated weights for policy 0, policy_version 68820 (0.0006) [2023-03-07 04:52:08,873][118044] Updated weights for policy 0, policy_version 68830 (0.0006) [2023-03-07 04:52:09,641][118044] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 04:52:10,415][118044] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 04:52:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 70510592. Throughput: 0: 13180.4. Samples: 70476995. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:11,086][117718] Avg episode reward: [(0, '2967.788')] [2023-03-07 04:52:11,190][118044] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-07 04:52:11,957][118044] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-07 04:52:12,776][118044] Updated weights for policy 0, policy_version 68880 (0.0007) [2023-03-07 04:52:13,541][118044] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 04:52:14,319][118044] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 04:52:15,098][118044] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 04:52:15,882][118044] Updated weights for policy 0, policy_version 68920 (0.0006) [2023-03-07 04:52:16,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13148.8). Total num frames: 70576128. Throughput: 0: 13168.6. Samples: 70555780. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:16,086][117718] Avg episode reward: [(0, '2944.941')] [2023-03-07 04:52:16,660][118044] Updated weights for policy 0, policy_version 68930 (0.0007) [2023-03-07 04:52:17,445][118044] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-07 04:52:18,206][118044] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-07 04:52:18,988][118044] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-07 04:52:19,772][118044] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-07 04:52:20,543][118044] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-07 04:52:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 70641664. Throughput: 0: 13159.5. Samples: 70634619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:21,086][117718] Avg episode reward: [(0, '2858.277')] [2023-03-07 04:52:21,336][118044] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-07 04:52:22,081][118044] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-07 04:52:22,878][118044] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-07 04:52:23,648][118044] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-07 04:52:24,437][118044] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-07 04:52:25,231][118044] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 04:52:26,016][118044] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 04:52:26,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 70708224. Throughput: 0: 13164.8. Samples: 70674228. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:26,086][117718] Avg episode reward: [(0, '2832.483')] [2023-03-07 04:52:26,796][118044] Updated weights for policy 0, policy_version 69060 (0.0007) [2023-03-07 04:52:27,579][118044] Updated weights for policy 0, policy_version 69070 (0.0006) [2023-03-07 04:52:28,342][118044] Updated weights for policy 0, policy_version 69080 (0.0006) [2023-03-07 04:52:29,130][118044] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-07 04:52:29,927][118044] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-07 04:52:30,680][118044] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-07 04:52:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 70773760. Throughput: 0: 13143.8. Samples: 70752789. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:31,086][117718] Avg episode reward: [(0, '2889.225')] [2023-03-07 04:52:31,462][118044] Updated weights for policy 0, policy_version 69120 (0.0005) [2023-03-07 04:52:32,249][118044] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-07 04:52:33,027][118044] Updated weights for policy 0, policy_version 69140 (0.0007) [2023-03-07 04:52:33,807][118044] Updated weights for policy 0, policy_version 69150 (0.0007) [2023-03-07 04:52:34,591][118044] Updated weights for policy 0, policy_version 69160 (0.0006) [2023-03-07 04:52:35,361][118044] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 04:52:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 70839296. Throughput: 0: 13142.0. Samples: 70831662. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:52:36,086][117718] Avg episode reward: [(0, '3019.859')] [2023-03-07 04:52:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000069179_70839296.pth... [2023-03-07 04:52:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000066097_67683328.pth [2023-03-07 04:52:36,154][118044] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-07 04:52:36,922][118044] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 04:52:37,707][118044] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-07 04:52:38,478][118044] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-07 04:52:39,253][118044] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 04:52:40,026][118044] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 04:52:40,807][118044] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-07 04:52:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 70904832. Throughput: 0: 13148.9. Samples: 70871067. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:52:41,086][117718] Avg episode reward: [(0, '2878.675')] [2023-03-07 04:52:41,598][118044] Updated weights for policy 0, policy_version 69250 (0.0007) [2023-03-07 04:52:42,366][118044] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-07 04:52:43,146][118044] Updated weights for policy 0, policy_version 69270 (0.0007) [2023-03-07 04:52:43,916][118044] Updated weights for policy 0, policy_version 69280 (0.0007) [2023-03-07 04:52:44,691][118044] Updated weights for policy 0, policy_version 69290 (0.0007) [2023-03-07 04:52:45,477][118044] Updated weights for policy 0, policy_version 69300 (0.0007) [2023-03-07 04:52:46,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 70970368. Throughput: 0: 13147.3. Samples: 70950197. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:52:46,086][117718] Avg episode reward: [(0, '2909.951')] [2023-03-07 04:52:46,247][118044] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-07 04:52:47,014][118044] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 04:52:47,802][118044] Updated weights for policy 0, policy_version 69330 (0.0006) [2023-03-07 04:52:48,594][118044] Updated weights for policy 0, policy_version 69340 (0.0007) [2023-03-07 04:52:49,364][118044] Updated weights for policy 0, policy_version 69350 (0.0006) [2023-03-07 04:52:50,152][118044] Updated weights for policy 0, policy_version 69360 (0.0007) [2023-03-07 04:52:50,925][118044] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 04:52:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71036928. Throughput: 0: 13147.5. Samples: 71029038. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:52:51,086][117718] Avg episode reward: [(0, '2971.178')] [2023-03-07 04:52:51,699][118044] Updated weights for policy 0, policy_version 69380 (0.0007) [2023-03-07 04:52:52,502][118044] Updated weights for policy 0, policy_version 69390 (0.0007) [2023-03-07 04:52:53,274][118044] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-07 04:52:54,054][118044] Updated weights for policy 0, policy_version 69410 (0.0007) [2023-03-07 04:52:54,851][118044] Updated weights for policy 0, policy_version 69420 (0.0005) [2023-03-07 04:52:55,639][118044] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 04:52:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 71101440. Throughput: 0: 13140.7. Samples: 71068329. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:52:56,086][117718] Avg episode reward: [(0, '2972.495')] [2023-03-07 04:52:56,440][118044] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 04:52:57,208][118044] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-07 04:52:57,996][118044] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-07 04:52:58,767][118044] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 04:52:59,537][118044] Updated weights for policy 0, policy_version 69480 (0.0005) [2023-03-07 04:53:00,325][118044] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-07 04:53:01,086][117718] Fps is (10 sec: 13004.7, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 71166976. Throughput: 0: 13137.4. Samples: 71146963. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:53:01,086][117718] Avg episode reward: [(0, '2843.339')] [2023-03-07 04:53:01,101][118044] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-07 04:53:01,873][118044] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-07 04:53:02,661][118044] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-07 04:53:03,431][118044] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 04:53:04,219][118044] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-07 04:53:04,993][118044] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-07 04:53:05,766][118044] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-07 04:53:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 71233536. Throughput: 0: 13133.7. Samples: 71225635. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:53:06,096][117718] Avg episode reward: [(0, '2953.295')] [2023-03-07 04:53:06,544][118044] Updated weights for policy 0, policy_version 69570 (0.0007) [2023-03-07 04:53:07,330][118044] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-07 04:53:08,099][118044] Updated weights for policy 0, policy_version 69590 (0.0007) [2023-03-07 04:53:08,893][118044] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-07 04:53:09,672][118044] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-07 04:53:10,447][118044] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-07 04:53:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 71299072. Throughput: 0: 13133.4. Samples: 71265231. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:53:11,086][117718] Avg episode reward: [(0, '2941.978')] [2023-03-07 04:53:11,238][118044] Updated weights for policy 0, policy_version 69630 (0.0005) [2023-03-07 04:53:12,005][118044] Updated weights for policy 0, policy_version 69640 (0.0006) [2023-03-07 04:53:12,783][118044] Updated weights for policy 0, policy_version 69650 (0.0007) [2023-03-07 04:53:13,550][118044] Updated weights for policy 0, policy_version 69660 (0.0006) [2023-03-07 04:53:14,315][118044] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-07 04:53:15,101][118044] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-07 04:53:15,862][118044] Updated weights for policy 0, policy_version 69690 (0.0006) [2023-03-07 04:53:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 71364608. Throughput: 0: 13145.1. Samples: 71344317. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:53:16,086][117718] Avg episode reward: [(0, '2874.780')] [2023-03-07 04:53:16,646][118044] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 04:53:17,425][118044] Updated weights for policy 0, policy_version 69710 (0.0006) [2023-03-07 04:53:18,192][118044] Updated weights for policy 0, policy_version 69720 (0.0007) [2023-03-07 04:53:18,954][118044] Updated weights for policy 0, policy_version 69730 (0.0005) [2023-03-07 04:53:19,739][118044] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 04:53:20,522][118044] Updated weights for policy 0, policy_version 69750 (0.0006) [2023-03-07 04:53:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71431168. Throughput: 0: 13158.0. Samples: 71423771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:21,086][117718] Avg episode reward: [(0, '2957.211')] [2023-03-07 04:53:21,290][118044] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-07 04:53:22,051][118044] Updated weights for policy 0, policy_version 69770 (0.0005) [2023-03-07 04:53:22,829][118044] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-07 04:53:23,614][118044] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 04:53:24,398][118044] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-07 04:53:25,170][118044] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-07 04:53:25,952][118044] Updated weights for policy 0, policy_version 69820 (0.0006) [2023-03-07 04:53:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 71496704. Throughput: 0: 13160.1. Samples: 71463272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:26,086][117718] Avg episode reward: [(0, '3022.308')] [2023-03-07 04:53:26,737][118044] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-07 04:53:27,504][118044] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-07 04:53:28,284][118044] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-07 04:53:29,073][118044] Updated weights for policy 0, policy_version 69860 (0.0006) [2023-03-07 04:53:29,848][118044] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-07 04:53:30,618][118044] Updated weights for policy 0, policy_version 69880 (0.0007) [2023-03-07 04:53:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 71562240. Throughput: 0: 13154.1. Samples: 71542130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:31,086][117718] Avg episode reward: [(0, '2885.798')] [2023-03-07 04:53:31,425][118044] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 04:53:32,209][118044] Updated weights for policy 0, policy_version 69900 (0.0006) [2023-03-07 04:53:32,984][118044] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-07 04:53:33,746][118044] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-07 04:53:34,536][118044] Updated weights for policy 0, policy_version 69930 (0.0007) [2023-03-07 04:53:35,309][118044] Updated weights for policy 0, policy_version 69940 (0.0006) [2023-03-07 04:53:36,071][118044] Updated weights for policy 0, policy_version 69950 (0.0007) [2023-03-07 04:53:36,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71628800. Throughput: 0: 13154.4. Samples: 71620984. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:36,086][117718] Avg episode reward: [(0, '3023.593')] [2023-03-07 04:53:36,873][118044] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-07 04:53:37,646][118044] Updated weights for policy 0, policy_version 69970 (0.0005) [2023-03-07 04:53:38,425][118044] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-07 04:53:39,198][118044] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-07 04:53:39,983][118044] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-07 04:53:40,756][118044] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-07 04:53:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71694336. Throughput: 0: 13156.0. Samples: 71660347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:41,086][117718] Avg episode reward: [(0, '2925.771')] [2023-03-07 04:53:41,538][118044] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-07 04:53:42,307][118044] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-07 04:53:43,084][118044] Updated weights for policy 0, policy_version 70040 (0.0007) [2023-03-07 04:53:43,852][118044] Updated weights for policy 0, policy_version 70050 (0.0007) [2023-03-07 04:53:44,640][118044] Updated weights for policy 0, policy_version 70060 (0.0005) [2023-03-07 04:53:45,412][118044] Updated weights for policy 0, policy_version 70070 (0.0005) [2023-03-07 04:53:46,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71759872. Throughput: 0: 13165.6. Samples: 71739416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:46,086][117718] Avg episode reward: [(0, '2861.996')] [2023-03-07 04:53:46,181][118044] Updated weights for policy 0, policy_version 70080 (0.0007) [2023-03-07 04:53:46,978][118044] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-07 04:53:47,758][118044] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-07 04:53:48,546][118044] Updated weights for policy 0, policy_version 70110 (0.0006) [2023-03-07 04:53:49,338][118044] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 04:53:50,107][118044] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-07 04:53:50,878][118044] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 04:53:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 71825408. Throughput: 0: 13170.4. Samples: 71818305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:51,086][117718] Avg episode reward: [(0, '2982.733')] [2023-03-07 04:53:51,663][118044] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-07 04:53:52,437][118044] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 04:53:53,191][118044] Updated weights for policy 0, policy_version 70170 (0.0005) [2023-03-07 04:53:53,989][118044] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-07 04:53:54,755][118044] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 04:53:55,563][118044] Updated weights for policy 0, policy_version 70200 (0.0006) [2023-03-07 04:53:56,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 71891968. Throughput: 0: 13169.1. Samples: 71857841. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:53:56,086][117718] Avg episode reward: [(0, '3002.979')] [2023-03-07 04:53:56,334][118044] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-07 04:53:57,110][118044] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-07 04:53:57,884][118044] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-07 04:53:58,659][118044] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-07 04:53:59,438][118044] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-07 04:54:00,217][118044] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 04:54:01,014][118044] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-07 04:54:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 71956480. Throughput: 0: 13163.2. Samples: 71936661. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 04:54:01,086][117718] Avg episode reward: [(0, '2966.240')] [2023-03-07 04:54:01,793][118044] Updated weights for policy 0, policy_version 70280 (0.0007) [2023-03-07 04:54:02,551][118044] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-07 04:54:03,335][118044] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-07 04:54:04,146][118044] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-07 04:54:04,913][118044] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-07 04:54:05,692][118044] Updated weights for policy 0, policy_version 70330 (0.0006) [2023-03-07 04:54:06,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 72022016. Throughput: 0: 13142.6. Samples: 72015190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:06,086][117718] Avg episode reward: [(0, '2950.482')] [2023-03-07 04:54:06,478][118044] Updated weights for policy 0, policy_version 70340 (0.0006) [2023-03-07 04:54:07,262][118044] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-07 04:54:08,059][118044] Updated weights for policy 0, policy_version 70360 (0.0006) [2023-03-07 04:54:08,831][118044] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-07 04:54:09,616][118044] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-07 04:54:10,397][118044] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-07 04:54:11,085][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 72087552. Throughput: 0: 13134.9. Samples: 72054341. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:11,086][117718] Avg episode reward: [(0, '2952.976')] [2023-03-07 04:54:11,198][118044] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-07 04:54:11,984][118044] Updated weights for policy 0, policy_version 70410 (0.0005) [2023-03-07 04:54:12,752][118044] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-07 04:54:13,541][118044] Updated weights for policy 0, policy_version 70430 (0.0006) [2023-03-07 04:54:14,311][118044] Updated weights for policy 0, policy_version 70440 (0.0007) [2023-03-07 04:54:15,074][118044] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-07 04:54:15,865][118044] Updated weights for policy 0, policy_version 70460 (0.0007) [2023-03-07 04:54:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 72154112. Throughput: 0: 13129.2. Samples: 72132943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:16,086][117718] Avg episode reward: [(0, '2957.649')] [2023-03-07 04:54:16,656][118044] Updated weights for policy 0, policy_version 70470 (0.0006) [2023-03-07 04:54:17,437][118044] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 04:54:18,210][118044] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-07 04:54:18,999][118044] Updated weights for policy 0, policy_version 70500 (0.0006) [2023-03-07 04:54:19,757][118044] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-07 04:54:20,535][118044] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-07 04:54:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 72218624. Throughput: 0: 13127.0. Samples: 72211698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:21,086][117718] Avg episode reward: [(0, '2902.887')] [2023-03-07 04:54:21,310][118044] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-07 04:54:22,081][118044] Updated weights for policy 0, policy_version 70540 (0.0006) [2023-03-07 04:54:22,858][118044] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-07 04:54:23,628][118044] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 04:54:24,405][118044] Updated weights for policy 0, policy_version 70570 (0.0005) [2023-03-07 04:54:25,183][118044] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-07 04:54:25,962][118044] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-07 04:54:26,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 72285184. Throughput: 0: 13136.1. Samples: 72251473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:26,086][117718] Avg episode reward: [(0, '2851.007')] [2023-03-07 04:54:26,741][118044] Updated weights for policy 0, policy_version 70600 (0.0006) [2023-03-07 04:54:27,525][118044] Updated weights for policy 0, policy_version 70610 (0.0006) [2023-03-07 04:54:28,305][118044] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 04:54:29,090][118044] Updated weights for policy 0, policy_version 70630 (0.0007) [2023-03-07 04:54:29,881][118044] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-07 04:54:30,662][118044] Updated weights for policy 0, policy_version 70650 (0.0006) [2023-03-07 04:54:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 72350720. Throughput: 0: 13133.1. Samples: 72330405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:31,086][117718] Avg episode reward: [(0, '2868.574')] [2023-03-07 04:54:31,447][118044] Updated weights for policy 0, policy_version 70660 (0.0007) [2023-03-07 04:54:32,218][118044] Updated weights for policy 0, policy_version 70670 (0.0006) [2023-03-07 04:54:33,000][118044] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-07 04:54:33,770][118044] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-07 04:54:34,548][118044] Updated weights for policy 0, policy_version 70700 (0.0005) [2023-03-07 04:54:35,325][118044] Updated weights for policy 0, policy_version 70710 (0.0005) [2023-03-07 04:54:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 72416256. Throughput: 0: 13130.4. Samples: 72409174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:36,086][117718] Avg episode reward: [(0, '2943.829')] [2023-03-07 04:54:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000070719_72416256.pth... [2023-03-07 04:54:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000067637_69260288.pth [2023-03-07 04:54:36,146][118044] Updated weights for policy 0, policy_version 70720 (0.0006) [2023-03-07 04:54:36,887][118044] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-07 04:54:37,665][118044] Updated weights for policy 0, policy_version 70740 (0.0007) [2023-03-07 04:54:38,457][118044] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-07 04:54:39,235][118044] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-07 04:54:39,998][118044] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-07 04:54:40,777][118044] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-07 04:54:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13148.9). Total num frames: 72481792. Throughput: 0: 13123.7. Samples: 72448408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:41,086][117718] Avg episode reward: [(0, '2982.871')] [2023-03-07 04:54:41,569][118044] Updated weights for policy 0, policy_version 70790 (0.0007) [2023-03-07 04:54:42,345][118044] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 04:54:43,125][118044] Updated weights for policy 0, policy_version 70810 (0.0006) [2023-03-07 04:54:43,918][118044] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-07 04:54:44,682][118044] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-07 04:54:45,470][118044] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 04:54:46,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 72547328. Throughput: 0: 13123.6. Samples: 72527229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:46,086][117718] Avg episode reward: [(0, '2868.035')] [2023-03-07 04:54:46,248][118044] Updated weights for policy 0, policy_version 70850 (0.0006) [2023-03-07 04:54:47,024][118044] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-07 04:54:47,810][118044] Updated weights for policy 0, policy_version 70870 (0.0006) [2023-03-07 04:54:48,588][118044] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-07 04:54:49,361][118044] Updated weights for policy 0, policy_version 70890 (0.0005) [2023-03-07 04:54:50,133][118044] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-07 04:54:50,900][118044] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 04:54:51,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 72613888. Throughput: 0: 13137.7. Samples: 72606383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:51,086][117718] Avg episode reward: [(0, '2864.797')] [2023-03-07 04:54:51,671][118044] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-07 04:54:52,442][118044] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-07 04:54:53,227][118044] Updated weights for policy 0, policy_version 70940 (0.0006) [2023-03-07 04:54:54,011][118044] Updated weights for policy 0, policy_version 70950 (0.0006) [2023-03-07 04:54:54,782][118044] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-07 04:54:55,582][118044] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-07 04:54:56,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 72679424. Throughput: 0: 13148.2. Samples: 72646009. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:54:56,086][117718] Avg episode reward: [(0, '2871.371')] [2023-03-07 04:54:56,355][118044] Updated weights for policy 0, policy_version 70980 (0.0006) [2023-03-07 04:54:57,126][118044] Updated weights for policy 0, policy_version 70990 (0.0005) [2023-03-07 04:54:57,899][118044] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-07 04:54:58,691][118044] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-07 04:54:59,468][118044] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-07 04:55:00,249][118044] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-07 04:55:01,024][118044] Updated weights for policy 0, policy_version 71040 (0.0007) [2023-03-07 04:55:01,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 72744960. Throughput: 0: 13150.0. Samples: 72724695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:01,086][117718] Avg episode reward: [(0, '2790.906')] [2023-03-07 04:55:01,811][118044] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-07 04:55:02,590][118044] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-07 04:55:03,358][118044] Updated weights for policy 0, policy_version 71070 (0.0006) [2023-03-07 04:55:04,129][118044] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-07 04:55:04,923][118044] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 04:55:05,677][118044] Updated weights for policy 0, policy_version 71100 (0.0006) [2023-03-07 04:55:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 72811520. Throughput: 0: 13159.0. Samples: 72803853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:06,086][117718] Avg episode reward: [(0, '2883.380')] [2023-03-07 04:55:06,449][118044] Updated weights for policy 0, policy_version 71110 (0.0007) [2023-03-07 04:55:07,234][118044] Updated weights for policy 0, policy_version 71120 (0.0007) [2023-03-07 04:55:07,995][118044] Updated weights for policy 0, policy_version 71130 (0.0007) [2023-03-07 04:55:08,785][118044] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-07 04:55:09,566][118044] Updated weights for policy 0, policy_version 71150 (0.0005) [2023-03-07 04:55:10,355][118044] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-07 04:55:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 72877056. Throughput: 0: 13154.6. Samples: 72843429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:11,086][117718] Avg episode reward: [(0, '2872.780')] [2023-03-07 04:55:11,132][118044] Updated weights for policy 0, policy_version 71170 (0.0008) [2023-03-07 04:55:11,922][118044] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-07 04:55:12,698][118044] Updated weights for policy 0, policy_version 71190 (0.0006) [2023-03-07 04:55:13,474][118044] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-07 04:55:14,253][118044] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-07 04:55:15,031][118044] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-07 04:55:15,806][118044] Updated weights for policy 0, policy_version 71230 (0.0006) [2023-03-07 04:55:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 72942592. Throughput: 0: 13148.3. Samples: 72922081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:16,086][117718] Avg episode reward: [(0, '2706.255')] [2023-03-07 04:55:16,588][118044] Updated weights for policy 0, policy_version 71240 (0.0007) [2023-03-07 04:55:17,347][118044] Updated weights for policy 0, policy_version 71250 (0.0006) [2023-03-07 04:55:18,120][118044] Updated weights for policy 0, policy_version 71260 (0.0007) [2023-03-07 04:55:18,928][118044] Updated weights for policy 0, policy_version 71270 (0.0007) [2023-03-07 04:55:19,694][118044] Updated weights for policy 0, policy_version 71280 (0.0006) [2023-03-07 04:55:20,478][118044] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 04:55:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 73008128. Throughput: 0: 13154.7. Samples: 73001138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:21,086][117718] Avg episode reward: [(0, '2767.882')] [2023-03-07 04:55:21,263][118044] Updated weights for policy 0, policy_version 71300 (0.0006) [2023-03-07 04:55:22,046][118044] Updated weights for policy 0, policy_version 71310 (0.0005) [2023-03-07 04:55:22,819][118044] Updated weights for policy 0, policy_version 71320 (0.0008) [2023-03-07 04:55:23,588][118044] Updated weights for policy 0, policy_version 71330 (0.0006) [2023-03-07 04:55:24,369][118044] Updated weights for policy 0, policy_version 71340 (0.0006) [2023-03-07 04:55:25,159][118044] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-07 04:55:25,934][118044] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-07 04:55:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 73074688. Throughput: 0: 13159.6. Samples: 73040588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:26,086][117718] Avg episode reward: [(0, '2712.576')] [2023-03-07 04:55:26,691][118044] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-07 04:55:27,487][118044] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-07 04:55:28,263][118044] Updated weights for policy 0, policy_version 71390 (0.0005) [2023-03-07 04:55:29,036][118044] Updated weights for policy 0, policy_version 71400 (0.0007) [2023-03-07 04:55:29,832][118044] Updated weights for policy 0, policy_version 71410 (0.0006) [2023-03-07 04:55:30,602][118044] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-07 04:55:31,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73140224. Throughput: 0: 13162.2. Samples: 73119526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:31,086][117718] Avg episode reward: [(0, '2683.101')] [2023-03-07 04:55:31,389][118044] Updated weights for policy 0, policy_version 71430 (0.0006) [2023-03-07 04:55:32,151][118044] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 04:55:32,945][118044] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-07 04:55:33,734][118044] Updated weights for policy 0, policy_version 71460 (0.0007) [2023-03-07 04:55:34,505][118044] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-07 04:55:35,279][118044] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-07 04:55:36,068][118044] Updated weights for policy 0, policy_version 71490 (0.0007) [2023-03-07 04:55:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73205760. Throughput: 0: 13153.6. Samples: 73198298. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:36,086][117718] Avg episode reward: [(0, '2806.803')] [2023-03-07 04:55:36,844][118044] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-07 04:55:37,609][118044] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-07 04:55:38,393][118044] Updated weights for policy 0, policy_version 71520 (0.0007) [2023-03-07 04:55:39,158][118044] Updated weights for policy 0, policy_version 71530 (0.0006) [2023-03-07 04:55:39,943][118044] Updated weights for policy 0, policy_version 71540 (0.0006) [2023-03-07 04:55:40,720][118044] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-07 04:55:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73271296. Throughput: 0: 13149.5. Samples: 73237733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:41,086][117718] Avg episode reward: [(0, '2712.218')] [2023-03-07 04:55:41,506][118044] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-07 04:55:42,262][118044] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-07 04:55:43,049][118044] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-07 04:55:43,814][118044] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-07 04:55:44,589][118044] Updated weights for policy 0, policy_version 71600 (0.0006) [2023-03-07 04:55:45,389][118044] Updated weights for policy 0, policy_version 71610 (0.0006) [2023-03-07 04:55:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73336832. Throughput: 0: 13160.4. Samples: 73316913. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:46,086][117718] Avg episode reward: [(0, '2734.066')] [2023-03-07 04:55:46,146][118044] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-07 04:55:46,941][118044] Updated weights for policy 0, policy_version 71630 (0.0007) [2023-03-07 04:55:47,725][118044] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-07 04:55:48,498][118044] Updated weights for policy 0, policy_version 71650 (0.0005) [2023-03-07 04:55:49,273][118044] Updated weights for policy 0, policy_version 71660 (0.0006) [2023-03-07 04:55:50,048][118044] Updated weights for policy 0, policy_version 71670 (0.0007) [2023-03-07 04:55:50,815][118044] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 04:55:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73403392. Throughput: 0: 13157.1. Samples: 73395923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:51,086][117718] Avg episode reward: [(0, '2795.157')] [2023-03-07 04:55:51,612][118044] Updated weights for policy 0, policy_version 71690 (0.0007) [2023-03-07 04:55:52,389][118044] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-07 04:55:53,173][118044] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-07 04:55:53,957][118044] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-07 04:55:54,732][118044] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-07 04:55:55,496][118044] Updated weights for policy 0, policy_version 71740 (0.0005) [2023-03-07 04:55:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73468928. Throughput: 0: 13153.2. Samples: 73435322. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:55:56,086][117718] Avg episode reward: [(0, '2726.218')] [2023-03-07 04:55:56,275][118044] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-07 04:55:57,041][118044] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-07 04:55:57,821][118044] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-07 04:55:58,605][118044] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-07 04:55:59,397][118044] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 04:56:00,180][118044] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-07 04:56:00,937][118044] Updated weights for policy 0, policy_version 71810 (0.0007) [2023-03-07 04:56:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73534464. Throughput: 0: 13156.0. Samples: 73514099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:01,086][117718] Avg episode reward: [(0, '2621.417')] [2023-03-07 04:56:01,722][118044] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-07 04:56:02,499][118044] Updated weights for policy 0, policy_version 71830 (0.0006) [2023-03-07 04:56:03,267][118044] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-07 04:56:04,045][118044] Updated weights for policy 0, policy_version 71850 (0.0007) [2023-03-07 04:56:04,823][118044] Updated weights for policy 0, policy_version 71860 (0.0006) [2023-03-07 04:56:05,610][118044] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 04:56:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 73601024. Throughput: 0: 13159.6. Samples: 73593315. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:06,086][117718] Avg episode reward: [(0, '2872.586')] [2023-03-07 04:56:06,388][118044] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-07 04:56:07,147][118044] Updated weights for policy 0, policy_version 71890 (0.0006) [2023-03-07 04:56:07,918][118044] Updated weights for policy 0, policy_version 71900 (0.0005) [2023-03-07 04:56:08,700][118044] Updated weights for policy 0, policy_version 71910 (0.0007) [2023-03-07 04:56:09,469][118044] Updated weights for policy 0, policy_version 71920 (0.0007) [2023-03-07 04:56:10,245][118044] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-07 04:56:11,046][118044] Updated weights for policy 0, policy_version 71940 (0.0006) [2023-03-07 04:56:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 73666560. Throughput: 0: 13168.4. Samples: 73633168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:11,086][117718] Avg episode reward: [(0, '2822.353')] [2023-03-07 04:56:11,828][118044] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-07 04:56:12,616][118044] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-07 04:56:13,373][118044] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-07 04:56:14,147][118044] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 04:56:14,918][118044] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 04:56:15,700][118044] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-07 04:56:16,086][117718] Fps is (10 sec: 13209.2, 60 sec: 13175.4, 300 sec: 13159.3). Total num frames: 73733120. Throughput: 0: 13164.6. Samples: 73711936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:16,086][117718] Avg episode reward: [(0, '2825.695')] [2023-03-07 04:56:16,460][118044] Updated weights for policy 0, policy_version 72010 (0.0007) [2023-03-07 04:56:17,233][118044] Updated weights for policy 0, policy_version 72020 (0.0006) [2023-03-07 04:56:18,017][118044] Updated weights for policy 0, policy_version 72030 (0.0007) [2023-03-07 04:56:18,793][118044] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-07 04:56:19,565][118044] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-07 04:56:20,349][118044] Updated weights for policy 0, policy_version 72060 (0.0005) [2023-03-07 04:56:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 73798656. Throughput: 0: 13180.9. Samples: 73791436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:21,086][117718] Avg episode reward: [(0, '2729.947')] [2023-03-07 04:56:21,107][118044] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-07 04:56:21,884][118044] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-07 04:56:22,670][118044] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-07 04:56:23,453][118044] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 04:56:24,235][118044] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-07 04:56:25,023][118044] Updated weights for policy 0, policy_version 72120 (0.0007) [2023-03-07 04:56:25,821][118044] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-07 04:56:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 73864192. Throughput: 0: 13182.6. Samples: 73830954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:26,087][117718] Avg episode reward: [(0, '2703.355')] [2023-03-07 04:56:26,588][118044] Updated weights for policy 0, policy_version 72140 (0.0006) [2023-03-07 04:56:27,354][118044] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 04:56:28,117][118044] Updated weights for policy 0, policy_version 72160 (0.0005) [2023-03-07 04:56:28,919][118044] Updated weights for policy 0, policy_version 72170 (0.0006) [2023-03-07 04:56:29,705][118044] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-07 04:56:30,465][118044] Updated weights for policy 0, policy_version 72190 (0.0005) [2023-03-07 04:56:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 73929728. Throughput: 0: 13168.5. Samples: 73909498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:31,086][117718] Avg episode reward: [(0, '2659.617')] [2023-03-07 04:56:31,253][118044] Updated weights for policy 0, policy_version 72200 (0.0006) [2023-03-07 04:56:32,026][118044] Updated weights for policy 0, policy_version 72210 (0.0007) [2023-03-07 04:56:32,802][118044] Updated weights for policy 0, policy_version 72220 (0.0007) [2023-03-07 04:56:33,574][118044] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-07 04:56:34,361][118044] Updated weights for policy 0, policy_version 72240 (0.0007) [2023-03-07 04:56:35,126][118044] Updated weights for policy 0, policy_version 72250 (0.0006) [2023-03-07 04:56:35,928][118044] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-07 04:56:36,086][117718] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 73996288. Throughput: 0: 13163.8. Samples: 73988296. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:36,086][117718] Avg episode reward: [(0, '2639.804')] [2023-03-07 04:56:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000072262_73996288.pth... [2023-03-07 04:56:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000069179_70839296.pth [2023-03-07 04:56:36,714][118044] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 04:56:37,491][118044] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-07 04:56:38,271][118044] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 04:56:39,057][118044] Updated weights for policy 0, policy_version 72300 (0.0007) [2023-03-07 04:56:39,847][118044] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 04:56:40,635][118044] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-07 04:56:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 74060800. Throughput: 0: 13167.7. Samples: 74027869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:41,086][117718] Avg episode reward: [(0, '2759.522')] [2023-03-07 04:56:41,404][118044] Updated weights for policy 0, policy_version 72330 (0.0007) [2023-03-07 04:56:42,195][118044] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 04:56:42,972][118044] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-07 04:56:43,747][118044] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-07 04:56:44,512][118044] Updated weights for policy 0, policy_version 72370 (0.0005) [2023-03-07 04:56:45,297][118044] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 04:56:46,079][118044] Updated weights for policy 0, policy_version 72390 (0.0006) [2023-03-07 04:56:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 74127360. Throughput: 0: 13166.9. Samples: 74106612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:46,086][117718] Avg episode reward: [(0, '2834.527')] [2023-03-07 04:56:46,841][118044] Updated weights for policy 0, policy_version 72400 (0.0007) [2023-03-07 04:56:47,634][118044] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 04:56:48,416][118044] Updated weights for policy 0, policy_version 72420 (0.0006) [2023-03-07 04:56:49,206][118044] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-07 04:56:49,980][118044] Updated weights for policy 0, policy_version 72440 (0.0007) [2023-03-07 04:56:50,774][118044] Updated weights for policy 0, policy_version 72450 (0.0005) [2023-03-07 04:56:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 74192896. Throughput: 0: 13151.1. Samples: 74185116. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:51,086][117718] Avg episode reward: [(0, '2798.723')] [2023-03-07 04:56:51,554][118044] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 04:56:52,353][118044] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-07 04:56:53,109][118044] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-07 04:56:53,893][118044] Updated weights for policy 0, policy_version 72490 (0.0006) [2023-03-07 04:56:54,672][118044] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-07 04:56:55,446][118044] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-07 04:56:56,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 74258432. Throughput: 0: 13137.6. Samples: 74224358. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:56:56,086][117718] Avg episode reward: [(0, '2819.377')] [2023-03-07 04:56:56,244][118044] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-07 04:56:57,013][118044] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-07 04:56:57,795][118044] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-07 04:56:58,563][118044] Updated weights for policy 0, policy_version 72550 (0.0007) [2023-03-07 04:56:59,348][118044] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 04:57:00,127][118044] Updated weights for policy 0, policy_version 72570 (0.0007) [2023-03-07 04:57:00,920][118044] Updated weights for policy 0, policy_version 72580 (0.0007) [2023-03-07 04:57:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 74323968. Throughput: 0: 13142.2. Samples: 74303335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:01,086][117718] Avg episode reward: [(0, '2762.293')] [2023-03-07 04:57:01,707][118044] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-07 04:57:02,470][118044] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-07 04:57:03,251][118044] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-07 04:57:04,021][118044] Updated weights for policy 0, policy_version 72620 (0.0007) [2023-03-07 04:57:04,819][118044] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-07 04:57:05,589][118044] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-07 04:57:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 74389504. Throughput: 0: 13127.0. Samples: 74382152. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:06,086][117718] Avg episode reward: [(0, '2802.991')] [2023-03-07 04:57:06,363][118044] Updated weights for policy 0, policy_version 72650 (0.0006) [2023-03-07 04:57:07,153][118044] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-07 04:57:07,933][118044] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-07 04:57:08,719][118044] Updated weights for policy 0, policy_version 72680 (0.0007) [2023-03-07 04:57:09,505][118044] Updated weights for policy 0, policy_version 72690 (0.0006) [2023-03-07 04:57:10,275][118044] Updated weights for policy 0, policy_version 72700 (0.0006) [2023-03-07 04:57:11,049][118044] Updated weights for policy 0, policy_version 72710 (0.0005) [2023-03-07 04:57:11,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 74455040. Throughput: 0: 13121.4. Samples: 74421417. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:11,086][117718] Avg episode reward: [(0, '2901.554')] [2023-03-07 04:57:11,837][118044] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-07 04:57:12,623][118044] Updated weights for policy 0, policy_version 72730 (0.0006) [2023-03-07 04:57:13,395][118044] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 04:57:14,168][118044] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-07 04:57:14,949][118044] Updated weights for policy 0, policy_version 72760 (0.0005) [2023-03-07 04:57:15,735][118044] Updated weights for policy 0, policy_version 72770 (0.0006) [2023-03-07 04:57:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.8). Total num frames: 74520576. Throughput: 0: 13123.4. Samples: 74500051. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:16,086][117718] Avg episode reward: [(0, '2885.508')] [2023-03-07 04:57:16,520][118044] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-07 04:57:17,314][118044] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 04:57:18,073][118044] Updated weights for policy 0, policy_version 72800 (0.0007) [2023-03-07 04:57:18,862][118044] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-07 04:57:19,637][118044] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 04:57:20,419][118044] Updated weights for policy 0, policy_version 72830 (0.0006) [2023-03-07 04:57:21,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 74586112. Throughput: 0: 13119.8. Samples: 74578687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:21,086][117718] Avg episode reward: [(0, '2919.430')] [2023-03-07 04:57:21,208][118044] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-07 04:57:21,979][118044] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-07 04:57:22,778][118044] Updated weights for policy 0, policy_version 72860 (0.0006) [2023-03-07 04:57:23,558][118044] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-07 04:57:24,345][118044] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-07 04:57:25,131][118044] Updated weights for policy 0, policy_version 72890 (0.0006) [2023-03-07 04:57:25,904][118044] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-07 04:57:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 74651648. Throughput: 0: 13111.9. Samples: 74617904. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:26,086][117718] Avg episode reward: [(0, '2923.303')] [2023-03-07 04:57:26,689][118044] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-07 04:57:27,478][118044] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-07 04:57:28,253][118044] Updated weights for policy 0, policy_version 72930 (0.0007) [2023-03-07 04:57:29,018][118044] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-07 04:57:29,801][118044] Updated weights for policy 0, policy_version 72950 (0.0007) [2023-03-07 04:57:30,591][118044] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 04:57:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 74717184. Throughput: 0: 13110.6. Samples: 74696586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:31,086][117718] Avg episode reward: [(0, '2890.288')] [2023-03-07 04:57:31,383][118044] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-07 04:57:32,154][118044] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 04:57:32,945][118044] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-07 04:57:33,723][118044] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-07 04:57:34,503][118044] Updated weights for policy 0, policy_version 73010 (0.0007) [2023-03-07 04:57:35,262][118044] Updated weights for policy 0, policy_version 73020 (0.0006) [2023-03-07 04:57:36,053][118044] Updated weights for policy 0, policy_version 73030 (0.0007) [2023-03-07 04:57:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 74782720. Throughput: 0: 13117.3. Samples: 74775396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:36,086][117718] Avg episode reward: [(0, '2863.234')] [2023-03-07 04:57:36,820][118044] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-07 04:57:37,610][118044] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 04:57:38,393][118044] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-07 04:57:39,182][118044] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-07 04:57:39,963][118044] Updated weights for policy 0, policy_version 73080 (0.0007) [2023-03-07 04:57:40,745][118044] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-07 04:57:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 74848256. Throughput: 0: 13115.9. Samples: 74814575. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:57:41,086][117718] Avg episode reward: [(0, '2945.920')] [2023-03-07 04:57:41,524][118044] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-07 04:57:42,319][118044] Updated weights for policy 0, policy_version 73110 (0.0006) [2023-03-07 04:57:43,086][118044] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-07 04:57:43,882][118044] Updated weights for policy 0, policy_version 73130 (0.0006) [2023-03-07 04:57:44,655][118044] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-07 04:57:45,438][118044] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 04:57:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 74913792. Throughput: 0: 13106.9. Samples: 74893144. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:57:46,086][117718] Avg episode reward: [(0, '2932.263')] [2023-03-07 04:57:46,206][118044] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 04:57:46,970][118044] Updated weights for policy 0, policy_version 73170 (0.0007) [2023-03-07 04:57:47,757][118044] Updated weights for policy 0, policy_version 73180 (0.0006) [2023-03-07 04:57:48,552][118044] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-07 04:57:49,310][118044] Updated weights for policy 0, policy_version 73200 (0.0005) [2023-03-07 04:57:50,119][118044] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 04:57:50,894][118044] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-07 04:57:51,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 74979328. Throughput: 0: 13107.0. Samples: 74971967. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:57:51,086][117718] Avg episode reward: [(0, '2941.525')] [2023-03-07 04:57:51,666][118044] Updated weights for policy 0, policy_version 73230 (0.0007) [2023-03-07 04:57:52,460][118044] Updated weights for policy 0, policy_version 73240 (0.0006) [2023-03-07 04:57:53,230][118044] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-07 04:57:54,012][118044] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 04:57:54,789][118044] Updated weights for policy 0, policy_version 73270 (0.0007) [2023-03-07 04:57:55,562][118044] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-07 04:57:56,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 75044864. Throughput: 0: 13108.4. Samples: 75011296. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:57:56,086][117718] Avg episode reward: [(0, '2835.634')] [2023-03-07 04:57:56,342][118044] Updated weights for policy 0, policy_version 73290 (0.0007) [2023-03-07 04:57:57,130][118044] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-07 04:57:57,895][118044] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-07 04:57:58,679][118044] Updated weights for policy 0, policy_version 73320 (0.0006) [2023-03-07 04:57:59,478][118044] Updated weights for policy 0, policy_version 73330 (0.0005) [2023-03-07 04:58:00,264][118044] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-07 04:58:01,061][118044] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-07 04:58:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 75110400. Throughput: 0: 13111.6. Samples: 75090071. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:01,097][117718] Avg episode reward: [(0, '3017.426')] [2023-03-07 04:58:01,826][118044] Updated weights for policy 0, policy_version 73360 (0.0005) [2023-03-07 04:58:02,609][118044] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-07 04:58:03,385][118044] Updated weights for policy 0, policy_version 73380 (0.0007) [2023-03-07 04:58:04,172][118044] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-07 04:58:04,961][118044] Updated weights for policy 0, policy_version 73400 (0.0005) [2023-03-07 04:58:05,720][118044] Updated weights for policy 0, policy_version 73410 (0.0007) [2023-03-07 04:58:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 75175936. Throughput: 0: 13113.3. Samples: 75168785. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:06,097][117718] Avg episode reward: [(0, '2886.294')] [2023-03-07 04:58:06,493][118044] Updated weights for policy 0, policy_version 73420 (0.0006) [2023-03-07 04:58:07,293][118044] Updated weights for policy 0, policy_version 73430 (0.0006) [2023-03-07 04:58:08,078][118044] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-07 04:58:08,824][118044] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 04:58:09,601][118044] Updated weights for policy 0, policy_version 73460 (0.0005) [2023-03-07 04:58:10,390][118044] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-07 04:58:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 75241472. Throughput: 0: 13116.7. Samples: 75208157. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:11,097][117718] Avg episode reward: [(0, '2826.015')] [2023-03-07 04:58:11,180][118044] Updated weights for policy 0, policy_version 73480 (0.0007) [2023-03-07 04:58:11,951][118044] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-07 04:58:12,724][118044] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-07 04:58:13,490][118044] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-07 04:58:14,262][118044] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-07 04:58:15,046][118044] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-07 04:58:15,833][118044] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-07 04:58:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 75308032. Throughput: 0: 13131.6. Samples: 75287508. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:16,097][117718] Avg episode reward: [(0, '2843.811')] [2023-03-07 04:58:16,625][118044] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-07 04:58:17,405][118044] Updated weights for policy 0, policy_version 73560 (0.0007) [2023-03-07 04:58:18,176][118044] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-07 04:58:18,954][118044] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-07 04:58:19,730][118044] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-07 04:58:20,487][118044] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-07 04:58:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 75373568. Throughput: 0: 13128.7. Samples: 75366185. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:21,096][117718] Avg episode reward: [(0, '2796.229')] [2023-03-07 04:58:21,260][118044] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-07 04:58:22,038][118044] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-07 04:58:22,810][118044] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-07 04:58:23,579][118044] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-07 04:58:24,372][118044] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-07 04:58:25,143][118044] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-07 04:58:25,927][118044] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-07 04:58:26,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 75440128. Throughput: 0: 13143.6. Samples: 75406035. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 04:58:26,096][117718] Avg episode reward: [(0, '2886.259')] [2023-03-07 04:58:26,709][118044] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-07 04:58:27,487][118044] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 04:58:28,280][118044] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-07 04:58:29,050][118044] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 04:58:29,844][118044] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-07 04:58:30,614][118044] Updated weights for policy 0, policy_version 73730 (0.0005) [2023-03-07 04:58:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 75505664. Throughput: 0: 13146.9. Samples: 75484754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:31,096][117718] Avg episode reward: [(0, '2784.426')] [2023-03-07 04:58:31,383][118044] Updated weights for policy 0, policy_version 73740 (0.0005) [2023-03-07 04:58:32,167][118044] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-07 04:58:32,946][118044] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-07 04:58:33,718][118044] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-07 04:58:34,494][118044] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 04:58:35,265][118044] Updated weights for policy 0, policy_version 73790 (0.0007) [2023-03-07 04:58:36,051][118044] Updated weights for policy 0, policy_version 73800 (0.0006) [2023-03-07 04:58:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 75571200. Throughput: 0: 13151.9. Samples: 75563802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:36,097][117718] Avg episode reward: [(0, '2900.258')] [2023-03-07 04:58:36,102][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000073800_75571200.pth... [2023-03-07 04:58:36,134][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000070719_72416256.pth [2023-03-07 04:58:36,841][118044] Updated weights for policy 0, policy_version 73810 (0.0006) [2023-03-07 04:58:37,622][118044] Updated weights for policy 0, policy_version 73820 (0.0007) [2023-03-07 04:58:38,404][118044] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-07 04:58:39,181][118044] Updated weights for policy 0, policy_version 73840 (0.0006) [2023-03-07 04:58:39,963][118044] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-07 04:58:40,738][118044] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-07 04:58:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 75636736. Throughput: 0: 13151.1. Samples: 75603095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:41,086][117718] Avg episode reward: [(0, '2871.519')] [2023-03-07 04:58:41,506][118044] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-07 04:58:42,272][118044] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-07 04:58:43,054][118044] Updated weights for policy 0, policy_version 73890 (0.0007) [2023-03-07 04:58:43,837][118044] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 04:58:44,611][118044] Updated weights for policy 0, policy_version 73910 (0.0007) [2023-03-07 04:58:45,396][118044] Updated weights for policy 0, policy_version 73920 (0.0008) [2023-03-07 04:58:46,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 75702272. Throughput: 0: 13157.6. Samples: 75682161. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:46,086][117718] Avg episode reward: [(0, '2843.547')] [2023-03-07 04:58:46,173][118044] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-07 04:58:46,954][118044] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-07 04:58:47,726][118044] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-07 04:58:48,511][118044] Updated weights for policy 0, policy_version 73960 (0.0005) [2023-03-07 04:58:49,290][118044] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-07 04:58:50,064][118044] Updated weights for policy 0, policy_version 73980 (0.0007) [2023-03-07 04:58:50,875][118044] Updated weights for policy 0, policy_version 73990 (0.0006) [2023-03-07 04:58:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 75767808. Throughput: 0: 13156.4. Samples: 75760824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:51,086][117718] Avg episode reward: [(0, '2834.735')] [2023-03-07 04:58:51,642][118044] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-07 04:58:52,432][118044] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-07 04:58:53,236][118044] Updated weights for policy 0, policy_version 74020 (0.0005) [2023-03-07 04:58:54,023][118044] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-07 04:58:54,790][118044] Updated weights for policy 0, policy_version 74040 (0.0006) [2023-03-07 04:58:55,548][118044] Updated weights for policy 0, policy_version 74050 (0.0007) [2023-03-07 04:58:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 75833344. Throughput: 0: 13150.7. Samples: 75799940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:58:56,086][117718] Avg episode reward: [(0, '2878.297')] [2023-03-07 04:58:56,353][118044] Updated weights for policy 0, policy_version 74060 (0.0007) [2023-03-07 04:58:57,124][118044] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-07 04:58:57,910][118044] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-07 04:58:58,709][118044] Updated weights for policy 0, policy_version 74090 (0.0007) [2023-03-07 04:58:59,484][118044] Updated weights for policy 0, policy_version 74100 (0.0007) [2023-03-07 04:59:00,265][118044] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-07 04:59:01,047][118044] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-07 04:59:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 75898880. Throughput: 0: 13134.0. Samples: 75878538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:01,086][117718] Avg episode reward: [(0, '2881.446')] [2023-03-07 04:59:01,835][118044] Updated weights for policy 0, policy_version 74130 (0.0007) [2023-03-07 04:59:02,604][118044] Updated weights for policy 0, policy_version 74140 (0.0007) [2023-03-07 04:59:03,385][118044] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 04:59:04,164][118044] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-07 04:59:04,933][118044] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-07 04:59:05,713][118044] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 04:59:06,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 75964416. Throughput: 0: 13137.4. Samples: 75957368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:06,086][117718] Avg episode reward: [(0, '2822.837')] [2023-03-07 04:59:06,475][118044] Updated weights for policy 0, policy_version 74190 (0.0005) [2023-03-07 04:59:07,250][118044] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-07 04:59:08,013][118044] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 04:59:08,804][118044] Updated weights for policy 0, policy_version 74220 (0.0006) [2023-03-07 04:59:09,586][118044] Updated weights for policy 0, policy_version 74230 (0.0007) [2023-03-07 04:59:10,369][118044] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-07 04:59:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 76030976. Throughput: 0: 13137.9. Samples: 75997243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:11,086][117718] Avg episode reward: [(0, '2764.812')] [2023-03-07 04:59:11,134][118044] Updated weights for policy 0, policy_version 74250 (0.0005) [2023-03-07 04:59:11,901][118044] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-07 04:59:12,679][118044] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 04:59:13,451][118044] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-07 04:59:14,218][118044] Updated weights for policy 0, policy_version 74290 (0.0006) [2023-03-07 04:59:15,001][118044] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-03-07 04:59:15,789][118044] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-07 04:59:16,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 76096512. Throughput: 0: 13149.1. Samples: 76076466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:16,086][117718] Avg episode reward: [(0, '2955.472')] [2023-03-07 04:59:16,564][118044] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-07 04:59:17,347][118044] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-07 04:59:18,134][118044] Updated weights for policy 0, policy_version 74340 (0.0005) [2023-03-07 04:59:18,916][118044] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-07 04:59:19,685][118044] Updated weights for policy 0, policy_version 74360 (0.0007) [2023-03-07 04:59:20,473][118044] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-07 04:59:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 76162048. Throughput: 0: 13140.7. Samples: 76155132. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:21,086][117718] Avg episode reward: [(0, '2902.482')] [2023-03-07 04:59:21,247][118044] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-07 04:59:22,034][118044] Updated weights for policy 0, policy_version 74390 (0.0006) [2023-03-07 04:59:22,813][118044] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-07 04:59:23,577][118044] Updated weights for policy 0, policy_version 74410 (0.0007) [2023-03-07 04:59:24,350][118044] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-07 04:59:25,132][118044] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 04:59:25,898][118044] Updated weights for policy 0, policy_version 74440 (0.0006) [2023-03-07 04:59:26,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 76228608. Throughput: 0: 13145.7. Samples: 76194652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:26,086][117718] Avg episode reward: [(0, '2749.622')] [2023-03-07 04:59:26,657][118044] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-07 04:59:27,447][118044] Updated weights for policy 0, policy_version 74460 (0.0007) [2023-03-07 04:59:28,233][118044] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-07 04:59:29,009][118044] Updated weights for policy 0, policy_version 74480 (0.0007) [2023-03-07 04:59:29,775][118044] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-07 04:59:30,566][118044] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-07 04:59:31,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 76294144. Throughput: 0: 13150.7. Samples: 76273941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:31,086][117718] Avg episode reward: [(0, '2868.767')] [2023-03-07 04:59:31,339][118044] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 04:59:32,133][118044] Updated weights for policy 0, policy_version 74520 (0.0006) [2023-03-07 04:59:32,911][118044] Updated weights for policy 0, policy_version 74530 (0.0006) [2023-03-07 04:59:33,697][118044] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-07 04:59:34,466][118044] Updated weights for policy 0, policy_version 74550 (0.0007) [2023-03-07 04:59:35,234][118044] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-07 04:59:36,011][118044] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-07 04:59:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 76360704. Throughput: 0: 13154.9. Samples: 76352793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:36,086][117718] Avg episode reward: [(0, '2747.446')] [2023-03-07 04:59:36,793][118044] Updated weights for policy 0, policy_version 74580 (0.0005) [2023-03-07 04:59:37,561][118044] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-07 04:59:38,328][118044] Updated weights for policy 0, policy_version 74600 (0.0005) [2023-03-07 04:59:39,103][118044] Updated weights for policy 0, policy_version 74610 (0.0006) [2023-03-07 04:59:39,885][118044] Updated weights for policy 0, policy_version 74620 (0.0006) [2023-03-07 04:59:40,659][118044] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 04:59:41,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 76426240. Throughput: 0: 13167.2. Samples: 76392463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:41,086][117718] Avg episode reward: [(0, '2850.596')] [2023-03-07 04:59:41,454][118044] Updated weights for policy 0, policy_version 74640 (0.0007) [2023-03-07 04:59:42,230][118044] Updated weights for policy 0, policy_version 74650 (0.0006) [2023-03-07 04:59:43,004][118044] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-07 04:59:43,787][118044] Updated weights for policy 0, policy_version 74670 (0.0007) [2023-03-07 04:59:44,578][118044] Updated weights for policy 0, policy_version 74680 (0.0006) [2023-03-07 04:59:45,345][118044] Updated weights for policy 0, policy_version 74690 (0.0007) [2023-03-07 04:59:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 76491776. Throughput: 0: 13169.1. Samples: 76471148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:46,086][117718] Avg episode reward: [(0, '2841.240')] [2023-03-07 04:59:46,128][118044] Updated weights for policy 0, policy_version 74700 (0.0005) [2023-03-07 04:59:46,898][118044] Updated weights for policy 0, policy_version 74710 (0.0006) [2023-03-07 04:59:47,664][118044] Updated weights for policy 0, policy_version 74720 (0.0007) [2023-03-07 04:59:48,458][118044] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-07 04:59:49,250][118044] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 04:59:50,028][118044] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 04:59:50,811][118044] Updated weights for policy 0, policy_version 74760 (0.0006) [2023-03-07 04:59:51,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 76557312. Throughput: 0: 13168.4. Samples: 76549947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:51,086][117718] Avg episode reward: [(0, '2809.584')] [2023-03-07 04:59:51,584][118044] Updated weights for policy 0, policy_version 74770 (0.0005) [2023-03-07 04:59:52,376][118044] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-07 04:59:53,149][118044] Updated weights for policy 0, policy_version 74790 (0.0006) [2023-03-07 04:59:53,945][118044] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-07 04:59:54,742][118044] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 04:59:55,493][118044] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-07 04:59:56,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 76622848. Throughput: 0: 13156.9. Samples: 76589307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 04:59:56,086][117718] Avg episode reward: [(0, '2837.959')] [2023-03-07 04:59:56,275][118044] Updated weights for policy 0, policy_version 74830 (0.0007) [2023-03-07 04:59:57,047][118044] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-07 04:59:57,841][118044] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-07 04:59:58,615][118044] Updated weights for policy 0, policy_version 74860 (0.0005) [2023-03-07 04:59:59,397][118044] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-07 05:00:00,163][118044] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-07 05:00:00,939][118044] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-07 05:00:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 76688384. Throughput: 0: 13148.9. Samples: 76668165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:01,086][117718] Avg episode reward: [(0, '2896.032')] [2023-03-07 05:00:01,707][118044] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-07 05:00:02,489][118044] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-07 05:00:03,259][118044] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-07 05:00:04,027][118044] Updated weights for policy 0, policy_version 74930 (0.0005) [2023-03-07 05:00:04,804][118044] Updated weights for policy 0, policy_version 74940 (0.0005) [2023-03-07 05:00:05,580][118044] Updated weights for policy 0, policy_version 74950 (0.0005) [2023-03-07 05:00:06,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 76754944. Throughput: 0: 13168.3. Samples: 76747706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:06,086][117718] Avg episode reward: [(0, '2922.857')] [2023-03-07 05:00:06,356][118044] Updated weights for policy 0, policy_version 74960 (0.0007) [2023-03-07 05:00:07,132][118044] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-07 05:00:07,926][118044] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-07 05:00:08,730][118044] Updated weights for policy 0, policy_version 74990 (0.0006) [2023-03-07 05:00:09,504][118044] Updated weights for policy 0, policy_version 75000 (0.0006) [2023-03-07 05:00:10,276][118044] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-07 05:00:11,056][118044] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-07 05:00:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 76820480. Throughput: 0: 13156.5. Samples: 76786694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:11,086][117718] Avg episode reward: [(0, '2899.114')] [2023-03-07 05:00:11,827][118044] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-07 05:00:12,607][118044] Updated weights for policy 0, policy_version 75040 (0.0007) [2023-03-07 05:00:13,401][118044] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 05:00:14,175][118044] Updated weights for policy 0, policy_version 75060 (0.0005) [2023-03-07 05:00:14,966][118044] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-07 05:00:15,752][118044] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-07 05:00:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 76886016. Throughput: 0: 13147.7. Samples: 76865585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:16,097][117718] Avg episode reward: [(0, '2966.749')] [2023-03-07 05:00:16,535][118044] Updated weights for policy 0, policy_version 75090 (0.0007) [2023-03-07 05:00:17,300][118044] Updated weights for policy 0, policy_version 75100 (0.0006) [2023-03-07 05:00:18,052][118044] Updated weights for policy 0, policy_version 75110 (0.0006) [2023-03-07 05:00:18,829][118044] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 05:00:19,614][118044] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-07 05:00:20,372][118044] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 05:00:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 76952576. Throughput: 0: 13155.3. Samples: 76944780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:21,096][117718] Avg episode reward: [(0, '2966.078')] [2023-03-07 05:00:21,139][118044] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-07 05:00:21,934][118044] Updated weights for policy 0, policy_version 75160 (0.0007) [2023-03-07 05:00:22,719][118044] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-07 05:00:23,500][118044] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-07 05:00:24,272][118044] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-07 05:00:25,049][118044] Updated weights for policy 0, policy_version 75200 (0.0007) [2023-03-07 05:00:25,826][118044] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-07 05:00:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77018112. Throughput: 0: 13150.2. Samples: 76984224. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:26,097][117718] Avg episode reward: [(0, '2877.709')] [2023-03-07 05:00:26,592][118044] Updated weights for policy 0, policy_version 75220 (0.0006) [2023-03-07 05:00:27,377][118044] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-07 05:00:28,171][118044] Updated weights for policy 0, policy_version 75240 (0.0007) [2023-03-07 05:00:28,953][118044] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-03-07 05:00:29,727][118044] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-07 05:00:30,510][118044] Updated weights for policy 0, policy_version 75270 (0.0006) [2023-03-07 05:00:31,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77083648. Throughput: 0: 13157.9. Samples: 77063255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:31,086][117718] Avg episode reward: [(0, '2892.640')] [2023-03-07 05:00:31,279][118044] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-07 05:00:32,046][118044] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-07 05:00:32,829][118044] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-07 05:00:33,598][118044] Updated weights for policy 0, policy_version 75310 (0.0006) [2023-03-07 05:00:34,369][118044] Updated weights for policy 0, policy_version 75320 (0.0006) [2023-03-07 05:00:35,172][118044] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-07 05:00:35,952][118044] Updated weights for policy 0, policy_version 75340 (0.0005) [2023-03-07 05:00:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 77149184. Throughput: 0: 13158.9. Samples: 77142098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:36,086][117718] Avg episode reward: [(0, '2805.729')] [2023-03-07 05:00:36,101][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000075342_77150208.pth... [2023-03-07 05:00:36,131][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000072262_73996288.pth [2023-03-07 05:00:36,721][118044] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-07 05:00:37,505][118044] Updated weights for policy 0, policy_version 75360 (0.0006) [2023-03-07 05:00:38,298][118044] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-07 05:00:39,074][118044] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-07 05:00:39,850][118044] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-07 05:00:40,653][118044] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-07 05:00:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 77214720. Throughput: 0: 13156.6. Samples: 77181350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:41,086][117718] Avg episode reward: [(0, '2936.528')] [2023-03-07 05:00:41,435][118044] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-07 05:00:42,216][118044] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-07 05:00:43,026][118044] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-07 05:00:43,813][118044] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-07 05:00:44,590][118044] Updated weights for policy 0, policy_version 75450 (0.0007) [2023-03-07 05:00:45,359][118044] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-07 05:00:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 77280256. Throughput: 0: 13143.6. Samples: 77259628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:46,086][117718] Avg episode reward: [(0, '2852.526')] [2023-03-07 05:00:46,137][118044] Updated weights for policy 0, policy_version 75470 (0.0007) [2023-03-07 05:00:46,908][118044] Updated weights for policy 0, policy_version 75480 (0.0007) [2023-03-07 05:00:47,704][118044] Updated weights for policy 0, policy_version 75490 (0.0006) [2023-03-07 05:00:48,475][118044] Updated weights for policy 0, policy_version 75500 (0.0005) [2023-03-07 05:00:49,266][118044] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-07 05:00:50,052][118044] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-07 05:00:50,822][118044] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-07 05:00:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 77345792. Throughput: 0: 13127.0. Samples: 77338421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:51,086][117718] Avg episode reward: [(0, '2896.122')] [2023-03-07 05:00:51,589][118044] Updated weights for policy 0, policy_version 75540 (0.0006) [2023-03-07 05:00:52,358][118044] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-07 05:00:53,136][118044] Updated weights for policy 0, policy_version 75560 (0.0007) [2023-03-07 05:00:53,897][118044] Updated weights for policy 0, policy_version 75570 (0.0007) [2023-03-07 05:00:54,681][118044] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-07 05:00:55,444][118044] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 05:00:56,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77412352. Throughput: 0: 13144.0. Samples: 77378177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:00:56,086][117718] Avg episode reward: [(0, '3001.344')] [2023-03-07 05:00:56,222][118044] Updated weights for policy 0, policy_version 75600 (0.0006) [2023-03-07 05:00:57,002][118044] Updated weights for policy 0, policy_version 75610 (0.0005) [2023-03-07 05:00:57,800][118044] Updated weights for policy 0, policy_version 75620 (0.0007) [2023-03-07 05:00:58,558][118044] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-07 05:00:59,350][118044] Updated weights for policy 0, policy_version 75640 (0.0007) [2023-03-07 05:01:00,134][118044] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-07 05:01:00,901][118044] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-07 05:01:01,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77477888. Throughput: 0: 13148.1. Samples: 77457252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:01,086][117718] Avg episode reward: [(0, '2920.380')] [2023-03-07 05:01:01,676][118044] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-07 05:01:02,461][118044] Updated weights for policy 0, policy_version 75680 (0.0006) [2023-03-07 05:01:03,226][118044] Updated weights for policy 0, policy_version 75690 (0.0005) [2023-03-07 05:01:04,023][118044] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-07 05:01:04,809][118044] Updated weights for policy 0, policy_version 75710 (0.0006) [2023-03-07 05:01:05,566][118044] Updated weights for policy 0, policy_version 75720 (0.0006) [2023-03-07 05:01:06,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 77543424. Throughput: 0: 13143.4. Samples: 77536236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:06,086][117718] Avg episode reward: [(0, '2908.036')] [2023-03-07 05:01:06,363][118044] Updated weights for policy 0, policy_version 75730 (0.0006) [2023-03-07 05:01:07,123][118044] Updated weights for policy 0, policy_version 75740 (0.0007) [2023-03-07 05:01:07,900][118044] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-07 05:01:08,681][118044] Updated weights for policy 0, policy_version 75760 (0.0006) [2023-03-07 05:01:09,460][118044] Updated weights for policy 0, policy_version 75770 (0.0007) [2023-03-07 05:01:10,219][118044] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-07 05:01:10,989][118044] Updated weights for policy 0, policy_version 75790 (0.0007) [2023-03-07 05:01:11,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 77608960. Throughput: 0: 13145.3. Samples: 77575762. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:11,086][117718] Avg episode reward: [(0, '2851.632')] [2023-03-07 05:01:11,769][118044] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-07 05:01:12,535][118044] Updated weights for policy 0, policy_version 75810 (0.0006) [2023-03-07 05:01:13,307][118044] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 05:01:14,109][118044] Updated weights for policy 0, policy_version 75830 (0.0005) [2023-03-07 05:01:14,888][118044] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-07 05:01:15,665][118044] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-07 05:01:16,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77675520. Throughput: 0: 13147.4. Samples: 77654887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:16,086][117718] Avg episode reward: [(0, '2882.580')] [2023-03-07 05:01:16,445][118044] Updated weights for policy 0, policy_version 75860 (0.0005) [2023-03-07 05:01:17,206][118044] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 05:01:18,021][118044] Updated weights for policy 0, policy_version 75880 (0.0007) [2023-03-07 05:01:18,785][118044] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-07 05:01:19,566][118044] Updated weights for policy 0, policy_version 75900 (0.0006) [2023-03-07 05:01:20,334][118044] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-07 05:01:21,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 77741056. Throughput: 0: 13148.9. Samples: 77733799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:21,086][117718] Avg episode reward: [(0, '2778.851')] [2023-03-07 05:01:21,131][118044] Updated weights for policy 0, policy_version 75920 (0.0006) [2023-03-07 05:01:21,918][118044] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-07 05:01:22,686][118044] Updated weights for policy 0, policy_version 75940 (0.0005) [2023-03-07 05:01:23,462][118044] Updated weights for policy 0, policy_version 75950 (0.0007) [2023-03-07 05:01:24,248][118044] Updated weights for policy 0, policy_version 75960 (0.0006) [2023-03-07 05:01:25,013][118044] Updated weights for policy 0, policy_version 75970 (0.0006) [2023-03-07 05:01:25,777][118044] Updated weights for policy 0, policy_version 75980 (0.0007) [2023-03-07 05:01:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 77806592. Throughput: 0: 13151.8. Samples: 77773183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:26,086][117718] Avg episode reward: [(0, '2770.511')] [2023-03-07 05:01:26,562][118044] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 05:01:27,341][118044] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-07 05:01:28,115][118044] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-07 05:01:28,883][118044] Updated weights for policy 0, policy_version 76020 (0.0005) [2023-03-07 05:01:29,689][118044] Updated weights for policy 0, policy_version 76030 (0.0007) [2023-03-07 05:01:30,462][118044] Updated weights for policy 0, policy_version 76040 (0.0006) [2023-03-07 05:01:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77873152. Throughput: 0: 13167.8. Samples: 77852179. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:31,097][117718] Avg episode reward: [(0, '2929.282')] [2023-03-07 05:01:31,213][118044] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-07 05:01:31,993][118044] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-07 05:01:32,774][118044] Updated weights for policy 0, policy_version 76070 (0.0006) [2023-03-07 05:01:33,561][118044] Updated weights for policy 0, policy_version 76080 (0.0005) [2023-03-07 05:01:34,335][118044] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-07 05:01:35,104][118044] Updated weights for policy 0, policy_version 76100 (0.0005) [2023-03-07 05:01:35,872][118044] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-07 05:01:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77938688. Throughput: 0: 13180.6. Samples: 77931547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:36,096][117718] Avg episode reward: [(0, '2933.217')] [2023-03-07 05:01:36,644][118044] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-07 05:01:37,413][118044] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 05:01:38,230][118044] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-07 05:01:38,976][118044] Updated weights for policy 0, policy_version 76150 (0.0007) [2023-03-07 05:01:39,779][118044] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-07 05:01:40,537][118044] Updated weights for policy 0, policy_version 76170 (0.0006) [2023-03-07 05:01:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 78004224. Throughput: 0: 13172.3. Samples: 77970931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:41,096][117718] Avg episode reward: [(0, '2971.265')] [2023-03-07 05:01:41,322][118044] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-07 05:01:42,121][118044] Updated weights for policy 0, policy_version 76190 (0.0006) [2023-03-07 05:01:42,887][118044] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-07 05:01:43,665][118044] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-07 05:01:44,441][118044] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-07 05:01:45,225][118044] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-07 05:01:46,013][118044] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 05:01:46,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 78070784. Throughput: 0: 13171.2. Samples: 78049956. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:46,097][117718] Avg episode reward: [(0, '2775.479')] [2023-03-07 05:01:46,804][118044] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-07 05:01:47,568][118044] Updated weights for policy 0, policy_version 76260 (0.0007) [2023-03-07 05:01:48,368][118044] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-07 05:01:49,127][118044] Updated weights for policy 0, policy_version 76280 (0.0006) [2023-03-07 05:01:49,896][118044] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-07 05:01:50,666][118044] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-07 05:01:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 78136320. Throughput: 0: 13169.1. Samples: 78128843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:51,097][117718] Avg episode reward: [(0, '2889.309')] [2023-03-07 05:01:51,444][118044] Updated weights for policy 0, policy_version 76310 (0.0005) [2023-03-07 05:01:52,226][118044] Updated weights for policy 0, policy_version 76320 (0.0006) [2023-03-07 05:01:53,010][118044] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-07 05:01:53,798][118044] Updated weights for policy 0, policy_version 76340 (0.0006) [2023-03-07 05:01:54,557][118044] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 05:01:55,327][118044] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-07 05:01:56,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 78201856. Throughput: 0: 13164.4. Samples: 78168160. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:01:56,091][118044] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 05:01:56,096][117718] Avg episode reward: [(0, '2988.106')] [2023-03-07 05:01:56,860][118044] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-07 05:01:57,647][118044] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 05:01:58,419][118044] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-07 05:01:59,201][118044] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 05:01:59,987][118044] Updated weights for policy 0, policy_version 76420 (0.0005) [2023-03-07 05:02:00,762][118044] Updated weights for policy 0, policy_version 76430 (0.0006) [2023-03-07 05:02:01,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 78268416. Throughput: 0: 13170.6. Samples: 78247563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:02:01,097][117718] Avg episode reward: [(0, '2905.505')] [2023-03-07 05:02:01,557][118044] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-07 05:02:02,322][118044] Updated weights for policy 0, policy_version 76450 (0.0007) [2023-03-07 05:02:03,120][118044] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-07 05:02:03,914][118044] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-07 05:02:04,687][118044] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 05:02:05,470][118044] Updated weights for policy 0, policy_version 76490 (0.0005) [2023-03-07 05:02:06,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 78333952. Throughput: 0: 13159.9. Samples: 78325993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:02:06,097][117718] Avg episode reward: [(0, '2927.090')] [2023-03-07 05:02:06,235][118044] Updated weights for policy 0, policy_version 76500 (0.0005) [2023-03-07 05:02:07,041][118044] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-07 05:02:07,799][118044] Updated weights for policy 0, policy_version 76520 (0.0005) [2023-03-07 05:02:08,570][118044] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-07 05:02:09,365][118044] Updated weights for policy 0, policy_version 76540 (0.0006) [2023-03-07 05:02:10,145][118044] Updated weights for policy 0, policy_version 76550 (0.0006) [2023-03-07 05:02:10,914][118044] Updated weights for policy 0, policy_version 76560 (0.0007) [2023-03-07 05:02:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 78399488. Throughput: 0: 13161.1. Samples: 78365430. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:02:11,096][117718] Avg episode reward: [(0, '2851.725')] [2023-03-07 05:02:11,705][118044] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-07 05:02:12,483][118044] Updated weights for policy 0, policy_version 76580 (0.0007) [2023-03-07 05:02:13,269][118044] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-03-07 05:02:14,054][118044] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 05:02:14,824][118044] Updated weights for policy 0, policy_version 76610 (0.0006) [2023-03-07 05:02:15,629][118044] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 05:02:16,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 78465024. Throughput: 0: 13154.4. Samples: 78444126. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:16,086][117718] Avg episode reward: [(0, '2964.348')] [2023-03-07 05:02:16,406][118044] Updated weights for policy 0, policy_version 76630 (0.0005) [2023-03-07 05:02:17,159][118044] Updated weights for policy 0, policy_version 76640 (0.0006) [2023-03-07 05:02:17,954][118044] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-07 05:02:18,727][118044] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-07 05:02:19,502][118044] Updated weights for policy 0, policy_version 76670 (0.0006) [2023-03-07 05:02:20,274][118044] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-07 05:02:21,062][118044] Updated weights for policy 0, policy_version 76690 (0.0005) [2023-03-07 05:02:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 78530560. Throughput: 0: 13144.3. Samples: 78523041. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:21,086][117718] Avg episode reward: [(0, '2946.680')] [2023-03-07 05:02:21,841][118044] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 05:02:22,607][118044] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-07 05:02:23,374][118044] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-07 05:02:24,182][118044] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-07 05:02:24,963][118044] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-07 05:02:25,751][118044] Updated weights for policy 0, policy_version 76750 (0.0005) [2023-03-07 05:02:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 78596096. Throughput: 0: 13146.5. Samples: 78562525. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:26,086][117718] Avg episode reward: [(0, '2869.981')] [2023-03-07 05:02:26,533][118044] Updated weights for policy 0, policy_version 76760 (0.0006) [2023-03-07 05:02:27,321][118044] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-07 05:02:28,070][118044] Updated weights for policy 0, policy_version 76780 (0.0007) [2023-03-07 05:02:28,865][118044] Updated weights for policy 0, policy_version 76790 (0.0007) [2023-03-07 05:02:29,650][118044] Updated weights for policy 0, policy_version 76800 (0.0005) [2023-03-07 05:02:30,427][118044] Updated weights for policy 0, policy_version 76810 (0.0006) [2023-03-07 05:02:31,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 78661632. Throughput: 0: 13137.0. Samples: 78641117. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:31,086][117718] Avg episode reward: [(0, '2887.375')] [2023-03-07 05:02:31,205][118044] Updated weights for policy 0, policy_version 76820 (0.0005) [2023-03-07 05:02:31,979][118044] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-07 05:02:32,763][118044] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-07 05:02:33,554][118044] Updated weights for policy 0, policy_version 76850 (0.0007) [2023-03-07 05:02:34,312][118044] Updated weights for policy 0, policy_version 76860 (0.0006) [2023-03-07 05:02:35,088][118044] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-07 05:02:35,877][118044] Updated weights for policy 0, policy_version 76880 (0.0007) [2023-03-07 05:02:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 78727168. Throughput: 0: 13136.4. Samples: 78719981. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:36,086][117718] Avg episode reward: [(0, '2805.663')] [2023-03-07 05:02:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000076882_78727168.pth... [2023-03-07 05:02:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000073800_75571200.pth [2023-03-07 05:02:36,645][118044] Updated weights for policy 0, policy_version 76890 (0.0005) [2023-03-07 05:02:37,419][118044] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-07 05:02:38,194][118044] Updated weights for policy 0, policy_version 76910 (0.0007) [2023-03-07 05:02:38,970][118044] Updated weights for policy 0, policy_version 76920 (0.0006) [2023-03-07 05:02:39,755][118044] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-07 05:02:40,521][118044] Updated weights for policy 0, policy_version 76940 (0.0006) [2023-03-07 05:02:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 78792704. Throughput: 0: 13143.7. Samples: 78759627. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:41,086][117718] Avg episode reward: [(0, '2766.138')] [2023-03-07 05:02:41,302][118044] Updated weights for policy 0, policy_version 76950 (0.0005) [2023-03-07 05:02:42,079][118044] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-07 05:02:42,853][118044] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-07 05:02:43,630][118044] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 05:02:44,394][118044] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-07 05:02:45,168][118044] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-07 05:02:45,952][118044] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-07 05:02:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 78859264. Throughput: 0: 13142.7. Samples: 78838985. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:46,086][117718] Avg episode reward: [(0, '2979.659')] [2023-03-07 05:02:46,698][118044] Updated weights for policy 0, policy_version 77020 (0.0005) [2023-03-07 05:02:47,477][118044] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 05:02:48,257][118044] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-07 05:02:49,035][118044] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-07 05:02:49,807][118044] Updated weights for policy 0, policy_version 77060 (0.0006) [2023-03-07 05:02:50,590][118044] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-07 05:02:51,086][117718] Fps is (10 sec: 13311.8, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 78925824. Throughput: 0: 13162.4. Samples: 78918301. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:51,086][117718] Avg episode reward: [(0, '2812.785')] [2023-03-07 05:02:51,377][118044] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-07 05:02:52,155][118044] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 05:02:52,924][118044] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-07 05:02:53,701][118044] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-07 05:02:54,476][118044] Updated weights for policy 0, policy_version 77120 (0.0005) [2023-03-07 05:02:55,249][118044] Updated weights for policy 0, policy_version 77130 (0.0006) [2023-03-07 05:02:56,029][118044] Updated weights for policy 0, policy_version 77140 (0.0007) [2023-03-07 05:02:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 78991360. Throughput: 0: 13164.9. Samples: 78957853. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:02:56,086][117718] Avg episode reward: [(0, '2890.943')] [2023-03-07 05:02:56,824][118044] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-07 05:02:57,605][118044] Updated weights for policy 0, policy_version 77160 (0.0006) [2023-03-07 05:02:58,393][118044] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-07 05:02:59,157][118044] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-07 05:02:59,933][118044] Updated weights for policy 0, policy_version 77190 (0.0007) [2023-03-07 05:03:00,706][118044] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-07 05:03:01,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 79056896. Throughput: 0: 13166.6. Samples: 79036623. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:03:01,086][117718] Avg episode reward: [(0, '2790.796')] [2023-03-07 05:03:01,496][118044] Updated weights for policy 0, policy_version 77210 (0.0006) [2023-03-07 05:03:02,260][118044] Updated weights for policy 0, policy_version 77220 (0.0005) [2023-03-07 05:03:03,047][118044] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-07 05:03:03,838][118044] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-07 05:03:04,607][118044] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-07 05:03:05,376][118044] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-07 05:03:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 79123456. Throughput: 0: 13169.3. Samples: 79115659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:06,086][117718] Avg episode reward: [(0, '2770.022')] [2023-03-07 05:03:06,145][118044] Updated weights for policy 0, policy_version 77270 (0.0007) [2023-03-07 05:03:06,928][118044] Updated weights for policy 0, policy_version 77280 (0.0007) [2023-03-07 05:03:07,711][118044] Updated weights for policy 0, policy_version 77290 (0.0007) [2023-03-07 05:03:08,492][118044] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-07 05:03:09,289][118044] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-07 05:03:10,062][118044] Updated weights for policy 0, policy_version 77320 (0.0006) [2023-03-07 05:03:10,851][118044] Updated weights for policy 0, policy_version 77330 (0.0007) [2023-03-07 05:03:11,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 79188992. Throughput: 0: 13167.5. Samples: 79155059. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:11,086][117718] Avg episode reward: [(0, '2825.844')] [2023-03-07 05:03:11,629][118044] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-07 05:03:12,407][118044] Updated weights for policy 0, policy_version 77350 (0.0006) [2023-03-07 05:03:13,188][118044] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 05:03:13,965][118044] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-07 05:03:14,734][118044] Updated weights for policy 0, policy_version 77380 (0.0007) [2023-03-07 05:03:15,510][118044] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-07 05:03:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 79254528. Throughput: 0: 13171.5. Samples: 79233835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:16,086][117718] Avg episode reward: [(0, '2807.710')] [2023-03-07 05:03:16,288][118044] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-07 05:03:17,053][118044] Updated weights for policy 0, policy_version 77410 (0.0007) [2023-03-07 05:03:17,844][118044] Updated weights for policy 0, policy_version 77420 (0.0006) [2023-03-07 05:03:18,616][118044] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-03-07 05:03:19,401][118044] Updated weights for policy 0, policy_version 77440 (0.0007) [2023-03-07 05:03:20,193][118044] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-07 05:03:20,964][118044] Updated weights for policy 0, policy_version 77460 (0.0007) [2023-03-07 05:03:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 79320064. Throughput: 0: 13170.9. Samples: 79312667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:21,086][117718] Avg episode reward: [(0, '2784.516')] [2023-03-07 05:03:21,722][118044] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-07 05:03:22,528][118044] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-07 05:03:23,319][118044] Updated weights for policy 0, policy_version 77490 (0.0006) [2023-03-07 05:03:24,093][118044] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-07 05:03:24,855][118044] Updated weights for policy 0, policy_version 77510 (0.0007) [2023-03-07 05:03:25,631][118044] Updated weights for policy 0, policy_version 77520 (0.0006) [2023-03-07 05:03:26,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 79385600. Throughput: 0: 13164.3. Samples: 79352022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:26,086][117718] Avg episode reward: [(0, '2872.217')] [2023-03-07 05:03:26,411][118044] Updated weights for policy 0, policy_version 77530 (0.0006) [2023-03-07 05:03:27,191][118044] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-07 05:03:27,975][118044] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-07 05:03:28,746][118044] Updated weights for policy 0, policy_version 77560 (0.0007) [2023-03-07 05:03:29,526][118044] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-07 05:03:30,309][118044] Updated weights for policy 0, policy_version 77580 (0.0006) [2023-03-07 05:03:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13155.8). Total num frames: 79452160. Throughput: 0: 13160.8. Samples: 79431223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:31,086][117718] Avg episode reward: [(0, '2864.003')] [2023-03-07 05:03:31,087][118044] Updated weights for policy 0, policy_version 77590 (0.0006) [2023-03-07 05:03:31,865][118044] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-07 05:03:32,657][118044] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-07 05:03:33,427][118044] Updated weights for policy 0, policy_version 77620 (0.0007) [2023-03-07 05:03:34,209][118044] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-07 05:03:35,008][118044] Updated weights for policy 0, policy_version 77640 (0.0007) [2023-03-07 05:03:35,791][118044] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-07 05:03:36,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 79516672. Throughput: 0: 13142.7. Samples: 79509723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:36,086][117718] Avg episode reward: [(0, '2763.057')] [2023-03-07 05:03:36,564][118044] Updated weights for policy 0, policy_version 77660 (0.0006) [2023-03-07 05:03:37,341][118044] Updated weights for policy 0, policy_version 77670 (0.0005) [2023-03-07 05:03:38,117][118044] Updated weights for policy 0, policy_version 77680 (0.0006) [2023-03-07 05:03:38,877][118044] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 05:03:39,662][118044] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-07 05:03:40,455][118044] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-07 05:03:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 79583232. Throughput: 0: 13140.8. Samples: 79549187. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:41,086][117718] Avg episode reward: [(0, '2828.496')] [2023-03-07 05:03:41,226][118044] Updated weights for policy 0, policy_version 77720 (0.0006) [2023-03-07 05:03:41,987][118044] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-07 05:03:42,786][118044] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-07 05:03:43,542][118044] Updated weights for policy 0, policy_version 77750 (0.0005) [2023-03-07 05:03:44,321][118044] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 05:03:45,114][118044] Updated weights for policy 0, policy_version 77770 (0.0006) [2023-03-07 05:03:45,875][118044] Updated weights for policy 0, policy_version 77780 (0.0005) [2023-03-07 05:03:46,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 79648768. Throughput: 0: 13149.2. Samples: 79628335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:46,086][117718] Avg episode reward: [(0, '2874.074')] [2023-03-07 05:03:46,666][118044] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-07 05:03:47,429][118044] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-07 05:03:48,222][118044] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-07 05:03:48,995][118044] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-07 05:03:49,772][118044] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-07 05:03:50,557][118044] Updated weights for policy 0, policy_version 77840 (0.0007) [2023-03-07 05:03:51,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 79715328. Throughput: 0: 13148.5. Samples: 79707344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:51,086][117718] Avg episode reward: [(0, '2904.672')] [2023-03-07 05:03:51,349][118044] Updated weights for policy 0, policy_version 77850 (0.0005) [2023-03-07 05:03:52,118][118044] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-07 05:03:52,910][118044] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-07 05:03:53,674][118044] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-07 05:03:54,472][118044] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-07 05:03:55,244][118044] Updated weights for policy 0, policy_version 77900 (0.0007) [2023-03-07 05:03:56,001][118044] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-07 05:03:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 79780864. Throughput: 0: 13148.8. Samples: 79746757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:03:56,086][117718] Avg episode reward: [(0, '2814.284')] [2023-03-07 05:03:56,771][118044] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-07 05:03:57,557][118044] Updated weights for policy 0, policy_version 77930 (0.0005) [2023-03-07 05:03:58,327][118044] Updated weights for policy 0, policy_version 77940 (0.0005) [2023-03-07 05:03:59,116][118044] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 05:03:59,884][118044] Updated weights for policy 0, policy_version 77960 (0.0007) [2023-03-07 05:04:00,639][118044] Updated weights for policy 0, policy_version 77970 (0.0006) [2023-03-07 05:04:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 79846400. Throughput: 0: 13157.3. Samples: 79825911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:01,086][117718] Avg episode reward: [(0, '2681.452')] [2023-03-07 05:04:01,417][118044] Updated weights for policy 0, policy_version 77980 (0.0006) [2023-03-07 05:04:02,196][118044] Updated weights for policy 0, policy_version 77990 (0.0007) [2023-03-07 05:04:02,982][118044] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-07 05:04:03,757][118044] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-07 05:04:04,550][118044] Updated weights for policy 0, policy_version 78020 (0.0007) [2023-03-07 05:04:05,341][118044] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-07 05:04:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 79911936. Throughput: 0: 13157.6. Samples: 79904760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:06,087][117718] Avg episode reward: [(0, '2717.537')] [2023-03-07 05:04:06,115][118044] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-07 05:04:06,893][118044] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-07 05:04:07,670][118044] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-07 05:04:08,452][118044] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 05:04:09,248][118044] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-07 05:04:10,025][118044] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-07 05:04:10,797][118044] Updated weights for policy 0, policy_version 78100 (0.0006) [2023-03-07 05:04:11,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 79977472. Throughput: 0: 13158.4. Samples: 79944149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:11,086][117718] Avg episode reward: [(0, '2697.304')] [2023-03-07 05:04:11,561][118044] Updated weights for policy 0, policy_version 78110 (0.0007) [2023-03-07 05:04:12,374][118044] Updated weights for policy 0, policy_version 78120 (0.0006) [2023-03-07 05:04:13,146][118044] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-07 05:04:13,934][118044] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-07 05:04:14,710][118044] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-07 05:04:15,478][118044] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-07 05:04:16,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 80043008. Throughput: 0: 13143.6. Samples: 80022684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:16,086][117718] Avg episode reward: [(0, '2836.353')] [2023-03-07 05:04:16,254][118044] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 05:04:17,014][118044] Updated weights for policy 0, policy_version 78180 (0.0006) [2023-03-07 05:04:17,784][118044] Updated weights for policy 0, policy_version 78190 (0.0007) [2023-03-07 05:04:18,581][118044] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-07 05:04:19,338][118044] Updated weights for policy 0, policy_version 78210 (0.0006) [2023-03-07 05:04:20,116][118044] Updated weights for policy 0, policy_version 78220 (0.0007) [2023-03-07 05:04:20,897][118044] Updated weights for policy 0, policy_version 78230 (0.0006) [2023-03-07 05:04:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 80109568. Throughput: 0: 13163.2. Samples: 80102065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:21,086][117718] Avg episode reward: [(0, '2881.435')] [2023-03-07 05:04:21,700][118044] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-07 05:04:22,488][118044] Updated weights for policy 0, policy_version 78250 (0.0007) [2023-03-07 05:04:23,277][118044] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 05:04:24,052][118044] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-07 05:04:24,814][118044] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-07 05:04:25,626][118044] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 05:04:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 80175104. Throughput: 0: 13156.0. Samples: 80141207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:26,086][117718] Avg episode reward: [(0, '2876.522')] [2023-03-07 05:04:26,390][118044] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-07 05:04:27,168][118044] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-07 05:04:27,945][118044] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-07 05:04:28,749][118044] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-07 05:04:29,529][118044] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-07 05:04:30,308][118044] Updated weights for policy 0, policy_version 78350 (0.0007) [2023-03-07 05:04:31,077][118044] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 05:04:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 80240640. Throughput: 0: 13140.3. Samples: 80219649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:31,086][117718] Avg episode reward: [(0, '2967.192')] [2023-03-07 05:04:31,850][118044] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-07 05:04:32,613][118044] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-07 05:04:33,405][118044] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-07 05:04:34,169][118044] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 05:04:34,958][118044] Updated weights for policy 0, policy_version 78410 (0.0007) [2023-03-07 05:04:35,732][118044] Updated weights for policy 0, policy_version 78420 (0.0005) [2023-03-07 05:04:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 80306176. Throughput: 0: 13142.4. Samples: 80298750. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:04:36,086][117718] Avg episode reward: [(0, '2846.797')] [2023-03-07 05:04:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000078424_80306176.pth... [2023-03-07 05:04:36,124][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000075342_77150208.pth [2023-03-07 05:04:36,517][118044] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-07 05:04:37,296][118044] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-07 05:04:38,093][118044] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-07 05:04:38,852][118044] Updated weights for policy 0, policy_version 78460 (0.0006) [2023-03-07 05:04:39,637][118044] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-07 05:04:40,407][118044] Updated weights for policy 0, policy_version 78480 (0.0006) [2023-03-07 05:04:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 80371712. Throughput: 0: 13142.3. Samples: 80338159. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:04:41,086][117718] Avg episode reward: [(0, '2807.103')] [2023-03-07 05:04:41,200][118044] Updated weights for policy 0, policy_version 78490 (0.0007) [2023-03-07 05:04:41,982][118044] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-07 05:04:42,760][118044] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-07 05:04:43,534][118044] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-07 05:04:44,319][118044] Updated weights for policy 0, policy_version 78530 (0.0006) [2023-03-07 05:04:45,101][118044] Updated weights for policy 0, policy_version 78540 (0.0006) [2023-03-07 05:04:45,915][118044] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-07 05:04:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 80437248. Throughput: 0: 13134.3. Samples: 80416955. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:04:46,086][117718] Avg episode reward: [(0, '2897.517')] [2023-03-07 05:04:46,693][118044] Updated weights for policy 0, policy_version 78560 (0.0007) [2023-03-07 05:04:47,502][118044] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-07 05:04:48,267][118044] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 05:04:49,037][118044] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 05:04:49,814][118044] Updated weights for policy 0, policy_version 78600 (0.0006) [2023-03-07 05:04:50,601][118044] Updated weights for policy 0, policy_version 78610 (0.0006) [2023-03-07 05:04:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 80502784. Throughput: 0: 13117.7. Samples: 80495053. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:04:51,086][117718] Avg episode reward: [(0, '2911.314')] [2023-03-07 05:04:51,397][118044] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-07 05:04:52,158][118044] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-07 05:04:52,957][118044] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-07 05:04:53,722][118044] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 05:04:54,518][118044] Updated weights for policy 0, policy_version 78660 (0.0005) [2023-03-07 05:04:55,293][118044] Updated weights for policy 0, policy_version 78670 (0.0007) [2023-03-07 05:04:56,084][118044] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-07 05:04:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 80568320. Throughput: 0: 13115.1. Samples: 80534328. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:04:56,086][117718] Avg episode reward: [(0, '2809.074')] [2023-03-07 05:04:56,853][118044] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-07 05:04:57,625][118044] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-07 05:04:58,421][118044] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-07 05:04:59,208][118044] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-07 05:04:59,980][118044] Updated weights for policy 0, policy_version 78730 (0.0007) [2023-03-07 05:05:00,768][118044] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 05:05:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 80633856. Throughput: 0: 13120.4. Samples: 80613099. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:05:01,086][117718] Avg episode reward: [(0, '2866.620')] [2023-03-07 05:05:01,547][118044] Updated weights for policy 0, policy_version 78750 (0.0007) [2023-03-07 05:05:02,315][118044] Updated weights for policy 0, policy_version 78760 (0.0005) [2023-03-07 05:05:03,096][118044] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 05:05:03,891][118044] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-07 05:05:04,663][118044] Updated weights for policy 0, policy_version 78790 (0.0006) [2023-03-07 05:05:05,454][118044] Updated weights for policy 0, policy_version 78800 (0.0006) [2023-03-07 05:05:06,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 80699392. Throughput: 0: 13100.2. Samples: 80691573. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:05:06,086][117718] Avg episode reward: [(0, '2950.310')] [2023-03-07 05:05:06,239][118044] Updated weights for policy 0, policy_version 78810 (0.0006) [2023-03-07 05:05:07,007][118044] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-07 05:05:07,791][118044] Updated weights for policy 0, policy_version 78830 (0.0006) [2023-03-07 05:05:08,573][118044] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-07 05:05:09,354][118044] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-07 05:05:10,138][118044] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-07 05:05:10,908][118044] Updated weights for policy 0, policy_version 78870 (0.0006) [2023-03-07 05:05:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 80764928. Throughput: 0: 13108.5. Samples: 80731088. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:05:11,086][117718] Avg episode reward: [(0, '2963.493')] [2023-03-07 05:05:11,698][118044] Updated weights for policy 0, policy_version 78880 (0.0007) [2023-03-07 05:05:12,486][118044] Updated weights for policy 0, policy_version 78890 (0.0007) [2023-03-07 05:05:13,277][118044] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-07 05:05:14,040][118044] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-07 05:05:14,833][118044] Updated weights for policy 0, policy_version 78920 (0.0006) [2023-03-07 05:05:15,618][118044] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 05:05:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 80830464. Throughput: 0: 13109.5. Samples: 80809579. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:05:16,086][117718] Avg episode reward: [(0, '2884.018')] [2023-03-07 05:05:16,391][118044] Updated weights for policy 0, policy_version 78940 (0.0006) [2023-03-07 05:05:17,182][118044] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-07 05:05:17,965][118044] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-07 05:05:18,727][118044] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 05:05:19,510][118044] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-07 05:05:20,294][118044] Updated weights for policy 0, policy_version 78990 (0.0006) [2023-03-07 05:05:21,072][118044] Updated weights for policy 0, policy_version 79000 (0.0008) [2023-03-07 05:05:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 80896000. Throughput: 0: 13103.2. Samples: 80888394. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:05:21,086][117718] Avg episode reward: [(0, '2811.882')] [2023-03-07 05:05:21,852][118044] Updated weights for policy 0, policy_version 79010 (0.0007) [2023-03-07 05:05:22,609][118044] Updated weights for policy 0, policy_version 79020 (0.0005) [2023-03-07 05:05:23,389][118044] Updated weights for policy 0, policy_version 79030 (0.0007) [2023-03-07 05:05:24,180][118044] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-07 05:05:24,966][118044] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-07 05:05:25,743][118044] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 05:05:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 80961536. Throughput: 0: 13102.8. Samples: 80927784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:26,086][117718] Avg episode reward: [(0, '2870.183')] [2023-03-07 05:05:26,509][118044] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 05:05:27,301][118044] Updated weights for policy 0, policy_version 79080 (0.0006) [2023-03-07 05:05:28,093][118044] Updated weights for policy 0, policy_version 79090 (0.0006) [2023-03-07 05:05:28,859][118044] Updated weights for policy 0, policy_version 79100 (0.0007) [2023-03-07 05:05:29,630][118044] Updated weights for policy 0, policy_version 79110 (0.0006) [2023-03-07 05:05:30,401][118044] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-07 05:05:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 81027072. Throughput: 0: 13106.4. Samples: 81006742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:31,086][117718] Avg episode reward: [(0, '2929.938')] [2023-03-07 05:05:31,179][118044] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-07 05:05:31,967][118044] Updated weights for policy 0, policy_version 79140 (0.0006) [2023-03-07 05:05:32,779][118044] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-07 05:05:33,549][118044] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-07 05:05:34,312][118044] Updated weights for policy 0, policy_version 79170 (0.0005) [2023-03-07 05:05:35,096][118044] Updated weights for policy 0, policy_version 79180 (0.0006) [2023-03-07 05:05:35,869][118044] Updated weights for policy 0, policy_version 79190 (0.0005) [2023-03-07 05:05:36,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 81093632. Throughput: 0: 13124.1. Samples: 81085639. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:36,086][117718] Avg episode reward: [(0, '2903.958')] [2023-03-07 05:05:36,647][118044] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 05:05:37,408][118044] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-07 05:05:38,194][118044] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-07 05:05:38,966][118044] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-07 05:05:39,721][118044] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 05:05:40,508][118044] Updated weights for policy 0, policy_version 79250 (0.0006) [2023-03-07 05:05:41,086][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 81159168. Throughput: 0: 13135.3. Samples: 81125416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:41,086][117718] Avg episode reward: [(0, '2844.742')] [2023-03-07 05:05:41,290][118044] Updated weights for policy 0, policy_version 79260 (0.0007) [2023-03-07 05:05:42,073][118044] Updated weights for policy 0, policy_version 79270 (0.0007) [2023-03-07 05:05:42,849][118044] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-07 05:05:43,630][118044] Updated weights for policy 0, policy_version 79290 (0.0007) [2023-03-07 05:05:44,400][118044] Updated weights for policy 0, policy_version 79300 (0.0006) [2023-03-07 05:05:45,185][118044] Updated weights for policy 0, policy_version 79310 (0.0007) [2023-03-07 05:05:45,950][118044] Updated weights for policy 0, policy_version 79320 (0.0006) [2023-03-07 05:05:46,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 81224704. Throughput: 0: 13140.4. Samples: 81204419. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:46,086][117718] Avg episode reward: [(0, '2690.981')] [2023-03-07 05:05:46,720][118044] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-07 05:05:47,496][118044] Updated weights for policy 0, policy_version 79340 (0.0006) [2023-03-07 05:05:48,277][118044] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 05:05:49,043][118044] Updated weights for policy 0, policy_version 79360 (0.0007) [2023-03-07 05:05:49,833][118044] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-07 05:05:50,609][118044] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-07 05:05:51,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 81291264. Throughput: 0: 13154.1. Samples: 81283506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:51,086][117718] Avg episode reward: [(0, '2726.175')] [2023-03-07 05:05:51,394][118044] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-07 05:05:52,193][118044] Updated weights for policy 0, policy_version 79400 (0.0006) [2023-03-07 05:05:52,967][118044] Updated weights for policy 0, policy_version 79410 (0.0006) [2023-03-07 05:05:53,748][118044] Updated weights for policy 0, policy_version 79420 (0.0006) [2023-03-07 05:05:54,541][118044] Updated weights for policy 0, policy_version 79430 (0.0006) [2023-03-07 05:05:55,333][118044] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 05:05:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 81355776. Throughput: 0: 13148.0. Samples: 81322751. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:05:56,086][117718] Avg episode reward: [(0, '2762.410')] [2023-03-07 05:05:56,099][118044] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-07 05:05:56,903][118044] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-07 05:05:57,681][118044] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-07 05:05:58,447][118044] Updated weights for policy 0, policy_version 79480 (0.0005) [2023-03-07 05:05:59,238][118044] Updated weights for policy 0, policy_version 79490 (0.0006) [2023-03-07 05:06:00,008][118044] Updated weights for policy 0, policy_version 79500 (0.0006) [2023-03-07 05:06:00,778][118044] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-07 05:06:01,085][117718] Fps is (10 sec: 13004.7, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 81421312. Throughput: 0: 13143.3. Samples: 81401027. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:06:01,086][117718] Avg episode reward: [(0, '2752.919')] [2023-03-07 05:06:01,558][118044] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-07 05:06:02,337][118044] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-07 05:06:03,115][118044] Updated weights for policy 0, policy_version 79540 (0.0006) [2023-03-07 05:06:03,897][118044] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 05:06:04,679][118044] Updated weights for policy 0, policy_version 79560 (0.0006) [2023-03-07 05:06:05,462][118044] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-07 05:06:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 81487872. Throughput: 0: 13148.0. Samples: 81480053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:06:06,086][117718] Avg episode reward: [(0, '2721.742')] [2023-03-07 05:06:06,254][118044] Updated weights for policy 0, policy_version 79580 (0.0007) [2023-03-07 05:06:07,023][118044] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-07 05:06:07,806][118044] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-07 05:06:08,434][117993] KL-divergence is very high: 5087.2388 [2023-03-07 05:06:08,594][118044] Updated weights for policy 0, policy_version 79610 (0.0006) [2023-03-07 05:06:09,376][118044] Updated weights for policy 0, policy_version 79620 (0.0006) [2023-03-07 05:06:10,144][118044] Updated weights for policy 0, policy_version 79630 (0.0007) [2023-03-07 05:06:10,908][118044] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-07 05:06:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 81553408. Throughput: 0: 13144.2. Samples: 81519274. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:06:11,086][117718] Avg episode reward: [(0, '2659.773')] [2023-03-07 05:06:11,711][118044] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-07 05:06:12,481][118044] Updated weights for policy 0, policy_version 79660 (0.0006) [2023-03-07 05:06:13,247][118044] Updated weights for policy 0, policy_version 79670 (0.0005) [2023-03-07 05:06:14,029][118044] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-07 05:06:14,804][118044] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-07 05:06:15,601][118044] Updated weights for policy 0, policy_version 79700 (0.0007) [2023-03-07 05:06:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 81618944. Throughput: 0: 13147.7. Samples: 81598387. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:16,086][117718] Avg episode reward: [(0, '2739.701')] [2023-03-07 05:06:16,371][118044] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-07 05:06:17,157][118044] Updated weights for policy 0, policy_version 79720 (0.0007) [2023-03-07 05:06:17,937][118044] Updated weights for policy 0, policy_version 79730 (0.0006) [2023-03-07 05:06:18,698][118044] Updated weights for policy 0, policy_version 79740 (0.0006) [2023-03-07 05:06:19,467][118044] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 05:06:20,253][118044] Updated weights for policy 0, policy_version 79760 (0.0005) [2023-03-07 05:06:21,040][118044] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-07 05:06:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 81684480. Throughput: 0: 13147.9. Samples: 81677294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:21,086][117718] Avg episode reward: [(0, '2700.152')] [2023-03-07 05:06:21,830][118044] Updated weights for policy 0, policy_version 79780 (0.0006) [2023-03-07 05:06:22,601][118044] Updated weights for policy 0, policy_version 79790 (0.0005) [2023-03-07 05:06:23,383][118044] Updated weights for policy 0, policy_version 79800 (0.0007) [2023-03-07 05:06:24,164][118044] Updated weights for policy 0, policy_version 79810 (0.0007) [2023-03-07 05:06:24,963][118044] Updated weights for policy 0, policy_version 79820 (0.0007) [2023-03-07 05:06:25,749][118044] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-07 05:06:26,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 81750016. Throughput: 0: 13137.7. Samples: 81716613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:26,086][117718] Avg episode reward: [(0, '2620.951')] [2023-03-07 05:06:26,525][118044] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-07 05:06:27,311][118044] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-07 05:06:28,084][118044] Updated weights for policy 0, policy_version 79860 (0.0007) [2023-03-07 05:06:28,866][118044] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-03-07 05:06:29,642][118044] Updated weights for policy 0, policy_version 79880 (0.0006) [2023-03-07 05:06:30,415][118044] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 05:06:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 81815552. Throughput: 0: 13125.6. Samples: 81795070. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:31,086][117718] Avg episode reward: [(0, '2628.267')] [2023-03-07 05:06:31,204][118044] Updated weights for policy 0, policy_version 79900 (0.0006) [2023-03-07 05:06:31,981][118044] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 05:06:32,750][118044] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-07 05:06:33,533][118044] Updated weights for policy 0, policy_version 79930 (0.0007) [2023-03-07 05:06:34,328][118044] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-07 05:06:35,098][118044] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-07 05:06:35,872][118044] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-07 05:06:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 81881088. Throughput: 0: 13123.6. Samples: 81874067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:36,086][117718] Avg episode reward: [(0, '2633.905')] [2023-03-07 05:06:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000079962_81881088.pth... [2023-03-07 05:06:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000076882_78727168.pth [2023-03-07 05:06:36,648][118044] Updated weights for policy 0, policy_version 79970 (0.0007) [2023-03-07 05:06:37,430][118044] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 05:06:38,214][118044] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-07 05:06:39,002][118044] Updated weights for policy 0, policy_version 80000 (0.0006) [2023-03-07 05:06:39,770][118044] Updated weights for policy 0, policy_version 80010 (0.0006) [2023-03-07 05:06:40,537][118044] Updated weights for policy 0, policy_version 80020 (0.0005) [2023-03-07 05:06:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.5). Total num frames: 81946624. Throughput: 0: 13123.7. Samples: 81913314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:41,086][117718] Avg episode reward: [(0, '2594.135')] [2023-03-07 05:06:41,318][118044] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-07 05:06:42,093][118044] Updated weights for policy 0, policy_version 80040 (0.0007) [2023-03-07 05:06:42,879][118044] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-07 05:06:43,671][118044] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-07 05:06:44,427][118044] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-07 05:06:45,216][118044] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-07 05:06:45,994][118044] Updated weights for policy 0, policy_version 80090 (0.0006) [2023-03-07 05:06:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 82012160. Throughput: 0: 13136.4. Samples: 81992168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:46,086][117718] Avg episode reward: [(0, '2664.394')] [2023-03-07 05:06:46,772][118044] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-07 05:06:47,553][118044] Updated weights for policy 0, policy_version 80110 (0.0005) [2023-03-07 05:06:48,328][118044] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-07 05:06:49,118][118044] Updated weights for policy 0, policy_version 80130 (0.0007) [2023-03-07 05:06:49,902][118044] Updated weights for policy 0, policy_version 80140 (0.0006) [2023-03-07 05:06:50,672][118044] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-07 05:06:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 82078720. Throughput: 0: 13138.2. Samples: 82071270. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:51,086][117718] Avg episode reward: [(0, '2721.416')] [2023-03-07 05:06:51,453][118044] Updated weights for policy 0, policy_version 80160 (0.0006) [2023-03-07 05:06:52,226][118044] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-07 05:06:53,013][118044] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-07 05:06:53,794][118044] Updated weights for policy 0, policy_version 80190 (0.0006) [2023-03-07 05:06:54,568][118044] Updated weights for policy 0, policy_version 80200 (0.0007) [2023-03-07 05:06:55,359][118044] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-07 05:06:56,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 82144256. Throughput: 0: 13137.5. Samples: 82110462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:06:56,086][117718] Avg episode reward: [(0, '2595.624')] [2023-03-07 05:06:56,142][118044] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 05:06:56,915][118044] Updated weights for policy 0, policy_version 80230 (0.0007) [2023-03-07 05:06:57,684][118044] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-07 05:06:58,468][118044] Updated weights for policy 0, policy_version 80250 (0.0005) [2023-03-07 05:06:59,245][118044] Updated weights for policy 0, policy_version 80260 (0.0007) [2023-03-07 05:07:00,022][118044] Updated weights for policy 0, policy_version 80270 (0.0005) [2023-03-07 05:07:00,810][118044] Updated weights for policy 0, policy_version 80280 (0.0005) [2023-03-07 05:07:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 82209792. Throughput: 0: 13129.1. Samples: 82189197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:07:01,086][117718] Avg episode reward: [(0, '2642.875')] [2023-03-07 05:07:01,595][118044] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-07 05:07:02,370][118044] Updated weights for policy 0, policy_version 80300 (0.0006) [2023-03-07 05:07:03,152][118044] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 05:07:03,919][118044] Updated weights for policy 0, policy_version 80320 (0.0006) [2023-03-07 05:07:04,706][118044] Updated weights for policy 0, policy_version 80330 (0.0006) [2023-03-07 05:07:05,509][118044] Updated weights for policy 0, policy_version 80340 (0.0007) [2023-03-07 05:07:06,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 82275328. Throughput: 0: 13126.5. Samples: 82267990. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:06,086][117718] Avg episode reward: [(0, '2626.328')] [2023-03-07 05:07:06,292][118044] Updated weights for policy 0, policy_version 80350 (0.0006) [2023-03-07 05:07:07,090][118044] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-07 05:07:07,877][118044] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-07 05:07:08,649][118044] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-07 05:07:09,437][118044] Updated weights for policy 0, policy_version 80390 (0.0007) [2023-03-07 05:07:10,220][118044] Updated weights for policy 0, policy_version 80400 (0.0007) [2023-03-07 05:07:10,996][118044] Updated weights for policy 0, policy_version 80410 (0.0007) [2023-03-07 05:07:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 82340864. Throughput: 0: 13121.5. Samples: 82307076. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:11,086][117718] Avg episode reward: [(0, '2592.740')] [2023-03-07 05:07:11,782][118044] Updated weights for policy 0, policy_version 80420 (0.0007) [2023-03-07 05:07:12,548][118044] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 05:07:13,328][118044] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-07 05:07:14,103][118044] Updated weights for policy 0, policy_version 80450 (0.0007) [2023-03-07 05:07:14,893][118044] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-07 05:07:15,660][118044] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-07 05:07:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 82406400. Throughput: 0: 13125.1. Samples: 82385702. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:16,086][117718] Avg episode reward: [(0, '2578.276')] [2023-03-07 05:07:16,441][118044] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-07 05:07:17,230][118044] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-07 05:07:17,993][118044] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-07 05:07:18,769][118044] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-07 05:07:19,546][118044] Updated weights for policy 0, policy_version 80520 (0.0007) [2023-03-07 05:07:20,327][118044] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-07 05:07:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.5). Total num frames: 82471936. Throughput: 0: 13127.8. Samples: 82464819. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:21,086][117718] Avg episode reward: [(0, '2574.892')] [2023-03-07 05:07:21,120][118044] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-07 05:07:21,905][118044] Updated weights for policy 0, policy_version 80550 (0.0005) [2023-03-07 05:07:22,682][118044] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 05:07:23,471][118044] Updated weights for policy 0, policy_version 80570 (0.0007) [2023-03-07 05:07:24,242][118044] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-07 05:07:25,034][118044] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-07 05:07:25,801][118044] Updated weights for policy 0, policy_version 80600 (0.0007) [2023-03-07 05:07:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 82537472. Throughput: 0: 13128.4. Samples: 82504093. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:26,086][117718] Avg episode reward: [(0, '2789.281')] [2023-03-07 05:07:26,572][118044] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 05:07:27,364][118044] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-07 05:07:28,141][118044] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-07 05:07:28,930][118044] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-07 05:07:29,707][118044] Updated weights for policy 0, policy_version 80650 (0.0006) [2023-03-07 05:07:30,478][118044] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 05:07:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 82603008. Throughput: 0: 13127.2. Samples: 82582893. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:31,086][117718] Avg episode reward: [(0, '2674.446')] [2023-03-07 05:07:31,261][118044] Updated weights for policy 0, policy_version 80670 (0.0007) [2023-03-07 05:07:32,029][118044] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 05:07:32,817][118044] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-07 05:07:33,606][118044] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-07 05:07:34,369][118044] Updated weights for policy 0, policy_version 80710 (0.0005) [2023-03-07 05:07:35,141][118044] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-07 05:07:35,905][118044] Updated weights for policy 0, policy_version 80730 (0.0006) [2023-03-07 05:07:36,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 82669568. Throughput: 0: 13127.8. Samples: 82662022. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:36,086][117718] Avg episode reward: [(0, '2704.090')] [2023-03-07 05:07:36,681][118044] Updated weights for policy 0, policy_version 80740 (0.0006) [2023-03-07 05:07:37,472][118044] Updated weights for policy 0, policy_version 80750 (0.0007) [2023-03-07 05:07:38,231][118044] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-07 05:07:38,997][118044] Updated weights for policy 0, policy_version 80770 (0.0006) [2023-03-07 05:07:39,777][118044] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-07 05:07:40,557][118044] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-07 05:07:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 82735104. Throughput: 0: 13134.9. Samples: 82701534. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:41,086][117718] Avg episode reward: [(0, '2671.266')] [2023-03-07 05:07:41,346][118044] Updated weights for policy 0, policy_version 80800 (0.0007) [2023-03-07 05:07:42,126][118044] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-07 05:07:42,903][118044] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-07 05:07:43,688][118044] Updated weights for policy 0, policy_version 80830 (0.0007) [2023-03-07 05:07:44,465][118044] Updated weights for policy 0, policy_version 80840 (0.0007) [2023-03-07 05:07:45,240][118044] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-07 05:07:46,006][118044] Updated weights for policy 0, policy_version 80860 (0.0006) [2023-03-07 05:07:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 82800640. Throughput: 0: 13138.3. Samples: 82780422. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 05:07:46,096][117718] Avg episode reward: [(0, '2659.447')] [2023-03-07 05:07:46,782][118044] Updated weights for policy 0, policy_version 80870 (0.0006) [2023-03-07 05:07:47,577][118044] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 05:07:48,337][118044] Updated weights for policy 0, policy_version 80890 (0.0006) [2023-03-07 05:07:49,117][118044] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-07 05:07:49,905][118044] Updated weights for policy 0, policy_version 80910 (0.0006) [2023-03-07 05:07:50,688][118044] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-07 05:07:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 82866176. Throughput: 0: 13141.8. Samples: 82859371. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:07:51,086][117718] Avg episode reward: [(0, '2655.977')] [2023-03-07 05:07:51,454][118044] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-07 05:07:52,258][118044] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-07 05:07:53,031][118044] Updated weights for policy 0, policy_version 80950 (0.0007) [2023-03-07 05:07:53,806][118044] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-07 05:07:54,585][118044] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-07 05:07:55,377][118044] Updated weights for policy 0, policy_version 80980 (0.0007) [2023-03-07 05:07:56,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 82932736. Throughput: 0: 13149.5. Samples: 82898806. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:07:56,086][117718] Avg episode reward: [(0, '2674.639')] [2023-03-07 05:07:56,137][118044] Updated weights for policy 0, policy_version 80990 (0.0006) [2023-03-07 05:07:56,910][118044] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-07 05:07:57,690][118044] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 05:07:58,468][118044] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-07 05:07:59,250][118044] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-07 05:08:00,023][118044] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-07 05:08:00,814][118044] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-07 05:08:01,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 82998272. Throughput: 0: 13159.6. Samples: 82977886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:01,086][117718] Avg episode reward: [(0, '2628.231')] [2023-03-07 05:08:01,599][118044] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-07 05:08:02,360][118044] Updated weights for policy 0, policy_version 81070 (0.0007) [2023-03-07 05:08:03,137][118044] Updated weights for policy 0, policy_version 81080 (0.0007) [2023-03-07 05:08:03,922][118044] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-07 05:08:04,688][118044] Updated weights for policy 0, policy_version 81100 (0.0007) [2023-03-07 05:08:05,463][118044] Updated weights for policy 0, policy_version 81110 (0.0007) [2023-03-07 05:08:06,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 83064832. Throughput: 0: 13157.4. Samples: 83056903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:06,086][117718] Avg episode reward: [(0, '2734.974')] [2023-03-07 05:08:06,230][118044] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-07 05:08:07,018][118044] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-07 05:08:07,805][118044] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 05:08:08,574][118044] Updated weights for policy 0, policy_version 81150 (0.0006) [2023-03-07 05:08:09,338][118044] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-07 05:08:10,127][118044] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-07 05:08:10,901][118044] Updated weights for policy 0, policy_version 81180 (0.0006) [2023-03-07 05:08:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 83130368. Throughput: 0: 13162.0. Samples: 83096384. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:11,086][117718] Avg episode reward: [(0, '2752.054')] [2023-03-07 05:08:11,673][118044] Updated weights for policy 0, policy_version 81190 (0.0006) [2023-03-07 05:08:12,456][118044] Updated weights for policy 0, policy_version 81200 (0.0006) [2023-03-07 05:08:13,227][118044] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 05:08:14,007][118044] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-07 05:08:14,789][118044] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-07 05:08:15,561][118044] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-07 05:08:16,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 83195904. Throughput: 0: 13169.6. Samples: 83175525. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:16,086][117718] Avg episode reward: [(0, '2582.645')] [2023-03-07 05:08:16,346][118044] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-07 05:08:17,118][118044] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-07 05:08:17,885][118044] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-07 05:08:18,683][118044] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-07 05:08:19,465][118044] Updated weights for policy 0, policy_version 81290 (0.0006) [2023-03-07 05:08:20,250][118044] Updated weights for policy 0, policy_version 81300 (0.0006) [2023-03-07 05:08:21,033][118044] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-07 05:08:21,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 83261440. Throughput: 0: 13162.3. Samples: 83254326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:21,086][117718] Avg episode reward: [(0, '2642.792')] [2023-03-07 05:08:21,816][118044] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-03-07 05:08:22,601][118044] Updated weights for policy 0, policy_version 81330 (0.0006) [2023-03-07 05:08:23,385][118044] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-07 05:08:24,153][118044] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-07 05:08:24,947][118044] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-07 05:08:25,713][118044] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-07 05:08:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 83326976. Throughput: 0: 13154.4. Samples: 83293484. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:26,086][117718] Avg episode reward: [(0, '2656.109')] [2023-03-07 05:08:26,506][118044] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-07 05:08:27,282][118044] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-07 05:08:28,045][118044] Updated weights for policy 0, policy_version 81400 (0.0007) [2023-03-07 05:08:28,843][118044] Updated weights for policy 0, policy_version 81410 (0.0007) [2023-03-07 05:08:29,618][118044] Updated weights for policy 0, policy_version 81420 (0.0007) [2023-03-07 05:08:30,397][118044] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-07 05:08:31,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 83392512. Throughput: 0: 13153.3. Samples: 83372322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:31,086][117718] Avg episode reward: [(0, '2686.662')] [2023-03-07 05:08:31,195][118044] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-07 05:08:31,976][118044] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-07 05:08:32,750][118044] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-07 05:08:33,541][118044] Updated weights for policy 0, policy_version 81470 (0.0007) [2023-03-07 05:08:34,321][118044] Updated weights for policy 0, policy_version 81480 (0.0007) [2023-03-07 05:08:35,090][118044] Updated weights for policy 0, policy_version 81490 (0.0005) [2023-03-07 05:08:35,873][118044] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-07 05:08:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 83458048. Throughput: 0: 13146.9. Samples: 83450982. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:08:36,086][117718] Avg episode reward: [(0, '2691.085')] [2023-03-07 05:08:36,097][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000081503_83459072.pth... [2023-03-07 05:08:36,128][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000078424_80306176.pth [2023-03-07 05:08:36,637][118044] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-07 05:08:37,407][118044] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-07 05:08:38,205][118044] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-07 05:08:38,981][118044] Updated weights for policy 0, policy_version 81540 (0.0007) [2023-03-07 05:08:39,764][118044] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-07 05:08:40,558][118044] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 05:08:41,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 83523584. Throughput: 0: 13148.5. Samples: 83490488. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:08:41,086][117718] Avg episode reward: [(0, '2701.847')] [2023-03-07 05:08:41,313][118044] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-07 05:08:42,105][118044] Updated weights for policy 0, policy_version 81580 (0.0007) [2023-03-07 05:08:42,885][118044] Updated weights for policy 0, policy_version 81590 (0.0007) [2023-03-07 05:08:43,645][118044] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-07 05:08:44,434][118044] Updated weights for policy 0, policy_version 81610 (0.0007) [2023-03-07 05:08:45,205][118044] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 05:08:45,990][118044] Updated weights for policy 0, policy_version 81630 (0.0007) [2023-03-07 05:08:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 83590144. Throughput: 0: 13142.8. Samples: 83569311. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:08:46,086][117718] Avg episode reward: [(0, '2742.325')] [2023-03-07 05:08:46,765][118044] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-07 05:08:47,549][118044] Updated weights for policy 0, policy_version 81650 (0.0006) [2023-03-07 05:08:48,324][118044] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-07 05:08:49,099][118044] Updated weights for policy 0, policy_version 81670 (0.0005) [2023-03-07 05:08:49,869][118044] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-07 05:08:50,650][118044] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-07 05:08:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 83655680. Throughput: 0: 13146.4. Samples: 83648491. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:08:51,086][117718] Avg episode reward: [(0, '2765.689')] [2023-03-07 05:08:51,426][118044] Updated weights for policy 0, policy_version 81700 (0.0007) [2023-03-07 05:08:52,224][118044] Updated weights for policy 0, policy_version 81710 (0.0007) [2023-03-07 05:08:52,998][118044] Updated weights for policy 0, policy_version 81720 (0.0005) [2023-03-07 05:08:53,782][118044] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-07 05:08:54,565][118044] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-07 05:08:55,346][118044] Updated weights for policy 0, policy_version 81750 (0.0005) [2023-03-07 05:08:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 83721216. Throughput: 0: 13136.8. Samples: 83687540. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:08:56,086][117718] Avg episode reward: [(0, '2707.174')] [2023-03-07 05:08:56,119][118044] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-07 05:08:56,906][118044] Updated weights for policy 0, policy_version 81770 (0.0006) [2023-03-07 05:08:57,675][118044] Updated weights for policy 0, policy_version 81780 (0.0006) [2023-03-07 05:08:58,476][118044] Updated weights for policy 0, policy_version 81790 (0.0006) [2023-03-07 05:08:59,258][118044] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-07 05:09:00,053][118044] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-07 05:09:00,840][118044] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-07 05:09:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 83786752. Throughput: 0: 13127.3. Samples: 83766256. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:01,086][117718] Avg episode reward: [(0, '2687.127')] [2023-03-07 05:09:01,609][118044] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-07 05:09:02,384][118044] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-07 05:09:03,158][118044] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 05:09:03,961][118044] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-07 05:09:04,731][118044] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-07 05:09:05,508][118044] Updated weights for policy 0, policy_version 81880 (0.0008) [2023-03-07 05:09:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 83852288. Throughput: 0: 13124.5. Samples: 83844930. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:06,096][117718] Avg episode reward: [(0, '2675.817')] [2023-03-07 05:09:06,278][118044] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-07 05:09:07,060][118044] Updated weights for policy 0, policy_version 81900 (0.0006) [2023-03-07 05:09:07,847][118044] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-07 05:09:08,609][118044] Updated weights for policy 0, policy_version 81920 (0.0007) [2023-03-07 05:09:09,377][118044] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-07 05:09:10,160][118044] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-07 05:09:10,938][118044] Updated weights for policy 0, policy_version 81950 (0.0007) [2023-03-07 05:09:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 83917824. Throughput: 0: 13130.1. Samples: 83884337. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:11,097][117718] Avg episode reward: [(0, '2693.698')] [2023-03-07 05:09:11,708][118044] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 05:09:12,488][118044] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 05:09:13,270][118044] Updated weights for policy 0, policy_version 81980 (0.0006) [2023-03-07 05:09:14,039][118044] Updated weights for policy 0, policy_version 81990 (0.0007) [2023-03-07 05:09:14,824][118044] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-07 05:09:15,617][118044] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-07 05:09:16,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 83984384. Throughput: 0: 13138.6. Samples: 83963559. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:16,097][117718] Avg episode reward: [(0, '2645.399')] [2023-03-07 05:09:16,390][118044] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-07 05:09:17,161][118044] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-07 05:09:17,925][118044] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-07 05:09:18,713][118044] Updated weights for policy 0, policy_version 82050 (0.0006) [2023-03-07 05:09:19,473][118044] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-07 05:09:20,248][118044] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-07 05:09:21,002][118044] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-07 05:09:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 84049920. Throughput: 0: 13153.9. Samples: 84042904. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:21,096][117718] Avg episode reward: [(0, '2571.514')] [2023-03-07 05:09:21,797][118044] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-07 05:09:22,566][118044] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-07 05:09:23,351][118044] Updated weights for policy 0, policy_version 82110 (0.0007) [2023-03-07 05:09:24,127][118044] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-07 05:09:24,907][118044] Updated weights for policy 0, policy_version 82130 (0.0007) [2023-03-07 05:09:25,693][118044] Updated weights for policy 0, policy_version 82140 (0.0006) [2023-03-07 05:09:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 84116480. Throughput: 0: 13156.4. Samples: 84082528. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 05:09:26,096][117718] Avg episode reward: [(0, '2632.877')] [2023-03-07 05:09:26,470][118044] Updated weights for policy 0, policy_version 82150 (0.0007) [2023-03-07 05:09:27,244][118044] Updated weights for policy 0, policy_version 82160 (0.0008) [2023-03-07 05:09:28,017][118044] Updated weights for policy 0, policy_version 82170 (0.0007) [2023-03-07 05:09:28,789][118044] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-07 05:09:29,565][118044] Updated weights for policy 0, policy_version 82190 (0.0005) [2023-03-07 05:09:30,362][118044] Updated weights for policy 0, policy_version 82200 (0.0006) [2023-03-07 05:09:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 84182016. Throughput: 0: 13159.3. Samples: 84161477. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:31,096][117718] Avg episode reward: [(0, '2566.810')] [2023-03-07 05:09:31,117][118044] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-07 05:09:31,909][118044] Updated weights for policy 0, policy_version 82220 (0.0006) [2023-03-07 05:09:32,693][118044] Updated weights for policy 0, policy_version 82230 (0.0006) [2023-03-07 05:09:33,451][118044] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-07 05:09:34,250][118044] Updated weights for policy 0, policy_version 82250 (0.0005) [2023-03-07 05:09:35,041][118044] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-07 05:09:35,826][118044] Updated weights for policy 0, policy_version 82270 (0.0007) [2023-03-07 05:09:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 84247552. Throughput: 0: 13148.9. Samples: 84240193. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:36,096][117718] Avg episode reward: [(0, '2708.416')] [2023-03-07 05:09:36,609][118044] Updated weights for policy 0, policy_version 82280 (0.0007) [2023-03-07 05:09:37,373][118044] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-07 05:09:38,165][118044] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-07 05:09:38,945][118044] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-07 05:09:39,716][118044] Updated weights for policy 0, policy_version 82320 (0.0006) [2023-03-07 05:09:40,499][118044] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 05:09:41,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 84313088. Throughput: 0: 13155.5. Samples: 84279536. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:41,086][117718] Avg episode reward: [(0, '2727.194')] [2023-03-07 05:09:41,266][118044] Updated weights for policy 0, policy_version 82340 (0.0006) [2023-03-07 05:09:42,055][118044] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-07 05:09:42,840][118044] Updated weights for policy 0, policy_version 82360 (0.0007) [2023-03-07 05:09:43,610][118044] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-07 05:09:44,380][118044] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-07 05:09:45,155][118044] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 05:09:45,950][118044] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-07 05:09:46,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 84378624. Throughput: 0: 13159.3. Samples: 84358425. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:46,086][117718] Avg episode reward: [(0, '2646.527')] [2023-03-07 05:09:46,703][118044] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-07 05:09:47,492][118044] Updated weights for policy 0, policy_version 82420 (0.0006) [2023-03-07 05:09:48,270][118044] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-07 05:09:49,055][118044] Updated weights for policy 0, policy_version 82440 (0.0006) [2023-03-07 05:09:49,828][118044] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-07 05:09:50,590][118044] Updated weights for policy 0, policy_version 82460 (0.0006) [2023-03-07 05:09:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 84445184. Throughput: 0: 13169.7. Samples: 84437568. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:51,086][117718] Avg episode reward: [(0, '2632.962')] [2023-03-07 05:09:51,379][118044] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-07 05:09:52,150][118044] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-07 05:09:52,917][118044] Updated weights for policy 0, policy_version 82490 (0.0007) [2023-03-07 05:09:53,693][118044] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-07 05:09:54,467][118044] Updated weights for policy 0, policy_version 82510 (0.0006) [2023-03-07 05:09:55,243][118044] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-07 05:09:56,034][118044] Updated weights for policy 0, policy_version 82530 (0.0006) [2023-03-07 05:09:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 84510720. Throughput: 0: 13173.9. Samples: 84477162. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:09:56,086][117718] Avg episode reward: [(0, '2747.866')] [2023-03-07 05:09:56,792][118044] Updated weights for policy 0, policy_version 82540 (0.0007) [2023-03-07 05:09:57,561][118044] Updated weights for policy 0, policy_version 82550 (0.0006) [2023-03-07 05:09:58,351][118044] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-07 05:09:59,146][118044] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-07 05:09:59,922][118044] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 05:10:00,698][118044] Updated weights for policy 0, policy_version 82590 (0.0007) [2023-03-07 05:10:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 84576256. Throughput: 0: 13168.3. Samples: 84556131. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:10:01,086][117718] Avg episode reward: [(0, '2663.152')] [2023-03-07 05:10:01,479][118044] Updated weights for policy 0, policy_version 82600 (0.0005) [2023-03-07 05:10:02,244][118044] Updated weights for policy 0, policy_version 82610 (0.0007) [2023-03-07 05:10:03,025][118044] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-07 05:10:03,795][118044] Updated weights for policy 0, policy_version 82630 (0.0006) [2023-03-07 05:10:04,575][118044] Updated weights for policy 0, policy_version 82640 (0.0006) [2023-03-07 05:10:05,341][118044] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 05:10:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 84642816. Throughput: 0: 13165.7. Samples: 84635361. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:10:06,086][117718] Avg episode reward: [(0, '2660.001')] [2023-03-07 05:10:06,117][118044] Updated weights for policy 0, policy_version 82660 (0.0006) [2023-03-07 05:10:06,892][118044] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-07 05:10:07,677][118044] Updated weights for policy 0, policy_version 82680 (0.0005) [2023-03-07 05:10:08,458][118044] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-07 05:10:09,229][118044] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-07 05:10:10,023][118044] Updated weights for policy 0, policy_version 82710 (0.0007) [2023-03-07 05:10:10,790][118044] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 05:10:11,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 84708352. Throughput: 0: 13165.2. Samples: 84674961. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:10:11,086][117718] Avg episode reward: [(0, '2629.953')] [2023-03-07 05:10:11,554][118044] Updated weights for policy 0, policy_version 82730 (0.0007) [2023-03-07 05:10:12,346][118044] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-07 05:10:13,126][118044] Updated weights for policy 0, policy_version 82750 (0.0007) [2023-03-07 05:10:13,910][118044] Updated weights for policy 0, policy_version 82760 (0.0005) [2023-03-07 05:10:14,699][118044] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-07 05:10:15,464][118044] Updated weights for policy 0, policy_version 82780 (0.0007) [2023-03-07 05:10:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.8). Total num frames: 84774912. Throughput: 0: 13163.3. Samples: 84753826. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:10:16,086][117718] Avg episode reward: [(0, '2690.126')] [2023-03-07 05:10:16,242][118044] Updated weights for policy 0, policy_version 82790 (0.0006) [2023-03-07 05:10:17,028][118044] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-07 05:10:17,828][118044] Updated weights for policy 0, policy_version 82810 (0.0005) [2023-03-07 05:10:18,590][118044] Updated weights for policy 0, policy_version 82820 (0.0006) [2023-03-07 05:10:19,365][118044] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-07 05:10:20,157][118044] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-07 05:10:20,939][118044] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-07 05:10:21,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 84839424. Throughput: 0: 13163.9. Samples: 84832570. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:21,086][117718] Avg episode reward: [(0, '2702.942')] [2023-03-07 05:10:21,728][118044] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-07 05:10:22,509][118044] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 05:10:23,295][118044] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-07 05:10:24,076][118044] Updated weights for policy 0, policy_version 82890 (0.0006) [2023-03-07 05:10:24,844][118044] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-07 05:10:25,627][118044] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-07 05:10:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 84905984. Throughput: 0: 13159.6. Samples: 84871717. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:26,086][117718] Avg episode reward: [(0, '2611.479')] [2023-03-07 05:10:26,397][118044] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-07 05:10:27,180][118044] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-07 05:10:27,972][118044] Updated weights for policy 0, policy_version 82940 (0.0006) [2023-03-07 05:10:28,738][118044] Updated weights for policy 0, policy_version 82950 (0.0005) [2023-03-07 05:10:29,521][118044] Updated weights for policy 0, policy_version 82960 (0.0007) [2023-03-07 05:10:30,309][118044] Updated weights for policy 0, policy_version 82970 (0.0007) [2023-03-07 05:10:31,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 84970496. Throughput: 0: 13159.6. Samples: 84950609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:31,086][117718] Avg episode reward: [(0, '2614.157')] [2023-03-07 05:10:31,092][118044] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 05:10:31,865][118044] Updated weights for policy 0, policy_version 82990 (0.0007) [2023-03-07 05:10:32,643][118044] Updated weights for policy 0, policy_version 83000 (0.0006) [2023-03-07 05:10:33,434][118044] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-07 05:10:34,209][118044] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-07 05:10:34,996][118044] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-07 05:10:35,775][118044] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-07 05:10:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.3, 300 sec: 13145.4). Total num frames: 85037056. Throughput: 0: 13147.0. Samples: 85029187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:36,086][117718] Avg episode reward: [(0, '2663.186')] [2023-03-07 05:10:36,092][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000083044_85037056.pth... [2023-03-07 05:10:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000079962_81881088.pth [2023-03-07 05:10:36,560][118044] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-07 05:10:37,338][118044] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-07 05:10:38,104][118044] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-07 05:10:38,889][118044] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-07 05:10:39,669][118044] Updated weights for policy 0, policy_version 83090 (0.0005) [2023-03-07 05:10:40,480][118044] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-07 05:10:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 85101568. Throughput: 0: 13144.5. Samples: 85068666. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:41,086][117718] Avg episode reward: [(0, '2672.458')] [2023-03-07 05:10:41,261][118044] Updated weights for policy 0, policy_version 83110 (0.0005) [2023-03-07 05:10:42,046][118044] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-07 05:10:42,819][118044] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-07 05:10:43,617][118044] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-07 05:10:44,382][118044] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-07 05:10:45,172][118044] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-07 05:10:45,952][118044] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-07 05:10:46,085][117718] Fps is (10 sec: 13005.1, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 85167104. Throughput: 0: 13127.0. Samples: 85146846. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:46,086][117718] Avg episode reward: [(0, '2729.039')] [2023-03-07 05:10:46,737][118044] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-07 05:10:47,526][118044] Updated weights for policy 0, policy_version 83190 (0.0005) [2023-03-07 05:10:48,310][118044] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-07 05:10:49,081][118044] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-07 05:10:49,873][118044] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-07 05:10:50,651][118044] Updated weights for policy 0, policy_version 83230 (0.0007) [2023-03-07 05:10:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 85232640. Throughput: 0: 13114.8. Samples: 85225529. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:51,086][117718] Avg episode reward: [(0, '2809.560')] [2023-03-07 05:10:51,435][118044] Updated weights for policy 0, policy_version 83240 (0.0006) [2023-03-07 05:10:52,213][118044] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-07 05:10:52,995][118044] Updated weights for policy 0, policy_version 83260 (0.0006) [2023-03-07 05:10:53,765][118044] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-07 05:10:54,553][118044] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-07 05:10:55,310][118044] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-07 05:10:56,083][118044] Updated weights for policy 0, policy_version 83300 (0.0007) [2023-03-07 05:10:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 85299200. Throughput: 0: 13108.6. Samples: 85264849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:10:56,086][117718] Avg episode reward: [(0, '2810.623')] [2023-03-07 05:10:56,888][118044] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-07 05:10:57,660][118044] Updated weights for policy 0, policy_version 83320 (0.0006) [2023-03-07 05:10:58,432][118044] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-07 05:10:59,216][118044] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-07 05:11:00,007][118044] Updated weights for policy 0, policy_version 83350 (0.0006) [2023-03-07 05:11:00,781][118044] Updated weights for policy 0, policy_version 83360 (0.0005) [2023-03-07 05:11:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 85364736. Throughput: 0: 13107.0. Samples: 85343639. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:01,086][117718] Avg episode reward: [(0, '2842.441')] [2023-03-07 05:11:01,567][118044] Updated weights for policy 0, policy_version 83370 (0.0007) [2023-03-07 05:11:02,348][118044] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-07 05:11:03,134][118044] Updated weights for policy 0, policy_version 83390 (0.0005) [2023-03-07 05:11:03,927][118044] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-07 05:11:04,698][118044] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-07 05:11:05,481][118044] Updated weights for policy 0, policy_version 83420 (0.0006) [2023-03-07 05:11:06,086][117718] Fps is (10 sec: 13004.6, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 85429248. Throughput: 0: 13102.4. Samples: 85422179. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:06,086][117718] Avg episode reward: [(0, '2707.500')] [2023-03-07 05:11:06,253][118044] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-07 05:11:07,032][118044] Updated weights for policy 0, policy_version 83440 (0.0005) [2023-03-07 05:11:07,800][118044] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-07 05:11:08,586][118044] Updated weights for policy 0, policy_version 83460 (0.0006) [2023-03-07 05:11:09,370][118044] Updated weights for policy 0, policy_version 83470 (0.0007) [2023-03-07 05:11:10,155][118044] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-07 05:11:10,949][118044] Updated weights for policy 0, policy_version 83490 (0.0006) [2023-03-07 05:11:11,085][117718] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 85494784. Throughput: 0: 13112.8. Samples: 85461793. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:11,086][117718] Avg episode reward: [(0, '2804.654')] [2023-03-07 05:11:11,715][118044] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-07 05:11:12,494][118044] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-07 05:11:13,261][118044] Updated weights for policy 0, policy_version 83520 (0.0005) [2023-03-07 05:11:14,043][118044] Updated weights for policy 0, policy_version 83530 (0.0005) [2023-03-07 05:11:14,836][118044] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-07 05:11:15,598][118044] Updated weights for policy 0, policy_version 83550 (0.0006) [2023-03-07 05:11:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 85561344. Throughput: 0: 13109.4. Samples: 85540530. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:16,086][117718] Avg episode reward: [(0, '2818.598')] [2023-03-07 05:11:16,395][118044] Updated weights for policy 0, policy_version 83560 (0.0007) [2023-03-07 05:11:17,177][118044] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-07 05:11:17,950][118044] Updated weights for policy 0, policy_version 83580 (0.0005) [2023-03-07 05:11:18,739][118044] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-07 05:11:19,546][118044] Updated weights for policy 0, policy_version 83600 (0.0006) [2023-03-07 05:11:20,325][118044] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-07 05:11:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 85625856. Throughput: 0: 13104.4. Samples: 85618880. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:21,086][117718] Avg episode reward: [(0, '2821.438')] [2023-03-07 05:11:21,101][118044] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-07 05:11:21,880][118044] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-07 05:11:22,642][118044] Updated weights for policy 0, policy_version 83640 (0.0006) [2023-03-07 05:11:23,421][118044] Updated weights for policy 0, policy_version 83650 (0.0006) [2023-03-07 05:11:24,201][118044] Updated weights for policy 0, policy_version 83660 (0.0007) [2023-03-07 05:11:24,981][118044] Updated weights for policy 0, policy_version 83670 (0.0007) [2023-03-07 05:11:25,749][118044] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-07 05:11:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 85692416. Throughput: 0: 13106.7. Samples: 85658468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:26,086][117718] Avg episode reward: [(0, '2654.673')] [2023-03-07 05:11:26,515][118044] Updated weights for policy 0, policy_version 83690 (0.0007) [2023-03-07 05:11:27,304][118044] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-07 05:11:28,085][118044] Updated weights for policy 0, policy_version 83710 (0.0006) [2023-03-07 05:11:28,860][118044] Updated weights for policy 0, policy_version 83720 (0.0006) [2023-03-07 05:11:29,620][118044] Updated weights for policy 0, policy_version 83730 (0.0006) [2023-03-07 05:11:30,399][118044] Updated weights for policy 0, policy_version 83740 (0.0007) [2023-03-07 05:11:31,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 85757952. Throughput: 0: 13131.3. Samples: 85737753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:31,086][117718] Avg episode reward: [(0, '2731.001')] [2023-03-07 05:11:31,193][118044] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 05:11:31,978][118044] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-07 05:11:32,776][118044] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-07 05:11:33,561][118044] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-07 05:11:34,353][118044] Updated weights for policy 0, policy_version 83790 (0.0006) [2023-03-07 05:11:35,102][118044] Updated weights for policy 0, policy_version 83800 (0.0005) [2023-03-07 05:11:35,902][118044] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-07 05:11:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13141.9). Total num frames: 85823488. Throughput: 0: 13124.2. Samples: 85816118. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:36,086][117718] Avg episode reward: [(0, '2823.580')] [2023-03-07 05:11:36,684][118044] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 05:11:37,453][118044] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 05:11:38,222][118044] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-07 05:11:39,016][118044] Updated weights for policy 0, policy_version 83850 (0.0006) [2023-03-07 05:11:39,768][118044] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 05:11:40,547][118044] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-07 05:11:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 85889024. Throughput: 0: 13124.7. Samples: 85855460. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:41,086][117718] Avg episode reward: [(0, '2734.227')] [2023-03-07 05:11:41,336][118044] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-07 05:11:42,109][118044] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-07 05:11:42,895][118044] Updated weights for policy 0, policy_version 83900 (0.0005) [2023-03-07 05:11:43,665][118044] Updated weights for policy 0, policy_version 83910 (0.0005) [2023-03-07 05:11:44,436][118044] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-07 05:11:45,234][118044] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-07 05:11:45,992][118044] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-07 05:11:46,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 85955584. Throughput: 0: 13132.9. Samples: 85934623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:46,086][117718] Avg episode reward: [(0, '2797.331')] [2023-03-07 05:11:46,777][118044] Updated weights for policy 0, policy_version 83950 (0.0007) [2023-03-07 05:11:47,562][118044] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 05:11:48,348][118044] Updated weights for policy 0, policy_version 83970 (0.0007) [2023-03-07 05:11:49,130][118044] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-07 05:11:49,907][118044] Updated weights for policy 0, policy_version 83990 (0.0007) [2023-03-07 05:11:50,701][118044] Updated weights for policy 0, policy_version 84000 (0.0006) [2023-03-07 05:11:51,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86021120. Throughput: 0: 13137.3. Samples: 86013357. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:51,086][117718] Avg episode reward: [(0, '2803.249')] [2023-03-07 05:11:51,487][118044] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-07 05:11:52,255][118044] Updated weights for policy 0, policy_version 84020 (0.0007) [2023-03-07 05:11:53,017][118044] Updated weights for policy 0, policy_version 84030 (0.0005) [2023-03-07 05:11:53,795][118044] Updated weights for policy 0, policy_version 84040 (0.0006) [2023-03-07 05:11:54,571][118044] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-07 05:11:55,359][118044] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-07 05:11:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 86086656. Throughput: 0: 13133.9. Samples: 86052820. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:11:56,086][117718] Avg episode reward: [(0, '2764.803')] [2023-03-07 05:11:56,132][118044] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-07 05:11:56,920][118044] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-07 05:11:57,726][118044] Updated weights for policy 0, policy_version 84090 (0.0007) [2023-03-07 05:11:58,495][118044] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-07 05:11:59,271][118044] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-07 05:12:00,054][118044] Updated weights for policy 0, policy_version 84120 (0.0006) [2023-03-07 05:12:00,836][118044] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 05:12:01,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 86152192. Throughput: 0: 13134.8. Samples: 86131596. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:12:01,086][117718] Avg episode reward: [(0, '2688.921')] [2023-03-07 05:12:01,590][118044] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-07 05:12:02,381][118044] Updated weights for policy 0, policy_version 84150 (0.0007) [2023-03-07 05:12:03,159][118044] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-07 05:12:03,937][118044] Updated weights for policy 0, policy_version 84170 (0.0006) [2023-03-07 05:12:04,714][118044] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-07 05:12:05,485][118044] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-07 05:12:06,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 86217728. Throughput: 0: 13146.3. Samples: 86210464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:06,086][117718] Avg episode reward: [(0, '2726.503')] [2023-03-07 05:12:06,286][118044] Updated weights for policy 0, policy_version 84200 (0.0006) [2023-03-07 05:12:07,063][118044] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-07 05:12:07,820][118044] Updated weights for policy 0, policy_version 84220 (0.0007) [2023-03-07 05:12:08,587][118044] Updated weights for policy 0, policy_version 84230 (0.0007) [2023-03-07 05:12:09,385][118044] Updated weights for policy 0, policy_version 84240 (0.0006) [2023-03-07 05:12:10,154][118044] Updated weights for policy 0, policy_version 84250 (0.0006) [2023-03-07 05:12:10,940][118044] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-07 05:12:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86283264. Throughput: 0: 13142.3. Samples: 86249873. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:11,086][117718] Avg episode reward: [(0, '2718.549')] [2023-03-07 05:12:11,735][118044] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-07 05:12:12,526][118044] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-07 05:12:13,315][118044] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 05:12:14,094][118044] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-07 05:12:14,870][118044] Updated weights for policy 0, policy_version 84310 (0.0006) [2023-03-07 05:12:15,639][118044] Updated weights for policy 0, policy_version 84320 (0.0006) [2023-03-07 05:12:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 86348800. Throughput: 0: 13130.4. Samples: 86328622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:16,086][117718] Avg episode reward: [(0, '2780.766')] [2023-03-07 05:12:16,412][118044] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-07 05:12:17,187][118044] Updated weights for policy 0, policy_version 84340 (0.0007) [2023-03-07 05:12:17,960][118044] Updated weights for policy 0, policy_version 84350 (0.0005) [2023-03-07 05:12:18,733][118044] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-07 05:12:19,526][118044] Updated weights for policy 0, policy_version 84370 (0.0006) [2023-03-07 05:12:20,331][118044] Updated weights for policy 0, policy_version 84380 (0.0006) [2023-03-07 05:12:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86414336. Throughput: 0: 13139.0. Samples: 86407374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:21,086][117718] Avg episode reward: [(0, '2710.902')] [2023-03-07 05:12:21,114][118044] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-07 05:12:21,894][118044] Updated weights for policy 0, policy_version 84400 (0.0007) [2023-03-07 05:12:22,661][118044] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-07 05:12:23,421][118044] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-07 05:12:24,198][118044] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-07 05:12:24,981][118044] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-07 05:12:25,760][118044] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-07 05:12:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 86480896. Throughput: 0: 13139.6. Samples: 86446742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:26,086][117718] Avg episode reward: [(0, '2816.461')] [2023-03-07 05:12:26,545][118044] Updated weights for policy 0, policy_version 84460 (0.0007) [2023-03-07 05:12:27,318][118044] Updated weights for policy 0, policy_version 84470 (0.0006) [2023-03-07 05:12:28,113][118044] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-07 05:12:28,870][118044] Updated weights for policy 0, policy_version 84490 (0.0006) [2023-03-07 05:12:29,674][118044] Updated weights for policy 0, policy_version 84500 (0.0007) [2023-03-07 05:12:30,450][118044] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-07 05:12:31,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86546432. Throughput: 0: 13136.3. Samples: 86525758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:31,086][117718] Avg episode reward: [(0, '2707.798')] [2023-03-07 05:12:31,230][118044] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-07 05:12:32,016][118044] Updated weights for policy 0, policy_version 84530 (0.0008) [2023-03-07 05:12:32,798][118044] Updated weights for policy 0, policy_version 84540 (0.0006) [2023-03-07 05:12:33,571][118044] Updated weights for policy 0, policy_version 84550 (0.0005) [2023-03-07 05:12:34,344][118044] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-07 05:12:35,141][118044] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-07 05:12:35,917][118044] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-03-07 05:12:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86611968. Throughput: 0: 13132.9. Samples: 86604338. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:36,086][117718] Avg episode reward: [(0, '2785.047')] [2023-03-07 05:12:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000084582_86611968.pth... [2023-03-07 05:12:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000081503_83459072.pth [2023-03-07 05:12:36,698][118044] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 05:12:37,468][118044] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 05:12:38,245][118044] Updated weights for policy 0, policy_version 84610 (0.0006) [2023-03-07 05:12:39,018][118044] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-07 05:12:39,786][118044] Updated weights for policy 0, policy_version 84630 (0.0006) [2023-03-07 05:12:40,566][118044] Updated weights for policy 0, policy_version 84640 (0.0006) [2023-03-07 05:12:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86677504. Throughput: 0: 13133.7. Samples: 86643838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:41,086][117718] Avg episode reward: [(0, '2725.609')] [2023-03-07 05:12:41,342][118044] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 05:12:42,118][118044] Updated weights for policy 0, policy_version 84660 (0.0005) [2023-03-07 05:12:42,893][118044] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-07 05:12:43,677][118044] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-07 05:12:44,440][118044] Updated weights for policy 0, policy_version 84690 (0.0005) [2023-03-07 05:12:45,228][118044] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-07 05:12:46,017][118044] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-07 05:12:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 86743040. Throughput: 0: 13144.1. Samples: 86723078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:46,086][117718] Avg episode reward: [(0, '2661.618')] [2023-03-07 05:12:46,778][118044] Updated weights for policy 0, policy_version 84720 (0.0006) [2023-03-07 05:12:47,554][118044] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-07 05:12:48,326][118044] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-07 05:12:49,099][118044] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-07 05:12:49,866][118044] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-07 05:12:50,648][118044] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 05:12:51,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86809600. Throughput: 0: 13152.3. Samples: 86802316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:51,086][117718] Avg episode reward: [(0, '2643.994')] [2023-03-07 05:12:51,437][118044] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-07 05:12:52,215][118044] Updated weights for policy 0, policy_version 84790 (0.0005) [2023-03-07 05:12:53,009][118044] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 05:12:53,785][118044] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-07 05:12:54,553][118044] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-07 05:12:55,337][118044] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-07 05:12:56,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 86875136. Throughput: 0: 13149.1. Samples: 86841583. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:12:56,086][117718] Avg episode reward: [(0, '2630.176')] [2023-03-07 05:12:56,121][118044] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-07 05:12:56,885][118044] Updated weights for policy 0, policy_version 84850 (0.0005) [2023-03-07 05:12:57,661][118044] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-07 05:12:58,428][118044] Updated weights for policy 0, policy_version 84870 (0.0007) [2023-03-07 05:12:59,196][118044] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-07 05:12:59,977][118044] Updated weights for policy 0, policy_version 84890 (0.0006) [2023-03-07 05:13:00,765][118044] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-07 05:13:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 86941696. Throughput: 0: 13158.8. Samples: 86920770. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:01,086][117718] Avg episode reward: [(0, '2613.809')] [2023-03-07 05:13:01,538][118044] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 05:13:02,322][118044] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-07 05:13:03,123][118044] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-07 05:13:03,898][118044] Updated weights for policy 0, policy_version 84940 (0.0006) [2023-03-07 05:13:04,687][118044] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-07 05:13:05,465][118044] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-07 05:13:06,086][117718] Fps is (10 sec: 13106.9, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 87006208. Throughput: 0: 13153.8. Samples: 86999299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:06,086][117718] Avg episode reward: [(0, '2611.075')] [2023-03-07 05:13:06,248][118044] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-07 05:13:07,027][118044] Updated weights for policy 0, policy_version 84980 (0.0007) [2023-03-07 05:13:07,800][118044] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-07 05:13:08,585][118044] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-07 05:13:09,349][118044] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-07 05:13:10,145][118044] Updated weights for policy 0, policy_version 85020 (0.0006) [2023-03-07 05:13:10,921][118044] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-07 05:13:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 87072768. Throughput: 0: 13153.0. Samples: 87038629. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:11,086][117718] Avg episode reward: [(0, '2736.953')] [2023-03-07 05:13:11,695][118044] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 05:13:12,495][118044] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 05:13:13,275][118044] Updated weights for policy 0, policy_version 85060 (0.0006) [2023-03-07 05:13:14,037][118044] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-07 05:13:14,802][118044] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-07 05:13:15,577][118044] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 05:13:16,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 87138304. Throughput: 0: 13157.2. Samples: 87117831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:16,086][117718] Avg episode reward: [(0, '2620.413')] [2023-03-07 05:13:16,368][118044] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 05:13:17,132][118044] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-07 05:13:17,608][117993] KL-divergence is very high: 153.5062 [2023-03-07 05:13:17,919][118044] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-07 05:13:18,705][118044] Updated weights for policy 0, policy_version 85130 (0.0007) [2023-03-07 05:13:19,491][118044] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-07 05:13:20,259][118044] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-07 05:13:21,052][118044] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-07 05:13:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 87203840. Throughput: 0: 13164.1. Samples: 87196724. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:21,086][117718] Avg episode reward: [(0, '2712.583')] [2023-03-07 05:13:21,809][118044] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 05:13:22,582][118044] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-07 05:13:22,969][117993] KL-divergence is very high: 111.4939 [2023-03-07 05:13:23,356][118044] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-07 05:13:24,130][118044] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-07 05:13:24,924][118044] Updated weights for policy 0, policy_version 85210 (0.0005) [2023-03-07 05:13:25,692][118044] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 05:13:26,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87270400. Throughput: 0: 13164.4. Samples: 87236239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:26,086][117718] Avg episode reward: [(0, '2603.932')] [2023-03-07 05:13:26,463][118044] Updated weights for policy 0, policy_version 85230 (0.0005) [2023-03-07 05:13:27,245][118044] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 05:13:28,013][118044] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-07 05:13:28,805][118044] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-07 05:13:29,579][118044] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-07 05:13:30,369][118044] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-07 05:13:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87335936. Throughput: 0: 13161.1. Samples: 87315328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:31,086][117718] Avg episode reward: [(0, '2665.861')] [2023-03-07 05:13:31,137][118044] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 05:13:31,909][118044] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-07 05:13:32,686][118044] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-07 05:13:33,474][118044] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-07 05:13:34,252][118044] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-07 05:13:35,019][118044] Updated weights for policy 0, policy_version 85340 (0.0006) [2023-03-07 05:13:35,793][118044] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-07 05:13:36,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87401472. Throughput: 0: 13157.6. Samples: 87394411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:36,086][117718] Avg episode reward: [(0, '2648.129')] [2023-03-07 05:13:36,584][118044] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-07 05:13:37,360][118044] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-03-07 05:13:38,140][118044] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-07 05:13:38,917][118044] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-07 05:13:39,700][118044] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-07 05:13:40,462][118044] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-07 05:13:41,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13145.4). Total num frames: 87468032. Throughput: 0: 13160.7. Samples: 87433818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:13:41,086][117718] Avg episode reward: [(0, '2647.202')] [2023-03-07 05:13:41,234][118044] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-07 05:13:42,018][118044] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-07 05:13:42,782][118044] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-07 05:13:43,563][118044] Updated weights for policy 0, policy_version 85450 (0.0006) [2023-03-07 05:13:44,367][118044] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 05:13:45,140][118044] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-07 05:13:45,932][118044] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-07 05:13:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 87533568. Throughput: 0: 13155.6. Samples: 87512774. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:13:46,086][117718] Avg episode reward: [(0, '2583.873')] [2023-03-07 05:13:46,710][118044] Updated weights for policy 0, policy_version 85490 (0.0005) [2023-03-07 05:13:47,473][118044] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-07 05:13:48,265][118044] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-07 05:13:49,037][118044] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-07 05:13:49,810][118044] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-07 05:13:50,578][118044] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-07 05:13:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87599104. Throughput: 0: 13164.2. Samples: 87591684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:13:51,086][117718] Avg episode reward: [(0, '2663.979')] [2023-03-07 05:13:51,355][118044] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-07 05:13:52,134][118044] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 05:13:52,920][118044] Updated weights for policy 0, policy_version 85570 (0.0008) [2023-03-07 05:13:53,682][118044] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-07 05:13:54,474][118044] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-07 05:13:55,249][118044] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 05:13:56,043][118044] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 05:13:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87664640. Throughput: 0: 13171.7. Samples: 87631353. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:13:56,086][117718] Avg episode reward: [(0, '2745.018')] [2023-03-07 05:13:56,823][118044] Updated weights for policy 0, policy_version 85620 (0.0007) [2023-03-07 05:13:57,613][118044] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-03-07 05:13:58,377][118044] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-07 05:13:59,153][118044] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-07 05:13:59,923][118044] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-07 05:14:00,703][118044] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-07 05:14:01,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 87730176. Throughput: 0: 13157.5. Samples: 87709918. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:01,086][117718] Avg episode reward: [(0, '2799.513')] [2023-03-07 05:14:01,469][118044] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-07 05:14:02,267][118044] Updated weights for policy 0, policy_version 85690 (0.0006) [2023-03-07 05:14:03,035][118044] Updated weights for policy 0, policy_version 85700 (0.0006) [2023-03-07 05:14:03,809][118044] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-07 05:14:04,607][118044] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-07 05:14:05,362][118044] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-07 05:14:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 87796736. Throughput: 0: 13163.8. Samples: 87789092. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:06,086][117718] Avg episode reward: [(0, '2782.576')] [2023-03-07 05:14:06,162][118044] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 05:14:06,924][118044] Updated weights for policy 0, policy_version 85750 (0.0006) [2023-03-07 05:14:07,705][118044] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-07 05:14:08,490][118044] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-07 05:14:09,274][118044] Updated weights for policy 0, policy_version 85780 (0.0005) [2023-03-07 05:14:10,034][118044] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 05:14:10,810][118044] Updated weights for policy 0, policy_version 85800 (0.0007) [2023-03-07 05:14:11,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87862272. Throughput: 0: 13160.6. Samples: 87828465. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:11,086][117718] Avg episode reward: [(0, '2787.296')] [2023-03-07 05:14:11,588][118044] Updated weights for policy 0, policy_version 85810 (0.0006) [2023-03-07 05:14:12,357][118044] Updated weights for policy 0, policy_version 85820 (0.0005) [2023-03-07 05:14:13,135][118044] Updated weights for policy 0, policy_version 85830 (0.0007) [2023-03-07 05:14:13,919][118044] Updated weights for policy 0, policy_version 85840 (0.0007) [2023-03-07 05:14:14,673][118044] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-07 05:14:15,459][118044] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-07 05:14:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 87927808. Throughput: 0: 13167.6. Samples: 87907871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:16,086][117718] Avg episode reward: [(0, '2724.201')] [2023-03-07 05:14:16,259][118044] Updated weights for policy 0, policy_version 85870 (0.0006) [2023-03-07 05:14:17,032][118044] Updated weights for policy 0, policy_version 85880 (0.0006) [2023-03-07 05:14:17,788][118044] Updated weights for policy 0, policy_version 85890 (0.0007) [2023-03-07 05:14:18,574][118044] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-07 05:14:19,360][118044] Updated weights for policy 0, policy_version 85910 (0.0006) [2023-03-07 05:14:20,124][118044] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 05:14:20,907][118044] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-07 05:14:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 87994368. Throughput: 0: 13165.9. Samples: 87986876. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:21,086][117718] Avg episode reward: [(0, '2819.817')] [2023-03-07 05:14:21,678][118044] Updated weights for policy 0, policy_version 85940 (0.0005) [2023-03-07 05:14:22,458][118044] Updated weights for policy 0, policy_version 85950 (0.0006) [2023-03-07 05:14:23,232][118044] Updated weights for policy 0, policy_version 85960 (0.0007) [2023-03-07 05:14:23,999][118044] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-07 05:14:24,761][118044] Updated weights for policy 0, policy_version 85980 (0.0005) [2023-03-07 05:14:25,537][118044] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-07 05:14:26,085][117718] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 88060928. Throughput: 0: 13169.4. Samples: 88026440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:26,086][117718] Avg episode reward: [(0, '2725.687')] [2023-03-07 05:14:26,295][118044] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-07 05:14:27,082][118044] Updated weights for policy 0, policy_version 86010 (0.0006) [2023-03-07 05:14:27,857][118044] Updated weights for policy 0, policy_version 86020 (0.0006) [2023-03-07 05:14:28,627][118044] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-07 05:14:29,410][118044] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 05:14:30,195][118044] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-07 05:14:30,970][118044] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-07 05:14:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 88126464. Throughput: 0: 13181.9. Samples: 88105960. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:31,086][117718] Avg episode reward: [(0, '2826.983')] [2023-03-07 05:14:31,761][118044] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 05:14:32,520][118044] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 05:14:33,289][118044] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-07 05:14:34,075][118044] Updated weights for policy 0, policy_version 86100 (0.0006) [2023-03-07 05:14:34,853][118044] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-07 05:14:35,623][118044] Updated weights for policy 0, policy_version 86120 (0.0005) [2023-03-07 05:14:36,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 88192000. Throughput: 0: 13185.8. Samples: 88185043. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:14:36,086][117718] Avg episode reward: [(0, '2800.689')] [2023-03-07 05:14:36,096][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000086126_88193024.pth... [2023-03-07 05:14:36,125][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000083044_85037056.pth [2023-03-07 05:14:36,413][118044] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-07 05:14:37,179][118044] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-07 05:14:37,965][118044] Updated weights for policy 0, policy_version 86150 (0.0007) [2023-03-07 05:14:38,726][118044] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-07 05:14:39,487][118044] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-07 05:14:40,271][118044] Updated weights for policy 0, policy_version 86180 (0.0008) [2023-03-07 05:14:41,073][118044] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-07 05:14:41,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 88258560. Throughput: 0: 13182.7. Samples: 88224576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:14:41,086][117718] Avg episode reward: [(0, '2570.314')] [2023-03-07 05:14:41,860][118044] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-07 05:14:42,631][118044] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-07 05:14:43,397][118044] Updated weights for policy 0, policy_version 86220 (0.0007) [2023-03-07 05:14:44,187][118044] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-07 05:14:44,969][118044] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 05:14:45,738][118044] Updated weights for policy 0, policy_version 86250 (0.0007) [2023-03-07 05:14:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 88324096. Throughput: 0: 13189.5. Samples: 88303448. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:14:46,086][117718] Avg episode reward: [(0, '2739.330')] [2023-03-07 05:14:46,533][118044] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-07 05:14:47,305][118044] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-07 05:14:48,089][118044] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-07 05:14:48,876][118044] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-07 05:14:49,656][118044] Updated weights for policy 0, policy_version 86300 (0.0007) [2023-03-07 05:14:50,431][118044] Updated weights for policy 0, policy_version 86310 (0.0005) [2023-03-07 05:14:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 88389632. Throughput: 0: 13179.7. Samples: 88382177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:14:51,086][117718] Avg episode reward: [(0, '2750.234')] [2023-03-07 05:14:51,209][118044] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 05:14:52,005][118044] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-07 05:14:52,787][118044] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-07 05:14:53,570][118044] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-07 05:14:54,332][118044] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 05:14:55,106][118044] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 05:14:55,881][118044] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-07 05:14:56,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13148.9). Total num frames: 88455168. Throughput: 0: 13181.5. Samples: 88421635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:14:56,086][117718] Avg episode reward: [(0, '2627.810')] [2023-03-07 05:14:56,654][118044] Updated weights for policy 0, policy_version 86390 (0.0007) [2023-03-07 05:14:57,436][118044] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-07 05:14:58,220][118044] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-07 05:14:59,017][118044] Updated weights for policy 0, policy_version 86420 (0.0007) [2023-03-07 05:14:59,789][118044] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-07 05:15:00,561][118044] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-07 05:15:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 88520704. Throughput: 0: 13168.7. Samples: 88500464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:01,086][117718] Avg episode reward: [(0, '2726.354')] [2023-03-07 05:15:01,355][118044] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-07 05:15:02,121][118044] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-07 05:15:02,898][118044] Updated weights for policy 0, policy_version 86470 (0.0006) [2023-03-07 05:15:03,675][118044] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-07 05:15:04,458][118044] Updated weights for policy 0, policy_version 86490 (0.0006) [2023-03-07 05:15:05,229][118044] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-07 05:15:06,009][118044] Updated weights for policy 0, policy_version 86510 (0.0006) [2023-03-07 05:15:06,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 88586240. Throughput: 0: 13165.0. Samples: 88579301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:06,086][117718] Avg episode reward: [(0, '2734.867')] [2023-03-07 05:15:06,796][118044] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-07 05:15:07,581][118044] Updated weights for policy 0, policy_version 86530 (0.0007) [2023-03-07 05:15:08,361][118044] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-07 05:15:09,131][118044] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-07 05:15:09,937][118044] Updated weights for policy 0, policy_version 86560 (0.0006) [2023-03-07 05:15:10,710][118044] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-07 05:15:11,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 88651776. Throughput: 0: 13158.6. Samples: 88618577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:11,086][117718] Avg episode reward: [(0, '2729.482')] [2023-03-07 05:15:11,489][118044] Updated weights for policy 0, policy_version 86580 (0.0006) [2023-03-07 05:15:12,266][118044] Updated weights for policy 0, policy_version 86590 (0.0005) [2023-03-07 05:15:13,046][118044] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-07 05:15:13,843][118044] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-07 05:15:14,621][118044] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-07 05:15:15,397][118044] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-07 05:15:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 88717312. Throughput: 0: 13137.2. Samples: 88697134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:16,086][117718] Avg episode reward: [(0, '2681.101')] [2023-03-07 05:15:16,187][118044] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-07 05:15:16,982][118044] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 05:15:17,756][118044] Updated weights for policy 0, policy_version 86660 (0.0006) [2023-03-07 05:15:18,551][118044] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-07 05:15:19,349][118044] Updated weights for policy 0, policy_version 86680 (0.0006) [2023-03-07 05:15:20,125][118044] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-07 05:15:20,916][118044] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-07 05:15:21,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 88782848. Throughput: 0: 13117.6. Samples: 88775333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:21,086][117718] Avg episode reward: [(0, '2756.982')] [2023-03-07 05:15:21,683][118044] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-07 05:15:22,465][118044] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-07 05:15:23,245][118044] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-07 05:15:24,036][118044] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-07 05:15:24,834][118044] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-07 05:15:25,594][118044] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-07 05:15:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 88848384. Throughput: 0: 13113.1. Samples: 88814668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:15:26,086][117718] Avg episode reward: [(0, '2652.912')] [2023-03-07 05:15:26,362][118044] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-07 05:15:27,146][118044] Updated weights for policy 0, policy_version 86780 (0.0006) [2023-03-07 05:15:27,951][118044] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-07 05:15:28,723][118044] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-07 05:15:29,518][118044] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-07 05:15:30,287][118044] Updated weights for policy 0, policy_version 86820 (0.0005) [2023-03-07 05:15:31,065][118044] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-07 05:15:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 88913920. Throughput: 0: 13104.4. Samples: 88893145. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:31,086][117718] Avg episode reward: [(0, '2705.746')] [2023-03-07 05:15:31,838][118044] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-07 05:15:32,626][118044] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-07 05:15:33,416][118044] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-07 05:15:34,189][118044] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-07 05:15:34,981][118044] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-07 05:15:35,764][118044] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-07 05:15:36,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 88979456. Throughput: 0: 13103.7. Samples: 88971844. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:36,086][117718] Avg episode reward: [(0, '2774.676')] [2023-03-07 05:15:36,529][118044] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-07 05:15:37,317][118044] Updated weights for policy 0, policy_version 86910 (0.0007) [2023-03-07 05:15:38,097][118044] Updated weights for policy 0, policy_version 86920 (0.0006) [2023-03-07 05:15:38,862][118044] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-07 05:15:39,657][118044] Updated weights for policy 0, policy_version 86940 (0.0006) [2023-03-07 05:15:40,444][118044] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-07 05:15:41,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 89044992. Throughput: 0: 13105.6. Samples: 89011386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:41,086][117718] Avg episode reward: [(0, '2746.833')] [2023-03-07 05:15:41,204][118044] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-07 05:15:41,989][118044] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-07 05:15:42,785][118044] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-07 05:15:43,555][118044] Updated weights for policy 0, policy_version 86990 (0.0006) [2023-03-07 05:15:44,335][118044] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 05:15:45,102][118044] Updated weights for policy 0, policy_version 87010 (0.0007) [2023-03-07 05:15:45,865][118044] Updated weights for policy 0, policy_version 87020 (0.0005) [2023-03-07 05:15:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 89110528. Throughput: 0: 13103.1. Samples: 89090106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:46,086][117718] Avg episode reward: [(0, '2782.021')] [2023-03-07 05:15:46,658][118044] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-07 05:15:47,441][118044] Updated weights for policy 0, policy_version 87040 (0.0006) [2023-03-07 05:15:48,219][118044] Updated weights for policy 0, policy_version 87050 (0.0006) [2023-03-07 05:15:48,992][118044] Updated weights for policy 0, policy_version 87060 (0.0007) [2023-03-07 05:15:49,765][118044] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-07 05:15:50,526][118044] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-07 05:15:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 89176064. Throughput: 0: 13107.6. Samples: 89169143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:51,086][117718] Avg episode reward: [(0, '2741.841')] [2023-03-07 05:15:51,313][118044] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-07 05:15:52,103][118044] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-07 05:15:52,871][118044] Updated weights for policy 0, policy_version 87110 (0.0006) [2023-03-07 05:15:53,642][118044] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-07 05:15:54,412][118044] Updated weights for policy 0, policy_version 87130 (0.0006) [2023-03-07 05:15:55,202][118044] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-07 05:15:55,975][118044] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-07 05:15:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 89242624. Throughput: 0: 13118.1. Samples: 89208889. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:15:56,086][117718] Avg episode reward: [(0, '2715.604')] [2023-03-07 05:15:56,752][118044] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-07 05:15:57,533][118044] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-07 05:15:58,286][118044] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 05:15:59,105][118044] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-07 05:15:59,884][118044] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 05:16:00,661][118044] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-07 05:16:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 89308160. Throughput: 0: 13126.8. Samples: 89287838. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:01,086][117718] Avg episode reward: [(0, '2795.986')] [2023-03-07 05:16:01,454][118044] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-07 05:16:02,226][118044] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-07 05:16:02,994][118044] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-07 05:16:03,768][118044] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-07 05:16:04,557][118044] Updated weights for policy 0, policy_version 87260 (0.0006) [2023-03-07 05:16:05,322][118044] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-07 05:16:06,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 89373696. Throughput: 0: 13138.6. Samples: 89366572. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:06,086][117718] Avg episode reward: [(0, '2786.890')] [2023-03-07 05:16:06,109][118044] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 05:16:06,885][118044] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-07 05:16:07,667][118044] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 05:16:08,449][118044] Updated weights for policy 0, policy_version 87310 (0.0005) [2023-03-07 05:16:09,257][118044] Updated weights for policy 0, policy_version 87320 (0.0007) [2023-03-07 05:16:10,046][118044] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 05:16:10,818][118044] Updated weights for policy 0, policy_version 87340 (0.0007) [2023-03-07 05:16:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 89439232. Throughput: 0: 13138.5. Samples: 89405900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:11,086][117718] Avg episode reward: [(0, '2800.026')] [2023-03-07 05:16:11,609][118044] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 05:16:12,392][118044] Updated weights for policy 0, policy_version 87360 (0.0006) [2023-03-07 05:16:13,146][118044] Updated weights for policy 0, policy_version 87370 (0.0006) [2023-03-07 05:16:13,933][118044] Updated weights for policy 0, policy_version 87380 (0.0007) [2023-03-07 05:16:14,713][118044] Updated weights for policy 0, policy_version 87390 (0.0006) [2023-03-07 05:16:15,481][118044] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-07 05:16:16,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13148.8). Total num frames: 89504768. Throughput: 0: 13136.0. Samples: 89484267. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:16,086][117718] Avg episode reward: [(0, '2733.999')] [2023-03-07 05:16:16,279][118044] Updated weights for policy 0, policy_version 87410 (0.0006) [2023-03-07 05:16:17,075][118044] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-07 05:16:17,852][118044] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-07 05:16:18,630][118044] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-07 05:16:19,407][118044] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-07 05:16:20,194][118044] Updated weights for policy 0, policy_version 87460 (0.0007) [2023-03-07 05:16:20,984][118044] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-07 05:16:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 89570304. Throughput: 0: 13133.4. Samples: 89562847. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:21,086][117718] Avg episode reward: [(0, '2741.384')] [2023-03-07 05:16:21,759][118044] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-07 05:16:22,525][118044] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-07 05:16:23,320][118044] Updated weights for policy 0, policy_version 87500 (0.0007) [2023-03-07 05:16:24,103][118044] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 05:16:24,870][118044] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-07 05:16:25,670][118044] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-07 05:16:26,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 89635840. Throughput: 0: 13128.9. Samples: 89602185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:26,086][117718] Avg episode reward: [(0, '2843.937')] [2023-03-07 05:16:26,450][118044] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 05:16:27,215][118044] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-07 05:16:27,989][118044] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-07 05:16:28,768][118044] Updated weights for policy 0, policy_version 87570 (0.0007) [2023-03-07 05:16:29,565][118044] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-07 05:16:30,323][118044] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-07 05:16:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13145.4). Total num frames: 89701376. Throughput: 0: 13133.1. Samples: 89681097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:31,086][117718] Avg episode reward: [(0, '2726.769')] [2023-03-07 05:16:31,129][118044] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-07 05:16:31,899][118044] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-07 05:16:32,670][118044] Updated weights for policy 0, policy_version 87620 (0.0007) [2023-03-07 05:16:33,450][118044] Updated weights for policy 0, policy_version 87630 (0.0005) [2023-03-07 05:16:34,223][118044] Updated weights for policy 0, policy_version 87640 (0.0006) [2023-03-07 05:16:35,005][118044] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-07 05:16:35,789][118044] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-07 05:16:36,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 89766912. Throughput: 0: 13130.2. Samples: 89760004. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:36,086][117718] Avg episode reward: [(0, '2812.164')] [2023-03-07 05:16:36,092][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000087664_89767936.pth... [2023-03-07 05:16:36,122][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000084582_86611968.pth [2023-03-07 05:16:36,553][118044] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-07 05:16:37,314][118044] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 05:16:38,102][118044] Updated weights for policy 0, policy_version 87690 (0.0006) [2023-03-07 05:16:38,882][118044] Updated weights for policy 0, policy_version 87700 (0.0007) [2023-03-07 05:16:39,676][118044] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-07 05:16:40,465][118044] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-07 05:16:41,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 89833472. Throughput: 0: 13128.5. Samples: 89799670. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:41,086][117718] Avg episode reward: [(0, '2702.478')] [2023-03-07 05:16:41,226][118044] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-07 05:16:41,994][118044] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-07 05:16:42,781][118044] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 05:16:43,541][118044] Updated weights for policy 0, policy_version 87760 (0.0007) [2023-03-07 05:16:44,309][118044] Updated weights for policy 0, policy_version 87770 (0.0006) [2023-03-07 05:16:45,110][118044] Updated weights for policy 0, policy_version 87780 (0.0006) [2023-03-07 05:16:45,889][118044] Updated weights for policy 0, policy_version 87790 (0.0006) [2023-03-07 05:16:46,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 89899008. Throughput: 0: 13126.8. Samples: 89878544. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:46,096][117718] Avg episode reward: [(0, '2855.948')] [2023-03-07 05:16:46,654][118044] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 05:16:47,442][118044] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-07 05:16:48,211][118044] Updated weights for policy 0, policy_version 87820 (0.0007) [2023-03-07 05:16:48,975][118044] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-07 05:16:49,769][118044] Updated weights for policy 0, policy_version 87840 (0.0006) [2023-03-07 05:16:50,551][118044] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-07 05:16:51,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 89965568. Throughput: 0: 13135.3. Samples: 89957657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:51,096][117718] Avg episode reward: [(0, '2896.075')] [2023-03-07 05:16:51,321][118044] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-07 05:16:52,101][118044] Updated weights for policy 0, policy_version 87870 (0.0006) [2023-03-07 05:16:52,860][118044] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 05:16:53,624][118044] Updated weights for policy 0, policy_version 87890 (0.0007) [2023-03-07 05:16:54,395][118044] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 05:16:55,176][118044] Updated weights for policy 0, policy_version 87910 (0.0007) [2023-03-07 05:16:55,964][118044] Updated weights for policy 0, policy_version 87920 (0.0007) [2023-03-07 05:16:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 90031104. Throughput: 0: 13148.2. Samples: 89997569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:16:56,096][117718] Avg episode reward: [(0, '2806.233')] [2023-03-07 05:16:56,741][118044] Updated weights for policy 0, policy_version 87930 (0.0006) [2023-03-07 05:16:57,533][118044] Updated weights for policy 0, policy_version 87940 (0.0006) [2023-03-07 05:16:58,313][118044] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-07 05:16:59,075][118044] Updated weights for policy 0, policy_version 87960 (0.0007) [2023-03-07 05:16:59,862][118044] Updated weights for policy 0, policy_version 87970 (0.0006) [2023-03-07 05:17:00,661][118044] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-07 05:17:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 90096640. Throughput: 0: 13155.7. Samples: 90076272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:17:01,087][117718] Avg episode reward: [(0, '2760.006')] [2023-03-07 05:17:01,422][118044] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-07 05:17:02,216][118044] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 05:17:02,970][118044] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-07 05:17:03,752][118044] Updated weights for policy 0, policy_version 88020 (0.0007) [2023-03-07 05:17:04,521][118044] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-07 05:17:05,312][118044] Updated weights for policy 0, policy_version 88040 (0.0006) [2023-03-07 05:17:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 90162176. Throughput: 0: 13164.2. Samples: 90155235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:17:06,086][117718] Avg episode reward: [(0, '2820.094')] [2023-03-07 05:17:06,093][118044] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-07 05:17:06,871][118044] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-07 05:17:07,647][118044] Updated weights for policy 0, policy_version 88070 (0.0005) [2023-03-07 05:17:08,430][118044] Updated weights for policy 0, policy_version 88080 (0.0006) [2023-03-07 05:17:09,203][118044] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-07 05:17:09,975][118044] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-07 05:17:10,755][118044] Updated weights for policy 0, policy_version 88110 (0.0007) [2023-03-07 05:17:11,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 90228736. Throughput: 0: 13165.2. Samples: 90194621. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:17:11,086][117718] Avg episode reward: [(0, '2785.770')] [2023-03-07 05:17:11,530][118044] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-07 05:17:12,301][118044] Updated weights for policy 0, policy_version 88130 (0.0006) [2023-03-07 05:17:13,085][118044] Updated weights for policy 0, policy_version 88140 (0.0005) [2023-03-07 05:17:13,868][118044] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-07 05:17:14,652][118044] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-07 05:17:15,421][118044] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-07 05:17:16,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 90294272. Throughput: 0: 13167.3. Samples: 90273624. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:16,086][117718] Avg episode reward: [(0, '2838.896')] [2023-03-07 05:17:16,198][118044] Updated weights for policy 0, policy_version 88180 (0.0005) [2023-03-07 05:17:16,974][118044] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-07 05:17:17,775][118044] Updated weights for policy 0, policy_version 88200 (0.0007) [2023-03-07 05:17:18,564][118044] Updated weights for policy 0, policy_version 88210 (0.0007) [2023-03-07 05:17:19,341][118044] Updated weights for policy 0, policy_version 88220 (0.0006) [2023-03-07 05:17:20,123][118044] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-07 05:17:20,903][118044] Updated weights for policy 0, policy_version 88240 (0.0006) [2023-03-07 05:17:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 90359808. Throughput: 0: 13163.3. Samples: 90352356. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:21,086][117718] Avg episode reward: [(0, '2821.759')] [2023-03-07 05:17:21,684][118044] Updated weights for policy 0, policy_version 88250 (0.0008) [2023-03-07 05:17:22,464][118044] Updated weights for policy 0, policy_version 88260 (0.0006) [2023-03-07 05:17:23,247][118044] Updated weights for policy 0, policy_version 88270 (0.0006) [2023-03-07 05:17:24,041][118044] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-07 05:17:24,813][118044] Updated weights for policy 0, policy_version 88290 (0.0007) [2023-03-07 05:17:25,579][118044] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-07 05:17:26,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 90425344. Throughput: 0: 13155.3. Samples: 90391660. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:26,086][117718] Avg episode reward: [(0, '2877.265')] [2023-03-07 05:17:26,397][118044] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-07 05:17:27,177][118044] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-07 05:17:27,938][118044] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-07 05:17:28,701][118044] Updated weights for policy 0, policy_version 88340 (0.0006) [2023-03-07 05:17:29,490][118044] Updated weights for policy 0, policy_version 88350 (0.0005) [2023-03-07 05:17:30,260][118044] Updated weights for policy 0, policy_version 88360 (0.0006) [2023-03-07 05:17:31,023][118044] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-07 05:17:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 90490880. Throughput: 0: 13152.6. Samples: 90470411. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:31,086][117718] Avg episode reward: [(0, '2802.189')] [2023-03-07 05:17:31,803][118044] Updated weights for policy 0, policy_version 88380 (0.0007) [2023-03-07 05:17:32,583][118044] Updated weights for policy 0, policy_version 88390 (0.0006) [2023-03-07 05:17:33,358][118044] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-07 05:17:34,133][118044] Updated weights for policy 0, policy_version 88410 (0.0007) [2023-03-07 05:17:34,924][118044] Updated weights for policy 0, policy_version 88420 (0.0007) [2023-03-07 05:17:35,707][118044] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-07 05:17:36,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 90556416. Throughput: 0: 13149.5. Samples: 90549387. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:36,086][117718] Avg episode reward: [(0, '2811.473')] [2023-03-07 05:17:36,501][118044] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-07 05:17:37,280][118044] Updated weights for policy 0, policy_version 88450 (0.0006) [2023-03-07 05:17:38,050][118044] Updated weights for policy 0, policy_version 88460 (0.0006) [2023-03-07 05:17:38,835][118044] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 05:17:39,610][118044] Updated weights for policy 0, policy_version 88480 (0.0007) [2023-03-07 05:17:40,373][118044] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-07 05:17:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 90622976. Throughput: 0: 13135.6. Samples: 90588672. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:41,086][117718] Avg episode reward: [(0, '2821.429')] [2023-03-07 05:17:41,154][118044] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-07 05:17:41,919][118044] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-07 05:17:42,682][118044] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 05:17:43,477][118044] Updated weights for policy 0, policy_version 88530 (0.0006) [2023-03-07 05:17:44,286][118044] Updated weights for policy 0, policy_version 88540 (0.0007) [2023-03-07 05:17:45,070][118044] Updated weights for policy 0, policy_version 88550 (0.0006) [2023-03-07 05:17:45,855][118044] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-07 05:17:46,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 90688512. Throughput: 0: 13141.8. Samples: 90667654. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:46,086][117718] Avg episode reward: [(0, '2828.094')] [2023-03-07 05:17:46,646][118044] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 05:17:47,416][118044] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-07 05:17:48,202][118044] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-07 05:17:48,974][118044] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-07 05:17:49,750][118044] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-07 05:17:50,529][118044] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-07 05:17:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 90754048. Throughput: 0: 13137.8. Samples: 90746436. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:51,086][117718] Avg episode reward: [(0, '2867.609')] [2023-03-07 05:17:51,298][118044] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-07 05:17:52,067][118044] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-07 05:17:52,845][118044] Updated weights for policy 0, policy_version 88650 (0.0007) [2023-03-07 05:17:53,617][118044] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 05:17:54,395][118044] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-07 05:17:55,166][118044] Updated weights for policy 0, policy_version 88680 (0.0006) [2023-03-07 05:17:55,959][118044] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 05:17:56,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 90819584. Throughput: 0: 13141.7. Samples: 90786000. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:17:56,086][117718] Avg episode reward: [(0, '2780.473')] [2023-03-07 05:17:56,744][118044] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-07 05:17:57,501][118044] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-07 05:17:58,298][118044] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-07 05:17:59,072][118044] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-07 05:17:59,847][118044] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 05:18:00,647][118044] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-07 05:18:01,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 90885120. Throughput: 0: 13140.2. Samples: 90864933. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:18:01,086][117718] Avg episode reward: [(0, '2775.017')] [2023-03-07 05:18:01,413][118044] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-07 05:18:02,177][118044] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-07 05:18:02,950][118044] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-07 05:18:03,724][118044] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-07 05:18:04,501][118044] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 05:18:05,285][118044] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-07 05:18:06,065][118044] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-07 05:18:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 90951680. Throughput: 0: 13150.3. Samples: 90944117. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 05:18:06,086][117718] Avg episode reward: [(0, '2744.544')] [2023-03-07 05:18:06,845][118044] Updated weights for policy 0, policy_version 88830 (0.0007) [2023-03-07 05:18:07,610][118044] Updated weights for policy 0, policy_version 88840 (0.0007) [2023-03-07 05:18:08,404][118044] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 05:18:09,170][118044] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-07 05:18:09,959][118044] Updated weights for policy 0, policy_version 88870 (0.0007) [2023-03-07 05:18:10,727][118044] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-07 05:18:11,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 91017216. Throughput: 0: 13150.3. Samples: 90983428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:11,086][117718] Avg episode reward: [(0, '2756.958')] [2023-03-07 05:18:11,508][118044] Updated weights for policy 0, policy_version 88890 (0.0007) [2023-03-07 05:18:12,284][118044] Updated weights for policy 0, policy_version 88900 (0.0006) [2023-03-07 05:18:13,054][118044] Updated weights for policy 0, policy_version 88910 (0.0007) [2023-03-07 05:18:13,855][118044] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-07 05:18:14,628][118044] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-07 05:18:15,415][118044] Updated weights for policy 0, policy_version 88940 (0.0007) [2023-03-07 05:18:16,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 91082752. Throughput: 0: 13156.6. Samples: 91062456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:16,086][117718] Avg episode reward: [(0, '2618.045')] [2023-03-07 05:18:16,180][118044] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-07 05:18:16,959][118044] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-07 05:18:17,751][118044] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 05:18:18,522][118044] Updated weights for policy 0, policy_version 88980 (0.0007) [2023-03-07 05:18:19,314][118044] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 05:18:20,094][118044] Updated weights for policy 0, policy_version 89000 (0.0006) [2023-03-07 05:18:20,878][118044] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 05:18:21,086][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 91148288. Throughput: 0: 13151.2. Samples: 91141190. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:21,086][117718] Avg episode reward: [(0, '2641.609')] [2023-03-07 05:18:21,641][118044] Updated weights for policy 0, policy_version 89020 (0.0007) [2023-03-07 05:18:22,424][118044] Updated weights for policy 0, policy_version 89030 (0.0006) [2023-03-07 05:18:23,196][118044] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-07 05:18:23,977][118044] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 05:18:24,762][118044] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 05:18:25,553][118044] Updated weights for policy 0, policy_version 89070 (0.0006) [2023-03-07 05:18:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91214848. Throughput: 0: 13159.1. Samples: 91180830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:26,096][117718] Avg episode reward: [(0, '2644.551')] [2023-03-07 05:18:26,305][118044] Updated weights for policy 0, policy_version 89080 (0.0005) [2023-03-07 05:18:27,104][118044] Updated weights for policy 0, policy_version 89090 (0.0005) [2023-03-07 05:18:27,878][118044] Updated weights for policy 0, policy_version 89100 (0.0005) [2023-03-07 05:18:28,660][118044] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-07 05:18:29,438][118044] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-07 05:18:30,213][118044] Updated weights for policy 0, policy_version 89130 (0.0007) [2023-03-07 05:18:30,996][118044] Updated weights for policy 0, policy_version 89140 (0.0007) [2023-03-07 05:18:31,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91280384. Throughput: 0: 13160.2. Samples: 91259865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:31,097][117718] Avg episode reward: [(0, '2696.558')] [2023-03-07 05:18:31,761][118044] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-07 05:18:32,533][118044] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-07 05:18:33,321][118044] Updated weights for policy 0, policy_version 89170 (0.0006) [2023-03-07 05:18:34,086][118044] Updated weights for policy 0, policy_version 89180 (0.0007) [2023-03-07 05:18:34,869][118044] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-07 05:18:35,646][118044] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-07 05:18:36,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 91345920. Throughput: 0: 13162.2. Samples: 91338735. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:36,097][117718] Avg episode reward: [(0, '2660.113')] [2023-03-07 05:18:36,102][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000089205_91345920.pth... [2023-03-07 05:18:36,131][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000086126_88193024.pth [2023-03-07 05:18:36,423][118044] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-07 05:18:37,222][118044] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-07 05:18:37,990][118044] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-07 05:18:38,776][118044] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-07 05:18:39,559][118044] Updated weights for policy 0, policy_version 89250 (0.0007) [2023-03-07 05:18:40,324][118044] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-07 05:18:41,082][118044] Updated weights for policy 0, policy_version 89270 (0.0007) [2023-03-07 05:18:41,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91412480. Throughput: 0: 13154.6. Samples: 91377955. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:41,096][117718] Avg episode reward: [(0, '2802.194')] [2023-03-07 05:18:41,879][118044] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 05:18:42,653][118044] Updated weights for policy 0, policy_version 89290 (0.0007) [2023-03-07 05:18:43,434][118044] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-07 05:18:44,205][118044] Updated weights for policy 0, policy_version 89310 (0.0006) [2023-03-07 05:18:45,007][118044] Updated weights for policy 0, policy_version 89320 (0.0007) [2023-03-07 05:18:45,777][118044] Updated weights for policy 0, policy_version 89330 (0.0006) [2023-03-07 05:18:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 91476992. Throughput: 0: 13155.2. Samples: 91456915. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:46,096][117718] Avg episode reward: [(0, '2745.483')] [2023-03-07 05:18:46,562][118044] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-07 05:18:47,366][118044] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-07 05:18:48,125][118044] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 05:18:48,896][118044] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-07 05:18:49,670][118044] Updated weights for policy 0, policy_version 89380 (0.0006) [2023-03-07 05:18:50,449][118044] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-07 05:18:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91543552. Throughput: 0: 13149.1. Samples: 91535826. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:51,096][117718] Avg episode reward: [(0, '2798.288')] [2023-03-07 05:18:51,243][118044] Updated weights for policy 0, policy_version 89400 (0.0006) [2023-03-07 05:18:51,995][118044] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-07 05:18:52,787][118044] Updated weights for policy 0, policy_version 89420 (0.0007) [2023-03-07 05:18:53,592][118044] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-07 05:18:54,351][118044] Updated weights for policy 0, policy_version 89440 (0.0006) [2023-03-07 05:18:55,130][118044] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-07 05:18:55,905][118044] Updated weights for policy 0, policy_version 89460 (0.0007) [2023-03-07 05:18:56,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91609088. Throughput: 0: 13150.4. Samples: 91575192. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:18:56,086][117718] Avg episode reward: [(0, '2896.135')] [2023-03-07 05:18:56,668][118044] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-07 05:18:57,437][118044] Updated weights for policy 0, policy_version 89480 (0.0005) [2023-03-07 05:18:58,223][118044] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-07 05:18:59,000][118044] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-07 05:18:59,780][118044] Updated weights for policy 0, policy_version 89510 (0.0008) [2023-03-07 05:19:00,552][118044] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 05:19:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 91674624. Throughput: 0: 13155.8. Samples: 91654465. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:19:01,086][117718] Avg episode reward: [(0, '2906.971')] [2023-03-07 05:19:01,318][118044] Updated weights for policy 0, policy_version 89530 (0.0007) [2023-03-07 05:19:02,117][118044] Updated weights for policy 0, policy_version 89540 (0.0007) [2023-03-07 05:19:02,898][118044] Updated weights for policy 0, policy_version 89550 (0.0005) [2023-03-07 05:19:03,275][117993] KL-divergence is very high: 2006.1792 [2023-03-07 05:19:03,679][118044] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-07 05:19:04,449][118044] Updated weights for policy 0, policy_version 89570 (0.0007) [2023-03-07 05:19:05,225][118044] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 05:19:05,997][118044] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-07 05:19:06,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91741184. Throughput: 0: 13161.4. Samples: 91733452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:06,086][117718] Avg episode reward: [(0, '2895.953')] [2023-03-07 05:19:06,783][118044] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-07 05:19:07,555][118044] Updated weights for policy 0, policy_version 89610 (0.0006) [2023-03-07 05:19:08,307][118044] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-07 05:19:08,529][117993] KL-divergence is very high: 173.1451 [2023-03-07 05:19:09,078][118044] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-07 05:19:09,840][118044] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-07 05:19:10,622][118044] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-07 05:19:11,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 91806720. Throughput: 0: 13163.7. Samples: 91773197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:11,086][117718] Avg episode reward: [(0, '2820.536')] [2023-03-07 05:19:11,411][118044] Updated weights for policy 0, policy_version 89660 (0.0007) [2023-03-07 05:19:12,197][118044] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-07 05:19:12,976][118044] Updated weights for policy 0, policy_version 89680 (0.0007) [2023-03-07 05:19:13,738][118044] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 05:19:14,529][118044] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-07 05:19:15,300][118044] Updated weights for policy 0, policy_version 89710 (0.0005) [2023-03-07 05:19:16,085][118044] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-07 05:19:16,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 91873280. Throughput: 0: 13163.7. Samples: 91852231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:16,086][117718] Avg episode reward: [(0, '2877.474')] [2023-03-07 05:19:16,849][118044] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-07 05:19:17,635][118044] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-07 05:19:18,426][118044] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-07 05:19:19,202][118044] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-07 05:19:19,984][118044] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-07 05:19:20,761][118044] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 05:19:21,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 91938816. Throughput: 0: 13162.6. Samples: 91931052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:21,086][117718] Avg episode reward: [(0, '2782.453')] [2023-03-07 05:19:21,538][118044] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-07 05:19:22,328][118044] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-07 05:19:23,081][118044] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-07 05:19:23,869][118044] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 05:19:24,645][118044] Updated weights for policy 0, policy_version 89830 (0.0007) [2023-03-07 05:19:25,435][118044] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-07 05:19:26,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13158.3, 300 sec: 13145.4). Total num frames: 92004352. Throughput: 0: 13170.6. Samples: 91970636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:26,086][117718] Avg episode reward: [(0, '2799.702')] [2023-03-07 05:19:26,221][118044] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-07 05:19:26,991][118044] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-07 05:19:27,790][118044] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-07 05:19:28,575][118044] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-07 05:19:29,345][118044] Updated weights for policy 0, policy_version 89890 (0.0007) [2023-03-07 05:19:30,140][118044] Updated weights for policy 0, policy_version 89900 (0.0006) [2023-03-07 05:19:30,922][118044] Updated weights for policy 0, policy_version 89910 (0.0005) [2023-03-07 05:19:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 92069888. Throughput: 0: 13158.2. Samples: 92049034. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:31,086][117718] Avg episode reward: [(0, '2850.008')] [2023-03-07 05:19:31,713][118044] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-07 05:19:32,503][118044] Updated weights for policy 0, policy_version 89930 (0.0005) [2023-03-07 05:19:33,270][118044] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-07 05:19:34,039][118044] Updated weights for policy 0, policy_version 89950 (0.0006) [2023-03-07 05:19:34,821][118044] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 05:19:35,597][118044] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-07 05:19:36,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 92135424. Throughput: 0: 13155.5. Samples: 92127822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:36,086][117718] Avg episode reward: [(0, '2909.305')] [2023-03-07 05:19:36,375][118044] Updated weights for policy 0, policy_version 89980 (0.0007) [2023-03-07 05:19:37,162][118044] Updated weights for policy 0, policy_version 89990 (0.0006) [2023-03-07 05:19:37,936][118044] Updated weights for policy 0, policy_version 90000 (0.0005) [2023-03-07 05:19:38,732][118044] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-07 05:19:39,503][118044] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-07 05:19:40,280][118044] Updated weights for policy 0, policy_version 90030 (0.0005) [2023-03-07 05:19:41,053][118044] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-07 05:19:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 92200960. Throughput: 0: 13154.3. Samples: 92167133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:41,086][117718] Avg episode reward: [(0, '2885.995')] [2023-03-07 05:19:41,847][118044] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-07 05:19:42,630][118044] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 05:19:43,404][118044] Updated weights for policy 0, policy_version 90070 (0.0007) [2023-03-07 05:19:44,187][118044] Updated weights for policy 0, policy_version 90080 (0.0007) [2023-03-07 05:19:44,971][118044] Updated weights for policy 0, policy_version 90090 (0.0007) [2023-03-07 05:19:45,741][118044] Updated weights for policy 0, policy_version 90100 (0.0005) [2023-03-07 05:19:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 92266496. Throughput: 0: 13143.6. Samples: 92245928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:46,086][117718] Avg episode reward: [(0, '2996.013')] [2023-03-07 05:19:46,529][118044] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-07 05:19:47,313][118044] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-07 05:19:48,074][118044] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-07 05:19:48,858][118044] Updated weights for policy 0, policy_version 90140 (0.0006) [2023-03-07 05:19:49,625][118044] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-07 05:19:50,418][118044] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-07 05:19:51,085][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 92332032. Throughput: 0: 13141.0. Samples: 92324800. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:51,086][117718] Avg episode reward: [(0, '2883.130')] [2023-03-07 05:19:51,194][118044] Updated weights for policy 0, policy_version 90170 (0.0007) [2023-03-07 05:19:51,964][118044] Updated weights for policy 0, policy_version 90180 (0.0005) [2023-03-07 05:19:52,748][118044] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-07 05:19:53,529][118044] Updated weights for policy 0, policy_version 90200 (0.0006) [2023-03-07 05:19:54,322][118044] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-07 05:19:55,106][118044] Updated weights for policy 0, policy_version 90220 (0.0006) [2023-03-07 05:19:55,882][118044] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-07 05:19:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 92397568. Throughput: 0: 13131.7. Samples: 92364125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:19:56,086][117718] Avg episode reward: [(0, '3001.568')] [2023-03-07 05:19:56,649][118044] Updated weights for policy 0, policy_version 90240 (0.0006) [2023-03-07 05:19:57,435][118044] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-07 05:19:58,206][118044] Updated weights for policy 0, policy_version 90260 (0.0005) [2023-03-07 05:19:58,983][118044] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-07 05:19:59,756][118044] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-07 05:20:00,535][118044] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-07 05:20:01,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 92464128. Throughput: 0: 13130.1. Samples: 92443087. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:01,086][117718] Avg episode reward: [(0, '2909.178')] [2023-03-07 05:20:01,307][118044] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-07 05:20:02,096][118044] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-07 05:20:02,875][118044] Updated weights for policy 0, policy_version 90320 (0.0007) [2023-03-07 05:20:03,661][118044] Updated weights for policy 0, policy_version 90330 (0.0007) [2023-03-07 05:20:04,441][118044] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-07 05:20:05,219][118044] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-07 05:20:05,996][118044] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-07 05:20:06,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 92529664. Throughput: 0: 13131.8. Samples: 92521982. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:06,086][117718] Avg episode reward: [(0, '2863.334')] [2023-03-07 05:20:06,778][118044] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-07 05:20:07,572][118044] Updated weights for policy 0, policy_version 90380 (0.0007) [2023-03-07 05:20:08,346][118044] Updated weights for policy 0, policy_version 90390 (0.0006) [2023-03-07 05:20:09,130][118044] Updated weights for policy 0, policy_version 90400 (0.0005) [2023-03-07 05:20:09,923][118044] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-07 05:20:10,693][118044] Updated weights for policy 0, policy_version 90420 (0.0006) [2023-03-07 05:20:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 92595200. Throughput: 0: 13125.5. Samples: 92561284. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:11,086][117718] Avg episode reward: [(0, '2834.079')] [2023-03-07 05:20:11,468][118044] Updated weights for policy 0, policy_version 90430 (0.0007) [2023-03-07 05:20:12,237][118044] Updated weights for policy 0, policy_version 90440 (0.0006) [2023-03-07 05:20:13,012][118044] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-07 05:20:13,804][118044] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-07 05:20:14,594][118044] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 05:20:15,361][118044] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-07 05:20:16,085][117718] Fps is (10 sec: 13005.0, 60 sec: 13107.2, 300 sec: 13141.9). Total num frames: 92659712. Throughput: 0: 13131.7. Samples: 92639958. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:16,086][117718] Avg episode reward: [(0, '2768.131')] [2023-03-07 05:20:16,142][118044] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-07 05:20:16,937][118044] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 05:20:17,698][118044] Updated weights for policy 0, policy_version 90510 (0.0007) [2023-03-07 05:20:18,475][118044] Updated weights for policy 0, policy_version 90520 (0.0006) [2023-03-07 05:20:19,252][118044] Updated weights for policy 0, policy_version 90530 (0.0005) [2023-03-07 05:20:20,032][118044] Updated weights for policy 0, policy_version 90540 (0.0006) [2023-03-07 05:20:20,827][118044] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-07 05:20:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 92726272. Throughput: 0: 13135.0. Samples: 92718897. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:21,086][117718] Avg episode reward: [(0, '2849.451')] [2023-03-07 05:20:21,603][118044] Updated weights for policy 0, policy_version 90560 (0.0006) [2023-03-07 05:20:22,393][118044] Updated weights for policy 0, policy_version 90570 (0.0006) [2023-03-07 05:20:23,165][118044] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-07 05:20:23,963][118044] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-07 05:20:24,752][118044] Updated weights for policy 0, policy_version 90600 (0.0006) [2023-03-07 05:20:25,510][118044] Updated weights for policy 0, policy_version 90610 (0.0006) [2023-03-07 05:20:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 92791808. Throughput: 0: 13130.5. Samples: 92758005. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:26,086][117718] Avg episode reward: [(0, '2769.417')] [2023-03-07 05:20:26,278][118044] Updated weights for policy 0, policy_version 90620 (0.0005) [2023-03-07 05:20:27,047][118044] Updated weights for policy 0, policy_version 90630 (0.0007) [2023-03-07 05:20:27,829][118044] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-07 05:20:28,601][118044] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-07 05:20:29,376][118044] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 05:20:30,160][118044] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-07 05:20:30,935][118044] Updated weights for policy 0, policy_version 90680 (0.0006) [2023-03-07 05:20:31,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 92857344. Throughput: 0: 13138.8. Samples: 92837172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:31,086][117718] Avg episode reward: [(0, '2853.146')] [2023-03-07 05:20:31,709][118044] Updated weights for policy 0, policy_version 90690 (0.0005) [2023-03-07 05:20:32,510][118044] Updated weights for policy 0, policy_version 90700 (0.0006) [2023-03-07 05:20:33,267][118044] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 05:20:34,065][118044] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-07 05:20:34,854][118044] Updated weights for policy 0, policy_version 90730 (0.0005) [2023-03-07 05:20:35,613][118044] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-07 05:20:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 92923904. Throughput: 0: 13136.6. Samples: 92915944. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:36,086][117718] Avg episode reward: [(0, '2842.052')] [2023-03-07 05:20:36,089][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000090746_92923904.pth... [2023-03-07 05:20:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000087664_89767936.pth [2023-03-07 05:20:36,385][118044] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-07 05:20:37,169][118044] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-07 05:20:37,963][118044] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-07 05:20:38,737][118044] Updated weights for policy 0, policy_version 90780 (0.0007) [2023-03-07 05:20:39,499][118044] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-07 05:20:40,280][118044] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-07 05:20:41,049][118044] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-07 05:20:41,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 92989440. Throughput: 0: 13139.4. Samples: 92955400. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:41,097][117718] Avg episode reward: [(0, '2871.042')] [2023-03-07 05:20:41,850][118044] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-07 05:20:42,602][118044] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-07 05:20:43,388][118044] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-07 05:20:44,154][118044] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-07 05:20:44,925][118044] Updated weights for policy 0, policy_version 90860 (0.0007) [2023-03-07 05:20:45,719][118044] Updated weights for policy 0, policy_version 90870 (0.0006) [2023-03-07 05:20:46,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93054976. Throughput: 0: 13147.6. Samples: 93034728. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:46,096][117718] Avg episode reward: [(0, '2855.880')] [2023-03-07 05:20:46,501][118044] Updated weights for policy 0, policy_version 90880 (0.0007) [2023-03-07 05:20:47,274][118044] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-07 05:20:48,057][118044] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-07 05:20:48,826][118044] Updated weights for policy 0, policy_version 90910 (0.0006) [2023-03-07 05:20:49,589][118044] Updated weights for policy 0, policy_version 90920 (0.0007) [2023-03-07 05:20:50,377][118044] Updated weights for policy 0, policy_version 90930 (0.0007) [2023-03-07 05:20:51,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 93121536. Throughput: 0: 13148.4. Samples: 93113658. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:20:51,086][117718] Avg episode reward: [(0, '2830.583')] [2023-03-07 05:20:51,155][118044] Updated weights for policy 0, policy_version 90940 (0.0007) [2023-03-07 05:20:51,924][118044] Updated weights for policy 0, policy_version 90950 (0.0006) [2023-03-07 05:20:52,713][118044] Updated weights for policy 0, policy_version 90960 (0.0006) [2023-03-07 05:20:53,488][118044] Updated weights for policy 0, policy_version 90970 (0.0005) [2023-03-07 05:20:54,289][118044] Updated weights for policy 0, policy_version 90980 (0.0006) [2023-03-07 05:20:55,064][118044] Updated weights for policy 0, policy_version 90990 (0.0006) [2023-03-07 05:20:55,829][118044] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 05:20:56,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 93187072. Throughput: 0: 13151.8. Samples: 93153114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:20:56,086][117718] Avg episode reward: [(0, '2786.933')] [2023-03-07 05:20:56,611][118044] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 05:20:57,392][118044] Updated weights for policy 0, policy_version 91020 (0.0007) [2023-03-07 05:20:58,157][118044] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-07 05:20:58,956][118044] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-07 05:20:59,757][118044] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-07 05:21:00,514][118044] Updated weights for policy 0, policy_version 91060 (0.0005) [2023-03-07 05:21:01,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93252608. Throughput: 0: 13152.9. Samples: 93231842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:01,086][117718] Avg episode reward: [(0, '2826.839')] [2023-03-07 05:21:01,286][118044] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 05:21:02,065][118044] Updated weights for policy 0, policy_version 91080 (0.0005) [2023-03-07 05:21:02,830][118044] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-07 05:21:03,617][118044] Updated weights for policy 0, policy_version 91100 (0.0008) [2023-03-07 05:21:04,401][118044] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-07 05:21:05,190][118044] Updated weights for policy 0, policy_version 91120 (0.0007) [2023-03-07 05:21:05,952][118044] Updated weights for policy 0, policy_version 91130 (0.0007) [2023-03-07 05:21:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 93318144. Throughput: 0: 13152.6. Samples: 93310763. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:06,086][117718] Avg episode reward: [(0, '2814.664')] [2023-03-07 05:21:06,737][118044] Updated weights for policy 0, policy_version 91140 (0.0007) [2023-03-07 05:21:07,520][118044] Updated weights for policy 0, policy_version 91150 (0.0005) [2023-03-07 05:21:08,298][118044] Updated weights for policy 0, policy_version 91160 (0.0007) [2023-03-07 05:21:09,066][118044] Updated weights for policy 0, policy_version 91170 (0.0007) [2023-03-07 05:21:09,861][118044] Updated weights for policy 0, policy_version 91180 (0.0007) [2023-03-07 05:21:10,633][118044] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-07 05:21:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93383680. Throughput: 0: 13163.0. Samples: 93350340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:11,086][117718] Avg episode reward: [(0, '2770.772')] [2023-03-07 05:21:11,403][118044] Updated weights for policy 0, policy_version 91200 (0.0005) [2023-03-07 05:21:12,181][118044] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-07 05:21:12,965][118044] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-07 05:21:13,739][118044] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-07 05:21:14,519][118044] Updated weights for policy 0, policy_version 91240 (0.0006) [2023-03-07 05:21:15,301][118044] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-07 05:21:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 93449216. Throughput: 0: 13158.9. Samples: 93429323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:16,086][117718] Avg episode reward: [(0, '2769.173')] [2023-03-07 05:21:16,089][118044] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-07 05:21:16,862][118044] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-07 05:21:17,644][118044] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-07 05:21:18,430][118044] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-07 05:21:19,213][118044] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-07 05:21:19,986][118044] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-07 05:21:20,784][118044] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 05:21:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93514752. Throughput: 0: 13154.8. Samples: 93507911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:21,086][117718] Avg episode reward: [(0, '2806.600')] [2023-03-07 05:21:21,545][118044] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-07 05:21:22,330][118044] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 05:21:23,105][118044] Updated weights for policy 0, policy_version 91350 (0.0007) [2023-03-07 05:21:23,886][118044] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-07 05:21:24,675][118044] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-07 05:21:25,445][118044] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-07 05:21:26,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 93581312. Throughput: 0: 13156.6. Samples: 93547449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:26,086][117718] Avg episode reward: [(0, '2862.189')] [2023-03-07 05:21:26,196][118044] Updated weights for policy 0, policy_version 91390 (0.0006) [2023-03-07 05:21:26,987][118044] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-07 05:21:27,767][118044] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-07 05:21:28,549][118044] Updated weights for policy 0, policy_version 91420 (0.0006) [2023-03-07 05:21:29,312][118044] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-07 05:21:30,095][118044] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-07 05:21:30,863][118044] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 05:21:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 93646848. Throughput: 0: 13153.3. Samples: 93626623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:31,086][117718] Avg episode reward: [(0, '2792.782')] [2023-03-07 05:21:31,641][118044] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-07 05:21:32,429][118044] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-07 05:21:33,199][118044] Updated weights for policy 0, policy_version 91480 (0.0005) [2023-03-07 05:21:33,973][118044] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-07 05:21:34,745][118044] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-07 05:21:35,527][118044] Updated weights for policy 0, policy_version 91510 (0.0007) [2023-03-07 05:21:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 93713408. Throughput: 0: 13159.5. Samples: 93705835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:36,086][117718] Avg episode reward: [(0, '2782.278')] [2023-03-07 05:21:36,296][118044] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-07 05:21:37,080][118044] Updated weights for policy 0, policy_version 91530 (0.0007) [2023-03-07 05:21:37,855][118044] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-07 05:21:38,636][118044] Updated weights for policy 0, policy_version 91550 (0.0007) [2023-03-07 05:21:39,418][118044] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-07 05:21:40,193][118044] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-07 05:21:40,975][118044] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-07 05:21:41,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 93778944. Throughput: 0: 13158.9. Samples: 93745266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:41,086][117718] Avg episode reward: [(0, '2876.269')] [2023-03-07 05:21:41,770][118044] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 05:21:42,545][118044] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 05:21:43,325][118044] Updated weights for policy 0, policy_version 91610 (0.0006) [2023-03-07 05:21:44,118][118044] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-07 05:21:44,882][118044] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-07 05:21:45,648][118044] Updated weights for policy 0, policy_version 91640 (0.0007) [2023-03-07 05:21:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 93844480. Throughput: 0: 13158.9. Samples: 93823991. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:21:46,086][117718] Avg episode reward: [(0, '2886.261')] [2023-03-07 05:21:46,451][118044] Updated weights for policy 0, policy_version 91650 (0.0005) [2023-03-07 05:21:47,218][118044] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-07 05:21:48,013][118044] Updated weights for policy 0, policy_version 91670 (0.0006) [2023-03-07 05:21:48,781][118044] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-07 05:21:49,558][118044] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 05:21:50,349][118044] Updated weights for policy 0, policy_version 91700 (0.0007) [2023-03-07 05:21:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93910016. Throughput: 0: 13156.1. Samples: 93902788. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:21:51,097][117718] Avg episode reward: [(0, '2888.220')] [2023-03-07 05:21:51,126][118044] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-07 05:21:51,894][118044] Updated weights for policy 0, policy_version 91720 (0.0007) [2023-03-07 05:21:52,686][118044] Updated weights for policy 0, policy_version 91730 (0.0007) [2023-03-07 05:21:53,471][118044] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 05:21:54,239][118044] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 05:21:55,019][118044] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-07 05:21:55,806][118044] Updated weights for policy 0, policy_version 91770 (0.0007) [2023-03-07 05:21:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 93975552. Throughput: 0: 13147.9. Samples: 93941996. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:21:56,096][117718] Avg episode reward: [(0, '2917.374')] [2023-03-07 05:21:56,589][118044] Updated weights for policy 0, policy_version 91780 (0.0006) [2023-03-07 05:21:57,375][118044] Updated weights for policy 0, policy_version 91790 (0.0007) [2023-03-07 05:21:58,161][118044] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-07 05:21:58,953][118044] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-07 05:21:59,724][118044] Updated weights for policy 0, policy_version 91820 (0.0006) [2023-03-07 05:22:00,517][118044] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 05:22:01,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94041088. Throughput: 0: 13139.3. Samples: 94020594. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:01,097][117718] Avg episode reward: [(0, '2939.333')] [2023-03-07 05:22:01,291][118044] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-07 05:22:02,077][118044] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-07 05:22:02,855][118044] Updated weights for policy 0, policy_version 91860 (0.0007) [2023-03-07 05:22:03,649][118044] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-07 05:22:04,410][118044] Updated weights for policy 0, policy_version 91880 (0.0008) [2023-03-07 05:22:05,196][118044] Updated weights for policy 0, policy_version 91890 (0.0007) [2023-03-07 05:22:05,987][118044] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-07 05:22:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 94106624. Throughput: 0: 13135.8. Samples: 94099024. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:06,086][117718] Avg episode reward: [(0, '2916.961')] [2023-03-07 05:22:06,763][118044] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-07 05:22:07,540][118044] Updated weights for policy 0, policy_version 91920 (0.0006) [2023-03-07 05:22:08,320][118044] Updated weights for policy 0, policy_version 91930 (0.0006) [2023-03-07 05:22:09,084][118044] Updated weights for policy 0, policy_version 91940 (0.0006) [2023-03-07 05:22:09,876][118044] Updated weights for policy 0, policy_version 91950 (0.0007) [2023-03-07 05:22:10,643][118044] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-07 05:22:11,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 94172160. Throughput: 0: 13135.6. Samples: 94138551. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:11,086][117718] Avg episode reward: [(0, '2891.796')] [2023-03-07 05:22:11,427][118044] Updated weights for policy 0, policy_version 91970 (0.0006) [2023-03-07 05:22:12,188][118044] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-07 05:22:12,966][118044] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-07 05:22:13,745][118044] Updated weights for policy 0, policy_version 92000 (0.0006) [2023-03-07 05:22:14,518][118044] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-07 05:22:15,286][118044] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-07 05:22:16,073][118044] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-07 05:22:16,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 94238720. Throughput: 0: 13140.4. Samples: 94217944. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:16,086][117718] Avg episode reward: [(0, '2854.792')] [2023-03-07 05:22:16,831][118044] Updated weights for policy 0, policy_version 92040 (0.0006) [2023-03-07 05:22:17,604][118044] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-07 05:22:18,377][118044] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-07 05:22:19,178][118044] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 05:22:19,953][118044] Updated weights for policy 0, policy_version 92080 (0.0006) [2023-03-07 05:22:20,721][118044] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-07 05:22:21,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 94304256. Throughput: 0: 13139.0. Samples: 94297091. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:21,086][117718] Avg episode reward: [(0, '2812.150')] [2023-03-07 05:22:21,503][118044] Updated weights for policy 0, policy_version 92100 (0.0007) [2023-03-07 05:22:22,277][118044] Updated weights for policy 0, policy_version 92110 (0.0005) [2023-03-07 05:22:23,078][118044] Updated weights for policy 0, policy_version 92120 (0.0006) [2023-03-07 05:22:23,855][118044] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-07 05:22:24,624][118044] Updated weights for policy 0, policy_version 92140 (0.0005) [2023-03-07 05:22:25,389][118044] Updated weights for policy 0, policy_version 92150 (0.0006) [2023-03-07 05:22:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94369792. Throughput: 0: 13136.4. Samples: 94336406. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:26,086][117718] Avg episode reward: [(0, '2928.726')] [2023-03-07 05:22:26,173][118044] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-07 05:22:26,948][118044] Updated weights for policy 0, policy_version 92170 (0.0006) [2023-03-07 05:22:27,717][118044] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-07 05:22:28,498][118044] Updated weights for policy 0, policy_version 92190 (0.0005) [2023-03-07 05:22:29,286][118044] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-07 05:22:30,069][118044] Updated weights for policy 0, policy_version 92210 (0.0006) [2023-03-07 05:22:30,841][118044] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 05:22:31,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 94436352. Throughput: 0: 13144.9. Samples: 94415510. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:31,086][117718] Avg episode reward: [(0, '2873.724')] [2023-03-07 05:22:31,628][118044] Updated weights for policy 0, policy_version 92230 (0.0007) [2023-03-07 05:22:32,417][118044] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-07 05:22:33,193][118044] Updated weights for policy 0, policy_version 92250 (0.0006) [2023-03-07 05:22:33,977][118044] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 05:22:34,753][118044] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-07 05:22:35,514][118044] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-07 05:22:36,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94501888. Throughput: 0: 13143.4. Samples: 94494242. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:36,086][117718] Avg episode reward: [(0, '2872.316')] [2023-03-07 05:22:36,091][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000092287_94501888.pth... [2023-03-07 05:22:36,120][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000089205_91345920.pth [2023-03-07 05:22:36,276][118044] Updated weights for policy 0, policy_version 92290 (0.0007) [2023-03-07 05:22:37,078][118044] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 05:22:37,860][118044] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-07 05:22:38,639][118044] Updated weights for policy 0, policy_version 92320 (0.0006) [2023-03-07 05:22:39,414][118044] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-07 05:22:40,202][118044] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-07 05:22:40,961][118044] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-07 05:22:41,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94567424. Throughput: 0: 13150.2. Samples: 94533757. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 05:22:41,086][117718] Avg episode reward: [(0, '2841.096')] [2023-03-07 05:22:41,738][118044] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-07 05:22:42,517][118044] Updated weights for policy 0, policy_version 92370 (0.0006) [2023-03-07 05:22:43,303][118044] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-07 05:22:44,092][118044] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-07 05:22:44,877][118044] Updated weights for policy 0, policy_version 92400 (0.0006) [2023-03-07 05:22:45,646][118044] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-07 05:22:46,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 94632960. Throughput: 0: 13157.7. Samples: 94612691. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:22:46,086][117718] Avg episode reward: [(0, '2882.979')] [2023-03-07 05:22:46,421][118044] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 05:22:47,205][118044] Updated weights for policy 0, policy_version 92430 (0.0007) [2023-03-07 05:22:47,990][118044] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-07 05:22:48,764][118044] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-07 05:22:49,561][118044] Updated weights for policy 0, policy_version 92460 (0.0006) [2023-03-07 05:22:50,346][118044] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-07 05:22:51,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94698496. Throughput: 0: 13157.8. Samples: 94691125. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:22:51,086][117718] Avg episode reward: [(0, '2839.478')] [2023-03-07 05:22:51,114][118044] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-07 05:22:51,887][118044] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 05:22:52,667][118044] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-07 05:22:53,440][118044] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-07 05:22:54,224][118044] Updated weights for policy 0, policy_version 92520 (0.0005) [2023-03-07 05:22:55,005][118044] Updated weights for policy 0, policy_version 92530 (0.0006) [2023-03-07 05:22:55,775][118044] Updated weights for policy 0, policy_version 92540 (0.0007) [2023-03-07 05:22:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 94764032. Throughput: 0: 13158.8. Samples: 94730698. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:22:56,086][117718] Avg episode reward: [(0, '2801.457')] [2023-03-07 05:22:56,567][118044] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-07 05:22:57,338][118044] Updated weights for policy 0, policy_version 92560 (0.0007) [2023-03-07 05:22:58,101][118044] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 05:22:58,886][118044] Updated weights for policy 0, policy_version 92580 (0.0008) [2023-03-07 05:22:59,657][118044] Updated weights for policy 0, policy_version 92590 (0.0007) [2023-03-07 05:23:00,438][118044] Updated weights for policy 0, policy_version 92600 (0.0005) [2023-03-07 05:23:01,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 94830592. Throughput: 0: 13158.9. Samples: 94810093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:01,086][117718] Avg episode reward: [(0, '2824.975')] [2023-03-07 05:23:01,223][118044] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-07 05:23:01,982][118044] Updated weights for policy 0, policy_version 92620 (0.0005) [2023-03-07 05:23:02,774][118044] Updated weights for policy 0, policy_version 92630 (0.0005) [2023-03-07 05:23:03,542][118044] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-07 05:23:04,321][118044] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-07 05:23:05,121][118044] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 05:23:05,901][118044] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 05:23:06,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 94896128. Throughput: 0: 13149.2. Samples: 94888805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:06,086][117718] Avg episode reward: [(0, '2830.127')] [2023-03-07 05:23:06,681][118044] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-07 05:23:07,453][118044] Updated weights for policy 0, policy_version 92690 (0.0005) [2023-03-07 05:23:08,233][118044] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-07 05:23:09,026][118044] Updated weights for policy 0, policy_version 92710 (0.0006) [2023-03-07 05:23:09,792][118044] Updated weights for policy 0, policy_version 92720 (0.0006) [2023-03-07 05:23:10,577][118044] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-07 05:23:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 94961664. Throughput: 0: 13152.1. Samples: 94928252. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:11,086][117718] Avg episode reward: [(0, '2858.303')] [2023-03-07 05:23:11,352][118044] Updated weights for policy 0, policy_version 92740 (0.0007) [2023-03-07 05:23:12,144][118044] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-07 05:23:12,919][118044] Updated weights for policy 0, policy_version 92760 (0.0007) [2023-03-07 05:23:13,709][118044] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 05:23:14,505][118044] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-07 05:23:15,272][118044] Updated weights for policy 0, policy_version 92790 (0.0006) [2023-03-07 05:23:16,033][118044] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-07 05:23:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 95027200. Throughput: 0: 13137.8. Samples: 95006709. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:16,086][117718] Avg episode reward: [(0, '2859.662')] [2023-03-07 05:23:16,825][118044] Updated weights for policy 0, policy_version 92810 (0.0007) [2023-03-07 05:23:17,602][118044] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-07 05:23:18,381][118044] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-07 05:23:19,149][118044] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-07 05:23:19,934][118044] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-07 05:23:20,716][118044] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-07 05:23:21,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 95092736. Throughput: 0: 13141.7. Samples: 95085619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:21,086][117718] Avg episode reward: [(0, '2883.879')] [2023-03-07 05:23:21,510][118044] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-07 05:23:22,274][118044] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-07 05:23:23,033][118044] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-07 05:23:23,834][118044] Updated weights for policy 0, policy_version 92900 (0.0006) [2023-03-07 05:23:24,609][118044] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-07 05:23:25,382][118044] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-07 05:23:26,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 95158272. Throughput: 0: 13138.4. Samples: 95124986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:26,086][117718] Avg episode reward: [(0, '2849.505')] [2023-03-07 05:23:26,173][118044] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-07 05:23:26,954][118044] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-07 05:23:27,743][118044] Updated weights for policy 0, policy_version 92950 (0.0006) [2023-03-07 05:23:28,527][118044] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-07 05:23:29,319][118044] Updated weights for policy 0, policy_version 92970 (0.0007) [2023-03-07 05:23:30,120][118044] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-07 05:23:30,892][118044] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-07 05:23:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 95223808. Throughput: 0: 13130.1. Samples: 95203544. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:31,086][117718] Avg episode reward: [(0, '2862.780')] [2023-03-07 05:23:31,692][118044] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-07 05:23:32,477][118044] Updated weights for policy 0, policy_version 93010 (0.0005) [2023-03-07 05:23:33,234][118044] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-07 05:23:34,034][118044] Updated weights for policy 0, policy_version 93030 (0.0006) [2023-03-07 05:23:34,820][118044] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 05:23:35,597][118044] Updated weights for policy 0, policy_version 93050 (0.0006) [2023-03-07 05:23:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 95289344. Throughput: 0: 13128.4. Samples: 95281904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:23:36,086][117718] Avg episode reward: [(0, '2901.177')] [2023-03-07 05:23:36,378][118044] Updated weights for policy 0, policy_version 93060 (0.0007) [2023-03-07 05:23:37,165][118044] Updated weights for policy 0, policy_version 93070 (0.0007) [2023-03-07 05:23:37,953][118044] Updated weights for policy 0, policy_version 93080 (0.0006) [2023-03-07 05:23:38,742][118044] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-07 05:23:39,513][118044] Updated weights for policy 0, policy_version 93100 (0.0007) [2023-03-07 05:23:40,296][118044] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-07 05:23:41,068][118044] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-07 05:23:41,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 95354880. Throughput: 0: 13118.9. Samples: 95321050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:23:41,086][117718] Avg episode reward: [(0, '2822.767')] [2023-03-07 05:23:41,857][118044] Updated weights for policy 0, policy_version 93130 (0.0006) [2023-03-07 05:23:42,651][118044] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-07 05:23:43,423][118044] Updated weights for policy 0, policy_version 93150 (0.0006) [2023-03-07 05:23:44,205][118044] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 05:23:44,994][118044] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 05:23:45,778][118044] Updated weights for policy 0, policy_version 93180 (0.0007) [2023-03-07 05:23:46,085][117718] Fps is (10 sec: 13005.0, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 95419392. Throughput: 0: 13099.6. Samples: 95399576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:23:46,086][117718] Avg episode reward: [(0, '2852.969')] [2023-03-07 05:23:46,566][118044] Updated weights for policy 0, policy_version 93190 (0.0006) [2023-03-07 05:23:47,343][118044] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-07 05:23:48,135][118044] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-07 05:23:48,898][118044] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-07 05:23:49,682][118044] Updated weights for policy 0, policy_version 93230 (0.0005) [2023-03-07 05:23:50,464][118044] Updated weights for policy 0, policy_version 93240 (0.0006) [2023-03-07 05:23:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 95485952. Throughput: 0: 13095.1. Samples: 95478086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:23:51,086][117718] Avg episode reward: [(0, '2864.976')] [2023-03-07 05:23:51,238][118044] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-07 05:23:52,021][118044] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-07 05:23:52,821][118044] Updated weights for policy 0, policy_version 93270 (0.0007) [2023-03-07 05:23:53,610][118044] Updated weights for policy 0, policy_version 93280 (0.0006) [2023-03-07 05:23:54,394][118044] Updated weights for policy 0, policy_version 93290 (0.0007) [2023-03-07 05:23:55,171][118044] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-07 05:23:55,958][118044] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-07 05:23:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13138.4). Total num frames: 95550464. Throughput: 0: 13086.9. Samples: 95517161. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:23:56,086][117718] Avg episode reward: [(0, '2896.164')] [2023-03-07 05:23:56,765][118044] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-07 05:23:57,546][118044] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-07 05:23:58,315][118044] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-07 05:23:59,094][118044] Updated weights for policy 0, policy_version 93350 (0.0006) [2023-03-07 05:23:59,878][118044] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-07 05:24:00,648][118044] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-07 05:24:01,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13090.1, 300 sec: 13135.0). Total num frames: 95616000. Throughput: 0: 13085.5. Samples: 95595555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:01,086][117718] Avg episode reward: [(0, '2827.087')] [2023-03-07 05:24:01,436][118044] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-07 05:24:02,202][118044] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-07 05:24:02,972][118044] Updated weights for policy 0, policy_version 93400 (0.0005) [2023-03-07 05:24:03,765][118044] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-07 05:24:04,542][118044] Updated weights for policy 0, policy_version 93420 (0.0006) [2023-03-07 05:24:05,335][118044] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-07 05:24:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13135.0). Total num frames: 95681536. Throughput: 0: 13085.7. Samples: 95674476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:06,086][117718] Avg episode reward: [(0, '2882.390')] [2023-03-07 05:24:06,106][118044] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-07 05:24:06,893][118044] Updated weights for policy 0, policy_version 93450 (0.0006) [2023-03-07 05:24:07,661][118044] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-07 05:24:08,446][118044] Updated weights for policy 0, policy_version 93470 (0.0006) [2023-03-07 05:24:09,218][118044] Updated weights for policy 0, policy_version 93480 (0.0007) [2023-03-07 05:24:10,001][118044] Updated weights for policy 0, policy_version 93490 (0.0006) [2023-03-07 05:24:10,769][118044] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-07 05:24:11,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 95748096. Throughput: 0: 13086.1. Samples: 95713862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:11,086][117718] Avg episode reward: [(0, '2854.363')] [2023-03-07 05:24:11,553][118044] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-07 05:24:12,335][118044] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-07 05:24:13,098][118044] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 05:24:13,901][118044] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-07 05:24:14,644][118044] Updated weights for policy 0, policy_version 93550 (0.0007) [2023-03-07 05:24:15,431][118044] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-07 05:24:16,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 95813632. Throughput: 0: 13100.8. Samples: 95793078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:16,097][117718] Avg episode reward: [(0, '2863.775')] [2023-03-07 05:24:16,203][118044] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-07 05:24:16,973][118044] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 05:24:17,746][118044] Updated weights for policy 0, policy_version 93590 (0.0006) [2023-03-07 05:24:18,518][118044] Updated weights for policy 0, policy_version 93600 (0.0007) [2023-03-07 05:24:19,310][118044] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-07 05:24:20,082][118044] Updated weights for policy 0, policy_version 93620 (0.0006) [2023-03-07 05:24:20,862][118044] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-07 05:24:21,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13135.0). Total num frames: 95879168. Throughput: 0: 13113.3. Samples: 95872002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:21,097][117718] Avg episode reward: [(0, '2901.104')] [2023-03-07 05:24:21,635][118044] Updated weights for policy 0, policy_version 93640 (0.0005) [2023-03-07 05:24:22,426][118044] Updated weights for policy 0, policy_version 93650 (0.0005) [2023-03-07 05:24:23,198][118044] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-07 05:24:23,982][118044] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-07 05:24:24,764][118044] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-07 05:24:25,551][118044] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 05:24:26,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 95945728. Throughput: 0: 13123.6. Samples: 95911610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:26,096][117718] Avg episode reward: [(0, '2909.213')] [2023-03-07 05:24:26,331][118044] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-07 05:24:27,089][118044] Updated weights for policy 0, policy_version 93710 (0.0007) [2023-03-07 05:24:27,859][118044] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-07 05:24:28,637][118044] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-07 05:24:29,404][118044] Updated weights for policy 0, policy_version 93740 (0.0006) [2023-03-07 05:24:30,193][118044] Updated weights for policy 0, policy_version 93750 (0.0006) [2023-03-07 05:24:30,962][118044] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 05:24:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 96011264. Throughput: 0: 13138.0. Samples: 95990784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:31,096][117718] Avg episode reward: [(0, '2909.121')] [2023-03-07 05:24:31,745][118044] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-07 05:24:32,522][118044] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-07 05:24:33,310][118044] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-07 05:24:34,093][118044] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-07 05:24:34,855][118044] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-07 05:24:35,612][118044] Updated weights for policy 0, policy_version 93820 (0.0005) [2023-03-07 05:24:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 96076800. Throughput: 0: 13150.2. Samples: 96069845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:36,086][117718] Avg episode reward: [(0, '2956.335')] [2023-03-07 05:24:36,096][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000093826_96077824.pth... [2023-03-07 05:24:36,126][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000090746_92923904.pth [2023-03-07 05:24:36,415][118044] Updated weights for policy 0, policy_version 93830 (0.0006) [2023-03-07 05:24:37,187][118044] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-07 05:24:37,970][118044] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-07 05:24:38,739][118044] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-07 05:24:39,522][118044] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-07 05:24:40,308][118044] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 05:24:41,078][118044] Updated weights for policy 0, policy_version 93890 (0.0006) [2023-03-07 05:24:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 96143360. Throughput: 0: 13156.3. Samples: 96109196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:41,086][117718] Avg episode reward: [(0, '2990.717')] [2023-03-07 05:24:41,885][118044] Updated weights for policy 0, policy_version 93900 (0.0007) [2023-03-07 05:24:42,642][118044] Updated weights for policy 0, policy_version 93910 (0.0006) [2023-03-07 05:24:43,433][118044] Updated weights for policy 0, policy_version 93920 (0.0006) [2023-03-07 05:24:44,221][118044] Updated weights for policy 0, policy_version 93930 (0.0006) [2023-03-07 05:24:45,008][118044] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-07 05:24:45,797][118044] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 05:24:46,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 96207872. Throughput: 0: 13163.5. Samples: 96187915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:46,086][117718] Avg episode reward: [(0, '2918.297')] [2023-03-07 05:24:46,569][118044] Updated weights for policy 0, policy_version 93960 (0.0007) [2023-03-07 05:24:47,335][118044] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-07 05:24:48,126][118044] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-07 05:24:48,925][118044] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-07 05:24:49,710][118044] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-07 05:24:50,480][118044] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-07 05:24:51,085][117718] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 96273408. Throughput: 0: 13152.9. Samples: 96266356. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:51,086][117718] Avg episode reward: [(0, '2946.404')] [2023-03-07 05:24:51,247][118044] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-07 05:24:52,038][118044] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-07 05:24:52,812][118044] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-07 05:24:53,601][118044] Updated weights for policy 0, policy_version 94050 (0.0007) [2023-03-07 05:24:54,393][118044] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-07 05:24:55,171][118044] Updated weights for policy 0, policy_version 94070 (0.0006) [2023-03-07 05:24:55,962][118044] Updated weights for policy 0, policy_version 94080 (0.0007) [2023-03-07 05:24:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 96338944. Throughput: 0: 13149.0. Samples: 96305565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:24:56,086][117718] Avg episode reward: [(0, '2957.005')] [2023-03-07 05:24:56,747][118044] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-07 05:24:57,527][118044] Updated weights for policy 0, policy_version 94100 (0.0007) [2023-03-07 05:24:58,311][118044] Updated weights for policy 0, policy_version 94110 (0.0006) [2023-03-07 05:24:59,086][118044] Updated weights for policy 0, policy_version 94120 (0.0005) [2023-03-07 05:24:59,865][118044] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 05:25:00,634][118044] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-07 05:25:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 96404480. Throughput: 0: 13134.7. Samples: 96384137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:01,086][117718] Avg episode reward: [(0, '2912.271')] [2023-03-07 05:25:01,414][118044] Updated weights for policy 0, policy_version 94150 (0.0006) [2023-03-07 05:25:02,207][118044] Updated weights for policy 0, policy_version 94160 (0.0005) [2023-03-07 05:25:02,994][118044] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-03-07 05:25:03,299][117993] KL-divergence is very high: 115.6019 [2023-03-07 05:25:03,768][118044] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-07 05:25:04,553][118044] Updated weights for policy 0, policy_version 94190 (0.0005) [2023-03-07 05:25:05,328][118044] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-07 05:25:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 96470016. Throughput: 0: 13132.5. Samples: 96462965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:06,086][117718] Avg episode reward: [(0, '2926.133')] [2023-03-07 05:25:06,115][118044] Updated weights for policy 0, policy_version 94210 (0.0006) [2023-03-07 05:25:06,886][118044] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-07 05:25:07,681][118044] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-07 05:25:08,455][118044] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-07 05:25:09,219][118044] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 05:25:09,997][118044] Updated weights for policy 0, policy_version 94260 (0.0006) [2023-03-07 05:25:10,777][118044] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-07 05:25:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 96535552. Throughput: 0: 13124.5. Samples: 96502214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:11,086][117718] Avg episode reward: [(0, '2857.347')] [2023-03-07 05:25:11,550][118044] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-07 05:25:12,333][118044] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-07 05:25:13,120][118044] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-07 05:25:13,896][118044] Updated weights for policy 0, policy_version 94310 (0.0007) [2023-03-07 05:25:14,674][118044] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-07 05:25:15,467][118044] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-07 05:25:16,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 96601088. Throughput: 0: 13120.2. Samples: 96581192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:16,086][117718] Avg episode reward: [(0, '2843.937')] [2023-03-07 05:25:16,243][118044] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-07 05:25:16,996][118044] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-07 05:25:17,793][118044] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-07 05:25:18,588][118044] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-07 05:25:19,366][118044] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-07 05:25:20,165][118044] Updated weights for policy 0, policy_version 94390 (0.0007) [2023-03-07 05:25:20,952][118044] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 05:25:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 96666624. Throughput: 0: 13106.6. Samples: 96659643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:21,086][117718] Avg episode reward: [(0, '2859.781')] [2023-03-07 05:25:21,705][118044] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-07 05:25:22,485][118044] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-07 05:25:23,257][118044] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-07 05:25:24,033][118044] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-07 05:25:24,829][118044] Updated weights for policy 0, policy_version 94450 (0.0006) [2023-03-07 05:25:25,601][118044] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-07 05:25:26,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 96733184. Throughput: 0: 13114.4. Samples: 96699346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:26,086][117718] Avg episode reward: [(0, '2769.677')] [2023-03-07 05:25:26,374][118044] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-07 05:25:27,162][118044] Updated weights for policy 0, policy_version 94480 (0.0007) [2023-03-07 05:25:27,959][118044] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 05:25:28,725][118044] Updated weights for policy 0, policy_version 94500 (0.0006) [2023-03-07 05:25:29,509][118044] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-07 05:25:30,274][118044] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-07 05:25:31,045][118044] Updated weights for policy 0, policy_version 94530 (0.0007) [2023-03-07 05:25:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 96798720. Throughput: 0: 13115.0. Samples: 96778089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:25:31,086][117718] Avg episode reward: [(0, '2646.793')] [2023-03-07 05:25:31,121][117993] KL-divergence is very high: 930546.3750 [2023-03-07 05:25:31,838][118044] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-07 05:25:32,600][118044] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-07 05:25:33,390][118044] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-07 05:25:34,157][118044] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-07 05:25:34,927][118044] Updated weights for policy 0, policy_version 94580 (0.0007) [2023-03-07 05:25:35,717][118044] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-07 05:25:36,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 96864256. Throughput: 0: 13129.9. Samples: 96857204. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:25:36,086][117718] Avg episode reward: [(0, '2717.986')] [2023-03-07 05:25:36,488][118044] Updated weights for policy 0, policy_version 94600 (0.0005) [2023-03-07 05:25:37,266][118044] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-07 05:25:38,053][118044] Updated weights for policy 0, policy_version 94620 (0.0007) [2023-03-07 05:25:38,825][118044] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 05:25:39,598][118044] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-07 05:25:40,377][118044] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-07 05:25:41,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 96930816. Throughput: 0: 13133.4. Samples: 96896568. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:25:41,086][117718] Avg episode reward: [(0, '2782.510')] [2023-03-07 05:25:41,155][118044] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-07 05:25:41,925][118044] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 05:25:42,714][118044] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-07 05:25:43,489][118044] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-07 05:25:44,261][118044] Updated weights for policy 0, policy_version 94700 (0.0006) [2023-03-07 05:25:45,051][118044] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 05:25:45,814][118044] Updated weights for policy 0, policy_version 94720 (0.0007) [2023-03-07 05:25:46,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 96996352. Throughput: 0: 13149.5. Samples: 96975866. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:25:46,086][117718] Avg episode reward: [(0, '2682.738')] [2023-03-07 05:25:46,597][118044] Updated weights for policy 0, policy_version 94730 (0.0006) [2023-03-07 05:25:47,387][118044] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 05:25:48,161][118044] Updated weights for policy 0, policy_version 94750 (0.0007) [2023-03-07 05:25:48,949][118044] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 05:25:49,725][118044] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-07 05:25:50,510][118044] Updated weights for policy 0, policy_version 94780 (0.0006) [2023-03-07 05:25:51,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97061888. Throughput: 0: 13145.2. Samples: 97054497. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:25:51,086][117718] Avg episode reward: [(0, '2641.897')] [2023-03-07 05:25:51,266][118044] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-07 05:25:52,068][118044] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-07 05:25:52,842][118044] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-07 05:25:53,618][118044] Updated weights for policy 0, policy_version 94820 (0.0006) [2023-03-07 05:25:54,425][118044] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-07 05:25:55,202][118044] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-07 05:25:55,978][118044] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-07 05:25:56,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97127424. Throughput: 0: 13149.1. Samples: 97093922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:25:56,086][117718] Avg episode reward: [(0, '2553.116')] [2023-03-07 05:25:56,757][118044] Updated weights for policy 0, policy_version 94860 (0.0007) [2023-03-07 05:25:57,543][118044] Updated weights for policy 0, policy_version 94870 (0.0007) [2023-03-07 05:25:58,324][118044] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 05:25:59,116][118044] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-07 05:25:59,892][118044] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-07 05:26:00,666][118044] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-07 05:26:01,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97192960. Throughput: 0: 13135.6. Samples: 97172293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:01,086][117718] Avg episode reward: [(0, '2710.646')] [2023-03-07 05:26:01,448][118044] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 05:26:02,223][118044] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-07 05:26:02,982][118044] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-07 05:26:03,759][118044] Updated weights for policy 0, policy_version 94950 (0.0006) [2023-03-07 05:26:04,545][118044] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-07 05:26:05,332][118044] Updated weights for policy 0, policy_version 94970 (0.0007) [2023-03-07 05:26:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 97258496. Throughput: 0: 13147.7. Samples: 97251287. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:06,086][117718] Avg episode reward: [(0, '2719.024')] [2023-03-07 05:26:06,128][118044] Updated weights for policy 0, policy_version 94980 (0.0007) [2023-03-07 05:26:06,912][118044] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-07 05:26:07,714][118044] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 05:26:08,482][118044] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 05:26:08,714][117993] KL-divergence is very high: 111.4439 [2023-03-07 05:26:09,278][118044] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-07 05:26:10,065][118044] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-07 05:26:10,846][118044] Updated weights for policy 0, policy_version 95040 (0.0007) [2023-03-07 05:26:11,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97324032. Throughput: 0: 13132.9. Samples: 97290327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:11,086][117718] Avg episode reward: [(0, '2586.247')] [2023-03-07 05:26:11,623][118044] Updated weights for policy 0, policy_version 95050 (0.0008) [2023-03-07 05:26:12,393][118044] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-07 05:26:13,161][118044] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-07 05:26:13,950][118044] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-07 05:26:14,713][118044] Updated weights for policy 0, policy_version 95090 (0.0006) [2023-03-07 05:26:15,488][118044] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-07 05:26:16,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97389568. Throughput: 0: 13132.5. Samples: 97369052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:16,086][117718] Avg episode reward: [(0, '2658.268')] [2023-03-07 05:26:16,276][118044] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-07 05:26:17,049][118044] Updated weights for policy 0, policy_version 95120 (0.0006) [2023-03-07 05:26:17,813][118044] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 05:26:18,605][118044] Updated weights for policy 0, policy_version 95140 (0.0005) [2023-03-07 05:26:19,393][118044] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-07 05:26:20,160][118044] Updated weights for policy 0, policy_version 95160 (0.0006) [2023-03-07 05:26:20,933][118044] Updated weights for policy 0, policy_version 95170 (0.0006) [2023-03-07 05:26:21,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 97455104. Throughput: 0: 13133.4. Samples: 97448208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:21,086][117718] Avg episode reward: [(0, '2733.357')] [2023-03-07 05:26:21,716][118044] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-07 05:26:22,481][118044] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-07 05:26:23,253][118044] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-07 05:26:24,037][118044] Updated weights for policy 0, policy_version 95210 (0.0006) [2023-03-07 05:26:24,808][118044] Updated weights for policy 0, policy_version 95220 (0.0006) [2023-03-07 05:26:25,583][118044] Updated weights for policy 0, policy_version 95230 (0.0006) [2023-03-07 05:26:26,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97521664. Throughput: 0: 13141.4. Samples: 97487932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:26:26,086][117718] Avg episode reward: [(0, '2650.468')] [2023-03-07 05:26:26,372][118044] Updated weights for policy 0, policy_version 95240 (0.0006) [2023-03-07 05:26:27,154][118044] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 05:26:27,955][118044] Updated weights for policy 0, policy_version 95260 (0.0006) [2023-03-07 05:26:28,721][118044] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-07 05:26:29,503][118044] Updated weights for policy 0, policy_version 95280 (0.0007) [2023-03-07 05:26:30,263][118044] Updated weights for policy 0, policy_version 95290 (0.0007) [2023-03-07 05:26:31,053][118044] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-07 05:26:31,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 97587200. Throughput: 0: 13125.1. Samples: 97566495. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:31,086][117718] Avg episode reward: [(0, '2627.899')] [2023-03-07 05:26:31,826][118044] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-07 05:26:32,609][118044] Updated weights for policy 0, policy_version 95320 (0.0007) [2023-03-07 05:26:33,410][118044] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-07 05:26:34,182][118044] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 05:26:34,977][118044] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 05:26:35,754][118044] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-07 05:26:36,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 97652736. Throughput: 0: 13128.5. Samples: 97645279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:36,086][117718] Avg episode reward: [(0, '2629.868')] [2023-03-07 05:26:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000095364_97652736.pth... [2023-03-07 05:26:36,119][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000092287_94501888.pth [2023-03-07 05:26:36,519][118044] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-07 05:26:37,306][118044] Updated weights for policy 0, policy_version 95380 (0.0005) [2023-03-07 05:26:38,082][118044] Updated weights for policy 0, policy_version 95390 (0.0006) [2023-03-07 05:26:38,851][118044] Updated weights for policy 0, policy_version 95400 (0.0006) [2023-03-07 05:26:39,620][118044] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 05:26:40,403][118044] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-07 05:26:41,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 97718272. Throughput: 0: 13129.7. Samples: 97684759. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:41,086][117718] Avg episode reward: [(0, '2809.572')] [2023-03-07 05:26:41,185][118044] Updated weights for policy 0, policy_version 95430 (0.0006) [2023-03-07 05:26:41,949][118044] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-07 05:26:42,723][118044] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-07 05:26:43,525][118044] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-07 05:26:44,293][118044] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-07 05:26:45,069][118044] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-07 05:26:45,869][118044] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-07 05:26:46,085][117718] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 97784832. Throughput: 0: 13145.1. Samples: 97763823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:46,086][117718] Avg episode reward: [(0, '2644.755')] [2023-03-07 05:26:46,641][118044] Updated weights for policy 0, policy_version 95500 (0.0007) [2023-03-07 05:26:47,418][118044] Updated weights for policy 0, policy_version 95510 (0.0006) [2023-03-07 05:26:48,207][118044] Updated weights for policy 0, policy_version 95520 (0.0007) [2023-03-07 05:26:48,966][118044] Updated weights for policy 0, policy_version 95530 (0.0005) [2023-03-07 05:26:49,750][118044] Updated weights for policy 0, policy_version 95540 (0.0007) [2023-03-07 05:26:50,524][118044] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 05:26:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13131.5). Total num frames: 97849344. Throughput: 0: 13136.7. Samples: 97842438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:51,086][117718] Avg episode reward: [(0, '2700.454')] [2023-03-07 05:26:51,301][118044] Updated weights for policy 0, policy_version 95560 (0.0005) [2023-03-07 05:26:52,097][118044] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-07 05:26:52,864][118044] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-07 05:26:53,630][118044] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-07 05:26:54,414][118044] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-07 05:26:55,210][118044] Updated weights for policy 0, policy_version 95610 (0.0005) [2023-03-07 05:26:55,989][118044] Updated weights for policy 0, policy_version 95620 (0.0007) [2023-03-07 05:26:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97915904. Throughput: 0: 13149.7. Samples: 97882064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:26:56,086][117718] Avg episode reward: [(0, '2727.596')] [2023-03-07 05:26:56,773][118044] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-07 05:26:57,548][118044] Updated weights for policy 0, policy_version 95640 (0.0006) [2023-03-07 05:26:58,340][118044] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 05:26:59,118][118044] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-07 05:26:59,899][118044] Updated weights for policy 0, policy_version 95670 (0.0005) [2023-03-07 05:27:00,694][118044] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-07 05:27:01,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 97981440. Throughput: 0: 13146.5. Samples: 97960642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:01,086][117718] Avg episode reward: [(0, '2708.182')] [2023-03-07 05:27:01,446][118044] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-07 05:27:02,219][118044] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 05:27:03,007][118044] Updated weights for policy 0, policy_version 95710 (0.0007) [2023-03-07 05:27:03,778][118044] Updated weights for policy 0, policy_version 95720 (0.0007) [2023-03-07 05:27:04,549][118044] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-07 05:27:05,342][118044] Updated weights for policy 0, policy_version 95740 (0.0007) [2023-03-07 05:27:06,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 98046976. Throughput: 0: 13146.9. Samples: 98039818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:06,086][117718] Avg episode reward: [(0, '2739.455')] [2023-03-07 05:27:06,129][118044] Updated weights for policy 0, policy_version 95750 (0.0007) [2023-03-07 05:27:06,890][118044] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-07 05:27:07,664][118044] Updated weights for policy 0, policy_version 95770 (0.0006) [2023-03-07 05:27:08,449][118044] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-07 05:27:09,221][118044] Updated weights for policy 0, policy_version 95790 (0.0006) [2023-03-07 05:27:10,023][118044] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-07 05:27:10,805][118044] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-07 05:27:11,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 98112512. Throughput: 0: 13138.7. Samples: 98079174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:11,086][117718] Avg episode reward: [(0, '2713.671')] [2023-03-07 05:27:11,586][118044] Updated weights for policy 0, policy_version 95820 (0.0007) [2023-03-07 05:27:12,359][118044] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-07 05:27:13,131][118044] Updated weights for policy 0, policy_version 95840 (0.0006) [2023-03-07 05:27:13,905][118044] Updated weights for policy 0, policy_version 95850 (0.0007) [2023-03-07 05:27:14,678][118044] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-07 05:27:15,459][118044] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-07 05:27:16,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 98179072. Throughput: 0: 13149.6. Samples: 98158229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:16,086][117718] Avg episode reward: [(0, '2698.561')] [2023-03-07 05:27:16,238][118044] Updated weights for policy 0, policy_version 95880 (0.0006) [2023-03-07 05:27:17,013][118044] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-07 05:27:17,790][118044] Updated weights for policy 0, policy_version 95900 (0.0005) [2023-03-07 05:27:18,562][118044] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 05:27:19,337][118044] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-07 05:27:20,127][118044] Updated weights for policy 0, policy_version 95930 (0.0007) [2023-03-07 05:27:20,894][118044] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-07 05:27:21,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 98244608. Throughput: 0: 13151.6. Samples: 98237101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:21,086][117718] Avg episode reward: [(0, '2656.413')] [2023-03-07 05:27:21,663][118044] Updated weights for policy 0, policy_version 95950 (0.0007) [2023-03-07 05:27:22,442][118044] Updated weights for policy 0, policy_version 95960 (0.0006) [2023-03-07 05:27:23,223][118044] Updated weights for policy 0, policy_version 95970 (0.0005) [2023-03-07 05:27:24,004][118044] Updated weights for policy 0, policy_version 95980 (0.0006) [2023-03-07 05:27:24,781][118044] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-07 05:27:25,568][118044] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-07 05:27:26,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 98310144. Throughput: 0: 13152.4. Samples: 98276616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:27:26,086][117718] Avg episode reward: [(0, '2574.198')] [2023-03-07 05:27:26,339][118044] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 05:27:27,134][118044] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-07 05:27:27,920][118044] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-07 05:27:28,690][118044] Updated weights for policy 0, policy_version 96040 (0.0007) [2023-03-07 05:27:29,465][118044] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-07 05:27:30,270][118044] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-07 05:27:31,050][118044] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-07 05:27:31,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 98375680. Throughput: 0: 13146.1. Samples: 98355399. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:31,086][117718] Avg episode reward: [(0, '2678.678')] [2023-03-07 05:27:31,805][118044] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-07 05:27:32,597][118044] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-07 05:27:33,365][118044] Updated weights for policy 0, policy_version 96100 (0.0007) [2023-03-07 05:27:34,135][118044] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-07 05:27:34,921][118044] Updated weights for policy 0, policy_version 96120 (0.0005) [2023-03-07 05:27:35,698][118044] Updated weights for policy 0, policy_version 96130 (0.0005) [2023-03-07 05:27:36,085][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 98442240. Throughput: 0: 13154.2. Samples: 98434377. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:36,086][117718] Avg episode reward: [(0, '2711.150')] [2023-03-07 05:27:36,481][118044] Updated weights for policy 0, policy_version 96140 (0.0006) [2023-03-07 05:27:37,265][118044] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-07 05:27:38,042][118044] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-07 05:27:38,813][118044] Updated weights for policy 0, policy_version 96170 (0.0006) [2023-03-07 05:27:39,598][118044] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-07 05:27:40,378][118044] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-07 05:27:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 98507776. Throughput: 0: 13146.3. Samples: 98473646. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:41,086][117718] Avg episode reward: [(0, '2689.002')] [2023-03-07 05:27:41,152][118044] Updated weights for policy 0, policy_version 96200 (0.0007) [2023-03-07 05:27:41,956][118044] Updated weights for policy 0, policy_version 96210 (0.0007) [2023-03-07 05:27:42,718][118044] Updated weights for policy 0, policy_version 96220 (0.0005) [2023-03-07 05:27:43,499][118044] Updated weights for policy 0, policy_version 96230 (0.0005) [2023-03-07 05:27:44,274][118044] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-07 05:27:45,057][118044] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-07 05:27:45,831][118044] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 05:27:46,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 98573312. Throughput: 0: 13155.6. Samples: 98552646. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:46,086][117718] Avg episode reward: [(0, '2545.704')] [2023-03-07 05:27:46,611][118044] Updated weights for policy 0, policy_version 96270 (0.0006) [2023-03-07 05:27:47,393][118044] Updated weights for policy 0, policy_version 96280 (0.0006) [2023-03-07 05:27:48,152][118044] Updated weights for policy 0, policy_version 96290 (0.0006) [2023-03-07 05:27:48,932][118044] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 05:27:49,712][118044] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 05:27:50,509][118044] Updated weights for policy 0, policy_version 96320 (0.0007) [2023-03-07 05:27:51,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13135.0). Total num frames: 98638848. Throughput: 0: 13151.6. Samples: 98631639. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:51,086][117718] Avg episode reward: [(0, '2825.681')] [2023-03-07 05:27:51,285][118044] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-07 05:27:52,055][118044] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-07 05:27:52,851][118044] Updated weights for policy 0, policy_version 96350 (0.0007) [2023-03-07 05:27:53,620][118044] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-07 05:27:54,396][118044] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-07 05:27:55,182][118044] Updated weights for policy 0, policy_version 96380 (0.0006) [2023-03-07 05:27:55,956][118044] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-07 05:27:56,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 98704384. Throughput: 0: 13148.8. Samples: 98670870. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:27:56,086][117718] Avg episode reward: [(0, '2659.262')] [2023-03-07 05:27:56,739][118044] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-07 05:27:57,537][118044] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-07 05:27:58,321][118044] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-07 05:27:59,094][118044] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-07 05:27:59,869][118044] Updated weights for policy 0, policy_version 96440 (0.0007) [2023-03-07 05:28:00,656][118044] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 05:28:01,085][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 98769920. Throughput: 0: 13141.8. Samples: 98749613. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:28:01,086][117718] Avg episode reward: [(0, '2794.395')] [2023-03-07 05:28:01,430][118044] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-07 05:28:02,221][118044] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-07 05:28:03,004][118044] Updated weights for policy 0, policy_version 96480 (0.0007) [2023-03-07 05:28:03,770][118044] Updated weights for policy 0, policy_version 96490 (0.0006) [2023-03-07 05:28:04,559][118044] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-07 05:28:05,321][118044] Updated weights for policy 0, policy_version 96510 (0.0007) [2023-03-07 05:28:06,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13131.5). Total num frames: 98835456. Throughput: 0: 13141.8. Samples: 98828482. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:28:06,086][117718] Avg episode reward: [(0, '2741.758')] [2023-03-07 05:28:06,094][118044] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-07 05:28:06,871][118044] Updated weights for policy 0, policy_version 96530 (0.0007) [2023-03-07 05:28:07,654][118044] Updated weights for policy 0, policy_version 96540 (0.0007) [2023-03-07 05:28:08,428][118044] Updated weights for policy 0, policy_version 96550 (0.0006) [2023-03-07 05:28:09,213][118044] Updated weights for policy 0, policy_version 96560 (0.0006) [2023-03-07 05:28:09,989][118044] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-07 05:28:10,789][118044] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-07 05:28:11,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13131.5). Total num frames: 98900992. Throughput: 0: 13138.3. Samples: 98867840. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:28:11,086][117718] Avg episode reward: [(0, '2652.135')] [2023-03-07 05:28:11,576][118044] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-07 05:28:12,346][118044] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-07 05:28:13,110][118044] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 05:28:13,905][118044] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-07 05:28:14,683][118044] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-07 05:28:15,446][118044] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-07 05:28:16,086][117718] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 98967552. Throughput: 0: 13135.4. Samples: 98946494. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:28:16,086][117718] Avg episode reward: [(0, '2639.885')] [2023-03-07 05:28:16,233][118044] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 05:28:17,008][118044] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-07 05:28:17,779][118044] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-07 05:28:18,563][118044] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-07 05:28:19,340][118044] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-07 05:28:20,124][118044] Updated weights for policy 0, policy_version 96700 (0.0007) [2023-03-07 05:28:20,916][118044] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-07 05:28:21,085][117718] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 99033088. Throughput: 0: 13136.5. Samples: 99025519. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 05:28:21,086][117718] Avg episode reward: [(0, '2663.990')] [2023-03-07 05:28:21,694][118044] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-07 05:28:22,463][118044] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-07 05:28:23,264][118044] Updated weights for policy 0, policy_version 96740 (0.0007) [2023-03-07 05:28:24,026][118044] Updated weights for policy 0, policy_version 96750 (0.0007) [2023-03-07 05:28:24,813][118044] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-07 05:28:25,599][118044] Updated weights for policy 0, policy_version 96770 (0.0007) [2023-03-07 05:28:26,086][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 99098624. Throughput: 0: 13139.7. Samples: 99064934. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:26,086][117718] Avg episode reward: [(0, '2657.049')] [2023-03-07 05:28:26,352][118044] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 05:28:27,127][118044] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 05:28:27,910][118044] Updated weights for policy 0, policy_version 96800 (0.0007) [2023-03-07 05:28:28,675][118044] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-07 05:28:29,464][118044] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 05:28:30,242][118044] Updated weights for policy 0, policy_version 96830 (0.0005) [2023-03-07 05:28:31,004][118044] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-07 05:28:31,085][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 99164160. Throughput: 0: 13138.9. Samples: 99143895. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:31,086][117718] Avg episode reward: [(0, '2635.686')] [2023-03-07 05:28:31,789][118044] Updated weights for policy 0, policy_version 96850 (0.0006) [2023-03-07 05:28:32,569][118044] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-07 05:28:33,341][118044] Updated weights for policy 0, policy_version 96870 (0.0006) [2023-03-07 05:28:34,133][118044] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-07 05:28:34,905][118044] Updated weights for policy 0, policy_version 96890 (0.0006) [2023-03-07 05:28:35,704][118044] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 05:28:36,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99230720. Throughput: 0: 13137.2. Samples: 99222816. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:36,086][117718] Avg episode reward: [(0, '2598.386')] [2023-03-07 05:28:36,090][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000096905_99230720.pth... [2023-03-07 05:28:36,121][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000093826_96077824.pth [2023-03-07 05:28:36,473][118044] Updated weights for policy 0, policy_version 96910 (0.0006) [2023-03-07 05:28:37,228][118044] Updated weights for policy 0, policy_version 96920 (0.0007) [2023-03-07 05:28:38,014][118044] Updated weights for policy 0, policy_version 96930 (0.0006) [2023-03-07 05:28:38,794][118044] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-07 05:28:39,566][118044] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-07 05:28:40,369][118044] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-07 05:28:41,085][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 99296256. Throughput: 0: 13146.1. Samples: 99262442. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:41,086][117718] Avg episode reward: [(0, '2432.085')] [2023-03-07 05:28:41,150][118044] Updated weights for policy 0, policy_version 96970 (0.0007) [2023-03-07 05:28:41,933][118044] Updated weights for policy 0, policy_version 96980 (0.0007) [2023-03-07 05:28:42,719][118044] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-07 05:28:43,486][118044] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 05:28:44,280][118044] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-07 05:28:45,055][118044] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-07 05:28:45,829][118044] Updated weights for policy 0, policy_version 97030 (0.0007) [2023-03-07 05:28:46,085][117718] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99361792. Throughput: 0: 13144.9. Samples: 99341134. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:46,086][117718] Avg episode reward: [(0, '2447.689')] [2023-03-07 05:28:46,597][118044] Updated weights for policy 0, policy_version 97040 (0.0007) [2023-03-07 05:28:47,371][118044] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-07 05:28:48,170][118044] Updated weights for policy 0, policy_version 97060 (0.0007) [2023-03-07 05:28:48,933][118044] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-07 05:28:49,716][118044] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 05:28:50,491][118044] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-07 05:28:51,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 99427328. Throughput: 0: 13144.4. Samples: 99419981. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:51,086][117718] Avg episode reward: [(0, '2455.622')] [2023-03-07 05:28:51,301][118044] Updated weights for policy 0, policy_version 97100 (0.0007) [2023-03-07 05:28:52,073][118044] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-07 05:28:52,865][118044] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 05:28:53,642][118044] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 05:28:54,416][118044] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-07 05:28:55,204][118044] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-07 05:28:55,979][118044] Updated weights for policy 0, policy_version 97160 (0.0007) [2023-03-07 05:28:56,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 99492864. Throughput: 0: 13139.8. Samples: 99459132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:28:56,086][117718] Avg episode reward: [(0, '2441.716')] [2023-03-07 05:28:56,760][118044] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-07 05:28:57,544][118044] Updated weights for policy 0, policy_version 97180 (0.0007) [2023-03-07 05:28:58,320][118044] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 05:28:59,091][118044] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-07 05:28:59,871][118044] Updated weights for policy 0, policy_version 97210 (0.0006) [2023-03-07 05:29:00,641][118044] Updated weights for policy 0, policy_version 97220 (0.0006) [2023-03-07 05:29:01,085][117718] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 99558400. Throughput: 0: 13145.5. Samples: 99538041. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:29:01,086][117718] Avg episode reward: [(0, '2525.623')] [2023-03-07 05:29:01,422][118044] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-07 05:29:02,204][118044] Updated weights for policy 0, policy_version 97240 (0.0006) [2023-03-07 05:29:02,994][118044] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 05:29:03,766][118044] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-07 05:29:04,538][118044] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-07 05:29:05,326][118044] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-07 05:29:06,086][117718] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99623936. Throughput: 0: 13144.1. Samples: 99617004. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:29:06,086][117718] Avg episode reward: [(0, '2458.195')] [2023-03-07 05:29:06,102][118044] Updated weights for policy 0, policy_version 97290 (0.0006) [2023-03-07 05:29:06,867][118044] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-07 05:29:07,652][118044] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-07 05:29:08,434][118044] Updated weights for policy 0, policy_version 97320 (0.0007) [2023-03-07 05:29:09,222][118044] Updated weights for policy 0, policy_version 97330 (0.0006) [2023-03-07 05:29:09,995][118044] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-07 05:29:10,766][118044] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-07 05:29:11,086][117718] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 99690496. Throughput: 0: 13146.3. Samples: 99656516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:29:11,086][117718] Avg episode reward: [(0, '2579.944')] [2023-03-07 05:29:11,540][118044] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 05:29:12,321][118044] Updated weights for policy 0, policy_version 97370 (0.0007) [2023-03-07 05:29:13,102][118044] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 05:29:13,879][118044] Updated weights for policy 0, policy_version 97390 (0.0006) [2023-03-07 05:29:14,662][118044] Updated weights for policy 0, policy_version 97400 (0.0006) [2023-03-07 05:29:15,459][118044] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-07 05:29:16,086][117718] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 99756032. Throughput: 0: 13141.1. Samples: 99735244. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:29:16,086][117718] Avg episode reward: [(0, '2437.649')] [2023-03-07 05:29:16,235][118044] Updated weights for policy 0, policy_version 97420 (0.0006) [2023-03-07 05:29:17,021][118044] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-07 05:29:17,793][118044] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-07 05:29:18,579][118044] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 05:29:19,359][118044] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-07 05:29:20,140][118044] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-07 05:29:20,915][118044] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 05:29:21,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99821568. Throughput: 0: 13132.5. Samples: 99813778. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 05:29:21,086][117718] Avg episode reward: [(0, '2512.302')] [2023-03-07 05:29:21,706][118044] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-07 05:29:22,493][118044] Updated weights for policy 0, policy_version 97500 (0.0006) [2023-03-07 05:29:23,271][118044] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-07 05:29:24,039][118044] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-07 05:29:24,827][118044] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-07 05:29:25,598][118044] Updated weights for policy 0, policy_version 97540 (0.0006) [2023-03-07 05:29:26,085][117718] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99887104. Throughput: 0: 13125.9. Samples: 99853110. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:29:26,086][117718] Avg episode reward: [(0, '2531.855')] [2023-03-07 05:29:26,377][118044] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-07 05:29:27,167][118044] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 05:29:27,968][118044] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-07 05:29:28,734][118044] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-07 05:29:29,533][118044] Updated weights for policy 0, policy_version 97590 (0.0005) [2023-03-07 05:29:30,310][118044] Updated weights for policy 0, policy_version 97600 (0.0007) [2023-03-07 05:29:31,082][118044] Updated weights for policy 0, policy_version 97610 (0.0006) [2023-03-07 05:29:31,086][117718] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 99952640. Throughput: 0: 13122.1. Samples: 99931631. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 05:29:31,086][117718] Avg episode reward: [(0, '2575.127')] [2023-03-07 05:29:31,847][118044] Updated weights for policy 0, policy_version 97620 (0.0005) [2023-03-07 05:29:32,656][118044] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 05:29:33,425][118044] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 05:29:34,203][118044] Updated weights for policy 0, policy_version 97650 (0.0007) [2023-03-07 05:29:34,839][118214] Stopping RolloutWorker_w17... [2023-03-07 05:29:34,839][118510] Stopping RolloutWorker_w24... [2023-03-07 05:29:34,839][118509] Stopping RolloutWorker_w23... [2023-03-07 05:29:34,839][118204] Stopping RolloutWorker_w4... [2023-03-07 05:29:34,839][118546] Stopping RolloutWorker_w29... [2023-03-07 05:29:34,839][118641] Stopping RolloutWorker_w30... [2023-03-07 05:29:34,839][118206] Stopping RolloutWorker_w11... [2023-03-07 05:29:34,839][118045] Stopping RolloutWorker_w0... [2023-03-07 05:29:34,839][118212] Stopping RolloutWorker_w10... [2023-03-07 05:29:34,839][118508] Stopping RolloutWorker_w22... [2023-03-07 05:29:34,839][118214] Loop rollout_proc17_evt_loop terminating... [2023-03-07 05:29:34,839][118048] Stopping RolloutWorker_w3... [2023-03-07 05:29:34,839][118210] Stopping RolloutWorker_w18... [2023-03-07 05:29:34,839][118513] Stopping RolloutWorker_w27... [2023-03-07 05:29:34,839][117993] Stopping Batcher_0... [2023-03-07 05:29:34,839][118213] Stopping RolloutWorker_w9... [2023-03-07 05:29:34,839][118296] Stopping RolloutWorker_w20... [2023-03-07 05:29:34,839][118445] Stopping RolloutWorker_w21... [2023-03-07 05:29:34,839][118216] Stopping RolloutWorker_w16... [2023-03-07 05:29:34,839][118046] Stopping RolloutWorker_w1... [2023-03-07 05:29:34,839][118444] Stopping RolloutWorker_w8... [2023-03-07 05:29:34,839][118512] Stopping RolloutWorker_w26... [2023-03-07 05:29:34,839][118443] Stopping RolloutWorker_w14... [2023-03-07 05:29:34,839][118510] Loop rollout_proc24_evt_loop terminating... [2023-03-07 05:29:34,839][118511] Stopping RolloutWorker_w25... [2023-03-07 05:29:34,839][118509] Loop rollout_proc23_evt_loop terminating... [2023-03-07 05:29:34,839][118208] Stopping RolloutWorker_w7... [2023-03-07 05:29:34,839][118204] Loop rollout_proc4_evt_loop terminating... [2023-03-07 05:29:34,839][118048] Loop rollout_proc3_evt_loop terminating... [2023-03-07 05:29:34,839][118546] Loop rollout_proc29_evt_loop terminating... [2023-03-07 05:29:34,839][118641] Loop rollout_proc30_evt_loop terminating... [2023-03-07 05:29:34,839][118047] Stopping RolloutWorker_w2... [2023-03-07 05:29:34,839][118209] Stopping RolloutWorker_w13... [2023-03-07 05:29:34,839][118045] Loop rollout_proc0_evt_loop terminating... [2023-03-07 05:29:34,839][118640] Stopping RolloutWorker_w31... [2023-03-07 05:29:34,839][118296] Loop rollout_proc20_evt_loop terminating... [2023-03-07 05:29:34,839][118206] Loop rollout_proc11_evt_loop terminating... [2023-03-07 05:29:34,839][118508] Loop rollout_proc22_evt_loop terminating... [2023-03-07 05:29:34,839][118212] Loop rollout_proc10_evt_loop terminating... [2023-03-07 05:29:34,839][118046] Loop rollout_proc1_evt_loop terminating... [2023-03-07 05:29:34,839][118213] Loop rollout_proc9_evt_loop terminating... [2023-03-07 05:29:34,839][118513] Loop rollout_proc27_evt_loop terminating... [2023-03-07 05:29:34,839][118445] Loop rollout_proc21_evt_loop terminating... [2023-03-07 05:29:34,839][117993] Loop batcher_evt_loop terminating... [2023-03-07 05:29:34,839][118216] Loop rollout_proc16_evt_loop terminating... [2023-03-07 05:29:34,839][118210] Loop rollout_proc18_evt_loop terminating... [2023-03-07 05:29:34,839][118444] Loop rollout_proc8_evt_loop terminating... [2023-03-07 05:29:34,839][118248] Stopping RolloutWorker_w19... [2023-03-07 05:29:34,839][118512] Loop rollout_proc26_evt_loop terminating... [2023-03-07 05:29:34,839][118443] Loop rollout_proc14_evt_loop terminating... [2023-03-07 05:29:34,840][118640] Loop rollout_proc31_evt_loop terminating... [2023-03-07 05:29:34,839][118208] Loop rollout_proc7_evt_loop terminating... [2023-03-07 05:29:34,839][118511] Loop rollout_proc25_evt_loop terminating... [2023-03-07 05:29:34,840][118209] Loop rollout_proc13_evt_loop terminating... [2023-03-07 05:29:34,840][118047] Loop rollout_proc2_evt_loop terminating... [2023-03-07 05:29:34,840][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 05:29:34,839][117718] Component RolloutWorker_w24 stopped! [2023-03-07 05:29:34,840][118207] Stopping RolloutWorker_w15... [2023-03-07 05:29:34,840][118207] Loop rollout_proc15_evt_loop terminating... [2023-03-07 05:29:34,840][117718] Component RolloutWorker_w17 stopped! [2023-03-07 05:29:34,841][117718] Component RolloutWorker_w23 stopped! [2023-03-07 05:29:34,841][117718] Component RolloutWorker_w4 stopped! [2023-03-07 05:29:34,841][117718] Component RolloutWorker_w29 stopped! [2023-03-07 05:29:34,841][117718] Component RolloutWorker_w11 stopped! [2023-03-07 05:29:34,842][117718] Component RolloutWorker_w30 stopped! [2023-03-07 05:29:34,842][117718] Component RolloutWorker_w0 stopped! [2023-03-07 05:29:34,842][117718] Component RolloutWorker_w10 stopped! [2023-03-07 05:29:34,842][117718] Component RolloutWorker_w18 stopped! [2023-03-07 05:29:34,842][118254] Stopping RolloutWorker_w12... [2023-03-07 05:29:34,842][117718] Component RolloutWorker_w22 stopped! [2023-03-07 05:29:34,843][118254] Loop rollout_proc12_evt_loop terminating... [2023-03-07 05:29:34,843][117718] Component Batcher_0 stopped! [2023-03-07 05:29:34,843][117718] Component RolloutWorker_w3 stopped! [2023-03-07 05:29:34,843][117718] Component RolloutWorker_w27 stopped! [2023-03-07 05:29:34,843][117718] Component RolloutWorker_w9 stopped! [2023-03-07 05:29:34,844][117718] Component RolloutWorker_w21 stopped! [2023-03-07 05:29:34,844][117718] Component RolloutWorker_w16 stopped! [2023-03-07 05:29:34,844][117718] Component RolloutWorker_w20 stopped! [2023-03-07 05:29:34,844][117718] Component RolloutWorker_w8 stopped! [2023-03-07 05:29:34,844][118249] Stopping RolloutWorker_w6... [2023-03-07 05:29:34,845][118249] Loop rollout_proc6_evt_loop terminating... [2023-03-07 05:29:34,845][117718] Component RolloutWorker_w1 stopped! [2023-03-07 05:29:34,845][117718] Component RolloutWorker_w26 stopped! [2023-03-07 05:29:34,846][117718] Component RolloutWorker_w14 stopped! [2023-03-07 05:29:34,846][117718] Component RolloutWorker_w2 stopped! [2023-03-07 05:29:34,847][117718] Component RolloutWorker_w25 stopped! [2023-03-07 05:29:34,847][117718] Component RolloutWorker_w7 stopped! [2023-03-07 05:29:34,847][117718] Component RolloutWorker_w13 stopped! [2023-03-07 05:29:34,847][117718] Component RolloutWorker_w19 stopped! [2023-03-07 05:29:34,848][117718] Component RolloutWorker_w31 stopped! [2023-03-07 05:29:34,848][117718] Component RolloutWorker_w15 stopped! [2023-03-07 05:29:34,848][117718] Component RolloutWorker_w12 stopped! [2023-03-07 05:29:34,849][117718] Component RolloutWorker_w6 stopped! [2023-03-07 05:29:34,852][117718] Component RolloutWorker_w5 stopped! [2023-03-07 05:29:34,852][118205] Stopping RolloutWorker_w5... [2023-03-07 05:29:34,853][118205] Loop rollout_proc5_evt_loop terminating... [2023-03-07 05:29:34,854][117718] Component RolloutWorker_w28 stopped! [2023-03-07 05:29:34,855][118545] Stopping RolloutWorker_w28... [2023-03-07 05:29:34,855][118545] Loop rollout_proc28_evt_loop terminating... [2023-03-07 05:29:34,864][118248] Loop rollout_proc19_evt_loop terminating... [2023-03-07 05:29:34,911][118044] Weights refcount: 2 0 [2023-03-07 05:29:34,914][118044] Stopping InferenceWorker_p0-w0... [2023-03-07 05:29:34,914][118044] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 05:29:34,914][117718] Component InferenceWorker_p0-w0 stopped! [2023-03-07 05:29:34,949][117993] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000095364_97652736.pth [2023-03-07 05:29:34,958][117993] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 05:29:35,055][117993] Stopping LearnerWorker_p0... [2023-03-07 05:29:35,055][117993] Loop learner_proc0_evt_loop terminating... [2023-03-07 05:29:35,055][117718] Component LearnerWorker_p0 stopped! [2023-03-07 05:29:35,056][117718] Waiting for process learner_proc0 to stop... [2023-03-07 05:29:36,214][117718] Waiting for process inference_proc0-0 to join... [2023-03-07 05:29:36,214][117718] Waiting for process rollout_proc0 to join... [2023-03-07 05:29:36,215][117718] Waiting for process rollout_proc1 to join... [2023-03-07 05:29:36,215][117718] Waiting for process rollout_proc2 to join... [2023-03-07 05:29:36,215][117718] Waiting for process rollout_proc3 to join... [2023-03-07 05:29:36,216][117718] Waiting for process rollout_proc4 to join... [2023-03-07 05:29:36,216][117718] Waiting for process rollout_proc5 to join... [2023-03-07 05:29:36,216][117718] Waiting for process rollout_proc6 to join... [2023-03-07 05:29:36,217][117718] Waiting for process rollout_proc7 to join... [2023-03-07 05:29:36,217][117718] Waiting for process rollout_proc8 to join... [2023-03-07 05:29:36,217][117718] Waiting for process rollout_proc9 to join... [2023-03-07 05:29:36,218][117718] Waiting for process rollout_proc10 to join... [2023-03-07 05:29:36,218][117718] Waiting for process rollout_proc11 to join... [2023-03-07 05:29:36,218][117718] Waiting for process rollout_proc12 to join... [2023-03-07 05:29:36,219][117718] Waiting for process rollout_proc13 to join... [2023-03-07 05:29:36,219][117718] Waiting for process rollout_proc14 to join... [2023-03-07 05:29:36,220][117718] Waiting for process rollout_proc15 to join... [2023-03-07 05:29:36,220][117718] Waiting for process rollout_proc16 to join... [2023-03-07 05:29:36,220][117718] Waiting for process rollout_proc17 to join... [2023-03-07 05:29:36,221][117718] Waiting for process rollout_proc18 to join... [2023-03-07 05:29:36,221][117718] Waiting for process rollout_proc19 to join... [2023-03-07 05:29:36,221][117718] Waiting for process rollout_proc20 to join... [2023-03-07 05:29:36,222][117718] Waiting for process rollout_proc21 to join... [2023-03-07 05:29:36,222][117718] Waiting for process rollout_proc22 to join... [2023-03-07 05:29:36,222][117718] Waiting for process rollout_proc23 to join... [2023-03-07 05:29:36,223][117718] Waiting for process rollout_proc24 to join... [2023-03-07 05:29:36,223][117718] Waiting for process rollout_proc25 to join... [2023-03-07 05:29:36,223][117718] Waiting for process rollout_proc26 to join... [2023-03-07 05:29:36,224][117718] Waiting for process rollout_proc27 to join... [2023-03-07 05:29:36,224][117718] Waiting for process rollout_proc28 to join... [2023-03-07 05:29:36,225][117718] Waiting for process rollout_proc29 to join... [2023-03-07 05:29:36,225][117718] Waiting for process rollout_proc30 to join... [2023-03-07 05:29:36,225][117718] Waiting for process rollout_proc31 to join... [2023-03-07 05:29:36,226][117718] Batcher 0 profile tree view: batching: 833.7163, releasing_batches: 1.6474 [2023-03-07 05:29:36,226][117718] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 249.6478 update_model: 135.4721 weight_update: 0.0007 one_step: 0.0060 handle_policy_step: 6842.3404 deserialize: 214.4844, stack: 36.4307, obs_to_device_normalize: 1208.6141, forward: 3061.8228, send_messages: 1335.6389 prepare_outputs: 715.3515 to_cpu: 361.7069 [2023-03-07 05:29:36,226][117718] Learner 0 profile tree view: misc: 0.4825, prepare_batch: 418.9541 train: 905.9831 epoch_init: 0.3772, minibatch_init: 0.4017, losses_postprocess: 30.1376, kl_divergence: 35.4884, after_optimizer: 100.2419 calculate_losses: 299.9134 losses_init: 0.2110, forward_head: 16.5109, bptt_initial: 109.3763, tail: 60.1101, advantages_returns: 7.4198, losses: 28.1246 bptt: 69.2444 bptt_forward_core: 66.7965 update: 417.0908 clip: 54.4586 [2023-03-07 05:29:36,227][117718] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.0064, enqueue_policy_requests: 185.0065, env_step: 2956.7711, overhead: 158.4219, complete_rollouts: 9.4728 save_policy_outputs: 223.9348 split_output_tensors: 108.9220 [2023-03-07 05:29:36,227][117718] RolloutWorker_w31 profile tree view: wait_for_trajectories: 4.0257, enqueue_policy_requests: 182.1478, env_step: 3001.8462, overhead: 163.7233, complete_rollouts: 9.4758 save_policy_outputs: 224.2406 split_output_tensors: 109.8690 [2023-03-07 05:29:36,227][117718] Loop Runner_EvtLoop terminating... [2023-03-07 05:29:36,228][117718] Runner profile tree view: main_loop: 7617.1370 [2023-03-07 05:29:36,228][117718] Collected {0: 100001792}, FPS: 13128.5