[2023-03-07 14:06:08,913][213445] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/config.json... [2023-03-07 14:06:08,928][213445] Rollout worker 0 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 1 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 2 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 3 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 4 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 5 uses device cpu [2023-03-07 14:06:08,929][213445] Rollout worker 6 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 7 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 8 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 9 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 10 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 11 uses device cpu [2023-03-07 14:06:08,930][213445] Rollout worker 12 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 13 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 14 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 15 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 16 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 17 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 18 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 19 uses device cpu [2023-03-07 14:06:08,931][213445] Rollout worker 20 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 21 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 22 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 23 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 24 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 25 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 26 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 27 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 28 uses device cpu [2023-03-07 14:06:08,932][213445] Rollout worker 29 uses device cpu [2023-03-07 14:06:08,933][213445] Rollout worker 30 uses device cpu [2023-03-07 14:06:08,933][213445] Rollout worker 31 uses device cpu [2023-03-07 14:06:08,946][213445] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 14:06:08,946][213445] InferenceWorker_p0-w0: min num requests: 10 [2023-03-07 14:06:09,028][213445] Starting all processes... [2023-03-07 14:06:09,029][213445] Starting process learner_proc0 [2023-03-07 14:06:09,078][213445] Starting all processes... [2023-03-07 14:06:09,126][213445] Starting process inference_proc0-0 [2023-03-07 14:06:09,135][213445] Starting process rollout_proc0 [2023-03-07 14:06:09,135][213445] Starting process rollout_proc1 [2023-03-07 14:06:09,136][213445] Starting process rollout_proc2 [2023-03-07 14:06:09,136][213445] Starting process rollout_proc3 [2023-03-07 14:06:09,136][213445] Starting process rollout_proc4 [2023-03-07 14:06:09,136][213445] Starting process rollout_proc5 [2023-03-07 14:06:09,136][213445] Starting process rollout_proc6 [2023-03-07 14:06:09,137][213445] Starting process rollout_proc7 [2023-03-07 14:06:09,139][213445] Starting process rollout_proc8 [2023-03-07 14:06:09,139][213445] Starting process rollout_proc9 [2023-03-07 14:06:09,147][213445] Starting process rollout_proc10 [2023-03-07 14:06:09,150][213445] Starting process rollout_proc11 [2023-03-07 14:06:09,155][213445] Starting process rollout_proc12 [2023-03-07 14:06:09,155][213445] Starting process rollout_proc13 [2023-03-07 14:06:09,156][213445] Starting process rollout_proc14 [2023-03-07 14:06:09,163][213445] Starting process rollout_proc15 [2023-03-07 14:06:09,164][213445] Starting process rollout_proc16 [2023-03-07 14:06:09,164][213445] Starting process rollout_proc17 [2023-03-07 14:06:09,165][213445] Starting process rollout_proc18 [2023-03-07 14:06:09,172][213445] Starting process rollout_proc19 [2023-03-07 14:06:09,183][213445] Starting process rollout_proc20 [2023-03-07 14:06:09,189][213445] Starting process rollout_proc21 [2023-03-07 14:06:09,313][213445] Starting process rollout_proc22 [2023-03-07 14:06:09,323][213445] Starting process rollout_proc23 [2023-03-07 14:06:09,365][213445] Starting process rollout_proc24 [2023-03-07 14:06:09,370][213445] Starting process rollout_proc25 [2023-03-07 14:06:09,375][213445] Starting process rollout_proc26 [2023-03-07 14:06:09,375][213445] Starting process rollout_proc27 [2023-03-07 14:06:09,375][213445] Starting process rollout_proc28 [2023-03-07 14:06:09,376][213445] Starting process rollout_proc29 [2023-03-07 14:06:09,376][213445] Starting process rollout_proc30 [2023-03-07 14:06:09,376][213445] Starting process rollout_proc31 [2023-03-07 14:06:11,006][213720] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 14:06:11,006][213720] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-07 14:06:11,015][213720] Num visible devices: 1 [2023-03-07 14:06:11,033][213720] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-07 14:06:11,033][213720] Starting seed is not provided [2023-03-07 14:06:11,033][213720] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 14:06:11,033][213720] Initializing actor-critic model on device cuda:0 [2023-03-07 14:06:11,034][213720] RunningMeanStd input shape: (39,) [2023-03-07 14:06:11,034][213720] RunningMeanStd input shape: (1,) [2023-03-07 14:06:11,056][213771] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 14:06:11,056][213771] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-07 14:06:11,070][213771] Num visible devices: 1 [2023-03-07 14:06:11,130][213774] Worker 0 uses CPU cores [0] [2023-03-07 14:06:11,133][213720] Created Actor Critic model with architecture: [2023-03-07 14:06:11,133][213720] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-07 14:06:11,223][213937] Worker 21 uses CPU cores [21] [2023-03-07 14:06:11,341][213775] Worker 3 uses CPU cores [3] [2023-03-07 14:06:11,619][214074] Worker 23 uses CPU cores [23] [2023-03-07 14:06:11,622][214071] Worker 12 uses CPU cores [12] [2023-03-07 14:06:11,771][214198] Worker 27 uses CPU cores [27] [2023-03-07 14:06:11,893][214072] Worker 20 uses CPU cores [20] [2023-03-07 14:06:12,059][213772] Worker 1 uses CPU cores [1] [2023-03-07 14:06:12,162][213839] Worker 5 uses CPU cores [5] [2023-03-07 14:06:12,319][214170] Worker 25 uses CPU cores [25] [2023-03-07 14:06:12,387][214036] Worker 8 uses CPU cores [8] [2023-03-07 14:06:12,434][213934] Worker 6 uses CPU cores [6] [2023-03-07 14:06:12,604][214069] Worker 17 uses CPU cores [17] [2023-03-07 14:06:12,671][213720] Using optimizer [2023-03-07 14:06:12,671][213720] No checkpoints found [2023-03-07 14:06:12,682][213720] Did not load from checkpoint, starting from scratch! [2023-03-07 14:06:12,682][213720] Initialized policy 0 weights for model version 0 [2023-03-07 14:06:12,687][213720] LearnerWorker_p0 finished initialization! [2023-03-07 14:06:12,687][213720] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 14:06:12,745][214070] Worker 19 uses CPU cores [19] [2023-03-07 14:06:12,767][213970] Worker 16 uses CPU cores [16] [2023-03-07 14:06:12,770][213771] RunningMeanStd input shape: (39,) [2023-03-07 14:06:12,770][213771] RunningMeanStd input shape: (1,) [2023-03-07 14:06:12,943][214206] Worker 29 uses CPU cores [29] [2023-03-07 14:06:13,043][213971] Worker 13 uses CPU cores [13] [2023-03-07 14:06:13,112][213973] Worker 10 uses CPU cores [10] [2023-03-07 14:06:13,235][213773] Worker 2 uses CPU cores [2] [2023-03-07 14:06:13,335][214205] Worker 28 uses CPU cores [28] [2023-03-07 14:06:13,541][213936] Worker 7 uses CPU cores [7] [2023-03-07 14:06:13,733][213445] Inference worker 0-0 is ready! [2023-03-07 14:06:13,734][213445] All inference workers are ready! Signal rollout workers to start! [2023-03-07 14:06:13,807][213972] Worker 14 uses CPU cores [14] [2023-03-07 14:06:13,882][214239] Worker 30 uses CPU cores [30] [2023-03-07 14:06:13,895][213933] Worker 11 uses CPU cores [11] [2023-03-07 14:06:14,144][214197] Worker 26 uses CPU cores [26] [2023-03-07 14:06:14,175][214254] Worker 31 uses CPU cores [31] [2023-03-07 14:06:14,215][214073] Worker 15 uses CPU cores [15] [2023-03-07 14:06:14,480][214204] Worker 22 uses CPU cores [22] [2023-03-07 14:06:14,555][214139] Worker 24 uses CPU cores [24] [2023-03-07 14:06:14,806][213807] Worker 4 uses CPU cores [4] [2023-03-07 14:06:14,921][214170] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,023][213935] Worker 18 uses CPU cores [18] [2023-03-07 14:06:15,028][213973] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,038][213934] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,056][214072] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,061][213773] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,151][214198] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,162][214036] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,235][213772] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,282][213971] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,286][213936] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,288][213775] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,291][213774] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,292][213970] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,292][213937] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,292][213839] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,296][214206] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,298][214070] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,313][213969] Worker 9 uses CPU cores [9] [2023-03-07 14:06:15,325][214074] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,326][214069] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,402][214239] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,441][213933] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,474][214071] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,476][214205] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,614][213972] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,813][214197] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,854][214073] Decorrelating experience for 0 frames... [2023-03-07 14:06:15,889][214254] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,106][213445] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-07 14:06:16,137][214170] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,169][214139] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,363][214204] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,492][213973] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,537][213934] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,538][214072] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,600][214198] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,613][214036] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,631][213807] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,672][213935] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,672][213773] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,735][213772] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,777][214206] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,791][214069] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,798][213933] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,817][213936] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,833][213839] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,837][213937] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,840][214070] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,841][213775] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,842][214239] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,848][213970] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,848][213774] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,872][214074] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,943][213969] Decorrelating experience for 0 frames... [2023-03-07 14:06:16,946][214071] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,947][214205] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,986][213971] Decorrelating experience for 32 frames... [2023-03-07 14:06:16,992][213972] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,030][214197] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,112][213720] Signal inference workers to stop experience collection... [2023-03-07 14:06:17,117][213771] InferenceWorker_p0-w0: stopping experience collection [2023-03-07 14:06:17,255][214073] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,270][214139] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,275][214254] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,350][214204] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,431][213720] Signal inference workers to resume experience collection... [2023-03-07 14:06:17,432][213771] InferenceWorker_p0-w0: resuming experience collection [2023-03-07 14:06:17,537][213807] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,579][213935] Decorrelating experience for 32 frames... [2023-03-07 14:06:17,808][213969] Decorrelating experience for 32 frames... [2023-03-07 14:06:18,567][213771] Updated weights for policy 0, policy_version 10 (0.0214) [2023-03-07 14:06:19,324][213771] Updated weights for policy 0, policy_version 20 (0.0006) [2023-03-07 14:06:20,104][213771] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-07 14:06:20,871][213771] Updated weights for policy 0, policy_version 40 (0.0006) [2023-03-07 14:06:21,105][213445] Fps is (10 sec: 8602.4, 60 sec: 8602.4, 300 sec: 8602.4). Total num frames: 43008. Throughput: 0: 5448.3. Samples: 27239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:06:21,106][213445] Avg episode reward: [(0, '58.779')] [2023-03-07 14:06:21,645][213771] Updated weights for policy 0, policy_version 50 (0.0007) [2023-03-07 14:06:22,416][213771] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-07 14:06:23,193][213771] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-07 14:06:23,951][213771] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-07 14:06:24,734][213771] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-07 14:06:25,515][213771] Updated weights for policy 0, policy_version 100 (0.0007) [2023-03-07 14:06:26,105][213445] Fps is (10 sec: 10957.2, 60 sec: 10957.2, 300 sec: 10957.2). Total num frames: 109568. Throughput: 0: 10648.5. Samples: 106481. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 14:06:26,106][213445] Avg episode reward: [(0, '182.505')] [2023-03-07 14:06:26,110][213720] Saving new best policy, reward=182.505! [2023-03-07 14:06:26,305][213771] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-07 14:06:27,071][213771] Updated weights for policy 0, policy_version 120 (0.0007) [2023-03-07 14:06:27,839][213771] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-07 14:06:28,584][213771] Updated weights for policy 0, policy_version 140 (0.0007) [2023-03-07 14:06:28,942][213445] Heartbeat connected on Batcher_0 [2023-03-07 14:06:28,944][213445] Heartbeat connected on LearnerWorker_p0 [2023-03-07 14:06:28,949][213445] Heartbeat connected on RolloutWorker_w0 [2023-03-07 14:06:28,950][213445] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-07 14:06:28,951][213445] Heartbeat connected on RolloutWorker_w1 [2023-03-07 14:06:28,953][213445] Heartbeat connected on RolloutWorker_w2 [2023-03-07 14:06:28,955][213445] Heartbeat connected on RolloutWorker_w3 [2023-03-07 14:06:28,957][213445] Heartbeat connected on RolloutWorker_w4 [2023-03-07 14:06:28,961][213445] Heartbeat connected on RolloutWorker_w5 [2023-03-07 14:06:28,962][213445] Heartbeat connected on RolloutWorker_w6 [2023-03-07 14:06:28,964][213445] Heartbeat connected on RolloutWorker_w8 [2023-03-07 14:06:28,967][213445] Heartbeat connected on RolloutWorker_w9 [2023-03-07 14:06:28,968][213445] Heartbeat connected on RolloutWorker_w10 [2023-03-07 14:06:28,969][213445] Heartbeat connected on RolloutWorker_w7 [2023-03-07 14:06:28,990][213445] Heartbeat connected on RolloutWorker_w11 [2023-03-07 14:06:28,991][213445] Heartbeat connected on RolloutWorker_w12 [2023-03-07 14:06:28,994][213445] Heartbeat connected on RolloutWorker_w13 [2023-03-07 14:06:28,995][213445] Heartbeat connected on RolloutWorker_w14 [2023-03-07 14:06:28,997][213445] Heartbeat connected on RolloutWorker_w15 [2023-03-07 14:06:29,000][213445] Heartbeat connected on RolloutWorker_w16 [2023-03-07 14:06:29,001][213445] Heartbeat connected on RolloutWorker_w17 [2023-03-07 14:06:29,002][213445] Heartbeat connected on RolloutWorker_w18 [2023-03-07 14:06:29,004][213445] Heartbeat connected on RolloutWorker_w19 [2023-03-07 14:06:29,006][213445] Heartbeat connected on RolloutWorker_w20 [2023-03-07 14:06:29,008][213445] Heartbeat connected on RolloutWorker_w21 [2023-03-07 14:06:29,010][213445] Heartbeat connected on RolloutWorker_w22 [2023-03-07 14:06:29,012][213445] Heartbeat connected on RolloutWorker_w23 [2023-03-07 14:06:29,014][213445] Heartbeat connected on RolloutWorker_w24 [2023-03-07 14:06:29,017][213445] Heartbeat connected on RolloutWorker_w25 [2023-03-07 14:06:29,019][213445] Heartbeat connected on RolloutWorker_w26 [2023-03-07 14:06:29,019][213445] Heartbeat connected on RolloutWorker_w27 [2023-03-07 14:06:29,021][213445] Heartbeat connected on RolloutWorker_w28 [2023-03-07 14:06:29,023][213445] Heartbeat connected on RolloutWorker_w29 [2023-03-07 14:06:29,025][213445] Heartbeat connected on RolloutWorker_w30 [2023-03-07 14:06:29,027][213445] Heartbeat connected on RolloutWorker_w31 [2023-03-07 14:06:29,367][213771] Updated weights for policy 0, policy_version 150 (0.0006) [2023-03-07 14:06:30,149][213771] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-07 14:06:30,910][213771] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-07 14:06:31,105][213445] Fps is (10 sec: 13311.8, 60 sec: 11742.1, 300 sec: 11742.1). Total num frames: 176128. Throughput: 0: 9763.0. Samples: 146442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:06:31,106][213445] Avg episode reward: [(0, '311.254')] [2023-03-07 14:06:31,107][213720] Saving new best policy, reward=311.254! [2023-03-07 14:06:31,683][213771] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-07 14:06:32,475][213771] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-07 14:06:33,248][213771] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-07 14:06:34,018][213771] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-07 14:06:34,818][213771] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-07 14:06:35,587][213771] Updated weights for policy 0, policy_version 230 (0.0006) [2023-03-07 14:06:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 12083.4, 300 sec: 12083.4). Total num frames: 241664. Throughput: 0: 11277.2. Samples: 225541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:06:36,106][213445] Avg episode reward: [(0, '522.844')] [2023-03-07 14:06:36,110][213720] Saving new best policy, reward=522.844! [2023-03-07 14:06:36,351][213771] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-07 14:06:37,146][213771] Updated weights for policy 0, policy_version 250 (0.0006) [2023-03-07 14:06:37,910][213771] Updated weights for policy 0, policy_version 260 (0.0006) [2023-03-07 14:06:38,689][213771] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-07 14:06:39,482][213771] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-07 14:06:40,273][213771] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-07 14:06:41,026][213771] Updated weights for policy 0, policy_version 300 (0.0005) [2023-03-07 14:06:41,105][213445] Fps is (10 sec: 13107.2, 60 sec: 12288.1, 300 sec: 12288.1). Total num frames: 307200. Throughput: 0: 12176.4. Samples: 304407. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:06:41,106][213445] Avg episode reward: [(0, '952.879')] [2023-03-07 14:06:41,107][213720] Saving new best policy, reward=952.879! [2023-03-07 14:06:41,820][213771] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-07 14:06:42,616][213771] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-07 14:06:43,391][213771] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-07 14:06:44,174][213771] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-07 14:06:44,948][213771] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-07 14:06:45,718][213771] Updated weights for policy 0, policy_version 360 (0.0006) [2023-03-07 14:06:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 12458.8, 300 sec: 12458.8). Total num frames: 373760. Throughput: 0: 11466.6. Samples: 343994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:06:46,106][213445] Avg episode reward: [(0, '787.131')] [2023-03-07 14:06:46,486][213771] Updated weights for policy 0, policy_version 370 (0.0006) [2023-03-07 14:06:47,291][213771] Updated weights for policy 0, policy_version 380 (0.0006) [2023-03-07 14:06:48,058][213771] Updated weights for policy 0, policy_version 390 (0.0006) [2023-03-07 14:06:48,826][213771] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-07 14:06:49,614][213771] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-07 14:06:50,387][213771] Updated weights for policy 0, policy_version 420 (0.0007) [2023-03-07 14:06:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 12551.5, 300 sec: 12551.5). Total num frames: 439296. Throughput: 0: 12078.4. Samples: 422740. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:06:51,106][213445] Avg episode reward: [(0, '899.442')] [2023-03-07 14:06:51,152][213771] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-07 14:06:51,934][213771] Updated weights for policy 0, policy_version 440 (0.0005) [2023-03-07 14:06:52,700][213771] Updated weights for policy 0, policy_version 450 (0.0006) [2023-03-07 14:06:53,477][213771] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-07 14:06:54,253][213771] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-07 14:06:55,020][213771] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-07 14:06:55,782][213771] Updated weights for policy 0, policy_version 490 (0.0007) [2023-03-07 14:06:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 12646.5, 300 sec: 12646.5). Total num frames: 505856. Throughput: 0: 12561.0. Samples: 502434. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:06:56,106][213445] Avg episode reward: [(0, '1233.680')] [2023-03-07 14:06:56,110][213720] Saving new best policy, reward=1233.680! [2023-03-07 14:06:56,567][213771] Updated weights for policy 0, policy_version 500 (0.0005) [2023-03-07 14:06:57,358][213771] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-07 14:06:58,130][213771] Updated weights for policy 0, policy_version 520 (0.0006) [2023-03-07 14:06:58,910][213771] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-07 14:06:59,682][213771] Updated weights for policy 0, policy_version 540 (0.0007) [2023-03-07 14:07:00,440][213771] Updated weights for policy 0, policy_version 550 (0.0006) [2023-03-07 14:07:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 12697.7, 300 sec: 12697.7). Total num frames: 571392. Throughput: 0: 12039.9. Samples: 541787. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:07:01,106][213445] Avg episode reward: [(0, '1479.727')] [2023-03-07 14:07:01,106][213720] Saving new best policy, reward=1479.727! [2023-03-07 14:07:01,226][213771] Updated weights for policy 0, policy_version 560 (0.0006) [2023-03-07 14:07:02,003][213771] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-07 14:07:02,762][213771] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-07 14:07:03,548][213771] Updated weights for policy 0, policy_version 590 (0.0007) [2023-03-07 14:07:04,331][213771] Updated weights for policy 0, policy_version 600 (0.0006) [2023-03-07 14:07:05,101][213771] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-07 14:07:05,873][213771] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-07 14:07:06,105][213445] Fps is (10 sec: 13107.4, 60 sec: 12738.7, 300 sec: 12738.7). Total num frames: 636928. Throughput: 0: 13194.9. Samples: 621010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:07:06,105][213445] Avg episode reward: [(0, '1769.815')] [2023-03-07 14:07:06,108][213720] Saving new best policy, reward=1769.815! [2023-03-07 14:07:06,664][213771] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-07 14:07:07,420][213771] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-07 14:07:08,190][213771] Updated weights for policy 0, policy_version 650 (0.0006) [2023-03-07 14:07:08,975][213771] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-07 14:07:09,757][213771] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-07 14:07:10,530][213771] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-07 14:07:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 12790.8, 300 sec: 12790.8). Total num frames: 703488. Throughput: 0: 13196.3. Samples: 700312. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:07:11,106][213445] Avg episode reward: [(0, '2482.088')] [2023-03-07 14:07:11,106][213720] Saving new best policy, reward=2482.088! [2023-03-07 14:07:11,305][213771] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-07 14:07:12,093][213771] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-07 14:07:12,840][213771] Updated weights for policy 0, policy_version 710 (0.0007) [2023-03-07 14:07:13,611][213771] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-07 14:07:14,387][213771] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-07 14:07:15,169][213771] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-07 14:07:15,952][213771] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-07 14:07:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 12817.2, 300 sec: 12817.2). Total num frames: 769024. Throughput: 0: 13190.2. Samples: 739997. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:07:16,105][213445] Avg episode reward: [(0, '2562.022')] [2023-03-07 14:07:16,110][213720] Saving new best policy, reward=2562.022! [2023-03-07 14:07:16,741][213771] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-07 14:07:17,517][213771] Updated weights for policy 0, policy_version 770 (0.0007) [2023-03-07 14:07:18,284][213771] Updated weights for policy 0, policy_version 780 (0.0006) [2023-03-07 14:07:19,064][213771] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-07 14:07:19,830][213771] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-07 14:07:20,611][213771] Updated weights for policy 0, policy_version 810 (0.0006) [2023-03-07 14:07:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 12855.2). Total num frames: 835584. Throughput: 0: 13193.0. Samples: 819224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:07:21,106][213445] Avg episode reward: [(0, '2233.776')] [2023-03-07 14:07:21,396][213771] Updated weights for policy 0, policy_version 820 (0.0006) [2023-03-07 14:07:22,162][213771] Updated weights for policy 0, policy_version 830 (0.0005) [2023-03-07 14:07:22,948][213771] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-07 14:07:23,725][213771] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-07 14:07:24,478][213771] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-07 14:07:25,261][213771] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-07 14:07:26,056][213771] Updated weights for policy 0, policy_version 880 (0.0005) [2023-03-07 14:07:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 12873.2). Total num frames: 901120. Throughput: 0: 13196.2. Samples: 898236. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:07:26,106][213445] Avg episode reward: [(0, '2088.733')] [2023-03-07 14:07:26,829][213771] Updated weights for policy 0, policy_version 890 (0.0007) [2023-03-07 14:07:27,596][213771] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-07 14:07:28,381][213771] Updated weights for policy 0, policy_version 910 (0.0006) [2023-03-07 14:07:29,153][213771] Updated weights for policy 0, policy_version 920 (0.0007) [2023-03-07 14:07:29,928][213771] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-07 14:07:30,719][213771] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-07 14:07:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 12902.5). Total num frames: 967680. Throughput: 0: 13196.7. Samples: 937848. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:07:31,106][213445] Avg episode reward: [(0, '2356.526')] [2023-03-07 14:07:31,486][213771] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-07 14:07:32,264][213771] Updated weights for policy 0, policy_version 960 (0.0007) [2023-03-07 14:07:33,044][213771] Updated weights for policy 0, policy_version 970 (0.0007) [2023-03-07 14:07:33,825][213771] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-07 14:07:34,585][213771] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-07 14:07:35,367][213771] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-07 14:07:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 12915.3). Total num frames: 1033216. Throughput: 0: 13210.3. Samples: 1017203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:07:36,106][213445] Avg episode reward: [(0, '2566.121')] [2023-03-07 14:07:36,110][213720] Saving new best policy, reward=2566.121! [2023-03-07 14:07:36,162][213771] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-07 14:07:36,891][213771] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-07 14:07:37,675][213771] Updated weights for policy 0, policy_version 1030 (0.0006) [2023-03-07 14:07:38,446][213771] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-07 14:07:39,208][213771] Updated weights for policy 0, policy_version 1050 (0.0006) [2023-03-07 14:07:40,000][213771] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-07 14:07:40,763][213771] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-07 14:07:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 12938.6). Total num frames: 1099776. Throughput: 0: 13203.7. Samples: 1096600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:07:41,106][213445] Avg episode reward: [(0, '3039.701')] [2023-03-07 14:07:41,106][213720] Saving new best policy, reward=3039.701! [2023-03-07 14:07:41,534][213771] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-07 14:07:42,300][213771] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-07 14:07:43,089][213771] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-07 14:07:43,869][213771] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-07 14:07:44,638][213771] Updated weights for policy 0, policy_version 1120 (0.0007) [2023-03-07 14:07:45,420][213771] Updated weights for policy 0, policy_version 1130 (0.0005) [2023-03-07 14:07:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 12948.0). Total num frames: 1165312. Throughput: 0: 13207.8. Samples: 1136137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:07:46,106][213445] Avg episode reward: [(0, '3151.998')] [2023-03-07 14:07:46,118][213720] Saving new best policy, reward=3151.998! [2023-03-07 14:07:46,187][213771] Updated weights for policy 0, policy_version 1140 (0.0007) [2023-03-07 14:07:46,952][213771] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-07 14:07:47,742][213771] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-07 14:07:48,518][213771] Updated weights for policy 0, policy_version 1170 (0.0007) [2023-03-07 14:07:49,293][213771] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-07 14:07:50,064][213771] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-07 14:07:50,845][213771] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-07 14:07:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 12967.1). Total num frames: 1231872. Throughput: 0: 13209.9. Samples: 1215456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:07:51,106][213445] Avg episode reward: [(0, '3295.790')] [2023-03-07 14:07:51,106][213720] Saving new best policy, reward=3295.790! [2023-03-07 14:07:51,607][213771] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-07 14:07:52,418][213771] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-07 14:07:53,190][213771] Updated weights for policy 0, policy_version 1230 (0.0005) [2023-03-07 14:07:53,952][213771] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-07 14:07:54,723][213771] Updated weights for policy 0, policy_version 1250 (0.0006) [2023-03-07 14:07:55,503][213771] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-07 14:07:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 12974.1). Total num frames: 1297408. Throughput: 0: 13210.0. Samples: 1294765. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:07:56,106][213445] Avg episode reward: [(0, '3132.131')] [2023-03-07 14:07:56,261][213771] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-07 14:07:57,037][213771] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-07 14:07:57,815][213771] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-07 14:07:58,577][213771] Updated weights for policy 0, policy_version 1300 (0.0005) [2023-03-07 14:07:59,358][213771] Updated weights for policy 0, policy_version 1310 (0.0006) [2023-03-07 14:08:00,132][213771] Updated weights for policy 0, policy_version 1320 (0.0006) [2023-03-07 14:08:00,914][213771] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-07 14:08:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 12990.2). Total num frames: 1363968. Throughput: 0: 13213.9. Samples: 1334625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:08:01,106][213445] Avg episode reward: [(0, '2847.515')] [2023-03-07 14:08:01,699][213771] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-07 14:08:02,467][213771] Updated weights for policy 0, policy_version 1350 (0.0007) [2023-03-07 14:08:03,242][213771] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-07 14:08:04,012][213771] Updated weights for policy 0, policy_version 1370 (0.0007) [2023-03-07 14:08:04,786][213771] Updated weights for policy 0, policy_version 1380 (0.0005) [2023-03-07 14:08:05,558][213771] Updated weights for policy 0, policy_version 1390 (0.0006) [2023-03-07 14:08:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.6, 300 sec: 13004.8). Total num frames: 1430528. Throughput: 0: 13213.1. Samples: 1413813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:08:06,106][213445] Avg episode reward: [(0, '2973.114')] [2023-03-07 14:08:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000001397_1430528.pth... [2023-03-07 14:08:06,330][213771] Updated weights for policy 0, policy_version 1400 (0.0006) [2023-03-07 14:08:07,106][213771] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-07 14:08:07,887][213771] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-07 14:08:08,646][213771] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-07 14:08:09,429][213771] Updated weights for policy 0, policy_version 1440 (0.0007) [2023-03-07 14:08:10,202][213771] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-07 14:08:10,973][213771] Updated weights for policy 0, policy_version 1460 (0.0007) [2023-03-07 14:08:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13009.3). Total num frames: 1496064. Throughput: 0: 13224.9. Samples: 1493354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:08:11,106][213445] Avg episode reward: [(0, '3024.573')] [2023-03-07 14:08:11,751][213771] Updated weights for policy 0, policy_version 1470 (0.0007) [2023-03-07 14:08:12,527][213771] Updated weights for policy 0, policy_version 1480 (0.0005) [2023-03-07 14:08:13,303][213771] Updated weights for policy 0, policy_version 1490 (0.0006) [2023-03-07 14:08:14,085][213771] Updated weights for policy 0, policy_version 1500 (0.0006) [2023-03-07 14:08:14,855][213771] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-07 14:08:15,626][213771] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-07 14:08:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13021.9). Total num frames: 1562624. Throughput: 0: 13221.4. Samples: 1532810. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:08:16,106][213445] Avg episode reward: [(0, '2917.191')] [2023-03-07 14:08:16,415][213771] Updated weights for policy 0, policy_version 1530 (0.0006) [2023-03-07 14:08:17,179][213771] Updated weights for policy 0, policy_version 1540 (0.0005) [2023-03-07 14:08:17,937][213771] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-07 14:08:18,712][213771] Updated weights for policy 0, policy_version 1560 (0.0007) [2023-03-07 14:08:19,489][213771] Updated weights for policy 0, policy_version 1570 (0.0006) [2023-03-07 14:08:20,254][213771] Updated weights for policy 0, policy_version 1580 (0.0005) [2023-03-07 14:08:21,041][213771] Updated weights for policy 0, policy_version 1590 (0.0007) [2023-03-07 14:08:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13025.3). Total num frames: 1628160. Throughput: 0: 13224.9. Samples: 1612321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:08:21,106][213445] Avg episode reward: [(0, '3226.855')] [2023-03-07 14:08:21,812][213771] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-07 14:08:22,565][213771] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-07 14:08:23,353][213771] Updated weights for policy 0, policy_version 1620 (0.0006) [2023-03-07 14:08:24,126][213771] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-07 14:08:24,890][213771] Updated weights for policy 0, policy_version 1640 (0.0006) [2023-03-07 14:08:25,658][213771] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-07 14:08:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13036.4). Total num frames: 1694720. Throughput: 0: 13226.9. Samples: 1691810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:08:26,106][213445] Avg episode reward: [(0, '2958.517')] [2023-03-07 14:08:26,453][213771] Updated weights for policy 0, policy_version 1660 (0.0007) [2023-03-07 14:08:27,218][213771] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-07 14:08:27,979][213771] Updated weights for policy 0, policy_version 1680 (0.0007) [2023-03-07 14:08:28,782][213771] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-07 14:08:29,556][213771] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-07 14:08:30,321][213771] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-07 14:08:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13039.0). Total num frames: 1760256. Throughput: 0: 13220.8. Samples: 1731073. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:08:31,105][213445] Avg episode reward: [(0, '3153.259')] [2023-03-07 14:08:31,109][213771] Updated weights for policy 0, policy_version 1720 (0.0007) [2023-03-07 14:08:31,886][213771] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-07 14:08:32,660][213771] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-07 14:08:33,457][213771] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-07 14:08:34,233][213771] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-07 14:08:35,013][213771] Updated weights for policy 0, policy_version 1770 (0.0007) [2023-03-07 14:08:35,784][213771] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-07 14:08:36,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13226.6, 300 sec: 13048.7). Total num frames: 1826816. Throughput: 0: 13217.9. Samples: 1810265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:08:36,106][213445] Avg episode reward: [(0, '3328.381')] [2023-03-07 14:08:36,111][213720] Saving new best policy, reward=3328.381! [2023-03-07 14:08:36,551][213771] Updated weights for policy 0, policy_version 1790 (0.0008) [2023-03-07 14:08:37,331][213771] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-07 14:08:38,117][213771] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-07 14:08:38,886][213771] Updated weights for policy 0, policy_version 1820 (0.0006) [2023-03-07 14:08:39,661][213771] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-07 14:08:40,455][213771] Updated weights for policy 0, policy_version 1840 (0.0006) [2023-03-07 14:08:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13050.7). Total num frames: 1892352. Throughput: 0: 13214.0. Samples: 1889393. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:08:41,106][213445] Avg episode reward: [(0, '3630.894')] [2023-03-07 14:08:41,107][213720] Saving new best policy, reward=3630.894! [2023-03-07 14:08:41,209][213771] Updated weights for policy 0, policy_version 1850 (0.0006) [2023-03-07 14:08:41,985][213771] Updated weights for policy 0, policy_version 1860 (0.0007) [2023-03-07 14:08:42,762][213771] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-07 14:08:43,542][213771] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-07 14:08:44,324][213771] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-07 14:08:45,110][213771] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-07 14:08:45,881][213771] Updated weights for policy 0, policy_version 1910 (0.0006) [2023-03-07 14:08:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13059.5). Total num frames: 1958912. Throughput: 0: 13207.4. Samples: 1928957. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:08:46,106][213445] Avg episode reward: [(0, '3696.357')] [2023-03-07 14:08:46,111][213720] Saving new best policy, reward=3696.357! [2023-03-07 14:08:46,660][213771] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-07 14:08:47,433][213771] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-07 14:08:48,218][213771] Updated weights for policy 0, policy_version 1940 (0.0006) [2023-03-07 14:08:48,965][213771] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-07 14:08:49,738][213771] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-07 14:08:50,521][213771] Updated weights for policy 0, policy_version 1970 (0.0006) [2023-03-07 14:08:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13061.0). Total num frames: 2024448. Throughput: 0: 13209.6. Samples: 2008244. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:08:51,106][213445] Avg episode reward: [(0, '3751.947')] [2023-03-07 14:08:51,107][213720] Saving new best policy, reward=3751.947! [2023-03-07 14:08:51,301][213771] Updated weights for policy 0, policy_version 1980 (0.0005) [2023-03-07 14:08:52,068][213771] Updated weights for policy 0, policy_version 1990 (0.0007) [2023-03-07 14:08:52,839][213771] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-07 14:08:53,611][213771] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-07 14:08:54,377][213771] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-07 14:08:55,155][213771] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-07 14:08:55,933][213771] Updated weights for policy 0, policy_version 2040 (0.0007) [2023-03-07 14:08:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13068.8). Total num frames: 2091008. Throughput: 0: 13206.8. Samples: 2087662. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:08:56,106][213445] Avg episode reward: [(0, '3528.654')] [2023-03-07 14:08:56,700][213771] Updated weights for policy 0, policy_version 2050 (0.0006) [2023-03-07 14:08:57,482][213771] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-07 14:08:58,269][213771] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-07 14:08:59,032][213771] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-07 14:08:59,795][213771] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-07 14:09:00,585][213771] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-07 14:09:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13076.2). Total num frames: 2157568. Throughput: 0: 13213.5. Samples: 2127419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:09:01,106][213445] Avg episode reward: [(0, '3582.455')] [2023-03-07 14:09:01,350][213771] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-07 14:09:02,109][213771] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-07 14:09:02,901][213771] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-07 14:09:03,661][213771] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-07 14:09:04,438][213771] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-07 14:09:05,212][213771] Updated weights for policy 0, policy_version 2160 (0.0006) [2023-03-07 14:09:05,994][213771] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-07 14:09:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13077.1). Total num frames: 2223104. Throughput: 0: 13209.2. Samples: 2206734. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:09:06,105][213445] Avg episode reward: [(0, '3658.421')] [2023-03-07 14:09:06,771][213771] Updated weights for policy 0, policy_version 2180 (0.0006) [2023-03-07 14:09:07,535][213771] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-07 14:09:08,302][213771] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-07 14:09:09,071][213771] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-07 14:09:09,841][213771] Updated weights for policy 0, policy_version 2220 (0.0006) [2023-03-07 14:09:10,621][213771] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-07 14:09:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13083.8). Total num frames: 2289664. Throughput: 0: 13213.6. Samples: 2286422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:11,106][213445] Avg episode reward: [(0, '4035.897')] [2023-03-07 14:09:11,106][213720] Saving new best policy, reward=4035.897! [2023-03-07 14:09:11,409][213771] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-07 14:09:12,177][213771] Updated weights for policy 0, policy_version 2250 (0.0005) [2023-03-07 14:09:12,943][213771] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-07 14:09:13,711][213771] Updated weights for policy 0, policy_version 2270 (0.0006) [2023-03-07 14:09:14,482][213771] Updated weights for policy 0, policy_version 2280 (0.0006) [2023-03-07 14:09:15,259][213771] Updated weights for policy 0, policy_version 2290 (0.0007) [2023-03-07 14:09:16,040][213771] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-07 14:09:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13084.5). Total num frames: 2355200. Throughput: 0: 13222.3. Samples: 2326078. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:16,106][213445] Avg episode reward: [(0, '3889.645')] [2023-03-07 14:09:16,800][213771] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-07 14:09:17,562][213771] Updated weights for policy 0, policy_version 2320 (0.0005) [2023-03-07 14:09:18,333][213771] Updated weights for policy 0, policy_version 2330 (0.0005) [2023-03-07 14:09:19,097][213771] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-07 14:09:19,880][213771] Updated weights for policy 0, policy_version 2350 (0.0007) [2023-03-07 14:09:20,661][213771] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-07 14:09:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13090.6). Total num frames: 2421760. Throughput: 0: 13231.0. Samples: 2405659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:21,106][213445] Avg episode reward: [(0, '3932.236')] [2023-03-07 14:09:21,418][213771] Updated weights for policy 0, policy_version 2370 (0.0005) [2023-03-07 14:09:22,213][213771] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-07 14:09:22,973][213771] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-07 14:09:23,748][213771] Updated weights for policy 0, policy_version 2400 (0.0006) [2023-03-07 14:09:24,526][213771] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-07 14:09:25,298][213771] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-07 14:09:26,069][213771] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-07 14:09:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13096.5). Total num frames: 2488320. Throughput: 0: 13238.8. Samples: 2485136. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:09:26,106][213445] Avg episode reward: [(0, '3844.673')] [2023-03-07 14:09:26,845][213771] Updated weights for policy 0, policy_version 2440 (0.0007) [2023-03-07 14:09:27,612][213771] Updated weights for policy 0, policy_version 2450 (0.0005) [2023-03-07 14:09:28,373][213771] Updated weights for policy 0, policy_version 2460 (0.0007) [2023-03-07 14:09:29,156][213771] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-07 14:09:29,945][213771] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-07 14:09:30,720][213771] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-07 14:09:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13096.7). Total num frames: 2553856. Throughput: 0: 13245.0. Samples: 2524980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:31,106][213445] Avg episode reward: [(0, '3756.255')] [2023-03-07 14:09:31,485][213771] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-07 14:09:32,269][213771] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-07 14:09:33,040][213771] Updated weights for policy 0, policy_version 2520 (0.0006) [2023-03-07 14:09:33,816][213771] Updated weights for policy 0, policy_version 2530 (0.0006) [2023-03-07 14:09:34,598][213771] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-07 14:09:35,364][213771] Updated weights for policy 0, policy_version 2550 (0.0007) [2023-03-07 14:09:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13102.1). Total num frames: 2620416. Throughput: 0: 13242.1. Samples: 2604139. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:09:36,106][213445] Avg episode reward: [(0, '4043.737')] [2023-03-07 14:09:36,108][213720] Saving new best policy, reward=4043.737! [2023-03-07 14:09:36,160][213771] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-07 14:09:36,935][213771] Updated weights for policy 0, policy_version 2570 (0.0006) [2023-03-07 14:09:37,721][213771] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-07 14:09:38,484][213771] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-07 14:09:39,265][213771] Updated weights for policy 0, policy_version 2600 (0.0005) [2023-03-07 14:09:40,025][213771] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-07 14:09:40,834][213771] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-07 14:09:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13102.2). Total num frames: 2685952. Throughput: 0: 13227.8. Samples: 2682915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:09:41,106][213445] Avg episode reward: [(0, '3955.312')] [2023-03-07 14:09:41,611][213771] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-07 14:09:42,371][213771] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-07 14:09:43,132][213771] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-07 14:09:43,924][213771] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-07 14:09:44,681][213771] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-07 14:09:45,471][213771] Updated weights for policy 0, policy_version 2680 (0.0008) [2023-03-07 14:09:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13107.2). Total num frames: 2752512. Throughput: 0: 13228.3. Samples: 2722691. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:09:46,106][213445] Avg episode reward: [(0, '4060.001')] [2023-03-07 14:09:46,110][213720] Saving new best policy, reward=4060.001! [2023-03-07 14:09:46,249][213771] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-07 14:09:47,005][213771] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-07 14:09:47,789][213771] Updated weights for policy 0, policy_version 2710 (0.0005) [2023-03-07 14:09:48,544][213771] Updated weights for policy 0, policy_version 2720 (0.0007) [2023-03-07 14:09:49,321][213771] Updated weights for policy 0, policy_version 2730 (0.0006) [2023-03-07 14:09:50,115][213771] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-07 14:09:50,880][213771] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-07 14:09:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13107.2). Total num frames: 2818048. Throughput: 0: 13231.4. Samples: 2802146. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:51,106][213445] Avg episode reward: [(0, '3811.224')] [2023-03-07 14:09:51,661][213771] Updated weights for policy 0, policy_version 2760 (0.0008) [2023-03-07 14:09:52,437][213771] Updated weights for policy 0, policy_version 2770 (0.0005) [2023-03-07 14:09:53,213][213771] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-07 14:09:53,961][213771] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-07 14:09:54,728][213771] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-07 14:09:55,509][213771] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-07 14:09:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13111.9). Total num frames: 2884608. Throughput: 0: 13227.7. Samples: 2881669. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:09:56,106][213445] Avg episode reward: [(0, '3905.500')] [2023-03-07 14:09:56,307][213771] Updated weights for policy 0, policy_version 2820 (0.0005) [2023-03-07 14:09:57,074][213771] Updated weights for policy 0, policy_version 2830 (0.0007) [2023-03-07 14:09:57,852][213771] Updated weights for policy 0, policy_version 2840 (0.0006) [2023-03-07 14:09:58,609][213771] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-07 14:09:59,388][213771] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-07 14:10:00,166][213771] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-07 14:10:00,926][213771] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-07 14:10:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13116.3). Total num frames: 2951168. Throughput: 0: 13229.8. Samples: 2921419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:10:01,106][213445] Avg episode reward: [(0, '3839.785')] [2023-03-07 14:10:01,689][213771] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-07 14:10:02,461][213771] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-07 14:10:03,243][213771] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-07 14:10:04,002][213771] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-07 14:10:04,783][213771] Updated weights for policy 0, policy_version 2930 (0.0006) [2023-03-07 14:10:05,541][213771] Updated weights for policy 0, policy_version 2940 (0.0006) [2023-03-07 14:10:06,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13120.6). Total num frames: 3017728. Throughput: 0: 13229.5. Samples: 3000986. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 14:10:06,106][213445] Avg episode reward: [(0, '4135.997')] [2023-03-07 14:10:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000002947_3017728.pth... [2023-03-07 14:10:06,140][213720] Saving new best policy, reward=4135.997! [2023-03-07 14:10:06,323][213771] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-07 14:10:07,092][213771] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-07 14:10:07,874][213771] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-07 14:10:08,650][213771] Updated weights for policy 0, policy_version 2980 (0.0005) [2023-03-07 14:10:09,415][213771] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-07 14:10:10,185][213771] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-07 14:10:10,944][213771] Updated weights for policy 0, policy_version 3010 (0.0006) [2023-03-07 14:10:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13124.7). Total num frames: 3084288. Throughput: 0: 13234.0. Samples: 3080668. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:10:11,106][213445] Avg episode reward: [(0, '3924.790')] [2023-03-07 14:10:11,721][213771] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-07 14:10:12,479][213771] Updated weights for policy 0, policy_version 3030 (0.0007) [2023-03-07 14:10:13,263][213771] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-07 14:10:14,036][213771] Updated weights for policy 0, policy_version 3050 (0.0005) [2023-03-07 14:10:14,819][213771] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-07 14:10:15,607][213771] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-07 14:10:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13124.3). Total num frames: 3149824. Throughput: 0: 13231.0. Samples: 3120375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:10:16,106][213445] Avg episode reward: [(0, '3904.696')] [2023-03-07 14:10:16,377][213771] Updated weights for policy 0, policy_version 3080 (0.0006) [2023-03-07 14:10:17,159][213771] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-03-07 14:10:17,946][213771] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-07 14:10:18,693][213771] Updated weights for policy 0, policy_version 3110 (0.0005) [2023-03-07 14:10:19,452][213771] Updated weights for policy 0, policy_version 3120 (0.0005) [2023-03-07 14:10:20,222][213771] Updated weights for policy 0, policy_version 3130 (0.0007) [2023-03-07 14:10:20,985][213771] Updated weights for policy 0, policy_version 3140 (0.0006) [2023-03-07 14:10:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13128.1). Total num frames: 3216384. Throughput: 0: 13245.0. Samples: 3200162. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:10:21,106][213445] Avg episode reward: [(0, '3963.823')] [2023-03-07 14:10:21,755][213771] Updated weights for policy 0, policy_version 3150 (0.0006) [2023-03-07 14:10:22,523][213771] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-07 14:10:23,314][213771] Updated weights for policy 0, policy_version 3170 (0.0007) [2023-03-07 14:10:24,070][213771] Updated weights for policy 0, policy_version 3180 (0.0006) [2023-03-07 14:10:24,877][213771] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-07 14:10:25,622][213771] Updated weights for policy 0, policy_version 3200 (0.0006) [2023-03-07 14:10:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13131.8). Total num frames: 3282944. Throughput: 0: 13256.0. Samples: 3279436. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:10:26,106][213445] Avg episode reward: [(0, '3910.946')] [2023-03-07 14:10:26,382][213771] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-07 14:10:27,164][213771] Updated weights for policy 0, policy_version 3220 (0.0007) [2023-03-07 14:10:27,925][213771] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-07 14:10:28,701][213771] Updated weights for policy 0, policy_version 3240 (0.0007) [2023-03-07 14:10:29,470][213771] Updated weights for policy 0, policy_version 3250 (0.0005) [2023-03-07 14:10:30,232][213771] Updated weights for policy 0, policy_version 3260 (0.0006) [2023-03-07 14:10:30,987][213771] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-07 14:10:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13135.3). Total num frames: 3349504. Throughput: 0: 13261.6. Samples: 3319464. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:10:31,106][213445] Avg episode reward: [(0, '4122.382')] [2023-03-07 14:10:31,757][213771] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-07 14:10:32,531][213771] Updated weights for policy 0, policy_version 3290 (0.0006) [2023-03-07 14:10:33,297][213771] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-07 14:10:34,071][213771] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-07 14:10:34,862][213771] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-07 14:10:35,620][213771] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-07 14:10:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13138.7). Total num frames: 3416064. Throughput: 0: 13274.8. Samples: 3399511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:10:36,106][213445] Avg episode reward: [(0, '4036.246')] [2023-03-07 14:10:36,402][213771] Updated weights for policy 0, policy_version 3340 (0.0006) [2023-03-07 14:10:37,182][213771] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-07 14:10:37,934][213771] Updated weights for policy 0, policy_version 3360 (0.0007) [2023-03-07 14:10:38,714][213771] Updated weights for policy 0, policy_version 3370 (0.0006) [2023-03-07 14:10:39,486][213771] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-07 14:10:40,260][213771] Updated weights for policy 0, policy_version 3390 (0.0005) [2023-03-07 14:10:41,030][213771] Updated weights for policy 0, policy_version 3400 (0.0006) [2023-03-07 14:10:41,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13142.0). Total num frames: 3482624. Throughput: 0: 13272.2. Samples: 3478919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:10:41,106][213445] Avg episode reward: [(0, '4132.906')] [2023-03-07 14:10:41,794][213771] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-07 14:10:42,588][213771] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-07 14:10:43,349][213771] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-07 14:10:44,131][213771] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-07 14:10:44,914][213771] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-07 14:10:45,681][213771] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-07 14:10:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13141.4). Total num frames: 3548160. Throughput: 0: 13272.5. Samples: 3518684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:10:46,106][213445] Avg episode reward: [(0, '3721.626')] [2023-03-07 14:10:46,435][213771] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-07 14:10:47,205][213771] Updated weights for policy 0, policy_version 3480 (0.0005) [2023-03-07 14:10:47,982][213771] Updated weights for policy 0, policy_version 3490 (0.0007) [2023-03-07 14:10:48,737][213771] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-07 14:10:49,522][213771] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-07 14:10:50,294][213771] Updated weights for policy 0, policy_version 3520 (0.0006) [2023-03-07 14:10:51,090][213771] Updated weights for policy 0, policy_version 3530 (0.0007) [2023-03-07 14:10:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13144.5). Total num frames: 3614720. Throughput: 0: 13275.5. Samples: 3598380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:10:51,106][213445] Avg episode reward: [(0, '3821.422')] [2023-03-07 14:10:51,850][213771] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-07 14:10:52,638][213771] Updated weights for policy 0, policy_version 3550 (0.0005) [2023-03-07 14:10:53,421][213771] Updated weights for policy 0, policy_version 3560 (0.0006) [2023-03-07 14:10:54,203][213771] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-07 14:10:54,972][213771] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-07 14:10:55,746][213771] Updated weights for policy 0, policy_version 3590 (0.0007) [2023-03-07 14:10:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13143.8). Total num frames: 3680256. Throughput: 0: 13260.6. Samples: 3677392. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:10:56,105][213445] Avg episode reward: [(0, '4086.919')] [2023-03-07 14:10:56,544][213771] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-07 14:10:57,308][213771] Updated weights for policy 0, policy_version 3610 (0.0006) [2023-03-07 14:10:58,097][213771] Updated weights for policy 0, policy_version 3620 (0.0007) [2023-03-07 14:10:58,863][213771] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-07 14:10:59,639][213771] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-07 14:11:00,392][213771] Updated weights for policy 0, policy_version 3650 (0.0007) [2023-03-07 14:11:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13146.7). Total num frames: 3746816. Throughput: 0: 13257.7. Samples: 3716974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:01,106][213445] Avg episode reward: [(0, '3978.052')] [2023-03-07 14:11:01,182][213771] Updated weights for policy 0, policy_version 3660 (0.0006) [2023-03-07 14:11:02,000][213771] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-07 14:11:02,773][213771] Updated weights for policy 0, policy_version 3680 (0.0006) [2023-03-07 14:11:03,557][213771] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-07 14:11:04,331][213771] Updated weights for policy 0, policy_version 3700 (0.0007) [2023-03-07 14:11:05,105][213771] Updated weights for policy 0, policy_version 3710 (0.0006) [2023-03-07 14:11:05,884][213771] Updated weights for policy 0, policy_version 3720 (0.0007) [2023-03-07 14:11:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13146.1). Total num frames: 3812352. Throughput: 0: 13227.2. Samples: 3795389. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:06,106][213445] Avg episode reward: [(0, '3716.569')] [2023-03-07 14:11:06,661][213771] Updated weights for policy 0, policy_version 3730 (0.0005) [2023-03-07 14:11:07,437][213771] Updated weights for policy 0, policy_version 3740 (0.0007) [2023-03-07 14:11:08,231][213771] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-07 14:11:09,005][213771] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-07 14:11:09,769][213771] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-07 14:11:10,537][213771] Updated weights for policy 0, policy_version 3780 (0.0005) [2023-03-07 14:11:11,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13226.6, 300 sec: 13145.4). Total num frames: 3877888. Throughput: 0: 13229.1. Samples: 3874745. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:11:11,106][213445] Avg episode reward: [(0, '4044.219')] [2023-03-07 14:11:11,301][213771] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-07 14:11:12,090][213771] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-07 14:11:12,868][213771] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-07 14:11:13,624][213771] Updated weights for policy 0, policy_version 3820 (0.0007) [2023-03-07 14:11:14,413][213771] Updated weights for policy 0, policy_version 3830 (0.0005) [2023-03-07 14:11:15,183][213771] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-07 14:11:15,961][213771] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-07 14:11:16,105][213445] Fps is (10 sec: 13107.5, 60 sec: 13226.7, 300 sec: 13221.8). Total num frames: 3943424. Throughput: 0: 13218.8. Samples: 3914311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:16,106][213445] Avg episode reward: [(0, '4019.176')] [2023-03-07 14:11:16,740][213771] Updated weights for policy 0, policy_version 3860 (0.0005) [2023-03-07 14:11:17,514][213771] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-07 14:11:18,272][213771] Updated weights for policy 0, policy_version 3880 (0.0005) [2023-03-07 14:11:19,049][213771] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-07 14:11:19,821][213771] Updated weights for policy 0, policy_version 3900 (0.0006) [2023-03-07 14:11:20,576][213771] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-07 14:11:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13221.8). Total num frames: 4009984. Throughput: 0: 13207.9. Samples: 3993864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:21,106][213445] Avg episode reward: [(0, '3962.164')] [2023-03-07 14:11:21,357][213771] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-07 14:11:22,130][213771] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-07 14:11:22,910][213771] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-07 14:11:23,702][213771] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-07 14:11:24,459][213771] Updated weights for policy 0, policy_version 3960 (0.0006) [2023-03-07 14:11:25,236][213771] Updated weights for policy 0, policy_version 3970 (0.0007) [2023-03-07 14:11:26,019][213771] Updated weights for policy 0, policy_version 3980 (0.0005) [2023-03-07 14:11:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13226.6, 300 sec: 13221.8). Total num frames: 4076544. Throughput: 0: 13211.4. Samples: 4073433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:26,106][213445] Avg episode reward: [(0, '4088.966')] [2023-03-07 14:11:26,762][213771] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-07 14:11:27,536][213771] Updated weights for policy 0, policy_version 4000 (0.0007) [2023-03-07 14:11:28,313][213771] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-03-07 14:11:29,071][213771] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-07 14:11:29,842][213771] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-03-07 14:11:30,635][213771] Updated weights for policy 0, policy_version 4040 (0.0006) [2023-03-07 14:11:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 4143104. Throughput: 0: 13217.6. Samples: 4113474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:11:31,106][213445] Avg episode reward: [(0, '4160.789')] [2023-03-07 14:11:31,106][213720] Saving new best policy, reward=4160.789! [2023-03-07 14:11:31,399][213771] Updated weights for policy 0, policy_version 4050 (0.0007) [2023-03-07 14:11:32,176][213771] Updated weights for policy 0, policy_version 4060 (0.0007) [2023-03-07 14:11:32,961][213771] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-07 14:11:33,731][213771] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-07 14:11:34,499][213771] Updated weights for policy 0, policy_version 4090 (0.0007) [2023-03-07 14:11:35,265][213771] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-07 14:11:36,044][213771] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-07 14:11:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 4208640. Throughput: 0: 13205.2. Samples: 4192612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-07 14:11:36,105][213445] Avg episode reward: [(0, '4098.191')] [2023-03-07 14:11:36,818][213771] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-07 14:11:37,591][213771] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-07 14:11:38,374][213771] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-07 14:11:39,140][213771] Updated weights for policy 0, policy_version 4150 (0.0007) [2023-03-07 14:11:39,906][213771] Updated weights for policy 0, policy_version 4160 (0.0007) [2023-03-07 14:11:40,683][213771] Updated weights for policy 0, policy_version 4170 (0.0007) [2023-03-07 14:11:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 4275200. Throughput: 0: 13217.5. Samples: 4272184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:41,106][213445] Avg episode reward: [(0, '4163.617')] [2023-03-07 14:11:41,107][213720] Saving new best policy, reward=4163.617! [2023-03-07 14:11:41,441][213771] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-07 14:11:42,220][213771] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-07 14:11:43,001][213771] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-07 14:11:43,777][213771] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-07 14:11:44,540][213771] Updated weights for policy 0, policy_version 4220 (0.0006) [2023-03-07 14:11:45,324][213771] Updated weights for policy 0, policy_version 4230 (0.0008) [2023-03-07 14:11:46,096][213771] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-07 14:11:46,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4341760. Throughput: 0: 13220.8. Samples: 4311909. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:11:46,106][213445] Avg episode reward: [(0, '4219.708')] [2023-03-07 14:11:46,109][213720] Saving new best policy, reward=4219.708! [2023-03-07 14:11:46,881][213771] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-07 14:11:47,652][213771] Updated weights for policy 0, policy_version 4260 (0.0006) [2023-03-07 14:11:48,419][213771] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-07 14:11:49,180][213771] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-07 14:11:49,978][213771] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-07 14:11:50,747][213771] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-07 14:11:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 4407296. Throughput: 0: 13240.0. Samples: 4391188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:51,106][213445] Avg episode reward: [(0, '4053.578')] [2023-03-07 14:11:51,502][213771] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-07 14:11:52,283][213771] Updated weights for policy 0, policy_version 4320 (0.0006) [2023-03-07 14:11:53,063][213771] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-07 14:11:53,828][213771] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-07 14:11:54,601][213771] Updated weights for policy 0, policy_version 4350 (0.0006) [2023-03-07 14:11:55,388][213771] Updated weights for policy 0, policy_version 4360 (0.0007) [2023-03-07 14:11:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4473856. Throughput: 0: 13239.5. Samples: 4470521. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:11:56,105][213445] Avg episode reward: [(0, '4139.515')] [2023-03-07 14:11:56,157][213771] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-07 14:11:56,937][213771] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-07 14:11:57,711][213771] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-07 14:11:58,481][213771] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-07 14:11:59,265][213771] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-07 14:12:00,047][213771] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-07 14:12:00,817][213771] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-07 14:12:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 4539392. Throughput: 0: 13239.1. Samples: 4510073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:01,106][213445] Avg episode reward: [(0, '4224.395')] [2023-03-07 14:12:01,109][213720] Saving new best policy, reward=4224.395! [2023-03-07 14:12:01,557][213771] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-07 14:12:02,353][213771] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-07 14:12:03,121][213771] Updated weights for policy 0, policy_version 4460 (0.0006) [2023-03-07 14:12:03,904][213771] Updated weights for policy 0, policy_version 4470 (0.0008) [2023-03-07 14:12:04,672][213771] Updated weights for policy 0, policy_version 4480 (0.0007) [2023-03-07 14:12:05,446][213771] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-07 14:12:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4605952. Throughput: 0: 13241.7. Samples: 4589738. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:12:06,106][213445] Avg episode reward: [(0, '4116.643')] [2023-03-07 14:12:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000004498_4605952.pth... [2023-03-07 14:12:06,142][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000001397_1430528.pth [2023-03-07 14:12:06,204][213771] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-07 14:12:06,988][213771] Updated weights for policy 0, policy_version 4510 (0.0006) [2023-03-07 14:12:07,767][213771] Updated weights for policy 0, policy_version 4520 (0.0007) [2023-03-07 14:12:08,545][213771] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-07 14:12:09,347][213771] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-07 14:12:10,126][213771] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-07 14:12:10,902][213771] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-07 14:12:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4671488. Throughput: 0: 13228.5. Samples: 4668714. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:11,106][213445] Avg episode reward: [(0, '3994.816')] [2023-03-07 14:12:11,659][213771] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-07 14:12:12,443][213771] Updated weights for policy 0, policy_version 4580 (0.0006) [2023-03-07 14:12:13,214][213771] Updated weights for policy 0, policy_version 4590 (0.0007) [2023-03-07 14:12:13,998][213771] Updated weights for policy 0, policy_version 4600 (0.0005) [2023-03-07 14:12:14,774][213771] Updated weights for policy 0, policy_version 4610 (0.0007) [2023-03-07 14:12:15,538][213771] Updated weights for policy 0, policy_version 4620 (0.0007) [2023-03-07 14:12:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 4738048. Throughput: 0: 13218.4. Samples: 4708303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:16,106][213445] Avg episode reward: [(0, '3970.752')] [2023-03-07 14:12:16,318][213771] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-07 14:12:17,097][213771] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-07 14:12:17,860][213771] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-07 14:12:18,631][213771] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-07 14:12:19,421][213771] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-07 14:12:20,187][213771] Updated weights for policy 0, policy_version 4680 (0.0007) [2023-03-07 14:12:20,953][213771] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-07 14:12:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4803584. Throughput: 0: 13224.0. Samples: 4787696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:21,106][213445] Avg episode reward: [(0, '4174.362')] [2023-03-07 14:12:21,741][213771] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-07 14:12:22,502][213771] Updated weights for policy 0, policy_version 4710 (0.0007) [2023-03-07 14:12:23,281][213771] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-07 14:12:24,063][213771] Updated weights for policy 0, policy_version 4730 (0.0005) [2023-03-07 14:12:24,826][213771] Updated weights for policy 0, policy_version 4740 (0.0006) [2023-03-07 14:12:25,605][213771] Updated weights for policy 0, policy_version 4750 (0.0006) [2023-03-07 14:12:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 4870144. Throughput: 0: 13219.3. Samples: 4867050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:26,106][213445] Avg episode reward: [(0, '4032.562')] [2023-03-07 14:12:26,378][213771] Updated weights for policy 0, policy_version 4760 (0.0008) [2023-03-07 14:12:27,145][213771] Updated weights for policy 0, policy_version 4770 (0.0005) [2023-03-07 14:12:27,939][213771] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-07 14:12:28,713][213771] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-07 14:12:29,493][213771] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-07 14:12:30,262][213771] Updated weights for policy 0, policy_version 4810 (0.0007) [2023-03-07 14:12:31,052][213771] Updated weights for policy 0, policy_version 4820 (0.0007) [2023-03-07 14:12:31,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 4935680. Throughput: 0: 13211.5. Samples: 4906423. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:12:31,106][213445] Avg episode reward: [(0, '3847.309')] [2023-03-07 14:12:31,822][213771] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-07 14:12:32,590][213771] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-07 14:12:33,369][213771] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-07 14:12:34,149][213771] Updated weights for policy 0, policy_version 4860 (0.0008) [2023-03-07 14:12:34,929][213771] Updated weights for policy 0, policy_version 4870 (0.0007) [2023-03-07 14:12:35,690][213771] Updated weights for policy 0, policy_version 4880 (0.0006) [2023-03-07 14:12:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 5002240. Throughput: 0: 13212.7. Samples: 4985757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:36,106][213445] Avg episode reward: [(0, '3753.872')] [2023-03-07 14:12:36,486][213771] Updated weights for policy 0, policy_version 4890 (0.0007) [2023-03-07 14:12:37,258][213771] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-07 14:12:38,030][213771] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-07 14:12:38,839][213771] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-07 14:12:39,587][213771] Updated weights for policy 0, policy_version 4930 (0.0006) [2023-03-07 14:12:40,357][213771] Updated weights for policy 0, policy_version 4940 (0.0007) [2023-03-07 14:12:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5067776. Throughput: 0: 13206.7. Samples: 5064824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:41,106][213445] Avg episode reward: [(0, '3955.435')] [2023-03-07 14:12:41,140][213771] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-07 14:12:41,929][213771] Updated weights for policy 0, policy_version 4960 (0.0007) [2023-03-07 14:12:42,698][213771] Updated weights for policy 0, policy_version 4970 (0.0007) [2023-03-07 14:12:43,483][213771] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-07 14:12:44,253][213771] Updated weights for policy 0, policy_version 4990 (0.0006) [2023-03-07 14:12:45,041][213771] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-07 14:12:45,797][213771] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-07 14:12:46,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13192.6, 300 sec: 13225.2). Total num frames: 5133312. Throughput: 0: 13204.8. Samples: 5104288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:46,106][213445] Avg episode reward: [(0, '3976.125')] [2023-03-07 14:12:46,579][213771] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-07 14:12:47,344][213771] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-07 14:12:48,122][213771] Updated weights for policy 0, policy_version 5040 (0.0007) [2023-03-07 14:12:48,889][213771] Updated weights for policy 0, policy_version 5050 (0.0005) [2023-03-07 14:12:49,667][213771] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-07 14:12:50,425][213771] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-07 14:12:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5199872. Throughput: 0: 13203.0. Samples: 5183871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:51,106][213445] Avg episode reward: [(0, '3721.481')] [2023-03-07 14:12:51,213][213771] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-07 14:12:51,968][213771] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-07 14:12:52,742][213771] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-07 14:12:53,523][213771] Updated weights for policy 0, policy_version 5110 (0.0007) [2023-03-07 14:12:54,282][213771] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-07 14:12:55,058][213771] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-07 14:12:55,831][213771] Updated weights for policy 0, policy_version 5140 (0.0005) [2023-03-07 14:12:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5266432. Throughput: 0: 13212.9. Samples: 5263293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:12:56,105][213445] Avg episode reward: [(0, '3873.260')] [2023-03-07 14:12:56,612][213771] Updated weights for policy 0, policy_version 5150 (0.0007) [2023-03-07 14:12:57,401][213771] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-07 14:12:58,169][213771] Updated weights for policy 0, policy_version 5170 (0.0006) [2023-03-07 14:12:58,949][213771] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-07 14:12:59,723][213771] Updated weights for policy 0, policy_version 5190 (0.0007) [2023-03-07 14:13:00,510][213771] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-07 14:13:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 5331968. Throughput: 0: 13211.7. Samples: 5302833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:01,106][213445] Avg episode reward: [(0, '3752.332')] [2023-03-07 14:13:01,281][213771] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-07 14:13:02,040][213771] Updated weights for policy 0, policy_version 5220 (0.0005) [2023-03-07 14:13:02,815][213771] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-07 14:13:03,593][213771] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-07 14:13:04,367][213771] Updated weights for policy 0, policy_version 5250 (0.0006) [2023-03-07 14:13:05,146][213771] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-03-07 14:13:05,922][213771] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-07 14:13:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5398528. Throughput: 0: 13211.9. Samples: 5382229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:06,106][213445] Avg episode reward: [(0, '3967.173')] [2023-03-07 14:13:06,681][213771] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-07 14:13:07,464][213771] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-07 14:13:08,248][213771] Updated weights for policy 0, policy_version 5300 (0.0005) [2023-03-07 14:13:09,022][213771] Updated weights for policy 0, policy_version 5310 (0.0006) [2023-03-07 14:13:09,792][213771] Updated weights for policy 0, policy_version 5320 (0.0006) [2023-03-07 14:13:10,568][213771] Updated weights for policy 0, policy_version 5330 (0.0008) [2023-03-07 14:13:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 5464064. Throughput: 0: 13207.8. Samples: 5461403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:13:11,106][213445] Avg episode reward: [(0, '3861.300')] [2023-03-07 14:13:11,348][213771] Updated weights for policy 0, policy_version 5340 (0.0007) [2023-03-07 14:13:12,120][213771] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-07 14:13:12,898][213771] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-07 14:13:13,681][213771] Updated weights for policy 0, policy_version 5370 (0.0007) [2023-03-07 14:13:14,468][213771] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-07 14:13:15,238][213771] Updated weights for policy 0, policy_version 5390 (0.0006) [2023-03-07 14:13:15,993][213771] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-07 14:13:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5530624. Throughput: 0: 13211.4. Samples: 5500938. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:13:16,106][213445] Avg episode reward: [(0, '4040.010')] [2023-03-07 14:13:16,756][213771] Updated weights for policy 0, policy_version 5410 (0.0005) [2023-03-07 14:13:17,542][213771] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-07 14:13:18,322][213771] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-07 14:13:19,074][213771] Updated weights for policy 0, policy_version 5440 (0.0006) [2023-03-07 14:13:19,865][213771] Updated weights for policy 0, policy_version 5450 (0.0006) [2023-03-07 14:13:20,645][213771] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-07 14:13:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 5597184. Throughput: 0: 13213.8. Samples: 5580376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:21,105][213445] Avg episode reward: [(0, '4070.192')] [2023-03-07 14:13:21,409][213771] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-07 14:13:22,196][213771] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-07 14:13:22,969][213771] Updated weights for policy 0, policy_version 5490 (0.0005) [2023-03-07 14:13:23,729][213771] Updated weights for policy 0, policy_version 5500 (0.0007) [2023-03-07 14:13:24,506][213771] Updated weights for policy 0, policy_version 5510 (0.0006) [2023-03-07 14:13:25,280][213771] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-07 14:13:26,058][213771] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-07 14:13:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 5662720. Throughput: 0: 13221.2. Samples: 5659776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:13:26,105][213445] Avg episode reward: [(0, '4084.326')] [2023-03-07 14:13:26,837][213771] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-07 14:13:27,595][213771] Updated weights for policy 0, policy_version 5550 (0.0005) [2023-03-07 14:13:28,364][213771] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-07 14:13:29,144][213771] Updated weights for policy 0, policy_version 5570 (0.0007) [2023-03-07 14:13:29,918][213771] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-07 14:13:30,692][213771] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-07 14:13:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 5729280. Throughput: 0: 13229.7. Samples: 5699626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:13:31,106][213445] Avg episode reward: [(0, '4140.670')] [2023-03-07 14:13:31,472][213771] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-07 14:13:32,241][213771] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-07 14:13:33,003][213771] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-07 14:13:33,781][213771] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-07 14:13:34,545][213771] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-07 14:13:35,312][213771] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-07 14:13:36,088][213771] Updated weights for policy 0, policy_version 5660 (0.0006) [2023-03-07 14:13:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 5795840. Throughput: 0: 13230.9. Samples: 5779259. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:13:36,105][213445] Avg episode reward: [(0, '4064.387')] [2023-03-07 14:13:36,849][213771] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-07 14:13:37,621][213771] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-07 14:13:38,390][213771] Updated weights for policy 0, policy_version 5690 (0.0007) [2023-03-07 14:13:39,156][213771] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-07 14:13:39,947][213771] Updated weights for policy 0, policy_version 5710 (0.0005) [2023-03-07 14:13:40,730][213771] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-07 14:13:41,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 5861376. Throughput: 0: 13232.0. Samples: 5858731. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:41,105][213445] Avg episode reward: [(0, '4044.773')] [2023-03-07 14:13:41,483][213771] Updated weights for policy 0, policy_version 5730 (0.0006) [2023-03-07 14:13:42,249][213771] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-07 14:13:43,030][213771] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-07 14:13:43,813][213771] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-07 14:13:44,558][213771] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-07 14:13:45,364][213771] Updated weights for policy 0, policy_version 5780 (0.0006) [2023-03-07 14:13:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 5927936. Throughput: 0: 13242.1. Samples: 5898725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:46,106][213445] Avg episode reward: [(0, '4016.156')] [2023-03-07 14:13:46,136][213771] Updated weights for policy 0, policy_version 5790 (0.0006) [2023-03-07 14:13:46,908][213771] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-07 14:13:47,681][213771] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-07 14:13:48,484][213771] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-07 14:13:49,245][213771] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-07 14:13:50,029][213771] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-07 14:13:50,794][213771] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-07 14:13:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 5993472. Throughput: 0: 13230.4. Samples: 5977597. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:13:51,106][213445] Avg episode reward: [(0, '4143.691')] [2023-03-07 14:13:51,575][213771] Updated weights for policy 0, policy_version 5860 (0.0008) [2023-03-07 14:13:52,345][213771] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-07 14:13:53,117][213771] Updated weights for policy 0, policy_version 5880 (0.0006) [2023-03-07 14:13:53,890][213771] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-07 14:13:54,653][213771] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-07 14:13:55,429][213771] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-07 14:13:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 6060032. Throughput: 0: 13241.8. Samples: 6057286. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:13:56,106][213445] Avg episode reward: [(0, '4180.774')] [2023-03-07 14:13:56,191][213771] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-07 14:13:56,964][213771] Updated weights for policy 0, policy_version 5930 (0.0005) [2023-03-07 14:13:57,729][213771] Updated weights for policy 0, policy_version 5940 (0.0005) [2023-03-07 14:13:58,511][213771] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-07 14:13:59,272][213771] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-07 14:14:00,046][213771] Updated weights for policy 0, policy_version 5970 (0.0008) [2023-03-07 14:14:00,837][213771] Updated weights for policy 0, policy_version 5980 (0.0005) [2023-03-07 14:14:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 6126592. Throughput: 0: 13247.9. Samples: 6097091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:14:01,106][213445] Avg episode reward: [(0, '4068.439')] [2023-03-07 14:14:01,602][213771] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-07 14:14:02,341][213771] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-07 14:14:03,119][213771] Updated weights for policy 0, policy_version 6010 (0.0007) [2023-03-07 14:14:03,888][213771] Updated weights for policy 0, policy_version 6020 (0.0007) [2023-03-07 14:14:04,654][213771] Updated weights for policy 0, policy_version 6030 (0.0007) [2023-03-07 14:14:05,441][213771] Updated weights for policy 0, policy_version 6040 (0.0007) [2023-03-07 14:14:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 6193152. Throughput: 0: 13257.4. Samples: 6176959. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:14:06,105][213445] Avg episode reward: [(0, '4203.221')] [2023-03-07 14:14:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000006048_6193152.pth... [2023-03-07 14:14:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000002947_3017728.pth [2023-03-07 14:14:06,214][213771] Updated weights for policy 0, policy_version 6050 (0.0007) [2023-03-07 14:14:06,982][213771] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-07 14:14:07,764][213771] Updated weights for policy 0, policy_version 6070 (0.0006) [2023-03-07 14:14:08,520][213771] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-07 14:14:09,282][213771] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-07 14:14:10,066][213771] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-07 14:14:10,825][213771] Updated weights for policy 0, policy_version 6110 (0.0007) [2023-03-07 14:14:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6259712. Throughput: 0: 13268.0. Samples: 6256838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:14:11,106][213445] Avg episode reward: [(0, '4246.429')] [2023-03-07 14:14:11,106][213720] Saving new best policy, reward=4246.429! [2023-03-07 14:14:11,595][213771] Updated weights for policy 0, policy_version 6120 (0.0006) [2023-03-07 14:14:12,370][213771] Updated weights for policy 0, policy_version 6130 (0.0006) [2023-03-07 14:14:13,152][213771] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-07 14:14:13,910][213771] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-07 14:14:14,690][213771] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-07 14:14:15,456][213771] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-07 14:14:16,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6326272. Throughput: 0: 13264.5. Samples: 6296528. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:14:16,106][213445] Avg episode reward: [(0, '4177.185')] [2023-03-07 14:14:16,234][213771] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-07 14:14:17,024][213771] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-07 14:14:17,798][213771] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-07 14:14:18,569][213771] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-07 14:14:19,340][213771] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-07 14:14:20,114][213771] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-07 14:14:20,901][213771] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-07 14:14:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 6391808. Throughput: 0: 13256.5. Samples: 6375805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:14:21,106][213445] Avg episode reward: [(0, '4130.916')] [2023-03-07 14:14:21,648][213771] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-07 14:14:22,449][213771] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-07 14:14:23,218][213771] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-07 14:14:23,997][213771] Updated weights for policy 0, policy_version 6280 (0.0006) [2023-03-07 14:14:24,759][213771] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-07 14:14:25,529][213771] Updated weights for policy 0, policy_version 6300 (0.0007) [2023-03-07 14:14:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6458368. Throughput: 0: 13256.3. Samples: 6455265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:14:26,106][213445] Avg episode reward: [(0, '4170.463')] [2023-03-07 14:14:26,297][213771] Updated weights for policy 0, policy_version 6310 (0.0006) [2023-03-07 14:14:27,056][213771] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-07 14:14:27,846][213771] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-07 14:14:28,612][213771] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-07 14:14:29,393][213771] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-07 14:14:30,182][213771] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-07 14:14:30,944][213771] Updated weights for policy 0, policy_version 6370 (0.0007) [2023-03-07 14:14:31,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6524928. Throughput: 0: 13252.8. Samples: 6495101. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:14:31,105][213445] Avg episode reward: [(0, '4252.224')] [2023-03-07 14:14:31,106][213720] Saving new best policy, reward=4252.224! [2023-03-07 14:14:31,726][213771] Updated weights for policy 0, policy_version 6380 (0.0006) [2023-03-07 14:14:32,516][213771] Updated weights for policy 0, policy_version 6390 (0.0007) [2023-03-07 14:14:33,278][213771] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-07 14:14:34,060][213771] Updated weights for policy 0, policy_version 6410 (0.0008) [2023-03-07 14:14:34,842][213771] Updated weights for policy 0, policy_version 6420 (0.0007) [2023-03-07 14:14:35,618][213771] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-07 14:14:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 6590464. Throughput: 0: 13255.0. Samples: 6574071. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:14:36,106][213445] Avg episode reward: [(0, '4238.621')] [2023-03-07 14:14:36,386][213771] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-07 14:14:37,163][213771] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-07 14:14:37,932][213771] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-07 14:14:38,707][213771] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-07 14:14:39,469][213771] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-07 14:14:40,230][213771] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-07 14:14:41,016][213771] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-07 14:14:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6657024. Throughput: 0: 13254.4. Samples: 6653735. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:14:41,116][213445] Avg episode reward: [(0, '4287.547')] [2023-03-07 14:14:41,118][213720] Saving new best policy, reward=4287.547! [2023-03-07 14:14:41,773][213771] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-07 14:14:42,545][213771] Updated weights for policy 0, policy_version 6520 (0.0006) [2023-03-07 14:14:43,333][213771] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-07 14:14:44,122][213771] Updated weights for policy 0, policy_version 6540 (0.0006) [2023-03-07 14:14:44,883][213771] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-07 14:14:45,666][213771] Updated weights for policy 0, policy_version 6560 (0.0007) [2023-03-07 14:14:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 6722560. Throughput: 0: 13247.5. Samples: 6693227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:14:46,116][213445] Avg episode reward: [(0, '4308.280')] [2023-03-07 14:14:46,122][213720] Saving new best policy, reward=4308.280! [2023-03-07 14:14:46,473][213771] Updated weights for policy 0, policy_version 6570 (0.0007) [2023-03-07 14:14:47,234][213771] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-07 14:14:48,002][213771] Updated weights for policy 0, policy_version 6590 (0.0007) [2023-03-07 14:14:48,763][213771] Updated weights for policy 0, policy_version 6600 (0.0006) [2023-03-07 14:14:49,540][213771] Updated weights for policy 0, policy_version 6610 (0.0007) [2023-03-07 14:14:50,319][213771] Updated weights for policy 0, policy_version 6620 (0.0007) [2023-03-07 14:14:51,104][213771] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-07 14:14:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 6789120. Throughput: 0: 13235.8. Samples: 6772569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:14:51,115][213445] Avg episode reward: [(0, '4271.437')] [2023-03-07 14:14:51,889][213771] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-07 14:14:52,664][213771] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-07 14:14:53,435][213771] Updated weights for policy 0, policy_version 6660 (0.0006) [2023-03-07 14:14:54,205][213771] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-07 14:14:54,982][213771] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-07 14:14:55,764][213771] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-07 14:14:56,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 6854656. Throughput: 0: 13214.6. Samples: 6851497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:14:56,117][213445] Avg episode reward: [(0, '4027.808')] [2023-03-07 14:14:56,540][213771] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-07 14:14:57,305][213771] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-07 14:14:58,088][213771] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-07 14:14:58,864][213771] Updated weights for policy 0, policy_version 6730 (0.0007) [2023-03-07 14:14:59,640][213771] Updated weights for policy 0, policy_version 6740 (0.0006) [2023-03-07 14:15:00,418][213771] Updated weights for policy 0, policy_version 6750 (0.0007) [2023-03-07 14:15:01,105][213445] Fps is (10 sec: 13106.9, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 6920192. Throughput: 0: 13212.8. Samples: 6891104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:01,117][213445] Avg episode reward: [(0, '4218.235')] [2023-03-07 14:15:01,198][213771] Updated weights for policy 0, policy_version 6760 (0.0006) [2023-03-07 14:15:01,965][213771] Updated weights for policy 0, policy_version 6770 (0.0005) [2023-03-07 14:15:02,737][213771] Updated weights for policy 0, policy_version 6780 (0.0006) [2023-03-07 14:15:03,529][213771] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-07 14:15:04,293][213771] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-07 14:15:05,077][213771] Updated weights for policy 0, policy_version 6810 (0.0008) [2023-03-07 14:15:05,859][213771] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-07 14:15:06,105][213445] Fps is (10 sec: 13107.5, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 6985728. Throughput: 0: 13206.7. Samples: 6970104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:15:06,116][213445] Avg episode reward: [(0, '4212.313')] [2023-03-07 14:15:06,637][213771] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-07 14:15:07,414][213771] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-07 14:15:08,202][213771] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-07 14:15:08,970][213771] Updated weights for policy 0, policy_version 6860 (0.0006) [2023-03-07 14:15:09,730][213771] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-07 14:15:10,539][213771] Updated weights for policy 0, policy_version 6880 (0.0007) [2023-03-07 14:15:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 7052288. Throughput: 0: 13199.2. Samples: 7049228. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:15:11,116][213445] Avg episode reward: [(0, '4094.320')] [2023-03-07 14:15:11,300][213771] Updated weights for policy 0, policy_version 6890 (0.0007) [2023-03-07 14:15:12,050][213771] Updated weights for policy 0, policy_version 6900 (0.0007) [2023-03-07 14:15:12,815][213771] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-07 14:15:13,586][213771] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-07 14:15:14,356][213771] Updated weights for policy 0, policy_version 6930 (0.0005) [2023-03-07 14:15:15,127][213771] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-07 14:15:15,906][213771] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-07 14:15:16,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 7118848. Throughput: 0: 13206.5. Samples: 7089397. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:15:16,116][213445] Avg episode reward: [(0, '4301.254')] [2023-03-07 14:15:16,666][213771] Updated weights for policy 0, policy_version 6960 (0.0006) [2023-03-07 14:15:17,469][213771] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-07 14:15:18,228][213771] Updated weights for policy 0, policy_version 6980 (0.0005) [2023-03-07 14:15:18,993][213771] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-07 14:15:19,773][213771] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-07 14:15:20,560][213771] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-07 14:15:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 7185408. Throughput: 0: 13214.4. Samples: 7168716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:21,116][213445] Avg episode reward: [(0, '4036.130')] [2023-03-07 14:15:21,336][213771] Updated weights for policy 0, policy_version 7020 (0.0007) [2023-03-07 14:15:22,121][213771] Updated weights for policy 0, policy_version 7030 (0.0006) [2023-03-07 14:15:22,898][213771] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-07 14:15:23,681][213771] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-07 14:15:24,458][213771] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-07 14:15:25,231][213771] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-07 14:15:26,010][213771] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-07 14:15:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 7250944. Throughput: 0: 13199.2. Samples: 7247698. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:15:26,116][213445] Avg episode reward: [(0, '4127.799')] [2023-03-07 14:15:26,778][213771] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-07 14:15:27,548][213771] Updated weights for policy 0, policy_version 7100 (0.0006) [2023-03-07 14:15:28,329][213771] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-07 14:15:29,083][213771] Updated weights for policy 0, policy_version 7120 (0.0005) [2023-03-07 14:15:29,868][213771] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-07 14:15:30,631][213771] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-07 14:15:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 7317504. Throughput: 0: 13202.4. Samples: 7287338. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:15:31,106][213445] Avg episode reward: [(0, '3960.298')] [2023-03-07 14:15:31,407][213771] Updated weights for policy 0, policy_version 7150 (0.0007) [2023-03-07 14:15:32,187][213771] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-07 14:15:32,966][213771] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-07 14:15:33,748][213771] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-07 14:15:34,530][213771] Updated weights for policy 0, policy_version 7190 (0.0006) [2023-03-07 14:15:35,306][213771] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-07 14:15:36,074][213771] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-07 14:15:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13221.8). Total num frames: 7383040. Throughput: 0: 13196.0. Samples: 7366389. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:36,106][213445] Avg episode reward: [(0, '4113.950')] [2023-03-07 14:15:36,865][213771] Updated weights for policy 0, policy_version 7220 (0.0005) [2023-03-07 14:15:37,661][213771] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-07 14:15:38,413][213771] Updated weights for policy 0, policy_version 7240 (0.0006) [2023-03-07 14:15:39,181][213771] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-07 14:15:39,958][213771] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-07 14:15:40,723][213771] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-07 14:15:41,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13221.8). Total num frames: 7448576. Throughput: 0: 13207.9. Samples: 7445849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:41,106][213445] Avg episode reward: [(0, '4292.115')] [2023-03-07 14:15:41,500][213771] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-07 14:15:42,264][213771] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-07 14:15:43,052][213771] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-07 14:15:43,806][213771] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-07 14:15:44,588][213771] Updated weights for policy 0, policy_version 7320 (0.0007) [2023-03-07 14:15:45,357][213771] Updated weights for policy 0, policy_version 7330 (0.0006) [2023-03-07 14:15:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13221.8). Total num frames: 7515136. Throughput: 0: 13213.0. Samples: 7485688. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:15:46,106][213445] Avg episode reward: [(0, '4240.986')] [2023-03-07 14:15:46,145][213771] Updated weights for policy 0, policy_version 7340 (0.0007) [2023-03-07 14:15:46,905][213771] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-07 14:15:47,670][213771] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-07 14:15:48,443][213771] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-07 14:15:49,210][213771] Updated weights for policy 0, policy_version 7380 (0.0005) [2023-03-07 14:15:49,992][213771] Updated weights for policy 0, policy_version 7390 (0.0006) [2023-03-07 14:15:50,765][213771] Updated weights for policy 0, policy_version 7400 (0.0006) [2023-03-07 14:15:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 7581696. Throughput: 0: 13222.2. Samples: 7565105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:51,106][213445] Avg episode reward: [(0, '4250.099')] [2023-03-07 14:15:51,535][213771] Updated weights for policy 0, policy_version 7410 (0.0006) [2023-03-07 14:15:52,304][213771] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-07 14:15:53,098][213771] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-07 14:15:53,859][213771] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-07 14:15:54,627][213771] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-07 14:15:55,408][213771] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-07 14:15:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 7648256. Throughput: 0: 13232.6. Samples: 7644696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:15:56,106][213445] Avg episode reward: [(0, '4070.706')] [2023-03-07 14:15:56,184][213771] Updated weights for policy 0, policy_version 7470 (0.0005) [2023-03-07 14:15:56,946][213771] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-07 14:15:57,728][213771] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-07 14:15:58,492][213771] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-07 14:15:59,269][213771] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-07 14:16:00,031][213771] Updated weights for policy 0, policy_version 7520 (0.0007) [2023-03-07 14:16:00,810][213771] Updated weights for policy 0, policy_version 7530 (0.0005) [2023-03-07 14:16:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 7713792. Throughput: 0: 13225.4. Samples: 7684539. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:16:01,106][213445] Avg episode reward: [(0, '4092.080')] [2023-03-07 14:16:01,574][213771] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-07 14:16:02,343][213771] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-07 14:16:03,113][213771] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-07 14:16:03,867][213771] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-07 14:16:04,655][213771] Updated weights for policy 0, policy_version 7580 (0.0006) [2023-03-07 14:16:05,422][213771] Updated weights for policy 0, policy_version 7590 (0.0006) [2023-03-07 14:16:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 7780352. Throughput: 0: 13235.6. Samples: 7764322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:16:06,106][213445] Avg episode reward: [(0, '4131.007')] [2023-03-07 14:16:06,128][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000007599_7781376.pth... [2023-03-07 14:16:06,159][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000004498_4605952.pth [2023-03-07 14:16:06,192][213771] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-07 14:16:06,969][213771] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-07 14:16:07,744][213771] Updated weights for policy 0, policy_version 7620 (0.0007) [2023-03-07 14:16:08,510][213771] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-07 14:16:09,301][213771] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-07 14:16:10,091][213771] Updated weights for policy 0, policy_version 7650 (0.0007) [2023-03-07 14:16:10,866][213771] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-07 14:16:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 7846912. Throughput: 0: 13243.1. Samples: 7843638. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:11,106][213445] Avg episode reward: [(0, '3909.138')] [2023-03-07 14:16:11,640][213771] Updated weights for policy 0, policy_version 7670 (0.0007) [2023-03-07 14:16:12,401][213771] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-07 14:16:13,182][213771] Updated weights for policy 0, policy_version 7690 (0.0006) [2023-03-07 14:16:13,953][213771] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-07 14:16:14,718][213771] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-07 14:16:15,498][213771] Updated weights for policy 0, policy_version 7720 (0.0005) [2023-03-07 14:16:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 7912448. Throughput: 0: 13243.3. Samples: 7883284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:16,106][213445] Avg episode reward: [(0, '3354.355')] [2023-03-07 14:16:16,275][213771] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-07 14:16:17,053][213771] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-07 14:16:17,808][213771] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-07 14:16:18,593][213771] Updated weights for policy 0, policy_version 7760 (0.0006) [2023-03-07 14:16:19,382][213771] Updated weights for policy 0, policy_version 7770 (0.0007) [2023-03-07 14:16:20,150][213771] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-07 14:16:20,918][213771] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-07 14:16:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 7979008. Throughput: 0: 13249.4. Samples: 7962612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:21,106][213445] Avg episode reward: [(0, '4122.896')] [2023-03-07 14:16:21,696][213771] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-07 14:16:22,470][213771] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-07 14:16:23,244][213771] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-07 14:16:24,013][213771] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-07 14:16:24,791][213771] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-07 14:16:25,562][213771] Updated weights for policy 0, policy_version 7850 (0.0007) [2023-03-07 14:16:26,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 8045568. Throughput: 0: 13247.1. Samples: 8041969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:26,106][213445] Avg episode reward: [(0, '4206.834')] [2023-03-07 14:16:26,345][213771] Updated weights for policy 0, policy_version 7860 (0.0007) [2023-03-07 14:16:27,102][213771] Updated weights for policy 0, policy_version 7870 (0.0007) [2023-03-07 14:16:27,879][213771] Updated weights for policy 0, policy_version 7880 (0.0008) [2023-03-07 14:16:28,654][213771] Updated weights for policy 0, policy_version 7890 (0.0007) [2023-03-07 14:16:29,407][213771] Updated weights for policy 0, policy_version 7900 (0.0006) [2023-03-07 14:16:30,192][213771] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-07 14:16:30,959][213771] Updated weights for policy 0, policy_version 7920 (0.0006) [2023-03-07 14:16:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 8111104. Throughput: 0: 13246.8. Samples: 8081793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:31,106][213445] Avg episode reward: [(0, '4288.205')] [2023-03-07 14:16:31,726][213771] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-07 14:16:32,502][213771] Updated weights for policy 0, policy_version 7940 (0.0006) [2023-03-07 14:16:33,254][213771] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-07 14:16:34,057][213771] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-07 14:16:34,826][213771] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-07 14:16:35,593][213771] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-07 14:16:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 8177664. Throughput: 0: 13252.6. Samples: 8161473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:36,106][213445] Avg episode reward: [(0, '4265.949')] [2023-03-07 14:16:36,360][213771] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-07 14:16:37,133][213771] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-03-07 14:16:37,885][213771] Updated weights for policy 0, policy_version 8010 (0.0005) [2023-03-07 14:16:38,641][213771] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-07 14:16:39,417][213771] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-07 14:16:40,198][213771] Updated weights for policy 0, policy_version 8040 (0.0007) [2023-03-07 14:16:40,974][213771] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-07 14:16:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13228.7). Total num frames: 8244224. Throughput: 0: 13258.1. Samples: 8241313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:41,106][213445] Avg episode reward: [(0, '4007.365')] [2023-03-07 14:16:41,753][213771] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-07 14:16:42,502][213771] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-07 14:16:43,282][213771] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-07 14:16:44,058][213771] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-07 14:16:44,845][213771] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-07 14:16:45,618][213771] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-07 14:16:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13232.2). Total num frames: 8310784. Throughput: 0: 13259.0. Samples: 8281196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:46,106][213445] Avg episode reward: [(0, '3698.127')] [2023-03-07 14:16:46,392][213771] Updated weights for policy 0, policy_version 8120 (0.0007) [2023-03-07 14:16:47,157][213771] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-07 14:16:47,931][213771] Updated weights for policy 0, policy_version 8140 (0.0005) [2023-03-07 14:16:48,706][213771] Updated weights for policy 0, policy_version 8150 (0.0007) [2023-03-07 14:16:49,477][213771] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-07 14:16:50,246][213771] Updated weights for policy 0, policy_version 8170 (0.0007) [2023-03-07 14:16:51,019][213771] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-07 14:16:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13228.7). Total num frames: 8376320. Throughput: 0: 13248.3. Samples: 8360492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:16:51,105][213445] Avg episode reward: [(0, '3784.788')] [2023-03-07 14:16:51,789][213771] Updated weights for policy 0, policy_version 8190 (0.0005) [2023-03-07 14:16:52,555][213771] Updated weights for policy 0, policy_version 8200 (0.0005) [2023-03-07 14:16:53,351][213771] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-07 14:16:54,111][213771] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-07 14:16:54,877][213771] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-07 14:16:55,633][213771] Updated weights for policy 0, policy_version 8240 (0.0005) [2023-03-07 14:16:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 8442880. Throughput: 0: 13258.1. Samples: 8440252. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:16:56,106][213445] Avg episode reward: [(0, '3812.239')] [2023-03-07 14:16:56,417][213771] Updated weights for policy 0, policy_version 8250 (0.0008) [2023-03-07 14:16:57,193][213771] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-07 14:16:57,971][213771] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-07 14:16:58,729][213771] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-03-07 14:16:59,508][213771] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-07 14:17:00,267][213771] Updated weights for policy 0, policy_version 8300 (0.0006) [2023-03-07 14:17:01,029][213771] Updated weights for policy 0, policy_version 8310 (0.0007) [2023-03-07 14:17:01,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13232.2). Total num frames: 8509440. Throughput: 0: 13260.4. Samples: 8480005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:01,106][213445] Avg episode reward: [(0, '3865.316')] [2023-03-07 14:17:01,817][213771] Updated weights for policy 0, policy_version 8320 (0.0007) [2023-03-07 14:17:02,593][213771] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-07 14:17:03,365][213771] Updated weights for policy 0, policy_version 8340 (0.0008) [2023-03-07 14:17:04,158][213771] Updated weights for policy 0, policy_version 8350 (0.0005) [2023-03-07 14:17:04,912][213771] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-07 14:17:05,686][213771] Updated weights for policy 0, policy_version 8370 (0.0007) [2023-03-07 14:17:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 8576000. Throughput: 0: 13268.9. Samples: 8559712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:06,105][213445] Avg episode reward: [(0, '4119.978')] [2023-03-07 14:17:06,473][213771] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-07 14:17:07,241][213771] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-07 14:17:08,018][213771] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-07 14:17:08,789][213771] Updated weights for policy 0, policy_version 8410 (0.0005) [2023-03-07 14:17:09,563][213771] Updated weights for policy 0, policy_version 8420 (0.0005) [2023-03-07 14:17:10,333][213771] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-07 14:17:11,093][213771] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-07 14:17:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 8642560. Throughput: 0: 13266.5. Samples: 8638961. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:17:11,106][213445] Avg episode reward: [(0, '4176.509')] [2023-03-07 14:17:11,867][213771] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-07 14:17:12,646][213771] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-07 14:17:13,419][213771] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-07 14:17:14,186][213771] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-07 14:17:14,973][213771] Updated weights for policy 0, policy_version 8490 (0.0007) [2023-03-07 14:17:15,740][213771] Updated weights for policy 0, policy_version 8500 (0.0007) [2023-03-07 14:17:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 8708096. Throughput: 0: 13263.6. Samples: 8678654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:16,106][213445] Avg episode reward: [(0, '4174.501')] [2023-03-07 14:17:16,526][213771] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-07 14:17:17,290][213771] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-07 14:17:18,058][213771] Updated weights for policy 0, policy_version 8530 (0.0007) [2023-03-07 14:17:18,824][213771] Updated weights for policy 0, policy_version 8540 (0.0006) [2023-03-07 14:17:19,609][213771] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-07 14:17:20,389][213771] Updated weights for policy 0, policy_version 8560 (0.0008) [2023-03-07 14:17:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 8774656. Throughput: 0: 13257.8. Samples: 8758074. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:21,106][213445] Avg episode reward: [(0, '4180.748')] [2023-03-07 14:17:21,159][213771] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-07 14:17:21,922][213771] Updated weights for policy 0, policy_version 8580 (0.0007) [2023-03-07 14:17:22,705][213771] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-07 14:17:23,477][213771] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-07 14:17:24,253][213771] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-07 14:17:25,021][213771] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-07 14:17:25,775][213771] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-07 14:17:26,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 8841216. Throughput: 0: 13253.8. Samples: 8837737. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:17:26,106][213445] Avg episode reward: [(0, '4321.501')] [2023-03-07 14:17:26,110][213720] Saving new best policy, reward=4321.501! [2023-03-07 14:17:26,551][213771] Updated weights for policy 0, policy_version 8640 (0.0005) [2023-03-07 14:17:27,337][213771] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-07 14:17:28,080][213771] Updated weights for policy 0, policy_version 8660 (0.0006) [2023-03-07 14:17:28,847][213771] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-07 14:17:29,647][213771] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-07 14:17:30,414][213771] Updated weights for policy 0, policy_version 8690 (0.0007) [2023-03-07 14:17:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 8906752. Throughput: 0: 13254.5. Samples: 8877649. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:17:31,106][213445] Avg episode reward: [(0, '4315.773')] [2023-03-07 14:17:31,183][213771] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-07 14:17:31,964][213771] Updated weights for policy 0, policy_version 8710 (0.0007) [2023-03-07 14:17:32,726][213771] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-07 14:17:33,508][213771] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-07 14:17:34,305][213771] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-07 14:17:35,074][213771] Updated weights for policy 0, policy_version 8750 (0.0006) [2023-03-07 14:17:35,837][213771] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-07 14:17:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 8973312. Throughput: 0: 13253.6. Samples: 8956906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:36,106][213445] Avg episode reward: [(0, '4189.906')] [2023-03-07 14:17:36,605][213771] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-07 14:17:37,377][213771] Updated weights for policy 0, policy_version 8780 (0.0005) [2023-03-07 14:17:38,150][213771] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-07 14:17:38,930][213771] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-07 14:17:39,699][213771] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-07 14:17:40,473][213771] Updated weights for policy 0, policy_version 8820 (0.0007) [2023-03-07 14:17:41,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 9039872. Throughput: 0: 13249.9. Samples: 9036497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:41,106][213445] Avg episode reward: [(0, '4171.983')] [2023-03-07 14:17:41,254][213771] Updated weights for policy 0, policy_version 8830 (0.0005) [2023-03-07 14:17:42,012][213771] Updated weights for policy 0, policy_version 8840 (0.0006) [2023-03-07 14:17:42,800][213771] Updated weights for policy 0, policy_version 8850 (0.0005) [2023-03-07 14:17:43,569][213771] Updated weights for policy 0, policy_version 8860 (0.0005) [2023-03-07 14:17:44,353][213771] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-07 14:17:45,101][213771] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-07 14:17:45,878][213771] Updated weights for policy 0, policy_version 8890 (0.0006) [2023-03-07 14:17:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 9105408. Throughput: 0: 13247.8. Samples: 9076155. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:17:46,106][213445] Avg episode reward: [(0, '4357.619')] [2023-03-07 14:17:46,114][213720] Saving new best policy, reward=4357.619! [2023-03-07 14:17:46,659][213771] Updated weights for policy 0, policy_version 8900 (0.0007) [2023-03-07 14:17:47,427][213771] Updated weights for policy 0, policy_version 8910 (0.0005) [2023-03-07 14:17:48,197][213771] Updated weights for policy 0, policy_version 8920 (0.0006) [2023-03-07 14:17:48,953][213771] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-07 14:17:49,749][213771] Updated weights for policy 0, policy_version 8940 (0.0006) [2023-03-07 14:17:50,523][213771] Updated weights for policy 0, policy_version 8950 (0.0005) [2023-03-07 14:17:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 9171968. Throughput: 0: 13246.4. Samples: 9155802. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:17:51,106][213445] Avg episode reward: [(0, '4232.600')] [2023-03-07 14:17:51,282][213771] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-07 14:17:52,051][213771] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-07 14:17:52,850][213771] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-07 14:17:53,615][213771] Updated weights for policy 0, policy_version 8990 (0.0007) [2023-03-07 14:17:54,387][213771] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-07 14:17:55,175][213771] Updated weights for policy 0, policy_version 9010 (0.0006) [2023-03-07 14:17:55,935][213771] Updated weights for policy 0, policy_version 9020 (0.0005) [2023-03-07 14:17:56,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 9238528. Throughput: 0: 13247.4. Samples: 9235095. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:17:56,106][213445] Avg episode reward: [(0, '4338.191')] [2023-03-07 14:17:56,717][213771] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-07 14:17:57,476][213771] Updated weights for policy 0, policy_version 9040 (0.0005) [2023-03-07 14:17:58,239][213771] Updated weights for policy 0, policy_version 9050 (0.0005) [2023-03-07 14:17:59,018][213771] Updated weights for policy 0, policy_version 9060 (0.0007) [2023-03-07 14:17:59,775][213771] Updated weights for policy 0, policy_version 9070 (0.0007) [2023-03-07 14:18:00,577][213771] Updated weights for policy 0, policy_version 9080 (0.0007) [2023-03-07 14:18:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 9304064. Throughput: 0: 13250.4. Samples: 9274923. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:18:01,106][213445] Avg episode reward: [(0, '4364.054')] [2023-03-07 14:18:01,114][213720] Saving new best policy, reward=4364.054! [2023-03-07 14:18:01,349][213771] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-07 14:18:02,126][213771] Updated weights for policy 0, policy_version 9100 (0.0007) [2023-03-07 14:18:02,893][213771] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-07 14:18:03,657][213771] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-07 14:18:04,437][213771] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-07 14:18:05,221][213771] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-07 14:18:05,976][213771] Updated weights for policy 0, policy_version 9150 (0.0006) [2023-03-07 14:18:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9370624. Throughput: 0: 13249.7. Samples: 9354313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:06,106][213445] Avg episode reward: [(0, '4374.453')] [2023-03-07 14:18:06,125][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000009152_9371648.pth... [2023-03-07 14:18:06,153][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000006048_6193152.pth [2023-03-07 14:18:06,156][213720] Saving new best policy, reward=4374.453! [2023-03-07 14:18:06,755][213771] Updated weights for policy 0, policy_version 9160 (0.0005) [2023-03-07 14:18:07,535][213771] Updated weights for policy 0, policy_version 9170 (0.0005) [2023-03-07 14:18:08,321][213771] Updated weights for policy 0, policy_version 9180 (0.0007) [2023-03-07 14:18:09,097][213771] Updated weights for policy 0, policy_version 9190 (0.0005) [2023-03-07 14:18:09,872][213771] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-07 14:18:10,653][213771] Updated weights for policy 0, policy_version 9210 (0.0007) [2023-03-07 14:18:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 9436160. Throughput: 0: 13235.9. Samples: 9433354. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:11,106][213445] Avg episode reward: [(0, '4300.809')] [2023-03-07 14:18:11,429][213771] Updated weights for policy 0, policy_version 9220 (0.0006) [2023-03-07 14:18:12,193][213771] Updated weights for policy 0, policy_version 9230 (0.0005) [2023-03-07 14:18:12,984][213771] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-07 14:18:13,741][213771] Updated weights for policy 0, policy_version 9250 (0.0006) [2023-03-07 14:18:14,534][213771] Updated weights for policy 0, policy_version 9260 (0.0006) [2023-03-07 14:18:15,306][213771] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-07 14:18:16,073][213771] Updated weights for policy 0, policy_version 9280 (0.0008) [2023-03-07 14:18:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 9502720. Throughput: 0: 13233.8. Samples: 9473167. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:18:16,106][213445] Avg episode reward: [(0, '4154.034')] [2023-03-07 14:18:16,838][213771] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-07 14:18:17,621][213771] Updated weights for policy 0, policy_version 9300 (0.0006) [2023-03-07 14:18:18,373][213771] Updated weights for policy 0, policy_version 9310 (0.0007) [2023-03-07 14:18:19,141][213771] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-07 14:18:19,922][213771] Updated weights for policy 0, policy_version 9330 (0.0007) [2023-03-07 14:18:20,697][213771] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-07 14:18:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9569280. Throughput: 0: 13242.5. Samples: 9552819. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:18:21,106][213445] Avg episode reward: [(0, '4187.863')] [2023-03-07 14:18:21,452][213771] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-07 14:18:22,223][213771] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-07 14:18:23,022][213771] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-07 14:18:23,790][213771] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-07 14:18:24,564][213771] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-07 14:18:25,339][213771] Updated weights for policy 0, policy_version 9400 (0.0007) [2023-03-07 14:18:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 9634816. Throughput: 0: 13239.5. Samples: 9632274. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:18:26,106][213445] Avg episode reward: [(0, '4325.593')] [2023-03-07 14:18:26,108][213771] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-07 14:18:26,880][213771] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-07 14:18:27,641][213771] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-07 14:18:28,423][213771] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-07 14:18:29,211][213771] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-07 14:18:29,973][213771] Updated weights for policy 0, policy_version 9460 (0.0006) [2023-03-07 14:18:30,738][213771] Updated weights for policy 0, policy_version 9470 (0.0007) [2023-03-07 14:18:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 9701376. Throughput: 0: 13239.4. Samples: 9671929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:31,106][213445] Avg episode reward: [(0, '4326.342')] [2023-03-07 14:18:31,506][213771] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-07 14:18:32,281][213771] Updated weights for policy 0, policy_version 9490 (0.0005) [2023-03-07 14:18:33,055][213771] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-07 14:18:33,824][213771] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-07 14:18:34,598][213771] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-07 14:18:35,389][213771] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-07 14:18:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9767936. Throughput: 0: 13240.3. Samples: 9751617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:36,106][213445] Avg episode reward: [(0, '4224.188')] [2023-03-07 14:18:36,152][213771] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-07 14:18:36,926][213771] Updated weights for policy 0, policy_version 9550 (0.0006) [2023-03-07 14:18:37,696][213771] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-07 14:18:38,453][213771] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-07 14:18:39,227][213771] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-07 14:18:40,008][213771] Updated weights for policy 0, policy_version 9590 (0.0007) [2023-03-07 14:18:40,788][213771] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-07 14:18:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9834496. Throughput: 0: 13239.7. Samples: 9830879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:41,106][213445] Avg episode reward: [(0, '4282.936')] [2023-03-07 14:18:41,566][213771] Updated weights for policy 0, policy_version 9610 (0.0007) [2023-03-07 14:18:42,341][213771] Updated weights for policy 0, policy_version 9620 (0.0006) [2023-03-07 14:18:43,116][213771] Updated weights for policy 0, policy_version 9630 (0.0007) [2023-03-07 14:18:43,899][213771] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-07 14:18:44,667][213771] Updated weights for policy 0, policy_version 9650 (0.0007) [2023-03-07 14:18:45,446][213771] Updated weights for policy 0, policy_version 9660 (0.0007) [2023-03-07 14:18:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9900032. Throughput: 0: 13238.1. Samples: 9870637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:18:46,106][213445] Avg episode reward: [(0, '4270.548')] [2023-03-07 14:18:46,225][213771] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-07 14:18:46,989][213771] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-07 14:18:47,767][213771] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-07 14:18:48,543][213771] Updated weights for policy 0, policy_version 9700 (0.0006) [2023-03-07 14:18:49,318][213771] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-07 14:18:50,102][213771] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-07 14:18:50,852][213771] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-07 14:18:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 9966592. Throughput: 0: 13236.7. Samples: 9949966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:51,106][213445] Avg episode reward: [(0, '4159.590')] [2023-03-07 14:18:51,627][213771] Updated weights for policy 0, policy_version 9740 (0.0005) [2023-03-07 14:18:52,398][213771] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-07 14:18:53,161][213771] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-07 14:18:53,942][213771] Updated weights for policy 0, policy_version 9770 (0.0006) [2023-03-07 14:18:54,732][213771] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-07 14:18:55,506][213771] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-07 14:18:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 10032128. Throughput: 0: 13247.7. Samples: 10029498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:18:56,106][213445] Avg episode reward: [(0, '4306.326')] [2023-03-07 14:18:56,278][213771] Updated weights for policy 0, policy_version 9800 (0.0005) [2023-03-07 14:18:57,037][213771] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-07 14:18:57,786][213771] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-07 14:18:58,570][213771] Updated weights for policy 0, policy_version 9830 (0.0006) [2023-03-07 14:18:59,337][213771] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-07 14:19:00,105][213771] Updated weights for policy 0, policy_version 9850 (0.0007) [2023-03-07 14:19:00,875][213771] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-07 14:19:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 10099712. Throughput: 0: 13252.3. Samples: 10069519. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:19:01,105][213445] Avg episode reward: [(0, '4119.891')] [2023-03-07 14:19:01,668][213771] Updated weights for policy 0, policy_version 9870 (0.0005) [2023-03-07 14:19:02,436][213771] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-07 14:19:03,198][213771] Updated weights for policy 0, policy_version 9890 (0.0007) [2023-03-07 14:19:03,961][213771] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-07 14:19:04,731][213771] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-07 14:19:05,495][213771] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-07 14:19:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 10165248. Throughput: 0: 13255.1. Samples: 10149300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:06,106][213445] Avg episode reward: [(0, '4403.614')] [2023-03-07 14:19:06,111][213720] Saving new best policy, reward=4403.614! [2023-03-07 14:19:06,282][213771] Updated weights for policy 0, policy_version 9930 (0.0007) [2023-03-07 14:19:07,040][213771] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-07 14:19:07,813][213771] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-07 14:19:08,586][213771] Updated weights for policy 0, policy_version 9960 (0.0007) [2023-03-07 14:19:09,363][213771] Updated weights for policy 0, policy_version 9970 (0.0005) [2023-03-07 14:19:10,137][213771] Updated weights for policy 0, policy_version 9980 (0.0006) [2023-03-07 14:19:10,904][213771] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-07 14:19:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 10231808. Throughput: 0: 13257.4. Samples: 10228859. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:11,106][213445] Avg episode reward: [(0, '4418.489')] [2023-03-07 14:19:11,106][213720] Saving new best policy, reward=4418.489! [2023-03-07 14:19:11,673][213771] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-07 14:19:12,457][213771] Updated weights for policy 0, policy_version 10010 (0.0007) [2023-03-07 14:19:13,224][213771] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-07 14:19:14,002][213771] Updated weights for policy 0, policy_version 10030 (0.0007) [2023-03-07 14:19:14,793][213771] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-07 14:19:15,557][213771] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-07 14:19:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 10297344. Throughput: 0: 13256.0. Samples: 10268449. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 14:19:16,116][213445] Avg episode reward: [(0, '4390.514')] [2023-03-07 14:19:16,325][213771] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-07 14:19:17,097][213771] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-07 14:19:17,880][213771] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-07 14:19:18,646][213771] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-07 14:19:19,438][213771] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-07 14:19:20,222][213771] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-07 14:19:21,009][213771] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-07 14:19:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 10363904. Throughput: 0: 13246.0. Samples: 10347687. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 14:19:21,116][213445] Avg episode reward: [(0, '4357.805')] [2023-03-07 14:19:21,773][213771] Updated weights for policy 0, policy_version 10130 (0.0006) [2023-03-07 14:19:22,548][213771] Updated weights for policy 0, policy_version 10140 (0.0007) [2023-03-07 14:19:23,334][213771] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-07 14:19:24,101][213771] Updated weights for policy 0, policy_version 10160 (0.0006) [2023-03-07 14:19:24,870][213771] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-07 14:19:25,663][213771] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-07 14:19:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 10429440. Throughput: 0: 13242.3. Samples: 10426784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:26,116][213445] Avg episode reward: [(0, '4349.053')] [2023-03-07 14:19:26,437][213771] Updated weights for policy 0, policy_version 10190 (0.0007) [2023-03-07 14:19:27,195][213771] Updated weights for policy 0, policy_version 10200 (0.0007) [2023-03-07 14:19:27,958][213771] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-07 14:19:28,722][213771] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-07 14:19:29,521][213771] Updated weights for policy 0, policy_version 10230 (0.0005) [2023-03-07 14:19:30,262][213771] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-07 14:19:31,040][213771] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-07 14:19:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 10497024. Throughput: 0: 13247.4. Samples: 10466769. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:31,106][213445] Avg episode reward: [(0, '4370.500')] [2023-03-07 14:19:31,801][213771] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-07 14:19:32,575][213771] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-07 14:19:33,352][213771] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-07 14:19:34,118][213771] Updated weights for policy 0, policy_version 10290 (0.0005) [2023-03-07 14:19:34,892][213771] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-07 14:19:35,647][213771] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-07 14:19:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 10562560. Throughput: 0: 13256.7. Samples: 10546515. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:36,106][213445] Avg episode reward: [(0, '4257.408')] [2023-03-07 14:19:36,428][213771] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-07 14:19:37,206][213771] Updated weights for policy 0, policy_version 10330 (0.0007) [2023-03-07 14:19:37,981][213771] Updated weights for policy 0, policy_version 10340 (0.0007) [2023-03-07 14:19:38,764][213771] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-07 14:19:39,521][213771] Updated weights for policy 0, policy_version 10360 (0.0007) [2023-03-07 14:19:40,289][213771] Updated weights for policy 0, policy_version 10370 (0.0006) [2023-03-07 14:19:41,072][213771] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-07 14:19:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 10629120. Throughput: 0: 13255.6. Samples: 10626004. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:19:41,106][213445] Avg episode reward: [(0, '4128.853')] [2023-03-07 14:19:41,840][213771] Updated weights for policy 0, policy_version 10390 (0.0006) [2023-03-07 14:19:42,613][213771] Updated weights for policy 0, policy_version 10400 (0.0006) [2023-03-07 14:19:43,365][213771] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-07 14:19:44,146][213771] Updated weights for policy 0, policy_version 10420 (0.0005) [2023-03-07 14:19:44,914][213771] Updated weights for policy 0, policy_version 10430 (0.0007) [2023-03-07 14:19:45,683][213771] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-07 14:19:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 10695680. Throughput: 0: 13254.3. Samples: 10665961. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:19:46,106][213445] Avg episode reward: [(0, '4269.302')] [2023-03-07 14:19:46,450][213771] Updated weights for policy 0, policy_version 10450 (0.0005) [2023-03-07 14:19:47,215][213771] Updated weights for policy 0, policy_version 10460 (0.0007) [2023-03-07 14:19:48,005][213771] Updated weights for policy 0, policy_version 10470 (0.0006) [2023-03-07 14:19:48,788][213771] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-07 14:19:49,557][213771] Updated weights for policy 0, policy_version 10490 (0.0006) [2023-03-07 14:19:50,335][213771] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-07 14:19:51,100][213771] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-07 14:19:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 10762240. Throughput: 0: 13249.5. Samples: 10745529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:19:51,106][213445] Avg episode reward: [(0, '4285.957')] [2023-03-07 14:19:51,885][213771] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-07 14:19:52,657][213771] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-07 14:19:53,439][213771] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-07 14:19:54,198][213771] Updated weights for policy 0, policy_version 10550 (0.0005) [2023-03-07 14:19:54,974][213771] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-07 14:19:55,738][213771] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-07 14:19:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 10827776. Throughput: 0: 13247.5. Samples: 10824994. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:19:56,106][213445] Avg episode reward: [(0, '4272.509')] [2023-03-07 14:19:56,510][213771] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-07 14:19:57,287][213771] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-07 14:19:58,037][213771] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-07 14:19:58,816][213771] Updated weights for policy 0, policy_version 10610 (0.0007) [2023-03-07 14:19:59,627][213771] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-07 14:20:00,408][213771] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-07 14:20:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 10894336. Throughput: 0: 13251.7. Samples: 10864773. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:20:01,105][213445] Avg episode reward: [(0, '4252.530')] [2023-03-07 14:20:01,177][213771] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-07 14:20:01,934][213771] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-07 14:20:02,708][213771] Updated weights for policy 0, policy_version 10660 (0.0005) [2023-03-07 14:20:03,478][213771] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-07 14:20:04,262][213771] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-07 14:20:05,024][213771] Updated weights for policy 0, policy_version 10690 (0.0005) [2023-03-07 14:20:05,817][213771] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-07 14:20:06,105][213445] Fps is (10 sec: 13209.2, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 10959872. Throughput: 0: 13248.2. Samples: 10943858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:06,106][213445] Avg episode reward: [(0, '4297.952')] [2023-03-07 14:20:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000010703_10959872.pth... [2023-03-07 14:20:06,142][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000007599_7781376.pth [2023-03-07 14:20:06,581][213771] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-07 14:20:07,362][213771] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-07 14:20:08,146][213771] Updated weights for policy 0, policy_version 10730 (0.0006) [2023-03-07 14:20:08,919][213771] Updated weights for policy 0, policy_version 10740 (0.0005) [2023-03-07 14:20:09,693][213771] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-07 14:20:10,457][213771] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-07 14:20:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 11026432. Throughput: 0: 13252.5. Samples: 11023148. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:20:11,106][213445] Avg episode reward: [(0, '4337.841')] [2023-03-07 14:20:11,245][213771] Updated weights for policy 0, policy_version 10770 (0.0006) [2023-03-07 14:20:12,029][213771] Updated weights for policy 0, policy_version 10780 (0.0007) [2023-03-07 14:20:12,793][213771] Updated weights for policy 0, policy_version 10790 (0.0006) [2023-03-07 14:20:13,578][213771] Updated weights for policy 0, policy_version 10800 (0.0007) [2023-03-07 14:20:14,342][213771] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-07 14:20:15,128][213771] Updated weights for policy 0, policy_version 10820 (0.0007) [2023-03-07 14:20:15,894][213771] Updated weights for policy 0, policy_version 10830 (0.0006) [2023-03-07 14:20:16,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 11091968. Throughput: 0: 13242.5. Samples: 11062684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:20:16,106][213445] Avg episode reward: [(0, '4344.483')] [2023-03-07 14:20:16,662][213771] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-07 14:20:17,441][213771] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-07 14:20:18,217][213771] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-07 14:20:18,996][213771] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-07 14:20:19,754][213771] Updated weights for policy 0, policy_version 10880 (0.0007) [2023-03-07 14:20:20,557][213771] Updated weights for policy 0, policy_version 10890 (0.0005) [2023-03-07 14:20:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 11158528. Throughput: 0: 13233.3. Samples: 11142014. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:21,106][213445] Avg episode reward: [(0, '4294.947')] [2023-03-07 14:20:21,329][213771] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-07 14:20:22,105][213771] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-07 14:20:22,883][213771] Updated weights for policy 0, policy_version 10920 (0.0007) [2023-03-07 14:20:23,648][213771] Updated weights for policy 0, policy_version 10930 (0.0006) [2023-03-07 14:20:24,429][213771] Updated weights for policy 0, policy_version 10940 (0.0006) [2023-03-07 14:20:25,202][213771] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-07 14:20:25,974][213771] Updated weights for policy 0, policy_version 10960 (0.0007) [2023-03-07 14:20:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 11224064. Throughput: 0: 13228.0. Samples: 11221263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:26,106][213445] Avg episode reward: [(0, '4275.715')] [2023-03-07 14:20:26,743][213771] Updated weights for policy 0, policy_version 10970 (0.0006) [2023-03-07 14:20:27,509][213771] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-07 14:20:28,284][213771] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-07 14:20:29,047][213771] Updated weights for policy 0, policy_version 11000 (0.0006) [2023-03-07 14:20:29,806][213771] Updated weights for policy 0, policy_version 11010 (0.0005) [2023-03-07 14:20:30,594][213771] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-07 14:20:31,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 11290624. Throughput: 0: 13227.0. Samples: 11261174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:31,105][213445] Avg episode reward: [(0, '4295.979')] [2023-03-07 14:20:31,370][213771] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-07 14:20:32,146][213771] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-07 14:20:32,929][213771] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-07 14:20:33,690][213771] Updated weights for policy 0, policy_version 11060 (0.0006) [2023-03-07 14:20:34,478][213771] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-07 14:20:35,253][213771] Updated weights for policy 0, policy_version 11080 (0.0007) [2023-03-07 14:20:36,022][213771] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-07 14:20:36,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 11357184. Throughput: 0: 13223.6. Samples: 11340590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:36,106][213445] Avg episode reward: [(0, '4257.235')] [2023-03-07 14:20:36,794][213771] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-07 14:20:37,558][213771] Updated weights for policy 0, policy_version 11110 (0.0008) [2023-03-07 14:20:38,339][213771] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-07 14:20:39,106][213771] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-07 14:20:39,880][213771] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-07 14:20:40,653][213771] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-07 14:20:41,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 11422720. Throughput: 0: 13220.5. Samples: 11419917. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:41,106][213445] Avg episode reward: [(0, '4199.863')] [2023-03-07 14:20:41,421][213771] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-07 14:20:42,192][213771] Updated weights for policy 0, policy_version 11170 (0.0007) [2023-03-07 14:20:42,958][213771] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-07 14:20:43,718][213771] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-07 14:20:44,478][213771] Updated weights for policy 0, policy_version 11200 (0.0006) [2023-03-07 14:20:45,241][213771] Updated weights for policy 0, policy_version 11210 (0.0007) [2023-03-07 14:20:46,026][213771] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-07 14:20:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 11490304. Throughput: 0: 13226.4. Samples: 11459964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:20:46,106][213445] Avg episode reward: [(0, '4173.789')] [2023-03-07 14:20:46,790][213771] Updated weights for policy 0, policy_version 11230 (0.0007) [2023-03-07 14:20:47,570][213771] Updated weights for policy 0, policy_version 11240 (0.0006) [2023-03-07 14:20:48,343][213771] Updated weights for policy 0, policy_version 11250 (0.0007) [2023-03-07 14:20:49,104][213771] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-07 14:20:49,877][213771] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-07 14:20:50,651][213771] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-07 14:20:51,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 11555840. Throughput: 0: 13242.2. Samples: 11539753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:20:51,106][213445] Avg episode reward: [(0, '4270.624')] [2023-03-07 14:20:51,419][213771] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-07 14:20:52,184][213771] Updated weights for policy 0, policy_version 11300 (0.0008) [2023-03-07 14:20:52,955][213771] Updated weights for policy 0, policy_version 11310 (0.0007) [2023-03-07 14:20:53,722][213771] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-07 14:20:54,499][213771] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-07 14:20:55,265][213771] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-07 14:20:56,033][213771] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-07 14:20:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 11622400. Throughput: 0: 13257.3. Samples: 11619725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:20:56,106][213445] Avg episode reward: [(0, '4282.403')] [2023-03-07 14:20:56,794][213771] Updated weights for policy 0, policy_version 11360 (0.0007) [2023-03-07 14:20:57,570][213771] Updated weights for policy 0, policy_version 11370 (0.0006) [2023-03-07 14:20:58,345][213771] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-07 14:20:59,114][213771] Updated weights for policy 0, policy_version 11390 (0.0006) [2023-03-07 14:20:59,903][213771] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-07 14:21:00,654][213771] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-07 14:21:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 11688960. Throughput: 0: 13267.3. Samples: 11659712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:01,105][213445] Avg episode reward: [(0, '4079.285')] [2023-03-07 14:21:01,425][213771] Updated weights for policy 0, policy_version 11420 (0.0006) [2023-03-07 14:21:02,189][213771] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-07 14:21:02,955][213771] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-07 14:21:03,730][213771] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-07 14:21:04,519][213771] Updated weights for policy 0, policy_version 11460 (0.0007) [2023-03-07 14:21:05,282][213771] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-07 14:21:06,047][213771] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-07 14:21:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.9, 300 sec: 13249.5). Total num frames: 11755520. Throughput: 0: 13273.6. Samples: 11739326. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:21:06,106][213445] Avg episode reward: [(0, '4153.104')] [2023-03-07 14:21:06,829][213771] Updated weights for policy 0, policy_version 11490 (0.0007) [2023-03-07 14:21:07,569][213771] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-07 14:21:08,361][213771] Updated weights for policy 0, policy_version 11510 (0.0006) [2023-03-07 14:21:09,120][213771] Updated weights for policy 0, policy_version 11520 (0.0007) [2023-03-07 14:21:09,901][213771] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-07 14:21:10,679][213771] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-07 14:21:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 11822080. Throughput: 0: 13282.2. Samples: 11818963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:11,106][213445] Avg episode reward: [(0, '4157.126')] [2023-03-07 14:21:11,460][213771] Updated weights for policy 0, policy_version 11550 (0.0007) [2023-03-07 14:21:12,228][213771] Updated weights for policy 0, policy_version 11560 (0.0007) [2023-03-07 14:21:12,998][213771] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-07 14:21:13,779][213771] Updated weights for policy 0, policy_version 11580 (0.0007) [2023-03-07 14:21:14,564][213771] Updated weights for policy 0, policy_version 11590 (0.0007) [2023-03-07 14:21:15,337][213771] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-07 14:21:16,096][213771] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-07 14:21:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 11888640. Throughput: 0: 13275.3. Samples: 11858563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:16,106][213445] Avg episode reward: [(0, '4220.715')] [2023-03-07 14:21:16,862][213771] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-07 14:21:17,641][213771] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-07 14:21:18,422][213771] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-07 14:21:19,189][213771] Updated weights for policy 0, policy_version 11650 (0.0007) [2023-03-07 14:21:19,983][213771] Updated weights for policy 0, policy_version 11660 (0.0005) [2023-03-07 14:21:20,746][213771] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-07 14:21:21,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 11954176. Throughput: 0: 13275.5. Samples: 11937983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:21,105][213445] Avg episode reward: [(0, '4317.464')] [2023-03-07 14:21:21,539][213771] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-07 14:21:22,309][213771] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-07 14:21:23,071][213771] Updated weights for policy 0, policy_version 11700 (0.0005) [2023-03-07 14:21:23,865][213771] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-07 14:21:24,627][213771] Updated weights for policy 0, policy_version 11720 (0.0006) [2023-03-07 14:21:25,389][213771] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-07 14:21:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 12020736. Throughput: 0: 13280.0. Samples: 12017515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:26,106][213445] Avg episode reward: [(0, '4298.168')] [2023-03-07 14:21:26,159][213771] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-07 14:21:26,921][213771] Updated weights for policy 0, policy_version 11750 (0.0007) [2023-03-07 14:21:27,701][213771] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-07 14:21:28,471][213771] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-07 14:21:29,250][213771] Updated weights for policy 0, policy_version 11780 (0.0007) [2023-03-07 14:21:30,025][213771] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-07 14:21:30,790][213771] Updated weights for policy 0, policy_version 11800 (0.0007) [2023-03-07 14:21:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 12087296. Throughput: 0: 13271.4. Samples: 12057176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:31,106][213445] Avg episode reward: [(0, '4223.897')] [2023-03-07 14:21:31,549][213771] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-07 14:21:32,330][213771] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-07 14:21:33,106][213771] Updated weights for policy 0, policy_version 11830 (0.0005) [2023-03-07 14:21:33,894][213771] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-07 14:21:34,667][213771] Updated weights for policy 0, policy_version 11850 (0.0007) [2023-03-07 14:21:35,426][213771] Updated weights for policy 0, policy_version 11860 (0.0006) [2023-03-07 14:21:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 12152832. Throughput: 0: 13267.5. Samples: 12136789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:36,105][213445] Avg episode reward: [(0, '4213.185')] [2023-03-07 14:21:36,215][213771] Updated weights for policy 0, policy_version 11870 (0.0007) [2023-03-07 14:21:36,985][213771] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-07 14:21:37,754][213771] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-07 14:21:38,549][213771] Updated weights for policy 0, policy_version 11900 (0.0007) [2023-03-07 14:21:39,312][213771] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-07 14:21:40,070][213771] Updated weights for policy 0, policy_version 11920 (0.0005) [2023-03-07 14:21:40,849][213771] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-07 14:21:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 12219392. Throughput: 0: 13254.3. Samples: 12216166. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:41,116][213445] Avg episode reward: [(0, '4195.065')] [2023-03-07 14:21:41,620][213771] Updated weights for policy 0, policy_version 11940 (0.0006) [2023-03-07 14:21:42,394][213771] Updated weights for policy 0, policy_version 11950 (0.0006) [2023-03-07 14:21:43,179][213771] Updated weights for policy 0, policy_version 11960 (0.0007) [2023-03-07 14:21:43,962][213771] Updated weights for policy 0, policy_version 11970 (0.0007) [2023-03-07 14:21:44,739][213771] Updated weights for policy 0, policy_version 11980 (0.0007) [2023-03-07 14:21:45,527][213771] Updated weights for policy 0, policy_version 11990 (0.0006) [2023-03-07 14:21:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 12284928. Throughput: 0: 13240.5. Samples: 12255538. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:21:46,106][213445] Avg episode reward: [(0, '4249.646')] [2023-03-07 14:21:46,308][213771] Updated weights for policy 0, policy_version 12000 (0.0006) [2023-03-07 14:21:47,081][213771] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-07 14:21:47,853][213771] Updated weights for policy 0, policy_version 12020 (0.0007) [2023-03-07 14:21:48,623][213771] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-07 14:21:49,380][213771] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-07 14:21:50,181][213771] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-07 14:21:50,932][213771] Updated weights for policy 0, policy_version 12060 (0.0006) [2023-03-07 14:21:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 12351488. Throughput: 0: 13229.9. Samples: 12334675. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:21:51,106][213445] Avg episode reward: [(0, '4302.718')] [2023-03-07 14:21:51,725][213771] Updated weights for policy 0, policy_version 12070 (0.0007) [2023-03-07 14:21:52,486][213771] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-07 14:21:53,267][213771] Updated weights for policy 0, policy_version 12090 (0.0007) [2023-03-07 14:21:54,018][213771] Updated weights for policy 0, policy_version 12100 (0.0007) [2023-03-07 14:21:54,797][213771] Updated weights for policy 0, policy_version 12110 (0.0006) [2023-03-07 14:21:55,571][213771] Updated weights for policy 0, policy_version 12120 (0.0007) [2023-03-07 14:21:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 12417024. Throughput: 0: 13231.5. Samples: 12414379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:21:56,106][213445] Avg episode reward: [(0, '4302.368')] [2023-03-07 14:21:56,357][213771] Updated weights for policy 0, policy_version 12130 (0.0005) [2023-03-07 14:21:57,141][213771] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-07 14:21:57,912][213771] Updated weights for policy 0, policy_version 12150 (0.0006) [2023-03-07 14:21:58,684][213771] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-07 14:21:59,445][213771] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-07 14:22:00,221][213771] Updated weights for policy 0, policy_version 12180 (0.0007) [2023-03-07 14:22:01,005][213771] Updated weights for policy 0, policy_version 12190 (0.0008) [2023-03-07 14:22:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 12483584. Throughput: 0: 13231.6. Samples: 12453983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:01,116][213445] Avg episode reward: [(0, '4361.533')] [2023-03-07 14:22:01,781][213771] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-07 14:22:02,563][213771] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-07 14:22:03,337][213771] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-07 14:22:04,111][213771] Updated weights for policy 0, policy_version 12230 (0.0007) [2023-03-07 14:22:04,881][213771] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-07 14:22:05,642][213771] Updated weights for policy 0, policy_version 12250 (0.0005) [2023-03-07 14:22:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 12549120. Throughput: 0: 13227.9. Samples: 12533244. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:06,116][213445] Avg episode reward: [(0, '4189.680')] [2023-03-07 14:22:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000012256_12550144.pth... [2023-03-07 14:22:06,150][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000009152_9371648.pth [2023-03-07 14:22:06,428][213771] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-07 14:22:07,214][213771] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-07 14:22:07,974][213771] Updated weights for policy 0, policy_version 12280 (0.0006) [2023-03-07 14:22:08,748][213771] Updated weights for policy 0, policy_version 12290 (0.0006) [2023-03-07 14:22:09,511][213771] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-07 14:22:10,289][213771] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-07 14:22:11,066][213771] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-07 14:22:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 12615680. Throughput: 0: 13226.4. Samples: 12612701. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:11,116][213445] Avg episode reward: [(0, '4000.892')] [2023-03-07 14:22:11,829][213771] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-07 14:22:12,615][213771] Updated weights for policy 0, policy_version 12340 (0.0006) [2023-03-07 14:22:13,385][213771] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-07 14:22:14,159][213771] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-07 14:22:14,929][213771] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-07 14:22:15,700][213771] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-07 14:22:16,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 12682240. Throughput: 0: 13225.6. Samples: 12652326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:16,116][213445] Avg episode reward: [(0, '4102.871')] [2023-03-07 14:22:16,480][213771] Updated weights for policy 0, policy_version 12390 (0.0007) [2023-03-07 14:22:17,247][213771] Updated weights for policy 0, policy_version 12400 (0.0007) [2023-03-07 14:22:18,031][213771] Updated weights for policy 0, policy_version 12410 (0.0007) [2023-03-07 14:22:18,794][213771] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-07 14:22:19,559][213771] Updated weights for policy 0, policy_version 12430 (0.0006) [2023-03-07 14:22:20,346][213771] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-07 14:22:21,099][213771] Updated weights for policy 0, policy_version 12450 (0.0008) [2023-03-07 14:22:21,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 12748800. Throughput: 0: 13226.0. Samples: 12731962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:21,116][213445] Avg episode reward: [(0, '4140.461')] [2023-03-07 14:22:21,881][213771] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-07 14:22:22,653][213771] Updated weights for policy 0, policy_version 12470 (0.0006) [2023-03-07 14:22:23,433][213771] Updated weights for policy 0, policy_version 12480 (0.0006) [2023-03-07 14:22:24,194][213771] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-07 14:22:24,959][213771] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-07 14:22:25,730][213771] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-07 14:22:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 12814336. Throughput: 0: 13231.2. Samples: 12811571. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:22:26,116][213445] Avg episode reward: [(0, '4181.161')] [2023-03-07 14:22:26,513][213771] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-07 14:22:27,283][213771] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-07 14:22:28,058][213771] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-07 14:22:28,820][213771] Updated weights for policy 0, policy_version 12550 (0.0007) [2023-03-07 14:22:29,593][213771] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-07 14:22:30,365][213771] Updated weights for policy 0, policy_version 12570 (0.0006) [2023-03-07 14:22:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 12880896. Throughput: 0: 13238.7. Samples: 12851282. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:22:31,116][213445] Avg episode reward: [(0, '4218.101')] [2023-03-07 14:22:31,128][213771] Updated weights for policy 0, policy_version 12580 (0.0007) [2023-03-07 14:22:31,903][213771] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-07 14:22:32,682][213771] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-07 14:22:33,450][213771] Updated weights for policy 0, policy_version 12610 (0.0007) [2023-03-07 14:22:34,224][213771] Updated weights for policy 0, policy_version 12620 (0.0007) [2023-03-07 14:22:35,003][213771] Updated weights for policy 0, policy_version 12630 (0.0006) [2023-03-07 14:22:35,778][213771] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-07 14:22:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 12947456. Throughput: 0: 13252.7. Samples: 12931045. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:22:36,116][213445] Avg episode reward: [(0, '4267.434')] [2023-03-07 14:22:36,532][213771] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-07 14:22:37,312][213771] Updated weights for policy 0, policy_version 12660 (0.0006) [2023-03-07 14:22:38,072][213771] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-07 14:22:38,840][213771] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-07 14:22:39,639][213771] Updated weights for policy 0, policy_version 12690 (0.0007) [2023-03-07 14:22:40,436][213771] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-07 14:22:41,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 13014016. Throughput: 0: 13242.5. Samples: 13010291. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:22:41,116][213445] Avg episode reward: [(0, '4218.881')] [2023-03-07 14:22:41,188][213771] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-07 14:22:41,957][213771] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-07 14:22:42,749][213771] Updated weights for policy 0, policy_version 12730 (0.0006) [2023-03-07 14:22:43,506][213771] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-07 14:22:44,258][213771] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-07 14:22:45,039][213771] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-07 14:22:45,808][213771] Updated weights for policy 0, policy_version 12770 (0.0005) [2023-03-07 14:22:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 13079552. Throughput: 0: 13248.5. Samples: 13050165. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:22:46,116][213445] Avg episode reward: [(0, '4130.951')] [2023-03-07 14:22:46,578][213771] Updated weights for policy 0, policy_version 12780 (0.0007) [2023-03-07 14:22:47,358][213771] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-07 14:22:48,127][213771] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-07 14:22:48,906][213771] Updated weights for policy 0, policy_version 12810 (0.0005) [2023-03-07 14:22:49,666][213771] Updated weights for policy 0, policy_version 12820 (0.0006) [2023-03-07 14:22:50,437][213771] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-07 14:22:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 13146112. Throughput: 0: 13258.8. Samples: 13129886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:51,116][213445] Avg episode reward: [(0, '4091.639')] [2023-03-07 14:22:51,202][213771] Updated weights for policy 0, policy_version 12840 (0.0007) [2023-03-07 14:22:51,978][213771] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-07 14:22:52,746][213771] Updated weights for policy 0, policy_version 12860 (0.0006) [2023-03-07 14:22:53,517][213771] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-07 14:22:54,295][213771] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-07 14:22:55,068][213771] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-07 14:22:55,834][213771] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-07 14:22:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13212672. Throughput: 0: 13264.1. Samples: 13209586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:22:56,116][213445] Avg episode reward: [(0, '4314.764')] [2023-03-07 14:22:56,608][213771] Updated weights for policy 0, policy_version 12910 (0.0005) [2023-03-07 14:22:57,394][213771] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-07 14:22:58,166][213771] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-07 14:22:58,930][213771] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-07 14:22:59,728][213771] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-07 14:23:00,489][213771] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-07 14:23:01,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13279232. Throughput: 0: 13262.4. Samples: 13249137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:01,117][213445] Avg episode reward: [(0, '4081.744')] [2023-03-07 14:23:01,255][213771] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-07 14:23:02,034][213771] Updated weights for policy 0, policy_version 12980 (0.0007) [2023-03-07 14:23:02,796][213771] Updated weights for policy 0, policy_version 12990 (0.0006) [2023-03-07 14:23:03,562][213771] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-07 14:23:04,334][213771] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-07 14:23:05,088][213771] Updated weights for policy 0, policy_version 13020 (0.0006) [2023-03-07 14:23:05,864][213771] Updated weights for policy 0, policy_version 13030 (0.0005) [2023-03-07 14:23:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 13345792. Throughput: 0: 13266.5. Samples: 13328953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:06,116][213445] Avg episode reward: [(0, '4260.302')] [2023-03-07 14:23:06,629][213771] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-07 14:23:07,405][213771] Updated weights for policy 0, policy_version 13050 (0.0005) [2023-03-07 14:23:08,173][213771] Updated weights for policy 0, policy_version 13060 (0.0006) [2023-03-07 14:23:08,946][213771] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-07 14:23:09,715][213771] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-07 14:23:10,501][213771] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-07 14:23:11,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13411328. Throughput: 0: 13265.5. Samples: 13408519. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:23:11,106][213445] Avg episode reward: [(0, '4282.440')] [2023-03-07 14:23:11,277][213771] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-07 14:23:12,062][213771] Updated weights for policy 0, policy_version 13110 (0.0006) [2023-03-07 14:23:12,827][213771] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-07 14:23:13,589][213771] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-07 14:23:14,366][213771] Updated weights for policy 0, policy_version 13140 (0.0006) [2023-03-07 14:23:15,141][213771] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-07 14:23:15,905][213771] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-07 14:23:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13477888. Throughput: 0: 13269.2. Samples: 13448396. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:23:16,114][213445] Avg episode reward: [(0, '4322.087')] [2023-03-07 14:23:16,677][213771] Updated weights for policy 0, policy_version 13170 (0.0005) [2023-03-07 14:23:17,441][213771] Updated weights for policy 0, policy_version 13180 (0.0007) [2023-03-07 14:23:18,202][213771] Updated weights for policy 0, policy_version 13190 (0.0007) [2023-03-07 14:23:18,998][213771] Updated weights for policy 0, policy_version 13200 (0.0008) [2023-03-07 14:23:19,775][213771] Updated weights for policy 0, policy_version 13210 (0.0005) [2023-03-07 14:23:20,546][213771] Updated weights for policy 0, policy_version 13220 (0.0007) [2023-03-07 14:23:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 13544448. Throughput: 0: 13262.8. Samples: 13527871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:21,116][213445] Avg episode reward: [(0, '4336.573')] [2023-03-07 14:23:21,321][213771] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-07 14:23:22,087][213771] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-07 14:23:22,851][213771] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-07 14:23:23,629][213771] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-07 14:23:24,384][213771] Updated weights for policy 0, policy_version 13270 (0.0005) [2023-03-07 14:23:25,161][213771] Updated weights for policy 0, policy_version 13280 (0.0006) [2023-03-07 14:23:25,946][213771] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-07 14:23:26,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 13611008. Throughput: 0: 13271.1. Samples: 13607493. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:26,116][213445] Avg episode reward: [(0, '4236.210')] [2023-03-07 14:23:26,715][213771] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-07 14:23:27,498][213771] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-07 14:23:28,264][213771] Updated weights for policy 0, policy_version 13320 (0.0005) [2023-03-07 14:23:29,057][213771] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-03-07 14:23:29,836][213771] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-07 14:23:30,606][213771] Updated weights for policy 0, policy_version 13350 (0.0007) [2023-03-07 14:23:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13676544. Throughput: 0: 13268.3. Samples: 13647236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:31,116][213445] Avg episode reward: [(0, '4265.860')] [2023-03-07 14:23:31,377][213771] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-07 14:23:32,154][213771] Updated weights for policy 0, policy_version 13370 (0.0006) [2023-03-07 14:23:32,906][213771] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-07 14:23:33,702][213771] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-07 14:23:34,472][213771] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-07 14:23:35,238][213771] Updated weights for policy 0, policy_version 13410 (0.0005) [2023-03-07 14:23:36,031][213771] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-07 14:23:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13743104. Throughput: 0: 13257.6. Samples: 13726480. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:23:36,117][213445] Avg episode reward: [(0, '4130.599')] [2023-03-07 14:23:36,798][213771] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-07 14:23:37,564][213771] Updated weights for policy 0, policy_version 13440 (0.0007) [2023-03-07 14:23:38,341][213771] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-07 14:23:39,105][213771] Updated weights for policy 0, policy_version 13460 (0.0005) [2023-03-07 14:23:39,884][213771] Updated weights for policy 0, policy_version 13470 (0.0005) [2023-03-07 14:23:40,665][213771] Updated weights for policy 0, policy_version 13480 (0.0005) [2023-03-07 14:23:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 13808640. Throughput: 0: 13248.6. Samples: 13805774. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:23:41,116][213445] Avg episode reward: [(0, '4239.180')] [2023-03-07 14:23:41,424][213771] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-07 14:23:42,191][213771] Updated weights for policy 0, policy_version 13500 (0.0005) [2023-03-07 14:23:42,976][213771] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-07 14:23:43,742][213771] Updated weights for policy 0, policy_version 13520 (0.0006) [2023-03-07 14:23:44,516][213771] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-07 14:23:45,293][213771] Updated weights for policy 0, policy_version 13540 (0.0007) [2023-03-07 14:23:46,047][213771] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-07 14:23:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 13875200. Throughput: 0: 13257.7. Samples: 13845733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:23:46,116][213445] Avg episode reward: [(0, '4296.829')] [2023-03-07 14:23:46,845][213771] Updated weights for policy 0, policy_version 13560 (0.0006) [2023-03-07 14:23:47,614][213771] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-07 14:23:48,364][213771] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-07 14:23:49,146][213771] Updated weights for policy 0, policy_version 13590 (0.0005) [2023-03-07 14:23:49,926][213771] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-07 14:23:50,709][213771] Updated weights for policy 0, policy_version 13610 (0.0005) [2023-03-07 14:23:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 13941760. Throughput: 0: 13251.8. Samples: 13925282. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:23:51,116][213445] Avg episode reward: [(0, '4089.557')] [2023-03-07 14:23:51,487][213771] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-07 14:23:52,238][213771] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-07 14:23:53,018][213771] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-07 14:23:53,786][213771] Updated weights for policy 0, policy_version 13650 (0.0006) [2023-03-07 14:23:54,542][213771] Updated weights for policy 0, policy_version 13660 (0.0007) [2023-03-07 14:23:55,327][213771] Updated weights for policy 0, policy_version 13670 (0.0005) [2023-03-07 14:23:56,087][213771] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-07 14:23:56,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 14008320. Throughput: 0: 13249.9. Samples: 14004766. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:23:56,116][213445] Avg episode reward: [(0, '3996.141')] [2023-03-07 14:23:56,869][213771] Updated weights for policy 0, policy_version 13690 (0.0007) [2023-03-07 14:23:57,643][213771] Updated weights for policy 0, policy_version 13700 (0.0007) [2023-03-07 14:23:58,427][213771] Updated weights for policy 0, policy_version 13710 (0.0006) [2023-03-07 14:23:59,190][213771] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-07 14:23:59,952][213771] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-07 14:24:00,756][213771] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-07 14:24:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 14073856. Throughput: 0: 13244.8. Samples: 14044413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:01,116][213445] Avg episode reward: [(0, '4111.393')] [2023-03-07 14:24:01,530][213771] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-07 14:24:02,308][213771] Updated weights for policy 0, policy_version 13760 (0.0007) [2023-03-07 14:24:03,085][213771] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-07 14:24:03,863][213771] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-07 14:24:04,630][213771] Updated weights for policy 0, policy_version 13790 (0.0005) [2023-03-07 14:24:05,409][213771] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-07 14:24:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 14140416. Throughput: 0: 13242.8. Samples: 14123798. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:06,106][213445] Avg episode reward: [(0, '4235.720')] [2023-03-07 14:24:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000013809_14140416.pth... [2023-03-07 14:24:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000010703_10959872.pth [2023-03-07 14:24:06,165][213771] Updated weights for policy 0, policy_version 13810 (0.0006) [2023-03-07 14:24:06,935][213771] Updated weights for policy 0, policy_version 13820 (0.0005) [2023-03-07 14:24:07,716][213771] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-07 14:24:08,490][213771] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-07 14:24:09,261][213771] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-07 14:24:10,046][213771] Updated weights for policy 0, policy_version 13860 (0.0005) [2023-03-07 14:24:10,825][213771] Updated weights for policy 0, policy_version 13870 (0.0005) [2023-03-07 14:24:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 14205952. Throughput: 0: 13234.3. Samples: 14203037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:11,116][213445] Avg episode reward: [(0, '4380.852')] [2023-03-07 14:24:11,589][213771] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-07 14:24:12,382][213771] Updated weights for policy 0, policy_version 13890 (0.0007) [2023-03-07 14:24:13,143][213771] Updated weights for policy 0, policy_version 13900 (0.0005) [2023-03-07 14:24:13,901][213771] Updated weights for policy 0, policy_version 13910 (0.0007) [2023-03-07 14:24:14,682][213771] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-07 14:24:15,458][213771] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-07 14:24:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 14272512. Throughput: 0: 13236.2. Samples: 14242865. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:16,116][213445] Avg episode reward: [(0, '4283.566')] [2023-03-07 14:24:16,258][213771] Updated weights for policy 0, policy_version 13940 (0.0006) [2023-03-07 14:24:17,035][213771] Updated weights for policy 0, policy_version 13950 (0.0006) [2023-03-07 14:24:17,798][213771] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-07 14:24:18,565][213771] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-07 14:24:19,345][213771] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-07 14:24:20,121][213771] Updated weights for policy 0, policy_version 13990 (0.0005) [2023-03-07 14:24:20,900][213771] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-07 14:24:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 14338048. Throughput: 0: 13233.3. Samples: 14321980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:21,116][213445] Avg episode reward: [(0, '4352.884')] [2023-03-07 14:24:21,701][213771] Updated weights for policy 0, policy_version 14010 (0.0005) [2023-03-07 14:24:22,452][213771] Updated weights for policy 0, policy_version 14020 (0.0005) [2023-03-07 14:24:23,222][213771] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-07 14:24:24,006][213771] Updated weights for policy 0, policy_version 14040 (0.0005) [2023-03-07 14:24:24,779][213771] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-07 14:24:25,561][213771] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-07 14:24:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 14404608. Throughput: 0: 13230.9. Samples: 14401165. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:26,116][213445] Avg episode reward: [(0, '4299.211')] [2023-03-07 14:24:26,326][213771] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-07 14:24:27,093][213771] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-07 14:24:27,870][213771] Updated weights for policy 0, policy_version 14090 (0.0008) [2023-03-07 14:24:28,652][213771] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-07 14:24:29,427][213771] Updated weights for policy 0, policy_version 14110 (0.0005) [2023-03-07 14:24:30,202][213771] Updated weights for policy 0, policy_version 14120 (0.0007) [2023-03-07 14:24:30,973][213771] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-07 14:24:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 14470144. Throughput: 0: 13226.6. Samples: 14440927. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:31,116][213445] Avg episode reward: [(0, '4225.752')] [2023-03-07 14:24:31,741][213771] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-07 14:24:32,524][213771] Updated weights for policy 0, policy_version 14150 (0.0005) [2023-03-07 14:24:33,281][213771] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-07 14:24:34,064][213771] Updated weights for policy 0, policy_version 14170 (0.0005) [2023-03-07 14:24:34,845][213771] Updated weights for policy 0, policy_version 14180 (0.0007) [2023-03-07 14:24:35,634][213771] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-07 14:24:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 14536704. Throughput: 0: 13221.0. Samples: 14520228. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:36,116][213445] Avg episode reward: [(0, '4324.937')] [2023-03-07 14:24:36,407][213771] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-07 14:24:37,189][213771] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-07 14:24:37,942][213771] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-07 14:24:38,710][213771] Updated weights for policy 0, policy_version 14230 (0.0005) [2023-03-07 14:24:39,502][213771] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-07 14:24:40,252][213771] Updated weights for policy 0, policy_version 14250 (0.0007) [2023-03-07 14:24:41,025][213771] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-07 14:24:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 14603264. Throughput: 0: 13218.7. Samples: 14599609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:41,116][213445] Avg episode reward: [(0, '4308.474')] [2023-03-07 14:24:41,815][213771] Updated weights for policy 0, policy_version 14270 (0.0007) [2023-03-07 14:24:42,569][213771] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-07 14:24:43,347][213771] Updated weights for policy 0, policy_version 14290 (0.0007) [2023-03-07 14:24:44,114][213771] Updated weights for policy 0, policy_version 14300 (0.0005) [2023-03-07 14:24:44,906][213771] Updated weights for policy 0, policy_version 14310 (0.0005) [2023-03-07 14:24:45,670][213771] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-07 14:24:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 14668800. Throughput: 0: 13221.2. Samples: 14639368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:24:46,116][213445] Avg episode reward: [(0, '4218.505')] [2023-03-07 14:24:46,455][213771] Updated weights for policy 0, policy_version 14330 (0.0007) [2023-03-07 14:24:47,229][213771] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-07 14:24:48,003][213771] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-07 14:24:48,760][213771] Updated weights for policy 0, policy_version 14360 (0.0007) [2023-03-07 14:24:49,526][213771] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-07 14:24:50,306][213771] Updated weights for policy 0, policy_version 14380 (0.0006) [2023-03-07 14:24:51,068][213771] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-07 14:24:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 14735360. Throughput: 0: 13223.8. Samples: 14718868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:51,116][213445] Avg episode reward: [(0, '4227.755')] [2023-03-07 14:24:51,844][213771] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-07 14:24:52,614][213771] Updated weights for policy 0, policy_version 14410 (0.0007) [2023-03-07 14:24:53,396][213771] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-07 14:24:54,169][213771] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-07 14:24:54,946][213771] Updated weights for policy 0, policy_version 14440 (0.0007) [2023-03-07 14:24:55,718][213771] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-07 14:24:56,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 14801920. Throughput: 0: 13231.9. Samples: 14798473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:24:56,116][213445] Avg episode reward: [(0, '4243.966')] [2023-03-07 14:24:56,510][213771] Updated weights for policy 0, policy_version 14460 (0.0007) [2023-03-07 14:24:57,281][213771] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-07 14:24:58,052][213771] Updated weights for policy 0, policy_version 14480 (0.0006) [2023-03-07 14:24:58,829][213771] Updated weights for policy 0, policy_version 14490 (0.0005) [2023-03-07 14:24:59,599][213771] Updated weights for policy 0, policy_version 14500 (0.0006) [2023-03-07 14:25:00,379][213771] Updated weights for policy 0, policy_version 14510 (0.0005) [2023-03-07 14:25:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 14867456. Throughput: 0: 13223.4. Samples: 14837918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:01,116][213445] Avg episode reward: [(0, '4230.986')] [2023-03-07 14:25:01,138][213771] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-07 14:25:01,919][213771] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-07 14:25:02,677][213771] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-07 14:25:03,453][213771] Updated weights for policy 0, policy_version 14550 (0.0005) [2023-03-07 14:25:04,236][213771] Updated weights for policy 0, policy_version 14560 (0.0007) [2023-03-07 14:25:05,013][213771] Updated weights for policy 0, policy_version 14570 (0.0006) [2023-03-07 14:25:05,780][213771] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-07 14:25:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 14934016. Throughput: 0: 13228.1. Samples: 14917245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:06,116][213445] Avg episode reward: [(0, '4124.187')] [2023-03-07 14:25:06,568][213771] Updated weights for policy 0, policy_version 14590 (0.0007) [2023-03-07 14:25:07,362][213771] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-07 14:25:08,125][213771] Updated weights for policy 0, policy_version 14610 (0.0006) [2023-03-07 14:25:08,918][213771] Updated weights for policy 0, policy_version 14620 (0.0007) [2023-03-07 14:25:09,681][213771] Updated weights for policy 0, policy_version 14630 (0.0006) [2023-03-07 14:25:10,457][213771] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-07 14:25:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 14999552. Throughput: 0: 13228.1. Samples: 14996431. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:25:11,116][213445] Avg episode reward: [(0, '4262.994')] [2023-03-07 14:25:11,239][213771] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-07 14:25:12,009][213771] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-07 14:25:12,770][213771] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-07 14:25:13,541][213771] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-07 14:25:14,316][213771] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-07 14:25:15,086][213771] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-07 14:25:15,858][213771] Updated weights for policy 0, policy_version 14710 (0.0005) [2023-03-07 14:25:16,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13209.6, 300 sec: 13242.6). Total num frames: 15065088. Throughput: 0: 13221.5. Samples: 15035893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:25:16,116][213445] Avg episode reward: [(0, '4384.435')] [2023-03-07 14:25:16,628][213771] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-07 14:25:17,394][213771] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-07 14:25:18,176][213771] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-07 14:25:18,960][213771] Updated weights for policy 0, policy_version 14750 (0.0007) [2023-03-07 14:25:19,721][213771] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-07 14:25:20,489][213771] Updated weights for policy 0, policy_version 14770 (0.0007) [2023-03-07 14:25:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 15132672. Throughput: 0: 13234.5. Samples: 15115780. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:25:21,106][213445] Avg episode reward: [(0, '4330.381')] [2023-03-07 14:25:21,251][213771] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-07 14:25:22,025][213771] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-07 14:25:22,791][213771] Updated weights for policy 0, policy_version 14800 (0.0005) [2023-03-07 14:25:23,573][213771] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-07 14:25:24,360][213771] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-07 14:25:25,127][213771] Updated weights for policy 0, policy_version 14830 (0.0007) [2023-03-07 14:25:25,896][213771] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-07 14:25:26,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 15198208. Throughput: 0: 13239.0. Samples: 15195367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:26,106][213445] Avg episode reward: [(0, '4358.884')] [2023-03-07 14:25:26,643][213771] Updated weights for policy 0, policy_version 14850 (0.0006) [2023-03-07 14:25:27,404][213771] Updated weights for policy 0, policy_version 14860 (0.0005) [2023-03-07 14:25:28,184][213771] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-07 14:25:28,982][213771] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-07 14:25:29,749][213771] Updated weights for policy 0, policy_version 14890 (0.0005) [2023-03-07 14:25:30,516][213771] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-07 14:25:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 15264768. Throughput: 0: 13243.1. Samples: 15235309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:31,106][213445] Avg episode reward: [(0, '4162.337')] [2023-03-07 14:25:31,275][213771] Updated weights for policy 0, policy_version 14910 (0.0005) [2023-03-07 14:25:32,050][213771] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-07 14:25:32,832][213771] Updated weights for policy 0, policy_version 14930 (0.0007) [2023-03-07 14:25:33,610][213771] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-07 14:25:34,388][213771] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-07 14:25:35,153][213771] Updated weights for policy 0, policy_version 14960 (0.0007) [2023-03-07 14:25:35,929][213771] Updated weights for policy 0, policy_version 14970 (0.0007) [2023-03-07 14:25:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 15331328. Throughput: 0: 13240.8. Samples: 15314706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:36,106][213445] Avg episode reward: [(0, '4251.215')] [2023-03-07 14:25:36,701][213771] Updated weights for policy 0, policy_version 14980 (0.0007) [2023-03-07 14:25:37,461][213771] Updated weights for policy 0, policy_version 14990 (0.0006) [2023-03-07 14:25:38,231][213771] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-07 14:25:38,995][213771] Updated weights for policy 0, policy_version 15010 (0.0006) [2023-03-07 14:25:39,774][213771] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-07 14:25:40,522][213771] Updated weights for policy 0, policy_version 15030 (0.0006) [2023-03-07 14:25:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 15397888. Throughput: 0: 13249.9. Samples: 15394717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:25:41,106][213445] Avg episode reward: [(0, '4234.227')] [2023-03-07 14:25:41,305][213771] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-07 14:25:42,054][213771] Updated weights for policy 0, policy_version 15050 (0.0005) [2023-03-07 14:25:42,833][213771] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-07 14:25:43,625][213771] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-07 14:25:44,372][213771] Updated weights for policy 0, policy_version 15080 (0.0005) [2023-03-07 14:25:45,144][213771] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-07 14:25:45,913][213771] Updated weights for policy 0, policy_version 15100 (0.0005) [2023-03-07 14:25:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 15464448. Throughput: 0: 13258.2. Samples: 15434536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:25:46,106][213445] Avg episode reward: [(0, '4350.583')] [2023-03-07 14:25:46,680][213771] Updated weights for policy 0, policy_version 15110 (0.0006) [2023-03-07 14:25:47,458][213771] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-07 14:25:48,218][213771] Updated weights for policy 0, policy_version 15130 (0.0006) [2023-03-07 14:25:48,981][213771] Updated weights for policy 0, policy_version 15140 (0.0006) [2023-03-07 14:25:49,762][213771] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-07 14:25:50,511][213771] Updated weights for policy 0, policy_version 15160 (0.0007) [2023-03-07 14:25:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 15531008. Throughput: 0: 13274.9. Samples: 15514615. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:25:51,106][213445] Avg episode reward: [(0, '4042.840')] [2023-03-07 14:25:51,280][213771] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-07 14:25:52,073][213771] Updated weights for policy 0, policy_version 15180 (0.0007) [2023-03-07 14:25:52,832][213771] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-07 14:25:53,601][213771] Updated weights for policy 0, policy_version 15200 (0.0007) [2023-03-07 14:25:54,365][213771] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-07 14:25:55,149][213771] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-07 14:25:55,900][213771] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-07 14:25:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 15597568. Throughput: 0: 13290.0. Samples: 15594484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 14:25:56,106][213445] Avg episode reward: [(0, '4134.659')] [2023-03-07 14:25:56,680][213771] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-07 14:25:57,455][213771] Updated weights for policy 0, policy_version 15250 (0.0006) [2023-03-07 14:25:58,215][213771] Updated weights for policy 0, policy_version 15260 (0.0007) [2023-03-07 14:25:58,986][213771] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-07 14:25:59,757][213771] Updated weights for policy 0, policy_version 15280 (0.0006) [2023-03-07 14:26:00,541][213771] Updated weights for policy 0, policy_version 15290 (0.0006) [2023-03-07 14:26:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 15664128. Throughput: 0: 13301.0. Samples: 15634440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 14:26:01,106][213445] Avg episode reward: [(0, '4208.674')] [2023-03-07 14:26:01,325][213771] Updated weights for policy 0, policy_version 15300 (0.0007) [2023-03-07 14:26:02,083][213771] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-07 14:26:02,854][213771] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-07 14:26:03,643][213771] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-07 14:26:04,409][213771] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-07 14:26:05,185][213771] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-07 14:26:05,969][213771] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-07 14:26:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 15729664. Throughput: 0: 13285.2. Samples: 15713617. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:26:06,106][213445] Avg episode reward: [(0, '3952.548')] [2023-03-07 14:26:06,119][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000015362_15730688.pth... [2023-03-07 14:26:06,147][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000012256_12550144.pth [2023-03-07 14:26:06,733][213771] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-07 14:26:07,507][213771] Updated weights for policy 0, policy_version 15380 (0.0005) [2023-03-07 14:26:08,271][213771] Updated weights for policy 0, policy_version 15390 (0.0007) [2023-03-07 14:26:09,049][213771] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-07 14:26:09,819][213771] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-07 14:26:10,587][213771] Updated weights for policy 0, policy_version 15420 (0.0006) [2023-03-07 14:26:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.8, 300 sec: 13246.0). Total num frames: 15796224. Throughput: 0: 13285.5. Samples: 15793211. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 14:26:11,106][213445] Avg episode reward: [(0, '4138.965')] [2023-03-07 14:26:11,378][213771] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-07 14:26:12,148][213771] Updated weights for policy 0, policy_version 15440 (0.0007) [2023-03-07 14:26:12,924][213771] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-07 14:26:13,684][213771] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-07 14:26:14,485][213771] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-07 14:26:15,249][213771] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-07 14:26:16,017][213771] Updated weights for policy 0, policy_version 15490 (0.0006) [2023-03-07 14:26:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13249.5). Total num frames: 15862784. Throughput: 0: 13280.8. Samples: 15832948. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:26:16,106][213445] Avg episode reward: [(0, '4052.911')] [2023-03-07 14:26:16,792][213771] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-03-07 14:26:17,575][213771] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-07 14:26:18,345][213771] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-07 14:26:19,108][213771] Updated weights for policy 0, policy_version 15530 (0.0007) [2023-03-07 14:26:19,869][213771] Updated weights for policy 0, policy_version 15540 (0.0006) [2023-03-07 14:26:20,646][213771] Updated weights for policy 0, policy_version 15550 (0.0006) [2023-03-07 14:26:21,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 15928320. Throughput: 0: 13281.6. Samples: 15912377. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:26:21,106][213445] Avg episode reward: [(0, '4029.443')] [2023-03-07 14:26:21,422][213771] Updated weights for policy 0, policy_version 15560 (0.0006) [2023-03-07 14:26:22,185][213771] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-07 14:26:22,957][213771] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-07 14:26:23,737][213771] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-07 14:26:24,505][213771] Updated weights for policy 0, policy_version 15600 (0.0009) [2023-03-07 14:26:25,268][213771] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-07 14:26:26,053][213771] Updated weights for policy 0, policy_version 15620 (0.0007) [2023-03-07 14:26:26,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 15994880. Throughput: 0: 13278.0. Samples: 15992227. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:26:26,105][213445] Avg episode reward: [(0, '3977.556')] [2023-03-07 14:26:26,813][213771] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-07 14:26:27,583][213771] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-07 14:26:28,360][213771] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-07 14:26:29,120][213771] Updated weights for policy 0, policy_version 15660 (0.0006) [2023-03-07 14:26:29,889][213771] Updated weights for policy 0, policy_version 15670 (0.0005) [2023-03-07 14:26:30,667][213771] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-07 14:26:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 16061440. Throughput: 0: 13277.4. Samples: 16032017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:31,105][213445] Avg episode reward: [(0, '4046.546')] [2023-03-07 14:26:31,441][213771] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-07 14:26:32,209][213771] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-07 14:26:32,969][213771] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-07 14:26:33,758][213771] Updated weights for policy 0, policy_version 15720 (0.0006) [2023-03-07 14:26:34,525][213771] Updated weights for policy 0, policy_version 15730 (0.0007) [2023-03-07 14:26:35,304][213771] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-07 14:26:36,071][213771] Updated weights for policy 0, policy_version 15750 (0.0006) [2023-03-07 14:26:36,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 16128000. Throughput: 0: 13266.8. Samples: 16111625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:36,106][213445] Avg episode reward: [(0, '4057.551')] [2023-03-07 14:26:36,837][213771] Updated weights for policy 0, policy_version 15760 (0.0007) [2023-03-07 14:26:37,595][213771] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-07 14:26:38,369][213771] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-07 14:26:39,129][213771] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-07 14:26:39,923][213771] Updated weights for policy 0, policy_version 15800 (0.0007) [2023-03-07 14:26:40,682][213771] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-07 14:26:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 16194560. Throughput: 0: 13261.7. Samples: 16191259. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:41,105][213445] Avg episode reward: [(0, '4198.645')] [2023-03-07 14:26:41,463][213771] Updated weights for policy 0, policy_version 15820 (0.0006) [2023-03-07 14:26:42,236][213771] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-07 14:26:43,006][213771] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-07 14:26:43,787][213771] Updated weights for policy 0, policy_version 15850 (0.0006) [2023-03-07 14:26:44,545][213771] Updated weights for policy 0, policy_version 15860 (0.0006) [2023-03-07 14:26:45,328][213771] Updated weights for policy 0, policy_version 15870 (0.0007) [2023-03-07 14:26:46,090][213771] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-07 14:26:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 16261120. Throughput: 0: 13258.7. Samples: 16231081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:46,106][213445] Avg episode reward: [(0, '4182.165')] [2023-03-07 14:26:46,864][213771] Updated weights for policy 0, policy_version 15890 (0.0007) [2023-03-07 14:26:47,641][213771] Updated weights for policy 0, policy_version 15900 (0.0005) [2023-03-07 14:26:48,405][213771] Updated weights for policy 0, policy_version 15910 (0.0006) [2023-03-07 14:26:49,171][213771] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-07 14:26:49,925][213771] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-07 14:26:50,706][213771] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-07 14:26:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16326656. Throughput: 0: 13272.2. Samples: 16310864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:51,106][213445] Avg episode reward: [(0, '4119.307')] [2023-03-07 14:26:51,490][213771] Updated weights for policy 0, policy_version 15950 (0.0006) [2023-03-07 14:26:52,259][213771] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-07 14:26:53,037][213771] Updated weights for policy 0, policy_version 15970 (0.0007) [2023-03-07 14:26:53,804][213771] Updated weights for policy 0, policy_version 15980 (0.0007) [2023-03-07 14:26:54,584][213771] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-07 14:26:55,356][213771] Updated weights for policy 0, policy_version 16000 (0.0006) [2023-03-07 14:26:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16393216. Throughput: 0: 13270.5. Samples: 16390383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:26:56,106][213445] Avg episode reward: [(0, '4178.217')] [2023-03-07 14:26:56,125][213771] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-07 14:26:56,898][213771] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-07 14:26:57,713][213771] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-07 14:26:58,499][213771] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-07 14:26:59,260][213771] Updated weights for policy 0, policy_version 16050 (0.0007) [2023-03-07 14:27:00,039][213771] Updated weights for policy 0, policy_version 16060 (0.0005) [2023-03-07 14:27:00,798][213771] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-07 14:27:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 16459776. Throughput: 0: 13257.1. Samples: 16429513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:27:01,105][213445] Avg episode reward: [(0, '4202.424')] [2023-03-07 14:27:01,591][213771] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-07 14:27:02,354][213771] Updated weights for policy 0, policy_version 16090 (0.0005) [2023-03-07 14:27:03,114][213771] Updated weights for policy 0, policy_version 16100 (0.0008) [2023-03-07 14:27:03,905][213771] Updated weights for policy 0, policy_version 16110 (0.0005) [2023-03-07 14:27:04,650][213771] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-07 14:27:05,433][213771] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-07 14:27:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16525312. Throughput: 0: 13263.4. Samples: 16509231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:27:06,106][213445] Avg episode reward: [(0, '4203.450')] [2023-03-07 14:27:06,216][213771] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-07 14:27:06,968][213771] Updated weights for policy 0, policy_version 16150 (0.0005) [2023-03-07 14:27:07,757][213771] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-07 14:27:08,516][213771] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-07 14:27:09,286][213771] Updated weights for policy 0, policy_version 16180 (0.0006) [2023-03-07 14:27:10,038][213771] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-07 14:27:10,821][213771] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-07 14:27:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16591872. Throughput: 0: 13260.3. Samples: 16588941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:27:11,106][213445] Avg episode reward: [(0, '4279.312')] [2023-03-07 14:27:11,581][213771] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-07 14:27:12,347][213771] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-07 14:27:13,133][213771] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-07 14:27:13,905][213771] Updated weights for policy 0, policy_version 16240 (0.0008) [2023-03-07 14:27:14,682][213771] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-07 14:27:15,454][213771] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-07 14:27:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16658432. Throughput: 0: 13260.4. Samples: 16628736. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:16,106][213445] Avg episode reward: [(0, '4308.262')] [2023-03-07 14:27:16,229][213771] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-07 14:27:17,009][213771] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-07 14:27:17,782][213771] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-07 14:27:18,554][213771] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-07 14:27:19,342][213771] Updated weights for policy 0, policy_version 16310 (0.0006) [2023-03-07 14:27:20,106][213771] Updated weights for policy 0, policy_version 16320 (0.0007) [2023-03-07 14:27:20,870][213771] Updated weights for policy 0, policy_version 16330 (0.0007) [2023-03-07 14:27:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 16724992. Throughput: 0: 13254.1. Samples: 16708056. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:21,106][213445] Avg episode reward: [(0, '4328.955')] [2023-03-07 14:27:21,645][213771] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-07 14:27:22,412][213771] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-07 14:27:23,180][213771] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-07 14:27:23,962][213771] Updated weights for policy 0, policy_version 16370 (0.0005) [2023-03-07 14:27:24,738][213771] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-07 14:27:25,486][213771] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-07 14:27:26,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16790528. Throughput: 0: 13255.2. Samples: 16787743. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:26,106][213445] Avg episode reward: [(0, '4251.621')] [2023-03-07 14:27:26,263][213771] Updated weights for policy 0, policy_version 16400 (0.0007) [2023-03-07 14:27:27,047][213771] Updated weights for policy 0, policy_version 16410 (0.0007) [2023-03-07 14:27:27,827][213771] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-07 14:27:28,617][213771] Updated weights for policy 0, policy_version 16430 (0.0007) [2023-03-07 14:27:29,383][213771] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-07 14:27:30,150][213771] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-07 14:27:30,941][213771] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-07 14:27:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 16857088. Throughput: 0: 13248.5. Samples: 16827263. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:31,106][213445] Avg episode reward: [(0, '4214.428')] [2023-03-07 14:27:31,721][213771] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-07 14:27:32,487][213771] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-07 14:27:33,278][213771] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-07 14:27:34,041][213771] Updated weights for policy 0, policy_version 16500 (0.0006) [2023-03-07 14:27:34,802][213771] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-07 14:27:35,594][213771] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-07 14:27:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 16922624. Throughput: 0: 13236.9. Samples: 16906523. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:36,105][213445] Avg episode reward: [(0, '4099.622')] [2023-03-07 14:27:36,367][213771] Updated weights for policy 0, policy_version 16530 (0.0007) [2023-03-07 14:27:37,136][213771] Updated weights for policy 0, policy_version 16540 (0.0007) [2023-03-07 14:27:37,928][213771] Updated weights for policy 0, policy_version 16550 (0.0007) [2023-03-07 14:27:38,690][213771] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-07 14:27:39,460][213771] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-07 14:27:40,223][213771] Updated weights for policy 0, policy_version 16580 (0.0006) [2023-03-07 14:27:41,016][213771] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-07 14:27:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 16989184. Throughput: 0: 13231.0. Samples: 16985777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:41,106][213445] Avg episode reward: [(0, '4177.273')] [2023-03-07 14:27:41,797][213771] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-07 14:27:42,572][213771] Updated weights for policy 0, policy_version 16610 (0.0006) [2023-03-07 14:27:43,342][213771] Updated weights for policy 0, policy_version 16620 (0.0005) [2023-03-07 14:27:44,128][213771] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-07 14:27:44,878][213771] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-07 14:27:45,667][213771] Updated weights for policy 0, policy_version 16650 (0.0007) [2023-03-07 14:27:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 17054720. Throughput: 0: 13242.0. Samples: 17025403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:27:46,106][213445] Avg episode reward: [(0, '4307.322')] [2023-03-07 14:27:46,440][213771] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-07 14:27:47,210][213771] Updated weights for policy 0, policy_version 16670 (0.0006) [2023-03-07 14:27:47,975][213771] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-07 14:27:48,758][213771] Updated weights for policy 0, policy_version 16690 (0.0007) [2023-03-07 14:27:49,531][213771] Updated weights for policy 0, policy_version 16700 (0.0005) [2023-03-07 14:27:50,306][213771] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-07 14:27:51,074][213771] Updated weights for policy 0, policy_version 16720 (0.0006) [2023-03-07 14:27:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17121280. Throughput: 0: 13234.6. Samples: 17104788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:27:51,106][213445] Avg episode reward: [(0, '4261.506')] [2023-03-07 14:27:51,855][213771] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-07 14:27:52,610][213771] Updated weights for policy 0, policy_version 16740 (0.0007) [2023-03-07 14:27:53,380][213771] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-07 14:27:54,146][213771] Updated weights for policy 0, policy_version 16760 (0.0006) [2023-03-07 14:27:54,921][213771] Updated weights for policy 0, policy_version 16770 (0.0006) [2023-03-07 14:27:55,696][213771] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-07 14:27:56,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17187840. Throughput: 0: 13239.8. Samples: 17184733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:27:56,106][213445] Avg episode reward: [(0, '4272.363')] [2023-03-07 14:27:56,462][213771] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-07 14:27:57,242][213771] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-07 14:27:58,024][213771] Updated weights for policy 0, policy_version 16810 (0.0006) [2023-03-07 14:27:58,794][213771] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-07 14:27:59,576][213771] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-07 14:28:00,332][213771] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-07 14:28:01,102][213771] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-07 14:28:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17254400. Throughput: 0: 13233.5. Samples: 17224243. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:28:01,106][213445] Avg episode reward: [(0, '4275.553')] [2023-03-07 14:28:01,872][213771] Updated weights for policy 0, policy_version 16860 (0.0005) [2023-03-07 14:28:02,654][213771] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-07 14:28:03,419][213771] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-07 14:28:04,200][213771] Updated weights for policy 0, policy_version 16890 (0.0007) [2023-03-07 14:28:04,979][213771] Updated weights for policy 0, policy_version 16900 (0.0007) [2023-03-07 14:28:05,731][213771] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-07 14:28:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17319936. Throughput: 0: 13238.5. Samples: 17303788. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:28:06,106][213445] Avg episode reward: [(0, '4177.865')] [2023-03-07 14:28:06,121][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000016915_17320960.pth... [2023-03-07 14:28:06,149][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000013809_14140416.pth [2023-03-07 14:28:06,513][213771] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-07 14:28:07,285][213771] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-07 14:28:08,069][213771] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-07 14:28:08,852][213771] Updated weights for policy 0, policy_version 16950 (0.0007) [2023-03-07 14:28:09,617][213771] Updated weights for policy 0, policy_version 16960 (0.0007) [2023-03-07 14:28:10,378][213771] Updated weights for policy 0, policy_version 16970 (0.0006) [2023-03-07 14:28:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17386496. Throughput: 0: 13232.1. Samples: 17383186. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:11,106][213445] Avg episode reward: [(0, '4216.284')] [2023-03-07 14:28:11,158][213771] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-07 14:28:11,913][213771] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-07 14:28:12,714][213771] Updated weights for policy 0, policy_version 17000 (0.0005) [2023-03-07 14:28:13,492][213771] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-07 14:28:14,255][213771] Updated weights for policy 0, policy_version 17020 (0.0006) [2023-03-07 14:28:15,007][213771] Updated weights for policy 0, policy_version 17030 (0.0005) [2023-03-07 14:28:15,796][213771] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-07 14:28:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 17453056. Throughput: 0: 13236.7. Samples: 17422913. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:16,105][213445] Avg episode reward: [(0, '4334.186')] [2023-03-07 14:28:16,560][213771] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-07 14:28:17,319][213771] Updated weights for policy 0, policy_version 17060 (0.0005) [2023-03-07 14:28:18,102][213771] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-07 14:28:18,871][213771] Updated weights for policy 0, policy_version 17080 (0.0006) [2023-03-07 14:28:19,636][213771] Updated weights for policy 0, policy_version 17090 (0.0006) [2023-03-07 14:28:20,410][213771] Updated weights for policy 0, policy_version 17100 (0.0007) [2023-03-07 14:28:21,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17519616. Throughput: 0: 13252.8. Samples: 17502900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:21,106][213445] Avg episode reward: [(0, '4340.495')] [2023-03-07 14:28:21,180][213771] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-07 14:28:21,949][213771] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-07 14:28:22,714][213771] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-07 14:28:23,486][213771] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-07 14:28:24,271][213771] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-07 14:28:25,040][213771] Updated weights for policy 0, policy_version 17160 (0.0007) [2023-03-07 14:28:25,816][213771] Updated weights for policy 0, policy_version 17170 (0.0005) [2023-03-07 14:28:26,105][213445] Fps is (10 sec: 13209.2, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17585152. Throughput: 0: 13257.3. Samples: 17582359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:28:26,106][213445] Avg episode reward: [(0, '4266.288')] [2023-03-07 14:28:26,584][213771] Updated weights for policy 0, policy_version 17180 (0.0007) [2023-03-07 14:28:27,363][213771] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-07 14:28:28,152][213771] Updated weights for policy 0, policy_version 17200 (0.0007) [2023-03-07 14:28:28,906][213771] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-07 14:28:29,682][213771] Updated weights for policy 0, policy_version 17220 (0.0007) [2023-03-07 14:28:30,465][213771] Updated weights for policy 0, policy_version 17230 (0.0007) [2023-03-07 14:28:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 17651712. Throughput: 0: 13260.6. Samples: 17622129. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:28:31,106][213445] Avg episode reward: [(0, '4326.855')] [2023-03-07 14:28:31,244][213771] Updated weights for policy 0, policy_version 17240 (0.0007) [2023-03-07 14:28:31,999][213771] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-07 14:28:32,776][213771] Updated weights for policy 0, policy_version 17260 (0.0005) [2023-03-07 14:28:33,545][213771] Updated weights for policy 0, policy_version 17270 (0.0005) [2023-03-07 14:28:34,317][213771] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-07 14:28:35,084][213771] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-07 14:28:35,871][213771] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-07 14:28:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 17718272. Throughput: 0: 13265.4. Samples: 17701732. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:36,106][213445] Avg episode reward: [(0, '4266.018')] [2023-03-07 14:28:36,642][213771] Updated weights for policy 0, policy_version 17310 (0.0007) [2023-03-07 14:28:37,424][213771] Updated weights for policy 0, policy_version 17320 (0.0005) [2023-03-07 14:28:38,190][213771] Updated weights for policy 0, policy_version 17330 (0.0006) [2023-03-07 14:28:38,954][213771] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-07 14:28:39,729][213771] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-07 14:28:40,481][213771] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-07 14:28:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 17784832. Throughput: 0: 13258.3. Samples: 17781355. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:41,106][213445] Avg episode reward: [(0, '4353.382')] [2023-03-07 14:28:41,252][213771] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-07 14:28:42,026][213771] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-07 14:28:42,794][213771] Updated weights for policy 0, policy_version 17390 (0.0005) [2023-03-07 14:28:43,558][213771] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-07 14:28:44,340][213771] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-07 14:28:45,111][213771] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-07 14:28:45,870][213771] Updated weights for policy 0, policy_version 17430 (0.0006) [2023-03-07 14:28:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 17850368. Throughput: 0: 13265.1. Samples: 17821172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:28:46,106][213445] Avg episode reward: [(0, '4266.907')] [2023-03-07 14:28:46,650][213771] Updated weights for policy 0, policy_version 17440 (0.0007) [2023-03-07 14:28:47,414][213771] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-07 14:28:48,205][213771] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-07 14:28:48,976][213771] Updated weights for policy 0, policy_version 17470 (0.0006) [2023-03-07 14:28:49,749][213771] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-07 14:28:50,532][213771] Updated weights for policy 0, policy_version 17490 (0.0007) [2023-03-07 14:28:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 17916928. Throughput: 0: 13263.0. Samples: 17900623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:28:51,106][213445] Avg episode reward: [(0, '4310.342')] [2023-03-07 14:28:51,294][213771] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-07 14:28:52,063][213771] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-07 14:28:52,836][213771] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-07 14:28:53,604][213771] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-07 14:28:54,389][213771] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-07 14:28:55,173][213771] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-07 14:28:55,947][213771] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-07 14:28:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 17982464. Throughput: 0: 13261.5. Samples: 17979953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:28:56,105][213445] Avg episode reward: [(0, '4306.682')] [2023-03-07 14:28:56,724][213771] Updated weights for policy 0, policy_version 17570 (0.0007) [2023-03-07 14:28:57,483][213771] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-07 14:28:58,256][213771] Updated weights for policy 0, policy_version 17590 (0.0006) [2023-03-07 14:28:59,041][213771] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-07 14:28:59,803][213771] Updated weights for policy 0, policy_version 17610 (0.0006) [2023-03-07 14:29:00,575][213771] Updated weights for policy 0, policy_version 17620 (0.0005) [2023-03-07 14:29:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 18049024. Throughput: 0: 13263.9. Samples: 18019792. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:01,106][213445] Avg episode reward: [(0, '4322.203')] [2023-03-07 14:29:01,365][213771] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-07 14:29:02,137][213771] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-07 14:29:02,897][213771] Updated weights for policy 0, policy_version 17650 (0.0005) [2023-03-07 14:29:03,669][213771] Updated weights for policy 0, policy_version 17660 (0.0007) [2023-03-07 14:29:04,447][213771] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-07 14:29:05,225][213771] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-07 14:29:06,010][213771] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-07 14:29:06,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 18115584. Throughput: 0: 13249.9. Samples: 18099148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:06,106][213445] Avg episode reward: [(0, '4297.755')] [2023-03-07 14:29:06,773][213771] Updated weights for policy 0, policy_version 17700 (0.0007) [2023-03-07 14:29:07,554][213771] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-07 14:29:08,333][213771] Updated weights for policy 0, policy_version 17720 (0.0006) [2023-03-07 14:29:09,105][213771] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-07 14:29:09,878][213771] Updated weights for policy 0, policy_version 17740 (0.0005) [2023-03-07 14:29:10,644][213771] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-07 14:29:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 18181120. Throughput: 0: 13247.4. Samples: 18178493. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:11,106][213445] Avg episode reward: [(0, '4252.398')] [2023-03-07 14:29:11,433][213771] Updated weights for policy 0, policy_version 17760 (0.0006) [2023-03-07 14:29:12,213][213771] Updated weights for policy 0, policy_version 17770 (0.0007) [2023-03-07 14:29:12,977][213771] Updated weights for policy 0, policy_version 17780 (0.0006) [2023-03-07 14:29:13,773][213771] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-07 14:29:14,553][213771] Updated weights for policy 0, policy_version 17800 (0.0006) [2023-03-07 14:29:15,313][213771] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-07 14:29:16,092][213771] Updated weights for policy 0, policy_version 17820 (0.0005) [2023-03-07 14:29:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 18247680. Throughput: 0: 13241.6. Samples: 18218003. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:29:16,106][213445] Avg episode reward: [(0, '4254.738')] [2023-03-07 14:29:16,873][213771] Updated weights for policy 0, policy_version 17830 (0.0006) [2023-03-07 14:29:17,646][213771] Updated weights for policy 0, policy_version 17840 (0.0007) [2023-03-07 14:29:18,429][213771] Updated weights for policy 0, policy_version 17850 (0.0007) [2023-03-07 14:29:19,207][213771] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-07 14:29:19,984][213771] Updated weights for policy 0, policy_version 17870 (0.0007) [2023-03-07 14:29:20,753][213771] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-07 14:29:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 18313216. Throughput: 0: 13227.4. Samples: 18296967. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:29:21,106][213445] Avg episode reward: [(0, '4310.461')] [2023-03-07 14:29:21,540][213771] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-07 14:29:22,318][213771] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-07 14:29:23,072][213771] Updated weights for policy 0, policy_version 17910 (0.0007) [2023-03-07 14:29:23,868][213771] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-07 14:29:24,635][213771] Updated weights for policy 0, policy_version 17930 (0.0007) [2023-03-07 14:29:25,409][213771] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-07 14:29:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 18379776. Throughput: 0: 13217.7. Samples: 18376151. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:29:26,106][213445] Avg episode reward: [(0, '4297.076')] [2023-03-07 14:29:26,173][213771] Updated weights for policy 0, policy_version 17950 (0.0006) [2023-03-07 14:29:26,943][213771] Updated weights for policy 0, policy_version 17960 (0.0006) [2023-03-07 14:29:27,726][213771] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-07 14:29:28,489][213771] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-07 14:29:29,282][213771] Updated weights for policy 0, policy_version 17990 (0.0007) [2023-03-07 14:29:30,050][213771] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-07 14:29:30,802][213771] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-07 14:29:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 18445312. Throughput: 0: 13215.1. Samples: 18415853. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:29:31,106][213445] Avg episode reward: [(0, '4375.853')] [2023-03-07 14:29:31,578][213771] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-07 14:29:32,349][213771] Updated weights for policy 0, policy_version 18030 (0.0006) [2023-03-07 14:29:33,117][213771] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-07 14:29:33,886][213771] Updated weights for policy 0, policy_version 18050 (0.0005) [2023-03-07 14:29:34,657][213771] Updated weights for policy 0, policy_version 18060 (0.0007) [2023-03-07 14:29:35,437][213771] Updated weights for policy 0, policy_version 18070 (0.0007) [2023-03-07 14:29:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 18511872. Throughput: 0: 13222.8. Samples: 18495651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:29:36,106][213445] Avg episode reward: [(0, '4320.686')] [2023-03-07 14:29:36,204][213771] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-07 14:29:36,976][213771] Updated weights for policy 0, policy_version 18090 (0.0005) [2023-03-07 14:29:37,775][213771] Updated weights for policy 0, policy_version 18100 (0.0007) [2023-03-07 14:29:38,520][213771] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-07 14:29:39,312][213771] Updated weights for policy 0, policy_version 18120 (0.0005) [2023-03-07 14:29:40,080][213771] Updated weights for policy 0, policy_version 18130 (0.0006) [2023-03-07 14:29:40,854][213771] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-07 14:29:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 18578432. Throughput: 0: 13224.3. Samples: 18575048. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:41,106][213445] Avg episode reward: [(0, '4372.882')] [2023-03-07 14:29:41,654][213771] Updated weights for policy 0, policy_version 18150 (0.0006) [2023-03-07 14:29:42,431][213771] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-07 14:29:43,195][213771] Updated weights for policy 0, policy_version 18170 (0.0008) [2023-03-07 14:29:43,985][213771] Updated weights for policy 0, policy_version 18180 (0.0007) [2023-03-07 14:29:44,768][213771] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-07 14:29:45,525][213771] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-07 14:29:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 18643968. Throughput: 0: 13216.5. Samples: 18614534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:46,106][213445] Avg episode reward: [(0, '4182.618')] [2023-03-07 14:29:46,309][213771] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-07 14:29:47,080][213771] Updated weights for policy 0, policy_version 18220 (0.0007) [2023-03-07 14:29:47,855][213771] Updated weights for policy 0, policy_version 18230 (0.0006) [2023-03-07 14:29:48,617][213771] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-07 14:29:49,409][213771] Updated weights for policy 0, policy_version 18250 (0.0007) [2023-03-07 14:29:50,177][213771] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-07 14:29:50,966][213771] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-07 14:29:51,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13246.1). Total num frames: 18709504. Throughput: 0: 13207.7. Samples: 18693491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:51,106][213445] Avg episode reward: [(0, '4046.986')] [2023-03-07 14:29:51,725][213771] Updated weights for policy 0, policy_version 18280 (0.0006) [2023-03-07 14:29:52,521][213771] Updated weights for policy 0, policy_version 18290 (0.0007) [2023-03-07 14:29:53,275][213771] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-07 14:29:54,051][213771] Updated weights for policy 0, policy_version 18310 (0.0007) [2023-03-07 14:29:54,811][213771] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-07 14:29:55,585][213771] Updated weights for policy 0, policy_version 18330 (0.0007) [2023-03-07 14:29:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 18777088. Throughput: 0: 13221.7. Samples: 18773466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:29:56,106][213445] Avg episode reward: [(0, '4264.766')] [2023-03-07 14:29:56,349][213771] Updated weights for policy 0, policy_version 18340 (0.0005) [2023-03-07 14:29:57,124][213771] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-07 14:29:57,902][213771] Updated weights for policy 0, policy_version 18360 (0.0006) [2023-03-07 14:29:58,653][213771] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-07 14:29:59,443][213771] Updated weights for policy 0, policy_version 18380 (0.0005) [2023-03-07 14:30:00,203][213771] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-07 14:30:00,976][213771] Updated weights for policy 0, policy_version 18400 (0.0007) [2023-03-07 14:30:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 18842624. Throughput: 0: 13227.8. Samples: 18813252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:01,106][213445] Avg episode reward: [(0, '4249.342')] [2023-03-07 14:30:01,750][213771] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-07 14:30:02,518][213771] Updated weights for policy 0, policy_version 18420 (0.0007) [2023-03-07 14:30:03,290][213771] Updated weights for policy 0, policy_version 18430 (0.0006) [2023-03-07 14:30:04,058][213771] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-07 14:30:04,841][213771] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-07 14:30:05,606][213771] Updated weights for policy 0, policy_version 18460 (0.0005) [2023-03-07 14:30:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 18909184. Throughput: 0: 13241.6. Samples: 18892841. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:30:06,106][213445] Avg episode reward: [(0, '4274.758')] [2023-03-07 14:30:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000018466_18909184.pth... [2023-03-07 14:30:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000015362_15730688.pth [2023-03-07 14:30:06,363][213771] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-07 14:30:07,155][213771] Updated weights for policy 0, policy_version 18480 (0.0005) [2023-03-07 14:30:07,927][213771] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-07 14:30:08,694][213771] Updated weights for policy 0, policy_version 18500 (0.0007) [2023-03-07 14:30:09,468][213771] Updated weights for policy 0, policy_version 18510 (0.0007) [2023-03-07 14:30:10,250][213771] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-07 14:30:11,026][213771] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-07 14:30:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 18974720. Throughput: 0: 13244.3. Samples: 18972143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:30:11,106][213445] Avg episode reward: [(0, '4323.425')] [2023-03-07 14:30:11,813][213771] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-07 14:30:12,579][213771] Updated weights for policy 0, policy_version 18550 (0.0007) [2023-03-07 14:30:13,358][213771] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-07 14:30:14,152][213771] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-07 14:30:14,912][213771] Updated weights for policy 0, policy_version 18580 (0.0006) [2023-03-07 14:30:15,685][213771] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-07 14:30:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 19041280. Throughput: 0: 13240.0. Samples: 19011651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:16,106][213445] Avg episode reward: [(0, '4293.242')] [2023-03-07 14:30:16,455][213771] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-07 14:30:17,212][213771] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-07 14:30:17,991][213771] Updated weights for policy 0, policy_version 18620 (0.0005) [2023-03-07 14:30:18,776][213771] Updated weights for policy 0, policy_version 18630 (0.0007) [2023-03-07 14:30:19,555][213771] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-07 14:30:20,333][213771] Updated weights for policy 0, policy_version 18650 (0.0006) [2023-03-07 14:30:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 19106816. Throughput: 0: 13232.6. Samples: 19091116. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:21,105][213445] Avg episode reward: [(0, '4389.587')] [2023-03-07 14:30:21,108][213771] Updated weights for policy 0, policy_version 18660 (0.0007) [2023-03-07 14:30:21,868][213771] Updated weights for policy 0, policy_version 18670 (0.0006) [2023-03-07 14:30:22,638][213771] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-07 14:30:23,418][213771] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-07 14:30:24,189][213771] Updated weights for policy 0, policy_version 18700 (0.0005) [2023-03-07 14:30:24,963][213771] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-07 14:30:25,739][213771] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-07 14:30:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 19173376. Throughput: 0: 13231.3. Samples: 19170459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:26,105][213445] Avg episode reward: [(0, '4135.404')] [2023-03-07 14:30:26,514][213771] Updated weights for policy 0, policy_version 18730 (0.0007) [2023-03-07 14:30:27,294][213771] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-07 14:30:28,071][213771] Updated weights for policy 0, policy_version 18750 (0.0005) [2023-03-07 14:30:28,842][213771] Updated weights for policy 0, policy_version 18760 (0.0008) [2023-03-07 14:30:29,606][213771] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-07 14:30:30,387][213771] Updated weights for policy 0, policy_version 18780 (0.0007) [2023-03-07 14:30:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 19239936. Throughput: 0: 13236.0. Samples: 19210155. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:31,105][213445] Avg episode reward: [(0, '4348.149')] [2023-03-07 14:30:31,156][213771] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-07 14:30:31,925][213771] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-07 14:30:32,682][213771] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-07 14:30:33,483][213771] Updated weights for policy 0, policy_version 18820 (0.0006) [2023-03-07 14:30:34,231][213771] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-07 14:30:34,999][213771] Updated weights for policy 0, policy_version 18840 (0.0007) [2023-03-07 14:30:35,772][213771] Updated weights for policy 0, policy_version 18850 (0.0006) [2023-03-07 14:30:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 19306496. Throughput: 0: 13254.1. Samples: 19289928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:36,106][213445] Avg episode reward: [(0, '4326.592')] [2023-03-07 14:30:36,541][213771] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-07 14:30:37,310][213771] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-07 14:30:38,080][213771] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-07 14:30:38,881][213771] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-07 14:30:39,651][213771] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-07 14:30:40,406][213771] Updated weights for policy 0, policy_version 18910 (0.0006) [2023-03-07 14:30:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 19372032. Throughput: 0: 13243.2. Samples: 19369409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:41,105][213445] Avg episode reward: [(0, '4203.628')] [2023-03-07 14:30:41,187][213771] Updated weights for policy 0, policy_version 18920 (0.0005) [2023-03-07 14:30:41,957][213771] Updated weights for policy 0, policy_version 18930 (0.0007) [2023-03-07 14:30:42,749][213771] Updated weights for policy 0, policy_version 18940 (0.0006) [2023-03-07 14:30:43,517][213771] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-07 14:30:44,286][213771] Updated weights for policy 0, policy_version 18960 (0.0006) [2023-03-07 14:30:45,063][213771] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-07 14:30:45,850][213771] Updated weights for policy 0, policy_version 18980 (0.0007) [2023-03-07 14:30:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 19438592. Throughput: 0: 13237.5. Samples: 19408941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:46,106][213445] Avg episode reward: [(0, '4226.481')] [2023-03-07 14:30:46,610][213771] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-07 14:30:47,391][213771] Updated weights for policy 0, policy_version 19000 (0.0007) [2023-03-07 14:30:48,167][213771] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-07 14:30:48,929][213771] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-07 14:30:49,706][213771] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-07 14:30:50,473][213771] Updated weights for policy 0, policy_version 19040 (0.0005) [2023-03-07 14:30:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 19505152. Throughput: 0: 13233.5. Samples: 19488344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:51,106][213445] Avg episode reward: [(0, '4150.594')] [2023-03-07 14:30:51,242][213771] Updated weights for policy 0, policy_version 19050 (0.0006) [2023-03-07 14:30:52,010][213771] Updated weights for policy 0, policy_version 19060 (0.0006) [2023-03-07 14:30:52,801][213771] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-07 14:30:53,561][213771] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-07 14:30:54,339][213771] Updated weights for policy 0, policy_version 19090 (0.0005) [2023-03-07 14:30:55,109][213771] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-07 14:30:55,889][213771] Updated weights for policy 0, policy_version 19110 (0.0005) [2023-03-07 14:30:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 19570688. Throughput: 0: 13239.7. Samples: 19567931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:30:56,106][213445] Avg episode reward: [(0, '4314.257')] [2023-03-07 14:30:56,653][213771] Updated weights for policy 0, policy_version 19120 (0.0005) [2023-03-07 14:30:57,420][213771] Updated weights for policy 0, policy_version 19130 (0.0007) [2023-03-07 14:30:58,205][213771] Updated weights for policy 0, policy_version 19140 (0.0005) [2023-03-07 14:30:58,974][213771] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-07 14:30:59,744][213771] Updated weights for policy 0, policy_version 19160 (0.0007) [2023-03-07 14:31:00,501][213771] Updated weights for policy 0, policy_version 19170 (0.0006) [2023-03-07 14:31:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 19637248. Throughput: 0: 13247.7. Samples: 19607795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:01,105][213445] Avg episode reward: [(0, '4332.391')] [2023-03-07 14:31:01,277][213771] Updated weights for policy 0, policy_version 19180 (0.0007) [2023-03-07 14:31:02,046][213771] Updated weights for policy 0, policy_version 19190 (0.0006) [2023-03-07 14:31:02,817][213771] Updated weights for policy 0, policy_version 19200 (0.0006) [2023-03-07 14:31:03,595][213771] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-07 14:31:04,349][213771] Updated weights for policy 0, policy_version 19220 (0.0007) [2023-03-07 14:31:05,126][213771] Updated weights for policy 0, policy_version 19230 (0.0007) [2023-03-07 14:31:05,895][213771] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-07 14:31:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 19703808. Throughput: 0: 13252.5. Samples: 19687479. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:06,106][213445] Avg episode reward: [(0, '4379.211')] [2023-03-07 14:31:06,666][213771] Updated weights for policy 0, policy_version 19250 (0.0006) [2023-03-07 14:31:07,446][213771] Updated weights for policy 0, policy_version 19260 (0.0007) [2023-03-07 14:31:08,252][213771] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-07 14:31:09,004][213771] Updated weights for policy 0, policy_version 19280 (0.0007) [2023-03-07 14:31:09,795][213771] Updated weights for policy 0, policy_version 19290 (0.0007) [2023-03-07 14:31:10,561][213771] Updated weights for policy 0, policy_version 19300 (0.0007) [2023-03-07 14:31:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 19770368. Throughput: 0: 13250.9. Samples: 19766751. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:11,105][213445] Avg episode reward: [(0, '4436.400')] [2023-03-07 14:31:11,106][213720] Saving new best policy, reward=4436.400! [2023-03-07 14:31:11,331][213771] Updated weights for policy 0, policy_version 19310 (0.0006) [2023-03-07 14:31:12,107][213771] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-07 14:31:12,877][213771] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-07 14:31:13,667][213771] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-07 14:31:14,443][213771] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-07 14:31:15,225][213771] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-07 14:31:15,999][213771] Updated weights for policy 0, policy_version 19370 (0.0007) [2023-03-07 14:31:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 19835904. Throughput: 0: 13249.2. Samples: 19806368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:16,105][213445] Avg episode reward: [(0, '4419.870')] [2023-03-07 14:31:16,778][213771] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-07 14:31:17,558][213771] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-07 14:31:18,326][213771] Updated weights for policy 0, policy_version 19400 (0.0005) [2023-03-07 14:31:19,110][213771] Updated weights for policy 0, policy_version 19410 (0.0005) [2023-03-07 14:31:19,890][213771] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-07 14:31:20,666][213771] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-07 14:31:21,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 19901440. Throughput: 0: 13233.4. Samples: 19885429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:21,106][213445] Avg episode reward: [(0, '4446.361')] [2023-03-07 14:31:21,106][213720] Saving new best policy, reward=4446.361! [2023-03-07 14:31:21,447][213771] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-07 14:31:22,215][213771] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-07 14:31:22,983][213771] Updated weights for policy 0, policy_version 19460 (0.0006) [2023-03-07 14:31:23,753][213771] Updated weights for policy 0, policy_version 19470 (0.0005) [2023-03-07 14:31:24,542][213771] Updated weights for policy 0, policy_version 19480 (0.0006) [2023-03-07 14:31:25,307][213771] Updated weights for policy 0, policy_version 19490 (0.0005) [2023-03-07 14:31:26,065][213771] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-07 14:31:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 19968000. Throughput: 0: 13232.0. Samples: 19964847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:26,105][213445] Avg episode reward: [(0, '4500.775')] [2023-03-07 14:31:26,110][213720] Saving new best policy, reward=4500.775! [2023-03-07 14:31:26,856][213771] Updated weights for policy 0, policy_version 19510 (0.0005) [2023-03-07 14:31:27,614][213771] Updated weights for policy 0, policy_version 19520 (0.0007) [2023-03-07 14:31:28,378][213771] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-07 14:31:29,156][213771] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-03-07 14:31:29,938][213771] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-07 14:31:30,708][213771] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-07 14:31:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 20034560. Throughput: 0: 13239.1. Samples: 20004699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:31:31,106][213445] Avg episode reward: [(0, '4463.064')] [2023-03-07 14:31:31,483][213771] Updated weights for policy 0, policy_version 19570 (0.0009) [2023-03-07 14:31:32,249][213771] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-07 14:31:33,019][213771] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-07 14:31:33,788][213771] Updated weights for policy 0, policy_version 19600 (0.0007) [2023-03-07 14:31:34,556][213771] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-07 14:31:35,337][213771] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-07 14:31:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 20100096. Throughput: 0: 13241.5. Samples: 20084212. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:31:36,106][213445] Avg episode reward: [(0, '4413.958')] [2023-03-07 14:31:36,115][213771] Updated weights for policy 0, policy_version 19630 (0.0005) [2023-03-07 14:31:36,878][213771] Updated weights for policy 0, policy_version 19640 (0.0005) [2023-03-07 14:31:37,661][213771] Updated weights for policy 0, policy_version 19650 (0.0006) [2023-03-07 14:31:38,437][213771] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-07 14:31:39,218][213771] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-07 14:31:39,977][213771] Updated weights for policy 0, policy_version 19680 (0.0006) [2023-03-07 14:31:40,757][213771] Updated weights for policy 0, policy_version 19690 (0.0005) [2023-03-07 14:31:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 20166656. Throughput: 0: 13236.7. Samples: 20163585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:31:41,106][213445] Avg episode reward: [(0, '4411.558')] [2023-03-07 14:31:41,547][213771] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-07 14:31:42,306][213771] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-07 14:31:43,070][213771] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-07 14:31:43,832][213771] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-07 14:31:44,615][213771] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-07 14:31:45,400][213771] Updated weights for policy 0, policy_version 19750 (0.0005) [2023-03-07 14:31:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 20233216. Throughput: 0: 13238.5. Samples: 20203529. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:31:46,106][213445] Avg episode reward: [(0, '4402.007')] [2023-03-07 14:31:46,163][213771] Updated weights for policy 0, policy_version 19760 (0.0005) [2023-03-07 14:31:46,929][213771] Updated weights for policy 0, policy_version 19770 (0.0005) [2023-03-07 14:31:47,721][213771] Updated weights for policy 0, policy_version 19780 (0.0006) [2023-03-07 14:31:48,502][213771] Updated weights for policy 0, policy_version 19790 (0.0006) [2023-03-07 14:31:49,278][213771] Updated weights for policy 0, policy_version 19800 (0.0005) [2023-03-07 14:31:50,044][213771] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-07 14:31:50,807][213771] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-07 14:31:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 20298752. Throughput: 0: 13227.2. Samples: 20282706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:31:51,106][213445] Avg episode reward: [(0, '4430.197')] [2023-03-07 14:31:51,577][213771] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-07 14:31:52,358][213771] Updated weights for policy 0, policy_version 19840 (0.0007) [2023-03-07 14:31:53,133][213771] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-07 14:31:53,911][213771] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-07 14:31:54,697][213771] Updated weights for policy 0, policy_version 19870 (0.0005) [2023-03-07 14:31:55,462][213771] Updated weights for policy 0, policy_version 19880 (0.0006) [2023-03-07 14:31:56,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 20365312. Throughput: 0: 13226.8. Samples: 20361959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:31:56,106][213445] Avg episode reward: [(0, '4428.350')] [2023-03-07 14:31:56,245][213771] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-07 14:31:57,012][213771] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-07 14:31:57,792][213771] Updated weights for policy 0, policy_version 19910 (0.0007) [2023-03-07 14:31:58,567][213771] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-07 14:31:59,345][213771] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-07 14:32:00,140][213771] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-07 14:32:00,892][213771] Updated weights for policy 0, policy_version 19950 (0.0006) [2023-03-07 14:32:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 20430848. Throughput: 0: 13225.0. Samples: 20401493. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:01,106][213445] Avg episode reward: [(0, '4391.499')] [2023-03-07 14:32:01,675][213771] Updated weights for policy 0, policy_version 19960 (0.0005) [2023-03-07 14:32:02,447][213771] Updated weights for policy 0, policy_version 19970 (0.0006) [2023-03-07 14:32:03,202][213771] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-07 14:32:03,965][213771] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-07 14:32:04,739][213771] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-07 14:32:05,500][213771] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-07 14:32:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 20497408. Throughput: 0: 13239.3. Samples: 20481195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:06,105][213445] Avg episode reward: [(0, '4409.693')] [2023-03-07 14:32:06,121][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000020018_20498432.pth... [2023-03-07 14:32:06,149][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000016915_17320960.pth [2023-03-07 14:32:06,274][213771] Updated weights for policy 0, policy_version 20020 (0.0006) [2023-03-07 14:32:07,054][213771] Updated weights for policy 0, policy_version 20030 (0.0005) [2023-03-07 14:32:07,834][213771] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-07 14:32:08,594][213771] Updated weights for policy 0, policy_version 20050 (0.0007) [2023-03-07 14:32:09,360][213771] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-07 14:32:10,146][213771] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-07 14:32:10,923][213771] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-07 14:32:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 20563968. Throughput: 0: 13243.7. Samples: 20560816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:11,106][213445] Avg episode reward: [(0, '4455.535')] [2023-03-07 14:32:11,697][213771] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-07 14:32:12,477][213771] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-07 14:32:13,263][213771] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-07 14:32:14,021][213771] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-07 14:32:14,794][213771] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-07 14:32:15,566][213771] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-07 14:32:16,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 20630528. Throughput: 0: 13237.9. Samples: 20600409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:16,106][213445] Avg episode reward: [(0, '4434.577')] [2023-03-07 14:32:16,318][213771] Updated weights for policy 0, policy_version 20150 (0.0005) [2023-03-07 14:32:17,085][213771] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-07 14:32:17,853][213771] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-07 14:32:18,629][213771] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-07 14:32:19,420][213771] Updated weights for policy 0, policy_version 20190 (0.0006) [2023-03-07 14:32:20,183][213771] Updated weights for policy 0, policy_version 20200 (0.0005) [2023-03-07 14:32:20,943][213771] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-07 14:32:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 20697088. Throughput: 0: 13244.7. Samples: 20680223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:21,106][213445] Avg episode reward: [(0, '4415.610')] [2023-03-07 14:32:21,693][213771] Updated weights for policy 0, policy_version 20220 (0.0007) [2023-03-07 14:32:22,483][213771] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-07 14:32:23,255][213771] Updated weights for policy 0, policy_version 20240 (0.0005) [2023-03-07 14:32:24,028][213771] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-07 14:32:24,791][213771] Updated weights for policy 0, policy_version 20260 (0.0008) [2023-03-07 14:32:25,584][213771] Updated weights for policy 0, policy_version 20270 (0.0005) [2023-03-07 14:32:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 20762624. Throughput: 0: 13253.0. Samples: 20759972. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:26,106][213445] Avg episode reward: [(0, '4446.277')] [2023-03-07 14:32:26,357][213771] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-07 14:32:27,133][213771] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-07 14:32:27,891][213771] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-07 14:32:28,674][213771] Updated weights for policy 0, policy_version 20310 (0.0006) [2023-03-07 14:32:29,447][213771] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-07 14:32:30,230][213771] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-07 14:32:30,993][213771] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-07 14:32:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 20829184. Throughput: 0: 13244.9. Samples: 20799551. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:32:31,106][213445] Avg episode reward: [(0, '4433.326')] [2023-03-07 14:32:31,775][213771] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-07 14:32:32,539][213771] Updated weights for policy 0, policy_version 20360 (0.0006) [2023-03-07 14:32:33,325][213771] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-07 14:32:34,102][213771] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-07 14:32:34,874][213771] Updated weights for policy 0, policy_version 20390 (0.0006) [2023-03-07 14:32:35,643][213771] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-07 14:32:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 20894720. Throughput: 0: 13246.6. Samples: 20878800. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:32:36,106][213445] Avg episode reward: [(0, '4429.078')] [2023-03-07 14:32:36,413][213771] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-03-07 14:32:37,181][213771] Updated weights for policy 0, policy_version 20420 (0.0006) [2023-03-07 14:32:37,942][213771] Updated weights for policy 0, policy_version 20430 (0.0005) [2023-03-07 14:32:38,741][213771] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-07 14:32:39,509][213771] Updated weights for policy 0, policy_version 20450 (0.0005) [2023-03-07 14:32:40,271][213771] Updated weights for policy 0, policy_version 20460 (0.0006) [2023-03-07 14:32:41,056][213771] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-07 14:32:41,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 20961280. Throughput: 0: 13254.6. Samples: 20958413. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:32:41,106][213445] Avg episode reward: [(0, '4403.937')] [2023-03-07 14:32:41,812][213771] Updated weights for policy 0, policy_version 20480 (0.0006) [2023-03-07 14:32:42,590][213771] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-07 14:32:43,367][213771] Updated weights for policy 0, policy_version 20500 (0.0006) [2023-03-07 14:32:44,150][213771] Updated weights for policy 0, policy_version 20510 (0.0007) [2023-03-07 14:32:44,925][213771] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-07 14:32:45,680][213771] Updated weights for policy 0, policy_version 20530 (0.0006) [2023-03-07 14:32:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 21027840. Throughput: 0: 13260.9. Samples: 20998234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:46,106][213445] Avg episode reward: [(0, '4353.472')] [2023-03-07 14:32:46,461][213771] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-07 14:32:47,222][213771] Updated weights for policy 0, policy_version 20550 (0.0005) [2023-03-07 14:32:47,980][213771] Updated weights for policy 0, policy_version 20560 (0.0005) [2023-03-07 14:32:48,762][213771] Updated weights for policy 0, policy_version 20570 (0.0006) [2023-03-07 14:32:49,527][213771] Updated weights for policy 0, policy_version 20580 (0.0006) [2023-03-07 14:32:50,296][213771] Updated weights for policy 0, policy_version 20590 (0.0007) [2023-03-07 14:32:51,070][213771] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-07 14:32:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 21094400. Throughput: 0: 13261.9. Samples: 21077983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:51,106][213445] Avg episode reward: [(0, '4425.059')] [2023-03-07 14:32:51,840][213771] Updated weights for policy 0, policy_version 20610 (0.0007) [2023-03-07 14:32:52,606][213771] Updated weights for policy 0, policy_version 20620 (0.0005) [2023-03-07 14:32:53,391][213771] Updated weights for policy 0, policy_version 20630 (0.0007) [2023-03-07 14:32:54,170][213771] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-07 14:32:54,940][213771] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-07 14:32:55,711][213771] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-07 14:32:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 21160960. Throughput: 0: 13258.8. Samples: 21157464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:32:56,106][213445] Avg episode reward: [(0, '4409.769')] [2023-03-07 14:32:56,488][213771] Updated weights for policy 0, policy_version 20670 (0.0008) [2023-03-07 14:32:57,258][213771] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-07 14:32:58,042][213771] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-07 14:32:58,829][213771] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-07 14:32:59,606][213771] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-07 14:33:00,397][213771] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-07 14:33:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 21226496. Throughput: 0: 13254.8. Samples: 21196872. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:01,106][213445] Avg episode reward: [(0, '4281.110')] [2023-03-07 14:33:01,157][213771] Updated weights for policy 0, policy_version 20730 (0.0007) [2023-03-07 14:33:01,942][213771] Updated weights for policy 0, policy_version 20740 (0.0005) [2023-03-07 14:33:02,714][213771] Updated weights for policy 0, policy_version 20750 (0.0006) [2023-03-07 14:33:03,478][213771] Updated weights for policy 0, policy_version 20760 (0.0005) [2023-03-07 14:33:04,244][213771] Updated weights for policy 0, policy_version 20770 (0.0005) [2023-03-07 14:33:05,032][213771] Updated weights for policy 0, policy_version 20780 (0.0006) [2023-03-07 14:33:05,799][213771] Updated weights for policy 0, policy_version 20790 (0.0005) [2023-03-07 14:33:06,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 21292032. Throughput: 0: 13245.3. Samples: 21276260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:06,106][213445] Avg episode reward: [(0, '4392.606')] [2023-03-07 14:33:06,582][213771] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-07 14:33:07,339][213771] Updated weights for policy 0, policy_version 20810 (0.0006) [2023-03-07 14:33:08,125][213771] Updated weights for policy 0, policy_version 20820 (0.0006) [2023-03-07 14:33:08,887][213771] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-07 14:33:09,659][213771] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-07 14:33:10,438][213771] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-07 14:33:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 21358592. Throughput: 0: 13236.9. Samples: 21355635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:11,106][213445] Avg episode reward: [(0, '4335.191')] [2023-03-07 14:33:11,208][213771] Updated weights for policy 0, policy_version 20860 (0.0007) [2023-03-07 14:33:11,998][213771] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-07 14:33:12,774][213771] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-07 14:33:13,558][213771] Updated weights for policy 0, policy_version 20890 (0.0008) [2023-03-07 14:33:14,341][213771] Updated weights for policy 0, policy_version 20900 (0.0006) [2023-03-07 14:33:15,113][213771] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-07 14:33:15,869][213771] Updated weights for policy 0, policy_version 20920 (0.0007) [2023-03-07 14:33:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 21425152. Throughput: 0: 13235.1. Samples: 21395130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:16,106][213445] Avg episode reward: [(0, '4423.499')] [2023-03-07 14:33:16,650][213771] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-07 14:33:17,443][213771] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-07 14:33:18,210][213771] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-07 14:33:18,979][213771] Updated weights for policy 0, policy_version 20960 (0.0007) [2023-03-07 14:33:19,773][213771] Updated weights for policy 0, policy_version 20970 (0.0008) [2023-03-07 14:33:20,541][213771] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-07 14:33:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 21490688. Throughput: 0: 13235.5. Samples: 21474399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:21,106][213445] Avg episode reward: [(0, '4392.970')] [2023-03-07 14:33:21,319][213771] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-07 14:33:22,085][213771] Updated weights for policy 0, policy_version 21000 (0.0007) [2023-03-07 14:33:22,858][213771] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-07 14:33:23,617][213771] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-07 14:33:24,371][213771] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-07 14:33:25,155][213771] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-07 14:33:25,933][213771] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-07 14:33:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 21557248. Throughput: 0: 13232.1. Samples: 21553859. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:33:26,106][213445] Avg episode reward: [(0, '4383.250')] [2023-03-07 14:33:26,718][213771] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-07 14:33:27,500][213771] Updated weights for policy 0, policy_version 21070 (0.0007) [2023-03-07 14:33:28,282][213771] Updated weights for policy 0, policy_version 21080 (0.0007) [2023-03-07 14:33:29,036][213771] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-07 14:33:29,807][213771] Updated weights for policy 0, policy_version 21100 (0.0007) [2023-03-07 14:33:30,574][213771] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-07 14:33:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 21622784. Throughput: 0: 13226.8. Samples: 21593438. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:33:31,105][213445] Avg episode reward: [(0, '4379.505')] [2023-03-07 14:33:31,343][213771] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-07 14:33:32,120][213771] Updated weights for policy 0, policy_version 21130 (0.0007) [2023-03-07 14:33:32,883][213771] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-07 14:33:33,651][213771] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-07 14:33:34,422][213771] Updated weights for policy 0, policy_version 21160 (0.0006) [2023-03-07 14:33:35,186][213771] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-07 14:33:35,958][213771] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-07 14:33:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 21689344. Throughput: 0: 13229.4. Samples: 21673308. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:33:36,106][213445] Avg episode reward: [(0, '4382.666')] [2023-03-07 14:33:36,736][213771] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-07 14:33:37,517][213771] Updated weights for policy 0, policy_version 21200 (0.0007) [2023-03-07 14:33:38,308][213771] Updated weights for policy 0, policy_version 21210 (0.0006) [2023-03-07 14:33:39,082][213771] Updated weights for policy 0, policy_version 21220 (0.0005) [2023-03-07 14:33:39,846][213771] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-07 14:33:40,618][213771] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-07 14:33:41,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 21755904. Throughput: 0: 13226.7. Samples: 21752664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:41,106][213445] Avg episode reward: [(0, '4400.359')] [2023-03-07 14:33:41,394][213771] Updated weights for policy 0, policy_version 21250 (0.0005) [2023-03-07 14:33:42,171][213771] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-07 14:33:42,926][213771] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-07 14:33:43,705][213771] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-07 14:33:44,469][213771] Updated weights for policy 0, policy_version 21290 (0.0006) [2023-03-07 14:33:45,225][213771] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-07 14:33:45,998][213771] Updated weights for policy 0, policy_version 21310 (0.0006) [2023-03-07 14:33:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 21822464. Throughput: 0: 13238.1. Samples: 21792587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:46,106][213445] Avg episode reward: [(0, '4411.834')] [2023-03-07 14:33:46,787][213771] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-07 14:33:47,557][213771] Updated weights for policy 0, policy_version 21330 (0.0007) [2023-03-07 14:33:48,323][213771] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-07 14:33:49,100][213771] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-07 14:33:49,851][213771] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-07 14:33:50,632][213771] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-07 14:33:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 21889024. Throughput: 0: 13242.8. Samples: 21872187. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:33:51,106][213445] Avg episode reward: [(0, '4296.083')] [2023-03-07 14:33:51,405][213771] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-07 14:33:52,155][213771] Updated weights for policy 0, policy_version 21390 (0.0007) [2023-03-07 14:33:52,937][213771] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-07 14:33:53,723][213771] Updated weights for policy 0, policy_version 21410 (0.0005) [2023-03-07 14:33:54,477][213771] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-07 14:33:55,252][213771] Updated weights for policy 0, policy_version 21430 (0.0006) [2023-03-07 14:33:56,027][213771] Updated weights for policy 0, policy_version 21440 (0.0007) [2023-03-07 14:33:56,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 21955584. Throughput: 0: 13252.7. Samples: 21952005. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:33:56,106][213445] Avg episode reward: [(0, '4326.567')] [2023-03-07 14:33:56,806][213771] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-07 14:33:57,571][213771] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-07 14:33:58,322][213771] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-07 14:33:59,132][213771] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-07 14:33:59,899][213771] Updated weights for policy 0, policy_version 21490 (0.0007) [2023-03-07 14:34:00,677][213771] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-07 14:34:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 22021120. Throughput: 0: 13259.2. Samples: 21991791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:34:01,106][213445] Avg episode reward: [(0, '4161.468')] [2023-03-07 14:34:01,425][213771] Updated weights for policy 0, policy_version 21510 (0.0006) [2023-03-07 14:34:02,214][213771] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-07 14:34:02,987][213771] Updated weights for policy 0, policy_version 21530 (0.0005) [2023-03-07 14:34:03,765][213771] Updated weights for policy 0, policy_version 21540 (0.0007) [2023-03-07 14:34:04,535][213771] Updated weights for policy 0, policy_version 21550 (0.0007) [2023-03-07 14:34:05,328][213771] Updated weights for policy 0, policy_version 21560 (0.0006) [2023-03-07 14:34:06,101][213771] Updated weights for policy 0, policy_version 21570 (0.0005) [2023-03-07 14:34:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 22087680. Throughput: 0: 13257.9. Samples: 22071002. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:34:06,106][213445] Avg episode reward: [(0, '4358.941')] [2023-03-07 14:34:06,113][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000021570_22087680.pth... [2023-03-07 14:34:06,145][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000018466_18909184.pth [2023-03-07 14:34:06,875][213771] Updated weights for policy 0, policy_version 21580 (0.0007) [2023-03-07 14:34:07,638][213771] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-07 14:34:08,404][213771] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-07 14:34:09,199][213771] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-07 14:34:09,976][213771] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-07 14:34:10,745][213771] Updated weights for policy 0, policy_version 21630 (0.0007) [2023-03-07 14:34:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 22153216. Throughput: 0: 13255.4. Samples: 22150354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:34:11,106][213445] Avg episode reward: [(0, '4422.636')] [2023-03-07 14:34:11,531][213771] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-07 14:34:12,308][213771] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-07 14:34:13,078][213771] Updated weights for policy 0, policy_version 21660 (0.0007) [2023-03-07 14:34:13,860][213771] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-07 14:34:14,641][213771] Updated weights for policy 0, policy_version 21680 (0.0006) [2023-03-07 14:34:15,402][213771] Updated weights for policy 0, policy_version 21690 (0.0007) [2023-03-07 14:34:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 22219776. Throughput: 0: 13252.4. Samples: 22189798. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:34:16,106][213445] Avg episode reward: [(0, '4429.199')] [2023-03-07 14:34:16,194][213771] Updated weights for policy 0, policy_version 21700 (0.0006) [2023-03-07 14:34:16,957][213771] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-07 14:34:17,720][213771] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-07 14:34:18,485][213771] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-07 14:34:19,253][213771] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-07 14:34:20,029][213771] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-07 14:34:20,793][213771] Updated weights for policy 0, policy_version 21760 (0.0006) [2023-03-07 14:34:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 22286336. Throughput: 0: 13248.7. Samples: 22269498. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:34:21,106][213445] Avg episode reward: [(0, '4380.593')] [2023-03-07 14:34:21,559][213771] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-07 14:34:22,330][213771] Updated weights for policy 0, policy_version 21780 (0.0005) [2023-03-07 14:34:23,109][213771] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-07 14:34:23,885][213771] Updated weights for policy 0, policy_version 21800 (0.0007) [2023-03-07 14:34:24,634][213771] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-07 14:34:25,413][213771] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-07 14:34:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 22351872. Throughput: 0: 13256.3. Samples: 22349201. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:34:26,106][213445] Avg episode reward: [(0, '4407.810')] [2023-03-07 14:34:26,186][213771] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-07 14:34:26,957][213771] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-07 14:34:27,721][213771] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-07 14:34:28,495][213771] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-07 14:34:29,264][213771] Updated weights for policy 0, policy_version 21870 (0.0007) [2023-03-07 14:34:30,033][213771] Updated weights for policy 0, policy_version 21880 (0.0005) [2023-03-07 14:34:30,804][213771] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-07 14:34:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 22418432. Throughput: 0: 13254.8. Samples: 22389052. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:34:31,106][213445] Avg episode reward: [(0, '4456.247')] [2023-03-07 14:34:31,581][213771] Updated weights for policy 0, policy_version 21900 (0.0005) [2023-03-07 14:34:32,361][213771] Updated weights for policy 0, policy_version 21910 (0.0007) [2023-03-07 14:34:33,142][213771] Updated weights for policy 0, policy_version 21920 (0.0006) [2023-03-07 14:34:33,905][213771] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-07 14:34:34,700][213771] Updated weights for policy 0, policy_version 21940 (0.0006) [2023-03-07 14:34:35,473][213771] Updated weights for policy 0, policy_version 21950 (0.0006) [2023-03-07 14:34:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 22484992. Throughput: 0: 13249.6. Samples: 22468418. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:34:36,106][213445] Avg episode reward: [(0, '4309.199')] [2023-03-07 14:34:36,257][213771] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-07 14:34:37,014][213771] Updated weights for policy 0, policy_version 21970 (0.0005) [2023-03-07 14:34:37,809][213771] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-07 14:34:38,568][213771] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-07 14:34:39,353][213771] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-07 14:34:40,126][213771] Updated weights for policy 0, policy_version 22010 (0.0007) [2023-03-07 14:34:40,893][213771] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-07 14:34:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 22550528. Throughput: 0: 13236.8. Samples: 22547659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:34:41,106][213445] Avg episode reward: [(0, '4375.178')] [2023-03-07 14:34:41,662][213771] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-07 14:34:42,439][213771] Updated weights for policy 0, policy_version 22040 (0.0007) [2023-03-07 14:34:43,201][213771] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-07 14:34:43,980][213771] Updated weights for policy 0, policy_version 22060 (0.0008) [2023-03-07 14:34:44,767][213771] Updated weights for policy 0, policy_version 22070 (0.0008) [2023-03-07 14:34:45,529][213771] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-07 14:34:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 22617088. Throughput: 0: 13237.6. Samples: 22587484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:34:46,106][213445] Avg episode reward: [(0, '4436.911')] [2023-03-07 14:34:46,311][213771] Updated weights for policy 0, policy_version 22090 (0.0006) [2023-03-07 14:34:47,100][213771] Updated weights for policy 0, policy_version 22100 (0.0007) [2023-03-07 14:34:47,873][213771] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-07 14:34:48,642][213771] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-07 14:34:49,410][213771] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-07 14:34:50,179][213771] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-07 14:34:50,948][213771] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-07 14:34:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 22683648. Throughput: 0: 13239.6. Samples: 22666782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:34:51,106][213445] Avg episode reward: [(0, '4373.970')] [2023-03-07 14:34:51,708][213771] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-07 14:34:52,493][213771] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-07 14:34:53,267][213771] Updated weights for policy 0, policy_version 22180 (0.0005) [2023-03-07 14:34:54,040][213771] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-07 14:34:54,811][213771] Updated weights for policy 0, policy_version 22200 (0.0006) [2023-03-07 14:34:55,578][213771] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-07 14:34:56,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 22749184. Throughput: 0: 13241.8. Samples: 22746235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:34:56,106][213445] Avg episode reward: [(0, '4376.719')] [2023-03-07 14:34:56,365][213771] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-07 14:34:57,134][213771] Updated weights for policy 0, policy_version 22230 (0.0007) [2023-03-07 14:34:57,885][213771] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-07 14:34:58,689][213771] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-07 14:34:59,456][213771] Updated weights for policy 0, policy_version 22260 (0.0007) [2023-03-07 14:35:00,253][213771] Updated weights for policy 0, policy_version 22270 (0.0007) [2023-03-07 14:35:01,029][213771] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-07 14:35:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 22815744. Throughput: 0: 13245.8. Samples: 22785860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:01,106][213445] Avg episode reward: [(0, '4293.239')] [2023-03-07 14:35:01,790][213771] Updated weights for policy 0, policy_version 22290 (0.0008) [2023-03-07 14:35:02,584][213771] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-07 14:35:03,366][213771] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-07 14:35:04,141][213771] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-07 14:35:04,914][213771] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-07 14:35:05,694][213771] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-07 14:35:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 22881280. Throughput: 0: 13233.9. Samples: 22865024. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:06,106][213445] Avg episode reward: [(0, '4449.936')] [2023-03-07 14:35:06,479][213771] Updated weights for policy 0, policy_version 22350 (0.0006) [2023-03-07 14:35:07,245][213771] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-07 14:35:08,037][213771] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-07 14:35:08,800][213771] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-07 14:35:09,591][213771] Updated weights for policy 0, policy_version 22390 (0.0007) [2023-03-07 14:35:10,355][213771] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-07 14:35:11,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 22946816. Throughput: 0: 13220.0. Samples: 22944101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:11,106][213445] Avg episode reward: [(0, '4408.012')] [2023-03-07 14:35:11,126][213771] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-07 14:35:11,891][213771] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-07 14:35:12,670][213771] Updated weights for policy 0, policy_version 22430 (0.0007) [2023-03-07 14:35:13,437][213771] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-07 14:35:14,201][213771] Updated weights for policy 0, policy_version 22450 (0.0005) [2023-03-07 14:35:14,987][213771] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-07 14:35:15,756][213771] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-07 14:35:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 23013376. Throughput: 0: 13218.0. Samples: 22983861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:16,106][213445] Avg episode reward: [(0, '4406.633')] [2023-03-07 14:35:16,531][213771] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-07 14:35:17,303][213771] Updated weights for policy 0, policy_version 22490 (0.0005) [2023-03-07 14:35:18,080][213771] Updated weights for policy 0, policy_version 22500 (0.0006) [2023-03-07 14:35:18,858][213771] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-07 14:35:19,630][213771] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-07 14:35:20,382][213771] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-07 14:35:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 23079936. Throughput: 0: 13219.9. Samples: 23063314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:21,106][213445] Avg episode reward: [(0, '4362.877')] [2023-03-07 14:35:21,160][213771] Updated weights for policy 0, policy_version 22540 (0.0007) [2023-03-07 14:35:21,920][213771] Updated weights for policy 0, policy_version 22550 (0.0007) [2023-03-07 14:35:22,686][213771] Updated weights for policy 0, policy_version 22560 (0.0005) [2023-03-07 14:35:23,449][213771] Updated weights for policy 0, policy_version 22570 (0.0005) [2023-03-07 14:35:24,211][213771] Updated weights for policy 0, policy_version 22580 (0.0006) [2023-03-07 14:35:24,991][213771] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-07 14:35:25,756][213771] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-07 14:35:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23146496. Throughput: 0: 13239.3. Samples: 23143430. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:26,106][213445] Avg episode reward: [(0, '4383.931')] [2023-03-07 14:35:26,541][213771] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-07 14:35:27,307][213771] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-07 14:35:28,072][213771] Updated weights for policy 0, policy_version 22630 (0.0007) [2023-03-07 14:35:28,837][213771] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-07 14:35:29,596][213771] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-07 14:35:30,385][213771] Updated weights for policy 0, policy_version 22660 (0.0005) [2023-03-07 14:35:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23213056. Throughput: 0: 13240.0. Samples: 23183285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:31,106][213445] Avg episode reward: [(0, '4238.482')] [2023-03-07 14:35:31,150][213771] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-07 14:35:31,925][213771] Updated weights for policy 0, policy_version 22680 (0.0006) [2023-03-07 14:35:32,699][213771] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-07 14:35:33,467][213771] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-07 14:35:34,234][213771] Updated weights for policy 0, policy_version 22710 (0.0007) [2023-03-07 14:35:35,030][213771] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-07 14:35:35,803][213771] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-07 14:35:36,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 23278592. Throughput: 0: 13246.8. Samples: 23262887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:36,105][213445] Avg episode reward: [(0, '4297.581')] [2023-03-07 14:35:36,573][213771] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-07 14:35:37,332][213771] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-07 14:35:38,091][213771] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-07 14:35:38,881][213771] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-07 14:35:39,654][213771] Updated weights for policy 0, policy_version 22780 (0.0006) [2023-03-07 14:35:40,426][213771] Updated weights for policy 0, policy_version 22790 (0.0007) [2023-03-07 14:35:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 23346176. Throughput: 0: 13251.2. Samples: 23342537. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:35:41,106][213445] Avg episode reward: [(0, '4442.280')] [2023-03-07 14:35:41,181][213771] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-07 14:35:41,974][213771] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-07 14:35:42,749][213771] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-07 14:35:43,531][213771] Updated weights for policy 0, policy_version 22830 (0.0007) [2023-03-07 14:35:44,300][213771] Updated weights for policy 0, policy_version 22840 (0.0007) [2023-03-07 14:35:45,076][213771] Updated weights for policy 0, policy_version 22850 (0.0007) [2023-03-07 14:35:45,853][213771] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-07 14:35:46,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23411712. Throughput: 0: 13247.8. Samples: 23382012. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:35:46,106][213445] Avg episode reward: [(0, '4372.910')] [2023-03-07 14:35:46,626][213771] Updated weights for policy 0, policy_version 22870 (0.0007) [2023-03-07 14:35:47,412][213771] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-07 14:35:48,182][213771] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-07 14:35:48,938][213771] Updated weights for policy 0, policy_version 22900 (0.0006) [2023-03-07 14:35:49,714][213771] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-07 14:35:50,493][213771] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-07 14:35:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 23478272. Throughput: 0: 13255.6. Samples: 23461521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:35:51,106][213445] Avg episode reward: [(0, '4342.815')] [2023-03-07 14:35:51,257][213771] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-07 14:35:52,044][213771] Updated weights for policy 0, policy_version 22940 (0.0007) [2023-03-07 14:35:52,793][213771] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-07 14:35:53,577][213771] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-07 14:35:54,362][213771] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-07 14:35:55,135][213771] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-07 14:35:55,907][213771] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-07 14:35:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23543808. Throughput: 0: 13257.7. Samples: 23540699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:35:56,106][213445] Avg episode reward: [(0, '4339.703')] [2023-03-07 14:35:56,685][213771] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-07 14:35:57,442][213771] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-07 14:35:58,244][213771] Updated weights for policy 0, policy_version 23020 (0.0006) [2023-03-07 14:35:59,024][213771] Updated weights for policy 0, policy_version 23030 (0.0008) [2023-03-07 14:35:59,794][213771] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-07 14:36:00,558][213771] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-07 14:36:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23610368. Throughput: 0: 13252.4. Samples: 23580221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:01,106][213445] Avg episode reward: [(0, '4235.691')] [2023-03-07 14:36:01,326][213771] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-07 14:36:02,114][213771] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-07 14:36:02,880][213771] Updated weights for policy 0, policy_version 23080 (0.0006) [2023-03-07 14:36:03,644][213771] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-07 14:36:04,436][213771] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-07 14:36:05,199][213771] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-07 14:36:05,978][213771] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-07 14:36:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 23675904. Throughput: 0: 13255.1. Samples: 23659793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:06,106][213445] Avg episode reward: [(0, '4335.511')] [2023-03-07 14:36:06,123][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000023122_23676928.pth... [2023-03-07 14:36:06,153][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000020018_20498432.pth [2023-03-07 14:36:06,730][213771] Updated weights for policy 0, policy_version 23130 (0.0005) [2023-03-07 14:36:07,510][213771] Updated weights for policy 0, policy_version 23140 (0.0007) [2023-03-07 14:36:08,281][213771] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-07 14:36:09,044][213771] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-07 14:36:09,814][213771] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-07 14:36:10,584][213771] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-07 14:36:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 23742464. Throughput: 0: 13252.8. Samples: 23739806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:11,106][213445] Avg episode reward: [(0, '4400.159')] [2023-03-07 14:36:11,361][213771] Updated weights for policy 0, policy_version 23190 (0.0005) [2023-03-07 14:36:12,150][213771] Updated weights for policy 0, policy_version 23200 (0.0007) [2023-03-07 14:36:12,907][213771] Updated weights for policy 0, policy_version 23210 (0.0005) [2023-03-07 14:36:13,688][213771] Updated weights for policy 0, policy_version 23220 (0.0007) [2023-03-07 14:36:14,462][213771] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-07 14:36:15,233][213771] Updated weights for policy 0, policy_version 23240 (0.0007) [2023-03-07 14:36:16,010][213771] Updated weights for policy 0, policy_version 23250 (0.0005) [2023-03-07 14:36:16,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 23809024. Throughput: 0: 13246.2. Samples: 23779367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:16,106][213445] Avg episode reward: [(0, '4405.637')] [2023-03-07 14:36:16,787][213771] Updated weights for policy 0, policy_version 23260 (0.0007) [2023-03-07 14:36:17,564][213771] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-07 14:36:18,332][213771] Updated weights for policy 0, policy_version 23280 (0.0007) [2023-03-07 14:36:19,109][213771] Updated weights for policy 0, policy_version 23290 (0.0007) [2023-03-07 14:36:19,875][213771] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-07 14:36:20,657][213771] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-07 14:36:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23874560. Throughput: 0: 13236.7. Samples: 23858539. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:21,106][213445] Avg episode reward: [(0, '4394.104')] [2023-03-07 14:36:21,439][213771] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-07 14:36:22,194][213771] Updated weights for policy 0, policy_version 23330 (0.0007) [2023-03-07 14:36:22,983][213771] Updated weights for policy 0, policy_version 23340 (0.0006) [2023-03-07 14:36:23,756][213771] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-07 14:36:24,511][213771] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-07 14:36:25,275][213771] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-07 14:36:26,057][213771] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-07 14:36:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 23941120. Throughput: 0: 13234.2. Samples: 23938079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:26,106][213445] Avg episode reward: [(0, '4343.217')] [2023-03-07 14:36:26,831][213771] Updated weights for policy 0, policy_version 23390 (0.0005) [2023-03-07 14:36:27,602][213771] Updated weights for policy 0, policy_version 23400 (0.0006) [2023-03-07 14:36:28,383][213771] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-07 14:36:29,140][213771] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-07 14:36:29,907][213771] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-07 14:36:30,670][213771] Updated weights for policy 0, policy_version 23440 (0.0006) [2023-03-07 14:36:31,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 24007680. Throughput: 0: 13241.5. Samples: 23977880. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:31,106][213445] Avg episode reward: [(0, '4294.604')] [2023-03-07 14:36:31,436][213771] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-07 14:36:32,212][213771] Updated weights for policy 0, policy_version 23460 (0.0007) [2023-03-07 14:36:33,001][213771] Updated weights for policy 0, policy_version 23470 (0.0007) [2023-03-07 14:36:33,756][213771] Updated weights for policy 0, policy_version 23480 (0.0007) [2023-03-07 14:36:34,522][213771] Updated weights for policy 0, policy_version 23490 (0.0006) [2023-03-07 14:36:35,333][213771] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-07 14:36:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 24074240. Throughput: 0: 13247.1. Samples: 24057641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:36,106][213445] Avg episode reward: [(0, '4353.599')] [2023-03-07 14:36:36,107][213771] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-07 14:36:36,873][213771] Updated weights for policy 0, policy_version 23520 (0.0007) [2023-03-07 14:36:37,627][213771] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-07 14:36:38,408][213771] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-07 14:36:39,188][213771] Updated weights for policy 0, policy_version 23550 (0.0007) [2023-03-07 14:36:39,956][213771] Updated weights for policy 0, policy_version 23560 (0.0007) [2023-03-07 14:36:40,730][213771] Updated weights for policy 0, policy_version 23570 (0.0006) [2023-03-07 14:36:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 24139776. Throughput: 0: 13253.8. Samples: 24137118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:41,106][213445] Avg episode reward: [(0, '4313.040')] [2023-03-07 14:36:41,481][213771] Updated weights for policy 0, policy_version 23580 (0.0006) [2023-03-07 14:36:42,264][213771] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-07 14:36:43,033][213771] Updated weights for policy 0, policy_version 23600 (0.0006) [2023-03-07 14:36:43,795][213771] Updated weights for policy 0, policy_version 23610 (0.0006) [2023-03-07 14:36:44,579][213771] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-07 14:36:45,343][213771] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-07 14:36:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 24206336. Throughput: 0: 13264.4. Samples: 24177120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:46,106][213445] Avg episode reward: [(0, '4184.088')] [2023-03-07 14:36:46,116][213771] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-07 14:36:46,892][213771] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-07 14:36:47,680][213771] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-07 14:36:48,453][213771] Updated weights for policy 0, policy_version 23670 (0.0005) [2023-03-07 14:36:49,228][213771] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-07 14:36:50,001][213771] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-07 14:36:50,766][213771] Updated weights for policy 0, policy_version 23700 (0.0006) [2023-03-07 14:36:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 24272896. Throughput: 0: 13257.8. Samples: 24256394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:36:51,106][213445] Avg episode reward: [(0, '4197.859')] [2023-03-07 14:36:51,522][213771] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-07 14:36:52,312][213771] Updated weights for policy 0, policy_version 23720 (0.0006) [2023-03-07 14:36:53,077][213771] Updated weights for policy 0, policy_version 23730 (0.0006) [2023-03-07 14:36:53,838][213771] Updated weights for policy 0, policy_version 23740 (0.0007) [2023-03-07 14:36:54,616][213771] Updated weights for policy 0, policy_version 23750 (0.0006) [2023-03-07 14:36:55,394][213771] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-07 14:36:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 24339456. Throughput: 0: 13252.4. Samples: 24336165. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:36:56,106][213445] Avg episode reward: [(0, '4304.061')] [2023-03-07 14:36:56,164][213771] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-07 14:36:56,932][213771] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-07 14:36:57,703][213771] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-07 14:36:58,474][213771] Updated weights for policy 0, policy_version 23800 (0.0005) [2023-03-07 14:36:59,242][213771] Updated weights for policy 0, policy_version 23810 (0.0006) [2023-03-07 14:37:00,010][213771] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-07 14:37:00,801][213771] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-07 14:37:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 24406016. Throughput: 0: 13263.3. Samples: 24376215. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:01,106][213445] Avg episode reward: [(0, '4250.996')] [2023-03-07 14:37:01,565][213771] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-07 14:37:02,333][213771] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-07 14:37:03,115][213771] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-07 14:37:03,882][213771] Updated weights for policy 0, policy_version 23870 (0.0007) [2023-03-07 14:37:04,664][213771] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-07 14:37:05,432][213771] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-07 14:37:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 24471552. Throughput: 0: 13264.5. Samples: 24455442. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:06,106][213445] Avg episode reward: [(0, '4145.779')] [2023-03-07 14:37:06,197][213771] Updated weights for policy 0, policy_version 23900 (0.0005) [2023-03-07 14:37:06,986][213771] Updated weights for policy 0, policy_version 23910 (0.0007) [2023-03-07 14:37:07,735][213771] Updated weights for policy 0, policy_version 23920 (0.0008) [2023-03-07 14:37:08,509][213771] Updated weights for policy 0, policy_version 23930 (0.0006) [2023-03-07 14:37:09,280][213771] Updated weights for policy 0, policy_version 23940 (0.0006) [2023-03-07 14:37:10,063][213771] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-07 14:37:10,821][213771] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-07 14:37:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 24538112. Throughput: 0: 13268.0. Samples: 24535138. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:11,106][213445] Avg episode reward: [(0, '4212.361')] [2023-03-07 14:37:11,590][213771] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-07 14:37:12,374][213771] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-07 14:37:13,150][213771] Updated weights for policy 0, policy_version 23990 (0.0007) [2023-03-07 14:37:13,933][213771] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-07 14:37:14,707][213771] Updated weights for policy 0, policy_version 24010 (0.0007) [2023-03-07 14:37:15,481][213771] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-07 14:37:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 24603648. Throughput: 0: 13263.4. Samples: 24574733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:37:16,106][213445] Avg episode reward: [(0, '4287.776')] [2023-03-07 14:37:16,270][213771] Updated weights for policy 0, policy_version 24030 (0.0006) [2023-03-07 14:37:17,028][213771] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-07 14:37:17,799][213771] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-07 14:37:18,581][213771] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-07 14:37:19,331][213771] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-07 14:37:20,100][213771] Updated weights for policy 0, policy_version 24080 (0.0008) [2023-03-07 14:37:20,854][213771] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-07 14:37:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 24671232. Throughput: 0: 13259.7. Samples: 24654326. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:37:21,106][213445] Avg episode reward: [(0, '4244.267')] [2023-03-07 14:37:21,626][213771] Updated weights for policy 0, policy_version 24100 (0.0007) [2023-03-07 14:37:22,406][213771] Updated weights for policy 0, policy_version 24110 (0.0006) [2023-03-07 14:37:23,171][213771] Updated weights for policy 0, policy_version 24120 (0.0006) [2023-03-07 14:37:23,953][213771] Updated weights for policy 0, policy_version 24130 (0.0007) [2023-03-07 14:37:24,714][213771] Updated weights for policy 0, policy_version 24140 (0.0007) [2023-03-07 14:37:25,494][213771] Updated weights for policy 0, policy_version 24150 (0.0006) [2023-03-07 14:37:26,105][213445] Fps is (10 sec: 13414.4, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 24737792. Throughput: 0: 13268.1. Samples: 24734181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:37:26,106][213445] Avg episode reward: [(0, '4126.869')] [2023-03-07 14:37:26,260][213771] Updated weights for policy 0, policy_version 24160 (0.0005) [2023-03-07 14:37:27,035][213771] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-07 14:37:27,789][213771] Updated weights for policy 0, policy_version 24180 (0.0005) [2023-03-07 14:37:28,565][213771] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-07 14:37:29,337][213771] Updated weights for policy 0, policy_version 24200 (0.0007) [2023-03-07 14:37:30,119][213771] Updated weights for policy 0, policy_version 24210 (0.0005) [2023-03-07 14:37:30,882][213771] Updated weights for policy 0, policy_version 24220 (0.0006) [2023-03-07 14:37:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 24803328. Throughput: 0: 13265.8. Samples: 24774080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:31,106][213445] Avg episode reward: [(0, '4192.150')] [2023-03-07 14:37:31,656][213771] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-07 14:37:32,448][213771] Updated weights for policy 0, policy_version 24240 (0.0007) [2023-03-07 14:37:33,205][213771] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-07 14:37:33,985][213771] Updated weights for policy 0, policy_version 24260 (0.0006) [2023-03-07 14:37:34,753][213771] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-07 14:37:35,529][213771] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-07 14:37:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 24869888. Throughput: 0: 13271.2. Samples: 24853599. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:36,106][213445] Avg episode reward: [(0, '4369.755')] [2023-03-07 14:37:36,293][213771] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-07 14:37:37,052][213771] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-07 14:37:37,827][213771] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-07 14:37:38,593][213771] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-07 14:37:39,362][213771] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-07 14:37:40,134][213771] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-07 14:37:40,906][213771] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-07 14:37:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 24936448. Throughput: 0: 13271.6. Samples: 24933387. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:37:41,106][213445] Avg episode reward: [(0, '4323.695')] [2023-03-07 14:37:41,682][213771] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-07 14:37:42,448][213771] Updated weights for policy 0, policy_version 24370 (0.0005) [2023-03-07 14:37:43,240][213771] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-07 14:37:44,022][213771] Updated weights for policy 0, policy_version 24390 (0.0005) [2023-03-07 14:37:44,785][213771] Updated weights for policy 0, policy_version 24400 (0.0007) [2023-03-07 14:37:45,547][213771] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-07 14:37:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 25003008. Throughput: 0: 13262.7. Samples: 24973037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:37:46,106][213445] Avg episode reward: [(0, '4346.506')] [2023-03-07 14:37:46,317][213771] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-07 14:37:47,089][213771] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-07 14:37:47,850][213771] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-07 14:37:48,625][213771] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-07 14:37:49,406][213771] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-07 14:37:50,174][213771] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-07 14:37:50,941][213771] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-07 14:37:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 25069568. Throughput: 0: 13273.7. Samples: 25052761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:37:51,106][213445] Avg episode reward: [(0, '4397.411')] [2023-03-07 14:37:51,685][213771] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-07 14:37:52,476][213771] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-07 14:37:53,230][213771] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-07 14:37:54,017][213771] Updated weights for policy 0, policy_version 24520 (0.0005) [2023-03-07 14:37:54,785][213771] Updated weights for policy 0, policy_version 24530 (0.0006) [2023-03-07 14:37:55,570][213771] Updated weights for policy 0, policy_version 24540 (0.0007) [2023-03-07 14:37:56,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 25136128. Throughput: 0: 13275.4. Samples: 25132529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:37:56,105][213445] Avg episode reward: [(0, '4415.665')] [2023-03-07 14:37:56,329][213771] Updated weights for policy 0, policy_version 24550 (0.0007) [2023-03-07 14:37:57,096][213771] Updated weights for policy 0, policy_version 24560 (0.0005) [2023-03-07 14:37:57,876][213771] Updated weights for policy 0, policy_version 24570 (0.0007) [2023-03-07 14:37:58,643][213771] Updated weights for policy 0, policy_version 24580 (0.0007) [2023-03-07 14:37:59,411][213771] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-07 14:38:00,197][213771] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-07 14:38:00,973][213771] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-07 14:38:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 25201664. Throughput: 0: 13284.9. Samples: 25172553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:38:01,106][213445] Avg episode reward: [(0, '4415.437')] [2023-03-07 14:38:01,751][213771] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-07 14:38:02,527][213771] Updated weights for policy 0, policy_version 24630 (0.0005) [2023-03-07 14:38:03,302][213771] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-07 14:38:04,081][213771] Updated weights for policy 0, policy_version 24650 (0.0005) [2023-03-07 14:38:04,854][213771] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-07 14:38:05,627][213771] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-07 14:38:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 25268224. Throughput: 0: 13273.2. Samples: 25251621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:38:06,106][213445] Avg episode reward: [(0, '4472.541')] [2023-03-07 14:38:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000024676_25268224.pth... [2023-03-07 14:38:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000021570_22087680.pth [2023-03-07 14:38:06,409][213771] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-07 14:38:07,177][213771] Updated weights for policy 0, policy_version 24690 (0.0006) [2023-03-07 14:38:07,964][213771] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-07 14:38:08,745][213771] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-07 14:38:09,506][213771] Updated weights for policy 0, policy_version 24720 (0.0006) [2023-03-07 14:38:10,289][213771] Updated weights for policy 0, policy_version 24730 (0.0006) [2023-03-07 14:38:11,056][213771] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-07 14:38:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 25333760. Throughput: 0: 13260.0. Samples: 25330884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:38:11,106][213445] Avg episode reward: [(0, '4480.746')] [2023-03-07 14:38:11,829][213771] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-07 14:38:12,616][213771] Updated weights for policy 0, policy_version 24760 (0.0007) [2023-03-07 14:38:13,374][213771] Updated weights for policy 0, policy_version 24770 (0.0007) [2023-03-07 14:38:14,148][213771] Updated weights for policy 0, policy_version 24780 (0.0007) [2023-03-07 14:38:14,933][213771] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-07 14:38:15,700][213771] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-07 14:38:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 25400320. Throughput: 0: 13255.2. Samples: 25370561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:38:16,106][213445] Avg episode reward: [(0, '4476.270')] [2023-03-07 14:38:16,481][213771] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-07 14:38:17,249][213771] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-07 14:38:18,022][213771] Updated weights for policy 0, policy_version 24830 (0.0007) [2023-03-07 14:38:18,781][213771] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-07 14:38:19,566][213771] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-07 14:38:20,346][213771] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-07 14:38:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 25465856. Throughput: 0: 13246.0. Samples: 25449669. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:38:21,106][213445] Avg episode reward: [(0, '4480.680')] [2023-03-07 14:38:21,120][213771] Updated weights for policy 0, policy_version 24870 (0.0007) [2023-03-07 14:38:21,907][213771] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-07 14:38:22,672][213771] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-07 14:38:23,460][213771] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-07 14:38:24,237][213771] Updated weights for policy 0, policy_version 24910 (0.0007) [2023-03-07 14:38:25,001][213771] Updated weights for policy 0, policy_version 24920 (0.0005) [2023-03-07 14:38:25,775][213771] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-07 14:38:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 25532416. Throughput: 0: 13236.3. Samples: 25529023. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:38:26,106][213445] Avg episode reward: [(0, '4457.024')] [2023-03-07 14:38:26,555][213771] Updated weights for policy 0, policy_version 24940 (0.0007) [2023-03-07 14:38:27,321][213771] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-07 14:38:28,082][213771] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-07 14:38:28,855][213771] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-07 14:38:29,619][213771] Updated weights for policy 0, policy_version 24980 (0.0007) [2023-03-07 14:38:30,401][213771] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-07 14:38:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 25598976. Throughput: 0: 13242.5. Samples: 25568945. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:38:31,106][213445] Avg episode reward: [(0, '4327.746')] [2023-03-07 14:38:31,170][213771] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-07 14:38:31,949][213771] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-07 14:38:32,725][213771] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-07 14:38:33,505][213771] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-07 14:38:34,278][213771] Updated weights for policy 0, policy_version 25040 (0.0006) [2023-03-07 14:38:35,047][213771] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-07 14:38:35,822][213771] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-07 14:38:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 25664512. Throughput: 0: 13237.5. Samples: 25648450. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:38:36,106][213445] Avg episode reward: [(0, '4423.916')] [2023-03-07 14:38:36,604][213771] Updated weights for policy 0, policy_version 25070 (0.0007) [2023-03-07 14:38:37,374][213771] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-07 14:38:38,139][213771] Updated weights for policy 0, policy_version 25090 (0.0007) [2023-03-07 14:38:38,918][213771] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-07 14:38:39,684][213771] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-07 14:38:40,462][213771] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-07 14:38:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 25731072. Throughput: 0: 13229.3. Samples: 25727852. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:38:41,106][213445] Avg episode reward: [(0, '4448.375')] [2023-03-07 14:38:41,230][213771] Updated weights for policy 0, policy_version 25130 (0.0006) [2023-03-07 14:38:42,013][213771] Updated weights for policy 0, policy_version 25140 (0.0006) [2023-03-07 14:38:42,793][213771] Updated weights for policy 0, policy_version 25150 (0.0005) [2023-03-07 14:38:43,558][213771] Updated weights for policy 0, policy_version 25160 (0.0007) [2023-03-07 14:38:44,329][213771] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-07 14:38:45,098][213771] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-07 14:38:45,859][213771] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-07 14:38:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 25797632. Throughput: 0: 13221.5. Samples: 25767524. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:38:46,106][213445] Avg episode reward: [(0, '4475.665')] [2023-03-07 14:38:46,637][213771] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-07 14:38:47,419][213771] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-07 14:38:48,187][213771] Updated weights for policy 0, policy_version 25220 (0.0005) [2023-03-07 14:38:48,945][213771] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-07 14:38:49,722][213771] Updated weights for policy 0, policy_version 25240 (0.0007) [2023-03-07 14:38:50,495][213771] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-07 14:38:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 25863168. Throughput: 0: 13240.2. Samples: 25847428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:38:51,106][213445] Avg episode reward: [(0, '4439.423')] [2023-03-07 14:38:51,298][213771] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-07 14:38:52,054][213771] Updated weights for policy 0, policy_version 25270 (0.0007) [2023-03-07 14:38:52,809][213771] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-07 14:38:53,582][213771] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-07 14:38:54,349][213771] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-07 14:38:55,134][213771] Updated weights for policy 0, policy_version 25310 (0.0006) [2023-03-07 14:38:55,924][213771] Updated weights for policy 0, policy_version 25320 (0.0005) [2023-03-07 14:38:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 25929728. Throughput: 0: 13237.6. Samples: 25926576. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:38:56,106][213445] Avg episode reward: [(0, '4460.941')] [2023-03-07 14:38:56,692][213771] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-07 14:38:57,464][213771] Updated weights for policy 0, policy_version 25340 (0.0005) [2023-03-07 14:38:58,244][213771] Updated weights for policy 0, policy_version 25350 (0.0006) [2023-03-07 14:38:59,002][213771] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-07 14:38:59,780][213771] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-07 14:39:00,538][213771] Updated weights for policy 0, policy_version 25380 (0.0007) [2023-03-07 14:39:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 25996288. Throughput: 0: 13242.6. Samples: 25966479. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:39:01,106][213445] Avg episode reward: [(0, '4438.912')] [2023-03-07 14:39:01,298][213771] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-07 14:39:02,069][213771] Updated weights for policy 0, policy_version 25400 (0.0005) [2023-03-07 14:39:02,861][213771] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-07 14:39:03,634][213771] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-07 14:39:04,399][213771] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-07 14:39:05,173][213771] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-07 14:39:05,957][213771] Updated weights for policy 0, policy_version 25450 (0.0007) [2023-03-07 14:39:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 26061824. Throughput: 0: 13252.9. Samples: 26046052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:39:06,106][213445] Avg episode reward: [(0, '4450.566')] [2023-03-07 14:39:06,745][213771] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-07 14:39:07,511][213771] Updated weights for policy 0, policy_version 25470 (0.0007) [2023-03-07 14:39:08,296][213771] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-07 14:39:09,077][213771] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-07 14:39:09,845][213771] Updated weights for policy 0, policy_version 25500 (0.0007) [2023-03-07 14:39:10,609][213771] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-07 14:39:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 26128384. Throughput: 0: 13250.1. Samples: 26125275. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:11,106][213445] Avg episode reward: [(0, '4411.300')] [2023-03-07 14:39:11,394][213771] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-07 14:39:12,145][213771] Updated weights for policy 0, policy_version 25530 (0.0006) [2023-03-07 14:39:12,916][213771] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-07 14:39:13,688][213771] Updated weights for policy 0, policy_version 25550 (0.0007) [2023-03-07 14:39:14,456][213771] Updated weights for policy 0, policy_version 25560 (0.0007) [2023-03-07 14:39:15,209][213771] Updated weights for policy 0, policy_version 25570 (0.0005) [2023-03-07 14:39:15,989][213771] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-07 14:39:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 26194944. Throughput: 0: 13250.2. Samples: 26165206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:16,105][213445] Avg episode reward: [(0, '4350.999')] [2023-03-07 14:39:16,756][213771] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-07 14:39:17,523][213771] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-07 14:39:18,309][213771] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-07 14:39:19,076][213771] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-07 14:39:19,824][213771] Updated weights for policy 0, policy_version 25630 (0.0007) [2023-03-07 14:39:20,609][213771] Updated weights for policy 0, policy_version 25640 (0.0007) [2023-03-07 14:39:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 26261504. Throughput: 0: 13259.7. Samples: 26245133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:21,106][213445] Avg episode reward: [(0, '4388.864')] [2023-03-07 14:39:21,375][213771] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-07 14:39:22,159][213771] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-07 14:39:22,950][213771] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-07 14:39:23,713][213771] Updated weights for policy 0, policy_version 25680 (0.0007) [2023-03-07 14:39:24,487][213771] Updated weights for policy 0, policy_version 25690 (0.0007) [2023-03-07 14:39:25,265][213771] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-07 14:39:26,033][213771] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-07 14:39:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 26327040. Throughput: 0: 13255.9. Samples: 26324368. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:39:26,106][213445] Avg episode reward: [(0, '4440.851')] [2023-03-07 14:39:26,818][213771] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-07 14:39:27,581][213771] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-07 14:39:28,349][213771] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-07 14:39:29,124][213771] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-07 14:39:29,901][213771] Updated weights for policy 0, policy_version 25760 (0.0007) [2023-03-07 14:39:30,693][213771] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-07 14:39:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 26393600. Throughput: 0: 13258.9. Samples: 26364171. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:39:31,106][213445] Avg episode reward: [(0, '4480.031')] [2023-03-07 14:39:31,463][213771] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-07 14:39:32,246][213771] Updated weights for policy 0, policy_version 25790 (0.0005) [2023-03-07 14:39:33,016][213771] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-07 14:39:33,804][213771] Updated weights for policy 0, policy_version 25810 (0.0007) [2023-03-07 14:39:34,576][213771] Updated weights for policy 0, policy_version 25820 (0.0007) [2023-03-07 14:39:35,339][213771] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-07 14:39:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 26459136. Throughput: 0: 13239.4. Samples: 26443203. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:39:36,106][213445] Avg episode reward: [(0, '4464.066')] [2023-03-07 14:39:36,124][213771] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-07 14:39:36,879][213771] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-07 14:39:37,654][213771] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-07 14:39:38,420][213771] Updated weights for policy 0, policy_version 25870 (0.0007) [2023-03-07 14:39:39,178][213771] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-07 14:39:39,957][213771] Updated weights for policy 0, policy_version 25890 (0.0007) [2023-03-07 14:39:40,746][213771] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-07 14:39:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 26525696. Throughput: 0: 13250.1. Samples: 26522831. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:39:41,106][213445] Avg episode reward: [(0, '4393.846')] [2023-03-07 14:39:41,508][213771] Updated weights for policy 0, policy_version 25910 (0.0007) [2023-03-07 14:39:42,281][213771] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-07 14:39:43,061][213771] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-07 14:39:43,830][213771] Updated weights for policy 0, policy_version 25940 (0.0005) [2023-03-07 14:39:44,618][213771] Updated weights for policy 0, policy_version 25950 (0.0006) [2023-03-07 14:39:45,388][213771] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-07 14:39:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 26592256. Throughput: 0: 13246.5. Samples: 26562572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:46,106][213445] Avg episode reward: [(0, '4394.224')] [2023-03-07 14:39:46,161][213771] Updated weights for policy 0, policy_version 25970 (0.0006) [2023-03-07 14:39:46,921][213771] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-07 14:39:47,698][213771] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-07 14:39:48,469][213771] Updated weights for policy 0, policy_version 26000 (0.0007) [2023-03-07 14:39:49,218][213771] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-07 14:39:49,983][213771] Updated weights for policy 0, policy_version 26020 (0.0005) [2023-03-07 14:39:50,769][213771] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-07 14:39:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 26658816. Throughput: 0: 13249.0. Samples: 26642258. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:51,106][213445] Avg episode reward: [(0, '4422.020')] [2023-03-07 14:39:51,528][213771] Updated weights for policy 0, policy_version 26040 (0.0006) [2023-03-07 14:39:52,295][213771] Updated weights for policy 0, policy_version 26050 (0.0006) [2023-03-07 14:39:53,071][213771] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-07 14:39:53,838][213771] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-07 14:39:54,604][213771] Updated weights for policy 0, policy_version 26080 (0.0007) [2023-03-07 14:39:55,374][213771] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-07 14:39:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 26725376. Throughput: 0: 13266.9. Samples: 26722285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:39:56,106][213445] Avg episode reward: [(0, '4448.669')] [2023-03-07 14:39:56,145][213771] Updated weights for policy 0, policy_version 26100 (0.0005) [2023-03-07 14:39:56,926][213771] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-07 14:39:57,691][213771] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-07 14:39:58,486][213771] Updated weights for policy 0, policy_version 26130 (0.0006) [2023-03-07 14:39:59,246][213771] Updated weights for policy 0, policy_version 26140 (0.0006) [2023-03-07 14:40:00,017][213771] Updated weights for policy 0, policy_version 26150 (0.0007) [2023-03-07 14:40:00,785][213771] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-07 14:40:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 26791936. Throughput: 0: 13260.7. Samples: 26761940. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:01,106][213445] Avg episode reward: [(0, '4361.540')] [2023-03-07 14:40:01,566][213771] Updated weights for policy 0, policy_version 26170 (0.0007) [2023-03-07 14:40:02,346][213771] Updated weights for policy 0, policy_version 26180 (0.0005) [2023-03-07 14:40:03,113][213771] Updated weights for policy 0, policy_version 26190 (0.0007) [2023-03-07 14:40:03,866][213771] Updated weights for policy 0, policy_version 26200 (0.0005) [2023-03-07 14:40:04,625][213771] Updated weights for policy 0, policy_version 26210 (0.0005) [2023-03-07 14:40:05,400][213771] Updated weights for policy 0, policy_version 26220 (0.0006) [2023-03-07 14:40:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 26858496. Throughput: 0: 13259.0. Samples: 26841789. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:06,106][213445] Avg episode reward: [(0, '4384.268')] [2023-03-07 14:40:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000026229_26858496.pth... [2023-03-07 14:40:06,139][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000023122_23676928.pth [2023-03-07 14:40:06,178][213771] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-07 14:40:06,930][213771] Updated weights for policy 0, policy_version 26240 (0.0007) [2023-03-07 14:40:07,715][213771] Updated weights for policy 0, policy_version 26250 (0.0005) [2023-03-07 14:40:08,493][213771] Updated weights for policy 0, policy_version 26260 (0.0005) [2023-03-07 14:40:09,270][213771] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-07 14:40:10,047][213771] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-07 14:40:10,807][213771] Updated weights for policy 0, policy_version 26290 (0.0007) [2023-03-07 14:40:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 26924032. Throughput: 0: 13262.2. Samples: 26921165. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:11,106][213445] Avg episode reward: [(0, '4438.801')] [2023-03-07 14:40:11,602][213771] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-07 14:40:12,343][213771] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-07 14:40:13,132][213771] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-07 14:40:13,900][213771] Updated weights for policy 0, policy_version 26330 (0.0006) [2023-03-07 14:40:14,669][213771] Updated weights for policy 0, policy_version 26340 (0.0005) [2023-03-07 14:40:15,445][213771] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-07 14:40:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 26990592. Throughput: 0: 13261.2. Samples: 26960925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:16,106][213445] Avg episode reward: [(0, '4471.173')] [2023-03-07 14:40:16,216][213771] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-07 14:40:16,996][213771] Updated weights for policy 0, policy_version 26370 (0.0007) [2023-03-07 14:40:17,770][213771] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-07 14:40:18,537][213771] Updated weights for policy 0, policy_version 26390 (0.0005) [2023-03-07 14:40:19,293][213771] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-07 14:40:20,044][213771] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-07 14:40:20,815][213771] Updated weights for policy 0, policy_version 26420 (0.0006) [2023-03-07 14:40:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 27057152. Throughput: 0: 13281.5. Samples: 27040872. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:21,106][213445] Avg episode reward: [(0, '4361.317')] [2023-03-07 14:40:21,585][213771] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-07 14:40:22,365][213771] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-07 14:40:23,145][213771] Updated weights for policy 0, policy_version 26450 (0.0006) [2023-03-07 14:40:23,924][213771] Updated weights for policy 0, policy_version 26460 (0.0006) [2023-03-07 14:40:24,681][213771] Updated weights for policy 0, policy_version 26470 (0.0005) [2023-03-07 14:40:25,443][213771] Updated weights for policy 0, policy_version 26480 (0.0005) [2023-03-07 14:40:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 27123712. Throughput: 0: 13284.8. Samples: 27120644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:26,106][213445] Avg episode reward: [(0, '4418.100')] [2023-03-07 14:40:26,209][213771] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-07 14:40:26,987][213771] Updated weights for policy 0, policy_version 26500 (0.0005) [2023-03-07 14:40:27,762][213771] Updated weights for policy 0, policy_version 26510 (0.0005) [2023-03-07 14:40:28,526][213771] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-07 14:40:29,294][213771] Updated weights for policy 0, policy_version 26530 (0.0006) [2023-03-07 14:40:30,074][213771] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-07 14:40:30,828][213771] Updated weights for policy 0, policy_version 26550 (0.0007) [2023-03-07 14:40:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 27190272. Throughput: 0: 13290.2. Samples: 27160630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:31,106][213445] Avg episode reward: [(0, '4367.521')] [2023-03-07 14:40:31,610][213771] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-07 14:40:32,385][213771] Updated weights for policy 0, policy_version 26570 (0.0006) [2023-03-07 14:40:33,140][213771] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-07 14:40:33,925][213771] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-07 14:40:34,683][213771] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-07 14:40:35,437][213771] Updated weights for policy 0, policy_version 26610 (0.0007) [2023-03-07 14:40:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13256.5). Total num frames: 27256832. Throughput: 0: 13294.1. Samples: 27240493. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:36,105][213445] Avg episode reward: [(0, '4295.137')] [2023-03-07 14:40:36,214][213771] Updated weights for policy 0, policy_version 26620 (0.0007) [2023-03-07 14:40:36,961][213771] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-07 14:40:37,730][213771] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-07 14:40:38,517][213771] Updated weights for policy 0, policy_version 26650 (0.0007) [2023-03-07 14:40:39,285][213771] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-07 14:40:40,067][213771] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-07 14:40:40,836][213771] Updated weights for policy 0, policy_version 26680 (0.0005) [2023-03-07 14:40:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13259.9). Total num frames: 27323392. Throughput: 0: 13290.5. Samples: 27320358. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:41,106][213445] Avg episode reward: [(0, '4285.048')] [2023-03-07 14:40:41,598][213771] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-07 14:40:42,364][213771] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-07 14:40:43,122][213771] Updated weights for policy 0, policy_version 26710 (0.0007) [2023-03-07 14:40:43,880][213771] Updated weights for policy 0, policy_version 26720 (0.0007) [2023-03-07 14:40:44,669][213771] Updated weights for policy 0, policy_version 26730 (0.0006) [2023-03-07 14:40:45,455][213771] Updated weights for policy 0, policy_version 26740 (0.0006) [2023-03-07 14:40:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13294.9, 300 sec: 13259.9). Total num frames: 27389952. Throughput: 0: 13301.2. Samples: 27360492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:40:46,106][213445] Avg episode reward: [(0, '4232.937')] [2023-03-07 14:40:46,226][213771] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-07 14:40:46,986][213771] Updated weights for policy 0, policy_version 26760 (0.0007) [2023-03-07 14:40:47,758][213771] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-07 14:40:48,525][213771] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-07 14:40:49,301][213771] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-07 14:40:50,074][213771] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-07 14:40:50,843][213771] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-07 14:40:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13263.4). Total num frames: 27456512. Throughput: 0: 13291.1. Samples: 27439887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:51,106][213445] Avg episode reward: [(0, '4289.968')] [2023-03-07 14:40:51,628][213771] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-07 14:40:52,402][213771] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-07 14:40:53,154][213771] Updated weights for policy 0, policy_version 26840 (0.0005) [2023-03-07 14:40:53,944][213771] Updated weights for policy 0, policy_version 26850 (0.0007) [2023-03-07 14:40:54,711][213771] Updated weights for policy 0, policy_version 26860 (0.0005) [2023-03-07 14:40:55,488][213771] Updated weights for policy 0, policy_version 26870 (0.0006) [2023-03-07 14:40:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13263.4). Total num frames: 27523072. Throughput: 0: 13295.8. Samples: 27519475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:40:56,106][213445] Avg episode reward: [(0, '4269.216')] [2023-03-07 14:40:56,248][213771] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-07 14:40:57,011][213771] Updated weights for policy 0, policy_version 26890 (0.0006) [2023-03-07 14:40:57,793][213771] Updated weights for policy 0, policy_version 26900 (0.0008) [2023-03-07 14:40:58,566][213771] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-07 14:40:59,339][213771] Updated weights for policy 0, policy_version 26920 (0.0006) [2023-03-07 14:41:00,118][213771] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-07 14:41:00,875][213771] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-07 14:41:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 27588608. Throughput: 0: 13299.6. Samples: 27559405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:41:01,105][213445] Avg episode reward: [(0, '4275.874')] [2023-03-07 14:41:01,629][213771] Updated weights for policy 0, policy_version 26950 (0.0006) [2023-03-07 14:41:02,424][213771] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-07 14:41:03,178][213771] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-07 14:41:03,967][213771] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-07 14:41:04,735][213771] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-07 14:41:05,506][213771] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-07 14:41:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 27655168. Throughput: 0: 13290.6. Samples: 27638949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:41:06,106][213445] Avg episode reward: [(0, '4209.491')] [2023-03-07 14:41:06,273][213771] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-07 14:41:07,050][213771] Updated weights for policy 0, policy_version 27020 (0.0005) [2023-03-07 14:41:07,823][213771] Updated weights for policy 0, policy_version 27030 (0.0005) [2023-03-07 14:41:08,589][213771] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-07 14:41:09,339][213771] Updated weights for policy 0, policy_version 27050 (0.0005) [2023-03-07 14:41:10,131][213771] Updated weights for policy 0, policy_version 27060 (0.0006) [2023-03-07 14:41:10,895][213771] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-07 14:41:11,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13263.4). Total num frames: 27721728. Throughput: 0: 13292.1. Samples: 27718790. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:41:11,106][213445] Avg episode reward: [(0, '4291.737')] [2023-03-07 14:41:11,674][213771] Updated weights for policy 0, policy_version 27080 (0.0006) [2023-03-07 14:41:12,453][213771] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-07 14:41:13,233][213771] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-07 14:41:14,006][213771] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-07 14:41:14,774][213771] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-07 14:41:15,543][213771] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-07 14:41:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13295.0, 300 sec: 13266.9). Total num frames: 27788288. Throughput: 0: 13283.9. Samples: 27758408. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:41:16,106][213445] Avg episode reward: [(0, '4291.577')] [2023-03-07 14:41:16,338][213771] Updated weights for policy 0, policy_version 27140 (0.0006) [2023-03-07 14:41:17,113][213771] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-07 14:41:17,884][213771] Updated weights for policy 0, policy_version 27160 (0.0007) [2023-03-07 14:41:18,643][213771] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-07 14:41:19,416][213771] Updated weights for policy 0, policy_version 27180 (0.0005) [2023-03-07 14:41:20,191][213771] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-07 14:41:20,962][213771] Updated weights for policy 0, policy_version 27200 (0.0005) [2023-03-07 14:41:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 27853824. Throughput: 0: 13272.8. Samples: 27837770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:41:21,106][213445] Avg episode reward: [(0, '4359.551')] [2023-03-07 14:41:21,733][213771] Updated weights for policy 0, policy_version 27210 (0.0005) [2023-03-07 14:41:22,494][213771] Updated weights for policy 0, policy_version 27220 (0.0006) [2023-03-07 14:41:23,290][213771] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-07 14:41:24,067][213771] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-07 14:41:24,837][213771] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-07 14:41:25,615][213771] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-07 14:41:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 27920384. Throughput: 0: 13265.6. Samples: 27917310. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:41:26,106][213445] Avg episode reward: [(0, '4333.282')] [2023-03-07 14:41:26,369][213771] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-07 14:41:27,151][213771] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-07 14:41:27,914][213771] Updated weights for policy 0, policy_version 27290 (0.0006) [2023-03-07 14:41:28,682][213771] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-07 14:41:29,459][213771] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-07 14:41:30,226][213771] Updated weights for policy 0, policy_version 27320 (0.0006) [2023-03-07 14:41:30,994][213771] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-07 14:41:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13263.4). Total num frames: 27986944. Throughput: 0: 13260.5. Samples: 27957218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:41:31,106][213445] Avg episode reward: [(0, '4374.650')] [2023-03-07 14:41:31,760][213771] Updated weights for policy 0, policy_version 27340 (0.0006) [2023-03-07 14:41:32,545][213771] Updated weights for policy 0, policy_version 27350 (0.0007) [2023-03-07 14:41:33,305][213771] Updated weights for policy 0, policy_version 27360 (0.0006) [2023-03-07 14:41:34,051][213771] Updated weights for policy 0, policy_version 27370 (0.0007) [2023-03-07 14:41:34,845][213771] Updated weights for policy 0, policy_version 27380 (0.0005) [2023-03-07 14:41:35,612][213771] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-07 14:41:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 28053504. Throughput: 0: 13271.0. Samples: 28037085. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:41:36,106][213445] Avg episode reward: [(0, '4361.457')] [2023-03-07 14:41:36,401][213771] Updated weights for policy 0, policy_version 27400 (0.0005) [2023-03-07 14:41:37,153][213771] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-07 14:41:37,918][213771] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-07 14:41:38,698][213771] Updated weights for policy 0, policy_version 27430 (0.0006) [2023-03-07 14:41:39,475][213771] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-07 14:41:40,228][213771] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-07 14:41:41,006][213771] Updated weights for policy 0, policy_version 27460 (0.0006) [2023-03-07 14:41:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 28120064. Throughput: 0: 13275.7. Samples: 28116884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:41:41,106][213445] Avg episode reward: [(0, '4362.875')] [2023-03-07 14:41:41,760][213771] Updated weights for policy 0, policy_version 27470 (0.0007) [2023-03-07 14:41:42,546][213771] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-07 14:41:43,313][213771] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-07 14:41:44,080][213771] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-07 14:41:44,855][213771] Updated weights for policy 0, policy_version 27510 (0.0005) [2023-03-07 14:41:45,632][213771] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-07 14:41:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 28186624. Throughput: 0: 13270.3. Samples: 28156571. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:41:46,105][213445] Avg episode reward: [(0, '4377.322')] [2023-03-07 14:41:46,394][213771] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-07 14:41:47,165][213771] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-07 14:41:47,963][213771] Updated weights for policy 0, policy_version 27550 (0.0007) [2023-03-07 14:41:48,732][213771] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-07 14:41:49,506][213771] Updated weights for policy 0, policy_version 27570 (0.0006) [2023-03-07 14:41:50,274][213771] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-07 14:41:51,029][213771] Updated weights for policy 0, policy_version 27590 (0.0006) [2023-03-07 14:41:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28252160. Throughput: 0: 13270.3. Samples: 28236111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:41:51,106][213445] Avg episode reward: [(0, '4396.814')] [2023-03-07 14:41:51,823][213771] Updated weights for policy 0, policy_version 27600 (0.0007) [2023-03-07 14:41:52,600][213771] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-07 14:41:53,370][213771] Updated weights for policy 0, policy_version 27620 (0.0006) [2023-03-07 14:41:54,137][213771] Updated weights for policy 0, policy_version 27630 (0.0005) [2023-03-07 14:41:54,906][213771] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-07 14:41:55,670][213771] Updated weights for policy 0, policy_version 27650 (0.0007) [2023-03-07 14:41:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28318720. Throughput: 0: 13265.6. Samples: 28315743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:41:56,106][213445] Avg episode reward: [(0, '4240.198')] [2023-03-07 14:41:56,456][213771] Updated weights for policy 0, policy_version 27660 (0.0007) [2023-03-07 14:41:57,237][213771] Updated weights for policy 0, policy_version 27670 (0.0006) [2023-03-07 14:41:58,000][213771] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-07 14:41:58,785][213771] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-07 14:41:59,552][213771] Updated weights for policy 0, policy_version 27700 (0.0005) [2023-03-07 14:42:00,316][213771] Updated weights for policy 0, policy_version 27710 (0.0005) [2023-03-07 14:42:01,067][213771] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-07 14:42:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 28385280. Throughput: 0: 13264.1. Samples: 28355293. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:42:01,106][213445] Avg episode reward: [(0, '4308.431')] [2023-03-07 14:42:01,855][213771] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-07 14:42:02,621][213771] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-07 14:42:03,396][213771] Updated weights for policy 0, policy_version 27750 (0.0007) [2023-03-07 14:42:04,158][213771] Updated weights for policy 0, policy_version 27760 (0.0007) [2023-03-07 14:42:04,942][213771] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-07 14:42:05,717][213771] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-07 14:42:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 28451840. Throughput: 0: 13274.5. Samples: 28435120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:42:06,105][213445] Avg episode reward: [(0, '4346.075')] [2023-03-07 14:42:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000027785_28451840.pth... [2023-03-07 14:42:06,138][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000024676_25268224.pth [2023-03-07 14:42:06,480][213771] Updated weights for policy 0, policy_version 27790 (0.0006) [2023-03-07 14:42:07,265][213771] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-07 14:42:08,032][213771] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-07 14:42:08,810][213771] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-07 14:42:09,574][213771] Updated weights for policy 0, policy_version 27830 (0.0006) [2023-03-07 14:42:10,380][213771] Updated weights for policy 0, policy_version 27840 (0.0006) [2023-03-07 14:42:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 28517376. Throughput: 0: 13269.4. Samples: 28514433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:42:11,105][213445] Avg episode reward: [(0, '4361.958')] [2023-03-07 14:42:11,156][213771] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-07 14:42:11,914][213771] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-07 14:42:12,688][213771] Updated weights for policy 0, policy_version 27870 (0.0007) [2023-03-07 14:42:13,470][213771] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-07 14:42:14,245][213771] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-07 14:42:15,006][213771] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-07 14:42:15,785][213771] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-07 14:42:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28583936. Throughput: 0: 13262.9. Samples: 28554047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:42:16,106][213445] Avg episode reward: [(0, '4437.395')] [2023-03-07 14:42:16,573][213771] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-07 14:42:17,346][213771] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-07 14:42:18,131][213771] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-07 14:42:18,877][213771] Updated weights for policy 0, policy_version 27950 (0.0005) [2023-03-07 14:42:19,645][213771] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-07 14:42:20,409][213771] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-07 14:42:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 28649472. Throughput: 0: 13253.0. Samples: 28633469. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:21,116][213445] Avg episode reward: [(0, '4360.645')] [2023-03-07 14:42:21,188][213771] Updated weights for policy 0, policy_version 27980 (0.0006) [2023-03-07 14:42:21,978][213771] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-07 14:42:22,743][213771] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-07 14:42:23,505][213771] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-07 14:42:24,285][213771] Updated weights for policy 0, policy_version 28020 (0.0006) [2023-03-07 14:42:25,061][213771] Updated weights for policy 0, policy_version 28030 (0.0006) [2023-03-07 14:42:25,842][213771] Updated weights for policy 0, policy_version 28040 (0.0007) [2023-03-07 14:42:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28716032. Throughput: 0: 13249.0. Samples: 28713087. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:26,116][213445] Avg episode reward: [(0, '4363.348')] [2023-03-07 14:42:26,607][213771] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-07 14:42:27,367][213771] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-07 14:42:28,137][213771] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-07 14:42:28,914][213771] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-07 14:42:29,673][213771] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-07 14:42:30,453][213771] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-07 14:42:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28782592. Throughput: 0: 13252.5. Samples: 28752937. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:31,116][213445] Avg episode reward: [(0, '4392.528')] [2023-03-07 14:42:31,238][213771] Updated weights for policy 0, policy_version 28110 (0.0006) [2023-03-07 14:42:31,997][213771] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-07 14:42:32,766][213771] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-07 14:42:33,543][213771] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-07 14:42:34,301][213771] Updated weights for policy 0, policy_version 28150 (0.0005) [2023-03-07 14:42:35,085][213771] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-07 14:42:35,873][213771] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-07 14:42:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28849152. Throughput: 0: 13259.1. Samples: 28832772. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:36,116][213445] Avg episode reward: [(0, '4383.772')] [2023-03-07 14:42:36,641][213771] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-07 14:42:37,401][213771] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-07 14:42:38,176][213771] Updated weights for policy 0, policy_version 28200 (0.0007) [2023-03-07 14:42:38,953][213771] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-07 14:42:39,721][213771] Updated weights for policy 0, policy_version 28220 (0.0007) [2023-03-07 14:42:40,475][213771] Updated weights for policy 0, policy_version 28230 (0.0005) [2023-03-07 14:42:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28915712. Throughput: 0: 13257.1. Samples: 28912309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:41,116][213445] Avg episode reward: [(0, '4323.249')] [2023-03-07 14:42:41,251][213771] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-07 14:42:42,005][213771] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-07 14:42:42,781][213771] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-07 14:42:43,559][213771] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-07 14:42:44,325][213771] Updated weights for policy 0, policy_version 28280 (0.0006) [2023-03-07 14:42:45,098][213771] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-07 14:42:45,885][213771] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-07 14:42:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 28982272. Throughput: 0: 13266.1. Samples: 28952271. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:46,117][213445] Avg episode reward: [(0, '4377.446')] [2023-03-07 14:42:46,643][213771] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-07 14:42:47,403][213771] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-07 14:42:48,168][213771] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-07 14:42:48,948][213771] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-07 14:42:49,712][213771] Updated weights for policy 0, policy_version 28350 (0.0006) [2023-03-07 14:42:50,480][213771] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-07 14:42:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 29048832. Throughput: 0: 13267.4. Samples: 29032152. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:51,117][213445] Avg episode reward: [(0, '4452.564')] [2023-03-07 14:42:51,250][213771] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-07 14:42:52,017][213771] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-07 14:42:52,794][213771] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-07 14:42:53,555][213771] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-07 14:42:54,333][213771] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-07 14:42:55,123][213771] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-07 14:42:55,878][213771] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-07 14:42:56,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 29115392. Throughput: 0: 13273.9. Samples: 29111761. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:42:56,116][213445] Avg episode reward: [(0, '4463.623')] [2023-03-07 14:42:56,639][213771] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-07 14:42:57,427][213771] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-07 14:42:58,179][213771] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-07 14:42:58,955][213771] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-07 14:42:59,721][213771] Updated weights for policy 0, policy_version 28480 (0.0006) [2023-03-07 14:43:00,464][213771] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-07 14:43:01,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 29181952. Throughput: 0: 13281.7. Samples: 29151724. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:43:01,116][213445] Avg episode reward: [(0, '4451.202')] [2023-03-07 14:43:01,242][213771] Updated weights for policy 0, policy_version 28500 (0.0005) [2023-03-07 14:43:02,013][213771] Updated weights for policy 0, policy_version 28510 (0.0006) [2023-03-07 14:43:02,791][213771] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-07 14:43:03,550][213771] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-07 14:43:04,341][213771] Updated weights for policy 0, policy_version 28540 (0.0006) [2023-03-07 14:43:05,107][213771] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-07 14:43:05,876][213771] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-07 14:43:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 29247488. Throughput: 0: 13291.1. Samples: 29231569. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:43:06,116][213445] Avg episode reward: [(0, '4428.155')] [2023-03-07 14:43:06,656][213771] Updated weights for policy 0, policy_version 28570 (0.0006) [2023-03-07 14:43:07,418][213771] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-07 14:43:08,177][213771] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-07 14:43:08,974][213771] Updated weights for policy 0, policy_version 28600 (0.0007) [2023-03-07 14:43:09,740][213771] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-07 14:43:10,521][213771] Updated weights for policy 0, policy_version 28620 (0.0007) [2023-03-07 14:43:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 29314048. Throughput: 0: 13291.2. Samples: 29311192. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:43:11,106][213445] Avg episode reward: [(0, '4290.517')] [2023-03-07 14:43:11,294][213771] Updated weights for policy 0, policy_version 28630 (0.0007) [2023-03-07 14:43:12,062][213771] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-07 14:43:12,830][213771] Updated weights for policy 0, policy_version 28650 (0.0005) [2023-03-07 14:43:13,591][213771] Updated weights for policy 0, policy_version 28660 (0.0006) [2023-03-07 14:43:14,374][213771] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-07 14:43:15,145][213771] Updated weights for policy 0, policy_version 28680 (0.0008) [2023-03-07 14:43:15,930][213771] Updated weights for policy 0, policy_version 28690 (0.0005) [2023-03-07 14:43:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13270.3). Total num frames: 29380608. Throughput: 0: 13289.0. Samples: 29350941. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 14:43:16,106][213445] Avg episode reward: [(0, '4457.426')] [2023-03-07 14:43:16,687][213771] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-07 14:43:17,456][213771] Updated weights for policy 0, policy_version 28710 (0.0005) [2023-03-07 14:43:18,228][213771] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-07 14:43:18,990][213771] Updated weights for policy 0, policy_version 28730 (0.0007) [2023-03-07 14:43:19,737][213771] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-07 14:43:20,513][213771] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-07 14:43:21,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13295.0, 300 sec: 13270.4). Total num frames: 29447168. Throughput: 0: 13291.2. Samples: 29430873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:43:21,106][213445] Avg episode reward: [(0, '4485.054')] [2023-03-07 14:43:21,292][213771] Updated weights for policy 0, policy_version 28760 (0.0007) [2023-03-07 14:43:22,065][213771] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-07 14:43:22,834][213771] Updated weights for policy 0, policy_version 28780 (0.0006) [2023-03-07 14:43:23,628][213771] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-07 14:43:24,402][213771] Updated weights for policy 0, policy_version 28800 (0.0005) [2023-03-07 14:43:25,169][213771] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-07 14:43:25,932][213771] Updated weights for policy 0, policy_version 28820 (0.0006) [2023-03-07 14:43:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13270.3). Total num frames: 29513728. Throughput: 0: 13288.7. Samples: 29510300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:43:26,106][213445] Avg episode reward: [(0, '4377.439')] [2023-03-07 14:43:26,727][213771] Updated weights for policy 0, policy_version 28830 (0.0005) [2023-03-07 14:43:27,476][213771] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-07 14:43:28,233][213771] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-07 14:43:28,999][213771] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-07 14:43:29,779][213771] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-07 14:43:30,558][213771] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-07 14:43:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13270.4). Total num frames: 29579264. Throughput: 0: 13287.2. Samples: 29550193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:43:31,106][213445] Avg episode reward: [(0, '4470.025')] [2023-03-07 14:43:31,345][213771] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-07 14:43:32,108][213771] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-07 14:43:32,881][213771] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-07 14:43:33,635][213771] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-07 14:43:34,409][213771] Updated weights for policy 0, policy_version 28930 (0.0005) [2023-03-07 14:43:35,183][213771] Updated weights for policy 0, policy_version 28940 (0.0005) [2023-03-07 14:43:35,967][213771] Updated weights for policy 0, policy_version 28950 (0.0006) [2023-03-07 14:43:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13270.4). Total num frames: 29645824. Throughput: 0: 13281.9. Samples: 29629836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:43:36,105][213445] Avg episode reward: [(0, '4455.656')] [2023-03-07 14:43:36,749][213771] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-07 14:43:37,530][213771] Updated weights for policy 0, policy_version 28970 (0.0007) [2023-03-07 14:43:38,312][213771] Updated weights for policy 0, policy_version 28980 (0.0007) [2023-03-07 14:43:39,074][213771] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-07 14:43:39,847][213771] Updated weights for policy 0, policy_version 29000 (0.0006) [2023-03-07 14:43:40,637][213771] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-07 14:43:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13270.4). Total num frames: 29712384. Throughput: 0: 13273.2. Samples: 29709056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:43:41,105][213445] Avg episode reward: [(0, '4451.489')] [2023-03-07 14:43:41,397][213771] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-07 14:43:42,168][213771] Updated weights for policy 0, policy_version 29030 (0.0007) [2023-03-07 14:43:42,943][213771] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-07 14:43:43,700][213771] Updated weights for policy 0, policy_version 29050 (0.0007) [2023-03-07 14:43:44,479][213771] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-07 14:43:45,253][213771] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-07 14:43:46,025][213771] Updated weights for policy 0, policy_version 29080 (0.0007) [2023-03-07 14:43:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13270.3). Total num frames: 29777920. Throughput: 0: 13269.8. Samples: 29748866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:43:46,106][213445] Avg episode reward: [(0, '4435.263')] [2023-03-07 14:43:46,781][213771] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-07 14:43:47,561][213771] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-07 14:43:48,321][213771] Updated weights for policy 0, policy_version 29110 (0.0007) [2023-03-07 14:43:49,089][213771] Updated weights for policy 0, policy_version 29120 (0.0006) [2023-03-07 14:43:49,869][213771] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-07 14:43:50,646][213771] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-07 14:43:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13273.8). Total num frames: 29845504. Throughput: 0: 13269.2. Samples: 29828680. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:43:51,105][213445] Avg episode reward: [(0, '4432.508')] [2023-03-07 14:43:51,401][213771] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-07 14:43:52,182][213771] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-07 14:43:52,945][213771] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-07 14:43:53,723][213771] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-07 14:43:54,478][213771] Updated weights for policy 0, policy_version 29190 (0.0007) [2023-03-07 14:43:55,254][213771] Updated weights for policy 0, policy_version 29200 (0.0007) [2023-03-07 14:43:56,019][213771] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-07 14:43:56,105][213445] Fps is (10 sec: 13414.5, 60 sec: 13277.9, 300 sec: 13273.8). Total num frames: 29912064. Throughput: 0: 13272.1. Samples: 29908436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:43:56,106][213445] Avg episode reward: [(0, '4358.325')] [2023-03-07 14:43:56,787][213771] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-07 14:43:57,565][213771] Updated weights for policy 0, policy_version 29230 (0.0006) [2023-03-07 14:43:58,337][213771] Updated weights for policy 0, policy_version 29240 (0.0005) [2023-03-07 14:43:59,105][213771] Updated weights for policy 0, policy_version 29250 (0.0007) [2023-03-07 14:43:59,882][213771] Updated weights for policy 0, policy_version 29260 (0.0007) [2023-03-07 14:44:00,650][213771] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-07 14:44:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13273.8). Total num frames: 29977600. Throughput: 0: 13275.1. Samples: 29948320. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:01,106][213445] Avg episode reward: [(0, '4349.063')] [2023-03-07 14:44:01,433][213771] Updated weights for policy 0, policy_version 29280 (0.0007) [2023-03-07 14:44:02,211][213771] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-07 14:44:02,990][213771] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-07 14:44:03,758][213771] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-07 14:44:04,562][213771] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-07 14:44:05,324][213771] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-07 14:44:06,087][213771] Updated weights for policy 0, policy_version 29340 (0.0005) [2023-03-07 14:44:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13273.8). Total num frames: 30044160. Throughput: 0: 13257.6. Samples: 30027466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:06,106][213445] Avg episode reward: [(0, '4386.523')] [2023-03-07 14:44:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000029340_30044160.pth... [2023-03-07 14:44:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000026229_26858496.pth [2023-03-07 14:44:06,879][213771] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-07 14:44:07,673][213771] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-07 14:44:08,438][213771] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-07 14:44:09,214][213771] Updated weights for policy 0, policy_version 29380 (0.0007) [2023-03-07 14:44:09,990][213771] Updated weights for policy 0, policy_version 29390 (0.0007) [2023-03-07 14:44:10,761][213771] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-07 14:44:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13270.3). Total num frames: 30109696. Throughput: 0: 13252.4. Samples: 30106660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:11,106][213445] Avg episode reward: [(0, '4480.311')] [2023-03-07 14:44:11,533][213771] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-07 14:44:12,318][213771] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-07 14:44:13,087][213771] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-07 14:44:13,856][213771] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-07 14:44:14,637][213771] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-07 14:44:15,425][213771] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-07 14:44:16,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13243.7, 300 sec: 13266.9). Total num frames: 30175232. Throughput: 0: 13245.2. Samples: 30146228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:16,106][213445] Avg episode reward: [(0, '4452.011')] [2023-03-07 14:44:16,196][213771] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-07 14:44:16,969][213771] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-07 14:44:17,746][213771] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-07 14:44:18,520][213771] Updated weights for policy 0, policy_version 29500 (0.0005) [2023-03-07 14:44:19,278][213771] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-07 14:44:20,060][213771] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-07 14:44:20,819][213771] Updated weights for policy 0, policy_version 29530 (0.0005) [2023-03-07 14:44:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13270.4). Total num frames: 30241792. Throughput: 0: 13238.9. Samples: 30225588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:21,106][213445] Avg episode reward: [(0, '4465.193')] [2023-03-07 14:44:21,588][213771] Updated weights for policy 0, policy_version 29540 (0.0007) [2023-03-07 14:44:22,360][213771] Updated weights for policy 0, policy_version 29550 (0.0007) [2023-03-07 14:44:23,133][213771] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-07 14:44:23,923][213771] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-07 14:44:24,699][213771] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-07 14:44:25,475][213771] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-07 14:44:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13270.3). Total num frames: 30308352. Throughput: 0: 13244.3. Samples: 30305051. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:26,106][213445] Avg episode reward: [(0, '4479.790')] [2023-03-07 14:44:26,245][213771] Updated weights for policy 0, policy_version 29600 (0.0007) [2023-03-07 14:44:27,001][213771] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-07 14:44:27,786][213771] Updated weights for policy 0, policy_version 29620 (0.0006) [2023-03-07 14:44:28,543][213771] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-07 14:44:29,331][213771] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-07 14:44:30,097][213771] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-07 14:44:30,876][213771] Updated weights for policy 0, policy_version 29660 (0.0007) [2023-03-07 14:44:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13273.8). Total num frames: 30374912. Throughput: 0: 13246.7. Samples: 30344965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:44:31,106][213445] Avg episode reward: [(0, '4485.116')] [2023-03-07 14:44:31,638][213771] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-07 14:44:32,393][213771] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-07 14:44:33,170][213771] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-07 14:44:33,921][213771] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-07 14:44:34,706][213771] Updated weights for policy 0, policy_version 29710 (0.0006) [2023-03-07 14:44:35,479][213771] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-07 14:44:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13273.8). Total num frames: 30441472. Throughput: 0: 13244.0. Samples: 30424662. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:44:36,106][213445] Avg episode reward: [(0, '4412.385')] [2023-03-07 14:44:36,248][213771] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-07 14:44:37,030][213771] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-07 14:44:37,794][213771] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-07 14:44:38,576][213771] Updated weights for policy 0, policy_version 29760 (0.0007) [2023-03-07 14:44:39,354][213771] Updated weights for policy 0, policy_version 29770 (0.0007) [2023-03-07 14:44:40,116][213771] Updated weights for policy 0, policy_version 29780 (0.0006) [2023-03-07 14:44:40,897][213771] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-07 14:44:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13270.3). Total num frames: 30507008. Throughput: 0: 13238.2. Samples: 30504154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:44:41,106][213445] Avg episode reward: [(0, '4427.162')] [2023-03-07 14:44:41,659][213771] Updated weights for policy 0, policy_version 29800 (0.0006) [2023-03-07 14:44:42,446][213771] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-07 14:44:43,222][213771] Updated weights for policy 0, policy_version 29820 (0.0005) [2023-03-07 14:44:43,997][213771] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-07 14:44:44,771][213771] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-07 14:44:45,549][213771] Updated weights for policy 0, policy_version 29850 (0.0005) [2023-03-07 14:44:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13270.4). Total num frames: 30573568. Throughput: 0: 13231.5. Samples: 30543734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:44:46,105][213445] Avg episode reward: [(0, '4428.485')] [2023-03-07 14:44:46,336][213771] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-07 14:44:47,098][213771] Updated weights for policy 0, policy_version 29870 (0.0007) [2023-03-07 14:44:47,851][213771] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-07 14:44:48,617][213771] Updated weights for policy 0, policy_version 29890 (0.0006) [2023-03-07 14:44:49,383][213771] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-07 14:44:50,168][213771] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-07 14:44:50,933][213771] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-07 14:44:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13270.3). Total num frames: 30640128. Throughput: 0: 13241.1. Samples: 30623313. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:44:51,105][213445] Avg episode reward: [(0, '4533.336')] [2023-03-07 14:44:51,106][213720] Saving new best policy, reward=4533.336! [2023-03-07 14:44:51,736][213771] Updated weights for policy 0, policy_version 29930 (0.0006) [2023-03-07 14:44:52,506][213771] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-07 14:44:53,286][213771] Updated weights for policy 0, policy_version 29950 (0.0006) [2023-03-07 14:44:54,058][213771] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-07 14:44:54,825][213771] Updated weights for policy 0, policy_version 29970 (0.0006) [2023-03-07 14:44:55,597][213771] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-07 14:44:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13266.9). Total num frames: 30705664. Throughput: 0: 13245.1. Samples: 30702688. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:44:56,116][213445] Avg episode reward: [(0, '4515.958')] [2023-03-07 14:44:56,373][213771] Updated weights for policy 0, policy_version 29990 (0.0008) [2023-03-07 14:44:57,137][213771] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-07 14:44:57,925][213771] Updated weights for policy 0, policy_version 30010 (0.0007) [2023-03-07 14:44:58,679][213771] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-07 14:44:59,455][213771] Updated weights for policy 0, policy_version 30030 (0.0007) [2023-03-07 14:45:00,222][213771] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-07 14:45:00,989][213771] Updated weights for policy 0, policy_version 30050 (0.0007) [2023-03-07 14:45:01,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13266.9). Total num frames: 30772224. Throughput: 0: 13247.8. Samples: 30742380. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:45:01,116][213445] Avg episode reward: [(0, '4499.027')] [2023-03-07 14:45:01,763][213771] Updated weights for policy 0, policy_version 30060 (0.0006) [2023-03-07 14:45:02,542][213771] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-07 14:45:03,312][213771] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-07 14:45:04,089][213771] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-07 14:45:04,859][213771] Updated weights for policy 0, policy_version 30100 (0.0005) [2023-03-07 14:45:05,628][213771] Updated weights for policy 0, policy_version 30110 (0.0006) [2023-03-07 14:45:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13270.3). Total num frames: 30838784. Throughput: 0: 13253.9. Samples: 30822014. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:45:06,116][213445] Avg episode reward: [(0, '4504.351')] [2023-03-07 14:45:06,387][213771] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-07 14:45:07,154][213771] Updated weights for policy 0, policy_version 30130 (0.0005) [2023-03-07 14:45:07,928][213771] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-07 14:45:08,695][213771] Updated weights for policy 0, policy_version 30150 (0.0007) [2023-03-07 14:45:09,469][213771] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-07 14:45:10,249][213771] Updated weights for policy 0, policy_version 30170 (0.0007) [2023-03-07 14:45:11,022][213771] Updated weights for policy 0, policy_version 30180 (0.0006) [2023-03-07 14:45:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13266.9). Total num frames: 30904320. Throughput: 0: 13255.9. Samples: 30901563. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:45:11,116][213445] Avg episode reward: [(0, '4442.716')] [2023-03-07 14:45:11,806][213771] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-07 14:45:12,572][213771] Updated weights for policy 0, policy_version 30200 (0.0007) [2023-03-07 14:45:13,353][213771] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-07 14:45:14,126][213771] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-07 14:45:14,894][213771] Updated weights for policy 0, policy_version 30230 (0.0006) [2023-03-07 14:45:15,668][213771] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-07 14:45:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 30970880. Throughput: 0: 13253.3. Samples: 30941366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:45:16,116][213445] Avg episode reward: [(0, '4448.741')] [2023-03-07 14:45:16,464][213771] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-07 14:45:17,225][213771] Updated weights for policy 0, policy_version 30260 (0.0006) [2023-03-07 14:45:18,014][213771] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-07 14:45:18,790][213771] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-07 14:45:19,573][213771] Updated weights for policy 0, policy_version 30290 (0.0005) [2023-03-07 14:45:20,329][213771] Updated weights for policy 0, policy_version 30300 (0.0006) [2023-03-07 14:45:21,104][213771] Updated weights for policy 0, policy_version 30310 (0.0006) [2023-03-07 14:45:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 31037440. Throughput: 0: 13243.8. Samples: 31020631. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:45:21,115][213445] Avg episode reward: [(0, '4459.699')] [2023-03-07 14:45:21,878][213771] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-07 14:45:22,660][213771] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-07 14:45:23,426][213771] Updated weights for policy 0, policy_version 30340 (0.0007) [2023-03-07 14:45:24,188][213771] Updated weights for policy 0, policy_version 30350 (0.0006) [2023-03-07 14:45:24,968][213771] Updated weights for policy 0, policy_version 30360 (0.0007) [2023-03-07 14:45:25,729][213771] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-07 14:45:26,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 31102976. Throughput: 0: 13249.8. Samples: 31100393. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:45:26,116][213445] Avg episode reward: [(0, '4517.480')] [2023-03-07 14:45:26,501][213771] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-07 14:45:27,265][213771] Updated weights for policy 0, policy_version 30390 (0.0007) [2023-03-07 14:45:28,044][213771] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-07 14:45:28,817][213771] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-07 14:45:29,581][213771] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-07 14:45:30,358][213771] Updated weights for policy 0, policy_version 30430 (0.0007) [2023-03-07 14:45:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 31169536. Throughput: 0: 13251.1. Samples: 31140035. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:45:31,116][213445] Avg episode reward: [(0, '4487.400')] [2023-03-07 14:45:31,146][213771] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-07 14:45:31,915][213771] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-07 14:45:32,702][213771] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-07 14:45:33,457][213771] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-07 14:45:34,230][213771] Updated weights for policy 0, policy_version 30480 (0.0008) [2023-03-07 14:45:35,000][213771] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-07 14:45:35,765][213771] Updated weights for policy 0, policy_version 30500 (0.0006) [2023-03-07 14:45:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 31236096. Throughput: 0: 13247.7. Samples: 31219462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:45:36,116][213445] Avg episode reward: [(0, '4467.386')] [2023-03-07 14:45:36,537][213771] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-07 14:45:37,324][213771] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-07 14:45:38,081][213771] Updated weights for policy 0, policy_version 30530 (0.0005) [2023-03-07 14:45:38,854][213771] Updated weights for policy 0, policy_version 30540 (0.0007) [2023-03-07 14:45:39,628][213771] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-07 14:45:40,395][213771] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-07 14:45:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 31302656. Throughput: 0: 13255.3. Samples: 31299176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:45:41,117][213445] Avg episode reward: [(0, '4451.675')] [2023-03-07 14:45:41,157][213771] Updated weights for policy 0, policy_version 30570 (0.0006) [2023-03-07 14:45:41,925][213771] Updated weights for policy 0, policy_version 30580 (0.0006) [2023-03-07 14:45:42,695][213771] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-07 14:45:43,468][213771] Updated weights for policy 0, policy_version 30600 (0.0007) [2023-03-07 14:45:44,253][213771] Updated weights for policy 0, policy_version 30610 (0.0006) [2023-03-07 14:45:45,008][213771] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-07 14:45:45,769][213771] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-07 14:45:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 31369216. Throughput: 0: 13264.8. Samples: 31339294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:45:46,116][213445] Avg episode reward: [(0, '4534.912')] [2023-03-07 14:45:46,120][213720] Saving new best policy, reward=4534.912! [2023-03-07 14:45:46,561][213771] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-07 14:45:47,337][213771] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-07 14:45:48,107][213771] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-07 14:45:48,878][213771] Updated weights for policy 0, policy_version 30670 (0.0005) [2023-03-07 14:45:49,639][213771] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-07 14:45:50,418][213771] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-07 14:45:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 31434752. Throughput: 0: 13265.3. Samples: 31418953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:45:51,106][213445] Avg episode reward: [(0, '4491.654')] [2023-03-07 14:45:51,208][213771] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-07 14:45:51,967][213771] Updated weights for policy 0, policy_version 30710 (0.0006) [2023-03-07 14:45:52,720][213771] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-07 14:45:53,522][213771] Updated weights for policy 0, policy_version 30730 (0.0007) [2023-03-07 14:45:54,301][213771] Updated weights for policy 0, policy_version 30740 (0.0006) [2023-03-07 14:45:55,083][213771] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-07 14:45:55,869][213771] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-07 14:45:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 31501312. Throughput: 0: 13252.9. Samples: 31497945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:45:56,105][213445] Avg episode reward: [(0, '4530.780')] [2023-03-07 14:45:56,637][213771] Updated weights for policy 0, policy_version 30770 (0.0006) [2023-03-07 14:45:57,405][213771] Updated weights for policy 0, policy_version 30780 (0.0007) [2023-03-07 14:45:58,175][213771] Updated weights for policy 0, policy_version 30790 (0.0006) [2023-03-07 14:45:58,974][213771] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-07 14:45:59,733][213771] Updated weights for policy 0, policy_version 30810 (0.0006) [2023-03-07 14:46:00,501][213771] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-07 14:46:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13259.9). Total num frames: 31566848. Throughput: 0: 13250.7. Samples: 31537645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:01,106][213445] Avg episode reward: [(0, '4471.627')] [2023-03-07 14:46:01,273][213771] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-03-07 14:46:02,048][213771] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-07 14:46:02,830][213771] Updated weights for policy 0, policy_version 30850 (0.0008) [2023-03-07 14:46:03,613][213771] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-07 14:46:04,384][213771] Updated weights for policy 0, policy_version 30870 (0.0007) [2023-03-07 14:46:05,162][213771] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-07 14:46:05,917][213771] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-07 14:46:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 31633408. Throughput: 0: 13250.0. Samples: 31616882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:06,106][213445] Avg episode reward: [(0, '4461.038')] [2023-03-07 14:46:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000030892_31633408.pth... [2023-03-07 14:46:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000027785_28451840.pth [2023-03-07 14:46:06,694][213771] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-07 14:46:07,463][213771] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-07 14:46:08,226][213771] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-07 14:46:08,977][213771] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-07 14:46:09,745][213771] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-07 14:46:10,507][213771] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-07 14:46:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 31699968. Throughput: 0: 13257.4. Samples: 31696980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:11,106][213445] Avg episode reward: [(0, '4502.427')] [2023-03-07 14:46:11,280][213771] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-07 14:46:12,052][213771] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-07 14:46:12,841][213771] Updated weights for policy 0, policy_version 30980 (0.0005) [2023-03-07 14:46:13,613][213771] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-07 14:46:14,371][213771] Updated weights for policy 0, policy_version 31000 (0.0005) [2023-03-07 14:46:15,141][213771] Updated weights for policy 0, policy_version 31010 (0.0005) [2023-03-07 14:46:15,925][213771] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-07 14:46:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 31766528. Throughput: 0: 13261.2. Samples: 31736791. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:46:16,106][213445] Avg episode reward: [(0, '4556.633')] [2023-03-07 14:46:16,113][213720] Saving new best policy, reward=4556.633! [2023-03-07 14:46:16,688][213771] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-07 14:46:17,475][213771] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-07 14:46:18,236][213771] Updated weights for policy 0, policy_version 31050 (0.0007) [2023-03-07 14:46:19,009][213771] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-07 14:46:19,785][213771] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-07 14:46:20,550][213771] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-07 14:46:21,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 31833088. Throughput: 0: 13267.2. Samples: 31816486. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:46:21,105][213445] Avg episode reward: [(0, '4533.873')] [2023-03-07 14:46:21,323][213771] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-07 14:46:22,089][213771] Updated weights for policy 0, policy_version 31100 (0.0005) [2023-03-07 14:46:22,865][213771] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-07 14:46:23,633][213771] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-07 14:46:24,415][213771] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-07 14:46:25,190][213771] Updated weights for policy 0, policy_version 31140 (0.0006) [2023-03-07 14:46:25,973][213771] Updated weights for policy 0, policy_version 31150 (0.0008) [2023-03-07 14:46:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.7, 300 sec: 13259.9). Total num frames: 31898624. Throughput: 0: 13262.2. Samples: 31895978. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:46:26,106][213445] Avg episode reward: [(0, '4525.570')] [2023-03-07 14:46:26,754][213771] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-07 14:46:27,528][213771] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-07 14:46:28,274][213771] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-03-07 14:46:29,062][213771] Updated weights for policy 0, policy_version 31190 (0.0006) [2023-03-07 14:46:29,840][213771] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-07 14:46:30,603][213771] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-07 14:46:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 31965184. Throughput: 0: 13252.3. Samples: 31935646. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:46:31,106][213445] Avg episode reward: [(0, '4496.316')] [2023-03-07 14:46:31,388][213771] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-07 14:46:32,158][213771] Updated weights for policy 0, policy_version 31230 (0.0006) [2023-03-07 14:46:32,927][213771] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-07 14:46:33,717][213771] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-07 14:46:34,486][213771] Updated weights for policy 0, policy_version 31260 (0.0005) [2023-03-07 14:46:35,260][213771] Updated weights for policy 0, policy_version 31270 (0.0006) [2023-03-07 14:46:36,019][213771] Updated weights for policy 0, policy_version 31280 (0.0007) [2023-03-07 14:46:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32031744. Throughput: 0: 13243.2. Samples: 32014897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:36,106][213445] Avg episode reward: [(0, '4519.290')] [2023-03-07 14:46:36,797][213771] Updated weights for policy 0, policy_version 31290 (0.0006) [2023-03-07 14:46:37,549][213771] Updated weights for policy 0, policy_version 31300 (0.0007) [2023-03-07 14:46:38,312][213771] Updated weights for policy 0, policy_version 31310 (0.0006) [2023-03-07 14:46:39,078][213771] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-07 14:46:39,845][213771] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-07 14:46:40,629][213771] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-07 14:46:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32098304. Throughput: 0: 13263.2. Samples: 32094787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:41,106][213445] Avg episode reward: [(0, '4548.107')] [2023-03-07 14:46:41,419][213771] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-07 14:46:42,189][213771] Updated weights for policy 0, policy_version 31360 (0.0006) [2023-03-07 14:46:42,949][213771] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-07 14:46:43,718][213771] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-07 14:46:44,485][213771] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-07 14:46:45,259][213771] Updated weights for policy 0, policy_version 31400 (0.0005) [2023-03-07 14:46:46,042][213771] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-07 14:46:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 32164864. Throughput: 0: 13265.3. Samples: 32134582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:46,106][213445] Avg episode reward: [(0, '4543.581')] [2023-03-07 14:46:46,818][213771] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-07 14:46:47,583][213771] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-07 14:46:48,352][213771] Updated weights for policy 0, policy_version 31440 (0.0007) [2023-03-07 14:46:49,133][213771] Updated weights for policy 0, policy_version 31450 (0.0007) [2023-03-07 14:46:49,894][213771] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-03-07 14:46:50,669][213771] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-07 14:46:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32230400. Throughput: 0: 13275.2. Samples: 32214268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:46:51,106][213445] Avg episode reward: [(0, '4381.410')] [2023-03-07 14:46:51,442][213771] Updated weights for policy 0, policy_version 31480 (0.0005) [2023-03-07 14:46:52,221][213771] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-07 14:46:52,976][213771] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-07 14:46:53,751][213771] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-07 14:46:54,518][213771] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-07 14:46:55,287][213771] Updated weights for policy 0, policy_version 31530 (0.0005) [2023-03-07 14:46:56,051][213771] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-07 14:46:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32296960. Throughput: 0: 13267.8. Samples: 32294029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:46:56,106][213445] Avg episode reward: [(0, '4512.414')] [2023-03-07 14:46:56,824][213771] Updated weights for policy 0, policy_version 31550 (0.0007) [2023-03-07 14:46:57,616][213771] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-07 14:46:58,376][213771] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-07 14:46:59,156][213771] Updated weights for policy 0, policy_version 31580 (0.0007) [2023-03-07 14:46:59,929][213771] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-07 14:47:00,688][213771] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-07 14:47:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 32363520. Throughput: 0: 13264.3. Samples: 32333680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:01,106][213445] Avg episode reward: [(0, '4573.184')] [2023-03-07 14:47:01,106][213720] Saving new best policy, reward=4573.184! [2023-03-07 14:47:01,463][213771] Updated weights for policy 0, policy_version 31610 (0.0007) [2023-03-07 14:47:02,237][213771] Updated weights for policy 0, policy_version 31620 (0.0007) [2023-03-07 14:47:02,999][213771] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-07 14:47:03,785][213771] Updated weights for policy 0, policy_version 31640 (0.0006) [2023-03-07 14:47:04,545][213771] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-07 14:47:05,326][213771] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-07 14:47:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32429056. Throughput: 0: 13260.5. Samples: 32413211. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:06,106][213445] Avg episode reward: [(0, '4543.591')] [2023-03-07 14:47:06,126][213771] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-07 14:47:06,901][213771] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-07 14:47:07,645][213771] Updated weights for policy 0, policy_version 31690 (0.0007) [2023-03-07 14:47:08,426][213771] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-07 14:47:09,201][213771] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-07 14:47:09,973][213771] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-07 14:47:10,757][213771] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-07 14:47:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32495616. Throughput: 0: 13261.7. Samples: 32492752. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:11,106][213445] Avg episode reward: [(0, '4559.412')] [2023-03-07 14:47:11,521][213771] Updated weights for policy 0, policy_version 31740 (0.0007) [2023-03-07 14:47:12,303][213771] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-07 14:47:13,050][213771] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-07 14:47:13,821][213771] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-07 14:47:14,596][213771] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-07 14:47:15,349][213771] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-07 14:47:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 32562176. Throughput: 0: 13265.3. Samples: 32532586. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:16,106][213445] Avg episode reward: [(0, '4570.134')] [2023-03-07 14:47:16,122][213771] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-07 14:47:16,904][213771] Updated weights for policy 0, policy_version 31810 (0.0006) [2023-03-07 14:47:17,654][213771] Updated weights for policy 0, policy_version 31820 (0.0007) [2023-03-07 14:47:18,433][213771] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-07 14:47:19,213][213771] Updated weights for policy 0, policy_version 31840 (0.0006) [2023-03-07 14:47:19,966][213771] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-07 14:47:20,746][213771] Updated weights for policy 0, policy_version 31860 (0.0005) [2023-03-07 14:47:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 32628736. Throughput: 0: 13280.3. Samples: 32612508. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:21,106][213445] Avg episode reward: [(0, '4519.804')] [2023-03-07 14:47:21,518][213771] Updated weights for policy 0, policy_version 31870 (0.0006) [2023-03-07 14:47:22,282][213771] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-07 14:47:23,062][213771] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-07 14:47:23,836][213771] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-07 14:47:24,598][213771] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-07 14:47:25,374][213771] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-07 14:47:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 32695296. Throughput: 0: 13273.8. Samples: 32692109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:26,106][213445] Avg episode reward: [(0, '4484.819')] [2023-03-07 14:47:26,163][213771] Updated weights for policy 0, policy_version 31930 (0.0007) [2023-03-07 14:47:26,921][213771] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-07 14:47:27,673][213771] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-07 14:47:28,432][213771] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-07 14:47:29,220][213771] Updated weights for policy 0, policy_version 31970 (0.0005) [2023-03-07 14:47:29,997][213771] Updated weights for policy 0, policy_version 31980 (0.0006) [2023-03-07 14:47:30,792][213771] Updated weights for policy 0, policy_version 31990 (0.0005) [2023-03-07 14:47:31,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.8, 300 sec: 13263.4). Total num frames: 32761856. Throughput: 0: 13281.1. Samples: 32732233. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:31,106][213445] Avg episode reward: [(0, '4451.615')] [2023-03-07 14:47:31,566][213771] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-07 14:47:32,324][213771] Updated weights for policy 0, policy_version 32010 (0.0008) [2023-03-07 14:47:33,093][213771] Updated weights for policy 0, policy_version 32020 (0.0006) [2023-03-07 14:47:33,872][213771] Updated weights for policy 0, policy_version 32030 (0.0005) [2023-03-07 14:47:34,644][213771] Updated weights for policy 0, policy_version 32040 (0.0006) [2023-03-07 14:47:35,427][213771] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-07 14:47:36,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32827392. Throughput: 0: 13268.5. Samples: 32811354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:36,106][213445] Avg episode reward: [(0, '4400.739')] [2023-03-07 14:47:36,200][213771] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-07 14:47:36,977][213771] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-07 14:47:37,749][213771] Updated weights for policy 0, policy_version 32080 (0.0007) [2023-03-07 14:47:38,530][213771] Updated weights for policy 0, policy_version 32090 (0.0006) [2023-03-07 14:47:39,309][213771] Updated weights for policy 0, policy_version 32100 (0.0006) [2023-03-07 14:47:40,066][213771] Updated weights for policy 0, policy_version 32110 (0.0005) [2023-03-07 14:47:40,866][213771] Updated weights for policy 0, policy_version 32120 (0.0006) [2023-03-07 14:47:41,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32893952. Throughput: 0: 13256.6. Samples: 32890574. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:41,106][213445] Avg episode reward: [(0, '4477.168')] [2023-03-07 14:47:41,620][213771] Updated weights for policy 0, policy_version 32130 (0.0006) [2023-03-07 14:47:42,391][213771] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-07 14:47:43,160][213771] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-07 14:47:43,957][213771] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-07 14:47:44,726][213771] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-07 14:47:45,491][213771] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-07 14:47:46,105][213445] Fps is (10 sec: 13312.4, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 32960512. Throughput: 0: 13261.2. Samples: 32930433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:46,106][213445] Avg episode reward: [(0, '4383.514')] [2023-03-07 14:47:46,254][213771] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-07 14:47:47,032][213771] Updated weights for policy 0, policy_version 32200 (0.0007) [2023-03-07 14:47:47,827][213771] Updated weights for policy 0, policy_version 32210 (0.0006) [2023-03-07 14:47:48,580][213771] Updated weights for policy 0, policy_version 32220 (0.0008) [2023-03-07 14:47:49,364][213771] Updated weights for policy 0, policy_version 32230 (0.0006) [2023-03-07 14:47:50,153][213771] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-07 14:47:50,918][213771] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-07 14:47:51,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33026048. Throughput: 0: 13255.1. Samples: 33009689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:47:51,106][213445] Avg episode reward: [(0, '4462.005')] [2023-03-07 14:47:51,693][213771] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-07 14:47:52,482][213771] Updated weights for policy 0, policy_version 32270 (0.0005) [2023-03-07 14:47:53,249][213771] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-07 14:47:54,020][213771] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-07 14:47:54,785][213771] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-07 14:47:55,550][213771] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-07 14:47:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33092608. Throughput: 0: 13255.8. Samples: 33089264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:47:56,106][213445] Avg episode reward: [(0, '4516.482')] [2023-03-07 14:47:56,325][213771] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-07 14:47:57,094][213771] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-07 14:47:57,853][213771] Updated weights for policy 0, policy_version 32340 (0.0006) [2023-03-07 14:47:58,634][213771] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-07 14:47:59,409][213771] Updated weights for policy 0, policy_version 32360 (0.0007) [2023-03-07 14:48:00,177][213771] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-07 14:48:00,926][213771] Updated weights for policy 0, policy_version 32380 (0.0007) [2023-03-07 14:48:01,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 33159168. Throughput: 0: 13257.3. Samples: 33129162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:01,106][213445] Avg episode reward: [(0, '4474.614')] [2023-03-07 14:48:01,693][213771] Updated weights for policy 0, policy_version 32390 (0.0006) [2023-03-07 14:48:02,476][213771] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-07 14:48:03,251][213771] Updated weights for policy 0, policy_version 32410 (0.0005) [2023-03-07 14:48:04,020][213771] Updated weights for policy 0, policy_version 32420 (0.0007) [2023-03-07 14:48:04,823][213771] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-07 14:48:05,589][213771] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-07 14:48:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33224704. Throughput: 0: 13246.1. Samples: 33208581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:06,105][213445] Avg episode reward: [(0, '4508.855')] [2023-03-07 14:48:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000032446_33224704.pth... [2023-03-07 14:48:06,138][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000029340_30044160.pth [2023-03-07 14:48:06,361][213771] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-07 14:48:07,136][213771] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-07 14:48:07,921][213771] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-07 14:48:08,716][213771] Updated weights for policy 0, policy_version 32480 (0.0006) [2023-03-07 14:48:09,469][213771] Updated weights for policy 0, policy_version 32490 (0.0006) [2023-03-07 14:48:10,228][213771] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-07 14:48:10,994][213771] Updated weights for policy 0, policy_version 32510 (0.0006) [2023-03-07 14:48:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33291264. Throughput: 0: 13244.1. Samples: 33288094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:11,106][213445] Avg episode reward: [(0, '4423.526')] [2023-03-07 14:48:11,772][213771] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-07 14:48:12,541][213771] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-07 14:48:13,322][213771] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-07 14:48:14,105][213771] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-07 14:48:14,873][213771] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-07 14:48:15,630][213771] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-07 14:48:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33357824. Throughput: 0: 13233.5. Samples: 33327737. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:48:16,105][213445] Avg episode reward: [(0, '4428.888')] [2023-03-07 14:48:16,425][213771] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-07 14:48:17,191][213771] Updated weights for policy 0, policy_version 32590 (0.0006) [2023-03-07 14:48:17,961][213771] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-07 14:48:18,737][213771] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-07 14:48:19,511][213771] Updated weights for policy 0, policy_version 32620 (0.0005) [2023-03-07 14:48:20,270][213771] Updated weights for policy 0, policy_version 32630 (0.0005) [2023-03-07 14:48:21,048][213771] Updated weights for policy 0, policy_version 32640 (0.0007) [2023-03-07 14:48:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 33423360. Throughput: 0: 13240.7. Samples: 33407184. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:48:21,106][213445] Avg episode reward: [(0, '4475.232')] [2023-03-07 14:48:21,844][213771] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-07 14:48:22,584][213771] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-07 14:48:23,357][213771] Updated weights for policy 0, policy_version 32670 (0.0007) [2023-03-07 14:48:24,122][213771] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-07 14:48:24,891][213771] Updated weights for policy 0, policy_version 32690 (0.0005) [2023-03-07 14:48:25,644][213771] Updated weights for policy 0, policy_version 32700 (0.0007) [2023-03-07 14:48:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 33489920. Throughput: 0: 13258.6. Samples: 33487214. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:48:26,106][213445] Avg episode reward: [(0, '4376.523')] [2023-03-07 14:48:26,426][213771] Updated weights for policy 0, policy_version 32710 (0.0007) [2023-03-07 14:48:27,204][213771] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-07 14:48:27,964][213771] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-07 14:48:28,742][213771] Updated weights for policy 0, policy_version 32740 (0.0005) [2023-03-07 14:48:29,512][213771] Updated weights for policy 0, policy_version 32750 (0.0007) [2023-03-07 14:48:30,253][213771] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-07 14:48:31,037][213771] Updated weights for policy 0, policy_version 32770 (0.0007) [2023-03-07 14:48:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 33556480. Throughput: 0: 13259.2. Samples: 33527098. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:48:31,106][213445] Avg episode reward: [(0, '4396.242')] [2023-03-07 14:48:31,791][213771] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-07 14:48:32,566][213771] Updated weights for policy 0, policy_version 32790 (0.0005) [2023-03-07 14:48:33,347][213771] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-07 14:48:34,115][213771] Updated weights for policy 0, policy_version 32810 (0.0006) [2023-03-07 14:48:34,874][213771] Updated weights for policy 0, policy_version 32820 (0.0006) [2023-03-07 14:48:35,651][213771] Updated weights for policy 0, policy_version 32830 (0.0007) [2023-03-07 14:48:36,105][213445] Fps is (10 sec: 13414.3, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 33624064. Throughput: 0: 13278.6. Samples: 33607225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:36,106][213445] Avg episode reward: [(0, '4428.713')] [2023-03-07 14:48:36,416][213771] Updated weights for policy 0, policy_version 32840 (0.0007) [2023-03-07 14:48:37,170][213771] Updated weights for policy 0, policy_version 32850 (0.0005) [2023-03-07 14:48:37,950][213771] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-07 14:48:38,714][213771] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-07 14:48:39,487][213771] Updated weights for policy 0, policy_version 32880 (0.0006) [2023-03-07 14:48:40,255][213771] Updated weights for policy 0, policy_version 32890 (0.0006) [2023-03-07 14:48:41,028][213771] Updated weights for policy 0, policy_version 32900 (0.0007) [2023-03-07 14:48:41,105][213445] Fps is (10 sec: 13414.6, 60 sec: 13277.8, 300 sec: 13263.4). Total num frames: 33690624. Throughput: 0: 13282.3. Samples: 33686965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:41,106][213445] Avg episode reward: [(0, '4392.660')] [2023-03-07 14:48:41,798][213771] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-07 14:48:42,557][213771] Updated weights for policy 0, policy_version 32920 (0.0005) [2023-03-07 14:48:43,334][213771] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-07 14:48:44,099][213771] Updated weights for policy 0, policy_version 32940 (0.0007) [2023-03-07 14:48:44,880][213771] Updated weights for policy 0, policy_version 32950 (0.0007) [2023-03-07 14:48:45,636][213771] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-07 14:48:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 33756160. Throughput: 0: 13285.7. Samples: 33727022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:46,106][213445] Avg episode reward: [(0, '4469.217')] [2023-03-07 14:48:46,422][213771] Updated weights for policy 0, policy_version 32970 (0.0008) [2023-03-07 14:48:47,180][213771] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-07 14:48:47,967][213771] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-07 14:48:48,721][213771] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-07 14:48:49,513][213771] Updated weights for policy 0, policy_version 33010 (0.0007) [2023-03-07 14:48:50,277][213771] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-07 14:48:51,038][213771] Updated weights for policy 0, policy_version 33030 (0.0007) [2023-03-07 14:48:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13294.9, 300 sec: 13259.9). Total num frames: 33823744. Throughput: 0: 13294.0. Samples: 33806811. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:51,106][213445] Avg episode reward: [(0, '4444.337')] [2023-03-07 14:48:51,817][213771] Updated weights for policy 0, policy_version 33040 (0.0005) [2023-03-07 14:48:52,583][213771] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-07 14:48:53,361][213771] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-07 14:48:54,104][213771] Updated weights for policy 0, policy_version 33070 (0.0007) [2023-03-07 14:48:54,915][213771] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-07 14:48:55,670][213771] Updated weights for policy 0, policy_version 33090 (0.0005) [2023-03-07 14:48:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.8, 300 sec: 13259.9). Total num frames: 33889280. Throughput: 0: 13295.0. Samples: 33886371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:48:56,106][213445] Avg episode reward: [(0, '4373.524')] [2023-03-07 14:48:56,441][213771] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-07 14:48:57,202][213771] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-07 14:48:57,973][213771] Updated weights for policy 0, policy_version 33120 (0.0006) [2023-03-07 14:48:58,745][213771] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-07 14:48:59,506][213771] Updated weights for policy 0, policy_version 33140 (0.0007) [2023-03-07 14:49:00,293][213771] Updated weights for policy 0, policy_version 33150 (0.0007) [2023-03-07 14:49:01,053][213771] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-07 14:49:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.8, 300 sec: 13259.9). Total num frames: 33955840. Throughput: 0: 13303.5. Samples: 33926398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:01,106][213445] Avg episode reward: [(0, '4403.205')] [2023-03-07 14:49:01,832][213771] Updated weights for policy 0, policy_version 33170 (0.0006) [2023-03-07 14:49:02,615][213771] Updated weights for policy 0, policy_version 33180 (0.0005) [2023-03-07 14:49:03,364][213771] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-07 14:49:04,138][213771] Updated weights for policy 0, policy_version 33200 (0.0007) [2023-03-07 14:49:04,907][213771] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-07 14:49:05,689][213771] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-07 14:49:06,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13294.9, 300 sec: 13263.4). Total num frames: 34022400. Throughput: 0: 13306.2. Samples: 34005962. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:06,106][213445] Avg episode reward: [(0, '4461.970')] [2023-03-07 14:49:06,445][213771] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-07 14:49:07,219][213771] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-07 14:49:07,993][213771] Updated weights for policy 0, policy_version 33250 (0.0005) [2023-03-07 14:49:08,764][213771] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-07 14:49:09,542][213771] Updated weights for policy 0, policy_version 33270 (0.0006) [2023-03-07 14:49:10,306][213771] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-07 14:49:11,088][213771] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-07 14:49:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13266.9). Total num frames: 34088960. Throughput: 0: 13296.4. Samples: 34085554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:11,106][213445] Avg episode reward: [(0, '4389.449')] [2023-03-07 14:49:11,869][213771] Updated weights for policy 0, policy_version 33300 (0.0006) [2023-03-07 14:49:12,640][213771] Updated weights for policy 0, policy_version 33310 (0.0006) [2023-03-07 14:49:13,405][213771] Updated weights for policy 0, policy_version 33320 (0.0006) [2023-03-07 14:49:14,193][213771] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-07 14:49:14,970][213771] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-07 14:49:15,725][213771] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-07 14:49:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13277.8, 300 sec: 13263.4). Total num frames: 34154496. Throughput: 0: 13291.4. Samples: 34125210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:16,106][213445] Avg episode reward: [(0, '4436.830')] [2023-03-07 14:49:16,513][213771] Updated weights for policy 0, policy_version 33360 (0.0006) [2023-03-07 14:49:17,283][213771] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-07 14:49:18,045][213771] Updated weights for policy 0, policy_version 33380 (0.0007) [2023-03-07 14:49:18,830][213771] Updated weights for policy 0, policy_version 33390 (0.0007) [2023-03-07 14:49:19,620][213771] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-07 14:49:20,394][213771] Updated weights for policy 0, policy_version 33410 (0.0006) [2023-03-07 14:49:21,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13295.0, 300 sec: 13263.4). Total num frames: 34221056. Throughput: 0: 13273.9. Samples: 34204549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:49:21,105][213445] Avg episode reward: [(0, '4437.829')] [2023-03-07 14:49:21,146][213771] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-07 14:49:21,925][213771] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-07 14:49:22,692][213771] Updated weights for policy 0, policy_version 33440 (0.0008) [2023-03-07 14:49:23,458][213771] Updated weights for policy 0, policy_version 33450 (0.0005) [2023-03-07 14:49:24,234][213771] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-07 14:49:25,000][213771] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-07 14:49:25,774][213771] Updated weights for policy 0, policy_version 33480 (0.0007) [2023-03-07 14:49:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13263.4). Total num frames: 34287616. Throughput: 0: 13272.9. Samples: 34284244. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:49:26,106][213445] Avg episode reward: [(0, '4375.797')] [2023-03-07 14:49:26,552][213771] Updated weights for policy 0, policy_version 33490 (0.0007) [2023-03-07 14:49:27,327][213771] Updated weights for policy 0, policy_version 33500 (0.0005) [2023-03-07 14:49:28,093][213771] Updated weights for policy 0, policy_version 33510 (0.0005) [2023-03-07 14:49:28,863][213771] Updated weights for policy 0, policy_version 33520 (0.0006) [2023-03-07 14:49:29,645][213771] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-07 14:49:30,416][213771] Updated weights for policy 0, policy_version 33540 (0.0007) [2023-03-07 14:49:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13295.0, 300 sec: 13263.4). Total num frames: 34354176. Throughput: 0: 13269.4. Samples: 34324145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:49:31,106][213445] Avg episode reward: [(0, '4469.390')] [2023-03-07 14:49:31,178][213771] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-07 14:49:31,967][213771] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-07 14:49:32,712][213771] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-07 14:49:33,477][213771] Updated weights for policy 0, policy_version 33580 (0.0006) [2023-03-07 14:49:34,263][213771] Updated weights for policy 0, policy_version 33590 (0.0007) [2023-03-07 14:49:35,025][213771] Updated weights for policy 0, policy_version 33600 (0.0007) [2023-03-07 14:49:35,797][213771] Updated weights for policy 0, policy_version 33610 (0.0006) [2023-03-07 14:49:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 34420736. Throughput: 0: 13268.8. Samples: 34403906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:49:36,105][213445] Avg episode reward: [(0, '4392.404')] [2023-03-07 14:49:36,569][213771] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-07 14:49:37,339][213771] Updated weights for policy 0, policy_version 33630 (0.0005) [2023-03-07 14:49:38,138][213771] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-07 14:49:38,901][213771] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-07 14:49:39,656][213771] Updated weights for policy 0, policy_version 33660 (0.0007) [2023-03-07 14:49:40,418][213771] Updated weights for policy 0, policy_version 33670 (0.0007) [2023-03-07 14:49:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 34486272. Throughput: 0: 13268.7. Samples: 34483463. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:41,106][213445] Avg episode reward: [(0, '4375.732')] [2023-03-07 14:49:41,203][213771] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-07 14:49:41,950][213771] Updated weights for policy 0, policy_version 33690 (0.0005) [2023-03-07 14:49:42,726][213771] Updated weights for policy 0, policy_version 33700 (0.0005) [2023-03-07 14:49:43,523][213771] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-07 14:49:44,287][213771] Updated weights for policy 0, policy_version 33720 (0.0006) [2023-03-07 14:49:45,058][213771] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-07 14:49:45,823][213771] Updated weights for policy 0, policy_version 33740 (0.0005) [2023-03-07 14:49:46,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13277.8, 300 sec: 13263.4). Total num frames: 34552832. Throughput: 0: 13262.5. Samples: 34523210. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:46,106][213445] Avg episode reward: [(0, '4428.413')] [2023-03-07 14:49:46,593][213771] Updated weights for policy 0, policy_version 33750 (0.0007) [2023-03-07 14:49:47,358][213771] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-07 14:49:48,141][213771] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-07 14:49:48,913][213771] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-07 14:49:49,680][213771] Updated weights for policy 0, policy_version 33790 (0.0007) [2023-03-07 14:49:50,448][213771] Updated weights for policy 0, policy_version 33800 (0.0005) [2023-03-07 14:49:51,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 34619392. Throughput: 0: 13266.2. Samples: 34602941. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:51,105][213445] Avg episode reward: [(0, '4372.642')] [2023-03-07 14:49:51,241][213771] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-07 14:49:51,994][213771] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-07 14:49:52,767][213771] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-07 14:49:53,560][213771] Updated weights for policy 0, policy_version 33840 (0.0006) [2023-03-07 14:49:54,307][213771] Updated weights for policy 0, policy_version 33850 (0.0007) [2023-03-07 14:49:55,086][213771] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-07 14:49:55,859][213771] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-07 14:49:56,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 34685952. Throughput: 0: 13267.7. Samples: 34682600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:49:56,106][213445] Avg episode reward: [(0, '4373.927')] [2023-03-07 14:49:56,629][213771] Updated weights for policy 0, policy_version 33880 (0.0006) [2023-03-07 14:49:57,400][213771] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-07 14:49:58,185][213771] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-07 14:49:58,950][213771] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-07 14:49:59,740][213771] Updated weights for policy 0, policy_version 33920 (0.0006) [2023-03-07 14:50:00,535][213771] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-07 14:50:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 34751488. Throughput: 0: 13268.2. Samples: 34722278. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:50:01,106][213445] Avg episode reward: [(0, '4422.854')] [2023-03-07 14:50:01,313][213771] Updated weights for policy 0, policy_version 33940 (0.0007) [2023-03-07 14:50:02,057][213771] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-07 14:50:02,837][213771] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-07 14:50:03,640][213771] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-07 14:50:04,381][213771] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-07 14:50:05,160][213771] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-07 14:50:05,925][213771] Updated weights for policy 0, policy_version 34000 (0.0005) [2023-03-07 14:50:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 34818048. Throughput: 0: 13264.5. Samples: 34801453. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:06,106][213445] Avg episode reward: [(0, '4420.827')] [2023-03-07 14:50:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000034002_34818048.pth... [2023-03-07 14:50:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000030892_31633408.pth [2023-03-07 14:50:06,709][213771] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-07 14:50:07,477][213771] Updated weights for policy 0, policy_version 34020 (0.0007) [2023-03-07 14:50:08,265][213771] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-07 14:50:09,020][213771] Updated weights for policy 0, policy_version 34040 (0.0008) [2023-03-07 14:50:09,806][213771] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-07 14:50:10,569][213771] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-07 14:50:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 34884608. Throughput: 0: 13262.9. Samples: 34881076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:11,106][213445] Avg episode reward: [(0, '4402.683')] [2023-03-07 14:50:11,329][213771] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-07 14:50:12,105][213771] Updated weights for policy 0, policy_version 34080 (0.0007) [2023-03-07 14:50:12,864][213771] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-07 14:50:13,646][213771] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-07 14:50:14,423][213771] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-07 14:50:15,214][213771] Updated weights for policy 0, policy_version 34120 (0.0007) [2023-03-07 14:50:15,980][213771] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-07 14:50:16,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 34950144. Throughput: 0: 13262.9. Samples: 34920973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:16,106][213445] Avg episode reward: [(0, '4490.872')] [2023-03-07 14:50:16,734][213771] Updated weights for policy 0, policy_version 34140 (0.0007) [2023-03-07 14:50:17,513][213771] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-07 14:50:18,290][213771] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-07 14:50:19,051][213771] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-07 14:50:19,818][213771] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-07 14:50:20,595][213771] Updated weights for policy 0, policy_version 34190 (0.0006) [2023-03-07 14:50:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35016704. Throughput: 0: 13255.4. Samples: 35000398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:21,106][213445] Avg episode reward: [(0, '4510.899')] [2023-03-07 14:50:21,363][213771] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-07 14:50:22,119][213771] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-07 14:50:22,902][213771] Updated weights for policy 0, policy_version 34220 (0.0007) [2023-03-07 14:50:23,669][213771] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-07 14:50:24,449][213771] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-07 14:50:25,211][213771] Updated weights for policy 0, policy_version 34250 (0.0005) [2023-03-07 14:50:25,995][213771] Updated weights for policy 0, policy_version 34260 (0.0008) [2023-03-07 14:50:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35083264. Throughput: 0: 13261.6. Samples: 35080234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:26,106][213445] Avg episode reward: [(0, '4506.246')] [2023-03-07 14:50:26,774][213771] Updated weights for policy 0, policy_version 34270 (0.0005) [2023-03-07 14:50:27,547][213771] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-07 14:50:28,303][213771] Updated weights for policy 0, policy_version 34290 (0.0006) [2023-03-07 14:50:29,075][213771] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-07 14:50:29,861][213771] Updated weights for policy 0, policy_version 34310 (0.0007) [2023-03-07 14:50:30,625][213771] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-07 14:50:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35149824. Throughput: 0: 13263.1. Samples: 35120047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:31,105][213445] Avg episode reward: [(0, '4519.024')] [2023-03-07 14:50:31,387][213771] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-07 14:50:32,154][213771] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-07 14:50:32,917][213771] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-07 14:50:33,683][213771] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-07 14:50:34,461][213771] Updated weights for policy 0, policy_version 34370 (0.0006) [2023-03-07 14:50:35,240][213771] Updated weights for policy 0, policy_version 34380 (0.0005) [2023-03-07 14:50:36,013][213771] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-07 14:50:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35216384. Throughput: 0: 13260.8. Samples: 35199674. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:36,105][213445] Avg episode reward: [(0, '4503.599')] [2023-03-07 14:50:36,775][213771] Updated weights for policy 0, policy_version 34400 (0.0007) [2023-03-07 14:50:37,549][213771] Updated weights for policy 0, policy_version 34410 (0.0005) [2023-03-07 14:50:38,318][213771] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-07 14:50:39,088][213771] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-07 14:50:39,870][213771] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-07 14:50:40,629][213771] Updated weights for policy 0, policy_version 34450 (0.0007) [2023-03-07 14:50:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 35282944. Throughput: 0: 13261.4. Samples: 35279360. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:41,106][213445] Avg episode reward: [(0, '4515.837')] [2023-03-07 14:50:41,396][213771] Updated weights for policy 0, policy_version 34460 (0.0007) [2023-03-07 14:50:42,168][213771] Updated weights for policy 0, policy_version 34470 (0.0005) [2023-03-07 14:50:42,942][213771] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-07 14:50:43,717][213771] Updated weights for policy 0, policy_version 34490 (0.0006) [2023-03-07 14:50:44,483][213771] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-07 14:50:45,265][213771] Updated weights for policy 0, policy_version 34510 (0.0007) [2023-03-07 14:50:46,045][213771] Updated weights for policy 0, policy_version 34520 (0.0006) [2023-03-07 14:50:46,105][213445] Fps is (10 sec: 13209.2, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35348480. Throughput: 0: 13262.3. Samples: 35319083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:46,106][213445] Avg episode reward: [(0, '4489.300')] [2023-03-07 14:50:46,833][213771] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-07 14:50:47,600][213771] Updated weights for policy 0, policy_version 34540 (0.0007) [2023-03-07 14:50:48,382][213771] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-07 14:50:49,148][213771] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-07 14:50:49,922][213771] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-07 14:50:50,693][213771] Updated weights for policy 0, policy_version 34580 (0.0007) [2023-03-07 14:50:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35415040. Throughput: 0: 13268.8. Samples: 35398548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:51,106][213445] Avg episode reward: [(0, '4312.787')] [2023-03-07 14:50:51,451][213771] Updated weights for policy 0, policy_version 34590 (0.0007) [2023-03-07 14:50:52,241][213771] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-07 14:50:53,013][213771] Updated weights for policy 0, policy_version 34610 (0.0006) [2023-03-07 14:50:53,778][213771] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-07 14:50:54,550][213771] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-07 14:50:55,346][213771] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-07 14:50:56,098][213771] Updated weights for policy 0, policy_version 34650 (0.0007) [2023-03-07 14:50:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13270.3). Total num frames: 35481600. Throughput: 0: 13263.8. Samples: 35477947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:50:56,106][213445] Avg episode reward: [(0, '4397.882')] [2023-03-07 14:50:56,870][213771] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-07 14:50:57,656][213771] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-07 14:50:58,426][213771] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-07 14:50:59,204][213771] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-07 14:50:59,973][213771] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-07 14:51:00,753][213771] Updated weights for policy 0, policy_version 34710 (0.0007) [2023-03-07 14:51:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35547136. Throughput: 0: 13255.1. Samples: 35517452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:01,106][213445] Avg episode reward: [(0, '4450.744')] [2023-03-07 14:51:01,519][213771] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-07 14:51:02,297][213771] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-07 14:51:03,074][213771] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-07 14:51:03,840][213771] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-07 14:51:04,631][213771] Updated weights for policy 0, policy_version 34760 (0.0007) [2023-03-07 14:51:05,416][213771] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-07 14:51:06,105][213445] Fps is (10 sec: 13107.5, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 35612672. Throughput: 0: 13254.8. Samples: 35596863. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:06,105][213445] Avg episode reward: [(0, '4507.178')] [2023-03-07 14:51:06,185][213771] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-07 14:51:06,954][213771] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-07 14:51:07,747][213771] Updated weights for policy 0, policy_version 34800 (0.0007) [2023-03-07 14:51:08,521][213771] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-07 14:51:09,302][213771] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-07 14:51:10,089][213771] Updated weights for policy 0, policy_version 34830 (0.0007) [2023-03-07 14:51:10,865][213771] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-07 14:51:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 35679232. Throughput: 0: 13237.2. Samples: 35675907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:11,106][213445] Avg episode reward: [(0, '4530.340')] [2023-03-07 14:51:11,630][213771] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-07 14:51:12,417][213771] Updated weights for policy 0, policy_version 34860 (0.0005) [2023-03-07 14:51:13,169][213771] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-07 14:51:13,950][213771] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-07 14:51:14,722][213771] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-07 14:51:15,494][213771] Updated weights for policy 0, policy_version 34900 (0.0007) [2023-03-07 14:51:16,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 35745792. Throughput: 0: 13237.1. Samples: 35715720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:16,106][213445] Avg episode reward: [(0, '4534.967')] [2023-03-07 14:51:16,247][213771] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-07 14:51:17,033][213771] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-07 14:51:17,802][213771] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-07 14:51:18,552][213771] Updated weights for policy 0, policy_version 34940 (0.0005) [2023-03-07 14:51:19,352][213771] Updated weights for policy 0, policy_version 34950 (0.0007) [2023-03-07 14:51:20,122][213771] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-07 14:51:20,879][213771] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-07 14:51:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35812352. Throughput: 0: 13237.0. Samples: 35795342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:21,106][213445] Avg episode reward: [(0, '4466.620')] [2023-03-07 14:51:21,652][213771] Updated weights for policy 0, policy_version 34980 (0.0007) [2023-03-07 14:51:22,429][213771] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-07 14:51:23,187][213771] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-07 14:51:23,929][213771] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-07 14:51:24,726][213771] Updated weights for policy 0, policy_version 35020 (0.0006) [2023-03-07 14:51:25,481][213771] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-07 14:51:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35878912. Throughput: 0: 13247.0. Samples: 35875477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:51:26,105][213445] Avg episode reward: [(0, '4512.536')] [2023-03-07 14:51:26,242][213771] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-07 14:51:27,033][213771] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-07 14:51:27,784][213771] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-07 14:51:28,557][213771] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-07 14:51:29,329][213771] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-07 14:51:30,099][213771] Updated weights for policy 0, policy_version 35090 (0.0005) [2023-03-07 14:51:30,878][213771] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-07 14:51:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 35944448. Throughput: 0: 13247.7. Samples: 35915225. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:31,106][213445] Avg episode reward: [(0, '4516.598')] [2023-03-07 14:51:31,647][213771] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-07 14:51:32,421][213771] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-07 14:51:33,173][213771] Updated weights for policy 0, policy_version 35130 (0.0006) [2023-03-07 14:51:33,941][213771] Updated weights for policy 0, policy_version 35140 (0.0005) [2023-03-07 14:51:34,721][213771] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-07 14:51:35,485][213771] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-07 14:51:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 36011008. Throughput: 0: 13255.4. Samples: 35995040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:36,106][213445] Avg episode reward: [(0, '4525.008')] [2023-03-07 14:51:36,266][213771] Updated weights for policy 0, policy_version 35170 (0.0005) [2023-03-07 14:51:37,068][213771] Updated weights for policy 0, policy_version 35180 (0.0006) [2023-03-07 14:51:37,815][213771] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-07 14:51:38,603][213771] Updated weights for policy 0, policy_version 35200 (0.0007) [2023-03-07 14:51:39,377][213771] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-07 14:51:40,141][213771] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-07 14:51:40,927][213771] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-07 14:51:41,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 36077568. Throughput: 0: 13253.9. Samples: 36074371. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:41,116][213445] Avg episode reward: [(0, '4529.121')] [2023-03-07 14:51:41,698][213771] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-07 14:51:42,463][213771] Updated weights for policy 0, policy_version 35250 (0.0005) [2023-03-07 14:51:43,242][213771] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-07 14:51:44,022][213771] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-07 14:51:44,771][213771] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-07 14:51:45,567][213771] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-07 14:51:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 36143104. Throughput: 0: 13259.4. Samples: 36114125. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:46,116][213445] Avg episode reward: [(0, '4526.889')] [2023-03-07 14:51:46,346][213771] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-07 14:51:47,110][213771] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-07 14:51:47,886][213771] Updated weights for policy 0, policy_version 35320 (0.0006) [2023-03-07 14:51:48,666][213771] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-07 14:51:49,445][213771] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-07 14:51:50,240][213771] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-07 14:51:51,025][213771] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-07 14:51:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 36209664. Throughput: 0: 13255.7. Samples: 36193370. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:51,116][213445] Avg episode reward: [(0, '4493.368')] [2023-03-07 14:51:51,783][213771] Updated weights for policy 0, policy_version 35370 (0.0007) [2023-03-07 14:51:52,561][213771] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-07 14:51:53,343][213771] Updated weights for policy 0, policy_version 35390 (0.0006) [2023-03-07 14:51:54,116][213771] Updated weights for policy 0, policy_version 35400 (0.0007) [2023-03-07 14:51:54,897][213771] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-07 14:51:55,678][213771] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-07 14:51:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13259.9). Total num frames: 36275200. Throughput: 0: 13251.1. Samples: 36272208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:51:56,116][213445] Avg episode reward: [(0, '4539.978')] [2023-03-07 14:51:56,438][213771] Updated weights for policy 0, policy_version 35430 (0.0005) [2023-03-07 14:51:57,230][213771] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-07 14:51:57,997][213771] Updated weights for policy 0, policy_version 35450 (0.0007) [2023-03-07 14:51:58,779][213771] Updated weights for policy 0, policy_version 35460 (0.0007) [2023-03-07 14:51:59,546][213771] Updated weights for policy 0, policy_version 35470 (0.0006) [2023-03-07 14:52:00,319][213771] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-07 14:52:01,086][213771] Updated weights for policy 0, policy_version 35490 (0.0005) [2023-03-07 14:52:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 36341760. Throughput: 0: 13249.1. Samples: 36311930. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:01,116][213445] Avg episode reward: [(0, '4541.261')] [2023-03-07 14:52:01,864][213771] Updated weights for policy 0, policy_version 35500 (0.0005) [2023-03-07 14:52:02,626][213771] Updated weights for policy 0, policy_version 35510 (0.0007) [2023-03-07 14:52:03,393][213771] Updated weights for policy 0, policy_version 35520 (0.0007) [2023-03-07 14:52:04,174][213771] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-07 14:52:04,959][213771] Updated weights for policy 0, policy_version 35540 (0.0005) [2023-03-07 14:52:05,743][213771] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-07 14:52:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 36407296. Throughput: 0: 13242.1. Samples: 36391238. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:06,117][213445] Avg episode reward: [(0, '4512.943')] [2023-03-07 14:52:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000035554_36407296.pth... [2023-03-07 14:52:06,152][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000032446_33224704.pth [2023-03-07 14:52:06,510][213771] Updated weights for policy 0, policy_version 35560 (0.0006) [2023-03-07 14:52:07,297][213771] Updated weights for policy 0, policy_version 35570 (0.0006) [2023-03-07 14:52:08,067][213771] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-07 14:52:08,831][213771] Updated weights for policy 0, policy_version 35590 (0.0005) [2023-03-07 14:52:09,593][213771] Updated weights for policy 0, policy_version 35600 (0.0006) [2023-03-07 14:52:10,393][213771] Updated weights for policy 0, policy_version 35610 (0.0007) [2023-03-07 14:52:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13259.9). Total num frames: 36473856. Throughput: 0: 13223.3. Samples: 36470526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:11,116][213445] Avg episode reward: [(0, '4486.159')] [2023-03-07 14:52:11,162][213771] Updated weights for policy 0, policy_version 35620 (0.0005) [2023-03-07 14:52:11,929][213771] Updated weights for policy 0, policy_version 35630 (0.0005) [2023-03-07 14:52:12,725][213771] Updated weights for policy 0, policy_version 35640 (0.0007) [2023-03-07 14:52:13,482][213771] Updated weights for policy 0, policy_version 35650 (0.0008) [2023-03-07 14:52:14,264][213771] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-07 14:52:15,046][213771] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-07 14:52:15,800][213771] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-07 14:52:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13256.5). Total num frames: 36539392. Throughput: 0: 13227.1. Samples: 36510445. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:52:16,116][213445] Avg episode reward: [(0, '4451.472')] [2023-03-07 14:52:16,585][213771] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-07 14:52:17,363][213771] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-07 14:52:18,152][213771] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-07 14:52:18,914][213771] Updated weights for policy 0, policy_version 35720 (0.0005) [2023-03-07 14:52:19,699][213771] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-07 14:52:20,466][213771] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-07 14:52:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 36605952. Throughput: 0: 13209.0. Samples: 36589446. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:52:21,116][213445] Avg episode reward: [(0, '4471.735')] [2023-03-07 14:52:21,235][213771] Updated weights for policy 0, policy_version 35750 (0.0006) [2023-03-07 14:52:22,017][213771] Updated weights for policy 0, policy_version 35760 (0.0007) [2023-03-07 14:52:22,796][213771] Updated weights for policy 0, policy_version 35770 (0.0007) [2023-03-07 14:52:23,566][213771] Updated weights for policy 0, policy_version 35780 (0.0007) [2023-03-07 14:52:24,323][213771] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-07 14:52:25,087][213771] Updated weights for policy 0, policy_version 35800 (0.0006) [2023-03-07 14:52:25,870][213771] Updated weights for policy 0, policy_version 35810 (0.0007) [2023-03-07 14:52:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 36672512. Throughput: 0: 13212.8. Samples: 36668948. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:52:26,106][213445] Avg episode reward: [(0, '4412.848')] [2023-03-07 14:52:26,654][213771] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-07 14:52:27,417][213771] Updated weights for policy 0, policy_version 35830 (0.0006) [2023-03-07 14:52:28,187][213771] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-07 14:52:28,948][213771] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-07 14:52:29,718][213771] Updated weights for policy 0, policy_version 35860 (0.0007) [2023-03-07 14:52:30,498][213771] Updated weights for policy 0, policy_version 35870 (0.0007) [2023-03-07 14:52:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 36738048. Throughput: 0: 13213.0. Samples: 36708710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:52:31,106][213445] Avg episode reward: [(0, '4417.269')] [2023-03-07 14:52:31,267][213771] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-07 14:52:32,040][213771] Updated weights for policy 0, policy_version 35890 (0.0007) [2023-03-07 14:52:32,825][213771] Updated weights for policy 0, policy_version 35900 (0.0007) [2023-03-07 14:52:33,592][213771] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-07 14:52:34,366][213771] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-07 14:52:35,153][213771] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-07 14:52:35,926][213771] Updated weights for policy 0, policy_version 35940 (0.0005) [2023-03-07 14:52:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 36804608. Throughput: 0: 13217.4. Samples: 36788154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:52:36,106][213445] Avg episode reward: [(0, '4479.169')] [2023-03-07 14:52:36,687][213771] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-07 14:52:37,464][213771] Updated weights for policy 0, policy_version 35960 (0.0005) [2023-03-07 14:52:38,234][213771] Updated weights for policy 0, policy_version 35970 (0.0005) [2023-03-07 14:52:39,008][213771] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-07 14:52:39,789][213771] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-07 14:52:40,558][213771] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-07 14:52:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 36870144. Throughput: 0: 13232.1. Samples: 36867653. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:41,106][213445] Avg episode reward: [(0, '4507.469')] [2023-03-07 14:52:41,316][213771] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-07 14:52:42,113][213771] Updated weights for policy 0, policy_version 36020 (0.0007) [2023-03-07 14:52:42,890][213771] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-07 14:52:43,670][213771] Updated weights for policy 0, policy_version 36040 (0.0006) [2023-03-07 14:52:44,458][213771] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-07 14:52:45,229][213771] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-07 14:52:46,010][213771] Updated weights for policy 0, policy_version 36070 (0.0007) [2023-03-07 14:52:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 36936704. Throughput: 0: 13228.5. Samples: 36907213. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:46,105][213445] Avg episode reward: [(0, '4490.494')] [2023-03-07 14:52:46,781][213771] Updated weights for policy 0, policy_version 36080 (0.0007) [2023-03-07 14:52:47,550][213771] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-07 14:52:48,328][213771] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-07 14:52:49,099][213771] Updated weights for policy 0, policy_version 36110 (0.0006) [2023-03-07 14:52:49,866][213771] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-07 14:52:50,636][213771] Updated weights for policy 0, policy_version 36130 (0.0005) [2023-03-07 14:52:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 37003264. Throughput: 0: 13226.9. Samples: 36986444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:51,105][213445] Avg episode reward: [(0, '4531.084')] [2023-03-07 14:52:51,393][213771] Updated weights for policy 0, policy_version 36140 (0.0007) [2023-03-07 14:52:52,191][213771] Updated weights for policy 0, policy_version 36150 (0.0008) [2023-03-07 14:52:52,950][213771] Updated weights for policy 0, policy_version 36160 (0.0005) [2023-03-07 14:52:53,733][213771] Updated weights for policy 0, policy_version 36170 (0.0005) [2023-03-07 14:52:54,507][213771] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-07 14:52:55,294][213771] Updated weights for policy 0, policy_version 36190 (0.0005) [2023-03-07 14:52:56,060][213771] Updated weights for policy 0, policy_version 36200 (0.0006) [2023-03-07 14:52:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 37068800. Throughput: 0: 13226.6. Samples: 37065725. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:52:56,106][213445] Avg episode reward: [(0, '4517.397')] [2023-03-07 14:52:56,846][213771] Updated weights for policy 0, policy_version 36210 (0.0005) [2023-03-07 14:52:57,622][213771] Updated weights for policy 0, policy_version 36220 (0.0007) [2023-03-07 14:52:58,389][213771] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-07 14:52:59,149][213771] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-07 14:52:59,936][213771] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-07 14:53:00,725][213771] Updated weights for policy 0, policy_version 36260 (0.0006) [2023-03-07 14:53:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 37135360. Throughput: 0: 13222.8. Samples: 37105467. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:53:01,106][213445] Avg episode reward: [(0, '4478.060')] [2023-03-07 14:53:01,491][213771] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-07 14:53:02,263][213771] Updated weights for policy 0, policy_version 36280 (0.0005) [2023-03-07 14:53:03,045][213771] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-07 14:53:03,821][213771] Updated weights for policy 0, policy_version 36300 (0.0005) [2023-03-07 14:53:04,592][213771] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-07 14:53:05,383][213771] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-07 14:53:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 37200896. Throughput: 0: 13228.9. Samples: 37184747. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:53:06,105][213445] Avg episode reward: [(0, '4565.955')] [2023-03-07 14:53:06,139][213771] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-07 14:53:06,936][213771] Updated weights for policy 0, policy_version 36340 (0.0007) [2023-03-07 14:53:07,692][213771] Updated weights for policy 0, policy_version 36350 (0.0007) [2023-03-07 14:53:08,463][213771] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-07 14:53:09,237][213771] Updated weights for policy 0, policy_version 36370 (0.0006) [2023-03-07 14:53:09,980][213771] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-03-07 14:53:10,745][213771] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-07 14:53:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13253.0). Total num frames: 37267456. Throughput: 0: 13233.0. Samples: 37264433. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:53:11,106][213445] Avg episode reward: [(0, '4510.139')] [2023-03-07 14:53:11,521][213771] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-07 14:53:12,287][213771] Updated weights for policy 0, policy_version 36410 (0.0006) [2023-03-07 14:53:13,073][213771] Updated weights for policy 0, policy_version 36420 (0.0006) [2023-03-07 14:53:13,852][213771] Updated weights for policy 0, policy_version 36430 (0.0007) [2023-03-07 14:53:14,630][213771] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-07 14:53:15,397][213771] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-07 14:53:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13256.5). Total num frames: 37334016. Throughput: 0: 13228.3. Samples: 37303982. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:53:16,106][213445] Avg episode reward: [(0, '4461.844')] [2023-03-07 14:53:16,157][213771] Updated weights for policy 0, policy_version 36460 (0.0005) [2023-03-07 14:53:16,934][213771] Updated weights for policy 0, policy_version 36470 (0.0007) [2023-03-07 14:53:17,703][213771] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-07 14:53:18,485][213771] Updated weights for policy 0, policy_version 36490 (0.0005) [2023-03-07 14:53:19,253][213771] Updated weights for policy 0, policy_version 36500 (0.0006) [2023-03-07 14:53:20,028][213771] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-07 14:53:20,810][213771] Updated weights for policy 0, policy_version 36520 (0.0007) [2023-03-07 14:53:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13253.0). Total num frames: 37399552. Throughput: 0: 13229.2. Samples: 37383468. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:53:21,106][213445] Avg episode reward: [(0, '4528.357')] [2023-03-07 14:53:21,593][213771] Updated weights for policy 0, policy_version 36530 (0.0006) [2023-03-07 14:53:22,369][213771] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-07 14:53:23,136][213771] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-07 14:53:23,911][213771] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-07 14:53:24,694][213771] Updated weights for policy 0, policy_version 36570 (0.0006) [2023-03-07 14:53:25,454][213771] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-07 14:53:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13253.0). Total num frames: 37466112. Throughput: 0: 13225.9. Samples: 37462820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:26,106][213445] Avg episode reward: [(0, '4531.703')] [2023-03-07 14:53:26,220][213771] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-07 14:53:26,998][213771] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-07 14:53:27,772][213771] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-07 14:53:28,568][213771] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-07 14:53:29,357][213771] Updated weights for policy 0, policy_version 36630 (0.0005) [2023-03-07 14:53:30,124][213771] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-07 14:53:30,898][213771] Updated weights for policy 0, policy_version 36650 (0.0007) [2023-03-07 14:53:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 37531648. Throughput: 0: 13225.8. Samples: 37502375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:31,106][213445] Avg episode reward: [(0, '4537.668')] [2023-03-07 14:53:31,681][213771] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-07 14:53:32,459][213771] Updated weights for policy 0, policy_version 36670 (0.0005) [2023-03-07 14:53:33,233][213771] Updated weights for policy 0, policy_version 36680 (0.0006) [2023-03-07 14:53:34,019][213771] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-07 14:53:34,788][213771] Updated weights for policy 0, policy_version 36700 (0.0006) [2023-03-07 14:53:35,551][213771] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-07 14:53:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 37598208. Throughput: 0: 13224.4. Samples: 37581544. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:36,106][213445] Avg episode reward: [(0, '4560.843')] [2023-03-07 14:53:36,305][213771] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-07 14:53:37,078][213771] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-07 14:53:37,851][213771] Updated weights for policy 0, policy_version 36740 (0.0007) [2023-03-07 14:53:38,625][213771] Updated weights for policy 0, policy_version 36750 (0.0007) [2023-03-07 14:53:39,407][213771] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-07 14:53:40,165][213771] Updated weights for policy 0, policy_version 36770 (0.0005) [2023-03-07 14:53:40,939][213771] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-07 14:53:41,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 37664768. Throughput: 0: 13235.2. Samples: 37661309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:41,106][213445] Avg episode reward: [(0, '4451.908')] [2023-03-07 14:53:41,735][213771] Updated weights for policy 0, policy_version 36790 (0.0007) [2023-03-07 14:53:42,486][213771] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-07 14:53:43,254][213771] Updated weights for policy 0, policy_version 36810 (0.0007) [2023-03-07 14:53:44,024][213771] Updated weights for policy 0, policy_version 36820 (0.0006) [2023-03-07 14:53:44,810][213771] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-07 14:53:45,579][213771] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-07 14:53:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 37730304. Throughput: 0: 13239.1. Samples: 37701226. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:46,106][213445] Avg episode reward: [(0, '4510.966')] [2023-03-07 14:53:46,368][213771] Updated weights for policy 0, policy_version 36850 (0.0007) [2023-03-07 14:53:47,133][213771] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-07 14:53:47,905][213771] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-07 14:53:48,699][213771] Updated weights for policy 0, policy_version 36880 (0.0007) [2023-03-07 14:53:49,474][213771] Updated weights for policy 0, policy_version 36890 (0.0005) [2023-03-07 14:53:50,249][213771] Updated weights for policy 0, policy_version 36900 (0.0007) [2023-03-07 14:53:51,013][213771] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-07 14:53:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13246.1). Total num frames: 37796864. Throughput: 0: 13234.5. Samples: 37780302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:51,106][213445] Avg episode reward: [(0, '4536.037')] [2023-03-07 14:53:51,784][213771] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-07 14:53:52,552][213771] Updated weights for policy 0, policy_version 36930 (0.0007) [2023-03-07 14:53:53,326][213771] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-07 14:53:54,089][213771] Updated weights for policy 0, policy_version 36950 (0.0005) [2023-03-07 14:53:54,865][213771] Updated weights for policy 0, policy_version 36960 (0.0007) [2023-03-07 14:53:55,630][213771] Updated weights for policy 0, policy_version 36970 (0.0006) [2023-03-07 14:53:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 37862400. Throughput: 0: 13233.2. Samples: 37859928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:53:56,106][213445] Avg episode reward: [(0, '4548.716')] [2023-03-07 14:53:56,415][213771] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-07 14:53:57,189][213771] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-07 14:53:57,956][213771] Updated weights for policy 0, policy_version 37000 (0.0007) [2023-03-07 14:53:58,720][213771] Updated weights for policy 0, policy_version 37010 (0.0007) [2023-03-07 14:53:59,498][213771] Updated weights for policy 0, policy_version 37020 (0.0006) [2023-03-07 14:54:00,274][213771] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-07 14:54:01,043][213771] Updated weights for policy 0, policy_version 37040 (0.0007) [2023-03-07 14:54:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 37928960. Throughput: 0: 13238.5. Samples: 37899716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:01,106][213445] Avg episode reward: [(0, '4552.374')] [2023-03-07 14:54:01,821][213771] Updated weights for policy 0, policy_version 37050 (0.0007) [2023-03-07 14:54:02,618][213771] Updated weights for policy 0, policy_version 37060 (0.0007) [2023-03-07 14:54:03,380][213771] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-07 14:54:04,149][213771] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-07 14:54:04,905][213771] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-07 14:54:05,699][213771] Updated weights for policy 0, policy_version 37100 (0.0005) [2023-03-07 14:54:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 37995520. Throughput: 0: 13236.0. Samples: 37979088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:06,106][213445] Avg episode reward: [(0, '4459.530')] [2023-03-07 14:54:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000037105_37995520.pth... [2023-03-07 14:54:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000034002_34818048.pth [2023-03-07 14:54:06,469][213771] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-07 14:54:07,260][213771] Updated weights for policy 0, policy_version 37120 (0.0007) [2023-03-07 14:54:08,022][213771] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-07 14:54:08,791][213771] Updated weights for policy 0, policy_version 37140 (0.0005) [2023-03-07 14:54:09,561][213771] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-07 14:54:10,338][213771] Updated weights for policy 0, policy_version 37160 (0.0005) [2023-03-07 14:54:11,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 38062080. Throughput: 0: 13236.5. Samples: 38058459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:11,105][213445] Avg episode reward: [(0, '4417.193')] [2023-03-07 14:54:11,107][213771] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-07 14:54:11,870][213771] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-07 14:54:12,648][213771] Updated weights for policy 0, policy_version 37190 (0.0006) [2023-03-07 14:54:13,422][213771] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-07 14:54:14,197][213771] Updated weights for policy 0, policy_version 37210 (0.0005) [2023-03-07 14:54:14,964][213771] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-07 14:54:15,732][213771] Updated weights for policy 0, policy_version 37230 (0.0007) [2023-03-07 14:54:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 38127616. Throughput: 0: 13241.5. Samples: 38098241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:54:16,106][213445] Avg episode reward: [(0, '4450.744')] [2023-03-07 14:54:16,506][213771] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-07 14:54:17,283][213771] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-07 14:54:18,051][213771] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-07 14:54:18,817][213771] Updated weights for policy 0, policy_version 37270 (0.0006) [2023-03-07 14:54:19,591][213771] Updated weights for policy 0, policy_version 37280 (0.0008) [2023-03-07 14:54:20,370][213771] Updated weights for policy 0, policy_version 37290 (0.0007) [2023-03-07 14:54:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 38194176. Throughput: 0: 13256.0. Samples: 38178063. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:54:21,106][213445] Avg episode reward: [(0, '4419.136')] [2023-03-07 14:54:21,129][213771] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-07 14:54:21,890][213771] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-07 14:54:22,679][213771] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-07 14:54:23,468][213771] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-07 14:54:24,236][213771] Updated weights for policy 0, policy_version 37340 (0.0007) [2023-03-07 14:54:25,006][213771] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-07 14:54:25,779][213771] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-07 14:54:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 38260736. Throughput: 0: 13244.1. Samples: 38257290. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:54:26,106][213445] Avg episode reward: [(0, '4384.315')] [2023-03-07 14:54:26,555][213771] Updated weights for policy 0, policy_version 37370 (0.0005) [2023-03-07 14:54:27,321][213771] Updated weights for policy 0, policy_version 37380 (0.0005) [2023-03-07 14:54:28,091][213771] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-07 14:54:28,873][213771] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-07 14:54:29,645][213771] Updated weights for policy 0, policy_version 37410 (0.0006) [2023-03-07 14:54:30,411][213771] Updated weights for policy 0, policy_version 37420 (0.0006) [2023-03-07 14:54:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 38326272. Throughput: 0: 13238.3. Samples: 38296952. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:54:31,106][213445] Avg episode reward: [(0, '4362.758')] [2023-03-07 14:54:31,189][213771] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-07 14:54:31,965][213771] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-07 14:54:32,749][213771] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-07 14:54:33,520][213771] Updated weights for policy 0, policy_version 37460 (0.0008) [2023-03-07 14:54:34,310][213771] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-07 14:54:35,071][213771] Updated weights for policy 0, policy_version 37480 (0.0007) [2023-03-07 14:54:35,842][213771] Updated weights for policy 0, policy_version 37490 (0.0005) [2023-03-07 14:54:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 38392832. Throughput: 0: 13246.2. Samples: 38376379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:36,106][213445] Avg episode reward: [(0, '4387.577')] [2023-03-07 14:54:36,620][213771] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-07 14:54:37,401][213771] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-07 14:54:38,165][213771] Updated weights for policy 0, policy_version 37520 (0.0007) [2023-03-07 14:54:38,942][213771] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-07 14:54:39,685][213771] Updated weights for policy 0, policy_version 37540 (0.0006) [2023-03-07 14:54:40,474][213771] Updated weights for policy 0, policy_version 37550 (0.0007) [2023-03-07 14:54:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 38459392. Throughput: 0: 13248.3. Samples: 38456099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:41,106][213445] Avg episode reward: [(0, '4252.030')] [2023-03-07 14:54:41,239][213771] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-07 14:54:42,005][213771] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-07 14:54:42,769][213771] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-07 14:54:43,554][213771] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-07 14:54:44,321][213771] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-07 14:54:45,098][213771] Updated weights for policy 0, policy_version 37610 (0.0007) [2023-03-07 14:54:45,885][213771] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-07 14:54:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 38525952. Throughput: 0: 13249.3. Samples: 38495934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:46,106][213445] Avg episode reward: [(0, '4378.441')] [2023-03-07 14:54:46,655][213771] Updated weights for policy 0, policy_version 37630 (0.0006) [2023-03-07 14:54:47,413][213771] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-07 14:54:48,197][213771] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-07 14:54:48,979][213771] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-07 14:54:49,743][213771] Updated weights for policy 0, policy_version 37670 (0.0006) [2023-03-07 14:54:50,517][213771] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-07 14:54:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 38591488. Throughput: 0: 13252.0. Samples: 38575426. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:51,106][213445] Avg episode reward: [(0, '4437.001')] [2023-03-07 14:54:51,281][213771] Updated weights for policy 0, policy_version 37690 (0.0006) [2023-03-07 14:54:52,046][213771] Updated weights for policy 0, policy_version 37700 (0.0005) [2023-03-07 14:54:52,823][213771] Updated weights for policy 0, policy_version 37710 (0.0007) [2023-03-07 14:54:53,609][213771] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-07 14:54:54,385][213771] Updated weights for policy 0, policy_version 37730 (0.0007) [2023-03-07 14:54:55,163][213771] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-07 14:54:55,940][213771] Updated weights for policy 0, policy_version 37750 (0.0006) [2023-03-07 14:54:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 38658048. Throughput: 0: 13252.6. Samples: 38654829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:54:56,106][213445] Avg episode reward: [(0, '4415.379')] [2023-03-07 14:54:56,722][213771] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-07 14:54:57,477][213771] Updated weights for policy 0, policy_version 37770 (0.0006) [2023-03-07 14:54:58,237][213771] Updated weights for policy 0, policy_version 37780 (0.0007) [2023-03-07 14:54:59,003][213771] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-07 14:54:59,762][213771] Updated weights for policy 0, policy_version 37800 (0.0007) [2023-03-07 14:55:00,550][213771] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-07 14:55:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 38724608. Throughput: 0: 13251.9. Samples: 38694577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:01,106][213445] Avg episode reward: [(0, '4456.445')] [2023-03-07 14:55:01,310][213771] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-07 14:55:02,089][213771] Updated weights for policy 0, policy_version 37830 (0.0007) [2023-03-07 14:55:02,862][213771] Updated weights for policy 0, policy_version 37840 (0.0006) [2023-03-07 14:55:03,638][213771] Updated weights for policy 0, policy_version 37850 (0.0006) [2023-03-07 14:55:04,413][213771] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-07 14:55:05,179][213771] Updated weights for policy 0, policy_version 37870 (0.0005) [2023-03-07 14:55:05,947][213771] Updated weights for policy 0, policy_version 37880 (0.0007) [2023-03-07 14:55:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 38790144. Throughput: 0: 13247.6. Samples: 38774207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:06,106][213445] Avg episode reward: [(0, '4386.684')] [2023-03-07 14:55:06,730][213771] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-07 14:55:07,519][213771] Updated weights for policy 0, policy_version 37900 (0.0006) [2023-03-07 14:55:08,290][213771] Updated weights for policy 0, policy_version 37910 (0.0005) [2023-03-07 14:55:09,057][213771] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-07 14:55:09,838][213771] Updated weights for policy 0, policy_version 37930 (0.0005) [2023-03-07 14:55:10,596][213771] Updated weights for policy 0, policy_version 37940 (0.0006) [2023-03-07 14:55:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 38856704. Throughput: 0: 13253.4. Samples: 38853695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:11,106][213445] Avg episode reward: [(0, '4368.621')] [2023-03-07 14:55:11,373][213771] Updated weights for policy 0, policy_version 37950 (0.0007) [2023-03-07 14:55:12,129][213771] Updated weights for policy 0, policy_version 37960 (0.0006) [2023-03-07 14:55:12,920][213771] Updated weights for policy 0, policy_version 37970 (0.0005) [2023-03-07 14:55:13,681][213771] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-07 14:55:14,449][213771] Updated weights for policy 0, policy_version 37990 (0.0006) [2023-03-07 14:55:15,212][213771] Updated weights for policy 0, policy_version 38000 (0.0006) [2023-03-07 14:55:15,989][213771] Updated weights for policy 0, policy_version 38010 (0.0005) [2023-03-07 14:55:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 38923264. Throughput: 0: 13258.1. Samples: 38893565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:16,106][213445] Avg episode reward: [(0, '4368.094')] [2023-03-07 14:55:16,754][213771] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-07 14:55:17,539][213771] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-07 14:55:18,312][213771] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-07 14:55:19,085][213771] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-07 14:55:19,863][213771] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-07 14:55:20,659][213771] Updated weights for policy 0, policy_version 38070 (0.0005) [2023-03-07 14:55:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 38988800. Throughput: 0: 13258.2. Samples: 38972996. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:21,106][213445] Avg episode reward: [(0, '4380.902')] [2023-03-07 14:55:21,425][213771] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-07 14:55:22,192][213771] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-07 14:55:22,981][213771] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-07 14:55:23,762][213771] Updated weights for policy 0, policy_version 38110 (0.0006) [2023-03-07 14:55:24,526][213771] Updated weights for policy 0, policy_version 38120 (0.0005) [2023-03-07 14:55:25,314][213771] Updated weights for policy 0, policy_version 38130 (0.0006) [2023-03-07 14:55:26,077][213771] Updated weights for policy 0, policy_version 38140 (0.0006) [2023-03-07 14:55:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 39055360. Throughput: 0: 13249.7. Samples: 39052337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:26,106][213445] Avg episode reward: [(0, '4381.545')] [2023-03-07 14:55:26,845][213771] Updated weights for policy 0, policy_version 38150 (0.0006) [2023-03-07 14:55:27,639][213771] Updated weights for policy 0, policy_version 38160 (0.0007) [2023-03-07 14:55:28,389][213771] Updated weights for policy 0, policy_version 38170 (0.0005) [2023-03-07 14:55:29,150][213771] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-07 14:55:29,947][213771] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-07 14:55:30,710][213771] Updated weights for policy 0, policy_version 38200 (0.0006) [2023-03-07 14:55:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 39121920. Throughput: 0: 13245.7. Samples: 39091989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:31,116][213445] Avg episode reward: [(0, '4388.622')] [2023-03-07 14:55:31,485][213771] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-07 14:55:32,248][213771] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-07 14:55:33,017][213771] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-07 14:55:33,780][213771] Updated weights for policy 0, policy_version 38240 (0.0006) [2023-03-07 14:55:34,561][213771] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-07 14:55:35,341][213771] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-07 14:55:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 39187456. Throughput: 0: 13250.7. Samples: 39171710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:36,115][213771] Updated weights for policy 0, policy_version 38270 (0.0007) [2023-03-07 14:55:36,116][213445] Avg episode reward: [(0, '4433.935')] [2023-03-07 14:55:36,879][213771] Updated weights for policy 0, policy_version 38280 (0.0007) [2023-03-07 14:55:37,655][213771] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-07 14:55:38,424][213771] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-07 14:55:39,201][213771] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-07 14:55:39,977][213771] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-07 14:55:40,738][213771] Updated weights for policy 0, policy_version 38330 (0.0006) [2023-03-07 14:55:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 39254016. Throughput: 0: 13252.1. Samples: 39251172. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:41,106][213445] Avg episode reward: [(0, '4402.552')] [2023-03-07 14:55:41,527][213771] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-07 14:55:42,299][213771] Updated weights for policy 0, policy_version 38350 (0.0006) [2023-03-07 14:55:43,057][213771] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-07 14:55:43,828][213771] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-07 14:55:44,610][213771] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-07 14:55:45,377][213771] Updated weights for policy 0, policy_version 38390 (0.0006) [2023-03-07 14:55:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 39320576. Throughput: 0: 13250.3. Samples: 39290841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:55:46,106][213445] Avg episode reward: [(0, '4387.818')] [2023-03-07 14:55:46,155][213771] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-07 14:55:46,923][213771] Updated weights for policy 0, policy_version 38410 (0.0006) [2023-03-07 14:55:47,704][213771] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-07 14:55:48,463][213771] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-07 14:55:49,231][213771] Updated weights for policy 0, policy_version 38440 (0.0006) [2023-03-07 14:55:50,012][213771] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-07 14:55:50,783][213771] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-07 14:55:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 39387136. Throughput: 0: 13248.2. Samples: 39370376. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:55:51,106][213445] Avg episode reward: [(0, '4302.324')] [2023-03-07 14:55:51,546][213771] Updated weights for policy 0, policy_version 38470 (0.0006) [2023-03-07 14:55:52,317][213771] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-07 14:55:53,086][213771] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-07 14:55:53,861][213771] Updated weights for policy 0, policy_version 38500 (0.0007) [2023-03-07 14:55:54,621][213771] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-07 14:55:55,384][213771] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-07 14:55:56,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 39453696. Throughput: 0: 13259.3. Samples: 39450364. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:55:56,106][213445] Avg episode reward: [(0, '4395.452')] [2023-03-07 14:55:56,160][213771] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-07 14:55:56,922][213771] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-07 14:55:57,685][213771] Updated weights for policy 0, policy_version 38550 (0.0006) [2023-03-07 14:55:58,477][213771] Updated weights for policy 0, policy_version 38560 (0.0005) [2023-03-07 14:55:59,233][213771] Updated weights for policy 0, policy_version 38570 (0.0006) [2023-03-07 14:56:00,004][213771] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-07 14:56:00,783][213771] Updated weights for policy 0, policy_version 38590 (0.0007) [2023-03-07 14:56:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 39520256. Throughput: 0: 13261.5. Samples: 39490333. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:56:01,106][213445] Avg episode reward: [(0, '4446.696')] [2023-03-07 14:56:01,560][213771] Updated weights for policy 0, policy_version 38600 (0.0006) [2023-03-07 14:56:02,332][213771] Updated weights for policy 0, policy_version 38610 (0.0006) [2023-03-07 14:56:03,106][213771] Updated weights for policy 0, policy_version 38620 (0.0005) [2023-03-07 14:56:03,885][213771] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-07 14:56:04,677][213771] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-07 14:56:05,456][213771] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-07 14:56:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 39585792. Throughput: 0: 13257.0. Samples: 39569560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:56:06,106][213445] Avg episode reward: [(0, '4515.826')] [2023-03-07 14:56:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000038658_39585792.pth... [2023-03-07 14:56:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000035554_36407296.pth [2023-03-07 14:56:06,225][213771] Updated weights for policy 0, policy_version 38660 (0.0006) [2023-03-07 14:56:06,993][213771] Updated weights for policy 0, policy_version 38670 (0.0005) [2023-03-07 14:56:07,765][213771] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-07 14:56:08,547][213771] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-07 14:56:09,324][213771] Updated weights for policy 0, policy_version 38700 (0.0005) [2023-03-07 14:56:10,097][213771] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 14:56:10,866][213771] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-07 14:56:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 39652352. Throughput: 0: 13255.3. Samples: 39648826. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:56:11,106][213445] Avg episode reward: [(0, '4488.760')] [2023-03-07 14:56:11,659][213771] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-07 14:56:12,432][213771] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 14:56:13,203][213771] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 14:56:13,979][213771] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-07 14:56:14,752][213771] Updated weights for policy 0, policy_version 38770 (0.0006) [2023-03-07 14:56:15,506][213771] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 14:56:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 39717888. Throughput: 0: 13251.9. Samples: 39688328. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:56:16,106][213445] Avg episode reward: [(0, '4479.173')] [2023-03-07 14:56:16,280][213771] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-07 14:56:17,054][213771] Updated weights for policy 0, policy_version 38800 (0.0006) [2023-03-07 14:56:17,843][213771] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-07 14:56:18,621][213771] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-07 14:56:19,406][213771] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 14:56:20,193][213771] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-07 14:56:20,955][213771] Updated weights for policy 0, policy_version 38850 (0.0008) [2023-03-07 14:56:21,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 39783424. Throughput: 0: 13238.9. Samples: 39767459. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:56:21,106][213445] Avg episode reward: [(0, '4466.428')] [2023-03-07 14:56:21,724][213771] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 14:56:22,490][213771] Updated weights for policy 0, policy_version 38870 (0.0006) [2023-03-07 14:56:23,282][213771] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-07 14:56:24,040][213771] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-07 14:56:24,826][213771] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-07 14:56:25,606][213771] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 14:56:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 39849984. Throughput: 0: 13241.8. Samples: 39847052. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:56:26,106][213445] Avg episode reward: [(0, '4510.808')] [2023-03-07 14:56:26,359][213771] Updated weights for policy 0, policy_version 38920 (0.0005) [2023-03-07 14:56:27,126][213771] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-07 14:56:27,897][213771] Updated weights for policy 0, policy_version 38940 (0.0005) [2023-03-07 14:56:28,663][213771] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 14:56:29,427][213771] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 14:56:30,206][213771] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-07 14:56:30,987][213771] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-07 14:56:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 39916544. Throughput: 0: 13250.0. Samples: 39887093. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:56:31,106][213445] Avg episode reward: [(0, '4517.687')] [2023-03-07 14:56:31,771][213771] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-07 14:56:32,552][213771] Updated weights for policy 0, policy_version 39000 (0.0005) [2023-03-07 14:56:33,332][213771] Updated weights for policy 0, policy_version 39010 (0.0007) [2023-03-07 14:56:34,087][213771] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-07 14:56:34,876][213771] Updated weights for policy 0, policy_version 39030 (0.0007) [2023-03-07 14:56:35,644][213771] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-07 14:56:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 39983104. Throughput: 0: 13239.3. Samples: 39966143. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:56:36,106][213445] Avg episode reward: [(0, '4443.938')] [2023-03-07 14:56:36,405][213771] Updated weights for policy 0, policy_version 39050 (0.0005) [2023-03-07 14:56:37,191][213771] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-07 14:56:37,978][213771] Updated weights for policy 0, policy_version 39070 (0.0007) [2023-03-07 14:56:38,742][213771] Updated weights for policy 0, policy_version 39080 (0.0007) [2023-03-07 14:56:39,517][213771] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-07 14:56:40,311][213771] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-07 14:56:41,080][213771] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-07 14:56:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 40048640. Throughput: 0: 13226.4. Samples: 40045553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:56:41,106][213445] Avg episode reward: [(0, '4405.206')] [2023-03-07 14:56:41,845][213771] Updated weights for policy 0, policy_version 39120 (0.0005) [2023-03-07 14:56:42,608][213771] Updated weights for policy 0, policy_version 39130 (0.0007) [2023-03-07 14:56:43,379][213771] Updated weights for policy 0, policy_version 39140 (0.0007) [2023-03-07 14:56:44,148][213771] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-07 14:56:44,925][213771] Updated weights for policy 0, policy_version 39160 (0.0007) [2023-03-07 14:56:45,690][213771] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-07 14:56:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 40115200. Throughput: 0: 13223.6. Samples: 40085392. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:56:46,105][213445] Avg episode reward: [(0, '4447.415')] [2023-03-07 14:56:46,466][213771] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 14:56:47,234][213771] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 14:56:48,012][213771] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-07 14:56:48,794][213771] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-07 14:56:49,568][213771] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-07 14:56:50,345][213771] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-07 14:56:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 40180736. Throughput: 0: 13226.1. Samples: 40164735. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:56:51,106][213445] Avg episode reward: [(0, '4303.782')] [2023-03-07 14:56:51,140][213771] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 14:56:51,895][213771] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 14:56:52,666][213771] Updated weights for policy 0, policy_version 39260 (0.0005) [2023-03-07 14:56:53,445][213771] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 14:56:54,212][213771] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-07 14:56:54,971][213771] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-07 14:56:55,745][213771] Updated weights for policy 0, policy_version 39300 (0.0007) [2023-03-07 14:56:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 40247296. Throughput: 0: 13234.2. Samples: 40244366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:56:56,106][213445] Avg episode reward: [(0, '4431.601')] [2023-03-07 14:56:56,517][213771] Updated weights for policy 0, policy_version 39310 (0.0005) [2023-03-07 14:56:57,285][213771] Updated weights for policy 0, policy_version 39320 (0.0007) [2023-03-07 14:56:58,053][213771] Updated weights for policy 0, policy_version 39330 (0.0007) [2023-03-07 14:56:58,823][213771] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-07 14:56:59,580][213771] Updated weights for policy 0, policy_version 39350 (0.0007) [2023-03-07 14:57:00,355][213771] Updated weights for policy 0, policy_version 39360 (0.0006) [2023-03-07 14:57:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 40313856. Throughput: 0: 13244.2. Samples: 40284316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:57:01,106][213445] Avg episode reward: [(0, '4438.400')] [2023-03-07 14:57:01,128][213771] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-07 14:57:01,882][213771] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-07 14:57:02,652][213771] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 14:57:03,438][213771] Updated weights for policy 0, policy_version 39400 (0.0006) [2023-03-07 14:57:04,206][213771] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-07 14:57:04,982][213771] Updated weights for policy 0, policy_version 39420 (0.0005) [2023-03-07 14:57:05,737][213771] Updated weights for policy 0, policy_version 39430 (0.0005) [2023-03-07 14:57:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40380416. Throughput: 0: 13265.1. Samples: 40364388. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:57:06,106][213445] Avg episode reward: [(0, '4473.508')] [2023-03-07 14:57:06,515][213771] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-07 14:57:07,292][213771] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-07 14:57:08,071][213771] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-07 14:57:08,842][213771] Updated weights for policy 0, policy_version 39470 (0.0007) [2023-03-07 14:57:09,611][213771] Updated weights for policy 0, policy_version 39480 (0.0006) [2023-03-07 14:57:10,379][213771] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-07 14:57:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 40446976. Throughput: 0: 13258.0. Samples: 40443664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:57:11,106][213445] Avg episode reward: [(0, '4412.751')] [2023-03-07 14:57:11,155][213771] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-07 14:57:11,925][213771] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-07 14:57:12,710][213771] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-07 14:57:13,474][213771] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 14:57:14,265][213771] Updated weights for policy 0, policy_version 39540 (0.0006) [2023-03-07 14:57:15,044][213771] Updated weights for policy 0, policy_version 39550 (0.0007) [2023-03-07 14:57:15,822][213771] Updated weights for policy 0, policy_version 39560 (0.0006) [2023-03-07 14:57:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40512512. Throughput: 0: 13250.0. Samples: 40483344. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:57:16,106][213445] Avg episode reward: [(0, '4494.151')] [2023-03-07 14:57:16,609][213771] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-07 14:57:17,380][213771] Updated weights for policy 0, policy_version 39580 (0.0005) [2023-03-07 14:57:18,157][213771] Updated weights for policy 0, policy_version 39590 (0.0006) [2023-03-07 14:57:18,924][213771] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-07 14:57:19,698][213771] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-07 14:57:20,481][213771] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-07 14:57:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 40579072. Throughput: 0: 13254.1. Samples: 40562576. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:57:21,106][213445] Avg episode reward: [(0, '4454.802')] [2023-03-07 14:57:21,241][213771] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-07 14:57:22,008][213771] Updated weights for policy 0, policy_version 39640 (0.0007) [2023-03-07 14:57:22,793][213771] Updated weights for policy 0, policy_version 39650 (0.0007) [2023-03-07 14:57:23,561][213771] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-07 14:57:24,313][213771] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-07 14:57:25,110][213771] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 14:57:25,874][213771] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-07 14:57:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40644608. Throughput: 0: 13255.4. Samples: 40642046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 14:57:26,105][213445] Avg episode reward: [(0, '4483.336')] [2023-03-07 14:57:26,637][213771] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-07 14:57:27,424][213771] Updated weights for policy 0, policy_version 39710 (0.0005) [2023-03-07 14:57:28,193][213771] Updated weights for policy 0, policy_version 39720 (0.0006) [2023-03-07 14:57:28,973][213771] Updated weights for policy 0, policy_version 39730 (0.0007) [2023-03-07 14:57:29,753][213771] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-07 14:57:30,531][213771] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 14:57:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40711168. Throughput: 0: 13251.6. Samples: 40681715. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:57:31,106][213445] Avg episode reward: [(0, '4505.591')] [2023-03-07 14:57:31,309][213771] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-07 14:57:32,081][213771] Updated weights for policy 0, policy_version 39770 (0.0005) [2023-03-07 14:57:32,857][213771] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-07 14:57:33,617][213771] Updated weights for policy 0, policy_version 39790 (0.0005) [2023-03-07 14:57:34,381][213771] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-07 14:57:35,182][213771] Updated weights for policy 0, policy_version 39810 (0.0006) [2023-03-07 14:57:35,934][213771] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-07 14:57:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 40777728. Throughput: 0: 13248.6. Samples: 40760925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:57:36,106][213445] Avg episode reward: [(0, '4472.125')] [2023-03-07 14:57:36,726][213771] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-07 14:57:37,491][213771] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-07 14:57:38,265][213771] Updated weights for policy 0, policy_version 39850 (0.0006) [2023-03-07 14:57:39,033][213771] Updated weights for policy 0, policy_version 39860 (0.0007) [2023-03-07 14:57:39,806][213771] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-07 14:57:40,583][213771] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-07 14:57:41,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 40843264. Throughput: 0: 13243.5. Samples: 40840325. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:57:41,106][213445] Avg episode reward: [(0, '4520.903')] [2023-03-07 14:57:41,341][213771] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-07 14:57:42,149][213771] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-07 14:57:42,920][213771] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-07 14:57:43,690][213771] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-07 14:57:44,476][213771] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-07 14:57:45,239][213771] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-07 14:57:46,017][213771] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-07 14:57:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40909824. Throughput: 0: 13239.0. Samples: 40880072. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:57:46,106][213445] Avg episode reward: [(0, '4446.540')] [2023-03-07 14:57:46,796][213771] Updated weights for policy 0, policy_version 39960 (0.0007) [2023-03-07 14:57:47,556][213771] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-07 14:57:48,333][213771] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 14:57:49,090][213771] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-07 14:57:49,862][213771] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-07 14:57:50,641][213771] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-07 14:57:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 40975360. Throughput: 0: 13227.7. Samples: 40959632. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:57:51,106][213445] Avg episode reward: [(0, '4445.518')] [2023-03-07 14:57:51,423][213771] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-07 14:57:52,222][213771] Updated weights for policy 0, policy_version 40030 (0.0006) [2023-03-07 14:57:52,990][213771] Updated weights for policy 0, policy_version 40040 (0.0007) [2023-03-07 14:57:53,760][213771] Updated weights for policy 0, policy_version 40050 (0.0006) [2023-03-07 14:57:54,543][213771] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-07 14:57:55,289][213771] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 14:57:56,065][213771] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 14:57:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 41041920. Throughput: 0: 13230.5. Samples: 41039038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:57:56,106][213445] Avg episode reward: [(0, '4458.082')] [2023-03-07 14:57:56,836][213771] Updated weights for policy 0, policy_version 40090 (0.0006) [2023-03-07 14:57:57,602][213771] Updated weights for policy 0, policy_version 40100 (0.0007) [2023-03-07 14:57:58,383][213771] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-07 14:57:59,137][213771] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-07 14:57:59,918][213771] Updated weights for policy 0, policy_version 40130 (0.0006) [2023-03-07 14:58:00,680][213771] Updated weights for policy 0, policy_version 40140 (0.0007) [2023-03-07 14:58:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 41108480. Throughput: 0: 13233.0. Samples: 41078829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:58:01,106][213445] Avg episode reward: [(0, '4481.761')] [2023-03-07 14:58:01,465][213771] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-07 14:58:02,223][213771] Updated weights for policy 0, policy_version 40160 (0.0005) [2023-03-07 14:58:02,991][213771] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-07 14:58:03,778][213771] Updated weights for policy 0, policy_version 40180 (0.0007) [2023-03-07 14:58:04,552][213771] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-07 14:58:05,314][213771] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-07 14:58:06,082][213771] Updated weights for policy 0, policy_version 40210 (0.0005) [2023-03-07 14:58:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 41175040. Throughput: 0: 13241.3. Samples: 41158434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:58:06,106][213445] Avg episode reward: [(0, '4478.636')] [2023-03-07 14:58:06,113][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000040210_41175040.pth... [2023-03-07 14:58:06,144][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000037105_37995520.pth [2023-03-07 14:58:06,865][213771] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 14:58:07,627][213771] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-07 14:58:08,408][213771] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-07 14:58:09,169][213771] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-07 14:58:09,943][213771] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-07 14:58:10,732][213771] Updated weights for policy 0, policy_version 40270 (0.0006) [2023-03-07 14:58:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 41240576. Throughput: 0: 13239.2. Samples: 41237808. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:58:11,106][213445] Avg episode reward: [(0, '4452.220')] [2023-03-07 14:58:11,501][213771] Updated weights for policy 0, policy_version 40280 (0.0005) [2023-03-07 14:58:12,265][213771] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-07 14:58:13,042][213771] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-07 14:58:13,812][213771] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 14:58:14,582][213771] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-03-07 14:58:15,352][213771] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-07 14:58:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 41307136. Throughput: 0: 13245.4. Samples: 41277758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 14:58:16,106][213445] Avg episode reward: [(0, '4447.898')] [2023-03-07 14:58:16,121][213771] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-07 14:58:16,898][213771] Updated weights for policy 0, policy_version 40350 (0.0006) [2023-03-07 14:58:17,674][213771] Updated weights for policy 0, policy_version 40360 (0.0007) [2023-03-07 14:58:18,460][213771] Updated weights for policy 0, policy_version 40370 (0.0008) [2023-03-07 14:58:19,243][213771] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 14:58:19,997][213771] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 14:58:20,777][213771] Updated weights for policy 0, policy_version 40400 (0.0007) [2023-03-07 14:58:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 41373696. Throughput: 0: 13245.8. Samples: 41356984. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:21,105][213445] Avg episode reward: [(0, '4498.774')] [2023-03-07 14:58:21,545][213771] Updated weights for policy 0, policy_version 40410 (0.0006) [2023-03-07 14:58:22,302][213771] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 14:58:23,093][213771] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 14:58:23,852][213771] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-07 14:58:24,620][213771] Updated weights for policy 0, policy_version 40450 (0.0006) [2023-03-07 14:58:25,390][213771] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-07 14:58:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 41440256. Throughput: 0: 13257.2. Samples: 41436900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:26,105][213445] Avg episode reward: [(0, '4475.433')] [2023-03-07 14:58:26,152][213771] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-07 14:58:26,940][213771] Updated weights for policy 0, policy_version 40480 (0.0007) [2023-03-07 14:58:27,704][213771] Updated weights for policy 0, policy_version 40490 (0.0005) [2023-03-07 14:58:28,491][213771] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-07 14:58:29,258][213771] Updated weights for policy 0, policy_version 40510 (0.0005) [2023-03-07 14:58:30,021][213771] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-07 14:58:30,804][213771] Updated weights for policy 0, policy_version 40530 (0.0006) [2023-03-07 14:58:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 41505792. Throughput: 0: 13256.4. Samples: 41476611. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:31,116][213445] Avg episode reward: [(0, '4392.218')] [2023-03-07 14:58:31,578][213771] Updated weights for policy 0, policy_version 40540 (0.0006) [2023-03-07 14:58:32,370][213771] Updated weights for policy 0, policy_version 40550 (0.0005) [2023-03-07 14:58:33,149][213771] Updated weights for policy 0, policy_version 40560 (0.0007) [2023-03-07 14:58:33,930][213771] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 14:58:34,697][213771] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-07 14:58:35,484][213771] Updated weights for policy 0, policy_version 40590 (0.0005) [2023-03-07 14:58:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 41572352. Throughput: 0: 13241.1. Samples: 41555482. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:36,116][213445] Avg episode reward: [(0, '4466.404')] [2023-03-07 14:58:36,247][213771] Updated weights for policy 0, policy_version 40600 (0.0007) [2023-03-07 14:58:37,034][213771] Updated weights for policy 0, policy_version 40610 (0.0005) [2023-03-07 14:58:37,801][213771] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-07 14:58:38,575][213771] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-07 14:58:39,353][213771] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-07 14:58:40,134][213771] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-07 14:58:40,888][213771] Updated weights for policy 0, policy_version 40660 (0.0006) [2023-03-07 14:58:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 41637888. Throughput: 0: 13248.3. Samples: 41635214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:41,116][213445] Avg episode reward: [(0, '4212.719')] [2023-03-07 14:58:41,672][213771] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-07 14:58:42,433][213771] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-07 14:58:43,209][213771] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-07 14:58:43,983][213771] Updated weights for policy 0, policy_version 40700 (0.0006) [2023-03-07 14:58:44,753][213771] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-07 14:58:45,526][213771] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 14:58:46,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 41704448. Throughput: 0: 13250.2. Samples: 41675091. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:46,116][213445] Avg episode reward: [(0, '4418.231')] [2023-03-07 14:58:46,300][213771] Updated weights for policy 0, policy_version 40730 (0.0006) [2023-03-07 14:58:47,078][213771] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 14:58:47,864][213771] Updated weights for policy 0, policy_version 40750 (0.0006) [2023-03-07 14:58:48,637][213771] Updated weights for policy 0, policy_version 40760 (0.0006) [2023-03-07 14:58:49,408][213771] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-07 14:58:50,193][213771] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-07 14:58:50,945][213771] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-07 14:58:51,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 41771008. Throughput: 0: 13236.0. Samples: 41754054. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:51,106][213445] Avg episode reward: [(0, '4459.718')] [2023-03-07 14:58:51,718][213771] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-07 14:58:52,499][213771] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-07 14:58:53,262][213771] Updated weights for policy 0, policy_version 40820 (0.0006) [2023-03-07 14:58:54,043][213771] Updated weights for policy 0, policy_version 40830 (0.0007) [2023-03-07 14:58:54,816][213771] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 14:58:55,588][213771] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-07 14:58:56,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 41836544. Throughput: 0: 13240.6. Samples: 41833637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:58:56,106][213445] Avg episode reward: [(0, '4428.100')] [2023-03-07 14:58:56,374][213771] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-07 14:58:57,151][213771] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-07 14:58:57,916][213771] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-07 14:58:58,690][213771] Updated weights for policy 0, policy_version 40890 (0.0005) [2023-03-07 14:58:59,473][213771] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-07 14:59:00,238][213771] Updated weights for policy 0, policy_version 40910 (0.0007) [2023-03-07 14:59:01,015][213771] Updated weights for policy 0, policy_version 40920 (0.0005) [2023-03-07 14:59:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 41903104. Throughput: 0: 13236.9. Samples: 41873418. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:01,105][213445] Avg episode reward: [(0, '4484.746')] [2023-03-07 14:59:01,798][213771] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 14:59:02,562][213771] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-07 14:59:03,327][213771] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 14:59:04,119][213771] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-07 14:59:04,902][213771] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-07 14:59:05,649][213771] Updated weights for policy 0, policy_version 40980 (0.0006) [2023-03-07 14:59:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 41968640. Throughput: 0: 13234.5. Samples: 41952536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:06,105][213445] Avg episode reward: [(0, '4447.224')] [2023-03-07 14:59:06,448][213771] Updated weights for policy 0, policy_version 40990 (0.0007) [2023-03-07 14:59:07,226][213771] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-07 14:59:07,985][213771] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 14:59:08,754][213771] Updated weights for policy 0, policy_version 41020 (0.0005) [2023-03-07 14:59:09,526][213771] Updated weights for policy 0, policy_version 41030 (0.0006) [2023-03-07 14:59:10,294][213771] Updated weights for policy 0, policy_version 41040 (0.0007) [2023-03-07 14:59:11,059][213771] Updated weights for policy 0, policy_version 41050 (0.0005) [2023-03-07 14:59:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42035200. Throughput: 0: 13229.4. Samples: 42032225. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:11,105][213445] Avg episode reward: [(0, '4500.565')] [2023-03-07 14:59:11,829][213771] Updated weights for policy 0, policy_version 41060 (0.0007) [2023-03-07 14:59:12,612][213771] Updated weights for policy 0, policy_version 41070 (0.0006) [2023-03-07 14:59:13,375][213771] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-07 14:59:14,141][213771] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-07 14:59:14,929][213771] Updated weights for policy 0, policy_version 41100 (0.0005) [2023-03-07 14:59:15,694][213771] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-07 14:59:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42101760. Throughput: 0: 13230.4. Samples: 42071980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:16,106][213445] Avg episode reward: [(0, '4474.375')] [2023-03-07 14:59:16,459][213771] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-07 14:59:17,227][213771] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-07 14:59:18,002][213771] Updated weights for policy 0, policy_version 41140 (0.0006) [2023-03-07 14:59:18,777][213771] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-07 14:59:19,527][213771] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-07 14:59:20,322][213771] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 14:59:21,091][213771] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-07 14:59:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42168320. Throughput: 0: 13254.1. Samples: 42151919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:21,106][213445] Avg episode reward: [(0, '4449.450')] [2023-03-07 14:59:21,859][213771] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-07 14:59:22,617][213771] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-07 14:59:23,386][213771] Updated weights for policy 0, policy_version 41210 (0.0007) [2023-03-07 14:59:24,140][213771] Updated weights for policy 0, policy_version 41220 (0.0005) [2023-03-07 14:59:24,929][213771] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-07 14:59:25,698][213771] Updated weights for policy 0, policy_version 41240 (0.0006) [2023-03-07 14:59:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 42234880. Throughput: 0: 13251.2. Samples: 42231517. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:26,106][213445] Avg episode reward: [(0, '4548.086')] [2023-03-07 14:59:26,478][213771] Updated weights for policy 0, policy_version 41250 (0.0005) [2023-03-07 14:59:27,241][213771] Updated weights for policy 0, policy_version 41260 (0.0007) [2023-03-07 14:59:28,021][213771] Updated weights for policy 0, policy_version 41270 (0.0006) [2023-03-07 14:59:28,777][213771] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-07 14:59:29,563][213771] Updated weights for policy 0, policy_version 41290 (0.0006) [2023-03-07 14:59:30,334][213771] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-07 14:59:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42300416. Throughput: 0: 13252.2. Samples: 42271439. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:31,106][213445] Avg episode reward: [(0, '4475.927')] [2023-03-07 14:59:31,124][213771] Updated weights for policy 0, policy_version 41310 (0.0007) [2023-03-07 14:59:31,885][213771] Updated weights for policy 0, policy_version 41320 (0.0005) [2023-03-07 14:59:32,672][213771] Updated weights for policy 0, policy_version 41330 (0.0005) [2023-03-07 14:59:33,441][213771] Updated weights for policy 0, policy_version 41340 (0.0006) [2023-03-07 14:59:34,194][213771] Updated weights for policy 0, policy_version 41350 (0.0007) [2023-03-07 14:59:34,982][213771] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-07 14:59:35,751][213771] Updated weights for policy 0, policy_version 41370 (0.0005) [2023-03-07 14:59:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42366976. Throughput: 0: 13260.0. Samples: 42350757. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:36,106][213445] Avg episode reward: [(0, '4282.112')] [2023-03-07 14:59:36,531][213771] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-07 14:59:37,301][213771] Updated weights for policy 0, policy_version 41390 (0.0005) [2023-03-07 14:59:38,073][213771] Updated weights for policy 0, policy_version 41400 (0.0007) [2023-03-07 14:59:38,825][213771] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-07 14:59:39,606][213771] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-07 14:59:40,369][213771] Updated weights for policy 0, policy_version 41430 (0.0007) [2023-03-07 14:59:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 42433536. Throughput: 0: 13263.1. Samples: 42430477. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:41,105][213445] Avg episode reward: [(0, '4369.054')] [2023-03-07 14:59:41,135][213771] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-07 14:59:41,915][213771] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-07 14:59:42,674][213771] Updated weights for policy 0, policy_version 41460 (0.0005) [2023-03-07 14:59:43,440][213771] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-07 14:59:44,229][213771] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-07 14:59:45,013][213771] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-07 14:59:45,769][213771] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-07 14:59:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 42500096. Throughput: 0: 13266.2. Samples: 42470396. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:46,106][213445] Avg episode reward: [(0, '4412.704')] [2023-03-07 14:59:46,556][213771] Updated weights for policy 0, policy_version 41510 (0.0006) [2023-03-07 14:59:47,321][213771] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-07 14:59:48,098][213771] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-07 14:59:48,854][213771] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-07 14:59:49,638][213771] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-07 14:59:50,416][213771] Updated weights for policy 0, policy_version 41560 (0.0005) [2023-03-07 14:59:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 42565632. Throughput: 0: 13272.6. Samples: 42549804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:51,106][213445] Avg episode reward: [(0, '4475.612')] [2023-03-07 14:59:51,191][213771] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-07 14:59:51,959][213771] Updated weights for policy 0, policy_version 41580 (0.0006) [2023-03-07 14:59:52,721][213771] Updated weights for policy 0, policy_version 41590 (0.0007) [2023-03-07 14:59:53,506][213771] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-07 14:59:54,260][213771] Updated weights for policy 0, policy_version 41610 (0.0007) [2023-03-07 14:59:55,030][213771] Updated weights for policy 0, policy_version 41620 (0.0006) [2023-03-07 14:59:55,798][213771] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-07 14:59:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 42633216. Throughput: 0: 13272.2. Samples: 42629474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 14:59:56,106][213445] Avg episode reward: [(0, '4463.897')] [2023-03-07 14:59:56,586][213771] Updated weights for policy 0, policy_version 41640 (0.0006) [2023-03-07 14:59:57,359][213771] Updated weights for policy 0, policy_version 41650 (0.0006) [2023-03-07 14:59:58,142][213771] Updated weights for policy 0, policy_version 41660 (0.0006) [2023-03-07 14:59:58,921][213771] Updated weights for policy 0, policy_version 41670 (0.0007) [2023-03-07 14:59:59,692][213771] Updated weights for policy 0, policy_version 41680 (0.0006) [2023-03-07 15:00:00,476][213771] Updated weights for policy 0, policy_version 41690 (0.0007) [2023-03-07 15:00:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 42698752. Throughput: 0: 13267.3. Samples: 42669009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:01,106][213445] Avg episode reward: [(0, '4479.168')] [2023-03-07 15:00:01,237][213771] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 15:00:02,011][213771] Updated weights for policy 0, policy_version 41710 (0.0006) [2023-03-07 15:00:02,776][213771] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-07 15:00:03,558][213771] Updated weights for policy 0, policy_version 41730 (0.0008) [2023-03-07 15:00:04,334][213771] Updated weights for policy 0, policy_version 41740 (0.0006) [2023-03-07 15:00:05,119][213771] Updated weights for policy 0, policy_version 41750 (0.0007) [2023-03-07 15:00:05,884][213771] Updated weights for policy 0, policy_version 41760 (0.0007) [2023-03-07 15:00:06,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 42764288. Throughput: 0: 13254.4. Samples: 42748368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:06,106][213445] Avg episode reward: [(0, '4521.578')] [2023-03-07 15:00:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000041763_42765312.pth... [2023-03-07 15:00:06,152][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000038658_39585792.pth [2023-03-07 15:00:06,670][213771] Updated weights for policy 0, policy_version 41770 (0.0007) [2023-03-07 15:00:07,454][213771] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 15:00:08,209][213771] Updated weights for policy 0, policy_version 41790 (0.0005) [2023-03-07 15:00:08,997][213771] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-07 15:00:09,760][213771] Updated weights for policy 0, policy_version 41810 (0.0007) [2023-03-07 15:00:10,541][213771] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-07 15:00:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 42830848. Throughput: 0: 13245.6. Samples: 42827567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:11,106][213445] Avg episode reward: [(0, '4438.522')] [2023-03-07 15:00:11,326][213771] Updated weights for policy 0, policy_version 41830 (0.0007) [2023-03-07 15:00:12,092][213771] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 15:00:12,863][213771] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-07 15:00:13,634][213771] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-07 15:00:14,415][213771] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-07 15:00:15,183][213771] Updated weights for policy 0, policy_version 41880 (0.0005) [2023-03-07 15:00:15,961][213771] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-07 15:00:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 42897408. Throughput: 0: 13239.9. Samples: 42867234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:16,106][213445] Avg episode reward: [(0, '4468.285')] [2023-03-07 15:00:16,738][213771] Updated weights for policy 0, policy_version 41900 (0.0006) [2023-03-07 15:00:17,519][213771] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 15:00:18,281][213771] Updated weights for policy 0, policy_version 41920 (0.0008) [2023-03-07 15:00:19,051][213771] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 15:00:19,827][213771] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-07 15:00:20,602][213771] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-07 15:00:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 42962944. Throughput: 0: 13245.1. Samples: 42946783. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:21,106][213445] Avg episode reward: [(0, '4456.208')] [2023-03-07 15:00:21,357][213771] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-07 15:00:22,151][213771] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-07 15:00:22,911][213771] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-07 15:00:23,674][213771] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-07 15:00:24,435][213771] Updated weights for policy 0, policy_version 42000 (0.0005) [2023-03-07 15:00:25,210][213771] Updated weights for policy 0, policy_version 42010 (0.0005) [2023-03-07 15:00:25,982][213771] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-07 15:00:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 43029504. Throughput: 0: 13243.5. Samples: 43026434. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:00:26,106][213445] Avg episode reward: [(0, '4367.261')] [2023-03-07 15:00:26,755][213771] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 15:00:27,541][213771] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-07 15:00:28,313][213771] Updated weights for policy 0, policy_version 42050 (0.0006) [2023-03-07 15:00:29,094][213771] Updated weights for policy 0, policy_version 42060 (0.0005) [2023-03-07 15:00:29,873][213771] Updated weights for policy 0, policy_version 42070 (0.0007) [2023-03-07 15:00:30,644][213771] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-07 15:00:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 43096064. Throughput: 0: 13232.3. Samples: 43065849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 15:00:31,106][213445] Avg episode reward: [(0, '4429.866')] [2023-03-07 15:00:31,400][213771] Updated weights for policy 0, policy_version 42090 (0.0006) [2023-03-07 15:00:32,196][213771] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-07 15:00:32,963][213771] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-07 15:00:33,755][213771] Updated weights for policy 0, policy_version 42120 (0.0007) [2023-03-07 15:00:34,522][213771] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-07 15:00:35,282][213771] Updated weights for policy 0, policy_version 42140 (0.0005) [2023-03-07 15:00:36,058][213771] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-07 15:00:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 43161600. Throughput: 0: 13233.4. Samples: 43145308. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 15:00:36,106][213445] Avg episode reward: [(0, '4417.598')] [2023-03-07 15:00:36,830][213771] Updated weights for policy 0, policy_version 42160 (0.0005) [2023-03-07 15:00:37,595][213771] Updated weights for policy 0, policy_version 42170 (0.0006) [2023-03-07 15:00:38,371][213771] Updated weights for policy 0, policy_version 42180 (0.0006) [2023-03-07 15:00:39,149][213771] Updated weights for policy 0, policy_version 42190 (0.0005) [2023-03-07 15:00:39,911][213771] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-07 15:00:40,680][213771] Updated weights for policy 0, policy_version 42210 (0.0005) [2023-03-07 15:00:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 43228160. Throughput: 0: 13234.5. Samples: 43225025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 15:00:41,106][213445] Avg episode reward: [(0, '4437.883')] [2023-03-07 15:00:41,456][213771] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-07 15:00:42,225][213771] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-07 15:00:43,010][213771] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-07 15:00:43,782][213771] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-07 15:00:44,555][213771] Updated weights for policy 0, policy_version 42260 (0.0006) [2023-03-07 15:00:45,336][213771] Updated weights for policy 0, policy_version 42270 (0.0005) [2023-03-07 15:00:46,092][213771] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-07 15:00:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 43294720. Throughput: 0: 13240.8. Samples: 43264843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 15:00:46,106][213445] Avg episode reward: [(0, '4468.372')] [2023-03-07 15:00:46,873][213771] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-07 15:00:47,666][213771] Updated weights for policy 0, policy_version 42300 (0.0007) [2023-03-07 15:00:48,441][213771] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-07 15:00:49,207][213771] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-07 15:00:49,997][213771] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-07 15:00:50,764][213771] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-07 15:00:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 43360256. Throughput: 0: 13233.6. Samples: 43343882. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 15:00:51,106][213445] Avg episode reward: [(0, '4399.735')] [2023-03-07 15:00:51,536][213771] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 15:00:52,314][213771] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-07 15:00:53,097][213771] Updated weights for policy 0, policy_version 42370 (0.0005) [2023-03-07 15:00:53,861][213771] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-07 15:00:54,633][213771] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-07 15:00:55,401][213771] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-07 15:00:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43426816. Throughput: 0: 13238.5. Samples: 43423300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:00:56,106][213445] Avg episode reward: [(0, '4428.882')] [2023-03-07 15:00:56,167][213771] Updated weights for policy 0, policy_version 42410 (0.0005) [2023-03-07 15:00:56,937][213771] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-07 15:00:57,714][213771] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-07 15:00:58,498][213771] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 15:00:59,274][213771] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-07 15:01:00,054][213771] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-07 15:01:00,837][213771] Updated weights for policy 0, policy_version 42470 (0.0006) [2023-03-07 15:01:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43492352. Throughput: 0: 13237.1. Samples: 43462903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:01:01,106][213445] Avg episode reward: [(0, '4423.562')] [2023-03-07 15:01:01,582][213771] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 15:01:02,355][213771] Updated weights for policy 0, policy_version 42490 (0.0006) [2023-03-07 15:01:03,163][213771] Updated weights for policy 0, policy_version 42500 (0.0007) [2023-03-07 15:01:03,923][213771] Updated weights for policy 0, policy_version 42510 (0.0007) [2023-03-07 15:01:04,702][213771] Updated weights for policy 0, policy_version 42520 (0.0006) [2023-03-07 15:01:05,475][213771] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-07 15:01:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 43558912. Throughput: 0: 13233.6. Samples: 43542294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:01:06,106][213445] Avg episode reward: [(0, '4343.860')] [2023-03-07 15:01:06,242][213771] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-07 15:01:07,005][213771] Updated weights for policy 0, policy_version 42550 (0.0006) [2023-03-07 15:01:07,785][213771] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-07 15:01:08,554][213771] Updated weights for policy 0, policy_version 42570 (0.0006) [2023-03-07 15:01:09,342][213771] Updated weights for policy 0, policy_version 42580 (0.0006) [2023-03-07 15:01:10,113][213771] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 15:01:10,889][213771] Updated weights for policy 0, policy_version 42600 (0.0006) [2023-03-07 15:01:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43624448. Throughput: 0: 13227.3. Samples: 43621661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:01:11,105][213445] Avg episode reward: [(0, '4403.673')] [2023-03-07 15:01:11,656][213771] Updated weights for policy 0, policy_version 42610 (0.0008) [2023-03-07 15:01:12,446][213771] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 15:01:13,216][213771] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-07 15:01:13,977][213771] Updated weights for policy 0, policy_version 42640 (0.0006) [2023-03-07 15:01:14,752][213771] Updated weights for policy 0, policy_version 42650 (0.0006) [2023-03-07 15:01:15,526][213771] Updated weights for policy 0, policy_version 42660 (0.0006) [2023-03-07 15:01:16,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13209.6, 300 sec: 13242.6). Total num frames: 43689984. Throughput: 0: 13231.8. Samples: 43661282. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:01:16,106][213445] Avg episode reward: [(0, '4368.520')] [2023-03-07 15:01:16,329][213771] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-07 15:01:17,122][213771] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-07 15:01:17,874][213771] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 15:01:18,647][213771] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-07 15:01:19,436][213771] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 15:01:20,196][213771] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-07 15:01:20,955][213771] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-07 15:01:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 43756544. Throughput: 0: 13224.3. Samples: 43740400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:01:21,106][213445] Avg episode reward: [(0, '4380.116')] [2023-03-07 15:01:21,757][213771] Updated weights for policy 0, policy_version 42740 (0.0005) [2023-03-07 15:01:22,522][213771] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-07 15:01:23,287][213771] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-07 15:01:24,064][213771] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-07 15:01:24,833][213771] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-07 15:01:25,595][213771] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-07 15:01:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43823104. Throughput: 0: 13222.4. Samples: 43820031. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:26,106][213445] Avg episode reward: [(0, '4361.095')] [2023-03-07 15:01:26,398][213771] Updated weights for policy 0, policy_version 42800 (0.0007) [2023-03-07 15:01:27,163][213771] Updated weights for policy 0, policy_version 42810 (0.0005) [2023-03-07 15:01:27,929][213771] Updated weights for policy 0, policy_version 42820 (0.0006) [2023-03-07 15:01:28,704][213771] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-07 15:01:29,478][213771] Updated weights for policy 0, policy_version 42840 (0.0006) [2023-03-07 15:01:30,268][213771] Updated weights for policy 0, policy_version 42850 (0.0007) [2023-03-07 15:01:31,040][213771] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 15:01:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43889664. Throughput: 0: 13220.2. Samples: 43859754. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:31,106][213445] Avg episode reward: [(0, '4423.454')] [2023-03-07 15:01:31,811][213771] Updated weights for policy 0, policy_version 42870 (0.0007) [2023-03-07 15:01:32,582][213771] Updated weights for policy 0, policy_version 42880 (0.0006) [2023-03-07 15:01:33,356][213771] Updated weights for policy 0, policy_version 42890 (0.0007) [2023-03-07 15:01:34,142][213771] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 15:01:34,899][213771] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-07 15:01:35,671][213771] Updated weights for policy 0, policy_version 42920 (0.0007) [2023-03-07 15:01:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 43955200. Throughput: 0: 13228.7. Samples: 43939175. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:36,106][213445] Avg episode reward: [(0, '4427.388')] [2023-03-07 15:01:36,446][213771] Updated weights for policy 0, policy_version 42930 (0.0005) [2023-03-07 15:01:37,230][213771] Updated weights for policy 0, policy_version 42940 (0.0006) [2023-03-07 15:01:38,001][213771] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-07 15:01:38,764][213771] Updated weights for policy 0, policy_version 42960 (0.0007) [2023-03-07 15:01:39,533][213771] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-07 15:01:40,317][213771] Updated weights for policy 0, policy_version 42980 (0.0005) [2023-03-07 15:01:41,094][213771] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-07 15:01:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 44021760. Throughput: 0: 13219.9. Samples: 44018193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:41,106][213445] Avg episode reward: [(0, '4413.984')] [2023-03-07 15:01:41,885][213771] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-07 15:01:42,646][213771] Updated weights for policy 0, policy_version 43010 (0.0007) [2023-03-07 15:01:43,437][213771] Updated weights for policy 0, policy_version 43020 (0.0007) [2023-03-07 15:01:44,208][213771] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-07 15:01:44,975][213771] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-07 15:01:45,752][213771] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 15:01:46,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13242.6). Total num frames: 44087296. Throughput: 0: 13221.8. Samples: 44057886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:46,106][213445] Avg episode reward: [(0, '4431.693')] [2023-03-07 15:01:46,514][213771] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-07 15:01:47,272][213771] Updated weights for policy 0, policy_version 43070 (0.0006) [2023-03-07 15:01:48,079][213771] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-07 15:01:48,868][213771] Updated weights for policy 0, policy_version 43090 (0.0005) [2023-03-07 15:01:49,646][213771] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-07 15:01:50,425][213771] Updated weights for policy 0, policy_version 43110 (0.0007) [2023-03-07 15:01:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 44153856. Throughput: 0: 13217.1. Samples: 44137062. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:51,105][213445] Avg episode reward: [(0, '4422.672')] [2023-03-07 15:01:51,187][213771] Updated weights for policy 0, policy_version 43120 (0.0005) [2023-03-07 15:01:51,939][213771] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-07 15:01:52,708][213771] Updated weights for policy 0, policy_version 43140 (0.0005) [2023-03-07 15:01:53,483][213771] Updated weights for policy 0, policy_version 43150 (0.0007) [2023-03-07 15:01:54,240][213771] Updated weights for policy 0, policy_version 43160 (0.0007) [2023-03-07 15:01:55,027][213771] Updated weights for policy 0, policy_version 43170 (0.0007) [2023-03-07 15:01:55,822][213771] Updated weights for policy 0, policy_version 43180 (0.0005) [2023-03-07 15:01:56,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 44219392. Throughput: 0: 13221.0. Samples: 44216610. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:01:56,117][213445] Avg episode reward: [(0, '4407.211')] [2023-03-07 15:01:56,600][213771] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 15:01:57,350][213771] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-07 15:01:58,139][213771] Updated weights for policy 0, policy_version 43210 (0.0007) [2023-03-07 15:01:58,920][213771] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 15:01:59,689][213771] Updated weights for policy 0, policy_version 43230 (0.0006) [2023-03-07 15:02:00,467][213771] Updated weights for policy 0, policy_version 43240 (0.0007) [2023-03-07 15:02:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 44285952. Throughput: 0: 13217.5. Samples: 44256070. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:01,116][213445] Avg episode reward: [(0, '4349.403')] [2023-03-07 15:02:01,242][213771] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-07 15:02:02,026][213771] Updated weights for policy 0, policy_version 43260 (0.0007) [2023-03-07 15:02:02,800][213771] Updated weights for policy 0, policy_version 43270 (0.0007) [2023-03-07 15:02:03,574][213771] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-07 15:02:04,346][213771] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 15:02:05,120][213771] Updated weights for policy 0, policy_version 43300 (0.0005) [2023-03-07 15:02:05,886][213771] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-07 15:02:06,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 44352512. Throughput: 0: 13225.0. Samples: 44335526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:06,116][213445] Avg episode reward: [(0, '4379.060')] [2023-03-07 15:02:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000043313_44352512.pth... [2023-03-07 15:02:06,155][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000040210_41175040.pth [2023-03-07 15:02:06,645][213771] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-07 15:02:07,420][213771] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-07 15:02:08,201][213771] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 15:02:08,962][213771] Updated weights for policy 0, policy_version 43350 (0.0005) [2023-03-07 15:02:09,734][213771] Updated weights for policy 0, policy_version 43360 (0.0007) [2023-03-07 15:02:10,500][213771] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-07 15:02:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 44418048. Throughput: 0: 13222.7. Samples: 44415052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:11,106][213445] Avg episode reward: [(0, '4437.337')] [2023-03-07 15:02:11,274][213771] Updated weights for policy 0, policy_version 43380 (0.0006) [2023-03-07 15:02:12,059][213771] Updated weights for policy 0, policy_version 43390 (0.0008) [2023-03-07 15:02:12,814][213771] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-07 15:02:13,617][213771] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 15:02:14,375][213771] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-07 15:02:15,138][213771] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-07 15:02:15,918][213771] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-07 15:02:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 44484608. Throughput: 0: 13226.6. Samples: 44454950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:16,106][213445] Avg episode reward: [(0, '4401.942')] [2023-03-07 15:02:16,697][213771] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-07 15:02:17,467][213771] Updated weights for policy 0, policy_version 43460 (0.0007) [2023-03-07 15:02:18,230][213771] Updated weights for policy 0, policy_version 43470 (0.0006) [2023-03-07 15:02:18,997][213771] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-07 15:02:19,749][213771] Updated weights for policy 0, policy_version 43490 (0.0006) [2023-03-07 15:02:20,538][213771] Updated weights for policy 0, policy_version 43500 (0.0007) [2023-03-07 15:02:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 44551168. Throughput: 0: 13234.9. Samples: 44534743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:21,106][213445] Avg episode reward: [(0, '4460.215')] [2023-03-07 15:02:21,295][213771] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-07 15:02:22,070][213771] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-07 15:02:22,837][213771] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-07 15:02:23,597][213771] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-07 15:02:24,385][213771] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-07 15:02:25,133][213771] Updated weights for policy 0, policy_version 43560 (0.0007) [2023-03-07 15:02:25,911][213771] Updated weights for policy 0, policy_version 43570 (0.0005) [2023-03-07 15:02:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 44617728. Throughput: 0: 13251.9. Samples: 44614528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:26,106][213445] Avg episode reward: [(0, '4438.412')] [2023-03-07 15:02:26,700][213771] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-07 15:02:27,482][213771] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-07 15:02:28,243][213771] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-07 15:02:29,030][213771] Updated weights for policy 0, policy_version 43610 (0.0007) [2023-03-07 15:02:29,807][213771] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-07 15:02:30,571][213771] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 15:02:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 44683264. Throughput: 0: 13247.7. Samples: 44654033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:31,106][213445] Avg episode reward: [(0, '4462.438')] [2023-03-07 15:02:31,358][213771] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-07 15:02:32,132][213771] Updated weights for policy 0, policy_version 43650 (0.0008) [2023-03-07 15:02:32,903][213771] Updated weights for policy 0, policy_version 43660 (0.0006) [2023-03-07 15:02:33,684][213771] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-07 15:02:34,433][213771] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-07 15:02:35,219][213771] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 15:02:35,989][213771] Updated weights for policy 0, policy_version 43700 (0.0005) [2023-03-07 15:02:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 44749824. Throughput: 0: 13254.8. Samples: 44733531. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:36,106][213445] Avg episode reward: [(0, '4437.316')] [2023-03-07 15:02:36,770][213771] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-07 15:02:37,534][213771] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-07 15:02:38,305][213771] Updated weights for policy 0, policy_version 43730 (0.0007) [2023-03-07 15:02:39,093][213771] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-07 15:02:39,859][213771] Updated weights for policy 0, policy_version 43750 (0.0007) [2023-03-07 15:02:40,648][213771] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-07 15:02:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 44815360. Throughput: 0: 13250.9. Samples: 44812899. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:02:41,106][213445] Avg episode reward: [(0, '4398.472')] [2023-03-07 15:02:41,427][213771] Updated weights for policy 0, policy_version 43770 (0.0006) [2023-03-07 15:02:42,186][213771] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-07 15:02:42,966][213771] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-07 15:02:43,740][213771] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-07 15:02:44,496][213771] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-07 15:02:45,273][213771] Updated weights for policy 0, policy_version 43820 (0.0007) [2023-03-07 15:02:46,066][213771] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 15:02:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 44881920. Throughput: 0: 13253.3. Samples: 44852468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:46,106][213445] Avg episode reward: [(0, '4387.220')] [2023-03-07 15:02:46,844][213771] Updated weights for policy 0, policy_version 43840 (0.0007) [2023-03-07 15:02:47,611][213771] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-07 15:02:48,389][213771] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-07 15:02:49,145][213771] Updated weights for policy 0, policy_version 43870 (0.0007) [2023-03-07 15:02:49,912][213771] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-07 15:02:50,678][213771] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-07 15:02:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 44948480. Throughput: 0: 13256.3. Samples: 44932060. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:51,106][213445] Avg episode reward: [(0, '4427.066')] [2023-03-07 15:02:51,462][213771] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-07 15:02:52,226][213771] Updated weights for policy 0, policy_version 43910 (0.0007) [2023-03-07 15:02:53,018][213771] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-07 15:02:53,774][213771] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-07 15:02:54,561][213771] Updated weights for policy 0, policy_version 43940 (0.0006) [2023-03-07 15:02:55,329][213771] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-07 15:02:56,094][213771] Updated weights for policy 0, policy_version 43960 (0.0007) [2023-03-07 15:02:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45015040. Throughput: 0: 13256.7. Samples: 45011606. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:02:56,106][213445] Avg episode reward: [(0, '4388.728')] [2023-03-07 15:02:56,869][213771] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-07 15:02:57,643][213771] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-07 15:02:58,418][213771] Updated weights for policy 0, policy_version 43990 (0.0005) [2023-03-07 15:02:59,186][213771] Updated weights for policy 0, policy_version 44000 (0.0007) [2023-03-07 15:02:59,956][213771] Updated weights for policy 0, policy_version 44010 (0.0005) [2023-03-07 15:03:00,729][213771] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-07 15:03:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 45080576. Throughput: 0: 13251.3. Samples: 45051257. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:03:01,106][213445] Avg episode reward: [(0, '4389.565')] [2023-03-07 15:03:01,501][213771] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 15:03:02,274][213771] Updated weights for policy 0, policy_version 44040 (0.0005) [2023-03-07 15:03:03,075][213771] Updated weights for policy 0, policy_version 44050 (0.0006) [2023-03-07 15:03:03,848][213771] Updated weights for policy 0, policy_version 44060 (0.0007) [2023-03-07 15:03:04,617][213771] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-07 15:03:05,382][213771] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-07 15:03:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 45147136. Throughput: 0: 13239.1. Samples: 45130502. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:03:06,106][213445] Avg episode reward: [(0, '4360.400')] [2023-03-07 15:03:06,153][213771] Updated weights for policy 0, policy_version 44090 (0.0005) [2023-03-07 15:03:06,922][213771] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-07 15:03:07,684][213771] Updated weights for policy 0, policy_version 44110 (0.0007) [2023-03-07 15:03:08,454][213771] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 15:03:09,239][213771] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-07 15:03:10,027][213771] Updated weights for policy 0, policy_version 44140 (0.0007) [2023-03-07 15:03:10,796][213771] Updated weights for policy 0, policy_version 44150 (0.0007) [2023-03-07 15:03:11,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45213696. Throughput: 0: 13240.5. Samples: 45210349. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:03:11,106][213445] Avg episode reward: [(0, '4419.687')] [2023-03-07 15:03:11,565][213771] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-07 15:03:12,310][213771] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-07 15:03:13,079][213771] Updated weights for policy 0, policy_version 44180 (0.0006) [2023-03-07 15:03:13,833][213771] Updated weights for policy 0, policy_version 44190 (0.0007) [2023-03-07 15:03:14,620][213771] Updated weights for policy 0, policy_version 44200 (0.0005) [2023-03-07 15:03:15,374][213771] Updated weights for policy 0, policy_version 44210 (0.0006) [2023-03-07 15:03:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45280256. Throughput: 0: 13256.5. Samples: 45250577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:16,106][213445] Avg episode reward: [(0, '4335.172')] [2023-03-07 15:03:16,137][213771] Updated weights for policy 0, policy_version 44220 (0.0005) [2023-03-07 15:03:16,906][213771] Updated weights for policy 0, policy_version 44230 (0.0007) [2023-03-07 15:03:17,691][213771] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-07 15:03:18,468][213771] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-07 15:03:19,238][213771] Updated weights for policy 0, policy_version 44260 (0.0007) [2023-03-07 15:03:20,004][213771] Updated weights for policy 0, policy_version 44270 (0.0007) [2023-03-07 15:03:20,788][213771] Updated weights for policy 0, policy_version 44280 (0.0005) [2023-03-07 15:03:21,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45346816. Throughput: 0: 13257.8. Samples: 45330133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:21,106][213445] Avg episode reward: [(0, '4408.919')] [2023-03-07 15:03:21,560][213771] Updated weights for policy 0, policy_version 44290 (0.0007) [2023-03-07 15:03:22,349][213771] Updated weights for policy 0, policy_version 44300 (0.0006) [2023-03-07 15:03:23,102][213771] Updated weights for policy 0, policy_version 44310 (0.0006) [2023-03-07 15:03:23,880][213771] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-07 15:03:24,660][213771] Updated weights for policy 0, policy_version 44330 (0.0006) [2023-03-07 15:03:25,441][213771] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-07 15:03:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 45412352. Throughput: 0: 13259.3. Samples: 45409567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:26,106][213445] Avg episode reward: [(0, '4416.197')] [2023-03-07 15:03:26,208][213771] Updated weights for policy 0, policy_version 44350 (0.0005) [2023-03-07 15:03:26,986][213771] Updated weights for policy 0, policy_version 44360 (0.0006) [2023-03-07 15:03:27,742][213771] Updated weights for policy 0, policy_version 44370 (0.0007) [2023-03-07 15:03:28,519][213771] Updated weights for policy 0, policy_version 44380 (0.0005) [2023-03-07 15:03:29,290][213771] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 15:03:30,065][213771] Updated weights for policy 0, policy_version 44400 (0.0007) [2023-03-07 15:03:30,849][213771] Updated weights for policy 0, policy_version 44410 (0.0005) [2023-03-07 15:03:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45478912. Throughput: 0: 13266.6. Samples: 45449467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:31,106][213445] Avg episode reward: [(0, '4423.979')] [2023-03-07 15:03:31,604][213771] Updated weights for policy 0, policy_version 44420 (0.0005) [2023-03-07 15:03:32,381][213771] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-07 15:03:33,146][213771] Updated weights for policy 0, policy_version 44440 (0.0006) [2023-03-07 15:03:33,908][213771] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-07 15:03:34,690][213771] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 15:03:35,478][213771] Updated weights for policy 0, policy_version 44470 (0.0006) [2023-03-07 15:03:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 45545472. Throughput: 0: 13264.3. Samples: 45528951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:36,116][213445] Avg episode reward: [(0, '4460.407')] [2023-03-07 15:03:36,241][213771] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-07 15:03:37,013][213771] Updated weights for policy 0, policy_version 44490 (0.0006) [2023-03-07 15:03:37,790][213771] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-07 15:03:38,557][213771] Updated weights for policy 0, policy_version 44510 (0.0007) [2023-03-07 15:03:39,326][213771] Updated weights for policy 0, policy_version 44520 (0.0005) [2023-03-07 15:03:40,101][213771] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-07 15:03:40,860][213771] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-07 15:03:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13246.1). Total num frames: 45612032. Throughput: 0: 13265.4. Samples: 45608546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:41,116][213445] Avg episode reward: [(0, '4487.297')] [2023-03-07 15:03:41,640][213771] Updated weights for policy 0, policy_version 44550 (0.0005) [2023-03-07 15:03:42,397][213771] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-07 15:03:43,186][213771] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-07 15:03:43,952][213771] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-07 15:03:44,735][213771] Updated weights for policy 0, policy_version 44590 (0.0006) [2023-03-07 15:03:45,500][213771] Updated weights for policy 0, policy_version 44600 (0.0007) [2023-03-07 15:03:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 45677568. Throughput: 0: 13266.7. Samples: 45648259. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:46,116][213445] Avg episode reward: [(0, '4465.515')] [2023-03-07 15:03:46,271][213771] Updated weights for policy 0, policy_version 44610 (0.0006) [2023-03-07 15:03:47,026][213771] Updated weights for policy 0, policy_version 44620 (0.0006) [2023-03-07 15:03:47,807][213771] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-07 15:03:48,570][213771] Updated weights for policy 0, policy_version 44640 (0.0005) [2023-03-07 15:03:49,343][213771] Updated weights for policy 0, policy_version 44650 (0.0005) [2023-03-07 15:03:50,126][213771] Updated weights for policy 0, policy_version 44660 (0.0006) [2023-03-07 15:03:50,885][213771] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-07 15:03:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 45744128. Throughput: 0: 13281.4. Samples: 45728162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:51,106][213445] Avg episode reward: [(0, '4491.044')] [2023-03-07 15:03:51,665][213771] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-07 15:03:52,422][213771] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-07 15:03:53,202][213771] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-07 15:03:53,990][213771] Updated weights for policy 0, policy_version 44710 (0.0006) [2023-03-07 15:03:54,764][213771] Updated weights for policy 0, policy_version 44720 (0.0008) [2023-03-07 15:03:55,538][213771] Updated weights for policy 0, policy_version 44730 (0.0007) [2023-03-07 15:03:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 45810688. Throughput: 0: 13268.8. Samples: 45807446. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:03:56,106][213445] Avg episode reward: [(0, '4438.757')] [2023-03-07 15:03:56,314][213771] Updated weights for policy 0, policy_version 44740 (0.0007) [2023-03-07 15:03:57,092][213771] Updated weights for policy 0, policy_version 44750 (0.0005) [2023-03-07 15:03:57,850][213771] Updated weights for policy 0, policy_version 44760 (0.0006) [2023-03-07 15:03:58,617][213771] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 15:03:59,398][213771] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-07 15:04:00,162][213771] Updated weights for policy 0, policy_version 44790 (0.0005) [2023-03-07 15:04:00,937][213771] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-07 15:04:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 45877248. Throughput: 0: 13257.8. Samples: 45847176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:01,106][213445] Avg episode reward: [(0, '4407.949')] [2023-03-07 15:04:01,706][213771] Updated weights for policy 0, policy_version 44810 (0.0005) [2023-03-07 15:04:02,471][213771] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-07 15:04:03,241][213771] Updated weights for policy 0, policy_version 44830 (0.0006) [2023-03-07 15:04:04,002][213771] Updated weights for policy 0, policy_version 44840 (0.0006) [2023-03-07 15:04:04,774][213771] Updated weights for policy 0, policy_version 44850 (0.0006) [2023-03-07 15:04:05,554][213771] Updated weights for policy 0, policy_version 44860 (0.0006) [2023-03-07 15:04:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 45943808. Throughput: 0: 13266.1. Samples: 45927107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:06,106][213445] Avg episode reward: [(0, '4261.858')] [2023-03-07 15:04:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000044867_45943808.pth... [2023-03-07 15:04:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000041763_42765312.pth [2023-03-07 15:04:06,308][213771] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-07 15:04:07,101][213771] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-07 15:04:07,878][213771] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-07 15:04:08,645][213771] Updated weights for policy 0, policy_version 44900 (0.0005) [2023-03-07 15:04:09,417][213771] Updated weights for policy 0, policy_version 44910 (0.0005) [2023-03-07 15:04:10,182][213771] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-07 15:04:10,959][213771] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-07 15:04:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 46009344. Throughput: 0: 13267.8. Samples: 46006619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:04:11,106][213445] Avg episode reward: [(0, '4484.714')] [2023-03-07 15:04:11,729][213771] Updated weights for policy 0, policy_version 44940 (0.0006) [2023-03-07 15:04:12,503][213771] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-03-07 15:04:13,281][213771] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 15:04:14,072][213771] Updated weights for policy 0, policy_version 44970 (0.0006) [2023-03-07 15:04:14,845][213771] Updated weights for policy 0, policy_version 44980 (0.0007) [2023-03-07 15:04:15,626][213771] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-07 15:04:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 46075904. Throughput: 0: 13265.6. Samples: 46046418. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:04:16,106][213445] Avg episode reward: [(0, '4455.184')] [2023-03-07 15:04:16,401][213771] Updated weights for policy 0, policy_version 45000 (0.0005) [2023-03-07 15:04:17,166][213771] Updated weights for policy 0, policy_version 45010 (0.0005) [2023-03-07 15:04:17,960][213771] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-07 15:04:18,729][213771] Updated weights for policy 0, policy_version 45030 (0.0007) [2023-03-07 15:04:19,502][213771] Updated weights for policy 0, policy_version 45040 (0.0006) [2023-03-07 15:04:20,267][213771] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-07 15:04:21,058][213771] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-07 15:04:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 46141440. Throughput: 0: 13255.4. Samples: 46125444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:04:21,105][213445] Avg episode reward: [(0, '4419.555')] [2023-03-07 15:04:21,831][213771] Updated weights for policy 0, policy_version 45070 (0.0007) [2023-03-07 15:04:22,598][213771] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-07 15:04:23,393][213771] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-07 15:04:24,164][213771] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 15:04:24,950][213771] Updated weights for policy 0, policy_version 45110 (0.0005) [2023-03-07 15:04:25,710][213771] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-07 15:04:26,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 46206976. Throughput: 0: 13240.8. Samples: 46204383. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:04:26,106][213445] Avg episode reward: [(0, '4368.974')] [2023-03-07 15:04:26,493][213771] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-07 15:04:27,269][213771] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 15:04:28,039][213771] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-07 15:04:28,805][213771] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-07 15:04:29,590][213771] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-07 15:04:30,350][213771] Updated weights for policy 0, policy_version 45180 (0.0005) [2023-03-07 15:04:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 46273536. Throughput: 0: 13242.5. Samples: 46244172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:04:31,106][213445] Avg episode reward: [(0, '4426.437')] [2023-03-07 15:04:31,123][213771] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-07 15:04:31,890][213771] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-07 15:04:32,671][213771] Updated weights for policy 0, policy_version 45210 (0.0006) [2023-03-07 15:04:33,457][213771] Updated weights for policy 0, policy_version 45220 (0.0007) [2023-03-07 15:04:34,222][213771] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-07 15:04:34,991][213771] Updated weights for policy 0, policy_version 45240 (0.0007) [2023-03-07 15:04:35,777][213771] Updated weights for policy 0, policy_version 45250 (0.0005) [2023-03-07 15:04:36,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 46340096. Throughput: 0: 13235.3. Samples: 46323751. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:36,106][213445] Avg episode reward: [(0, '4379.101')] [2023-03-07 15:04:36,570][213771] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-07 15:04:37,336][213771] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 15:04:38,093][213771] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-07 15:04:38,873][213771] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 15:04:39,646][213771] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-07 15:04:40,425][213771] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-07 15:04:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 46406656. Throughput: 0: 13234.0. Samples: 46402973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:41,106][213445] Avg episode reward: [(0, '4425.170')] [2023-03-07 15:04:41,198][213771] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 15:04:41,964][213771] Updated weights for policy 0, policy_version 45330 (0.0006) [2023-03-07 15:04:42,750][213771] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-07 15:04:43,512][213771] Updated weights for policy 0, policy_version 45350 (0.0006) [2023-03-07 15:04:44,289][213771] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-07 15:04:45,061][213771] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-07 15:04:45,827][213771] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-07 15:04:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 46472192. Throughput: 0: 13233.6. Samples: 46442689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:46,106][213445] Avg episode reward: [(0, '4439.477')] [2023-03-07 15:04:46,605][213771] Updated weights for policy 0, policy_version 45390 (0.0007) [2023-03-07 15:04:47,393][213771] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-07 15:04:48,183][213771] Updated weights for policy 0, policy_version 45410 (0.0006) [2023-03-07 15:04:48,942][213771] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-07 15:04:49,726][213771] Updated weights for policy 0, policy_version 45430 (0.0007) [2023-03-07 15:04:50,520][213771] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-07 15:04:51,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 46537728. Throughput: 0: 13211.2. Samples: 46521610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:51,105][213445] Avg episode reward: [(0, '4423.781')] [2023-03-07 15:04:51,302][213771] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-07 15:04:52,083][213771] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-07 15:04:52,842][213771] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-07 15:04:53,629][213771] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-07 15:04:54,387][213771] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-07 15:04:55,175][213771] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-07 15:04:55,965][213771] Updated weights for policy 0, policy_version 45510 (0.0005) [2023-03-07 15:04:56,105][213445] Fps is (10 sec: 13107.4, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 46603264. Throughput: 0: 13199.7. Samples: 46600607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:04:56,106][213445] Avg episode reward: [(0, '4443.003')] [2023-03-07 15:04:56,737][213771] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-07 15:04:57,509][213771] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-07 15:04:58,295][213771] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-07 15:04:59,057][213771] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-07 15:04:59,845][213771] Updated weights for policy 0, policy_version 45560 (0.0005) [2023-03-07 15:05:00,620][213771] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 15:05:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 46669824. Throughput: 0: 13195.6. Samples: 46640222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:01,106][213445] Avg episode reward: [(0, '4415.623')] [2023-03-07 15:05:01,386][213771] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-07 15:05:02,159][213771] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-07 15:05:02,917][213771] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 15:05:03,694][213771] Updated weights for policy 0, policy_version 45610 (0.0007) [2023-03-07 15:05:04,477][213771] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-07 15:05:05,247][213771] Updated weights for policy 0, policy_version 45630 (0.0007) [2023-03-07 15:05:06,029][213771] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-07 15:05:06,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 46736384. Throughput: 0: 13204.3. Samples: 46719640. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:06,106][213445] Avg episode reward: [(0, '4494.359')] [2023-03-07 15:05:06,793][213771] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-07 15:05:07,565][213771] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-07 15:05:08,326][213771] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 15:05:09,119][213771] Updated weights for policy 0, policy_version 45680 (0.0005) [2023-03-07 15:05:09,870][213771] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-07 15:05:10,657][213771] Updated weights for policy 0, policy_version 45700 (0.0005) [2023-03-07 15:05:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 46801920. Throughput: 0: 13218.3. Samples: 46799208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:11,106][213445] Avg episode reward: [(0, '4449.086')] [2023-03-07 15:05:11,420][213771] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-07 15:05:12,186][213771] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-07 15:05:12,958][213771] Updated weights for policy 0, policy_version 45730 (0.0007) [2023-03-07 15:05:13,746][213771] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-07 15:05:14,519][213771] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 15:05:15,278][213771] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-07 15:05:16,045][213771] Updated weights for policy 0, policy_version 45770 (0.0006) [2023-03-07 15:05:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 46868480. Throughput: 0: 13220.7. Samples: 46839102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:16,106][213445] Avg episode reward: [(0, '4428.700')] [2023-03-07 15:05:16,825][213771] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-07 15:05:17,594][213771] Updated weights for policy 0, policy_version 45790 (0.0007) [2023-03-07 15:05:18,372][213771] Updated weights for policy 0, policy_version 45800 (0.0006) [2023-03-07 15:05:19,147][213771] Updated weights for policy 0, policy_version 45810 (0.0006) [2023-03-07 15:05:19,898][213771] Updated weights for policy 0, policy_version 45820 (0.0006) [2023-03-07 15:05:20,686][213771] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-07 15:05:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 46935040. Throughput: 0: 13219.0. Samples: 46918607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:21,106][213445] Avg episode reward: [(0, '4398.219')] [2023-03-07 15:05:21,468][213771] Updated weights for policy 0, policy_version 45840 (0.0007) [2023-03-07 15:05:22,222][213771] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-07 15:05:23,005][213771] Updated weights for policy 0, policy_version 45860 (0.0007) [2023-03-07 15:05:23,786][213771] Updated weights for policy 0, policy_version 45870 (0.0007) [2023-03-07 15:05:24,566][213771] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-07 15:05:25,326][213771] Updated weights for policy 0, policy_version 45890 (0.0005) [2023-03-07 15:05:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13235.6). Total num frames: 47000576. Throughput: 0: 13220.6. Samples: 46997902. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:26,116][213445] Avg episode reward: [(0, '4433.864')] [2023-03-07 15:05:26,119][213771] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-07 15:05:26,893][213771] Updated weights for policy 0, policy_version 45910 (0.0008) [2023-03-07 15:05:27,656][213771] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-07 15:05:28,416][213771] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 15:05:29,204][213771] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 15:05:29,964][213771] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 15:05:30,748][213771] Updated weights for policy 0, policy_version 45960 (0.0006) [2023-03-07 15:05:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 47067136. Throughput: 0: 13224.2. Samples: 47037774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:31,116][213445] Avg episode reward: [(0, '4403.811')] [2023-03-07 15:05:31,516][213771] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-07 15:05:32,294][213771] Updated weights for policy 0, policy_version 45980 (0.0007) [2023-03-07 15:05:33,053][213771] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-07 15:05:33,826][213771] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 15:05:34,606][213771] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 15:05:35,378][213771] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-07 15:05:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 47133696. Throughput: 0: 13238.1. Samples: 47117327. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:36,116][213445] Avg episode reward: [(0, '4425.434')] [2023-03-07 15:05:36,152][213771] Updated weights for policy 0, policy_version 46030 (0.0007) [2023-03-07 15:05:36,936][213771] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-07 15:05:37,703][213771] Updated weights for policy 0, policy_version 46050 (0.0005) [2023-03-07 15:05:38,485][213771] Updated weights for policy 0, policy_version 46060 (0.0007) [2023-03-07 15:05:39,256][213771] Updated weights for policy 0, policy_version 46070 (0.0005) [2023-03-07 15:05:40,021][213771] Updated weights for policy 0, policy_version 46080 (0.0006) [2023-03-07 15:05:40,788][213771] Updated weights for policy 0, policy_version 46090 (0.0008) [2023-03-07 15:05:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 47199232. Throughput: 0: 13245.7. Samples: 47196665. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:41,106][213445] Avg episode reward: [(0, '4448.016')] [2023-03-07 15:05:41,586][213771] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 15:05:42,346][213771] Updated weights for policy 0, policy_version 46110 (0.0005) [2023-03-07 15:05:43,127][213771] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-07 15:05:43,886][213771] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-07 15:05:44,657][213771] Updated weights for policy 0, policy_version 46140 (0.0005) [2023-03-07 15:05:45,430][213771] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-07 15:05:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 47265792. Throughput: 0: 13245.2. Samples: 47236256. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:46,106][213445] Avg episode reward: [(0, '4430.212')] [2023-03-07 15:05:46,205][213771] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-07 15:05:47,001][213771] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-07 15:05:47,769][213771] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-07 15:05:48,557][213771] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 15:05:49,341][213771] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-07 15:05:50,104][213771] Updated weights for policy 0, policy_version 46210 (0.0005) [2023-03-07 15:05:50,877][213771] Updated weights for policy 0, policy_version 46220 (0.0007) [2023-03-07 15:05:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 47332352. Throughput: 0: 13240.5. Samples: 47315461. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:51,106][213445] Avg episode reward: [(0, '4425.472')] [2023-03-07 15:05:51,655][213771] Updated weights for policy 0, policy_version 46230 (0.0005) [2023-03-07 15:05:52,424][213771] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-07 15:05:53,193][213771] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-07 15:05:53,972][213771] Updated weights for policy 0, policy_version 46260 (0.0006) [2023-03-07 15:05:54,775][213771] Updated weights for policy 0, policy_version 46270 (0.0005) [2023-03-07 15:05:55,554][213771] Updated weights for policy 0, policy_version 46280 (0.0008) [2023-03-07 15:05:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 47397888. Throughput: 0: 13231.1. Samples: 47394605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:05:56,106][213445] Avg episode reward: [(0, '4373.312')] [2023-03-07 15:05:56,314][213771] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-07 15:05:57,109][213771] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-07 15:05:57,863][213771] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 15:05:58,625][213771] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-07 15:05:59,415][213771] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-07 15:06:00,187][213771] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-07 15:06:00,960][213771] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-07 15:06:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 47464448. Throughput: 0: 13226.5. Samples: 47434295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:06:01,106][213445] Avg episode reward: [(0, '4339.593')] [2023-03-07 15:06:01,725][213771] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-07 15:06:02,510][213771] Updated weights for policy 0, policy_version 46370 (0.0006) [2023-03-07 15:06:03,269][213771] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-07 15:06:04,053][213771] Updated weights for policy 0, policy_version 46390 (0.0008) [2023-03-07 15:06:04,829][213771] Updated weights for policy 0, policy_version 46400 (0.0006) [2023-03-07 15:06:05,609][213771] Updated weights for policy 0, policy_version 46410 (0.0005) [2023-03-07 15:06:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 47529984. Throughput: 0: 13225.1. Samples: 47513739. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:06:06,106][213445] Avg episode reward: [(0, '4341.355')] [2023-03-07 15:06:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000046416_47529984.pth... [2023-03-07 15:06:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000043313_44352512.pth [2023-03-07 15:06:06,382][213771] Updated weights for policy 0, policy_version 46420 (0.0005) [2023-03-07 15:06:07,155][213771] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-07 15:06:07,937][213771] Updated weights for policy 0, policy_version 46440 (0.0007) [2023-03-07 15:06:08,719][213771] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 15:06:09,476][213771] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 15:06:10,225][213771] Updated weights for policy 0, policy_version 46470 (0.0007) [2023-03-07 15:06:11,015][213771] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-07 15:06:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 47596544. Throughput: 0: 13226.7. Samples: 47593104. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:06:11,106][213445] Avg episode reward: [(0, '4362.386')] [2023-03-07 15:06:11,783][213771] Updated weights for policy 0, policy_version 46490 (0.0005) [2023-03-07 15:06:12,550][213771] Updated weights for policy 0, policy_version 46500 (0.0006) [2023-03-07 15:06:13,321][213771] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 15:06:14,100][213771] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-07 15:06:14,889][213771] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-07 15:06:15,658][213771] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-07 15:06:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 47662080. Throughput: 0: 13226.6. Samples: 47632972. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:06:16,106][213445] Avg episode reward: [(0, '4413.521')] [2023-03-07 15:06:16,439][213771] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-07 15:06:17,216][213771] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-07 15:06:17,990][213771] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-07 15:06:18,778][213771] Updated weights for policy 0, policy_version 46580 (0.0007) [2023-03-07 15:06:19,540][213771] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 15:06:20,322][213771] Updated weights for policy 0, policy_version 46600 (0.0007) [2023-03-07 15:06:21,089][213771] Updated weights for policy 0, policy_version 46610 (0.0005) [2023-03-07 15:06:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 47728640. Throughput: 0: 13215.2. Samples: 47712009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:06:21,106][213445] Avg episode reward: [(0, '4385.331')] [2023-03-07 15:06:21,861][213771] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-07 15:06:22,620][213771] Updated weights for policy 0, policy_version 46630 (0.0005) [2023-03-07 15:06:23,399][213771] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-07 15:06:24,164][213771] Updated weights for policy 0, policy_version 46650 (0.0007) [2023-03-07 15:06:24,932][213771] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 15:06:25,702][213771] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-07 15:06:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 47794176. Throughput: 0: 13221.9. Samples: 47791652. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:06:26,106][213445] Avg episode reward: [(0, '4459.391')] [2023-03-07 15:06:26,491][213771] Updated weights for policy 0, policy_version 46680 (0.0007) [2023-03-07 15:06:27,256][213771] Updated weights for policy 0, policy_version 46690 (0.0007) [2023-03-07 15:06:28,030][213771] Updated weights for policy 0, policy_version 46700 (0.0007) [2023-03-07 15:06:28,790][213771] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-07 15:06:29,571][213771] Updated weights for policy 0, policy_version 46720 (0.0007) [2023-03-07 15:06:30,334][213771] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-07 15:06:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 47860736. Throughput: 0: 13227.7. Samples: 47831501. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:31,106][213445] Avg episode reward: [(0, '4422.400')] [2023-03-07 15:06:31,127][213771] Updated weights for policy 0, policy_version 46740 (0.0005) [2023-03-07 15:06:31,901][213771] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-07 15:06:32,686][213771] Updated weights for policy 0, policy_version 46760 (0.0007) [2023-03-07 15:06:33,449][213771] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-07 15:06:34,217][213771] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-07 15:06:34,989][213771] Updated weights for policy 0, policy_version 46790 (0.0006) [2023-03-07 15:06:35,764][213771] Updated weights for policy 0, policy_version 46800 (0.0007) [2023-03-07 15:06:36,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 47927296. Throughput: 0: 13232.9. Samples: 47910940. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:36,106][213445] Avg episode reward: [(0, '4512.960')] [2023-03-07 15:06:36,556][213771] Updated weights for policy 0, policy_version 46810 (0.0006) [2023-03-07 15:06:37,337][213771] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-07 15:06:38,105][213771] Updated weights for policy 0, policy_version 46830 (0.0007) [2023-03-07 15:06:38,867][213771] Updated weights for policy 0, policy_version 46840 (0.0005) [2023-03-07 15:06:39,647][213771] Updated weights for policy 0, policy_version 46850 (0.0006) [2023-03-07 15:06:40,417][213771] Updated weights for policy 0, policy_version 46860 (0.0005) [2023-03-07 15:06:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 47992832. Throughput: 0: 13234.4. Samples: 47990153. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:41,106][213445] Avg episode reward: [(0, '4454.199')] [2023-03-07 15:06:41,199][213771] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 15:06:41,958][213771] Updated weights for policy 0, policy_version 46880 (0.0005) [2023-03-07 15:06:42,737][213771] Updated weights for policy 0, policy_version 46890 (0.0006) [2023-03-07 15:06:43,518][213771] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 15:06:44,279][213771] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-07 15:06:45,078][213771] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-07 15:06:45,847][213771] Updated weights for policy 0, policy_version 46930 (0.0006) [2023-03-07 15:06:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 48059392. Throughput: 0: 13234.5. Samples: 48029848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:46,106][213445] Avg episode reward: [(0, '4443.644')] [2023-03-07 15:06:46,609][213771] Updated weights for policy 0, policy_version 46940 (0.0006) [2023-03-07 15:06:47,378][213771] Updated weights for policy 0, policy_version 46950 (0.0006) [2023-03-07 15:06:48,160][213771] Updated weights for policy 0, policy_version 46960 (0.0007) [2023-03-07 15:06:48,929][213771] Updated weights for policy 0, policy_version 46970 (0.0005) [2023-03-07 15:06:49,688][213771] Updated weights for policy 0, policy_version 46980 (0.0005) [2023-03-07 15:06:50,468][213771] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-07 15:06:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 48125952. Throughput: 0: 13236.5. Samples: 48109382. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:51,106][213445] Avg episode reward: [(0, '4432.758')] [2023-03-07 15:06:51,236][213771] Updated weights for policy 0, policy_version 47000 (0.0005) [2023-03-07 15:06:52,025][213771] Updated weights for policy 0, policy_version 47010 (0.0006) [2023-03-07 15:06:52,780][213771] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-07 15:06:53,543][213771] Updated weights for policy 0, policy_version 47030 (0.0005) [2023-03-07 15:06:54,321][213771] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-07 15:06:55,115][213771] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-07 15:06:55,863][213771] Updated weights for policy 0, policy_version 47060 (0.0005) [2023-03-07 15:06:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 48191488. Throughput: 0: 13242.3. Samples: 48189010. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:06:56,106][213445] Avg episode reward: [(0, '4383.670')] [2023-03-07 15:06:56,650][213771] Updated weights for policy 0, policy_version 47070 (0.0007) [2023-03-07 15:06:57,445][213771] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-07 15:06:58,195][213771] Updated weights for policy 0, policy_version 47090 (0.0007) [2023-03-07 15:06:58,954][213771] Updated weights for policy 0, policy_version 47100 (0.0008) [2023-03-07 15:06:59,742][213771] Updated weights for policy 0, policy_version 47110 (0.0006) [2023-03-07 15:07:00,517][213771] Updated weights for policy 0, policy_version 47120 (0.0006) [2023-03-07 15:07:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 48258048. Throughput: 0: 13240.2. Samples: 48228778. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:01,116][213445] Avg episode reward: [(0, '4370.729')] [2023-03-07 15:07:01,278][213771] Updated weights for policy 0, policy_version 47130 (0.0005) [2023-03-07 15:07:02,061][213771] Updated weights for policy 0, policy_version 47140 (0.0006) [2023-03-07 15:07:02,824][213771] Updated weights for policy 0, policy_version 47150 (0.0005) [2023-03-07 15:07:03,584][213771] Updated weights for policy 0, policy_version 47160 (0.0007) [2023-03-07 15:07:04,378][213771] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-07 15:07:05,144][213771] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-07 15:07:05,903][213771] Updated weights for policy 0, policy_version 47190 (0.0006) [2023-03-07 15:07:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 48324608. Throughput: 0: 13248.9. Samples: 48308208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:06,116][213445] Avg episode reward: [(0, '4375.062')] [2023-03-07 15:07:06,685][213771] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-07 15:07:07,458][213771] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-07 15:07:08,230][213771] Updated weights for policy 0, policy_version 47220 (0.0006) [2023-03-07 15:07:09,007][213771] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-07 15:07:09,789][213771] Updated weights for policy 0, policy_version 47240 (0.0005) [2023-03-07 15:07:10,566][213771] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-07 15:07:11,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 48391168. Throughput: 0: 13240.7. Samples: 48387478. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:11,106][213445] Avg episode reward: [(0, '4334.215')] [2023-03-07 15:07:11,346][213771] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-07 15:07:12,135][213771] Updated weights for policy 0, policy_version 47270 (0.0006) [2023-03-07 15:07:12,902][213771] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-07 15:07:13,678][213771] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-07 15:07:14,484][213771] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-07 15:07:15,243][213771] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 15:07:15,999][213771] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 15:07:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 48456704. Throughput: 0: 13234.0. Samples: 48427029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:16,106][213445] Avg episode reward: [(0, '4350.381')] [2023-03-07 15:07:16,791][213771] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-07 15:07:17,564][213771] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-07 15:07:18,321][213771] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 15:07:19,103][213771] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 15:07:19,874][213771] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-07 15:07:20,648][213771] Updated weights for policy 0, policy_version 47380 (0.0006) [2023-03-07 15:07:21,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 48522240. Throughput: 0: 13229.7. Samples: 48506274. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:21,105][213445] Avg episode reward: [(0, '4379.250')] [2023-03-07 15:07:21,421][213771] Updated weights for policy 0, policy_version 47390 (0.0006) [2023-03-07 15:07:22,198][213771] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-07 15:07:22,985][213771] Updated weights for policy 0, policy_version 47410 (0.0006) [2023-03-07 15:07:23,754][213771] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-07 15:07:24,536][213771] Updated weights for policy 0, policy_version 47430 (0.0005) [2023-03-07 15:07:25,323][213771] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-07 15:07:26,087][213771] Updated weights for policy 0, policy_version 47450 (0.0006) [2023-03-07 15:07:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 48588800. Throughput: 0: 13227.1. Samples: 48585372. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:07:26,106][213445] Avg episode reward: [(0, '4259.644')] [2023-03-07 15:07:26,857][213771] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-07 15:07:27,642][213771] Updated weights for policy 0, policy_version 47470 (0.0005) [2023-03-07 15:07:28,415][213771] Updated weights for policy 0, policy_version 47480 (0.0006) [2023-03-07 15:07:29,183][213771] Updated weights for policy 0, policy_version 47490 (0.0005) [2023-03-07 15:07:29,965][213771] Updated weights for policy 0, policy_version 47500 (0.0006) [2023-03-07 15:07:30,737][213771] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-07 15:07:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 48654336. Throughput: 0: 13225.7. Samples: 48625006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:31,106][213445] Avg episode reward: [(0, '4244.073')] [2023-03-07 15:07:31,500][213771] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-07 15:07:32,274][213771] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-07 15:07:33,054][213771] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-07 15:07:33,811][213771] Updated weights for policy 0, policy_version 47550 (0.0007) [2023-03-07 15:07:34,599][213771] Updated weights for policy 0, policy_version 47560 (0.0007) [2023-03-07 15:07:35,375][213771] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-07 15:07:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 48720896. Throughput: 0: 13227.4. Samples: 48704613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:36,106][213445] Avg episode reward: [(0, '4153.601')] [2023-03-07 15:07:36,140][213771] Updated weights for policy 0, policy_version 47580 (0.0007) [2023-03-07 15:07:36,933][213771] Updated weights for policy 0, policy_version 47590 (0.0007) [2023-03-07 15:07:37,690][213771] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-07 15:07:38,469][213771] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-07 15:07:39,240][213771] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 15:07:40,026][213771] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-07 15:07:40,790][213771] Updated weights for policy 0, policy_version 47640 (0.0005) [2023-03-07 15:07:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 48786432. Throughput: 0: 13221.2. Samples: 48783963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:41,106][213445] Avg episode reward: [(0, '4258.870')] [2023-03-07 15:07:41,569][213771] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-07 15:07:42,341][213771] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-07 15:07:43,097][213771] Updated weights for policy 0, policy_version 47670 (0.0005) [2023-03-07 15:07:43,878][213771] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-07 15:07:44,663][213771] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 15:07:45,428][213771] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-07 15:07:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 48852992. Throughput: 0: 13219.1. Samples: 48823637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:46,106][213445] Avg episode reward: [(0, '4214.414')] [2023-03-07 15:07:46,201][213771] Updated weights for policy 0, policy_version 47710 (0.0007) [2023-03-07 15:07:46,977][213771] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-07 15:07:47,741][213771] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-07 15:07:48,509][213771] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-07 15:07:49,274][213771] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-07 15:07:50,048][213771] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-07 15:07:50,829][213771] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 15:07:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 48919552. Throughput: 0: 13223.6. Samples: 48903269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:51,106][213445] Avg episode reward: [(0, '4261.834')] [2023-03-07 15:07:51,606][213771] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-07 15:07:52,381][213771] Updated weights for policy 0, policy_version 47790 (0.0007) [2023-03-07 15:07:53,137][213771] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-07 15:07:53,921][213771] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-07 15:07:54,684][213771] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-07 15:07:55,475][213771] Updated weights for policy 0, policy_version 47830 (0.0006) [2023-03-07 15:07:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 48986112. Throughput: 0: 13226.8. Samples: 48982687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:07:56,105][213445] Avg episode reward: [(0, '4341.247')] [2023-03-07 15:07:56,258][213771] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-07 15:07:57,013][213771] Updated weights for policy 0, policy_version 47850 (0.0005) [2023-03-07 15:07:57,768][213771] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-07 15:07:58,556][213771] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-07 15:07:59,330][213771] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-07 15:08:00,094][213771] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-07 15:08:00,859][213771] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-07 15:08:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13239.1). Total num frames: 49052672. Throughput: 0: 13234.1. Samples: 49022561. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:01,105][213445] Avg episode reward: [(0, '4317.080')] [2023-03-07 15:08:01,638][213771] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-07 15:08:02,418][213771] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-07 15:08:03,189][213771] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 15:08:03,962][213771] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 15:08:04,722][213771] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-07 15:08:05,519][213771] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-07 15:08:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 49118208. Throughput: 0: 13243.0. Samples: 49102209. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:06,109][213445] Avg episode reward: [(0, '4244.080')] [2023-03-07 15:08:06,114][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000047967_49118208.pth... [2023-03-07 15:08:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000044867_45943808.pth [2023-03-07 15:08:06,308][213771] Updated weights for policy 0, policy_version 47970 (0.0007) [2023-03-07 15:08:07,080][213771] Updated weights for policy 0, policy_version 47980 (0.0007) [2023-03-07 15:08:07,864][213771] Updated weights for policy 0, policy_version 47990 (0.0005) [2023-03-07 15:08:08,633][213771] Updated weights for policy 0, policy_version 48000 (0.0007) [2023-03-07 15:08:09,414][213771] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-07 15:08:10,180][213771] Updated weights for policy 0, policy_version 48020 (0.0005) [2023-03-07 15:08:10,965][213771] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 15:08:11,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 49183744. Throughput: 0: 13237.9. Samples: 49181077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:11,116][213445] Avg episode reward: [(0, '4261.344')] [2023-03-07 15:08:11,730][213771] Updated weights for policy 0, policy_version 48040 (0.0007) [2023-03-07 15:08:12,501][213771] Updated weights for policy 0, policy_version 48050 (0.0006) [2023-03-07 15:08:13,275][213771] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-07 15:08:14,046][213771] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-07 15:08:14,812][213771] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-07 15:08:15,588][213771] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-07 15:08:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 49250304. Throughput: 0: 13243.1. Samples: 49220943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:16,116][213445] Avg episode reward: [(0, '4251.550')] [2023-03-07 15:08:16,368][213771] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-07 15:08:17,142][213771] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-07 15:08:17,911][213771] Updated weights for policy 0, policy_version 48120 (0.0006) [2023-03-07 15:08:18,696][213771] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-07 15:08:19,471][213771] Updated weights for policy 0, policy_version 48140 (0.0006) [2023-03-07 15:08:20,244][213771] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 15:08:21,012][213771] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-07 15:08:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 49316864. Throughput: 0: 13238.1. Samples: 49300326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:21,116][213445] Avg episode reward: [(0, '4324.626')] [2023-03-07 15:08:21,797][213771] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-07 15:08:22,576][213771] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 15:08:23,342][213771] Updated weights for policy 0, policy_version 48190 (0.0006) [2023-03-07 15:08:24,122][213771] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-07 15:08:24,909][213771] Updated weights for policy 0, policy_version 48210 (0.0006) [2023-03-07 15:08:25,685][213771] Updated weights for policy 0, policy_version 48220 (0.0006) [2023-03-07 15:08:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 49382400. Throughput: 0: 13232.2. Samples: 49379412. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:26,116][213445] Avg episode reward: [(0, '4293.167')] [2023-03-07 15:08:26,442][213771] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-07 15:08:27,212][213771] Updated weights for policy 0, policy_version 48240 (0.0006) [2023-03-07 15:08:27,963][213771] Updated weights for policy 0, policy_version 48250 (0.0005) [2023-03-07 15:08:28,730][213771] Updated weights for policy 0, policy_version 48260 (0.0007) [2023-03-07 15:08:29,509][213771] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-07 15:08:30,278][213771] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-07 15:08:31,059][213771] Updated weights for policy 0, policy_version 48290 (0.0007) [2023-03-07 15:08:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49448960. Throughput: 0: 13243.4. Samples: 49419591. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:31,116][213445] Avg episode reward: [(0, '4121.943')] [2023-03-07 15:08:31,826][213771] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 15:08:32,607][213771] Updated weights for policy 0, policy_version 48310 (0.0006) [2023-03-07 15:08:33,378][213771] Updated weights for policy 0, policy_version 48320 (0.0006) [2023-03-07 15:08:34,153][213771] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-07 15:08:34,933][213771] Updated weights for policy 0, policy_version 48340 (0.0006) [2023-03-07 15:08:35,701][213771] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-07 15:08:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49515520. Throughput: 0: 13237.9. Samples: 49498973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:36,116][213445] Avg episode reward: [(0, '4154.908')] [2023-03-07 15:08:36,485][213771] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-07 15:08:37,266][213771] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-07 15:08:38,040][213771] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-07 15:08:38,834][213771] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-07 15:08:39,591][213771] Updated weights for policy 0, policy_version 48400 (0.0005) [2023-03-07 15:08:40,350][213771] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-07 15:08:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49581056. Throughput: 0: 13230.9. Samples: 49578078. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:41,116][213445] Avg episode reward: [(0, '4159.139')] [2023-03-07 15:08:41,134][213771] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-07 15:08:41,918][213771] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-07 15:08:42,676][213771] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-07 15:08:43,446][213771] Updated weights for policy 0, policy_version 48450 (0.0005) [2023-03-07 15:08:44,211][213771] Updated weights for policy 0, policy_version 48460 (0.0006) [2023-03-07 15:08:44,989][213771] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-07 15:08:45,760][213771] Updated weights for policy 0, policy_version 48480 (0.0007) [2023-03-07 15:08:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49647616. Throughput: 0: 13230.2. Samples: 49617922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:46,116][213445] Avg episode reward: [(0, '4251.838')] [2023-03-07 15:08:46,533][213771] Updated weights for policy 0, policy_version 48490 (0.0005) [2023-03-07 15:08:47,301][213771] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 15:08:48,078][213771] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-07 15:08:48,853][213771] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-07 15:08:49,610][213771] Updated weights for policy 0, policy_version 48530 (0.0007) [2023-03-07 15:08:50,382][213771] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-07 15:08:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49714176. Throughput: 0: 13228.9. Samples: 49697512. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:51,116][213445] Avg episode reward: [(0, '4246.351')] [2023-03-07 15:08:51,153][213771] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-07 15:08:51,926][213771] Updated weights for policy 0, policy_version 48560 (0.0005) [2023-03-07 15:08:52,706][213771] Updated weights for policy 0, policy_version 48570 (0.0007) [2023-03-07 15:08:53,461][213771] Updated weights for policy 0, policy_version 48580 (0.0005) [2023-03-07 15:08:54,225][213771] Updated weights for policy 0, policy_version 48590 (0.0005) [2023-03-07 15:08:55,017][213771] Updated weights for policy 0, policy_version 48600 (0.0006) [2023-03-07 15:08:55,791][213771] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 15:08:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49780736. Throughput: 0: 13249.3. Samples: 49777295. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:08:56,116][213445] Avg episode reward: [(0, '4272.005')] [2023-03-07 15:08:56,563][213771] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-07 15:08:57,342][213771] Updated weights for policy 0, policy_version 48630 (0.0006) [2023-03-07 15:08:58,122][213771] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 15:08:58,880][213771] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-07 15:08:59,650][213771] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 15:09:00,426][213771] Updated weights for policy 0, policy_version 48670 (0.0007) [2023-03-07 15:09:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 49846272. Throughput: 0: 13243.7. Samples: 49816911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:01,106][213445] Avg episode reward: [(0, '4350.293')] [2023-03-07 15:09:01,204][213771] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-07 15:09:01,981][213771] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-07 15:09:02,756][213771] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-07 15:09:03,530][213771] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-07 15:09:04,297][213771] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-07 15:09:05,076][213771] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-07 15:09:05,838][213771] Updated weights for policy 0, policy_version 48740 (0.0005) [2023-03-07 15:09:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 49912832. Throughput: 0: 13242.1. Samples: 49896221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:06,106][213445] Avg episode reward: [(0, '4337.933')] [2023-03-07 15:09:06,622][213771] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-07 15:09:07,401][213771] Updated weights for policy 0, policy_version 48760 (0.0006) [2023-03-07 15:09:08,182][213771] Updated weights for policy 0, policy_version 48770 (0.0007) [2023-03-07 15:09:08,955][213771] Updated weights for policy 0, policy_version 48780 (0.0006) [2023-03-07 15:09:09,734][213771] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-07 15:09:10,506][213771] Updated weights for policy 0, policy_version 48800 (0.0007) [2023-03-07 15:09:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 49978368. Throughput: 0: 13248.3. Samples: 49975584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:11,106][213445] Avg episode reward: [(0, '4267.332')] [2023-03-07 15:09:11,272][213771] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-07 15:09:12,058][213771] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 15:09:12,830][213771] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-07 15:09:13,602][213771] Updated weights for policy 0, policy_version 48840 (0.0005) [2023-03-07 15:09:14,374][213771] Updated weights for policy 0, policy_version 48850 (0.0006) [2023-03-07 15:09:15,140][213771] Updated weights for policy 0, policy_version 48860 (0.0006) [2023-03-07 15:09:15,904][213771] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 15:09:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 50044928. Throughput: 0: 13236.0. Samples: 50015209. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:16,106][213445] Avg episode reward: [(0, '4294.472')] [2023-03-07 15:09:16,686][213771] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-07 15:09:17,470][213771] Updated weights for policy 0, policy_version 48890 (0.0007) [2023-03-07 15:09:18,237][213771] Updated weights for policy 0, policy_version 48900 (0.0007) [2023-03-07 15:09:19,014][213771] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-07 15:09:19,800][213771] Updated weights for policy 0, policy_version 48920 (0.0007) [2023-03-07 15:09:20,575][213771] Updated weights for policy 0, policy_version 48930 (0.0007) [2023-03-07 15:09:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50110464. Throughput: 0: 13235.1. Samples: 50094551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:21,106][213445] Avg episode reward: [(0, '4197.140')] [2023-03-07 15:09:21,345][213771] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-07 15:09:22,133][213771] Updated weights for policy 0, policy_version 48950 (0.0008) [2023-03-07 15:09:22,900][213771] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-07 15:09:23,706][213771] Updated weights for policy 0, policy_version 48970 (0.0006) [2023-03-07 15:09:24,457][213771] Updated weights for policy 0, policy_version 48980 (0.0007) [2023-03-07 15:09:25,239][213771] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-07 15:09:26,008][213771] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-07 15:09:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 50177024. Throughput: 0: 13235.4. Samples: 50173670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:09:26,106][213445] Avg episode reward: [(0, '4273.274')] [2023-03-07 15:09:26,788][213771] Updated weights for policy 0, policy_version 49010 (0.0007) [2023-03-07 15:09:27,541][213771] Updated weights for policy 0, policy_version 49020 (0.0007) [2023-03-07 15:09:28,314][213771] Updated weights for policy 0, policy_version 49030 (0.0005) [2023-03-07 15:09:29,094][213771] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-07 15:09:29,875][213771] Updated weights for policy 0, policy_version 49050 (0.0005) [2023-03-07 15:09:30,638][213771] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-07 15:09:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 50242560. Throughput: 0: 13233.5. Samples: 50213430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:31,106][213445] Avg episode reward: [(0, '4270.345')] [2023-03-07 15:09:31,417][213771] Updated weights for policy 0, policy_version 49070 (0.0006) [2023-03-07 15:09:32,195][213771] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-07 15:09:32,975][213771] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-07 15:09:33,744][213771] Updated weights for policy 0, policy_version 49100 (0.0006) [2023-03-07 15:09:34,531][213771] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-07 15:09:35,290][213771] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-07 15:09:36,087][213771] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-07 15:09:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 50309120. Throughput: 0: 13227.5. Samples: 50292749. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:36,106][213445] Avg episode reward: [(0, '4282.804')] [2023-03-07 15:09:36,865][213771] Updated weights for policy 0, policy_version 49140 (0.0006) [2023-03-07 15:09:37,626][213771] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-07 15:09:38,430][213771] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 15:09:39,192][213771] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-07 15:09:39,958][213771] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-07 15:09:40,745][213771] Updated weights for policy 0, policy_version 49190 (0.0006) [2023-03-07 15:09:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 50374656. Throughput: 0: 13214.6. Samples: 50371951. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:41,116][213445] Avg episode reward: [(0, '4330.086')] [2023-03-07 15:09:41,517][213771] Updated weights for policy 0, policy_version 49200 (0.0006) [2023-03-07 15:09:42,291][213771] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-07 15:09:43,072][213771] Updated weights for policy 0, policy_version 49220 (0.0007) [2023-03-07 15:09:43,848][213771] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-07 15:09:44,631][213771] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 15:09:45,397][213771] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-07 15:09:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50441216. Throughput: 0: 13211.1. Samples: 50411411. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:46,116][213445] Avg episode reward: [(0, '4411.720')] [2023-03-07 15:09:46,167][213771] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-07 15:09:46,919][213771] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-07 15:09:47,702][213771] Updated weights for policy 0, policy_version 49280 (0.0006) [2023-03-07 15:09:48,502][213771] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-07 15:09:49,256][213771] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-07 15:09:50,028][213771] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-07 15:09:50,805][213771] Updated weights for policy 0, policy_version 49320 (0.0007) [2023-03-07 15:09:51,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 50506752. Throughput: 0: 13213.3. Samples: 50490821. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:51,116][213445] Avg episode reward: [(0, '4461.891')] [2023-03-07 15:09:51,594][213771] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-07 15:09:52,356][213771] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-07 15:09:53,138][213771] Updated weights for policy 0, policy_version 49350 (0.0007) [2023-03-07 15:09:53,889][213771] Updated weights for policy 0, policy_version 49360 (0.0007) [2023-03-07 15:09:54,657][213771] Updated weights for policy 0, policy_version 49370 (0.0007) [2023-03-07 15:09:55,437][213771] Updated weights for policy 0, policy_version 49380 (0.0005) [2023-03-07 15:09:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 50573312. Throughput: 0: 13215.7. Samples: 50570288. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:09:56,116][213445] Avg episode reward: [(0, '4395.874')] [2023-03-07 15:09:56,241][213771] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-07 15:09:56,998][213771] Updated weights for policy 0, policy_version 49400 (0.0006) [2023-03-07 15:09:57,770][213771] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-07 15:09:58,543][213771] Updated weights for policy 0, policy_version 49420 (0.0007) [2023-03-07 15:09:59,322][213771] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-07 15:10:00,088][213771] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-07 15:10:00,874][213771] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-07 15:10:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50639872. Throughput: 0: 13217.1. Samples: 50609980. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:01,116][213445] Avg episode reward: [(0, '4415.374')] [2023-03-07 15:10:01,648][213771] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-07 15:10:02,417][213771] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-07 15:10:03,192][213771] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-07 15:10:03,949][213771] Updated weights for policy 0, policy_version 49490 (0.0006) [2023-03-07 15:10:04,709][213771] Updated weights for policy 0, policy_version 49500 (0.0006) [2023-03-07 15:10:05,499][213771] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-07 15:10:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 50705408. Throughput: 0: 13222.5. Samples: 50689565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:06,116][213445] Avg episode reward: [(0, '4405.963')] [2023-03-07 15:10:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000049518_50706432.pth... [2023-03-07 15:10:06,152][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000046416_47529984.pth [2023-03-07 15:10:06,263][213771] Updated weights for policy 0, policy_version 49520 (0.0006) [2023-03-07 15:10:07,045][213771] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-07 15:10:07,814][213771] Updated weights for policy 0, policy_version 49540 (0.0006) [2023-03-07 15:10:08,578][213771] Updated weights for policy 0, policy_version 49550 (0.0006) [2023-03-07 15:10:09,358][213771] Updated weights for policy 0, policy_version 49560 (0.0005) [2023-03-07 15:10:10,147][213771] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 15:10:10,914][213771] Updated weights for policy 0, policy_version 49580 (0.0005) [2023-03-07 15:10:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50771968. Throughput: 0: 13228.4. Samples: 50768949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:11,116][213445] Avg episode reward: [(0, '4383.273')] [2023-03-07 15:10:11,695][213771] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 15:10:12,446][213771] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-07 15:10:13,223][213771] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-07 15:10:13,986][213771] Updated weights for policy 0, policy_version 49620 (0.0005) [2023-03-07 15:10:14,737][213771] Updated weights for policy 0, policy_version 49630 (0.0007) [2023-03-07 15:10:15,534][213771] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 15:10:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 50838528. Throughput: 0: 13231.9. Samples: 50808867. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:16,116][213445] Avg episode reward: [(0, '4439.751')] [2023-03-07 15:10:16,304][213771] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-07 15:10:17,079][213771] Updated weights for policy 0, policy_version 49660 (0.0008) [2023-03-07 15:10:17,857][213771] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-07 15:10:18,639][213771] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-07 15:10:19,413][213771] Updated weights for policy 0, policy_version 49690 (0.0006) [2023-03-07 15:10:20,196][213771] Updated weights for policy 0, policy_version 49700 (0.0007) [2023-03-07 15:10:20,972][213771] Updated weights for policy 0, policy_version 49710 (0.0007) [2023-03-07 15:10:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50904064. Throughput: 0: 13230.0. Samples: 50888099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:21,116][213445] Avg episode reward: [(0, '4451.488')] [2023-03-07 15:10:21,746][213771] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-07 15:10:22,532][213771] Updated weights for policy 0, policy_version 49730 (0.0007) [2023-03-07 15:10:23,306][213771] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-07 15:10:24,100][213771] Updated weights for policy 0, policy_version 49750 (0.0007) [2023-03-07 15:10:24,857][213771] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-07 15:10:25,634][213771] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 15:10:26,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 50970624. Throughput: 0: 13230.0. Samples: 50967300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:26,116][213445] Avg episode reward: [(0, '4412.134')] [2023-03-07 15:10:26,388][213771] Updated weights for policy 0, policy_version 49780 (0.0006) [2023-03-07 15:10:27,174][213771] Updated weights for policy 0, policy_version 49790 (0.0005) [2023-03-07 15:10:27,952][213771] Updated weights for policy 0, policy_version 49800 (0.0005) [2023-03-07 15:10:28,729][213771] Updated weights for policy 0, policy_version 49810 (0.0007) [2023-03-07 15:10:29,490][213771] Updated weights for policy 0, policy_version 49820 (0.0006) [2023-03-07 15:10:30,265][213771] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-07 15:10:31,045][213771] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-07 15:10:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 51036160. Throughput: 0: 13231.2. Samples: 51006816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:10:31,116][213445] Avg episode reward: [(0, '4300.017')] [2023-03-07 15:10:31,828][213771] Updated weights for policy 0, policy_version 49850 (0.0007) [2023-03-07 15:10:32,609][213771] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-07 15:10:33,383][213771] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-07 15:10:34,153][213771] Updated weights for policy 0, policy_version 49880 (0.0005) [2023-03-07 15:10:34,941][213771] Updated weights for policy 0, policy_version 49890 (0.0006) [2023-03-07 15:10:35,697][213771] Updated weights for policy 0, policy_version 49900 (0.0006) [2023-03-07 15:10:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 51102720. Throughput: 0: 13227.6. Samples: 51086062. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:10:36,116][213445] Avg episode reward: [(0, '4422.534')] [2023-03-07 15:10:36,472][213771] Updated weights for policy 0, policy_version 49910 (0.0007) [2023-03-07 15:10:37,245][213771] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-07 15:10:38,010][213771] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-07 15:10:38,778][213771] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-07 15:10:39,555][213771] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-07 15:10:40,358][213771] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-07 15:10:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 51168256. Throughput: 0: 13226.9. Samples: 51165497. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:10:41,106][213445] Avg episode reward: [(0, '4416.529')] [2023-03-07 15:10:41,117][213771] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-07 15:10:41,884][213771] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 15:10:42,655][213771] Updated weights for policy 0, policy_version 49990 (0.0007) [2023-03-07 15:10:43,432][213771] Updated weights for policy 0, policy_version 50000 (0.0007) [2023-03-07 15:10:44,196][213771] Updated weights for policy 0, policy_version 50010 (0.0009) [2023-03-07 15:10:44,966][213771] Updated weights for policy 0, policy_version 50020 (0.0006) [2023-03-07 15:10:45,740][213771] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-07 15:10:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 51234816. Throughput: 0: 13230.1. Samples: 51205335. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:10:46,106][213445] Avg episode reward: [(0, '4359.938')] [2023-03-07 15:10:46,518][213771] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-07 15:10:47,289][213771] Updated weights for policy 0, policy_version 50050 (0.0007) [2023-03-07 15:10:48,053][213771] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-07 15:10:48,842][213771] Updated weights for policy 0, policy_version 50070 (0.0006) [2023-03-07 15:10:49,611][213771] Updated weights for policy 0, policy_version 50080 (0.0006) [2023-03-07 15:10:50,387][213771] Updated weights for policy 0, policy_version 50090 (0.0005) [2023-03-07 15:10:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 51301376. Throughput: 0: 13228.0. Samples: 51284826. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:10:51,106][213445] Avg episode reward: [(0, '4332.794')] [2023-03-07 15:10:51,156][213771] Updated weights for policy 0, policy_version 50100 (0.0007) [2023-03-07 15:10:51,930][213771] Updated weights for policy 0, policy_version 50110 (0.0007) [2023-03-07 15:10:52,696][213771] Updated weights for policy 0, policy_version 50120 (0.0007) [2023-03-07 15:10:53,458][213771] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-07 15:10:54,238][213771] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-07 15:10:55,015][213771] Updated weights for policy 0, policy_version 50150 (0.0007) [2023-03-07 15:10:55,796][213771] Updated weights for policy 0, policy_version 50160 (0.0006) [2023-03-07 15:10:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 51366912. Throughput: 0: 13228.5. Samples: 51364234. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:10:56,106][213445] Avg episode reward: [(0, '4366.666')] [2023-03-07 15:10:56,581][213771] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-07 15:10:57,348][213771] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-07 15:10:58,114][213771] Updated weights for policy 0, policy_version 50190 (0.0006) [2023-03-07 15:10:58,894][213771] Updated weights for policy 0, policy_version 50200 (0.0005) [2023-03-07 15:10:59,657][213771] Updated weights for policy 0, policy_version 50210 (0.0005) [2023-03-07 15:11:00,443][213771] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-07 15:11:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 51433472. Throughput: 0: 13222.6. Samples: 51403882. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:11:01,105][213445] Avg episode reward: [(0, '4434.378')] [2023-03-07 15:11:01,220][213771] Updated weights for policy 0, policy_version 50230 (0.0006) [2023-03-07 15:11:01,981][213771] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-07 15:11:02,747][213771] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-07 15:11:03,520][213771] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-07 15:11:04,308][213771] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 15:11:05,076][213771] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-07 15:11:05,849][213771] Updated weights for policy 0, policy_version 50290 (0.0007) [2023-03-07 15:11:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 51500032. Throughput: 0: 13229.1. Samples: 51483410. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:06,106][213445] Avg episode reward: [(0, '4435.828')] [2023-03-07 15:11:06,642][213771] Updated weights for policy 0, policy_version 50300 (0.0007) [2023-03-07 15:11:07,432][213771] Updated weights for policy 0, policy_version 50310 (0.0008) [2023-03-07 15:11:08,198][213771] Updated weights for policy 0, policy_version 50320 (0.0007) [2023-03-07 15:11:08,969][213771] Updated weights for policy 0, policy_version 50330 (0.0006) [2023-03-07 15:11:09,750][213771] Updated weights for policy 0, policy_version 50340 (0.0007) [2023-03-07 15:11:10,516][213771] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-07 15:11:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 51565568. Throughput: 0: 13221.0. Samples: 51562246. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:11,106][213445] Avg episode reward: [(0, '4479.284')] [2023-03-07 15:11:11,300][213771] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-07 15:11:12,069][213771] Updated weights for policy 0, policy_version 50370 (0.0005) [2023-03-07 15:11:12,848][213771] Updated weights for policy 0, policy_version 50380 (0.0005) [2023-03-07 15:11:13,617][213771] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-07 15:11:14,396][213771] Updated weights for policy 0, policy_version 50400 (0.0006) [2023-03-07 15:11:15,182][213771] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-07 15:11:15,955][213771] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-07 15:11:16,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 51631104. Throughput: 0: 13231.8. Samples: 51602246. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:16,106][213445] Avg episode reward: [(0, '4483.189')] [2023-03-07 15:11:16,737][213771] Updated weights for policy 0, policy_version 50430 (0.0006) [2023-03-07 15:11:17,505][213771] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-07 15:11:18,281][213771] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-07 15:11:19,055][213771] Updated weights for policy 0, policy_version 50460 (0.0005) [2023-03-07 15:11:19,830][213771] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-07 15:11:20,605][213771] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-07 15:11:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 51697664. Throughput: 0: 13229.7. Samples: 51681401. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:21,106][213445] Avg episode reward: [(0, '4465.145')] [2023-03-07 15:11:21,382][213771] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-07 15:11:22,155][213771] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-07 15:11:22,933][213771] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-07 15:11:23,734][213771] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-07 15:11:24,489][213771] Updated weights for policy 0, policy_version 50530 (0.0005) [2023-03-07 15:11:25,257][213771] Updated weights for policy 0, policy_version 50540 (0.0007) [2023-03-07 15:11:26,034][213771] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-07 15:11:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 51763200. Throughput: 0: 13220.4. Samples: 51760416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:26,106][213445] Avg episode reward: [(0, '4462.300')] [2023-03-07 15:11:26,815][213771] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-07 15:11:27,574][213771] Updated weights for policy 0, policy_version 50570 (0.0005) [2023-03-07 15:11:28,344][213771] Updated weights for policy 0, policy_version 50580 (0.0005) [2023-03-07 15:11:29,112][213771] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-07 15:11:29,890][213771] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-07 15:11:30,668][213771] Updated weights for policy 0, policy_version 50610 (0.0006) [2023-03-07 15:11:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 51829760. Throughput: 0: 13226.9. Samples: 51800543. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:31,106][213445] Avg episode reward: [(0, '4447.065')] [2023-03-07 15:11:31,440][213771] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 15:11:32,194][213771] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-07 15:11:32,972][213771] Updated weights for policy 0, policy_version 50640 (0.0008) [2023-03-07 15:11:33,750][213771] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-07 15:11:34,551][213771] Updated weights for policy 0, policy_version 50660 (0.0006) [2023-03-07 15:11:35,316][213771] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-07 15:11:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 51895296. Throughput: 0: 13218.2. Samples: 51879646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:36,106][213445] Avg episode reward: [(0, '4439.718')] [2023-03-07 15:11:36,116][213771] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 15:11:36,875][213771] Updated weights for policy 0, policy_version 50690 (0.0007) [2023-03-07 15:11:37,649][213771] Updated weights for policy 0, policy_version 50700 (0.0006) [2023-03-07 15:11:38,418][213771] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 15:11:39,193][213771] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-07 15:11:39,958][213771] Updated weights for policy 0, policy_version 50730 (0.0007) [2023-03-07 15:11:40,710][213771] Updated weights for policy 0, policy_version 50740 (0.0007) [2023-03-07 15:11:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 51961856. Throughput: 0: 13222.2. Samples: 51959231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:41,116][213445] Avg episode reward: [(0, '4392.576')] [2023-03-07 15:11:41,485][213771] Updated weights for policy 0, policy_version 50750 (0.0007) [2023-03-07 15:11:42,249][213771] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-07 15:11:43,025][213771] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-07 15:11:43,779][213771] Updated weights for policy 0, policy_version 50780 (0.0008) [2023-03-07 15:11:44,571][213771] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-07 15:11:45,344][213771] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 15:11:46,101][213771] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-07 15:11:46,105][213445] Fps is (10 sec: 13414.6, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 52029440. Throughput: 0: 13229.3. Samples: 51999199. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:46,116][213445] Avg episode reward: [(0, '4449.480')] [2023-03-07 15:11:46,879][213771] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-07 15:11:47,643][213771] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-07 15:11:48,424][213771] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-07 15:11:49,193][213771] Updated weights for policy 0, policy_version 50850 (0.0005) [2023-03-07 15:11:49,969][213771] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-07 15:11:50,738][213771] Updated weights for policy 0, policy_version 50870 (0.0006) [2023-03-07 15:11:51,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 52094976. Throughput: 0: 13230.2. Samples: 52078768. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:51,116][213445] Avg episode reward: [(0, '4499.656')] [2023-03-07 15:11:51,506][213771] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-07 15:11:52,289][213771] Updated weights for policy 0, policy_version 50890 (0.0005) [2023-03-07 15:11:53,064][213771] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-07 15:11:53,826][213771] Updated weights for policy 0, policy_version 50910 (0.0006) [2023-03-07 15:11:54,605][213771] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-07 15:11:55,381][213771] Updated weights for policy 0, policy_version 50930 (0.0006) [2023-03-07 15:11:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 52161536. Throughput: 0: 13250.0. Samples: 52158492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:11:56,116][213445] Avg episode reward: [(0, '4486.840')] [2023-03-07 15:11:56,140][213771] Updated weights for policy 0, policy_version 50940 (0.0005) [2023-03-07 15:11:56,905][213771] Updated weights for policy 0, policy_version 50950 (0.0007) [2023-03-07 15:11:57,693][213771] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-07 15:11:58,475][213771] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-07 15:11:59,230][213771] Updated weights for policy 0, policy_version 50980 (0.0006) [2023-03-07 15:12:00,002][213771] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-07 15:12:00,793][213771] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-07 15:12:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 52228096. Throughput: 0: 13244.0. Samples: 52198226. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:12:01,116][213445] Avg episode reward: [(0, '4483.317')] [2023-03-07 15:12:01,548][213771] Updated weights for policy 0, policy_version 51010 (0.0006) [2023-03-07 15:12:02,319][213771] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-07 15:12:03,106][213771] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-07 15:12:03,888][213771] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-07 15:12:04,651][213771] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-07 15:12:05,421][213771] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-07 15:12:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 52293632. Throughput: 0: 13252.7. Samples: 52277770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:12:06,116][213445] Avg episode reward: [(0, '4434.469')] [2023-03-07 15:12:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000051069_52294656.pth... [2023-03-07 15:12:06,151][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000047967_49118208.pth [2023-03-07 15:12:06,204][213771] Updated weights for policy 0, policy_version 51070 (0.0006) [2023-03-07 15:12:06,994][213771] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-07 15:12:07,758][213771] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-07 15:12:08,518][213771] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 15:12:09,308][213771] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-07 15:12:10,065][213771] Updated weights for policy 0, policy_version 51120 (0.0007) [2023-03-07 15:12:10,846][213771] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-07 15:12:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 52360192. Throughput: 0: 13255.7. Samples: 52356922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:11,116][213445] Avg episode reward: [(0, '4399.574')] [2023-03-07 15:12:11,617][213771] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-07 15:12:12,394][213771] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-07 15:12:13,165][213771] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-07 15:12:13,914][213771] Updated weights for policy 0, policy_version 51170 (0.0005) [2023-03-07 15:12:14,697][213771] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-07 15:12:15,466][213771] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-07 15:12:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 52426752. Throughput: 0: 13251.0. Samples: 52396837. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:16,116][213445] Avg episode reward: [(0, '4475.789')] [2023-03-07 15:12:16,248][213771] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 15:12:17,016][213771] Updated weights for policy 0, policy_version 51210 (0.0006) [2023-03-07 15:12:17,792][213771] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-07 15:12:18,571][213771] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-07 15:12:19,339][213771] Updated weights for policy 0, policy_version 51240 (0.0006) [2023-03-07 15:12:20,098][213771] Updated weights for policy 0, policy_version 51250 (0.0006) [2023-03-07 15:12:20,873][213771] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-07 15:12:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 52492288. Throughput: 0: 13258.7. Samples: 52476289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:21,106][213445] Avg episode reward: [(0, '4476.043')] [2023-03-07 15:12:21,646][213771] Updated weights for policy 0, policy_version 51270 (0.0005) [2023-03-07 15:12:22,413][213771] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-07 15:12:23,212][213771] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-07 15:12:23,985][213771] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-07 15:12:24,758][213771] Updated weights for policy 0, policy_version 51310 (0.0007) [2023-03-07 15:12:25,522][213771] Updated weights for policy 0, policy_version 51320 (0.0008) [2023-03-07 15:12:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 52558848. Throughput: 0: 13257.9. Samples: 52555837. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:26,106][213445] Avg episode reward: [(0, '4431.706')] [2023-03-07 15:12:26,306][213771] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-07 15:12:27,069][213771] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-07 15:12:27,835][213771] Updated weights for policy 0, policy_version 51350 (0.0006) [2023-03-07 15:12:28,603][213771] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-07 15:12:29,384][213771] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-07 15:12:30,149][213771] Updated weights for policy 0, policy_version 51380 (0.0006) [2023-03-07 15:12:30,929][213771] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-07 15:12:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 52625408. Throughput: 0: 13255.5. Samples: 52595698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:31,106][213445] Avg episode reward: [(0, '4520.291')] [2023-03-07 15:12:31,686][213771] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 15:12:32,462][213771] Updated weights for policy 0, policy_version 51410 (0.0007) [2023-03-07 15:12:33,242][213771] Updated weights for policy 0, policy_version 51420 (0.0006) [2023-03-07 15:12:34,000][213771] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-07 15:12:34,763][213771] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-07 15:12:35,533][213771] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-07 15:12:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13239.1). Total num frames: 52691968. Throughput: 0: 13264.6. Samples: 52675675. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:36,106][213445] Avg episode reward: [(0, '4483.177')] [2023-03-07 15:12:36,309][213771] Updated weights for policy 0, policy_version 51460 (0.0006) [2023-03-07 15:12:37,093][213771] Updated weights for policy 0, policy_version 51470 (0.0007) [2023-03-07 15:12:37,849][213771] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-07 15:12:38,622][213771] Updated weights for policy 0, policy_version 51490 (0.0007) [2023-03-07 15:12:39,397][213771] Updated weights for policy 0, policy_version 51500 (0.0005) [2023-03-07 15:12:40,160][213771] Updated weights for policy 0, policy_version 51510 (0.0005) [2023-03-07 15:12:40,934][213771] Updated weights for policy 0, policy_version 51520 (0.0007) [2023-03-07 15:12:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13239.1). Total num frames: 52758528. Throughput: 0: 13258.4. Samples: 52755120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:41,105][213445] Avg episode reward: [(0, '4403.191')] [2023-03-07 15:12:41,702][213771] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-07 15:12:42,457][213771] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-07 15:12:43,238][213771] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-07 15:12:44,010][213771] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-07 15:12:44,782][213771] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-07 15:12:45,555][213771] Updated weights for policy 0, policy_version 51580 (0.0007) [2023-03-07 15:12:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 52825088. Throughput: 0: 13264.2. Samples: 52795112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:46,105][213445] Avg episode reward: [(0, '4404.582')] [2023-03-07 15:12:46,328][213771] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-07 15:12:47,093][213771] Updated weights for policy 0, policy_version 51600 (0.0005) [2023-03-07 15:12:47,843][213771] Updated weights for policy 0, policy_version 51610 (0.0006) [2023-03-07 15:12:48,641][213771] Updated weights for policy 0, policy_version 51620 (0.0005) [2023-03-07 15:12:49,418][213771] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-07 15:12:50,193][213771] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-07 15:12:50,993][213771] Updated weights for policy 0, policy_version 51650 (0.0007) [2023-03-07 15:12:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 52890624. Throughput: 0: 13264.0. Samples: 52874650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:51,106][213445] Avg episode reward: [(0, '4444.220')] [2023-03-07 15:12:51,769][213771] Updated weights for policy 0, policy_version 51660 (0.0006) [2023-03-07 15:12:52,530][213771] Updated weights for policy 0, policy_version 51670 (0.0005) [2023-03-07 15:12:53,304][213771] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-07 15:12:54,095][213771] Updated weights for policy 0, policy_version 51690 (0.0005) [2023-03-07 15:12:54,858][213771] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-07 15:12:55,634][213771] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 15:12:56,105][213445] Fps is (10 sec: 13106.8, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 52956160. Throughput: 0: 13260.9. Samples: 52953664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:12:56,106][213445] Avg episode reward: [(0, '4451.333')] [2023-03-07 15:12:56,411][213771] Updated weights for policy 0, policy_version 51720 (0.0006) [2023-03-07 15:12:57,178][213771] Updated weights for policy 0, policy_version 51730 (0.0007) [2023-03-07 15:12:57,958][213771] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-07 15:12:58,742][213771] Updated weights for policy 0, policy_version 51750 (0.0006) [2023-03-07 15:12:59,505][213771] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-07 15:13:00,289][213771] Updated weights for policy 0, policy_version 51770 (0.0006) [2023-03-07 15:13:01,065][213771] Updated weights for policy 0, policy_version 51780 (0.0005) [2023-03-07 15:13:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 53022720. Throughput: 0: 13256.2. Samples: 52993368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:13:01,106][213445] Avg episode reward: [(0, '4464.111')] [2023-03-07 15:13:01,855][213771] Updated weights for policy 0, policy_version 51790 (0.0006) [2023-03-07 15:13:02,624][213771] Updated weights for policy 0, policy_version 51800 (0.0007) [2023-03-07 15:13:03,408][213771] Updated weights for policy 0, policy_version 51810 (0.0007) [2023-03-07 15:13:04,190][213771] Updated weights for policy 0, policy_version 51820 (0.0007) [2023-03-07 15:13:04,961][213771] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-07 15:13:05,710][213771] Updated weights for policy 0, policy_version 51840 (0.0006) [2023-03-07 15:13:06,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 53088256. Throughput: 0: 13245.4. Samples: 53072332. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:13:06,106][213445] Avg episode reward: [(0, '4391.487')] [2023-03-07 15:13:06,510][213771] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-07 15:13:07,278][213771] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-07 15:13:08,057][213771] Updated weights for policy 0, policy_version 51870 (0.0007) [2023-03-07 15:13:08,831][213771] Updated weights for policy 0, policy_version 51880 (0.0005) [2023-03-07 15:13:09,614][213771] Updated weights for policy 0, policy_version 51890 (0.0005) [2023-03-07 15:13:10,390][213771] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-07 15:13:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 53154816. Throughput: 0: 13236.8. Samples: 53151493. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:11,106][213445] Avg episode reward: [(0, '4414.478')] [2023-03-07 15:13:11,170][213771] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-07 15:13:11,940][213771] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-07 15:13:12,712][213771] Updated weights for policy 0, policy_version 51930 (0.0006) [2023-03-07 15:13:13,501][213771] Updated weights for policy 0, policy_version 51940 (0.0007) [2023-03-07 15:13:14,265][213771] Updated weights for policy 0, policy_version 51950 (0.0005) [2023-03-07 15:13:15,033][213771] Updated weights for policy 0, policy_version 51960 (0.0007) [2023-03-07 15:13:15,814][213771] Updated weights for policy 0, policy_version 51970 (0.0007) [2023-03-07 15:13:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 53220352. Throughput: 0: 13231.9. Samples: 53191135. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:16,106][213445] Avg episode reward: [(0, '4470.808')] [2023-03-07 15:13:16,581][213771] Updated weights for policy 0, policy_version 51980 (0.0005) [2023-03-07 15:13:17,349][213771] Updated weights for policy 0, policy_version 51990 (0.0007) [2023-03-07 15:13:18,121][213771] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 15:13:18,886][213771] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-07 15:13:19,661][213771] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-07 15:13:20,427][213771] Updated weights for policy 0, policy_version 52030 (0.0006) [2023-03-07 15:13:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13235.6). Total num frames: 53286912. Throughput: 0: 13225.7. Samples: 53270830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:21,106][213445] Avg episode reward: [(0, '4479.139')] [2023-03-07 15:13:21,214][213771] Updated weights for policy 0, policy_version 52040 (0.0006) [2023-03-07 15:13:21,977][213771] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-07 15:13:22,756][213771] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 15:13:23,543][213771] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 15:13:24,313][213771] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-07 15:13:25,085][213771] Updated weights for policy 0, policy_version 52090 (0.0005) [2023-03-07 15:13:25,841][213771] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-07 15:13:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 53352448. Throughput: 0: 13220.2. Samples: 53350029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:26,106][213445] Avg episode reward: [(0, '4450.226')] [2023-03-07 15:13:26,641][213771] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-07 15:13:27,417][213771] Updated weights for policy 0, policy_version 52120 (0.0005) [2023-03-07 15:13:28,189][213771] Updated weights for policy 0, policy_version 52130 (0.0007) [2023-03-07 15:13:28,990][213771] Updated weights for policy 0, policy_version 52140 (0.0005) [2023-03-07 15:13:29,754][213771] Updated weights for policy 0, policy_version 52150 (0.0007) [2023-03-07 15:13:30,520][213771] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-07 15:13:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 53419008. Throughput: 0: 13206.7. Samples: 53389414. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:31,106][213445] Avg episode reward: [(0, '4391.106')] [2023-03-07 15:13:31,307][213771] Updated weights for policy 0, policy_version 52170 (0.0007) [2023-03-07 15:13:32,082][213771] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-07 15:13:32,840][213771] Updated weights for policy 0, policy_version 52190 (0.0006) [2023-03-07 15:13:33,618][213771] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-07 15:13:34,380][213771] Updated weights for policy 0, policy_version 52210 (0.0007) [2023-03-07 15:13:35,171][213771] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-07 15:13:35,934][213771] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-07 15:13:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 53485568. Throughput: 0: 13207.1. Samples: 53468970. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:36,106][213445] Avg episode reward: [(0, '4331.433')] [2023-03-07 15:13:36,702][213771] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-07 15:13:37,486][213771] Updated weights for policy 0, policy_version 52250 (0.0007) [2023-03-07 15:13:38,264][213771] Updated weights for policy 0, policy_version 52260 (0.0006) [2023-03-07 15:13:39,016][213771] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-07 15:13:39,813][213771] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-07 15:13:40,585][213771] Updated weights for policy 0, policy_version 52290 (0.0007) [2023-03-07 15:13:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 53551104. Throughput: 0: 13209.6. Samples: 53548094. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:13:41,106][213445] Avg episode reward: [(0, '4277.857')] [2023-03-07 15:13:41,366][213771] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-07 15:13:42,142][213771] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-07 15:13:42,921][213771] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-07 15:13:43,689][213771] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-07 15:13:44,466][213771] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 15:13:45,235][213771] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-07 15:13:46,003][213771] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-07 15:13:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 53617664. Throughput: 0: 13210.1. Samples: 53587824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:13:46,106][213445] Avg episode reward: [(0, '4107.397')] [2023-03-07 15:13:46,780][213771] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 15:13:47,545][213771] Updated weights for policy 0, policy_version 52380 (0.0005) [2023-03-07 15:13:48,324][213771] Updated weights for policy 0, policy_version 52390 (0.0006) [2023-03-07 15:13:49,101][213771] Updated weights for policy 0, policy_version 52400 (0.0006) [2023-03-07 15:13:49,858][213771] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-07 15:13:50,633][213771] Updated weights for policy 0, policy_version 52420 (0.0008) [2023-03-07 15:13:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 53683200. Throughput: 0: 13221.9. Samples: 53667316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:13:51,106][213445] Avg episode reward: [(0, '4130.597')] [2023-03-07 15:13:51,416][213771] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 15:13:52,185][213771] Updated weights for policy 0, policy_version 52440 (0.0006) [2023-03-07 15:13:52,957][213771] Updated weights for policy 0, policy_version 52450 (0.0005) [2023-03-07 15:13:53,749][213771] Updated weights for policy 0, policy_version 52460 (0.0007) [2023-03-07 15:13:54,518][213771] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-07 15:13:55,298][213771] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-07 15:13:56,078][213771] Updated weights for policy 0, policy_version 52490 (0.0005) [2023-03-07 15:13:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 53749760. Throughput: 0: 13225.9. Samples: 53746659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:13:56,116][213445] Avg episode reward: [(0, '4245.632')] [2023-03-07 15:13:56,842][213771] Updated weights for policy 0, policy_version 52500 (0.0006) [2023-03-07 15:13:57,618][213771] Updated weights for policy 0, policy_version 52510 (0.0005) [2023-03-07 15:13:58,388][213771] Updated weights for policy 0, policy_version 52520 (0.0006) [2023-03-07 15:13:59,158][213771] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-07 15:13:59,922][213771] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 15:14:00,698][213771] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-07 15:14:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 53816320. Throughput: 0: 13229.6. Samples: 53786468. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:14:01,116][213445] Avg episode reward: [(0, '4345.486')] [2023-03-07 15:14:01,465][213771] Updated weights for policy 0, policy_version 52560 (0.0006) [2023-03-07 15:14:02,251][213771] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-07 15:14:03,011][213771] Updated weights for policy 0, policy_version 52580 (0.0006) [2023-03-07 15:14:03,784][213771] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-07 15:14:04,553][213771] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 15:14:05,323][213771] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-07 15:14:06,089][213771] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-07 15:14:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 53882880. Throughput: 0: 13228.3. Samples: 53866107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:14:06,106][213445] Avg episode reward: [(0, '4158.975')] [2023-03-07 15:14:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000052620_53882880.pth... [2023-03-07 15:14:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000049518_50706432.pth [2023-03-07 15:14:06,854][213771] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-07 15:14:07,645][213771] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-07 15:14:08,420][213771] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-07 15:14:09,190][213771] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-07 15:14:09,964][213771] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-07 15:14:10,718][213771] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-07 15:14:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 53948416. Throughput: 0: 13237.4. Samples: 53945712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:14:11,106][213445] Avg episode reward: [(0, '4217.954')] [2023-03-07 15:14:11,485][213771] Updated weights for policy 0, policy_version 52690 (0.0005) [2023-03-07 15:14:12,273][213771] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-07 15:14:13,041][213771] Updated weights for policy 0, policy_version 52710 (0.0005) [2023-03-07 15:14:13,800][213771] Updated weights for policy 0, policy_version 52720 (0.0006) [2023-03-07 15:14:14,576][213771] Updated weights for policy 0, policy_version 52730 (0.0007) [2023-03-07 15:14:15,349][213771] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-07 15:14:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 54014976. Throughput: 0: 13248.3. Samples: 53985586. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:16,106][213445] Avg episode reward: [(0, '4228.648')] [2023-03-07 15:14:16,133][213771] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 15:14:16,914][213771] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-07 15:14:17,680][213771] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 15:14:18,470][213771] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-07 15:14:19,242][213771] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-07 15:14:20,025][213771] Updated weights for policy 0, policy_version 52800 (0.0005) [2023-03-07 15:14:20,791][213771] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-07 15:14:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 54081536. Throughput: 0: 13239.3. Samples: 54064739. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:21,106][213445] Avg episode reward: [(0, '4265.241')] [2023-03-07 15:14:21,556][213771] Updated weights for policy 0, policy_version 52820 (0.0007) [2023-03-07 15:14:22,340][213771] Updated weights for policy 0, policy_version 52830 (0.0006) [2023-03-07 15:14:23,095][213771] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-07 15:14:23,861][213771] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-07 15:14:24,639][213771] Updated weights for policy 0, policy_version 52860 (0.0006) [2023-03-07 15:14:25,417][213771] Updated weights for policy 0, policy_version 52870 (0.0006) [2023-03-07 15:14:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 54147072. Throughput: 0: 13250.7. Samples: 54144377. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:26,106][213445] Avg episode reward: [(0, '4207.418')] [2023-03-07 15:14:26,173][213771] Updated weights for policy 0, policy_version 52880 (0.0007) [2023-03-07 15:14:26,954][213771] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-07 15:14:27,737][213771] Updated weights for policy 0, policy_version 52900 (0.0006) [2023-03-07 15:14:28,516][213771] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-07 15:14:29,295][213771] Updated weights for policy 0, policy_version 52920 (0.0007) [2023-03-07 15:14:30,096][213771] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-07 15:14:30,850][213771] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-07 15:14:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13235.6). Total num frames: 54213632. Throughput: 0: 13248.7. Samples: 54184012. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:31,105][213445] Avg episode reward: [(0, '4094.228')] [2023-03-07 15:14:31,620][213771] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-07 15:14:32,408][213771] Updated weights for policy 0, policy_version 52960 (0.0007) [2023-03-07 15:14:33,177][213771] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 15:14:33,973][213771] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 15:14:34,734][213771] Updated weights for policy 0, policy_version 52990 (0.0006) [2023-03-07 15:14:35,506][213771] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-07 15:14:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 54279168. Throughput: 0: 13235.3. Samples: 54262906. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:36,105][213445] Avg episode reward: [(0, '4237.856')] [2023-03-07 15:14:36,274][213771] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-07 15:14:37,075][213771] Updated weights for policy 0, policy_version 53020 (0.0005) [2023-03-07 15:14:37,839][213771] Updated weights for policy 0, policy_version 53030 (0.0005) [2023-03-07 15:14:38,594][213771] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-07 15:14:39,386][213771] Updated weights for policy 0, policy_version 53050 (0.0006) [2023-03-07 15:14:40,146][213771] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-07 15:14:40,934][213771] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-07 15:14:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 54345728. Throughput: 0: 13241.2. Samples: 54342511. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:41,106][213445] Avg episode reward: [(0, '4252.699')] [2023-03-07 15:14:41,710][213771] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-07 15:14:42,492][213771] Updated weights for policy 0, policy_version 53090 (0.0006) [2023-03-07 15:14:43,254][213771] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-07 15:14:44,044][213771] Updated weights for policy 0, policy_version 53110 (0.0008) [2023-03-07 15:14:44,807][213771] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 15:14:45,572][213771] Updated weights for policy 0, policy_version 53130 (0.0006) [2023-03-07 15:14:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 54411264. Throughput: 0: 13234.0. Samples: 54381997. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 15:14:46,106][213445] Avg episode reward: [(0, '4297.754')] [2023-03-07 15:14:46,353][213771] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-07 15:14:47,122][213771] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-07 15:14:47,889][213771] Updated weights for policy 0, policy_version 53160 (0.0005) [2023-03-07 15:14:48,674][213771] Updated weights for policy 0, policy_version 53170 (0.0006) [2023-03-07 15:14:49,469][213771] Updated weights for policy 0, policy_version 53180 (0.0007) [2023-03-07 15:14:50,221][213771] Updated weights for policy 0, policy_version 53190 (0.0006) [2023-03-07 15:14:50,994][213771] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-07 15:14:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 54477824. Throughput: 0: 13228.8. Samples: 54461404. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:14:51,106][213445] Avg episode reward: [(0, '4328.543')] [2023-03-07 15:14:51,774][213771] Updated weights for policy 0, policy_version 53210 (0.0007) [2023-03-07 15:14:52,548][213771] Updated weights for policy 0, policy_version 53220 (0.0007) [2023-03-07 15:14:53,331][213771] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-07 15:14:54,111][213771] Updated weights for policy 0, policy_version 53240 (0.0007) [2023-03-07 15:14:54,881][213771] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-07 15:14:55,682][213771] Updated weights for policy 0, policy_version 53260 (0.0007) [2023-03-07 15:14:56,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 54543360. Throughput: 0: 13218.1. Samples: 54540525. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:14:56,105][213445] Avg episode reward: [(0, '4145.076')] [2023-03-07 15:14:56,458][213771] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 15:14:57,223][213771] Updated weights for policy 0, policy_version 53280 (0.0007) [2023-03-07 15:14:58,000][213771] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-07 15:14:58,784][213771] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-07 15:14:59,551][213771] Updated weights for policy 0, policy_version 53310 (0.0005) [2023-03-07 15:15:00,329][213771] Updated weights for policy 0, policy_version 53320 (0.0007) [2023-03-07 15:15:01,096][213771] Updated weights for policy 0, policy_version 53330 (0.0007) [2023-03-07 15:15:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 54609920. Throughput: 0: 13209.4. Samples: 54580011. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:15:01,106][213445] Avg episode reward: [(0, '4291.864')] [2023-03-07 15:15:01,872][213771] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-07 15:15:02,613][213771] Updated weights for policy 0, policy_version 53350 (0.0006) [2023-03-07 15:15:03,398][213771] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-07 15:15:04,171][213771] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 15:15:04,943][213771] Updated weights for policy 0, policy_version 53380 (0.0005) [2023-03-07 15:15:05,737][213771] Updated weights for policy 0, policy_version 53390 (0.0007) [2023-03-07 15:15:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 54675456. Throughput: 0: 13221.6. Samples: 54659708. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:15:06,106][213445] Avg episode reward: [(0, '4272.180')] [2023-03-07 15:15:06,512][213771] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-07 15:15:07,283][213771] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-07 15:15:08,063][213771] Updated weights for policy 0, policy_version 53420 (0.0008) [2023-03-07 15:15:08,828][213771] Updated weights for policy 0, policy_version 53430 (0.0005) [2023-03-07 15:15:09,616][213771] Updated weights for policy 0, policy_version 53440 (0.0006) [2023-03-07 15:15:10,393][213771] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-07 15:15:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 54742016. Throughput: 0: 13206.3. Samples: 54738659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:15:11,106][213445] Avg episode reward: [(0, '4315.719')] [2023-03-07 15:15:11,163][213771] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-07 15:15:11,953][213771] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-07 15:15:12,712][213771] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 15:15:13,494][213771] Updated weights for policy 0, policy_version 53490 (0.0007) [2023-03-07 15:15:14,269][213771] Updated weights for policy 0, policy_version 53500 (0.0007) [2023-03-07 15:15:15,043][213771] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-07 15:15:15,818][213771] Updated weights for policy 0, policy_version 53520 (0.0006) [2023-03-07 15:15:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 54807552. Throughput: 0: 13205.8. Samples: 54778275. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:15:16,106][213445] Avg episode reward: [(0, '4322.767')] [2023-03-07 15:15:16,592][213771] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-07 15:15:17,354][213771] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-07 15:15:18,145][213771] Updated weights for policy 0, policy_version 53550 (0.0006) [2023-03-07 15:15:18,914][213771] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-07 15:15:19,686][213771] Updated weights for policy 0, policy_version 53570 (0.0006) [2023-03-07 15:15:20,468][213771] Updated weights for policy 0, policy_version 53580 (0.0007) [2023-03-07 15:15:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 54874112. Throughput: 0: 13215.4. Samples: 54857598. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:15:21,106][213445] Avg episode reward: [(0, '4264.272')] [2023-03-07 15:15:21,248][213771] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 15:15:22,027][213771] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-07 15:15:22,795][213771] Updated weights for policy 0, policy_version 53610 (0.0006) [2023-03-07 15:15:23,564][213771] Updated weights for policy 0, policy_version 53620 (0.0005) [2023-03-07 15:15:24,330][213771] Updated weights for policy 0, policy_version 53630 (0.0007) [2023-03-07 15:15:25,100][213771] Updated weights for policy 0, policy_version 53640 (0.0007) [2023-03-07 15:15:25,869][213771] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-07 15:15:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 54940672. Throughput: 0: 13213.3. Samples: 54937110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:26,105][213445] Avg episode reward: [(0, '4231.541')] [2023-03-07 15:15:26,653][213771] Updated weights for policy 0, policy_version 53660 (0.0007) [2023-03-07 15:15:27,425][213771] Updated weights for policy 0, policy_version 53670 (0.0007) [2023-03-07 15:15:28,195][213771] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-07 15:15:28,950][213771] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-07 15:15:29,740][213771] Updated weights for policy 0, policy_version 53700 (0.0005) [2023-03-07 15:15:30,525][213771] Updated weights for policy 0, policy_version 53710 (0.0005) [2023-03-07 15:15:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 55006208. Throughput: 0: 13217.6. Samples: 54976788. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:31,116][213445] Avg episode reward: [(0, '4264.124')] [2023-03-07 15:15:31,284][213771] Updated weights for policy 0, policy_version 53720 (0.0007) [2023-03-07 15:15:32,049][213771] Updated weights for policy 0, policy_version 53730 (0.0006) [2023-03-07 15:15:32,829][213771] Updated weights for policy 0, policy_version 53740 (0.0005) [2023-03-07 15:15:33,596][213771] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-07 15:15:34,371][213771] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-07 15:15:35,158][213771] Updated weights for policy 0, policy_version 53770 (0.0006) [2023-03-07 15:15:35,927][213771] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 15:15:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13235.6). Total num frames: 55072768. Throughput: 0: 13218.9. Samples: 55056252. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:36,116][213445] Avg episode reward: [(0, '4169.629')] [2023-03-07 15:15:36,710][213771] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-07 15:15:37,477][213771] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-07 15:15:38,249][213771] Updated weights for policy 0, policy_version 53810 (0.0006) [2023-03-07 15:15:39,017][213771] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-07 15:15:39,813][213771] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-07 15:15:40,568][213771] Updated weights for policy 0, policy_version 53840 (0.0005) [2023-03-07 15:15:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 55139328. Throughput: 0: 13225.7. Samples: 55135685. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:41,116][213445] Avg episode reward: [(0, '4137.193')] [2023-03-07 15:15:41,346][213771] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-07 15:15:42,124][213771] Updated weights for policy 0, policy_version 53860 (0.0006) [2023-03-07 15:15:42,894][213771] Updated weights for policy 0, policy_version 53870 (0.0006) [2023-03-07 15:15:43,676][213771] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-07 15:15:44,458][213771] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-07 15:15:45,236][213771] Updated weights for policy 0, policy_version 53900 (0.0007) [2023-03-07 15:15:46,005][213771] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 15:15:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 55204864. Throughput: 0: 13228.4. Samples: 55175290. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:46,116][213445] Avg episode reward: [(0, '4263.073')] [2023-03-07 15:15:46,784][213771] Updated weights for policy 0, policy_version 53920 (0.0006) [2023-03-07 15:15:47,563][213771] Updated weights for policy 0, policy_version 53930 (0.0006) [2023-03-07 15:15:48,327][213771] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-07 15:15:49,113][213771] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-07 15:15:49,877][213771] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-03-07 15:15:50,657][213771] Updated weights for policy 0, policy_version 53970 (0.0006) [2023-03-07 15:15:51,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 55270400. Throughput: 0: 13217.0. Samples: 55254472. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:51,106][213445] Avg episode reward: [(0, '4294.559')] [2023-03-07 15:15:51,443][213771] Updated weights for policy 0, policy_version 53980 (0.0006) [2023-03-07 15:15:52,227][213771] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-07 15:15:53,003][213771] Updated weights for policy 0, policy_version 54000 (0.0007) [2023-03-07 15:15:53,774][213771] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-07 15:15:54,548][213771] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-07 15:15:55,326][213771] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-07 15:15:56,092][213771] Updated weights for policy 0, policy_version 54040 (0.0007) [2023-03-07 15:15:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 55336960. Throughput: 0: 13217.6. Samples: 55333452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:15:56,106][213445] Avg episode reward: [(0, '4306.067')] [2023-03-07 15:15:56,874][213771] Updated weights for policy 0, policy_version 54050 (0.0005) [2023-03-07 15:15:57,669][213771] Updated weights for policy 0, policy_version 54060 (0.0006) [2023-03-07 15:15:58,429][213771] Updated weights for policy 0, policy_version 54070 (0.0006) [2023-03-07 15:15:59,199][213771] Updated weights for policy 0, policy_version 54080 (0.0007) [2023-03-07 15:15:59,986][213771] Updated weights for policy 0, policy_version 54090 (0.0007) [2023-03-07 15:16:00,747][213771] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-07 15:16:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 55402496. Throughput: 0: 13217.3. Samples: 55373052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:01,106][213445] Avg episode reward: [(0, '4348.027')] [2023-03-07 15:16:01,523][213771] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-07 15:16:02,285][213771] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-07 15:16:03,065][213771] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-07 15:16:03,834][213771] Updated weights for policy 0, policy_version 54140 (0.0007) [2023-03-07 15:16:04,608][213771] Updated weights for policy 0, policy_version 54150 (0.0007) [2023-03-07 15:16:05,375][213771] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-07 15:16:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 55469056. Throughput: 0: 13221.1. Samples: 55452547. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:06,106][213445] Avg episode reward: [(0, '4336.476')] [2023-03-07 15:16:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000054169_55469056.pth... [2023-03-07 15:16:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000051069_52294656.pth [2023-03-07 15:16:06,163][213771] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-07 15:16:06,937][213771] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-07 15:16:07,717][213771] Updated weights for policy 0, policy_version 54190 (0.0007) [2023-03-07 15:16:08,489][213771] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-07 15:16:09,249][213771] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-07 15:16:10,009][213771] Updated weights for policy 0, policy_version 54220 (0.0007) [2023-03-07 15:16:10,797][213771] Updated weights for policy 0, policy_version 54230 (0.0007) [2023-03-07 15:16:11,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 55535616. Throughput: 0: 13219.9. Samples: 55532003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:11,106][213445] Avg episode reward: [(0, '4366.099')] [2023-03-07 15:16:11,562][213771] Updated weights for policy 0, policy_version 54240 (0.0006) [2023-03-07 15:16:12,332][213771] Updated weights for policy 0, policy_version 54250 (0.0006) [2023-03-07 15:16:13,107][213771] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-07 15:16:13,880][213771] Updated weights for policy 0, policy_version 54270 (0.0007) [2023-03-07 15:16:14,654][213771] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-07 15:16:15,434][213771] Updated weights for policy 0, policy_version 54290 (0.0006) [2023-03-07 15:16:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 55601152. Throughput: 0: 13224.8. Samples: 55571905. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:16,106][213445] Avg episode reward: [(0, '4258.548')] [2023-03-07 15:16:16,208][213771] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-07 15:16:16,979][213771] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-07 15:16:17,759][213771] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-07 15:16:18,526][213771] Updated weights for policy 0, policy_version 54330 (0.0006) [2023-03-07 15:16:19,287][213771] Updated weights for policy 0, policy_version 54340 (0.0005) [2023-03-07 15:16:20,078][213771] Updated weights for policy 0, policy_version 54350 (0.0005) [2023-03-07 15:16:20,842][213771] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-07 15:16:21,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 55667712. Throughput: 0: 13224.4. Samples: 55651352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:21,106][213445] Avg episode reward: [(0, '4323.719')] [2023-03-07 15:16:21,622][213771] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-07 15:16:22,387][213771] Updated weights for policy 0, policy_version 54380 (0.0006) [2023-03-07 15:16:23,160][213771] Updated weights for policy 0, policy_version 54390 (0.0005) [2023-03-07 15:16:23,925][213771] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-07 15:16:24,689][213771] Updated weights for policy 0, policy_version 54410 (0.0005) [2023-03-07 15:16:25,458][213771] Updated weights for policy 0, policy_version 54420 (0.0005) [2023-03-07 15:16:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13226.6, 300 sec: 13235.6). Total num frames: 55734272. Throughput: 0: 13230.9. Samples: 55731079. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:16:26,106][213445] Avg episode reward: [(0, '4356.161')] [2023-03-07 15:16:26,239][213771] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-07 15:16:27,008][213771] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-07 15:16:27,783][213771] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-07 15:16:28,559][213771] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 15:16:29,333][213771] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-07 15:16:30,104][213771] Updated weights for policy 0, policy_version 54480 (0.0007) [2023-03-07 15:16:30,878][213771] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-07 15:16:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 55799808. Throughput: 0: 13230.3. Samples: 55770653. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:31,105][213445] Avg episode reward: [(0, '4391.440')] [2023-03-07 15:16:31,640][213771] Updated weights for policy 0, policy_version 54500 (0.0007) [2023-03-07 15:16:32,408][213771] Updated weights for policy 0, policy_version 54510 (0.0006) [2023-03-07 15:16:33,187][213771] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-07 15:16:33,975][213771] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-07 15:16:34,760][213771] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-07 15:16:35,525][213771] Updated weights for policy 0, policy_version 54550 (0.0005) [2023-03-07 15:16:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 55866368. Throughput: 0: 13236.9. Samples: 55850131. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:36,106][213445] Avg episode reward: [(0, '4347.456')] [2023-03-07 15:16:36,305][213771] Updated weights for policy 0, policy_version 54560 (0.0005) [2023-03-07 15:16:37,094][213771] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-07 15:16:37,855][213771] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-07 15:16:38,638][213771] Updated weights for policy 0, policy_version 54590 (0.0007) [2023-03-07 15:16:39,429][213771] Updated weights for policy 0, policy_version 54600 (0.0006) [2023-03-07 15:16:40,209][213771] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-07 15:16:40,980][213771] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-07 15:16:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 55931904. Throughput: 0: 13234.5. Samples: 55929006. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:41,107][213445] Avg episode reward: [(0, '4374.615')] [2023-03-07 15:16:41,747][213771] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 15:16:42,517][213771] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-07 15:16:43,307][213771] Updated weights for policy 0, policy_version 54650 (0.0005) [2023-03-07 15:16:44,082][213771] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-07 15:16:44,857][213771] Updated weights for policy 0, policy_version 54670 (0.0006) [2023-03-07 15:16:45,625][213771] Updated weights for policy 0, policy_version 54680 (0.0007) [2023-03-07 15:16:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 55998464. Throughput: 0: 13237.2. Samples: 55968728. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:46,106][213445] Avg episode reward: [(0, '4354.272')] [2023-03-07 15:16:46,401][213771] Updated weights for policy 0, policy_version 54690 (0.0006) [2023-03-07 15:16:47,173][213771] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-07 15:16:47,946][213771] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-07 15:16:48,719][213771] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 15:16:49,497][213771] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-07 15:16:50,275][213771] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-07 15:16:51,052][213771] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-07 15:16:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 56064000. Throughput: 0: 13232.5. Samples: 56048011. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:51,106][213445] Avg episode reward: [(0, '4348.537')] [2023-03-07 15:16:51,813][213771] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-07 15:16:52,592][213771] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-07 15:16:53,366][213771] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-07 15:16:54,138][213771] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 15:16:54,919][213771] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-07 15:16:55,697][213771] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-07 15:16:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 56130560. Throughput: 0: 13234.1. Samples: 56127539. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:16:56,106][213445] Avg episode reward: [(0, '4363.250')] [2023-03-07 15:16:56,452][213771] Updated weights for policy 0, policy_version 54820 (0.0005) [2023-03-07 15:16:57,219][213771] Updated weights for policy 0, policy_version 54830 (0.0005) [2023-03-07 15:16:57,995][213771] Updated weights for policy 0, policy_version 54840 (0.0007) [2023-03-07 15:16:58,766][213771] Updated weights for policy 0, policy_version 54850 (0.0007) [2023-03-07 15:16:59,554][213771] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-07 15:17:00,313][213771] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-07 15:17:01,089][213771] Updated weights for policy 0, policy_version 54880 (0.0006) [2023-03-07 15:17:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 56197120. Throughput: 0: 13234.1. Samples: 56167438. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:01,106][213445] Avg episode reward: [(0, '4323.127')] [2023-03-07 15:17:01,853][213771] Updated weights for policy 0, policy_version 54890 (0.0006) [2023-03-07 15:17:02,620][213771] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 15:17:03,390][213771] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-07 15:17:04,160][213771] Updated weights for policy 0, policy_version 54920 (0.0007) [2023-03-07 15:17:04,933][213771] Updated weights for policy 0, policy_version 54930 (0.0007) [2023-03-07 15:17:05,697][213771] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-07 15:17:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 56263680. Throughput: 0: 13238.8. Samples: 56247098. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:06,106][213445] Avg episode reward: [(0, '4202.421')] [2023-03-07 15:17:06,474][213771] Updated weights for policy 0, policy_version 54950 (0.0007) [2023-03-07 15:17:07,237][213771] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-07 15:17:08,022][213771] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-07 15:17:08,791][213771] Updated weights for policy 0, policy_version 54980 (0.0007) [2023-03-07 15:17:09,570][213771] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 15:17:10,335][213771] Updated weights for policy 0, policy_version 55000 (0.0005) [2023-03-07 15:17:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13228.7). Total num frames: 56329216. Throughput: 0: 13230.9. Samples: 56326469. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:11,106][213445] Avg episode reward: [(0, '4323.422')] [2023-03-07 15:17:11,117][213771] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-07 15:17:11,879][213771] Updated weights for policy 0, policy_version 55020 (0.0007) [2023-03-07 15:17:12,673][213771] Updated weights for policy 0, policy_version 55030 (0.0005) [2023-03-07 15:17:13,445][213771] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-07 15:17:14,218][213771] Updated weights for policy 0, policy_version 55050 (0.0006) [2023-03-07 15:17:14,988][213771] Updated weights for policy 0, policy_version 55060 (0.0007) [2023-03-07 15:17:15,763][213771] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 15:17:16,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 56395776. Throughput: 0: 13229.1. Samples: 56365965. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:16,106][213445] Avg episode reward: [(0, '4282.160')] [2023-03-07 15:17:16,536][213771] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-07 15:17:17,318][213771] Updated weights for policy 0, policy_version 55090 (0.0006) [2023-03-07 15:17:18,081][213771] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 15:17:18,861][213771] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-07 15:17:19,644][213771] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-07 15:17:20,401][213771] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-07 15:17:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 56462336. Throughput: 0: 13231.5. Samples: 56445547. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:21,106][213445] Avg episode reward: [(0, '4292.050')] [2023-03-07 15:17:21,166][213771] Updated weights for policy 0, policy_version 55140 (0.0006) [2023-03-07 15:17:21,950][213771] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-07 15:17:22,714][213771] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-07 15:17:23,482][213771] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-07 15:17:24,262][213771] Updated weights for policy 0, policy_version 55180 (0.0006) [2023-03-07 15:17:25,025][213771] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-07 15:17:25,809][213771] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-07 15:17:26,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 56527872. Throughput: 0: 13246.4. Samples: 56525093. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:26,106][213445] Avg episode reward: [(0, '4388.707')] [2023-03-07 15:17:26,586][213771] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-07 15:17:27,341][213771] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 15:17:28,111][213771] Updated weights for policy 0, policy_version 55230 (0.0006) [2023-03-07 15:17:28,860][213771] Updated weights for policy 0, policy_version 55240 (0.0006) [2023-03-07 15:17:29,640][213771] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 15:17:30,401][213771] Updated weights for policy 0, policy_version 55260 (0.0006) [2023-03-07 15:17:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 56594432. Throughput: 0: 13255.9. Samples: 56565240. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:31,106][213445] Avg episode reward: [(0, '4366.805')] [2023-03-07 15:17:31,190][213771] Updated weights for policy 0, policy_version 55270 (0.0006) [2023-03-07 15:17:31,953][213771] Updated weights for policy 0, policy_version 55280 (0.0007) [2023-03-07 15:17:32,734][213771] Updated weights for policy 0, policy_version 55290 (0.0007) [2023-03-07 15:17:33,501][213771] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 15:17:34,267][213771] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 15:17:35,065][213771] Updated weights for policy 0, policy_version 55320 (0.0006) [2023-03-07 15:17:35,837][213771] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-07 15:17:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 56660992. Throughput: 0: 13261.0. Samples: 56644756. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 15:17:36,105][213445] Avg episode reward: [(0, '4322.959')] [2023-03-07 15:17:36,585][213771] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-07 15:17:37,385][213771] Updated weights for policy 0, policy_version 55350 (0.0006) [2023-03-07 15:17:38,134][213771] Updated weights for policy 0, policy_version 55360 (0.0006) [2023-03-07 15:17:38,921][213771] Updated weights for policy 0, policy_version 55370 (0.0007) [2023-03-07 15:17:39,700][213771] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-07 15:17:40,462][213771] Updated weights for policy 0, policy_version 55390 (0.0007) [2023-03-07 15:17:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13228.7). Total num frames: 56727552. Throughput: 0: 13261.1. Samples: 56724288. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:17:41,105][213445] Avg episode reward: [(0, '4394.418')] [2023-03-07 15:17:41,257][213771] Updated weights for policy 0, policy_version 55400 (0.0005) [2023-03-07 15:17:42,034][213771] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-07 15:17:42,791][213771] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 15:17:43,570][213771] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-07 15:17:44,341][213771] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-07 15:17:45,110][213771] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-07 15:17:45,869][213771] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 15:17:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 56793088. Throughput: 0: 13252.8. Samples: 56763815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:17:46,106][213445] Avg episode reward: [(0, '4390.188')] [2023-03-07 15:17:46,645][213771] Updated weights for policy 0, policy_version 55470 (0.0007) [2023-03-07 15:17:47,426][213771] Updated weights for policy 0, policy_version 55480 (0.0005) [2023-03-07 15:17:48,197][213771] Updated weights for policy 0, policy_version 55490 (0.0006) [2023-03-07 15:17:48,968][213771] Updated weights for policy 0, policy_version 55500 (0.0007) [2023-03-07 15:17:49,743][213771] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-07 15:17:50,511][213771] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-07 15:17:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13232.2). Total num frames: 56859648. Throughput: 0: 13252.6. Samples: 56843468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:17:51,106][213445] Avg episode reward: [(0, '4351.572')] [2023-03-07 15:17:51,300][213771] Updated weights for policy 0, policy_version 55530 (0.0007) [2023-03-07 15:17:52,078][213771] Updated weights for policy 0, policy_version 55540 (0.0005) [2023-03-07 15:17:52,852][213771] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-07 15:17:53,616][213771] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-07 15:17:54,392][213771] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-07 15:17:55,155][213771] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 15:17:55,922][213771] Updated weights for policy 0, policy_version 55590 (0.0007) [2023-03-07 15:17:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13232.2). Total num frames: 56926208. Throughput: 0: 13254.5. Samples: 56922924. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:17:56,106][213445] Avg episode reward: [(0, '4348.390')] [2023-03-07 15:17:56,711][213771] Updated weights for policy 0, policy_version 55600 (0.0006) [2023-03-07 15:17:57,501][213771] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-07 15:17:58,260][213771] Updated weights for policy 0, policy_version 55620 (0.0005) [2023-03-07 15:17:59,037][213771] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 15:17:59,792][213771] Updated weights for policy 0, policy_version 55640 (0.0006) [2023-03-07 15:18:00,573][213771] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-07 15:18:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 56992768. Throughput: 0: 13255.0. Samples: 56962436. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:18:01,106][213445] Avg episode reward: [(0, '4305.527')] [2023-03-07 15:18:01,342][213771] Updated weights for policy 0, policy_version 55660 (0.0005) [2023-03-07 15:18:02,109][213771] Updated weights for policy 0, policy_version 55670 (0.0006) [2023-03-07 15:18:02,888][213771] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-07 15:18:03,657][213771] Updated weights for policy 0, policy_version 55690 (0.0005) [2023-03-07 15:18:04,422][213771] Updated weights for policy 0, policy_version 55700 (0.0006) [2023-03-07 15:18:05,180][213771] Updated weights for policy 0, policy_version 55710 (0.0007) [2023-03-07 15:18:05,975][213771] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 15:18:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 57058304. Throughput: 0: 13260.4. Samples: 57042266. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:18:06,106][213445] Avg episode reward: [(0, '4324.925')] [2023-03-07 15:18:06,124][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000055722_57059328.pth... [2023-03-07 15:18:06,153][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000052620_53882880.pth [2023-03-07 15:18:06,742][213771] Updated weights for policy 0, policy_version 55730 (0.0005) [2023-03-07 15:18:07,520][213771] Updated weights for policy 0, policy_version 55740 (0.0008) [2023-03-07 15:18:08,310][213771] Updated weights for policy 0, policy_version 55750 (0.0005) [2023-03-07 15:18:09,077][213771] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-07 15:18:09,849][213771] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-07 15:18:10,605][213771] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-07 15:18:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 57124864. Throughput: 0: 13256.7. Samples: 57121646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:11,106][213445] Avg episode reward: [(0, '4288.916')] [2023-03-07 15:18:11,378][213771] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-07 15:18:12,157][213771] Updated weights for policy 0, policy_version 55800 (0.0006) [2023-03-07 15:18:12,922][213771] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-07 15:18:13,706][213771] Updated weights for policy 0, policy_version 55820 (0.0007) [2023-03-07 15:18:14,472][213771] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-07 15:18:15,249][213771] Updated weights for policy 0, policy_version 55840 (0.0006) [2023-03-07 15:18:16,023][213771] Updated weights for policy 0, policy_version 55850 (0.0007) [2023-03-07 15:18:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 57190400. Throughput: 0: 13249.2. Samples: 57161456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:16,106][213445] Avg episode reward: [(0, '4297.558')] [2023-03-07 15:18:16,813][213771] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 15:18:17,595][213771] Updated weights for policy 0, policy_version 55870 (0.0006) [2023-03-07 15:18:18,353][213771] Updated weights for policy 0, policy_version 55880 (0.0006) [2023-03-07 15:18:19,126][213771] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-07 15:18:19,910][213771] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-07 15:18:20,682][213771] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-07 15:18:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 57256960. Throughput: 0: 13241.1. Samples: 57240607. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:21,106][213445] Avg episode reward: [(0, '4244.964')] [2023-03-07 15:18:21,458][213771] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 15:18:22,233][213771] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-07 15:18:22,997][213771] Updated weights for policy 0, policy_version 55940 (0.0008) [2023-03-07 15:18:23,769][213771] Updated weights for policy 0, policy_version 55950 (0.0005) [2023-03-07 15:18:24,550][213771] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-07 15:18:25,313][213771] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-07 15:18:26,088][213771] Updated weights for policy 0, policy_version 55980 (0.0007) [2023-03-07 15:18:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 57323520. Throughput: 0: 13239.2. Samples: 57320053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:26,106][213445] Avg episode reward: [(0, '4257.817')] [2023-03-07 15:18:26,858][213771] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-07 15:18:27,644][213771] Updated weights for policy 0, policy_version 56000 (0.0007) [2023-03-07 15:18:28,404][213771] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-07 15:18:29,161][213771] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-07 15:18:29,937][213771] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-07 15:18:30,701][213771] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-07 15:18:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 57390080. Throughput: 0: 13245.7. Samples: 57359869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:31,106][213445] Avg episode reward: [(0, '4348.031')] [2023-03-07 15:18:31,479][213771] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-07 15:18:32,258][213771] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-07 15:18:33,037][213771] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-07 15:18:33,794][213771] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-07 15:18:34,557][213771] Updated weights for policy 0, policy_version 56090 (0.0006) [2023-03-07 15:18:35,341][213771] Updated weights for policy 0, policy_version 56100 (0.0005) [2023-03-07 15:18:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 57455616. Throughput: 0: 13250.3. Samples: 57439734. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:36,106][213445] Avg episode reward: [(0, '4368.540')] [2023-03-07 15:18:36,109][213771] Updated weights for policy 0, policy_version 56110 (0.0005) [2023-03-07 15:18:36,870][213771] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-07 15:18:37,644][213771] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-07 15:18:38,421][213771] Updated weights for policy 0, policy_version 56140 (0.0008) [2023-03-07 15:18:39,183][213771] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-07 15:18:39,940][213771] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-07 15:18:40,740][213771] Updated weights for policy 0, policy_version 56170 (0.0005) [2023-03-07 15:18:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 57522176. Throughput: 0: 13255.4. Samples: 57519416. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:18:41,106][213445] Avg episode reward: [(0, '4329.336')] [2023-03-07 15:18:41,497][213771] Updated weights for policy 0, policy_version 56180 (0.0006) [2023-03-07 15:18:42,273][213771] Updated weights for policy 0, policy_version 56190 (0.0005) [2023-03-07 15:18:43,043][213771] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 15:18:43,811][213771] Updated weights for policy 0, policy_version 56210 (0.0006) [2023-03-07 15:18:44,590][213771] Updated weights for policy 0, policy_version 56220 (0.0006) [2023-03-07 15:18:45,353][213771] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-07 15:18:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 57588736. Throughput: 0: 13264.8. Samples: 57559354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:18:46,106][213445] Avg episode reward: [(0, '4329.120')] [2023-03-07 15:18:46,120][213771] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-07 15:18:46,898][213771] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-07 15:18:47,679][213771] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-07 15:18:48,458][213771] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 15:18:49,224][213771] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-07 15:18:50,000][213771] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 15:18:50,777][213771] Updated weights for policy 0, policy_version 56300 (0.0006) [2023-03-07 15:18:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 57655296. Throughput: 0: 13253.2. Samples: 57638661. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:18:51,106][213445] Avg episode reward: [(0, '4345.729')] [2023-03-07 15:18:51,535][213771] Updated weights for policy 0, policy_version 56310 (0.0006) [2023-03-07 15:18:52,332][213771] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-07 15:18:53,082][213771] Updated weights for policy 0, policy_version 56330 (0.0005) [2023-03-07 15:18:53,875][213771] Updated weights for policy 0, policy_version 56340 (0.0007) [2023-03-07 15:18:54,637][213771] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-07 15:18:55,402][213771] Updated weights for policy 0, policy_version 56360 (0.0006) [2023-03-07 15:18:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 57721856. Throughput: 0: 13258.7. Samples: 57718285. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:18:56,106][213445] Avg episode reward: [(0, '4394.440')] [2023-03-07 15:18:56,181][213771] Updated weights for policy 0, policy_version 56370 (0.0006) [2023-03-07 15:18:56,945][213771] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-07 15:18:57,718][213771] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-07 15:18:58,490][213771] Updated weights for policy 0, policy_version 56400 (0.0005) [2023-03-07 15:18:59,267][213771] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 15:19:00,068][213771] Updated weights for policy 0, policy_version 56420 (0.0007) [2023-03-07 15:19:00,830][213771] Updated weights for policy 0, policy_version 56430 (0.0006) [2023-03-07 15:19:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 57787392. Throughput: 0: 13261.4. Samples: 57758219. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:19:01,106][213445] Avg episode reward: [(0, '4382.588')] [2023-03-07 15:19:01,575][213771] Updated weights for policy 0, policy_version 56440 (0.0005) [2023-03-07 15:19:02,361][213771] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-07 15:19:03,127][213771] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-07 15:19:03,908][213771] Updated weights for policy 0, policy_version 56470 (0.0006) [2023-03-07 15:19:04,681][213771] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-07 15:19:05,437][213771] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-07 15:19:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 57853952. Throughput: 0: 13265.8. Samples: 57837566. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:19:06,106][213445] Avg episode reward: [(0, '4367.399')] [2023-03-07 15:19:06,211][213771] Updated weights for policy 0, policy_version 56500 (0.0006) [2023-03-07 15:19:06,971][213771] Updated weights for policy 0, policy_version 56510 (0.0007) [2023-03-07 15:19:07,746][213771] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-07 15:19:08,545][213771] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 15:19:09,309][213771] Updated weights for policy 0, policy_version 56540 (0.0007) [2023-03-07 15:19:10,081][213771] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-07 15:19:10,858][213771] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-07 15:19:11,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 57920512. Throughput: 0: 13273.1. Samples: 57917343. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:19:11,106][213445] Avg episode reward: [(0, '4324.610')] [2023-03-07 15:19:11,619][213771] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-07 15:19:12,401][213771] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-07 15:19:13,159][213771] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-07 15:19:13,929][213771] Updated weights for policy 0, policy_version 56600 (0.0006) [2023-03-07 15:19:14,702][213771] Updated weights for policy 0, policy_version 56610 (0.0007) [2023-03-07 15:19:15,484][213771] Updated weights for policy 0, policy_version 56620 (0.0006) [2023-03-07 15:19:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13239.1). Total num frames: 57987072. Throughput: 0: 13271.4. Samples: 57957081. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:19:16,106][213445] Avg episode reward: [(0, '4370.694')] [2023-03-07 15:19:16,252][213771] Updated weights for policy 0, policy_version 56630 (0.0007) [2023-03-07 15:19:17,026][213771] Updated weights for policy 0, policy_version 56640 (0.0008) [2023-03-07 15:19:17,797][213771] Updated weights for policy 0, policy_version 56650 (0.0007) [2023-03-07 15:19:18,578][213771] Updated weights for policy 0, policy_version 56660 (0.0005) [2023-03-07 15:19:19,349][213771] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-07 15:19:20,131][213771] Updated weights for policy 0, policy_version 56680 (0.0006) [2023-03-07 15:19:20,906][213771] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 15:19:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 58052608. Throughput: 0: 13258.7. Samples: 58036376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:21,106][213445] Avg episode reward: [(0, '4318.276')] [2023-03-07 15:19:21,680][213771] Updated weights for policy 0, policy_version 56700 (0.0007) [2023-03-07 15:19:22,441][213771] Updated weights for policy 0, policy_version 56710 (0.0005) [2023-03-07 15:19:23,228][213771] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-07 15:19:23,994][213771] Updated weights for policy 0, policy_version 56730 (0.0007) [2023-03-07 15:19:24,758][213771] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-07 15:19:25,536][213771] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 15:19:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 58119168. Throughput: 0: 13255.8. Samples: 58115928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:26,105][213445] Avg episode reward: [(0, '4293.135')] [2023-03-07 15:19:26,295][213771] Updated weights for policy 0, policy_version 56760 (0.0006) [2023-03-07 15:19:27,080][213771] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-07 15:19:27,838][213771] Updated weights for policy 0, policy_version 56780 (0.0007) [2023-03-07 15:19:28,619][213771] Updated weights for policy 0, policy_version 56790 (0.0005) [2023-03-07 15:19:29,396][213771] Updated weights for policy 0, policy_version 56800 (0.0005) [2023-03-07 15:19:30,173][213771] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-07 15:19:30,935][213771] Updated weights for policy 0, policy_version 56820 (0.0006) [2023-03-07 15:19:31,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 58185728. Throughput: 0: 13253.5. Samples: 58155761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:31,106][213445] Avg episode reward: [(0, '4320.820')] [2023-03-07 15:19:31,702][213771] Updated weights for policy 0, policy_version 56830 (0.0006) [2023-03-07 15:19:32,489][213771] Updated weights for policy 0, policy_version 56840 (0.0007) [2023-03-07 15:19:33,242][213771] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-07 15:19:34,021][213771] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-07 15:19:34,783][213771] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 15:19:35,569][213771] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-07 15:19:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 58251264. Throughput: 0: 13263.1. Samples: 58235502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:36,106][213445] Avg episode reward: [(0, '4270.972')] [2023-03-07 15:19:36,346][213771] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-07 15:19:37,106][213771] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 15:19:37,917][213771] Updated weights for policy 0, policy_version 56910 (0.0005) [2023-03-07 15:19:38,689][213771] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-07 15:19:39,454][213771] Updated weights for policy 0, policy_version 56930 (0.0006) [2023-03-07 15:19:40,229][213771] Updated weights for policy 0, policy_version 56940 (0.0007) [2023-03-07 15:19:41,014][213771] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-07 15:19:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 58317824. Throughput: 0: 13248.3. Samples: 58314460. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:41,106][213445] Avg episode reward: [(0, '4239.103')] [2023-03-07 15:19:41,779][213771] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-07 15:19:42,554][213771] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-07 15:19:43,346][213771] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 15:19:44,105][213771] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-07 15:19:44,868][213771] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-07 15:19:45,655][213771] Updated weights for policy 0, policy_version 57010 (0.0007) [2023-03-07 15:19:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13239.1). Total num frames: 58383360. Throughput: 0: 13245.2. Samples: 58354254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:46,106][213445] Avg episode reward: [(0, '4250.474')] [2023-03-07 15:19:46,409][213771] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 15:19:47,186][213771] Updated weights for policy 0, policy_version 57030 (0.0006) [2023-03-07 15:19:47,961][213771] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-07 15:19:48,726][213771] Updated weights for policy 0, policy_version 57050 (0.0007) [2023-03-07 15:19:49,500][213771] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-07 15:19:50,257][213771] Updated weights for policy 0, policy_version 57070 (0.0006) [2023-03-07 15:19:51,028][213771] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-07 15:19:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 58449920. Throughput: 0: 13251.2. Samples: 58433871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:51,106][213445] Avg episode reward: [(0, '4230.268')] [2023-03-07 15:19:51,819][213771] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-07 15:19:52,582][213771] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-07 15:19:53,320][213771] Updated weights for policy 0, policy_version 57110 (0.0006) [2023-03-07 15:19:54,111][213771] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 15:19:54,880][213771] Updated weights for policy 0, policy_version 57130 (0.0006) [2023-03-07 15:19:55,629][213771] Updated weights for policy 0, policy_version 57140 (0.0006) [2023-03-07 15:19:56,105][213445] Fps is (10 sec: 13414.3, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 58517504. Throughput: 0: 13256.0. Samples: 58513864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:19:56,106][213445] Avg episode reward: [(0, '4307.679')] [2023-03-07 15:19:56,414][213771] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-07 15:19:57,192][213771] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-07 15:19:57,960][213771] Updated weights for policy 0, policy_version 57170 (0.0006) [2023-03-07 15:19:58,723][213771] Updated weights for policy 0, policy_version 57180 (0.0007) [2023-03-07 15:19:59,517][213771] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 15:20:00,268][213771] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-07 15:20:01,041][213771] Updated weights for policy 0, policy_version 57210 (0.0007) [2023-03-07 15:20:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 58583040. Throughput: 0: 13256.0. Samples: 58553601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:01,106][213445] Avg episode reward: [(0, '4205.191')] [2023-03-07 15:20:01,807][213771] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-07 15:20:02,580][213771] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-07 15:20:03,351][213771] Updated weights for policy 0, policy_version 57240 (0.0007) [2023-03-07 15:20:04,130][213771] Updated weights for policy 0, policy_version 57250 (0.0006) [2023-03-07 15:20:04,909][213771] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-07 15:20:05,665][213771] Updated weights for policy 0, policy_version 57270 (0.0006) [2023-03-07 15:20:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 58649600. Throughput: 0: 13266.7. Samples: 58633376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:06,106][213445] Avg episode reward: [(0, '4223.434')] [2023-03-07 15:20:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000057275_58649600.pth... [2023-03-07 15:20:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000054169_55469056.pth [2023-03-07 15:20:06,443][213771] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-07 15:20:07,225][213771] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 15:20:07,993][213771] Updated weights for policy 0, policy_version 57300 (0.0007) [2023-03-07 15:20:08,767][213771] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 15:20:09,558][213771] Updated weights for policy 0, policy_version 57320 (0.0005) [2023-03-07 15:20:10,324][213771] Updated weights for policy 0, policy_version 57330 (0.0007) [2023-03-07 15:20:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 58716160. Throughput: 0: 13258.1. Samples: 58712545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:11,106][213445] Avg episode reward: [(0, '4299.363')] [2023-03-07 15:20:11,106][213771] Updated weights for policy 0, policy_version 57340 (0.0005) [2023-03-07 15:20:11,871][213771] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-07 15:20:12,645][213771] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-07 15:20:13,422][213771] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-07 15:20:14,200][213771] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-07 15:20:14,967][213771] Updated weights for policy 0, policy_version 57390 (0.0005) [2023-03-07 15:20:15,754][213771] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-07 15:20:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 58781696. Throughput: 0: 13254.3. Samples: 58752206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:16,106][213445] Avg episode reward: [(0, '4284.318')] [2023-03-07 15:20:16,520][213771] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 15:20:17,315][213771] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-07 15:20:18,073][213771] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-07 15:20:18,850][213771] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-07 15:20:19,625][213771] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-07 15:20:20,410][213771] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-07 15:20:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 58848256. Throughput: 0: 13247.7. Samples: 58831645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:21,106][213445] Avg episode reward: [(0, '4311.327')] [2023-03-07 15:20:21,185][213771] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-07 15:20:21,966][213771] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 15:20:22,755][213771] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-07 15:20:23,523][213771] Updated weights for policy 0, policy_version 57500 (0.0007) [2023-03-07 15:20:24,281][213771] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 15:20:25,058][213771] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-07 15:20:25,830][213771] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-07 15:20:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 58913792. Throughput: 0: 13251.1. Samples: 58910761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:20:26,106][213445] Avg episode reward: [(0, '4383.037')] [2023-03-07 15:20:26,613][213771] Updated weights for policy 0, policy_version 57540 (0.0007) [2023-03-07 15:20:27,386][213771] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 15:20:28,173][213771] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-07 15:20:28,933][213771] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-07 15:20:29,701][213771] Updated weights for policy 0, policy_version 57580 (0.0007) [2023-03-07 15:20:30,488][213771] Updated weights for policy 0, policy_version 57590 (0.0006) [2023-03-07 15:20:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 58980352. Throughput: 0: 13246.8. Samples: 58950359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:31,105][213445] Avg episode reward: [(0, '4382.688')] [2023-03-07 15:20:31,259][213771] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-07 15:20:32,017][213771] Updated weights for policy 0, policy_version 57610 (0.0006) [2023-03-07 15:20:32,788][213771] Updated weights for policy 0, policy_version 57620 (0.0006) [2023-03-07 15:20:33,553][213771] Updated weights for policy 0, policy_version 57630 (0.0006) [2023-03-07 15:20:34,337][213771] Updated weights for policy 0, policy_version 57640 (0.0006) [2023-03-07 15:20:35,119][213771] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-07 15:20:35,887][213771] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 15:20:36,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 59045888. Throughput: 0: 13243.3. Samples: 59029819. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:36,105][213445] Avg episode reward: [(0, '4366.087')] [2023-03-07 15:20:36,659][213771] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-07 15:20:37,465][213771] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-07 15:20:38,242][213771] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 15:20:39,005][213771] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-07 15:20:39,781][213771] Updated weights for policy 0, policy_version 57710 (0.0005) [2023-03-07 15:20:40,578][213771] Updated weights for policy 0, policy_version 57720 (0.0005) [2023-03-07 15:20:41,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 59111424. Throughput: 0: 13222.8. Samples: 59108887. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:41,106][213445] Avg episode reward: [(0, '4220.020')] [2023-03-07 15:20:41,355][213771] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-07 15:20:42,116][213771] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-07 15:20:42,869][213771] Updated weights for policy 0, policy_version 57750 (0.0005) [2023-03-07 15:20:43,653][213771] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-07 15:20:44,406][213771] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-07 15:20:45,178][213771] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-07 15:20:45,963][213771] Updated weights for policy 0, policy_version 57790 (0.0006) [2023-03-07 15:20:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 59177984. Throughput: 0: 13226.9. Samples: 59148812. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:46,106][213445] Avg episode reward: [(0, '4321.964')] [2023-03-07 15:20:46,733][213771] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-07 15:20:47,504][213771] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-07 15:20:48,297][213771] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-07 15:20:49,050][213771] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-07 15:20:49,825][213771] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-07 15:20:50,596][213771] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-07 15:20:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 59244544. Throughput: 0: 13220.4. Samples: 59228295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:51,106][213445] Avg episode reward: [(0, '4311.336')] [2023-03-07 15:20:51,372][213771] Updated weights for policy 0, policy_version 57860 (0.0007) [2023-03-07 15:20:52,150][213771] Updated weights for policy 0, policy_version 57870 (0.0006) [2023-03-07 15:20:52,917][213771] Updated weights for policy 0, policy_version 57880 (0.0006) [2023-03-07 15:20:53,689][213771] Updated weights for policy 0, policy_version 57890 (0.0006) [2023-03-07 15:20:54,458][213771] Updated weights for policy 0, policy_version 57900 (0.0006) [2023-03-07 15:20:55,226][213771] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-07 15:20:56,008][213771] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-07 15:20:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 59311104. Throughput: 0: 13231.5. Samples: 59307961. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:20:56,105][213445] Avg episode reward: [(0, '4184.436')] [2023-03-07 15:20:56,780][213771] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-07 15:20:57,549][213771] Updated weights for policy 0, policy_version 57940 (0.0005) [2023-03-07 15:20:58,315][213771] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-07 15:20:59,093][213771] Updated weights for policy 0, policy_version 57960 (0.0006) [2023-03-07 15:20:59,878][213771] Updated weights for policy 0, policy_version 57970 (0.0006) [2023-03-07 15:21:00,637][213771] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-07 15:21:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 59377664. Throughput: 0: 13232.5. Samples: 59347667. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:21:01,106][213445] Avg episode reward: [(0, '4243.220')] [2023-03-07 15:21:01,403][213771] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 15:21:02,170][213771] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-07 15:21:02,956][213771] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-07 15:21:03,723][213771] Updated weights for policy 0, policy_version 58020 (0.0007) [2023-03-07 15:21:04,498][213771] Updated weights for policy 0, policy_version 58030 (0.0006) [2023-03-07 15:21:05,284][213771] Updated weights for policy 0, policy_version 58040 (0.0007) [2023-03-07 15:21:06,050][213771] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-07 15:21:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 59443200. Throughput: 0: 13232.8. Samples: 59427123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:21:06,106][213445] Avg episode reward: [(0, '4212.891')] [2023-03-07 15:21:06,813][213771] Updated weights for policy 0, policy_version 58060 (0.0006) [2023-03-07 15:21:07,602][213771] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-07 15:21:08,359][213771] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-07 15:21:09,133][213771] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 15:21:09,907][213771] Updated weights for policy 0, policy_version 58100 (0.0007) [2023-03-07 15:21:10,682][213771] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-07 15:21:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 59509760. Throughput: 0: 13243.6. Samples: 59506722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:11,106][213445] Avg episode reward: [(0, '4268.911')] [2023-03-07 15:21:11,465][213771] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-07 15:21:12,242][213771] Updated weights for policy 0, policy_version 58130 (0.0006) [2023-03-07 15:21:13,021][213771] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-07 15:21:13,789][213771] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 15:21:14,572][213771] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-07 15:21:15,354][213771] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-07 15:21:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 59575296. Throughput: 0: 13242.3. Samples: 59546263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:16,106][213445] Avg episode reward: [(0, '4256.408')] [2023-03-07 15:21:16,126][213771] Updated weights for policy 0, policy_version 58180 (0.0005) [2023-03-07 15:21:16,902][213771] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-07 15:21:17,673][213771] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-07 15:21:18,450][213771] Updated weights for policy 0, policy_version 58210 (0.0006) [2023-03-07 15:21:19,233][213771] Updated weights for policy 0, policy_version 58220 (0.0005) [2023-03-07 15:21:19,989][213771] Updated weights for policy 0, policy_version 58230 (0.0007) [2023-03-07 15:21:20,757][213771] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-07 15:21:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13246.1). Total num frames: 59641856. Throughput: 0: 13234.7. Samples: 59625382. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:21,106][213445] Avg episode reward: [(0, '4255.003')] [2023-03-07 15:21:21,540][213771] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-07 15:21:22,310][213771] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-07 15:21:23,108][213771] Updated weights for policy 0, policy_version 58270 (0.0005) [2023-03-07 15:21:23,863][213771] Updated weights for policy 0, policy_version 58280 (0.0007) [2023-03-07 15:21:24,658][213771] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 15:21:25,428][213771] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-07 15:21:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 59707392. Throughput: 0: 13239.8. Samples: 59704679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:26,106][213445] Avg episode reward: [(0, '4208.351')] [2023-03-07 15:21:26,210][213771] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-07 15:21:26,968][213771] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 15:21:27,749][213771] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-07 15:21:28,514][213771] Updated weights for policy 0, policy_version 58340 (0.0007) [2023-03-07 15:21:29,285][213771] Updated weights for policy 0, policy_version 58350 (0.0007) [2023-03-07 15:21:30,037][213771] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 15:21:30,794][213771] Updated weights for policy 0, policy_version 58370 (0.0007) [2023-03-07 15:21:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 59773952. Throughput: 0: 13236.4. Samples: 59744450. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:31,106][213445] Avg episode reward: [(0, '4281.950')] [2023-03-07 15:21:31,586][213771] Updated weights for policy 0, policy_version 58380 (0.0007) [2023-03-07 15:21:32,357][213771] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-07 15:21:33,121][213771] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-07 15:21:33,896][213771] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 15:21:34,678][213771] Updated weights for policy 0, policy_version 58420 (0.0006) [2023-03-07 15:21:35,440][213771] Updated weights for policy 0, policy_version 58430 (0.0005) [2023-03-07 15:21:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 59840512. Throughput: 0: 13243.8. Samples: 59824267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:36,106][213445] Avg episode reward: [(0, '4332.844')] [2023-03-07 15:21:36,232][213771] Updated weights for policy 0, policy_version 58440 (0.0007) [2023-03-07 15:21:37,006][213771] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-07 15:21:37,764][213771] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-07 15:21:38,542][213771] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-07 15:21:39,326][213771] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 15:21:40,104][213771] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 15:21:40,874][213771] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-07 15:21:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 59906048. Throughput: 0: 13232.2. Samples: 59903410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:21:41,105][213445] Avg episode reward: [(0, '4275.108')] [2023-03-07 15:21:41,650][213771] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-07 15:21:42,417][213771] Updated weights for policy 0, policy_version 58520 (0.0007) [2023-03-07 15:21:43,196][213771] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-07 15:21:43,960][213771] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-07 15:21:44,738][213771] Updated weights for policy 0, policy_version 58550 (0.0006) [2023-03-07 15:21:45,517][213771] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 15:21:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 59972608. Throughput: 0: 13235.2. Samples: 59943250. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:21:46,106][213445] Avg episode reward: [(0, '4293.838')] [2023-03-07 15:21:46,274][213771] Updated weights for policy 0, policy_version 58570 (0.0005) [2023-03-07 15:21:47,048][213771] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-07 15:21:47,837][213771] Updated weights for policy 0, policy_version 58590 (0.0007) [2023-03-07 15:21:48,594][213771] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-07 15:21:49,377][213771] Updated weights for policy 0, policy_version 58610 (0.0007) [2023-03-07 15:21:50,150][213771] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-07 15:21:50,940][213771] Updated weights for policy 0, policy_version 58630 (0.0006) [2023-03-07 15:21:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 60039168. Throughput: 0: 13236.6. Samples: 60022770. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:21:51,106][213445] Avg episode reward: [(0, '4280.432')] [2023-03-07 15:21:51,697][213771] Updated weights for policy 0, policy_version 58640 (0.0005) [2023-03-07 15:21:52,477][213771] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-07 15:21:53,242][213771] Updated weights for policy 0, policy_version 58660 (0.0007) [2023-03-07 15:21:54,032][213771] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-07 15:21:54,829][213771] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-07 15:21:55,607][213771] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-07 15:21:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 60104704. Throughput: 0: 13221.3. Samples: 60101680. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:21:56,106][213445] Avg episode reward: [(0, '4265.472')] [2023-03-07 15:21:56,363][213771] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 15:21:57,137][213771] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-07 15:21:57,903][213771] Updated weights for policy 0, policy_version 58720 (0.0005) [2023-03-07 15:21:58,679][213771] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-07 15:21:59,474][213771] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-07 15:22:00,238][213771] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-07 15:22:01,003][213771] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-07 15:22:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 60171264. Throughput: 0: 13226.0. Samples: 60141433. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:22:01,106][213445] Avg episode reward: [(0, '4284.220')] [2023-03-07 15:22:01,785][213771] Updated weights for policy 0, policy_version 58770 (0.0007) [2023-03-07 15:22:02,578][213771] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 15:22:03,344][213771] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 15:22:04,117][213771] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-07 15:22:04,903][213771] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-07 15:22:05,664][213771] Updated weights for policy 0, policy_version 58820 (0.0005) [2023-03-07 15:22:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 60236800. Throughput: 0: 13228.7. Samples: 60220675. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:22:06,106][213445] Avg episode reward: [(0, '4327.464')] [2023-03-07 15:22:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000058825_60236800.pth... [2023-03-07 15:22:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000055722_57059328.pth [2023-03-07 15:22:06,431][213771] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-07 15:22:07,213][213771] Updated weights for policy 0, policy_version 58840 (0.0006) [2023-03-07 15:22:07,971][213771] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-07 15:22:08,740][213771] Updated weights for policy 0, policy_version 58860 (0.0007) [2023-03-07 15:22:09,497][213771] Updated weights for policy 0, policy_version 58870 (0.0007) [2023-03-07 15:22:10,261][213771] Updated weights for policy 0, policy_version 58880 (0.0006) [2023-03-07 15:22:11,045][213771] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-07 15:22:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 60303360. Throughput: 0: 13240.6. Samples: 60300504. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:22:11,106][213445] Avg episode reward: [(0, '4312.240')] [2023-03-07 15:22:11,822][213771] Updated weights for policy 0, policy_version 58900 (0.0006) [2023-03-07 15:22:12,565][213771] Updated weights for policy 0, policy_version 58910 (0.0006) [2023-03-07 15:22:13,351][213771] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-07 15:22:14,135][213771] Updated weights for policy 0, policy_version 58930 (0.0005) [2023-03-07 15:22:14,906][213771] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-07 15:22:15,676][213771] Updated weights for policy 0, policy_version 58950 (0.0006) [2023-03-07 15:22:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 60369920. Throughput: 0: 13245.4. Samples: 60340491. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:22:16,106][213445] Avg episode reward: [(0, '4252.380')] [2023-03-07 15:22:16,429][213771] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-07 15:22:17,205][213771] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-07 15:22:17,964][213771] Updated weights for policy 0, policy_version 58980 (0.0007) [2023-03-07 15:22:18,723][213771] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-07 15:22:19,506][213771] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-07 15:22:20,289][213771] Updated weights for policy 0, policy_version 59010 (0.0006) [2023-03-07 15:22:21,066][213771] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-07 15:22:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 60436480. Throughput: 0: 13242.7. Samples: 60420191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:21,106][213445] Avg episode reward: [(0, '4286.757')] [2023-03-07 15:22:21,840][213771] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-07 15:22:22,616][213771] Updated weights for policy 0, policy_version 59040 (0.0005) [2023-03-07 15:22:23,393][213771] Updated weights for policy 0, policy_version 59050 (0.0006) [2023-03-07 15:22:24,170][213771] Updated weights for policy 0, policy_version 59060 (0.0006) [2023-03-07 15:22:24,918][213771] Updated weights for policy 0, policy_version 59070 (0.0006) [2023-03-07 15:22:25,705][213771] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-07 15:22:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 60503040. Throughput: 0: 13251.1. Samples: 60499711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:26,106][213445] Avg episode reward: [(0, '4266.458')] [2023-03-07 15:22:26,476][213771] Updated weights for policy 0, policy_version 59090 (0.0007) [2023-03-07 15:22:27,246][213771] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-07 15:22:28,031][213771] Updated weights for policy 0, policy_version 59110 (0.0006) [2023-03-07 15:22:28,800][213771] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-07 15:22:29,577][213771] Updated weights for policy 0, policy_version 59130 (0.0005) [2023-03-07 15:22:30,349][213771] Updated weights for policy 0, policy_version 59140 (0.0007) [2023-03-07 15:22:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 60568576. Throughput: 0: 13248.2. Samples: 60539418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:31,106][213445] Avg episode reward: [(0, '4325.927')] [2023-03-07 15:22:31,121][213771] Updated weights for policy 0, policy_version 59150 (0.0005) [2023-03-07 15:22:31,889][213771] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-07 15:22:32,661][213771] Updated weights for policy 0, policy_version 59170 (0.0006) [2023-03-07 15:22:33,439][213771] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 15:22:34,209][213771] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-07 15:22:34,958][213771] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-07 15:22:35,737][213771] Updated weights for policy 0, policy_version 59210 (0.0007) [2023-03-07 15:22:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 60635136. Throughput: 0: 13248.9. Samples: 60618967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:36,106][213445] Avg episode reward: [(0, '4360.722')] [2023-03-07 15:22:36,528][213771] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-07 15:22:37,293][213771] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-07 15:22:38,064][213771] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-07 15:22:38,838][213771] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 15:22:39,594][213771] Updated weights for policy 0, policy_version 59260 (0.0005) [2023-03-07 15:22:40,370][213771] Updated weights for policy 0, policy_version 59270 (0.0007) [2023-03-07 15:22:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 60701696. Throughput: 0: 13265.3. Samples: 60698617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:41,106][213445] Avg episode reward: [(0, '4322.181')] [2023-03-07 15:22:41,139][213771] Updated weights for policy 0, policy_version 59280 (0.0007) [2023-03-07 15:22:41,905][213771] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-07 15:22:42,674][213771] Updated weights for policy 0, policy_version 59300 (0.0005) [2023-03-07 15:22:43,442][213771] Updated weights for policy 0, policy_version 59310 (0.0005) [2023-03-07 15:22:44,224][213771] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-07 15:22:44,994][213771] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-07 15:22:45,755][213771] Updated weights for policy 0, policy_version 59340 (0.0005) [2023-03-07 15:22:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 60768256. Throughput: 0: 13267.9. Samples: 60738488. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:46,106][213445] Avg episode reward: [(0, '4322.558')] [2023-03-07 15:22:46,550][213771] Updated weights for policy 0, policy_version 59350 (0.0007) [2023-03-07 15:22:47,324][213771] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-07 15:22:48,085][213771] Updated weights for policy 0, policy_version 59370 (0.0007) [2023-03-07 15:22:48,846][213771] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-07 15:22:49,609][213771] Updated weights for policy 0, policy_version 59390 (0.0005) [2023-03-07 15:22:50,379][213771] Updated weights for policy 0, policy_version 59400 (0.0006) [2023-03-07 15:22:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 60834816. Throughput: 0: 13278.0. Samples: 60818184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:22:51,106][213445] Avg episode reward: [(0, '4345.586')] [2023-03-07 15:22:51,153][213771] Updated weights for policy 0, policy_version 59410 (0.0007) [2023-03-07 15:22:51,917][213771] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 15:22:52,690][213771] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 15:22:53,485][213771] Updated weights for policy 0, policy_version 59440 (0.0006) [2023-03-07 15:22:54,227][213771] Updated weights for policy 0, policy_version 59450 (0.0006) [2023-03-07 15:22:55,013][213771] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 15:22:55,799][213771] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 15:22:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 60900352. Throughput: 0: 13272.5. Samples: 60897769. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:22:56,106][213445] Avg episode reward: [(0, '4359.791')] [2023-03-07 15:22:56,578][213771] Updated weights for policy 0, policy_version 59480 (0.0006) [2023-03-07 15:22:57,361][213771] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-07 15:22:58,131][213771] Updated weights for policy 0, policy_version 59500 (0.0006) [2023-03-07 15:22:58,900][213771] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 15:22:59,698][213771] Updated weights for policy 0, policy_version 59520 (0.0008) [2023-03-07 15:23:00,466][213771] Updated weights for policy 0, policy_version 59530 (0.0007) [2023-03-07 15:23:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 60966912. Throughput: 0: 13267.0. Samples: 60937504. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:01,106][213445] Avg episode reward: [(0, '4352.269')] [2023-03-07 15:23:01,240][213771] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-07 15:23:02,013][213771] Updated weights for policy 0, policy_version 59550 (0.0006) [2023-03-07 15:23:02,779][213771] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-07 15:23:03,538][213771] Updated weights for policy 0, policy_version 59570 (0.0006) [2023-03-07 15:23:04,331][213771] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-07 15:23:05,105][213771] Updated weights for policy 0, policy_version 59590 (0.0006) [2023-03-07 15:23:05,872][213771] Updated weights for policy 0, policy_version 59600 (0.0007) [2023-03-07 15:23:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 61033472. Throughput: 0: 13258.5. Samples: 61016822. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:06,105][213445] Avg episode reward: [(0, '4341.812')] [2023-03-07 15:23:06,651][213771] Updated weights for policy 0, policy_version 59610 (0.0007) [2023-03-07 15:23:07,424][213771] Updated weights for policy 0, policy_version 59620 (0.0007) [2023-03-07 15:23:08,217][213771] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-07 15:23:08,974][213771] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-07 15:23:09,766][213771] Updated weights for policy 0, policy_version 59650 (0.0006) [2023-03-07 15:23:10,557][213771] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-03-07 15:23:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 61099008. Throughput: 0: 13245.9. Samples: 61095774. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:11,105][213445] Avg episode reward: [(0, '4294.876')] [2023-03-07 15:23:11,312][213771] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 15:23:12,069][213771] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-07 15:23:12,866][213771] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 15:23:13,612][213771] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-07 15:23:14,368][213771] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-07 15:23:15,146][213771] Updated weights for policy 0, policy_version 59720 (0.0006) [2023-03-07 15:23:15,913][213771] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-07 15:23:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 61165568. Throughput: 0: 13251.2. Samples: 61135719. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:16,106][213445] Avg episode reward: [(0, '4305.924')] [2023-03-07 15:23:16,697][213771] Updated weights for policy 0, policy_version 59740 (0.0005) [2023-03-07 15:23:17,478][213771] Updated weights for policy 0, policy_version 59750 (0.0007) [2023-03-07 15:23:18,236][213771] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-07 15:23:19,017][213771] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-07 15:23:19,779][213771] Updated weights for policy 0, policy_version 59780 (0.0007) [2023-03-07 15:23:20,572][213771] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 15:23:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 61231104. Throughput: 0: 13253.3. Samples: 61215365. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:21,106][213445] Avg episode reward: [(0, '4369.261')] [2023-03-07 15:23:21,357][213771] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-07 15:23:22,112][213771] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 15:23:22,885][213771] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-07 15:23:23,678][213771] Updated weights for policy 0, policy_version 59830 (0.0005) [2023-03-07 15:23:24,438][213771] Updated weights for policy 0, policy_version 59840 (0.0006) [2023-03-07 15:23:25,195][213771] Updated weights for policy 0, policy_version 59850 (0.0005) [2023-03-07 15:23:25,976][213771] Updated weights for policy 0, policy_version 59860 (0.0007) [2023-03-07 15:23:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 61297664. Throughput: 0: 13248.0. Samples: 61294779. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:26,106][213445] Avg episode reward: [(0, '4355.160')] [2023-03-07 15:23:26,734][213771] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 15:23:27,521][213771] Updated weights for policy 0, policy_version 59880 (0.0007) [2023-03-07 15:23:28,297][213771] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-07 15:23:29,067][213771] Updated weights for policy 0, policy_version 59900 (0.0006) [2023-03-07 15:23:29,846][213771] Updated weights for policy 0, policy_version 59910 (0.0006) [2023-03-07 15:23:30,621][213771] Updated weights for policy 0, policy_version 59920 (0.0007) [2023-03-07 15:23:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 61364224. Throughput: 0: 13247.1. Samples: 61334608. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:23:31,105][213445] Avg episode reward: [(0, '4385.055')] [2023-03-07 15:23:31,383][213771] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 15:23:32,161][213771] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-07 15:23:32,935][213771] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 15:23:33,695][213771] Updated weights for policy 0, policy_version 59960 (0.0007) [2023-03-07 15:23:34,470][213771] Updated weights for policy 0, policy_version 59970 (0.0007) [2023-03-07 15:23:35,241][213771] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-07 15:23:36,019][213771] Updated weights for policy 0, policy_version 59990 (0.0005) [2023-03-07 15:23:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 61430784. Throughput: 0: 13243.7. Samples: 61414152. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:23:36,106][213445] Avg episode reward: [(0, '4315.602')] [2023-03-07 15:23:36,781][213771] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-07 15:23:37,565][213771] Updated weights for policy 0, policy_version 60010 (0.0005) [2023-03-07 15:23:38,346][213771] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 15:23:39,110][213771] Updated weights for policy 0, policy_version 60030 (0.0007) [2023-03-07 15:23:39,891][213771] Updated weights for policy 0, policy_version 60040 (0.0006) [2023-03-07 15:23:40,660][213771] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-07 15:23:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 61496320. Throughput: 0: 13239.6. Samples: 61493551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:23:41,106][213445] Avg episode reward: [(0, '4346.228')] [2023-03-07 15:23:41,441][213771] Updated weights for policy 0, policy_version 60060 (0.0007) [2023-03-07 15:23:42,220][213771] Updated weights for policy 0, policy_version 60070 (0.0007) [2023-03-07 15:23:43,006][213771] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-07 15:23:43,785][213771] Updated weights for policy 0, policy_version 60090 (0.0005) [2023-03-07 15:23:44,565][213771] Updated weights for policy 0, policy_version 60100 (0.0007) [2023-03-07 15:23:45,337][213771] Updated weights for policy 0, policy_version 60110 (0.0006) [2023-03-07 15:23:46,089][213771] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-07 15:23:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 61562880. Throughput: 0: 13231.8. Samples: 61532937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:23:46,106][213445] Avg episode reward: [(0, '4316.287')] [2023-03-07 15:23:46,850][213771] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-07 15:23:47,653][213771] Updated weights for policy 0, policy_version 60140 (0.0006) [2023-03-07 15:23:48,412][213771] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 15:23:49,189][213771] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-07 15:23:49,959][213771] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-07 15:23:50,740][213771] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-07 15:23:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 61628416. Throughput: 0: 13232.1. Samples: 61612269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:23:51,106][213445] Avg episode reward: [(0, '4211.380')] [2023-03-07 15:23:51,512][213771] Updated weights for policy 0, policy_version 60190 (0.0005) [2023-03-07 15:23:52,284][213771] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-07 15:23:53,054][213771] Updated weights for policy 0, policy_version 60210 (0.0006) [2023-03-07 15:23:53,834][213771] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-07 15:23:54,600][213771] Updated weights for policy 0, policy_version 60230 (0.0007) [2023-03-07 15:23:55,388][213771] Updated weights for policy 0, policy_version 60240 (0.0007) [2023-03-07 15:23:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 61694976. Throughput: 0: 13242.5. Samples: 61691688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:23:56,106][213445] Avg episode reward: [(0, '4261.089')] [2023-03-07 15:23:56,158][213771] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-07 15:23:56,941][213771] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 15:23:57,705][213771] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-07 15:23:58,473][213771] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-07 15:23:59,230][213771] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-07 15:24:00,028][213771] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-07 15:24:00,765][213771] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-07 15:24:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 61761536. Throughput: 0: 13240.7. Samples: 61731553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:24:01,106][213445] Avg episode reward: [(0, '4286.392')] [2023-03-07 15:24:01,550][213771] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-07 15:24:02,327][213771] Updated weights for policy 0, policy_version 60330 (0.0005) [2023-03-07 15:24:03,071][213771] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-07 15:24:03,857][213771] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-07 15:24:04,646][213771] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-07 15:24:05,413][213771] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-07 15:24:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 61827072. Throughput: 0: 13239.9. Samples: 61811163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 15:24:06,106][213445] Avg episode reward: [(0, '4309.627')] [2023-03-07 15:24:06,119][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000060379_61828096.pth... [2023-03-07 15:24:06,148][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000057275_58649600.pth [2023-03-07 15:24:06,187][213771] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 15:24:06,936][213771] Updated weights for policy 0, policy_version 60390 (0.0006) [2023-03-07 15:24:07,717][213771] Updated weights for policy 0, policy_version 60400 (0.0006) [2023-03-07 15:24:08,496][213771] Updated weights for policy 0, policy_version 60410 (0.0007) [2023-03-07 15:24:09,266][213771] Updated weights for policy 0, policy_version 60420 (0.0006) [2023-03-07 15:24:10,040][213771] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-07 15:24:10,809][213771] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 15:24:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 61894656. Throughput: 0: 13246.3. Samples: 61890863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:11,106][213445] Avg episode reward: [(0, '4296.354')] [2023-03-07 15:24:11,585][213771] Updated weights for policy 0, policy_version 60450 (0.0005) [2023-03-07 15:24:12,337][213771] Updated weights for policy 0, policy_version 60460 (0.0007) [2023-03-07 15:24:13,117][213771] Updated weights for policy 0, policy_version 60470 (0.0006) [2023-03-07 15:24:13,880][213771] Updated weights for policy 0, policy_version 60480 (0.0005) [2023-03-07 15:24:14,653][213771] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-07 15:24:15,421][213771] Updated weights for policy 0, policy_version 60500 (0.0006) [2023-03-07 15:24:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 61960192. Throughput: 0: 13249.0. Samples: 61930815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:16,106][213445] Avg episode reward: [(0, '4384.628')] [2023-03-07 15:24:16,187][213771] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 15:24:16,966][213771] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-07 15:24:17,736][213771] Updated weights for policy 0, policy_version 60530 (0.0008) [2023-03-07 15:24:18,515][213771] Updated weights for policy 0, policy_version 60540 (0.0006) [2023-03-07 15:24:19,291][213771] Updated weights for policy 0, policy_version 60550 (0.0005) [2023-03-07 15:24:20,059][213771] Updated weights for policy 0, policy_version 60560 (0.0005) [2023-03-07 15:24:20,829][213771] Updated weights for policy 0, policy_version 60570 (0.0006) [2023-03-07 15:24:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 62026752. Throughput: 0: 13249.2. Samples: 62010368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:21,106][213445] Avg episode reward: [(0, '4330.198')] [2023-03-07 15:24:21,609][213771] Updated weights for policy 0, policy_version 60580 (0.0006) [2023-03-07 15:24:22,377][213771] Updated weights for policy 0, policy_version 60590 (0.0006) [2023-03-07 15:24:23,159][213771] Updated weights for policy 0, policy_version 60600 (0.0006) [2023-03-07 15:24:23,933][213771] Updated weights for policy 0, policy_version 60610 (0.0005) [2023-03-07 15:24:24,711][213771] Updated weights for policy 0, policy_version 60620 (0.0005) [2023-03-07 15:24:25,485][213771] Updated weights for policy 0, policy_version 60630 (0.0006) [2023-03-07 15:24:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 62093312. Throughput: 0: 13252.7. Samples: 62089922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:26,106][213445] Avg episode reward: [(0, '4316.528')] [2023-03-07 15:24:26,238][213771] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-07 15:24:27,006][213771] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-07 15:24:27,776][213771] Updated weights for policy 0, policy_version 60660 (0.0005) [2023-03-07 15:24:28,562][213771] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-07 15:24:29,325][213771] Updated weights for policy 0, policy_version 60680 (0.0006) [2023-03-07 15:24:30,094][213771] Updated weights for policy 0, policy_version 60690 (0.0006) [2023-03-07 15:24:30,868][213771] Updated weights for policy 0, policy_version 60700 (0.0006) [2023-03-07 15:24:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 62158848. Throughput: 0: 13262.7. Samples: 62129757. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:31,106][213445] Avg episode reward: [(0, '4303.159')] [2023-03-07 15:24:31,636][213771] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 15:24:32,418][213771] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-07 15:24:33,187][213771] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-07 15:24:33,965][213771] Updated weights for policy 0, policy_version 60740 (0.0007) [2023-03-07 15:24:34,734][213771] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-07 15:24:35,489][213771] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-07 15:24:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 62225408. Throughput: 0: 13267.1. Samples: 62209288. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:36,106][213445] Avg episode reward: [(0, '4379.608')] [2023-03-07 15:24:36,293][213771] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-07 15:24:37,050][213771] Updated weights for policy 0, policy_version 60780 (0.0005) [2023-03-07 15:24:37,809][213771] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-07 15:24:38,584][213771] Updated weights for policy 0, policy_version 60800 (0.0005) [2023-03-07 15:24:39,373][213771] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 15:24:40,148][213771] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 15:24:40,923][213771] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-07 15:24:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 62291968. Throughput: 0: 13265.5. Samples: 62288634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:41,105][213445] Avg episode reward: [(0, '4358.942')] [2023-03-07 15:24:41,702][213771] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 15:24:42,479][213771] Updated weights for policy 0, policy_version 60850 (0.0006) [2023-03-07 15:24:43,249][213771] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-07 15:24:44,022][213771] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-07 15:24:44,805][213771] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-07 15:24:45,570][213771] Updated weights for policy 0, policy_version 60890 (0.0007) [2023-03-07 15:24:46,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 62357504. Throughput: 0: 13261.0. Samples: 62328295. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:24:46,105][213445] Avg episode reward: [(0, '4326.689')] [2023-03-07 15:24:46,351][213771] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-07 15:24:47,121][213771] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 15:24:47,906][213771] Updated weights for policy 0, policy_version 60920 (0.0006) [2023-03-07 15:24:48,673][213771] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-07 15:24:49,448][213771] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-07 15:24:50,222][213771] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-07 15:24:50,993][213771] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-07 15:24:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 62424064. Throughput: 0: 13256.5. Samples: 62407706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:24:51,106][213445] Avg episode reward: [(0, '4391.927')] [2023-03-07 15:24:51,760][213771] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-07 15:24:52,524][213771] Updated weights for policy 0, policy_version 60980 (0.0006) [2023-03-07 15:24:53,286][213771] Updated weights for policy 0, policy_version 60990 (0.0006) [2023-03-07 15:24:54,073][213771] Updated weights for policy 0, policy_version 61000 (0.0007) [2023-03-07 15:24:54,825][213771] Updated weights for policy 0, policy_version 61010 (0.0006) [2023-03-07 15:24:55,588][213771] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-07 15:24:56,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 62490624. Throughput: 0: 13262.5. Samples: 62487676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:24:56,106][213445] Avg episode reward: [(0, '4287.152')] [2023-03-07 15:24:56,358][213771] Updated weights for policy 0, policy_version 61030 (0.0006) [2023-03-07 15:24:57,133][213771] Updated weights for policy 0, policy_version 61040 (0.0007) [2023-03-07 15:24:57,886][213771] Updated weights for policy 0, policy_version 61050 (0.0005) [2023-03-07 15:24:58,668][213771] Updated weights for policy 0, policy_version 61060 (0.0006) [2023-03-07 15:24:59,434][213771] Updated weights for policy 0, policy_version 61070 (0.0005) [2023-03-07 15:25:00,198][213771] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-07 15:25:00,975][213771] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 15:25:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 62557184. Throughput: 0: 13264.8. Samples: 62527727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:01,105][213445] Avg episode reward: [(0, '4103.533')] [2023-03-07 15:25:01,750][213771] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-07 15:25:02,546][213771] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-07 15:25:03,307][213771] Updated weights for policy 0, policy_version 61120 (0.0005) [2023-03-07 15:25:04,076][213771] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-07 15:25:04,858][213771] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-07 15:25:05,634][213771] Updated weights for policy 0, policy_version 61150 (0.0007) [2023-03-07 15:25:06,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 62623744. Throughput: 0: 13262.4. Samples: 62607177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:06,106][213445] Avg episode reward: [(0, '4265.074')] [2023-03-07 15:25:06,401][213771] Updated weights for policy 0, policy_version 61160 (0.0007) [2023-03-07 15:25:07,162][213771] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-07 15:25:07,928][213771] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-07 15:25:08,700][213771] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-07 15:25:09,473][213771] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-07 15:25:10,250][213771] Updated weights for policy 0, policy_version 61210 (0.0006) [2023-03-07 15:25:11,011][213771] Updated weights for policy 0, policy_version 61220 (0.0007) [2023-03-07 15:25:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 62690304. Throughput: 0: 13265.6. Samples: 62686872. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:11,105][213445] Avg episode reward: [(0, '4275.923')] [2023-03-07 15:25:11,789][213771] Updated weights for policy 0, policy_version 61230 (0.0007) [2023-03-07 15:25:12,569][213771] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-07 15:25:13,345][213771] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-07 15:25:14,108][213771] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-07 15:25:14,860][213771] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-07 15:25:15,632][213771] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 15:25:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 62756864. Throughput: 0: 13262.1. Samples: 62726553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:16,106][213445] Avg episode reward: [(0, '4361.396')] [2023-03-07 15:25:16,427][213771] Updated weights for policy 0, policy_version 61290 (0.0007) [2023-03-07 15:25:17,196][213771] Updated weights for policy 0, policy_version 61300 (0.0006) [2023-03-07 15:25:17,963][213771] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-03-07 15:25:18,737][213771] Updated weights for policy 0, policy_version 61320 (0.0006) [2023-03-07 15:25:19,501][213771] Updated weights for policy 0, policy_version 61330 (0.0005) [2023-03-07 15:25:20,264][213771] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-07 15:25:21,036][213771] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-07 15:25:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 62823424. Throughput: 0: 13271.2. Samples: 62806489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:21,106][213445] Avg episode reward: [(0, '4275.861')] [2023-03-07 15:25:21,805][213771] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 15:25:22,570][213771] Updated weights for policy 0, policy_version 61370 (0.0007) [2023-03-07 15:25:23,366][213771] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-07 15:25:24,117][213771] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 15:25:24,902][213771] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-07 15:25:25,678][213771] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-07 15:25:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 62888960. Throughput: 0: 13276.1. Samples: 62886061. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:26,106][213445] Avg episode reward: [(0, '4290.944')] [2023-03-07 15:25:26,427][213771] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 15:25:27,207][213771] Updated weights for policy 0, policy_version 61430 (0.0006) [2023-03-07 15:25:27,978][213771] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-07 15:25:28,732][213771] Updated weights for policy 0, policy_version 61450 (0.0006) [2023-03-07 15:25:29,514][213771] Updated weights for policy 0, policy_version 61460 (0.0007) [2023-03-07 15:25:30,291][213771] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-07 15:25:31,049][213771] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-07 15:25:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 62955520. Throughput: 0: 13281.5. Samples: 62925964. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:31,106][213445] Avg episode reward: [(0, '4387.240')] [2023-03-07 15:25:31,829][213771] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-07 15:25:32,594][213771] Updated weights for policy 0, policy_version 61500 (0.0007) [2023-03-07 15:25:33,358][213771] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-07 15:25:34,124][213771] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-07 15:25:34,910][213771] Updated weights for policy 0, policy_version 61530 (0.0005) [2023-03-07 15:25:35,688][213771] Updated weights for policy 0, policy_version 61540 (0.0006) [2023-03-07 15:25:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 63022080. Throughput: 0: 13289.8. Samples: 63005744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:36,105][213445] Avg episode reward: [(0, '4355.769')] [2023-03-07 15:25:36,465][213771] Updated weights for policy 0, policy_version 61550 (0.0007) [2023-03-07 15:25:37,245][213771] Updated weights for policy 0, policy_version 61560 (0.0007) [2023-03-07 15:25:38,017][213771] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-07 15:25:38,801][213771] Updated weights for policy 0, policy_version 61580 (0.0005) [2023-03-07 15:25:39,582][213771] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-07 15:25:40,349][213771] Updated weights for policy 0, policy_version 61600 (0.0006) [2023-03-07 15:25:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 63087616. Throughput: 0: 13269.5. Samples: 63084802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:41,106][213445] Avg episode reward: [(0, '4399.397')] [2023-03-07 15:25:41,126][213771] Updated weights for policy 0, policy_version 61610 (0.0006) [2023-03-07 15:25:41,907][213771] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-07 15:25:42,697][213771] Updated weights for policy 0, policy_version 61630 (0.0006) [2023-03-07 15:25:43,459][213771] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-07 15:25:44,233][213771] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-07 15:25:45,001][213771] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-07 15:25:45,778][213771] Updated weights for policy 0, policy_version 61670 (0.0006) [2023-03-07 15:25:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 63154176. Throughput: 0: 13261.5. Samples: 63124496. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:46,106][213445] Avg episode reward: [(0, '4390.733')] [2023-03-07 15:25:46,557][213771] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-07 15:25:47,335][213771] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-07 15:25:48,117][213771] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-07 15:25:48,885][213771] Updated weights for policy 0, policy_version 61710 (0.0007) [2023-03-07 15:25:49,671][213771] Updated weights for policy 0, policy_version 61720 (0.0007) [2023-03-07 15:25:50,442][213771] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-07 15:25:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 63219712. Throughput: 0: 13249.0. Samples: 63203381. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:51,106][213445] Avg episode reward: [(0, '4433.321')] [2023-03-07 15:25:51,211][213771] Updated weights for policy 0, policy_version 61740 (0.0005) [2023-03-07 15:25:52,005][213771] Updated weights for policy 0, policy_version 61750 (0.0007) [2023-03-07 15:25:52,791][213771] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-07 15:25:53,553][213771] Updated weights for policy 0, policy_version 61770 (0.0005) [2023-03-07 15:25:54,332][213771] Updated weights for policy 0, policy_version 61780 (0.0005) [2023-03-07 15:25:55,111][213771] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-07 15:25:55,873][213771] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-07 15:25:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 63286272. Throughput: 0: 13240.7. Samples: 63282703. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:25:56,106][213445] Avg episode reward: [(0, '4439.474')] [2023-03-07 15:25:56,663][213771] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-07 15:25:57,457][213771] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-07 15:25:58,212][213771] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-07 15:25:59,001][213771] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-07 15:25:59,776][213771] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 15:26:00,537][213771] Updated weights for policy 0, policy_version 61860 (0.0006) [2023-03-07 15:26:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 63351808. Throughput: 0: 13235.3. Samples: 63322140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:01,106][213445] Avg episode reward: [(0, '4374.201')] [2023-03-07 15:26:01,314][213771] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-07 15:26:02,085][213771] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-07 15:26:02,866][213771] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-07 15:26:03,637][213771] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-07 15:26:04,397][213771] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-07 15:26:05,172][213771] Updated weights for policy 0, policy_version 61920 (0.0007) [2023-03-07 15:26:05,944][213771] Updated weights for policy 0, policy_version 61930 (0.0007) [2023-03-07 15:26:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 63418368. Throughput: 0: 13220.1. Samples: 63401398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:06,106][213445] Avg episode reward: [(0, '4378.351')] [2023-03-07 15:26:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000061932_63418368.pth... [2023-03-07 15:26:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000058825_60236800.pth [2023-03-07 15:26:06,722][213771] Updated weights for policy 0, policy_version 61940 (0.0007) [2023-03-07 15:26:07,497][213771] Updated weights for policy 0, policy_version 61950 (0.0007) [2023-03-07 15:26:08,270][213771] Updated weights for policy 0, policy_version 61960 (0.0006) [2023-03-07 15:26:09,031][213771] Updated weights for policy 0, policy_version 61970 (0.0005) [2023-03-07 15:26:09,821][213771] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-07 15:26:10,590][213771] Updated weights for policy 0, policy_version 61990 (0.0007) [2023-03-07 15:26:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 63483904. Throughput: 0: 13215.5. Samples: 63480756. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:11,106][213445] Avg episode reward: [(0, '4351.114')] [2023-03-07 15:26:11,374][213771] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-07 15:26:12,138][213771] Updated weights for policy 0, policy_version 62010 (0.0006) [2023-03-07 15:26:12,912][213771] Updated weights for policy 0, policy_version 62020 (0.0006) [2023-03-07 15:26:13,675][213771] Updated weights for policy 0, policy_version 62030 (0.0007) [2023-03-07 15:26:14,440][213771] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-07 15:26:15,201][213771] Updated weights for policy 0, policy_version 62050 (0.0006) [2023-03-07 15:26:15,986][213771] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 15:26:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 63550464. Throughput: 0: 13219.7. Samples: 63520848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:16,106][213445] Avg episode reward: [(0, '4350.529')] [2023-03-07 15:26:16,759][213771] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-07 15:26:17,513][213771] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-07 15:26:18,269][213771] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-07 15:26:19,040][213771] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-07 15:26:19,817][213771] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-07 15:26:20,602][213771] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 15:26:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 63617024. Throughput: 0: 13222.0. Samples: 63600733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:21,105][213445] Avg episode reward: [(0, '4396.597')] [2023-03-07 15:26:21,364][213771] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-07 15:26:22,141][213771] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-07 15:26:22,917][213771] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-07 15:26:23,674][213771] Updated weights for policy 0, policy_version 62160 (0.0005) [2023-03-07 15:26:24,454][213771] Updated weights for policy 0, policy_version 62170 (0.0007) [2023-03-07 15:26:25,234][213771] Updated weights for policy 0, policy_version 62180 (0.0006) [2023-03-07 15:26:26,001][213771] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-07 15:26:26,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 63683584. Throughput: 0: 13234.8. Samples: 63680367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:26,106][213445] Avg episode reward: [(0, '4396.358')] [2023-03-07 15:26:26,765][213771] Updated weights for policy 0, policy_version 62200 (0.0005) [2023-03-07 15:26:27,526][213771] Updated weights for policy 0, policy_version 62210 (0.0006) [2023-03-07 15:26:28,306][213771] Updated weights for policy 0, policy_version 62220 (0.0006) [2023-03-07 15:26:29,066][213771] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-07 15:26:29,834][213771] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-07 15:26:30,614][213771] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-07 15:26:31,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 63750144. Throughput: 0: 13242.1. Samples: 63720394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:31,106][213445] Avg episode reward: [(0, '4386.336')] [2023-03-07 15:26:31,393][213771] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-07 15:26:32,174][213771] Updated weights for policy 0, policy_version 62270 (0.0005) [2023-03-07 15:26:32,973][213771] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-07 15:26:33,725][213771] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-07 15:26:34,488][213771] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-07 15:26:35,263][213771] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-07 15:26:36,037][213771] Updated weights for policy 0, policy_version 62320 (0.0005) [2023-03-07 15:26:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.6, 300 sec: 13253.0). Total num frames: 63815680. Throughput: 0: 13252.5. Samples: 63799742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:36,106][213445] Avg episode reward: [(0, '4366.098')] [2023-03-07 15:26:36,811][213771] Updated weights for policy 0, policy_version 62330 (0.0005) [2023-03-07 15:26:37,590][213771] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 15:26:38,351][213771] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-07 15:26:39,121][213771] Updated weights for policy 0, policy_version 62360 (0.0006) [2023-03-07 15:26:39,889][213771] Updated weights for policy 0, policy_version 62370 (0.0005) [2023-03-07 15:26:40,653][213771] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-07 15:26:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 63883264. Throughput: 0: 13261.1. Samples: 63879452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:41,106][213445] Avg episode reward: [(0, '4332.358')] [2023-03-07 15:26:41,424][213771] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-07 15:26:42,199][213771] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-07 15:26:42,982][213771] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-07 15:26:43,754][213771] Updated weights for policy 0, policy_version 62420 (0.0007) [2023-03-07 15:26:44,517][213771] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-07 15:26:45,282][213771] Updated weights for policy 0, policy_version 62440 (0.0007) [2023-03-07 15:26:46,042][213771] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 15:26:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 63948800. Throughput: 0: 13270.7. Samples: 63919323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:46,106][213445] Avg episode reward: [(0, '4411.766')] [2023-03-07 15:26:46,818][213771] Updated weights for policy 0, policy_version 62460 (0.0007) [2023-03-07 15:26:47,595][213771] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-07 15:26:48,364][213771] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-07 15:26:49,134][213771] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-07 15:26:49,884][213771] Updated weights for policy 0, policy_version 62500 (0.0006) [2023-03-07 15:26:50,667][213771] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-07 15:26:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 64015360. Throughput: 0: 13282.7. Samples: 63999119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:51,106][213445] Avg episode reward: [(0, '4420.153')] [2023-03-07 15:26:51,438][213771] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-07 15:26:52,207][213771] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-07 15:26:52,970][213771] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-07 15:26:53,743][213771] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-07 15:26:54,513][213771] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 15:26:55,304][213771] Updated weights for policy 0, policy_version 62570 (0.0007) [2023-03-07 15:26:56,074][213771] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-07 15:26:56,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 64081920. Throughput: 0: 13289.2. Samples: 64078768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:26:56,105][213445] Avg episode reward: [(0, '4420.389')] [2023-03-07 15:26:56,865][213771] Updated weights for policy 0, policy_version 62590 (0.0006) [2023-03-07 15:26:57,632][213771] Updated weights for policy 0, policy_version 62600 (0.0005) [2023-03-07 15:26:58,402][213771] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-07 15:26:59,177][213771] Updated weights for policy 0, policy_version 62620 (0.0007) [2023-03-07 15:26:59,954][213771] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-07 15:27:00,730][213771] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-07 15:27:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 64147456. Throughput: 0: 13276.0. Samples: 64118267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:27:01,106][213445] Avg episode reward: [(0, '4449.429')] [2023-03-07 15:27:01,515][213771] Updated weights for policy 0, policy_version 62650 (0.0007) [2023-03-07 15:27:02,270][213771] Updated weights for policy 0, policy_version 62660 (0.0005) [2023-03-07 15:27:03,037][213771] Updated weights for policy 0, policy_version 62670 (0.0005) [2023-03-07 15:27:03,809][213771] Updated weights for policy 0, policy_version 62680 (0.0005) [2023-03-07 15:27:04,582][213771] Updated weights for policy 0, policy_version 62690 (0.0007) [2023-03-07 15:27:05,362][213771] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-07 15:27:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 64214016. Throughput: 0: 13266.4. Samples: 64197721. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:27:06,106][213445] Avg episode reward: [(0, '4411.942')] [2023-03-07 15:27:06,137][213771] Updated weights for policy 0, policy_version 62710 (0.0007) [2023-03-07 15:27:06,910][213771] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 15:27:07,691][213771] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-07 15:27:08,454][213771] Updated weights for policy 0, policy_version 62740 (0.0008) [2023-03-07 15:27:09,241][213771] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-07 15:27:10,022][213771] Updated weights for policy 0, policy_version 62760 (0.0007) [2023-03-07 15:27:10,805][213771] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-07 15:27:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 64279552. Throughput: 0: 13257.1. Samples: 64276935. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:27:11,105][213445] Avg episode reward: [(0, '4417.379')] [2023-03-07 15:27:11,565][213771] Updated weights for policy 0, policy_version 62780 (0.0005) [2023-03-07 15:27:12,338][213771] Updated weights for policy 0, policy_version 62790 (0.0007) [2023-03-07 15:27:13,125][213771] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-07 15:27:13,892][213771] Updated weights for policy 0, policy_version 62810 (0.0007) [2023-03-07 15:27:14,667][213771] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-07 15:27:15,447][213771] Updated weights for policy 0, policy_version 62830 (0.0005) [2023-03-07 15:27:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 64346112. Throughput: 0: 13252.1. Samples: 64316736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:27:16,106][213445] Avg episode reward: [(0, '4379.809')] [2023-03-07 15:27:16,221][213771] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 15:27:16,993][213771] Updated weights for policy 0, policy_version 62850 (0.0006) [2023-03-07 15:27:17,770][213771] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-07 15:27:18,526][213771] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-07 15:27:19,291][213771] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 15:27:20,089][213771] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-07 15:27:20,861][213771] Updated weights for policy 0, policy_version 62900 (0.0005) [2023-03-07 15:27:21,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 64412672. Throughput: 0: 13253.7. Samples: 64396160. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:21,106][213445] Avg episode reward: [(0, '4399.628')] [2023-03-07 15:27:21,633][213771] Updated weights for policy 0, policy_version 62910 (0.0005) [2023-03-07 15:27:22,398][213771] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 15:27:23,189][213771] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-07 15:27:23,961][213771] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-07 15:27:24,733][213771] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 15:27:25,504][213771] Updated weights for policy 0, policy_version 62960 (0.0006) [2023-03-07 15:27:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 64478208. Throughput: 0: 13243.4. Samples: 64475408. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:26,106][213445] Avg episode reward: [(0, '4409.374')] [2023-03-07 15:27:26,294][213771] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-07 15:27:27,049][213771] Updated weights for policy 0, policy_version 62980 (0.0006) [2023-03-07 15:27:27,831][213771] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 15:27:28,588][213771] Updated weights for policy 0, policy_version 63000 (0.0007) [2023-03-07 15:27:29,340][213771] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-07 15:27:30,122][213771] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 15:27:30,903][213771] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-07 15:27:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 64544768. Throughput: 0: 13243.6. Samples: 64515283. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:31,106][213445] Avg episode reward: [(0, '4353.834')] [2023-03-07 15:27:31,687][213771] Updated weights for policy 0, policy_version 63040 (0.0006) [2023-03-07 15:27:32,468][213771] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 15:27:33,244][213771] Updated weights for policy 0, policy_version 63060 (0.0007) [2023-03-07 15:27:34,025][213771] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-07 15:27:34,794][213771] Updated weights for policy 0, policy_version 63080 (0.0007) [2023-03-07 15:27:35,570][213771] Updated weights for policy 0, policy_version 63090 (0.0006) [2023-03-07 15:27:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 64611328. Throughput: 0: 13230.3. Samples: 64594480. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:36,106][213445] Avg episode reward: [(0, '4361.899')] [2023-03-07 15:27:36,333][213771] Updated weights for policy 0, policy_version 63100 (0.0005) [2023-03-07 15:27:37,110][213771] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-07 15:27:37,877][213771] Updated weights for policy 0, policy_version 63120 (0.0005) [2023-03-07 15:27:38,652][213771] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-07 15:27:39,425][213771] Updated weights for policy 0, policy_version 63140 (0.0006) [2023-03-07 15:27:40,192][213771] Updated weights for policy 0, policy_version 63150 (0.0006) [2023-03-07 15:27:40,965][213771] Updated weights for policy 0, policy_version 63160 (0.0006) [2023-03-07 15:27:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 64676864. Throughput: 0: 13229.3. Samples: 64674088. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:41,106][213445] Avg episode reward: [(0, '4314.420')] [2023-03-07 15:27:41,749][213771] Updated weights for policy 0, policy_version 63170 (0.0006) [2023-03-07 15:27:42,505][213771] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-07 15:27:43,281][213771] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 15:27:44,046][213771] Updated weights for policy 0, policy_version 63200 (0.0005) [2023-03-07 15:27:44,837][213771] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-07 15:27:45,624][213771] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-07 15:27:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 64743424. Throughput: 0: 13238.3. Samples: 64713989. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:46,106][213445] Avg episode reward: [(0, '4346.038')] [2023-03-07 15:27:46,391][213771] Updated weights for policy 0, policy_version 63230 (0.0006) [2023-03-07 15:27:47,167][213771] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 15:27:47,940][213771] Updated weights for policy 0, policy_version 63250 (0.0006) [2023-03-07 15:27:48,711][213771] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 15:27:49,491][213771] Updated weights for policy 0, policy_version 63270 (0.0006) [2023-03-07 15:27:50,273][213771] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 15:27:51,038][213771] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-07 15:27:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 64808960. Throughput: 0: 13227.0. Samples: 64792938. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:51,106][213445] Avg episode reward: [(0, '4355.393')] [2023-03-07 15:27:51,810][213771] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-07 15:27:52,586][213771] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-07 15:27:53,363][213771] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-07 15:27:54,144][213771] Updated weights for policy 0, policy_version 63330 (0.0005) [2023-03-07 15:27:54,909][213771] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-07 15:27:55,675][213771] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-07 15:27:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 64875520. Throughput: 0: 13235.3. Samples: 64872525. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:27:56,106][213445] Avg episode reward: [(0, '4417.430')] [2023-03-07 15:27:56,451][213771] Updated weights for policy 0, policy_version 63360 (0.0006) [2023-03-07 15:27:57,241][213771] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-07 15:27:58,012][213771] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-07 15:27:58,777][213771] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-07 15:27:59,557][213771] Updated weights for policy 0, policy_version 63400 (0.0005) [2023-03-07 15:28:00,322][213771] Updated weights for policy 0, policy_version 63410 (0.0008) [2023-03-07 15:28:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 64941056. Throughput: 0: 13230.8. Samples: 64912124. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:01,106][213445] Avg episode reward: [(0, '4408.067')] [2023-03-07 15:28:01,108][213771] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 15:28:01,878][213771] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-07 15:28:02,662][213771] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 15:28:03,429][213771] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 15:28:04,202][213771] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-07 15:28:04,989][213771] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 15:28:05,763][213771] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-07 15:28:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 65007616. Throughput: 0: 13225.5. Samples: 64991309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:06,117][213445] Avg episode reward: [(0, '4465.995')] [2023-03-07 15:28:06,123][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000063484_65007616.pth... [2023-03-07 15:28:06,155][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000060379_61828096.pth [2023-03-07 15:28:06,532][213771] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-07 15:28:07,294][213771] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-07 15:28:08,059][213771] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-07 15:28:08,835][213771] Updated weights for policy 0, policy_version 63520 (0.0006) [2023-03-07 15:28:09,615][213771] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-07 15:28:10,370][213771] Updated weights for policy 0, policy_version 63540 (0.0006) [2023-03-07 15:28:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 65074176. Throughput: 0: 13236.0. Samples: 65071027. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:11,106][213445] Avg episode reward: [(0, '4437.046')] [2023-03-07 15:28:11,166][213771] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-07 15:28:11,939][213771] Updated weights for policy 0, policy_version 63560 (0.0005) [2023-03-07 15:28:12,710][213771] Updated weights for policy 0, policy_version 63570 (0.0007) [2023-03-07 15:28:13,499][213771] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-07 15:28:14,274][213771] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-07 15:28:15,034][213771] Updated weights for policy 0, policy_version 63600 (0.0005) [2023-03-07 15:28:15,809][213771] Updated weights for policy 0, policy_version 63610 (0.0007) [2023-03-07 15:28:16,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 65139712. Throughput: 0: 13225.4. Samples: 65110426. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:16,106][213445] Avg episode reward: [(0, '4419.126')] [2023-03-07 15:28:16,589][213771] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-07 15:28:17,360][213771] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-07 15:28:18,133][213771] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-07 15:28:18,905][213771] Updated weights for policy 0, policy_version 63650 (0.0006) [2023-03-07 15:28:19,672][213771] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-07 15:28:20,445][213771] Updated weights for policy 0, policy_version 63670 (0.0007) [2023-03-07 15:28:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 65206272. Throughput: 0: 13232.4. Samples: 65189941. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:21,106][213445] Avg episode reward: [(0, '4397.530')] [2023-03-07 15:28:21,229][213771] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 15:28:21,999][213771] Updated weights for policy 0, policy_version 63690 (0.0005) [2023-03-07 15:28:22,770][213771] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-07 15:28:23,558][213771] Updated weights for policy 0, policy_version 63710 (0.0007) [2023-03-07 15:28:24,319][213771] Updated weights for policy 0, policy_version 63720 (0.0007) [2023-03-07 15:28:25,083][213771] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-07 15:28:25,854][213771] Updated weights for policy 0, policy_version 63740 (0.0007) [2023-03-07 15:28:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 65272832. Throughput: 0: 13229.8. Samples: 65269431. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:26,106][213445] Avg episode reward: [(0, '4445.730')] [2023-03-07 15:28:26,637][213771] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-07 15:28:27,414][213771] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-07 15:28:28,193][213771] Updated weights for policy 0, policy_version 63770 (0.0005) [2023-03-07 15:28:28,958][213771] Updated weights for policy 0, policy_version 63780 (0.0005) [2023-03-07 15:28:29,742][213771] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-07 15:28:30,495][213771] Updated weights for policy 0, policy_version 63800 (0.0006) [2023-03-07 15:28:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 65338368. Throughput: 0: 13221.2. Samples: 65308943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:31,106][213445] Avg episode reward: [(0, '4451.705')] [2023-03-07 15:28:31,269][213771] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 15:28:32,049][213771] Updated weights for policy 0, policy_version 63820 (0.0006) [2023-03-07 15:28:32,821][213771] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 15:28:33,582][213771] Updated weights for policy 0, policy_version 63840 (0.0005) [2023-03-07 15:28:34,372][213771] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 15:28:35,135][213771] Updated weights for policy 0, policy_version 63860 (0.0006) [2023-03-07 15:28:35,892][213771] Updated weights for policy 0, policy_version 63870 (0.0005) [2023-03-07 15:28:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 65404928. Throughput: 0: 13240.2. Samples: 65388746. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:28:36,106][213445] Avg episode reward: [(0, '4387.156')] [2023-03-07 15:28:36,684][213771] Updated weights for policy 0, policy_version 63880 (0.0006) [2023-03-07 15:28:37,469][213771] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-07 15:28:38,225][213771] Updated weights for policy 0, policy_version 63900 (0.0005) [2023-03-07 15:28:39,004][213771] Updated weights for policy 0, policy_version 63910 (0.0007) [2023-03-07 15:28:39,768][213771] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 15:28:40,550][213771] Updated weights for policy 0, policy_version 63930 (0.0007) [2023-03-07 15:28:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 65471488. Throughput: 0: 13236.7. Samples: 65468177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:28:41,106][213445] Avg episode reward: [(0, '4424.071')] [2023-03-07 15:28:41,320][213771] Updated weights for policy 0, policy_version 63940 (0.0007) [2023-03-07 15:28:42,085][213771] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-07 15:28:42,877][213771] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-07 15:28:43,620][213771] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-07 15:28:44,391][213771] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-07 15:28:45,165][213771] Updated weights for policy 0, policy_version 63990 (0.0006) [2023-03-07 15:28:45,922][213771] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-07 15:28:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 65538048. Throughput: 0: 13245.2. Samples: 65508157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:28:46,106][213445] Avg episode reward: [(0, '4435.700')] [2023-03-07 15:28:46,718][213771] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-07 15:28:47,498][213771] Updated weights for policy 0, policy_version 64020 (0.0005) [2023-03-07 15:28:48,266][213771] Updated weights for policy 0, policy_version 64030 (0.0006) [2023-03-07 15:28:49,030][213771] Updated weights for policy 0, policy_version 64040 (0.0007) [2023-03-07 15:28:49,816][213771] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-07 15:28:50,566][213771] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-07 15:28:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 65603584. Throughput: 0: 13249.8. Samples: 65587549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:28:51,106][213445] Avg episode reward: [(0, '4457.701')] [2023-03-07 15:28:51,350][213771] Updated weights for policy 0, policy_version 64070 (0.0005) [2023-03-07 15:28:52,122][213771] Updated weights for policy 0, policy_version 64080 (0.0006) [2023-03-07 15:28:52,916][213771] Updated weights for policy 0, policy_version 64090 (0.0007) [2023-03-07 15:28:53,692][213771] Updated weights for policy 0, policy_version 64100 (0.0006) [2023-03-07 15:28:54,454][213771] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-07 15:28:55,233][213771] Updated weights for policy 0, policy_version 64120 (0.0006) [2023-03-07 15:28:55,993][213771] Updated weights for policy 0, policy_version 64130 (0.0006) [2023-03-07 15:28:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 65670144. Throughput: 0: 13244.2. Samples: 65667017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:28:56,106][213445] Avg episode reward: [(0, '4408.766')] [2023-03-07 15:28:56,781][213771] Updated weights for policy 0, policy_version 64140 (0.0008) [2023-03-07 15:28:57,569][213771] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 15:28:58,330][213771] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-07 15:28:59,101][213771] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 15:28:59,881][213771] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 15:29:00,644][213771] Updated weights for policy 0, policy_version 64190 (0.0007) [2023-03-07 15:29:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 65735680. Throughput: 0: 13249.2. Samples: 65706640. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:29:01,106][213445] Avg episode reward: [(0, '4437.161')] [2023-03-07 15:29:01,410][213771] Updated weights for policy 0, policy_version 64200 (0.0006) [2023-03-07 15:29:02,199][213771] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-07 15:29:02,987][213771] Updated weights for policy 0, policy_version 64220 (0.0005) [2023-03-07 15:29:03,744][213771] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-07 15:29:04,527][213771] Updated weights for policy 0, policy_version 64240 (0.0007) [2023-03-07 15:29:05,322][213771] Updated weights for policy 0, policy_version 64250 (0.0008) [2023-03-07 15:29:06,091][213771] Updated weights for policy 0, policy_version 64260 (0.0007) [2023-03-07 15:29:06,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 65802240. Throughput: 0: 13239.8. Samples: 65785731. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:29:06,105][213445] Avg episode reward: [(0, '4390.836')] [2023-03-07 15:29:06,865][213771] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-07 15:29:07,644][213771] Updated weights for policy 0, policy_version 64280 (0.0007) [2023-03-07 15:29:08,406][213771] Updated weights for policy 0, policy_version 64290 (0.0006) [2023-03-07 15:29:09,189][213771] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-07 15:29:09,941][213771] Updated weights for policy 0, policy_version 64310 (0.0007) [2023-03-07 15:29:10,711][213771] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-07 15:29:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 65867776. Throughput: 0: 13239.6. Samples: 65865214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:29:11,106][213445] Avg episode reward: [(0, '4442.497')] [2023-03-07 15:29:11,489][213771] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-07 15:29:12,249][213771] Updated weights for policy 0, policy_version 64340 (0.0007) [2023-03-07 15:29:13,031][213771] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 15:29:13,799][213771] Updated weights for policy 0, policy_version 64360 (0.0007) [2023-03-07 15:29:14,576][213771] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-07 15:29:15,338][213771] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-07 15:29:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 65934336. Throughput: 0: 13247.6. Samples: 65905087. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:29:16,106][213445] Avg episode reward: [(0, '4385.578')] [2023-03-07 15:29:16,126][213771] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-07 15:29:16,889][213771] Updated weights for policy 0, policy_version 64400 (0.0007) [2023-03-07 15:29:17,655][213771] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-07 15:29:18,436][213771] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-07 15:29:19,202][213771] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-07 15:29:19,961][213771] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-07 15:29:20,737][213771] Updated weights for policy 0, policy_version 64450 (0.0006) [2023-03-07 15:29:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 66000896. Throughput: 0: 13244.6. Samples: 65984754. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:21,106][213445] Avg episode reward: [(0, '4420.261')] [2023-03-07 15:29:21,518][213771] Updated weights for policy 0, policy_version 64460 (0.0007) [2023-03-07 15:29:22,282][213771] Updated weights for policy 0, policy_version 64470 (0.0007) [2023-03-07 15:29:23,046][213771] Updated weights for policy 0, policy_version 64480 (0.0006) [2023-03-07 15:29:23,830][213771] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 15:29:24,582][213771] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-07 15:29:25,368][213771] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-07 15:29:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 66067456. Throughput: 0: 13250.5. Samples: 66064448. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:26,116][213445] Avg episode reward: [(0, '4463.091')] [2023-03-07 15:29:26,135][213771] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 15:29:26,903][213771] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-07 15:29:27,679][213771] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-07 15:29:28,455][213771] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-07 15:29:29,226][213771] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-07 15:29:29,998][213771] Updated weights for policy 0, policy_version 64570 (0.0006) [2023-03-07 15:29:30,772][213771] Updated weights for policy 0, policy_version 64580 (0.0005) [2023-03-07 15:29:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 66134016. Throughput: 0: 13246.8. Samples: 66104262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:31,116][213445] Avg episode reward: [(0, '4422.881')] [2023-03-07 15:29:31,546][213771] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 15:29:32,321][213771] Updated weights for policy 0, policy_version 64600 (0.0006) [2023-03-07 15:29:33,083][213771] Updated weights for policy 0, policy_version 64610 (0.0005) [2023-03-07 15:29:33,872][213771] Updated weights for policy 0, policy_version 64620 (0.0007) [2023-03-07 15:29:34,646][213771] Updated weights for policy 0, policy_version 64630 (0.0007) [2023-03-07 15:29:35,401][213771] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-07 15:29:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 66199552. Throughput: 0: 13246.1. Samples: 66183622. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:36,116][213445] Avg episode reward: [(0, '4417.586')] [2023-03-07 15:29:36,181][213771] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-07 15:29:36,941][213771] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 15:29:37,712][213771] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 15:29:38,495][213771] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-07 15:29:39,269][213771] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-07 15:29:40,045][213771] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-07 15:29:40,811][213771] Updated weights for policy 0, policy_version 64710 (0.0008) [2023-03-07 15:29:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 66266112. Throughput: 0: 13250.2. Samples: 66263272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:41,116][213445] Avg episode reward: [(0, '4444.438')] [2023-03-07 15:29:41,586][213771] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-07 15:29:42,366][213771] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-07 15:29:43,131][213771] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-07 15:29:43,910][213771] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 15:29:44,693][213771] Updated weights for policy 0, policy_version 64760 (0.0005) [2023-03-07 15:29:45,473][213771] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-07 15:29:46,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 66332672. Throughput: 0: 13255.2. Samples: 66303123. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:46,116][213445] Avg episode reward: [(0, '4425.585')] [2023-03-07 15:29:46,214][213771] Updated weights for policy 0, policy_version 64780 (0.0005) [2023-03-07 15:29:46,986][213771] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 15:29:47,757][213771] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-07 15:29:48,536][213771] Updated weights for policy 0, policy_version 64810 (0.0006) [2023-03-07 15:29:49,311][213771] Updated weights for policy 0, policy_version 64820 (0.0007) [2023-03-07 15:29:50,090][213771] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-07 15:29:50,844][213771] Updated weights for policy 0, policy_version 64840 (0.0007) [2023-03-07 15:29:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 66399232. Throughput: 0: 13266.7. Samples: 66382735. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:51,117][213445] Avg episode reward: [(0, '4435.821')] [2023-03-07 15:29:51,609][213771] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-07 15:29:52,384][213771] Updated weights for policy 0, policy_version 64860 (0.0007) [2023-03-07 15:29:53,125][213771] Updated weights for policy 0, policy_version 64870 (0.0005) [2023-03-07 15:29:53,893][213771] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-07 15:29:54,675][213771] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-07 15:29:55,429][213771] Updated weights for policy 0, policy_version 64900 (0.0007) [2023-03-07 15:29:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 66465792. Throughput: 0: 13280.1. Samples: 66462820. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:29:56,106][213445] Avg episode reward: [(0, '4356.814')] [2023-03-07 15:29:56,212][213771] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-07 15:29:56,983][213771] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-07 15:29:57,740][213771] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 15:29:58,526][213771] Updated weights for policy 0, policy_version 64940 (0.0006) [2023-03-07 15:29:59,300][213771] Updated weights for policy 0, policy_version 64950 (0.0006) [2023-03-07 15:30:00,079][213771] Updated weights for policy 0, policy_version 64960 (0.0005) [2023-03-07 15:30:00,861][213771] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 15:30:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.8, 300 sec: 13249.5). Total num frames: 66532352. Throughput: 0: 13275.8. Samples: 66502499. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:01,106][213445] Avg episode reward: [(0, '4382.222')] [2023-03-07 15:30:01,631][213771] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-03-07 15:30:02,405][213771] Updated weights for policy 0, policy_version 64990 (0.0006) [2023-03-07 15:30:03,176][213771] Updated weights for policy 0, policy_version 65000 (0.0007) [2023-03-07 15:30:03,959][213771] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-07 15:30:04,729][213771] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-07 15:30:05,494][213771] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 15:30:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 66597888. Throughput: 0: 13269.0. Samples: 66581860. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:06,106][213445] Avg episode reward: [(0, '4416.440')] [2023-03-07 15:30:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000065037_66597888.pth... [2023-03-07 15:30:06,139][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000061932_63418368.pth [2023-03-07 15:30:06,282][213771] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-07 15:30:07,054][213771] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-07 15:30:07,812][213771] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-07 15:30:08,580][213771] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 15:30:09,359][213771] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-07 15:30:10,122][213771] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-07 15:30:10,905][213771] Updated weights for policy 0, policy_version 65100 (0.0007) [2023-03-07 15:30:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 66664448. Throughput: 0: 13268.9. Samples: 66661551. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:11,106][213445] Avg episode reward: [(0, '4363.011')] [2023-03-07 15:30:11,670][213771] Updated weights for policy 0, policy_version 65110 (0.0005) [2023-03-07 15:30:12,457][213771] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-07 15:30:13,230][213771] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-07 15:30:14,007][213771] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-07 15:30:14,776][213771] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-07 15:30:15,555][213771] Updated weights for policy 0, policy_version 65160 (0.0006) [2023-03-07 15:30:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 66731008. Throughput: 0: 13263.5. Samples: 66701120. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:16,106][213445] Avg episode reward: [(0, '4373.965')] [2023-03-07 15:30:16,318][213771] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-07 15:30:17,101][213771] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-07 15:30:17,873][213771] Updated weights for policy 0, policy_version 65190 (0.0007) [2023-03-07 15:30:18,644][213771] Updated weights for policy 0, policy_version 65200 (0.0007) [2023-03-07 15:30:19,417][213771] Updated weights for policy 0, policy_version 65210 (0.0007) [2023-03-07 15:30:20,194][213771] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-07 15:30:20,956][213771] Updated weights for policy 0, policy_version 65230 (0.0007) [2023-03-07 15:30:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 66796544. Throughput: 0: 13262.8. Samples: 66780449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:21,106][213445] Avg episode reward: [(0, '4427.017')] [2023-03-07 15:30:21,736][213771] Updated weights for policy 0, policy_version 65240 (0.0007) [2023-03-07 15:30:22,490][213771] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-07 15:30:23,271][213771] Updated weights for policy 0, policy_version 65260 (0.0006) [2023-03-07 15:30:24,027][213771] Updated weights for policy 0, policy_version 65270 (0.0005) [2023-03-07 15:30:24,813][213771] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-07 15:30:25,598][213771] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 15:30:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 66863104. Throughput: 0: 13264.4. Samples: 66860174. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:26,106][213445] Avg episode reward: [(0, '4474.613')] [2023-03-07 15:30:26,374][213771] Updated weights for policy 0, policy_version 65300 (0.0007) [2023-03-07 15:30:27,162][213771] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-07 15:30:27,917][213771] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-07 15:30:28,688][213771] Updated weights for policy 0, policy_version 65330 (0.0007) [2023-03-07 15:30:29,464][213771] Updated weights for policy 0, policy_version 65340 (0.0005) [2023-03-07 15:30:30,229][213771] Updated weights for policy 0, policy_version 65350 (0.0007) [2023-03-07 15:30:31,013][213771] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-07 15:30:31,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 66929664. Throughput: 0: 13262.1. Samples: 66899916. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:31,106][213445] Avg episode reward: [(0, '4388.671')] [2023-03-07 15:30:31,775][213771] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 15:30:32,558][213771] Updated weights for policy 0, policy_version 65380 (0.0005) [2023-03-07 15:30:33,321][213771] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-07 15:30:34,105][213771] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-07 15:30:34,888][213771] Updated weights for policy 0, policy_version 65410 (0.0006) [2023-03-07 15:30:35,654][213771] Updated weights for policy 0, policy_version 65420 (0.0008) [2023-03-07 15:30:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 66995200. Throughput: 0: 13256.9. Samples: 66979294. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:30:36,106][213445] Avg episode reward: [(0, '4351.526')] [2023-03-07 15:30:36,412][213771] Updated weights for policy 0, policy_version 65430 (0.0006) [2023-03-07 15:30:37,189][213771] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-07 15:30:37,966][213771] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-07 15:30:38,737][213771] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-07 15:30:39,517][213771] Updated weights for policy 0, policy_version 65470 (0.0007) [2023-03-07 15:30:40,302][213771] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-07 15:30:41,063][213771] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-07 15:30:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 67061760. Throughput: 0: 13245.7. Samples: 67058878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:30:41,106][213445] Avg episode reward: [(0, '4408.876')] [2023-03-07 15:30:41,834][213771] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-07 15:30:42,605][213771] Updated weights for policy 0, policy_version 65510 (0.0007) [2023-03-07 15:30:43,357][213771] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-07 15:30:44,129][213771] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-07 15:30:44,918][213771] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-07 15:30:45,681][213771] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-07 15:30:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 67128320. Throughput: 0: 13248.8. Samples: 67098694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:30:46,106][213445] Avg episode reward: [(0, '4343.781')] [2023-03-07 15:30:46,442][213771] Updated weights for policy 0, policy_version 65560 (0.0006) [2023-03-07 15:30:47,223][213771] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 15:30:47,990][213771] Updated weights for policy 0, policy_version 65580 (0.0006) [2023-03-07 15:30:48,773][213771] Updated weights for policy 0, policy_version 65590 (0.0006) [2023-03-07 15:30:49,548][213771] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-07 15:30:50,304][213771] Updated weights for policy 0, policy_version 65610 (0.0007) [2023-03-07 15:30:51,086][213771] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-07 15:30:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 67194880. Throughput: 0: 13256.9. Samples: 67178421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:30:51,106][213445] Avg episode reward: [(0, '4345.970')] [2023-03-07 15:30:51,864][213771] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-07 15:30:52,644][213771] Updated weights for policy 0, policy_version 65640 (0.0006) [2023-03-07 15:30:53,403][213771] Updated weights for policy 0, policy_version 65650 (0.0006) [2023-03-07 15:30:54,171][213771] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-07 15:30:54,959][213771] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-07 15:30:55,714][213771] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-07 15:30:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 67261440. Throughput: 0: 13251.4. Samples: 67257862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:30:56,106][213445] Avg episode reward: [(0, '4315.363')] [2023-03-07 15:30:56,493][213771] Updated weights for policy 0, policy_version 65690 (0.0007) [2023-03-07 15:30:57,270][213771] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-07 15:30:58,046][213771] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-07 15:30:58,822][213771] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-07 15:30:59,612][213771] Updated weights for policy 0, policy_version 65730 (0.0006) [2023-03-07 15:31:00,363][213771] Updated weights for policy 0, policy_version 65740 (0.0007) [2023-03-07 15:31:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 67326976. Throughput: 0: 13252.0. Samples: 67297459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:01,106][213445] Avg episode reward: [(0, '4325.663')] [2023-03-07 15:31:01,133][213771] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-07 15:31:01,907][213771] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-07 15:31:02,652][213771] Updated weights for policy 0, policy_version 65770 (0.0005) [2023-03-07 15:31:03,442][213771] Updated weights for policy 0, policy_version 65780 (0.0007) [2023-03-07 15:31:04,228][213771] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-07 15:31:04,995][213771] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-07 15:31:05,758][213771] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 15:31:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 67393536. Throughput: 0: 13257.0. Samples: 67377015. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:06,106][213445] Avg episode reward: [(0, '4408.464')] [2023-03-07 15:31:06,550][213771] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-07 15:31:07,309][213771] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 15:31:08,091][213771] Updated weights for policy 0, policy_version 65840 (0.0005) [2023-03-07 15:31:08,862][213771] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-07 15:31:09,629][213771] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 15:31:10,397][213771] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 15:31:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 67460096. Throughput: 0: 13253.2. Samples: 67456567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:11,106][213445] Avg episode reward: [(0, '4338.931')] [2023-03-07 15:31:11,161][213771] Updated weights for policy 0, policy_version 65880 (0.0006) [2023-03-07 15:31:11,942][213771] Updated weights for policy 0, policy_version 65890 (0.0006) [2023-03-07 15:31:12,733][213771] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-07 15:31:13,526][213771] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 15:31:14,310][213771] Updated weights for policy 0, policy_version 65920 (0.0006) [2023-03-07 15:31:15,083][213771] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-07 15:31:15,877][213771] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 15:31:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 67525632. Throughput: 0: 13242.8. Samples: 67495843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:16,106][213445] Avg episode reward: [(0, '4433.431')] [2023-03-07 15:31:16,638][213771] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-07 15:31:17,416][213771] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-07 15:31:18,180][213771] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 15:31:18,944][213771] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-07 15:31:19,726][213771] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 15:31:20,487][213771] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-07 15:31:21,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 67591168. Throughput: 0: 13246.4. Samples: 67575384. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:21,106][213445] Avg episode reward: [(0, '4428.753')] [2023-03-07 15:31:21,272][213771] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-07 15:31:22,033][213771] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-07 15:31:22,829][213771] Updated weights for policy 0, policy_version 66030 (0.0007) [2023-03-07 15:31:23,592][213771] Updated weights for policy 0, policy_version 66040 (0.0007) [2023-03-07 15:31:24,345][213771] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-07 15:31:25,136][213771] Updated weights for policy 0, policy_version 66060 (0.0007) [2023-03-07 15:31:25,903][213771] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-07 15:31:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 67657728. Throughput: 0: 13241.4. Samples: 67654741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:26,105][213445] Avg episode reward: [(0, '4387.654')] [2023-03-07 15:31:26,650][213771] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-07 15:31:27,445][213771] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-07 15:31:28,219][213771] Updated weights for policy 0, policy_version 66100 (0.0006) [2023-03-07 15:31:28,993][213771] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-07 15:31:29,762][213771] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-07 15:31:30,549][213771] Updated weights for policy 0, policy_version 66130 (0.0006) [2023-03-07 15:31:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 67724288. Throughput: 0: 13241.0. Samples: 67694537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:31,106][213445] Avg episode reward: [(0, '4349.453')] [2023-03-07 15:31:31,329][213771] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-07 15:31:32,078][213771] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-07 15:31:32,859][213771] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 15:31:33,625][213771] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-07 15:31:34,393][213771] Updated weights for policy 0, policy_version 66180 (0.0006) [2023-03-07 15:31:35,178][213771] Updated weights for policy 0, policy_version 66190 (0.0007) [2023-03-07 15:31:35,944][213771] Updated weights for policy 0, policy_version 66200 (0.0006) [2023-03-07 15:31:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 67790848. Throughput: 0: 13242.2. Samples: 67774317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:36,105][213445] Avg episode reward: [(0, '4358.709')] [2023-03-07 15:31:36,705][213771] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-07 15:31:37,485][213771] Updated weights for policy 0, policy_version 66220 (0.0007) [2023-03-07 15:31:38,270][213771] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-07 15:31:39,034][213771] Updated weights for policy 0, policy_version 66240 (0.0007) [2023-03-07 15:31:39,809][213771] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-07 15:31:40,596][213771] Updated weights for policy 0, policy_version 66260 (0.0007) [2023-03-07 15:31:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 67856384. Throughput: 0: 13235.9. Samples: 67853476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:41,106][213445] Avg episode reward: [(0, '4384.815')] [2023-03-07 15:31:41,369][213771] Updated weights for policy 0, policy_version 66270 (0.0006) [2023-03-07 15:31:42,122][213771] Updated weights for policy 0, policy_version 66280 (0.0007) [2023-03-07 15:31:42,910][213771] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-07 15:31:43,685][213771] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-07 15:31:44,472][213771] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-07 15:31:45,229][213771] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-07 15:31:46,007][213771] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-07 15:31:46,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 67922944. Throughput: 0: 13239.9. Samples: 67893257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:46,106][213445] Avg episode reward: [(0, '4347.484')] [2023-03-07 15:31:46,773][213771] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 15:31:47,553][213771] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-07 15:31:48,303][213771] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-07 15:31:49,098][213771] Updated weights for policy 0, policy_version 66370 (0.0006) [2023-03-07 15:31:49,874][213771] Updated weights for policy 0, policy_version 66380 (0.0006) [2023-03-07 15:31:50,639][213771] Updated weights for policy 0, policy_version 66390 (0.0006) [2023-03-07 15:31:51,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 67989504. Throughput: 0: 13236.0. Samples: 67972635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:51,106][213445] Avg episode reward: [(0, '4420.649')] [2023-03-07 15:31:51,402][213771] Updated weights for policy 0, policy_version 66400 (0.0005) [2023-03-07 15:31:52,153][213771] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 15:31:52,921][213771] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-07 15:31:53,689][213771] Updated weights for policy 0, policy_version 66430 (0.0006) [2023-03-07 15:31:54,464][213771] Updated weights for policy 0, policy_version 66440 (0.0005) [2023-03-07 15:31:55,238][213771] Updated weights for policy 0, policy_version 66450 (0.0006) [2023-03-07 15:31:56,010][213771] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-07 15:31:56,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 68056064. Throughput: 0: 13248.5. Samples: 68052747. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:31:56,105][213445] Avg episode reward: [(0, '4402.580')] [2023-03-07 15:31:56,772][213771] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-07 15:31:57,553][213771] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 15:31:58,321][213771] Updated weights for policy 0, policy_version 66490 (0.0006) [2023-03-07 15:31:59,097][213771] Updated weights for policy 0, policy_version 66500 (0.0007) [2023-03-07 15:31:59,871][213771] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-07 15:32:00,639][213771] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 15:32:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 68122624. Throughput: 0: 13256.9. Samples: 68092402. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:01,106][213445] Avg episode reward: [(0, '4462.186')] [2023-03-07 15:32:01,402][213771] Updated weights for policy 0, policy_version 66530 (0.0006) [2023-03-07 15:32:02,163][213771] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-07 15:32:02,932][213771] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-07 15:32:03,709][213771] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-07 15:32:04,496][213771] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-07 15:32:05,260][213771] Updated weights for policy 0, policy_version 66580 (0.0005) [2023-03-07 15:32:06,037][213771] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 15:32:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 68188160. Throughput: 0: 13260.9. Samples: 68172123. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:06,106][213445] Avg episode reward: [(0, '4458.130')] [2023-03-07 15:32:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000066591_68189184.pth... [2023-03-07 15:32:06,142][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000063484_65007616.pth [2023-03-07 15:32:06,814][213771] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-07 15:32:07,589][213771] Updated weights for policy 0, policy_version 66610 (0.0006) [2023-03-07 15:32:08,384][213771] Updated weights for policy 0, policy_version 66620 (0.0006) [2023-03-07 15:32:09,146][213771] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-07 15:32:09,921][213771] Updated weights for policy 0, policy_version 66640 (0.0008) [2023-03-07 15:32:10,709][213771] Updated weights for policy 0, policy_version 66650 (0.0006) [2023-03-07 15:32:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 68254720. Throughput: 0: 13262.1. Samples: 68251537. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:11,106][213445] Avg episode reward: [(0, '4376.464')] [2023-03-07 15:32:11,476][213771] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-07 15:32:12,253][213771] Updated weights for policy 0, policy_version 66670 (0.0006) [2023-03-07 15:32:13,007][213771] Updated weights for policy 0, policy_version 66680 (0.0005) [2023-03-07 15:32:13,777][213771] Updated weights for policy 0, policy_version 66690 (0.0006) [2023-03-07 15:32:14,570][213771] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 15:32:15,334][213771] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 15:32:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 68320256. Throughput: 0: 13259.0. Samples: 68291193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:16,106][213445] Avg episode reward: [(0, '4253.309')] [2023-03-07 15:32:16,109][213771] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-07 15:32:16,895][213771] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-07 15:32:17,662][213771] Updated weights for policy 0, policy_version 66740 (0.0007) [2023-03-07 15:32:18,431][213771] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 15:32:19,219][213771] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 15:32:19,993][213771] Updated weights for policy 0, policy_version 66770 (0.0006) [2023-03-07 15:32:20,774][213771] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-07 15:32:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 68386816. Throughput: 0: 13246.7. Samples: 68370421. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:21,106][213445] Avg episode reward: [(0, '4328.633')] [2023-03-07 15:32:21,538][213771] Updated weights for policy 0, policy_version 66790 (0.0007) [2023-03-07 15:32:22,311][213771] Updated weights for policy 0, policy_version 66800 (0.0007) [2023-03-07 15:32:23,090][213771] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-07 15:32:23,877][213771] Updated weights for policy 0, policy_version 66820 (0.0005) [2023-03-07 15:32:24,647][213771] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 15:32:25,431][213771] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-07 15:32:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 68452352. Throughput: 0: 13247.3. Samples: 68449603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:26,106][213445] Avg episode reward: [(0, '4369.580')] [2023-03-07 15:32:26,197][213771] Updated weights for policy 0, policy_version 66850 (0.0006) [2023-03-07 15:32:26,961][213771] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-07 15:32:27,731][213771] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-07 15:32:28,490][213771] Updated weights for policy 0, policy_version 66880 (0.0006) [2023-03-07 15:32:29,274][213771] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-07 15:32:30,038][213771] Updated weights for policy 0, policy_version 66900 (0.0006) [2023-03-07 15:32:30,834][213771] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-07 15:32:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 68518912. Throughput: 0: 13251.2. Samples: 68489556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:31,106][213445] Avg episode reward: [(0, '4419.950')] [2023-03-07 15:32:31,595][213771] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-07 15:32:32,386][213771] Updated weights for policy 0, policy_version 66930 (0.0006) [2023-03-07 15:32:33,160][213771] Updated weights for policy 0, policy_version 66940 (0.0006) [2023-03-07 15:32:33,914][213771] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 15:32:34,682][213771] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-07 15:32:35,470][213771] Updated weights for policy 0, policy_version 66970 (0.0006) [2023-03-07 15:32:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 68585472. Throughput: 0: 13251.6. Samples: 68568957. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:32:36,106][213445] Avg episode reward: [(0, '4483.395')] [2023-03-07 15:32:36,239][213771] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-07 15:32:37,008][213771] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 15:32:37,785][213771] Updated weights for policy 0, policy_version 67000 (0.0005) [2023-03-07 15:32:38,551][213771] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-07 15:32:39,322][213771] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-07 15:32:40,099][213771] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-07 15:32:40,882][213771] Updated weights for policy 0, policy_version 67040 (0.0007) [2023-03-07 15:32:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 68651008. Throughput: 0: 13234.4. Samples: 68648299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:32:41,106][213445] Avg episode reward: [(0, '4463.584')] [2023-03-07 15:32:41,646][213771] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-07 15:32:42,432][213771] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 15:32:43,212][213771] Updated weights for policy 0, policy_version 67070 (0.0006) [2023-03-07 15:32:43,985][213771] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-07 15:32:44,747][213771] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-07 15:32:45,512][213771] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 15:32:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 68717568. Throughput: 0: 13233.2. Samples: 68687895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:32:46,106][213445] Avg episode reward: [(0, '4449.972')] [2023-03-07 15:32:46,282][213771] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 15:32:47,061][213771] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-07 15:32:47,852][213771] Updated weights for policy 0, policy_version 67130 (0.0006) [2023-03-07 15:32:48,620][213771] Updated weights for policy 0, policy_version 67140 (0.0005) [2023-03-07 15:32:49,392][213771] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-07 15:32:50,157][213771] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-07 15:32:50,942][213771] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-07 15:32:51,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 68784128. Throughput: 0: 13227.5. Samples: 68767357. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:32:51,106][213445] Avg episode reward: [(0, '4499.355')] [2023-03-07 15:32:51,707][213771] Updated weights for policy 0, policy_version 67180 (0.0007) [2023-03-07 15:32:52,477][213771] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 15:32:53,269][213771] Updated weights for policy 0, policy_version 67200 (0.0007) [2023-03-07 15:32:54,020][213771] Updated weights for policy 0, policy_version 67210 (0.0006) [2023-03-07 15:32:54,778][213771] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-07 15:32:55,566][213771] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 15:32:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 68850688. Throughput: 0: 13236.6. Samples: 68847185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:32:56,105][213445] Avg episode reward: [(0, '4481.417')] [2023-03-07 15:32:56,323][213771] Updated weights for policy 0, policy_version 67240 (0.0006) [2023-03-07 15:32:57,098][213771] Updated weights for policy 0, policy_version 67250 (0.0005) [2023-03-07 15:32:57,863][213771] Updated weights for policy 0, policy_version 67260 (0.0006) [2023-03-07 15:32:58,646][213771] Updated weights for policy 0, policy_version 67270 (0.0006) [2023-03-07 15:32:59,410][213771] Updated weights for policy 0, policy_version 67280 (0.0006) [2023-03-07 15:33:00,185][213771] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-07 15:33:00,949][213771] Updated weights for policy 0, policy_version 67300 (0.0005) [2023-03-07 15:33:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 68917248. Throughput: 0: 13239.6. Samples: 68886976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:01,106][213445] Avg episode reward: [(0, '4407.945')] [2023-03-07 15:33:01,727][213771] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-07 15:33:02,511][213771] Updated weights for policy 0, policy_version 67320 (0.0007) [2023-03-07 15:33:03,267][213771] Updated weights for policy 0, policy_version 67330 (0.0006) [2023-03-07 15:33:04,048][213771] Updated weights for policy 0, policy_version 67340 (0.0007) [2023-03-07 15:33:04,835][213771] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-07 15:33:05,599][213771] Updated weights for policy 0, policy_version 67360 (0.0006) [2023-03-07 15:33:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 68982784. Throughput: 0: 13248.3. Samples: 68966592. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:06,116][213445] Avg episode reward: [(0, '4405.483')] [2023-03-07 15:33:06,376][213771] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-07 15:33:07,153][213771] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-07 15:33:07,941][213771] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-07 15:33:08,715][213771] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-07 15:33:09,491][213771] Updated weights for policy 0, policy_version 67410 (0.0007) [2023-03-07 15:33:10,258][213771] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-07 15:33:11,036][213771] Updated weights for policy 0, policy_version 67430 (0.0005) [2023-03-07 15:33:11,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 69048320. Throughput: 0: 13247.2. Samples: 69045727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:11,117][213445] Avg episode reward: [(0, '4401.368')] [2023-03-07 15:33:11,812][213771] Updated weights for policy 0, policy_version 67440 (0.0006) [2023-03-07 15:33:12,568][213771] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 15:33:13,338][213771] Updated weights for policy 0, policy_version 67460 (0.0005) [2023-03-07 15:33:14,116][213771] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-03-07 15:33:14,908][213771] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-07 15:33:15,687][213771] Updated weights for policy 0, policy_version 67490 (0.0005) [2023-03-07 15:33:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 69114880. Throughput: 0: 13244.4. Samples: 69085556. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:16,116][213445] Avg episode reward: [(0, '4374.522')] [2023-03-07 15:33:16,442][213771] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-07 15:33:17,221][213771] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-07 15:33:17,985][213771] Updated weights for policy 0, policy_version 67520 (0.0007) [2023-03-07 15:33:18,794][213771] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 15:33:19,573][213771] Updated weights for policy 0, policy_version 67540 (0.0005) [2023-03-07 15:33:20,330][213771] Updated weights for policy 0, policy_version 67550 (0.0005) [2023-03-07 15:33:21,090][213771] Updated weights for policy 0, policy_version 67560 (0.0005) [2023-03-07 15:33:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 69181440. Throughput: 0: 13234.2. Samples: 69164495. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:21,116][213445] Avg episode reward: [(0, '4404.879')] [2023-03-07 15:33:21,865][213771] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-07 15:33:22,608][213771] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-07 15:33:23,401][213771] Updated weights for policy 0, policy_version 67590 (0.0007) [2023-03-07 15:33:24,173][213771] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-07 15:33:24,946][213771] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-07 15:33:25,707][213771] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 15:33:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 69248000. Throughput: 0: 13248.0. Samples: 69244459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:26,116][213445] Avg episode reward: [(0, '4348.935')] [2023-03-07 15:33:26,471][213771] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-07 15:33:27,242][213771] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-07 15:33:28,015][213771] Updated weights for policy 0, policy_version 67650 (0.0006) [2023-03-07 15:33:28,793][213771] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-07 15:33:29,567][213771] Updated weights for policy 0, policy_version 67670 (0.0006) [2023-03-07 15:33:30,341][213771] Updated weights for policy 0, policy_version 67680 (0.0006) [2023-03-07 15:33:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 69313536. Throughput: 0: 13256.5. Samples: 69284439. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:31,106][213445] Avg episode reward: [(0, '4400.635')] [2023-03-07 15:33:31,126][213771] Updated weights for policy 0, policy_version 67690 (0.0007) [2023-03-07 15:33:31,894][213771] Updated weights for policy 0, policy_version 67700 (0.0005) [2023-03-07 15:33:32,665][213771] Updated weights for policy 0, policy_version 67710 (0.0006) [2023-03-07 15:33:33,441][213771] Updated weights for policy 0, policy_version 67720 (0.0006) [2023-03-07 15:33:34,218][213771] Updated weights for policy 0, policy_version 67730 (0.0007) [2023-03-07 15:33:34,989][213771] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-07 15:33:35,750][213771] Updated weights for policy 0, policy_version 67750 (0.0006) [2023-03-07 15:33:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 69380096. Throughput: 0: 13256.1. Samples: 69363885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:36,106][213445] Avg episode reward: [(0, '4401.385')] [2023-03-07 15:33:36,518][213771] Updated weights for policy 0, policy_version 67760 (0.0007) [2023-03-07 15:33:37,305][213771] Updated weights for policy 0, policy_version 67770 (0.0006) [2023-03-07 15:33:38,077][213771] Updated weights for policy 0, policy_version 67780 (0.0005) [2023-03-07 15:33:38,845][213771] Updated weights for policy 0, policy_version 67790 (0.0006) [2023-03-07 15:33:39,608][213771] Updated weights for policy 0, policy_version 67800 (0.0006) [2023-03-07 15:33:40,384][213771] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-07 15:33:41,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 69446656. Throughput: 0: 13255.0. Samples: 69443662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:41,106][213445] Avg episode reward: [(0, '4382.918')] [2023-03-07 15:33:41,143][213771] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-07 15:33:41,909][213771] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 15:33:42,679][213771] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-07 15:33:43,473][213771] Updated weights for policy 0, policy_version 67850 (0.0007) [2023-03-07 15:33:44,226][213771] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-07 15:33:45,005][213771] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 15:33:45,763][213771] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-07 15:33:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 69513216. Throughput: 0: 13252.7. Samples: 69483348. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:46,106][213445] Avg episode reward: [(0, '4373.484')] [2023-03-07 15:33:46,539][213771] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-07 15:33:47,325][213771] Updated weights for policy 0, policy_version 67900 (0.0007) [2023-03-07 15:33:48,090][213771] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-07 15:33:48,863][213771] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-07 15:33:49,641][213771] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 15:33:50,413][213771] Updated weights for policy 0, policy_version 67940 (0.0005) [2023-03-07 15:33:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 69579776. Throughput: 0: 13253.8. Samples: 69563014. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:51,106][213445] Avg episode reward: [(0, '4336.949')] [2023-03-07 15:33:51,180][213771] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-07 15:33:51,949][213771] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-07 15:33:52,734][213771] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-07 15:33:53,516][213771] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-07 15:33:54,290][213771] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-07 15:33:55,064][213771] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-07 15:33:55,842][213771] Updated weights for policy 0, policy_version 68010 (0.0006) [2023-03-07 15:33:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 69645312. Throughput: 0: 13250.3. Samples: 69641990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:33:56,106][213445] Avg episode reward: [(0, '4364.476')] [2023-03-07 15:33:56,622][213771] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-07 15:33:57,402][213771] Updated weights for policy 0, policy_version 68030 (0.0005) [2023-03-07 15:33:58,174][213771] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-07 15:33:58,933][213771] Updated weights for policy 0, policy_version 68050 (0.0006) [2023-03-07 15:33:59,705][213771] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-07 15:34:00,489][213771] Updated weights for policy 0, policy_version 68070 (0.0007) [2023-03-07 15:34:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 69711872. Throughput: 0: 13250.9. Samples: 69681846. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:01,106][213445] Avg episode reward: [(0, '4354.079')] [2023-03-07 15:34:01,253][213771] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-07 15:34:02,027][213771] Updated weights for policy 0, policy_version 68090 (0.0005) [2023-03-07 15:34:02,797][213771] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-07 15:34:03,575][213771] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-07 15:34:04,342][213771] Updated weights for policy 0, policy_version 68120 (0.0007) [2023-03-07 15:34:05,132][213771] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-07 15:34:05,907][213771] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-07 15:34:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 69777408. Throughput: 0: 13261.0. Samples: 69761241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:06,106][213445] Avg episode reward: [(0, '4279.054')] [2023-03-07 15:34:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000068142_69777408.pth... [2023-03-07 15:34:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000065037_66597888.pth [2023-03-07 15:34:06,676][213771] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 15:34:07,458][213771] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 15:34:08,229][213771] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-07 15:34:08,996][213771] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-07 15:34:09,764][213771] Updated weights for policy 0, policy_version 68190 (0.0007) [2023-03-07 15:34:10,545][213771] Updated weights for policy 0, policy_version 68200 (0.0006) [2023-03-07 15:34:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 69843968. Throughput: 0: 13254.1. Samples: 69840890. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:11,106][213445] Avg episode reward: [(0, '4275.140')] [2023-03-07 15:34:11,318][213771] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-07 15:34:12,082][213771] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-07 15:34:12,856][213771] Updated weights for policy 0, policy_version 68230 (0.0007) [2023-03-07 15:34:13,633][213771] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-07 15:34:14,401][213771] Updated weights for policy 0, policy_version 68250 (0.0006) [2023-03-07 15:34:15,189][213771] Updated weights for policy 0, policy_version 68260 (0.0007) [2023-03-07 15:34:15,943][213771] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-07 15:34:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 69910528. Throughput: 0: 13246.1. Samples: 69880512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:16,106][213445] Avg episode reward: [(0, '4292.975')] [2023-03-07 15:34:16,717][213771] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-07 15:34:17,489][213771] Updated weights for policy 0, policy_version 68290 (0.0007) [2023-03-07 15:34:18,245][213771] Updated weights for policy 0, policy_version 68300 (0.0006) [2023-03-07 15:34:19,010][213771] Updated weights for policy 0, policy_version 68310 (0.0007) [2023-03-07 15:34:19,796][213771] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-07 15:34:20,576][213771] Updated weights for policy 0, policy_version 68330 (0.0007) [2023-03-07 15:34:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 69976064. Throughput: 0: 13250.3. Samples: 69960147. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:21,106][213445] Avg episode reward: [(0, '4302.708')] [2023-03-07 15:34:21,342][213771] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-07 15:34:22,110][213771] Updated weights for policy 0, policy_version 68350 (0.0005) [2023-03-07 15:34:22,883][213771] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-07 15:34:23,641][213771] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-07 15:34:24,428][213771] Updated weights for policy 0, policy_version 68380 (0.0006) [2023-03-07 15:34:25,198][213771] Updated weights for policy 0, policy_version 68390 (0.0005) [2023-03-07 15:34:25,968][213771] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-07 15:34:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 70042624. Throughput: 0: 13249.0. Samples: 70039867. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:26,106][213445] Avg episode reward: [(0, '4271.518')] [2023-03-07 15:34:26,756][213771] Updated weights for policy 0, policy_version 68410 (0.0007) [2023-03-07 15:34:27,516][213771] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-07 15:34:28,287][213771] Updated weights for policy 0, policy_version 68430 (0.0007) [2023-03-07 15:34:29,065][213771] Updated weights for policy 0, policy_version 68440 (0.0007) [2023-03-07 15:34:29,834][213771] Updated weights for policy 0, policy_version 68450 (0.0005) [2023-03-07 15:34:30,597][213771] Updated weights for policy 0, policy_version 68460 (0.0006) [2023-03-07 15:34:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70109184. Throughput: 0: 13250.4. Samples: 70079616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:31,106][213445] Avg episode reward: [(0, '4255.906')] [2023-03-07 15:34:31,361][213771] Updated weights for policy 0, policy_version 68470 (0.0005) [2023-03-07 15:34:32,137][213771] Updated weights for policy 0, policy_version 68480 (0.0007) [2023-03-07 15:34:32,917][213771] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-07 15:34:33,668][213771] Updated weights for policy 0, policy_version 68500 (0.0007) [2023-03-07 15:34:34,441][213771] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-07 15:34:35,205][213771] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-07 15:34:35,990][213771] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-07 15:34:36,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70175744. Throughput: 0: 13258.6. Samples: 70159648. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:36,106][213445] Avg episode reward: [(0, '4270.428')] [2023-03-07 15:34:36,761][213771] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-07 15:34:37,530][213771] Updated weights for policy 0, policy_version 68550 (0.0008) [2023-03-07 15:34:38,307][213771] Updated weights for policy 0, policy_version 68560 (0.0007) [2023-03-07 15:34:39,094][213771] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-07 15:34:39,858][213771] Updated weights for policy 0, policy_version 68580 (0.0005) [2023-03-07 15:34:40,633][213771] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-07 15:34:41,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70242304. Throughput: 0: 13265.1. Samples: 70238919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:41,106][213445] Avg episode reward: [(0, '4302.899')] [2023-03-07 15:34:41,403][213771] Updated weights for policy 0, policy_version 68600 (0.0007) [2023-03-07 15:34:42,184][213771] Updated weights for policy 0, policy_version 68610 (0.0008) [2023-03-07 15:34:42,942][213771] Updated weights for policy 0, policy_version 68620 (0.0006) [2023-03-07 15:34:43,724][213771] Updated weights for policy 0, policy_version 68630 (0.0006) [2023-03-07 15:34:44,490][213771] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-07 15:34:45,267][213771] Updated weights for policy 0, policy_version 68650 (0.0007) [2023-03-07 15:34:46,045][213771] Updated weights for policy 0, policy_version 68660 (0.0005) [2023-03-07 15:34:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 70307840. Throughput: 0: 13260.1. Samples: 70278551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:46,106][213445] Avg episode reward: [(0, '4182.687')] [2023-03-07 15:34:46,829][213771] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-07 15:34:47,608][213771] Updated weights for policy 0, policy_version 68680 (0.0007) [2023-03-07 15:34:48,383][213771] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-07 15:34:49,165][213771] Updated weights for policy 0, policy_version 68700 (0.0007) [2023-03-07 15:34:49,929][213771] Updated weights for policy 0, policy_version 68710 (0.0005) [2023-03-07 15:34:50,691][213771] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-07 15:34:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 70374400. Throughput: 0: 13258.1. Samples: 70357855. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:51,106][213445] Avg episode reward: [(0, '4312.199')] [2023-03-07 15:34:51,492][213771] Updated weights for policy 0, policy_version 68730 (0.0007) [2023-03-07 15:34:52,246][213771] Updated weights for policy 0, policy_version 68740 (0.0007) [2023-03-07 15:34:53,010][213771] Updated weights for policy 0, policy_version 68750 (0.0007) [2023-03-07 15:34:53,809][213771] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-07 15:34:54,568][213771] Updated weights for policy 0, policy_version 68770 (0.0006) [2023-03-07 15:34:55,345][213771] Updated weights for policy 0, policy_version 68780 (0.0005) [2023-03-07 15:34:56,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 70439936. Throughput: 0: 13248.9. Samples: 70437094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:34:56,106][213445] Avg episode reward: [(0, '4337.128')] [2023-03-07 15:34:56,124][213771] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-07 15:34:56,890][213771] Updated weights for policy 0, policy_version 68800 (0.0007) [2023-03-07 15:34:57,665][213771] Updated weights for policy 0, policy_version 68810 (0.0007) [2023-03-07 15:34:58,425][213771] Updated weights for policy 0, policy_version 68820 (0.0006) [2023-03-07 15:34:59,215][213771] Updated weights for policy 0, policy_version 68830 (0.0007) [2023-03-07 15:34:59,992][213771] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 15:35:00,737][213771] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 15:35:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 70506496. Throughput: 0: 13253.5. Samples: 70476923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:35:01,106][213445] Avg episode reward: [(0, '4326.771')] [2023-03-07 15:35:01,516][213771] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-07 15:35:02,294][213771] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-07 15:35:03,062][213771] Updated weights for policy 0, policy_version 68880 (0.0006) [2023-03-07 15:35:03,825][213771] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 15:35:04,610][213771] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 15:35:05,380][213771] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 15:35:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 70573056. Throughput: 0: 13256.5. Samples: 70556692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:35:06,106][213445] Avg episode reward: [(0, '4302.031')] [2023-03-07 15:35:06,163][213771] Updated weights for policy 0, policy_version 68920 (0.0007) [2023-03-07 15:35:06,918][213771] Updated weights for policy 0, policy_version 68930 (0.0006) [2023-03-07 15:35:07,682][213771] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-07 15:35:08,446][213771] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-07 15:35:09,210][213771] Updated weights for policy 0, policy_version 68960 (0.0007) [2023-03-07 15:35:09,976][213771] Updated weights for policy 0, policy_version 68970 (0.0005) [2023-03-07 15:35:10,759][213771] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-07 15:35:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 70639616. Throughput: 0: 13259.9. Samples: 70636561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:35:11,106][213445] Avg episode reward: [(0, '4320.497')] [2023-03-07 15:35:11,526][213771] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-07 15:35:12,297][213771] Updated weights for policy 0, policy_version 69000 (0.0006) [2023-03-07 15:35:13,073][213771] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-07 15:35:13,834][213771] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-07 15:35:14,610][213771] Updated weights for policy 0, policy_version 69030 (0.0007) [2023-03-07 15:35:15,361][213771] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 15:35:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70706176. Throughput: 0: 13264.3. Samples: 70676509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:35:16,106][213445] Avg episode reward: [(0, '4304.022')] [2023-03-07 15:35:16,126][213771] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 15:35:16,900][213771] Updated weights for policy 0, policy_version 69060 (0.0007) [2023-03-07 15:35:17,678][213771] Updated weights for policy 0, policy_version 69070 (0.0006) [2023-03-07 15:35:18,453][213771] Updated weights for policy 0, policy_version 69080 (0.0006) [2023-03-07 15:35:19,223][213771] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-07 15:35:19,994][213771] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-07 15:35:20,782][213771] Updated weights for policy 0, policy_version 69110 (0.0007) [2023-03-07 15:35:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 70772736. Throughput: 0: 13257.4. Samples: 70756233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:35:21,106][213445] Avg episode reward: [(0, '4202.935')] [2023-03-07 15:35:21,557][213771] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-07 15:35:22,332][213771] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-07 15:35:23,106][213771] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-07 15:35:23,880][213771] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-07 15:35:24,652][213771] Updated weights for policy 0, policy_version 69160 (0.0007) [2023-03-07 15:35:25,420][213771] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 15:35:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 70839296. Throughput: 0: 13260.3. Samples: 70835633. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:26,106][213445] Avg episode reward: [(0, '4047.433')] [2023-03-07 15:35:26,194][213771] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-07 15:35:26,949][213771] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 15:35:27,725][213771] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-07 15:35:28,518][213771] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-07 15:35:29,296][213771] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 15:35:30,072][213771] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 15:35:30,843][213771] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-07 15:35:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70904832. Throughput: 0: 13262.6. Samples: 70875365. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:31,105][213445] Avg episode reward: [(0, '4135.418')] [2023-03-07 15:35:31,599][213771] Updated weights for policy 0, policy_version 69250 (0.0006) [2023-03-07 15:35:32,377][213771] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-07 15:35:33,141][213771] Updated weights for policy 0, policy_version 69270 (0.0006) [2023-03-07 15:35:33,931][213771] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-07 15:35:34,706][213771] Updated weights for policy 0, policy_version 69290 (0.0006) [2023-03-07 15:35:35,476][213771] Updated weights for policy 0, policy_version 69300 (0.0007) [2023-03-07 15:35:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 70971392. Throughput: 0: 13264.5. Samples: 70954758. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:36,106][213445] Avg episode reward: [(0, '4287.690')] [2023-03-07 15:35:36,229][213771] Updated weights for policy 0, policy_version 69310 (0.0007) [2023-03-07 15:35:37,025][213771] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 15:35:37,805][213771] Updated weights for policy 0, policy_version 69330 (0.0006) [2023-03-07 15:35:38,594][213771] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-07 15:35:39,367][213771] Updated weights for policy 0, policy_version 69350 (0.0005) [2023-03-07 15:35:40,140][213771] Updated weights for policy 0, policy_version 69360 (0.0007) [2023-03-07 15:35:40,913][213771] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 15:35:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 71036928. Throughput: 0: 13263.3. Samples: 71033941. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:41,106][213445] Avg episode reward: [(0, '4274.692')] [2023-03-07 15:35:41,682][213771] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-07 15:35:42,461][213771] Updated weights for policy 0, policy_version 69390 (0.0006) [2023-03-07 15:35:43,235][213771] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-07 15:35:43,998][213771] Updated weights for policy 0, policy_version 69410 (0.0007) [2023-03-07 15:35:44,780][213771] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-07 15:35:45,561][213771] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 15:35:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 71103488. Throughput: 0: 13258.5. Samples: 71073554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:46,106][213445] Avg episode reward: [(0, '4311.503')] [2023-03-07 15:35:46,345][213771] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 15:35:47,136][213771] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-07 15:35:47,895][213771] Updated weights for policy 0, policy_version 69460 (0.0005) [2023-03-07 15:35:48,684][213771] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 15:35:49,469][213771] Updated weights for policy 0, policy_version 69480 (0.0005) [2023-03-07 15:35:50,230][213771] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-07 15:35:51,005][213771] Updated weights for policy 0, policy_version 69500 (0.0007) [2023-03-07 15:35:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 71169024. Throughput: 0: 13239.8. Samples: 71152480. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:51,105][213445] Avg episode reward: [(0, '4313.277')] [2023-03-07 15:35:51,774][213771] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-07 15:35:52,541][213771] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-07 15:35:53,321][213771] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 15:35:54,098][213771] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-07 15:35:54,862][213771] Updated weights for policy 0, policy_version 69550 (0.0007) [2023-03-07 15:35:55,631][213771] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-07 15:35:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 71235584. Throughput: 0: 13233.2. Samples: 71232055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:35:56,106][213445] Avg episode reward: [(0, '4354.420')] [2023-03-07 15:35:56,411][213771] Updated weights for policy 0, policy_version 69570 (0.0006) [2023-03-07 15:35:57,183][213771] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-07 15:35:57,935][213771] Updated weights for policy 0, policy_version 69590 (0.0005) [2023-03-07 15:35:58,704][213771] Updated weights for policy 0, policy_version 69600 (0.0007) [2023-03-07 15:35:59,486][213771] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-07 15:36:00,251][213771] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-07 15:36:01,012][213771] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-07 15:36:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 71301120. Throughput: 0: 13234.6. Samples: 71272066. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:01,106][213445] Avg episode reward: [(0, '4325.143')] [2023-03-07 15:36:01,809][213771] Updated weights for policy 0, policy_version 69640 (0.0007) [2023-03-07 15:36:02,555][213771] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-07 15:36:03,340][213771] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-03-07 15:36:04,095][213771] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-07 15:36:04,871][213771] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-07 15:36:05,677][213771] Updated weights for policy 0, policy_version 69690 (0.0006) [2023-03-07 15:36:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 71367680. Throughput: 0: 13229.9. Samples: 71351581. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:06,106][213445] Avg episode reward: [(0, '4350.561')] [2023-03-07 15:36:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000069695_71367680.pth... [2023-03-07 15:36:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000066591_68189184.pth [2023-03-07 15:36:06,435][213771] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 15:36:07,201][213771] Updated weights for policy 0, policy_version 69710 (0.0007) [2023-03-07 15:36:07,969][213771] Updated weights for policy 0, policy_version 69720 (0.0006) [2023-03-07 15:36:08,733][213771] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-07 15:36:09,513][213771] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 15:36:10,292][213771] Updated weights for policy 0, policy_version 69750 (0.0006) [2023-03-07 15:36:11,058][213771] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-07 15:36:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 71434240. Throughput: 0: 13232.8. Samples: 71431110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:11,106][213445] Avg episode reward: [(0, '4341.912')] [2023-03-07 15:36:11,838][213771] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-07 15:36:12,599][213771] Updated weights for policy 0, policy_version 69780 (0.0007) [2023-03-07 15:36:13,377][213771] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 15:36:14,146][213771] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-07 15:36:14,925][213771] Updated weights for policy 0, policy_version 69810 (0.0005) [2023-03-07 15:36:15,699][213771] Updated weights for policy 0, policy_version 69820 (0.0006) [2023-03-07 15:36:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 71500800. Throughput: 0: 13235.5. Samples: 71470966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:16,106][213445] Avg episode reward: [(0, '4397.191')] [2023-03-07 15:36:16,473][213771] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-07 15:36:17,242][213771] Updated weights for policy 0, policy_version 69840 (0.0005) [2023-03-07 15:36:18,033][213771] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-07 15:36:18,807][213771] Updated weights for policy 0, policy_version 69860 (0.0007) [2023-03-07 15:36:19,588][213771] Updated weights for policy 0, policy_version 69870 (0.0007) [2023-03-07 15:36:20,349][213771] Updated weights for policy 0, policy_version 69880 (0.0006) [2023-03-07 15:36:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 71566336. Throughput: 0: 13233.4. Samples: 71550261. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:21,106][213445] Avg episode reward: [(0, '4405.838')] [2023-03-07 15:36:21,149][213771] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 15:36:21,905][213771] Updated weights for policy 0, policy_version 69900 (0.0007) [2023-03-07 15:36:22,682][213771] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-07 15:36:23,462][213771] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-07 15:36:24,230][213771] Updated weights for policy 0, policy_version 69930 (0.0006) [2023-03-07 15:36:25,006][213771] Updated weights for policy 0, policy_version 69940 (0.0006) [2023-03-07 15:36:25,789][213771] Updated weights for policy 0, policy_version 69950 (0.0005) [2023-03-07 15:36:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 71632896. Throughput: 0: 13233.5. Samples: 71629451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:26,106][213445] Avg episode reward: [(0, '4414.071')] [2023-03-07 15:36:26,550][213771] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-07 15:36:27,325][213771] Updated weights for policy 0, policy_version 69970 (0.0006) [2023-03-07 15:36:28,122][213771] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-07 15:36:28,870][213771] Updated weights for policy 0, policy_version 69990 (0.0007) [2023-03-07 15:36:29,643][213771] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-07 15:36:30,419][213771] Updated weights for policy 0, policy_version 70010 (0.0005) [2023-03-07 15:36:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 71698432. Throughput: 0: 13236.1. Samples: 71669176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:31,106][213445] Avg episode reward: [(0, '4418.850')] [2023-03-07 15:36:31,186][213771] Updated weights for policy 0, policy_version 70020 (0.0005) [2023-03-07 15:36:31,954][213771] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-07 15:36:32,737][213771] Updated weights for policy 0, policy_version 70040 (0.0006) [2023-03-07 15:36:33,502][213771] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-07 15:36:34,274][213771] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-07 15:36:35,053][213771] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-07 15:36:35,819][213771] Updated weights for policy 0, policy_version 70080 (0.0006) [2023-03-07 15:36:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 71764992. Throughput: 0: 13254.6. Samples: 71748939. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:36,106][213445] Avg episode reward: [(0, '4407.376')] [2023-03-07 15:36:36,595][213771] Updated weights for policy 0, policy_version 70090 (0.0005) [2023-03-07 15:36:37,363][213771] Updated weights for policy 0, policy_version 70100 (0.0005) [2023-03-07 15:36:38,143][213771] Updated weights for policy 0, policy_version 70110 (0.0006) [2023-03-07 15:36:38,920][213771] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 15:36:39,706][213771] Updated weights for policy 0, policy_version 70130 (0.0007) [2023-03-07 15:36:40,487][213771] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 15:36:41,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 71831552. Throughput: 0: 13244.5. Samples: 71828058. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:41,106][213445] Avg episode reward: [(0, '4413.933')] [2023-03-07 15:36:41,259][213771] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-07 15:36:42,031][213771] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 15:36:42,806][213771] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-07 15:36:43,582][213771] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-07 15:36:44,344][213771] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 15:36:45,123][213771] Updated weights for policy 0, policy_version 70200 (0.0006) [2023-03-07 15:36:45,904][213771] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-07 15:36:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 71897088. Throughput: 0: 13238.4. Samples: 71867793. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:46,106][213445] Avg episode reward: [(0, '4321.552')] [2023-03-07 15:36:46,677][213771] Updated weights for policy 0, policy_version 70220 (0.0005) [2023-03-07 15:36:47,444][213771] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-07 15:36:48,214][213771] Updated weights for policy 0, policy_version 70240 (0.0007) [2023-03-07 15:36:48,984][213771] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-07 15:36:49,752][213771] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 15:36:50,508][213771] Updated weights for policy 0, policy_version 70270 (0.0007) [2023-03-07 15:36:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 71963648. Throughput: 0: 13243.0. Samples: 71947514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:51,106][213445] Avg episode reward: [(0, '4378.764')] [2023-03-07 15:36:51,281][213771] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-07 15:36:52,059][213771] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-07 15:36:52,830][213771] Updated weights for policy 0, policy_version 70300 (0.0007) [2023-03-07 15:36:53,608][213771] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-07 15:36:54,373][213771] Updated weights for policy 0, policy_version 70320 (0.0007) [2023-03-07 15:36:55,149][213771] Updated weights for policy 0, policy_version 70330 (0.0005) [2023-03-07 15:36:55,907][213771] Updated weights for policy 0, policy_version 70340 (0.0006) [2023-03-07 15:36:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 72030208. Throughput: 0: 13244.1. Samples: 72027096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:36:56,106][213445] Avg episode reward: [(0, '4400.680')] [2023-03-07 15:36:56,693][213771] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-07 15:36:57,469][213771] Updated weights for policy 0, policy_version 70360 (0.0006) [2023-03-07 15:36:58,250][213771] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-07 15:36:59,016][213771] Updated weights for policy 0, policy_version 70380 (0.0005) [2023-03-07 15:36:59,782][213771] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-07 15:37:00,564][213771] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-07 15:37:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 72096768. Throughput: 0: 13239.6. Samples: 72066748. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:01,106][213445] Avg episode reward: [(0, '4422.206')] [2023-03-07 15:37:01,335][213771] Updated weights for policy 0, policy_version 70410 (0.0007) [2023-03-07 15:37:02,114][213771] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-07 15:37:02,890][213771] Updated weights for policy 0, policy_version 70430 (0.0006) [2023-03-07 15:37:03,660][213771] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-07 15:37:04,437][213771] Updated weights for policy 0, policy_version 70450 (0.0007) [2023-03-07 15:37:05,200][213771] Updated weights for policy 0, policy_version 70460 (0.0007) [2023-03-07 15:37:05,970][213771] Updated weights for policy 0, policy_version 70470 (0.0007) [2023-03-07 15:37:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 72162304. Throughput: 0: 13243.7. Samples: 72146227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:06,106][213445] Avg episode reward: [(0, '4377.655')] [2023-03-07 15:37:06,742][213771] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 15:37:07,504][213771] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-07 15:37:08,289][213771] Updated weights for policy 0, policy_version 70500 (0.0006) [2023-03-07 15:37:09,045][213771] Updated weights for policy 0, policy_version 70510 (0.0005) [2023-03-07 15:37:09,829][213771] Updated weights for policy 0, policy_version 70520 (0.0007) [2023-03-07 15:37:10,611][213771] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-07 15:37:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72228864. Throughput: 0: 13253.3. Samples: 72225849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:11,106][213445] Avg episode reward: [(0, '4449.988')] [2023-03-07 15:37:11,392][213771] Updated weights for policy 0, policy_version 70540 (0.0005) [2023-03-07 15:37:12,151][213771] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-07 15:37:12,925][213771] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 15:37:13,697][213771] Updated weights for policy 0, policy_version 70570 (0.0006) [2023-03-07 15:37:14,477][213771] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-07 15:37:15,252][213771] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-07 15:37:16,040][213771] Updated weights for policy 0, policy_version 70600 (0.0006) [2023-03-07 15:37:16,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72295424. Throughput: 0: 13253.1. Samples: 72265564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:16,106][213445] Avg episode reward: [(0, '4402.864')] [2023-03-07 15:37:16,797][213771] Updated weights for policy 0, policy_version 70610 (0.0005) [2023-03-07 15:37:17,566][213771] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 15:37:18,346][213771] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-07 15:37:19,129][213771] Updated weights for policy 0, policy_version 70640 (0.0007) [2023-03-07 15:37:19,898][213771] Updated weights for policy 0, policy_version 70650 (0.0006) [2023-03-07 15:37:20,677][213771] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-07 15:37:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72360960. Throughput: 0: 13242.0. Samples: 72344829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:21,106][213445] Avg episode reward: [(0, '4377.575')] [2023-03-07 15:37:21,450][213771] Updated weights for policy 0, policy_version 70670 (0.0006) [2023-03-07 15:37:22,221][213771] Updated weights for policy 0, policy_version 70680 (0.0005) [2023-03-07 15:37:23,013][213771] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-07 15:37:23,794][213771] Updated weights for policy 0, policy_version 70700 (0.0007) [2023-03-07 15:37:24,546][213771] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-07 15:37:25,320][213771] Updated weights for policy 0, policy_version 70720 (0.0007) [2023-03-07 15:37:26,082][213771] Updated weights for policy 0, policy_version 70730 (0.0005) [2023-03-07 15:37:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72427520. Throughput: 0: 13245.4. Samples: 72424099. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:26,106][213445] Avg episode reward: [(0, '4330.166')] [2023-03-07 15:37:26,855][213771] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-07 15:37:27,647][213771] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-07 15:37:28,409][213771] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-07 15:37:29,185][213771] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-07 15:37:29,975][213771] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-07 15:37:30,745][213771] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-07 15:37:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 72493056. Throughput: 0: 13246.7. Samples: 72463893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:37:31,106][213445] Avg episode reward: [(0, '4389.576')] [2023-03-07 15:37:31,513][213771] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 15:37:32,280][213771] Updated weights for policy 0, policy_version 70810 (0.0005) [2023-03-07 15:37:33,030][213771] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-07 15:37:33,825][213771] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-07 15:37:34,596][213771] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 15:37:35,374][213771] Updated weights for policy 0, policy_version 70850 (0.0005) [2023-03-07 15:37:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72559616. Throughput: 0: 13240.0. Samples: 72543315. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:37:36,106][213445] Avg episode reward: [(0, '4397.296')] [2023-03-07 15:37:36,150][213771] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-07 15:37:36,921][213771] Updated weights for policy 0, policy_version 70870 (0.0005) [2023-03-07 15:37:37,693][213771] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-07 15:37:38,461][213771] Updated weights for policy 0, policy_version 70890 (0.0006) [2023-03-07 15:37:39,242][213771] Updated weights for policy 0, policy_version 70900 (0.0007) [2023-03-07 15:37:40,013][213771] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 15:37:40,801][213771] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-07 15:37:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 72625152. Throughput: 0: 13235.4. Samples: 72622688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:37:41,106][213445] Avg episode reward: [(0, '4366.397')] [2023-03-07 15:37:41,579][213771] Updated weights for policy 0, policy_version 70930 (0.0007) [2023-03-07 15:37:42,332][213771] Updated weights for policy 0, policy_version 70940 (0.0006) [2023-03-07 15:37:43,101][213771] Updated weights for policy 0, policy_version 70950 (0.0006) [2023-03-07 15:37:43,872][213771] Updated weights for policy 0, policy_version 70960 (0.0005) [2023-03-07 15:37:44,661][213771] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-07 15:37:45,425][213771] Updated weights for policy 0, policy_version 70980 (0.0006) [2023-03-07 15:37:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 72692736. Throughput: 0: 13238.9. Samples: 72662499. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:37:46,106][213445] Avg episode reward: [(0, '4423.826')] [2023-03-07 15:37:46,194][213771] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-07 15:37:46,954][213771] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-07 15:37:47,741][213771] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-07 15:37:48,515][213771] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-07 15:37:49,290][213771] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-07 15:37:50,085][213771] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-07 15:37:50,868][213771] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-07 15:37:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 72758272. Throughput: 0: 13235.9. Samples: 72741840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:37:51,106][213445] Avg episode reward: [(0, '4392.031')] [2023-03-07 15:37:51,644][213771] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-07 15:37:52,412][213771] Updated weights for policy 0, policy_version 71070 (0.0005) [2023-03-07 15:37:53,197][213771] Updated weights for policy 0, policy_version 71080 (0.0008) [2023-03-07 15:37:53,953][213771] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 15:37:54,712][213771] Updated weights for policy 0, policy_version 71100 (0.0006) [2023-03-07 15:37:55,481][213771] Updated weights for policy 0, policy_version 71110 (0.0006) [2023-03-07 15:37:56,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 72824832. Throughput: 0: 13232.8. Samples: 72821326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:37:56,105][213445] Avg episode reward: [(0, '4427.363')] [2023-03-07 15:37:56,261][213771] Updated weights for policy 0, policy_version 71120 (0.0005) [2023-03-07 15:37:57,040][213771] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-07 15:37:57,812][213771] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-07 15:37:58,595][213771] Updated weights for policy 0, policy_version 71150 (0.0006) [2023-03-07 15:37:59,342][213771] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-07 15:38:00,136][213771] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-07 15:38:00,919][213771] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-07 15:38:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 72890368. Throughput: 0: 13227.9. Samples: 72860820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:38:01,106][213445] Avg episode reward: [(0, '4382.833')] [2023-03-07 15:38:01,700][213771] Updated weights for policy 0, policy_version 71190 (0.0006) [2023-03-07 15:38:02,465][213771] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-07 15:38:03,240][213771] Updated weights for policy 0, policy_version 71210 (0.0007) [2023-03-07 15:38:04,012][213771] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-07 15:38:04,776][213771] Updated weights for policy 0, policy_version 71230 (0.0006) [2023-03-07 15:38:05,542][213771] Updated weights for policy 0, policy_version 71240 (0.0006) [2023-03-07 15:38:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 72956928. Throughput: 0: 13235.0. Samples: 72940404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:38:06,106][213445] Avg episode reward: [(0, '4307.865')] [2023-03-07 15:38:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000071247_72956928.pth... [2023-03-07 15:38:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000068142_69777408.pth [2023-03-07 15:38:06,322][213771] Updated weights for policy 0, policy_version 71250 (0.0007) [2023-03-07 15:38:07,093][213771] Updated weights for policy 0, policy_version 71260 (0.0007) [2023-03-07 15:38:07,865][213771] Updated weights for policy 0, policy_version 71270 (0.0006) [2023-03-07 15:38:08,641][213771] Updated weights for policy 0, policy_version 71280 (0.0008) [2023-03-07 15:38:09,419][213771] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 15:38:10,192][213771] Updated weights for policy 0, policy_version 71300 (0.0008) [2023-03-07 15:38:10,961][213771] Updated weights for policy 0, policy_version 71310 (0.0006) [2023-03-07 15:38:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 73022464. Throughput: 0: 13236.2. Samples: 73019726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:38:11,106][213445] Avg episode reward: [(0, '4318.071')] [2023-03-07 15:38:11,743][213771] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-07 15:38:12,508][213771] Updated weights for policy 0, policy_version 71330 (0.0007) [2023-03-07 15:38:13,289][213771] Updated weights for policy 0, policy_version 71340 (0.0007) [2023-03-07 15:38:14,063][213771] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-07 15:38:14,817][213771] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-07 15:38:15,593][213771] Updated weights for policy 0, policy_version 71370 (0.0005) [2023-03-07 15:38:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 73089024. Throughput: 0: 13231.1. Samples: 73059294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:38:16,106][213445] Avg episode reward: [(0, '4301.970')] [2023-03-07 15:38:16,365][213771] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-07 15:38:17,138][213771] Updated weights for policy 0, policy_version 71390 (0.0006) [2023-03-07 15:38:17,916][213771] Updated weights for policy 0, policy_version 71400 (0.0006) [2023-03-07 15:38:18,682][213771] Updated weights for policy 0, policy_version 71410 (0.0006) [2023-03-07 15:38:19,444][213771] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-07 15:38:20,207][213771] Updated weights for policy 0, policy_version 71430 (0.0005) [2023-03-07 15:38:20,975][213771] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 15:38:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 73155584. Throughput: 0: 13244.8. Samples: 73139328. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:21,106][213445] Avg episode reward: [(0, '4334.860')] [2023-03-07 15:38:21,744][213771] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-07 15:38:22,507][213771] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-07 15:38:23,292][213771] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-07 15:38:24,085][213771] Updated weights for policy 0, policy_version 71480 (0.0005) [2023-03-07 15:38:24,868][213771] Updated weights for policy 0, policy_version 71490 (0.0007) [2023-03-07 15:38:25,618][213771] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-07 15:38:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 73222144. Throughput: 0: 13244.8. Samples: 73218704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:26,106][213445] Avg episode reward: [(0, '4315.317')] [2023-03-07 15:38:26,407][213771] Updated weights for policy 0, policy_version 71510 (0.0007) [2023-03-07 15:38:27,182][213771] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-07 15:38:27,949][213771] Updated weights for policy 0, policy_version 71530 (0.0007) [2023-03-07 15:38:28,709][213771] Updated weights for policy 0, policy_version 71540 (0.0005) [2023-03-07 15:38:29,457][213771] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-07 15:38:30,222][213771] Updated weights for policy 0, policy_version 71560 (0.0007) [2023-03-07 15:38:31,013][213771] Updated weights for policy 0, policy_version 71570 (0.0008) [2023-03-07 15:38:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 73287680. Throughput: 0: 13247.0. Samples: 73258611. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:31,105][213445] Avg episode reward: [(0, '4251.978')] [2023-03-07 15:38:31,801][213771] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-07 15:38:32,569][213771] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-07 15:38:33,332][213771] Updated weights for policy 0, policy_version 71600 (0.0005) [2023-03-07 15:38:34,116][213771] Updated weights for policy 0, policy_version 71610 (0.0005) [2023-03-07 15:38:34,889][213771] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-07 15:38:35,655][213771] Updated weights for policy 0, policy_version 71630 (0.0006) [2023-03-07 15:38:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 73354240. Throughput: 0: 13249.4. Samples: 73338064. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:36,106][213445] Avg episode reward: [(0, '4162.595')] [2023-03-07 15:38:36,442][213771] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-07 15:38:37,217][213771] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-07 15:38:37,963][213771] Updated weights for policy 0, policy_version 71660 (0.0006) [2023-03-07 15:38:38,749][213771] Updated weights for policy 0, policy_version 71670 (0.0005) [2023-03-07 15:38:39,502][213771] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 15:38:40,281][213771] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-07 15:38:41,058][213771] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-07 15:38:41,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 73420800. Throughput: 0: 13256.9. Samples: 73417891. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:41,106][213445] Avg episode reward: [(0, '4193.999')] [2023-03-07 15:38:41,818][213771] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-07 15:38:42,588][213771] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-07 15:38:43,371][213771] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-07 15:38:44,129][213771] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-07 15:38:44,913][213771] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-07 15:38:45,684][213771] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-07 15:38:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 73487360. Throughput: 0: 13268.0. Samples: 73457880. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:46,106][213445] Avg episode reward: [(0, '4260.884')] [2023-03-07 15:38:46,437][213771] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-07 15:38:47,210][213771] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-07 15:38:48,002][213771] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 15:38:48,764][213771] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-07 15:38:49,534][213771] Updated weights for policy 0, policy_version 71810 (0.0006) [2023-03-07 15:38:50,314][213771] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-07 15:38:51,080][213771] Updated weights for policy 0, policy_version 71830 (0.0006) [2023-03-07 15:38:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 73553920. Throughput: 0: 13267.0. Samples: 73537421. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:51,117][213445] Avg episode reward: [(0, '4251.376')] [2023-03-07 15:38:51,855][213771] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-07 15:38:52,641][213771] Updated weights for policy 0, policy_version 71850 (0.0007) [2023-03-07 15:38:53,417][213771] Updated weights for policy 0, policy_version 71860 (0.0005) [2023-03-07 15:38:54,179][213771] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 15:38:54,939][213771] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-07 15:38:55,702][213771] Updated weights for policy 0, policy_version 71890 (0.0006) [2023-03-07 15:38:56,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13260.7, 300 sec: 13249.5). Total num frames: 73620480. Throughput: 0: 13277.0. Samples: 73617194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:38:56,106][213445] Avg episode reward: [(0, '4260.385')] [2023-03-07 15:38:56,474][213771] Updated weights for policy 0, policy_version 71900 (0.0007) [2023-03-07 15:38:57,249][213771] Updated weights for policy 0, policy_version 71910 (0.0006) [2023-03-07 15:38:58,017][213771] Updated weights for policy 0, policy_version 71920 (0.0006) [2023-03-07 15:38:58,772][213771] Updated weights for policy 0, policy_version 71930 (0.0007) [2023-03-07 15:38:59,548][213771] Updated weights for policy 0, policy_version 71940 (0.0007) [2023-03-07 15:39:00,336][213771] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-07 15:39:01,102][213771] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-07 15:39:01,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 73687040. Throughput: 0: 13285.2. Samples: 73657125. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:01,105][213445] Avg episode reward: [(0, '4213.081')] [2023-03-07 15:39:01,861][213771] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-07 15:39:02,636][213771] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 15:39:03,398][213771] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 15:39:04,156][213771] Updated weights for policy 0, policy_version 72000 (0.0007) [2023-03-07 15:39:04,953][213771] Updated weights for policy 0, policy_version 72010 (0.0006) [2023-03-07 15:39:05,711][213771] Updated weights for policy 0, policy_version 72020 (0.0007) [2023-03-07 15:39:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 73752576. Throughput: 0: 13275.2. Samples: 73736714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:06,106][213445] Avg episode reward: [(0, '4226.708')] [2023-03-07 15:39:06,464][213771] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-07 15:39:07,263][213771] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-07 15:39:08,011][213771] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-07 15:39:08,802][213771] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-07 15:39:09,568][213771] Updated weights for policy 0, policy_version 72070 (0.0005) [2023-03-07 15:39:10,342][213771] Updated weights for policy 0, policy_version 72080 (0.0006) [2023-03-07 15:39:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 73819136. Throughput: 0: 13286.1. Samples: 73816580. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:11,106][213445] Avg episode reward: [(0, '4299.421')] [2023-03-07 15:39:11,110][213771] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-07 15:39:11,880][213771] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 15:39:12,652][213771] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-07 15:39:13,427][213771] Updated weights for policy 0, policy_version 72120 (0.0007) [2023-03-07 15:39:14,210][213771] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-07 15:39:14,960][213771] Updated weights for policy 0, policy_version 72140 (0.0007) [2023-03-07 15:39:15,746][213771] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 15:39:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 73885696. Throughput: 0: 13285.9. Samples: 73856478. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:16,106][213445] Avg episode reward: [(0, '4276.821')] [2023-03-07 15:39:16,529][213771] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-07 15:39:17,282][213771] Updated weights for policy 0, policy_version 72170 (0.0006) [2023-03-07 15:39:18,042][213771] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-07 15:39:18,834][213771] Updated weights for policy 0, policy_version 72190 (0.0006) [2023-03-07 15:39:19,599][213771] Updated weights for policy 0, policy_version 72200 (0.0007) [2023-03-07 15:39:20,394][213771] Updated weights for policy 0, policy_version 72210 (0.0007) [2023-03-07 15:39:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 73952256. Throughput: 0: 13281.7. Samples: 73935740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:21,106][213445] Avg episode reward: [(0, '4267.260')] [2023-03-07 15:39:21,158][213771] Updated weights for policy 0, policy_version 72220 (0.0006) [2023-03-07 15:39:21,939][213771] Updated weights for policy 0, policy_version 72230 (0.0005) [2023-03-07 15:39:22,716][213771] Updated weights for policy 0, policy_version 72240 (0.0007) [2023-03-07 15:39:23,485][213771] Updated weights for policy 0, policy_version 72250 (0.0006) [2023-03-07 15:39:24,265][213771] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-07 15:39:25,037][213771] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 15:39:25,791][213771] Updated weights for policy 0, policy_version 72280 (0.0005) [2023-03-07 15:39:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 74018816. Throughput: 0: 13276.4. Samples: 74015327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:26,106][213445] Avg episode reward: [(0, '4334.515')] [2023-03-07 15:39:26,562][213771] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 15:39:27,350][213771] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-07 15:39:28,114][213771] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 15:39:28,890][213771] Updated weights for policy 0, policy_version 72320 (0.0007) [2023-03-07 15:39:29,656][213771] Updated weights for policy 0, policy_version 72330 (0.0006) [2023-03-07 15:39:30,439][213771] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 15:39:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.8, 300 sec: 13249.5). Total num frames: 74084352. Throughput: 0: 13268.4. Samples: 74054958. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:31,106][213445] Avg episode reward: [(0, '4313.382')] [2023-03-07 15:39:31,208][213771] Updated weights for policy 0, policy_version 72350 (0.0005) [2023-03-07 15:39:31,974][213771] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-07 15:39:32,754][213771] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-07 15:39:33,514][213771] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 15:39:34,285][213771] Updated weights for policy 0, policy_version 72390 (0.0007) [2023-03-07 15:39:35,067][213771] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-07 15:39:35,831][213771] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 15:39:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 74150912. Throughput: 0: 13273.7. Samples: 74134738. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:36,106][213445] Avg episode reward: [(0, '4351.470')] [2023-03-07 15:39:36,598][213771] Updated weights for policy 0, policy_version 72420 (0.0006) [2023-03-07 15:39:37,375][213771] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-07 15:39:38,130][213771] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-07 15:39:38,910][213771] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-07 15:39:39,672][213771] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 15:39:40,441][213771] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-07 15:39:41,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 74217472. Throughput: 0: 13275.0. Samples: 74214567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:39:41,106][213445] Avg episode reward: [(0, '4367.476')] [2023-03-07 15:39:41,209][213771] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-07 15:39:41,982][213771] Updated weights for policy 0, policy_version 72490 (0.0006) [2023-03-07 15:39:42,747][213771] Updated weights for policy 0, policy_version 72500 (0.0007) [2023-03-07 15:39:43,521][213771] Updated weights for policy 0, policy_version 72510 (0.0007) [2023-03-07 15:39:44,284][213771] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-07 15:39:45,055][213771] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-07 15:39:45,826][213771] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-07 15:39:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 74284032. Throughput: 0: 13272.5. Samples: 74254390. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:39:46,106][213445] Avg episode reward: [(0, '4370.029')] [2023-03-07 15:39:46,613][213771] Updated weights for policy 0, policy_version 72550 (0.0007) [2023-03-07 15:39:47,381][213771] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 15:39:48,146][213771] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-07 15:39:48,925][213771] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-07 15:39:49,688][213771] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-07 15:39:50,464][213771] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-07 15:39:51,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 74350592. Throughput: 0: 13276.1. Samples: 74334135. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:39:51,106][213445] Avg episode reward: [(0, '4366.198')] [2023-03-07 15:39:51,210][213771] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-07 15:39:51,997][213771] Updated weights for policy 0, policy_version 72620 (0.0006) [2023-03-07 15:39:52,761][213771] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-07 15:39:53,540][213771] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-07 15:39:54,322][213771] Updated weights for policy 0, policy_version 72650 (0.0007) [2023-03-07 15:39:55,088][213771] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-07 15:39:55,877][213771] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-07 15:39:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 74416128. Throughput: 0: 13266.1. Samples: 74413556. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:39:56,106][213445] Avg episode reward: [(0, '4379.101')] [2023-03-07 15:39:56,636][213771] Updated weights for policy 0, policy_version 72680 (0.0007) [2023-03-07 15:39:57,399][213771] Updated weights for policy 0, policy_version 72690 (0.0007) [2023-03-07 15:39:58,185][213771] Updated weights for policy 0, policy_version 72700 (0.0006) [2023-03-07 15:39:58,954][213771] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-07 15:39:59,745][213771] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-07 15:40:00,517][213771] Updated weights for policy 0, policy_version 72730 (0.0006) [2023-03-07 15:40:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 74482688. Throughput: 0: 13260.8. Samples: 74453213. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:01,106][213445] Avg episode reward: [(0, '4424.232')] [2023-03-07 15:40:01,285][213771] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 15:40:02,070][213771] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-07 15:40:02,841][213771] Updated weights for policy 0, policy_version 72760 (0.0007) [2023-03-07 15:40:03,614][213771] Updated weights for policy 0, policy_version 72770 (0.0005) [2023-03-07 15:40:04,381][213771] Updated weights for policy 0, policy_version 72780 (0.0005) [2023-03-07 15:40:05,137][213771] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 15:40:05,913][213771] Updated weights for policy 0, policy_version 72800 (0.0005) [2023-03-07 15:40:06,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 74549248. Throughput: 0: 13267.2. Samples: 74532763. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:06,106][213445] Avg episode reward: [(0, '4401.436')] [2023-03-07 15:40:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000072802_74549248.pth... [2023-03-07 15:40:06,142][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000069695_71367680.pth [2023-03-07 15:40:06,692][213771] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-07 15:40:07,472][213771] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 15:40:08,248][213771] Updated weights for policy 0, policy_version 72830 (0.0007) [2023-03-07 15:40:09,008][213771] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-07 15:40:09,786][213771] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-07 15:40:10,552][213771] Updated weights for policy 0, policy_version 72860 (0.0006) [2023-03-07 15:40:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 74614784. Throughput: 0: 13264.6. Samples: 74612232. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:11,105][213445] Avg episode reward: [(0, '4411.415')] [2023-03-07 15:40:11,314][213771] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-07 15:40:12,102][213771] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-07 15:40:12,872][213771] Updated weights for policy 0, policy_version 72890 (0.0007) [2023-03-07 15:40:13,656][213771] Updated weights for policy 0, policy_version 72900 (0.0005) [2023-03-07 15:40:14,444][213771] Updated weights for policy 0, policy_version 72910 (0.0005) [2023-03-07 15:40:15,201][213771] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-07 15:40:15,962][213771] Updated weights for policy 0, policy_version 72930 (0.0007) [2023-03-07 15:40:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 74681344. Throughput: 0: 13265.3. Samples: 74651898. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:16,106][213445] Avg episode reward: [(0, '4426.860')] [2023-03-07 15:40:16,750][213771] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-07 15:40:17,516][213771] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-07 15:40:18,303][213771] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 15:40:19,076][213771] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-07 15:40:19,844][213771] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 15:40:20,625][213771] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-07 15:40:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 74747904. Throughput: 0: 13255.4. Samples: 74731230. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:21,106][213445] Avg episode reward: [(0, '4390.481')] [2023-03-07 15:40:21,404][213771] Updated weights for policy 0, policy_version 73000 (0.0005) [2023-03-07 15:40:22,157][213771] Updated weights for policy 0, policy_version 73010 (0.0005) [2023-03-07 15:40:22,939][213771] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-07 15:40:23,705][213771] Updated weights for policy 0, policy_version 73030 (0.0006) [2023-03-07 15:40:24,466][213771] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-07 15:40:25,242][213771] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 15:40:26,026][213771] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-07 15:40:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 74813440. Throughput: 0: 13254.6. Samples: 74811023. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:26,106][213445] Avg episode reward: [(0, '4392.647')] [2023-03-07 15:40:26,788][213771] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-07 15:40:27,566][213771] Updated weights for policy 0, policy_version 73080 (0.0007) [2023-03-07 15:40:28,341][213771] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-07 15:40:29,117][213771] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-07 15:40:29,907][213771] Updated weights for policy 0, policy_version 73110 (0.0007) [2023-03-07 15:40:30,695][213771] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-07 15:40:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 74880000. Throughput: 0: 13250.0. Samples: 74850640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:31,116][213445] Avg episode reward: [(0, '4407.236')] [2023-03-07 15:40:31,460][213771] Updated weights for policy 0, policy_version 73130 (0.0007) [2023-03-07 15:40:32,245][213771] Updated weights for policy 0, policy_version 73140 (0.0008) [2023-03-07 15:40:33,014][213771] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 15:40:33,797][213771] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 15:40:34,572][213771] Updated weights for policy 0, policy_version 73170 (0.0006) [2023-03-07 15:40:35,334][213771] Updated weights for policy 0, policy_version 73180 (0.0005) [2023-03-07 15:40:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 74945536. Throughput: 0: 13234.7. Samples: 74929694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:36,113][213771] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-07 15:40:36,116][213445] Avg episode reward: [(0, '4382.286')] [2023-03-07 15:40:36,887][213771] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-07 15:40:37,654][213771] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 15:40:38,422][213771] Updated weights for policy 0, policy_version 73220 (0.0005) [2023-03-07 15:40:39,215][213771] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-07 15:40:39,982][213771] Updated weights for policy 0, policy_version 73240 (0.0006) [2023-03-07 15:40:40,757][213771] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-07 15:40:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 75012096. Throughput: 0: 13233.1. Samples: 75009046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:41,106][213445] Avg episode reward: [(0, '4406.586')] [2023-03-07 15:40:41,528][213771] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 15:40:42,312][213771] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-07 15:40:43,075][213771] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-07 15:40:43,845][213771] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-07 15:40:44,625][213771] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-07 15:40:45,400][213771] Updated weights for policy 0, policy_version 73310 (0.0007) [2023-03-07 15:40:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 75078656. Throughput: 0: 13236.4. Samples: 75048850. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:46,106][213445] Avg episode reward: [(0, '4400.145')] [2023-03-07 15:40:46,178][213771] Updated weights for policy 0, policy_version 73320 (0.0006) [2023-03-07 15:40:46,941][213771] Updated weights for policy 0, policy_version 73330 (0.0006) [2023-03-07 15:40:47,717][213771] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-07 15:40:48,494][213771] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-07 15:40:49,280][213771] Updated weights for policy 0, policy_version 73360 (0.0007) [2023-03-07 15:40:50,041][213771] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-07 15:40:50,805][213771] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-07 15:40:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 75144192. Throughput: 0: 13228.5. Samples: 75128045. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:51,106][213445] Avg episode reward: [(0, '4411.505')] [2023-03-07 15:40:51,584][213771] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-07 15:40:52,347][213771] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-07 15:40:53,104][213771] Updated weights for policy 0, policy_version 73410 (0.0005) [2023-03-07 15:40:53,901][213771] Updated weights for policy 0, policy_version 73420 (0.0005) [2023-03-07 15:40:54,669][213771] Updated weights for policy 0, policy_version 73430 (0.0005) [2023-03-07 15:40:55,438][213771] Updated weights for policy 0, policy_version 73440 (0.0007) [2023-03-07 15:40:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 75210752. Throughput: 0: 13233.6. Samples: 75207745. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:40:56,106][213445] Avg episode reward: [(0, '4399.662')] [2023-03-07 15:40:56,195][213771] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 15:40:56,984][213771] Updated weights for policy 0, policy_version 73460 (0.0006) [2023-03-07 15:40:57,756][213771] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-07 15:40:58,534][213771] Updated weights for policy 0, policy_version 73480 (0.0006) [2023-03-07 15:40:59,308][213771] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-07 15:41:00,069][213771] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-07 15:41:00,840][213771] Updated weights for policy 0, policy_version 73510 (0.0005) [2023-03-07 15:41:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 75277312. Throughput: 0: 13233.2. Samples: 75247392. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:01,106][213445] Avg episode reward: [(0, '4349.077')] [2023-03-07 15:41:01,613][213771] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-07 15:41:02,379][213771] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-07 15:41:03,161][213771] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-07 15:41:03,917][213771] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-07 15:41:04,681][213771] Updated weights for policy 0, policy_version 73560 (0.0007) [2023-03-07 15:41:05,462][213771] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-07 15:41:06,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 75343872. Throughput: 0: 13247.0. Samples: 75327347. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:06,106][213445] Avg episode reward: [(0, '4394.664')] [2023-03-07 15:41:06,246][213771] Updated weights for policy 0, policy_version 73580 (0.0005) [2023-03-07 15:41:07,017][213771] Updated weights for policy 0, policy_version 73590 (0.0007) [2023-03-07 15:41:07,782][213771] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-07 15:41:08,573][213771] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-07 15:41:09,343][213771] Updated weights for policy 0, policy_version 73620 (0.0007) [2023-03-07 15:41:10,102][213771] Updated weights for policy 0, policy_version 73630 (0.0005) [2023-03-07 15:41:10,882][213771] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-07 15:41:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 75410432. Throughput: 0: 13238.0. Samples: 75406729. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:11,105][213445] Avg episode reward: [(0, '4377.414')] [2023-03-07 15:41:11,645][213771] Updated weights for policy 0, policy_version 73650 (0.0007) [2023-03-07 15:41:12,410][213771] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-07 15:41:13,173][213771] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-07 15:41:13,939][213771] Updated weights for policy 0, policy_version 73680 (0.0007) [2023-03-07 15:41:14,705][213771] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 15:41:15,482][213771] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-07 15:41:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 75475968. Throughput: 0: 13250.0. Samples: 75446892. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:16,106][213445] Avg episode reward: [(0, '4400.183')] [2023-03-07 15:41:16,265][213771] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 15:41:17,021][213771] Updated weights for policy 0, policy_version 73720 (0.0007) [2023-03-07 15:41:17,816][213771] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-07 15:41:18,594][213771] Updated weights for policy 0, policy_version 73740 (0.0005) [2023-03-07 15:41:19,370][213771] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-07 15:41:20,154][213771] Updated weights for policy 0, policy_version 73760 (0.0005) [2023-03-07 15:41:20,925][213771] Updated weights for policy 0, policy_version 73770 (0.0007) [2023-03-07 15:41:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 75542528. Throughput: 0: 13257.1. Samples: 75526265. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:21,106][213445] Avg episode reward: [(0, '4411.482')] [2023-03-07 15:41:21,678][213771] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 15:41:22,461][213771] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-07 15:41:23,220][213771] Updated weights for policy 0, policy_version 73800 (0.0006) [2023-03-07 15:41:23,997][213771] Updated weights for policy 0, policy_version 73810 (0.0005) [2023-03-07 15:41:24,754][213771] Updated weights for policy 0, policy_version 73820 (0.0005) [2023-03-07 15:41:25,517][213771] Updated weights for policy 0, policy_version 73830 (0.0007) [2023-03-07 15:41:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 75609088. Throughput: 0: 13266.1. Samples: 75606021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:26,106][213445] Avg episode reward: [(0, '4409.449')] [2023-03-07 15:41:26,289][213771] Updated weights for policy 0, policy_version 73840 (0.0007) [2023-03-07 15:41:27,053][213771] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-07 15:41:27,834][213771] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-07 15:41:28,602][213771] Updated weights for policy 0, policy_version 73870 (0.0007) [2023-03-07 15:41:29,380][213771] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-07 15:41:30,156][213771] Updated weights for policy 0, policy_version 73890 (0.0006) [2023-03-07 15:41:30,929][213771] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 15:41:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 75675648. Throughput: 0: 13268.1. Samples: 75645915. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:31,106][213445] Avg episode reward: [(0, '4389.360')] [2023-03-07 15:41:31,699][213771] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-07 15:41:32,497][213771] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-07 15:41:33,253][213771] Updated weights for policy 0, policy_version 73930 (0.0007) [2023-03-07 15:41:34,047][213771] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-07 15:41:34,814][213771] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-07 15:41:35,598][213771] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-07 15:41:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 75741184. Throughput: 0: 13266.9. Samples: 75725055. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:36,106][213445] Avg episode reward: [(0, '4407.246')] [2023-03-07 15:41:36,365][213771] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-07 15:41:37,127][213771] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-07 15:41:37,900][213771] Updated weights for policy 0, policy_version 73990 (0.0006) [2023-03-07 15:41:38,667][213771] Updated weights for policy 0, policy_version 74000 (0.0006) [2023-03-07 15:41:39,452][213771] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-07 15:41:40,206][213771] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-07 15:41:40,986][213771] Updated weights for policy 0, policy_version 74030 (0.0007) [2023-03-07 15:41:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 75807744. Throughput: 0: 13266.8. Samples: 75804753. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:41,106][213445] Avg episode reward: [(0, '4378.101')] [2023-03-07 15:41:41,767][213771] Updated weights for policy 0, policy_version 74040 (0.0006) [2023-03-07 15:41:42,506][213771] Updated weights for policy 0, policy_version 74050 (0.0005) [2023-03-07 15:41:43,294][213771] Updated weights for policy 0, policy_version 74060 (0.0006) [2023-03-07 15:41:44,058][213771] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-07 15:41:44,842][213771] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-07 15:41:45,604][213771] Updated weights for policy 0, policy_version 74090 (0.0006) [2023-03-07 15:41:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 75874304. Throughput: 0: 13273.0. Samples: 75844677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:46,116][213445] Avg episode reward: [(0, '4356.554')] [2023-03-07 15:41:46,374][213771] Updated weights for policy 0, policy_version 74100 (0.0007) [2023-03-07 15:41:47,154][213771] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-07 15:41:47,925][213771] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-07 15:41:48,706][213771] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-07 15:41:49,474][213771] Updated weights for policy 0, policy_version 74140 (0.0006) [2023-03-07 15:41:50,233][213771] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 15:41:51,007][213771] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-07 15:41:51,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 75940864. Throughput: 0: 13265.9. Samples: 75924311. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:51,116][213445] Avg episode reward: [(0, '4330.111')] [2023-03-07 15:41:51,769][213771] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-07 15:41:52,546][213771] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 15:41:53,325][213771] Updated weights for policy 0, policy_version 74190 (0.0007) [2023-03-07 15:41:54,092][213771] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-07 15:41:54,860][213771] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 15:41:55,636][213771] Updated weights for policy 0, policy_version 74220 (0.0007) [2023-03-07 15:41:56,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13277.8, 300 sec: 13256.5). Total num frames: 76007424. Throughput: 0: 13274.5. Samples: 76004086. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:41:56,116][213445] Avg episode reward: [(0, '4260.646')] [2023-03-07 15:41:56,386][213771] Updated weights for policy 0, policy_version 74230 (0.0006) [2023-03-07 15:41:57,188][213771] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-07 15:41:57,951][213771] Updated weights for policy 0, policy_version 74250 (0.0006) [2023-03-07 15:41:58,712][213771] Updated weights for policy 0, policy_version 74260 (0.0008) [2023-03-07 15:41:59,489][213771] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 15:42:00,263][213771] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-07 15:42:01,035][213771] Updated weights for policy 0, policy_version 74290 (0.0007) [2023-03-07 15:42:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 76073984. Throughput: 0: 13267.6. Samples: 76043932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:01,116][213445] Avg episode reward: [(0, '4305.546')] [2023-03-07 15:42:01,793][213771] Updated weights for policy 0, policy_version 74300 (0.0006) [2023-03-07 15:42:02,567][213771] Updated weights for policy 0, policy_version 74310 (0.0005) [2023-03-07 15:42:03,355][213771] Updated weights for policy 0, policy_version 74320 (0.0007) [2023-03-07 15:42:04,121][213771] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-07 15:42:04,884][213771] Updated weights for policy 0, policy_version 74340 (0.0005) [2023-03-07 15:42:05,654][213771] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-07 15:42:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.4). Total num frames: 76139520. Throughput: 0: 13268.4. Samples: 76123345. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:06,117][213445] Avg episode reward: [(0, '4375.922')] [2023-03-07 15:42:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000074356_76140544.pth... [2023-03-07 15:42:06,152][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000071247_72956928.pth [2023-03-07 15:42:06,430][213771] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-07 15:42:07,197][213771] Updated weights for policy 0, policy_version 74370 (0.0007) [2023-03-07 15:42:07,955][213771] Updated weights for policy 0, policy_version 74380 (0.0008) [2023-03-07 15:42:08,730][213771] Updated weights for policy 0, policy_version 74390 (0.0006) [2023-03-07 15:42:09,492][213771] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-07 15:42:10,282][213771] Updated weights for policy 0, policy_version 74410 (0.0005) [2023-03-07 15:42:11,050][213771] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-07 15:42:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 76206080. Throughput: 0: 13273.1. Samples: 76203309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:11,116][213445] Avg episode reward: [(0, '4314.987')] [2023-03-07 15:42:11,833][213771] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 15:42:12,597][213771] Updated weights for policy 0, policy_version 74440 (0.0006) [2023-03-07 15:42:13,370][213771] Updated weights for policy 0, policy_version 74450 (0.0007) [2023-03-07 15:42:14,146][213771] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-07 15:42:14,915][213771] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-07 15:42:15,691][213771] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-07 15:42:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13277.8, 300 sec: 13259.9). Total num frames: 76272640. Throughput: 0: 13267.0. Samples: 76242930. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:16,115][213445] Avg episode reward: [(0, '4353.123')] [2023-03-07 15:42:16,456][213771] Updated weights for policy 0, policy_version 74490 (0.0008) [2023-03-07 15:42:17,242][213771] Updated weights for policy 0, policy_version 74500 (0.0005) [2023-03-07 15:42:18,026][213771] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 15:42:18,796][213771] Updated weights for policy 0, policy_version 74520 (0.0007) [2023-03-07 15:42:19,545][213771] Updated weights for policy 0, policy_version 74530 (0.0007) [2023-03-07 15:42:20,313][213771] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-07 15:42:21,090][213771] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-07 15:42:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13259.9). Total num frames: 76339200. Throughput: 0: 13276.1. Samples: 76322481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:21,116][213445] Avg episode reward: [(0, '4374.213')] [2023-03-07 15:42:21,868][213771] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-07 15:42:22,631][213771] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-07 15:42:23,410][213771] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-07 15:42:24,192][213771] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-07 15:42:24,972][213771] Updated weights for policy 0, policy_version 74600 (0.0006) [2023-03-07 15:42:25,742][213771] Updated weights for policy 0, policy_version 74610 (0.0007) [2023-03-07 15:42:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 76404736. Throughput: 0: 13270.2. Samples: 76401910. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:26,116][213445] Avg episode reward: [(0, '4407.761')] [2023-03-07 15:42:26,507][213771] Updated weights for policy 0, policy_version 74620 (0.0006) [2023-03-07 15:42:27,287][213771] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 15:42:28,058][213771] Updated weights for policy 0, policy_version 74640 (0.0005) [2023-03-07 15:42:28,835][213771] Updated weights for policy 0, policy_version 74650 (0.0005) [2023-03-07 15:42:29,613][213771] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-07 15:42:30,378][213771] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-07 15:42:31,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 76471296. Throughput: 0: 13265.6. Samples: 76441629. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:31,105][213445] Avg episode reward: [(0, '4362.079')] [2023-03-07 15:42:31,160][213771] Updated weights for policy 0, policy_version 74680 (0.0006) [2023-03-07 15:42:31,944][213771] Updated weights for policy 0, policy_version 74690 (0.0006) [2023-03-07 15:42:32,710][213771] Updated weights for policy 0, policy_version 74700 (0.0007) [2023-03-07 15:42:33,496][213771] Updated weights for policy 0, policy_version 74710 (0.0007) [2023-03-07 15:42:34,260][213771] Updated weights for policy 0, policy_version 74720 (0.0007) [2023-03-07 15:42:35,041][213771] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-07 15:42:35,798][213771] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 15:42:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 76536832. Throughput: 0: 13255.7. Samples: 76520818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:36,106][213445] Avg episode reward: [(0, '4410.997')] [2023-03-07 15:42:36,583][213771] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 15:42:37,348][213771] Updated weights for policy 0, policy_version 74760 (0.0006) [2023-03-07 15:42:38,130][213771] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-07 15:42:38,903][213771] Updated weights for policy 0, policy_version 74780 (0.0008) [2023-03-07 15:42:39,689][213771] Updated weights for policy 0, policy_version 74790 (0.0006) [2023-03-07 15:42:40,459][213771] Updated weights for policy 0, policy_version 74800 (0.0007) [2023-03-07 15:42:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 76603392. Throughput: 0: 13245.2. Samples: 76600119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:42:41,106][213445] Avg episode reward: [(0, '4318.907')] [2023-03-07 15:42:41,231][213771] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 15:42:42,021][213771] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-07 15:42:42,801][213771] Updated weights for policy 0, policy_version 74830 (0.0005) [2023-03-07 15:42:43,566][213771] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-07 15:42:44,335][213771] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-07 15:42:45,116][213771] Updated weights for policy 0, policy_version 74860 (0.0007) [2023-03-07 15:42:45,873][213771] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-07 15:42:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 76668928. Throughput: 0: 13242.5. Samples: 76639845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:42:46,106][213445] Avg episode reward: [(0, '4377.200')] [2023-03-07 15:42:46,631][213771] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-07 15:42:47,420][213771] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-07 15:42:48,176][213771] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-07 15:42:48,949][213771] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-07 15:42:49,733][213771] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-07 15:42:50,508][213771] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-07 15:42:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 76735488. Throughput: 0: 13250.3. Samples: 76719604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:42:51,105][213445] Avg episode reward: [(0, '4298.942')] [2023-03-07 15:42:51,282][213771] Updated weights for policy 0, policy_version 74940 (0.0005) [2023-03-07 15:42:52,060][213771] Updated weights for policy 0, policy_version 74950 (0.0005) [2023-03-07 15:42:52,826][213771] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-07 15:42:53,595][213771] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-07 15:42:54,375][213771] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-07 15:42:55,150][213771] Updated weights for policy 0, policy_version 74990 (0.0006) [2023-03-07 15:42:55,933][213771] Updated weights for policy 0, policy_version 75000 (0.0006) [2023-03-07 15:42:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 76802048. Throughput: 0: 13234.2. Samples: 76798851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:42:56,106][213445] Avg episode reward: [(0, '4301.001')] [2023-03-07 15:42:56,723][213771] Updated weights for policy 0, policy_version 75010 (0.0005) [2023-03-07 15:42:57,488][213771] Updated weights for policy 0, policy_version 75020 (0.0005) [2023-03-07 15:42:58,270][213771] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-07 15:42:59,049][213771] Updated weights for policy 0, policy_version 75040 (0.0005) [2023-03-07 15:42:59,825][213771] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 15:43:00,598][213771] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-07 15:43:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13256.5). Total num frames: 76867584. Throughput: 0: 13229.8. Samples: 76838269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:01,106][213445] Avg episode reward: [(0, '4292.919')] [2023-03-07 15:43:01,354][213771] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-07 15:43:02,146][213771] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-07 15:43:02,904][213771] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-07 15:43:03,683][213771] Updated weights for policy 0, policy_version 75100 (0.0006) [2023-03-07 15:43:04,461][213771] Updated weights for policy 0, policy_version 75110 (0.0006) [2023-03-07 15:43:05,241][213771] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 15:43:05,998][213771] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-07 15:43:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13259.9). Total num frames: 76934144. Throughput: 0: 13230.1. Samples: 76917835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:06,106][213445] Avg episode reward: [(0, '4325.344')] [2023-03-07 15:43:06,772][213771] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 15:43:07,526][213771] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-07 15:43:08,309][213771] Updated weights for policy 0, policy_version 75160 (0.0005) [2023-03-07 15:43:09,087][213771] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-07 15:43:09,853][213771] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-07 15:43:10,628][213771] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-07 15:43:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 77000704. Throughput: 0: 13229.8. Samples: 76997254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:11,106][213445] Avg episode reward: [(0, '4294.886')] [2023-03-07 15:43:11,392][213771] Updated weights for policy 0, policy_version 75200 (0.0006) [2023-03-07 15:43:12,175][213771] Updated weights for policy 0, policy_version 75210 (0.0007) [2023-03-07 15:43:12,935][213771] Updated weights for policy 0, policy_version 75220 (0.0005) [2023-03-07 15:43:13,718][213771] Updated weights for policy 0, policy_version 75230 (0.0005) [2023-03-07 15:43:14,472][213771] Updated weights for policy 0, policy_version 75240 (0.0005) [2023-03-07 15:43:15,256][213771] Updated weights for policy 0, policy_version 75250 (0.0006) [2023-03-07 15:43:16,026][213771] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-07 15:43:16,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 77066240. Throughput: 0: 13233.6. Samples: 77037140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:16,106][213445] Avg episode reward: [(0, '4316.688')] [2023-03-07 15:43:16,809][213771] Updated weights for policy 0, policy_version 75270 (0.0005) [2023-03-07 15:43:17,574][213771] Updated weights for policy 0, policy_version 75280 (0.0007) [2023-03-07 15:43:18,341][213771] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-07 15:43:19,118][213771] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-07 15:43:19,897][213771] Updated weights for policy 0, policy_version 75310 (0.0005) [2023-03-07 15:43:20,659][213771] Updated weights for policy 0, policy_version 75320 (0.0007) [2023-03-07 15:43:21,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 77132800. Throughput: 0: 13238.4. Samples: 77116543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:21,116][213445] Avg episode reward: [(0, '4159.987')] [2023-03-07 15:43:21,447][213771] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-07 15:43:22,226][213771] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-07 15:43:22,991][213771] Updated weights for policy 0, policy_version 75350 (0.0007) [2023-03-07 15:43:23,769][213771] Updated weights for policy 0, policy_version 75360 (0.0007) [2023-03-07 15:43:24,542][213771] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-07 15:43:25,284][213771] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-07 15:43:26,053][213771] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-07 15:43:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13259.9). Total num frames: 77199360. Throughput: 0: 13250.3. Samples: 77196380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:43:26,116][213445] Avg episode reward: [(0, '4225.440')] [2023-03-07 15:43:26,838][213771] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-07 15:43:27,598][213771] Updated weights for policy 0, policy_version 75410 (0.0005) [2023-03-07 15:43:28,366][213771] Updated weights for policy 0, policy_version 75420 (0.0007) [2023-03-07 15:43:29,153][213771] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-07 15:43:29,930][213771] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-07 15:43:30,698][213771] Updated weights for policy 0, policy_version 75450 (0.0006) [2023-03-07 15:43:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 77265920. Throughput: 0: 13252.3. Samples: 77236199. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:31,116][213445] Avg episode reward: [(0, '4230.984')] [2023-03-07 15:43:31,478][213771] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-07 15:43:32,231][213771] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-07 15:43:32,992][213771] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-07 15:43:33,780][213771] Updated weights for policy 0, policy_version 75490 (0.0006) [2023-03-07 15:43:34,552][213771] Updated weights for policy 0, policy_version 75500 (0.0006) [2023-03-07 15:43:35,308][213771] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-07 15:43:36,077][213771] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-07 15:43:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 77332480. Throughput: 0: 13251.8. Samples: 77315934. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:36,116][213445] Avg episode reward: [(0, '4297.789')] [2023-03-07 15:43:36,859][213771] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-07 15:43:37,645][213771] Updated weights for policy 0, policy_version 75540 (0.0006) [2023-03-07 15:43:38,426][213771] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-07 15:43:39,202][213771] Updated weights for policy 0, policy_version 75560 (0.0006) [2023-03-07 15:43:39,958][213771] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-07 15:43:40,740][213771] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-07 15:43:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 77398016. Throughput: 0: 13249.8. Samples: 77395089. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:41,116][213445] Avg episode reward: [(0, '4334.354')] [2023-03-07 15:43:41,517][213771] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 15:43:42,292][213771] Updated weights for policy 0, policy_version 75600 (0.0007) [2023-03-07 15:43:43,081][213771] Updated weights for policy 0, policy_version 75610 (0.0005) [2023-03-07 15:43:43,858][213771] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-07 15:43:44,634][213771] Updated weights for policy 0, policy_version 75630 (0.0007) [2023-03-07 15:43:45,409][213771] Updated weights for policy 0, policy_version 75640 (0.0005) [2023-03-07 15:43:46,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 77463552. Throughput: 0: 13251.4. Samples: 77434582. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:46,116][213445] Avg episode reward: [(0, '4330.973')] [2023-03-07 15:43:46,194][213771] Updated weights for policy 0, policy_version 75650 (0.0007) [2023-03-07 15:43:46,964][213771] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-07 15:43:47,742][213771] Updated weights for policy 0, policy_version 75670 (0.0005) [2023-03-07 15:43:48,501][213771] Updated weights for policy 0, policy_version 75680 (0.0006) [2023-03-07 15:43:49,299][213771] Updated weights for policy 0, policy_version 75690 (0.0006) [2023-03-07 15:43:50,077][213771] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-07 15:43:50,855][213771] Updated weights for policy 0, policy_version 75710 (0.0005) [2023-03-07 15:43:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 77530112. Throughput: 0: 13238.2. Samples: 77513551. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:51,116][213445] Avg episode reward: [(0, '4306.422')] [2023-03-07 15:43:51,637][213771] Updated weights for policy 0, policy_version 75720 (0.0007) [2023-03-07 15:43:52,393][213771] Updated weights for policy 0, policy_version 75730 (0.0006) [2023-03-07 15:43:53,174][213771] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-07 15:43:53,955][213771] Updated weights for policy 0, policy_version 75750 (0.0007) [2023-03-07 15:43:54,721][213771] Updated weights for policy 0, policy_version 75760 (0.0007) [2023-03-07 15:43:55,493][213771] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-07 15:43:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 77596672. Throughput: 0: 13240.7. Samples: 77593083. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:43:56,116][213445] Avg episode reward: [(0, '4406.743')] [2023-03-07 15:43:56,265][213771] Updated weights for policy 0, policy_version 75780 (0.0005) [2023-03-07 15:43:57,024][213771] Updated weights for policy 0, policy_version 75790 (0.0006) [2023-03-07 15:43:57,806][213771] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-07 15:43:58,586][213771] Updated weights for policy 0, policy_version 75810 (0.0006) [2023-03-07 15:43:59,344][213771] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 15:44:00,126][213771] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-07 15:44:00,904][213771] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-07 15:44:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 77662208. Throughput: 0: 13237.4. Samples: 77632824. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:44:01,116][213445] Avg episode reward: [(0, '4310.600')] [2023-03-07 15:44:01,673][213771] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-07 15:44:02,448][213771] Updated weights for policy 0, policy_version 75860 (0.0005) [2023-03-07 15:44:03,210][213771] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 15:44:03,964][213771] Updated weights for policy 0, policy_version 75880 (0.0005) [2023-03-07 15:44:04,735][213771] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-07 15:44:05,512][213771] Updated weights for policy 0, policy_version 75900 (0.0006) [2023-03-07 15:44:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 77728768. Throughput: 0: 13241.3. Samples: 77712403. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:44:06,106][213445] Avg episode reward: [(0, '4293.791')] [2023-03-07 15:44:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000075908_77729792.pth... [2023-03-07 15:44:06,153][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000072802_74549248.pth [2023-03-07 15:44:06,273][213771] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-07 15:44:07,061][213771] Updated weights for policy 0, policy_version 75920 (0.0005) [2023-03-07 15:44:07,850][213771] Updated weights for policy 0, policy_version 75930 (0.0007) [2023-03-07 15:44:08,625][213771] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-07 15:44:09,381][213771] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-07 15:44:10,164][213771] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-07 15:44:10,942][213771] Updated weights for policy 0, policy_version 75970 (0.0007) [2023-03-07 15:44:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 77795328. Throughput: 0: 13233.0. Samples: 77791863. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:44:11,106][213445] Avg episode reward: [(0, '4082.129')] [2023-03-07 15:44:11,714][213771] Updated weights for policy 0, policy_version 75980 (0.0006) [2023-03-07 15:44:12,485][213771] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 15:44:13,270][213771] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-07 15:44:14,038][213771] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-07 15:44:14,805][213771] Updated weights for policy 0, policy_version 76020 (0.0006) [2023-03-07 15:44:15,590][213771] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-07 15:44:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 77860864. Throughput: 0: 13229.8. Samples: 77831538. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 15:44:16,105][213445] Avg episode reward: [(0, '4328.678')] [2023-03-07 15:44:16,352][213771] Updated weights for policy 0, policy_version 76040 (0.0008) [2023-03-07 15:44:17,133][213771] Updated weights for policy 0, policy_version 76050 (0.0005) [2023-03-07 15:44:17,904][213771] Updated weights for policy 0, policy_version 76060 (0.0007) [2023-03-07 15:44:18,686][213771] Updated weights for policy 0, policy_version 76070 (0.0007) [2023-03-07 15:44:19,462][213771] Updated weights for policy 0, policy_version 76080 (0.0006) [2023-03-07 15:44:20,231][213771] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-07 15:44:21,021][213771] Updated weights for policy 0, policy_version 76100 (0.0007) [2023-03-07 15:44:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 77927424. Throughput: 0: 13224.2. Samples: 77911025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:21,106][213445] Avg episode reward: [(0, '4326.249')] [2023-03-07 15:44:21,781][213771] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-07 15:44:22,557][213771] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-07 15:44:23,329][213771] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 15:44:24,082][213771] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-07 15:44:24,861][213771] Updated weights for policy 0, policy_version 76150 (0.0006) [2023-03-07 15:44:25,630][213771] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-07 15:44:26,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 77993984. Throughput: 0: 13235.1. Samples: 77990671. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:26,106][213445] Avg episode reward: [(0, '4296.420')] [2023-03-07 15:44:26,390][213771] Updated weights for policy 0, policy_version 76170 (0.0006) [2023-03-07 15:44:27,169][213771] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-07 15:44:27,923][213771] Updated weights for policy 0, policy_version 76190 (0.0006) [2023-03-07 15:44:28,696][213771] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-07 15:44:29,489][213771] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-07 15:44:30,248][213771] Updated weights for policy 0, policy_version 76220 (0.0005) [2023-03-07 15:44:31,020][213771] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-07 15:44:31,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 78060544. Throughput: 0: 13244.3. Samples: 78030574. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:31,106][213445] Avg episode reward: [(0, '4240.224')] [2023-03-07 15:44:31,798][213771] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 15:44:32,549][213771] Updated weights for policy 0, policy_version 76250 (0.0007) [2023-03-07 15:44:33,317][213771] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-07 15:44:34,081][213771] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-07 15:44:34,872][213771] Updated weights for policy 0, policy_version 76280 (0.0006) [2023-03-07 15:44:35,631][213771] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-07 15:44:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 78127104. Throughput: 0: 13260.3. Samples: 78110263. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:36,106][213445] Avg episode reward: [(0, '4235.791')] [2023-03-07 15:44:36,417][213771] Updated weights for policy 0, policy_version 76300 (0.0005) [2023-03-07 15:44:37,188][213771] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-07 15:44:37,949][213771] Updated weights for policy 0, policy_version 76320 (0.0007) [2023-03-07 15:44:38,737][213771] Updated weights for policy 0, policy_version 76330 (0.0005) [2023-03-07 15:44:39,499][213771] Updated weights for policy 0, policy_version 76340 (0.0006) [2023-03-07 15:44:40,266][213771] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 15:44:41,058][213771] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-07 15:44:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 78192640. Throughput: 0: 13256.9. Samples: 78189645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:41,106][213445] Avg episode reward: [(0, '4273.502')] [2023-03-07 15:44:41,835][213771] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 15:44:42,604][213771] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-07 15:44:43,377][213771] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 15:44:44,160][213771] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-07 15:44:44,925][213771] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 15:44:45,713][213771] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-07 15:44:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 78259200. Throughput: 0: 13255.3. Samples: 78229317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:46,106][213445] Avg episode reward: [(0, '4286.626')] [2023-03-07 15:44:46,492][213771] Updated weights for policy 0, policy_version 76430 (0.0007) [2023-03-07 15:44:47,259][213771] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-07 15:44:48,038][213771] Updated weights for policy 0, policy_version 76450 (0.0006) [2023-03-07 15:44:48,813][213771] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-07 15:44:49,573][213771] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-07 15:44:50,365][213771] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 15:44:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 78324736. Throughput: 0: 13250.4. Samples: 78308670. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:51,106][213445] Avg episode reward: [(0, '4300.527')] [2023-03-07 15:44:51,139][213771] Updated weights for policy 0, policy_version 76490 (0.0007) [2023-03-07 15:44:51,915][213771] Updated weights for policy 0, policy_version 76500 (0.0006) [2023-03-07 15:44:52,696][213771] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-07 15:44:53,451][213771] Updated weights for policy 0, policy_version 76520 (0.0006) [2023-03-07 15:44:54,237][213771] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-07 15:44:54,997][213771] Updated weights for policy 0, policy_version 76540 (0.0007) [2023-03-07 15:44:55,763][213771] Updated weights for policy 0, policy_version 76550 (0.0006) [2023-03-07 15:44:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 78391296. Throughput: 0: 13247.0. Samples: 78387980. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:44:56,106][213445] Avg episode reward: [(0, '4242.014')] [2023-03-07 15:44:56,558][213771] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-07 15:44:57,325][213771] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-07 15:44:58,097][213771] Updated weights for policy 0, policy_version 76580 (0.0005) [2023-03-07 15:44:58,862][213771] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-03-07 15:44:59,632][213771] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 15:45:00,423][213771] Updated weights for policy 0, policy_version 76610 (0.0006) [2023-03-07 15:45:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 78456832. Throughput: 0: 13247.9. Samples: 78427696. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:45:01,106][213445] Avg episode reward: [(0, '4279.363')] [2023-03-07 15:45:01,191][213771] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 15:45:01,950][213771] Updated weights for policy 0, policy_version 76630 (0.0006) [2023-03-07 15:45:02,743][213771] Updated weights for policy 0, policy_version 76640 (0.0006) [2023-03-07 15:45:03,518][213771] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-07 15:45:04,275][213771] Updated weights for policy 0, policy_version 76660 (0.0005) [2023-03-07 15:45:05,046][213771] Updated weights for policy 0, policy_version 76670 (0.0006) [2023-03-07 15:45:05,817][213771] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-07 15:45:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 78523392. Throughput: 0: 13249.3. Samples: 78507242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:06,106][213445] Avg episode reward: [(0, '4215.346')] [2023-03-07 15:45:06,592][213771] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-07 15:45:07,355][213771] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 15:45:08,120][213771] Updated weights for policy 0, policy_version 76710 (0.0007) [2023-03-07 15:45:08,893][213771] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-07 15:45:09,670][213771] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-07 15:45:10,427][213771] Updated weights for policy 0, policy_version 76740 (0.0007) [2023-03-07 15:45:11,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 78589952. Throughput: 0: 13252.5. Samples: 78587032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:11,106][213445] Avg episode reward: [(0, '4275.395')] [2023-03-07 15:45:11,213][213771] Updated weights for policy 0, policy_version 76750 (0.0006) [2023-03-07 15:45:11,976][213771] Updated weights for policy 0, policy_version 76760 (0.0007) [2023-03-07 15:45:12,759][213771] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-07 15:45:13,518][213771] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-07 15:45:14,290][213771] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-07 15:45:15,056][213771] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-07 15:45:15,832][213771] Updated weights for policy 0, policy_version 76810 (0.0005) [2023-03-07 15:45:16,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 78656512. Throughput: 0: 13252.0. Samples: 78626917. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:16,106][213445] Avg episode reward: [(0, '4301.251')] [2023-03-07 15:45:16,602][213771] Updated weights for policy 0, policy_version 76820 (0.0005) [2023-03-07 15:45:17,382][213771] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-07 15:45:18,163][213771] Updated weights for policy 0, policy_version 76840 (0.0007) [2023-03-07 15:45:18,933][213771] Updated weights for policy 0, policy_version 76850 (0.0005) [2023-03-07 15:45:19,701][213771] Updated weights for policy 0, policy_version 76860 (0.0006) [2023-03-07 15:45:20,477][213771] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-07 15:45:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 78723072. Throughput: 0: 13247.7. Samples: 78706409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:21,106][213445] Avg episode reward: [(0, '4267.202')] [2023-03-07 15:45:21,238][213771] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-07 15:45:22,013][213771] Updated weights for policy 0, policy_version 76890 (0.0006) [2023-03-07 15:45:22,797][213771] Updated weights for policy 0, policy_version 76900 (0.0008) [2023-03-07 15:45:23,559][213771] Updated weights for policy 0, policy_version 76910 (0.0006) [2023-03-07 15:45:24,309][213771] Updated weights for policy 0, policy_version 76920 (0.0005) [2023-03-07 15:45:25,081][213771] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-07 15:45:25,838][213771] Updated weights for policy 0, policy_version 76940 (0.0006) [2023-03-07 15:45:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 78789632. Throughput: 0: 13259.3. Samples: 78786317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:26,106][213445] Avg episode reward: [(0, '4305.461')] [2023-03-07 15:45:26,618][213771] Updated weights for policy 0, policy_version 76950 (0.0006) [2023-03-07 15:45:27,404][213771] Updated weights for policy 0, policy_version 76960 (0.0007) [2023-03-07 15:45:28,178][213771] Updated weights for policy 0, policy_version 76970 (0.0007) [2023-03-07 15:45:28,957][213771] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 15:45:29,737][213771] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-07 15:45:30,487][213771] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-07 15:45:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 78855168. Throughput: 0: 13259.2. Samples: 78825979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:31,106][213445] Avg episode reward: [(0, '4257.450')] [2023-03-07 15:45:31,273][213771] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-07 15:45:32,034][213771] Updated weights for policy 0, policy_version 77020 (0.0005) [2023-03-07 15:45:32,797][213771] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 15:45:33,568][213771] Updated weights for policy 0, policy_version 77040 (0.0008) [2023-03-07 15:45:34,342][213771] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-07 15:45:35,098][213771] Updated weights for policy 0, policy_version 77060 (0.0006) [2023-03-07 15:45:35,875][213771] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-07 15:45:36,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 78921728. Throughput: 0: 13269.2. Samples: 78905784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:36,106][213445] Avg episode reward: [(0, '4192.285')] [2023-03-07 15:45:36,645][213771] Updated weights for policy 0, policy_version 77080 (0.0006) [2023-03-07 15:45:37,409][213771] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 15:45:38,192][213771] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-07 15:45:38,965][213771] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-07 15:45:39,733][213771] Updated weights for policy 0, policy_version 77120 (0.0005) [2023-03-07 15:45:40,488][213771] Updated weights for policy 0, policy_version 77130 (0.0007) [2023-03-07 15:45:41,105][213445] Fps is (10 sec: 13414.7, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 78989312. Throughput: 0: 13281.5. Samples: 78985647. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:41,105][213445] Avg episode reward: [(0, '4225.134')] [2023-03-07 15:45:41,248][213771] Updated weights for policy 0, policy_version 77140 (0.0007) [2023-03-07 15:45:42,034][213771] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-07 15:45:42,788][213771] Updated weights for policy 0, policy_version 77160 (0.0007) [2023-03-07 15:45:43,565][213771] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-07 15:45:44,342][213771] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-07 15:45:45,113][213771] Updated weights for policy 0, policy_version 77190 (0.0007) [2023-03-07 15:45:45,889][213771] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-07 15:45:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 79054848. Throughput: 0: 13286.7. Samples: 79025598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:45:46,106][213445] Avg episode reward: [(0, '4322.453')] [2023-03-07 15:45:46,658][213771] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-07 15:45:47,443][213771] Updated weights for policy 0, policy_version 77220 (0.0006) [2023-03-07 15:45:48,220][213771] Updated weights for policy 0, policy_version 77230 (0.0007) [2023-03-07 15:45:48,990][213771] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-07 15:45:49,754][213771] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-07 15:45:50,517][213771] Updated weights for policy 0, policy_version 77260 (0.0005) [2023-03-07 15:45:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 79121408. Throughput: 0: 13285.8. Samples: 79105104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:45:51,106][213445] Avg episode reward: [(0, '4233.146')] [2023-03-07 15:45:51,306][213771] Updated weights for policy 0, policy_version 77270 (0.0005) [2023-03-07 15:45:52,096][213771] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-07 15:45:52,845][213771] Updated weights for policy 0, policy_version 77290 (0.0007) [2023-03-07 15:45:53,625][213771] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-07 15:45:54,394][213771] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-07 15:45:55,150][213771] Updated weights for policy 0, policy_version 77320 (0.0006) [2023-03-07 15:45:55,931][213771] Updated weights for policy 0, policy_version 77330 (0.0006) [2023-03-07 15:45:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 79187968. Throughput: 0: 13278.9. Samples: 79184583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:45:56,106][213445] Avg episode reward: [(0, '4200.153')] [2023-03-07 15:45:56,706][213771] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-07 15:45:57,469][213771] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-07 15:45:58,251][213771] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 15:45:59,018][213771] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-07 15:45:59,782][213771] Updated weights for policy 0, policy_version 77380 (0.0006) [2023-03-07 15:46:00,558][213771] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-07 15:46:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13295.0, 300 sec: 13256.5). Total num frames: 79254528. Throughput: 0: 13276.5. Samples: 79224358. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:01,106][213445] Avg episode reward: [(0, '4166.582')] [2023-03-07 15:46:01,338][213771] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-07 15:46:02,103][213771] Updated weights for policy 0, policy_version 77410 (0.0005) [2023-03-07 15:46:02,876][213771] Updated weights for policy 0, policy_version 77420 (0.0007) [2023-03-07 15:46:03,650][213771] Updated weights for policy 0, policy_version 77430 (0.0005) [2023-03-07 15:46:04,425][213771] Updated weights for policy 0, policy_version 77440 (0.0007) [2023-03-07 15:46:05,182][213771] Updated weights for policy 0, policy_version 77450 (0.0007) [2023-03-07 15:46:05,961][213771] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-07 15:46:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 79320064. Throughput: 0: 13281.6. Samples: 79304080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:06,106][213445] Avg episode reward: [(0, '4097.521')] [2023-03-07 15:46:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000077462_79321088.pth... [2023-03-07 15:46:06,141][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000074356_76140544.pth [2023-03-07 15:46:06,729][213771] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-07 15:46:07,484][213771] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-07 15:46:08,264][213771] Updated weights for policy 0, policy_version 77490 (0.0007) [2023-03-07 15:46:09,029][213771] Updated weights for policy 0, policy_version 77500 (0.0007) [2023-03-07 15:46:09,808][213771] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-07 15:46:10,567][213771] Updated weights for policy 0, policy_version 77520 (0.0007) [2023-03-07 15:46:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 79386624. Throughput: 0: 13280.0. Samples: 79383917. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:11,106][213445] Avg episode reward: [(0, '4143.513')] [2023-03-07 15:46:11,362][213771] Updated weights for policy 0, policy_version 77530 (0.0007) [2023-03-07 15:46:12,129][213771] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-07 15:46:12,907][213771] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-07 15:46:13,665][213771] Updated weights for policy 0, policy_version 77560 (0.0005) [2023-03-07 15:46:14,452][213771] Updated weights for policy 0, policy_version 77570 (0.0007) [2023-03-07 15:46:15,216][213771] Updated weights for policy 0, policy_version 77580 (0.0006) [2023-03-07 15:46:16,009][213771] Updated weights for policy 0, policy_version 77590 (0.0007) [2023-03-07 15:46:16,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 79453184. Throughput: 0: 13283.5. Samples: 79423736. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:16,106][213445] Avg episode reward: [(0, '4196.114')] [2023-03-07 15:46:16,776][213771] Updated weights for policy 0, policy_version 77600 (0.0007) [2023-03-07 15:46:17,542][213771] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-07 15:46:18,313][213771] Updated weights for policy 0, policy_version 77620 (0.0007) [2023-03-07 15:46:19,078][213771] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-07 15:46:19,852][213771] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-07 15:46:20,632][213771] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-07 15:46:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79518720. Throughput: 0: 13270.9. Samples: 79502975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:21,105][213445] Avg episode reward: [(0, '4223.538')] [2023-03-07 15:46:21,418][213771] Updated weights for policy 0, policy_version 77660 (0.0007) [2023-03-07 15:46:22,207][213771] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-07 15:46:22,982][213771] Updated weights for policy 0, policy_version 77680 (0.0006) [2023-03-07 15:46:23,775][213771] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 15:46:24,543][213771] Updated weights for policy 0, policy_version 77700 (0.0007) [2023-03-07 15:46:25,324][213771] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-07 15:46:26,098][213771] Updated weights for policy 0, policy_version 77720 (0.0006) [2023-03-07 15:46:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79585280. Throughput: 0: 13247.9. Samples: 79581805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:26,106][213445] Avg episode reward: [(0, '4299.430')] [2023-03-07 15:46:26,872][213771] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-07 15:46:27,646][213771] Updated weights for policy 0, policy_version 77740 (0.0005) [2023-03-07 15:46:28,415][213771] Updated weights for policy 0, policy_version 77750 (0.0006) [2023-03-07 15:46:29,195][213771] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 15:46:29,978][213771] Updated weights for policy 0, policy_version 77770 (0.0008) [2023-03-07 15:46:30,748][213771] Updated weights for policy 0, policy_version 77780 (0.0007) [2023-03-07 15:46:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79650816. Throughput: 0: 13239.3. Samples: 79621367. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:31,106][213445] Avg episode reward: [(0, '4283.544')] [2023-03-07 15:46:31,504][213771] Updated weights for policy 0, policy_version 77790 (0.0005) [2023-03-07 15:46:32,297][213771] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-07 15:46:33,079][213771] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-07 15:46:33,848][213771] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-07 15:46:34,596][213771] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-07 15:46:35,368][213771] Updated weights for policy 0, policy_version 77840 (0.0007) [2023-03-07 15:46:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79717376. Throughput: 0: 13244.8. Samples: 79701121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:46:36,106][213445] Avg episode reward: [(0, '4234.813')] [2023-03-07 15:46:36,142][213771] Updated weights for policy 0, policy_version 77850 (0.0006) [2023-03-07 15:46:36,933][213771] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-07 15:46:37,699][213771] Updated weights for policy 0, policy_version 77870 (0.0007) [2023-03-07 15:46:38,479][213771] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-07 15:46:39,253][213771] Updated weights for policy 0, policy_version 77890 (0.0007) [2023-03-07 15:46:40,012][213771] Updated weights for policy 0, policy_version 77900 (0.0006) [2023-03-07 15:46:40,781][213771] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-07 15:46:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 79783936. Throughput: 0: 13246.3. Samples: 79780667. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:46:41,106][213445] Avg episode reward: [(0, '4083.455')] [2023-03-07 15:46:41,557][213771] Updated weights for policy 0, policy_version 77920 (0.0007) [2023-03-07 15:46:42,324][213771] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-07 15:46:43,101][213771] Updated weights for policy 0, policy_version 77940 (0.0006) [2023-03-07 15:46:43,859][213771] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 15:46:44,639][213771] Updated weights for policy 0, policy_version 77960 (0.0006) [2023-03-07 15:46:45,421][213771] Updated weights for policy 0, policy_version 77970 (0.0005) [2023-03-07 15:46:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79850496. Throughput: 0: 13247.6. Samples: 79820503. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:46:46,106][213445] Avg episode reward: [(0, '4132.895')] [2023-03-07 15:46:46,182][213771] Updated weights for policy 0, policy_version 77980 (0.0007) [2023-03-07 15:46:46,949][213771] Updated weights for policy 0, policy_version 77990 (0.0006) [2023-03-07 15:46:47,721][213771] Updated weights for policy 0, policy_version 78000 (0.0007) [2023-03-07 15:46:48,490][213771] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-07 15:46:49,259][213771] Updated weights for policy 0, policy_version 78020 (0.0006) [2023-03-07 15:46:50,028][213771] Updated weights for policy 0, policy_version 78030 (0.0007) [2023-03-07 15:46:50,787][213771] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-07 15:46:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79917056. Throughput: 0: 13244.5. Samples: 79900085. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:46:51,106][213445] Avg episode reward: [(0, '4202.455')] [2023-03-07 15:46:51,553][213771] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-07 15:46:52,305][213771] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-07 15:46:53,093][213771] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 15:46:53,861][213771] Updated weights for policy 0, policy_version 78080 (0.0006) [2023-03-07 15:46:54,627][213771] Updated weights for policy 0, policy_version 78090 (0.0007) [2023-03-07 15:46:55,410][213771] Updated weights for policy 0, policy_version 78100 (0.0006) [2023-03-07 15:46:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 79983616. Throughput: 0: 13249.5. Samples: 79980143. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:46:56,106][213445] Avg episode reward: [(0, '4196.115')] [2023-03-07 15:46:56,173][213771] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-07 15:46:56,953][213771] Updated weights for policy 0, policy_version 78120 (0.0006) [2023-03-07 15:46:57,717][213771] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-07 15:46:58,485][213771] Updated weights for policy 0, policy_version 78140 (0.0005) [2023-03-07 15:46:59,270][213771] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-07 15:47:00,039][213771] Updated weights for policy 0, policy_version 78160 (0.0007) [2023-03-07 15:47:00,814][213771] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 15:47:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 80049152. Throughput: 0: 13247.0. Samples: 80019850. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:47:01,106][213445] Avg episode reward: [(0, '4246.507')] [2023-03-07 15:47:01,594][213771] Updated weights for policy 0, policy_version 78180 (0.0005) [2023-03-07 15:47:02,364][213771] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-07 15:47:03,120][213771] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-07 15:47:03,897][213771] Updated weights for policy 0, policy_version 78210 (0.0006) [2023-03-07 15:47:04,689][213771] Updated weights for policy 0, policy_version 78220 (0.0006) [2023-03-07 15:47:05,455][213771] Updated weights for policy 0, policy_version 78230 (0.0007) [2023-03-07 15:47:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80115712. Throughput: 0: 13252.7. Samples: 80099347. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:47:06,106][213445] Avg episode reward: [(0, '4267.922')] [2023-03-07 15:47:06,230][213771] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-07 15:47:07,009][213771] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-07 15:47:07,768][213771] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 15:47:08,552][213771] Updated weights for policy 0, policy_version 78270 (0.0005) [2023-03-07 15:47:09,317][213771] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-07 15:47:10,090][213771] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 15:47:10,861][213771] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-07 15:47:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80182272. Throughput: 0: 13268.5. Samples: 80178889. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:47:11,106][213445] Avg episode reward: [(0, '4308.611')] [2023-03-07 15:47:11,612][213771] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-07 15:47:12,394][213771] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-07 15:47:13,168][213771] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-07 15:47:13,920][213771] Updated weights for policy 0, policy_version 78340 (0.0005) [2023-03-07 15:47:14,717][213771] Updated weights for policy 0, policy_version 78350 (0.0007) [2023-03-07 15:47:15,482][213771] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 15:47:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80248832. Throughput: 0: 13279.0. Samples: 80218921. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:47:16,106][213445] Avg episode reward: [(0, '4404.050')] [2023-03-07 15:47:16,253][213771] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-07 15:47:17,025][213771] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-07 15:47:17,790][213771] Updated weights for policy 0, policy_version 78390 (0.0007) [2023-03-07 15:47:18,563][213771] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 15:47:19,366][213771] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-07 15:47:20,122][213771] Updated weights for policy 0, policy_version 78420 (0.0005) [2023-03-07 15:47:20,916][213771] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-07 15:47:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80314368. Throughput: 0: 13266.7. Samples: 80298123. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:47:21,106][213445] Avg episode reward: [(0, '4297.746')] [2023-03-07 15:47:21,680][213771] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-07 15:47:22,444][213771] Updated weights for policy 0, policy_version 78450 (0.0007) [2023-03-07 15:47:23,242][213771] Updated weights for policy 0, policy_version 78460 (0.0006) [2023-03-07 15:47:24,013][213771] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-07 15:47:24,786][213771] Updated weights for policy 0, policy_version 78480 (0.0006) [2023-03-07 15:47:25,561][213771] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-07 15:47:26,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 80379904. Throughput: 0: 13258.2. Samples: 80377286. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:26,106][213445] Avg episode reward: [(0, '4280.400')] [2023-03-07 15:47:26,341][213771] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-07 15:47:27,103][213771] Updated weights for policy 0, policy_version 78510 (0.0007) [2023-03-07 15:47:27,879][213771] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-07 15:47:28,669][213771] Updated weights for policy 0, policy_version 78530 (0.0006) [2023-03-07 15:47:29,442][213771] Updated weights for policy 0, policy_version 78540 (0.0006) [2023-03-07 15:47:30,216][213771] Updated weights for policy 0, policy_version 78550 (0.0007) [2023-03-07 15:47:30,993][213771] Updated weights for policy 0, policy_version 78560 (0.0007) [2023-03-07 15:47:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80446464. Throughput: 0: 13253.6. Samples: 80416917. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:31,106][213445] Avg episode reward: [(0, '4226.596')] [2023-03-07 15:47:31,777][213771] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-07 15:47:32,542][213771] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 15:47:33,315][213771] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 15:47:34,073][213771] Updated weights for policy 0, policy_version 78600 (0.0006) [2023-03-07 15:47:34,840][213771] Updated weights for policy 0, policy_version 78610 (0.0006) [2023-03-07 15:47:35,621][213771] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-07 15:47:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 80513024. Throughput: 0: 13251.9. Samples: 80496419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:36,116][213445] Avg episode reward: [(0, '4291.823')] [2023-03-07 15:47:36,399][213771] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-07 15:47:37,139][213771] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-07 15:47:37,923][213771] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 15:47:38,706][213771] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-07 15:47:39,493][213771] Updated weights for policy 0, policy_version 78670 (0.0006) [2023-03-07 15:47:40,238][213771] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-07 15:47:41,029][213771] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-07 15:47:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 80579584. Throughput: 0: 13244.8. Samples: 80576159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:41,116][213445] Avg episode reward: [(0, '4290.062')] [2023-03-07 15:47:41,799][213771] Updated weights for policy 0, policy_version 78700 (0.0008) [2023-03-07 15:47:42,555][213771] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-07 15:47:43,321][213771] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-07 15:47:44,088][213771] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-07 15:47:44,847][213771] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 15:47:45,626][213771] Updated weights for policy 0, policy_version 78750 (0.0006) [2023-03-07 15:47:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 80646144. Throughput: 0: 13247.8. Samples: 80616000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:46,123][213445] Avg episode reward: [(0, '4245.054')] [2023-03-07 15:47:46,401][213771] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-07 15:47:47,172][213771] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 15:47:47,961][213771] Updated weights for policy 0, policy_version 78780 (0.0005) [2023-03-07 15:47:48,720][213771] Updated weights for policy 0, policy_version 78790 (0.0006) [2023-03-07 15:47:49,497][213771] Updated weights for policy 0, policy_version 78800 (0.0006) [2023-03-07 15:47:50,278][213771] Updated weights for policy 0, policy_version 78810 (0.0006) [2023-03-07 15:47:51,038][213771] Updated weights for policy 0, policy_version 78820 (0.0008) [2023-03-07 15:47:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 80712704. Throughput: 0: 13251.6. Samples: 80695669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:51,116][213445] Avg episode reward: [(0, '4282.481')] [2023-03-07 15:47:51,808][213771] Updated weights for policy 0, policy_version 78830 (0.0007) [2023-03-07 15:47:52,589][213771] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-07 15:47:53,354][213771] Updated weights for policy 0, policy_version 78850 (0.0007) [2023-03-07 15:47:54,123][213771] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-07 15:47:54,910][213771] Updated weights for policy 0, policy_version 78870 (0.0007) [2023-03-07 15:47:55,672][213771] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-07 15:47:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 80778240. Throughput: 0: 13253.0. Samples: 80775274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:47:56,116][213445] Avg episode reward: [(0, '4269.509')] [2023-03-07 15:47:56,453][213771] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-07 15:47:57,226][213771] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-07 15:47:57,992][213771] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-07 15:47:58,770][213771] Updated weights for policy 0, policy_version 78920 (0.0005) [2023-03-07 15:47:59,547][213771] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 15:48:00,321][213771] Updated weights for policy 0, policy_version 78940 (0.0006) [2023-03-07 15:48:01,092][213771] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-07 15:48:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 80844800. Throughput: 0: 13250.3. Samples: 80815186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:01,117][213445] Avg episode reward: [(0, '4264.820')] [2023-03-07 15:48:01,866][213771] Updated weights for policy 0, policy_version 78960 (0.0007) [2023-03-07 15:48:02,655][213771] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 15:48:03,422][213771] Updated weights for policy 0, policy_version 78980 (0.0007) [2023-03-07 15:48:04,185][213771] Updated weights for policy 0, policy_version 78990 (0.0007) [2023-03-07 15:48:04,953][213771] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-07 15:48:05,715][213771] Updated weights for policy 0, policy_version 79010 (0.0005) [2023-03-07 15:48:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 80911360. Throughput: 0: 13250.9. Samples: 80894412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:06,106][213445] Avg episode reward: [(0, '4259.388')] [2023-03-07 15:48:06,113][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000079015_80911360.pth... [2023-03-07 15:48:06,145][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000075908_77729792.pth [2023-03-07 15:48:06,490][213771] Updated weights for policy 0, policy_version 79020 (0.0005) [2023-03-07 15:48:07,258][213771] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-07 15:48:08,018][213771] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-07 15:48:08,791][213771] Updated weights for policy 0, policy_version 79050 (0.0007) [2023-03-07 15:48:09,573][213771] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 15:48:10,334][213771] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 15:48:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 80976896. Throughput: 0: 13264.1. Samples: 80974170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:11,106][213445] Avg episode reward: [(0, '4251.069')] [2023-03-07 15:48:11,119][213771] Updated weights for policy 0, policy_version 79080 (0.0005) [2023-03-07 15:48:11,899][213771] Updated weights for policy 0, policy_version 79090 (0.0005) [2023-03-07 15:48:12,656][213771] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-07 15:48:13,442][213771] Updated weights for policy 0, policy_version 79110 (0.0007) [2023-03-07 15:48:14,227][213771] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-07 15:48:15,004][213771] Updated weights for policy 0, policy_version 79130 (0.0005) [2023-03-07 15:48:15,776][213771] Updated weights for policy 0, policy_version 79140 (0.0006) [2023-03-07 15:48:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 81043456. Throughput: 0: 13263.7. Samples: 81013782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:16,106][213445] Avg episode reward: [(0, '4177.274')] [2023-03-07 15:48:16,537][213771] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-07 15:48:17,305][213771] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-07 15:48:18,087][213771] Updated weights for policy 0, policy_version 79170 (0.0006) [2023-03-07 15:48:18,873][213771] Updated weights for policy 0, policy_version 79180 (0.0006) [2023-03-07 15:48:19,657][213771] Updated weights for policy 0, policy_version 79190 (0.0006) [2023-03-07 15:48:20,412][213771] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 15:48:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 81110016. Throughput: 0: 13259.3. Samples: 81093089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:21,106][213445] Avg episode reward: [(0, '4215.372')] [2023-03-07 15:48:21,193][213771] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-07 15:48:21,966][213771] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-07 15:48:22,729][213771] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-07 15:48:23,485][213771] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 15:48:24,249][213771] Updated weights for policy 0, policy_version 79250 (0.0005) [2023-03-07 15:48:25,033][213771] Updated weights for policy 0, policy_version 79260 (0.0006) [2023-03-07 15:48:25,813][213771] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-07 15:48:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 81175552. Throughput: 0: 13260.5. Samples: 81172884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:26,106][213445] Avg episode reward: [(0, '4210.945')] [2023-03-07 15:48:26,573][213771] Updated weights for policy 0, policy_version 79280 (0.0008) [2023-03-07 15:48:27,341][213771] Updated weights for policy 0, policy_version 79290 (0.0007) [2023-03-07 15:48:28,124][213771] Updated weights for policy 0, policy_version 79300 (0.0007) [2023-03-07 15:48:28,900][213771] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-07 15:48:29,657][213771] Updated weights for policy 0, policy_version 79320 (0.0005) [2023-03-07 15:48:30,417][213771] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-07 15:48:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 81242112. Throughput: 0: 13255.8. Samples: 81212509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:31,106][213445] Avg episode reward: [(0, '4112.296')] [2023-03-07 15:48:31,201][213771] Updated weights for policy 0, policy_version 79340 (0.0007) [2023-03-07 15:48:31,966][213771] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 15:48:32,755][213771] Updated weights for policy 0, policy_version 79360 (0.0007) [2023-03-07 15:48:33,510][213771] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-07 15:48:34,301][213771] Updated weights for policy 0, policy_version 79380 (0.0007) [2023-03-07 15:48:35,070][213771] Updated weights for policy 0, policy_version 79390 (0.0005) [2023-03-07 15:48:35,825][213771] Updated weights for policy 0, policy_version 79400 (0.0005) [2023-03-07 15:48:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 81308672. Throughput: 0: 13258.5. Samples: 81292304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:36,106][213445] Avg episode reward: [(0, '4196.680')] [2023-03-07 15:48:36,597][213771] Updated weights for policy 0, policy_version 79410 (0.0006) [2023-03-07 15:48:37,371][213771] Updated weights for policy 0, policy_version 79420 (0.0007) [2023-03-07 15:48:38,164][213771] Updated weights for policy 0, policy_version 79430 (0.0006) [2023-03-07 15:48:38,904][213771] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 15:48:39,682][213771] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-07 15:48:40,462][213771] Updated weights for policy 0, policy_version 79460 (0.0007) [2023-03-07 15:48:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 81375232. Throughput: 0: 13256.8. Samples: 81371829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:41,106][213445] Avg episode reward: [(0, '4165.779')] [2023-03-07 15:48:41,237][213771] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-07 15:48:42,009][213771] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-07 15:48:42,805][213771] Updated weights for policy 0, policy_version 79490 (0.0007) [2023-03-07 15:48:43,565][213771] Updated weights for policy 0, policy_version 79500 (0.0006) [2023-03-07 15:48:44,353][213771] Updated weights for policy 0, policy_version 79510 (0.0007) [2023-03-07 15:48:45,112][213771] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-07 15:48:45,882][213771] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-07 15:48:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 81441792. Throughput: 0: 13249.1. Samples: 81411396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:46,106][213445] Avg episode reward: [(0, '4222.420')] [2023-03-07 15:48:46,639][213771] Updated weights for policy 0, policy_version 79540 (0.0005) [2023-03-07 15:48:47,431][213771] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 15:48:48,202][213771] Updated weights for policy 0, policy_version 79560 (0.0007) [2023-03-07 15:48:48,963][213771] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-07 15:48:49,717][213771] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-07 15:48:50,502][213771] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-07 15:48:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 81507328. Throughput: 0: 13270.9. Samples: 81491601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:51,106][213445] Avg episode reward: [(0, '4190.420')] [2023-03-07 15:48:51,285][213771] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-07 15:48:52,074][213771] Updated weights for policy 0, policy_version 79610 (0.0007) [2023-03-07 15:48:52,838][213771] Updated weights for policy 0, policy_version 79620 (0.0006) [2023-03-07 15:48:53,603][213771] Updated weights for policy 0, policy_version 79630 (0.0006) [2023-03-07 15:48:54,381][213771] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-07 15:48:55,136][213771] Updated weights for policy 0, policy_version 79650 (0.0005) [2023-03-07 15:48:55,897][213771] Updated weights for policy 0, policy_version 79660 (0.0005) [2023-03-07 15:48:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 81573888. Throughput: 0: 13262.3. Samples: 81570973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:48:56,106][213445] Avg episode reward: [(0, '4154.001')] [2023-03-07 15:48:56,687][213771] Updated weights for policy 0, policy_version 79670 (0.0007) [2023-03-07 15:48:57,453][213771] Updated weights for policy 0, policy_version 79680 (0.0008) [2023-03-07 15:48:58,215][213771] Updated weights for policy 0, policy_version 79690 (0.0005) [2023-03-07 15:48:58,991][213771] Updated weights for policy 0, policy_version 79700 (0.0006) [2023-03-07 15:48:59,777][213771] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-07 15:49:00,551][213771] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-07 15:49:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 81640448. Throughput: 0: 13266.4. Samples: 81610767. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:01,106][213445] Avg episode reward: [(0, '4102.313')] [2023-03-07 15:49:01,317][213771] Updated weights for policy 0, policy_version 79730 (0.0006) [2023-03-07 15:49:02,086][213771] Updated weights for policy 0, policy_version 79740 (0.0006) [2023-03-07 15:49:02,870][213771] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 15:49:03,634][213771] Updated weights for policy 0, policy_version 79760 (0.0007) [2023-03-07 15:49:04,409][213771] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-07 15:49:05,182][213771] Updated weights for policy 0, policy_version 79780 (0.0006) [2023-03-07 15:49:05,958][213771] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-07 15:49:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 81705984. Throughput: 0: 13265.9. Samples: 81690052. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:06,106][213445] Avg episode reward: [(0, '4097.759')] [2023-03-07 15:49:06,732][213771] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-07 15:49:07,489][213771] Updated weights for policy 0, policy_version 79810 (0.0005) [2023-03-07 15:49:08,269][213771] Updated weights for policy 0, policy_version 79820 (0.0006) [2023-03-07 15:49:09,033][213771] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-07 15:49:09,810][213771] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-07 15:49:10,586][213771] Updated weights for policy 0, policy_version 79850 (0.0005) [2023-03-07 15:49:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 81772544. Throughput: 0: 13262.1. Samples: 81769677. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:11,106][213445] Avg episode reward: [(0, '4108.888')] [2023-03-07 15:49:11,353][213771] Updated weights for policy 0, policy_version 79860 (0.0006) [2023-03-07 15:49:12,115][213771] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-03-07 15:49:12,884][213771] Updated weights for policy 0, policy_version 79880 (0.0005) [2023-03-07 15:49:13,639][213771] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 15:49:14,408][213771] Updated weights for policy 0, policy_version 79900 (0.0006) [2023-03-07 15:49:15,186][213771] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 15:49:15,945][213771] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-07 15:49:16,105][213445] Fps is (10 sec: 13414.4, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 81840128. Throughput: 0: 13275.0. Samples: 81809884. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:16,106][213445] Avg episode reward: [(0, '3973.131')] [2023-03-07 15:49:16,719][213771] Updated weights for policy 0, policy_version 79930 (0.0007) [2023-03-07 15:49:17,496][213771] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-07 15:49:18,249][213771] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-07 15:49:19,017][213771] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-07 15:49:19,794][213771] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-07 15:49:20,562][213771] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 15:49:21,105][213445] Fps is (10 sec: 13414.3, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 81906688. Throughput: 0: 13277.4. Samples: 81889784. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:21,106][213445] Avg episode reward: [(0, '3920.891')] [2023-03-07 15:49:21,346][213771] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-07 15:49:22,105][213771] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-07 15:49:22,879][213771] Updated weights for policy 0, policy_version 80010 (0.0006) [2023-03-07 15:49:23,663][213771] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-07 15:49:24,433][213771] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-07 15:49:25,197][213771] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-07 15:49:25,986][213771] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-07 15:49:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13259.9). Total num frames: 81972224. Throughput: 0: 13280.9. Samples: 81969469. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:26,106][213445] Avg episode reward: [(0, '4068.933')] [2023-03-07 15:49:26,743][213771] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-07 15:49:27,494][213771] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-07 15:49:28,286][213771] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-07 15:49:29,071][213771] Updated weights for policy 0, policy_version 80090 (0.0005) [2023-03-07 15:49:29,849][213771] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-07 15:49:30,610][213771] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-07 15:49:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13277.8, 300 sec: 13259.9). Total num frames: 82038784. Throughput: 0: 13284.6. Samples: 82009204. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:31,106][213445] Avg episode reward: [(0, '4137.587')] [2023-03-07 15:49:31,391][213771] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-07 15:49:32,161][213771] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-07 15:49:32,940][213771] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-03-07 15:49:33,713][213771] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-07 15:49:34,477][213771] Updated weights for policy 0, policy_version 80160 (0.0007) [2023-03-07 15:49:35,246][213771] Updated weights for policy 0, policy_version 80170 (0.0007) [2023-03-07 15:49:36,026][213771] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-07 15:49:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 82105344. Throughput: 0: 13265.6. Samples: 82088555. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:36,106][213445] Avg episode reward: [(0, '4177.339')] [2023-03-07 15:49:36,798][213771] Updated weights for policy 0, policy_version 80190 (0.0006) [2023-03-07 15:49:37,568][213771] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-07 15:49:38,349][213771] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-07 15:49:39,118][213771] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 15:49:39,888][213771] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-07 15:49:40,684][213771] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-07 15:49:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 82170880. Throughput: 0: 13264.9. Samples: 82167891. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:41,106][213445] Avg episode reward: [(0, '4179.917')] [2023-03-07 15:49:41,429][213771] Updated weights for policy 0, policy_version 80250 (0.0006) [2023-03-07 15:49:42,194][213771] Updated weights for policy 0, policy_version 80260 (0.0005) [2023-03-07 15:49:42,961][213771] Updated weights for policy 0, policy_version 80270 (0.0006) [2023-03-07 15:49:43,731][213771] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-07 15:49:44,500][213771] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-07 15:49:45,283][213771] Updated weights for policy 0, policy_version 80300 (0.0005) [2023-03-07 15:49:46,049][213771] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 15:49:46,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 82237440. Throughput: 0: 13272.8. Samples: 82208044. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 15:49:46,106][213445] Avg episode reward: [(0, '4093.813')] [2023-03-07 15:49:46,817][213771] Updated weights for policy 0, policy_version 80320 (0.0005) [2023-03-07 15:49:47,591][213771] Updated weights for policy 0, policy_version 80330 (0.0007) [2023-03-07 15:49:48,366][213771] Updated weights for policy 0, policy_version 80340 (0.0007) [2023-03-07 15:49:49,141][213771] Updated weights for policy 0, policy_version 80350 (0.0006) [2023-03-07 15:49:49,900][213771] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-07 15:49:50,660][213771] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-07 15:49:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 82304000. Throughput: 0: 13282.8. Samples: 82287777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:49:51,117][213445] Avg episode reward: [(0, '4155.075')] [2023-03-07 15:49:51,442][213771] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-07 15:49:52,199][213771] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-07 15:49:52,994][213771] Updated weights for policy 0, policy_version 80400 (0.0007) [2023-03-07 15:49:53,774][213771] Updated weights for policy 0, policy_version 80410 (0.0006) [2023-03-07 15:49:54,531][213771] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-07 15:49:55,290][213771] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 15:49:56,070][213771] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-07 15:49:56,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 82370560. Throughput: 0: 13284.4. Samples: 82367476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:49:56,105][213445] Avg episode reward: [(0, '4167.114')] [2023-03-07 15:49:56,837][213771] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-07 15:49:57,607][213771] Updated weights for policy 0, policy_version 80460 (0.0005) [2023-03-07 15:49:58,397][213771] Updated weights for policy 0, policy_version 80470 (0.0006) [2023-03-07 15:49:59,157][213771] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-07 15:49:59,928][213771] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-07 15:50:00,713][213771] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-07 15:50:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 82437120. Throughput: 0: 13275.1. Samples: 82407265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:01,106][213445] Avg episode reward: [(0, '4194.161')] [2023-03-07 15:50:01,461][213771] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-07 15:50:02,237][213771] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-07 15:50:03,003][213771] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-07 15:50:03,780][213771] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-07 15:50:04,539][213771] Updated weights for policy 0, policy_version 80550 (0.0006) [2023-03-07 15:50:05,300][213771] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 15:50:06,058][213771] Updated weights for policy 0, policy_version 80570 (0.0006) [2023-03-07 15:50:06,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13266.9). Total num frames: 82503680. Throughput: 0: 13275.9. Samples: 82487202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:06,106][213445] Avg episode reward: [(0, '4237.013')] [2023-03-07 15:50:06,113][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000080571_82504704.pth... [2023-03-07 15:50:06,144][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000077462_79321088.pth [2023-03-07 15:50:06,835][213771] Updated weights for policy 0, policy_version 80580 (0.0007) [2023-03-07 15:50:07,587][213771] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-07 15:50:08,379][213771] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-07 15:50:09,129][213771] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 15:50:09,904][213771] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-07 15:50:10,685][213771] Updated weights for policy 0, policy_version 80630 (0.0005) [2023-03-07 15:50:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13266.9). Total num frames: 82570240. Throughput: 0: 13282.4. Samples: 82567178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:11,106][213445] Avg episode reward: [(0, '4262.194')] [2023-03-07 15:50:11,440][213771] Updated weights for policy 0, policy_version 80640 (0.0007) [2023-03-07 15:50:12,238][213771] Updated weights for policy 0, policy_version 80650 (0.0005) [2023-03-07 15:50:12,986][213771] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 15:50:13,757][213771] Updated weights for policy 0, policy_version 80670 (0.0007) [2023-03-07 15:50:14,548][213771] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 15:50:15,301][213771] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-07 15:50:16,090][213771] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-07 15:50:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 82636800. Throughput: 0: 13284.3. Samples: 82607000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:16,106][213445] Avg episode reward: [(0, '4163.253')] [2023-03-07 15:50:16,846][213771] Updated weights for policy 0, policy_version 80710 (0.0006) [2023-03-07 15:50:17,612][213771] Updated weights for policy 0, policy_version 80720 (0.0007) [2023-03-07 15:50:18,401][213771] Updated weights for policy 0, policy_version 80730 (0.0007) [2023-03-07 15:50:19,162][213771] Updated weights for policy 0, policy_version 80740 (0.0007) [2023-03-07 15:50:19,948][213771] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-07 15:50:20,708][213771] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-07 15:50:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 82703360. Throughput: 0: 13290.0. Samples: 82686605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:21,106][213445] Avg episode reward: [(0, '4269.439')] [2023-03-07 15:50:21,467][213771] Updated weights for policy 0, policy_version 80770 (0.0005) [2023-03-07 15:50:22,253][213771] Updated weights for policy 0, policy_version 80780 (0.0005) [2023-03-07 15:50:23,022][213771] Updated weights for policy 0, policy_version 80790 (0.0007) [2023-03-07 15:50:23,806][213771] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-07 15:50:24,578][213771] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-07 15:50:25,335][213771] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-07 15:50:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13277.8, 300 sec: 13266.9). Total num frames: 82768896. Throughput: 0: 13292.7. Samples: 82766063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:26,106][213445] Avg episode reward: [(0, '4148.510')] [2023-03-07 15:50:26,135][213771] Updated weights for policy 0, policy_version 80830 (0.0006) [2023-03-07 15:50:26,904][213771] Updated weights for policy 0, policy_version 80840 (0.0006) [2023-03-07 15:50:27,673][213771] Updated weights for policy 0, policy_version 80850 (0.0005) [2023-03-07 15:50:28,454][213771] Updated weights for policy 0, policy_version 80860 (0.0007) [2023-03-07 15:50:29,238][213771] Updated weights for policy 0, policy_version 80870 (0.0006) [2023-03-07 15:50:30,002][213771] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 15:50:30,760][213771] Updated weights for policy 0, policy_version 80890 (0.0007) [2023-03-07 15:50:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13266.9). Total num frames: 82835456. Throughput: 0: 13282.6. Samples: 82805757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:31,106][213445] Avg episode reward: [(0, '4239.875')] [2023-03-07 15:50:31,555][213771] Updated weights for policy 0, policy_version 80900 (0.0008) [2023-03-07 15:50:32,313][213771] Updated weights for policy 0, policy_version 80910 (0.0007) [2023-03-07 15:50:33,099][213771] Updated weights for policy 0, policy_version 80920 (0.0007) [2023-03-07 15:50:33,874][213771] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-07 15:50:34,668][213771] Updated weights for policy 0, policy_version 80940 (0.0005) [2023-03-07 15:50:35,425][213771] Updated weights for policy 0, policy_version 80950 (0.0008) [2023-03-07 15:50:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 82900992. Throughput: 0: 13268.4. Samples: 82884855. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:36,106][213445] Avg episode reward: [(0, '4210.451')] [2023-03-07 15:50:36,206][213771] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-07 15:50:36,994][213771] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-07 15:50:37,757][213771] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-07 15:50:38,523][213771] Updated weights for policy 0, policy_version 80990 (0.0006) [2023-03-07 15:50:39,300][213771] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-07 15:50:40,063][213771] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 15:50:40,833][213771] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-07 15:50:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 82967552. Throughput: 0: 13264.6. Samples: 82964383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:41,106][213445] Avg episode reward: [(0, '4067.206')] [2023-03-07 15:50:41,609][213771] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-07 15:50:42,381][213771] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-07 15:50:43,157][213771] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-07 15:50:43,930][213771] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-07 15:50:44,703][213771] Updated weights for policy 0, policy_version 81070 (0.0006) [2023-03-07 15:50:45,469][213771] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-07 15:50:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13263.4). Total num frames: 83034112. Throughput: 0: 13262.9. Samples: 83004094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:46,106][213445] Avg episode reward: [(0, '3991.471')] [2023-03-07 15:50:46,247][213771] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-07 15:50:47,026][213771] Updated weights for policy 0, policy_version 81100 (0.0006) [2023-03-07 15:50:47,800][213771] Updated weights for policy 0, policy_version 81110 (0.0006) [2023-03-07 15:50:48,569][213771] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-07 15:50:49,333][213771] Updated weights for policy 0, policy_version 81130 (0.0007) [2023-03-07 15:50:50,112][213771] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 15:50:50,894][213771] Updated weights for policy 0, policy_version 81150 (0.0006) [2023-03-07 15:50:51,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 83099648. Throughput: 0: 13253.7. Samples: 83083621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:51,106][213445] Avg episode reward: [(0, '3843.543')] [2023-03-07 15:50:51,675][213771] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-07 15:50:52,434][213771] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-07 15:50:53,215][213771] Updated weights for policy 0, policy_version 81180 (0.0007) [2023-03-07 15:50:53,981][213771] Updated weights for policy 0, policy_version 81190 (0.0005) [2023-03-07 15:50:54,769][213771] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-03-07 15:50:55,539][213771] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 15:50:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 83166208. Throughput: 0: 13240.8. Samples: 83163013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:50:56,106][213445] Avg episode reward: [(0, '4028.509')] [2023-03-07 15:50:56,303][213771] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-07 15:50:57,061][213771] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-07 15:50:57,843][213771] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-07 15:50:58,602][213771] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-07 15:50:59,373][213771] Updated weights for policy 0, policy_version 81260 (0.0007) [2023-03-07 15:51:00,135][213771] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-07 15:51:00,925][213771] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-07 15:51:01,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 83232768. Throughput: 0: 13244.8. Samples: 83203015. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:01,106][213445] Avg episode reward: [(0, '4128.601')] [2023-03-07 15:51:01,707][213771] Updated weights for policy 0, policy_version 81290 (0.0007) [2023-03-07 15:51:02,476][213771] Updated weights for policy 0, policy_version 81300 (0.0007) [2023-03-07 15:51:03,239][213771] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-07 15:51:04,011][213771] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-03-07 15:51:04,774][213771] Updated weights for policy 0, policy_version 81330 (0.0006) [2023-03-07 15:51:05,542][213771] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-07 15:51:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13263.4). Total num frames: 83299328. Throughput: 0: 13243.2. Samples: 83282548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:06,106][213445] Avg episode reward: [(0, '3613.809')] [2023-03-07 15:51:06,327][213771] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-07 15:51:07,104][213771] Updated weights for policy 0, policy_version 81360 (0.0005) [2023-03-07 15:51:07,883][213771] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-07 15:51:08,653][213771] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-07 15:51:09,427][213771] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-07 15:51:10,225][213771] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-07 15:51:10,985][213771] Updated weights for policy 0, policy_version 81410 (0.0006) [2023-03-07 15:51:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 83364864. Throughput: 0: 13238.8. Samples: 83361807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:11,106][213445] Avg episode reward: [(0, '3914.452')] [2023-03-07 15:51:11,753][213771] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-07 15:51:12,534][213771] Updated weights for policy 0, policy_version 81430 (0.0005) [2023-03-07 15:51:13,307][213771] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-07 15:51:14,073][213771] Updated weights for policy 0, policy_version 81450 (0.0007) [2023-03-07 15:51:14,854][213771] Updated weights for policy 0, policy_version 81460 (0.0005) [2023-03-07 15:51:15,630][213771] Updated weights for policy 0, policy_version 81470 (0.0006) [2023-03-07 15:51:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 83431424. Throughput: 0: 13243.3. Samples: 83401707. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:16,106][213445] Avg episode reward: [(0, '3948.633')] [2023-03-07 15:51:16,397][213771] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-07 15:51:17,186][213771] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-07 15:51:17,953][213771] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-07 15:51:18,734][213771] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-07 15:51:19,511][213771] Updated weights for policy 0, policy_version 81520 (0.0006) [2023-03-07 15:51:20,276][213771] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-07 15:51:21,057][213771] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-07 15:51:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13259.9). Total num frames: 83496960. Throughput: 0: 13244.5. Samples: 83480856. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:21,106][213445] Avg episode reward: [(0, '3919.867')] [2023-03-07 15:51:21,821][213771] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-07 15:51:22,592][213771] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 15:51:23,357][213771] Updated weights for policy 0, policy_version 81570 (0.0005) [2023-03-07 15:51:24,132][213771] Updated weights for policy 0, policy_version 81580 (0.0006) [2023-03-07 15:51:24,902][213771] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-07 15:51:25,673][213771] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-07 15:51:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13263.4). Total num frames: 83563520. Throughput: 0: 13248.0. Samples: 83560543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:26,105][213445] Avg episode reward: [(0, '3947.353')] [2023-03-07 15:51:26,453][213771] Updated weights for policy 0, policy_version 81610 (0.0005) [2023-03-07 15:51:27,209][213771] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 15:51:28,010][213771] Updated weights for policy 0, policy_version 81630 (0.0006) [2023-03-07 15:51:28,785][213771] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-07 15:51:29,549][213771] Updated weights for policy 0, policy_version 81650 (0.0006) [2023-03-07 15:51:30,329][213771] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-07 15:51:31,101][213771] Updated weights for policy 0, policy_version 81670 (0.0006) [2023-03-07 15:51:31,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13263.4). Total num frames: 83630080. Throughput: 0: 13244.8. Samples: 83600112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:31,106][213445] Avg episode reward: [(0, '3967.960')] [2023-03-07 15:51:31,892][213771] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-07 15:51:32,657][213771] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-07 15:51:33,422][213771] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-07 15:51:34,199][213771] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-07 15:51:34,966][213771] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-07 15:51:35,729][213771] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-07 15:51:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 83695616. Throughput: 0: 13242.5. Samples: 83679532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:36,106][213445] Avg episode reward: [(0, '3856.760')] [2023-03-07 15:51:36,503][213771] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-07 15:51:37,286][213771] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-07 15:51:38,053][213771] Updated weights for policy 0, policy_version 81760 (0.0007) [2023-03-07 15:51:38,815][213771] Updated weights for policy 0, policy_version 81770 (0.0006) [2023-03-07 15:51:39,601][213771] Updated weights for policy 0, policy_version 81780 (0.0007) [2023-03-07 15:51:40,375][213771] Updated weights for policy 0, policy_version 81790 (0.0006) [2023-03-07 15:51:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 83762176. Throughput: 0: 13245.7. Samples: 83759071. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:41,106][213445] Avg episode reward: [(0, '3797.748')] [2023-03-07 15:51:41,146][213771] Updated weights for policy 0, policy_version 81800 (0.0005) [2023-03-07 15:51:41,910][213771] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-07 15:51:42,681][213771] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-07 15:51:43,441][213771] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-07 15:51:44,215][213771] Updated weights for policy 0, policy_version 81840 (0.0007) [2023-03-07 15:51:44,990][213771] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 15:51:45,778][213771] Updated weights for policy 0, policy_version 81860 (0.0006) [2023-03-07 15:51:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 83828736. Throughput: 0: 13245.5. Samples: 83799063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:46,106][213445] Avg episode reward: [(0, '3781.957')] [2023-03-07 15:51:46,550][213771] Updated weights for policy 0, policy_version 81870 (0.0007) [2023-03-07 15:51:47,334][213771] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-07 15:51:48,118][213771] Updated weights for policy 0, policy_version 81890 (0.0007) [2023-03-07 15:51:48,888][213771] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-03-07 15:51:49,654][213771] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-07 15:51:50,432][213771] Updated weights for policy 0, policy_version 81920 (0.0006) [2023-03-07 15:51:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13256.5). Total num frames: 83894272. Throughput: 0: 13237.7. Samples: 83878243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:51,105][213445] Avg episode reward: [(0, '3909.140')] [2023-03-07 15:51:51,215][213771] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-07 15:51:51,993][213771] Updated weights for policy 0, policy_version 81940 (0.0007) [2023-03-07 15:51:52,766][213771] Updated weights for policy 0, policy_version 81950 (0.0007) [2023-03-07 15:51:53,552][213771] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 15:51:54,330][213771] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 15:51:55,090][213771] Updated weights for policy 0, policy_version 81980 (0.0007) [2023-03-07 15:51:55,882][213771] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-07 15:51:56,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.7, 300 sec: 13256.5). Total num frames: 83959808. Throughput: 0: 13227.5. Samples: 83957044. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:51:56,106][213445] Avg episode reward: [(0, '3833.645')] [2023-03-07 15:51:56,660][213771] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-07 15:51:57,439][213771] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-07 15:51:58,220][213771] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-07 15:51:58,998][213771] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-07 15:51:59,769][213771] Updated weights for policy 0, policy_version 82040 (0.0008) [2023-03-07 15:52:00,554][213771] Updated weights for policy 0, policy_version 82050 (0.0005) [2023-03-07 15:52:01,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13256.5). Total num frames: 84026368. Throughput: 0: 13218.9. Samples: 83996558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:52:01,106][213445] Avg episode reward: [(0, '3867.206')] [2023-03-07 15:52:01,327][213771] Updated weights for policy 0, policy_version 82060 (0.0005) [2023-03-07 15:52:02,109][213771] Updated weights for policy 0, policy_version 82070 (0.0007) [2023-03-07 15:52:02,897][213771] Updated weights for policy 0, policy_version 82080 (0.0007) [2023-03-07 15:52:03,665][213771] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-07 15:52:04,420][213771] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-07 15:52:05,191][213771] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-07 15:52:05,968][213771] Updated weights for policy 0, policy_version 82120 (0.0007) [2023-03-07 15:52:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 84091904. Throughput: 0: 13224.2. Samples: 84075943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:52:06,106][213445] Avg episode reward: [(0, '3990.897')] [2023-03-07 15:52:06,121][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000082122_84092928.pth... [2023-03-07 15:52:06,152][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000079015_80911360.pth [2023-03-07 15:52:06,736][213771] Updated weights for policy 0, policy_version 82130 (0.0006) [2023-03-07 15:52:07,501][213771] Updated weights for policy 0, policy_version 82140 (0.0006) [2023-03-07 15:52:08,295][213771] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-07 15:52:09,058][213771] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-03-07 15:52:09,849][213771] Updated weights for policy 0, policy_version 82170 (0.0006) [2023-03-07 15:52:10,621][213771] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-07 15:52:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 84158464. Throughput: 0: 13210.1. Samples: 84154998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:52:11,106][213445] Avg episode reward: [(0, '4029.685')] [2023-03-07 15:52:11,421][213771] Updated weights for policy 0, policy_version 82190 (0.0006) [2023-03-07 15:52:12,184][213771] Updated weights for policy 0, policy_version 82200 (0.0005) [2023-03-07 15:52:12,968][213771] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-07 15:52:13,739][213771] Updated weights for policy 0, policy_version 82220 (0.0006) [2023-03-07 15:52:14,516][213771] Updated weights for policy 0, policy_version 82230 (0.0006) [2023-03-07 15:52:15,281][213771] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-07 15:52:16,059][213771] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-07 15:52:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 84224000. Throughput: 0: 13209.3. Samples: 84194532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:52:16,106][213445] Avg episode reward: [(0, '4039.631')] [2023-03-07 15:52:16,843][213771] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-07 15:52:17,615][213771] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-07 15:52:18,395][213771] Updated weights for policy 0, policy_version 82280 (0.0007) [2023-03-07 15:52:19,181][213771] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-07 15:52:19,945][213771] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-07 15:52:20,720][213771] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-07 15:52:21,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 84289536. Throughput: 0: 13205.7. Samples: 84273791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:21,106][213445] Avg episode reward: [(0, '3970.377')] [2023-03-07 15:52:21,486][213771] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-07 15:52:22,262][213771] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 15:52:23,037][213771] Updated weights for policy 0, policy_version 82340 (0.0006) [2023-03-07 15:52:23,819][213771] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-07 15:52:24,596][213771] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-07 15:52:25,366][213771] Updated weights for policy 0, policy_version 82370 (0.0007) [2023-03-07 15:52:26,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 84356096. Throughput: 0: 13197.9. Samples: 84352974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:26,106][213445] Avg episode reward: [(0, '4086.547')] [2023-03-07 15:52:26,145][213771] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-07 15:52:26,910][213771] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 15:52:27,718][213771] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-07 15:52:28,478][213771] Updated weights for policy 0, policy_version 82410 (0.0007) [2023-03-07 15:52:29,248][213771] Updated weights for policy 0, policy_version 82420 (0.0006) [2023-03-07 15:52:30,025][213771] Updated weights for policy 0, policy_version 82430 (0.0007) [2023-03-07 15:52:30,784][213771] Updated weights for policy 0, policy_version 82440 (0.0005) [2023-03-07 15:52:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13253.0). Total num frames: 84422656. Throughput: 0: 13191.5. Samples: 84392680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:31,106][213445] Avg episode reward: [(0, '3953.765')] [2023-03-07 15:52:31,566][213771] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-07 15:52:32,341][213771] Updated weights for policy 0, policy_version 82460 (0.0006) [2023-03-07 15:52:33,112][213771] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-07 15:52:33,895][213771] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-07 15:52:34,669][213771] Updated weights for policy 0, policy_version 82490 (0.0006) [2023-03-07 15:52:35,430][213771] Updated weights for policy 0, policy_version 82500 (0.0005) [2023-03-07 15:52:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13249.5). Total num frames: 84488192. Throughput: 0: 13194.8. Samples: 84472009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:36,106][213445] Avg episode reward: [(0, '4055.423')] [2023-03-07 15:52:36,218][213771] Updated weights for policy 0, policy_version 82510 (0.0005) [2023-03-07 15:52:36,970][213771] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-07 15:52:37,756][213771] Updated weights for policy 0, policy_version 82530 (0.0007) [2023-03-07 15:52:38,545][213771] Updated weights for policy 0, policy_version 82540 (0.0007) [2023-03-07 15:52:39,318][213771] Updated weights for policy 0, policy_version 82550 (0.0007) [2023-03-07 15:52:40,099][213771] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-07 15:52:40,858][213771] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-07 15:52:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13249.5). Total num frames: 84554752. Throughput: 0: 13206.4. Samples: 84551331. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:41,106][213445] Avg episode reward: [(0, '4087.378')] [2023-03-07 15:52:41,643][213771] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 15:52:42,415][213771] Updated weights for policy 0, policy_version 82590 (0.0006) [2023-03-07 15:52:43,193][213771] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-07 15:52:43,941][213771] Updated weights for policy 0, policy_version 82610 (0.0006) [2023-03-07 15:52:44,729][213771] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-07 15:52:45,492][213771] Updated weights for policy 0, policy_version 82630 (0.0005) [2023-03-07 15:52:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13246.0). Total num frames: 84620288. Throughput: 0: 13213.2. Samples: 84591149. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:46,106][213445] Avg episode reward: [(0, '4069.001')] [2023-03-07 15:52:46,270][213771] Updated weights for policy 0, policy_version 82640 (0.0007) [2023-03-07 15:52:47,054][213771] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 15:52:47,835][213771] Updated weights for policy 0, policy_version 82660 (0.0006) [2023-03-07 15:52:48,601][213771] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-07 15:52:49,389][213771] Updated weights for policy 0, policy_version 82680 (0.0007) [2023-03-07 15:52:50,167][213771] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-07 15:52:50,925][213771] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-07 15:52:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13249.5). Total num frames: 84686848. Throughput: 0: 13204.0. Samples: 84670124. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:51,106][213445] Avg episode reward: [(0, '3971.395')] [2023-03-07 15:52:51,711][213771] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-07 15:52:52,496][213771] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 15:52:53,266][213771] Updated weights for policy 0, policy_version 82730 (0.0005) [2023-03-07 15:52:54,047][213771] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-07 15:52:54,806][213771] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-07 15:52:55,599][213771] Updated weights for policy 0, policy_version 82760 (0.0006) [2023-03-07 15:52:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13246.1). Total num frames: 84752384. Throughput: 0: 13210.7. Samples: 84749482. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:52:56,116][213445] Avg episode reward: [(0, '4039.730')] [2023-03-07 15:52:56,362][213771] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-07 15:52:57,136][213771] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-07 15:52:57,911][213771] Updated weights for policy 0, policy_version 82790 (0.0005) [2023-03-07 15:52:58,662][213771] Updated weights for policy 0, policy_version 82800 (0.0007) [2023-03-07 15:52:59,438][213771] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-07 15:53:00,213][213771] Updated weights for policy 0, policy_version 82820 (0.0007) [2023-03-07 15:53:00,969][213771] Updated weights for policy 0, policy_version 82830 (0.0005) [2023-03-07 15:53:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13246.0). Total num frames: 84818944. Throughput: 0: 13220.4. Samples: 84789450. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:01,116][213445] Avg episode reward: [(0, '4100.918')] [2023-03-07 15:53:01,740][213771] Updated weights for policy 0, policy_version 82840 (0.0006) [2023-03-07 15:53:02,511][213771] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-07 15:53:03,272][213771] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-07 15:53:04,039][213771] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 15:53:04,822][213771] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-07 15:53:05,593][213771] Updated weights for policy 0, policy_version 82890 (0.0006) [2023-03-07 15:53:06,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 84885504. Throughput: 0: 13234.6. Samples: 84869347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:06,116][213445] Avg episode reward: [(0, '4051.372')] [2023-03-07 15:53:06,366][213771] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-07 15:53:07,125][213771] Updated weights for policy 0, policy_version 82910 (0.0007) [2023-03-07 15:53:07,887][213771] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-07 15:53:08,651][213771] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-07 15:53:09,432][213771] Updated weights for policy 0, policy_version 82940 (0.0007) [2023-03-07 15:53:10,202][213771] Updated weights for policy 0, policy_version 82950 (0.0006) [2023-03-07 15:53:10,977][213771] Updated weights for policy 0, policy_version 82960 (0.0007) [2023-03-07 15:53:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 84952064. Throughput: 0: 13250.3. Samples: 84949237. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:11,117][213445] Avg episode reward: [(0, '4021.423')] [2023-03-07 15:53:11,744][213771] Updated weights for policy 0, policy_version 82970 (0.0005) [2023-03-07 15:53:12,510][213771] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 15:53:13,282][213771] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-07 15:53:14,059][213771] Updated weights for policy 0, policy_version 83000 (0.0007) [2023-03-07 15:53:14,833][213771] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-07 15:53:15,608][213771] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-07 15:53:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 85018624. Throughput: 0: 13253.0. Samples: 84989064. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:16,116][213445] Avg episode reward: [(0, '4091.982')] [2023-03-07 15:53:16,380][213771] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-07 15:53:17,139][213771] Updated weights for policy 0, policy_version 83040 (0.0005) [2023-03-07 15:53:17,920][213771] Updated weights for policy 0, policy_version 83050 (0.0007) [2023-03-07 15:53:18,707][213771] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-07 15:53:19,472][213771] Updated weights for policy 0, policy_version 83070 (0.0007) [2023-03-07 15:53:20,250][213771] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-07 15:53:21,025][213771] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-07 15:53:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 85085184. Throughput: 0: 13256.8. Samples: 85068567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:21,116][213445] Avg episode reward: [(0, '4080.769')] [2023-03-07 15:53:21,806][213771] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-07 15:53:22,577][213771] Updated weights for policy 0, policy_version 83110 (0.0007) [2023-03-07 15:53:23,352][213771] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-07 15:53:24,118][213771] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-07 15:53:24,901][213771] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-07 15:53:25,690][213771] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-07 15:53:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 85150720. Throughput: 0: 13249.5. Samples: 85147559. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:26,106][213445] Avg episode reward: [(0, '4031.369')] [2023-03-07 15:53:26,481][213771] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-07 15:53:27,261][213771] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-07 15:53:28,031][213771] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-07 15:53:28,799][213771] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-07 15:53:29,584][213771] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-07 15:53:30,352][213771] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-07 15:53:31,097][213771] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-07 15:53:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 85217280. Throughput: 0: 13241.1. Samples: 85186998. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:31,106][213445] Avg episode reward: [(0, '4002.245')] [2023-03-07 15:53:31,875][213771] Updated weights for policy 0, policy_version 83230 (0.0007) [2023-03-07 15:53:32,643][213771] Updated weights for policy 0, policy_version 83240 (0.0008) [2023-03-07 15:53:33,398][213771] Updated weights for policy 0, policy_version 83250 (0.0005) [2023-03-07 15:53:34,184][213771] Updated weights for policy 0, policy_version 83260 (0.0007) [2023-03-07 15:53:34,947][213771] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-07 15:53:35,714][213771] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-07 15:53:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 85283840. Throughput: 0: 13263.1. Samples: 85266964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:36,106][213445] Avg episode reward: [(0, '4094.129')] [2023-03-07 15:53:36,493][213771] Updated weights for policy 0, policy_version 83290 (0.0005) [2023-03-07 15:53:37,249][213771] Updated weights for policy 0, policy_version 83300 (0.0006) [2023-03-07 15:53:38,025][213771] Updated weights for policy 0, policy_version 83310 (0.0005) [2023-03-07 15:53:38,804][213771] Updated weights for policy 0, policy_version 83320 (0.0006) [2023-03-07 15:53:39,570][213771] Updated weights for policy 0, policy_version 83330 (0.0005) [2023-03-07 15:53:40,345][213771] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-07 15:53:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 85349376. Throughput: 0: 13270.8. Samples: 85346670. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:41,106][213445] Avg episode reward: [(0, '4071.442')] [2023-03-07 15:53:41,127][213771] Updated weights for policy 0, policy_version 83350 (0.0006) [2023-03-07 15:53:41,902][213771] Updated weights for policy 0, policy_version 83360 (0.0007) [2023-03-07 15:53:42,688][213771] Updated weights for policy 0, policy_version 83370 (0.0006) [2023-03-07 15:53:43,459][213771] Updated weights for policy 0, policy_version 83380 (0.0007) [2023-03-07 15:53:44,214][213771] Updated weights for policy 0, policy_version 83390 (0.0006) [2023-03-07 15:53:44,998][213771] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-07 15:53:45,761][213771] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-07 15:53:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 85415936. Throughput: 0: 13263.4. Samples: 85386301. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:46,105][213445] Avg episode reward: [(0, '4077.215')] [2023-03-07 15:53:46,534][213771] Updated weights for policy 0, policy_version 83420 (0.0005) [2023-03-07 15:53:47,314][213771] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-07 15:53:48,091][213771] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-07 15:53:48,861][213771] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-07 15:53:49,632][213771] Updated weights for policy 0, policy_version 83460 (0.0006) [2023-03-07 15:53:50,411][213771] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-07 15:53:51,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 85481472. Throughput: 0: 13254.5. Samples: 85465801. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:51,105][213445] Avg episode reward: [(0, '4165.691')] [2023-03-07 15:53:51,193][213771] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-07 15:53:51,954][213771] Updated weights for policy 0, policy_version 83490 (0.0005) [2023-03-07 15:53:52,736][213771] Updated weights for policy 0, policy_version 83500 (0.0008) [2023-03-07 15:53:53,514][213771] Updated weights for policy 0, policy_version 83510 (0.0007) [2023-03-07 15:53:54,296][213771] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-07 15:53:55,075][213771] Updated weights for policy 0, policy_version 83530 (0.0007) [2023-03-07 15:53:55,849][213771] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-07 15:53:56,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 85548032. Throughput: 0: 13236.3. Samples: 85544872. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:53:56,106][213445] Avg episode reward: [(0, '4126.916')] [2023-03-07 15:53:56,618][213771] Updated weights for policy 0, policy_version 83550 (0.0005) [2023-03-07 15:53:57,402][213771] Updated weights for policy 0, policy_version 83560 (0.0007) [2023-03-07 15:53:58,162][213771] Updated weights for policy 0, policy_version 83570 (0.0005) [2023-03-07 15:53:58,917][213771] Updated weights for policy 0, policy_version 83580 (0.0006) [2023-03-07 15:53:59,694][213771] Updated weights for policy 0, policy_version 83590 (0.0007) [2023-03-07 15:54:00,485][213771] Updated weights for policy 0, policy_version 83600 (0.0006) [2023-03-07 15:54:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 85614592. Throughput: 0: 13237.8. Samples: 85584766. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:01,106][213445] Avg episode reward: [(0, '4143.384')] [2023-03-07 15:54:01,241][213771] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-07 15:54:02,014][213771] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-07 15:54:02,783][213771] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-07 15:54:03,560][213771] Updated weights for policy 0, policy_version 83640 (0.0006) [2023-03-07 15:54:04,346][213771] Updated weights for policy 0, policy_version 83650 (0.0005) [2023-03-07 15:54:05,121][213771] Updated weights for policy 0, policy_version 83660 (0.0006) [2023-03-07 15:54:05,898][213771] Updated weights for policy 0, policy_version 83670 (0.0006) [2023-03-07 15:54:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 85680128. Throughput: 0: 13233.4. Samples: 85664070. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:06,106][213445] Avg episode reward: [(0, '4102.753')] [2023-03-07 15:54:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000083672_85680128.pth... [2023-03-07 15:54:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000080571_82504704.pth [2023-03-07 15:54:06,677][213771] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-07 15:54:07,447][213771] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-07 15:54:08,224][213771] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-07 15:54:09,003][213771] Updated weights for policy 0, policy_version 83710 (0.0007) [2023-03-07 15:54:09,781][213771] Updated weights for policy 0, policy_version 83720 (0.0006) [2023-03-07 15:54:10,561][213771] Updated weights for policy 0, policy_version 83730 (0.0006) [2023-03-07 15:54:11,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 85745664. Throughput: 0: 13232.0. Samples: 85742999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:11,106][213445] Avg episode reward: [(0, '4149.394')] [2023-03-07 15:54:11,343][213771] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-07 15:54:12,117][213771] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 15:54:12,880][213771] Updated weights for policy 0, policy_version 83760 (0.0005) [2023-03-07 15:54:13,658][213771] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-07 15:54:14,445][213771] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-07 15:54:15,219][213771] Updated weights for policy 0, policy_version 83790 (0.0006) [2023-03-07 15:54:15,987][213771] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-07 15:54:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 85812224. Throughput: 0: 13240.8. Samples: 85782833. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:16,106][213445] Avg episode reward: [(0, '4063.086')] [2023-03-07 15:54:16,768][213771] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-07 15:54:17,558][213771] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 15:54:18,337][213771] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 15:54:19,116][213771] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-07 15:54:19,892][213771] Updated weights for policy 0, policy_version 83850 (0.0007) [2023-03-07 15:54:20,669][213771] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 15:54:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 85877760. Throughput: 0: 13216.1. Samples: 85861689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:21,106][213445] Avg episode reward: [(0, '4088.528')] [2023-03-07 15:54:21,442][213771] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-07 15:54:22,201][213771] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-07 15:54:22,995][213771] Updated weights for policy 0, policy_version 83890 (0.0007) [2023-03-07 15:54:23,768][213771] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-07 15:54:24,534][213771] Updated weights for policy 0, policy_version 83910 (0.0006) [2023-03-07 15:54:25,317][213771] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-07 15:54:26,087][213771] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-07 15:54:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 85944320. Throughput: 0: 13209.0. Samples: 85941072. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:26,106][213445] Avg episode reward: [(0, '4092.150')] [2023-03-07 15:54:26,881][213771] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-07 15:54:27,654][213771] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-07 15:54:28,410][213771] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 15:54:29,176][213771] Updated weights for policy 0, policy_version 83970 (0.0006) [2023-03-07 15:54:29,951][213771] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-07 15:54:30,705][213771] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-07 15:54:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 86009856. Throughput: 0: 13206.6. Samples: 85980597. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:31,106][213445] Avg episode reward: [(0, '4061.699')] [2023-03-07 15:54:31,490][213771] Updated weights for policy 0, policy_version 84000 (0.0007) [2023-03-07 15:54:32,271][213771] Updated weights for policy 0, policy_version 84010 (0.0007) [2023-03-07 15:54:33,049][213771] Updated weights for policy 0, policy_version 84020 (0.0006) [2023-03-07 15:54:33,824][213771] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-07 15:54:34,600][213771] Updated weights for policy 0, policy_version 84040 (0.0006) [2023-03-07 15:54:35,397][213771] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-07 15:54:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 86076416. Throughput: 0: 13201.2. Samples: 86059857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:36,106][213445] Avg episode reward: [(0, '4232.094')] [2023-03-07 15:54:36,170][213771] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-07 15:54:36,941][213771] Updated weights for policy 0, policy_version 84070 (0.0007) [2023-03-07 15:54:37,712][213771] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-07 15:54:38,468][213771] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-07 15:54:39,258][213771] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-07 15:54:40,011][213771] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-07 15:54:40,806][213771] Updated weights for policy 0, policy_version 84120 (0.0005) [2023-03-07 15:54:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 86141952. Throughput: 0: 13208.9. Samples: 86139269. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:41,105][213445] Avg episode reward: [(0, '4125.430')] [2023-03-07 15:54:41,568][213771] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 15:54:42,335][213771] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-07 15:54:43,143][213771] Updated weights for policy 0, policy_version 84150 (0.0007) [2023-03-07 15:54:43,895][213771] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-07 15:54:44,667][213771] Updated weights for policy 0, policy_version 84170 (0.0006) [2023-03-07 15:54:45,438][213771] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-07 15:54:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 86208512. Throughput: 0: 13204.9. Samples: 86178988. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:54:46,106][213445] Avg episode reward: [(0, '4197.198')] [2023-03-07 15:54:46,222][213771] Updated weights for policy 0, policy_version 84190 (0.0005) [2023-03-07 15:54:46,970][213771] Updated weights for policy 0, policy_version 84200 (0.0006) [2023-03-07 15:54:47,737][213771] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-07 15:54:48,523][213771] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-07 15:54:49,287][213771] Updated weights for policy 0, policy_version 84230 (0.0006) [2023-03-07 15:54:50,049][213771] Updated weights for policy 0, policy_version 84240 (0.0005) [2023-03-07 15:54:50,831][213771] Updated weights for policy 0, policy_version 84250 (0.0005) [2023-03-07 15:54:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13226.6, 300 sec: 13235.6). Total num frames: 86275072. Throughput: 0: 13218.4. Samples: 86258899. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:54:51,106][213445] Avg episode reward: [(0, '4204.481')] [2023-03-07 15:54:51,606][213771] Updated weights for policy 0, policy_version 84260 (0.0005) [2023-03-07 15:54:52,375][213771] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-07 15:54:53,140][213771] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-07 15:54:53,911][213771] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 15:54:54,668][213771] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-07 15:54:55,456][213771] Updated weights for policy 0, policy_version 84310 (0.0006) [2023-03-07 15:54:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 86341632. Throughput: 0: 13232.9. Samples: 86338479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:54:56,106][213445] Avg episode reward: [(0, '4317.877')] [2023-03-07 15:54:56,234][213771] Updated weights for policy 0, policy_version 84320 (0.0007) [2023-03-07 15:54:56,999][213771] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-07 15:54:57,765][213771] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-07 15:54:58,533][213771] Updated weights for policy 0, policy_version 84350 (0.0007) [2023-03-07 15:54:59,290][213771] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-07 15:55:00,059][213771] Updated weights for policy 0, policy_version 84370 (0.0006) [2023-03-07 15:55:00,860][213771] Updated weights for policy 0, policy_version 84380 (0.0006) [2023-03-07 15:55:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 86408192. Throughput: 0: 13235.0. Samples: 86378406. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:01,116][213445] Avg episode reward: [(0, '4285.702')] [2023-03-07 15:55:01,630][213771] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-07 15:55:02,415][213771] Updated weights for policy 0, policy_version 84400 (0.0006) [2023-03-07 15:55:03,198][213771] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-07 15:55:03,977][213771] Updated weights for policy 0, policy_version 84420 (0.0007) [2023-03-07 15:55:04,751][213771] Updated weights for policy 0, policy_version 84430 (0.0007) [2023-03-07 15:55:05,539][213771] Updated weights for policy 0, policy_version 84440 (0.0005) [2023-03-07 15:55:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 86473728. Throughput: 0: 13239.1. Samples: 86457447. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:06,116][213445] Avg episode reward: [(0, '4291.358')] [2023-03-07 15:55:06,296][213771] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-07 15:55:07,079][213771] Updated weights for policy 0, policy_version 84460 (0.0006) [2023-03-07 15:55:07,842][213771] Updated weights for policy 0, policy_version 84470 (0.0007) [2023-03-07 15:55:08,630][213771] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-07 15:55:09,399][213771] Updated weights for policy 0, policy_version 84490 (0.0006) [2023-03-07 15:55:10,165][213771] Updated weights for policy 0, policy_version 84500 (0.0005) [2023-03-07 15:55:10,936][213771] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-07 15:55:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 86540288. Throughput: 0: 13236.4. Samples: 86536710. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:11,116][213445] Avg episode reward: [(0, '4303.456')] [2023-03-07 15:55:11,721][213771] Updated weights for policy 0, policy_version 84520 (0.0007) [2023-03-07 15:55:12,492][213771] Updated weights for policy 0, policy_version 84530 (0.0006) [2023-03-07 15:55:13,284][213771] Updated weights for policy 0, policy_version 84540 (0.0007) [2023-03-07 15:55:14,049][213771] Updated weights for policy 0, policy_version 84550 (0.0008) [2023-03-07 15:55:14,815][213771] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-07 15:55:15,585][213771] Updated weights for policy 0, policy_version 84570 (0.0007) [2023-03-07 15:55:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 86605824. Throughput: 0: 13241.8. Samples: 86576482. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:16,116][213445] Avg episode reward: [(0, '4244.995')] [2023-03-07 15:55:16,355][213771] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-03-07 15:55:17,121][213771] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 15:55:17,917][213771] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 15:55:18,680][213771] Updated weights for policy 0, policy_version 84610 (0.0007) [2023-03-07 15:55:19,446][213771] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-07 15:55:20,246][213771] Updated weights for policy 0, policy_version 84630 (0.0007) [2023-03-07 15:55:21,017][213771] Updated weights for policy 0, policy_version 84640 (0.0006) [2023-03-07 15:55:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 86672384. Throughput: 0: 13247.0. Samples: 86655972. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:21,116][213445] Avg episode reward: [(0, '4257.470')] [2023-03-07 15:55:21,771][213771] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 15:55:22,545][213771] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-07 15:55:23,324][213771] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-07 15:55:24,096][213771] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-07 15:55:24,854][213771] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-07 15:55:25,635][213771] Updated weights for policy 0, policy_version 84700 (0.0005) [2023-03-07 15:55:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 86738944. Throughput: 0: 13251.2. Samples: 86735572. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:26,106][213445] Avg episode reward: [(0, '4190.179')] [2023-03-07 15:55:26,397][213771] Updated weights for policy 0, policy_version 84710 (0.0007) [2023-03-07 15:55:27,169][213771] Updated weights for policy 0, policy_version 84720 (0.0007) [2023-03-07 15:55:27,936][213771] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-07 15:55:28,709][213771] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-07 15:55:29,482][213771] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-07 15:55:30,249][213771] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-07 15:55:31,021][213771] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 15:55:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 86805504. Throughput: 0: 13254.8. Samples: 86775454. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:31,106][213445] Avg episode reward: [(0, '4157.859')] [2023-03-07 15:55:31,802][213771] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-07 15:55:32,595][213771] Updated weights for policy 0, policy_version 84790 (0.0007) [2023-03-07 15:55:33,357][213771] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 15:55:34,127][213771] Updated weights for policy 0, policy_version 84810 (0.0007) [2023-03-07 15:55:34,886][213771] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-07 15:55:35,665][213771] Updated weights for policy 0, policy_version 84830 (0.0007) [2023-03-07 15:55:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 86871040. Throughput: 0: 13245.7. Samples: 86854954. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 15:55:36,106][213445] Avg episode reward: [(0, '4225.537')] [2023-03-07 15:55:36,420][213771] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-07 15:55:37,205][213771] Updated weights for policy 0, policy_version 84850 (0.0005) [2023-03-07 15:55:37,977][213771] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-07 15:55:38,740][213771] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-07 15:55:39,521][213771] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-07 15:55:40,293][213771] Updated weights for policy 0, policy_version 84890 (0.0007) [2023-03-07 15:55:41,063][213771] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-07 15:55:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13232.2). Total num frames: 86937600. Throughput: 0: 13248.2. Samples: 86934646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:55:41,106][213445] Avg episode reward: [(0, '4251.700')] [2023-03-07 15:55:41,845][213771] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 15:55:42,614][213771] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-07 15:55:43,393][213771] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-07 15:55:44,186][213771] Updated weights for policy 0, policy_version 84940 (0.0007) [2023-03-07 15:55:44,957][213771] Updated weights for policy 0, policy_version 84950 (0.0007) [2023-03-07 15:55:45,720][213771] Updated weights for policy 0, policy_version 84960 (0.0007) [2023-03-07 15:55:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87003136. Throughput: 0: 13238.8. Samples: 86974154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:55:46,106][213445] Avg episode reward: [(0, '4108.353')] [2023-03-07 15:55:46,490][213771] Updated weights for policy 0, policy_version 84970 (0.0005) [2023-03-07 15:55:47,273][213771] Updated weights for policy 0, policy_version 84980 (0.0006) [2023-03-07 15:55:48,032][213771] Updated weights for policy 0, policy_version 84990 (0.0005) [2023-03-07 15:55:48,796][213771] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-07 15:55:49,569][213771] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-07 15:55:50,353][213771] Updated weights for policy 0, policy_version 85020 (0.0006) [2023-03-07 15:55:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87069696. Throughput: 0: 13251.4. Samples: 87053760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:55:51,106][213445] Avg episode reward: [(0, '4216.451')] [2023-03-07 15:55:51,116][213771] Updated weights for policy 0, policy_version 85030 (0.0005) [2023-03-07 15:55:51,897][213771] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 15:55:52,667][213771] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 15:55:53,441][213771] Updated weights for policy 0, policy_version 85060 (0.0007) [2023-03-07 15:55:54,202][213771] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-07 15:55:54,973][213771] Updated weights for policy 0, policy_version 85080 (0.0005) [2023-03-07 15:55:55,749][213771] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 15:55:56,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13232.2). Total num frames: 87136256. Throughput: 0: 13258.2. Samples: 87133325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:55:56,106][213445] Avg episode reward: [(0, '4247.370')] [2023-03-07 15:55:56,526][213771] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 15:55:57,296][213771] Updated weights for policy 0, policy_version 85110 (0.0007) [2023-03-07 15:55:58,080][213771] Updated weights for policy 0, policy_version 85120 (0.0005) [2023-03-07 15:55:58,849][213771] Updated weights for policy 0, policy_version 85130 (0.0005) [2023-03-07 15:55:59,610][213771] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-07 15:56:00,391][213771] Updated weights for policy 0, policy_version 85150 (0.0007) [2023-03-07 15:56:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87202816. Throughput: 0: 13255.2. Samples: 87172966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:01,106][213445] Avg episode reward: [(0, '4229.809')] [2023-03-07 15:56:01,163][213771] Updated weights for policy 0, policy_version 85160 (0.0007) [2023-03-07 15:56:01,937][213771] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 15:56:02,699][213771] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-07 15:56:03,483][213771] Updated weights for policy 0, policy_version 85190 (0.0007) [2023-03-07 15:56:04,246][213771] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-07 15:56:05,029][213771] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-07 15:56:05,809][213771] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 15:56:06,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87268352. Throughput: 0: 13256.5. Samples: 87252514. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:06,106][213445] Avg episode reward: [(0, '4021.307')] [2023-03-07 15:56:06,117][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000085224_87269376.pth... [2023-03-07 15:56:06,149][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000082122_84092928.pth [2023-03-07 15:56:06,569][213771] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-07 15:56:07,368][213771] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 15:56:08,147][213771] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-07 15:56:08,906][213771] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-07 15:56:09,673][213771] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-07 15:56:10,453][213771] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-07 15:56:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87334912. Throughput: 0: 13249.5. Samples: 87331799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:11,106][213445] Avg episode reward: [(0, '4089.637')] [2023-03-07 15:56:11,234][213771] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 15:56:11,981][213771] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-07 15:56:12,763][213771] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-07 15:56:13,538][213771] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-07 15:56:14,313][213771] Updated weights for policy 0, policy_version 85330 (0.0007) [2023-03-07 15:56:15,081][213771] Updated weights for policy 0, policy_version 85340 (0.0005) [2023-03-07 15:56:15,856][213771] Updated weights for policy 0, policy_version 85350 (0.0005) [2023-03-07 15:56:16,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 87401472. Throughput: 0: 13247.9. Samples: 87371609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:16,106][213445] Avg episode reward: [(0, '4160.730')] [2023-03-07 15:56:16,635][213771] Updated weights for policy 0, policy_version 85360 (0.0007) [2023-03-07 15:56:17,414][213771] Updated weights for policy 0, policy_version 85370 (0.0006) [2023-03-07 15:56:18,201][213771] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-07 15:56:18,958][213771] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-07 15:56:19,738][213771] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-07 15:56:20,498][213771] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-07 15:56:21,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 87468032. Throughput: 0: 13242.8. Samples: 87450877. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:21,106][213445] Avg episode reward: [(0, '4157.510')] [2023-03-07 15:56:21,249][213771] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-07 15:56:22,023][213771] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-07 15:56:22,797][213771] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-07 15:56:23,565][213771] Updated weights for policy 0, policy_version 85450 (0.0006) [2023-03-07 15:56:24,336][213771] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 15:56:25,111][213771] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-07 15:56:25,881][213771] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-07 15:56:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13232.2). Total num frames: 87533568. Throughput: 0: 13247.7. Samples: 87530793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:26,106][213445] Avg episode reward: [(0, '4196.861')] [2023-03-07 15:56:26,642][213771] Updated weights for policy 0, policy_version 85490 (0.0005) [2023-03-07 15:56:27,393][213771] Updated weights for policy 0, policy_version 85500 (0.0005) [2023-03-07 15:56:28,195][213771] Updated weights for policy 0, policy_version 85510 (0.0005) [2023-03-07 15:56:28,964][213771] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-07 15:56:29,737][213771] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-07 15:56:30,512][213771] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-07 15:56:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13235.6). Total num frames: 87600128. Throughput: 0: 13259.1. Samples: 87570813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:56:31,106][213445] Avg episode reward: [(0, '4136.274')] [2023-03-07 15:56:31,277][213771] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-07 15:56:32,063][213771] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 15:56:32,827][213771] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-07 15:56:33,612][213771] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-07 15:56:34,372][213771] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-07 15:56:35,145][213771] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 15:56:35,896][213771] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 15:56:36,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 87666688. Throughput: 0: 13255.2. Samples: 87650245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:56:36,106][213445] Avg episode reward: [(0, '4175.919')] [2023-03-07 15:56:36,665][213771] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-07 15:56:37,444][213771] Updated weights for policy 0, policy_version 85630 (0.0006) [2023-03-07 15:56:38,209][213771] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-07 15:56:38,981][213771] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-07 15:56:39,756][213771] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-07 15:56:40,528][213771] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-07 15:56:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13235.6). Total num frames: 87733248. Throughput: 0: 13263.3. Samples: 87730176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:56:41,106][213445] Avg episode reward: [(0, '4257.525')] [2023-03-07 15:56:41,303][213771] Updated weights for policy 0, policy_version 85680 (0.0007) [2023-03-07 15:56:42,078][213771] Updated weights for policy 0, policy_version 85690 (0.0005) [2023-03-07 15:56:42,847][213771] Updated weights for policy 0, policy_version 85700 (0.0005) [2023-03-07 15:56:43,611][213771] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-07 15:56:44,387][213771] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-07 15:56:45,158][213771] Updated weights for policy 0, policy_version 85730 (0.0005) [2023-03-07 15:56:45,918][213771] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 15:56:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13239.1). Total num frames: 87799808. Throughput: 0: 13267.2. Samples: 87769989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:56:46,106][213445] Avg episode reward: [(0, '4119.813')] [2023-03-07 15:56:46,692][213771] Updated weights for policy 0, policy_version 85750 (0.0006) [2023-03-07 15:56:47,462][213771] Updated weights for policy 0, policy_version 85760 (0.0008) [2023-03-07 15:56:48,235][213771] Updated weights for policy 0, policy_version 85770 (0.0005) [2023-03-07 15:56:49,003][213771] Updated weights for policy 0, policy_version 85780 (0.0007) [2023-03-07 15:56:49,778][213771] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 15:56:50,557][213771] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-07 15:56:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 87865344. Throughput: 0: 13265.6. Samples: 87849463. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:56:51,106][213445] Avg episode reward: [(0, '4212.862')] [2023-03-07 15:56:51,333][213771] Updated weights for policy 0, policy_version 85810 (0.0006) [2023-03-07 15:56:52,113][213771] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-07 15:56:52,895][213771] Updated weights for policy 0, policy_version 85830 (0.0006) [2023-03-07 15:56:53,682][213771] Updated weights for policy 0, policy_version 85840 (0.0005) [2023-03-07 15:56:54,451][213771] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-07 15:56:55,209][213771] Updated weights for policy 0, policy_version 85860 (0.0007) [2023-03-07 15:56:55,986][213771] Updated weights for policy 0, policy_version 85870 (0.0006) [2023-03-07 15:56:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13239.1). Total num frames: 87931904. Throughput: 0: 13266.8. Samples: 87928805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:56:56,106][213445] Avg episode reward: [(0, '4220.474')] [2023-03-07 15:56:56,763][213771] Updated weights for policy 0, policy_version 85880 (0.0005) [2023-03-07 15:56:57,520][213771] Updated weights for policy 0, policy_version 85890 (0.0006) [2023-03-07 15:56:58,299][213771] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-07 15:56:59,084][213771] Updated weights for policy 0, policy_version 85910 (0.0005) [2023-03-07 15:56:59,839][213771] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 15:57:00,618][213771] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-07 15:57:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 87998464. Throughput: 0: 13269.1. Samples: 87968718. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:57:01,106][213445] Avg episode reward: [(0, '4228.263')] [2023-03-07 15:57:01,398][213771] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-07 15:57:02,169][213771] Updated weights for policy 0, policy_version 85950 (0.0005) [2023-03-07 15:57:02,945][213771] Updated weights for policy 0, policy_version 85960 (0.0005) [2023-03-07 15:57:03,725][213771] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-07 15:57:04,471][213771] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-07 15:57:05,235][213771] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-07 15:57:06,021][213771] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-07 15:57:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13242.6). Total num frames: 88065024. Throughput: 0: 13276.5. Samples: 88048321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:57:06,116][213445] Avg episode reward: [(0, '4223.079')] [2023-03-07 15:57:06,789][213771] Updated weights for policy 0, policy_version 86010 (0.0006) [2023-03-07 15:57:07,559][213771] Updated weights for policy 0, policy_version 86020 (0.0006) [2023-03-07 15:57:08,343][213771] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-07 15:57:09,113][213771] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 15:57:09,884][213771] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-07 15:57:10,663][213771] Updated weights for policy 0, policy_version 86060 (0.0005) [2023-03-07 15:57:11,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13242.6). Total num frames: 88130560. Throughput: 0: 13266.4. Samples: 88127784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:57:11,116][213445] Avg episode reward: [(0, '4185.129')] [2023-03-07 15:57:11,424][213771] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 15:57:12,188][213771] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 15:57:12,985][213771] Updated weights for policy 0, policy_version 86090 (0.0005) [2023-03-07 15:57:13,745][213771] Updated weights for policy 0, policy_version 86100 (0.0006) [2023-03-07 15:57:14,489][213771] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-07 15:57:15,275][213771] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-07 15:57:16,041][213771] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-07 15:57:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 88197120. Throughput: 0: 13264.3. Samples: 88167706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:57:16,106][213445] Avg episode reward: [(0, '4210.522')] [2023-03-07 15:57:16,808][213771] Updated weights for policy 0, policy_version 86140 (0.0005) [2023-03-07 15:57:17,577][213771] Updated weights for policy 0, policy_version 86150 (0.0005) [2023-03-07 15:57:18,366][213771] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-07 15:57:19,138][213771] Updated weights for policy 0, policy_version 86170 (0.0007) [2023-03-07 15:57:19,906][213771] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-07 15:57:20,686][213771] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-07 15:57:21,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 88263680. Throughput: 0: 13272.2. Samples: 88247495. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:57:21,105][213445] Avg episode reward: [(0, '4211.867')] [2023-03-07 15:57:21,449][213771] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-07 15:57:22,223][213771] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-07 15:57:22,992][213771] Updated weights for policy 0, policy_version 86220 (0.0006) [2023-03-07 15:57:23,776][213771] Updated weights for policy 0, policy_version 86230 (0.0007) [2023-03-07 15:57:24,544][213771] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 15:57:25,323][213771] Updated weights for policy 0, policy_version 86250 (0.0007) [2023-03-07 15:57:26,084][213771] Updated weights for policy 0, policy_version 86260 (0.0005) [2023-03-07 15:57:26,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 88330240. Throughput: 0: 13260.1. Samples: 88326880. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:26,106][213445] Avg episode reward: [(0, '4217.784')] [2023-03-07 15:57:26,852][213771] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-07 15:57:27,619][213771] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-07 15:57:28,376][213771] Updated weights for policy 0, policy_version 86290 (0.0005) [2023-03-07 15:57:29,130][213771] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-07 15:57:29,927][213771] Updated weights for policy 0, policy_version 86310 (0.0005) [2023-03-07 15:57:30,695][213771] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 15:57:31,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 88396800. Throughput: 0: 13266.9. Samples: 88367000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:31,106][213445] Avg episode reward: [(0, '4262.116')] [2023-03-07 15:57:31,478][213771] Updated weights for policy 0, policy_version 86330 (0.0005) [2023-03-07 15:57:32,244][213771] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-07 15:57:33,016][213771] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-07 15:57:33,780][213771] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 15:57:34,573][213771] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 15:57:35,362][213771] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-07 15:57:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 88462336. Throughput: 0: 13260.3. Samples: 88446178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:36,106][213445] Avg episode reward: [(0, '4234.492')] [2023-03-07 15:57:36,140][213771] Updated weights for policy 0, policy_version 86390 (0.0007) [2023-03-07 15:57:36,925][213771] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-07 15:57:37,697][213771] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-07 15:57:38,473][213771] Updated weights for policy 0, policy_version 86420 (0.0007) [2023-03-07 15:57:39,242][213771] Updated weights for policy 0, policy_version 86430 (0.0005) [2023-03-07 15:57:40,016][213771] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-07 15:57:40,769][213771] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-07 15:57:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 88528896. Throughput: 0: 13261.0. Samples: 88525552. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:41,106][213445] Avg episode reward: [(0, '4246.331')] [2023-03-07 15:57:41,562][213771] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-07 15:57:42,329][213771] Updated weights for policy 0, policy_version 86470 (0.0007) [2023-03-07 15:57:43,104][213771] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-07 15:57:43,885][213771] Updated weights for policy 0, policy_version 86490 (0.0007) [2023-03-07 15:57:44,633][213771] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-07 15:57:45,404][213771] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-03-07 15:57:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 88594432. Throughput: 0: 13255.5. Samples: 88565216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:46,106][213445] Avg episode reward: [(0, '4253.874')] [2023-03-07 15:57:46,184][213771] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-07 15:57:46,957][213771] Updated weights for policy 0, policy_version 86530 (0.0006) [2023-03-07 15:57:47,728][213771] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-07 15:57:48,493][213771] Updated weights for policy 0, policy_version 86550 (0.0005) [2023-03-07 15:57:49,251][213771] Updated weights for policy 0, policy_version 86560 (0.0007) [2023-03-07 15:57:50,029][213771] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-07 15:57:50,791][213771] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-03-07 15:57:51,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 88662016. Throughput: 0: 13260.0. Samples: 88645023. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:51,106][213445] Avg episode reward: [(0, '4159.058')] [2023-03-07 15:57:51,555][213771] Updated weights for policy 0, policy_version 86590 (0.0006) [2023-03-07 15:57:52,340][213771] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-07 15:57:53,122][213771] Updated weights for policy 0, policy_version 86610 (0.0005) [2023-03-07 15:57:53,894][213771] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-07 15:57:54,657][213771] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-07 15:57:55,437][213771] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-07 15:57:56,105][213445] Fps is (10 sec: 13414.2, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 88728576. Throughput: 0: 13268.2. Samples: 88724852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:57:56,106][213445] Avg episode reward: [(0, '4207.121')] [2023-03-07 15:57:56,203][213771] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 15:57:56,966][213771] Updated weights for policy 0, policy_version 86660 (0.0006) [2023-03-07 15:57:57,745][213771] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-07 15:57:58,521][213771] Updated weights for policy 0, policy_version 86680 (0.0006) [2023-03-07 15:57:59,313][213771] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-07 15:58:00,065][213771] Updated weights for policy 0, policy_version 86700 (0.0005) [2023-03-07 15:58:00,854][213771] Updated weights for policy 0, policy_version 86710 (0.0005) [2023-03-07 15:58:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 88794112. Throughput: 0: 13262.4. Samples: 88764515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:58:01,106][213445] Avg episode reward: [(0, '4180.842')] [2023-03-07 15:58:01,621][213771] Updated weights for policy 0, policy_version 86720 (0.0007) [2023-03-07 15:58:02,387][213771] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-07 15:58:03,149][213771] Updated weights for policy 0, policy_version 86740 (0.0005) [2023-03-07 15:58:03,911][213771] Updated weights for policy 0, policy_version 86750 (0.0007) [2023-03-07 15:58:04,673][213771] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-07 15:58:05,451][213771] Updated weights for policy 0, policy_version 86770 (0.0005) [2023-03-07 15:58:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 88860672. Throughput: 0: 13261.7. Samples: 88844274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:58:06,106][213445] Avg episode reward: [(0, '4078.926')] [2023-03-07 15:58:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000086778_88860672.pth... [2023-03-07 15:58:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000083672_85680128.pth [2023-03-07 15:58:06,226][213771] Updated weights for policy 0, policy_version 86780 (0.0005) [2023-03-07 15:58:07,008][213771] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-07 15:58:07,776][213771] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-07 15:58:08,552][213771] Updated weights for policy 0, policy_version 86810 (0.0008) [2023-03-07 15:58:09,320][213771] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-07 15:58:10,078][213771] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-07 15:58:10,866][213771] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-07 15:58:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 88927232. Throughput: 0: 13265.2. Samples: 88923814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:58:11,106][213445] Avg episode reward: [(0, '4134.820')] [2023-03-07 15:58:11,643][213771] Updated weights for policy 0, policy_version 86850 (0.0005) [2023-03-07 15:58:12,404][213771] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-07 15:58:13,197][213771] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-07 15:58:13,956][213771] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-07 15:58:14,709][213771] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-07 15:58:15,481][213771] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-07 15:58:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 88993792. Throughput: 0: 13258.4. Samples: 88963628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 15:58:16,106][213445] Avg episode reward: [(0, '4166.442')] [2023-03-07 15:58:16,245][213771] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-07 15:58:17,027][213771] Updated weights for policy 0, policy_version 86920 (0.0006) [2023-03-07 15:58:17,789][213771] Updated weights for policy 0, policy_version 86930 (0.0007) [2023-03-07 15:58:18,561][213771] Updated weights for policy 0, policy_version 86940 (0.0006) [2023-03-07 15:58:19,328][213771] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-07 15:58:20,117][213771] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-07 15:58:20,881][213771] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-07 15:58:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89059328. Throughput: 0: 13267.5. Samples: 89043214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:21,106][213445] Avg episode reward: [(0, '4201.376')] [2023-03-07 15:58:21,665][213771] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-07 15:58:22,442][213771] Updated weights for policy 0, policy_version 86990 (0.0007) [2023-03-07 15:58:23,204][213771] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 15:58:23,966][213771] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-07 15:58:24,745][213771] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-07 15:58:25,523][213771] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-07 15:58:26,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89125888. Throughput: 0: 13273.6. Samples: 89122865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:26,106][213445] Avg episode reward: [(0, '4218.573')] [2023-03-07 15:58:26,297][213771] Updated weights for policy 0, policy_version 87040 (0.0006) [2023-03-07 15:58:27,073][213771] Updated weights for policy 0, policy_version 87050 (0.0007) [2023-03-07 15:58:27,849][213771] Updated weights for policy 0, policy_version 87060 (0.0006) [2023-03-07 15:58:28,608][213771] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-07 15:58:29,369][213771] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-07 15:58:30,148][213771] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-07 15:58:30,911][213771] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-07 15:58:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89192448. Throughput: 0: 13278.4. Samples: 89162742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:31,106][213445] Avg episode reward: [(0, '4190.316')] [2023-03-07 15:58:31,693][213771] Updated weights for policy 0, policy_version 87110 (0.0007) [2023-03-07 15:58:32,474][213771] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-07 15:58:33,246][213771] Updated weights for policy 0, policy_version 87130 (0.0007) [2023-03-07 15:58:34,023][213771] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-07 15:58:34,796][213771] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-07 15:58:35,577][213771] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-07 15:58:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89257984. Throughput: 0: 13269.2. Samples: 89242137. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:36,106][213445] Avg episode reward: [(0, '4129.880')] [2023-03-07 15:58:36,348][213771] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-07 15:58:37,118][213771] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 15:58:37,901][213771] Updated weights for policy 0, policy_version 87190 (0.0007) [2023-03-07 15:58:38,671][213771] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 15:58:39,437][213771] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-07 15:58:40,204][213771] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-07 15:58:40,990][213771] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-07 15:58:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89324544. Throughput: 0: 13261.0. Samples: 89321599. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:41,106][213445] Avg episode reward: [(0, '4174.694')] [2023-03-07 15:58:41,763][213771] Updated weights for policy 0, policy_version 87240 (0.0005) [2023-03-07 15:58:42,549][213771] Updated weights for policy 0, policy_version 87250 (0.0007) [2023-03-07 15:58:43,315][213771] Updated weights for policy 0, policy_version 87260 (0.0005) [2023-03-07 15:58:44,078][213771] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-07 15:58:44,890][213771] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 15:58:45,667][213771] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-07 15:58:46,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 89390080. Throughput: 0: 13259.7. Samples: 89361204. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:46,106][213445] Avg episode reward: [(0, '4190.854')] [2023-03-07 15:58:46,436][213771] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 15:58:47,208][213771] Updated weights for policy 0, policy_version 87310 (0.0006) [2023-03-07 15:58:47,982][213771] Updated weights for policy 0, policy_version 87320 (0.0007) [2023-03-07 15:58:48,742][213771] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 15:58:49,518][213771] Updated weights for policy 0, policy_version 87340 (0.0007) [2023-03-07 15:58:50,283][213771] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 15:58:51,033][213771] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-03-07 15:58:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 89456640. Throughput: 0: 13246.7. Samples: 89440375. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:51,106][213445] Avg episode reward: [(0, '4147.453')] [2023-03-07 15:58:51,811][213771] Updated weights for policy 0, policy_version 87370 (0.0006) [2023-03-07 15:58:52,584][213771] Updated weights for policy 0, policy_version 87380 (0.0006) [2023-03-07 15:58:53,346][213771] Updated weights for policy 0, policy_version 87390 (0.0006) [2023-03-07 15:58:54,114][213771] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-07 15:58:54,905][213771] Updated weights for policy 0, policy_version 87410 (0.0006) [2023-03-07 15:58:55,696][213771] Updated weights for policy 0, policy_version 87420 (0.0007) [2023-03-07 15:58:56,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 89523200. Throughput: 0: 13251.1. Samples: 89520113. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:58:56,106][213445] Avg episode reward: [(0, '4132.742')] [2023-03-07 15:58:56,463][213771] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-07 15:58:57,233][213771] Updated weights for policy 0, policy_version 87440 (0.0005) [2023-03-07 15:58:58,001][213771] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-07 15:58:58,774][213771] Updated weights for policy 0, policy_version 87460 (0.0005) [2023-03-07 15:58:59,566][213771] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-07 15:59:00,340][213771] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-07 15:59:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 89588736. Throughput: 0: 13249.6. Samples: 89559861. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:01,106][213445] Avg episode reward: [(0, '4181.253')] [2023-03-07 15:59:01,108][213771] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-07 15:59:01,905][213771] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-07 15:59:02,677][213771] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 15:59:03,443][213771] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-07 15:59:04,233][213771] Updated weights for policy 0, policy_version 87530 (0.0007) [2023-03-07 15:59:05,010][213771] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 15:59:05,778][213771] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-07 15:59:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 89655296. Throughput: 0: 13231.8. Samples: 89638645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:06,106][213445] Avg episode reward: [(0, '4134.283')] [2023-03-07 15:59:06,567][213771] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-07 15:59:07,336][213771] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-07 15:59:08,111][213771] Updated weights for policy 0, policy_version 87580 (0.0008) [2023-03-07 15:59:08,889][213771] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-07 15:59:09,658][213771] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-07 15:59:10,428][213771] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-07 15:59:11,105][213445] Fps is (10 sec: 13209.1, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 89720832. Throughput: 0: 13227.0. Samples: 89718083. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:11,106][213445] Avg episode reward: [(0, '4154.021')] [2023-03-07 15:59:11,197][213771] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-07 15:59:11,953][213771] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-07 15:59:12,710][213771] Updated weights for policy 0, policy_version 87640 (0.0005) [2023-03-07 15:59:13,503][213771] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-07 15:59:14,257][213771] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-07 15:59:15,017][213771] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-07 15:59:15,794][213771] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 15:59:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13253.0). Total num frames: 89787392. Throughput: 0: 13230.7. Samples: 89758122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:16,106][213445] Avg episode reward: [(0, '4151.522')] [2023-03-07 15:59:16,567][213771] Updated weights for policy 0, policy_version 87690 (0.0007) [2023-03-07 15:59:17,335][213771] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-07 15:59:18,126][213771] Updated weights for policy 0, policy_version 87710 (0.0005) [2023-03-07 15:59:18,890][213771] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-07 15:59:19,644][213771] Updated weights for policy 0, policy_version 87730 (0.0005) [2023-03-07 15:59:20,452][213771] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-07 15:59:21,105][213445] Fps is (10 sec: 13312.4, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 89853952. Throughput: 0: 13237.3. Samples: 89837818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:21,106][213445] Avg episode reward: [(0, '4071.743')] [2023-03-07 15:59:21,212][213771] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 15:59:21,992][213771] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-07 15:59:22,781][213771] Updated weights for policy 0, policy_version 87770 (0.0006) [2023-03-07 15:59:23,533][213771] Updated weights for policy 0, policy_version 87780 (0.0006) [2023-03-07 15:59:24,326][213771] Updated weights for policy 0, policy_version 87790 (0.0006) [2023-03-07 15:59:25,083][213771] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 15:59:25,876][213771] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-07 15:59:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13256.5). Total num frames: 89920512. Throughput: 0: 13230.6. Samples: 89916975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:26,105][213445] Avg episode reward: [(0, '4109.486')] [2023-03-07 15:59:26,637][213771] Updated weights for policy 0, policy_version 87820 (0.0005) [2023-03-07 15:59:27,400][213771] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-07 15:59:28,176][213771] Updated weights for policy 0, policy_version 87840 (0.0007) [2023-03-07 15:59:28,942][213771] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-07 15:59:29,724][213771] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-07 15:59:30,507][213771] Updated weights for policy 0, policy_version 87870 (0.0006) [2023-03-07 15:59:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13253.0). Total num frames: 89986048. Throughput: 0: 13236.4. Samples: 89956842. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:31,106][213445] Avg episode reward: [(0, '4149.032')] [2023-03-07 15:59:31,285][213771] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 15:59:32,031][213771] Updated weights for policy 0, policy_version 87890 (0.0006) [2023-03-07 15:59:32,818][213771] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 15:59:33,585][213771] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-07 15:59:34,351][213771] Updated weights for policy 0, policy_version 87920 (0.0006) [2023-03-07 15:59:35,140][213771] Updated weights for policy 0, policy_version 87930 (0.0006) [2023-03-07 15:59:35,905][213771] Updated weights for policy 0, policy_version 87940 (0.0005) [2023-03-07 15:59:36,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 90052608. Throughput: 0: 13240.6. Samples: 90036203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:36,106][213445] Avg episode reward: [(0, '4144.667')] [2023-03-07 15:59:36,686][213771] Updated weights for policy 0, policy_version 87950 (0.0005) [2023-03-07 15:59:37,462][213771] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-07 15:59:38,243][213771] Updated weights for policy 0, policy_version 87970 (0.0007) [2023-03-07 15:59:39,004][213771] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-07 15:59:39,795][213771] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-07 15:59:40,558][213771] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 15:59:41,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.8, 300 sec: 13256.5). Total num frames: 90119168. Throughput: 0: 13233.1. Samples: 90115600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:41,105][213445] Avg episode reward: [(0, '4139.000')] [2023-03-07 15:59:41,342][213771] Updated weights for policy 0, policy_version 88010 (0.0007) [2023-03-07 15:59:42,109][213771] Updated weights for policy 0, policy_version 88020 (0.0007) [2023-03-07 15:59:42,882][213771] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-07 15:59:43,650][213771] Updated weights for policy 0, policy_version 88040 (0.0006) [2023-03-07 15:59:44,424][213771] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-07 15:59:45,219][213771] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-07 15:59:45,978][213771] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-07 15:59:46,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 90184704. Throughput: 0: 13232.3. Samples: 90155314. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:46,106][213445] Avg episode reward: [(0, '4049.251')] [2023-03-07 15:59:46,752][213771] Updated weights for policy 0, policy_version 88080 (0.0006) [2023-03-07 15:59:47,530][213771] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-07 15:59:48,312][213771] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-07 15:59:49,101][213771] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-07 15:59:49,866][213771] Updated weights for policy 0, policy_version 88120 (0.0005) [2023-03-07 15:59:50,639][213771] Updated weights for policy 0, policy_version 88130 (0.0005) [2023-03-07 15:59:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 90251264. Throughput: 0: 13240.9. Samples: 90234485. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:51,105][213445] Avg episode reward: [(0, '4096.863')] [2023-03-07 15:59:51,419][213771] Updated weights for policy 0, policy_version 88140 (0.0006) [2023-03-07 15:59:52,190][213771] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-07 15:59:52,958][213771] Updated weights for policy 0, policy_version 88160 (0.0007) [2023-03-07 15:59:53,735][213771] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-07 15:59:54,511][213771] Updated weights for policy 0, policy_version 88180 (0.0007) [2023-03-07 15:59:55,265][213771] Updated weights for policy 0, policy_version 88190 (0.0007) [2023-03-07 15:59:56,029][213771] Updated weights for policy 0, policy_version 88200 (0.0006) [2023-03-07 15:59:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90317824. Throughput: 0: 13245.3. Samples: 90314115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 15:59:56,106][213445] Avg episode reward: [(0, '4098.040')] [2023-03-07 15:59:56,793][213771] Updated weights for policy 0, policy_version 88210 (0.0007) [2023-03-07 15:59:57,571][213771] Updated weights for policy 0, policy_version 88220 (0.0006) [2023-03-07 15:59:58,338][213771] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-07 15:59:59,123][213771] Updated weights for policy 0, policy_version 88240 (0.0006) [2023-03-07 15:59:59,892][213771] Updated weights for policy 0, policy_version 88250 (0.0005) [2023-03-07 16:00:00,643][213771] Updated weights for policy 0, policy_version 88260 (0.0007) [2023-03-07 16:00:01,105][213445] Fps is (10 sec: 13209.3, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90383360. Throughput: 0: 13244.0. Samples: 90354103. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:01,106][213445] Avg episode reward: [(0, '3910.578')] [2023-03-07 16:00:01,433][213771] Updated weights for policy 0, policy_version 88270 (0.0007) [2023-03-07 16:00:02,217][213771] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-07 16:00:02,992][213771] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-07 16:00:03,758][213771] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-07 16:00:04,537][213771] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-07 16:00:05,311][213771] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-07 16:00:06,077][213771] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-07 16:00:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90449920. Throughput: 0: 13233.3. Samples: 90433318. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:06,106][213445] Avg episode reward: [(0, '4053.969')] [2023-03-07 16:00:06,110][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000088330_90449920.pth... [2023-03-07 16:00:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000085224_87269376.pth [2023-03-07 16:00:06,871][213771] Updated weights for policy 0, policy_version 88340 (0.0006) [2023-03-07 16:00:07,638][213771] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-07 16:00:08,416][213771] Updated weights for policy 0, policy_version 88360 (0.0006) [2023-03-07 16:00:09,199][213771] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-07 16:00:09,970][213771] Updated weights for policy 0, policy_version 88380 (0.0007) [2023-03-07 16:00:10,749][213771] Updated weights for policy 0, policy_version 88390 (0.0006) [2023-03-07 16:00:11,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13253.0). Total num frames: 90515456. Throughput: 0: 13235.4. Samples: 90512567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:11,106][213445] Avg episode reward: [(0, '4121.291')] [2023-03-07 16:00:11,510][213771] Updated weights for policy 0, policy_version 88400 (0.0007) [2023-03-07 16:00:12,284][213771] Updated weights for policy 0, policy_version 88410 (0.0006) [2023-03-07 16:00:13,037][213771] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-07 16:00:13,811][213771] Updated weights for policy 0, policy_version 88430 (0.0005) [2023-03-07 16:00:14,604][213771] Updated weights for policy 0, policy_version 88440 (0.0005) [2023-03-07 16:00:15,346][213771] Updated weights for policy 0, policy_version 88450 (0.0006) [2023-03-07 16:00:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90582016. Throughput: 0: 13239.5. Samples: 90552620. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:16,106][213445] Avg episode reward: [(0, '4103.127')] [2023-03-07 16:00:16,122][213771] Updated weights for policy 0, policy_version 88460 (0.0006) [2023-03-07 16:00:16,887][213771] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 16:00:17,665][213771] Updated weights for policy 0, policy_version 88480 (0.0006) [2023-03-07 16:00:18,433][213771] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-07 16:00:19,203][213771] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-07 16:00:19,974][213771] Updated weights for policy 0, policy_version 88510 (0.0007) [2023-03-07 16:00:20,757][213771] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 16:00:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90648576. Throughput: 0: 13246.1. Samples: 90632275. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:21,106][213445] Avg episode reward: [(0, '4014.866')] [2023-03-07 16:00:21,524][213771] Updated weights for policy 0, policy_version 88530 (0.0006) [2023-03-07 16:00:22,286][213771] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-07 16:00:23,067][213771] Updated weights for policy 0, policy_version 88550 (0.0006) [2023-03-07 16:00:23,828][213771] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-07 16:00:24,617][213771] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 16:00:25,394][213771] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-07 16:00:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90715136. Throughput: 0: 13249.0. Samples: 90711808. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:26,106][213445] Avg episode reward: [(0, '4028.525')] [2023-03-07 16:00:26,164][213771] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-07 16:00:26,924][213771] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-07 16:00:27,702][213771] Updated weights for policy 0, policy_version 88610 (0.0007) [2023-03-07 16:00:28,465][213771] Updated weights for policy 0, policy_version 88620 (0.0007) [2023-03-07 16:00:29,237][213771] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-07 16:00:30,020][213771] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-07 16:00:30,785][213771] Updated weights for policy 0, policy_version 88650 (0.0006) [2023-03-07 16:00:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 90781696. Throughput: 0: 13253.3. Samples: 90751714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:31,106][213445] Avg episode reward: [(0, '4007.999')] [2023-03-07 16:00:31,553][213771] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 16:00:32,338][213771] Updated weights for policy 0, policy_version 88670 (0.0008) [2023-03-07 16:00:33,105][213771] Updated weights for policy 0, policy_version 88680 (0.0006) [2023-03-07 16:00:33,881][213771] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 16:00:34,655][213771] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-07 16:00:35,440][213771] Updated weights for policy 0, policy_version 88710 (0.0005) [2023-03-07 16:00:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 90847232. Throughput: 0: 13259.3. Samples: 90831153. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:36,106][213445] Avg episode reward: [(0, '4127.745')] [2023-03-07 16:00:36,209][213771] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-07 16:00:36,987][213771] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-07 16:00:37,754][213771] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 16:00:38,529][213771] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-07 16:00:39,290][213771] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-07 16:00:40,067][213771] Updated weights for policy 0, policy_version 88770 (0.0005) [2023-03-07 16:00:40,833][213771] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-07 16:00:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 90913792. Throughput: 0: 13258.5. Samples: 90910745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:41,105][213445] Avg episode reward: [(0, '4097.651')] [2023-03-07 16:00:41,599][213771] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-07 16:00:42,366][213771] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 16:00:43,142][213771] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-07 16:00:43,906][213771] Updated weights for policy 0, policy_version 88820 (0.0005) [2023-03-07 16:00:44,691][213771] Updated weights for policy 0, policy_version 88830 (0.0006) [2023-03-07 16:00:45,446][213771] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-07 16:00:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 90980352. Throughput: 0: 13258.0. Samples: 90950709. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:46,105][213445] Avg episode reward: [(0, '4135.912')] [2023-03-07 16:00:46,230][213771] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 16:00:47,007][213771] Updated weights for policy 0, policy_version 88860 (0.0007) [2023-03-07 16:00:47,780][213771] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-07 16:00:48,555][213771] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-07 16:00:49,313][213771] Updated weights for policy 0, policy_version 88890 (0.0005) [2023-03-07 16:00:50,072][213771] Updated weights for policy 0, policy_version 88900 (0.0006) [2023-03-07 16:00:50,838][213771] Updated weights for policy 0, policy_version 88910 (0.0005) [2023-03-07 16:00:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 91046912. Throughput: 0: 13268.5. Samples: 91030399. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:51,106][213445] Avg episode reward: [(0, '4161.398')] [2023-03-07 16:00:51,605][213771] Updated weights for policy 0, policy_version 88920 (0.0005) [2023-03-07 16:00:52,377][213771] Updated weights for policy 0, policy_version 88930 (0.0007) [2023-03-07 16:00:53,142][213771] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-07 16:00:53,937][213771] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-07 16:00:54,701][213771] Updated weights for policy 0, policy_version 88960 (0.0005) [2023-03-07 16:00:55,476][213771] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 16:00:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 91113472. Throughput: 0: 13274.8. Samples: 91109937. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:00:56,106][213445] Avg episode reward: [(0, '4135.783')] [2023-03-07 16:00:56,253][213771] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-07 16:00:57,032][213771] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 16:00:57,828][213771] Updated weights for policy 0, policy_version 89000 (0.0008) [2023-03-07 16:00:58,594][213771] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 16:00:59,366][213771] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-07 16:01:00,152][213771] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-07 16:01:00,928][213771] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-07 16:01:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 91179008. Throughput: 0: 13262.2. Samples: 91149415. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:01,105][213445] Avg episode reward: [(0, '4031.367')] [2023-03-07 16:01:01,693][213771] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 16:01:02,482][213771] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 16:01:03,239][213771] Updated weights for policy 0, policy_version 89070 (0.0008) [2023-03-07 16:01:04,025][213771] Updated weights for policy 0, policy_version 89080 (0.0007) [2023-03-07 16:01:04,798][213771] Updated weights for policy 0, policy_version 89090 (0.0007) [2023-03-07 16:01:05,570][213771] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-07 16:01:06,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 91245568. Throughput: 0: 13250.8. Samples: 91228561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:06,106][213445] Avg episode reward: [(0, '4040.314')] [2023-03-07 16:01:06,331][213771] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-07 16:01:07,101][213771] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-07 16:01:07,881][213771] Updated weights for policy 0, policy_version 89130 (0.0006) [2023-03-07 16:01:08,659][213771] Updated weights for policy 0, policy_version 89140 (0.0007) [2023-03-07 16:01:09,432][213771] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-07 16:01:10,219][213771] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-07 16:01:11,004][213771] Updated weights for policy 0, policy_version 89170 (0.0007) [2023-03-07 16:01:11,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 91311104. Throughput: 0: 13248.5. Samples: 91307992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:11,106][213445] Avg episode reward: [(0, '4193.626')] [2023-03-07 16:01:11,778][213771] Updated weights for policy 0, policy_version 89180 (0.0006) [2023-03-07 16:01:12,553][213771] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-07 16:01:13,340][213771] Updated weights for policy 0, policy_version 89200 (0.0007) [2023-03-07 16:01:14,107][213771] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-07 16:01:14,892][213771] Updated weights for policy 0, policy_version 89220 (0.0007) [2023-03-07 16:01:15,671][213771] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-07 16:01:16,105][213445] Fps is (10 sec: 13107.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 91376640. Throughput: 0: 13240.9. Samples: 91347555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:16,106][213445] Avg episode reward: [(0, '4159.137')] [2023-03-07 16:01:16,430][213771] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-07 16:01:17,189][213771] Updated weights for policy 0, policy_version 89250 (0.0007) [2023-03-07 16:01:17,965][213771] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-07 16:01:18,736][213771] Updated weights for policy 0, policy_version 89270 (0.0007) [2023-03-07 16:01:19,522][213771] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 16:01:20,299][213771] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-07 16:01:21,069][213771] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-07 16:01:21,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 91443200. Throughput: 0: 13238.5. Samples: 91426887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:21,106][213445] Avg episode reward: [(0, '4149.710')] [2023-03-07 16:01:21,851][213771] Updated weights for policy 0, policy_version 89310 (0.0008) [2023-03-07 16:01:22,632][213771] Updated weights for policy 0, policy_version 89320 (0.0006) [2023-03-07 16:01:23,405][213771] Updated weights for policy 0, policy_version 89330 (0.0006) [2023-03-07 16:01:24,182][213771] Updated weights for policy 0, policy_version 89340 (0.0007) [2023-03-07 16:01:24,945][213771] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-07 16:01:25,720][213771] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 16:01:26,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 91509760. Throughput: 0: 13235.1. Samples: 91506324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:26,106][213445] Avg episode reward: [(0, '4159.091')] [2023-03-07 16:01:26,481][213771] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-07 16:01:27,261][213771] Updated weights for policy 0, policy_version 89380 (0.0006) [2023-03-07 16:01:28,012][213771] Updated weights for policy 0, policy_version 89390 (0.0005) [2023-03-07 16:01:28,808][213771] Updated weights for policy 0, policy_version 89400 (0.0007) [2023-03-07 16:01:29,572][213771] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-07 16:01:30,336][213771] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-07 16:01:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 91575296. Throughput: 0: 13232.0. Samples: 91546148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:31,106][213445] Avg episode reward: [(0, '4186.548')] [2023-03-07 16:01:31,114][213771] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-07 16:01:31,879][213771] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-07 16:01:32,647][213771] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-07 16:01:33,421][213771] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-07 16:01:34,184][213771] Updated weights for policy 0, policy_version 89470 (0.0005) [2023-03-07 16:01:34,968][213771] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-07 16:01:35,741][213771] Updated weights for policy 0, policy_version 89490 (0.0005) [2023-03-07 16:01:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 91641856. Throughput: 0: 13230.4. Samples: 91625766. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:36,106][213445] Avg episode reward: [(0, '4109.397')] [2023-03-07 16:01:36,506][213771] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-07 16:01:37,286][213771] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-07 16:01:38,056][213771] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 16:01:38,829][213771] Updated weights for policy 0, policy_version 89530 (0.0007) [2023-03-07 16:01:39,610][213771] Updated weights for policy 0, policy_version 89540 (0.0006) [2023-03-07 16:01:40,376][213771] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-07 16:01:41,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 91708416. Throughput: 0: 13230.7. Samples: 91705319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:41,106][213445] Avg episode reward: [(0, '4214.484')] [2023-03-07 16:01:41,138][213771] Updated weights for policy 0, policy_version 89560 (0.0005) [2023-03-07 16:01:41,926][213771] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-07 16:01:42,683][213771] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 16:01:43,441][213771] Updated weights for policy 0, policy_version 89590 (0.0007) [2023-03-07 16:01:44,228][213771] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-07 16:01:45,005][213771] Updated weights for policy 0, policy_version 89610 (0.0005) [2023-03-07 16:01:45,772][213771] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-07 16:01:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 91774976. Throughput: 0: 13240.3. Samples: 91745230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:01:46,106][213445] Avg episode reward: [(0, '4166.943')] [2023-03-07 16:01:46,547][213771] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-07 16:01:47,343][213771] Updated weights for policy 0, policy_version 89640 (0.0007) [2023-03-07 16:01:48,102][213771] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-07 16:01:48,893][213771] Updated weights for policy 0, policy_version 89660 (0.0005) [2023-03-07 16:01:49,658][213771] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-07 16:01:50,439][213771] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-07 16:01:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 91840512. Throughput: 0: 13240.1. Samples: 91824366. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:01:51,106][213445] Avg episode reward: [(0, '4180.078')] [2023-03-07 16:01:51,205][213771] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 16:01:51,990][213771] Updated weights for policy 0, policy_version 89700 (0.0008) [2023-03-07 16:01:52,748][213771] Updated weights for policy 0, policy_version 89710 (0.0006) [2023-03-07 16:01:53,526][213771] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-07 16:01:54,302][213771] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-07 16:01:55,088][213771] Updated weights for policy 0, policy_version 89740 (0.0007) [2023-03-07 16:01:55,854][213771] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-07 16:01:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 91907072. Throughput: 0: 13242.8. Samples: 91903918. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:01:56,106][213445] Avg episode reward: [(0, '4264.240')] [2023-03-07 16:01:56,614][213771] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-07 16:01:57,397][213771] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-07 16:01:58,172][213771] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 16:01:58,944][213771] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-07 16:01:59,713][213771] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-07 16:02:00,513][213771] Updated weights for policy 0, policy_version 89810 (0.0007) [2023-03-07 16:02:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 91972608. Throughput: 0: 13245.7. Samples: 91943611. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:01,106][213445] Avg episode reward: [(0, '4277.660')] [2023-03-07 16:02:01,277][213771] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 16:02:02,041][213771] Updated weights for policy 0, policy_version 89830 (0.0005) [2023-03-07 16:02:02,826][213771] Updated weights for policy 0, policy_version 89840 (0.0005) [2023-03-07 16:02:03,606][213771] Updated weights for policy 0, policy_version 89850 (0.0007) [2023-03-07 16:02:04,392][213771] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-07 16:02:05,173][213771] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-07 16:02:05,946][213771] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-07 16:02:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 92039168. Throughput: 0: 13234.8. Samples: 92022454. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:06,106][213445] Avg episode reward: [(0, '4236.800')] [2023-03-07 16:02:06,109][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000089882_92039168.pth... [2023-03-07 16:02:06,139][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000086778_88860672.pth [2023-03-07 16:02:06,717][213771] Updated weights for policy 0, policy_version 89890 (0.0007) [2023-03-07 16:02:07,490][213771] Updated weights for policy 0, policy_version 89900 (0.0006) [2023-03-07 16:02:08,255][213771] Updated weights for policy 0, policy_version 89910 (0.0006) [2023-03-07 16:02:09,038][213771] Updated weights for policy 0, policy_version 89920 (0.0007) [2023-03-07 16:02:09,818][213771] Updated weights for policy 0, policy_version 89930 (0.0005) [2023-03-07 16:02:10,585][213771] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-07 16:02:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 92104704. Throughput: 0: 13238.2. Samples: 92102044. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:11,106][213445] Avg episode reward: [(0, '4280.138')] [2023-03-07 16:02:11,332][213771] Updated weights for policy 0, policy_version 89950 (0.0006) [2023-03-07 16:02:12,138][213771] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 16:02:12,908][213771] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-07 16:02:13,685][213771] Updated weights for policy 0, policy_version 89980 (0.0006) [2023-03-07 16:02:14,473][213771] Updated weights for policy 0, policy_version 89990 (0.0006) [2023-03-07 16:02:15,234][213771] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-07 16:02:15,994][213771] Updated weights for policy 0, policy_version 90010 (0.0005) [2023-03-07 16:02:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 92171264. Throughput: 0: 13231.2. Samples: 92141554. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:16,106][213445] Avg episode reward: [(0, '4240.935')] [2023-03-07 16:02:16,763][213771] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-07 16:02:17,520][213771] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-07 16:02:18,284][213771] Updated weights for policy 0, policy_version 90040 (0.0007) [2023-03-07 16:02:19,071][213771] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-07 16:02:19,844][213771] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 16:02:20,591][213771] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-07 16:02:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 92237824. Throughput: 0: 13236.5. Samples: 92221408. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:21,106][213445] Avg episode reward: [(0, '4239.777')] [2023-03-07 16:02:21,378][213771] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-07 16:02:22,155][213771] Updated weights for policy 0, policy_version 90090 (0.0008) [2023-03-07 16:02:22,905][213771] Updated weights for policy 0, policy_version 90100 (0.0006) [2023-03-07 16:02:23,681][213771] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-07 16:02:24,450][213771] Updated weights for policy 0, policy_version 90120 (0.0005) [2023-03-07 16:02:25,233][213771] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-07 16:02:25,998][213771] Updated weights for policy 0, policy_version 90140 (0.0005) [2023-03-07 16:02:26,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 92304384. Throughput: 0: 13240.7. Samples: 92301148. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:26,106][213445] Avg episode reward: [(0, '4292.261')] [2023-03-07 16:02:26,778][213771] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-07 16:02:27,550][213771] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-07 16:02:28,323][213771] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-07 16:02:29,074][213771] Updated weights for policy 0, policy_version 90180 (0.0005) [2023-03-07 16:02:29,845][213771] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-07 16:02:30,625][213771] Updated weights for policy 0, policy_version 90200 (0.0007) [2023-03-07 16:02:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 92370944. Throughput: 0: 13239.6. Samples: 92341011. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:31,106][213445] Avg episode reward: [(0, '4194.910')] [2023-03-07 16:02:31,390][213771] Updated weights for policy 0, policy_version 90210 (0.0007) [2023-03-07 16:02:32,162][213771] Updated weights for policy 0, policy_version 90220 (0.0006) [2023-03-07 16:02:32,927][213771] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-07 16:02:33,712][213771] Updated weights for policy 0, policy_version 90240 (0.0007) [2023-03-07 16:02:34,494][213771] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-07 16:02:35,257][213771] Updated weights for policy 0, policy_version 90260 (0.0005) [2023-03-07 16:02:36,036][213771] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-07 16:02:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 92436480. Throughput: 0: 13247.8. Samples: 92420518. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:36,106][213445] Avg episode reward: [(0, '4270.813')] [2023-03-07 16:02:36,801][213771] Updated weights for policy 0, policy_version 90280 (0.0005) [2023-03-07 16:02:37,564][213771] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-07 16:02:38,349][213771] Updated weights for policy 0, policy_version 90300 (0.0007) [2023-03-07 16:02:39,111][213771] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-07 16:02:39,884][213771] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-07 16:02:40,654][213771] Updated weights for policy 0, policy_version 90330 (0.0007) [2023-03-07 16:02:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 92503040. Throughput: 0: 13250.6. Samples: 92500196. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:02:41,106][213445] Avg episode reward: [(0, '4271.122')] [2023-03-07 16:02:41,417][213771] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-07 16:02:42,199][213771] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-07 16:02:42,969][213771] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-07 16:02:43,737][213771] Updated weights for policy 0, policy_version 90370 (0.0007) [2023-03-07 16:02:44,518][213771] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-07 16:02:45,272][213771] Updated weights for policy 0, policy_version 90390 (0.0007) [2023-03-07 16:02:46,052][213771] Updated weights for policy 0, policy_version 90400 (0.0006) [2023-03-07 16:02:46,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 92569600. Throughput: 0: 13256.6. Samples: 92540158. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:02:46,106][213445] Avg episode reward: [(0, '4305.445')] [2023-03-07 16:02:46,842][213771] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-07 16:02:47,631][213771] Updated weights for policy 0, policy_version 90420 (0.0006) [2023-03-07 16:02:48,397][213771] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-07 16:02:49,183][213771] Updated weights for policy 0, policy_version 90440 (0.0006) [2023-03-07 16:02:49,945][213771] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-07 16:02:50,712][213771] Updated weights for policy 0, policy_version 90460 (0.0008) [2023-03-07 16:02:51,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 92636160. Throughput: 0: 13264.8. Samples: 92619370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:02:51,106][213445] Avg episode reward: [(0, '4273.980')] [2023-03-07 16:02:51,488][213771] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 16:02:52,239][213771] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-07 16:02:53,016][213771] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-07 16:02:53,788][213771] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 16:02:54,553][213771] Updated weights for policy 0, policy_version 90510 (0.0007) [2023-03-07 16:02:55,327][213771] Updated weights for policy 0, policy_version 90520 (0.0006) [2023-03-07 16:02:56,091][213771] Updated weights for policy 0, policy_version 90530 (0.0005) [2023-03-07 16:02:56,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 92702720. Throughput: 0: 13270.1. Samples: 92699201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:02:56,106][213445] Avg episode reward: [(0, '4274.105')] [2023-03-07 16:02:56,861][213771] Updated weights for policy 0, policy_version 90540 (0.0006) [2023-03-07 16:02:57,631][213771] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-07 16:02:58,395][213771] Updated weights for policy 0, policy_version 90560 (0.0006) [2023-03-07 16:02:59,175][213771] Updated weights for policy 0, policy_version 90570 (0.0006) [2023-03-07 16:02:59,970][213771] Updated weights for policy 0, policy_version 90580 (0.0005) [2023-03-07 16:03:00,751][213771] Updated weights for policy 0, policy_version 90590 (0.0007) [2023-03-07 16:03:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 92768256. Throughput: 0: 13277.8. Samples: 92739053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:01,106][213445] Avg episode reward: [(0, '4260.233')] [2023-03-07 16:03:01,518][213771] Updated weights for policy 0, policy_version 90600 (0.0007) [2023-03-07 16:03:02,287][213771] Updated weights for policy 0, policy_version 90610 (0.0007) [2023-03-07 16:03:03,068][213771] Updated weights for policy 0, policy_version 90620 (0.0007) [2023-03-07 16:03:03,842][213771] Updated weights for policy 0, policy_version 90630 (0.0007) [2023-03-07 16:03:04,601][213771] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-07 16:03:05,364][213771] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-07 16:03:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 92834816. Throughput: 0: 13265.0. Samples: 92818335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:06,106][213445] Avg episode reward: [(0, '4257.030')] [2023-03-07 16:03:06,147][213771] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 16:03:06,913][213771] Updated weights for policy 0, policy_version 90670 (0.0007) [2023-03-07 16:03:07,680][213771] Updated weights for policy 0, policy_version 90680 (0.0005) [2023-03-07 16:03:08,450][213771] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-07 16:03:09,224][213771] Updated weights for policy 0, policy_version 90700 (0.0007) [2023-03-07 16:03:09,993][213771] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 16:03:10,775][213771] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-07 16:03:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13246.0). Total num frames: 92901376. Throughput: 0: 13267.4. Samples: 92898182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:11,106][213445] Avg episode reward: [(0, '4314.647')] [2023-03-07 16:03:11,551][213771] Updated weights for policy 0, policy_version 90730 (0.0006) [2023-03-07 16:03:12,321][213771] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-07 16:03:13,112][213771] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-07 16:03:13,864][213771] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-07 16:03:14,639][213771] Updated weights for policy 0, policy_version 90770 (0.0007) [2023-03-07 16:03:15,421][213771] Updated weights for policy 0, policy_version 90780 (0.0007) [2023-03-07 16:03:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 92967936. Throughput: 0: 13264.0. Samples: 92937891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:16,105][213445] Avg episode reward: [(0, '4295.638')] [2023-03-07 16:03:16,183][213771] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-07 16:03:16,958][213771] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-07 16:03:17,737][213771] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-07 16:03:18,515][213771] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-07 16:03:19,286][213771] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-07 16:03:20,061][213771] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-07 16:03:20,838][213771] Updated weights for policy 0, policy_version 90850 (0.0007) [2023-03-07 16:03:21,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 93033472. Throughput: 0: 13257.4. Samples: 93017100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:21,105][213445] Avg episode reward: [(0, '4321.479')] [2023-03-07 16:03:21,615][213771] Updated weights for policy 0, policy_version 90860 (0.0007) [2023-03-07 16:03:22,397][213771] Updated weights for policy 0, policy_version 90870 (0.0006) [2023-03-07 16:03:23,171][213771] Updated weights for policy 0, policy_version 90880 (0.0006) [2023-03-07 16:03:23,950][213771] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-07 16:03:24,712][213771] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-07 16:03:25,490][213771] Updated weights for policy 0, policy_version 90910 (0.0006) [2023-03-07 16:03:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 93100032. Throughput: 0: 13251.0. Samples: 93096491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:26,105][213445] Avg episode reward: [(0, '4294.437')] [2023-03-07 16:03:26,257][213771] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-07 16:03:27,021][213771] Updated weights for policy 0, policy_version 90930 (0.0006) [2023-03-07 16:03:27,810][213771] Updated weights for policy 0, policy_version 90940 (0.0006) [2023-03-07 16:03:28,570][213771] Updated weights for policy 0, policy_version 90950 (0.0006) [2023-03-07 16:03:29,352][213771] Updated weights for policy 0, policy_version 90960 (0.0007) [2023-03-07 16:03:30,122][213771] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-07 16:03:30,886][213771] Updated weights for policy 0, policy_version 90980 (0.0005) [2023-03-07 16:03:31,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 93165568. Throughput: 0: 13245.2. Samples: 93136192. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:31,106][213445] Avg episode reward: [(0, '4289.001')] [2023-03-07 16:03:31,658][213771] Updated weights for policy 0, policy_version 90990 (0.0007) [2023-03-07 16:03:32,434][213771] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 16:03:33,196][213771] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 16:03:33,968][213771] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-07 16:03:34,726][213771] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-07 16:03:35,492][213771] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-07 16:03:36,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13249.5). Total num frames: 93233152. Throughput: 0: 13258.5. Samples: 93216001. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:03:36,105][213445] Avg episode reward: [(0, '4328.705')] [2023-03-07 16:03:36,274][213771] Updated weights for policy 0, policy_version 91050 (0.0007) [2023-03-07 16:03:37,053][213771] Updated weights for policy 0, policy_version 91060 (0.0006) [2023-03-07 16:03:37,816][213771] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 16:03:38,580][213771] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-07 16:03:39,341][213771] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-07 16:03:40,100][213771] Updated weights for policy 0, policy_version 91100 (0.0005) [2023-03-07 16:03:40,854][213771] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-07 16:03:41,105][213445] Fps is (10 sec: 13414.3, 60 sec: 13277.8, 300 sec: 13253.0). Total num frames: 93299712. Throughput: 0: 13269.6. Samples: 93296334. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:03:41,106][213445] Avg episode reward: [(0, '4320.837')] [2023-03-07 16:03:41,629][213771] Updated weights for policy 0, policy_version 91120 (0.0006) [2023-03-07 16:03:42,414][213771] Updated weights for policy 0, policy_version 91130 (0.0007) [2023-03-07 16:03:43,179][213771] Updated weights for policy 0, policy_version 91140 (0.0006) [2023-03-07 16:03:43,942][213771] Updated weights for policy 0, policy_version 91150 (0.0006) [2023-03-07 16:03:44,711][213771] Updated weights for policy 0, policy_version 91160 (0.0006) [2023-03-07 16:03:45,490][213771] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-07 16:03:46,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 93366272. Throughput: 0: 13268.5. Samples: 93336134. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:03:46,106][213445] Avg episode reward: [(0, '4340.773')] [2023-03-07 16:03:46,255][213771] Updated weights for policy 0, policy_version 91180 (0.0007) [2023-03-07 16:03:47,020][213771] Updated weights for policy 0, policy_version 91190 (0.0006) [2023-03-07 16:03:47,806][213771] Updated weights for policy 0, policy_version 91200 (0.0006) [2023-03-07 16:03:48,576][213771] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-07 16:03:49,358][213771] Updated weights for policy 0, policy_version 91220 (0.0007) [2023-03-07 16:03:50,117][213771] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-07 16:03:50,900][213771] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-03-07 16:03:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 93431808. Throughput: 0: 13274.3. Samples: 93415681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:03:51,106][213445] Avg episode reward: [(0, '4324.813')] [2023-03-07 16:03:51,665][213771] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-07 16:03:52,430][213771] Updated weights for policy 0, policy_version 91260 (0.0007) [2023-03-07 16:03:53,221][213771] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-07 16:03:54,009][213771] Updated weights for policy 0, policy_version 91280 (0.0005) [2023-03-07 16:03:54,777][213771] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-07 16:03:55,554][213771] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-07 16:03:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 93498368. Throughput: 0: 13262.6. Samples: 93494997. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:03:56,106][213445] Avg episode reward: [(0, '4272.073')] [2023-03-07 16:03:56,313][213771] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-07 16:03:57,097][213771] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 16:03:57,866][213771] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-07 16:03:58,646][213771] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 16:03:59,418][213771] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-07 16:04:00,194][213771] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-07 16:04:00,983][213771] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-07 16:04:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 93563904. Throughput: 0: 13262.7. Samples: 93534715. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:01,106][213445] Avg episode reward: [(0, '4261.113')] [2023-03-07 16:04:01,747][213771] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-07 16:04:02,517][213771] Updated weights for policy 0, policy_version 91390 (0.0006) [2023-03-07 16:04:03,293][213771] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-07 16:04:04,070][213771] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-07 16:04:04,845][213771] Updated weights for policy 0, policy_version 91420 (0.0005) [2023-03-07 16:04:05,626][213771] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-07 16:04:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 93630464. Throughput: 0: 13262.7. Samples: 93613924. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:06,106][213445] Avg episode reward: [(0, '4273.717')] [2023-03-07 16:04:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000091436_93630464.pth... [2023-03-07 16:04:06,142][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000088330_90449920.pth [2023-03-07 16:04:06,400][213771] Updated weights for policy 0, policy_version 91440 (0.0005) [2023-03-07 16:04:07,163][213771] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 16:04:07,941][213771] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-07 16:04:08,709][213771] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-07 16:04:09,499][213771] Updated weights for policy 0, policy_version 91480 (0.0006) [2023-03-07 16:04:10,270][213771] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-07 16:04:11,046][213771] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-07 16:04:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 93696000. Throughput: 0: 13261.3. Samples: 93693249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:11,106][213445] Avg episode reward: [(0, '4272.032')] [2023-03-07 16:04:11,831][213771] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-07 16:04:12,587][213771] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-07 16:04:13,365][213771] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-07 16:04:14,123][213771] Updated weights for policy 0, policy_version 91540 (0.0005) [2023-03-07 16:04:14,891][213771] Updated weights for policy 0, policy_version 91550 (0.0006) [2023-03-07 16:04:15,653][213771] Updated weights for policy 0, policy_version 91560 (0.0005) [2023-03-07 16:04:16,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 93762560. Throughput: 0: 13265.1. Samples: 93733122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:16,116][213445] Avg episode reward: [(0, '4261.090')] [2023-03-07 16:04:16,429][213771] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-07 16:04:17,211][213771] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-07 16:04:17,979][213771] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 16:04:18,759][213771] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 16:04:19,534][213771] Updated weights for policy 0, policy_version 91610 (0.0006) [2023-03-07 16:04:20,305][213771] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-07 16:04:21,085][213771] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-07 16:04:21,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 93829120. Throughput: 0: 13262.7. Samples: 93812822. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:21,116][213445] Avg episode reward: [(0, '4267.843')] [2023-03-07 16:04:21,855][213771] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-07 16:04:22,629][213771] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-07 16:04:23,407][213771] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-07 16:04:24,173][213771] Updated weights for policy 0, policy_version 91670 (0.0005) [2023-03-07 16:04:24,932][213771] Updated weights for policy 0, policy_version 91680 (0.0005) [2023-03-07 16:04:25,727][213771] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 16:04:26,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 93895680. Throughput: 0: 13239.8. Samples: 93892124. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:26,116][213445] Avg episode reward: [(0, '4275.762')] [2023-03-07 16:04:26,494][213771] Updated weights for policy 0, policy_version 91700 (0.0007) [2023-03-07 16:04:27,268][213771] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-07 16:04:28,054][213771] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-07 16:04:28,818][213771] Updated weights for policy 0, policy_version 91730 (0.0006) [2023-03-07 16:04:29,618][213771] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 16:04:30,397][213771] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 16:04:31,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 93961216. Throughput: 0: 13234.4. Samples: 93931682. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:04:31,116][213445] Avg episode reward: [(0, '4250.142')] [2023-03-07 16:04:31,163][213771] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-07 16:04:31,946][213771] Updated weights for policy 0, policy_version 91770 (0.0007) [2023-03-07 16:04:32,718][213771] Updated weights for policy 0, policy_version 91780 (0.0005) [2023-03-07 16:04:33,495][213771] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-07 16:04:34,273][213771] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-07 16:04:35,044][213771] Updated weights for policy 0, policy_version 91810 (0.0007) [2023-03-07 16:04:35,818][213771] Updated weights for policy 0, policy_version 91820 (0.0007) [2023-03-07 16:04:36,105][213445] Fps is (10 sec: 13107.3, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 94026752. Throughput: 0: 13219.4. Samples: 94010551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:04:36,116][213445] Avg episode reward: [(0, '4223.476')] [2023-03-07 16:04:36,595][213771] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 16:04:37,352][213771] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-07 16:04:38,139][213771] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-07 16:04:38,925][213771] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-07 16:04:39,700][213771] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-07 16:04:40,474][213771] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-07 16:04:41,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 94093312. Throughput: 0: 13219.0. Samples: 94089853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:04:41,106][213445] Avg episode reward: [(0, '4273.047')] [2023-03-07 16:04:41,244][213771] Updated weights for policy 0, policy_version 91890 (0.0006) [2023-03-07 16:04:42,012][213771] Updated weights for policy 0, policy_version 91900 (0.0007) [2023-03-07 16:04:42,774][213771] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-07 16:04:43,570][213771] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-07 16:04:44,357][213771] Updated weights for policy 0, policy_version 91930 (0.0006) [2023-03-07 16:04:45,126][213771] Updated weights for policy 0, policy_version 91940 (0.0006) [2023-03-07 16:04:45,889][213771] Updated weights for policy 0, policy_version 91950 (0.0006) [2023-03-07 16:04:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13246.0). Total num frames: 94158848. Throughput: 0: 13216.8. Samples: 94129471. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:04:46,106][213445] Avg episode reward: [(0, '4280.269')] [2023-03-07 16:04:46,666][213771] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-07 16:04:47,434][213771] Updated weights for policy 0, policy_version 91970 (0.0006) [2023-03-07 16:04:48,205][213771] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-07 16:04:48,977][213771] Updated weights for policy 0, policy_version 91990 (0.0005) [2023-03-07 16:04:49,750][213771] Updated weights for policy 0, policy_version 92000 (0.0006) [2023-03-07 16:04:50,525][213771] Updated weights for policy 0, policy_version 92010 (0.0007) [2023-03-07 16:04:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 94225408. Throughput: 0: 13226.7. Samples: 94209126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:04:51,106][213445] Avg episode reward: [(0, '4293.533')] [2023-03-07 16:04:51,300][213771] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-07 16:04:52,058][213771] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-07 16:04:52,845][213771] Updated weights for policy 0, policy_version 92040 (0.0007) [2023-03-07 16:04:53,626][213771] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-07 16:04:54,401][213771] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-07 16:04:55,167][213771] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 16:04:55,950][213771] Updated weights for policy 0, policy_version 92080 (0.0006) [2023-03-07 16:04:56,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 94291968. Throughput: 0: 13223.7. Samples: 94288317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:04:56,106][213445] Avg episode reward: [(0, '4195.072')] [2023-03-07 16:04:56,698][213771] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-07 16:04:57,485][213771] Updated weights for policy 0, policy_version 92100 (0.0006) [2023-03-07 16:04:58,263][213771] Updated weights for policy 0, policy_version 92110 (0.0006) [2023-03-07 16:04:59,033][213771] Updated weights for policy 0, policy_version 92120 (0.0006) [2023-03-07 16:04:59,798][213771] Updated weights for policy 0, policy_version 92130 (0.0005) [2023-03-07 16:05:00,570][213771] Updated weights for policy 0, policy_version 92140 (0.0007) [2023-03-07 16:05:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 94357504. Throughput: 0: 13223.4. Samples: 94328175. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:01,105][213445] Avg episode reward: [(0, '4264.935')] [2023-03-07 16:05:01,351][213771] Updated weights for policy 0, policy_version 92150 (0.0007) [2023-03-07 16:05:02,114][213771] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-07 16:05:02,885][213771] Updated weights for policy 0, policy_version 92170 (0.0005) [2023-03-07 16:05:03,664][213771] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-07 16:05:04,427][213771] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-07 16:05:05,223][213771] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-07 16:05:06,005][213771] Updated weights for policy 0, policy_version 92210 (0.0007) [2023-03-07 16:05:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 94424064. Throughput: 0: 13221.6. Samples: 94407795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:06,105][213445] Avg episode reward: [(0, '4257.391')] [2023-03-07 16:05:06,766][213771] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 16:05:07,536][213771] Updated weights for policy 0, policy_version 92230 (0.0007) [2023-03-07 16:05:08,306][213771] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-07 16:05:09,081][213771] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-03-07 16:05:09,833][213771] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 16:05:10,614][213771] Updated weights for policy 0, policy_version 92270 (0.0006) [2023-03-07 16:05:11,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 94490624. Throughput: 0: 13229.2. Samples: 94487436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:11,106][213445] Avg episode reward: [(0, '4283.586')] [2023-03-07 16:05:11,383][213771] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-07 16:05:12,134][213771] Updated weights for policy 0, policy_version 92290 (0.0007) [2023-03-07 16:05:12,929][213771] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 16:05:13,699][213771] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-07 16:05:14,466][213771] Updated weights for policy 0, policy_version 92320 (0.0007) [2023-03-07 16:05:15,240][213771] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-07 16:05:15,994][213771] Updated weights for policy 0, policy_version 92340 (0.0007) [2023-03-07 16:05:16,105][213445] Fps is (10 sec: 13311.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 94557184. Throughput: 0: 13230.4. Samples: 94527052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:16,106][213445] Avg episode reward: [(0, '4273.917')] [2023-03-07 16:05:16,789][213771] Updated weights for policy 0, policy_version 92350 (0.0005) [2023-03-07 16:05:17,556][213771] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-07 16:05:18,328][213771] Updated weights for policy 0, policy_version 92370 (0.0006) [2023-03-07 16:05:19,090][213771] Updated weights for policy 0, policy_version 92380 (0.0007) [2023-03-07 16:05:19,893][213771] Updated weights for policy 0, policy_version 92390 (0.0005) [2023-03-07 16:05:20,654][213771] Updated weights for policy 0, policy_version 92400 (0.0006) [2023-03-07 16:05:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 94622720. Throughput: 0: 13245.5. Samples: 94606601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:21,106][213445] Avg episode reward: [(0, '4326.951')] [2023-03-07 16:05:21,417][213771] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-07 16:05:22,196][213771] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 16:05:22,948][213771] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-07 16:05:23,726][213771] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-07 16:05:24,495][213771] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-07 16:05:25,262][213771] Updated weights for policy 0, policy_version 92460 (0.0006) [2023-03-07 16:05:26,053][213771] Updated weights for policy 0, policy_version 92470 (0.0007) [2023-03-07 16:05:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 94689280. Throughput: 0: 13256.5. Samples: 94686398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:26,106][213445] Avg episode reward: [(0, '4260.124')] [2023-03-07 16:05:26,827][213771] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-07 16:05:27,603][213771] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 16:05:28,384][213771] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-07 16:05:29,172][213771] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-07 16:05:29,934][213771] Updated weights for policy 0, policy_version 92520 (0.0008) [2023-03-07 16:05:30,733][213771] Updated weights for policy 0, policy_version 92530 (0.0006) [2023-03-07 16:05:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 94755840. Throughput: 0: 13255.2. Samples: 94725953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:31,106][213445] Avg episode reward: [(0, '4325.228')] [2023-03-07 16:05:31,498][213771] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-07 16:05:32,278][213771] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-07 16:05:33,045][213771] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-07 16:05:33,821][213771] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 16:05:34,578][213771] Updated weights for policy 0, policy_version 92580 (0.0006) [2023-03-07 16:05:35,355][213771] Updated weights for policy 0, policy_version 92590 (0.0007) [2023-03-07 16:05:36,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 94821376. Throughput: 0: 13246.4. Samples: 94805215. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:36,106][213445] Avg episode reward: [(0, '4308.831')] [2023-03-07 16:05:36,137][213771] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-07 16:05:36,888][213771] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-07 16:05:37,664][213771] Updated weights for policy 0, policy_version 92620 (0.0006) [2023-03-07 16:05:38,442][213771] Updated weights for policy 0, policy_version 92630 (0.0005) [2023-03-07 16:05:39,228][213771] Updated weights for policy 0, policy_version 92640 (0.0008) [2023-03-07 16:05:39,989][213771] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-07 16:05:40,758][213771] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 16:05:41,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 94887936. Throughput: 0: 13260.1. Samples: 94885020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:41,106][213445] Avg episode reward: [(0, '4263.011')] [2023-03-07 16:05:41,512][213771] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 16:05:42,297][213771] Updated weights for policy 0, policy_version 92680 (0.0005) [2023-03-07 16:05:43,049][213771] Updated weights for policy 0, policy_version 92690 (0.0005) [2023-03-07 16:05:43,858][213771] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-07 16:05:44,630][213771] Updated weights for policy 0, policy_version 92710 (0.0005) [2023-03-07 16:05:45,392][213771] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-07 16:05:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13246.1). Total num frames: 94954496. Throughput: 0: 13257.6. Samples: 94924767. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:46,106][213445] Avg episode reward: [(0, '4283.442')] [2023-03-07 16:05:46,170][213771] Updated weights for policy 0, policy_version 92730 (0.0007) [2023-03-07 16:05:46,956][213771] Updated weights for policy 0, policy_version 92740 (0.0007) [2023-03-07 16:05:47,701][213771] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-07 16:05:48,467][213771] Updated weights for policy 0, policy_version 92760 (0.0006) [2023-03-07 16:05:49,237][213771] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 16:05:50,011][213771] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-07 16:05:50,801][213771] Updated weights for policy 0, policy_version 92790 (0.0006) [2023-03-07 16:05:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 95020032. Throughput: 0: 13256.8. Samples: 95004355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:51,106][213445] Avg episode reward: [(0, '4329.148')] [2023-03-07 16:05:51,584][213771] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-07 16:05:52,356][213771] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-07 16:05:53,112][213771] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-07 16:05:53,910][213771] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-07 16:05:54,669][213771] Updated weights for policy 0, policy_version 92840 (0.0007) [2023-03-07 16:05:55,446][213771] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-07 16:05:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.0). Total num frames: 95086592. Throughput: 0: 13248.8. Samples: 95083634. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:05:56,106][213445] Avg episode reward: [(0, '4299.857')] [2023-03-07 16:05:56,219][213771] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-07 16:05:56,985][213771] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-07 16:05:57,760][213771] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-07 16:05:58,550][213771] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-07 16:05:59,317][213771] Updated weights for policy 0, policy_version 92900 (0.0005) [2023-03-07 16:06:00,084][213771] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-07 16:06:00,866][213771] Updated weights for policy 0, policy_version 92920 (0.0005) [2023-03-07 16:06:01,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13246.0). Total num frames: 95153152. Throughput: 0: 13250.3. Samples: 95123314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:01,106][213445] Avg episode reward: [(0, '4315.907')] [2023-03-07 16:06:01,645][213771] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-07 16:06:02,421][213771] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-07 16:06:03,195][213771] Updated weights for policy 0, policy_version 92950 (0.0008) [2023-03-07 16:06:03,949][213771] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-07 16:06:04,727][213771] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-07 16:06:05,496][213771] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-07 16:06:06,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 95218688. Throughput: 0: 13246.5. Samples: 95202693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:06,106][213445] Avg episode reward: [(0, '4252.045')] [2023-03-07 16:06:06,112][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000092988_95219712.pth... [2023-03-07 16:06:06,140][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000089882_92039168.pth [2023-03-07 16:06:06,264][213771] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-07 16:06:07,045][213771] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-07 16:06:07,816][213771] Updated weights for policy 0, policy_version 93010 (0.0006) [2023-03-07 16:06:08,593][213771] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-07 16:06:09,352][213771] Updated weights for policy 0, policy_version 93030 (0.0006) [2023-03-07 16:06:10,126][213771] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 16:06:10,896][213771] Updated weights for policy 0, policy_version 93050 (0.0007) [2023-03-07 16:06:11,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 95285248. Throughput: 0: 13244.1. Samples: 95282379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:11,106][213445] Avg episode reward: [(0, '4225.820')] [2023-03-07 16:06:11,666][213771] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-07 16:06:12,428][213771] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-07 16:06:13,208][213771] Updated weights for policy 0, policy_version 93080 (0.0005) [2023-03-07 16:06:13,984][213771] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-07 16:06:14,742][213771] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-07 16:06:15,524][213771] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-07 16:06:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 95351808. Throughput: 0: 13251.4. Samples: 95322268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:16,106][213445] Avg episode reward: [(0, '4230.314')] [2023-03-07 16:06:16,299][213771] Updated weights for policy 0, policy_version 93120 (0.0005) [2023-03-07 16:06:17,066][213771] Updated weights for policy 0, policy_version 93130 (0.0006) [2023-03-07 16:06:17,850][213771] Updated weights for policy 0, policy_version 93140 (0.0007) [2023-03-07 16:06:18,617][213771] Updated weights for policy 0, policy_version 93150 (0.0006) [2023-03-07 16:06:19,397][213771] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 16:06:20,170][213771] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 16:06:20,924][213771] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-07 16:06:21,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 95418368. Throughput: 0: 13255.4. Samples: 95401710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:21,106][213445] Avg episode reward: [(0, '4200.483')] [2023-03-07 16:06:21,698][213771] Updated weights for policy 0, policy_version 93190 (0.0007) [2023-03-07 16:06:22,480][213771] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-07 16:06:23,247][213771] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-07 16:06:24,022][213771] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-07 16:06:24,795][213771] Updated weights for policy 0, policy_version 93230 (0.0008) [2023-03-07 16:06:25,578][213771] Updated weights for policy 0, policy_version 93240 (0.0006) [2023-03-07 16:06:26,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 95484928. Throughput: 0: 13251.5. Samples: 95481336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:26,106][213445] Avg episode reward: [(0, '4224.175')] [2023-03-07 16:06:26,332][213771] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-07 16:06:27,097][213771] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-07 16:06:27,877][213771] Updated weights for policy 0, policy_version 93270 (0.0006) [2023-03-07 16:06:28,652][213771] Updated weights for policy 0, policy_version 93280 (0.0007) [2023-03-07 16:06:29,407][213771] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-07 16:06:30,174][213771] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-07 16:06:30,941][213771] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-07 16:06:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 95551488. Throughput: 0: 13256.3. Samples: 95521303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:31,106][213445] Avg episode reward: [(0, '4195.785')] [2023-03-07 16:06:31,712][213771] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-07 16:06:32,478][213771] Updated weights for policy 0, policy_version 93330 (0.0005) [2023-03-07 16:06:33,233][213771] Updated weights for policy 0, policy_version 93340 (0.0005) [2023-03-07 16:06:34,031][213771] Updated weights for policy 0, policy_version 93350 (0.0005) [2023-03-07 16:06:34,787][213771] Updated weights for policy 0, policy_version 93360 (0.0007) [2023-03-07 16:06:35,550][213771] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-07 16:06:36,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 95617024. Throughput: 0: 13266.1. Samples: 95601327. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:36,116][213445] Avg episode reward: [(0, '4212.370')] [2023-03-07 16:06:36,326][213771] Updated weights for policy 0, policy_version 93380 (0.0005) [2023-03-07 16:06:37,097][213771] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-07 16:06:37,896][213771] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-07 16:06:38,658][213771] Updated weights for policy 0, policy_version 93410 (0.0007) [2023-03-07 16:06:39,424][213771] Updated weights for policy 0, policy_version 93420 (0.0007) [2023-03-07 16:06:40,204][213771] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-07 16:06:40,964][213771] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-07 16:06:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 95683584. Throughput: 0: 13270.4. Samples: 95680802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:41,106][213445] Avg episode reward: [(0, '4235.719')] [2023-03-07 16:06:41,745][213771] Updated weights for policy 0, policy_version 93450 (0.0006) [2023-03-07 16:06:42,526][213771] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-07 16:06:43,299][213771] Updated weights for policy 0, policy_version 93470 (0.0006) [2023-03-07 16:06:44,069][213771] Updated weights for policy 0, policy_version 93480 (0.0007) [2023-03-07 16:06:44,846][213771] Updated weights for policy 0, policy_version 93490 (0.0006) [2023-03-07 16:06:45,623][213771] Updated weights for policy 0, policy_version 93500 (0.0007) [2023-03-07 16:06:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 95750144. Throughput: 0: 13268.6. Samples: 95720402. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:46,106][213445] Avg episode reward: [(0, '4206.879')] [2023-03-07 16:06:46,398][213771] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-07 16:06:47,180][213771] Updated weights for policy 0, policy_version 93520 (0.0005) [2023-03-07 16:06:47,950][213771] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 16:06:48,724][213771] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-07 16:06:49,507][213771] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-07 16:06:50,277][213771] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-07 16:06:51,058][213771] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-07 16:06:51,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 95815680. Throughput: 0: 13263.8. Samples: 95799562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:51,106][213445] Avg episode reward: [(0, '4186.370')] [2023-03-07 16:06:51,821][213771] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 16:06:52,580][213771] Updated weights for policy 0, policy_version 93590 (0.0005) [2023-03-07 16:06:53,372][213771] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-07 16:06:54,125][213771] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-07 16:06:54,900][213771] Updated weights for policy 0, policy_version 93620 (0.0006) [2023-03-07 16:06:55,671][213771] Updated weights for policy 0, policy_version 93630 (0.0005) [2023-03-07 16:06:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 95882240. Throughput: 0: 13264.6. Samples: 95879287. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:06:56,106][213445] Avg episode reward: [(0, '4179.676')] [2023-03-07 16:06:56,452][213771] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-07 16:06:57,217][213771] Updated weights for policy 0, policy_version 93650 (0.0006) [2023-03-07 16:06:57,974][213771] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-07 16:06:58,743][213771] Updated weights for policy 0, policy_version 93670 (0.0007) [2023-03-07 16:06:59,511][213771] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-07 16:07:00,290][213771] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 16:07:01,054][213771] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-07 16:07:01,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 95948800. Throughput: 0: 13270.6. Samples: 95919442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:07:01,106][213445] Avg episode reward: [(0, '4151.441')] [2023-03-07 16:07:01,825][213771] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-07 16:07:02,592][213771] Updated weights for policy 0, policy_version 93720 (0.0007) [2023-03-07 16:07:03,342][213771] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-07 16:07:04,111][213771] Updated weights for policy 0, policy_version 93740 (0.0005) [2023-03-07 16:07:04,902][213771] Updated weights for policy 0, policy_version 93750 (0.0007) [2023-03-07 16:07:05,649][213771] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 16:07:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 96015360. Throughput: 0: 13276.5. Samples: 95999151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:07:06,106][213445] Avg episode reward: [(0, '4220.973')] [2023-03-07 16:07:06,441][213771] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-07 16:07:07,205][213771] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-07 16:07:07,976][213771] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-07 16:07:08,749][213771] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-07 16:07:09,524][213771] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-07 16:07:10,293][213771] Updated weights for policy 0, policy_version 93820 (0.0007) [2023-03-07 16:07:11,084][213771] Updated weights for policy 0, policy_version 93830 (0.0006) [2023-03-07 16:07:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 96081920. Throughput: 0: 13275.6. Samples: 96078736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:07:11,105][213445] Avg episode reward: [(0, '4220.723')] [2023-03-07 16:07:11,844][213771] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-07 16:07:12,616][213771] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-07 16:07:13,387][213771] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-07 16:07:14,145][213771] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-07 16:07:14,934][213771] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 16:07:15,720][213771] Updated weights for policy 0, policy_version 93890 (0.0005) [2023-03-07 16:07:16,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13256.5). Total num frames: 96148480. Throughput: 0: 13272.4. Samples: 96118559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:07:16,106][213445] Avg episode reward: [(0, '4070.462')] [2023-03-07 16:07:16,487][213771] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-07 16:07:17,260][213771] Updated weights for policy 0, policy_version 93910 (0.0006) [2023-03-07 16:07:18,049][213771] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-07 16:07:18,801][213771] Updated weights for policy 0, policy_version 93930 (0.0006) [2023-03-07 16:07:19,581][213771] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-07 16:07:20,362][213771] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 16:07:21,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96214016. Throughput: 0: 13256.9. Samples: 96197886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:07:21,106][213445] Avg episode reward: [(0, '4174.771')] [2023-03-07 16:07:21,134][213771] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-07 16:07:21,887][213771] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-07 16:07:22,657][213771] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-07 16:07:23,438][213771] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-07 16:07:24,214][213771] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-07 16:07:24,994][213771] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-07 16:07:25,766][213771] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-07 16:07:26,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96280576. Throughput: 0: 13258.9. Samples: 96277451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:26,105][213445] Avg episode reward: [(0, '4258.991')] [2023-03-07 16:07:26,533][213771] Updated weights for policy 0, policy_version 94030 (0.0007) [2023-03-07 16:07:27,304][213771] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-07 16:07:28,076][213771] Updated weights for policy 0, policy_version 94050 (0.0005) [2023-03-07 16:07:28,855][213771] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-07 16:07:29,638][213771] Updated weights for policy 0, policy_version 94070 (0.0005) [2023-03-07 16:07:30,400][213771] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-07 16:07:31,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13256.5). Total num frames: 96347136. Throughput: 0: 13263.1. Samples: 96317241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:31,106][213445] Avg episode reward: [(0, '4208.300')] [2023-03-07 16:07:31,177][213771] Updated weights for policy 0, policy_version 94090 (0.0007) [2023-03-07 16:07:31,958][213771] Updated weights for policy 0, policy_version 94100 (0.0006) [2023-03-07 16:07:32,725][213771] Updated weights for policy 0, policy_version 94110 (0.0007) [2023-03-07 16:07:33,490][213771] Updated weights for policy 0, policy_version 94120 (0.0006) [2023-03-07 16:07:34,263][213771] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 16:07:35,037][213771] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-07 16:07:35,829][213771] Updated weights for policy 0, policy_version 94150 (0.0007) [2023-03-07 16:07:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96412672. Throughput: 0: 13265.3. Samples: 96396501. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:36,106][213445] Avg episode reward: [(0, '4213.740')] [2023-03-07 16:07:36,592][213771] Updated weights for policy 0, policy_version 94160 (0.0007) [2023-03-07 16:07:37,389][213771] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-03-07 16:07:38,169][213771] Updated weights for policy 0, policy_version 94180 (0.0007) [2023-03-07 16:07:38,932][213771] Updated weights for policy 0, policy_version 94190 (0.0006) [2023-03-07 16:07:39,690][213771] Updated weights for policy 0, policy_version 94200 (0.0007) [2023-03-07 16:07:40,471][213771] Updated weights for policy 0, policy_version 94210 (0.0005) [2023-03-07 16:07:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96479232. Throughput: 0: 13259.5. Samples: 96475963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:41,106][213445] Avg episode reward: [(0, '4108.613')] [2023-03-07 16:07:41,236][213771] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-07 16:07:41,989][213771] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-07 16:07:42,768][213771] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-07 16:07:43,554][213771] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 16:07:44,307][213771] Updated weights for policy 0, policy_version 94260 (0.0006) [2023-03-07 16:07:45,083][213771] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-07 16:07:45,850][213771] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-07 16:07:46,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96545792. Throughput: 0: 13252.8. Samples: 96515818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:46,105][213445] Avg episode reward: [(0, '4157.778')] [2023-03-07 16:07:46,618][213771] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-07 16:07:47,391][213771] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-07 16:07:48,160][213771] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-07 16:07:48,932][213771] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-07 16:07:49,694][213771] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-07 16:07:50,481][213771] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-07 16:07:51,105][213445] Fps is (10 sec: 13312.3, 60 sec: 13277.9, 300 sec: 13253.0). Total num frames: 96612352. Throughput: 0: 13255.2. Samples: 96595634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:51,105][213445] Avg episode reward: [(0, '4049.896')] [2023-03-07 16:07:51,251][213771] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-07 16:07:52,011][213771] Updated weights for policy 0, policy_version 94360 (0.0007) [2023-03-07 16:07:52,788][213771] Updated weights for policy 0, policy_version 94370 (0.0007) [2023-03-07 16:07:53,575][213771] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-07 16:07:54,336][213771] Updated weights for policy 0, policy_version 94390 (0.0006) [2023-03-07 16:07:55,108][213771] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 16:07:55,874][213771] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-07 16:07:56,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96677888. Throughput: 0: 13256.9. Samples: 96675297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:07:56,106][213445] Avg episode reward: [(0, '4090.861')] [2023-03-07 16:07:56,662][213771] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-07 16:07:57,433][213771] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-07 16:07:58,221][213771] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-07 16:07:58,996][213771] Updated weights for policy 0, policy_version 94450 (0.0006) [2023-03-07 16:07:59,771][213771] Updated weights for policy 0, policy_version 94460 (0.0005) [2023-03-07 16:08:00,536][213771] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-07 16:08:01,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96744448. Throughput: 0: 13248.8. Samples: 96714754. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:08:01,106][213445] Avg episode reward: [(0, '4105.220')] [2023-03-07 16:08:01,307][213771] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-07 16:08:02,087][213771] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 16:08:02,854][213771] Updated weights for policy 0, policy_version 94500 (0.0005) [2023-03-07 16:08:03,613][213771] Updated weights for policy 0, policy_version 94510 (0.0005) [2023-03-07 16:08:04,379][213771] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-07 16:08:05,158][213771] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-07 16:08:05,929][213771] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-07 16:08:06,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96811008. Throughput: 0: 13256.5. Samples: 96794430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:08:06,106][213445] Avg episode reward: [(0, '4207.804')] [2023-03-07 16:08:06,111][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000094542_96811008.pth... [2023-03-07 16:08:06,143][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000091436_93630464.pth [2023-03-07 16:08:06,709][213771] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-07 16:08:07,476][213771] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-07 16:08:08,234][213771] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-07 16:08:09,018][213771] Updated weights for policy 0, policy_version 94580 (0.0005) [2023-03-07 16:08:09,767][213771] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-07 16:08:10,546][213771] Updated weights for policy 0, policy_version 94600 (0.0007) [2023-03-07 16:08:11,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 96877568. Throughput: 0: 13259.6. Samples: 96874134. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:08:11,105][213445] Avg episode reward: [(0, '4158.773')] [2023-03-07 16:08:11,323][213771] Updated weights for policy 0, policy_version 94610 (0.0007) [2023-03-07 16:08:12,104][213771] Updated weights for policy 0, policy_version 94620 (0.0007) [2023-03-07 16:08:12,882][213771] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 16:08:13,670][213771] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-07 16:08:14,429][213771] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-07 16:08:15,198][213771] Updated weights for policy 0, policy_version 94660 (0.0005) [2023-03-07 16:08:15,972][213771] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 16:08:16,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 96943104. Throughput: 0: 13250.8. Samples: 96913528. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:08:16,106][213445] Avg episode reward: [(0, '4172.116')] [2023-03-07 16:08:16,753][213771] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-07 16:08:17,527][213771] Updated weights for policy 0, policy_version 94690 (0.0008) [2023-03-07 16:08:18,291][213771] Updated weights for policy 0, policy_version 94700 (0.0006) [2023-03-07 16:08:19,057][213771] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 16:08:19,832][213771] Updated weights for policy 0, policy_version 94720 (0.0007) [2023-03-07 16:08:20,606][213771] Updated weights for policy 0, policy_version 94730 (0.0007) [2023-03-07 16:08:21,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 97009664. Throughput: 0: 13257.6. Samples: 96993092. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:21,106][213445] Avg episode reward: [(0, '4223.744')] [2023-03-07 16:08:21,402][213771] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 16:08:22,179][213771] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-07 16:08:22,958][213771] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 16:08:23,718][213771] Updated weights for policy 0, policy_version 94770 (0.0007) [2023-03-07 16:08:24,499][213771] Updated weights for policy 0, policy_version 94780 (0.0006) [2023-03-07 16:08:25,263][213771] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-07 16:08:26,064][213771] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-07 16:08:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 97075200. Throughput: 0: 13254.8. Samples: 97072430. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:26,106][213445] Avg episode reward: [(0, '4155.815')] [2023-03-07 16:08:26,829][213771] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-07 16:08:27,611][213771] Updated weights for policy 0, policy_version 94820 (0.0006) [2023-03-07 16:08:28,379][213771] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-07 16:08:29,152][213771] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-07 16:08:29,909][213771] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-07 16:08:30,681][213771] Updated weights for policy 0, policy_version 94860 (0.0007) [2023-03-07 16:08:31,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 97141760. Throughput: 0: 13252.6. Samples: 97112185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:31,106][213445] Avg episode reward: [(0, '4150.080')] [2023-03-07 16:08:31,449][213771] Updated weights for policy 0, policy_version 94870 (0.0006) [2023-03-07 16:08:32,244][213771] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 16:08:32,994][213771] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-07 16:08:33,769][213771] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-07 16:08:34,537][213771] Updated weights for policy 0, policy_version 94910 (0.0007) [2023-03-07 16:08:35,322][213771] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 16:08:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 97207296. Throughput: 0: 13243.0. Samples: 97191570. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:36,106][213445] Avg episode reward: [(0, '4058.172')] [2023-03-07 16:08:36,113][213771] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-07 16:08:36,902][213771] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-07 16:08:37,670][213771] Updated weights for policy 0, policy_version 94950 (0.0006) [2023-03-07 16:08:38,426][213771] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-07 16:08:39,214][213771] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-07 16:08:39,978][213771] Updated weights for policy 0, policy_version 94980 (0.0006) [2023-03-07 16:08:40,753][213771] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-07 16:08:41,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 97273856. Throughput: 0: 13232.8. Samples: 97270773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:41,106][213445] Avg episode reward: [(0, '4108.116')] [2023-03-07 16:08:41,537][213771] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 16:08:42,321][213771] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 16:08:43,092][213771] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-07 16:08:43,865][213771] Updated weights for policy 0, policy_version 95030 (0.0007) [2023-03-07 16:08:44,642][213771] Updated weights for policy 0, policy_version 95040 (0.0006) [2023-03-07 16:08:45,388][213771] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-07 16:08:46,105][213445] Fps is (10 sec: 13311.8, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 97340416. Throughput: 0: 13234.2. Samples: 97310294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:46,106][213445] Avg episode reward: [(0, '4218.503')] [2023-03-07 16:08:46,166][213771] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-07 16:08:46,948][213771] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-07 16:08:47,722][213771] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-07 16:08:48,481][213771] Updated weights for policy 0, policy_version 95090 (0.0007) [2023-03-07 16:08:49,271][213771] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-07 16:08:50,042][213771] Updated weights for policy 0, policy_version 95110 (0.0007) [2023-03-07 16:08:50,800][213771] Updated weights for policy 0, policy_version 95120 (0.0006) [2023-03-07 16:08:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 97405952. Throughput: 0: 13231.8. Samples: 97389860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:51,106][213445] Avg episode reward: [(0, '4178.739')] [2023-03-07 16:08:51,594][213771] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 16:08:52,361][213771] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-07 16:08:53,124][213771] Updated weights for policy 0, policy_version 95150 (0.0007) [2023-03-07 16:08:53,907][213771] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-07 16:08:54,688][213771] Updated weights for policy 0, policy_version 95170 (0.0007) [2023-03-07 16:08:55,449][213771] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-07 16:08:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 97472512. Throughput: 0: 13226.0. Samples: 97469306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:08:56,106][213445] Avg episode reward: [(0, '4222.422')] [2023-03-07 16:08:56,247][213771] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-07 16:08:57,006][213771] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-07 16:08:57,792][213771] Updated weights for policy 0, policy_version 95210 (0.0007) [2023-03-07 16:08:58,565][213771] Updated weights for policy 0, policy_version 95220 (0.0007) [2023-03-07 16:08:59,335][213771] Updated weights for policy 0, policy_version 95230 (0.0007) [2023-03-07 16:09:00,112][213771] Updated weights for policy 0, policy_version 95240 (0.0006) [2023-03-07 16:09:00,901][213771] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 16:09:01,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 97538048. Throughput: 0: 13229.8. Samples: 97508866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:09:01,106][213445] Avg episode reward: [(0, '4222.360')] [2023-03-07 16:09:01,658][213771] Updated weights for policy 0, policy_version 95260 (0.0006) [2023-03-07 16:09:02,424][213771] Updated weights for policy 0, policy_version 95270 (0.0007) [2023-03-07 16:09:03,192][213771] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-07 16:09:03,966][213771] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-07 16:09:04,757][213771] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-07 16:09:05,513][213771] Updated weights for policy 0, policy_version 95310 (0.0007) [2023-03-07 16:09:06,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 97604608. Throughput: 0: 13229.6. Samples: 97588422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:09:06,106][213445] Avg episode reward: [(0, '4238.103')] [2023-03-07 16:09:06,295][213771] Updated weights for policy 0, policy_version 95320 (0.0007) [2023-03-07 16:09:07,069][213771] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-07 16:09:07,856][213771] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 16:09:08,636][213771] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 16:09:09,401][213771] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-07 16:09:10,173][213771] Updated weights for policy 0, policy_version 95370 (0.0005) [2023-03-07 16:09:10,959][213771] Updated weights for policy 0, policy_version 95380 (0.0005) [2023-03-07 16:09:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13246.0). Total num frames: 97670144. Throughput: 0: 13223.9. Samples: 97667503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:09:11,106][213445] Avg episode reward: [(0, '4237.211')] [2023-03-07 16:09:11,731][213771] Updated weights for policy 0, policy_version 95390 (0.0006) [2023-03-07 16:09:12,493][213771] Updated weights for policy 0, policy_version 95400 (0.0007) [2023-03-07 16:09:13,278][213771] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 16:09:14,042][213771] Updated weights for policy 0, policy_version 95420 (0.0005) [2023-03-07 16:09:14,820][213771] Updated weights for policy 0, policy_version 95430 (0.0007) [2023-03-07 16:09:15,606][213771] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-07 16:09:16,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13246.0). Total num frames: 97736704. Throughput: 0: 13223.0. Samples: 97707219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:09:16,106][213445] Avg episode reward: [(0, '4201.235')] [2023-03-07 16:09:16,393][213771] Updated weights for policy 0, policy_version 95450 (0.0007) [2023-03-07 16:09:17,165][213771] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-07 16:09:17,941][213771] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-07 16:09:18,696][213771] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-07 16:09:19,500][213771] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-07 16:09:20,279][213771] Updated weights for policy 0, policy_version 95500 (0.0006) [2023-03-07 16:09:21,043][213771] Updated weights for policy 0, policy_version 95510 (0.0006) [2023-03-07 16:09:21,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 97803264. Throughput: 0: 13217.6. Samples: 97786361. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:21,105][213445] Avg episode reward: [(0, '4153.799')] [2023-03-07 16:09:21,806][213771] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-07 16:09:22,575][213771] Updated weights for policy 0, policy_version 95530 (0.0005) [2023-03-07 16:09:23,354][213771] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-07 16:09:24,137][213771] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 16:09:24,894][213771] Updated weights for policy 0, policy_version 95560 (0.0006) [2023-03-07 16:09:25,681][213771] Updated weights for policy 0, policy_version 95570 (0.0005) [2023-03-07 16:09:26,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 97868800. Throughput: 0: 13223.2. Samples: 97865814. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:26,106][213445] Avg episode reward: [(0, '4137.600')] [2023-03-07 16:09:26,432][213771] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-07 16:09:27,231][213771] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-07 16:09:27,994][213771] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-07 16:09:28,760][213771] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-07 16:09:29,529][213771] Updated weights for policy 0, policy_version 95620 (0.0007) [2023-03-07 16:09:30,292][213771] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-07 16:09:31,063][213771] Updated weights for policy 0, policy_version 95640 (0.0005) [2023-03-07 16:09:31,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13249.5). Total num frames: 97935360. Throughput: 0: 13234.5. Samples: 97905847. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:31,106][213445] Avg episode reward: [(0, '4009.571')] [2023-03-07 16:09:31,836][213771] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 16:09:32,606][213771] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-07 16:09:33,372][213771] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-07 16:09:34,151][213771] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-07 16:09:34,914][213771] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-07 16:09:35,700][213771] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 16:09:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 98001920. Throughput: 0: 13233.9. Samples: 97985386. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:36,106][213445] Avg episode reward: [(0, '4083.389')] [2023-03-07 16:09:36,480][213771] Updated weights for policy 0, policy_version 95710 (0.0007) [2023-03-07 16:09:37,247][213771] Updated weights for policy 0, policy_version 95720 (0.0006) [2023-03-07 16:09:38,035][213771] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-07 16:09:38,795][213771] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-07 16:09:39,569][213771] Updated weights for policy 0, policy_version 95750 (0.0007) [2023-03-07 16:09:40,349][213771] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-07 16:09:41,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 98067456. Throughput: 0: 13229.4. Samples: 98064626. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:41,106][213445] Avg episode reward: [(0, '4151.698')] [2023-03-07 16:09:41,137][213771] Updated weights for policy 0, policy_version 95770 (0.0006) [2023-03-07 16:09:41,910][213771] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-07 16:09:42,687][213771] Updated weights for policy 0, policy_version 95790 (0.0006) [2023-03-07 16:09:43,457][213771] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-07 16:09:44,215][213771] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-07 16:09:44,988][213771] Updated weights for policy 0, policy_version 95820 (0.0006) [2023-03-07 16:09:45,762][213771] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-07 16:09:46,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 98134016. Throughput: 0: 13232.3. Samples: 98104319. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:46,106][213445] Avg episode reward: [(0, '4089.885')] [2023-03-07 16:09:46,540][213771] Updated weights for policy 0, policy_version 95840 (0.0005) [2023-03-07 16:09:47,315][213771] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-07 16:09:48,086][213771] Updated weights for policy 0, policy_version 95860 (0.0007) [2023-03-07 16:09:48,856][213771] Updated weights for policy 0, policy_version 95870 (0.0005) [2023-03-07 16:09:49,621][213771] Updated weights for policy 0, policy_version 95880 (0.0006) [2023-03-07 16:09:50,401][213771] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-07 16:09:51,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 98199552. Throughput: 0: 13230.4. Samples: 98183788. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:51,105][213445] Avg episode reward: [(0, '4158.981')] [2023-03-07 16:09:51,189][213771] Updated weights for policy 0, policy_version 95900 (0.0006) [2023-03-07 16:09:51,962][213771] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 16:09:52,730][213771] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-07 16:09:53,506][213771] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-07 16:09:54,289][213771] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-07 16:09:55,058][213771] Updated weights for policy 0, policy_version 95950 (0.0005) [2023-03-07 16:09:55,817][213771] Updated weights for policy 0, policy_version 95960 (0.0006) [2023-03-07 16:09:56,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13249.5). Total num frames: 98266112. Throughput: 0: 13238.6. Samples: 98263241. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:09:56,106][213445] Avg episode reward: [(0, '4162.237')] [2023-03-07 16:09:56,586][213771] Updated weights for policy 0, policy_version 95970 (0.0005) [2023-03-07 16:09:57,361][213771] Updated weights for policy 0, policy_version 95980 (0.0006) [2023-03-07 16:09:58,117][213771] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-07 16:09:58,899][213771] Updated weights for policy 0, policy_version 96000 (0.0007) [2023-03-07 16:09:59,685][213771] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 16:10:00,452][213771] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-07 16:10:01,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98332672. Throughput: 0: 13242.5. Samples: 98303131. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:10:01,106][213445] Avg episode reward: [(0, '4167.280')] [2023-03-07 16:10:01,226][213771] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-07 16:10:02,010][213771] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-07 16:10:02,794][213771] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-07 16:10:03,554][213771] Updated weights for policy 0, policy_version 96060 (0.0007) [2023-03-07 16:10:04,343][213771] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-07 16:10:05,112][213771] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-07 16:10:05,881][213771] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-07 16:10:06,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 98398208. Throughput: 0: 13239.9. Samples: 98382157. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:10:06,106][213445] Avg episode reward: [(0, '4112.315')] [2023-03-07 16:10:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000096093_98399232.pth... [2023-03-07 16:10:06,151][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000092988_95219712.pth [2023-03-07 16:10:06,662][213771] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-07 16:10:07,441][213771] Updated weights for policy 0, policy_version 96110 (0.0007) [2023-03-07 16:10:08,206][213771] Updated weights for policy 0, policy_version 96120 (0.0005) [2023-03-07 16:10:08,997][213771] Updated weights for policy 0, policy_version 96130 (0.0005) [2023-03-07 16:10:09,770][213771] Updated weights for policy 0, policy_version 96140 (0.0007) [2023-03-07 16:10:10,538][213771] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-07 16:10:11,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 98464768. Throughput: 0: 13237.8. Samples: 98461517. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:10:11,106][213445] Avg episode reward: [(0, '4089.299')] [2023-03-07 16:10:11,306][213771] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-07 16:10:12,082][213771] Updated weights for policy 0, policy_version 96170 (0.0006) [2023-03-07 16:10:12,848][213771] Updated weights for policy 0, policy_version 96180 (0.0005) [2023-03-07 16:10:13,627][213771] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-07 16:10:14,389][213771] Updated weights for policy 0, policy_version 96200 (0.0006) [2023-03-07 16:10:15,158][213771] Updated weights for policy 0, policy_version 96210 (0.0005) [2023-03-07 16:10:15,922][213771] Updated weights for policy 0, policy_version 96220 (0.0006) [2023-03-07 16:10:16,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98531328. Throughput: 0: 13236.4. Samples: 98501485. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:16,106][213445] Avg episode reward: [(0, '4108.276')] [2023-03-07 16:10:16,701][213771] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-07 16:10:17,486][213771] Updated weights for policy 0, policy_version 96240 (0.0007) [2023-03-07 16:10:18,243][213771] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-07 16:10:19,013][213771] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 16:10:19,766][213771] Updated weights for policy 0, policy_version 96270 (0.0007) [2023-03-07 16:10:20,571][213771] Updated weights for policy 0, policy_version 96280 (0.0007) [2023-03-07 16:10:21,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98597888. Throughput: 0: 13242.6. Samples: 98581303. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:21,106][213445] Avg episode reward: [(0, '4104.208')] [2023-03-07 16:10:21,338][213771] Updated weights for policy 0, policy_version 96290 (0.0006) [2023-03-07 16:10:22,101][213771] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 16:10:22,890][213771] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 16:10:23,650][213771] Updated weights for policy 0, policy_version 96320 (0.0006) [2023-03-07 16:10:24,417][213771] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-07 16:10:25,190][213771] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-07 16:10:25,959][213771] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-07 16:10:26,105][213445] Fps is (10 sec: 13209.9, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 98663424. Throughput: 0: 13247.8. Samples: 98660775. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:26,105][213445] Avg episode reward: [(0, '4141.376')] [2023-03-07 16:10:26,731][213771] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-07 16:10:27,513][213771] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-07 16:10:28,259][213771] Updated weights for policy 0, policy_version 96380 (0.0005) [2023-03-07 16:10:29,042][213771] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-07 16:10:29,814][213771] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-07 16:10:30,591][213771] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-07 16:10:31,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98729984. Throughput: 0: 13248.4. Samples: 98700496. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:31,106][213445] Avg episode reward: [(0, '4025.449')] [2023-03-07 16:10:31,374][213771] Updated weights for policy 0, policy_version 96420 (0.0007) [2023-03-07 16:10:32,134][213771] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-07 16:10:32,913][213771] Updated weights for policy 0, policy_version 96440 (0.0006) [2023-03-07 16:10:33,676][213771] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 16:10:34,448][213771] Updated weights for policy 0, policy_version 96460 (0.0007) [2023-03-07 16:10:35,220][213771] Updated weights for policy 0, policy_version 96470 (0.0007) [2023-03-07 16:10:35,992][213771] Updated weights for policy 0, policy_version 96480 (0.0007) [2023-03-07 16:10:36,105][213445] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98796544. Throughput: 0: 13254.4. Samples: 98780239. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:36,106][213445] Avg episode reward: [(0, '4068.917')] [2023-03-07 16:10:36,745][213771] Updated weights for policy 0, policy_version 96490 (0.0005) [2023-03-07 16:10:37,522][213771] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-07 16:10:38,287][213771] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-07 16:10:39,065][213771] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-07 16:10:39,851][213771] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-07 16:10:40,635][213771] Updated weights for policy 0, policy_version 96540 (0.0006) [2023-03-07 16:10:41,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 98863104. Throughput: 0: 13253.9. Samples: 98859667. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:41,105][213445] Avg episode reward: [(0, '4029.766')] [2023-03-07 16:10:41,398][213771] Updated weights for policy 0, policy_version 96550 (0.0006) [2023-03-07 16:10:42,178][213771] Updated weights for policy 0, policy_version 96560 (0.0006) [2023-03-07 16:10:42,942][213771] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-07 16:10:43,725][213771] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-07 16:10:44,512][213771] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-07 16:10:45,285][213771] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-07 16:10:46,068][213771] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 16:10:46,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 98928640. Throughput: 0: 13251.6. Samples: 98899454. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:46,106][213445] Avg episode reward: [(0, '3967.453')] [2023-03-07 16:10:46,844][213771] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-07 16:10:47,621][213771] Updated weights for policy 0, policy_version 96630 (0.0007) [2023-03-07 16:10:48,396][213771] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-07 16:10:49,165][213771] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 16:10:49,946][213771] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-07 16:10:50,706][213771] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-07 16:10:51,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 98995200. Throughput: 0: 13254.5. Samples: 98978607. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:51,106][213445] Avg episode reward: [(0, '4029.482')] [2023-03-07 16:10:51,497][213771] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-07 16:10:52,255][213771] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-07 16:10:53,028][213771] Updated weights for policy 0, policy_version 96700 (0.0006) [2023-03-07 16:10:53,783][213771] Updated weights for policy 0, policy_version 96710 (0.0005) [2023-03-07 16:10:54,562][213771] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-07 16:10:55,339][213771] Updated weights for policy 0, policy_version 96730 (0.0007) [2023-03-07 16:10:56,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 99060736. Throughput: 0: 13255.2. Samples: 99058003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:10:56,106][213445] Avg episode reward: [(0, '4038.159')] [2023-03-07 16:10:56,109][213771] Updated weights for policy 0, policy_version 96740 (0.0006) [2023-03-07 16:10:56,890][213771] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-07 16:10:57,657][213771] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-07 16:10:58,463][213771] Updated weights for policy 0, policy_version 96770 (0.0007) [2023-03-07 16:10:59,221][213771] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 16:11:00,006][213771] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 16:11:00,767][213771] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-07 16:11:01,105][213445] Fps is (10 sec: 13209.8, 60 sec: 13243.8, 300 sec: 13249.5). Total num frames: 99127296. Throughput: 0: 13247.3. Samples: 99097611. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:11:01,106][213445] Avg episode reward: [(0, '4149.080')] [2023-03-07 16:11:01,534][213771] Updated weights for policy 0, policy_version 96810 (0.0007) [2023-03-07 16:11:02,309][213771] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 16:11:03,082][213771] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-07 16:11:03,852][213771] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-07 16:11:04,623][213771] Updated weights for policy 0, policy_version 96850 (0.0005) [2023-03-07 16:11:05,373][213771] Updated weights for policy 0, policy_version 96860 (0.0005) [2023-03-07 16:11:06,105][213445] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 99193856. Throughput: 0: 13240.3. Samples: 99177113. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:11:06,106][213445] Avg episode reward: [(0, '4119.953')] [2023-03-07 16:11:06,159][213771] Updated weights for policy 0, policy_version 96870 (0.0006) [2023-03-07 16:11:06,938][213771] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-07 16:11:07,722][213771] Updated weights for policy 0, policy_version 96890 (0.0007) [2023-03-07 16:11:08,489][213771] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 16:11:09,259][213771] Updated weights for policy 0, policy_version 96910 (0.0005) [2023-03-07 16:11:10,026][213771] Updated weights for policy 0, policy_version 96920 (0.0006) [2023-03-07 16:11:10,802][213771] Updated weights for policy 0, policy_version 96930 (0.0006) [2023-03-07 16:11:11,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13249.5). Total num frames: 99260416. Throughput: 0: 13241.0. Samples: 99256619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:11:11,106][213445] Avg episode reward: [(0, '4136.703')] [2023-03-07 16:11:11,582][213771] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-07 16:11:12,355][213771] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-07 16:11:13,140][213771] Updated weights for policy 0, policy_version 96960 (0.0007) [2023-03-07 16:11:13,915][213771] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-07 16:11:14,686][213771] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-07 16:11:15,469][213771] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-07 16:11:16,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13246.1). Total num frames: 99325952. Throughput: 0: 13238.1. Samples: 99296208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:16,106][213445] Avg episode reward: [(0, '4158.635')] [2023-03-07 16:11:16,232][213771] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 16:11:17,016][213771] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-07 16:11:17,785][213771] Updated weights for policy 0, policy_version 97020 (0.0007) [2023-03-07 16:11:18,568][213771] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-07 16:11:19,340][213771] Updated weights for policy 0, policy_version 97040 (0.0006) [2023-03-07 16:11:20,109][213771] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-07 16:11:20,895][213771] Updated weights for policy 0, policy_version 97060 (0.0007) [2023-03-07 16:11:21,105][213445] Fps is (10 sec: 13107.1, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 99391488. Throughput: 0: 13228.8. Samples: 99375533. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:21,106][213445] Avg episode reward: [(0, '4165.164')] [2023-03-07 16:11:21,685][213771] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-07 16:11:22,449][213771] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 16:11:23,224][213771] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-07 16:11:23,996][213771] Updated weights for policy 0, policy_version 97100 (0.0005) [2023-03-07 16:11:24,770][213771] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-07 16:11:25,526][213771] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 16:11:26,105][213445] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13242.6). Total num frames: 99458048. Throughput: 0: 13230.9. Samples: 99455060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:26,106][213445] Avg episode reward: [(0, '4180.574')] [2023-03-07 16:11:26,305][213771] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 16:11:27,057][213771] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-07 16:11:27,829][213771] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-07 16:11:28,594][213771] Updated weights for policy 0, policy_version 97160 (0.0005) [2023-03-07 16:11:29,377][213771] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-07 16:11:30,161][213771] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-07 16:11:30,929][213771] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 16:11:31,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 99524608. Throughput: 0: 13231.4. Samples: 99494869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:31,106][213445] Avg episode reward: [(0, '4191.714')] [2023-03-07 16:11:31,726][213771] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-07 16:11:32,481][213771] Updated weights for policy 0, policy_version 97210 (0.0007) [2023-03-07 16:11:33,289][213771] Updated weights for policy 0, policy_version 97220 (0.0007) [2023-03-07 16:11:34,050][213771] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-07 16:11:34,841][213771] Updated weights for policy 0, policy_version 97240 (0.0007) [2023-03-07 16:11:35,604][213771] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 16:11:36,105][213445] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 99590144. Throughput: 0: 13227.7. Samples: 99573852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:36,106][213445] Avg episode reward: [(0, '4120.605')] [2023-03-07 16:11:36,366][213771] Updated weights for policy 0, policy_version 97260 (0.0008) [2023-03-07 16:11:37,147][213771] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-07 16:11:37,921][213771] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-07 16:11:38,705][213771] Updated weights for policy 0, policy_version 97290 (0.0008) [2023-03-07 16:11:39,470][213771] Updated weights for policy 0, policy_version 97300 (0.0007) [2023-03-07 16:11:40,254][213771] Updated weights for policy 0, policy_version 97310 (0.0007) [2023-03-07 16:11:41,030][213771] Updated weights for policy 0, policy_version 97320 (0.0007) [2023-03-07 16:11:41,105][213445] Fps is (10 sec: 13107.2, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 99655680. Throughput: 0: 13225.2. Samples: 99653136. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:41,116][213445] Avg episode reward: [(0, '4144.480')] [2023-03-07 16:11:41,806][213771] Updated weights for policy 0, policy_version 97330 (0.0006) [2023-03-07 16:11:42,570][213771] Updated weights for policy 0, policy_version 97340 (0.0005) [2023-03-07 16:11:43,338][213771] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-07 16:11:44,118][213771] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 16:11:44,883][213771] Updated weights for policy 0, policy_version 97370 (0.0006) [2023-03-07 16:11:45,651][213771] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 16:11:46,105][213445] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13246.0). Total num frames: 99723264. Throughput: 0: 13228.9. Samples: 99692912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:46,116][213445] Avg episode reward: [(0, '4186.370')] [2023-03-07 16:11:46,422][213771] Updated weights for policy 0, policy_version 97390 (0.0005) [2023-03-07 16:11:47,189][213771] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-03-07 16:11:47,967][213771] Updated weights for policy 0, policy_version 97410 (0.0005) [2023-03-07 16:11:48,733][213771] Updated weights for policy 0, policy_version 97420 (0.0005) [2023-03-07 16:11:49,534][213771] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-07 16:11:50,309][213771] Updated weights for policy 0, policy_version 97440 (0.0005) [2023-03-07 16:11:51,066][213771] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 16:11:51,105][213445] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13242.6). Total num frames: 99788800. Throughput: 0: 13227.9. Samples: 99772370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:51,116][213445] Avg episode reward: [(0, '4132.195')] [2023-03-07 16:11:51,837][213771] Updated weights for policy 0, policy_version 97460 (0.0007) [2023-03-07 16:11:52,611][213771] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-07 16:11:53,378][213771] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 16:11:54,145][213771] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-07 16:11:54,926][213771] Updated weights for policy 0, policy_version 97500 (0.0007) [2023-03-07 16:11:55,695][213771] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-07 16:11:56,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13242.6). Total num frames: 99855360. Throughput: 0: 13229.3. Samples: 99851938. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:11:56,116][213445] Avg episode reward: [(0, '4095.343')] [2023-03-07 16:11:56,476][213771] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-07 16:11:57,250][213771] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-07 16:11:58,030][213771] Updated weights for policy 0, policy_version 97540 (0.0007) [2023-03-07 16:11:58,794][213771] Updated weights for policy 0, policy_version 97550 (0.0007) [2023-03-07 16:11:59,579][213771] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 16:12:00,337][213771] Updated weights for policy 0, policy_version 97570 (0.0005) [2023-03-07 16:12:01,105][213445] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 99920896. Throughput: 0: 13231.0. Samples: 99891603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:12:01,106][213771] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-07 16:12:01,116][213445] Avg episode reward: [(0, '4043.203')] [2023-03-07 16:12:01,893][213771] Updated weights for policy 0, policy_version 97590 (0.0005) [2023-03-07 16:12:02,670][213771] Updated weights for policy 0, policy_version 97600 (0.0005) [2023-03-07 16:12:03,431][213771] Updated weights for policy 0, policy_version 97610 (0.0005) [2023-03-07 16:12:04,206][213771] Updated weights for policy 0, policy_version 97620 (0.0007) [2023-03-07 16:12:04,978][213771] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 16:12:05,750][213771] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 16:12:06,105][213445] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 99987456. Throughput: 0: 13235.5. Samples: 99971130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:12:06,116][213445] Avg episode reward: [(0, '4127.119')] [2023-03-07 16:12:06,122][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000097644_99987456.pth... [2023-03-07 16:12:06,153][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000094542_96811008.pth [2023-03-07 16:12:06,541][213771] Updated weights for policy 0, policy_version 97650 (0.0006) [2023-03-07 16:12:07,155][214069] Stopping RolloutWorker_w17... [2023-03-07 16:12:07,155][213933] Stopping RolloutWorker_w11... [2023-03-07 16:12:07,155][213839] Stopping RolloutWorker_w5... [2023-03-07 16:12:07,155][213973] Stopping RolloutWorker_w10... [2023-03-07 16:12:07,155][214204] Stopping RolloutWorker_w22... [2023-03-07 16:12:07,155][214198] Stopping RolloutWorker_w27... [2023-03-07 16:12:07,155][214036] Stopping RolloutWorker_w8... [2023-03-07 16:12:07,155][213972] Stopping RolloutWorker_w14... [2023-03-07 16:12:07,155][214073] Stopping RolloutWorker_w15... [2023-03-07 16:12:07,155][214069] Loop rollout_proc17_evt_loop terminating... [2023-03-07 16:12:07,155][213720] Stopping Batcher_0... [2023-03-07 16:12:07,155][213933] Loop rollout_proc11_evt_loop terminating... [2023-03-07 16:12:07,155][214239] Stopping RolloutWorker_w30... [2023-03-07 16:12:07,155][214036] Loop rollout_proc8_evt_loop terminating... [2023-03-07 16:12:07,155][214139] Stopping RolloutWorker_w24... [2023-03-07 16:12:07,155][214197] Stopping RolloutWorker_w26... [2023-03-07 16:12:07,156][214239] Loop rollout_proc30_evt_loop terminating... [2023-03-07 16:12:07,155][213839] Loop rollout_proc5_evt_loop terminating... [2023-03-07 16:12:07,155][213937] Stopping RolloutWorker_w21... [2023-03-07 16:12:07,155][214170] Stopping RolloutWorker_w25... [2023-03-07 16:12:07,155][213973] Loop rollout_proc10_evt_loop terminating... [2023-03-07 16:12:07,155][213972] Loop rollout_proc14_evt_loop terminating... [2023-03-07 16:12:07,155][214073] Loop rollout_proc15_evt_loop terminating... [2023-03-07 16:12:07,155][214204] Loop rollout_proc22_evt_loop terminating... [2023-03-07 16:12:07,155][214206] Stopping RolloutWorker_w29... [2023-03-07 16:12:07,155][214198] Loop rollout_proc27_evt_loop terminating... [2023-03-07 16:12:07,155][214254] Stopping RolloutWorker_w31... [2023-03-07 16:12:07,155][213807] Stopping RolloutWorker_w4... [2023-03-07 16:12:07,155][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 16:12:07,155][213935] Stopping RolloutWorker_w18... [2023-03-07 16:12:07,156][214139] Loop rollout_proc24_evt_loop terminating... [2023-03-07 16:12:07,155][213773] Stopping RolloutWorker_w2... [2023-03-07 16:12:07,156][213970] Stopping RolloutWorker_w16... [2023-03-07 16:12:07,156][214074] Stopping RolloutWorker_w23... [2023-03-07 16:12:07,156][214170] Loop rollout_proc25_evt_loop terminating... [2023-03-07 16:12:07,156][213936] Stopping RolloutWorker_w7... [2023-03-07 16:12:07,156][214206] Loop rollout_proc29_evt_loop terminating... [2023-03-07 16:12:07,156][214072] Stopping RolloutWorker_w20... [2023-03-07 16:12:07,156][214197] Loop rollout_proc26_evt_loop terminating... [2023-03-07 16:12:07,156][214254] Loop rollout_proc31_evt_loop terminating... [2023-03-07 16:12:07,156][213937] Loop rollout_proc21_evt_loop terminating... [2023-03-07 16:12:07,156][213775] Stopping RolloutWorker_w3... [2023-03-07 16:12:07,155][213445] Component RolloutWorker_w11 stopped! [2023-03-07 16:12:07,156][213774] Stopping RolloutWorker_w0... [2023-03-07 16:12:07,156][213807] Loop rollout_proc4_evt_loop terminating... [2023-03-07 16:12:07,156][214205] Stopping RolloutWorker_w28... [2023-03-07 16:12:07,156][214071] Stopping RolloutWorker_w12... [2023-03-07 16:12:07,156][213970] Loop rollout_proc16_evt_loop terminating... [2023-03-07 16:12:07,156][213773] Loop rollout_proc2_evt_loop terminating... [2023-03-07 16:12:07,156][213935] Loop rollout_proc18_evt_loop terminating... [2023-03-07 16:12:07,156][214074] Loop rollout_proc23_evt_loop terminating... [2023-03-07 16:12:07,156][214072] Loop rollout_proc20_evt_loop terminating... [2023-03-07 16:12:07,156][213936] Loop rollout_proc7_evt_loop terminating... [2023-03-07 16:12:07,156][213775] Loop rollout_proc3_evt_loop terminating... [2023-03-07 16:12:07,156][213774] Loop rollout_proc0_evt_loop terminating... [2023-03-07 16:12:07,156][214205] Loop rollout_proc28_evt_loop terminating... [2023-03-07 16:12:07,156][214071] Loop rollout_proc12_evt_loop terminating... [2023-03-07 16:12:07,156][213445] Component RolloutWorker_w5 stopped! [2023-03-07 16:12:07,156][213969] Stopping RolloutWorker_w9... [2023-03-07 16:12:07,157][213969] Loop rollout_proc9_evt_loop terminating... [2023-03-07 16:12:07,157][213445] Component RolloutWorker_w17 stopped! [2023-03-07 16:12:07,157][213772] Stopping RolloutWorker_w1... [2023-03-07 16:12:07,157][213445] Component RolloutWorker_w10 stopped! [2023-03-07 16:12:07,158][213772] Loop rollout_proc1_evt_loop terminating... [2023-03-07 16:12:07,158][213445] Component RolloutWorker_w22 stopped! [2023-03-07 16:12:07,158][213445] Component RolloutWorker_w27 stopped! [2023-03-07 16:12:07,159][213445] Component Batcher_0 stopped! [2023-03-07 16:12:07,159][213445] Component RolloutWorker_w14 stopped! [2023-03-07 16:12:07,160][213445] Component RolloutWorker_w8 stopped! [2023-03-07 16:12:07,160][213445] Component RolloutWorker_w15 stopped! [2023-03-07 16:12:07,161][213445] Component RolloutWorker_w30 stopped! [2023-03-07 16:12:07,162][213445] Component RolloutWorker_w24 stopped! [2023-03-07 16:12:07,162][213445] Component RolloutWorker_w29 stopped! [2023-03-07 16:12:07,163][213445] Component RolloutWorker_w31 stopped! [2023-03-07 16:12:07,163][214070] Stopping RolloutWorker_w19... [2023-03-07 16:12:07,163][213445] Component RolloutWorker_w26 stopped! [2023-03-07 16:12:07,164][214070] Loop rollout_proc19_evt_loop terminating... [2023-03-07 16:12:07,164][213445] Component RolloutWorker_w21 stopped! [2023-03-07 16:12:07,164][213445] Component RolloutWorker_w25 stopped! [2023-03-07 16:12:07,165][213445] Component RolloutWorker_w4 stopped! [2023-03-07 16:12:07,165][213445] Component RolloutWorker_w2 stopped! [2023-03-07 16:12:07,165][213445] Component RolloutWorker_w18 stopped! [2023-03-07 16:12:07,166][213445] Component RolloutWorker_w16 stopped! [2023-03-07 16:12:07,166][213445] Component RolloutWorker_w23 stopped! [2023-03-07 16:12:07,167][213445] Component RolloutWorker_w7 stopped! [2023-03-07 16:12:07,167][213445] Component RolloutWorker_w20 stopped! [2023-03-07 16:12:07,168][213445] Component RolloutWorker_w3 stopped! [2023-03-07 16:12:07,168][213445] Component RolloutWorker_w0 stopped! [2023-03-07 16:12:07,173][213934] Stopping RolloutWorker_w6... [2023-03-07 16:12:07,174][213934] Loop rollout_proc6_evt_loop terminating... [2023-03-07 16:12:07,169][213445] Component RolloutWorker_w28 stopped! [2023-03-07 16:12:07,176][213445] Component RolloutWorker_w12 stopped! [2023-03-07 16:12:07,177][213445] Component RolloutWorker_w9 stopped! [2023-03-07 16:12:07,177][213445] Component RolloutWorker_w1 stopped! [2023-03-07 16:12:07,178][213445] Component RolloutWorker_w19 stopped! [2023-03-07 16:12:07,178][213445] Component RolloutWorker_w13 stopped! [2023-03-07 16:12:07,178][213445] Component RolloutWorker_w6 stopped! [2023-03-07 16:12:07,156][213720] Loop batcher_evt_loop terminating... [2023-03-07 16:12:07,163][213971] Stopping RolloutWorker_w13... [2023-03-07 16:12:07,189][213971] Loop rollout_proc13_evt_loop terminating... [2023-03-07 16:12:07,227][213771] Weights refcount: 2 0 [2023-03-07 16:12:07,230][213771] Stopping InferenceWorker_p0-w0... [2023-03-07 16:12:07,231][213771] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 16:12:07,231][213445] Component InferenceWorker_p0-w0 stopped! [2023-03-07 16:12:07,263][213720] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000096093_98399232.pth [2023-03-07 16:12:07,272][213720] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/dial-turn-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 16:12:07,355][213720] Stopping LearnerWorker_p0... [2023-03-07 16:12:07,356][213720] Loop learner_proc0_evt_loop terminating... [2023-03-07 16:12:07,356][213445] Component LearnerWorker_p0 stopped! [2023-03-07 16:12:07,357][213445] Waiting for process learner_proc0 to stop... [2023-03-07 16:12:08,560][213445] Waiting for process inference_proc0-0 to join... [2023-03-07 16:12:08,561][213445] Waiting for process rollout_proc0 to join... [2023-03-07 16:12:08,561][213445] Waiting for process rollout_proc1 to join... [2023-03-07 16:12:08,561][213445] Waiting for process rollout_proc2 to join... [2023-03-07 16:12:08,562][213445] Waiting for process rollout_proc3 to join... [2023-03-07 16:12:08,562][213445] Waiting for process rollout_proc4 to join... [2023-03-07 16:12:08,562][213445] Waiting for process rollout_proc5 to join... [2023-03-07 16:12:08,562][213445] Waiting for process rollout_proc6 to join... [2023-03-07 16:12:08,562][213445] Waiting for process rollout_proc7 to join... [2023-03-07 16:12:08,563][213445] Waiting for process rollout_proc8 to join... [2023-03-07 16:12:08,563][213445] Waiting for process rollout_proc9 to join... [2023-03-07 16:12:08,563][213445] Waiting for process rollout_proc10 to join... [2023-03-07 16:12:08,563][213445] Waiting for process rollout_proc11 to join... [2023-03-07 16:12:08,564][213445] Waiting for process rollout_proc12 to join... [2023-03-07 16:12:08,564][213445] Waiting for process rollout_proc13 to join... [2023-03-07 16:12:08,564][213445] Waiting for process rollout_proc14 to join... [2023-03-07 16:12:08,564][213445] Waiting for process rollout_proc15 to join... [2023-03-07 16:12:08,564][213445] Waiting for process rollout_proc16 to join... [2023-03-07 16:12:08,565][213445] Waiting for process rollout_proc17 to join... [2023-03-07 16:12:08,565][213445] Waiting for process rollout_proc18 to join... [2023-03-07 16:12:08,565][213445] Waiting for process rollout_proc19 to join... [2023-03-07 16:12:08,565][213445] Waiting for process rollout_proc20 to join... [2023-03-07 16:12:08,566][213445] Waiting for process rollout_proc21 to join... [2023-03-07 16:12:08,566][213445] Waiting for process rollout_proc22 to join... [2023-03-07 16:12:08,566][213445] Waiting for process rollout_proc23 to join... [2023-03-07 16:12:08,566][213445] Waiting for process rollout_proc24 to join... [2023-03-07 16:12:08,567][213445] Waiting for process rollout_proc25 to join... [2023-03-07 16:12:08,567][213445] Waiting for process rollout_proc26 to join... [2023-03-07 16:12:08,567][213445] Waiting for process rollout_proc27 to join... [2023-03-07 16:12:08,567][213445] Waiting for process rollout_proc28 to join... [2023-03-07 16:12:08,567][213445] Waiting for process rollout_proc29 to join... [2023-03-07 16:12:08,568][213445] Waiting for process rollout_proc30 to join... [2023-03-07 16:12:08,568][213445] Waiting for process rollout_proc31 to join... [2023-03-07 16:12:08,568][213445] Batcher 0 profile tree view: batching: 888.1139, releasing_batches: 1.5390 [2023-03-07 16:12:08,568][213445] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 234.6937 update_model: 135.4663 weight_update: 0.0007 one_step: 0.0115 handle_policy_step: 6804.1570 deserialize: 214.4534, stack: 37.7163, obs_to_device_normalize: 1238.2897, forward: 2951.4912, send_messages: 1405.6163 prepare_outputs: 692.8265 to_cpu: 349.8472 [2023-03-07 16:12:08,568][213445] Learner 0 profile tree view: misc: 0.6065, prepare_batch: 449.2238 train: 953.2285 epoch_init: 0.3941, minibatch_init: 0.4203, losses_postprocess: 28.0509, kl_divergence: 36.1181, after_optimizer: 77.1502 calculate_losses: 320.5402 losses_init: 0.2366, forward_head: 18.2265, bptt_initial: 115.9098, tail: 63.0264, advantages_returns: 7.8514, losses: 29.6915 bptt: 75.9614 bptt_forward_core: 73.3300 update: 466.9696 clip: 58.5481 [2023-03-07 16:12:08,569][213445] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.4316, enqueue_policy_requests: 223.7074, env_step: 2898.3658, overhead: 209.5131, complete_rollouts: 11.6927 save_policy_outputs: 276.3541 split_output_tensors: 134.7446 [2023-03-07 16:12:08,569][213445] RolloutWorker_w31 profile tree view: wait_for_trajectories: 4.6346, enqueue_policy_requests: 226.1753, env_step: 2981.6888, overhead: 216.9098, complete_rollouts: 12.0426 save_policy_outputs: 284.4959 split_output_tensors: 139.7739 [2023-03-07 16:12:08,569][213445] Loop Runner_EvtLoop terminating... [2023-03-07 16:12:08,569][213445] Runner profile tree view: main_loop: 7559.5412 [2023-03-07 16:12:08,570][213445] Collected {0: 100001792}, FPS: 13228.6