coffee-pull-v2 / sf_log.txt
qgallouedec's picture
qgallouedec HF staff
Upload folder using huggingface_hub
e8c7635
[2023-07-16 20:56:12,575][233173] Saving configuration to /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/config.json...
[2023-07-16 20:56:12,592][233173] Rollout worker 0 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 1 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 2 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 3 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 4 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 5 uses device cpu
[2023-07-16 20:56:12,592][233173] Rollout worker 6 uses device cpu
[2023-07-16 20:56:12,593][233173] Rollout worker 7 uses device cpu
[2023-07-16 20:56:12,593][233173] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1
[2023-07-16 20:56:12,603][233173] InferenceWorker_p0-w0: min num requests: 2
[2023-07-16 20:56:12,620][233173] Starting all processes...
[2023-07-16 20:56:12,621][233173] Starting process learner_proc0
[2023-07-16 20:56:12,670][233173] Starting all processes...
[2023-07-16 20:56:12,715][233173] Starting process inference_proc0-0
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc0
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc1
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc2
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc3
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc4
[2023-07-16 20:56:12,715][233173] Starting process rollout_proc5
[2023-07-16 20:56:12,717][233173] Starting process rollout_proc6
[2023-07-16 20:56:12,717][233173] Starting process rollout_proc7
[2023-07-16 20:56:14,470][233414] Starting seed is not provided
[2023-07-16 20:56:14,470][233414] Initializing actor-critic model on device cpu
[2023-07-16 20:56:14,470][233414] RunningMeanStd input shape: (39,)
[2023-07-16 20:56:14,471][233414] RunningMeanStd input shape: (1,)
[2023-07-16 20:56:14,528][233414] Created Actor Critic model with architecture:
[2023-07-16 20:56:14,528][233414] ActorCriticSharedWeights(
(obs_normalizer): ObservationNormalizer(
(running_mean_std): RunningMeanStdDictInPlace(
(running_mean_std): ModuleDict(
(obs): RunningMeanStdInPlace()
)
)
)
(returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace)
(encoder): MultiInputEncoder(
(encoders): ModuleDict(
(obs): MlpEncoder(
(mlp_head): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=Linear)
(1): RecursiveScriptModule(original_name=Tanh)
(2): RecursiveScriptModule(original_name=Linear)
(3): RecursiveScriptModule(original_name=Tanh)
)
)
)
)
(core): ModelCoreIdentity()
(decoder): MlpDecoder(
(mlp): Identity()
)
(critic_linear): Linear(in_features=64, out_features=1, bias=True)
(action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev(
(distribution_linear): Linear(in_features=64, out_features=4, bias=True)
)
)
[2023-07-16 20:56:14,551][233464] Worker 5 uses CPU cores [20, 21, 22, 23]
[2023-07-16 20:56:14,677][233459] Worker 0 uses CPU cores [0, 1, 2, 3]
[2023-07-16 20:56:14,705][233497] Worker 7 uses CPU cores [28, 29, 30, 31]
[2023-07-16 20:56:14,838][233414] Using optimizer <class 'torch.optim.adam.Adam'>
[2023-07-16 20:56:14,839][233414] No checkpoints found
[2023-07-16 20:56:14,839][233414] Did not load from checkpoint, starting from scratch!
[2023-07-16 20:56:14,839][233414] Initialized policy 0 weights for model version 0
[2023-07-16 20:56:14,840][233414] LearnerWorker_p0 finished initialization!
[2023-07-16 20:56:14,841][233458] RunningMeanStd input shape: (39,)
[2023-07-16 20:56:14,842][233458] RunningMeanStd input shape: (1,)
[2023-07-16 20:56:14,926][233173] Inference worker 0-0 is ready!
[2023-07-16 20:56:14,927][233173] All inference workers are ready! Signal rollout workers to start!
[2023-07-16 20:56:14,937][233463] Worker 4 uses CPU cores [16, 17, 18, 19]
[2023-07-16 20:56:14,988][233461] Worker 2 uses CPU cores [8, 9, 10, 11]
[2023-07-16 20:56:15,031][233460] Worker 1 uses CPU cores [4, 5, 6, 7]
[2023-07-16 20:56:15,121][233462] Worker 3 uses CPU cores [12, 13, 14, 15]
[2023-07-16 20:56:15,209][233496] Worker 6 uses CPU cores [24, 25, 26, 27]
[2023-07-16 20:56:15,579][233173] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2023-07-16 20:56:16,168][233497] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,173][233459] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,176][233497] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,181][233459] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,183][233464] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,191][233464] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,207][233463] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,212][233497] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,215][233463] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,217][233459] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,226][233464] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,251][233463] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,282][233497] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,288][233459] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,296][233464] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,321][233463] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,327][233460] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,331][233461] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,335][233460] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,339][233461] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,370][233460] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,375][233461] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,392][233462] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,400][233462] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,436][233462] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,440][233460] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,445][233461] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,506][233462] Decorrelating experience for 192 frames...
[2023-07-16 20:56:16,513][233496] Decorrelating experience for 0 frames...
[2023-07-16 20:56:16,521][233496] Decorrelating experience for 64 frames...
[2023-07-16 20:56:16,557][233496] Decorrelating experience for 128 frames...
[2023-07-16 20:56:16,628][233496] Decorrelating experience for 192 frames...
[2023-07-16 20:56:17,528][233497] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,532][233459] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,533][233464] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,554][233463] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,659][233497] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,664][233464] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,665][233459] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,683][233460] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,686][233463] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,688][233461] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,744][233462] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,813][233460] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,819][233461] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,825][233497] Decorrelating experience for 384 frames...
[2023-07-16 20:56:17,827][233464] Decorrelating experience for 384 frames...
[2023-07-16 20:56:17,832][233459] Decorrelating experience for 384 frames...
[2023-07-16 20:56:17,852][233463] Decorrelating experience for 384 frames...
[2023-07-16 20:56:17,871][233496] Decorrelating experience for 256 frames...
[2023-07-16 20:56:17,874][233462] Decorrelating experience for 320 frames...
[2023-07-16 20:56:17,978][233460] Decorrelating experience for 384 frames...
[2023-07-16 20:56:17,984][233461] Decorrelating experience for 384 frames...
[2023-07-16 20:56:18,003][233496] Decorrelating experience for 320 frames...
[2023-07-16 20:56:18,017][233497] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,018][233464] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,026][233459] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,039][233462] Decorrelating experience for 384 frames...
[2023-07-16 20:56:18,045][233463] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,169][233460] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,170][233496] Decorrelating experience for 384 frames...
[2023-07-16 20:56:18,174][233461] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,228][233462] Decorrelating experience for 448 frames...
[2023-07-16 20:56:18,363][233496] Decorrelating experience for 448 frames...
[2023-07-16 20:56:20,579][233173] Fps is (10 sec: 3276.8, 60 sec: 3276.8, 300 sec: 3276.8). Total num frames: 16384. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:56:20,579][233173] Avg episode reward: [(0, '3.785')]
[2023-07-16 20:56:22,477][233458] Updated weights for policy 0, policy_version 80 (0.0005)
[2023-07-16 20:56:25,579][233173] Fps is (10 sec: 7372.7, 60 sec: 7372.7, 300 sec: 7372.7). Total num frames: 73728. Throughput: 0: 5740.7. Samples: 57408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:56:25,580][233173] Avg episode reward: [(0, '9.304')]
[2023-07-16 20:56:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000144_73728.pth...
[2023-07-16 20:56:26,241][233458] Updated weights for policy 0, policy_version 160 (0.0006)
[2023-07-16 20:56:29,908][233458] Updated weights for policy 0, policy_version 240 (0.0005)
[2023-07-16 20:56:30,579][233173] Fps is (10 sec: 11059.2, 60 sec: 8465.1, 300 sec: 8465.1). Total num frames: 126976. Throughput: 0: 8213.9. Samples: 123208. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 20:56:30,581][233173] Avg episode reward: [(0, '14.479')]
[2023-07-16 20:56:30,582][233414] Saving new best policy, reward=14.479!
[2023-07-16 20:56:32,599][233173] Heartbeat connected on Batcher_0
[2023-07-16 20:56:32,601][233173] Heartbeat connected on LearnerWorker_p0
[2023-07-16 20:56:32,604][233173] Heartbeat connected on InferenceWorker_p0-w0
[2023-07-16 20:56:32,609][233173] Heartbeat connected on RolloutWorker_w0
[2023-07-16 20:56:32,612][233173] Heartbeat connected on RolloutWorker_w1
[2023-07-16 20:56:32,612][233173] Heartbeat connected on RolloutWorker_w2
[2023-07-16 20:56:32,614][233173] Heartbeat connected on RolloutWorker_w3
[2023-07-16 20:56:32,617][233173] Heartbeat connected on RolloutWorker_w4
[2023-07-16 20:56:32,626][233173] Heartbeat connected on RolloutWorker_w5
[2023-07-16 20:56:32,627][233173] Heartbeat connected on RolloutWorker_w6
[2023-07-16 20:56:32,630][233173] Heartbeat connected on RolloutWorker_w7
[2023-07-16 20:56:33,672][233458] Updated weights for policy 0, policy_version 320 (0.0005)
[2023-07-16 20:56:35,579][233173] Fps is (10 sec: 10649.8, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 180224. Throughput: 0: 7832.0. Samples: 156640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:56:35,579][233173] Avg episode reward: [(0, '14.272')]
[2023-07-16 20:56:37,585][233458] Updated weights for policy 0, policy_version 400 (0.0006)
[2023-07-16 20:56:40,579][233173] Fps is (10 sec: 11059.1, 60 sec: 9502.7, 300 sec: 9502.7). Total num frames: 237568. Throughput: 0: 8798.7. Samples: 219968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 20:56:40,580][233173] Avg episode reward: [(0, '34.643')]
[2023-07-16 20:56:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000464_237568.pth...
[2023-07-16 20:56:40,585][233414] Saving new best policy, reward=34.643!
[2023-07-16 20:56:41,323][233458] Updated weights for policy 0, policy_version 480 (0.0005)
[2023-07-16 20:56:45,051][233458] Updated weights for policy 0, policy_version 560 (0.0005)
[2023-07-16 20:56:45,579][233173] Fps is (10 sec: 11059.2, 60 sec: 9693.9, 300 sec: 9693.9). Total num frames: 290816. Throughput: 0: 9536.4. Samples: 286092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:56:45,580][233173] Avg episode reward: [(0, '56.074')]
[2023-07-16 20:56:45,580][233414] Saving new best policy, reward=56.074!
[2023-07-16 20:56:48,751][233458] Updated weights for policy 0, policy_version 640 (0.0005)
[2023-07-16 20:56:50,579][233173] Fps is (10 sec: 10649.7, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 344064. Throughput: 0: 9109.7. Samples: 318840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:56:50,580][233173] Avg episode reward: [(0, '69.593')]
[2023-07-16 20:56:50,580][233414] Saving new best policy, reward=69.593!
[2023-07-16 20:56:52,496][233458] Updated weights for policy 0, policy_version 720 (0.0005)
[2023-07-16 20:56:55,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10035.2, 300 sec: 10035.2). Total num frames: 401408. Throughput: 0: 9627.4. Samples: 385096. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0)
[2023-07-16 20:56:55,580][233173] Avg episode reward: [(0, '82.166')]
[2023-07-16 20:56:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000784_401408.pth...
[2023-07-16 20:56:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000144_73728.pth
[2023-07-16 20:56:55,587][233414] Saving new best policy, reward=82.166!
[2023-07-16 20:56:56,245][233458] Updated weights for policy 0, policy_version 800 (0.0005)
[2023-07-16 20:57:00,159][233458] Updated weights for policy 0, policy_version 880 (0.0006)
[2023-07-16 20:57:00,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10103.5, 300 sec: 10103.5). Total num frames: 454656. Throughput: 0: 9956.7. Samples: 448052. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 20:57:00,580][233173] Avg episode reward: [(0, '86.589')]
[2023-07-16 20:57:00,580][233414] Saving new best policy, reward=86.589!
[2023-07-16 20:57:03,737][233458] Updated weights for policy 0, policy_version 960 (0.0004)
[2023-07-16 20:57:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10240.0, 300 sec: 10240.0). Total num frames: 512000. Throughput: 0: 10730.4. Samples: 482868. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 20:57:05,579][233173] Avg episode reward: [(0, '83.769')]
[2023-07-16 20:57:07,388][233458] Updated weights for policy 0, policy_version 1040 (0.0005)
[2023-07-16 20:57:10,579][233173] Fps is (10 sec: 11059.0, 60 sec: 10277.2, 300 sec: 10277.2). Total num frames: 565248. Throughput: 0: 10953.4. Samples: 550312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:10,580][233173] Avg episode reward: [(0, '94.776')]
[2023-07-16 20:57:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001104_565248.pth...
[2023-07-16 20:57:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000464_237568.pth
[2023-07-16 20:57:10,586][233414] Saving new best policy, reward=94.776!
[2023-07-16 20:57:11,026][233458] Updated weights for policy 0, policy_version 1120 (0.0005)
[2023-07-16 20:57:14,623][233458] Updated weights for policy 0, policy_version 1200 (0.0005)
[2023-07-16 20:57:15,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10376.5). Total num frames: 622592. Throughput: 0: 11006.0. Samples: 618480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:15,580][233173] Avg episode reward: [(0, '93.291')]
[2023-07-16 20:57:18,273][233458] Updated weights for policy 0, policy_version 1280 (0.0005)
[2023-07-16 20:57:20,579][233173] Fps is (10 sec: 11469.0, 60 sec: 11059.2, 300 sec: 10460.6). Total num frames: 679936. Throughput: 0: 10993.4. Samples: 651344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:20,580][233173] Avg episode reward: [(0, '92.621')]
[2023-07-16 20:57:21,915][233458] Updated weights for policy 0, policy_version 1360 (0.0005)
[2023-07-16 20:57:25,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10991.0, 300 sec: 10474.1). Total num frames: 733184. Throughput: 0: 11103.3. Samples: 719616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:25,580][233173] Avg episode reward: [(0, '91.748')]
[2023-07-16 20:57:25,592][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001440_737280.pth...
[2023-07-16 20:57:25,592][233458] Updated weights for policy 0, policy_version 1440 (0.0005)
[2023-07-16 20:57:25,594][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000000784_401408.pth
[2023-07-16 20:57:29,504][233458] Updated weights for policy 0, policy_version 1520 (0.0005)
[2023-07-16 20:57:30,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10485.8). Total num frames: 786432. Throughput: 0: 11043.8. Samples: 783064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:30,580][233173] Avg episode reward: [(0, '86.924')]
[2023-07-16 20:57:33,450][233458] Updated weights for policy 0, policy_version 1600 (0.0006)
[2023-07-16 20:57:35,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10496.0). Total num frames: 839680. Throughput: 0: 11012.5. Samples: 814404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:35,580][233173] Avg episode reward: [(0, '95.575')]
[2023-07-16 20:57:35,581][233414] Saving new best policy, reward=95.575!
[2023-07-16 20:57:37,122][233458] Updated weights for policy 0, policy_version 1680 (0.0005)
[2023-07-16 20:57:40,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10553.2). Total num frames: 897024. Throughput: 0: 11013.7. Samples: 880712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 20:57:40,579][233173] Avg episode reward: [(0, '92.284')]
[2023-07-16 20:57:40,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001752_897024.pth...
[2023-07-16 20:57:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001104_565248.pth
[2023-07-16 20:57:40,925][233458] Updated weights for policy 0, policy_version 1760 (0.0005)
[2023-07-16 20:57:44,723][233458] Updated weights for policy 0, policy_version 1840 (0.0006)
[2023-07-16 20:57:45,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10558.6). Total num frames: 950272. Throughput: 0: 11040.4. Samples: 944872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 20:57:45,579][233173] Avg episode reward: [(0, '98.425')]
[2023-07-16 20:57:45,580][233414] Saving new best policy, reward=98.425!
[2023-07-16 20:57:48,494][233458] Updated weights for policy 0, policy_version 1920 (0.0005)
[2023-07-16 20:57:50,579][233173] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10563.4). Total num frames: 1003520. Throughput: 0: 11002.0. Samples: 977956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:57:50,579][233173] Avg episode reward: [(0, '93.092')]
[2023-07-16 20:57:52,351][233458] Updated weights for policy 0, policy_version 2000 (0.0006)
[2023-07-16 20:57:55,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10567.7). Total num frames: 1056768. Throughput: 0: 10902.9. Samples: 1040940. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 20:57:55,579][233173] Avg episode reward: [(0, '95.074')]
[2023-07-16 20:57:55,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002064_1056768.pth...
[2023-07-16 20:57:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001440_737280.pth
[2023-07-16 20:57:56,211][233458] Updated weights for policy 0, policy_version 2080 (0.0006)
[2023-07-16 20:58:00,050][233458] Updated weights for policy 0, policy_version 2160 (0.0005)
[2023-07-16 20:58:00,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10571.6). Total num frames: 1110016. Throughput: 0: 10809.9. Samples: 1104924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:00,579][233173] Avg episode reward: [(0, '93.880')]
[2023-07-16 20:58:03,735][233458] Updated weights for policy 0, policy_version 2240 (0.0004)
[2023-07-16 20:58:05,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10575.1). Total num frames: 1163264. Throughput: 0: 10829.6. Samples: 1138676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:05,579][233173] Avg episode reward: [(0, '98.634')]
[2023-07-16 20:58:05,580][233414] Saving new best policy, reward=98.634!
[2023-07-16 20:58:07,669][233458] Updated weights for policy 0, policy_version 2320 (0.0006)
[2023-07-16 20:58:10,579][233173] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10578.4). Total num frames: 1216512. Throughput: 0: 10697.2. Samples: 1200992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:10,580][233173] Avg episode reward: [(0, '95.043')]
[2023-07-16 20:58:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002376_1216512.pth...
[2023-07-16 20:58:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001752_897024.pth
[2023-07-16 20:58:11,572][233458] Updated weights for policy 0, policy_version 2400 (0.0006)
[2023-07-16 20:58:15,425][233458] Updated weights for policy 0, policy_version 2480 (0.0006)
[2023-07-16 20:58:15,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10581.3). Total num frames: 1269760. Throughput: 0: 10707.6. Samples: 1264908. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 20:58:15,579][233173] Avg episode reward: [(0, '99.133')]
[2023-07-16 20:58:15,580][233414] Saving new best policy, reward=99.133!
[2023-07-16 20:58:19,227][233458] Updated weights for policy 0, policy_version 2560 (0.0006)
[2023-07-16 20:58:20,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10584.1). Total num frames: 1323008. Throughput: 0: 10739.9. Samples: 1297700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:20,579][233173] Avg episode reward: [(0, '97.400')]
[2023-07-16 20:58:22,997][233458] Updated weights for policy 0, policy_version 2640 (0.0005)
[2023-07-16 20:58:25,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10618.1). Total num frames: 1380352. Throughput: 0: 10702.8. Samples: 1362336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:25,580][233173] Avg episode reward: [(0, '95.703')]
[2023-07-16 20:58:25,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002696_1380352.pth...
[2023-07-16 20:58:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002064_1056768.pth
[2023-07-16 20:58:26,660][233458] Updated weights for policy 0, policy_version 2720 (0.0005)
[2023-07-16 20:58:30,544][233458] Updated weights for policy 0, policy_version 2800 (0.0005)
[2023-07-16 20:58:30,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10619.3). Total num frames: 1433600. Throughput: 0: 10722.6. Samples: 1427388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:30,579][233173] Avg episode reward: [(0, '99.745')]
[2023-07-16 20:58:30,580][233414] Saving new best policy, reward=99.745!
[2023-07-16 20:58:34,385][233458] Updated weights for policy 0, policy_version 2880 (0.0005)
[2023-07-16 20:58:35,579][233173] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10620.3). Total num frames: 1486848. Throughput: 0: 10687.5. Samples: 1458892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:35,579][233173] Avg episode reward: [(0, '98.342')]
[2023-07-16 20:58:38,194][233458] Updated weights for policy 0, policy_version 2960 (0.0005)
[2023-07-16 20:58:40,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10621.3). Total num frames: 1540096. Throughput: 0: 10729.7. Samples: 1523776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:40,579][233173] Avg episode reward: [(0, '93.654')]
[2023-07-16 20:58:40,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003008_1540096.pth...
[2023-07-16 20:58:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002376_1216512.pth
[2023-07-16 20:58:42,005][233458] Updated weights for policy 0, policy_version 3040 (0.0005)
[2023-07-16 20:58:45,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10622.3). Total num frames: 1593344. Throughput: 0: 10739.9. Samples: 1588220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:45,579][233173] Avg episode reward: [(0, '102.542')]
[2023-07-16 20:58:45,580][233414] Saving new best policy, reward=102.542!
[2023-07-16 20:58:45,803][233458] Updated weights for policy 0, policy_version 3120 (0.0005)
[2023-07-16 20:58:49,439][233458] Updated weights for policy 0, policy_version 3200 (0.0004)
[2023-07-16 20:58:50,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 1650688. Throughput: 0: 10723.3. Samples: 1621224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 20:58:50,579][233173] Avg episode reward: [(0, '100.332')]
[2023-07-16 20:58:53,248][233458] Updated weights for policy 0, policy_version 3280 (0.0005)
[2023-07-16 20:58:55,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 1703936. Throughput: 0: 10803.8. Samples: 1687160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:58:55,579][233173] Avg episode reward: [(0, '101.109')]
[2023-07-16 20:58:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003328_1703936.pth...
[2023-07-16 20:58:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002696_1380352.pth
[2023-07-16 20:58:57,101][233458] Updated weights for policy 0, policy_version 3360 (0.0005)
[2023-07-16 20:59:00,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 1757184. Throughput: 0: 10802.8. Samples: 1751032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 20:59:00,579][233173] Avg episode reward: [(0, '102.451')]
[2023-07-16 20:59:00,916][233458] Updated weights for policy 0, policy_version 3440 (0.0005)
[2023-07-16 20:59:04,703][233458] Updated weights for policy 0, policy_version 3520 (0.0005)
[2023-07-16 20:59:05,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 1810432. Throughput: 0: 10796.5. Samples: 1783540. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 20:59:05,579][233173] Avg episode reward: [(0, '104.801')]
[2023-07-16 20:59:05,580][233414] Saving new best policy, reward=104.801!
[2023-07-16 20:59:08,345][233458] Updated weights for policy 0, policy_version 3600 (0.0005)
[2023-07-16 20:59:10,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10673.0). Total num frames: 1867776. Throughput: 0: 10840.0. Samples: 1850136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:10,579][233173] Avg episode reward: [(0, '105.189')]
[2023-07-16 20:59:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003648_1867776.pth...
[2023-07-16 20:59:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003008_1540096.pth
[2023-07-16 20:59:10,586][233414] Saving new best policy, reward=105.189!
[2023-07-16 20:59:12,132][233458] Updated weights for policy 0, policy_version 3680 (0.0005)
[2023-07-16 20:59:15,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 1916928. Throughput: 0: 10804.1. Samples: 1913572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:15,579][233173] Avg episode reward: [(0, '106.363')]
[2023-07-16 20:59:15,580][233414] Saving new best policy, reward=106.363!
[2023-07-16 20:59:16,037][233458] Updated weights for policy 0, policy_version 3760 (0.0006)
[2023-07-16 20:59:19,757][233458] Updated weights for policy 0, policy_version 3840 (0.0005)
[2023-07-16 20:59:20,579][233173] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 10671.7). Total num frames: 1974272. Throughput: 0: 10817.2. Samples: 1945664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 20:59:20,579][233173] Avg episode reward: [(0, '105.300')]
[2023-07-16 20:59:23,655][233458] Updated weights for policy 0, policy_version 3920 (0.0006)
[2023-07-16 20:59:25,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10671.2). Total num frames: 2027520. Throughput: 0: 10820.0. Samples: 2010676. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 20:59:25,579][233173] Avg episode reward: [(0, '105.847')]
[2023-07-16 20:59:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003960_2027520.pth...
[2023-07-16 20:59:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003328_1703936.pth
[2023-07-16 20:59:27,304][233458] Updated weights for policy 0, policy_version 4000 (0.0005)
[2023-07-16 20:59:30,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10691.6). Total num frames: 2084864. Throughput: 0: 10888.4. Samples: 2078200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:30,579][233173] Avg episode reward: [(0, '105.973')]
[2023-07-16 20:59:30,937][233458] Updated weights for policy 0, policy_version 4080 (0.0005)
[2023-07-16 20:59:34,503][233458] Updated weights for policy 0, policy_version 4160 (0.0004)
[2023-07-16 20:59:35,579][233173] Fps is (10 sec: 11468.9, 60 sec: 10922.7, 300 sec: 10711.0). Total num frames: 2142208. Throughput: 0: 10915.0. Samples: 2112400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 20:59:35,579][233173] Avg episode reward: [(0, '104.700')]
[2023-07-16 20:59:38,051][233458] Updated weights for policy 0, policy_version 4240 (0.0004)
[2023-07-16 20:59:40,579][233173] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 10729.5). Total num frames: 2199552. Throughput: 0: 10979.5. Samples: 2181236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:40,579][233173] Avg episode reward: [(0, '106.739')]
[2023-07-16 20:59:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004296_2199552.pth...
[2023-07-16 20:59:40,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003648_1867776.pth
[2023-07-16 20:59:40,586][233414] Saving new best policy, reward=106.739!
[2023-07-16 20:59:41,647][233458] Updated weights for policy 0, policy_version 4320 (0.0005)
[2023-07-16 20:59:45,216][233458] Updated weights for policy 0, policy_version 4400 (0.0004)
[2023-07-16 20:59:45,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10727.6). Total num frames: 2252800. Throughput: 0: 11076.5. Samples: 2249476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:45,579][233173] Avg episode reward: [(0, '106.752')]
[2023-07-16 20:59:45,582][233414] Saving new best policy, reward=106.752!
[2023-07-16 20:59:48,979][233458] Updated weights for policy 0, policy_version 4480 (0.0005)
[2023-07-16 20:59:50,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10744.9). Total num frames: 2310144. Throughput: 0: 11102.6. Samples: 2283156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 20:59:50,580][233173] Avg episode reward: [(0, '108.524')]
[2023-07-16 20:59:50,580][233414] Saving new best policy, reward=108.524!
[2023-07-16 20:59:52,719][233458] Updated weights for policy 0, policy_version 4560 (0.0005)
[2023-07-16 20:59:55,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 10761.3). Total num frames: 2367488. Throughput: 0: 11088.1. Samples: 2349100. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 20:59:55,579][233173] Avg episode reward: [(0, '109.163')]
[2023-07-16 20:59:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004624_2367488.pth...
[2023-07-16 20:59:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000003960_2027520.pth
[2023-07-16 20:59:55,585][233414] Saving new best policy, reward=109.163!
[2023-07-16 20:59:56,318][233458] Updated weights for policy 0, policy_version 4640 (0.0005)
[2023-07-16 21:00:00,109][233458] Updated weights for policy 0, policy_version 4720 (0.0005)
[2023-07-16 21:00:00,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10758.8). Total num frames: 2420736. Throughput: 0: 11147.4. Samples: 2415204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:00,580][233173] Avg episode reward: [(0, '110.142')]
[2023-07-16 21:00:00,580][233414] Saving new best policy, reward=110.142!
[2023-07-16 21:00:03,697][233458] Updated weights for policy 0, policy_version 4800 (0.0005)
[2023-07-16 21:00:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10774.3). Total num frames: 2478080. Throughput: 0: 11194.5. Samples: 2449416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:05,580][233173] Avg episode reward: [(0, '111.022')]
[2023-07-16 21:00:05,580][233414] Saving new best policy, reward=111.022!
[2023-07-16 21:00:07,317][233458] Updated weights for policy 0, policy_version 4880 (0.0005)
[2023-07-16 21:00:10,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10771.6). Total num frames: 2531328. Throughput: 0: 11234.3. Samples: 2516220. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:00:10,579][233173] Avg episode reward: [(0, '109.604')]
[2023-07-16 21:00:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004944_2531328.pth...
[2023-07-16 21:00:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004296_2199552.pth
[2023-07-16 21:00:11,080][233458] Updated weights for policy 0, policy_version 4960 (0.0005)
[2023-07-16 21:00:14,807][233458] Updated weights for policy 0, policy_version 5040 (0.0005)
[2023-07-16 21:00:15,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 10786.1). Total num frames: 2588672. Throughput: 0: 11198.0. Samples: 2582108. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:00:15,580][233173] Avg episode reward: [(0, '108.958')]
[2023-07-16 21:00:18,567][233458] Updated weights for policy 0, policy_version 5120 (0.0005)
[2023-07-16 21:00:20,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10783.3). Total num frames: 2641920. Throughput: 0: 11152.4. Samples: 2614260. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:00:20,580][233173] Avg episode reward: [(0, '110.690')]
[2023-07-16 21:00:22,285][233458] Updated weights for policy 0, policy_version 5200 (0.0005)
[2023-07-16 21:00:25,579][233173] Fps is (10 sec: 10649.5, 60 sec: 11127.5, 300 sec: 10780.7). Total num frames: 2695168. Throughput: 0: 11099.5. Samples: 2680716. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:00:25,580][233173] Avg episode reward: [(0, '111.728')]
[2023-07-16 21:00:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005264_2695168.pth...
[2023-07-16 21:00:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004624_2367488.pth
[2023-07-16 21:00:25,587][233414] Saving new best policy, reward=111.728!
[2023-07-16 21:00:26,087][233458] Updated weights for policy 0, policy_version 5280 (0.0006)
[2023-07-16 21:00:29,814][233458] Updated weights for policy 0, policy_version 5360 (0.0005)
[2023-07-16 21:00:30,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10794.2). Total num frames: 2752512. Throughput: 0: 11032.8. Samples: 2745952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:30,580][233173] Avg episode reward: [(0, '112.253')]
[2023-07-16 21:00:30,581][233414] Saving new best policy, reward=112.253!
[2023-07-16 21:00:33,545][233458] Updated weights for policy 0, policy_version 5440 (0.0006)
[2023-07-16 21:00:35,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10791.4). Total num frames: 2805760. Throughput: 0: 11011.6. Samples: 2778676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:35,580][233173] Avg episode reward: [(0, '112.404')]
[2023-07-16 21:00:35,581][233414] Saving new best policy, reward=112.404!
[2023-07-16 21:00:37,308][233458] Updated weights for policy 0, policy_version 5520 (0.0005)
[2023-07-16 21:00:40,579][233173] Fps is (10 sec: 10649.5, 60 sec: 10990.9, 300 sec: 10788.7). Total num frames: 2859008. Throughput: 0: 10995.6. Samples: 2843904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0)
[2023-07-16 21:00:40,580][233173] Avg episode reward: [(0, '113.197')]
[2023-07-16 21:00:40,584][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005584_2859008.pth...
[2023-07-16 21:00:40,587][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004944_2531328.pth
[2023-07-16 21:00:40,587][233414] Saving new best policy, reward=113.197!
[2023-07-16 21:00:41,091][233458] Updated weights for policy 0, policy_version 5600 (0.0006)
[2023-07-16 21:00:44,652][233458] Updated weights for policy 0, policy_version 5680 (0.0004)
[2023-07-16 21:00:45,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10801.3). Total num frames: 2916352. Throughput: 0: 11039.5. Samples: 2911980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:45,580][233173] Avg episode reward: [(0, '114.691')]
[2023-07-16 21:00:45,580][233414] Saving new best policy, reward=114.691!
[2023-07-16 21:00:48,408][233458] Updated weights for policy 0, policy_version 5760 (0.0006)
[2023-07-16 21:00:50,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10798.5). Total num frames: 2969600. Throughput: 0: 11007.8. Samples: 2944768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:50,580][233173] Avg episode reward: [(0, '119.601')]
[2023-07-16 21:00:50,580][233414] Saving new best policy, reward=119.601!
[2023-07-16 21:00:52,053][233458] Updated weights for policy 0, policy_version 5840 (0.0005)
[2023-07-16 21:00:55,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10810.5). Total num frames: 3026944. Throughput: 0: 11002.1. Samples: 3011316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:00:55,579][233173] Avg episode reward: [(0, '119.934')]
[2023-07-16 21:00:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005912_3026944.pth...
[2023-07-16 21:00:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005264_2695168.pth
[2023-07-16 21:00:55,586][233414] Saving new best policy, reward=119.934!
[2023-07-16 21:00:55,810][233458] Updated weights for policy 0, policy_version 5920 (0.0005)
[2023-07-16 21:00:59,580][233458] Updated weights for policy 0, policy_version 6000 (0.0006)
[2023-07-16 21:01:00,579][233173] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10807.7). Total num frames: 3080192. Throughput: 0: 10977.7. Samples: 3076104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:00,579][233173] Avg episode reward: [(0, '114.889')]
[2023-07-16 21:01:03,383][233458] Updated weights for policy 0, policy_version 6080 (0.0006)
[2023-07-16 21:01:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10819.1). Total num frames: 3137536. Throughput: 0: 10987.9. Samples: 3108716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:05,579][233173] Avg episode reward: [(0, '115.073')]
[2023-07-16 21:01:06,947][233458] Updated weights for policy 0, policy_version 6160 (0.0005)
[2023-07-16 21:01:10,502][233458] Updated weights for policy 0, policy_version 6240 (0.0005)
[2023-07-16 21:01:10,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 10830.1). Total num frames: 3194880. Throughput: 0: 11034.9. Samples: 3177284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:10,579][233173] Avg episode reward: [(0, '111.224')]
[2023-07-16 21:01:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006240_3194880.pth...
[2023-07-16 21:01:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005584_2859008.pth
[2023-07-16 21:01:14,084][233458] Updated weights for policy 0, policy_version 6320 (0.0005)
[2023-07-16 21:01:15,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10968.9). Total num frames: 3252224. Throughput: 0: 11109.5. Samples: 3245880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:15,580][233173] Avg episode reward: [(0, '119.969')]
[2023-07-16 21:01:15,580][233414] Saving new best policy, reward=119.969!
[2023-07-16 21:01:17,664][233458] Updated weights for policy 0, policy_version 6400 (0.0005)
[2023-07-16 21:01:20,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11127.5, 300 sec: 10969.0). Total num frames: 3309568. Throughput: 0: 11147.3. Samples: 3280304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:20,579][233173] Avg episode reward: [(0, '123.697')]
[2023-07-16 21:01:20,580][233414] Saving new best policy, reward=123.697!
[2023-07-16 21:01:21,294][233458] Updated weights for policy 0, policy_version 6480 (0.0005)
[2023-07-16 21:01:24,830][233458] Updated weights for policy 0, policy_version 6560 (0.0004)
[2023-07-16 21:01:25,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 10982.8). Total num frames: 3366912. Throughput: 0: 11211.3. Samples: 3348412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:25,579][233173] Avg episode reward: [(0, '152.760')]
[2023-07-16 21:01:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006576_3366912.pth...
[2023-07-16 21:01:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005912_3026944.pth
[2023-07-16 21:01:25,586][233414] Saving new best policy, reward=152.760!
[2023-07-16 21:01:28,376][233458] Updated weights for policy 0, policy_version 6640 (0.0004)
[2023-07-16 21:01:30,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11195.7, 300 sec: 10996.7). Total num frames: 3424256. Throughput: 0: 11220.9. Samples: 3416920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:30,579][233173] Avg episode reward: [(0, '169.987')]
[2023-07-16 21:01:30,580][233414] Saving new best policy, reward=169.987!
[2023-07-16 21:01:32,114][233458] Updated weights for policy 0, policy_version 6720 (0.0004)
[2023-07-16 21:01:35,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 10982.8). Total num frames: 3477504. Throughput: 0: 11219.3. Samples: 3449636. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:01:35,580][233173] Avg episode reward: [(0, '193.125')]
[2023-07-16 21:01:35,580][233414] Saving new best policy, reward=193.125!
[2023-07-16 21:01:35,889][233458] Updated weights for policy 0, policy_version 6800 (0.0005)
[2023-07-16 21:01:39,635][233458] Updated weights for policy 0, policy_version 6880 (0.0005)
[2023-07-16 21:01:40,579][233173] Fps is (10 sec: 10649.5, 60 sec: 11195.7, 300 sec: 10982.8). Total num frames: 3530752. Throughput: 0: 11192.8. Samples: 3514992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:40,580][233173] Avg episode reward: [(0, '197.293')]
[2023-07-16 21:01:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006896_3530752.pth...
[2023-07-16 21:01:40,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006240_3194880.pth
[2023-07-16 21:01:40,586][233414] Saving new best policy, reward=197.293!
[2023-07-16 21:01:43,444][233458] Updated weights for policy 0, policy_version 6960 (0.0005)
[2023-07-16 21:01:45,579][233173] Fps is (10 sec: 10649.7, 60 sec: 11127.5, 300 sec: 10982.8). Total num frames: 3584000. Throughput: 0: 11197.2. Samples: 3579976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:01:45,580][233173] Avg episode reward: [(0, '214.501')]
[2023-07-16 21:01:45,580][233414] Saving new best policy, reward=214.501!
[2023-07-16 21:01:47,197][233458] Updated weights for policy 0, policy_version 7040 (0.0005)
[2023-07-16 21:01:50,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 10982.8). Total num frames: 3641344. Throughput: 0: 11200.6. Samples: 3612744. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:01:50,580][233173] Avg episode reward: [(0, '215.870')]
[2023-07-16 21:01:50,581][233414] Saving new best policy, reward=215.870!
[2023-07-16 21:01:50,924][233458] Updated weights for policy 0, policy_version 7120 (0.0005)
[2023-07-16 21:01:54,694][233458] Updated weights for policy 0, policy_version 7200 (0.0005)
[2023-07-16 21:01:55,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10982.8). Total num frames: 3694592. Throughput: 0: 11133.1. Samples: 3678272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:01:55,580][233173] Avg episode reward: [(0, '204.688')]
[2023-07-16 21:01:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007216_3694592.pth...
[2023-07-16 21:01:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006576_3366912.pth
[2023-07-16 21:01:58,495][233458] Updated weights for policy 0, policy_version 7280 (0.0005)
[2023-07-16 21:02:00,579][233173] Fps is (10 sec: 10649.7, 60 sec: 11127.5, 300 sec: 10968.9). Total num frames: 3747840. Throughput: 0: 11055.3. Samples: 3743368. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:02:00,580][233173] Avg episode reward: [(0, '223.902')]
[2023-07-16 21:02:00,580][233414] Saving new best policy, reward=223.902!
[2023-07-16 21:02:02,298][233458] Updated weights for policy 0, policy_version 7360 (0.0005)
[2023-07-16 21:02:05,579][233173] Fps is (10 sec: 11059.0, 60 sec: 11127.4, 300 sec: 10982.8). Total num frames: 3805184. Throughput: 0: 11010.3. Samples: 3775768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:05,580][233173] Avg episode reward: [(0, '226.431')]
[2023-07-16 21:02:05,581][233414] Saving new best policy, reward=226.431!
[2023-07-16 21:02:05,838][233458] Updated weights for policy 0, policy_version 7440 (0.0004)
[2023-07-16 21:02:09,365][233458] Updated weights for policy 0, policy_version 7520 (0.0004)
[2023-07-16 21:02:10,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11127.5, 300 sec: 10982.8). Total num frames: 3862528. Throughput: 0: 11051.8. Samples: 3845744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:10,580][233173] Avg episode reward: [(0, '226.518')]
[2023-07-16 21:02:10,584][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007544_3862528.pth...
[2023-07-16 21:02:10,587][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000006896_3530752.pth
[2023-07-16 21:02:10,587][233414] Saving new best policy, reward=226.518!
[2023-07-16 21:02:12,854][233458] Updated weights for policy 0, policy_version 7600 (0.0004)
[2023-07-16 21:02:15,579][233173] Fps is (10 sec: 11469.0, 60 sec: 11127.5, 300 sec: 10982.8). Total num frames: 3919872. Throughput: 0: 11087.1. Samples: 3915840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:15,580][233173] Avg episode reward: [(0, '219.495')]
[2023-07-16 21:02:16,376][233458] Updated weights for policy 0, policy_version 7680 (0.0004)
[2023-07-16 21:02:19,899][233458] Updated weights for policy 0, policy_version 7760 (0.0004)
[2023-07-16 21:02:20,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11127.5, 300 sec: 10996.7). Total num frames: 3977216. Throughput: 0: 11131.7. Samples: 3950564. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:02:20,585][233173] Avg episode reward: [(0, '224.007')]
[2023-07-16 21:02:23,414][233458] Updated weights for policy 0, policy_version 7840 (0.0004)
[2023-07-16 21:02:25,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11195.7, 300 sec: 11024.5). Total num frames: 4038656. Throughput: 0: 11234.8. Samples: 4020560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:02:25,580][233173] Avg episode reward: [(0, '237.930')]
[2023-07-16 21:02:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007888_4038656.pth...
[2023-07-16 21:02:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007216_3694592.pth
[2023-07-16 21:02:25,586][233414] Saving new best policy, reward=237.930!
[2023-07-16 21:02:26,884][233458] Updated weights for policy 0, policy_version 7920 (0.0004)
[2023-07-16 21:02:30,393][233458] Updated weights for policy 0, policy_version 8000 (0.0004)
[2023-07-16 21:02:30,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11195.7, 300 sec: 11038.4). Total num frames: 4096000. Throughput: 0: 11359.8. Samples: 4091168. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:02:30,579][233173] Avg episode reward: [(0, '227.551')]
[2023-07-16 21:02:33,935][233458] Updated weights for policy 0, policy_version 8080 (0.0004)
[2023-07-16 21:02:35,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11038.4). Total num frames: 4153344. Throughput: 0: 11393.8. Samples: 4125464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:35,579][233173] Avg episode reward: [(0, '247.405')]
[2023-07-16 21:02:35,580][233414] Saving new best policy, reward=247.405!
[2023-07-16 21:02:37,498][233458] Updated weights for policy 0, policy_version 8160 (0.0004)
[2023-07-16 21:02:40,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11052.3). Total num frames: 4210688. Throughput: 0: 11481.9. Samples: 4194960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:40,579][233173] Avg episode reward: [(0, '241.311')]
[2023-07-16 21:02:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008224_4210688.pth...
[2023-07-16 21:02:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007544_3862528.pth
[2023-07-16 21:02:41,023][233458] Updated weights for policy 0, policy_version 8240 (0.0004)
[2023-07-16 21:02:44,553][233458] Updated weights for policy 0, policy_version 8320 (0.0004)
[2023-07-16 21:02:45,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11066.1). Total num frames: 4268032. Throughput: 0: 11579.5. Samples: 4264448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:45,580][233173] Avg episode reward: [(0, '250.507')]
[2023-07-16 21:02:45,580][233414] Saving new best policy, reward=250.507!
[2023-07-16 21:02:48,162][233458] Updated weights for policy 0, policy_version 8400 (0.0005)
[2023-07-16 21:02:50,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11400.6, 300 sec: 11080.0). Total num frames: 4325376. Throughput: 0: 11622.8. Samples: 4298792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:50,579][233173] Avg episode reward: [(0, '220.265')]
[2023-07-16 21:02:52,004][233458] Updated weights for policy 0, policy_version 8480 (0.0005)
[2023-07-16 21:02:55,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11080.0). Total num frames: 4378624. Throughput: 0: 11481.3. Samples: 4362404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:02:55,579][233173] Avg episode reward: [(0, '248.841')]
[2023-07-16 21:02:55,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008552_4378624.pth...
[2023-07-16 21:02:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007888_4038656.pth
[2023-07-16 21:02:55,811][233458] Updated weights for policy 0, policy_version 8560 (0.0005)
[2023-07-16 21:02:59,613][233458] Updated weights for policy 0, policy_version 8640 (0.0005)
[2023-07-16 21:03:00,579][233173] Fps is (10 sec: 10649.6, 60 sec: 11400.5, 300 sec: 11080.0). Total num frames: 4431872. Throughput: 0: 11376.5. Samples: 4427784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:00,579][233173] Avg episode reward: [(0, '252.878')]
[2023-07-16 21:03:00,580][233414] Saving new best policy, reward=252.878!
[2023-07-16 21:03:03,270][233458] Updated weights for policy 0, policy_version 8720 (0.0005)
[2023-07-16 21:03:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11093.9). Total num frames: 4489216. Throughput: 0: 11340.4. Samples: 4460880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:05,579][233173] Avg episode reward: [(0, '240.351')]
[2023-07-16 21:03:07,046][233458] Updated weights for policy 0, policy_version 8800 (0.0006)
[2023-07-16 21:03:10,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11093.9). Total num frames: 4542464. Throughput: 0: 11238.4. Samples: 4526288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:10,579][233173] Avg episode reward: [(0, '240.922')]
[2023-07-16 21:03:10,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008872_4542464.pth...
[2023-07-16 21:03:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008224_4210688.pth
[2023-07-16 21:03:10,770][233458] Updated weights for policy 0, policy_version 8880 (0.0006)
[2023-07-16 21:03:14,601][233458] Updated weights for policy 0, policy_version 8960 (0.0006)
[2023-07-16 21:03:15,579][233173] Fps is (10 sec: 10649.6, 60 sec: 11264.0, 300 sec: 11093.9). Total num frames: 4595712. Throughput: 0: 11122.5. Samples: 4591680. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:03:15,579][233173] Avg episode reward: [(0, '227.352')]
[2023-07-16 21:03:18,414][233458] Updated weights for policy 0, policy_version 9040 (0.0006)
[2023-07-16 21:03:20,579][233173] Fps is (10 sec: 10649.7, 60 sec: 11195.7, 300 sec: 11080.0). Total num frames: 4648960. Throughput: 0: 11084.6. Samples: 4624268. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:03:20,579][233173] Avg episode reward: [(0, '240.783')]
[2023-07-16 21:03:22,186][233458] Updated weights for policy 0, policy_version 9120 (0.0006)
[2023-07-16 21:03:25,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 11093.9). Total num frames: 4706304. Throughput: 0: 10983.5. Samples: 4689216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:25,579][233173] Avg episode reward: [(0, '239.592')]
[2023-07-16 21:03:25,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009192_4706304.pth...
[2023-07-16 21:03:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008552_4378624.pth
[2023-07-16 21:03:25,905][233458] Updated weights for policy 0, policy_version 9200 (0.0006)
[2023-07-16 21:03:29,687][233458] Updated weights for policy 0, policy_version 9280 (0.0006)
[2023-07-16 21:03:30,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 11093.9). Total num frames: 4759552. Throughput: 0: 10893.4. Samples: 4754652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:30,580][233173] Avg episode reward: [(0, '237.566')]
[2023-07-16 21:03:33,469][233458] Updated weights for policy 0, policy_version 9360 (0.0005)
[2023-07-16 21:03:35,579][233173] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 11093.9). Total num frames: 4812800. Throughput: 0: 10859.6. Samples: 4787476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:35,580][233173] Avg episode reward: [(0, '240.924')]
[2023-07-16 21:03:37,195][233458] Updated weights for policy 0, policy_version 9440 (0.0005)
[2023-07-16 21:03:40,579][233173] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 11093.9). Total num frames: 4866048. Throughput: 0: 10886.9. Samples: 4852316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:03:40,580][233173] Avg episode reward: [(0, '236.372')]
[2023-07-16 21:03:40,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009504_4866048.pth...
[2023-07-16 21:03:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008872_4542464.pth
[2023-07-16 21:03:41,026][233458] Updated weights for policy 0, policy_version 9520 (0.0006)
[2023-07-16 21:03:44,807][233458] Updated weights for policy 0, policy_version 9600 (0.0006)
[2023-07-16 21:03:45,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 11093.9). Total num frames: 4923392. Throughput: 0: 10868.6. Samples: 4916872. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:03:45,580][233173] Avg episode reward: [(0, '245.066')]
[2023-07-16 21:03:48,581][233458] Updated weights for policy 0, policy_version 9680 (0.0006)
[2023-07-16 21:03:50,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 11093.9). Total num frames: 4976640. Throughput: 0: 10847.8. Samples: 4949032. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:03:50,580][233173] Avg episode reward: [(0, '234.719')]
[2023-07-16 21:03:52,288][233458] Updated weights for policy 0, policy_version 9760 (0.0006)
[2023-07-16 21:03:55,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 11093.9). Total num frames: 5029888. Throughput: 0: 10859.2. Samples: 5014952. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:03:55,580][233173] Avg episode reward: [(0, '247.706')]
[2023-07-16 21:03:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009824_5029888.pth...
[2023-07-16 21:03:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009192_4706304.pth
[2023-07-16 21:03:56,099][233458] Updated weights for policy 0, policy_version 9840 (0.0006)
[2023-07-16 21:03:59,901][233458] Updated weights for policy 0, policy_version 9920 (0.0006)
[2023-07-16 21:04:00,579][233173] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 11093.9). Total num frames: 5083136. Throughput: 0: 10838.9. Samples: 5079432. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:04:00,579][233173] Avg episode reward: [(0, '238.391')]
[2023-07-16 21:04:03,615][233458] Updated weights for policy 0, policy_version 10000 (0.0006)
[2023-07-16 21:04:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 11093.9). Total num frames: 5140480. Throughput: 0: 10846.8. Samples: 5112376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:04:05,579][233173] Avg episode reward: [(0, '251.765')]
[2023-07-16 21:04:07,372][233458] Updated weights for policy 0, policy_version 10080 (0.0006)
[2023-07-16 21:04:10,579][233173] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 11107.8). Total num frames: 5193728. Throughput: 0: 10868.1. Samples: 5178280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0)
[2023-07-16 21:04:10,579][233173] Avg episode reward: [(0, '242.580')]
[2023-07-16 21:04:10,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010144_5193728.pth...
[2023-07-16 21:04:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009504_4866048.pth
[2023-07-16 21:04:11,117][233458] Updated weights for policy 0, policy_version 10160 (0.0006)
[2023-07-16 21:04:14,837][233458] Updated weights for policy 0, policy_version 10240 (0.0006)
[2023-07-16 21:04:15,579][233173] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 11093.9). Total num frames: 5246976. Throughput: 0: 10878.9. Samples: 5244200. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0)
[2023-07-16 21:04:15,579][233173] Avg episode reward: [(0, '250.963')]
[2023-07-16 21:04:18,568][233458] Updated weights for policy 0, policy_version 10320 (0.0005)
[2023-07-16 21:04:20,579][233173] Fps is (10 sec: 11059.3, 60 sec: 10922.7, 300 sec: 11107.8). Total num frames: 5304320. Throughput: 0: 10877.0. Samples: 5276940. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:04:20,580][233173] Avg episode reward: [(0, '235.283')]
[2023-07-16 21:04:22,229][233458] Updated weights for policy 0, policy_version 10400 (0.0005)
[2023-07-16 21:04:25,579][233173] Fps is (10 sec: 11468.6, 60 sec: 10922.7, 300 sec: 11107.8). Total num frames: 5361664. Throughput: 0: 10927.6. Samples: 5344056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:04:25,580][233173] Avg episode reward: [(0, '241.176')]
[2023-07-16 21:04:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010472_5361664.pth...
[2023-07-16 21:04:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000009824_5029888.pth
[2023-07-16 21:04:25,855][233458] Updated weights for policy 0, policy_version 10480 (0.0005)
[2023-07-16 21:04:29,343][233458] Updated weights for policy 0, policy_version 10560 (0.0004)
[2023-07-16 21:04:30,579][233173] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 11107.8). Total num frames: 5419008. Throughput: 0: 11046.6. Samples: 5413968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:04:30,580][233173] Avg episode reward: [(0, '243.350')]
[2023-07-16 21:04:32,833][233458] Updated weights for policy 0, policy_version 10640 (0.0004)
[2023-07-16 21:04:35,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11059.2, 300 sec: 11107.8). Total num frames: 5476352. Throughput: 0: 11111.6. Samples: 5449052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:04:35,580][233173] Avg episode reward: [(0, '231.381')]
[2023-07-16 21:04:36,360][233458] Updated weights for policy 0, policy_version 10720 (0.0005)
[2023-07-16 21:04:39,830][233458] Updated weights for policy 0, policy_version 10800 (0.0005)
[2023-07-16 21:04:40,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11195.7, 300 sec: 11135.6). Total num frames: 5537792. Throughput: 0: 11203.8. Samples: 5519124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:04:40,580][233173] Avg episode reward: [(0, '241.939')]
[2023-07-16 21:04:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010816_5537792.pth...
[2023-07-16 21:04:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010144_5193728.pth
[2023-07-16 21:04:43,280][233458] Updated weights for policy 0, policy_version 10880 (0.0004)
[2023-07-16 21:04:45,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11195.7, 300 sec: 11135.6). Total num frames: 5595136. Throughput: 0: 11352.3. Samples: 5590288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:04:45,580][233173] Avg episode reward: [(0, '258.636')]
[2023-07-16 21:04:45,580][233414] Saving new best policy, reward=258.636!
[2023-07-16 21:04:46,766][233458] Updated weights for policy 0, policy_version 10960 (0.0004)
[2023-07-16 21:04:50,209][233458] Updated weights for policy 0, policy_version 11040 (0.0004)
[2023-07-16 21:04:50,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11149.5). Total num frames: 5656576. Throughput: 0: 11405.4. Samples: 5625620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:04:50,579][233173] Avg episode reward: [(0, '252.285')]
[2023-07-16 21:04:53,806][233458] Updated weights for policy 0, policy_version 11120 (0.0005)
[2023-07-16 21:04:55,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11149.4). Total num frames: 5709824. Throughput: 0: 11485.0. Samples: 5695104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:04:55,580][233173] Avg episode reward: [(0, '233.111')]
[2023-07-16 21:04:55,584][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011152_5709824.pth...
[2023-07-16 21:04:55,587][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010472_5361664.pth
[2023-07-16 21:04:57,534][233458] Updated weights for policy 0, policy_version 11200 (0.0006)
[2023-07-16 21:05:00,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11149.5). Total num frames: 5767168. Throughput: 0: 11485.5. Samples: 5761048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:05:00,580][233173] Avg episode reward: [(0, '244.755')]
[2023-07-16 21:05:01,288][233458] Updated weights for policy 0, policy_version 11280 (0.0006)
[2023-07-16 21:05:05,148][233458] Updated weights for policy 0, policy_version 11360 (0.0006)
[2023-07-16 21:05:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11149.5). Total num frames: 5820416. Throughput: 0: 11469.6. Samples: 5793072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:05:05,580][233173] Avg episode reward: [(0, '240.928')]
[2023-07-16 21:05:08,884][233458] Updated weights for policy 0, policy_version 11440 (0.0006)
[2023-07-16 21:05:10,579][233173] Fps is (10 sec: 10649.5, 60 sec: 11332.3, 300 sec: 11135.6). Total num frames: 5873664. Throughput: 0: 11420.3. Samples: 5857968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:05:10,580][233173] Avg episode reward: [(0, '252.400')]
[2023-07-16 21:05:10,584][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011472_5873664.pth...
[2023-07-16 21:05:10,587][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010816_5537792.pth
[2023-07-16 21:05:12,547][233458] Updated weights for policy 0, policy_version 11520 (0.0005)
[2023-07-16 21:05:15,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11149.5). Total num frames: 5931008. Throughput: 0: 11361.6. Samples: 5925240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:05:15,580][233173] Avg episode reward: [(0, '242.673')]
[2023-07-16 21:05:16,170][233458] Updated weights for policy 0, policy_version 11600 (0.0005)
[2023-07-16 21:05:19,967][233458] Updated weights for policy 0, policy_version 11680 (0.0006)
[2023-07-16 21:05:20,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11149.5). Total num frames: 5984256. Throughput: 0: 11319.5. Samples: 5958428. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:05:20,580][233173] Avg episode reward: [(0, '242.504')]
[2023-07-16 21:05:23,554][233458] Updated weights for policy 0, policy_version 11760 (0.0005)
[2023-07-16 21:05:25,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11149.4). Total num frames: 6041600. Throughput: 0: 11250.7. Samples: 6025408. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:05:25,580][233173] Avg episode reward: [(0, '237.536')]
[2023-07-16 21:05:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011800_6041600.pth...
[2023-07-16 21:05:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011152_5709824.pth
[2023-07-16 21:05:27,109][233458] Updated weights for policy 0, policy_version 11840 (0.0005)
[2023-07-16 21:05:30,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11163.3). Total num frames: 6098944. Throughput: 0: 11214.0. Samples: 6094920. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:05:30,580][233173] Avg episode reward: [(0, '251.202')]
[2023-07-16 21:05:30,642][233458] Updated weights for policy 0, policy_version 11920 (0.0005)
[2023-07-16 21:05:34,208][233458] Updated weights for policy 0, policy_version 12000 (0.0005)
[2023-07-16 21:05:35,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11177.2). Total num frames: 6156288. Throughput: 0: 11197.7. Samples: 6129516. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:05:35,580][233173] Avg episode reward: [(0, '230.443')]
[2023-07-16 21:05:37,781][233458] Updated weights for policy 0, policy_version 12080 (0.0004)
[2023-07-16 21:05:40,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11177.2). Total num frames: 6213632. Throughput: 0: 11181.6. Samples: 6198276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:05:40,580][233173] Avg episode reward: [(0, '230.914')]
[2023-07-16 21:05:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012144_6217728.pth...
[2023-07-16 21:05:40,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011472_5873664.pth
[2023-07-16 21:05:41,309][233458] Updated weights for policy 0, policy_version 12160 (0.0004)
[2023-07-16 21:05:44,848][233458] Updated weights for policy 0, policy_version 12240 (0.0004)
[2023-07-16 21:05:45,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11205.0). Total num frames: 6275072. Throughput: 0: 11265.5. Samples: 6267996. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:05:45,579][233173] Avg episode reward: [(0, '244.432')]
[2023-07-16 21:05:48,415][233458] Updated weights for policy 0, policy_version 12320 (0.0005)
[2023-07-16 21:05:50,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11205.0). Total num frames: 6332416. Throughput: 0: 11329.2. Samples: 6302888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0)
[2023-07-16 21:05:50,580][233173] Avg episode reward: [(0, '248.237')]
[2023-07-16 21:05:51,827][233458] Updated weights for policy 0, policy_version 12400 (0.0004)
[2023-07-16 21:05:55,318][233458] Updated weights for policy 0, policy_version 12480 (0.0004)
[2023-07-16 21:05:55,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11218.9). Total num frames: 6389760. Throughput: 0: 11464.4. Samples: 6373868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:05:55,580][233173] Avg episode reward: [(0, '259.238')]
[2023-07-16 21:05:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012480_6389760.pth...
[2023-07-16 21:05:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011800_6041600.pth
[2023-07-16 21:05:55,586][233414] Saving new best policy, reward=259.238!
[2023-07-16 21:05:58,816][233458] Updated weights for policy 0, policy_version 12560 (0.0004)
[2023-07-16 21:06:00,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11232.8). Total num frames: 6451200. Throughput: 0: 11525.0. Samples: 6443864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:00,580][233173] Avg episode reward: [(0, '248.101')]
[2023-07-16 21:06:02,271][233458] Updated weights for policy 0, policy_version 12640 (0.0004)
[2023-07-16 21:06:05,579][233173] Fps is (10 sec: 11878.5, 60 sec: 11468.8, 300 sec: 11232.8). Total num frames: 6508544. Throughput: 0: 11585.9. Samples: 6479792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:05,579][233173] Avg episode reward: [(0, '238.714')]
[2023-07-16 21:06:05,790][233458] Updated weights for policy 0, policy_version 12720 (0.0004)
[2023-07-16 21:06:09,302][233458] Updated weights for policy 0, policy_version 12800 (0.0005)
[2023-07-16 21:06:10,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11232.8). Total num frames: 6565888. Throughput: 0: 11648.0. Samples: 6549568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:10,579][233173] Avg episode reward: [(0, '239.454')]
[2023-07-16 21:06:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012824_6565888.pth...
[2023-07-16 21:06:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012144_6217728.pth
[2023-07-16 21:06:12,870][233458] Updated weights for policy 0, policy_version 12880 (0.0005)
[2023-07-16 21:06:15,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11232.8). Total num frames: 6623232. Throughput: 0: 11650.8. Samples: 6619208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:15,579][233173] Avg episode reward: [(0, '251.982')]
[2023-07-16 21:06:16,359][233458] Updated weights for policy 0, policy_version 12960 (0.0004)
[2023-07-16 21:06:19,831][233458] Updated weights for policy 0, policy_version 13040 (0.0004)
[2023-07-16 21:06:20,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11246.6). Total num frames: 6684672. Throughput: 0: 11673.5. Samples: 6654824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:20,579][233173] Avg episode reward: [(0, '255.756')]
[2023-07-16 21:06:23,359][233458] Updated weights for policy 0, policy_version 13120 (0.0005)
[2023-07-16 21:06:25,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11246.6). Total num frames: 6742016. Throughput: 0: 11697.2. Samples: 6724648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:25,580][233173] Avg episode reward: [(0, '248.211')]
[2023-07-16 21:06:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013168_6742016.pth...
[2023-07-16 21:06:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012480_6389760.pth
[2023-07-16 21:06:26,835][233458] Updated weights for policy 0, policy_version 13200 (0.0004)
[2023-07-16 21:06:30,397][233458] Updated weights for policy 0, policy_version 13280 (0.0005)
[2023-07-16 21:06:30,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11260.5). Total num frames: 6799360. Throughput: 0: 11696.2. Samples: 6794324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:30,579][233173] Avg episode reward: [(0, '241.638')]
[2023-07-16 21:06:33,923][233458] Updated weights for policy 0, policy_version 13360 (0.0004)
[2023-07-16 21:06:35,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11274.4). Total num frames: 6856704. Throughput: 0: 11687.4. Samples: 6828820. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:06:35,580][233173] Avg episode reward: [(0, '242.056')]
[2023-07-16 21:06:37,407][233458] Updated weights for policy 0, policy_version 13440 (0.0004)
[2023-07-16 21:06:40,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11288.3). Total num frames: 6914048. Throughput: 0: 11663.1. Samples: 6898708. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:06:40,580][233173] Avg episode reward: [(0, '259.936')]
[2023-07-16 21:06:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013504_6914048.pth...
[2023-07-16 21:06:40,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000012824_6565888.pth
[2023-07-16 21:06:40,586][233414] Saving new best policy, reward=259.936!
[2023-07-16 21:06:41,140][233458] Updated weights for policy 0, policy_version 13520 (0.0006)
[2023-07-16 21:06:44,857][233458] Updated weights for policy 0, policy_version 13600 (0.0006)
[2023-07-16 21:06:45,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11274.4). Total num frames: 6967296. Throughput: 0: 11562.9. Samples: 6964192. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:06:45,579][233173] Avg episode reward: [(0, '260.463')]
[2023-07-16 21:06:45,611][233414] Saving new best policy, reward=260.463!
[2023-07-16 21:06:48,627][233458] Updated weights for policy 0, policy_version 13680 (0.0006)
[2023-07-16 21:06:50,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11288.3). Total num frames: 7024640. Throughput: 0: 11478.8. Samples: 6996336. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:06:50,580][233173] Avg episode reward: [(0, '247.895')]
[2023-07-16 21:06:52,395][233458] Updated weights for policy 0, policy_version 13760 (0.0006)
[2023-07-16 21:06:55,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11288.3). Total num frames: 7077888. Throughput: 0: 11394.4. Samples: 7062316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:06:55,580][233173] Avg episode reward: [(0, '237.198')]
[2023-07-16 21:06:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013824_7077888.pth...
[2023-07-16 21:06:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013168_6742016.pth
[2023-07-16 21:06:56,203][233458] Updated weights for policy 0, policy_version 13840 (0.0005)
[2023-07-16 21:06:59,717][233458] Updated weights for policy 0, policy_version 13920 (0.0004)
[2023-07-16 21:07:00,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 7135232. Throughput: 0: 11349.9. Samples: 7129952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:00,579][233173] Avg episode reward: [(0, '265.948')]
[2023-07-16 21:07:00,580][233414] Saving new best policy, reward=265.948!
[2023-07-16 21:07:03,358][233458] Updated weights for policy 0, policy_version 14000 (0.0005)
[2023-07-16 21:07:05,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 7192576. Throughput: 0: 11310.1. Samples: 7163780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:07:05,579][233173] Avg episode reward: [(0, '252.173')]
[2023-07-16 21:07:06,841][233458] Updated weights for policy 0, policy_version 14080 (0.0004)
[2023-07-16 21:07:10,380][233458] Updated weights for policy 0, policy_version 14160 (0.0004)
[2023-07-16 21:07:10,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 7249920. Throughput: 0: 11315.0. Samples: 7233824. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:07:10,579][233173] Avg episode reward: [(0, '251.743')]
[2023-07-16 21:07:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014160_7249920.pth...
[2023-07-16 21:07:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013504_6914048.pth
[2023-07-16 21:07:13,955][233458] Updated weights for policy 0, policy_version 14240 (0.0004)
[2023-07-16 21:07:15,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 7307264. Throughput: 0: 11275.5. Samples: 7301724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:15,580][233173] Avg episode reward: [(0, '254.450')]
[2023-07-16 21:07:17,803][233458] Updated weights for policy 0, policy_version 14320 (0.0006)
[2023-07-16 21:07:20,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11264.0, 300 sec: 11260.5). Total num frames: 7360512. Throughput: 0: 11216.7. Samples: 7333572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:20,579][233173] Avg episode reward: [(0, '263.053')]
[2023-07-16 21:07:21,537][233458] Updated weights for policy 0, policy_version 14400 (0.0005)
[2023-07-16 21:07:25,297][233458] Updated weights for policy 0, policy_version 14480 (0.0005)
[2023-07-16 21:07:25,579][233173] Fps is (10 sec: 10649.6, 60 sec: 11195.7, 300 sec: 11246.6). Total num frames: 7413760. Throughput: 0: 11129.9. Samples: 7399556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:25,580][233173] Avg episode reward: [(0, '242.342')]
[2023-07-16 21:07:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014480_7413760.pth...
[2023-07-16 21:07:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013824_7077888.pth
[2023-07-16 21:07:29,056][233458] Updated weights for policy 0, policy_version 14560 (0.0005)
[2023-07-16 21:07:30,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11246.6). Total num frames: 7471104. Throughput: 0: 11115.0. Samples: 7464368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:30,580][233173] Avg episode reward: [(0, '264.440')]
[2023-07-16 21:07:32,657][233458] Updated weights for policy 0, policy_version 14640 (0.0005)
[2023-07-16 21:07:35,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11195.7, 300 sec: 11246.6). Total num frames: 7528448. Throughput: 0: 11179.8. Samples: 7499428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:07:35,579][233173] Avg episode reward: [(0, '258.466')]
[2023-07-16 21:07:36,148][233458] Updated weights for policy 0, policy_version 14720 (0.0004)
[2023-07-16 21:07:39,841][233458] Updated weights for policy 0, policy_version 14800 (0.0005)
[2023-07-16 21:07:40,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 11232.8). Total num frames: 7581696. Throughput: 0: 11246.4. Samples: 7568404. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:07:40,579][233173] Avg episode reward: [(0, '251.050')]
[2023-07-16 21:07:40,593][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014816_7585792.pth...
[2023-07-16 21:07:40,595][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014160_7249920.pth
[2023-07-16 21:07:43,430][233458] Updated weights for policy 0, policy_version 14880 (0.0004)
[2023-07-16 21:07:45,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11246.6). Total num frames: 7643136. Throughput: 0: 11243.5. Samples: 7635912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:07:45,580][233173] Avg episode reward: [(0, '257.649')]
[2023-07-16 21:07:46,968][233458] Updated weights for policy 0, policy_version 14960 (0.0005)
[2023-07-16 21:07:50,415][233458] Updated weights for policy 0, policy_version 15040 (0.0004)
[2023-07-16 21:07:50,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11260.5). Total num frames: 7700480. Throughput: 0: 11285.8. Samples: 7671640. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:07:50,579][233173] Avg episode reward: [(0, '259.584')]
[2023-07-16 21:07:53,954][233458] Updated weights for policy 0, policy_version 15120 (0.0004)
[2023-07-16 21:07:55,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 7757824. Throughput: 0: 11281.8. Samples: 7741504. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:07:55,579][233173] Avg episode reward: [(0, '254.896')]
[2023-07-16 21:07:55,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015152_7757824.pth...
[2023-07-16 21:07:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014480_7413760.pth
[2023-07-16 21:07:57,450][233458] Updated weights for policy 0, policy_version 15200 (0.0004)
[2023-07-16 21:08:00,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 7815168. Throughput: 0: 11324.8. Samples: 7811340. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:08:00,579][233173] Avg episode reward: [(0, '242.059')]
[2023-07-16 21:08:00,972][233458] Updated weights for policy 0, policy_version 15280 (0.0004)
[2023-07-16 21:08:04,490][233458] Updated weights for policy 0, policy_version 15360 (0.0004)
[2023-07-16 21:08:05,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11302.2). Total num frames: 7876608. Throughput: 0: 11402.2. Samples: 7846672. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:08:05,580][233173] Avg episode reward: [(0, '248.020')]
[2023-07-16 21:08:07,996][233458] Updated weights for policy 0, policy_version 15440 (0.0004)
[2023-07-16 21:08:10,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11316.1). Total num frames: 7933952. Throughput: 0: 11493.9. Samples: 7916780. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:08:10,579][233173] Avg episode reward: [(0, '260.911')]
[2023-07-16 21:08:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015496_7933952.pth...
[2023-07-16 21:08:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014816_7585792.pth
[2023-07-16 21:08:11,499][233458] Updated weights for policy 0, policy_version 15520 (0.0004)
[2023-07-16 21:08:14,990][233458] Updated weights for policy 0, policy_version 15600 (0.0004)
[2023-07-16 21:08:15,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11329.9). Total num frames: 7991296. Throughput: 0: 11618.0. Samples: 7987180. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:08:15,579][233173] Avg episode reward: [(0, '245.380')]
[2023-07-16 21:08:18,442][233458] Updated weights for policy 0, policy_version 15680 (0.0004)
[2023-07-16 21:08:20,579][233173] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11343.8). Total num frames: 8052736. Throughput: 0: 11615.6. Samples: 8022128. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0)
[2023-07-16 21:08:20,579][233173] Avg episode reward: [(0, '256.791')]
[2023-07-16 21:08:21,906][233458] Updated weights for policy 0, policy_version 15760 (0.0004)
[2023-07-16 21:08:25,401][233458] Updated weights for policy 0, policy_version 15840 (0.0004)
[2023-07-16 21:08:25,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11357.7). Total num frames: 8110080. Throughput: 0: 11674.6. Samples: 8093760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:08:25,579][233173] Avg episode reward: [(0, '257.080')]
[2023-07-16 21:08:25,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015840_8110080.pth...
[2023-07-16 21:08:25,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015152_7757824.pth
[2023-07-16 21:08:28,931][233458] Updated weights for policy 0, policy_version 15920 (0.0004)
[2023-07-16 21:08:30,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11371.6). Total num frames: 8167424. Throughput: 0: 11717.8. Samples: 8163212. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0)
[2023-07-16 21:08:30,579][233173] Avg episode reward: [(0, '261.754')]
[2023-07-16 21:08:32,436][233458] Updated weights for policy 0, policy_version 16000 (0.0005)
[2023-07-16 21:08:35,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11399.4). Total num frames: 8228864. Throughput: 0: 11695.3. Samples: 8197928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:08:35,580][233173] Avg episode reward: [(0, '239.413')]
[2023-07-16 21:08:35,939][233458] Updated weights for policy 0, policy_version 16080 (0.0004)
[2023-07-16 21:08:39,608][233458] Updated weights for policy 0, policy_version 16160 (0.0005)
[2023-07-16 21:08:40,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11385.5). Total num frames: 8282112. Throughput: 0: 11676.4. Samples: 8266944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:08:40,580][233173] Avg episode reward: [(0, '252.493')]
[2023-07-16 21:08:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016176_8282112.pth...
[2023-07-16 21:08:40,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015496_7933952.pth
[2023-07-16 21:08:43,311][233458] Updated weights for policy 0, policy_version 16240 (0.0006)
[2023-07-16 21:08:45,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11605.3, 300 sec: 11399.4). Total num frames: 8339456. Throughput: 0: 11588.4. Samples: 8332816. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:08:45,579][233173] Avg episode reward: [(0, '254.165')]
[2023-07-16 21:08:47,060][233458] Updated weights for policy 0, policy_version 16320 (0.0006)
[2023-07-16 21:08:50,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11399.4). Total num frames: 8392704. Throughput: 0: 11531.4. Samples: 8365584. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:08:50,580][233173] Avg episode reward: [(0, '244.095')]
[2023-07-16 21:08:50,717][233458] Updated weights for policy 0, policy_version 16400 (0.0005)
[2023-07-16 21:08:54,297][233458] Updated weights for policy 0, policy_version 16480 (0.0005)
[2023-07-16 21:08:55,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 8450048. Throughput: 0: 11487.9. Samples: 8433736. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:08:55,579][233173] Avg episode reward: [(0, '243.795')]
[2023-07-16 21:08:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016504_8450048.pth...
[2023-07-16 21:08:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000015840_8110080.pth
[2023-07-16 21:08:57,994][233458] Updated weights for policy 0, policy_version 16560 (0.0005)
[2023-07-16 21:09:00,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 8507392. Throughput: 0: 11416.8. Samples: 8500936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:00,580][233173] Avg episode reward: [(0, '268.202')]
[2023-07-16 21:09:00,580][233414] Saving new best policy, reward=268.202!
[2023-07-16 21:09:01,524][233458] Updated weights for policy 0, policy_version 16640 (0.0004)
[2023-07-16 21:09:05,261][233458] Updated weights for policy 0, policy_version 16720 (0.0005)
[2023-07-16 21:09:05,579][233173] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 8560640. Throughput: 0: 11421.1. Samples: 8536076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:05,580][233173] Avg episode reward: [(0, '250.291')]
[2023-07-16 21:09:09,054][233458] Updated weights for policy 0, policy_version 16800 (0.0006)
[2023-07-16 21:09:10,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 8617984. Throughput: 0: 11273.0. Samples: 8601044. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:09:10,580][233173] Avg episode reward: [(0, '259.330')]
[2023-07-16 21:09:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016832_8617984.pth...
[2023-07-16 21:09:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016176_8282112.pth
[2023-07-16 21:09:12,791][233458] Updated weights for policy 0, policy_version 16880 (0.0006)
[2023-07-16 21:09:15,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 8671232. Throughput: 0: 11176.1. Samples: 8666136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:09:15,580][233173] Avg episode reward: [(0, '250.446')]
[2023-07-16 21:09:16,597][233458] Updated weights for policy 0, policy_version 16960 (0.0006)
[2023-07-16 21:09:20,317][233458] Updated weights for policy 0, policy_version 17040 (0.0006)
[2023-07-16 21:09:20,579][233173] Fps is (10 sec: 10649.7, 60 sec: 11195.7, 300 sec: 11399.4). Total num frames: 8724480. Throughput: 0: 11130.8. Samples: 8698812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:20,580][233173] Avg episode reward: [(0, '252.739')]
[2023-07-16 21:09:23,888][233458] Updated weights for policy 0, policy_version 17120 (0.0005)
[2023-07-16 21:09:25,579][233173] Fps is (10 sec: 11059.2, 60 sec: 11195.8, 300 sec: 11399.4). Total num frames: 8781824. Throughput: 0: 11091.4. Samples: 8766056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:25,580][233173] Avg episode reward: [(0, '260.518')]
[2023-07-16 21:09:25,603][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017160_8785920.pth...
[2023-07-16 21:09:25,604][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016504_8450048.pth
[2023-07-16 21:09:27,316][233458] Updated weights for policy 0, policy_version 17200 (0.0004)
[2023-07-16 21:09:30,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11413.3). Total num frames: 8843264. Throughput: 0: 11183.6. Samples: 8836080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:30,579][233173] Avg episode reward: [(0, '266.308')]
[2023-07-16 21:09:30,893][233458] Updated weights for policy 0, policy_version 17280 (0.0005)
[2023-07-16 21:09:34,447][233458] Updated weights for policy 0, policy_version 17360 (0.0005)
[2023-07-16 21:09:35,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11195.8, 300 sec: 11399.4). Total num frames: 8900608. Throughput: 0: 11230.0. Samples: 8870932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:35,579][233173] Avg episode reward: [(0, '266.495')]
[2023-07-16 21:09:37,890][233458] Updated weights for policy 0, policy_version 17440 (0.0004)
[2023-07-16 21:09:40,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11399.4). Total num frames: 8957952. Throughput: 0: 11286.6. Samples: 8941632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:09:40,579][233173] Avg episode reward: [(0, '249.994')]
[2023-07-16 21:09:40,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017496_8957952.pth...
[2023-07-16 21:09:40,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016832_8617984.pth
[2023-07-16 21:09:41,391][233458] Updated weights for policy 0, policy_version 17520 (0.0004)
[2023-07-16 21:09:44,861][233458] Updated weights for policy 0, policy_version 17600 (0.0004)
[2023-07-16 21:09:45,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11399.4). Total num frames: 9019392. Throughput: 0: 11359.7. Samples: 9012120. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:09:45,579][233173] Avg episode reward: [(0, '268.194')]
[2023-07-16 21:09:48,390][233458] Updated weights for policy 0, policy_version 17680 (0.0004)
[2023-07-16 21:09:50,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11399.4). Total num frames: 9072640. Throughput: 0: 11374.2. Samples: 9047916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:09:50,579][233173] Avg episode reward: [(0, '245.481')]
[2023-07-16 21:09:52,183][233458] Updated weights for policy 0, policy_version 17760 (0.0006)
[2023-07-16 21:09:55,579][233173] Fps is (10 sec: 11059.0, 60 sec: 11332.2, 300 sec: 11399.4). Total num frames: 9129984. Throughput: 0: 11382.6. Samples: 9113260. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0)
[2023-07-16 21:09:55,580][233173] Avg episode reward: [(0, '255.933')]
[2023-07-16 21:09:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017832_9129984.pth...
[2023-07-16 21:09:55,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017160_8785920.pth
[2023-07-16 21:09:55,892][233458] Updated weights for policy 0, policy_version 17840 (0.0006)
[2023-07-16 21:09:59,645][233458] Updated weights for policy 0, policy_version 17920 (0.0006)
[2023-07-16 21:10:00,579][233173] Fps is (10 sec: 11059.1, 60 sec: 11264.0, 300 sec: 11399.4). Total num frames: 9183232. Throughput: 0: 11393.6. Samples: 9178848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:10:00,579][233173] Avg episode reward: [(0, '257.570')]
[2023-07-16 21:10:03,309][233458] Updated weights for policy 0, policy_version 18000 (0.0005)
[2023-07-16 21:10:05,579][233173] Fps is (10 sec: 11059.4, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 9240576. Throughput: 0: 11403.5. Samples: 9211968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:10:05,579][233173] Avg episode reward: [(0, '253.401')]
[2023-07-16 21:10:06,729][233458] Updated weights for policy 0, policy_version 18080 (0.0004)
[2023-07-16 21:10:10,208][233458] Updated weights for policy 0, policy_version 18160 (0.0004)
[2023-07-16 21:10:10,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 9302016. Throughput: 0: 11484.9. Samples: 9282880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:10:10,579][233173] Avg episode reward: [(0, '249.736')]
[2023-07-16 21:10:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018168_9302016.pth...
[2023-07-16 21:10:10,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017496_8957952.pth
[2023-07-16 21:10:13,671][233458] Updated weights for policy 0, policy_version 18240 (0.0004)
[2023-07-16 21:10:15,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 9359360. Throughput: 0: 11518.6. Samples: 9354416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:10:15,579][233173] Avg episode reward: [(0, '252.513')]
[2023-07-16 21:10:17,181][233458] Updated weights for policy 0, policy_version 18320 (0.0004)
[2023-07-16 21:10:20,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 9416704. Throughput: 0: 11508.0. Samples: 9388792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:10:20,579][233173] Avg episode reward: [(0, '248.556')]
[2023-07-16 21:10:20,707][233458] Updated weights for policy 0, policy_version 18400 (0.0005)
[2023-07-16 21:10:24,248][233458] Updated weights for policy 0, policy_version 18480 (0.0005)
[2023-07-16 21:10:25,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11537.0, 300 sec: 11441.0). Total num frames: 9474048. Throughput: 0: 11475.0. Samples: 9458008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:10:25,580][233173] Avg episode reward: [(0, '256.695')]
[2023-07-16 21:10:25,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018504_9474048.pth...
[2023-07-16 21:10:25,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017832_9129984.pth
[2023-07-16 21:10:27,706][233458] Updated weights for policy 0, policy_version 18560 (0.0004)
[2023-07-16 21:10:30,579][233173] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 9535488. Throughput: 0: 11493.8. Samples: 9529340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:10:30,580][233173] Avg episode reward: [(0, '262.615')]
[2023-07-16 21:10:31,187][233458] Updated weights for policy 0, policy_version 18640 (0.0004)
[2023-07-16 21:10:34,673][233458] Updated weights for policy 0, policy_version 18720 (0.0004)
[2023-07-16 21:10:35,579][233173] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 9592832. Throughput: 0: 11478.7. Samples: 9564460. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0)
[2023-07-16 21:10:35,579][233173] Avg episode reward: [(0, '266.768')]
[2023-07-16 21:10:38,379][233458] Updated weights for policy 0, policy_version 18800 (0.0005)
[2023-07-16 21:10:40,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 9650176. Throughput: 0: 11539.8. Samples: 9632552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:10:40,580][233173] Avg episode reward: [(0, '263.259')]
[2023-07-16 21:10:40,582][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018848_9650176.pth...
[2023-07-16 21:10:40,584][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018168_9302016.pth
[2023-07-16 21:10:41,941][233458] Updated weights for policy 0, policy_version 18880 (0.0005)
[2023-07-16 21:10:45,446][233458] Updated weights for policy 0, policy_version 18960 (0.0004)
[2023-07-16 21:10:45,579][233173] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 9707520. Throughput: 0: 11633.2. Samples: 9702340. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:10:45,579][233173] Avg episode reward: [(0, '237.297')]
[2023-07-16 21:10:48,975][233458] Updated weights for policy 0, policy_version 19040 (0.0005)
[2023-07-16 21:10:50,579][233173] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 9764864. Throughput: 0: 11661.8. Samples: 9736748. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:10:50,579][233173] Avg episode reward: [(0, '253.225')]
[2023-07-16 21:10:52,508][233458] Updated weights for policy 0, policy_version 19120 (0.0005)
[2023-07-16 21:10:55,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11427.1). Total num frames: 9822208. Throughput: 0: 11625.6. Samples: 9806032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:10:55,579][233173] Avg episode reward: [(0, '270.107')]
[2023-07-16 21:10:55,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019184_9822208.pth...
[2023-07-16 21:10:55,585][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018504_9474048.pth
[2023-07-16 21:10:55,586][233414] Saving new best policy, reward=270.107!
[2023-07-16 21:10:56,003][233458] Updated weights for policy 0, policy_version 19200 (0.0004)
[2023-07-16 21:10:59,464][233458] Updated weights for policy 0, policy_version 19280 (0.0004)
[2023-07-16 21:11:00,579][233173] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11441.0). Total num frames: 9883648. Throughput: 0: 11607.2. Samples: 9876740. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0)
[2023-07-16 21:11:00,579][233173] Avg episode reward: [(0, '263.989')]
[2023-07-16 21:11:03,054][233458] Updated weights for policy 0, policy_version 19360 (0.0005)
[2023-07-16 21:11:05,579][233173] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11441.0). Total num frames: 9940992. Throughput: 0: 11617.2. Samples: 9911568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:11:05,579][233173] Avg episode reward: [(0, '253.133')]
[2023-07-16 21:11:06,526][233458] Updated weights for policy 0, policy_version 19440 (0.0004)
[2023-07-16 21:11:10,064][233458] Updated weights for policy 0, policy_version 19520 (0.0005)
[2023-07-16 21:11:10,579][233173] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11441.0). Total num frames: 9998336. Throughput: 0: 11636.4. Samples: 9981648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0)
[2023-07-16 21:11:10,580][233173] Avg episode reward: [(0, '246.763')]
[2023-07-16 21:11:10,583][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019528_9998336.pth...
[2023-07-16 21:11:10,586][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000018848_9650176.pth
[2023-07-16 21:11:11,106][233414] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000
[2023-07-16 21:11:11,107][233460] Stopping RolloutWorker_w1...
[2023-07-16 21:11:11,107][233461] Stopping RolloutWorker_w2...
[2023-07-16 21:11:11,107][233463] Stopping RolloutWorker_w4...
[2023-07-16 21:11:11,107][233464] Stopping RolloutWorker_w5...
[2023-07-16 21:11:11,107][233460] Loop rollout_proc1_evt_loop terminating...
[2023-07-16 21:11:11,107][233497] Stopping RolloutWorker_w7...
[2023-07-16 21:11:11,107][233461] Loop rollout_proc2_evt_loop terminating...
[2023-07-16 21:11:11,107][233463] Loop rollout_proc4_evt_loop terminating...
[2023-07-16 21:11:11,107][233462] Stopping RolloutWorker_w3...
[2023-07-16 21:11:11,107][233464] Loop rollout_proc5_evt_loop terminating...
[2023-07-16 21:11:11,107][233496] Stopping RolloutWorker_w6...
[2023-07-16 21:11:11,107][233459] Stopping RolloutWorker_w0...
[2023-07-16 21:11:11,107][233497] Loop rollout_proc7_evt_loop terminating...
[2023-07-16 21:11:11,107][233462] Loop rollout_proc3_evt_loop terminating...
[2023-07-16 21:11:11,107][233496] Loop rollout_proc6_evt_loop terminating...
[2023-07-16 21:11:11,107][233459] Loop rollout_proc0_evt_loop terminating...
[2023-07-16 21:11:11,107][233173] Component RolloutWorker_w1 stopped!
[2023-07-16 21:11:11,107][233414] Stopping Batcher_0...
[2023-07-16 21:11:11,108][233173] Component RolloutWorker_w2 stopped!
[2023-07-16 21:11:11,108][233414] Loop batcher_evt_loop terminating...
[2023-07-16 21:11:11,108][233173] Component RolloutWorker_w4 stopped!
[2023-07-16 21:11:11,108][233173] Component RolloutWorker_w5 stopped!
[2023-07-16 21:11:11,108][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth...
[2023-07-16 21:11:11,108][233173] Component RolloutWorker_w7 stopped!
[2023-07-16 21:11:11,109][233173] Component RolloutWorker_w3 stopped!
[2023-07-16 21:11:11,109][233173] Component RolloutWorker_w6 stopped!
[2023-07-16 21:11:11,109][233173] Component RolloutWorker_w0 stopped!
[2023-07-16 21:11:11,109][233173] Component Batcher_0 stopped!
[2023-07-16 21:11:11,111][233414] Removing /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019184_9822208.pth
[2023-07-16 21:11:11,111][233414] Saving /home/qgallouedec/data/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019544_10006528.pth...
[2023-07-16 21:11:11,114][233414] Stopping LearnerWorker_p0...
[2023-07-16 21:11:11,114][233414] Loop learner_proc0_evt_loop terminating...
[2023-07-16 21:11:11,114][233173] Component LearnerWorker_p0 stopped!
[2023-07-16 21:11:11,174][233458] Weights refcount: 2 0
[2023-07-16 21:11:11,174][233458] Stopping InferenceWorker_p0-w0...
[2023-07-16 21:11:11,175][233458] Loop inference_proc0-0_evt_loop terminating...
[2023-07-16 21:11:11,175][233173] Component InferenceWorker_p0-w0 stopped!
[2023-07-16 21:11:11,175][233173] Waiting for process learner_proc0 to stop...
[2023-07-16 21:11:11,709][233173] Waiting for process inference_proc0-0 to join...
[2023-07-16 21:11:11,718][233173] Waiting for process rollout_proc0 to join...
[2023-07-16 21:11:11,718][233173] Waiting for process rollout_proc1 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc2 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc3 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc4 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc5 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc6 to join...
[2023-07-16 21:11:11,719][233173] Waiting for process rollout_proc7 to join...
[2023-07-16 21:11:11,720][233173] Batcher 0 profile tree view:
batching: 1.7244, releasing_batches: 1.5408
[2023-07-16 21:11:11,720][233173] InferenceWorker_p0-w0 profile tree view:
wait_policy: 0.0051
wait_policy_total: 314.2288
update_model: 11.5858
weight_update: 0.0005
one_step: 0.0005
handle_policy_step: 509.6172
deserialize: 21.4223, stack: 5.5738, obs_to_device_normalize: 91.8932, forward: 251.5496, send_messages: 36.9858
prepare_outputs: 58.6589
to_cpu: 8.9351
[2023-07-16 21:11:11,720][233173] Learner 0 profile tree view:
misc: 0.0094, prepare_batch: 8.8592
train: 92.3850
epoch_init: 0.0347, minibatch_init: 1.2664, losses_postprocess: 1.2281, kl_divergence: 0.4286, after_optimizer: 0.5853
calculate_losses: 39.3723
losses_init: 0.0301, forward_head: 15.4418, bptt_initial: 0.1400, bptt: 0.1227, tail: 11.0726, advantages_returns: 0.8504, losses: 10.3300
update: 47.9133
clip: 5.6934
[2023-07-16 21:11:11,720][233173] RolloutWorker_w0 profile tree view:
wait_for_trajectories: 0.2795, enqueue_policy_requests: 12.6913, env_step: 644.2048, overhead: 19.6522, complete_rollouts: 0.3250
save_policy_outputs: 38.6580
split_output_tensors: 13.1978
[2023-07-16 21:11:11,720][233173] RolloutWorker_w7 profile tree view:
wait_for_trajectories: 0.2947, enqueue_policy_requests: 13.1313, env_step: 643.4403, overhead: 19.6936, complete_rollouts: 0.3227
save_policy_outputs: 38.8751
split_output_tensors: 13.3601
[2023-07-16 21:11:11,720][233173] Loop Runner_EvtLoop terminating...
[2023-07-16 21:11:11,721][233173] Runner profile tree view:
main_loop: 899.1011
[2023-07-16 21:11:11,721][233173] Collected {0: 10006528}, FPS: 11129.5