[2023-03-09 05:39:47,423][635489] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/config.json... [2023-03-09 05:39:47,439][635489] Rollout worker 0 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 1 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 2 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 3 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 4 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 5 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 6 uses device cpu [2023-03-09 05:39:47,440][635489] Rollout worker 7 uses device cpu [2023-03-09 05:39:47,441][635489] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 [2023-03-09 05:39:47,453][635489] InferenceWorker_p0-w0: min num requests: 2 [2023-03-09 05:39:47,472][635489] Starting all processes... [2023-03-09 05:39:47,472][635489] Starting process learner_proc0 [2023-03-09 05:39:47,522][635489] Starting all processes... [2023-03-09 05:39:47,535][635489] Starting process inference_proc0-0 [2023-03-09 05:39:47,545][635489] Starting process rollout_proc0 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc2 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc3 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc4 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc5 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc6 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc7 [2023-03-09 05:39:47,550][635489] Starting process rollout_proc1 [2023-03-09 05:39:49,120][635777] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-03-09 05:39:49,130][635732] Starting seed is not provided [2023-03-09 05:39:49,131][635732] Initializing actor-critic model on device cpu [2023-03-09 05:39:49,131][635732] RunningMeanStd input shape: (39,) [2023-03-09 05:39:49,131][635732] RunningMeanStd input shape: (1,) [2023-03-09 05:39:49,190][635732] Created Actor Critic model with architecture: [2023-03-09 05:39:49,190][635732] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=Tanh) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=Tanh) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=64, out_features=1, bias=True) (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) [2023-03-09 05:39:49,249][635813] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-03-09 05:39:49,341][635846] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-03-09 05:39:49,384][635779] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-03-09 05:39:49,526][635732] Using optimizer [2023-03-09 05:39:49,527][635732] No checkpoints found [2023-03-09 05:39:49,527][635732] Did not load from checkpoint, starting from scratch! [2023-03-09 05:39:49,528][635732] Initialized policy 0 weights for model version 0 [2023-03-09 05:39:49,529][635732] LearnerWorker_p0 finished initialization! [2023-03-09 05:39:49,539][635778] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-03-09 05:39:49,682][635814] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-03-09 05:39:49,715][635811] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-03-09 05:39:49,788][635776] RunningMeanStd input shape: (39,) [2023-03-09 05:39:49,789][635776] RunningMeanStd input shape: (1,) [2023-03-09 05:39:49,846][635489] Inference worker 0-0 is ready! [2023-03-09 05:39:49,846][635489] All inference workers are ready! Signal rollout workers to start! [2023-03-09 05:39:49,858][635812] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-03-09 05:39:50,122][635489] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-09 05:39:53,690][635779] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,693][635811] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,703][635779] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,706][635811] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,712][635814] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,725][635814] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,735][635777] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,739][635779] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,742][635811] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,745][635812] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,746][635846] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,748][635777] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,756][635778] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,758][635812] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,759][635846] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,761][635814] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,769][635778] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,783][635777] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,794][635812] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,796][635846] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,797][635779] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,800][635811] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,805][635778] Decorrelating experience for 128 frames... [2023-03-09 05:39:53,820][635814] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,843][635777] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,854][635812] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,855][635846] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,864][635778] Decorrelating experience for 192 frames... [2023-03-09 05:39:53,940][635813] Decorrelating experience for 0 frames... [2023-03-09 05:39:53,953][635813] Decorrelating experience for 64 frames... [2023-03-09 05:39:53,989][635813] Decorrelating experience for 128 frames... [2023-03-09 05:39:54,048][635813] Decorrelating experience for 192 frames... [2023-03-09 05:39:55,122][635489] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-09 05:39:57,635][635811] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,644][635779] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,711][635814] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,712][635778] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,721][635846] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,722][635812] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,739][635811] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,760][635779] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,761][635777] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,816][635814] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,817][635778] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,827][635812] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,833][635846] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,851][635813] Decorrelating experience for 256 frames... [2023-03-09 05:39:57,865][635777] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,866][635811] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,897][635779] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,945][635814] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,946][635778] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,955][635812] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,956][635813] Decorrelating experience for 320 frames... [2023-03-09 05:39:57,961][635846] Decorrelating experience for 384 frames... [2023-03-09 05:39:57,991][635777] Decorrelating experience for 384 frames... [2023-03-09 05:39:58,018][635811] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,061][635779] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,082][635813] Decorrelating experience for 384 frames... [2023-03-09 05:39:58,095][635814] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,097][635778] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,108][635812] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,110][635846] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,140][635777] Decorrelating experience for 448 frames... [2023-03-09 05:39:58,230][635813] Decorrelating experience for 448 frames... [2023-03-09 05:40:00,122][635489] Fps is (10 sec: 1228.8, 60 sec: 1228.8, 300 sec: 1228.8). Total num frames: 12288. Throughput: 0: 409.6. Samples: 4096. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:40:00,122][635489] Avg episode reward: [(0, '39.799')] [2023-03-09 05:40:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000024_12288.pth... [2023-03-09 05:40:02,885][635776] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-09 05:40:05,121][635489] Fps is (10 sec: 5734.4, 60 sec: 3823.0, 300 sec: 3823.0). Total num frames: 57344. Throughput: 0: 3993.4. Samples: 59900. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:40:05,122][635489] Avg episode reward: [(0, '173.478')] [2023-03-09 05:40:07,418][635776] Updated weights for policy 0, policy_version 160 (0.0005) [2023-03-09 05:40:07,448][635489] Heartbeat connected on Batcher_0 [2023-03-09 05:40:07,450][635489] Heartbeat connected on LearnerWorker_p0 [2023-03-09 05:40:07,454][635489] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-09 05:40:07,459][635489] Heartbeat connected on RolloutWorker_w0 [2023-03-09 05:40:07,462][635489] Heartbeat connected on RolloutWorker_w1 [2023-03-09 05:40:07,464][635489] Heartbeat connected on RolloutWorker_w2 [2023-03-09 05:40:07,466][635489] Heartbeat connected on RolloutWorker_w3 [2023-03-09 05:40:07,468][635489] Heartbeat connected on RolloutWorker_w4 [2023-03-09 05:40:07,472][635489] Heartbeat connected on RolloutWorker_w6 [2023-03-09 05:40:07,472][635489] Heartbeat connected on RolloutWorker_w5 [2023-03-09 05:40:07,476][635489] Heartbeat connected on RolloutWorker_w7 [2023-03-09 05:40:10,122][635489] Fps is (10 sec: 9420.9, 60 sec: 5324.8, 300 sec: 5324.8). Total num frames: 106496. Throughput: 0: 4304.8. Samples: 86096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:40:10,122][635489] Avg episode reward: [(0, '154.434')] [2023-03-09 05:40:10,123][635732] Saving new best policy, reward=154.434! [2023-03-09 05:40:11,650][635776] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-09 05:40:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 6389.8, 300 sec: 6389.8). Total num frames: 159744. Throughput: 0: 5870.2. Samples: 146756. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:40:15,122][635489] Avg episode reward: [(0, '217.692')] [2023-03-09 05:40:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000312_159744.pth... [2023-03-09 05:40:15,128][635732] Saving new best policy, reward=217.692! [2023-03-09 05:40:15,542][635776] Updated weights for policy 0, policy_version 320 (0.0005) [2023-03-09 05:40:19,416][635776] Updated weights for policy 0, policy_version 400 (0.0005) [2023-03-09 05:40:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 6963.2, 300 sec: 6963.2). Total num frames: 208896. Throughput: 0: 6983.8. Samples: 209512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:40:20,122][635489] Avg episode reward: [(0, '420.621')] [2023-03-09 05:40:20,122][635732] Saving new best policy, reward=420.621! [2023-03-09 05:40:23,122][635776] Updated weights for policy 0, policy_version 480 (0.0005) [2023-03-09 05:40:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 7606.9, 300 sec: 7606.9). Total num frames: 266240. Throughput: 0: 6952.0. Samples: 243320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:40:25,122][635489] Avg episode reward: [(0, '559.733')] [2023-03-09 05:40:25,123][635732] Saving new best policy, reward=559.733! [2023-03-09 05:40:26,969][635776] Updated weights for policy 0, policy_version 560 (0.0005) [2023-03-09 05:40:30,121][635489] Fps is (10 sec: 11059.2, 60 sec: 7987.2, 300 sec: 7987.2). Total num frames: 319488. Throughput: 0: 7686.3. Samples: 307452. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:40:30,122][635489] Avg episode reward: [(0, '585.418')] [2023-03-09 05:40:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000624_319488.pth... [2023-03-09 05:40:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000024_12288.pth [2023-03-09 05:40:30,126][635732] Saving new best policy, reward=585.418! [2023-03-09 05:40:30,723][635776] Updated weights for policy 0, policy_version 640 (0.0005) [2023-03-09 05:40:34,896][635776] Updated weights for policy 0, policy_version 720 (0.0005) [2023-03-09 05:40:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 368640. Throughput: 0: 8192.2. Samples: 368648. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:40:35,122][635489] Avg episode reward: [(0, '568.676')] [2023-03-09 05:40:38,687][635776] Updated weights for policy 0, policy_version 800 (0.0005) [2023-03-09 05:40:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 8437.8, 300 sec: 8437.8). Total num frames: 421888. Throughput: 0: 8918.4. Samples: 401328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:40:40,122][635489] Avg episode reward: [(0, '556.000')] [2023-03-09 05:40:42,283][635776] Updated weights for policy 0, policy_version 880 (0.0005) [2023-03-09 05:40:45,121][635489] Fps is (10 sec: 11059.3, 60 sec: 8713.3, 300 sec: 8713.3). Total num frames: 479232. Throughput: 0: 10350.1. Samples: 469848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:40:45,122][635489] Avg episode reward: [(0, '565.682')] [2023-03-09 05:40:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000936_479232.pth... [2023-03-09 05:40:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000312_159744.pth [2023-03-09 05:40:45,978][635776] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-09 05:40:49,511][635776] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-09 05:40:50,122][635489] Fps is (10 sec: 11468.8, 60 sec: 8942.9, 300 sec: 8942.9). Total num frames: 536576. Throughput: 0: 10592.3. Samples: 536552. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:40:50,122][635489] Avg episode reward: [(0, '585.047')] [2023-03-09 05:40:53,264][635776] Updated weights for policy 0, policy_version 1120 (0.0005) [2023-03-09 05:40:55,122][635489] Fps is (10 sec: 11059.1, 60 sec: 9830.4, 300 sec: 9074.2). Total num frames: 589824. Throughput: 0: 10740.3. Samples: 569408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:40:55,122][635489] Avg episode reward: [(0, '589.083')] [2023-03-09 05:40:55,151][635732] Saving new best policy, reward=589.083! [2023-03-09 05:40:57,094][635776] Updated weights for policy 0, policy_version 1200 (0.0005) [2023-03-09 05:41:00,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 9245.3). Total num frames: 647168. Throughput: 0: 10847.5. Samples: 634892. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:41:00,122][635489] Avg episode reward: [(0, '577.774')] [2023-03-09 05:41:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001264_647168.pth... [2023-03-09 05:41:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000624_319488.pth [2023-03-09 05:41:00,777][635776] Updated weights for policy 0, policy_version 1280 (0.0005) [2023-03-09 05:41:04,576][635776] Updated weights for policy 0, policy_version 1360 (0.0005) [2023-03-09 05:41:05,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 9338.9). Total num frames: 700416. Throughput: 0: 10910.6. Samples: 700488. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:41:05,122][635489] Avg episode reward: [(0, '589.833')] [2023-03-09 05:41:05,122][635732] Saving new best policy, reward=589.833! [2023-03-09 05:41:08,336][635776] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-09 05:41:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 9420.8). Total num frames: 753664. Throughput: 0: 10883.6. Samples: 733084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:41:10,122][635489] Avg episode reward: [(0, '595.634')] [2023-03-09 05:41:10,138][635732] Saving new best policy, reward=595.634! [2023-03-09 05:41:12,081][635776] Updated weights for policy 0, policy_version 1520 (0.0005) [2023-03-09 05:41:15,121][635489] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 9541.3). Total num frames: 811008. Throughput: 0: 10932.6. Samples: 799420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:41:15,122][635489] Avg episode reward: [(0, '594.064')] [2023-03-09 05:41:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001584_811008.pth... [2023-03-09 05:41:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000000936_479232.pth [2023-03-09 05:41:15,709][635776] Updated weights for policy 0, policy_version 1600 (0.0005) [2023-03-09 05:41:19,344][635776] Updated weights for policy 0, policy_version 1680 (0.0005) [2023-03-09 05:41:20,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10922.7, 300 sec: 9602.9). Total num frames: 864256. Throughput: 0: 11048.0. Samples: 865808. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:41:20,122][635489] Avg episode reward: [(0, '602.871')] [2023-03-09 05:41:20,153][635732] Saving new best policy, reward=602.871! [2023-03-09 05:41:23,019][635776] Updated weights for policy 0, policy_version 1760 (0.0005) [2023-03-09 05:41:25,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 9701.1). Total num frames: 921600. Throughput: 0: 11061.9. Samples: 899112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:41:25,122][635489] Avg episode reward: [(0, '608.839')] [2023-03-09 05:41:25,123][635732] Saving new best policy, reward=608.839! [2023-03-09 05:41:26,711][635776] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-09 05:41:30,122][635489] Fps is (10 sec: 11468.7, 60 sec: 10990.9, 300 sec: 9789.4). Total num frames: 978944. Throughput: 0: 11041.8. Samples: 966728. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:41:30,122][635489] Avg episode reward: [(0, '604.126')] [2023-03-09 05:41:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001912_978944.pth... [2023-03-09 05:41:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001264_647168.pth [2023-03-09 05:41:30,274][635776] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-09 05:41:33,915][635776] Updated weights for policy 0, policy_version 2000 (0.0005) [2023-03-09 05:41:35,121][635489] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 9830.4). Total num frames: 1032192. Throughput: 0: 11030.3. Samples: 1032916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:41:35,122][635489] Avg episode reward: [(0, '610.388')] [2023-03-09 05:41:35,122][635732] Saving new best policy, reward=610.388! [2023-03-09 05:41:37,777][635776] Updated weights for policy 0, policy_version 2080 (0.0005) [2023-03-09 05:41:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 9904.9). Total num frames: 1089536. Throughput: 0: 11039.5. Samples: 1066184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:41:40,122][635489] Avg episode reward: [(0, '607.747')] [2023-03-09 05:41:41,343][635776] Updated weights for policy 0, policy_version 2160 (0.0005) [2023-03-09 05:41:45,121][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 9937.3). Total num frames: 1142784. Throughput: 0: 11104.7. Samples: 1134604. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:41:45,122][635489] Avg episode reward: [(0, '611.159')] [2023-03-09 05:41:45,156][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002240_1146880.pth... [2023-03-09 05:41:45,156][635776] Updated weights for policy 0, policy_version 2240 (0.0005) [2023-03-09 05:41:45,157][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001584_811008.pth [2023-03-09 05:41:45,158][635732] Saving new best policy, reward=611.159! [2023-03-09 05:41:48,910][635776] Updated weights for policy 0, policy_version 2320 (0.0005) [2023-03-09 05:41:50,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10001.1). Total num frames: 1200128. Throughput: 0: 11073.5. Samples: 1198796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:41:50,122][635489] Avg episode reward: [(0, '611.409')] [2023-03-09 05:41:50,123][635732] Saving new best policy, reward=611.409! [2023-03-09 05:41:52,427][635776] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-09 05:41:55,122][635489] Fps is (10 sec: 11468.7, 60 sec: 11127.5, 300 sec: 10059.8). Total num frames: 1257472. Throughput: 0: 11149.8. Samples: 1234824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:41:55,122][635489] Avg episode reward: [(0, '606.466')] [2023-03-09 05:41:56,101][635776] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-09 05:41:59,779][635776] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-09 05:42:00,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10082.5). Total num frames: 1310720. Throughput: 0: 11159.5. Samples: 1301600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:00,122][635489] Avg episode reward: [(0, '604.502')] [2023-03-09 05:42:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002560_1310720.pth... [2023-03-09 05:42:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000001912_978944.pth [2023-03-09 05:42:03,636][635776] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-09 05:42:05,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11127.4, 300 sec: 10133.8). Total num frames: 1368064. Throughput: 0: 11092.0. Samples: 1364948. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:42:05,133][635489] Avg episode reward: [(0, '609.806')] [2023-03-09 05:42:07,381][635776] Updated weights for policy 0, policy_version 2720 (0.0005) [2023-03-09 05:42:10,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10152.2). Total num frames: 1421312. Throughput: 0: 11115.6. Samples: 1399316. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:42:10,122][635489] Avg episode reward: [(0, '604.473')] [2023-03-09 05:42:11,040][635776] Updated weights for policy 0, policy_version 2800 (0.0004) [2023-03-09 05:42:14,842][635776] Updated weights for policy 0, policy_version 2880 (0.0004) [2023-03-09 05:42:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 10169.4). Total num frames: 1474560. Throughput: 0: 11097.1. Samples: 1466100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:15,132][635489] Avg episode reward: [(0, '607.545')] [2023-03-09 05:42:15,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002880_1474560.pth... [2023-03-09 05:42:15,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002240_1146880.pth [2023-03-09 05:42:18,668][635776] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-09 05:42:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 11059.2, 300 sec: 10185.4). Total num frames: 1527808. Throughput: 0: 11020.7. Samples: 1528848. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:42:20,132][635489] Avg episode reward: [(0, '608.943')] [2023-03-09 05:42:22,457][635776] Updated weights for policy 0, policy_version 3040 (0.0005) [2023-03-09 05:42:25,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10226.8). Total num frames: 1585152. Throughput: 0: 11028.1. Samples: 1562448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:25,133][635489] Avg episode reward: [(0, '608.949')] [2023-03-09 05:42:26,113][635776] Updated weights for policy 0, policy_version 3120 (0.0005) [2023-03-09 05:42:29,901][635776] Updated weights for policy 0, policy_version 3200 (0.0005) [2023-03-09 05:42:30,122][635489] Fps is (10 sec: 11059.0, 60 sec: 10990.9, 300 sec: 10240.0). Total num frames: 1638400. Throughput: 0: 10922.5. Samples: 1626120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:30,133][635489] Avg episode reward: [(0, '602.911')] [2023-03-09 05:42:30,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003200_1638400.pth... [2023-03-09 05:42:30,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002560_1310720.pth [2023-03-09 05:42:33,613][635776] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-09 05:42:35,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10252.4). Total num frames: 1691648. Throughput: 0: 10959.0. Samples: 1691952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:35,122][635489] Avg episode reward: [(0, '600.073')] [2023-03-09 05:42:37,146][635776] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-09 05:42:40,121][635489] Fps is (10 sec: 11469.0, 60 sec: 11059.2, 300 sec: 10312.3). Total num frames: 1753088. Throughput: 0: 10973.3. Samples: 1728620. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:42:40,122][635489] Avg episode reward: [(0, '613.444')] [2023-03-09 05:42:40,122][635732] Saving new best policy, reward=613.444! [2023-03-09 05:42:40,924][635776] Updated weights for policy 0, policy_version 3440 (0.0005) [2023-03-09 05:42:44,719][635776] Updated weights for policy 0, policy_version 3520 (0.0005) [2023-03-09 05:42:45,122][635489] Fps is (10 sec: 11468.6, 60 sec: 11059.2, 300 sec: 10321.9). Total num frames: 1806336. Throughput: 0: 10938.1. Samples: 1793816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:45,122][635489] Avg episode reward: [(0, '613.208')] [2023-03-09 05:42:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003528_1806336.pth... [2023-03-09 05:42:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000002880_1474560.pth [2023-03-09 05:42:48,556][635776] Updated weights for policy 0, policy_version 3600 (0.0005) [2023-03-09 05:42:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10990.9, 300 sec: 10331.0). Total num frames: 1859584. Throughput: 0: 10967.9. Samples: 1858504. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:42:50,122][635489] Avg episode reward: [(0, '607.904')] [2023-03-09 05:42:52,206][635776] Updated weights for policy 0, policy_version 3680 (0.0005) [2023-03-09 05:42:55,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10922.7, 300 sec: 10339.6). Total num frames: 1912832. Throughput: 0: 10950.9. Samples: 1892104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:42:55,122][635489] Avg episode reward: [(0, '609.909')] [2023-03-09 05:42:55,993][635776] Updated weights for policy 0, policy_version 3760 (0.0005) [2023-03-09 05:42:59,613][635776] Updated weights for policy 0, policy_version 3840 (0.0005) [2023-03-09 05:43:00,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10369.3). Total num frames: 1970176. Throughput: 0: 10930.0. Samples: 1957952. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:43:00,122][635489] Avg episode reward: [(0, '611.894')] [2023-03-09 05:43:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003848_1970176.pth... [2023-03-09 05:43:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003200_1638400.pth [2023-03-09 05:43:03,268][635776] Updated weights for policy 0, policy_version 3920 (0.0004) [2023-03-09 05:43:05,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10376.5). Total num frames: 2023424. Throughput: 0: 11028.2. Samples: 2025116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:05,122][635489] Avg episode reward: [(0, '609.679')] [2023-03-09 05:43:07,032][635776] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-09 05:43:10,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 10383.4). Total num frames: 2076672. Throughput: 0: 10973.7. Samples: 2056264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:10,122][635489] Avg episode reward: [(0, '603.730')] [2023-03-09 05:43:10,924][635776] Updated weights for policy 0, policy_version 4080 (0.0005) [2023-03-09 05:43:14,525][635776] Updated weights for policy 0, policy_version 4160 (0.0004) [2023-03-09 05:43:15,122][635489] Fps is (10 sec: 11059.0, 60 sec: 10990.9, 300 sec: 10409.8). Total num frames: 2134016. Throughput: 0: 11035.9. Samples: 2122736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:15,122][635489] Avg episode reward: [(0, '603.228')] [2023-03-09 05:43:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004168_2134016.pth... [2023-03-09 05:43:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003528_1806336.pth [2023-03-09 05:43:18,322][635776] Updated weights for policy 0, policy_version 4240 (0.0005) [2023-03-09 05:43:20,122][635489] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10435.0). Total num frames: 2191360. Throughput: 0: 11034.8. Samples: 2188520. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:43:20,122][635489] Avg episode reward: [(0, '599.396')] [2023-03-09 05:43:22,003][635776] Updated weights for policy 0, policy_version 4320 (0.0005) [2023-03-09 05:43:25,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10440.0). Total num frames: 2244608. Throughput: 0: 10959.4. Samples: 2221792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:25,122][635489] Avg episode reward: [(0, '607.713')] [2023-03-09 05:43:25,586][635776] Updated weights for policy 0, policy_version 4400 (0.0005) [2023-03-09 05:43:29,361][635776] Updated weights for policy 0, policy_version 4480 (0.0005) [2023-03-09 05:43:30,122][635489] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10463.4). Total num frames: 2301952. Throughput: 0: 11018.2. Samples: 2289636. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:43:30,122][635489] Avg episode reward: [(0, '608.545')] [2023-03-09 05:43:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004496_2301952.pth... [2023-03-09 05:43:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000003848_1970176.pth [2023-03-09 05:43:33,106][635776] Updated weights for policy 0, policy_version 4560 (0.0005) [2023-03-09 05:43:35,122][635489] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10467.6). Total num frames: 2355200. Throughput: 0: 10984.6. Samples: 2352812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:35,122][635489] Avg episode reward: [(0, '599.795')] [2023-03-09 05:43:37,057][635776] Updated weights for policy 0, policy_version 4640 (0.0004) [2023-03-09 05:43:40,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10922.7, 300 sec: 10471.5). Total num frames: 2408448. Throughput: 0: 10946.5. Samples: 2384696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:43:40,122][635489] Avg episode reward: [(0, '607.633')] [2023-03-09 05:43:40,621][635776] Updated weights for policy 0, policy_version 4720 (0.0004) [2023-03-09 05:43:44,359][635776] Updated weights for policy 0, policy_version 4800 (0.0005) [2023-03-09 05:43:45,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10492.7). Total num frames: 2465792. Throughput: 0: 10992.4. Samples: 2452612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:43:45,122][635489] Avg episode reward: [(0, '606.668')] [2023-03-09 05:43:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004816_2465792.pth... [2023-03-09 05:43:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004168_2134016.pth [2023-03-09 05:43:48,025][635776] Updated weights for policy 0, policy_version 4880 (0.0005) [2023-03-09 05:43:50,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10496.0). Total num frames: 2519040. Throughput: 0: 10972.5. Samples: 2518880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:43:50,122][635489] Avg episode reward: [(0, '608.611')] [2023-03-09 05:43:51,914][635776] Updated weights for policy 0, policy_version 4960 (0.0005) [2023-03-09 05:43:55,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10499.1). Total num frames: 2572288. Throughput: 0: 10976.7. Samples: 2550216. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:43:55,122][635489] Avg episode reward: [(0, '604.155')] [2023-03-09 05:43:55,625][635776] Updated weights for policy 0, policy_version 5040 (0.0005) [2023-03-09 05:43:59,429][635776] Updated weights for policy 0, policy_version 5120 (0.0005) [2023-03-09 05:44:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10502.1). Total num frames: 2625536. Throughput: 0: 10960.1. Samples: 2615940. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:44:00,122][635489] Avg episode reward: [(0, '603.824')] [2023-03-09 05:44:00,164][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005136_2629632.pth... [2023-03-09 05:44:00,166][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004496_2301952.pth [2023-03-09 05:44:03,014][635776] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-09 05:44:05,122][635489] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10537.2). Total num frames: 2686976. Throughput: 0: 11023.7. Samples: 2684588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:05,122][635489] Avg episode reward: [(0, '601.675')] [2023-03-09 05:44:06,461][635776] Updated weights for policy 0, policy_version 5280 (0.0005) [2023-03-09 05:44:10,121][635489] Fps is (10 sec: 11468.9, 60 sec: 11059.2, 300 sec: 10539.3). Total num frames: 2740224. Throughput: 0: 11058.5. Samples: 2719424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:10,122][635489] Avg episode reward: [(0, '607.360')] [2023-03-09 05:44:10,234][635776] Updated weights for policy 0, policy_version 5360 (0.0005) [2023-03-09 05:44:13,981][635776] Updated weights for policy 0, policy_version 5440 (0.0005) [2023-03-09 05:44:15,122][635489] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10556.9). Total num frames: 2797568. Throughput: 0: 11012.4. Samples: 2785196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:15,122][635489] Avg episode reward: [(0, '598.919')] [2023-03-09 05:44:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005464_2797568.pth... [2023-03-09 05:44:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000004816_2465792.pth [2023-03-09 05:44:17,667][635776] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-09 05:44:20,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10558.6). Total num frames: 2850816. Throughput: 0: 11037.3. Samples: 2849492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:20,122][635489] Avg episode reward: [(0, '602.416')] [2023-03-09 05:44:21,597][635776] Updated weights for policy 0, policy_version 5600 (0.0005) [2023-03-09 05:44:25,108][635776] Updated weights for policy 0, policy_version 5680 (0.0005) [2023-03-09 05:44:25,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10575.1). Total num frames: 2908160. Throughput: 0: 11053.7. Samples: 2882116. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:44:25,122][635489] Avg episode reward: [(0, '608.489')] [2023-03-09 05:44:28,830][635776] Updated weights for policy 0, policy_version 5760 (0.0005) [2023-03-09 05:44:30,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10576.5). Total num frames: 2961408. Throughput: 0: 11040.1. Samples: 2949416. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:44:30,122][635489] Avg episode reward: [(0, '604.848')] [2023-03-09 05:44:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005784_2961408.pth... [2023-03-09 05:44:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005136_2629632.pth [2023-03-09 05:44:32,635][635776] Updated weights for policy 0, policy_version 5840 (0.0005) [2023-03-09 05:44:35,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10592.1). Total num frames: 3018752. Throughput: 0: 11050.0. Samples: 3016128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:35,122][635489] Avg episode reward: [(0, '604.438')] [2023-03-09 05:44:36,216][635776] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-09 05:44:39,923][635776] Updated weights for policy 0, policy_version 6000 (0.0005) [2023-03-09 05:44:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10593.1). Total num frames: 3072000. Throughput: 0: 11080.7. Samples: 3048848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:40,122][635489] Avg episode reward: [(0, '603.936')] [2023-03-09 05:44:43,781][635776] Updated weights for policy 0, policy_version 6080 (0.0004) [2023-03-09 05:44:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10594.1). Total num frames: 3125248. Throughput: 0: 11071.7. Samples: 3114168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:45,122][635489] Avg episode reward: [(0, '614.938')] [2023-03-09 05:44:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006104_3125248.pth... [2023-03-09 05:44:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005464_2797568.pth [2023-03-09 05:44:45,127][635732] Saving new best policy, reward=614.938! [2023-03-09 05:44:47,562][635776] Updated weights for policy 0, policy_version 6160 (0.0005) [2023-03-09 05:44:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10774.6). Total num frames: 3178496. Throughput: 0: 11005.4. Samples: 3179832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:50,122][635489] Avg episode reward: [(0, '612.959')] [2023-03-09 05:44:51,262][635776] Updated weights for policy 0, policy_version 6240 (0.0005) [2023-03-09 05:44:54,885][635776] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-09 05:44:55,122][635489] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10927.3). Total num frames: 3235840. Throughput: 0: 10963.4. Samples: 3212780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:44:55,122][635489] Avg episode reward: [(0, '607.233')] [2023-03-09 05:44:58,639][635776] Updated weights for policy 0, policy_version 6400 (0.0004) [2023-03-09 05:45:00,122][635489] Fps is (10 sec: 11468.6, 60 sec: 11127.4, 300 sec: 10968.9). Total num frames: 3293184. Throughput: 0: 10993.2. Samples: 3279892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:00,122][635489] Avg episode reward: [(0, '618.519')] [2023-03-09 05:45:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006432_3293184.pth... [2023-03-09 05:45:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000005784_2961408.pth [2023-03-09 05:45:00,128][635732] Saving new best policy, reward=618.519! [2023-03-09 05:45:02,260][635776] Updated weights for policy 0, policy_version 6480 (0.0005) [2023-03-09 05:45:05,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10982.8). Total num frames: 3346432. Throughput: 0: 11044.7. Samples: 3346504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:05,122][635489] Avg episode reward: [(0, '612.291')] [2023-03-09 05:45:05,946][635776] Updated weights for policy 0, policy_version 6560 (0.0005) [2023-03-09 05:45:09,460][635776] Updated weights for policy 0, policy_version 6640 (0.0005) [2023-03-09 05:45:10,122][635489] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10996.7). Total num frames: 3403776. Throughput: 0: 11077.3. Samples: 3380596. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:45:10,122][635489] Avg episode reward: [(0, '612.618')] [2023-03-09 05:45:13,167][635776] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-09 05:45:15,122][635489] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 11024.5). Total num frames: 3461120. Throughput: 0: 11098.1. Samples: 3448832. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:45:15,122][635489] Avg episode reward: [(0, '609.944')] [2023-03-09 05:45:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006760_3461120.pth... [2023-03-09 05:45:15,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006104_3125248.pth [2023-03-09 05:45:16,925][635776] Updated weights for policy 0, policy_version 6800 (0.0005) [2023-03-09 05:45:20,122][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 11010.6). Total num frames: 3514368. Throughput: 0: 11074.6. Samples: 3514484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:20,122][635489] Avg episode reward: [(0, '615.046')] [2023-03-09 05:45:20,623][635776] Updated weights for policy 0, policy_version 6880 (0.0005) [2023-03-09 05:45:24,449][635776] Updated weights for policy 0, policy_version 6960 (0.0005) [2023-03-09 05:45:25,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10991.0, 300 sec: 11010.6). Total num frames: 3567616. Throughput: 0: 11073.3. Samples: 3547144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:25,122][635489] Avg episode reward: [(0, '615.762')] [2023-03-09 05:45:28,246][635776] Updated weights for policy 0, policy_version 7040 (0.0005) [2023-03-09 05:45:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10990.9, 300 sec: 11024.5). Total num frames: 3620864. Throughput: 0: 11069.6. Samples: 3612300. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:45:30,122][635489] Avg episode reward: [(0, '613.545')] [2023-03-09 05:45:30,132][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007080_3624960.pth... [2023-03-09 05:45:30,133][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006432_3293184.pth [2023-03-09 05:45:32,057][635776] Updated weights for policy 0, policy_version 7120 (0.0005) [2023-03-09 05:45:35,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 11038.4). Total num frames: 3678208. Throughput: 0: 11057.0. Samples: 3677396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:45:35,122][635489] Avg episode reward: [(0, '617.341')] [2023-03-09 05:45:35,655][635776] Updated weights for policy 0, policy_version 7200 (0.0004) [2023-03-09 05:45:39,286][635776] Updated weights for policy 0, policy_version 7280 (0.0005) [2023-03-09 05:45:40,122][635489] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 11038.4). Total num frames: 3735552. Throughput: 0: 11088.2. Samples: 3711748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:40,122][635489] Avg episode reward: [(0, '610.721')] [2023-03-09 05:45:43,208][635776] Updated weights for policy 0, policy_version 7360 (0.0005) [2023-03-09 05:45:45,122][635489] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 11024.5). Total num frames: 3788800. Throughput: 0: 11037.4. Samples: 3776576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:45,122][635489] Avg episode reward: [(0, '621.796')] [2023-03-09 05:45:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007400_3788800.pth... [2023-03-09 05:45:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000006760_3461120.pth [2023-03-09 05:45:45,129][635732] Saving new best policy, reward=621.796! [2023-03-09 05:45:46,893][635776] Updated weights for policy 0, policy_version 7440 (0.0005) [2023-03-09 05:45:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 11024.5). Total num frames: 3842048. Throughput: 0: 10985.0. Samples: 3840828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:50,122][635489] Avg episode reward: [(0, '613.698')] [2023-03-09 05:45:50,866][635776] Updated weights for policy 0, policy_version 7520 (0.0005) [2023-03-09 05:45:54,631][635776] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-09 05:45:55,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 11010.6). Total num frames: 3895296. Throughput: 0: 10897.3. Samples: 3870976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:45:55,122][635489] Avg episode reward: [(0, '607.492')] [2023-03-09 05:45:58,267][635776] Updated weights for policy 0, policy_version 7680 (0.0005) [2023-03-09 05:46:00,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 11010.6). Total num frames: 3948544. Throughput: 0: 10907.5. Samples: 3939668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:46:00,122][635489] Avg episode reward: [(0, '618.250')] [2023-03-09 05:46:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007712_3948544.pth... [2023-03-09 05:46:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007080_3624960.pth [2023-03-09 05:46:02,138][635776] Updated weights for policy 0, policy_version 7760 (0.0005) [2023-03-09 05:46:05,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 11024.5). Total num frames: 4005888. Throughput: 0: 10921.5. Samples: 4005952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:46:05,122][635489] Avg episode reward: [(0, '617.402')] [2023-03-09 05:46:05,554][635776] Updated weights for policy 0, policy_version 7840 (0.0005) [2023-03-09 05:46:09,261][635776] Updated weights for policy 0, policy_version 7920 (0.0005) [2023-03-09 05:46:10,122][635489] Fps is (10 sec: 11468.7, 60 sec: 10990.9, 300 sec: 11024.5). Total num frames: 4063232. Throughput: 0: 10963.2. Samples: 4040488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:46:10,122][635489] Avg episode reward: [(0, '617.480')] [2023-03-09 05:46:12,955][635776] Updated weights for policy 0, policy_version 8000 (0.0004) [2023-03-09 05:46:15,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 11024.5). Total num frames: 4116480. Throughput: 0: 10988.5. Samples: 4106780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:46:15,122][635489] Avg episode reward: [(0, '619.220')] [2023-03-09 05:46:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008040_4116480.pth... [2023-03-09 05:46:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007400_3788800.pth [2023-03-09 05:46:16,656][635776] Updated weights for policy 0, policy_version 8080 (0.0005) [2023-03-09 05:46:20,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 11024.5). Total num frames: 4173824. Throughput: 0: 10992.4. Samples: 4172056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:46:20,122][635489] Avg episode reward: [(0, '617.180')] [2023-03-09 05:46:20,422][635776] Updated weights for policy 0, policy_version 8160 (0.0004) [2023-03-09 05:46:24,207][635776] Updated weights for policy 0, policy_version 8240 (0.0005) [2023-03-09 05:46:25,122][635489] Fps is (10 sec: 11059.0, 60 sec: 10990.9, 300 sec: 11010.6). Total num frames: 4227072. Throughput: 0: 10928.7. Samples: 4203540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:46:25,122][635489] Avg episode reward: [(0, '611.516')] [2023-03-09 05:46:28,117][635776] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-09 05:46:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 11010.6). Total num frames: 4280320. Throughput: 0: 10923.4. Samples: 4268128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:46:30,122][635489] Avg episode reward: [(0, '615.998')] [2023-03-09 05:46:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008360_4280320.pth... [2023-03-09 05:46:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000007712_3948544.pth [2023-03-09 05:46:31,650][635776] Updated weights for policy 0, policy_version 8400 (0.0005) [2023-03-09 05:46:35,122][635489] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 11010.6). Total num frames: 4337664. Throughput: 0: 11028.5. Samples: 4337112. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:46:35,122][635489] Avg episode reward: [(0, '616.188')] [2023-03-09 05:46:35,330][635776] Updated weights for policy 0, policy_version 8480 (0.0005) [2023-03-09 05:46:39,059][635776] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-09 05:46:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 11010.6). Total num frames: 4390912. Throughput: 0: 11099.2. Samples: 4370440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:46:40,122][635489] Avg episode reward: [(0, '611.178')] [2023-03-09 05:46:42,840][635776] Updated weights for policy 0, policy_version 8640 (0.0005) [2023-03-09 05:46:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10996.7). Total num frames: 4444160. Throughput: 0: 11018.8. Samples: 4435516. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:46:45,122][635489] Avg episode reward: [(0, '617.850')] [2023-03-09 05:46:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008680_4444160.pth... [2023-03-09 05:46:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008040_4116480.pth [2023-03-09 05:46:46,558][635776] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-09 05:46:50,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10996.7). Total num frames: 4501504. Throughput: 0: 11013.9. Samples: 4501576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:46:50,122][635489] Avg episode reward: [(0, '617.575')] [2023-03-09 05:46:50,242][635776] Updated weights for policy 0, policy_version 8800 (0.0004) [2023-03-09 05:46:53,948][635776] Updated weights for policy 0, policy_version 8880 (0.0005) [2023-03-09 05:46:55,122][635489] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 11010.6). Total num frames: 4558848. Throughput: 0: 10974.4. Samples: 4534336. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:46:55,122][635489] Avg episode reward: [(0, '616.924')] [2023-03-09 05:46:57,608][635776] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-09 05:47:00,121][635489] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10996.7). Total num frames: 4612096. Throughput: 0: 11000.2. Samples: 4601788. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:47:00,122][635489] Avg episode reward: [(0, '605.078')] [2023-03-09 05:47:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009008_4612096.pth... [2023-03-09 05:47:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008360_4280320.pth [2023-03-09 05:47:01,363][635776] Updated weights for policy 0, policy_version 9040 (0.0005) [2023-03-09 05:47:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10996.7). Total num frames: 4665344. Throughput: 0: 10963.4. Samples: 4665408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:47:05,122][635489] Avg episode reward: [(0, '612.582')] [2023-03-09 05:47:05,262][635776] Updated weights for policy 0, policy_version 9120 (0.0005) [2023-03-09 05:47:08,942][635776] Updated weights for policy 0, policy_version 9200 (0.0005) [2023-03-09 05:47:10,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10996.7). Total num frames: 4718592. Throughput: 0: 10992.1. Samples: 4698184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:47:10,122][635489] Avg episode reward: [(0, '583.601')] [2023-03-09 05:47:12,888][635776] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-09 05:47:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10922.6, 300 sec: 10996.7). Total num frames: 4771840. Throughput: 0: 10960.3. Samples: 4761340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:47:15,122][635489] Avg episode reward: [(0, '599.947')] [2023-03-09 05:47:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009320_4771840.pth... [2023-03-09 05:47:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000008680_4444160.pth [2023-03-09 05:47:17,003][635776] Updated weights for policy 0, policy_version 9360 (0.0005) [2023-03-09 05:47:20,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10982.8). Total num frames: 4825088. Throughput: 0: 10841.4. Samples: 4824976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:47:20,122][635489] Avg episode reward: [(0, '586.674')] [2023-03-09 05:47:20,752][635776] Updated weights for policy 0, policy_version 9440 (0.0005) [2023-03-09 05:47:24,754][635776] Updated weights for policy 0, policy_version 9520 (0.0005) [2023-03-09 05:47:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10786.2, 300 sec: 10969.0). Total num frames: 4874240. Throughput: 0: 10787.8. Samples: 4855892. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:47:25,122][635489] Avg episode reward: [(0, '588.785')] [2023-03-09 05:47:28,622][635776] Updated weights for policy 0, policy_version 9600 (0.0004) [2023-03-09 05:47:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10786.1, 300 sec: 10968.9). Total num frames: 4927488. Throughput: 0: 10736.1. Samples: 4918640. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:47:30,122][635489] Avg episode reward: [(0, '601.420')] [2023-03-09 05:47:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009624_4927488.pth... [2023-03-09 05:47:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009008_4612096.pth [2023-03-09 05:47:32,504][635776] Updated weights for policy 0, policy_version 9680 (0.0005) [2023-03-09 05:47:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10941.2). Total num frames: 4980736. Throughput: 0: 10673.1. Samples: 4981868. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:47:35,122][635489] Avg episode reward: [(0, '602.171')] [2023-03-09 05:47:36,321][635776] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-09 05:47:39,903][635776] Updated weights for policy 0, policy_version 9840 (0.0005) [2023-03-09 05:47:40,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10786.2, 300 sec: 10955.1). Total num frames: 5038080. Throughput: 0: 10675.9. Samples: 5014752. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:47:40,122][635489] Avg episode reward: [(0, '583.909')] [2023-03-09 05:47:43,582][635776] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-09 05:47:45,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10955.1). Total num frames: 5091328. Throughput: 0: 10694.9. Samples: 5083060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:47:45,122][635489] Avg episode reward: [(0, '587.573')] [2023-03-09 05:47:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009952_5095424.pth... [2023-03-09 05:47:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009320_4771840.pth [2023-03-09 05:47:47,281][635776] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-09 05:47:50,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10968.9). Total num frames: 5148672. Throughput: 0: 10709.2. Samples: 5147324. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:47:50,122][635489] Avg episode reward: [(0, '591.475')] [2023-03-09 05:47:50,960][635776] Updated weights for policy 0, policy_version 10080 (0.0005) [2023-03-09 05:47:54,638][635776] Updated weights for policy 0, policy_version 10160 (0.0006) [2023-03-09 05:47:55,122][635489] Fps is (10 sec: 11468.8, 60 sec: 10786.1, 300 sec: 10969.0). Total num frames: 5206016. Throughput: 0: 10742.0. Samples: 5181576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:47:55,122][635489] Avg episode reward: [(0, '595.264')] [2023-03-09 05:47:58,342][635776] Updated weights for policy 0, policy_version 10240 (0.0004) [2023-03-09 05:48:00,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10968.9). Total num frames: 5259264. Throughput: 0: 10808.7. Samples: 5247732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:48:00,122][635489] Avg episode reward: [(0, '597.481')] [2023-03-09 05:48:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010272_5259264.pth... [2023-03-09 05:48:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009624_4927488.pth [2023-03-09 05:48:02,048][635776] Updated weights for policy 0, policy_version 10320 (0.0005) [2023-03-09 05:48:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10969.0). Total num frames: 5312512. Throughput: 0: 10845.9. Samples: 5313044. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:48:05,122][635489] Avg episode reward: [(0, '606.419')] [2023-03-09 05:48:06,021][635776] Updated weights for policy 0, policy_version 10400 (0.0004) [2023-03-09 05:48:09,769][635776] Updated weights for policy 0, policy_version 10480 (0.0004) [2023-03-09 05:48:10,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10955.1). Total num frames: 5365760. Throughput: 0: 10875.5. Samples: 5345288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:48:10,122][635489] Avg episode reward: [(0, '593.813')] [2023-03-09 05:48:13,386][635776] Updated weights for policy 0, policy_version 10560 (0.0005) [2023-03-09 05:48:15,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10955.1). Total num frames: 5423104. Throughput: 0: 10959.4. Samples: 5411812. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:48:15,122][635489] Avg episode reward: [(0, '609.363')] [2023-03-09 05:48:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010592_5423104.pth... [2023-03-09 05:48:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000009952_5095424.pth [2023-03-09 05:48:17,363][635776] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-09 05:48:20,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10955.1). Total num frames: 5476352. Throughput: 0: 10989.9. Samples: 5476416. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 05:48:20,122][635489] Avg episode reward: [(0, '608.017')] [2023-03-09 05:48:21,003][635776] Updated weights for policy 0, policy_version 10720 (0.0005) [2023-03-09 05:48:24,966][635776] Updated weights for policy 0, policy_version 10800 (0.0004) [2023-03-09 05:48:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10941.2). Total num frames: 5529600. Throughput: 0: 10980.0. Samples: 5508852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:48:25,122][635489] Avg episode reward: [(0, '611.135')] [2023-03-09 05:48:28,613][635776] Updated weights for policy 0, policy_version 10880 (0.0004) [2023-03-09 05:48:30,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10955.1). Total num frames: 5586944. Throughput: 0: 10914.4. Samples: 5574208. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:48:30,122][635489] Avg episode reward: [(0, '594.522')] [2023-03-09 05:48:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010912_5586944.pth... [2023-03-09 05:48:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010272_5259264.pth [2023-03-09 05:48:32,258][635776] Updated weights for policy 0, policy_version 10960 (0.0005) [2023-03-09 05:48:35,122][635489] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10968.9). Total num frames: 5644288. Throughput: 0: 11042.0. Samples: 5644216. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:48:35,122][635489] Avg episode reward: [(0, '606.807')] [2023-03-09 05:48:35,665][635776] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-09 05:48:39,606][635776] Updated weights for policy 0, policy_version 11120 (0.0005) [2023-03-09 05:48:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10955.1). Total num frames: 5697536. Throughput: 0: 11005.1. Samples: 5676804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:48:40,122][635489] Avg episode reward: [(0, '611.559')] [2023-03-09 05:48:43,178][635776] Updated weights for policy 0, policy_version 11200 (0.0005) [2023-03-09 05:48:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10955.1). Total num frames: 5750784. Throughput: 0: 10990.4. Samples: 5742300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:48:45,122][635489] Avg episode reward: [(0, '595.156')] [2023-03-09 05:48:45,169][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011240_5754880.pth... [2023-03-09 05:48:45,171][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010592_5423104.pth [2023-03-09 05:48:46,996][635776] Updated weights for policy 0, policy_version 11280 (0.0004) [2023-03-09 05:48:50,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10968.9). Total num frames: 5808128. Throughput: 0: 10995.5. Samples: 5807844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:48:50,122][635489] Avg episode reward: [(0, '588.365')] [2023-03-09 05:48:50,670][635776] Updated weights for policy 0, policy_version 11360 (0.0005) [2023-03-09 05:48:54,570][635776] Updated weights for policy 0, policy_version 11440 (0.0005) [2023-03-09 05:48:55,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10968.9). Total num frames: 5861376. Throughput: 0: 11013.7. Samples: 5840904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:48:55,122][635489] Avg episode reward: [(0, '587.664')] [2023-03-09 05:48:58,405][635776] Updated weights for policy 0, policy_version 11520 (0.0005) [2023-03-09 05:49:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10941.2). Total num frames: 5914624. Throughput: 0: 10912.9. Samples: 5902892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:00,122][635489] Avg episode reward: [(0, '610.208')] [2023-03-09 05:49:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011552_5914624.pth... [2023-03-09 05:49:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000010912_5586944.pth [2023-03-09 05:49:02,136][635776] Updated weights for policy 0, policy_version 11600 (0.0005) [2023-03-09 05:49:05,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 10941.2). Total num frames: 5967872. Throughput: 0: 10927.0. Samples: 5968128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:05,122][635489] Avg episode reward: [(0, '541.688')] [2023-03-09 05:49:05,916][635776] Updated weights for policy 0, policy_version 11680 (0.0005) [2023-03-09 05:49:09,714][635776] Updated weights for policy 0, policy_version 11760 (0.0005) [2023-03-09 05:49:10,122][635489] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10941.2). Total num frames: 6025216. Throughput: 0: 10932.4. Samples: 6000812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:10,122][635489] Avg episode reward: [(0, '579.258')] [2023-03-09 05:49:13,441][635776] Updated weights for policy 0, policy_version 11840 (0.0004) [2023-03-09 05:49:15,122][635489] Fps is (10 sec: 11058.9, 60 sec: 10922.6, 300 sec: 10941.2). Total num frames: 6078464. Throughput: 0: 10940.2. Samples: 6066516. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:49:15,122][635489] Avg episode reward: [(0, '588.674')] [2023-03-09 05:49:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011872_6078464.pth... [2023-03-09 05:49:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011240_5754880.pth [2023-03-09 05:49:17,332][635776] Updated weights for policy 0, policy_version 11920 (0.0004) [2023-03-09 05:49:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10854.4, 300 sec: 10913.4). Total num frames: 6127616. Throughput: 0: 10788.1. Samples: 6129680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:49:20,122][635489] Avg episode reward: [(0, '567.861')] [2023-03-09 05:49:21,251][635776] Updated weights for policy 0, policy_version 12000 (0.0004) [2023-03-09 05:49:25,122][635489] Fps is (10 sec: 10240.2, 60 sec: 10854.4, 300 sec: 10913.4). Total num frames: 6180864. Throughput: 0: 10747.8. Samples: 6160456. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:49:25,122][635489] Avg episode reward: [(0, '577.885')] [2023-03-09 05:49:25,339][635776] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-09 05:49:29,221][635776] Updated weights for policy 0, policy_version 12160 (0.0005) [2023-03-09 05:49:30,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10786.1, 300 sec: 10899.5). Total num frames: 6234112. Throughput: 0: 10669.4. Samples: 6222424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:49:30,122][635489] Avg episode reward: [(0, '588.112')] [2023-03-09 05:49:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012176_6234112.pth... [2023-03-09 05:49:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011552_5914624.pth [2023-03-09 05:49:33,180][635776] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-09 05:49:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10885.6). Total num frames: 6283264. Throughput: 0: 10568.6. Samples: 6283432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:49:35,122][635489] Avg episode reward: [(0, '604.468')] [2023-03-09 05:49:37,046][635776] Updated weights for policy 0, policy_version 12320 (0.0005) [2023-03-09 05:49:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10885.6). Total num frames: 6336512. Throughput: 0: 10562.3. Samples: 6316208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:49:40,122][635489] Avg episode reward: [(0, '617.274')] [2023-03-09 05:49:40,818][635776] Updated weights for policy 0, policy_version 12400 (0.0004) [2023-03-09 05:49:44,633][635776] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-09 05:49:45,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10899.5). Total num frames: 6393856. Throughput: 0: 10572.1. Samples: 6378636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:45,122][635489] Avg episode reward: [(0, '620.846')] [2023-03-09 05:49:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012488_6393856.pth... [2023-03-09 05:49:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000011872_6078464.pth [2023-03-09 05:49:48,351][635776] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-09 05:49:50,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10885.6). Total num frames: 6447104. Throughput: 0: 10646.9. Samples: 6447240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:50,122][635489] Avg episode reward: [(0, '622.353')] [2023-03-09 05:49:50,122][635732] Saving new best policy, reward=622.353! [2023-03-09 05:49:52,119][635776] Updated weights for policy 0, policy_version 12640 (0.0004) [2023-03-09 05:49:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10871.8). Total num frames: 6500352. Throughput: 0: 10642.2. Samples: 6479708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:49:55,122][635489] Avg episode reward: [(0, '618.076')] [2023-03-09 05:49:55,810][635776] Updated weights for policy 0, policy_version 12720 (0.0005) [2023-03-09 05:49:59,555][635776] Updated weights for policy 0, policy_version 12800 (0.0005) [2023-03-09 05:50:00,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10885.6). Total num frames: 6557696. Throughput: 0: 10667.0. Samples: 6546532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:00,122][635489] Avg episode reward: [(0, '625.905')] [2023-03-09 05:50:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012808_6557696.pth... [2023-03-09 05:50:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012176_6234112.pth [2023-03-09 05:50:00,128][635732] Saving new best policy, reward=625.905! [2023-03-09 05:50:03,458][635776] Updated weights for policy 0, policy_version 12880 (0.0004) [2023-03-09 05:50:05,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10857.9). Total num frames: 6606848. Throughput: 0: 10626.4. Samples: 6607868. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:50:05,122][635489] Avg episode reward: [(0, '614.031')] [2023-03-09 05:50:07,556][635776] Updated weights for policy 0, policy_version 12960 (0.0004) [2023-03-09 05:50:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10844.0). Total num frames: 6660096. Throughput: 0: 10647.8. Samples: 6639608. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:50:10,122][635489] Avg episode reward: [(0, '615.900')] [2023-03-09 05:50:11,355][635776] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-09 05:50:15,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10844.0). Total num frames: 6713344. Throughput: 0: 10686.6. Samples: 6703320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:15,122][635489] Avg episode reward: [(0, '622.284')] [2023-03-09 05:50:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013112_6713344.pth... [2023-03-09 05:50:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012488_6393856.pth [2023-03-09 05:50:15,212][635776] Updated weights for policy 0, policy_version 13120 (0.0005) [2023-03-09 05:50:18,999][635776] Updated weights for policy 0, policy_version 13200 (0.0005) [2023-03-09 05:50:20,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10844.0). Total num frames: 6766592. Throughput: 0: 10744.5. Samples: 6766932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:20,122][635489] Avg episode reward: [(0, '609.962')] [2023-03-09 05:50:22,833][635776] Updated weights for policy 0, policy_version 13280 (0.0005) [2023-03-09 05:50:25,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10844.0). Total num frames: 6819840. Throughput: 0: 10741.8. Samples: 6799588. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:50:25,122][635489] Avg episode reward: [(0, '625.197')] [2023-03-09 05:50:26,908][635776] Updated weights for policy 0, policy_version 13360 (0.0004) [2023-03-09 05:50:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10830.1). Total num frames: 6873088. Throughput: 0: 10768.5. Samples: 6863220. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:50:30,122][635489] Avg episode reward: [(0, '648.254')] [2023-03-09 05:50:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013424_6873088.pth... [2023-03-09 05:50:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000012808_6557696.pth [2023-03-09 05:50:30,126][635732] Saving new best policy, reward=648.254! [2023-03-09 05:50:30,618][635776] Updated weights for policy 0, policy_version 13440 (0.0005) [2023-03-09 05:50:34,371][635776] Updated weights for policy 0, policy_version 13520 (0.0005) [2023-03-09 05:50:35,122][635489] Fps is (10 sec: 11059.0, 60 sec: 10786.1, 300 sec: 10830.1). Total num frames: 6930432. Throughput: 0: 10698.6. Samples: 6928680. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:50:35,122][635489] Avg episode reward: [(0, '639.765')] [2023-03-09 05:50:38,156][635776] Updated weights for policy 0, policy_version 13600 (0.0005) [2023-03-09 05:50:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10830.1). Total num frames: 6983680. Throughput: 0: 10664.1. Samples: 6959592. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:50:40,122][635489] Avg episode reward: [(0, '653.278')] [2023-03-09 05:50:40,123][635732] Saving new best policy, reward=653.278! [2023-03-09 05:50:41,965][635776] Updated weights for policy 0, policy_version 13680 (0.0005) [2023-03-09 05:50:45,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 7036928. Throughput: 0: 10626.2. Samples: 7024712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:45,122][635489] Avg episode reward: [(0, '638.528')] [2023-03-09 05:50:45,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013744_7036928.pth... [2023-03-09 05:50:45,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013112_6713344.pth [2023-03-09 05:50:45,925][635776] Updated weights for policy 0, policy_version 13760 (0.0004) [2023-03-09 05:50:49,610][635776] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-09 05:50:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 7090176. Throughput: 0: 10708.0. Samples: 7089728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:50,122][635489] Avg episode reward: [(0, '689.610')] [2023-03-09 05:50:50,123][635732] Saving new best policy, reward=689.610! [2023-03-09 05:50:53,646][635776] Updated weights for policy 0, policy_version 13920 (0.0005) [2023-03-09 05:50:55,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 7143424. Throughput: 0: 10649.1. Samples: 7118816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:50:55,122][635489] Avg episode reward: [(0, '730.322')] [2023-03-09 05:50:55,122][635732] Saving new best policy, reward=730.322! [2023-03-09 05:50:57,263][635776] Updated weights for policy 0, policy_version 14000 (0.0005) [2023-03-09 05:51:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10816.2). Total num frames: 7196672. Throughput: 0: 10734.1. Samples: 7186356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:00,122][635489] Avg episode reward: [(0, '759.895')] [2023-03-09 05:51:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014056_7196672.pth... [2023-03-09 05:51:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013424_6873088.pth [2023-03-09 05:51:00,128][635732] Saving new best policy, reward=759.895! [2023-03-09 05:51:01,055][635776] Updated weights for policy 0, policy_version 14080 (0.0005) [2023-03-09 05:51:04,924][635776] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-09 05:51:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10802.3). Total num frames: 7249920. Throughput: 0: 10730.2. Samples: 7249792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:05,122][635489] Avg episode reward: [(0, '753.177')] [2023-03-09 05:51:08,976][635776] Updated weights for policy 0, policy_version 14240 (0.0005) [2023-03-09 05:51:10,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 7299072. Throughput: 0: 10688.3. Samples: 7280560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:10,122][635489] Avg episode reward: [(0, '699.020')] [2023-03-09 05:51:13,147][635776] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-09 05:51:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10581.3, 300 sec: 10760.7). Total num frames: 7348224. Throughput: 0: 10594.4. Samples: 7339968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:15,122][635489] Avg episode reward: [(0, '711.535')] [2023-03-09 05:51:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014352_7348224.pth... [2023-03-09 05:51:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000013744_7036928.pth [2023-03-09 05:51:17,202][635776] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-09 05:51:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10760.7). Total num frames: 7401472. Throughput: 0: 10479.2. Samples: 7400244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:20,122][635489] Avg episode reward: [(0, '725.680')] [2023-03-09 05:51:21,388][635776] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-09 05:51:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10746.8). Total num frames: 7450624. Throughput: 0: 10454.0. Samples: 7430024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:25,122][635489] Avg episode reward: [(0, '653.144')] [2023-03-09 05:51:25,262][635776] Updated weights for policy 0, policy_version 14560 (0.0005) [2023-03-09 05:51:29,315][635776] Updated weights for policy 0, policy_version 14640 (0.0005) [2023-03-09 05:51:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10732.9). Total num frames: 7503872. Throughput: 0: 10376.4. Samples: 7491648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:30,122][635489] Avg episode reward: [(0, '702.416')] [2023-03-09 05:51:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014656_7503872.pth... [2023-03-09 05:51:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014056_7196672.pth [2023-03-09 05:51:33,175][635776] Updated weights for policy 0, policy_version 14720 (0.0005) [2023-03-09 05:51:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10732.9). Total num frames: 7557120. Throughput: 0: 10386.7. Samples: 7557128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:35,122][635489] Avg episode reward: [(0, '710.505')] [2023-03-09 05:51:36,835][635776] Updated weights for policy 0, policy_version 14800 (0.0005) [2023-03-09 05:51:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10732.9). Total num frames: 7610368. Throughput: 0: 10461.7. Samples: 7589592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:40,122][635489] Avg episode reward: [(0, '711.286')] [2023-03-09 05:51:40,637][635776] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-09 05:51:44,934][635776] Updated weights for policy 0, policy_version 14960 (0.0005) [2023-03-09 05:51:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10705.1). Total num frames: 7659520. Throughput: 0: 10301.6. Samples: 7649928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:51:45,122][635489] Avg episode reward: [(0, '617.945')] [2023-03-09 05:51:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014960_7659520.pth... [2023-03-09 05:51:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014352_7348224.pth [2023-03-09 05:51:48,672][635776] Updated weights for policy 0, policy_version 15040 (0.0005) [2023-03-09 05:51:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10691.3). Total num frames: 7712768. Throughput: 0: 10304.6. Samples: 7713496. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:51:50,122][635489] Avg episode reward: [(0, '650.982')] [2023-03-09 05:51:52,496][635776] Updated weights for policy 0, policy_version 15120 (0.0005) [2023-03-09 05:51:55,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10691.3). Total num frames: 7766016. Throughput: 0: 10332.8. Samples: 7745536. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 05:51:55,122][635489] Avg episode reward: [(0, '644.198')] [2023-03-09 05:51:56,468][635776] Updated weights for policy 0, policy_version 15200 (0.0004) [2023-03-09 05:52:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10691.3). Total num frames: 7819264. Throughput: 0: 10382.3. Samples: 7807172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:00,122][635489] Avg episode reward: [(0, '652.174')] [2023-03-09 05:52:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015272_7819264.pth... [2023-03-09 05:52:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014656_7503872.pth [2023-03-09 05:52:00,435][635776] Updated weights for policy 0, policy_version 15280 (0.0005) [2023-03-09 05:52:04,235][635776] Updated weights for policy 0, policy_version 15360 (0.0004) [2023-03-09 05:52:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10691.3). Total num frames: 7872512. Throughput: 0: 10486.4. Samples: 7872132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:05,122][635489] Avg episode reward: [(0, '801.224')] [2023-03-09 05:52:05,123][635732] Saving new best policy, reward=801.224! [2023-03-09 05:52:08,157][635776] Updated weights for policy 0, policy_version 15440 (0.0005) [2023-03-09 05:52:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 7925760. Throughput: 0: 10503.2. Samples: 7902668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:10,122][635489] Avg episode reward: [(0, '838.439')] [2023-03-09 05:52:10,122][635732] Saving new best policy, reward=838.439! [2023-03-09 05:52:11,945][635776] Updated weights for policy 0, policy_version 15520 (0.0005) [2023-03-09 05:52:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 7974912. Throughput: 0: 10556.3. Samples: 7966680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:15,122][635489] Avg episode reward: [(0, '827.038')] [2023-03-09 05:52:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015576_7974912.pth... [2023-03-09 05:52:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000014960_7659520.pth [2023-03-09 05:52:15,858][635776] Updated weights for policy 0, policy_version 15600 (0.0005) [2023-03-09 05:52:19,715][635776] Updated weights for policy 0, policy_version 15680 (0.0005) [2023-03-09 05:52:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 8032256. Throughput: 0: 10515.9. Samples: 8030344. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:52:20,132][635489] Avg episode reward: [(0, '836.403')] [2023-03-09 05:52:23,438][635776] Updated weights for policy 0, policy_version 15760 (0.0005) [2023-03-09 05:52:25,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 8085504. Throughput: 0: 10545.4. Samples: 8064132. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:52:25,122][635489] Avg episode reward: [(0, '705.007')] [2023-03-09 05:52:27,239][635776] Updated weights for policy 0, policy_version 15840 (0.0005) [2023-03-09 05:52:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 8138752. Throughput: 0: 10593.5. Samples: 8126636. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:52:30,133][635489] Avg episode reward: [(0, '723.276')] [2023-03-09 05:52:30,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015896_8138752.pth... [2023-03-09 05:52:30,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015272_7819264.pth [2023-03-09 05:52:31,163][635776] Updated weights for policy 0, policy_version 15920 (0.0005) [2023-03-09 05:52:35,067][635776] Updated weights for policy 0, policy_version 16000 (0.0005) [2023-03-09 05:52:35,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10691.3). Total num frames: 8192000. Throughput: 0: 10607.1. Samples: 8190816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:35,122][635489] Avg episode reward: [(0, '800.094')] [2023-03-09 05:52:39,218][635776] Updated weights for policy 0, policy_version 16080 (0.0004) [2023-03-09 05:52:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10513.1, 300 sec: 10677.4). Total num frames: 8241152. Throughput: 0: 10560.0. Samples: 8220736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:40,132][635489] Avg episode reward: [(0, '739.841')] [2023-03-09 05:52:43,071][635776] Updated weights for policy 0, policy_version 16160 (0.0005) [2023-03-09 05:52:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 8294400. Throughput: 0: 10556.9. Samples: 8282232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:45,133][635489] Avg episode reward: [(0, '969.291')] [2023-03-09 05:52:45,137][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016200_8294400.pth... [2023-03-09 05:52:45,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015576_7974912.pth [2023-03-09 05:52:45,140][635732] Saving new best policy, reward=969.291! [2023-03-09 05:52:47,110][635776] Updated weights for policy 0, policy_version 16240 (0.0005) [2023-03-09 05:52:50,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 8343552. Throughput: 0: 10470.2. Samples: 8343292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:52:50,133][635489] Avg episode reward: [(0, '941.611')] [2023-03-09 05:52:51,256][635776] Updated weights for policy 0, policy_version 16320 (0.0004) [2023-03-09 05:52:55,082][635776] Updated weights for policy 0, policy_version 16400 (0.0005) [2023-03-09 05:52:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 8396800. Throughput: 0: 10460.6. Samples: 8373396. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:52:55,133][635489] Avg episode reward: [(0, '872.978')] [2023-03-09 05:52:58,960][635776] Updated weights for policy 0, policy_version 16480 (0.0004) [2023-03-09 05:53:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 8445952. Throughput: 0: 10464.9. Samples: 8437600. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:53:00,132][635489] Avg episode reward: [(0, '810.904')] [2023-03-09 05:53:00,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016496_8445952.pth... [2023-03-09 05:53:00,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000015896_8138752.pth [2023-03-09 05:53:02,880][635776] Updated weights for policy 0, policy_version 16560 (0.0005) [2023-03-09 05:53:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 8499200. Throughput: 0: 10420.6. Samples: 8499272. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:53:05,132][635489] Avg episode reward: [(0, '1045.311')] [2023-03-09 05:53:05,133][635732] Saving new best policy, reward=1045.311! [2023-03-09 05:53:06,808][635776] Updated weights for policy 0, policy_version 16640 (0.0005) [2023-03-09 05:53:10,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 8552448. Throughput: 0: 10386.7. Samples: 8531532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:10,122][635489] Avg episode reward: [(0, '988.800')] [2023-03-09 05:53:10,757][635776] Updated weights for policy 0, policy_version 16720 (0.0005) [2023-03-09 05:53:14,889][635776] Updated weights for policy 0, policy_version 16800 (0.0004) [2023-03-09 05:53:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 8601600. Throughput: 0: 10364.8. Samples: 8593052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:15,122][635489] Avg episode reward: [(0, '999.919')] [2023-03-09 05:53:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016800_8601600.pth... [2023-03-09 05:53:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016200_8294400.pth [2023-03-09 05:53:18,742][635776] Updated weights for policy 0, policy_version 16880 (0.0005) [2023-03-09 05:53:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10594.1). Total num frames: 8654848. Throughput: 0: 10311.3. Samples: 8654824. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:53:20,122][635489] Avg episode reward: [(0, '1004.897')] [2023-03-09 05:53:22,751][635776] Updated weights for policy 0, policy_version 16960 (0.0005) [2023-03-09 05:53:25,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 8708096. Throughput: 0: 10319.5. Samples: 8685116. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:53:25,122][635489] Avg episode reward: [(0, '764.709')] [2023-03-09 05:53:26,623][635776] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-09 05:53:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10552.4). Total num frames: 8757248. Throughput: 0: 10284.3. Samples: 8745024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:30,122][635489] Avg episode reward: [(0, '964.634')] [2023-03-09 05:53:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017104_8757248.pth... [2023-03-09 05:53:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016496_8445952.pth [2023-03-09 05:53:30,731][635776] Updated weights for policy 0, policy_version 17120 (0.0004) [2023-03-09 05:53:34,767][635776] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-09 05:53:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10538.5). Total num frames: 8806400. Throughput: 0: 10314.2. Samples: 8807432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:35,122][635489] Avg episode reward: [(0, '961.357')] [2023-03-09 05:53:38,732][635776] Updated weights for policy 0, policy_version 17280 (0.0005) [2023-03-09 05:53:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10538.5). Total num frames: 8859648. Throughput: 0: 10350.1. Samples: 8839152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:40,122][635489] Avg episode reward: [(0, '845.366')] [2023-03-09 05:53:42,819][635776] Updated weights for policy 0, policy_version 17360 (0.0005) [2023-03-09 05:53:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 8912896. Throughput: 0: 10286.3. Samples: 8900484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:45,122][635489] Avg episode reward: [(0, '685.059')] [2023-03-09 05:53:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017408_8912896.pth... [2023-03-09 05:53:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000016800_8601600.pth [2023-03-09 05:53:46,725][635776] Updated weights for policy 0, policy_version 17440 (0.0004) [2023-03-09 05:53:50,121][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 8966144. Throughput: 0: 10314.5. Samples: 8963424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:50,122][635489] Avg episode reward: [(0, '943.250')] [2023-03-09 05:53:50,496][635776] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-09 05:53:54,525][635776] Updated weights for policy 0, policy_version 17600 (0.0005) [2023-03-09 05:53:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10510.8). Total num frames: 9015296. Throughput: 0: 10294.0. Samples: 8994764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:53:55,122][635489] Avg episode reward: [(0, '1125.715')] [2023-03-09 05:53:55,122][635732] Saving new best policy, reward=1125.715! [2023-03-09 05:53:58,406][635776] Updated weights for policy 0, policy_version 17680 (0.0005) [2023-03-09 05:54:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10510.7). Total num frames: 9068544. Throughput: 0: 10301.3. Samples: 9056612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:54:00,122][635489] Avg episode reward: [(0, '1311.214')] [2023-03-09 05:54:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017712_9068544.pth... [2023-03-09 05:54:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017104_8757248.pth [2023-03-09 05:54:00,128][635732] Saving new best policy, reward=1311.214! [2023-03-09 05:54:02,435][635776] Updated weights for policy 0, policy_version 17760 (0.0004) [2023-03-09 05:54:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10483.0). Total num frames: 9117696. Throughput: 0: 10261.2. Samples: 9116580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:54:05,122][635489] Avg episode reward: [(0, '1113.772')] [2023-03-09 05:54:06,554][635776] Updated weights for policy 0, policy_version 17840 (0.0005) [2023-03-09 05:54:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 9166848. Throughput: 0: 10251.5. Samples: 9146432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:54:10,122][635489] Avg episode reward: [(0, '1526.288')] [2023-03-09 05:54:10,123][635732] Saving new best policy, reward=1526.288! [2023-03-09 05:54:10,811][635776] Updated weights for policy 0, policy_version 17920 (0.0004) [2023-03-09 05:54:14,786][635776] Updated weights for policy 0, policy_version 18000 (0.0005) [2023-03-09 05:54:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 9216000. Throughput: 0: 10282.3. Samples: 9207728. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:54:15,122][635489] Avg episode reward: [(0, '1583.573')] [2023-03-09 05:54:15,142][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018008_9220096.pth... [2023-03-09 05:54:15,144][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017408_8912896.pth [2023-03-09 05:54:15,144][635732] Saving new best policy, reward=1583.573! [2023-03-09 05:54:18,843][635776] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-09 05:54:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 9269248. Throughput: 0: 10220.4. Samples: 9267352. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:54:20,122][635489] Avg episode reward: [(0, '2112.717')] [2023-03-09 05:54:20,123][635732] Saving new best policy, reward=2112.717! [2023-03-09 05:54:23,042][635776] Updated weights for policy 0, policy_version 18160 (0.0005) [2023-03-09 05:54:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10455.2). Total num frames: 9318400. Throughput: 0: 10176.4. Samples: 9297092. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:54:25,122][635489] Avg episode reward: [(0, '1621.437')] [2023-03-09 05:54:27,055][635776] Updated weights for policy 0, policy_version 18240 (0.0005) [2023-03-09 05:54:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10455.2). Total num frames: 9367552. Throughput: 0: 10168.1. Samples: 9358048. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 05:54:30,122][635489] Avg episode reward: [(0, '2067.449')] [2023-03-09 05:54:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018296_9367552.pth... [2023-03-09 05:54:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000017712_9068544.pth [2023-03-09 05:54:31,394][635776] Updated weights for policy 0, policy_version 18320 (0.0005) [2023-03-09 05:54:35,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 9412608. Throughput: 0: 10009.9. Samples: 9413868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:54:35,122][635489] Avg episode reward: [(0, '1997.311')] [2023-03-09 05:54:35,566][635776] Updated weights for policy 0, policy_version 18400 (0.0004) [2023-03-09 05:54:39,496][635776] Updated weights for policy 0, policy_version 18480 (0.0004) [2023-03-09 05:54:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10413.6). Total num frames: 9465856. Throughput: 0: 10001.2. Samples: 9444816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:54:40,122][635489] Avg episode reward: [(0, '1261.853')] [2023-03-09 05:54:43,656][635776] Updated weights for policy 0, policy_version 18560 (0.0005) [2023-03-09 05:54:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10399.7). Total num frames: 9515008. Throughput: 0: 9966.1. Samples: 9505088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:54:45,122][635489] Avg episode reward: [(0, '1419.924')] [2023-03-09 05:54:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018584_9515008.pth... [2023-03-09 05:54:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018008_9220096.pth [2023-03-09 05:54:47,962][635776] Updated weights for policy 0, policy_version 18640 (0.0004) [2023-03-09 05:54:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10385.8). Total num frames: 9564160. Throughput: 0: 9936.4. Samples: 9563716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:54:50,122][635489] Avg episode reward: [(0, '1586.768')] [2023-03-09 05:54:52,144][635776] Updated weights for policy 0, policy_version 18720 (0.0005) [2023-03-09 05:54:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10358.0). Total num frames: 9613312. Throughput: 0: 9918.6. Samples: 9592768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:54:55,122][635489] Avg episode reward: [(0, '1280.497')] [2023-03-09 05:54:56,261][635776] Updated weights for policy 0, policy_version 18800 (0.0005) [2023-03-09 05:55:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10358.0). Total num frames: 9662464. Throughput: 0: 9861.2. Samples: 9651480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:55:00,122][635489] Avg episode reward: [(0, '978.298')] [2023-03-09 05:55:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018872_9662464.pth... [2023-03-09 05:55:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018296_9367552.pth [2023-03-09 05:55:00,331][635776] Updated weights for policy 0, policy_version 18880 (0.0005) [2023-03-09 05:55:04,326][635776] Updated weights for policy 0, policy_version 18960 (0.0004) [2023-03-09 05:55:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10344.1). Total num frames: 9711616. Throughput: 0: 9893.9. Samples: 9712576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:55:05,122][635489] Avg episode reward: [(0, '1288.265')] [2023-03-09 05:55:08,655][635776] Updated weights for policy 0, policy_version 19040 (0.0004) [2023-03-09 05:55:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10330.3). Total num frames: 9760768. Throughput: 0: 9850.8. Samples: 9740376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:55:10,122][635489] Avg episode reward: [(0, '1359.751')] [2023-03-09 05:55:12,836][635776] Updated weights for policy 0, policy_version 19120 (0.0004) [2023-03-09 05:55:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10316.4). Total num frames: 9809920. Throughput: 0: 9791.4. Samples: 9798660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:55:15,122][635489] Avg episode reward: [(0, '1566.587')] [2023-03-09 05:55:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019160_9809920.pth... [2023-03-09 05:55:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018584_9515008.pth [2023-03-09 05:55:17,200][635776] Updated weights for policy 0, policy_version 19200 (0.0005) [2023-03-09 05:55:20,121][635489] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 10288.6). Total num frames: 9854976. Throughput: 0: 9819.6. Samples: 9855752. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:55:20,122][635489] Avg episode reward: [(0, '1686.822')] [2023-03-09 05:55:21,291][635776] Updated weights for policy 0, policy_version 19280 (0.0005) [2023-03-09 05:55:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10288.6). Total num frames: 9908224. Throughput: 0: 9844.3. Samples: 9887808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:55:25,122][635489] Avg episode reward: [(0, '2310.121')] [2023-03-09 05:55:25,122][635732] Saving new best policy, reward=2310.121! [2023-03-09 05:55:25,216][635776] Updated weights for policy 0, policy_version 19360 (0.0005) [2023-03-09 05:55:29,089][635776] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-09 05:55:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 10274.7). Total num frames: 9961472. Throughput: 0: 9927.7. Samples: 9951836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:55:30,122][635489] Avg episode reward: [(0, '2145.036')] [2023-03-09 05:55:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019456_9961472.pth... [2023-03-09 05:55:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000018872_9662464.pth [2023-03-09 05:55:33,193][635776] Updated weights for policy 0, policy_version 19520 (0.0004) [2023-03-09 05:55:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10260.8). Total num frames: 10010624. Throughput: 0: 9930.7. Samples: 10010596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:55:35,122][635489] Avg episode reward: [(0, '2826.733')] [2023-03-09 05:55:35,123][635732] Saving new best policy, reward=2826.733! [2023-03-09 05:55:37,440][635776] Updated weights for policy 0, policy_version 19600 (0.0004) [2023-03-09 05:55:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 10059776. Throughput: 0: 9924.3. Samples: 10039360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:55:40,122][635489] Avg episode reward: [(0, '1552.984')] [2023-03-09 05:55:41,675][635776] Updated weights for policy 0, policy_version 19680 (0.0005) [2023-03-09 05:55:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10233.1). Total num frames: 10108928. Throughput: 0: 9941.1. Samples: 10098828. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:55:45,122][635489] Avg episode reward: [(0, '2031.382')] [2023-03-09 05:55:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019744_10108928.pth... [2023-03-09 05:55:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019160_9809920.pth [2023-03-09 05:55:45,567][635776] Updated weights for policy 0, policy_version 19760 (0.0005) [2023-03-09 05:55:49,755][635776] Updated weights for policy 0, policy_version 19840 (0.0005) [2023-03-09 05:55:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10219.2). Total num frames: 10158080. Throughput: 0: 9930.3. Samples: 10159440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 05:55:50,122][635489] Avg episode reward: [(0, '2603.699')] [2023-03-09 05:55:53,858][635776] Updated weights for policy 0, policy_version 19920 (0.0005) [2023-03-09 05:55:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 10207232. Throughput: 0: 10004.3. Samples: 10190572. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:55:55,122][635489] Avg episode reward: [(0, '3281.652')] [2023-03-09 05:55:55,129][635732] Saving new best policy, reward=3281.652! [2023-03-09 05:55:58,021][635776] Updated weights for policy 0, policy_version 20000 (0.0005) [2023-03-09 05:56:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 10260480. Throughput: 0: 9991.0. Samples: 10248256. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:56:00,122][635489] Avg episode reward: [(0, '3193.420')] [2023-03-09 05:56:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020040_10260480.pth... [2023-03-09 05:56:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019456_9961472.pth [2023-03-09 05:56:02,230][635776] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-09 05:56:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10191.4). Total num frames: 10305536. Throughput: 0: 10013.8. Samples: 10306372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 05:56:05,122][635489] Avg episode reward: [(0, '2604.124')] [2023-03-09 05:56:06,454][635776] Updated weights for policy 0, policy_version 20160 (0.0004) [2023-03-09 05:56:10,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10191.4). Total num frames: 10354688. Throughput: 0: 9946.1. Samples: 10335384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:10,122][635489] Avg episode reward: [(0, '3198.631')] [2023-03-09 05:56:10,649][635776] Updated weights for policy 0, policy_version 20240 (0.0004) [2023-03-09 05:56:14,925][635776] Updated weights for policy 0, policy_version 20320 (0.0005) [2023-03-09 05:56:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.6, 300 sec: 10177.5). Total num frames: 10403840. Throughput: 0: 9814.2. Samples: 10393476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:15,122][635489] Avg episode reward: [(0, '2971.520')] [2023-03-09 05:56:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020320_10403840.pth... [2023-03-09 05:56:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000019744_10108928.pth [2023-03-09 05:56:19,121][635776] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-09 05:56:20,121][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 10452992. Throughput: 0: 9830.4. Samples: 10452964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:20,133][635489] Avg episode reward: [(0, '1813.978')] [2023-03-09 05:56:23,262][635776] Updated weights for policy 0, policy_version 20480 (0.0005) [2023-03-09 05:56:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 10502144. Throughput: 0: 9833.7. Samples: 10481876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:25,122][635489] Avg episode reward: [(0, '2460.652')] [2023-03-09 05:56:27,365][635776] Updated weights for policy 0, policy_version 20560 (0.0005) [2023-03-09 05:56:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10149.8). Total num frames: 10551296. Throughput: 0: 9872.8. Samples: 10543104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:30,122][635489] Avg episode reward: [(0, '3318.381')] [2023-03-09 05:56:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020616_10555392.pth... [2023-03-09 05:56:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020040_10260480.pth [2023-03-09 05:56:30,127][635732] Saving new best policy, reward=3318.381! [2023-03-09 05:56:31,407][635776] Updated weights for policy 0, policy_version 20640 (0.0005) [2023-03-09 05:56:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10149.8). Total num frames: 10604544. Throughput: 0: 9873.3. Samples: 10603740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:35,122][635489] Avg episode reward: [(0, '3816.322')] [2023-03-09 05:56:35,123][635732] Saving new best policy, reward=3816.322! [2023-03-09 05:56:35,391][635776] Updated weights for policy 0, policy_version 20720 (0.0005) [2023-03-09 05:56:39,715][635776] Updated weights for policy 0, policy_version 20800 (0.0005) [2023-03-09 05:56:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 10653696. Throughput: 0: 9821.0. Samples: 10632516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:40,122][635489] Avg episode reward: [(0, '3835.417')] [2023-03-09 05:56:40,123][635732] Saving new best policy, reward=3835.417! [2023-03-09 05:56:43,795][635776] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-09 05:56:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 10702848. Throughput: 0: 9844.4. Samples: 10691256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:45,122][635489] Avg episode reward: [(0, '3122.642')] [2023-03-09 05:56:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020904_10702848.pth... [2023-03-09 05:56:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020320_10403840.pth [2023-03-09 05:56:48,011][635776] Updated weights for policy 0, policy_version 20960 (0.0005) [2023-03-09 05:56:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 10752000. Throughput: 0: 9859.7. Samples: 10750060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:50,122][635489] Avg episode reward: [(0, '3145.534')] [2023-03-09 05:56:52,199][635776] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-09 05:56:55,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 10797056. Throughput: 0: 9869.1. Samples: 10779496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:56:55,122][635489] Avg episode reward: [(0, '3363.439')] [2023-03-09 05:56:56,548][635776] Updated weights for policy 0, policy_version 21120 (0.0005) [2023-03-09 05:57:00,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 10080.3). Total num frames: 10846208. Throughput: 0: 9833.6. Samples: 10835988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:00,122][635489] Avg episode reward: [(0, '3374.690')] [2023-03-09 05:57:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021184_10846208.pth... [2023-03-09 05:57:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020616_10555392.pth [2023-03-09 05:57:00,878][635776] Updated weights for policy 0, policy_version 21200 (0.0005) [2023-03-09 05:57:05,006][635776] Updated weights for policy 0, policy_version 21280 (0.0005) [2023-03-09 05:57:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10066.4). Total num frames: 10895360. Throughput: 0: 9825.3. Samples: 10895104. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:05,122][635489] Avg episode reward: [(0, '3351.468')] [2023-03-09 05:57:09,344][635776] Updated weights for policy 0, policy_version 21360 (0.0005) [2023-03-09 05:57:10,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 10052.6). Total num frames: 10940416. Throughput: 0: 9788.5. Samples: 10922360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:10,122][635489] Avg episode reward: [(0, '1751.879')] [2023-03-09 05:57:13,321][635776] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-09 05:57:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 10993664. Throughput: 0: 9757.2. Samples: 10982180. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:15,122][635489] Avg episode reward: [(0, '2635.483')] [2023-03-09 05:57:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021472_10993664.pth... [2023-03-09 05:57:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000020904_10702848.pth [2023-03-09 05:57:17,473][635776] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-09 05:57:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 11042816. Throughput: 0: 9758.7. Samples: 11042880. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:20,122][635489] Avg episode reward: [(0, '2963.925')] [2023-03-09 05:57:21,560][635776] Updated weights for policy 0, policy_version 21600 (0.0005) [2023-03-09 05:57:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 11091968. Throughput: 0: 9781.5. Samples: 11072684. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:25,122][635489] Avg episode reward: [(0, '3056.608')] [2023-03-09 05:57:25,939][635776] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-09 05:57:30,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 11137024. Throughput: 0: 9709.9. Samples: 11128200. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:57:30,122][635489] Avg episode reward: [(0, '3019.863')] [2023-03-09 05:57:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021752_11137024.pth... [2023-03-09 05:57:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021184_10846208.pth [2023-03-09 05:57:30,326][635776] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-09 05:57:34,274][635776] Updated weights for policy 0, policy_version 21840 (0.0005) [2023-03-09 05:57:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 11190272. Throughput: 0: 9767.7. Samples: 11189608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:35,122][635489] Avg episode reward: [(0, '3348.991')] [2023-03-09 05:57:38,264][635776] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-09 05:57:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 11239424. Throughput: 0: 9772.3. Samples: 11219248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:40,122][635489] Avg episode reward: [(0, '3914.760')] [2023-03-09 05:57:40,122][635732] Saving new best policy, reward=3914.760! [2023-03-09 05:57:42,502][635776] Updated weights for policy 0, policy_version 22000 (0.0005) [2023-03-09 05:57:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 11288576. Throughput: 0: 9831.2. Samples: 11278392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:45,122][635489] Avg episode reward: [(0, '4034.340')] [2023-03-09 05:57:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022048_11288576.pth... [2023-03-09 05:57:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021472_10993664.pth [2023-03-09 05:57:45,127][635732] Saving new best policy, reward=4034.340! [2023-03-09 05:57:46,691][635776] Updated weights for policy 0, policy_version 22080 (0.0005) [2023-03-09 05:57:50,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9693.8, 300 sec: 9955.4). Total num frames: 11333632. Throughput: 0: 9780.7. Samples: 11335236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:50,122][635489] Avg episode reward: [(0, '4064.498')] [2023-03-09 05:57:50,171][635732] Saving new best policy, reward=4064.498! [2023-03-09 05:57:51,071][635776] Updated weights for policy 0, policy_version 22160 (0.0005) [2023-03-09 05:57:55,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 11382784. Throughput: 0: 9815.8. Samples: 11364072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:57:55,122][635489] Avg episode reward: [(0, '3776.998')] [2023-03-09 05:57:55,293][635776] Updated weights for policy 0, policy_version 22240 (0.0005) [2023-03-09 05:57:59,419][635776] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-09 05:58:00,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 11431936. Throughput: 0: 9810.5. Samples: 11423652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:00,122][635489] Avg episode reward: [(0, '3742.043')] [2023-03-09 05:58:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022328_11431936.pth... [2023-03-09 05:58:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000021752_11137024.pth [2023-03-09 05:58:03,179][635776] Updated weights for policy 0, policy_version 22400 (0.0005) [2023-03-09 05:58:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 11489280. Throughput: 0: 9916.9. Samples: 11489140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:05,122][635489] Avg episode reward: [(0, '4125.282')] [2023-03-09 05:58:05,123][635732] Saving new best policy, reward=4125.282! [2023-03-09 05:58:07,140][635776] Updated weights for policy 0, policy_version 22480 (0.0005) [2023-03-09 05:58:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 11538432. Throughput: 0: 9889.3. Samples: 11517704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:10,122][635489] Avg episode reward: [(0, '4330.514')] [2023-03-09 05:58:10,123][635732] Saving new best policy, reward=4330.514! [2023-03-09 05:58:11,270][635776] Updated weights for policy 0, policy_version 22560 (0.0005) [2023-03-09 05:58:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 11587584. Throughput: 0: 9982.7. Samples: 11577420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:15,122][635489] Avg episode reward: [(0, '4396.888')] [2023-03-09 05:58:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022632_11587584.pth... [2023-03-09 05:58:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022048_11288576.pth [2023-03-09 05:58:15,128][635732] Saving new best policy, reward=4396.888! [2023-03-09 05:58:15,373][635776] Updated weights for policy 0, policy_version 22640 (0.0005) [2023-03-09 05:58:19,485][635776] Updated weights for policy 0, policy_version 22720 (0.0005) [2023-03-09 05:58:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 11636736. Throughput: 0: 9937.6. Samples: 11636800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:20,122][635489] Avg episode reward: [(0, '4402.825')] [2023-03-09 05:58:20,123][635732] Saving new best policy, reward=4402.825! [2023-03-09 05:58:23,558][635776] Updated weights for policy 0, policy_version 22800 (0.0004) [2023-03-09 05:58:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 11685888. Throughput: 0: 9943.7. Samples: 11666716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:25,122][635489] Avg episode reward: [(0, '4300.115')] [2023-03-09 05:58:27,823][635776] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-09 05:58:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 11735040. Throughput: 0: 9957.0. Samples: 11726456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:30,122][635489] Avg episode reward: [(0, '4323.008')] [2023-03-09 05:58:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022920_11735040.pth... [2023-03-09 05:58:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022328_11431936.pth [2023-03-09 05:58:31,917][635776] Updated weights for policy 0, policy_version 22960 (0.0005) [2023-03-09 05:58:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 11784192. Throughput: 0: 9975.8. Samples: 11784148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:35,122][635489] Avg episode reward: [(0, '4448.371')] [2023-03-09 05:58:35,123][635732] Saving new best policy, reward=4448.371! [2023-03-09 05:58:36,250][635776] Updated weights for policy 0, policy_version 23040 (0.0005) [2023-03-09 05:58:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 11833344. Throughput: 0: 9964.6. Samples: 11812480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:40,122][635489] Avg episode reward: [(0, '4250.961')] [2023-03-09 05:58:40,543][635776] Updated weights for policy 0, policy_version 23120 (0.0004) [2023-03-09 05:58:44,923][635776] Updated weights for policy 0, policy_version 23200 (0.0005) [2023-03-09 05:58:45,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 11878400. Throughput: 0: 9900.0. Samples: 11869152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:45,122][635489] Avg episode reward: [(0, '3927.714')] [2023-03-09 05:58:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023200_11878400.pth... [2023-03-09 05:58:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022632_11587584.pth [2023-03-09 05:58:48,902][635776] Updated weights for policy 0, policy_version 23280 (0.0005) [2023-03-09 05:58:50,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 11927552. Throughput: 0: 9770.0. Samples: 11928788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:58:50,122][635489] Avg episode reward: [(0, '4064.114')] [2023-03-09 05:58:53,162][635776] Updated weights for policy 0, policy_version 23360 (0.0004) [2023-03-09 05:58:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 11976704. Throughput: 0: 9785.0. Samples: 11958028. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:58:55,122][635489] Avg episode reward: [(0, '4170.943')] [2023-03-09 05:58:57,235][635776] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-09 05:59:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 12029952. Throughput: 0: 9800.0. Samples: 12018420. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:59:00,122][635489] Avg episode reward: [(0, '4307.143')] [2023-03-09 05:59:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023496_12029952.pth... [2023-03-09 05:59:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000022920_11735040.pth [2023-03-09 05:59:01,282][635776] Updated weights for policy 0, policy_version 23520 (0.0004) [2023-03-09 05:59:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 12079104. Throughput: 0: 9768.5. Samples: 12076384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 05:59:05,122][635489] Avg episode reward: [(0, '4193.752')] [2023-03-09 05:59:05,537][635776] Updated weights for policy 0, policy_version 23600 (0.0005) [2023-03-09 05:59:09,581][635776] Updated weights for policy 0, policy_version 23680 (0.0004) [2023-03-09 05:59:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 12128256. Throughput: 0: 9802.8. Samples: 12107840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:59:10,122][635489] Avg episode reward: [(0, '4370.439')] [2023-03-09 05:59:13,625][635776] Updated weights for policy 0, policy_version 23760 (0.0005) [2023-03-09 05:59:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 12177408. Throughput: 0: 9817.2. Samples: 12168228. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:59:15,122][635489] Avg episode reward: [(0, '4368.036')] [2023-03-09 05:59:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023784_12177408.pth... [2023-03-09 05:59:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023200_11878400.pth [2023-03-09 05:59:17,766][635776] Updated weights for policy 0, policy_version 23840 (0.0005) [2023-03-09 05:59:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 12230656. Throughput: 0: 9876.4. Samples: 12228588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 05:59:20,122][635489] Avg episode reward: [(0, '4477.565')] [2023-03-09 05:59:20,123][635732] Saving new best policy, reward=4477.565! [2023-03-09 05:59:21,778][635776] Updated weights for policy 0, policy_version 23920 (0.0005) [2023-03-09 05:59:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 12279808. Throughput: 0: 9930.2. Samples: 12259340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:25,122][635489] Avg episode reward: [(0, '4463.521')] [2023-03-09 05:59:25,695][635776] Updated weights for policy 0, policy_version 24000 (0.0005) [2023-03-09 05:59:29,711][635776] Updated weights for policy 0, policy_version 24080 (0.0005) [2023-03-09 05:59:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 12328960. Throughput: 0: 10035.9. Samples: 12320768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:30,122][635489] Avg episode reward: [(0, '4296.351')] [2023-03-09 05:59:30,161][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024088_12333056.pth... [2023-03-09 05:59:30,163][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023496_12029952.pth [2023-03-09 05:59:34,011][635776] Updated weights for policy 0, policy_version 24160 (0.0004) [2023-03-09 05:59:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 12378112. Throughput: 0: 9985.0. Samples: 12378112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:35,122][635489] Avg episode reward: [(0, '3980.477')] [2023-03-09 05:59:38,272][635776] Updated weights for policy 0, policy_version 24240 (0.0005) [2023-03-09 05:59:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 12427264. Throughput: 0: 9973.9. Samples: 12406856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:40,122][635489] Avg episode reward: [(0, '4083.309')] [2023-03-09 05:59:42,382][635776] Updated weights for policy 0, policy_version 24320 (0.0005) [2023-03-09 05:59:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 12480512. Throughput: 0: 9997.1. Samples: 12468288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:45,122][635489] Avg episode reward: [(0, '4437.907')] [2023-03-09 05:59:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024376_12480512.pth... [2023-03-09 05:59:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000023784_12177408.pth [2023-03-09 05:59:46,289][635776] Updated weights for policy 0, policy_version 24400 (0.0005) [2023-03-09 05:59:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 12529664. Throughput: 0: 10074.3. Samples: 12529728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:50,122][635489] Avg episode reward: [(0, '4218.201')] [2023-03-09 05:59:50,330][635776] Updated weights for policy 0, policy_version 24480 (0.0005) [2023-03-09 05:59:54,370][635776] Updated weights for policy 0, policy_version 24560 (0.0004) [2023-03-09 05:59:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 12578816. Throughput: 0: 10023.6. Samples: 12558900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 05:59:55,122][635489] Avg episode reward: [(0, '4058.999')] [2023-03-09 05:59:58,741][635776] Updated weights for policy 0, policy_version 24640 (0.0005) [2023-03-09 06:00:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9967.0, 300 sec: 9885.9). Total num frames: 12627968. Throughput: 0: 9975.6. Samples: 12617128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:00,122][635489] Avg episode reward: [(0, '4038.439')] [2023-03-09 06:00:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024664_12627968.pth... [2023-03-09 06:00:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024088_12333056.pth [2023-03-09 06:00:02,996][635776] Updated weights for policy 0, policy_version 24720 (0.0004) [2023-03-09 06:00:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 12677120. Throughput: 0: 9963.9. Samples: 12676964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:05,122][635489] Avg episode reward: [(0, '3871.760')] [2023-03-09 06:00:07,008][635776] Updated weights for policy 0, policy_version 24800 (0.0004) [2023-03-09 06:00:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 12726272. Throughput: 0: 9923.1. Samples: 12705880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:10,122][635489] Avg episode reward: [(0, '3772.449')] [2023-03-09 06:00:11,116][635776] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-09 06:00:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 12775424. Throughput: 0: 9918.0. Samples: 12767080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:15,122][635489] Avg episode reward: [(0, '4466.491')] [2023-03-09 06:00:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024952_12775424.pth... [2023-03-09 06:00:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024376_12480512.pth [2023-03-09 06:00:15,245][635776] Updated weights for policy 0, policy_version 24960 (0.0005) [2023-03-09 06:00:19,379][635776] Updated weights for policy 0, policy_version 25040 (0.0005) [2023-03-09 06:00:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 12824576. Throughput: 0: 9930.9. Samples: 12825004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:20,122][635489] Avg episode reward: [(0, '4251.893')] [2023-03-09 06:00:23,543][635776] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-09 06:00:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 12873728. Throughput: 0: 9973.2. Samples: 12855652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:25,122][635489] Avg episode reward: [(0, '4312.261')] [2023-03-09 06:00:27,432][635776] Updated weights for policy 0, policy_version 25200 (0.0005) [2023-03-09 06:00:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 12926976. Throughput: 0: 9950.0. Samples: 12916040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:00:30,122][635489] Avg episode reward: [(0, '4439.201')] [2023-03-09 06:00:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025248_12926976.pth... [2023-03-09 06:00:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024664_12627968.pth [2023-03-09 06:00:31,517][635776] Updated weights for policy 0, policy_version 25280 (0.0005) [2023-03-09 06:00:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 12976128. Throughput: 0: 9920.3. Samples: 12976140. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:00:35,122][635489] Avg episode reward: [(0, '4278.412')] [2023-03-09 06:00:35,739][635776] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-09 06:00:39,872][635776] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-09 06:00:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 13025280. Throughput: 0: 9922.4. Samples: 13005408. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:00:40,122][635489] Avg episode reward: [(0, '4406.880')] [2023-03-09 06:00:44,278][635776] Updated weights for policy 0, policy_version 25520 (0.0005) [2023-03-09 06:00:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 13074432. Throughput: 0: 9892.0. Samples: 13062268. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:00:45,122][635489] Avg episode reward: [(0, '4417.938')] [2023-03-09 06:00:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025536_13074432.pth... [2023-03-09 06:00:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000024952_12775424.pth [2023-03-09 06:00:48,426][635776] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-09 06:00:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 13123584. Throughput: 0: 9907.0. Samples: 13122780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:50,122][635489] Avg episode reward: [(0, '4410.227')] [2023-03-09 06:00:52,653][635776] Updated weights for policy 0, policy_version 25680 (0.0004) [2023-03-09 06:00:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 13172736. Throughput: 0: 9896.3. Samples: 13151216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:00:55,122][635489] Avg episode reward: [(0, '4389.388')] [2023-03-09 06:00:56,747][635776] Updated weights for policy 0, policy_version 25760 (0.0004) [2023-03-09 06:01:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 13221888. Throughput: 0: 9849.7. Samples: 13210316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:00,122][635489] Avg episode reward: [(0, '4411.615')] [2023-03-09 06:01:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025824_13221888.pth... [2023-03-09 06:01:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025248_12926976.pth [2023-03-09 06:01:00,799][635776] Updated weights for policy 0, policy_version 25840 (0.0005) [2023-03-09 06:01:04,630][635776] Updated weights for policy 0, policy_version 25920 (0.0004) [2023-03-09 06:01:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 13275136. Throughput: 0: 9981.7. Samples: 13274180. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:01:05,122][635489] Avg episode reward: [(0, '4167.092')] [2023-03-09 06:01:08,523][635776] Updated weights for policy 0, policy_version 26000 (0.0005) [2023-03-09 06:01:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 13324288. Throughput: 0: 9966.8. Samples: 13304160. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:01:10,122][635489] Avg episode reward: [(0, '4273.667')] [2023-03-09 06:01:12,553][635776] Updated weights for policy 0, policy_version 26080 (0.0004) [2023-03-09 06:01:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13377536. Throughput: 0: 10072.4. Samples: 13369296. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:01:15,122][635489] Avg episode reward: [(0, '4340.409')] [2023-03-09 06:01:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026128_13377536.pth... [2023-03-09 06:01:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025536_13074432.pth [2023-03-09 06:01:16,424][635776] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-09 06:01:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13426688. Throughput: 0: 10032.0. Samples: 13427580. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:01:20,122][635489] Avg episode reward: [(0, '3935.548')] [2023-03-09 06:01:20,598][635776] Updated weights for policy 0, policy_version 26240 (0.0005) [2023-03-09 06:01:24,662][635776] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-09 06:01:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13475840. Throughput: 0: 10084.6. Samples: 13459216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:25,122][635489] Avg episode reward: [(0, '3757.812')] [2023-03-09 06:01:28,805][635776] Updated weights for policy 0, policy_version 26400 (0.0005) [2023-03-09 06:01:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13529088. Throughput: 0: 10112.4. Samples: 13517324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:30,122][635489] Avg episode reward: [(0, '4211.157')] [2023-03-09 06:01:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026424_13529088.pth... [2023-03-09 06:01:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000025824_13221888.pth [2023-03-09 06:01:33,125][635776] Updated weights for policy 0, policy_version 26480 (0.0004) [2023-03-09 06:01:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 13574144. Throughput: 0: 10053.5. Samples: 13575188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:35,122][635489] Avg episode reward: [(0, '4301.431')] [2023-03-09 06:01:37,002][635776] Updated weights for policy 0, policy_version 26560 (0.0005) [2023-03-09 06:01:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13627392. Throughput: 0: 10128.2. Samples: 13606984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:40,122][635489] Avg episode reward: [(0, '4345.749')] [2023-03-09 06:01:41,201][635776] Updated weights for policy 0, policy_version 26640 (0.0004) [2023-03-09 06:01:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 13676544. Throughput: 0: 10152.1. Samples: 13667160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:45,122][635489] Avg episode reward: [(0, '4246.708')] [2023-03-09 06:01:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026712_13676544.pth... [2023-03-09 06:01:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026128_13377536.pth [2023-03-09 06:01:45,233][635776] Updated weights for policy 0, policy_version 26720 (0.0005) [2023-03-09 06:01:49,263][635776] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-09 06:01:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 13729792. Throughput: 0: 10087.2. Samples: 13728104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:50,122][635489] Avg episode reward: [(0, '4384.026')] [2023-03-09 06:01:53,178][635776] Updated weights for policy 0, policy_version 26880 (0.0005) [2023-03-09 06:01:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 13778944. Throughput: 0: 10097.2. Samples: 13758536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:01:55,122][635489] Avg episode reward: [(0, '4333.909')] [2023-03-09 06:01:57,147][635776] Updated weights for policy 0, policy_version 26960 (0.0005) [2023-03-09 06:02:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9955.4). Total num frames: 13832192. Throughput: 0: 10034.9. Samples: 13820868. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:02:00,122][635489] Avg episode reward: [(0, '4493.763')] [2023-03-09 06:02:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027016_13832192.pth... [2023-03-09 06:02:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026424_13529088.pth [2023-03-09 06:02:00,127][635732] Saving new best policy, reward=4493.763! [2023-03-09 06:02:01,010][635776] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-09 06:02:05,002][635776] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-09 06:02:05,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 13885440. Throughput: 0: 10168.3. Samples: 13885152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:02:05,122][635489] Avg episode reward: [(0, '4427.315')] [2023-03-09 06:02:09,191][635776] Updated weights for policy 0, policy_version 27200 (0.0005) [2023-03-09 06:02:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9969.2). Total num frames: 13934592. Throughput: 0: 10108.0. Samples: 13914076. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:02:10,122][635489] Avg episode reward: [(0, '4457.674')] [2023-03-09 06:02:13,246][635776] Updated weights for policy 0, policy_version 27280 (0.0005) [2023-03-09 06:02:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 13983744. Throughput: 0: 10153.4. Samples: 13974228. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:02:15,122][635489] Avg episode reward: [(0, '4124.050')] [2023-03-09 06:02:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027312_13983744.pth... [2023-03-09 06:02:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000026712_13676544.pth [2023-03-09 06:02:17,213][635776] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-09 06:02:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 14032896. Throughput: 0: 10213.2. Samples: 14034784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:02:20,122][635489] Avg episode reward: [(0, '4049.889')] [2023-03-09 06:02:21,383][635776] Updated weights for policy 0, policy_version 27440 (0.0005) [2023-03-09 06:02:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 14082048. Throughput: 0: 10173.2. Samples: 14064780. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:02:25,122][635489] Avg episode reward: [(0, '3881.857')] [2023-03-09 06:02:25,711][635776] Updated weights for policy 0, policy_version 27520 (0.0005) [2023-03-09 06:02:29,978][635776] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-09 06:02:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 14131200. Throughput: 0: 10083.0. Samples: 14120896. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:02:30,122][635489] Avg episode reward: [(0, '3635.337')] [2023-03-09 06:02:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027600_14131200.pth... [2023-03-09 06:02:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027016_13832192.pth [2023-03-09 06:02:34,299][635776] Updated weights for policy 0, policy_version 27680 (0.0005) [2023-03-09 06:02:35,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 14176256. Throughput: 0: 10010.8. Samples: 14178588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:02:35,122][635489] Avg episode reward: [(0, '4055.125')] [2023-03-09 06:02:38,309][635776] Updated weights for policy 0, policy_version 27760 (0.0005) [2023-03-09 06:02:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 14229504. Throughput: 0: 10012.3. Samples: 14209088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:02:40,122][635489] Avg episode reward: [(0, '4217.178')] [2023-03-09 06:02:42,555][635776] Updated weights for policy 0, policy_version 27840 (0.0005) [2023-03-09 06:02:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9969.3). Total num frames: 14274560. Throughput: 0: 9900.0. Samples: 14266368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:02:45,122][635489] Avg episode reward: [(0, '3919.255')] [2023-03-09 06:02:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027880_14274560.pth... [2023-03-09 06:02:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027312_13983744.pth [2023-03-09 06:02:46,895][635776] Updated weights for policy 0, policy_version 27920 (0.0004) [2023-03-09 06:02:50,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 14323712. Throughput: 0: 9739.0. Samples: 14323408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:02:50,122][635489] Avg episode reward: [(0, '3997.395')] [2023-03-09 06:02:51,232][635776] Updated weights for policy 0, policy_version 28000 (0.0004) [2023-03-09 06:02:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 14372864. Throughput: 0: 9738.9. Samples: 14352328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:02:55,122][635489] Avg episode reward: [(0, '4386.831')] [2023-03-09 06:02:55,271][635776] Updated weights for policy 0, policy_version 28080 (0.0005) [2023-03-09 06:02:59,249][635776] Updated weights for policy 0, policy_version 28160 (0.0005) [2023-03-09 06:03:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 14426112. Throughput: 0: 9766.3. Samples: 14413712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:00,122][635489] Avg episode reward: [(0, '4304.215')] [2023-03-09 06:03:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028176_14426112.pth... [2023-03-09 06:03:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027600_14131200.pth [2023-03-09 06:03:03,040][635776] Updated weights for policy 0, policy_version 28240 (0.0005) [2023-03-09 06:03:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 14479360. Throughput: 0: 9879.6. Samples: 14479368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:05,122][635489] Avg episode reward: [(0, '3991.246')] [2023-03-09 06:03:06,690][635776] Updated weights for policy 0, policy_version 28320 (0.0005) [2023-03-09 06:03:10,121][635489] Fps is (10 sec: 10649.8, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 14532608. Throughput: 0: 9928.6. Samples: 14511568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:10,122][635489] Avg episode reward: [(0, '4290.629')] [2023-03-09 06:03:10,882][635776] Updated weights for policy 0, policy_version 28400 (0.0005) [2023-03-09 06:03:14,942][635776] Updated weights for policy 0, policy_version 28480 (0.0005) [2023-03-09 06:03:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 14581760. Throughput: 0: 10018.8. Samples: 14571744. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:03:15,122][635489] Avg episode reward: [(0, '3934.658')] [2023-03-09 06:03:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028480_14581760.pth... [2023-03-09 06:03:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000027880_14274560.pth [2023-03-09 06:03:19,129][635776] Updated weights for policy 0, policy_version 28560 (0.0005) [2023-03-09 06:03:20,121][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 14630912. Throughput: 0: 10040.3. Samples: 14630404. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:03:20,122][635489] Avg episode reward: [(0, '3265.940')] [2023-03-09 06:03:23,337][635776] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-09 06:03:25,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 14675968. Throughput: 0: 10004.2. Samples: 14659276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:03:25,122][635489] Avg episode reward: [(0, '3973.065')] [2023-03-09 06:03:27,793][635776] Updated weights for policy 0, policy_version 28720 (0.0004) [2023-03-09 06:03:30,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 14725120. Throughput: 0: 10012.6. Samples: 14716936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:30,122][635489] Avg episode reward: [(0, '4276.895')] [2023-03-09 06:03:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028760_14725120.pth... [2023-03-09 06:03:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028176_14426112.pth [2023-03-09 06:03:31,747][635776] Updated weights for policy 0, policy_version 28800 (0.0004) [2023-03-09 06:03:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9969.3). Total num frames: 14774272. Throughput: 0: 10038.8. Samples: 14775152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:35,122][635489] Avg episode reward: [(0, '4265.149')] [2023-03-09 06:03:36,059][635776] Updated weights for policy 0, policy_version 28880 (0.0005) [2023-03-09 06:03:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 14823424. Throughput: 0: 10029.7. Samples: 14803664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:40,122][635489] Avg episode reward: [(0, '4218.730')] [2023-03-09 06:03:40,331][635776] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-09 06:03:44,418][635776] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-09 06:03:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 14872576. Throughput: 0: 10011.7. Samples: 14864240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:45,122][635489] Avg episode reward: [(0, '3914.078')] [2023-03-09 06:03:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029048_14872576.pth... [2023-03-09 06:03:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028480_14581760.pth [2023-03-09 06:03:48,519][635776] Updated weights for policy 0, policy_version 29120 (0.0005) [2023-03-09 06:03:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 14921728. Throughput: 0: 9864.4. Samples: 14923268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:50,122][635489] Avg episode reward: [(0, '4273.729')] [2023-03-09 06:03:52,492][635776] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-09 06:03:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 14974976. Throughput: 0: 9842.5. Samples: 14954480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:03:55,122][635489] Avg episode reward: [(0, '4381.020')] [2023-03-09 06:03:56,582][635776] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-09 06:04:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 15024128. Throughput: 0: 9811.5. Samples: 15013260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:00,122][635489] Avg episode reward: [(0, '4496.067')] [2023-03-09 06:04:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029344_15024128.pth... [2023-03-09 06:04:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000028760_14725120.pth [2023-03-09 06:04:00,129][635732] Saving new best policy, reward=4496.067! [2023-03-09 06:04:00,705][635776] Updated weights for policy 0, policy_version 29360 (0.0005) [2023-03-09 06:04:04,752][635776] Updated weights for policy 0, policy_version 29440 (0.0004) [2023-03-09 06:04:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 15073280. Throughput: 0: 9871.9. Samples: 15074640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:05,122][635489] Avg episode reward: [(0, '4571.203')] [2023-03-09 06:04:05,122][635732] Saving new best policy, reward=4571.203! [2023-03-09 06:04:08,660][635776] Updated weights for policy 0, policy_version 29520 (0.0005) [2023-03-09 06:04:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9997.0). Total num frames: 15126528. Throughput: 0: 9929.7. Samples: 15106112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:10,122][635489] Avg episode reward: [(0, '4573.192')] [2023-03-09 06:04:10,123][635732] Saving new best policy, reward=4573.192! [2023-03-09 06:04:12,479][635776] Updated weights for policy 0, policy_version 29600 (0.0005) [2023-03-09 06:04:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 9966.9, 300 sec: 9997.0). Total num frames: 15179776. Throughput: 0: 10053.5. Samples: 15169344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:15,122][635489] Avg episode reward: [(0, '4486.471')] [2023-03-09 06:04:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029648_15179776.pth... [2023-03-09 06:04:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029048_14872576.pth [2023-03-09 06:04:16,451][635776] Updated weights for policy 0, policy_version 29680 (0.0005) [2023-03-09 06:04:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 15233024. Throughput: 0: 10170.0. Samples: 15232800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:20,122][635489] Avg episode reward: [(0, '4528.508')] [2023-03-09 06:04:20,356][635776] Updated weights for policy 0, policy_version 29760 (0.0005) [2023-03-09 06:04:24,439][635776] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-09 06:04:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 15282176. Throughput: 0: 10196.5. Samples: 15262508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:25,122][635489] Avg episode reward: [(0, '4455.929')] [2023-03-09 06:04:28,621][635776] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-09 06:04:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 15331328. Throughput: 0: 10179.6. Samples: 15322324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:30,122][635489] Avg episode reward: [(0, '4433.231')] [2023-03-09 06:04:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029944_15331328.pth... [2023-03-09 06:04:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029344_15024128.pth [2023-03-09 06:04:32,563][635776] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-09 06:04:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 15384576. Throughput: 0: 10230.7. Samples: 15383648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:35,122][635489] Avg episode reward: [(0, '4377.742')] [2023-03-09 06:04:36,764][635776] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-09 06:04:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 15433728. Throughput: 0: 10191.2. Samples: 15413084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:40,122][635489] Avg episode reward: [(0, '4302.339')] [2023-03-09 06:04:40,854][635776] Updated weights for policy 0, policy_version 30160 (0.0005) [2023-03-09 06:04:45,028][635776] Updated weights for policy 0, policy_version 30240 (0.0005) [2023-03-09 06:04:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 15482880. Throughput: 0: 10205.9. Samples: 15472524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:45,122][635489] Avg episode reward: [(0, '4405.219')] [2023-03-09 06:04:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030240_15482880.pth... [2023-03-09 06:04:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029648_15179776.pth [2023-03-09 06:04:48,765][635776] Updated weights for policy 0, policy_version 30320 (0.0005) [2023-03-09 06:04:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 15536128. Throughput: 0: 10240.4. Samples: 15535460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:50,122][635489] Avg episode reward: [(0, '4500.685')] [2023-03-09 06:04:52,973][635776] Updated weights for policy 0, policy_version 30400 (0.0005) [2023-03-09 06:04:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 15581184. Throughput: 0: 10187.4. Samples: 15564544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:04:55,122][635489] Avg episode reward: [(0, '4206.769')] [2023-03-09 06:04:57,246][635776] Updated weights for policy 0, policy_version 30480 (0.0005) [2023-03-09 06:05:00,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 15630336. Throughput: 0: 10057.8. Samples: 15621944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:05:00,122][635489] Avg episode reward: [(0, '3973.549')] [2023-03-09 06:05:00,152][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030536_15634432.pth... [2023-03-09 06:05:00,153][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000029944_15331328.pth [2023-03-09 06:05:01,435][635776] Updated weights for policy 0, policy_version 30560 (0.0005) [2023-03-09 06:05:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 15683584. Throughput: 0: 9969.8. Samples: 15681444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:05:05,122][635489] Avg episode reward: [(0, '4306.004')] [2023-03-09 06:05:05,502][635776] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-09 06:05:09,524][635776] Updated weights for policy 0, policy_version 30720 (0.0005) [2023-03-09 06:05:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 15732736. Throughput: 0: 9992.4. Samples: 15712164. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:05:10,122][635489] Avg episode reward: [(0, '4402.052')] [2023-03-09 06:05:13,414][635776] Updated weights for policy 0, policy_version 30800 (0.0005) [2023-03-09 06:05:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 15785984. Throughput: 0: 10037.0. Samples: 15773988. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:05:15,122][635489] Avg episode reward: [(0, '4343.852')] [2023-03-09 06:05:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030832_15785984.pth... [2023-03-09 06:05:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030240_15482880.pth [2023-03-09 06:05:17,254][635776] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-09 06:05:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.4, 300 sec: 10052.6). Total num frames: 15839232. Throughput: 0: 10108.9. Samples: 15838548. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:05:20,122][635489] Avg episode reward: [(0, '4357.725')] [2023-03-09 06:05:21,209][635776] Updated weights for policy 0, policy_version 30960 (0.0005) [2023-03-09 06:05:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 15888384. Throughput: 0: 10116.3. Samples: 15868316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:05:25,133][635489] Avg episode reward: [(0, '4522.990')] [2023-03-09 06:05:25,313][635776] Updated weights for policy 0, policy_version 31040 (0.0005) [2023-03-09 06:05:29,395][635776] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-09 06:05:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 15937536. Throughput: 0: 10149.7. Samples: 15929260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:05:30,122][635489] Avg episode reward: [(0, '4555.202')] [2023-03-09 06:05:30,190][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031136_15941632.pth... [2023-03-09 06:05:30,200][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030536_15634432.pth [2023-03-09 06:05:33,482][635776] Updated weights for policy 0, policy_version 31200 (0.0005) [2023-03-09 06:05:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 15990784. Throughput: 0: 10095.0. Samples: 15989736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:05:35,122][635489] Avg episode reward: [(0, '4317.984')] [2023-03-09 06:05:37,428][635776] Updated weights for policy 0, policy_version 31280 (0.0005) [2023-03-09 06:05:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 16039936. Throughput: 0: 10110.7. Samples: 16019528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:05:40,122][635489] Avg episode reward: [(0, '4524.853')] [2023-03-09 06:05:41,484][635776] Updated weights for policy 0, policy_version 31360 (0.0005) [2023-03-09 06:05:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 16089088. Throughput: 0: 10197.5. Samples: 16080832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:05:45,122][635489] Avg episode reward: [(0, '4368.557')] [2023-03-09 06:05:45,154][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031432_16093184.pth... [2023-03-09 06:05:45,155][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000030832_15785984.pth [2023-03-09 06:05:45,607][635776] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-09 06:05:49,337][635776] Updated weights for policy 0, policy_version 31520 (0.0004) [2023-03-09 06:05:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16146432. Throughput: 0: 10293.2. Samples: 16144636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:05:50,122][635489] Avg episode reward: [(0, '4534.467')] [2023-03-09 06:05:53,167][635776] Updated weights for policy 0, policy_version 31600 (0.0005) [2023-03-09 06:05:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 16195584. Throughput: 0: 10311.0. Samples: 16176160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:05:55,122][635489] Avg episode reward: [(0, '4462.702')] [2023-03-09 06:05:57,224][635776] Updated weights for policy 0, policy_version 31680 (0.0005) [2023-03-09 06:06:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10080.3). Total num frames: 16248832. Throughput: 0: 10286.2. Samples: 16236868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:06:00,122][635489] Avg episode reward: [(0, '4600.935')] [2023-03-09 06:06:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031736_16248832.pth... [2023-03-09 06:06:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031136_15941632.pth [2023-03-09 06:06:00,129][635732] Saving new best policy, reward=4600.935! [2023-03-09 06:06:01,405][635776] Updated weights for policy 0, policy_version 31760 (0.0005) [2023-03-09 06:06:05,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 16297984. Throughput: 0: 10187.7. Samples: 16296992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:06:05,122][635489] Avg episode reward: [(0, '4585.269')] [2023-03-09 06:06:05,444][635776] Updated weights for policy 0, policy_version 31840 (0.0004) [2023-03-09 06:06:09,600][635776] Updated weights for policy 0, policy_version 31920 (0.0005) [2023-03-09 06:06:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 16347136. Throughput: 0: 10186.8. Samples: 16326720. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:06:10,122][635489] Avg episode reward: [(0, '4522.017')] [2023-03-09 06:06:13,710][635776] Updated weights for policy 0, policy_version 32000 (0.0004) [2023-03-09 06:06:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 16396288. Throughput: 0: 10154.8. Samples: 16386224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:06:15,122][635489] Avg episode reward: [(0, '4509.632')] [2023-03-09 06:06:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032024_16396288.pth... [2023-03-09 06:06:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031432_16093184.pth [2023-03-09 06:06:17,659][635776] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-09 06:06:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16449536. Throughput: 0: 10165.3. Samples: 16447176. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:06:20,122][635489] Avg episode reward: [(0, '4439.069')] [2023-03-09 06:06:21,641][635776] Updated weights for policy 0, policy_version 32160 (0.0004) [2023-03-09 06:06:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 16498688. Throughput: 0: 10217.5. Samples: 16479316. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:06:25,122][635489] Avg episode reward: [(0, '4506.162')] [2023-03-09 06:06:25,646][635776] Updated weights for policy 0, policy_version 32240 (0.0005) [2023-03-09 06:06:29,814][635776] Updated weights for policy 0, policy_version 32320 (0.0004) [2023-03-09 06:06:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16547840. Throughput: 0: 10190.6. Samples: 16539408. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:06:30,122][635489] Avg episode reward: [(0, '4591.546')] [2023-03-09 06:06:30,161][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032328_16551936.pth... [2023-03-09 06:06:30,163][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000031736_16248832.pth [2023-03-09 06:06:33,684][635776] Updated weights for policy 0, policy_version 32400 (0.0004) [2023-03-09 06:06:35,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16601088. Throughput: 0: 10143.1. Samples: 16601076. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:06:35,122][635489] Avg episode reward: [(0, '4457.028')] [2023-03-09 06:06:37,747][635776] Updated weights for policy 0, policy_version 32480 (0.0004) [2023-03-09 06:06:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16650240. Throughput: 0: 10110.8. Samples: 16631144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:06:40,122][635489] Avg episode reward: [(0, '4538.218')] [2023-03-09 06:06:41,752][635776] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-09 06:06:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 16703488. Throughput: 0: 10131.2. Samples: 16692772. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:06:45,122][635489] Avg episode reward: [(0, '4547.179')] [2023-03-09 06:06:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032624_16703488.pth... [2023-03-09 06:06:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032024_16396288.pth [2023-03-09 06:06:45,791][635776] Updated weights for policy 0, policy_version 32640 (0.0004) [2023-03-09 06:06:50,014][635776] Updated weights for policy 0, policy_version 32720 (0.0005) [2023-03-09 06:06:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 16752640. Throughput: 0: 10123.3. Samples: 16752540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:06:50,122][635489] Avg episode reward: [(0, '4464.320')] [2023-03-09 06:06:54,181][635776] Updated weights for policy 0, policy_version 32800 (0.0005) [2023-03-09 06:06:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 16801792. Throughput: 0: 10103.6. Samples: 16781384. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:06:55,122][635489] Avg episode reward: [(0, '4597.409')] [2023-03-09 06:06:58,220][635776] Updated weights for policy 0, policy_version 32880 (0.0005) [2023-03-09 06:07:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 16850944. Throughput: 0: 10138.4. Samples: 16842452. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:07:00,122][635489] Avg episode reward: [(0, '4414.865')] [2023-03-09 06:07:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032912_16850944.pth... [2023-03-09 06:07:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032328_16551936.pth [2023-03-09 06:07:02,389][635776] Updated weights for policy 0, policy_version 32960 (0.0005) [2023-03-09 06:07:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 16904192. Throughput: 0: 10170.2. Samples: 16904832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:07:05,122][635489] Avg episode reward: [(0, '4541.308')] [2023-03-09 06:07:06,121][635776] Updated weights for policy 0, policy_version 33040 (0.0005) [2023-03-09 06:07:10,107][635776] Updated weights for policy 0, policy_version 33120 (0.0005) [2023-03-09 06:07:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 16957440. Throughput: 0: 10122.3. Samples: 16934820. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:07:10,122][635489] Avg episode reward: [(0, '4572.149')] [2023-03-09 06:07:14,288][635776] Updated weights for policy 0, policy_version 33200 (0.0005) [2023-03-09 06:07:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 17006592. Throughput: 0: 10109.1. Samples: 16994320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:07:15,122][635489] Avg episode reward: [(0, '4556.542')] [2023-03-09 06:07:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033216_17006592.pth... [2023-03-09 06:07:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032624_16703488.pth [2023-03-09 06:07:18,325][635776] Updated weights for policy 0, policy_version 33280 (0.0004) [2023-03-09 06:07:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 17055744. Throughput: 0: 10066.5. Samples: 17054068. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:07:20,122][635489] Avg episode reward: [(0, '4611.624')] [2023-03-09 06:07:20,123][635732] Saving new best policy, reward=4611.624! [2023-03-09 06:07:22,666][635776] Updated weights for policy 0, policy_version 33360 (0.0005) [2023-03-09 06:07:25,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 17100800. Throughput: 0: 10040.1. Samples: 17082948. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:07:25,122][635489] Avg episode reward: [(0, '4526.040')] [2023-03-09 06:07:26,587][635776] Updated weights for policy 0, policy_version 33440 (0.0005) [2023-03-09 06:07:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10094.2). Total num frames: 17154048. Throughput: 0: 10026.7. Samples: 17143976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:07:30,122][635489] Avg episode reward: [(0, '4504.821')] [2023-03-09 06:07:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033504_17154048.pth... [2023-03-09 06:07:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000032912_16850944.pth [2023-03-09 06:07:30,687][635776] Updated weights for policy 0, policy_version 33520 (0.0005) [2023-03-09 06:07:34,339][635776] Updated weights for policy 0, policy_version 33600 (0.0004) [2023-03-09 06:07:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 17207296. Throughput: 0: 10135.3. Samples: 17208628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:07:35,122][635489] Avg episode reward: [(0, '4542.928')] [2023-03-09 06:07:38,329][635776] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-09 06:07:40,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 17260544. Throughput: 0: 10194.3. Samples: 17240128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:07:40,122][635489] Avg episode reward: [(0, '4536.551')] [2023-03-09 06:07:42,405][635776] Updated weights for policy 0, policy_version 33760 (0.0005) [2023-03-09 06:07:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 17313792. Throughput: 0: 10177.0. Samples: 17300416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:07:45,122][635489] Avg episode reward: [(0, '4539.081')] [2023-03-09 06:07:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033816_17313792.pth... [2023-03-09 06:07:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033216_17006592.pth [2023-03-09 06:07:46,276][635776] Updated weights for policy 0, policy_version 33840 (0.0005) [2023-03-09 06:07:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 17358848. Throughput: 0: 10098.5. Samples: 17359264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:07:50,122][635489] Avg episode reward: [(0, '4554.319')] [2023-03-09 06:07:50,584][635776] Updated weights for policy 0, policy_version 33920 (0.0004) [2023-03-09 06:07:54,495][635776] Updated weights for policy 0, policy_version 34000 (0.0005) [2023-03-09 06:07:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 17412096. Throughput: 0: 10152.4. Samples: 17391680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:07:55,122][635489] Avg episode reward: [(0, '4366.687')] [2023-03-09 06:07:58,506][635776] Updated weights for policy 0, policy_version 34080 (0.0004) [2023-03-09 06:08:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 17461248. Throughput: 0: 10194.2. Samples: 17453056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:00,122][635489] Avg episode reward: [(0, '4330.001')] [2023-03-09 06:08:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034104_17461248.pth... [2023-03-09 06:08:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033504_17154048.pth [2023-03-09 06:08:02,753][635776] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-09 06:08:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 17510400. Throughput: 0: 10143.3. Samples: 17510516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:05,122][635489] Avg episode reward: [(0, '4505.270')] [2023-03-09 06:08:06,746][635776] Updated weights for policy 0, policy_version 34240 (0.0005) [2023-03-09 06:08:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 17563648. Throughput: 0: 10215.5. Samples: 17542644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:10,122][635489] Avg episode reward: [(0, '4474.796')] [2023-03-09 06:08:10,767][635776] Updated weights for policy 0, policy_version 34320 (0.0004) [2023-03-09 06:08:14,858][635776] Updated weights for policy 0, policy_version 34400 (0.0005) [2023-03-09 06:08:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 17612800. Throughput: 0: 10226.7. Samples: 17604176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:15,122][635489] Avg episode reward: [(0, '4550.927')] [2023-03-09 06:08:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034400_17612800.pth... [2023-03-09 06:08:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000033816_17313792.pth [2023-03-09 06:08:19,201][635776] Updated weights for policy 0, policy_version 34480 (0.0005) [2023-03-09 06:08:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 17661952. Throughput: 0: 10064.8. Samples: 17661544. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:08:20,122][635489] Avg episode reward: [(0, '4538.957')] [2023-03-09 06:08:23,210][635776] Updated weights for policy 0, policy_version 34560 (0.0005) [2023-03-09 06:08:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 17711104. Throughput: 0: 10032.3. Samples: 17691580. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:08:25,122][635489] Avg episode reward: [(0, '4488.932')] [2023-03-09 06:08:27,279][635776] Updated weights for policy 0, policy_version 34640 (0.0004) [2023-03-09 06:08:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 17764352. Throughput: 0: 10036.6. Samples: 17752064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:08:30,122][635489] Avg episode reward: [(0, '4318.032')] [2023-03-09 06:08:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034696_17764352.pth... [2023-03-09 06:08:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034104_17461248.pth [2023-03-09 06:08:31,226][635776] Updated weights for policy 0, policy_version 34720 (0.0004) [2023-03-09 06:08:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 17813504. Throughput: 0: 10086.7. Samples: 17813164. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:08:35,122][635489] Avg episode reward: [(0, '4182.581')] [2023-03-09 06:08:35,398][635776] Updated weights for policy 0, policy_version 34800 (0.0005) [2023-03-09 06:08:39,475][635776] Updated weights for policy 0, policy_version 34880 (0.0004) [2023-03-09 06:08:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 17862656. Throughput: 0: 10013.5. Samples: 17842288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:08:40,122][635489] Avg episode reward: [(0, '4469.633')] [2023-03-09 06:08:43,277][635776] Updated weights for policy 0, policy_version 34960 (0.0005) [2023-03-09 06:08:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 17915904. Throughput: 0: 10083.8. Samples: 17906828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:45,122][635489] Avg episode reward: [(0, '4456.422')] [2023-03-09 06:08:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034992_17915904.pth... [2023-03-09 06:08:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034400_17612800.pth [2023-03-09 06:08:47,137][635776] Updated weights for policy 0, policy_version 35040 (0.0005) [2023-03-09 06:08:50,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 17969152. Throughput: 0: 10192.2. Samples: 17969164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:50,122][635489] Avg episode reward: [(0, '4543.557')] [2023-03-09 06:08:51,165][635776] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-09 06:08:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 18018304. Throughput: 0: 10120.3. Samples: 17998056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:08:55,122][635489] Avg episode reward: [(0, '4201.973')] [2023-03-09 06:08:55,459][635776] Updated weights for policy 0, policy_version 35200 (0.0005) [2023-03-09 06:08:59,452][635776] Updated weights for policy 0, policy_version 35280 (0.0005) [2023-03-09 06:09:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 18067456. Throughput: 0: 10110.7. Samples: 18059160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:00,122][635489] Avg episode reward: [(0, '3656.387')] [2023-03-09 06:09:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035288_18067456.pth... [2023-03-09 06:09:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034696_17764352.pth [2023-03-09 06:09:03,427][635776] Updated weights for policy 0, policy_version 35360 (0.0005) [2023-03-09 06:09:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 18116608. Throughput: 0: 10143.0. Samples: 18117980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:05,122][635489] Avg episode reward: [(0, '3900.940')] [2023-03-09 06:09:07,800][635776] Updated weights for policy 0, policy_version 35440 (0.0005) [2023-03-09 06:09:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 18165760. Throughput: 0: 10096.2. Samples: 18145908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:10,122][635489] Avg episode reward: [(0, '3778.204')] [2023-03-09 06:09:11,952][635776] Updated weights for policy 0, policy_version 35520 (0.0005) [2023-03-09 06:09:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18214912. Throughput: 0: 10067.0. Samples: 18205080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:15,122][635489] Avg episode reward: [(0, '3862.993')] [2023-03-09 06:09:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035576_18214912.pth... [2023-03-09 06:09:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000034992_17915904.pth [2023-03-09 06:09:16,235][635776] Updated weights for policy 0, policy_version 35600 (0.0004) [2023-03-09 06:09:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18264064. Throughput: 0: 10019.2. Samples: 18264028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:20,122][635489] Avg episode reward: [(0, '4015.108')] [2023-03-09 06:09:20,380][635776] Updated weights for policy 0, policy_version 35680 (0.0005) [2023-03-09 06:09:24,544][635776] Updated weights for policy 0, policy_version 35760 (0.0005) [2023-03-09 06:09:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18313216. Throughput: 0: 10010.1. Samples: 18292744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:25,122][635489] Avg episode reward: [(0, '3845.927')] [2023-03-09 06:09:28,446][635776] Updated weights for policy 0, policy_version 35840 (0.0005) [2023-03-09 06:09:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18366464. Throughput: 0: 9942.5. Samples: 18354240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:09:30,122][635489] Avg episode reward: [(0, '3773.823')] [2023-03-09 06:09:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035872_18366464.pth... [2023-03-09 06:09:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035288_18067456.pth [2023-03-09 06:09:32,516][635776] Updated weights for policy 0, policy_version 35920 (0.0005) [2023-03-09 06:09:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18415616. Throughput: 0: 9921.5. Samples: 18415632. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:09:35,122][635489] Avg episode reward: [(0, '3343.475')] [2023-03-09 06:09:36,566][635776] Updated weights for policy 0, policy_version 36000 (0.0005) [2023-03-09 06:09:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 18464768. Throughput: 0: 9963.9. Samples: 18446432. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:09:40,122][635489] Avg episode reward: [(0, '3024.357')] [2023-03-09 06:09:40,886][635776] Updated weights for policy 0, policy_version 36080 (0.0004) [2023-03-09 06:09:45,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 18509824. Throughput: 0: 9826.9. Samples: 18501368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:09:45,122][635489] Avg episode reward: [(0, '3229.217')] [2023-03-09 06:09:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036152_18509824.pth... [2023-03-09 06:09:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035576_18214912.pth [2023-03-09 06:09:45,238][635776] Updated weights for policy 0, policy_version 36160 (0.0005) [2023-03-09 06:09:48,950][635776] Updated weights for policy 0, policy_version 36240 (0.0004) [2023-03-09 06:09:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 18563072. Throughput: 0: 9896.4. Samples: 18563316. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:09:50,122][635489] Avg episode reward: [(0, '3205.725')] [2023-03-09 06:09:53,126][635776] Updated weights for policy 0, policy_version 36320 (0.0004) [2023-03-09 06:09:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 18612224. Throughput: 0: 9944.2. Samples: 18593396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:09:55,122][635489] Avg episode reward: [(0, '3034.392')] [2023-03-09 06:09:57,247][635776] Updated weights for policy 0, policy_version 36400 (0.0005) [2023-03-09 06:10:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 18665472. Throughput: 0: 9967.5. Samples: 18653616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:00,122][635489] Avg episode reward: [(0, '3833.899')] [2023-03-09 06:10:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036456_18665472.pth... [2023-03-09 06:10:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000035872_18366464.pth [2023-03-09 06:10:01,388][635776] Updated weights for policy 0, policy_version 36480 (0.0004) [2023-03-09 06:10:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 18710528. Throughput: 0: 9951.1. Samples: 18711828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:05,122][635489] Avg episode reward: [(0, '4043.748')] [2023-03-09 06:10:05,518][635776] Updated weights for policy 0, policy_version 36560 (0.0004) [2023-03-09 06:10:09,731][635776] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-09 06:10:10,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 18759680. Throughput: 0: 9995.1. Samples: 18742524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:10,122][635489] Avg episode reward: [(0, '3683.699')] [2023-03-09 06:10:13,850][635776] Updated weights for policy 0, policy_version 36720 (0.0005) [2023-03-09 06:10:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 18808832. Throughput: 0: 9921.6. Samples: 18800712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:15,122][635489] Avg episode reward: [(0, '4162.052')] [2023-03-09 06:10:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036736_18808832.pth... [2023-03-09 06:10:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036152_18509824.pth [2023-03-09 06:10:18,180][635776] Updated weights for policy 0, policy_version 36800 (0.0004) [2023-03-09 06:10:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 18857984. Throughput: 0: 9831.5. Samples: 18858048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:20,122][635489] Avg episode reward: [(0, '3876.782')] [2023-03-09 06:10:22,253][635776] Updated weights for policy 0, policy_version 36880 (0.0005) [2023-03-09 06:10:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 18911232. Throughput: 0: 9843.0. Samples: 18889368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:25,122][635489] Avg episode reward: [(0, '3525.717')] [2023-03-09 06:10:25,848][635776] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-09 06:10:29,847][635776] Updated weights for policy 0, policy_version 37040 (0.0005) [2023-03-09 06:10:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 18964480. Throughput: 0: 10085.2. Samples: 18955204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:30,122][635489] Avg episode reward: [(0, '3810.720')] [2023-03-09 06:10:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037040_18964480.pth... [2023-03-09 06:10:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036456_18665472.pth [2023-03-09 06:10:34,067][635776] Updated weights for policy 0, policy_version 37120 (0.0004) [2023-03-09 06:10:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 19013632. Throughput: 0: 10004.9. Samples: 19013536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:10:35,122][635489] Avg episode reward: [(0, '3790.224')] [2023-03-09 06:10:38,200][635776] Updated weights for policy 0, policy_version 37200 (0.0005) [2023-03-09 06:10:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 19062784. Throughput: 0: 10002.0. Samples: 19043488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:10:40,122][635489] Avg episode reward: [(0, '3911.979')] [2023-03-09 06:10:42,459][635776] Updated weights for policy 0, policy_version 37280 (0.0005) [2023-03-09 06:10:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 19111936. Throughput: 0: 9950.5. Samples: 19101388. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:10:45,133][635489] Avg episode reward: [(0, '3317.702')] [2023-03-09 06:10:45,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037328_19111936.pth... [2023-03-09 06:10:45,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000036736_18808832.pth [2023-03-09 06:10:46,857][635776] Updated weights for policy 0, policy_version 37360 (0.0005) [2023-03-09 06:10:50,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 19156992. Throughput: 0: 9887.6. Samples: 19156772. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:10:50,122][635489] Avg episode reward: [(0, '3418.245')] [2023-03-09 06:10:51,274][635776] Updated weights for policy 0, policy_version 37440 (0.0005) [2023-03-09 06:10:55,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 19206144. Throughput: 0: 9830.6. Samples: 19184900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:10:55,122][635489] Avg episode reward: [(0, '4091.195')] [2023-03-09 06:10:55,497][635776] Updated weights for policy 0, policy_version 37520 (0.0005) [2023-03-09 06:10:59,506][635776] Updated weights for policy 0, policy_version 37600 (0.0005) [2023-03-09 06:11:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 19255296. Throughput: 0: 9874.8. Samples: 19245080. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:11:00,122][635489] Avg episode reward: [(0, '4147.756')] [2023-03-09 06:11:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037608_19255296.pth... [2023-03-09 06:11:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037040_18964480.pth [2023-03-09 06:11:03,557][635776] Updated weights for policy 0, policy_version 37680 (0.0005) [2023-03-09 06:11:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 19304448. Throughput: 0: 9939.0. Samples: 19305304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:05,128][635489] Avg episode reward: [(0, '4092.116')] [2023-03-09 06:11:07,820][635776] Updated weights for policy 0, policy_version 37760 (0.0005) [2023-03-09 06:11:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 19353600. Throughput: 0: 9870.1. Samples: 19333524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:10,132][635489] Avg episode reward: [(0, '4304.928')] [2023-03-09 06:11:11,725][635776] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-09 06:11:15,121][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 19406848. Throughput: 0: 9805.6. Samples: 19396456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:15,122][635489] Avg episode reward: [(0, '3521.105')] [2023-03-09 06:11:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037904_19406848.pth... [2023-03-09 06:11:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037328_19111936.pth [2023-03-09 06:11:15,743][635776] Updated weights for policy 0, policy_version 37920 (0.0004) [2023-03-09 06:11:19,698][635776] Updated weights for policy 0, policy_version 38000 (0.0005) [2023-03-09 06:11:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 19460096. Throughput: 0: 9882.7. Samples: 19458256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:20,133][635489] Avg episode reward: [(0, '3213.377')] [2023-03-09 06:11:23,860][635776] Updated weights for policy 0, policy_version 38080 (0.0005) [2023-03-09 06:11:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 19505152. Throughput: 0: 9871.5. Samples: 19487704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:25,132][635489] Avg episode reward: [(0, '3637.332')] [2023-03-09 06:11:28,195][635776] Updated weights for policy 0, policy_version 38160 (0.0005) [2023-03-09 06:11:30,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 19554304. Throughput: 0: 9847.1. Samples: 19544504. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:11:30,132][635489] Avg episode reward: [(0, '3900.664')] [2023-03-09 06:11:30,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038192_19554304.pth... [2023-03-09 06:11:30,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037608_19255296.pth [2023-03-09 06:11:32,480][635776] Updated weights for policy 0, policy_version 38240 (0.0005) [2023-03-09 06:11:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 19603456. Throughput: 0: 9878.1. Samples: 19601284. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:11:35,132][635489] Avg episode reward: [(0, '3603.560')] [2023-03-09 06:11:36,916][635776] Updated weights for policy 0, policy_version 38320 (0.0004) [2023-03-09 06:11:40,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 19648512. Throughput: 0: 9856.7. Samples: 19628452. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:11:40,133][635489] Avg episode reward: [(0, '3654.524')] [2023-03-09 06:11:41,186][635776] Updated weights for policy 0, policy_version 38400 (0.0005) [2023-03-09 06:11:45,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9983.1). Total num frames: 19697664. Throughput: 0: 9808.9. Samples: 19686480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:11:45,133][635489] Avg episode reward: [(0, '3529.389')] [2023-03-09 06:11:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038472_19697664.pth... [2023-03-09 06:11:45,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000037904_19406848.pth [2023-03-09 06:11:45,412][635776] Updated weights for policy 0, policy_version 38480 (0.0005) [2023-03-09 06:11:49,847][635776] Updated weights for policy 0, policy_version 38560 (0.0005) [2023-03-09 06:11:50,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 19742720. Throughput: 0: 9721.9. Samples: 19742792. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:11:50,133][635489] Avg episode reward: [(0, '3532.012')] [2023-03-09 06:11:54,222][635776] Updated weights for policy 0, policy_version 38640 (0.0005) [2023-03-09 06:11:55,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 19791872. Throughput: 0: 9729.9. Samples: 19771372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:11:55,133][635489] Avg episode reward: [(0, '4054.596')] [2023-03-09 06:11:58,399][635776] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-09 06:12:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 19841024. Throughput: 0: 9607.8. Samples: 19828808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:00,133][635489] Avg episode reward: [(0, '4121.966')] [2023-03-09 06:12:00,137][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038752_19841024.pth... [2023-03-09 06:12:00,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038192_19554304.pth [2023-03-09 06:12:02,493][635776] Updated weights for policy 0, policy_version 38800 (0.0005) [2023-03-09 06:12:05,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 19886080. Throughput: 0: 9558.8. Samples: 19888400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:05,122][635489] Avg episode reward: [(0, '3839.948')] [2023-03-09 06:12:06,743][635776] Updated weights for policy 0, policy_version 38880 (0.0005) [2023-03-09 06:12:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 19939328. Throughput: 0: 9578.4. Samples: 19918732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:10,122][635489] Avg episode reward: [(0, '3631.901')] [2023-03-09 06:12:10,932][635776] Updated weights for policy 0, policy_version 38960 (0.0005) [2023-03-09 06:12:14,795][635776] Updated weights for policy 0, policy_version 39040 (0.0005) [2023-03-09 06:12:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 19988480. Throughput: 0: 9618.6. Samples: 19977340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:15,122][635489] Avg episode reward: [(0, '3496.034')] [2023-03-09 06:12:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039040_19988480.pth... [2023-03-09 06:12:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038472_19697664.pth [2023-03-09 06:12:18,988][635776] Updated weights for policy 0, policy_version 39120 (0.0005) [2023-03-09 06:12:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9969.2). Total num frames: 20041728. Throughput: 0: 9726.1. Samples: 20038960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:20,122][635489] Avg episode reward: [(0, '3508.338')] [2023-03-09 06:12:22,826][635776] Updated weights for policy 0, policy_version 39200 (0.0005) [2023-03-09 06:12:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 20090880. Throughput: 0: 9827.5. Samples: 20070688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:25,122][635489] Avg episode reward: [(0, '3728.958')] [2023-03-09 06:12:26,888][635776] Updated weights for policy 0, policy_version 39280 (0.0004) [2023-03-09 06:12:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 20140032. Throughput: 0: 9894.0. Samples: 20131708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:30,122][635489] Avg episode reward: [(0, '4212.797')] [2023-03-09 06:12:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039336_20140032.pth... [2023-03-09 06:12:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000038752_19841024.pth [2023-03-09 06:12:31,037][635776] Updated weights for policy 0, policy_version 39360 (0.0005) [2023-03-09 06:12:35,117][635776] Updated weights for policy 0, policy_version 39440 (0.0004) [2023-03-09 06:12:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 20193280. Throughput: 0: 9963.4. Samples: 20191144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:35,122][635489] Avg episode reward: [(0, '4328.923')] [2023-03-09 06:12:39,090][635776] Updated weights for policy 0, policy_version 39520 (0.0005) [2023-03-09 06:12:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 20242432. Throughput: 0: 10013.2. Samples: 20221968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:12:40,122][635489] Avg episode reward: [(0, '4549.071')] [2023-03-09 06:12:42,937][635776] Updated weights for policy 0, policy_version 39600 (0.0005) [2023-03-09 06:12:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 20295680. Throughput: 0: 10181.4. Samples: 20286972. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:12:45,122][635489] Avg episode reward: [(0, '4446.954')] [2023-03-09 06:12:45,139][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039648_20299776.pth... [2023-03-09 06:12:45,141][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039040_19988480.pth [2023-03-09 06:12:46,772][635776] Updated weights for policy 0, policy_version 39680 (0.0004) [2023-03-09 06:12:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 20348928. Throughput: 0: 10223.3. Samples: 20348448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:12:50,122][635489] Avg episode reward: [(0, '4386.965')] [2023-03-09 06:12:50,883][635776] Updated weights for policy 0, policy_version 39760 (0.0005) [2023-03-09 06:12:54,776][635776] Updated weights for policy 0, policy_version 39840 (0.0004) [2023-03-09 06:12:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9969.2). Total num frames: 20402176. Throughput: 0: 10221.9. Samples: 20378716. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:12:55,122][635489] Avg episode reward: [(0, '4294.113')] [2023-03-09 06:12:59,064][635776] Updated weights for policy 0, policy_version 39920 (0.0005) [2023-03-09 06:13:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 20447232. Throughput: 0: 10235.1. Samples: 20437920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:13:00,122][635489] Avg episode reward: [(0, '4342.233')] [2023-03-09 06:13:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039936_20447232.pth... [2023-03-09 06:13:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039336_20140032.pth [2023-03-09 06:13:03,225][635776] Updated weights for policy 0, policy_version 40000 (0.0004) [2023-03-09 06:13:05,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 20496384. Throughput: 0: 10166.4. Samples: 20496448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:13:05,122][635489] Avg episode reward: [(0, '4177.487')] [2023-03-09 06:13:07,363][635776] Updated weights for policy 0, policy_version 40080 (0.0004) [2023-03-09 06:13:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 20545536. Throughput: 0: 10109.1. Samples: 20525596. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:13:10,122][635489] Avg episode reward: [(0, '4330.981')] [2023-03-09 06:13:11,731][635776] Updated weights for policy 0, policy_version 40160 (0.0004) [2023-03-09 06:13:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 20594688. Throughput: 0: 10017.9. Samples: 20582516. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:13:15,122][635489] Avg episode reward: [(0, '4356.043')] [2023-03-09 06:13:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040224_20594688.pth... [2023-03-09 06:13:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039648_20299776.pth [2023-03-09 06:13:15,829][635776] Updated weights for policy 0, policy_version 40240 (0.0004) [2023-03-09 06:13:20,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 20639744. Throughput: 0: 9971.2. Samples: 20639848. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:13:20,122][635489] Avg episode reward: [(0, '4472.863')] [2023-03-09 06:13:20,286][635776] Updated weights for policy 0, policy_version 40320 (0.0005) [2023-03-09 06:13:24,522][635776] Updated weights for policy 0, policy_version 40400 (0.0005) [2023-03-09 06:13:25,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 20688896. Throughput: 0: 9927.6. Samples: 20668712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:13:25,122][635489] Avg episode reward: [(0, '4360.829')] [2023-03-09 06:13:28,596][635776] Updated weights for policy 0, policy_version 40480 (0.0005) [2023-03-09 06:13:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 20738048. Throughput: 0: 9829.9. Samples: 20729320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:13:30,122][635489] Avg episode reward: [(0, '4444.171')] [2023-03-09 06:13:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040504_20738048.pth... [2023-03-09 06:13:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000039936_20447232.pth [2023-03-09 06:13:32,830][635776] Updated weights for policy 0, policy_version 40560 (0.0005) [2023-03-09 06:13:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 20787200. Throughput: 0: 9750.3. Samples: 20787212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:13:35,122][635489] Avg episode reward: [(0, '4472.861')] [2023-03-09 06:13:36,839][635776] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-09 06:13:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 20840448. Throughput: 0: 9752.7. Samples: 20817588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:13:40,122][635489] Avg episode reward: [(0, '4166.404')] [2023-03-09 06:13:40,880][635776] Updated weights for policy 0, policy_version 40720 (0.0004) [2023-03-09 06:13:44,916][635776] Updated weights for policy 0, policy_version 40800 (0.0004) [2023-03-09 06:13:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 20889600. Throughput: 0: 9817.8. Samples: 20879720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:13:45,122][635489] Avg episode reward: [(0, '4561.173')] [2023-03-09 06:13:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040800_20889600.pth... [2023-03-09 06:13:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040224_20594688.pth [2023-03-09 06:13:48,650][635776] Updated weights for policy 0, policy_version 40880 (0.0005) [2023-03-09 06:13:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 20942848. Throughput: 0: 9920.2. Samples: 20942856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:13:50,122][635489] Avg episode reward: [(0, '4551.384')] [2023-03-09 06:13:52,599][635776] Updated weights for policy 0, policy_version 40960 (0.0005) [2023-03-09 06:13:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 20996096. Throughput: 0: 9990.7. Samples: 20975180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:13:55,122][635489] Avg episode reward: [(0, '4443.203')] [2023-03-09 06:13:56,608][635776] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-09 06:14:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21045248. Throughput: 0: 10053.0. Samples: 21034900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:14:00,122][635489] Avg episode reward: [(0, '4420.552')] [2023-03-09 06:14:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041104_21045248.pth... [2023-03-09 06:14:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040504_20738048.pth [2023-03-09 06:14:00,824][635776] Updated weights for policy 0, policy_version 41120 (0.0005) [2023-03-09 06:14:04,953][635776] Updated weights for policy 0, policy_version 41200 (0.0005) [2023-03-09 06:14:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21094400. Throughput: 0: 10098.1. Samples: 21094264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:14:05,122][635489] Avg episode reward: [(0, '4184.712')] [2023-03-09 06:14:09,164][635776] Updated weights for policy 0, policy_version 41280 (0.0005) [2023-03-09 06:14:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21143552. Throughput: 0: 10094.4. Samples: 21122960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:14:10,122][635489] Avg episode reward: [(0, '3782.732')] [2023-03-09 06:14:13,269][635776] Updated weights for policy 0, policy_version 41360 (0.0005) [2023-03-09 06:14:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21192704. Throughput: 0: 10070.9. Samples: 21182512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:14:15,122][635489] Avg episode reward: [(0, '4149.587')] [2023-03-09 06:14:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041392_21192704.pth... [2023-03-09 06:14:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000040800_20889600.pth [2023-03-09 06:14:17,352][635776] Updated weights for policy 0, policy_version 41440 (0.0004) [2023-03-09 06:14:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 21241856. Throughput: 0: 10103.0. Samples: 21241848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:14:20,122][635489] Avg episode reward: [(0, '4389.692')] [2023-03-09 06:14:21,534][635776] Updated weights for policy 0, policy_version 41520 (0.0005) [2023-03-09 06:14:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 21291008. Throughput: 0: 10082.3. Samples: 21271292. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:25,122][635489] Avg episode reward: [(0, '4480.786')] [2023-03-09 06:14:25,620][635776] Updated weights for policy 0, policy_version 41600 (0.0005) [2023-03-09 06:14:29,655][635776] Updated weights for policy 0, policy_version 41680 (0.0005) [2023-03-09 06:14:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 21344256. Throughput: 0: 10051.6. Samples: 21332040. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:30,122][635489] Avg episode reward: [(0, '4379.752')] [2023-03-09 06:14:30,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041688_21344256.pth... [2023-03-09 06:14:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041104_21045248.pth [2023-03-09 06:14:33,812][635776] Updated weights for policy 0, policy_version 41760 (0.0005) [2023-03-09 06:14:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 21389312. Throughput: 0: 9971.7. Samples: 21391580. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:35,122][635489] Avg episode reward: [(0, '4463.366')] [2023-03-09 06:14:38,035][635776] Updated weights for policy 0, policy_version 41840 (0.0004) [2023-03-09 06:14:40,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21438464. Throughput: 0: 9905.4. Samples: 21420924. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:40,122][635489] Avg episode reward: [(0, '4527.887')] [2023-03-09 06:14:42,234][635776] Updated weights for policy 0, policy_version 41920 (0.0004) [2023-03-09 06:14:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 21491712. Throughput: 0: 9874.4. Samples: 21479248. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:45,122][635489] Avg episode reward: [(0, '4498.666')] [2023-03-09 06:14:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041976_21491712.pth... [2023-03-09 06:14:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041392_21192704.pth [2023-03-09 06:14:46,252][635776] Updated weights for policy 0, policy_version 42000 (0.0005) [2023-03-09 06:14:50,075][635776] Updated weights for policy 0, policy_version 42080 (0.0004) [2023-03-09 06:14:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 21544960. Throughput: 0: 9992.8. Samples: 21543940. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:14:50,122][635489] Avg episode reward: [(0, '4107.423')] [2023-03-09 06:14:54,183][635776] Updated weights for policy 0, policy_version 42160 (0.0005) [2023-03-09 06:14:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 21594112. Throughput: 0: 10014.5. Samples: 21573612. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:14:55,122][635489] Avg episode reward: [(0, '3995.931')] [2023-03-09 06:14:58,113][635776] Updated weights for policy 0, policy_version 42240 (0.0005) [2023-03-09 06:15:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 21647360. Throughput: 0: 10056.4. Samples: 21635052. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:15:00,122][635489] Avg episode reward: [(0, '3563.836')] [2023-03-09 06:15:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042280_21647360.pth... [2023-03-09 06:15:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041688_21344256.pth [2023-03-09 06:15:02,078][635776] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-09 06:15:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 21696512. Throughput: 0: 10102.7. Samples: 21696472. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:15:05,122][635489] Avg episode reward: [(0, '3313.817')] [2023-03-09 06:15:06,198][635776] Updated weights for policy 0, policy_version 42400 (0.0005) [2023-03-09 06:15:09,985][635776] Updated weights for policy 0, policy_version 42480 (0.0005) [2023-03-09 06:15:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 21749760. Throughput: 0: 10145.0. Samples: 21727820. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:15:10,122][635489] Avg episode reward: [(0, '3014.821')] [2023-03-09 06:15:13,942][635776] Updated weights for policy 0, policy_version 42560 (0.0005) [2023-03-09 06:15:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 21798912. Throughput: 0: 10189.9. Samples: 21790584. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:15:15,122][635489] Avg episode reward: [(0, '2969.334')] [2023-03-09 06:15:15,145][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042584_21803008.pth... [2023-03-09 06:15:15,147][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000041976_21491712.pth [2023-03-09 06:15:18,062][635776] Updated weights for policy 0, policy_version 42640 (0.0005) [2023-03-09 06:15:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 21848064. Throughput: 0: 10156.3. Samples: 21848616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:20,122][635489] Avg episode reward: [(0, '3145.806')] [2023-03-09 06:15:22,270][635776] Updated weights for policy 0, policy_version 42720 (0.0005) [2023-03-09 06:15:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9955.4). Total num frames: 21901312. Throughput: 0: 10190.2. Samples: 21879484. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:25,122][635489] Avg episode reward: [(0, '3410.604')] [2023-03-09 06:15:26,315][635776] Updated weights for policy 0, policy_version 42800 (0.0005) [2023-03-09 06:15:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 21950464. Throughput: 0: 10230.5. Samples: 21939620. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:30,122][635489] Avg episode reward: [(0, '3641.047')] [2023-03-09 06:15:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042872_21950464.pth... [2023-03-09 06:15:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042280_21647360.pth [2023-03-09 06:15:30,315][635776] Updated weights for policy 0, policy_version 42880 (0.0005) [2023-03-09 06:15:34,414][635776] Updated weights for policy 0, policy_version 42960 (0.0005) [2023-03-09 06:15:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9955.4). Total num frames: 21999616. Throughput: 0: 10127.6. Samples: 21999680. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:35,122][635489] Avg episode reward: [(0, '3452.571')] [2023-03-09 06:15:38,397][635776] Updated weights for policy 0, policy_version 43040 (0.0005) [2023-03-09 06:15:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 22052864. Throughput: 0: 10167.3. Samples: 22031140. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:40,122][635489] Avg episode reward: [(0, '3379.791')] [2023-03-09 06:15:42,610][635776] Updated weights for policy 0, policy_version 43120 (0.0005) [2023-03-09 06:15:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 22097920. Throughput: 0: 10107.4. Samples: 22089884. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:15:45,122][635489] Avg episode reward: [(0, '2262.182')] [2023-03-09 06:15:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043168_22102016.pth... [2023-03-09 06:15:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042584_21803008.pth [2023-03-09 06:15:46,774][635776] Updated weights for policy 0, policy_version 43200 (0.0005) [2023-03-09 06:15:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 22151168. Throughput: 0: 10103.0. Samples: 22151104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:15:50,122][635489] Avg episode reward: [(0, '1925.102')] [2023-03-09 06:15:50,716][635776] Updated weights for policy 0, policy_version 43280 (0.0005) [2023-03-09 06:15:54,812][635776] Updated weights for policy 0, policy_version 43360 (0.0005) [2023-03-09 06:15:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 22200320. Throughput: 0: 10055.4. Samples: 22180312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:15:55,122][635489] Avg episode reward: [(0, '2613.188')] [2023-03-09 06:15:58,751][635776] Updated weights for policy 0, policy_version 43440 (0.0005) [2023-03-09 06:16:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 22253568. Throughput: 0: 10049.1. Samples: 22242796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:00,122][635489] Avg episode reward: [(0, '2020.815')] [2023-03-09 06:16:00,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043464_22253568.pth... [2023-03-09 06:16:00,132][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000042872_21950464.pth [2023-03-09 06:16:02,846][635776] Updated weights for policy 0, policy_version 43520 (0.0005) [2023-03-09 06:16:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 22302720. Throughput: 0: 10087.1. Samples: 22302536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:05,122][635489] Avg episode reward: [(0, '1854.110')] [2023-03-09 06:16:07,008][635776] Updated weights for policy 0, policy_version 43600 (0.0005) [2023-03-09 06:16:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 22351872. Throughput: 0: 10044.0. Samples: 22331464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:10,122][635489] Avg episode reward: [(0, '2495.560')] [2023-03-09 06:16:11,058][635776] Updated weights for policy 0, policy_version 43680 (0.0005) [2023-03-09 06:16:14,930][635776] Updated weights for policy 0, policy_version 43760 (0.0004) [2023-03-09 06:16:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 9983.1). Total num frames: 22405120. Throughput: 0: 10083.0. Samples: 22393356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:15,122][635489] Avg episode reward: [(0, '2540.204')] [2023-03-09 06:16:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043760_22405120.pth... [2023-03-09 06:16:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043168_22102016.pth [2023-03-09 06:16:18,873][635776] Updated weights for policy 0, policy_version 43840 (0.0005) [2023-03-09 06:16:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 22458368. Throughput: 0: 10159.2. Samples: 22456844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:20,122][635489] Avg episode reward: [(0, '2983.908')] [2023-03-09 06:16:22,786][635776] Updated weights for policy 0, policy_version 43920 (0.0005) [2023-03-09 06:16:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 22507520. Throughput: 0: 10148.1. Samples: 22487804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:25,122][635489] Avg episode reward: [(0, '2708.920')] [2023-03-09 06:16:26,869][635776] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-09 06:16:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 22560768. Throughput: 0: 10192.4. Samples: 22548544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:30,122][635489] Avg episode reward: [(0, '3178.948')] [2023-03-09 06:16:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044064_22560768.pth... [2023-03-09 06:16:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043464_22253568.pth [2023-03-09 06:16:30,947][635776] Updated weights for policy 0, policy_version 44080 (0.0004) [2023-03-09 06:16:35,024][635776] Updated weights for policy 0, policy_version 44160 (0.0004) [2023-03-09 06:16:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 22609920. Throughput: 0: 10190.7. Samples: 22609684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:35,122][635489] Avg episode reward: [(0, '3061.535')] [2023-03-09 06:16:39,010][635776] Updated weights for policy 0, policy_version 44240 (0.0004) [2023-03-09 06:16:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 22659072. Throughput: 0: 10211.6. Samples: 22639832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:40,129][635489] Avg episode reward: [(0, '2847.010')] [2023-03-09 06:16:43,085][635776] Updated weights for policy 0, policy_version 44320 (0.0005) [2023-03-09 06:16:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 22712320. Throughput: 0: 10170.6. Samples: 22700472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:45,128][635489] Avg episode reward: [(0, '2571.993')] [2023-03-09 06:16:45,131][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044360_22712320.pth... [2023-03-09 06:16:45,134][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000043760_22405120.pth [2023-03-09 06:16:46,815][635776] Updated weights for policy 0, policy_version 44400 (0.0004) [2023-03-09 06:16:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 22761472. Throughput: 0: 10249.1. Samples: 22763744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:50,132][635489] Avg episode reward: [(0, '2969.736')] [2023-03-09 06:16:51,039][635776] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-09 06:16:55,045][635776] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-09 06:16:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 22814720. Throughput: 0: 10248.6. Samples: 22792652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:16:55,124][635489] Avg episode reward: [(0, '2341.290')] [2023-03-09 06:16:58,995][635776] Updated weights for policy 0, policy_version 44640 (0.0005) [2023-03-09 06:17:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.8, 300 sec: 10094.2). Total num frames: 22863872. Throughput: 0: 10262.3. Samples: 22855156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:00,132][635489] Avg episode reward: [(0, '2807.434')] [2023-03-09 06:17:00,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044656_22863872.pth... [2023-03-09 06:17:00,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044064_22560768.pth [2023-03-09 06:17:02,922][635776] Updated weights for policy 0, policy_version 44720 (0.0005) [2023-03-09 06:17:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 22917120. Throughput: 0: 10219.6. Samples: 22916724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:05,133][635489] Avg episode reward: [(0, '2442.483')] [2023-03-09 06:17:07,077][635776] Updated weights for policy 0, policy_version 44800 (0.0004) [2023-03-09 06:17:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 22962176. Throughput: 0: 10169.0. Samples: 22945408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:10,122][635489] Avg episode reward: [(0, '2261.720')] [2023-03-09 06:17:11,312][635776] Updated weights for policy 0, policy_version 44880 (0.0004) [2023-03-09 06:17:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 23015424. Throughput: 0: 10124.5. Samples: 23004148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:15,133][635489] Avg episode reward: [(0, '2998.888')] [2023-03-09 06:17:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044952_23015424.pth... [2023-03-09 06:17:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044360_22712320.pth [2023-03-09 06:17:15,417][635776] Updated weights for policy 0, policy_version 44960 (0.0005) [2023-03-09 06:17:19,775][635776] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-09 06:17:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 23060480. Throughput: 0: 10041.6. Samples: 23061556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:20,132][635489] Avg episode reward: [(0, '2798.112')] [2023-03-09 06:17:24,073][635776] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-09 06:17:25,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 23109632. Throughput: 0: 10006.7. Samples: 23090132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:25,122][635489] Avg episode reward: [(0, '2894.390')] [2023-03-09 06:17:28,065][635776] Updated weights for policy 0, policy_version 45200 (0.0005) [2023-03-09 06:17:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 23162880. Throughput: 0: 10042.0. Samples: 23152360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:30,133][635489] Avg episode reward: [(0, '3192.320')] [2023-03-09 06:17:30,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045240_23162880.pth... [2023-03-09 06:17:30,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044656_22863872.pth [2023-03-09 06:17:32,037][635776] Updated weights for policy 0, policy_version 45280 (0.0004) [2023-03-09 06:17:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 23212032. Throughput: 0: 9959.8. Samples: 23211936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:17:35,133][635489] Avg episode reward: [(0, '2683.309')] [2023-03-09 06:17:36,074][635776] Updated weights for policy 0, policy_version 45360 (0.0004) [2023-03-09 06:17:39,628][635776] Updated weights for policy 0, policy_version 45440 (0.0005) [2023-03-09 06:17:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 23269376. Throughput: 0: 10058.1. Samples: 23245268. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:17:40,122][635489] Avg episode reward: [(0, '2669.438')] [2023-03-09 06:17:43,577][635776] Updated weights for policy 0, policy_version 45520 (0.0005) [2023-03-09 06:17:45,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10171.8, 300 sec: 10080.3). Total num frames: 23322624. Throughput: 0: 10115.1. Samples: 23310336. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:17:45,122][635489] Avg episode reward: [(0, '2825.309')] [2023-03-09 06:17:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045552_23322624.pth... [2023-03-09 06:17:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000044952_23015424.pth [2023-03-09 06:17:47,022][635776] Updated weights for policy 0, policy_version 45600 (0.0005) [2023-03-09 06:17:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 23375872. Throughput: 0: 10210.3. Samples: 23376188. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:17:50,122][635489] Avg episode reward: [(0, '2674.438')] [2023-03-09 06:17:51,021][635776] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-09 06:17:55,069][635776] Updated weights for policy 0, policy_version 45760 (0.0005) [2023-03-09 06:17:55,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 23429120. Throughput: 0: 10278.7. Samples: 23407948. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:17:55,122][635489] Avg episode reward: [(0, '3003.988')] [2023-03-09 06:17:59,063][635776] Updated weights for policy 0, policy_version 45840 (0.0005) [2023-03-09 06:18:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 23478272. Throughput: 0: 10304.9. Samples: 23467868. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:18:00,138][635489] Avg episode reward: [(0, '2540.854')] [2023-03-09 06:18:00,141][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045856_23478272.pth... [2023-03-09 06:18:00,144][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045240_23162880.pth [2023-03-09 06:18:03,053][635776] Updated weights for policy 0, policy_version 45920 (0.0005) [2023-03-09 06:18:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 23531520. Throughput: 0: 10459.8. Samples: 23532248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:18:05,132][635489] Avg episode reward: [(0, '2434.865')] [2023-03-09 06:18:06,935][635776] Updated weights for policy 0, policy_version 46000 (0.0005) [2023-03-09 06:18:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 23580672. Throughput: 0: 10455.4. Samples: 23560624. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:10,132][635489] Avg episode reward: [(0, '2080.369')] [2023-03-09 06:18:11,086][635776] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-09 06:18:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 23629824. Throughput: 0: 10426.6. Samples: 23621556. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:15,122][635489] Avg episode reward: [(0, '2486.053')] [2023-03-09 06:18:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046152_23629824.pth... [2023-03-09 06:18:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045552_23322624.pth [2023-03-09 06:18:15,224][635776] Updated weights for policy 0, policy_version 46160 (0.0005) [2023-03-09 06:18:19,378][635776] Updated weights for policy 0, policy_version 46240 (0.0004) [2023-03-09 06:18:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 23678976. Throughput: 0: 10423.0. Samples: 23680968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:20,122][635489] Avg episode reward: [(0, '2973.780')] [2023-03-09 06:18:23,575][635776] Updated weights for policy 0, policy_version 46320 (0.0005) [2023-03-09 06:18:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 23728128. Throughput: 0: 10324.2. Samples: 23709856. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:25,122][635489] Avg episode reward: [(0, '2725.154')] [2023-03-09 06:18:27,862][635776] Updated weights for policy 0, policy_version 46400 (0.0004) [2023-03-09 06:18:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 23777280. Throughput: 0: 10138.6. Samples: 23766576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:30,122][635489] Avg episode reward: [(0, '2896.658')] [2023-03-09 06:18:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046440_23777280.pth... [2023-03-09 06:18:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000045856_23478272.pth [2023-03-09 06:18:32,213][635776] Updated weights for policy 0, policy_version 46480 (0.0005) [2023-03-09 06:18:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 23826432. Throughput: 0: 9986.0. Samples: 23825560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:18:35,122][635489] Avg episode reward: [(0, '2595.273')] [2023-03-09 06:18:36,354][635776] Updated weights for policy 0, policy_version 46560 (0.0005) [2023-03-09 06:18:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 23875584. Throughput: 0: 9936.2. Samples: 23855076. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:18:40,122][635489] Avg episode reward: [(0, '2791.190')] [2023-03-09 06:18:40,336][635776] Updated weights for policy 0, policy_version 46640 (0.0005) [2023-03-09 06:18:44,423][635776] Updated weights for policy 0, policy_version 46720 (0.0005) [2023-03-09 06:18:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 23924736. Throughput: 0: 9940.3. Samples: 23915180. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:18:45,122][635489] Avg episode reward: [(0, '2396.718')] [2023-03-09 06:18:45,161][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046736_23928832.pth... [2023-03-09 06:18:45,163][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046152_23629824.pth [2023-03-09 06:18:48,445][635776] Updated weights for policy 0, policy_version 46800 (0.0004) [2023-03-09 06:18:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 23977984. Throughput: 0: 9882.8. Samples: 23976972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:18:50,122][635489] Avg episode reward: [(0, '2063.158')] [2023-03-09 06:18:52,323][635776] Updated weights for policy 0, policy_version 46880 (0.0005) [2023-03-09 06:18:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 24027136. Throughput: 0: 9935.4. Samples: 24007716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:18:55,122][635489] Avg episode reward: [(0, '2052.644')] [2023-03-09 06:18:56,629][635776] Updated weights for policy 0, policy_version 46960 (0.0005) [2023-03-09 06:19:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 24076288. Throughput: 0: 9833.5. Samples: 24064064. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:19:00,122][635489] Avg episode reward: [(0, '2033.375')] [2023-03-09 06:19:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047024_24076288.pth... [2023-03-09 06:19:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046440_23777280.pth [2023-03-09 06:19:00,923][635776] Updated weights for policy 0, policy_version 47040 (0.0004) [2023-03-09 06:19:04,904][635776] Updated weights for policy 0, policy_version 47120 (0.0005) [2023-03-09 06:19:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.6, 300 sec: 10108.1). Total num frames: 24125440. Throughput: 0: 9875.3. Samples: 24125356. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:19:05,122][635489] Avg episode reward: [(0, '2396.431')] [2023-03-09 06:19:09,003][635776] Updated weights for policy 0, policy_version 47200 (0.0005) [2023-03-09 06:19:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 24174592. Throughput: 0: 9874.0. Samples: 24154184. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:10,122][635489] Avg episode reward: [(0, '2191.543')] [2023-03-09 06:19:13,016][635776] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-09 06:19:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 24227840. Throughput: 0: 9987.8. Samples: 24216028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:15,122][635489] Avg episode reward: [(0, '2900.182')] [2023-03-09 06:19:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047320_24227840.pth... [2023-03-09 06:19:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000046736_23928832.pth [2023-03-09 06:19:17,069][635776] Updated weights for policy 0, policy_version 47360 (0.0005) [2023-03-09 06:19:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 24276992. Throughput: 0: 10029.3. Samples: 24276880. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:20,122][635489] Avg episode reward: [(0, '2688.313')] [2023-03-09 06:19:21,182][635776] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-09 06:19:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 24326144. Throughput: 0: 10012.3. Samples: 24305628. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:25,122][635489] Avg episode reward: [(0, '2364.717')] [2023-03-09 06:19:25,335][635776] Updated weights for policy 0, policy_version 47520 (0.0005) [2023-03-09 06:19:29,237][635776] Updated weights for policy 0, policy_version 47600 (0.0005) [2023-03-09 06:19:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9967.0, 300 sec: 10122.0). Total num frames: 24375296. Throughput: 0: 10045.4. Samples: 24367220. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:30,122][635489] Avg episode reward: [(0, '3353.399')] [2023-03-09 06:19:30,130][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047616_24379392.pth... [2023-03-09 06:19:30,132][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047024_24076288.pth [2023-03-09 06:19:33,551][635776] Updated weights for policy 0, policy_version 47680 (0.0005) [2023-03-09 06:19:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 24428544. Throughput: 0: 9988.8. Samples: 24426468. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 06:19:35,122][635489] Avg episode reward: [(0, '2719.675')] [2023-03-09 06:19:37,461][635776] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-09 06:19:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 24477696. Throughput: 0: 9990.5. Samples: 24457288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:19:40,122][635489] Avg episode reward: [(0, '3379.117')] [2023-03-09 06:19:41,361][635776] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-09 06:19:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 24530944. Throughput: 0: 10123.3. Samples: 24519612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:19:45,122][635489] Avg episode reward: [(0, '3033.691')] [2023-03-09 06:19:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047912_24530944.pth... [2023-03-09 06:19:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047320_24227840.pth [2023-03-09 06:19:45,429][635776] Updated weights for policy 0, policy_version 47920 (0.0005) [2023-03-09 06:19:49,225][635776] Updated weights for policy 0, policy_version 48000 (0.0005) [2023-03-09 06:19:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 24584192. Throughput: 0: 10192.8. Samples: 24584032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:19:50,122][635489] Avg episode reward: [(0, '2912.877')] [2023-03-09 06:19:53,188][635776] Updated weights for policy 0, policy_version 48080 (0.0004) [2023-03-09 06:19:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 24633344. Throughput: 0: 10194.3. Samples: 24612928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:19:55,122][635489] Avg episode reward: [(0, '2897.116')] [2023-03-09 06:19:57,077][635776] Updated weights for policy 0, policy_version 48160 (0.0005) [2023-03-09 06:20:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10135.9). Total num frames: 24686592. Throughput: 0: 10255.5. Samples: 24677524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:00,122][635489] Avg episode reward: [(0, '2450.424')] [2023-03-09 06:20:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048216_24686592.pth... [2023-03-09 06:20:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047616_24379392.pth [2023-03-09 06:20:01,117][635776] Updated weights for policy 0, policy_version 48240 (0.0004) [2023-03-09 06:20:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 24735744. Throughput: 0: 10252.6. Samples: 24738248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:05,122][635489] Avg episode reward: [(0, '2484.397')] [2023-03-09 06:20:05,146][635776] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-09 06:20:09,153][635776] Updated weights for policy 0, policy_version 48400 (0.0004) [2023-03-09 06:20:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 24788992. Throughput: 0: 10286.6. Samples: 24768524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:10,122][635489] Avg episode reward: [(0, '2428.726')] [2023-03-09 06:20:12,999][635776] Updated weights for policy 0, policy_version 48480 (0.0005) [2023-03-09 06:20:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 24842240. Throughput: 0: 10313.4. Samples: 24831324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:15,133][635489] Avg episode reward: [(0, '3005.625')] [2023-03-09 06:20:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048520_24842240.pth... [2023-03-09 06:20:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000047912_24530944.pth [2023-03-09 06:20:17,058][635776] Updated weights for policy 0, policy_version 48560 (0.0004) [2023-03-09 06:20:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 24891392. Throughput: 0: 10331.9. Samples: 24891404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:20,122][635489] Avg episode reward: [(0, '2750.591')] [2023-03-09 06:20:21,128][635776] Updated weights for policy 0, policy_version 48640 (0.0005) [2023-03-09 06:20:25,111][635776] Updated weights for policy 0, policy_version 48720 (0.0005) [2023-03-09 06:20:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10149.7). Total num frames: 24944640. Throughput: 0: 10305.9. Samples: 24921056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:25,122][635489] Avg episode reward: [(0, '2744.271')] [2023-03-09 06:20:29,091][635776] Updated weights for policy 0, policy_version 48800 (0.0005) [2023-03-09 06:20:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10149.7). Total num frames: 24993792. Throughput: 0: 10317.7. Samples: 24983908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:30,122][635489] Avg episode reward: [(0, '2177.171')] [2023-03-09 06:20:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048816_24993792.pth... [2023-03-09 06:20:30,131][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048216_24686592.pth [2023-03-09 06:20:33,157][635776] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-09 06:20:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 25042944. Throughput: 0: 10204.4. Samples: 25043228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:35,122][635489] Avg episode reward: [(0, '1922.291')] [2023-03-09 06:20:37,058][635776] Updated weights for policy 0, policy_version 48960 (0.0005) [2023-03-09 06:20:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 25096192. Throughput: 0: 10285.5. Samples: 25075776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:40,122][635489] Avg episode reward: [(0, '2023.308')] [2023-03-09 06:20:41,149][635776] Updated weights for policy 0, policy_version 49040 (0.0005) [2023-03-09 06:20:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 25145344. Throughput: 0: 10189.8. Samples: 25136064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:45,122][635489] Avg episode reward: [(0, '2103.406')] [2023-03-09 06:20:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049112_25145344.pth... [2023-03-09 06:20:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048520_24842240.pth [2023-03-09 06:20:45,229][635776] Updated weights for policy 0, policy_version 49120 (0.0005) [2023-03-09 06:20:49,324][635776] Updated weights for policy 0, policy_version 49200 (0.0005) [2023-03-09 06:20:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 25194496. Throughput: 0: 10171.2. Samples: 25195952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:50,122][635489] Avg episode reward: [(0, '2557.107')] [2023-03-09 06:20:53,131][635776] Updated weights for policy 0, policy_version 49280 (0.0005) [2023-03-09 06:20:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 25251840. Throughput: 0: 10216.8. Samples: 25228280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:20:55,122][635489] Avg episode reward: [(0, '2536.111')] [2023-03-09 06:20:56,965][635776] Updated weights for policy 0, policy_version 49360 (0.0004) [2023-03-09 06:21:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25300992. Throughput: 0: 10197.8. Samples: 25290224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:00,122][635489] Avg episode reward: [(0, '2776.260')] [2023-03-09 06:21:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049416_25300992.pth... [2023-03-09 06:21:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000048816_24993792.pth [2023-03-09 06:21:01,117][635776] Updated weights for policy 0, policy_version 49440 (0.0005) [2023-03-09 06:21:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25350144. Throughput: 0: 10208.5. Samples: 25350784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:05,122][635489] Avg episode reward: [(0, '2418.468')] [2023-03-09 06:21:05,214][635776] Updated weights for policy 0, policy_version 49520 (0.0005) [2023-03-09 06:21:09,206][635776] Updated weights for policy 0, policy_version 49600 (0.0005) [2023-03-09 06:21:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25403392. Throughput: 0: 10243.7. Samples: 25382020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:10,122][635489] Avg episode reward: [(0, '2279.202')] [2023-03-09 06:21:13,046][635776] Updated weights for policy 0, policy_version 49680 (0.0004) [2023-03-09 06:21:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 25452544. Throughput: 0: 10235.0. Samples: 25444484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:15,122][635489] Avg episode reward: [(0, '2159.653')] [2023-03-09 06:21:15,178][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049720_25456640.pth... [2023-03-09 06:21:15,180][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049112_25145344.pth [2023-03-09 06:21:17,161][635776] Updated weights for policy 0, policy_version 49760 (0.0005) [2023-03-09 06:21:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25505792. Throughput: 0: 10278.6. Samples: 25505764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:20,122][635489] Avg episode reward: [(0, '2728.247')] [2023-03-09 06:21:21,182][635776] Updated weights for policy 0, policy_version 49840 (0.0005) [2023-03-09 06:21:24,979][635776] Updated weights for policy 0, policy_version 49920 (0.0005) [2023-03-09 06:21:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25559040. Throughput: 0: 10248.4. Samples: 25536956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:25,122][635489] Avg episode reward: [(0, '1652.514')] [2023-03-09 06:21:29,234][635776] Updated weights for policy 0, policy_version 50000 (0.0005) [2023-03-09 06:21:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25608192. Throughput: 0: 10232.3. Samples: 25596520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:30,122][635489] Avg episode reward: [(0, '1895.781')] [2023-03-09 06:21:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050016_25608192.pth... [2023-03-09 06:21:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049416_25300992.pth [2023-03-09 06:21:33,395][635776] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-09 06:21:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 25657344. Throughput: 0: 10220.4. Samples: 25655872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:35,122][635489] Avg episode reward: [(0, '2207.936')] [2023-03-09 06:21:37,483][635776] Updated weights for policy 0, policy_version 50160 (0.0005) [2023-03-09 06:21:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 25706496. Throughput: 0: 10173.5. Samples: 25686088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:40,122][635489] Avg episode reward: [(0, '2539.713')] [2023-03-09 06:21:41,527][635776] Updated weights for policy 0, policy_version 50240 (0.0005) [2023-03-09 06:21:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 25755648. Throughput: 0: 10153.0. Samples: 25747108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:45,122][635489] Avg episode reward: [(0, '2592.166')] [2023-03-09 06:21:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050304_25755648.pth... [2023-03-09 06:21:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000049720_25456640.pth [2023-03-09 06:21:45,718][635776] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-09 06:21:49,763][635776] Updated weights for policy 0, policy_version 50400 (0.0005) [2023-03-09 06:21:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 25804800. Throughput: 0: 10118.2. Samples: 25806104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:50,122][635489] Avg episode reward: [(0, '2579.123')] [2023-03-09 06:21:53,714][635776] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-09 06:21:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 25858048. Throughput: 0: 10108.3. Samples: 25836892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:21:55,122][635489] Avg episode reward: [(0, '2324.127')] [2023-03-09 06:21:57,525][635776] Updated weights for policy 0, policy_version 50560 (0.0005) [2023-03-09 06:22:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 25911296. Throughput: 0: 10106.4. Samples: 25899272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:00,122][635489] Avg episode reward: [(0, '2248.878')] [2023-03-09 06:22:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050608_25911296.pth... [2023-03-09 06:22:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050016_25608192.pth [2023-03-09 06:22:01,674][635776] Updated weights for policy 0, policy_version 50640 (0.0004) [2023-03-09 06:22:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 25960448. Throughput: 0: 10090.4. Samples: 25959832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:05,122][635489] Avg episode reward: [(0, '2146.628')] [2023-03-09 06:22:05,795][635776] Updated weights for policy 0, policy_version 50720 (0.0005) [2023-03-09 06:22:09,582][635776] Updated weights for policy 0, policy_version 50800 (0.0004) [2023-03-09 06:22:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 26013696. Throughput: 0: 10125.4. Samples: 25992600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:10,122][635489] Avg episode reward: [(0, '2686.596')] [2023-03-09 06:22:13,471][635776] Updated weights for policy 0, policy_version 50880 (0.0004) [2023-03-09 06:22:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 26062848. Throughput: 0: 10182.3. Samples: 26054720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:15,122][635489] Avg episode reward: [(0, '2579.687')] [2023-03-09 06:22:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050904_26062848.pth... [2023-03-09 06:22:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050304_25755648.pth [2023-03-09 06:22:17,691][635776] Updated weights for policy 0, policy_version 50960 (0.0004) [2023-03-09 06:22:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 26116096. Throughput: 0: 10198.5. Samples: 26114804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:20,122][635489] Avg episode reward: [(0, '1701.224')] [2023-03-09 06:22:21,767][635776] Updated weights for policy 0, policy_version 51040 (0.0004) [2023-03-09 06:22:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 26161152. Throughput: 0: 10149.2. Samples: 26142800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:25,122][635489] Avg episode reward: [(0, '2008.418')] [2023-03-09 06:22:26,070][635776] Updated weights for policy 0, policy_version 51120 (0.0004) [2023-03-09 06:22:30,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 26210304. Throughput: 0: 10110.5. Samples: 26202080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:30,122][635489] Avg episode reward: [(0, '2134.791')] [2023-03-09 06:22:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051192_26210304.pth... [2023-03-09 06:22:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050608_25911296.pth [2023-03-09 06:22:30,249][635776] Updated weights for policy 0, policy_version 51200 (0.0005) [2023-03-09 06:22:34,352][635776] Updated weights for policy 0, policy_version 51280 (0.0005) [2023-03-09 06:22:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 26259456. Throughput: 0: 10095.2. Samples: 26260388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:35,122][635489] Avg episode reward: [(0, '2041.204')] [2023-03-09 06:22:38,719][635776] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-09 06:22:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 26308608. Throughput: 0: 10026.6. Samples: 26288088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:22:40,122][635489] Avg episode reward: [(0, '1981.515')] [2023-03-09 06:22:42,827][635776] Updated weights for policy 0, policy_version 51440 (0.0005) [2023-03-09 06:22:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 26357760. Throughput: 0: 10002.0. Samples: 26349360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:22:45,122][635489] Avg episode reward: [(0, '2546.733')] [2023-03-09 06:22:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051480_26357760.pth... [2023-03-09 06:22:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000050904_26062848.pth [2023-03-09 06:22:46,795][635776] Updated weights for policy 0, policy_version 51520 (0.0004) [2023-03-09 06:22:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 26411008. Throughput: 0: 9967.2. Samples: 26408356. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:22:50,122][635489] Avg episode reward: [(0, '2537.545')] [2023-03-09 06:22:50,922][635776] Updated weights for policy 0, policy_version 51600 (0.0004) [2023-03-09 06:22:54,904][635776] Updated weights for policy 0, policy_version 51680 (0.0005) [2023-03-09 06:22:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 26460160. Throughput: 0: 9946.8. Samples: 26440204. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:22:55,122][635489] Avg episode reward: [(0, '2450.425')] [2023-03-09 06:22:58,710][635776] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-09 06:23:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 26513408. Throughput: 0: 9972.2. Samples: 26503468. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:23:00,122][635489] Avg episode reward: [(0, '2692.873')] [2023-03-09 06:23:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051784_26513408.pth... [2023-03-09 06:23:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051192_26210304.pth [2023-03-09 06:23:02,640][635776] Updated weights for policy 0, policy_version 51840 (0.0004) [2023-03-09 06:23:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 26562560. Throughput: 0: 9980.9. Samples: 26563944. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:23:05,122][635489] Avg episode reward: [(0, '2387.833')] [2023-03-09 06:23:06,940][635776] Updated weights for policy 0, policy_version 51920 (0.0004) [2023-03-09 06:23:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 26615808. Throughput: 0: 10045.4. Samples: 26594844. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:23:10,122][635489] Avg episode reward: [(0, '2379.420')] [2023-03-09 06:23:10,518][635776] Updated weights for policy 0, policy_version 52000 (0.0005) [2023-03-09 06:23:14,242][635776] Updated weights for policy 0, policy_version 52080 (0.0004) [2023-03-09 06:23:15,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 26673152. Throughput: 0: 10204.3. Samples: 26661276. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:23:15,122][635489] Avg episode reward: [(0, '2527.971')] [2023-03-09 06:23:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052096_26673152.pth... [2023-03-09 06:23:15,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051480_26357760.pth [2023-03-09 06:23:18,152][635776] Updated weights for policy 0, policy_version 52160 (0.0005) [2023-03-09 06:23:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 26722304. Throughput: 0: 10295.5. Samples: 26723684. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:20,122][635489] Avg episode reward: [(0, '2425.579')] [2023-03-09 06:23:22,096][635776] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-09 06:23:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 26775552. Throughput: 0: 10377.7. Samples: 26755084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:25,122][635489] Avg episode reward: [(0, '2501.766')] [2023-03-09 06:23:26,067][635776] Updated weights for policy 0, policy_version 52320 (0.0005) [2023-03-09 06:23:29,903][635776] Updated weights for policy 0, policy_version 52400 (0.0005) [2023-03-09 06:23:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 26828800. Throughput: 0: 10422.7. Samples: 26818384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:30,122][635489] Avg episode reward: [(0, '2448.632')] [2023-03-09 06:23:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052400_26828800.pth... [2023-03-09 06:23:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000051784_26513408.pth [2023-03-09 06:23:33,769][635776] Updated weights for policy 0, policy_version 52480 (0.0004) [2023-03-09 06:23:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 26882048. Throughput: 0: 10521.0. Samples: 26881800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:35,122][635489] Avg episode reward: [(0, '2443.666')] [2023-03-09 06:23:37,606][635776] Updated weights for policy 0, policy_version 52560 (0.0005) [2023-03-09 06:23:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 26935296. Throughput: 0: 10530.9. Samples: 26914096. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:40,122][635489] Avg episode reward: [(0, '2846.498')] [2023-03-09 06:23:41,426][635776] Updated weights for policy 0, policy_version 52640 (0.0005) [2023-03-09 06:23:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 26984448. Throughput: 0: 10507.8. Samples: 26976320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:23:45,122][635489] Avg episode reward: [(0, '3030.696')] [2023-03-09 06:23:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052704_26984448.pth... [2023-03-09 06:23:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052096_26673152.pth [2023-03-09 06:23:45,627][635776] Updated weights for policy 0, policy_version 52720 (0.0004) [2023-03-09 06:23:49,889][635776] Updated weights for policy 0, policy_version 52800 (0.0005) [2023-03-09 06:23:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 27033600. Throughput: 0: 10437.0. Samples: 27033612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:23:50,122][635489] Avg episode reward: [(0, '2965.486')] [2023-03-09 06:23:53,933][635776] Updated weights for policy 0, policy_version 52880 (0.0005) [2023-03-09 06:23:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 27086848. Throughput: 0: 10431.6. Samples: 27064264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:23:55,122][635489] Avg episode reward: [(0, '3383.914')] [2023-03-09 06:23:57,914][635776] Updated weights for policy 0, policy_version 52960 (0.0005) [2023-03-09 06:24:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 27136000. Throughput: 0: 10314.9. Samples: 27125448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:00,122][635489] Avg episode reward: [(0, '3541.296')] [2023-03-09 06:24:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053000_27136000.pth... [2023-03-09 06:24:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052400_26828800.pth [2023-03-09 06:24:02,066][635776] Updated weights for policy 0, policy_version 53040 (0.0004) [2023-03-09 06:24:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 27185152. Throughput: 0: 10227.7. Samples: 27183932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:05,122][635489] Avg episode reward: [(0, '3367.826')] [2023-03-09 06:24:06,280][635776] Updated weights for policy 0, policy_version 53120 (0.0005) [2023-03-09 06:24:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 27234304. Throughput: 0: 10195.6. Samples: 27213888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:10,122][635489] Avg episode reward: [(0, '3405.591')] [2023-03-09 06:24:10,204][635776] Updated weights for policy 0, policy_version 53200 (0.0005) [2023-03-09 06:24:14,163][635776] Updated weights for policy 0, policy_version 53280 (0.0005) [2023-03-09 06:24:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 27287552. Throughput: 0: 10161.5. Samples: 27275652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:15,122][635489] Avg episode reward: [(0, '3472.735')] [2023-03-09 06:24:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053296_27287552.pth... [2023-03-09 06:24:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000052704_26984448.pth [2023-03-09 06:24:18,449][635776] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-09 06:24:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 27336704. Throughput: 0: 10073.3. Samples: 27335100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:20,122][635489] Avg episode reward: [(0, '3508.365')] [2023-03-09 06:24:22,393][635776] Updated weights for policy 0, policy_version 53440 (0.0005) [2023-03-09 06:24:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 27385856. Throughput: 0: 10032.1. Samples: 27365540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:25,122][635489] Avg episode reward: [(0, '3426.721')] [2023-03-09 06:24:26,605][635776] Updated weights for policy 0, policy_version 53520 (0.0005) [2023-03-09 06:24:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 27439104. Throughput: 0: 10008.9. Samples: 27426720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:30,122][635489] Avg episode reward: [(0, '3875.588')] [2023-03-09 06:24:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053592_27439104.pth... [2023-03-09 06:24:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053000_27136000.pth [2023-03-09 06:24:30,502][635776] Updated weights for policy 0, policy_version 53600 (0.0005) [2023-03-09 06:24:34,593][635776] Updated weights for policy 0, policy_version 53680 (0.0005) [2023-03-09 06:24:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 27488256. Throughput: 0: 10098.6. Samples: 27488048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:35,122][635489] Avg episode reward: [(0, '3570.387')] [2023-03-09 06:24:38,473][635776] Updated weights for policy 0, policy_version 53760 (0.0004) [2023-03-09 06:24:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 27537408. Throughput: 0: 10087.1. Samples: 27518184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:40,122][635489] Avg episode reward: [(0, '3320.488')] [2023-03-09 06:24:42,655][635776] Updated weights for policy 0, policy_version 53840 (0.0004) [2023-03-09 06:24:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10103.4, 300 sec: 10191.4). Total num frames: 27590656. Throughput: 0: 10062.7. Samples: 27578272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:45,122][635489] Avg episode reward: [(0, '3047.871')] [2023-03-09 06:24:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053888_27590656.pth... [2023-03-09 06:24:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053296_27287552.pth [2023-03-09 06:24:46,792][635776] Updated weights for policy 0, policy_version 53920 (0.0005) [2023-03-09 06:24:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 27639808. Throughput: 0: 10085.9. Samples: 27637796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:50,122][635489] Avg episode reward: [(0, '3653.812')] [2023-03-09 06:24:50,743][635776] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-09 06:24:54,609][635776] Updated weights for policy 0, policy_version 54080 (0.0005) [2023-03-09 06:24:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 27693056. Throughput: 0: 10179.6. Samples: 27671972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:24:55,122][635489] Avg episode reward: [(0, '3629.392')] [2023-03-09 06:24:58,765][635776] Updated weights for policy 0, policy_version 54160 (0.0005) [2023-03-09 06:25:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 27742208. Throughput: 0: 10120.7. Samples: 27731084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:00,122][635489] Avg episode reward: [(0, '3979.507')] [2023-03-09 06:25:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054184_27742208.pth... [2023-03-09 06:25:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053592_27439104.pth [2023-03-09 06:25:02,795][635776] Updated weights for policy 0, policy_version 54240 (0.0005) [2023-03-09 06:25:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 27795456. Throughput: 0: 10187.6. Samples: 27793544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:05,122][635489] Avg episode reward: [(0, '3667.229')] [2023-03-09 06:25:06,670][635776] Updated weights for policy 0, policy_version 54320 (0.0005) [2023-03-09 06:25:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 27844608. Throughput: 0: 10190.8. Samples: 27824128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:10,122][635489] Avg episode reward: [(0, '3282.900')] [2023-03-09 06:25:10,563][635776] Updated weights for policy 0, policy_version 54400 (0.0005) [2023-03-09 06:25:14,712][635776] Updated weights for policy 0, policy_version 54480 (0.0004) [2023-03-09 06:25:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 27897856. Throughput: 0: 10196.6. Samples: 27885568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:15,122][635489] Avg episode reward: [(0, '3373.220')] [2023-03-09 06:25:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054488_27897856.pth... [2023-03-09 06:25:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000053888_27590656.pth [2023-03-09 06:25:18,565][635776] Updated weights for policy 0, policy_version 54560 (0.0005) [2023-03-09 06:25:20,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 27951104. Throughput: 0: 10249.2. Samples: 27949260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:20,122][635489] Avg episode reward: [(0, '3372.439')] [2023-03-09 06:25:22,610][635776] Updated weights for policy 0, policy_version 54640 (0.0005) [2023-03-09 06:25:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28000256. Throughput: 0: 10245.2. Samples: 27979220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:25,122][635489] Avg episode reward: [(0, '3176.393')] [2023-03-09 06:25:26,791][635776] Updated weights for policy 0, policy_version 54720 (0.0005) [2023-03-09 06:25:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 28049408. Throughput: 0: 10216.7. Samples: 28038024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:30,122][635489] Avg episode reward: [(0, '3209.391')] [2023-03-09 06:25:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054784_28049408.pth... [2023-03-09 06:25:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054184_27742208.pth [2023-03-09 06:25:30,824][635776] Updated weights for policy 0, policy_version 54800 (0.0005) [2023-03-09 06:25:34,809][635776] Updated weights for policy 0, policy_version 54880 (0.0005) [2023-03-09 06:25:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 28098560. Throughput: 0: 10250.3. Samples: 28099060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:35,122][635489] Avg episode reward: [(0, '3112.159')] [2023-03-09 06:25:38,834][635776] Updated weights for policy 0, policy_version 54960 (0.0005) [2023-03-09 06:25:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28151808. Throughput: 0: 10197.3. Samples: 28130852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:40,122][635489] Avg episode reward: [(0, '3251.915')] [2023-03-09 06:25:42,948][635776] Updated weights for policy 0, policy_version 55040 (0.0005) [2023-03-09 06:25:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.8, 300 sec: 10191.4). Total num frames: 28200960. Throughput: 0: 10195.3. Samples: 28189872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:45,122][635489] Avg episode reward: [(0, '3211.423')] [2023-03-09 06:25:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055080_28200960.pth... [2023-03-09 06:25:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054488_27897856.pth [2023-03-09 06:25:47,107][635776] Updated weights for policy 0, policy_version 55120 (0.0005) [2023-03-09 06:25:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 28250112. Throughput: 0: 10145.3. Samples: 28250084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:50,122][635489] Avg episode reward: [(0, '3277.762')] [2023-03-09 06:25:51,117][635776] Updated weights for policy 0, policy_version 55200 (0.0004) [2023-03-09 06:25:54,991][635776] Updated weights for policy 0, policy_version 55280 (0.0005) [2023-03-09 06:25:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 28303360. Throughput: 0: 10181.1. Samples: 28282280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:25:55,122][635489] Avg episode reward: [(0, '3328.106')] [2023-03-09 06:25:59,006][635776] Updated weights for policy 0, policy_version 55360 (0.0005) [2023-03-09 06:26:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 28352512. Throughput: 0: 10177.9. Samples: 28343572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:00,122][635489] Avg episode reward: [(0, '3240.328')] [2023-03-09 06:26:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055376_28352512.pth... [2023-03-09 06:26:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000054784_28049408.pth [2023-03-09 06:26:02,771][635776] Updated weights for policy 0, policy_version 55440 (0.0004) [2023-03-09 06:26:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28409856. Throughput: 0: 10192.2. Samples: 28407908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:05,122][635489] Avg episode reward: [(0, '3547.169')] [2023-03-09 06:26:06,546][635776] Updated weights for policy 0, policy_version 55520 (0.0005) [2023-03-09 06:26:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28459008. Throughput: 0: 10251.0. Samples: 28440516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:10,122][635489] Avg episode reward: [(0, '3734.914')] [2023-03-09 06:26:10,528][635776] Updated weights for policy 0, policy_version 55600 (0.0005) [2023-03-09 06:26:14,406][635776] Updated weights for policy 0, policy_version 55680 (0.0005) [2023-03-09 06:26:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28512256. Throughput: 0: 10345.3. Samples: 28503560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:15,122][635489] Avg episode reward: [(0, '3417.005')] [2023-03-09 06:26:15,129][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055696_28516352.pth... [2023-03-09 06:26:15,131][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055080_28200960.pth [2023-03-09 06:26:18,360][635776] Updated weights for policy 0, policy_version 55760 (0.0005) [2023-03-09 06:26:20,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28565504. Throughput: 0: 10342.7. Samples: 28564480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:20,122][635489] Avg episode reward: [(0, '3655.404')] [2023-03-09 06:26:22,383][635776] Updated weights for policy 0, policy_version 55840 (0.0005) [2023-03-09 06:26:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28614656. Throughput: 0: 10302.4. Samples: 28594460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:25,122][635489] Avg episode reward: [(0, '3849.961')] [2023-03-09 06:26:26,440][635776] Updated weights for policy 0, policy_version 55920 (0.0005) [2023-03-09 06:26:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28663808. Throughput: 0: 10313.4. Samples: 28653976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:30,122][635489] Avg episode reward: [(0, '3893.223')] [2023-03-09 06:26:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055984_28663808.pth... [2023-03-09 06:26:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055376_28352512.pth [2023-03-09 06:26:30,835][635776] Updated weights for policy 0, policy_version 56000 (0.0005) [2023-03-09 06:26:34,947][635776] Updated weights for policy 0, policy_version 56080 (0.0005) [2023-03-09 06:26:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 28712960. Throughput: 0: 10276.4. Samples: 28712520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:35,122][635489] Avg episode reward: [(0, '3927.507')] [2023-03-09 06:26:39,299][635776] Updated weights for policy 0, policy_version 56160 (0.0005) [2023-03-09 06:26:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 28762112. Throughput: 0: 10206.7. Samples: 28741580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:40,122][635489] Avg episode reward: [(0, '4258.072')] [2023-03-09 06:26:43,281][635776] Updated weights for policy 0, policy_version 56240 (0.0005) [2023-03-09 06:26:45,122][635489] Fps is (10 sec: 9830.2, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 28811264. Throughput: 0: 10168.4. Samples: 28801152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:45,122][635489] Avg episode reward: [(0, '3776.904')] [2023-03-09 06:26:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056272_28811264.pth... [2023-03-09 06:26:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055696_28516352.pth [2023-03-09 06:26:47,226][635776] Updated weights for policy 0, policy_version 56320 (0.0005) [2023-03-09 06:26:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 28860416. Throughput: 0: 10055.7. Samples: 28860416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:50,122][635489] Avg episode reward: [(0, '3183.346')] [2023-03-09 06:26:51,558][635776] Updated weights for policy 0, policy_version 56400 (0.0004) [2023-03-09 06:26:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 28909568. Throughput: 0: 9968.4. Samples: 28889096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:26:55,122][635489] Avg episode reward: [(0, '3431.343')] [2023-03-09 06:26:55,962][635776] Updated weights for policy 0, policy_version 56480 (0.0005) [2023-03-09 06:27:00,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 28954624. Throughput: 0: 9831.0. Samples: 28945956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:00,122][635489] Avg episode reward: [(0, '3695.854')] [2023-03-09 06:27:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056552_28954624.pth... [2023-03-09 06:27:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000055984_28663808.pth [2023-03-09 06:27:00,320][635776] Updated weights for policy 0, policy_version 56560 (0.0005) [2023-03-09 06:27:04,647][635776] Updated weights for policy 0, policy_version 56640 (0.0005) [2023-03-09 06:27:05,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 29003776. Throughput: 0: 9740.5. Samples: 29002804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:05,122][635489] Avg episode reward: [(0, '4051.300')] [2023-03-09 06:27:08,847][635776] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-09 06:27:10,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 29052928. Throughput: 0: 9722.5. Samples: 29031972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:10,122][635489] Avg episode reward: [(0, '3989.419')] [2023-03-09 06:27:13,317][635776] Updated weights for policy 0, policy_version 56800 (0.0005) [2023-03-09 06:27:15,121][635489] Fps is (10 sec: 9011.2, 60 sec: 9693.9, 300 sec: 10094.2). Total num frames: 29093888. Throughput: 0: 9613.8. Samples: 29086596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:15,122][635489] Avg episode reward: [(0, '3357.158')] [2023-03-09 06:27:15,155][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056832_29097984.pth... [2023-03-09 06:27:15,156][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056272_28811264.pth [2023-03-09 06:27:17,655][635776] Updated weights for policy 0, policy_version 56880 (0.0005) [2023-03-09 06:27:20,122][635489] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 29143040. Throughput: 0: 9566.9. Samples: 29143032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:20,122][635489] Avg episode reward: [(0, '3514.043')] [2023-03-09 06:27:21,909][635776] Updated weights for policy 0, policy_version 56960 (0.0005) [2023-03-09 06:27:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 29192192. Throughput: 0: 9601.9. Samples: 29173668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:25,122][635489] Avg episode reward: [(0, '3885.572')] [2023-03-09 06:27:26,209][635776] Updated weights for policy 0, policy_version 57040 (0.0005) [2023-03-09 06:27:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 29241344. Throughput: 0: 9576.1. Samples: 29232076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:30,122][635489] Avg episode reward: [(0, '3760.810')] [2023-03-09 06:27:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057112_29241344.pth... [2023-03-09 06:27:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056552_28954624.pth [2023-03-09 06:27:30,314][635776] Updated weights for policy 0, policy_version 57120 (0.0005) [2023-03-09 06:27:34,378][635776] Updated weights for policy 0, policy_version 57200 (0.0005) [2023-03-09 06:27:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 29290496. Throughput: 0: 9573.4. Samples: 29291216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:35,122][635489] Avg episode reward: [(0, '3891.214')] [2023-03-09 06:27:38,608][635776] Updated weights for policy 0, policy_version 57280 (0.0005) [2023-03-09 06:27:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 29339648. Throughput: 0: 9585.3. Samples: 29320432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:40,122][635489] Avg episode reward: [(0, '3693.227')] [2023-03-09 06:27:42,775][635776] Updated weights for policy 0, policy_version 57360 (0.0005) [2023-03-09 06:27:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 10094.2). Total num frames: 29388800. Throughput: 0: 9655.4. Samples: 29380452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:27:45,122][635489] Avg episode reward: [(0, '3752.842')] [2023-03-09 06:27:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057400_29388800.pth... [2023-03-09 06:27:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000056832_29097984.pth [2023-03-09 06:27:46,854][635776] Updated weights for policy 0, policy_version 57440 (0.0004) [2023-03-09 06:27:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 10108.1). Total num frames: 29442048. Throughput: 0: 9755.0. Samples: 29441780. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:27:50,122][635489] Avg episode reward: [(0, '3828.810')] [2023-03-09 06:27:50,782][635776] Updated weights for policy 0, policy_version 57520 (0.0005) [2023-03-09 06:27:54,731][635776] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-09 06:27:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 10094.2). Total num frames: 29491200. Throughput: 0: 9772.4. Samples: 29471732. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:27:55,122][635489] Avg episode reward: [(0, '3920.011')] [2023-03-09 06:27:58,863][635776] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-09 06:28:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 29544448. Throughput: 0: 9902.8. Samples: 29532224. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:28:00,122][635489] Avg episode reward: [(0, '3862.828')] [2023-03-09 06:28:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057704_29544448.pth... [2023-03-09 06:28:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057112_29241344.pth [2023-03-09 06:28:02,920][635776] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-09 06:28:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 29593600. Throughput: 0: 10005.7. Samples: 29593288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:28:05,122][635489] Avg episode reward: [(0, '3922.473')] [2023-03-09 06:28:06,849][635776] Updated weights for policy 0, policy_version 57840 (0.0005) [2023-03-09 06:28:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10066.4). Total num frames: 29642752. Throughput: 0: 9999.6. Samples: 29623648. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:28:10,122][635489] Avg episode reward: [(0, '3691.821')] [2023-03-09 06:28:11,044][635776] Updated weights for policy 0, policy_version 57920 (0.0005) [2023-03-09 06:28:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10066.4). Total num frames: 29691904. Throughput: 0: 10036.2. Samples: 29683704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:28:15,122][635489] Avg episode reward: [(0, '3947.549')] [2023-03-09 06:28:15,163][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058000_29696000.pth... [2023-03-09 06:28:15,164][635776] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-09 06:28:15,165][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057400_29388800.pth [2023-03-09 06:28:19,492][635776] Updated weights for policy 0, policy_version 58080 (0.0005) [2023-03-09 06:28:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10052.6). Total num frames: 29741056. Throughput: 0: 9996.6. Samples: 29741064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:28:20,122][635489] Avg episode reward: [(0, '4097.583')] [2023-03-09 06:28:23,510][635776] Updated weights for policy 0, policy_version 58160 (0.0005) [2023-03-09 06:28:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 29794304. Throughput: 0: 10034.0. Samples: 29771964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:25,122][635489] Avg episode reward: [(0, '4102.119')] [2023-03-09 06:28:27,374][635776] Updated weights for policy 0, policy_version 58240 (0.0005) [2023-03-09 06:28:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.4, 300 sec: 10052.6). Total num frames: 29847552. Throughput: 0: 10106.8. Samples: 29835260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:30,122][635489] Avg episode reward: [(0, '3840.377')] [2023-03-09 06:28:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058296_29847552.pth... [2023-03-09 06:28:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000057704_29544448.pth [2023-03-09 06:28:31,371][635776] Updated weights for policy 0, policy_version 58320 (0.0004) [2023-03-09 06:28:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 29896704. Throughput: 0: 10083.8. Samples: 29895552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:35,122][635489] Avg episode reward: [(0, '3682.211')] [2023-03-09 06:28:35,405][635776] Updated weights for policy 0, policy_version 58400 (0.0005) [2023-03-09 06:28:39,390][635776] Updated weights for policy 0, policy_version 58480 (0.0004) [2023-03-09 06:28:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 29945856. Throughput: 0: 10081.2. Samples: 29925384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:40,122][635489] Avg episode reward: [(0, '3860.748')] [2023-03-09 06:28:43,300][635776] Updated weights for policy 0, policy_version 58560 (0.0005) [2023-03-09 06:28:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 29999104. Throughput: 0: 10152.8. Samples: 29989100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:45,122][635489] Avg episode reward: [(0, '4046.882')] [2023-03-09 06:28:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058592_29999104.pth... [2023-03-09 06:28:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058000_29696000.pth [2023-03-09 06:28:47,401][635776] Updated weights for policy 0, policy_version 58640 (0.0005) [2023-03-09 06:28:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 30048256. Throughput: 0: 10138.7. Samples: 30049528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:50,122][635489] Avg episode reward: [(0, '3962.548')] [2023-03-09 06:28:51,388][635776] Updated weights for policy 0, policy_version 58720 (0.0005) [2023-03-09 06:28:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 30101504. Throughput: 0: 10163.9. Samples: 30081024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:28:55,122][635489] Avg episode reward: [(0, '4098.167')] [2023-03-09 06:28:55,294][635776] Updated weights for policy 0, policy_version 58800 (0.0005) [2023-03-09 06:28:59,407][635776] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-09 06:29:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 30150656. Throughput: 0: 10193.5. Samples: 30142412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:29:00,122][635489] Avg episode reward: [(0, '4250.570')] [2023-03-09 06:29:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058888_30150656.pth... [2023-03-09 06:29:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058296_29847552.pth [2023-03-09 06:29:03,252][635776] Updated weights for policy 0, policy_version 58960 (0.0005) [2023-03-09 06:29:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 30203904. Throughput: 0: 10313.8. Samples: 30205184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:05,122][635489] Avg episode reward: [(0, '4176.689')] [2023-03-09 06:29:07,162][635776] Updated weights for policy 0, policy_version 59040 (0.0004) [2023-03-09 06:29:10,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 30261248. Throughput: 0: 10327.0. Samples: 30236680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:10,122][635489] Avg episode reward: [(0, '4218.780')] [2023-03-09 06:29:10,920][635776] Updated weights for policy 0, policy_version 59120 (0.0005) [2023-03-09 06:29:14,784][635776] Updated weights for policy 0, policy_version 59200 (0.0005) [2023-03-09 06:29:15,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 30314496. Throughput: 0: 10324.0. Samples: 30299840. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:15,122][635489] Avg episode reward: [(0, '4370.035')] [2023-03-09 06:29:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059208_30314496.pth... [2023-03-09 06:29:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058592_29999104.pth [2023-03-09 06:29:18,474][635776] Updated weights for policy 0, policy_version 59280 (0.0005) [2023-03-09 06:29:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10108.1). Total num frames: 30367744. Throughput: 0: 10446.8. Samples: 30365660. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:20,122][635489] Avg episode reward: [(0, '4062.141')] [2023-03-09 06:29:22,437][635776] Updated weights for policy 0, policy_version 59360 (0.0005) [2023-03-09 06:29:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10094.2). Total num frames: 30416896. Throughput: 0: 10490.2. Samples: 30397440. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:25,122][635489] Avg episode reward: [(0, '4355.316')] [2023-03-09 06:29:26,381][635776] Updated weights for policy 0, policy_version 59440 (0.0005) [2023-03-09 06:29:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 30470144. Throughput: 0: 10417.0. Samples: 30457864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:30,122][635489] Avg episode reward: [(0, '4288.213')] [2023-03-09 06:29:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059512_30470144.pth... [2023-03-09 06:29:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000058888_30150656.pth [2023-03-09 06:29:30,478][635776] Updated weights for policy 0, policy_version 59520 (0.0005) [2023-03-09 06:29:34,409][635776] Updated weights for policy 0, policy_version 59600 (0.0005) [2023-03-09 06:29:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 30519296. Throughput: 0: 10440.7. Samples: 30519360. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:29:35,122][635489] Avg episode reward: [(0, '4160.094')] [2023-03-09 06:29:38,461][635776] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-09 06:29:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 30568448. Throughput: 0: 10413.7. Samples: 30549640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:29:40,122][635489] Avg episode reward: [(0, '4419.583')] [2023-03-09 06:29:42,661][635776] Updated weights for policy 0, policy_version 59760 (0.0005) [2023-03-09 06:29:45,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 30617600. Throughput: 0: 10378.0. Samples: 30609420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:29:45,122][635489] Avg episode reward: [(0, '4352.847')] [2023-03-09 06:29:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059800_30617600.pth... [2023-03-09 06:29:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059208_30314496.pth [2023-03-09 06:29:46,892][635776] Updated weights for policy 0, policy_version 59840 (0.0005) [2023-03-09 06:29:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 30670848. Throughput: 0: 10316.5. Samples: 30669428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:29:50,122][635489] Avg episode reward: [(0, '4350.945')] [2023-03-09 06:29:50,919][635776] Updated weights for policy 0, policy_version 59920 (0.0005) [2023-03-09 06:29:54,716][635776] Updated weights for policy 0, policy_version 60000 (0.0005) [2023-03-09 06:29:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 30720000. Throughput: 0: 10286.8. Samples: 30699584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:29:55,122][635489] Avg episode reward: [(0, '4306.459')] [2023-03-09 06:29:58,652][635776] Updated weights for policy 0, policy_version 60080 (0.0005) [2023-03-09 06:30:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 30773248. Throughput: 0: 10305.2. Samples: 30763576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:00,122][635489] Avg episode reward: [(0, '4413.131')] [2023-03-09 06:30:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060104_30773248.pth... [2023-03-09 06:30:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059512_30470144.pth [2023-03-09 06:30:02,819][635776] Updated weights for policy 0, policy_version 60160 (0.0004) [2023-03-09 06:30:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 30822400. Throughput: 0: 10146.8. Samples: 30822268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:05,122][635489] Avg episode reward: [(0, '4252.006')] [2023-03-09 06:30:07,051][635776] Updated weights for policy 0, policy_version 60240 (0.0005) [2023-03-09 06:30:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 30871552. Throughput: 0: 10082.1. Samples: 30851136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:10,122][635489] Avg episode reward: [(0, '4391.196')] [2023-03-09 06:30:11,215][635776] Updated weights for policy 0, policy_version 60320 (0.0005) [2023-03-09 06:30:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 30920704. Throughput: 0: 10075.9. Samples: 30911276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:15,122][635489] Avg episode reward: [(0, '4540.171')] [2023-03-09 06:30:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060392_30920704.pth... [2023-03-09 06:30:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000059800_30617600.pth [2023-03-09 06:30:15,216][635776] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-09 06:30:19,394][635776] Updated weights for policy 0, policy_version 60480 (0.0004) [2023-03-09 06:30:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 30969856. Throughput: 0: 10012.6. Samples: 30969928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:20,122][635489] Avg episode reward: [(0, '4500.268')] [2023-03-09 06:30:23,643][635776] Updated weights for policy 0, policy_version 60560 (0.0004) [2023-03-09 06:30:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 31019008. Throughput: 0: 9983.2. Samples: 30998884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:25,122][635489] Avg episode reward: [(0, '4565.211')] [2023-03-09 06:30:27,738][635776] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-09 06:30:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 31072256. Throughput: 0: 10013.6. Samples: 31060032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:30,122][635489] Avg episode reward: [(0, '4548.228')] [2023-03-09 06:30:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060688_31072256.pth... [2023-03-09 06:30:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060104_30773248.pth [2023-03-09 06:30:31,756][635776] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-09 06:30:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 31121408. Throughput: 0: 10041.4. Samples: 31121292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:35,122][635489] Avg episode reward: [(0, '4418.214')] [2023-03-09 06:30:35,657][635776] Updated weights for policy 0, policy_version 60800 (0.0004) [2023-03-09 06:30:39,536][635776] Updated weights for policy 0, policy_version 60880 (0.0004) [2023-03-09 06:30:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 31174656. Throughput: 0: 10102.0. Samples: 31154176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:40,122][635489] Avg episode reward: [(0, '4270.221')] [2023-03-09 06:30:43,585][635776] Updated weights for policy 0, policy_version 60960 (0.0005) [2023-03-09 06:30:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 31223808. Throughput: 0: 10043.9. Samples: 31215552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:45,122][635489] Avg episode reward: [(0, '4390.209')] [2023-03-09 06:30:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060984_31223808.pth... [2023-03-09 06:30:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060392_30920704.pth [2023-03-09 06:30:47,776][635776] Updated weights for policy 0, policy_version 61040 (0.0005) [2023-03-09 06:30:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 31277056. Throughput: 0: 10055.5. Samples: 31274764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:30:50,122][635489] Avg episode reward: [(0, '4447.261')] [2023-03-09 06:30:51,730][635776] Updated weights for policy 0, policy_version 61120 (0.0004) [2023-03-09 06:30:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 31326208. Throughput: 0: 10083.7. Samples: 31304904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:30:55,122][635489] Avg episode reward: [(0, '4424.744')] [2023-03-09 06:30:55,840][635776] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-09 06:30:59,653][635776] Updated weights for policy 0, policy_version 61280 (0.0005) [2023-03-09 06:31:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 31375360. Throughput: 0: 10132.3. Samples: 31367232. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:00,122][635489] Avg episode reward: [(0, '4421.971')] [2023-03-09 06:31:00,128][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061288_31379456.pth... [2023-03-09 06:31:00,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060688_31072256.pth [2023-03-09 06:31:03,687][635776] Updated weights for policy 0, policy_version 61360 (0.0005) [2023-03-09 06:31:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 31428608. Throughput: 0: 10192.9. Samples: 31428608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:05,122][635489] Avg episode reward: [(0, '4360.609')] [2023-03-09 06:31:07,770][635776] Updated weights for policy 0, policy_version 61440 (0.0005) [2023-03-09 06:31:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 31477760. Throughput: 0: 10214.4. Samples: 31458532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:10,122][635489] Avg episode reward: [(0, '4467.114')] [2023-03-09 06:31:11,755][635776] Updated weights for policy 0, policy_version 61520 (0.0004) [2023-03-09 06:31:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 31531008. Throughput: 0: 10282.9. Samples: 31522760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:15,122][635489] Avg episode reward: [(0, '4491.736')] [2023-03-09 06:31:15,129][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061592_31535104.pth... [2023-03-09 06:31:15,132][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000060984_31223808.pth [2023-03-09 06:31:15,532][635776] Updated weights for policy 0, policy_version 61600 (0.0005) [2023-03-09 06:31:19,219][635776] Updated weights for policy 0, policy_version 61680 (0.0004) [2023-03-09 06:31:20,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 31588352. Throughput: 0: 10371.4. Samples: 31588004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:20,122][635489] Avg episode reward: [(0, '4227.852')] [2023-03-09 06:31:23,130][635776] Updated weights for policy 0, policy_version 61760 (0.0005) [2023-03-09 06:31:25,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 31641600. Throughput: 0: 10296.2. Samples: 31617504. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:25,122][635489] Avg episode reward: [(0, '4150.916')] [2023-03-09 06:31:27,107][635776] Updated weights for policy 0, policy_version 61840 (0.0005) [2023-03-09 06:31:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 31690752. Throughput: 0: 10328.8. Samples: 31680348. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:31:30,122][635489] Avg episode reward: [(0, '4356.739')] [2023-03-09 06:31:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061896_31690752.pth... [2023-03-09 06:31:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061288_31379456.pth [2023-03-09 06:31:31,044][635776] Updated weights for policy 0, policy_version 61920 (0.0005) [2023-03-09 06:31:35,099][635776] Updated weights for policy 0, policy_version 62000 (0.0005) [2023-03-09 06:31:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 31744000. Throughput: 0: 10388.7. Samples: 31742256. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:31:35,122][635489] Avg episode reward: [(0, '4325.794')] [2023-03-09 06:31:39,366][635776] Updated weights for policy 0, policy_version 62080 (0.0004) [2023-03-09 06:31:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 31789056. Throughput: 0: 10347.0. Samples: 31770520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:31:40,122][635489] Avg episode reward: [(0, '4509.570')] [2023-03-09 06:31:43,517][635776] Updated weights for policy 0, policy_version 62160 (0.0005) [2023-03-09 06:31:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.2, 300 sec: 10108.1). Total num frames: 31842304. Throughput: 0: 10279.4. Samples: 31829804. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:31:45,122][635489] Avg episode reward: [(0, '4531.262')] [2023-03-09 06:31:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062192_31842304.pth... [2023-03-09 06:31:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061592_31535104.pth [2023-03-09 06:31:47,652][635776] Updated weights for policy 0, policy_version 62240 (0.0005) [2023-03-09 06:31:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 31891456. Throughput: 0: 10256.3. Samples: 31890140. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:31:50,132][635489] Avg episode reward: [(0, '4508.609')] [2023-03-09 06:31:51,442][635776] Updated weights for policy 0, policy_version 62320 (0.0005) [2023-03-09 06:31:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 31940608. Throughput: 0: 10334.2. Samples: 31923572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:31:55,133][635489] Avg episode reward: [(0, '4482.283')] [2023-03-09 06:31:55,553][635776] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-09 06:31:59,521][635776] Updated weights for policy 0, policy_version 62480 (0.0005) [2023-03-09 06:32:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 31993856. Throughput: 0: 10241.9. Samples: 31983644. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:00,132][635489] Avg episode reward: [(0, '4474.384')] [2023-03-09 06:32:00,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062488_31993856.pth... [2023-03-09 06:32:00,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000061896_31690752.pth [2023-03-09 06:32:03,849][635776] Updated weights for policy 0, policy_version 62560 (0.0005) [2023-03-09 06:32:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 32038912. Throughput: 0: 10045.0. Samples: 32040028. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:05,132][635489] Avg episode reward: [(0, '4371.299')] [2023-03-09 06:32:08,025][635776] Updated weights for policy 0, policy_version 62640 (0.0005) [2023-03-09 06:32:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 32092160. Throughput: 0: 10056.2. Samples: 32070032. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:10,133][635489] Avg episode reward: [(0, '4372.357')] [2023-03-09 06:32:12,090][635776] Updated weights for policy 0, policy_version 62720 (0.0005) [2023-03-09 06:32:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 32141312. Throughput: 0: 10016.3. Samples: 32131080. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:15,132][635489] Avg episode reward: [(0, '3981.966')] [2023-03-09 06:32:15,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062776_32141312.pth... [2023-03-09 06:32:15,136][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062192_31842304.pth [2023-03-09 06:32:16,026][635776] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-09 06:32:20,051][635776] Updated weights for policy 0, policy_version 62880 (0.0005) [2023-03-09 06:32:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 32194560. Throughput: 0: 10024.9. Samples: 32193376. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:20,133][635489] Avg episode reward: [(0, '3855.890')] [2023-03-09 06:32:24,185][635776] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-09 06:32:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 32243712. Throughput: 0: 10061.7. Samples: 32223296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:25,133][635489] Avg episode reward: [(0, '4520.469')] [2023-03-09 06:32:28,279][635776] Updated weights for policy 0, policy_version 63040 (0.0004) [2023-03-09 06:32:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 32292864. Throughput: 0: 10081.9. Samples: 32283492. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:30,133][635489] Avg episode reward: [(0, '4414.350')] [2023-03-09 06:32:30,137][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063072_32292864.pth... [2023-03-09 06:32:30,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062488_31993856.pth [2023-03-09 06:32:32,437][635776] Updated weights for policy 0, policy_version 63120 (0.0005) [2023-03-09 06:32:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 32342016. Throughput: 0: 10032.6. Samples: 32341608. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:35,133][635489] Avg episode reward: [(0, '3844.094')] [2023-03-09 06:32:36,766][635776] Updated weights for policy 0, policy_version 63200 (0.0005) [2023-03-09 06:32:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 32391168. Throughput: 0: 9934.8. Samples: 32370640. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:40,122][635489] Avg episode reward: [(0, '3402.147')] [2023-03-09 06:32:40,962][635776] Updated weights for policy 0, policy_version 63280 (0.0005) [2023-03-09 06:32:45,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10149.8). Total num frames: 32436224. Throughput: 0: 9876.7. Samples: 32428096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 06:32:45,122][635489] Avg episode reward: [(0, '3514.034')] [2023-03-09 06:32:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063352_32436224.pth... [2023-03-09 06:32:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000062776_32141312.pth [2023-03-09 06:32:45,213][635776] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-09 06:32:49,509][635776] Updated weights for policy 0, policy_version 63440 (0.0005) [2023-03-09 06:32:50,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 32485376. Throughput: 0: 9895.1. Samples: 32485308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:32:50,122][635489] Avg episode reward: [(0, '3675.248')] [2023-03-09 06:32:53,592][635776] Updated weights for policy 0, policy_version 63520 (0.0006) [2023-03-09 06:32:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 32534528. Throughput: 0: 9890.5. Samples: 32515104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:32:55,122][635489] Avg episode reward: [(0, '3840.342')] [2023-03-09 06:32:57,668][635776] Updated weights for policy 0, policy_version 63600 (0.0006) [2023-03-09 06:33:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10135.9). Total num frames: 32583680. Throughput: 0: 9875.7. Samples: 32575488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:00,122][635489] Avg episode reward: [(0, '3952.481')] [2023-03-09 06:33:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063640_32583680.pth... [2023-03-09 06:33:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063072_32292864.pth [2023-03-09 06:33:01,896][635776] Updated weights for policy 0, policy_version 63680 (0.0005) [2023-03-09 06:33:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 32632832. Throughput: 0: 9772.6. Samples: 32633144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:05,122][635489] Avg episode reward: [(0, '3520.260')] [2023-03-09 06:33:06,074][635776] Updated weights for policy 0, policy_version 63760 (0.0005) [2023-03-09 06:33:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10135.9). Total num frames: 32681984. Throughput: 0: 9739.4. Samples: 32661568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:10,122][635489] Avg episode reward: [(0, '4051.765')] [2023-03-09 06:33:10,392][635776] Updated weights for policy 0, policy_version 63840 (0.0005) [2023-03-09 06:33:14,669][635776] Updated weights for policy 0, policy_version 63920 (0.0004) [2023-03-09 06:33:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10135.9). Total num frames: 32731136. Throughput: 0: 9685.8. Samples: 32719352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:15,122][635489] Avg episode reward: [(0, '4163.736')] [2023-03-09 06:33:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063928_32731136.pth... [2023-03-09 06:33:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063352_32436224.pth [2023-03-09 06:33:18,943][635776] Updated weights for policy 0, policy_version 64000 (0.0005) [2023-03-09 06:33:20,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 10108.1). Total num frames: 32776192. Throughput: 0: 9680.2. Samples: 32777216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:20,122][635489] Avg episode reward: [(0, '3565.003')] [2023-03-09 06:33:23,171][635776] Updated weights for policy 0, policy_version 64080 (0.0004) [2023-03-09 06:33:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10108.1). Total num frames: 32829440. Throughput: 0: 9664.6. Samples: 32805548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:25,122][635489] Avg episode reward: [(0, '3479.320')] [2023-03-09 06:33:27,172][635776] Updated weights for policy 0, policy_version 64160 (0.0005) [2023-03-09 06:33:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 10108.1). Total num frames: 32878592. Throughput: 0: 9762.0. Samples: 32867388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:30,122][635489] Avg episode reward: [(0, '4258.433')] [2023-03-09 06:33:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064216_32878592.pth... [2023-03-09 06:33:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063640_32583680.pth [2023-03-09 06:33:31,294][635776] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-09 06:33:34,985][635776] Updated weights for policy 0, policy_version 64320 (0.0005) [2023-03-09 06:33:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10122.0). Total num frames: 32931840. Throughput: 0: 9911.9. Samples: 32931344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:35,122][635489] Avg episode reward: [(0, '4328.704')] [2023-03-09 06:33:39,063][635776] Updated weights for policy 0, policy_version 64400 (0.0005) [2023-03-09 06:33:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 32980992. Throughput: 0: 9924.1. Samples: 32961688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:40,122][635489] Avg episode reward: [(0, '4116.219')] [2023-03-09 06:33:43,093][635776] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-09 06:33:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 33030144. Throughput: 0: 9934.0. Samples: 33022516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:45,122][635489] Avg episode reward: [(0, '4212.236')] [2023-03-09 06:33:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064512_33030144.pth... [2023-03-09 06:33:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000063928_32731136.pth [2023-03-09 06:33:47,405][635776] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-09 06:33:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 33079296. Throughput: 0: 9878.3. Samples: 33077668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:50,122][635489] Avg episode reward: [(0, '4327.275')] [2023-03-09 06:33:51,890][635776] Updated weights for policy 0, policy_version 64640 (0.0004) [2023-03-09 06:33:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 33128448. Throughput: 0: 9901.6. Samples: 33107140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:33:55,122][635489] Avg episode reward: [(0, '4267.688')] [2023-03-09 06:33:55,671][635776] Updated weights for policy 0, policy_version 64720 (0.0005) [2023-03-09 06:33:59,959][635776] Updated weights for policy 0, policy_version 64800 (0.0005) [2023-03-09 06:34:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 33177600. Throughput: 0: 9965.9. Samples: 33167816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:34:00,122][635489] Avg episode reward: [(0, '4426.755')] [2023-03-09 06:34:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064800_33177600.pth... [2023-03-09 06:34:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064216_32878592.pth [2023-03-09 06:34:03,783][635776] Updated weights for policy 0, policy_version 64880 (0.0005) [2023-03-09 06:34:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10052.6). Total num frames: 33226752. Throughput: 0: 10036.5. Samples: 33228860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:34:05,122][635489] Avg episode reward: [(0, '4366.195')] [2023-03-09 06:34:08,208][635776] Updated weights for policy 0, policy_version 64960 (0.0005) [2023-03-09 06:34:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 33275904. Throughput: 0: 10027.7. Samples: 33256792. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:10,122][635489] Avg episode reward: [(0, '4410.436')] [2023-03-09 06:34:12,109][635776] Updated weights for policy 0, policy_version 65040 (0.0004) [2023-03-09 06:34:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 33329152. Throughput: 0: 9997.9. Samples: 33317292. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:15,122][635489] Avg episode reward: [(0, '4388.534')] [2023-03-09 06:34:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065096_33329152.pth... [2023-03-09 06:34:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064512_33030144.pth [2023-03-09 06:34:16,171][635776] Updated weights for policy 0, policy_version 65120 (0.0005) [2023-03-09 06:34:20,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33378304. Throughput: 0: 9904.3. Samples: 33377036. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:20,122][635489] Avg episode reward: [(0, '4418.184')] [2023-03-09 06:34:20,354][635776] Updated weights for policy 0, policy_version 65200 (0.0004) [2023-03-09 06:34:24,118][635776] Updated weights for policy 0, policy_version 65280 (0.0005) [2023-03-09 06:34:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33431552. Throughput: 0: 9938.6. Samples: 33408928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:25,122][635489] Avg episode reward: [(0, '4551.862')] [2023-03-09 06:34:28,213][635776] Updated weights for policy 0, policy_version 65360 (0.0004) [2023-03-09 06:34:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33480704. Throughput: 0: 9981.8. Samples: 33471700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:30,122][635489] Avg episode reward: [(0, '4265.187')] [2023-03-09 06:34:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065392_33480704.pth... [2023-03-09 06:34:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000064800_33177600.pth [2023-03-09 06:34:32,343][635776] Updated weights for policy 0, policy_version 65440 (0.0005) [2023-03-09 06:34:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 33529856. Throughput: 0: 10054.0. Samples: 33530096. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:35,122][635489] Avg episode reward: [(0, '4254.704')] [2023-03-09 06:34:36,529][635776] Updated weights for policy 0, policy_version 65520 (0.0005) [2023-03-09 06:34:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 33583104. Throughput: 0: 10032.3. Samples: 33558592. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:40,122][635489] Avg episode reward: [(0, '4301.276')] [2023-03-09 06:34:40,353][635776] Updated weights for policy 0, policy_version 65600 (0.0005) [2023-03-09 06:34:44,341][635776] Updated weights for policy 0, policy_version 65680 (0.0004) [2023-03-09 06:34:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 33636352. Throughput: 0: 10130.3. Samples: 33623680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:34:45,122][635489] Avg episode reward: [(0, '4306.686')] [2023-03-09 06:34:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065696_33636352.pth... [2023-03-09 06:34:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065096_33329152.pth [2023-03-09 06:34:48,426][635776] Updated weights for policy 0, policy_version 65760 (0.0004) [2023-03-09 06:34:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33681408. Throughput: 0: 10093.1. Samples: 33683052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:34:50,122][635489] Avg episode reward: [(0, '4455.163')] [2023-03-09 06:34:52,465][635776] Updated weights for policy 0, policy_version 65840 (0.0005) [2023-03-09 06:34:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 33734656. Throughput: 0: 10165.5. Samples: 33714240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:34:55,122][635489] Avg episode reward: [(0, '4442.398')] [2023-03-09 06:34:56,538][635776] Updated weights for policy 0, policy_version 65920 (0.0005) [2023-03-09 06:35:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 33783808. Throughput: 0: 10176.3. Samples: 33775228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:00,122][635489] Avg episode reward: [(0, '4523.366')] [2023-03-09 06:35:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065984_33783808.pth... [2023-03-09 06:35:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065392_33480704.pth [2023-03-09 06:35:00,652][635776] Updated weights for policy 0, policy_version 66000 (0.0005) [2023-03-09 06:35:04,758][635776] Updated weights for policy 0, policy_version 66080 (0.0005) [2023-03-09 06:35:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 33837056. Throughput: 0: 10161.0. Samples: 33834280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:05,122][635489] Avg episode reward: [(0, '4278.018')] [2023-03-09 06:35:08,921][635776] Updated weights for policy 0, policy_version 66160 (0.0005) [2023-03-09 06:35:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 33882112. Throughput: 0: 10135.4. Samples: 33865020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:10,122][635489] Avg episode reward: [(0, '4441.091')] [2023-03-09 06:35:13,198][635776] Updated weights for policy 0, policy_version 66240 (0.0005) [2023-03-09 06:35:15,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33931264. Throughput: 0: 10012.6. Samples: 33922268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:15,122][635489] Avg episode reward: [(0, '4163.132')] [2023-03-09 06:35:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066272_33931264.pth... [2023-03-09 06:35:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065696_33636352.pth [2023-03-09 06:35:17,497][635776] Updated weights for policy 0, policy_version 66320 (0.0004) [2023-03-09 06:35:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 33980416. Throughput: 0: 10005.6. Samples: 33980348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:20,122][635489] Avg episode reward: [(0, '4423.312')] [2023-03-09 06:35:21,570][635776] Updated weights for policy 0, policy_version 66400 (0.0005) [2023-03-09 06:35:25,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 34033664. Throughput: 0: 10101.2. Samples: 34013144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:25,122][635489] Avg episode reward: [(0, '4430.340')] [2023-03-09 06:35:25,291][635776] Updated weights for policy 0, policy_version 66480 (0.0004) [2023-03-09 06:35:29,320][635776] Updated weights for policy 0, policy_version 66560 (0.0004) [2023-03-09 06:35:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 34082816. Throughput: 0: 10021.3. Samples: 34074636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:30,122][635489] Avg episode reward: [(0, '4360.058')] [2023-03-09 06:35:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066568_34082816.pth... [2023-03-09 06:35:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000065984_33783808.pth [2023-03-09 06:35:33,362][635776] Updated weights for policy 0, policy_version 66640 (0.0005) [2023-03-09 06:35:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 34136064. Throughput: 0: 10063.1. Samples: 34135892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:35,122][635489] Avg episode reward: [(0, '4404.677')] [2023-03-09 06:35:37,551][635776] Updated weights for policy 0, policy_version 66720 (0.0005) [2023-03-09 06:35:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 34185216. Throughput: 0: 10005.9. Samples: 34164504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:40,122][635489] Avg episode reward: [(0, '4144.460')] [2023-03-09 06:35:41,868][635776] Updated weights for policy 0, policy_version 66800 (0.0005) [2023-03-09 06:35:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 34234368. Throughput: 0: 9972.4. Samples: 34223988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:45,122][635489] Avg episode reward: [(0, '3936.941')] [2023-03-09 06:35:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066864_34234368.pth... [2023-03-09 06:35:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066272_33931264.pth [2023-03-09 06:35:45,882][635776] Updated weights for policy 0, policy_version 66880 (0.0004) [2023-03-09 06:35:49,968][635776] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-09 06:35:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 34283520. Throughput: 0: 9976.7. Samples: 34283232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:50,122][635489] Avg episode reward: [(0, '4041.251')] [2023-03-09 06:35:53,845][635776] Updated weights for policy 0, policy_version 67040 (0.0004) [2023-03-09 06:35:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 34336768. Throughput: 0: 9985.2. Samples: 34314356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:35:55,122][635489] Avg episode reward: [(0, '3526.065')] [2023-03-09 06:35:57,886][635776] Updated weights for policy 0, policy_version 67120 (0.0005) [2023-03-09 06:36:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 34385920. Throughput: 0: 10061.2. Samples: 34375024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:00,122][635489] Avg episode reward: [(0, '3963.124')] [2023-03-09 06:36:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067160_34385920.pth... [2023-03-09 06:36:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066568_34082816.pth [2023-03-09 06:36:02,298][635776] Updated weights for policy 0, policy_version 67200 (0.0005) [2023-03-09 06:36:05,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 34430976. Throughput: 0: 10030.2. Samples: 34431708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:05,122][635489] Avg episode reward: [(0, '4017.245')] [2023-03-09 06:36:06,463][635776] Updated weights for policy 0, policy_version 67280 (0.0005) [2023-03-09 06:36:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 34484224. Throughput: 0: 9978.4. Samples: 34462172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:10,122][635489] Avg episode reward: [(0, '3713.133')] [2023-03-09 06:36:10,333][635776] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-09 06:36:14,574][635776] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-09 06:36:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 34533376. Throughput: 0: 9942.4. Samples: 34522044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:15,122][635489] Avg episode reward: [(0, '4201.611')] [2023-03-09 06:36:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067448_34533376.pth... [2023-03-09 06:36:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000066864_34234368.pth [2023-03-09 06:36:18,647][635776] Updated weights for policy 0, policy_version 67520 (0.0005) [2023-03-09 06:36:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 34582528. Throughput: 0: 9923.1. Samples: 34582432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:20,122][635489] Avg episode reward: [(0, '4309.555')] [2023-03-09 06:36:22,969][635776] Updated weights for policy 0, policy_version 67600 (0.0005) [2023-03-09 06:36:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 34631680. Throughput: 0: 9923.1. Samples: 34611044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:25,122][635489] Avg episode reward: [(0, '4407.284')] [2023-03-09 06:36:26,991][635776] Updated weights for policy 0, policy_version 67680 (0.0005) [2023-03-09 06:36:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 34680832. Throughput: 0: 9902.0. Samples: 34669580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:30,122][635489] Avg episode reward: [(0, '4405.388')] [2023-03-09 06:36:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067736_34680832.pth... [2023-03-09 06:36:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067160_34385920.pth [2023-03-09 06:36:31,110][635776] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-09 06:36:35,102][635776] Updated weights for policy 0, policy_version 67840 (0.0004) [2023-03-09 06:36:35,122][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 34734080. Throughput: 0: 9948.6. Samples: 34730920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:35,122][635489] Avg episode reward: [(0, '4322.002')] [2023-03-09 06:36:39,302][635776] Updated weights for policy 0, policy_version 67920 (0.0004) [2023-03-09 06:36:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 34779136. Throughput: 0: 9915.2. Samples: 34760540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:40,122][635489] Avg episode reward: [(0, '4414.745')] [2023-03-09 06:36:43,316][635776] Updated weights for policy 0, policy_version 68000 (0.0004) [2023-03-09 06:36:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 34832384. Throughput: 0: 9922.8. Samples: 34821548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:45,122][635489] Avg episode reward: [(0, '4102.874')] [2023-03-09 06:36:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068032_34832384.pth... [2023-03-09 06:36:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067448_34533376.pth [2023-03-09 06:36:47,518][635776] Updated weights for policy 0, policy_version 68080 (0.0005) [2023-03-09 06:36:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 34881536. Throughput: 0: 9993.9. Samples: 34881436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:36:50,122][635489] Avg episode reward: [(0, '4086.902')] [2023-03-09 06:36:51,603][635776] Updated weights for policy 0, policy_version 68160 (0.0005) [2023-03-09 06:36:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 34930688. Throughput: 0: 10030.8. Samples: 34913556. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:36:55,122][635489] Avg episode reward: [(0, '3992.556')] [2023-03-09 06:36:55,621][635776] Updated weights for policy 0, policy_version 68240 (0.0005) [2023-03-09 06:37:00,054][635776] Updated weights for policy 0, policy_version 68320 (0.0005) [2023-03-09 06:37:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 34979840. Throughput: 0: 9945.5. Samples: 34969592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:00,122][635489] Avg episode reward: [(0, '4120.980')] [2023-03-09 06:37:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068320_34979840.pth... [2023-03-09 06:37:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000067736_34680832.pth [2023-03-09 06:37:03,992][635776] Updated weights for policy 0, policy_version 68400 (0.0005) [2023-03-09 06:37:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 35028992. Throughput: 0: 9925.0. Samples: 35029056. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:05,122][635489] Avg episode reward: [(0, '4186.134')] [2023-03-09 06:37:08,476][635776] Updated weights for policy 0, policy_version 68480 (0.0005) [2023-03-09 06:37:10,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 35074048. Throughput: 0: 9915.1. Samples: 35057224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:10,122][635489] Avg episode reward: [(0, '4380.479')] [2023-03-09 06:37:12,757][635776] Updated weights for policy 0, policy_version 68560 (0.0005) [2023-03-09 06:37:15,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 35123200. Throughput: 0: 9887.2. Samples: 35114504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:15,122][635489] Avg episode reward: [(0, '4458.784')] [2023-03-09 06:37:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068600_35123200.pth... [2023-03-09 06:37:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068032_34832384.pth [2023-03-09 06:37:16,778][635776] Updated weights for policy 0, policy_version 68640 (0.0005) [2023-03-09 06:37:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 35172352. Throughput: 0: 9849.8. Samples: 35174160. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:20,122][635489] Avg episode reward: [(0, '4452.065')] [2023-03-09 06:37:20,973][635776] Updated weights for policy 0, policy_version 68720 (0.0005) [2023-03-09 06:37:25,094][635776] Updated weights for policy 0, policy_version 68800 (0.0005) [2023-03-09 06:37:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 35225600. Throughput: 0: 9863.3. Samples: 35204388. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:25,122][635489] Avg episode reward: [(0, '4414.146')] [2023-03-09 06:37:29,190][635776] Updated weights for policy 0, policy_version 68880 (0.0004) [2023-03-09 06:37:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 35274752. Throughput: 0: 9824.9. Samples: 35263668. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:37:30,122][635489] Avg episode reward: [(0, '4381.431')] [2023-03-09 06:37:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068896_35274752.pth... [2023-03-09 06:37:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068320_34979840.pth [2023-03-09 06:37:33,323][635776] Updated weights for policy 0, policy_version 68960 (0.0004) [2023-03-09 06:37:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 35323904. Throughput: 0: 9834.6. Samples: 35323992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:37:35,122][635489] Avg episode reward: [(0, '3582.870')] [2023-03-09 06:37:37,362][635776] Updated weights for policy 0, policy_version 69040 (0.0004) [2023-03-09 06:37:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 35373056. Throughput: 0: 9777.0. Samples: 35353520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:37:40,122][635489] Avg episode reward: [(0, '3468.669')] [2023-03-09 06:37:41,363][635776] Updated weights for policy 0, policy_version 69120 (0.0005) [2023-03-09 06:37:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 35426304. Throughput: 0: 9880.7. Samples: 35414224. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:37:45,122][635489] Avg episode reward: [(0, '3869.463')] [2023-03-09 06:37:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069192_35426304.pth... [2023-03-09 06:37:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068600_35123200.pth [2023-03-09 06:37:45,444][635776] Updated weights for policy 0, policy_version 69200 (0.0005) [2023-03-09 06:37:49,602][635776] Updated weights for policy 0, policy_version 69280 (0.0005) [2023-03-09 06:37:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 35475456. Throughput: 0: 9913.1. Samples: 35475144. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:37:50,122][635489] Avg episode reward: [(0, '3855.078')] [2023-03-09 06:37:53,877][635776] Updated weights for policy 0, policy_version 69360 (0.0006) [2023-03-09 06:37:55,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 35520512. Throughput: 0: 9922.9. Samples: 35503756. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:37:55,122][635489] Avg episode reward: [(0, '4053.493')] [2023-03-09 06:37:58,005][635776] Updated weights for policy 0, policy_version 69440 (0.0005) [2023-03-09 06:38:00,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 35569664. Throughput: 0: 9934.2. Samples: 35561544. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:38:00,122][635489] Avg episode reward: [(0, '4434.744')] [2023-03-09 06:38:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069472_35569664.pth... [2023-03-09 06:38:00,131][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000068896_35274752.pth [2023-03-09 06:38:02,356][635776] Updated weights for policy 0, policy_version 69520 (0.0004) [2023-03-09 06:38:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 35618816. Throughput: 0: 9913.8. Samples: 35620280. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:38:05,122][635489] Avg episode reward: [(0, '4559.437')] [2023-03-09 06:38:06,328][635776] Updated weights for policy 0, policy_version 69600 (0.0005) [2023-03-09 06:38:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 35672064. Throughput: 0: 9953.6. Samples: 35652300. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:38:10,122][635489] Avg episode reward: [(0, '4580.236')] [2023-03-09 06:38:10,302][635776] Updated weights for policy 0, policy_version 69680 (0.0005) [2023-03-09 06:38:14,719][635776] Updated weights for policy 0, policy_version 69760 (0.0005) [2023-03-09 06:38:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 35717120. Throughput: 0: 9897.6. Samples: 35709060. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:38:15,122][635489] Avg episode reward: [(0, '4446.814')] [2023-03-09 06:38:15,129][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069768_35721216.pth... [2023-03-09 06:38:15,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069192_35426304.pth [2023-03-09 06:38:19,102][635776] Updated weights for policy 0, policy_version 69840 (0.0005) [2023-03-09 06:38:20,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 35766272. Throughput: 0: 9827.8. Samples: 35766244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:20,122][635489] Avg episode reward: [(0, '4466.515')] [2023-03-09 06:38:23,277][635776] Updated weights for policy 0, policy_version 69920 (0.0005) [2023-03-09 06:38:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 35815424. Throughput: 0: 9817.2. Samples: 35795296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:25,122][635489] Avg episode reward: [(0, '4227.011')] [2023-03-09 06:38:27,501][635776] Updated weights for policy 0, policy_version 70000 (0.0005) [2023-03-09 06:38:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 35864576. Throughput: 0: 9762.6. Samples: 35853540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:30,122][635489] Avg episode reward: [(0, '4259.998')] [2023-03-09 06:38:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070048_35864576.pth... [2023-03-09 06:38:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069472_35569664.pth [2023-03-09 06:38:31,767][635776] Updated weights for policy 0, policy_version 70080 (0.0005) [2023-03-09 06:38:35,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 35909632. Throughput: 0: 9655.6. Samples: 35909644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:35,122][635489] Avg episode reward: [(0, '4453.485')] [2023-03-09 06:38:36,165][635776] Updated weights for policy 0, policy_version 70160 (0.0004) [2023-03-09 06:38:40,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 35958784. Throughput: 0: 9672.0. Samples: 35938996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:40,122][635489] Avg episode reward: [(0, '4524.750')] [2023-03-09 06:38:40,301][635776] Updated weights for policy 0, policy_version 70240 (0.0005) [2023-03-09 06:38:44,454][635776] Updated weights for policy 0, policy_version 70320 (0.0004) [2023-03-09 06:38:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 36007936. Throughput: 0: 9729.9. Samples: 35999388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:45,122][635489] Avg episode reward: [(0, '4520.083')] [2023-03-09 06:38:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070328_36007936.pth... [2023-03-09 06:38:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000069768_35721216.pth [2023-03-09 06:38:48,801][635776] Updated weights for policy 0, policy_version 70400 (0.0005) [2023-03-09 06:38:50,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9913.7). Total num frames: 36052992. Throughput: 0: 9660.3. Samples: 36054992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:50,122][635489] Avg episode reward: [(0, '4387.730')] [2023-03-09 06:38:53,086][635776] Updated weights for policy 0, policy_version 70480 (0.0005) [2023-03-09 06:38:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 36106240. Throughput: 0: 9611.7. Samples: 36084828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:38:55,122][635489] Avg episode reward: [(0, '4334.263')] [2023-03-09 06:38:57,230][635776] Updated weights for policy 0, policy_version 70560 (0.0005) [2023-03-09 06:39:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 36151296. Throughput: 0: 9632.7. Samples: 36142532. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:00,122][635489] Avg episode reward: [(0, '4249.820')] [2023-03-09 06:39:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070608_36151296.pth... [2023-03-09 06:39:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070048_35864576.pth [2023-03-09 06:39:01,515][635776] Updated weights for policy 0, policy_version 70640 (0.0005) [2023-03-09 06:39:05,121][635489] Fps is (10 sec: 9421.0, 60 sec: 9693.9, 300 sec: 9913.7). Total num frames: 36200448. Throughput: 0: 9650.4. Samples: 36200512. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:05,122][635489] Avg episode reward: [(0, '4120.399')] [2023-03-09 06:39:05,706][635776] Updated weights for policy 0, policy_version 70720 (0.0005) [2023-03-09 06:39:09,919][635776] Updated weights for policy 0, policy_version 70800 (0.0005) [2023-03-09 06:39:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9899.8). Total num frames: 36249600. Throughput: 0: 9660.9. Samples: 36230036. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:10,122][635489] Avg episode reward: [(0, '4378.644')] [2023-03-09 06:39:14,374][635776] Updated weights for policy 0, policy_version 70880 (0.0004) [2023-03-09 06:39:15,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9885.9). Total num frames: 36294656. Throughput: 0: 9620.2. Samples: 36286448. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:15,122][635489] Avg episode reward: [(0, '4333.539')] [2023-03-09 06:39:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070888_36294656.pth... [2023-03-09 06:39:15,125][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070328_36007936.pth [2023-03-09 06:39:18,861][635776] Updated weights for policy 0, policy_version 70960 (0.0005) [2023-03-09 06:39:20,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9872.1). Total num frames: 36343808. Throughput: 0: 9612.2. Samples: 36342192. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:20,122][635489] Avg episode reward: [(0, '4519.419')] [2023-03-09 06:39:23,035][635776] Updated weights for policy 0, policy_version 71040 (0.0005) [2023-03-09 06:39:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9872.1). Total num frames: 36392960. Throughput: 0: 9630.0. Samples: 36372348. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:25,122][635489] Avg episode reward: [(0, '4580.297')] [2023-03-09 06:39:26,984][635776] Updated weights for policy 0, policy_version 71120 (0.0005) [2023-03-09 06:39:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9872.1). Total num frames: 36442112. Throughput: 0: 9656.0. Samples: 36433908. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:30,122][635489] Avg episode reward: [(0, '4528.166')] [2023-03-09 06:39:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071176_36442112.pth... [2023-03-09 06:39:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070608_36151296.pth [2023-03-09 06:39:31,164][635776] Updated weights for policy 0, policy_version 71200 (0.0005) [2023-03-09 06:39:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 36491264. Throughput: 0: 9697.3. Samples: 36491372. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:35,122][635489] Avg episode reward: [(0, '4499.321')] [2023-03-09 06:39:35,271][635776] Updated weights for policy 0, policy_version 71280 (0.0005) [2023-03-09 06:39:39,347][635776] Updated weights for policy 0, policy_version 71360 (0.0005) [2023-03-09 06:39:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 36540416. Throughput: 0: 9710.2. Samples: 36521784. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:39:40,122][635489] Avg episode reward: [(0, '4265.560')] [2023-03-09 06:39:43,337][635776] Updated weights for policy 0, policy_version 71440 (0.0005) [2023-03-09 06:39:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 36593664. Throughput: 0: 9782.7. Samples: 36582752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:39:45,122][635489] Avg episode reward: [(0, '4393.128')] [2023-03-09 06:39:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071472_36593664.pth... [2023-03-09 06:39:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000070888_36294656.pth [2023-03-09 06:39:47,302][635776] Updated weights for policy 0, policy_version 71520 (0.0005) [2023-03-09 06:39:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 36646912. Throughput: 0: 9882.5. Samples: 36645224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:39:50,122][635489] Avg episode reward: [(0, '4612.976')] [2023-03-09 06:39:50,123][635732] Saving new best policy, reward=4612.976! [2023-03-09 06:39:51,380][635776] Updated weights for policy 0, policy_version 71600 (0.0005) [2023-03-09 06:39:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9762.2, 300 sec: 9858.2). Total num frames: 36691968. Throughput: 0: 9880.1. Samples: 36674640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:39:55,122][635489] Avg episode reward: [(0, '4463.390')] [2023-03-09 06:39:55,539][635776] Updated weights for policy 0, policy_version 71680 (0.0004) [2023-03-09 06:39:59,609][635776] Updated weights for policy 0, policy_version 71760 (0.0004) [2023-03-09 06:40:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 36745216. Throughput: 0: 9950.3. Samples: 36734212. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:00,122][635489] Avg episode reward: [(0, '4380.659')] [2023-03-09 06:40:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071768_36745216.pth... [2023-03-09 06:40:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071176_36442112.pth [2023-03-09 06:40:03,635][635776] Updated weights for policy 0, policy_version 71840 (0.0005) [2023-03-09 06:40:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 36794368. Throughput: 0: 10048.5. Samples: 36794376. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:05,122][635489] Avg episode reward: [(0, '4324.455')] [2023-03-09 06:40:07,652][635776] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-09 06:40:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 36847616. Throughput: 0: 10079.7. Samples: 36825936. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:10,122][635489] Avg episode reward: [(0, '4474.049')] [2023-03-09 06:40:11,457][635776] Updated weights for policy 0, policy_version 72000 (0.0005) [2023-03-09 06:40:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 36896768. Throughput: 0: 10103.5. Samples: 36888564. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:15,122][635489] Avg episode reward: [(0, '4590.909')] [2023-03-09 06:40:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072064_36896768.pth... [2023-03-09 06:40:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071472_36593664.pth [2023-03-09 06:40:15,693][635776] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-09 06:40:19,996][635776] Updated weights for policy 0, policy_version 72160 (0.0004) [2023-03-09 06:40:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9872.1). Total num frames: 36945920. Throughput: 0: 10098.9. Samples: 36945820. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:20,122][635489] Avg episode reward: [(0, '4563.346')] [2023-03-09 06:40:24,090][635776] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-09 06:40:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9872.1). Total num frames: 36995072. Throughput: 0: 10064.8. Samples: 36974700. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:40:25,122][635489] Avg episode reward: [(0, '4611.705')] [2023-03-09 06:40:28,081][635776] Updated weights for policy 0, policy_version 72320 (0.0005) [2023-03-09 06:40:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 37044224. Throughput: 0: 10074.3. Samples: 37036096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:30,132][635489] Avg episode reward: [(0, '4589.711')] [2023-03-09 06:40:30,160][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072360_37048320.pth... [2023-03-09 06:40:30,162][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000071768_36745216.pth [2023-03-09 06:40:32,290][635776] Updated weights for policy 0, policy_version 72400 (0.0005) [2023-03-09 06:40:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9872.1). Total num frames: 37097472. Throughput: 0: 10022.0. Samples: 37096212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:35,122][635489] Avg episode reward: [(0, '4585.170')] [2023-03-09 06:40:36,301][635776] Updated weights for policy 0, policy_version 72480 (0.0005) [2023-03-09 06:40:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9872.1). Total num frames: 37146624. Throughput: 0: 10019.2. Samples: 37125504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:40,122][635489] Avg episode reward: [(0, '4530.068')] [2023-03-09 06:40:40,532][635776] Updated weights for policy 0, policy_version 72560 (0.0004) [2023-03-09 06:40:44,794][635776] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-09 06:40:45,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9967.0, 300 sec: 9858.2). Total num frames: 37191680. Throughput: 0: 9984.2. Samples: 37183500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:45,122][635489] Avg episode reward: [(0, '4506.504')] [2023-03-09 06:40:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072640_37191680.pth... [2023-03-09 06:40:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072064_36896768.pth [2023-03-09 06:40:48,974][635776] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-09 06:40:50,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 37240832. Throughput: 0: 9949.8. Samples: 37242116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:50,122][635489] Avg episode reward: [(0, '4559.802')] [2023-03-09 06:40:53,159][635776] Updated weights for policy 0, policy_version 72800 (0.0004) [2023-03-09 06:40:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9967.0, 300 sec: 9844.3). Total num frames: 37289984. Throughput: 0: 9870.5. Samples: 37270108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:40:55,122][635489] Avg episode reward: [(0, '4552.387')] [2023-03-09 06:40:57,469][635776] Updated weights for policy 0, policy_version 72880 (0.0005) [2023-03-09 06:41:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 37339136. Throughput: 0: 9762.9. Samples: 37327896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:41:00,122][635489] Avg episode reward: [(0, '4587.781')] [2023-03-09 06:41:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072928_37339136.pth... [2023-03-09 06:41:00,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072360_37048320.pth [2023-03-09 06:41:01,872][635776] Updated weights for policy 0, policy_version 72960 (0.0005) [2023-03-09 06:41:05,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 37384192. Throughput: 0: 9753.0. Samples: 37384704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:41:05,122][635489] Avg episode reward: [(0, '4599.890')] [2023-03-09 06:41:06,021][635776] Updated weights for policy 0, policy_version 73040 (0.0005) [2023-03-09 06:41:10,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 37433344. Throughput: 0: 9764.4. Samples: 37414096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:41:10,122][635489] Avg episode reward: [(0, '4477.093')] [2023-03-09 06:41:10,379][635776] Updated weights for policy 0, policy_version 73120 (0.0005) [2023-03-09 06:41:14,533][635776] Updated weights for policy 0, policy_version 73200 (0.0005) [2023-03-09 06:41:15,122][635489] Fps is (10 sec: 9830.2, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 37482496. Throughput: 0: 9707.3. Samples: 37472928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:15,122][635489] Avg episode reward: [(0, '4492.446')] [2023-03-09 06:41:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073208_37482496.pth... [2023-03-09 06:41:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072640_37191680.pth [2023-03-09 06:41:18,853][635776] Updated weights for policy 0, policy_version 73280 (0.0005) [2023-03-09 06:41:20,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 37527552. Throughput: 0: 9607.7. Samples: 37528560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:20,122][635489] Avg episode reward: [(0, '4440.482')] [2023-03-09 06:41:23,077][635776] Updated weights for policy 0, policy_version 73360 (0.0004) [2023-03-09 06:41:25,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 37576704. Throughput: 0: 9604.6. Samples: 37557708. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:25,122][635489] Avg episode reward: [(0, '4314.636')] [2023-03-09 06:41:27,186][635776] Updated weights for policy 0, policy_version 73440 (0.0005) [2023-03-09 06:41:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 37629952. Throughput: 0: 9684.2. Samples: 37619288. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:30,122][635489] Avg episode reward: [(0, '4291.858')] [2023-03-09 06:41:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073496_37629952.pth... [2023-03-09 06:41:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000072928_37339136.pth [2023-03-09 06:41:31,091][635776] Updated weights for policy 0, policy_version 73520 (0.0005) [2023-03-09 06:41:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 37679104. Throughput: 0: 9704.9. Samples: 37678836. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:35,122][635489] Avg episode reward: [(0, '4481.961')] [2023-03-09 06:41:35,409][635776] Updated weights for policy 0, policy_version 73600 (0.0005) [2023-03-09 06:41:39,447][635776] Updated weights for policy 0, policy_version 73680 (0.0004) [2023-03-09 06:41:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 37728256. Throughput: 0: 9782.6. Samples: 37710324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:40,122][635489] Avg episode reward: [(0, '4463.838')] [2023-03-09 06:41:43,267][635776] Updated weights for policy 0, policy_version 73760 (0.0004) [2023-03-09 06:41:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 37781504. Throughput: 0: 9879.4. Samples: 37772468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:45,122][635489] Avg episode reward: [(0, '4447.378')] [2023-03-09 06:41:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073792_37781504.pth... [2023-03-09 06:41:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073208_37482496.pth [2023-03-09 06:41:47,184][635776] Updated weights for policy 0, policy_version 73840 (0.0005) [2023-03-09 06:41:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 37834752. Throughput: 0: 10001.2. Samples: 37834760. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:50,122][635489] Avg episode reward: [(0, '4570.220')] [2023-03-09 06:41:51,276][635776] Updated weights for policy 0, policy_version 73920 (0.0005) [2023-03-09 06:41:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 37879808. Throughput: 0: 9966.9. Samples: 37862608. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:41:55,122][635489] Avg episode reward: [(0, '4588.079')] [2023-03-09 06:41:55,585][635776] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-09 06:41:59,652][635776] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-09 06:42:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 37933056. Throughput: 0: 9970.8. Samples: 37921612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:00,122][635489] Avg episode reward: [(0, '4580.674')] [2023-03-09 06:42:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074088_37933056.pth... [2023-03-09 06:42:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073496_37629952.pth [2023-03-09 06:42:03,724][635776] Updated weights for policy 0, policy_version 74160 (0.0004) [2023-03-09 06:42:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9858.2). Total num frames: 37982208. Throughput: 0: 10094.4. Samples: 37982808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:05,122][635489] Avg episode reward: [(0, '4427.494')] [2023-03-09 06:42:07,522][635776] Updated weights for policy 0, policy_version 74240 (0.0004) [2023-03-09 06:42:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9872.1). Total num frames: 38035456. Throughput: 0: 10157.7. Samples: 38014804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:10,122][635489] Avg episode reward: [(0, '4063.206')] [2023-03-09 06:42:11,605][635776] Updated weights for policy 0, policy_version 74320 (0.0004) [2023-03-09 06:42:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 9885.9). Total num frames: 38088704. Throughput: 0: 10148.9. Samples: 38075988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:15,122][635489] Avg episode reward: [(0, '3929.804')] [2023-03-09 06:42:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074392_38088704.pth... [2023-03-09 06:42:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000073792_37781504.pth [2023-03-09 06:42:15,460][635776] Updated weights for policy 0, policy_version 74400 (0.0005) [2023-03-09 06:42:19,679][635776] Updated weights for policy 0, policy_version 74480 (0.0005) [2023-03-09 06:42:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9858.2). Total num frames: 38133760. Throughput: 0: 10158.8. Samples: 38135980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:20,122][635489] Avg episode reward: [(0, '3669.351')] [2023-03-09 06:42:23,811][635732] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000007 [2023-03-09 06:42:23,813][635776] Updated weights for policy 0, policy_version 74560 (0.0004) [2023-03-09 06:42:25,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 9858.2). Total num frames: 38182912. Throughput: 0: 10137.9. Samples: 38166528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:25,122][635489] Avg episode reward: [(0, '4057.871')] [2023-03-09 06:42:28,426][635776] Updated weights for policy 0, policy_version 74640 (0.0005) [2023-03-09 06:42:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 38232064. Throughput: 0: 9941.6. Samples: 38219840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:30,122][635489] Avg episode reward: [(0, '4123.156')] [2023-03-09 06:42:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074672_38232064.pth... [2023-03-09 06:42:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074088_37933056.pth [2023-03-09 06:42:32,745][635776] Updated weights for policy 0, policy_version 74720 (0.0005) [2023-03-09 06:42:35,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9844.3). Total num frames: 38277120. Throughput: 0: 9833.1. Samples: 38277248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:35,122][635489] Avg episode reward: [(0, '4225.460')] [2023-03-09 06:42:37,047][635776] Updated weights for policy 0, policy_version 74800 (0.0005) [2023-03-09 06:42:40,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 38326272. Throughput: 0: 9847.7. Samples: 38305756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:42:40,122][635489] Avg episode reward: [(0, '4150.771')] [2023-03-09 06:42:41,388][635776] Updated weights for policy 0, policy_version 74880 (0.0005) [2023-03-09 06:42:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 38375424. Throughput: 0: 9857.7. Samples: 38365208. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:42:45,122][635489] Avg episode reward: [(0, '4149.866')] [2023-03-09 06:42:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074952_38375424.pth... [2023-03-09 06:42:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074392_38088704.pth [2023-03-09 06:42:45,320][635776] Updated weights for policy 0, policy_version 74960 (0.0005) [2023-03-09 06:42:49,532][635776] Updated weights for policy 0, policy_version 75040 (0.0004) [2023-03-09 06:42:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 38424576. Throughput: 0: 9817.1. Samples: 38424576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:42:50,122][635489] Avg episode reward: [(0, '4194.646')] [2023-03-09 06:42:53,633][635776] Updated weights for policy 0, policy_version 75120 (0.0005) [2023-03-09 06:42:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 38473728. Throughput: 0: 9747.9. Samples: 38453460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:42:55,122][635489] Avg episode reward: [(0, '2970.023')] [2023-03-09 06:42:57,612][635776] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-09 06:43:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 38522880. Throughput: 0: 9750.5. Samples: 38514760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:00,122][635489] Avg episode reward: [(0, '3443.382')] [2023-03-09 06:43:00,139][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075248_38526976.pth... [2023-03-09 06:43:00,141][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074672_38232064.pth [2023-03-09 06:43:01,732][635776] Updated weights for policy 0, policy_version 75280 (0.0005) [2023-03-09 06:43:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 38576128. Throughput: 0: 9749.9. Samples: 38574724. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:05,122][635489] Avg episode reward: [(0, '3970.433')] [2023-03-09 06:43:05,880][635776] Updated weights for policy 0, policy_version 75360 (0.0005) [2023-03-09 06:43:10,091][635776] Updated weights for policy 0, policy_version 75440 (0.0005) [2023-03-09 06:43:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 38625280. Throughput: 0: 9709.6. Samples: 38603460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:10,122][635489] Avg episode reward: [(0, '3937.449')] [2023-03-09 06:43:14,121][635776] Updated weights for policy 0, policy_version 75520 (0.0005) [2023-03-09 06:43:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9858.2). Total num frames: 38674432. Throughput: 0: 9888.3. Samples: 38664812. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:15,122][635489] Avg episode reward: [(0, '4216.892')] [2023-03-09 06:43:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075536_38674432.pth... [2023-03-09 06:43:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000074952_38375424.pth [2023-03-09 06:43:18,358][635776] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-09 06:43:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 38723584. Throughput: 0: 9902.0. Samples: 38722836. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:20,122][635489] Avg episode reward: [(0, '4291.310')] [2023-03-09 06:43:22,604][635776] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-09 06:43:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 38772736. Throughput: 0: 9909.5. Samples: 38751684. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:25,122][635489] Avg episode reward: [(0, '4226.695')] [2023-03-09 06:43:26,628][635776] Updated weights for policy 0, policy_version 75760 (0.0005) [2023-03-09 06:43:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 38825984. Throughput: 0: 9966.4. Samples: 38813696. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:30,122][635489] Avg episode reward: [(0, '4491.335')] [2023-03-09 06:43:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075832_38825984.pth... [2023-03-09 06:43:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075248_38526976.pth [2023-03-09 06:43:30,361][635776] Updated weights for policy 0, policy_version 75840 (0.0005) [2023-03-09 06:43:34,287][635776] Updated weights for policy 0, policy_version 75920 (0.0005) [2023-03-09 06:43:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 38875136. Throughput: 0: 10040.6. Samples: 38876404. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:35,122][635489] Avg episode reward: [(0, '4308.230')] [2023-03-09 06:43:38,493][635776] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-09 06:43:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 38928384. Throughput: 0: 10046.5. Samples: 38905552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:40,122][635489] Avg episode reward: [(0, '4314.810')] [2023-03-09 06:43:42,767][635776] Updated weights for policy 0, policy_version 76080 (0.0005) [2023-03-09 06:43:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 38973440. Throughput: 0: 10004.6. Samples: 38964968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:45,122][635489] Avg episode reward: [(0, '4174.833')] [2023-03-09 06:43:45,133][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076128_38977536.pth... [2023-03-09 06:43:45,135][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075536_38674432.pth [2023-03-09 06:43:46,815][635776] Updated weights for policy 0, policy_version 76160 (0.0005) [2023-03-09 06:43:50,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 39022592. Throughput: 0: 9999.6. Samples: 39024708. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:50,122][635489] Avg episode reward: [(0, '4394.678')] [2023-03-09 06:43:51,001][635776] Updated weights for policy 0, policy_version 76240 (0.0005) [2023-03-09 06:43:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 39071744. Throughput: 0: 10005.4. Samples: 39053704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:43:55,122][635489] Avg episode reward: [(0, '4068.475')] [2023-03-09 06:43:55,186][635776] Updated weights for policy 0, policy_version 76320 (0.0005) [2023-03-09 06:43:59,483][635776] Updated weights for policy 0, policy_version 76400 (0.0005) [2023-03-09 06:44:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 39120896. Throughput: 0: 9931.4. Samples: 39111724. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:44:00,122][635489] Avg episode reward: [(0, '3978.433')] [2023-03-09 06:44:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076408_39120896.pth... [2023-03-09 06:44:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000075832_38825984.pth [2023-03-09 06:44:03,563][635776] Updated weights for policy 0, policy_version 76480 (0.0005) [2023-03-09 06:44:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 39170048. Throughput: 0: 9942.0. Samples: 39170228. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:44:05,122][635489] Avg episode reward: [(0, '4441.913')] [2023-03-09 06:44:07,881][635776] Updated weights for policy 0, policy_version 76560 (0.0005) [2023-03-09 06:44:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.6, 300 sec: 9913.7). Total num frames: 39219200. Throughput: 0: 9935.5. Samples: 39198784. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:44:10,122][635489] Avg episode reward: [(0, '4345.919')] [2023-03-09 06:44:12,179][635776] Updated weights for policy 0, policy_version 76640 (0.0004) [2023-03-09 06:44:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 39268352. Throughput: 0: 9842.2. Samples: 39256596. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 06:44:15,122][635489] Avg episode reward: [(0, '4288.977')] [2023-03-09 06:44:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076696_39268352.pth... [2023-03-09 06:44:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076128_38977536.pth [2023-03-09 06:44:16,248][635776] Updated weights for policy 0, policy_version 76720 (0.0005) [2023-03-09 06:44:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 39317504. Throughput: 0: 9782.1. Samples: 39316600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:20,122][635489] Avg episode reward: [(0, '4008.528')] [2023-03-09 06:44:20,492][635776] Updated weights for policy 0, policy_version 76800 (0.0004) [2023-03-09 06:44:24,793][635776] Updated weights for policy 0, policy_version 76880 (0.0005) [2023-03-09 06:44:25,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 39362560. Throughput: 0: 9785.7. Samples: 39345908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:25,122][635489] Avg episode reward: [(0, '3665.301')] [2023-03-09 06:44:28,764][635776] Updated weights for policy 0, policy_version 76960 (0.0005) [2023-03-09 06:44:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 39415808. Throughput: 0: 9769.7. Samples: 39404604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:30,122][635489] Avg episode reward: [(0, '3578.538')] [2023-03-09 06:44:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076984_39415808.pth... [2023-03-09 06:44:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076408_39120896.pth [2023-03-09 06:44:32,878][635776] Updated weights for policy 0, policy_version 77040 (0.0005) [2023-03-09 06:44:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 39464960. Throughput: 0: 9787.1. Samples: 39465128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:35,122][635489] Avg episode reward: [(0, '3154.682')] [2023-03-09 06:44:36,899][635776] Updated weights for policy 0, policy_version 77120 (0.0005) [2023-03-09 06:44:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 39514112. Throughput: 0: 9834.5. Samples: 39496256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:40,122][635489] Avg episode reward: [(0, '3431.910')] [2023-03-09 06:44:41,111][635776] Updated weights for policy 0, policy_version 77200 (0.0005) [2023-03-09 06:44:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 39563264. Throughput: 0: 9775.1. Samples: 39551604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:45,122][635489] Avg episode reward: [(0, '4109.701')] [2023-03-09 06:44:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077272_39563264.pth... [2023-03-09 06:44:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076696_39268352.pth [2023-03-09 06:44:45,474][635776] Updated weights for policy 0, policy_version 77280 (0.0005) [2023-03-09 06:44:49,811][635776] Updated weights for policy 0, policy_version 77360 (0.0005) [2023-03-09 06:44:50,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 39608320. Throughput: 0: 9745.3. Samples: 39608768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:50,122][635489] Avg episode reward: [(0, '4414.419')] [2023-03-09 06:44:53,853][635776] Updated weights for policy 0, policy_version 77440 (0.0005) [2023-03-09 06:44:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 39661568. Throughput: 0: 9815.3. Samples: 39640472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:44:55,122][635489] Avg episode reward: [(0, '4510.526')] [2023-03-09 06:44:57,968][635776] Updated weights for policy 0, policy_version 77520 (0.0005) [2023-03-09 06:45:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 39710720. Throughput: 0: 9824.3. Samples: 39698688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:00,122][635489] Avg episode reward: [(0, '4468.050')] [2023-03-09 06:45:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077560_39710720.pth... [2023-03-09 06:45:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000076984_39415808.pth [2023-03-09 06:45:02,058][635776] Updated weights for policy 0, policy_version 77600 (0.0005) [2023-03-09 06:45:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 39759872. Throughput: 0: 9843.3. Samples: 39759548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:05,122][635489] Avg episode reward: [(0, '4511.993')] [2023-03-09 06:45:06,307][635776] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-09 06:45:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 39809024. Throughput: 0: 9835.3. Samples: 39788496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:10,132][635489] Avg episode reward: [(0, '4384.566')] [2023-03-09 06:45:10,294][635776] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-09 06:45:14,538][635776] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-09 06:45:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 39858176. Throughput: 0: 9851.9. Samples: 39847940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:15,132][635489] Avg episode reward: [(0, '4418.592')] [2023-03-09 06:45:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077848_39858176.pth... [2023-03-09 06:45:15,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077272_39563264.pth [2023-03-09 06:45:18,471][635776] Updated weights for policy 0, policy_version 77920 (0.0005) [2023-03-09 06:45:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 39911424. Throughput: 0: 9864.3. Samples: 39909024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:20,133][635489] Avg episode reward: [(0, '4532.263')] [2023-03-09 06:45:22,611][635776] Updated weights for policy 0, policy_version 78000 (0.0005) [2023-03-09 06:45:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 39960576. Throughput: 0: 9851.6. Samples: 39939580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:25,133][635489] Avg episode reward: [(0, '4541.956')] [2023-03-09 06:45:26,715][635776] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-09 06:45:30,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 40005632. Throughput: 0: 9901.5. Samples: 39997172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:30,132][635489] Avg episode reward: [(0, '4576.066')] [2023-03-09 06:45:30,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078136_40005632.pth... [2023-03-09 06:45:30,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077560_39710720.pth [2023-03-09 06:45:31,248][635776] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-09 06:45:35,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 40054784. Throughput: 0: 9911.0. Samples: 40054764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:35,122][635489] Avg episode reward: [(0, '4562.636')] [2023-03-09 06:45:35,365][635776] Updated weights for policy 0, policy_version 78240 (0.0005) [2023-03-09 06:45:39,133][635776] Updated weights for policy 0, policy_version 78320 (0.0005) [2023-03-09 06:45:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 40108032. Throughput: 0: 9928.9. Samples: 40087272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:40,122][635489] Avg episode reward: [(0, '4580.212')] [2023-03-09 06:45:43,216][635776] Updated weights for policy 0, policy_version 78400 (0.0004) [2023-03-09 06:45:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 40157184. Throughput: 0: 9978.7. Samples: 40147732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:45,122][635489] Avg episode reward: [(0, '4587.846')] [2023-03-09 06:45:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078432_40157184.pth... [2023-03-09 06:45:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000077848_39858176.pth [2023-03-09 06:45:47,374][635776] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-09 06:45:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 40206336. Throughput: 0: 9928.8. Samples: 40206344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:45:50,122][635489] Avg episode reward: [(0, '4538.505')] [2023-03-09 06:45:51,419][635776] Updated weights for policy 0, policy_version 78560 (0.0004) [2023-03-09 06:45:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 40259584. Throughput: 0: 10013.7. Samples: 40239112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:45:55,122][635489] Avg episode reward: [(0, '3989.728')] [2023-03-09 06:45:55,461][635776] Updated weights for policy 0, policy_version 78640 (0.0005) [2023-03-09 06:45:59,536][635776] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-09 06:46:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 40308736. Throughput: 0: 10017.3. Samples: 40298720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:00,122][635489] Avg episode reward: [(0, '4235.889')] [2023-03-09 06:46:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078728_40308736.pth... [2023-03-09 06:46:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078136_40005632.pth [2023-03-09 06:46:03,791][635776] Updated weights for policy 0, policy_version 78800 (0.0005) [2023-03-09 06:46:05,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 40357888. Throughput: 0: 9942.8. Samples: 40356448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:05,133][635489] Avg episode reward: [(0, '4467.327')] [2023-03-09 06:46:07,918][635776] Updated weights for policy 0, policy_version 78880 (0.0005) [2023-03-09 06:46:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 40407040. Throughput: 0: 9932.9. Samples: 40386560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:10,133][635489] Avg episode reward: [(0, '4580.351')] [2023-03-09 06:46:12,027][635776] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-09 06:46:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 40456192. Throughput: 0: 10024.5. Samples: 40448272. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:15,132][635489] Avg episode reward: [(0, '4601.626')] [2023-03-09 06:46:15,169][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079024_40460288.pth... [2023-03-09 06:46:15,170][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078432_40157184.pth [2023-03-09 06:46:15,864][635776] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-09 06:46:19,996][635776] Updated weights for policy 0, policy_version 79120 (0.0005) [2023-03-09 06:46:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 40509440. Throughput: 0: 10101.5. Samples: 40509332. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:20,122][635489] Avg episode reward: [(0, '4589.467')] [2023-03-09 06:46:23,890][635776] Updated weights for policy 0, policy_version 79200 (0.0005) [2023-03-09 06:46:25,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 40562688. Throughput: 0: 10058.6. Samples: 40539908. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:25,122][635489] Avg episode reward: [(0, '4544.541')] [2023-03-09 06:46:27,858][635776] Updated weights for policy 0, policy_version 79280 (0.0005) [2023-03-09 06:46:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 40611840. Throughput: 0: 10125.3. Samples: 40603372. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:30,133][635489] Avg episode reward: [(0, '4521.499')] [2023-03-09 06:46:30,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079320_40611840.pth... [2023-03-09 06:46:30,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000078728_40308736.pth [2023-03-09 06:46:32,065][635776] Updated weights for policy 0, policy_version 79360 (0.0005) [2023-03-09 06:46:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 40660992. Throughput: 0: 10100.1. Samples: 40660848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 06:46:35,133][635489] Avg episode reward: [(0, '4516.467')] [2023-03-09 06:46:36,234][635776] Updated weights for policy 0, policy_version 79440 (0.0005) [2023-03-09 06:46:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 40710144. Throughput: 0: 10006.2. Samples: 40689392. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:46:40,132][635489] Avg episode reward: [(0, '4573.965')] [2023-03-09 06:46:40,517][635776] Updated weights for policy 0, policy_version 79520 (0.0005) [2023-03-09 06:46:44,648][635776] Updated weights for policy 0, policy_version 79600 (0.0004) [2023-03-09 06:46:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 40759296. Throughput: 0: 9986.9. Samples: 40748132. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:46:45,133][635489] Avg episode reward: [(0, '4444.947')] [2023-03-09 06:46:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079608_40759296.pth... [2023-03-09 06:46:45,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079024_40460288.pth [2023-03-09 06:46:48,871][635776] Updated weights for policy 0, policy_version 79680 (0.0004) [2023-03-09 06:46:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 40808448. Throughput: 0: 9990.3. Samples: 40806012. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:46:50,122][635489] Avg episode reward: [(0, '4135.417')] [2023-03-09 06:46:53,144][635776] Updated weights for policy 0, policy_version 79760 (0.0005) [2023-03-09 06:46:55,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 40853504. Throughput: 0: 9973.3. Samples: 40835360. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:46:55,122][635489] Avg episode reward: [(0, '4423.181')] [2023-03-09 06:46:57,187][635776] Updated weights for policy 0, policy_version 79840 (0.0005) [2023-03-09 06:47:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 40906752. Throughput: 0: 9925.0. Samples: 40894900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:00,122][635489] Avg episode reward: [(0, '4195.424')] [2023-03-09 06:47:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079896_40906752.pth... [2023-03-09 06:47:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079320_40611840.pth [2023-03-09 06:47:01,176][635776] Updated weights for policy 0, policy_version 79920 (0.0005) [2023-03-09 06:47:05,120][635776] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-09 06:47:05,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 40960000. Throughput: 0: 9942.9. Samples: 40956760. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:05,122][635489] Avg episode reward: [(0, '4285.700')] [2023-03-09 06:47:09,006][635776] Updated weights for policy 0, policy_version 80080 (0.0004) [2023-03-09 06:47:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 41009152. Throughput: 0: 9974.0. Samples: 40988736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:10,122][635489] Avg episode reward: [(0, '4545.049')] [2023-03-09 06:47:13,129][635776] Updated weights for policy 0, policy_version 80160 (0.0005) [2023-03-09 06:47:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 41058304. Throughput: 0: 9925.2. Samples: 41050008. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:15,122][635489] Avg episode reward: [(0, '4494.424')] [2023-03-09 06:47:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080200_41062400.pth... [2023-03-09 06:47:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079608_40759296.pth [2023-03-09 06:47:17,089][635776] Updated weights for policy 0, policy_version 80240 (0.0005) [2023-03-09 06:47:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 41111552. Throughput: 0: 10009.1. Samples: 41111260. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:20,122][635489] Avg episode reward: [(0, '4383.536')] [2023-03-09 06:47:21,214][635776] Updated weights for policy 0, policy_version 80320 (0.0005) [2023-03-09 06:47:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9967.0, 300 sec: 9927.6). Total num frames: 41160704. Throughput: 0: 10058.8. Samples: 41142040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 06:47:25,122][635489] Avg episode reward: [(0, '4500.427')] [2023-03-09 06:47:25,163][635776] Updated weights for policy 0, policy_version 80400 (0.0006) [2023-03-09 06:47:29,343][635776] Updated weights for policy 0, policy_version 80480 (0.0005) [2023-03-09 06:47:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9967.0, 300 sec: 9941.5). Total num frames: 41209856. Throughput: 0: 10066.5. Samples: 41201124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:30,122][635489] Avg episode reward: [(0, '4400.929')] [2023-03-09 06:47:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080496_41213952.pth... [2023-03-09 06:47:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000079896_40906752.pth [2023-03-09 06:47:33,385][635776] Updated weights for policy 0, policy_version 80560 (0.0005) [2023-03-09 06:47:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41263104. Throughput: 0: 10155.7. Samples: 41263016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:35,122][635489] Avg episode reward: [(0, '4370.227')] [2023-03-09 06:47:37,540][635776] Updated weights for policy 0, policy_version 80640 (0.0005) [2023-03-09 06:47:40,121][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 41308160. Throughput: 0: 10140.1. Samples: 41291664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:40,122][635489] Avg episode reward: [(0, '4322.107')] [2023-03-09 06:47:42,106][635776] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-09 06:47:45,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 41357312. Throughput: 0: 10006.7. Samples: 41345200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:45,122][635489] Avg episode reward: [(0, '3789.146')] [2023-03-09 06:47:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080776_41357312.pth... [2023-03-09 06:47:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080200_41062400.pth [2023-03-09 06:47:46,288][635776] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-09 06:47:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9967.0, 300 sec: 9941.5). Total num frames: 41406464. Throughput: 0: 10020.6. Samples: 41407688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:50,122][635489] Avg episode reward: [(0, '4152.894')] [2023-03-09 06:47:50,177][635776] Updated weights for policy 0, policy_version 80880 (0.0005) [2023-03-09 06:47:54,159][635776] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-09 06:47:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 41459712. Throughput: 0: 9984.6. Samples: 41438044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:47:55,122][635489] Avg episode reward: [(0, '3980.123')] [2023-03-09 06:47:58,012][635776] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-09 06:48:00,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 41508864. Throughput: 0: 10018.2. Samples: 41500828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:48:00,122][635489] Avg episode reward: [(0, '4297.031')] [2023-03-09 06:48:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081072_41508864.pth... [2023-03-09 06:48:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080496_41213952.pth [2023-03-09 06:48:02,209][635776] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-09 06:48:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41562112. Throughput: 0: 10017.9. Samples: 41562064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:48:05,122][635489] Avg episode reward: [(0, '4281.155')] [2023-03-09 06:48:06,060][635776] Updated weights for policy 0, policy_version 81200 (0.0005) [2023-03-09 06:48:10,081][635776] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-09 06:48:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 9969.3). Total num frames: 41615360. Throughput: 0: 10041.8. Samples: 41593920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:48:10,122][635489] Avg episode reward: [(0, '4446.217')] [2023-03-09 06:48:14,336][635776] Updated weights for policy 0, policy_version 81360 (0.0005) [2023-03-09 06:48:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 41664512. Throughput: 0: 10025.8. Samples: 41652288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:48:15,122][635489] Avg episode reward: [(0, '4314.007')] [2023-03-09 06:48:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081376_41664512.pth... [2023-03-09 06:48:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000080776_41357312.pth [2023-03-09 06:48:18,297][635776] Updated weights for policy 0, policy_version 81440 (0.0005) [2023-03-09 06:48:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 41713664. Throughput: 0: 10012.4. Samples: 41713572. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:20,122][635489] Avg episode reward: [(0, '4425.848')] [2023-03-09 06:48:22,537][635776] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-09 06:48:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41762816. Throughput: 0: 10013.3. Samples: 41742264. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:25,122][635489] Avg episode reward: [(0, '4490.065')] [2023-03-09 06:48:26,627][635776] Updated weights for policy 0, policy_version 81600 (0.0005) [2023-03-09 06:48:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41811968. Throughput: 0: 10126.1. Samples: 41800872. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:30,122][635489] Avg episode reward: [(0, '4578.256')] [2023-03-09 06:48:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081664_41811968.pth... [2023-03-09 06:48:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081072_41508864.pth [2023-03-09 06:48:30,757][635776] Updated weights for policy 0, policy_version 81680 (0.0005) [2023-03-09 06:48:35,072][635776] Updated weights for policy 0, policy_version 81760 (0.0005) [2023-03-09 06:48:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 41861120. Throughput: 0: 10045.0. Samples: 41859712. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:35,122][635489] Avg episode reward: [(0, '4448.180')] [2023-03-09 06:48:39,179][635776] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-09 06:48:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41910272. Throughput: 0: 10038.9. Samples: 41889792. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:40,122][635489] Avg episode reward: [(0, '4444.858')] [2023-03-09 06:48:43,230][635776] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-09 06:48:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 41959424. Throughput: 0: 9997.8. Samples: 41950728. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:45,122][635489] Avg episode reward: [(0, '4526.966')] [2023-03-09 06:48:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081952_41959424.pth... [2023-03-09 06:48:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081376_41664512.pth [2023-03-09 06:48:47,379][635776] Updated weights for policy 0, policy_version 82000 (0.0005) [2023-03-09 06:48:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 42008576. Throughput: 0: 9948.8. Samples: 42009760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:50,122][635489] Avg episode reward: [(0, '4537.969')] [2023-03-09 06:48:51,298][635776] Updated weights for policy 0, policy_version 82080 (0.0005) [2023-03-09 06:48:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9969.3). Total num frames: 42061824. Throughput: 0: 9944.2. Samples: 42041408. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:48:55,122][635489] Avg episode reward: [(0, '4525.771')] [2023-03-09 06:48:55,161][635776] Updated weights for policy 0, policy_version 82160 (0.0005) [2023-03-09 06:48:59,424][635776] Updated weights for policy 0, policy_version 82240 (0.0005) [2023-03-09 06:49:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 42110976. Throughput: 0: 10006.1. Samples: 42102560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:49:00,122][635489] Avg episode reward: [(0, '4508.551')] [2023-03-09 06:49:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082248_42110976.pth... [2023-03-09 06:49:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081664_41811968.pth [2023-03-09 06:49:03,704][635776] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-09 06:49:05,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 42160128. Throughput: 0: 9911.2. Samples: 42159576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 06:49:05,122][635489] Avg episode reward: [(0, '4587.282')] [2023-03-09 06:49:07,935][635776] Updated weights for policy 0, policy_version 82400 (0.0004) [2023-03-09 06:49:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9969.3). Total num frames: 42209280. Throughput: 0: 9923.0. Samples: 42188800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:10,122][635489] Avg episode reward: [(0, '4617.876')] [2023-03-09 06:49:10,123][635732] Saving new best policy, reward=4617.876! [2023-03-09 06:49:11,807][635776] Updated weights for policy 0, policy_version 82480 (0.0004) [2023-03-09 06:49:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 42258432. Throughput: 0: 9986.1. Samples: 42250248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:15,122][635489] Avg episode reward: [(0, '4439.213')] [2023-03-09 06:49:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082536_42258432.pth... [2023-03-09 06:49:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000081952_41959424.pth [2023-03-09 06:49:16,089][635776] Updated weights for policy 0, policy_version 82560 (0.0005) [2023-03-09 06:49:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 42307584. Throughput: 0: 9979.5. Samples: 42308788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:20,122][635489] Avg episode reward: [(0, '4526.648')] [2023-03-09 06:49:20,207][635776] Updated weights for policy 0, policy_version 82640 (0.0005) [2023-03-09 06:49:24,258][635776] Updated weights for policy 0, policy_version 82720 (0.0005) [2023-03-09 06:49:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 42356736. Throughput: 0: 10007.9. Samples: 42340148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:25,122][635489] Avg episode reward: [(0, '4469.354')] [2023-03-09 06:49:28,384][635776] Updated weights for policy 0, policy_version 82800 (0.0004) [2023-03-09 06:49:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 42409984. Throughput: 0: 9969.3. Samples: 42399348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:30,122][635489] Avg episode reward: [(0, '4189.667')] [2023-03-09 06:49:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082832_42409984.pth... [2023-03-09 06:49:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082248_42110976.pth [2023-03-09 06:49:32,331][635776] Updated weights for policy 0, policy_version 82880 (0.0005) [2023-03-09 06:49:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 42459136. Throughput: 0: 9999.5. Samples: 42459736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:35,122][635489] Avg episode reward: [(0, '4281.815')] [2023-03-09 06:49:36,424][635776] Updated weights for policy 0, policy_version 82960 (0.0004) [2023-03-09 06:49:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9997.0). Total num frames: 42512384. Throughput: 0: 10008.8. Samples: 42491804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:40,122][635489] Avg episode reward: [(0, '4034.459')] [2023-03-09 06:49:40,521][635776] Updated weights for policy 0, policy_version 83040 (0.0005) [2023-03-09 06:49:44,495][635776] Updated weights for policy 0, policy_version 83120 (0.0005) [2023-03-09 06:49:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 42561536. Throughput: 0: 9997.7. Samples: 42552456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:45,122][635489] Avg episode reward: [(0, '4093.900')] [2023-03-09 06:49:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083128_42561536.pth... [2023-03-09 06:49:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082536_42258432.pth [2023-03-09 06:49:48,448][635776] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-09 06:49:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 42614784. Throughput: 0: 10092.5. Samples: 42613740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:50,122][635489] Avg episode reward: [(0, '4135.202')] [2023-03-09 06:49:52,364][635776] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-09 06:49:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 42663936. Throughput: 0: 10141.2. Samples: 42645152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:49:55,122][635489] Avg episode reward: [(0, '4208.429')] [2023-03-09 06:49:56,389][635776] Updated weights for policy 0, policy_version 83360 (0.0005) [2023-03-09 06:50:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 42717184. Throughput: 0: 10159.0. Samples: 42707404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:00,122][635489] Avg episode reward: [(0, '4440.740')] [2023-03-09 06:50:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083432_42717184.pth... [2023-03-09 06:50:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000082832_42409984.pth [2023-03-09 06:50:00,344][635776] Updated weights for policy 0, policy_version 83440 (0.0005) [2023-03-09 06:50:04,096][635776] Updated weights for policy 0, policy_version 83520 (0.0005) [2023-03-09 06:50:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 42770432. Throughput: 0: 10260.2. Samples: 42770496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:05,122][635489] Avg episode reward: [(0, '4136.375')] [2023-03-09 06:50:08,073][635776] Updated weights for policy 0, policy_version 83600 (0.0005) [2023-03-09 06:50:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 42823680. Throughput: 0: 10236.8. Samples: 42800804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:10,122][635489] Avg episode reward: [(0, '4002.116')] [2023-03-09 06:50:11,848][635776] Updated weights for policy 0, policy_version 83680 (0.0005) [2023-03-09 06:50:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 42872832. Throughput: 0: 10339.8. Samples: 42864640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:15,122][635489] Avg episode reward: [(0, '4208.982')] [2023-03-09 06:50:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083736_42872832.pth... [2023-03-09 06:50:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083128_42561536.pth [2023-03-09 06:50:16,047][635776] Updated weights for policy 0, policy_version 83760 (0.0004) [2023-03-09 06:50:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 42921984. Throughput: 0: 10276.1. Samples: 42922160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:20,122][635489] Avg episode reward: [(0, '3968.887')] [2023-03-09 06:50:20,308][635776] Updated weights for policy 0, policy_version 83840 (0.0004) [2023-03-09 06:50:24,617][635776] Updated weights for policy 0, policy_version 83920 (0.0005) [2023-03-09 06:50:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 42971136. Throughput: 0: 10198.3. Samples: 42950728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:25,122][635489] Avg episode reward: [(0, '3983.879')] [2023-03-09 06:50:28,625][635776] Updated weights for policy 0, policy_version 84000 (0.0005) [2023-03-09 06:50:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 43020288. Throughput: 0: 10198.9. Samples: 43011404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:30,122][635489] Avg episode reward: [(0, '3819.160')] [2023-03-09 06:50:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084024_43020288.pth... [2023-03-09 06:50:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083432_42717184.pth [2023-03-09 06:50:32,622][635776] Updated weights for policy 0, policy_version 84080 (0.0005) [2023-03-09 06:50:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 43073536. Throughput: 0: 10175.9. Samples: 43071656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:35,122][635489] Avg episode reward: [(0, '4399.830')] [2023-03-09 06:50:36,704][635776] Updated weights for policy 0, policy_version 84160 (0.0005) [2023-03-09 06:50:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 43122688. Throughput: 0: 10151.5. Samples: 43101968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:40,122][635489] Avg episode reward: [(0, '4596.426')] [2023-03-09 06:50:40,719][635776] Updated weights for policy 0, policy_version 84240 (0.0005) [2023-03-09 06:50:44,679][635776] Updated weights for policy 0, policy_version 84320 (0.0004) [2023-03-09 06:50:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 43175936. Throughput: 0: 10149.9. Samples: 43164148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:45,122][635489] Avg episode reward: [(0, '4542.504')] [2023-03-09 06:50:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084328_43175936.pth... [2023-03-09 06:50:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000083736_42872832.pth [2023-03-09 06:50:48,739][635776] Updated weights for policy 0, policy_version 84400 (0.0005) [2023-03-09 06:50:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 43225088. Throughput: 0: 10103.5. Samples: 43225152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:50,122][635489] Avg episode reward: [(0, '4467.188')] [2023-03-09 06:50:52,688][635776] Updated weights for policy 0, policy_version 84480 (0.0005) [2023-03-09 06:50:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 43278336. Throughput: 0: 10118.0. Samples: 43256116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:50:55,122][635489] Avg episode reward: [(0, '4548.617')] [2023-03-09 06:50:56,768][635776] Updated weights for policy 0, policy_version 84560 (0.0005) [2023-03-09 06:51:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 43323392. Throughput: 0: 10007.9. Samples: 43314996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:00,122][635489] Avg episode reward: [(0, '4455.600')] [2023-03-09 06:51:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084616_43323392.pth... [2023-03-09 06:51:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084024_43020288.pth [2023-03-09 06:51:01,181][635776] Updated weights for policy 0, policy_version 84640 (0.0004) [2023-03-09 06:51:05,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 43372544. Throughput: 0: 10027.7. Samples: 43373404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:05,122][635489] Avg episode reward: [(0, '3531.876')] [2023-03-09 06:51:05,170][635776] Updated weights for policy 0, policy_version 84720 (0.0005) [2023-03-09 06:51:09,405][635776] Updated weights for policy 0, policy_version 84800 (0.0005) [2023-03-09 06:51:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10052.6). Total num frames: 43421696. Throughput: 0: 10058.3. Samples: 43403352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:10,122][635489] Avg episode reward: [(0, '4268.051')] [2023-03-09 06:51:13,466][635776] Updated weights for policy 0, policy_version 84880 (0.0004) [2023-03-09 06:51:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 43470848. Throughput: 0: 10028.0. Samples: 43462664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:15,122][635489] Avg episode reward: [(0, '4466.857')] [2023-03-09 06:51:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084904_43470848.pth... [2023-03-09 06:51:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084328_43175936.pth [2023-03-09 06:51:17,786][635776] Updated weights for policy 0, policy_version 84960 (0.0005) [2023-03-09 06:51:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 43524096. Throughput: 0: 10028.2. Samples: 43522924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:20,122][635489] Avg episode reward: [(0, '4467.876')] [2023-03-09 06:51:21,437][635776] Updated weights for policy 0, policy_version 85040 (0.0005) [2023-03-09 06:51:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 43573248. Throughput: 0: 10062.2. Samples: 43554768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:25,122][635489] Avg episode reward: [(0, '4517.220')] [2023-03-09 06:51:25,522][635776] Updated weights for policy 0, policy_version 85120 (0.0004) [2023-03-09 06:51:29,337][635776] Updated weights for policy 0, policy_version 85200 (0.0005) [2023-03-09 06:51:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 43630592. Throughput: 0: 10072.6. Samples: 43617416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:30,122][635489] Avg episode reward: [(0, '4491.132')] [2023-03-09 06:51:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085216_43630592.pth... [2023-03-09 06:51:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084616_43323392.pth [2023-03-09 06:51:33,165][635776] Updated weights for policy 0, policy_version 85280 (0.0005) [2023-03-09 06:51:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 43679744. Throughput: 0: 10099.9. Samples: 43679648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:35,122][635489] Avg episode reward: [(0, '4573.777')] [2023-03-09 06:51:37,457][635776] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-09 06:51:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10066.4). Total num frames: 43728896. Throughput: 0: 10051.1. Samples: 43708416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:40,122][635489] Avg episode reward: [(0, '4532.978')] [2023-03-09 06:51:41,606][635776] Updated weights for policy 0, policy_version 85440 (0.0004) [2023-03-09 06:51:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 43782144. Throughput: 0: 10101.9. Samples: 43769580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:45,122][635489] Avg episode reward: [(0, '4579.480')] [2023-03-09 06:51:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085512_43782144.pth... [2023-03-09 06:51:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000084904_43470848.pth [2023-03-09 06:51:45,455][635776] Updated weights for policy 0, policy_version 85520 (0.0005) [2023-03-09 06:51:49,224][635776] Updated weights for policy 0, policy_version 85600 (0.0005) [2023-03-09 06:51:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 43835392. Throughput: 0: 10225.0. Samples: 43833532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:50,122][635489] Avg episode reward: [(0, '4012.263')] [2023-03-09 06:51:53,027][635776] Updated weights for policy 0, policy_version 85680 (0.0005) [2023-03-09 06:51:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 43884544. Throughput: 0: 10326.0. Samples: 43868020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:51:55,122][635489] Avg episode reward: [(0, '3852.704')] [2023-03-09 06:51:57,036][635776] Updated weights for policy 0, policy_version 85760 (0.0005) [2023-03-09 06:52:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 43937792. Throughput: 0: 10370.3. Samples: 43929328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:00,122][635489] Avg episode reward: [(0, '3844.791')] [2023-03-09 06:52:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085816_43937792.pth... [2023-03-09 06:52:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085216_43630592.pth [2023-03-09 06:52:01,047][635776] Updated weights for policy 0, policy_version 85840 (0.0005) [2023-03-09 06:52:04,811][635776] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-09 06:52:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 43991040. Throughput: 0: 10413.6. Samples: 43991536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:05,122][635489] Avg episode reward: [(0, '3286.841')] [2023-03-09 06:52:08,775][635776] Updated weights for policy 0, policy_version 86000 (0.0005) [2023-03-09 06:52:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 44044288. Throughput: 0: 10408.3. Samples: 44023140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:10,122][635489] Avg episode reward: [(0, '3527.606')] [2023-03-09 06:52:12,834][635776] Updated weights for policy 0, policy_version 86080 (0.0005) [2023-03-09 06:52:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 44093440. Throughput: 0: 10345.0. Samples: 44082940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:15,122][635489] Avg episode reward: [(0, '3786.022')] [2023-03-09 06:52:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086120_44093440.pth... [2023-03-09 06:52:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085512_43782144.pth [2023-03-09 06:52:17,071][635776] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-09 06:52:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 44142592. Throughput: 0: 10289.3. Samples: 44142664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:20,122][635489] Avg episode reward: [(0, '3282.846')] [2023-03-09 06:52:21,103][635776] Updated weights for policy 0, policy_version 86240 (0.0004) [2023-03-09 06:52:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.2, 300 sec: 10108.1). Total num frames: 44191744. Throughput: 0: 10322.8. Samples: 44172940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:25,122][635489] Avg episode reward: [(0, '3156.242')] [2023-03-09 06:52:25,385][635776] Updated weights for policy 0, policy_version 86320 (0.0005) [2023-03-09 06:52:29,603][635776] Updated weights for policy 0, policy_version 86400 (0.0004) [2023-03-09 06:52:30,122][635489] Fps is (10 sec: 9830.2, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 44240896. Throughput: 0: 10254.9. Samples: 44231052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:30,122][635489] Avg episode reward: [(0, '3590.966')] [2023-03-09 06:52:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086408_44240896.pth... [2023-03-09 06:52:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000085816_43937792.pth [2023-03-09 06:52:33,580][635776] Updated weights for policy 0, policy_version 86480 (0.0005) [2023-03-09 06:52:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 44290048. Throughput: 0: 10151.8. Samples: 44290360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:35,122][635489] Avg episode reward: [(0, '4233.069')] [2023-03-09 06:52:37,708][635776] Updated weights for policy 0, policy_version 86560 (0.0005) [2023-03-09 06:52:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 44339200. Throughput: 0: 10058.3. Samples: 44320644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:40,122][635489] Avg episode reward: [(0, '4172.012')] [2023-03-09 06:52:41,925][635776] Updated weights for policy 0, policy_version 86640 (0.0005) [2023-03-09 06:52:45,122][635489] Fps is (10 sec: 9830.2, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 44388352. Throughput: 0: 9990.9. Samples: 44378920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:45,122][635489] Avg episode reward: [(0, '4034.413')] [2023-03-09 06:52:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086696_44388352.pth... [2023-03-09 06:52:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086120_44093440.pth [2023-03-09 06:52:46,089][635776] Updated weights for policy 0, policy_version 86720 (0.0005) [2023-03-09 06:52:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 44437504. Throughput: 0: 9934.2. Samples: 44438576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:50,122][635489] Avg episode reward: [(0, '4326.837')] [2023-03-09 06:52:50,227][635776] Updated weights for policy 0, policy_version 86800 (0.0004) [2023-03-09 06:52:54,445][635776] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-09 06:52:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 44486656. Throughput: 0: 9877.9. Samples: 44467644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:52:55,122][635489] Avg episode reward: [(0, '4216.459')] [2023-03-09 06:52:58,373][635776] Updated weights for policy 0, policy_version 86960 (0.0004) [2023-03-09 06:53:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 44539904. Throughput: 0: 9941.9. Samples: 44530324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:00,122][635489] Avg episode reward: [(0, '4379.756')] [2023-03-09 06:53:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086992_44539904.pth... [2023-03-09 06:53:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086408_44240896.pth [2023-03-09 06:53:02,375][635776] Updated weights for policy 0, policy_version 87040 (0.0004) [2023-03-09 06:53:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 44589056. Throughput: 0: 9921.4. Samples: 44589128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:05,122][635489] Avg episode reward: [(0, '4368.295')] [2023-03-09 06:53:06,465][635776] Updated weights for policy 0, policy_version 87120 (0.0004) [2023-03-09 06:53:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 44638208. Throughput: 0: 9944.5. Samples: 44620440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:10,122][635489] Avg episode reward: [(0, '4566.852')] [2023-03-09 06:53:10,677][635776] Updated weights for policy 0, policy_version 87200 (0.0005) [2023-03-09 06:53:14,642][635776] Updated weights for policy 0, policy_version 87280 (0.0005) [2023-03-09 06:53:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 44691456. Throughput: 0: 9959.7. Samples: 44679240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:15,122][635489] Avg episode reward: [(0, '4494.600')] [2023-03-09 06:53:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087288_44691456.pth... [2023-03-09 06:53:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086696_44388352.pth [2023-03-09 06:53:18,560][635776] Updated weights for policy 0, policy_version 87360 (0.0005) [2023-03-09 06:53:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 44740608. Throughput: 0: 10006.9. Samples: 44740672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:20,122][635489] Avg episode reward: [(0, '4545.117')] [2023-03-09 06:53:22,560][635776] Updated weights for policy 0, policy_version 87440 (0.0004) [2023-03-09 06:53:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 44793856. Throughput: 0: 10054.4. Samples: 44773092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:53:25,122][635489] Avg episode reward: [(0, '4363.091')] [2023-03-09 06:53:26,765][635776] Updated weights for policy 0, policy_version 87520 (0.0004) [2023-03-09 06:53:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 44843008. Throughput: 0: 10044.1. Samples: 44830904. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:30,122][635489] Avg episode reward: [(0, '4371.266')] [2023-03-09 06:53:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087584_44843008.pth... [2023-03-09 06:53:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000086992_44539904.pth [2023-03-09 06:53:30,909][635776] Updated weights for policy 0, policy_version 87600 (0.0004) [2023-03-09 06:53:35,028][635776] Updated weights for policy 0, policy_version 87680 (0.0005) [2023-03-09 06:53:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 44892160. Throughput: 0: 10054.3. Samples: 44891020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:35,122][635489] Avg episode reward: [(0, '4554.253')] [2023-03-09 06:53:39,211][635776] Updated weights for policy 0, policy_version 87760 (0.0004) [2023-03-09 06:53:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 44941312. Throughput: 0: 10071.2. Samples: 44920848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:40,122][635489] Avg episode reward: [(0, '4578.021')] [2023-03-09 06:53:43,169][635776] Updated weights for policy 0, policy_version 87840 (0.0005) [2023-03-09 06:53:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 44990464. Throughput: 0: 10041.2. Samples: 44982176. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:45,122][635489] Avg episode reward: [(0, '4529.771')] [2023-03-09 06:53:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087872_44990464.pth... [2023-03-09 06:53:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087288_44691456.pth [2023-03-09 06:53:47,158][635776] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-09 06:53:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 45043712. Throughput: 0: 10113.4. Samples: 45044228. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:50,122][635489] Avg episode reward: [(0, '4176.608')] [2023-03-09 06:53:50,920][635776] Updated weights for policy 0, policy_version 88000 (0.0005) [2023-03-09 06:53:54,970][635776] Updated weights for policy 0, policy_version 88080 (0.0004) [2023-03-09 06:53:55,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 45096960. Throughput: 0: 10103.5. Samples: 45075100. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:53:55,122][635489] Avg episode reward: [(0, '4576.024')] [2023-03-09 06:53:59,372][635776] Updated weights for policy 0, policy_version 88160 (0.0005) [2023-03-09 06:54:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 45142016. Throughput: 0: 10102.1. Samples: 45133832. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:54:00,122][635489] Avg episode reward: [(0, '4464.276')] [2023-03-09 06:54:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088168_45142016.pth... [2023-03-09 06:54:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087584_44843008.pth [2023-03-09 06:54:03,412][635776] Updated weights for policy 0, policy_version 88240 (0.0005) [2023-03-09 06:54:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 45195264. Throughput: 0: 10097.3. Samples: 45195052. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:54:05,122][635489] Avg episode reward: [(0, '4029.064')] [2023-03-09 06:54:07,478][635776] Updated weights for policy 0, policy_version 88320 (0.0005) [2023-03-09 06:54:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 45244416. Throughput: 0: 10020.2. Samples: 45224000. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:54:10,122][635489] Avg episode reward: [(0, '4147.877')] [2023-03-09 06:54:11,579][635776] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-09 06:54:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 45293568. Throughput: 0: 10094.5. Samples: 45285156. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:54:15,122][635489] Avg episode reward: [(0, '4518.160')] [2023-03-09 06:54:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088464_45293568.pth... [2023-03-09 06:54:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000087872_44990464.pth [2023-03-09 06:54:15,605][635776] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-09 06:54:19,259][635776] Updated weights for policy 0, policy_version 88560 (0.0005) [2023-03-09 06:54:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 45350912. Throughput: 0: 10205.8. Samples: 45350280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 06:54:20,122][635489] Avg episode reward: [(0, '4416.525')] [2023-03-09 06:54:23,037][635776] Updated weights for policy 0, policy_version 88640 (0.0005) [2023-03-09 06:54:25,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 45404160. Throughput: 0: 10274.3. Samples: 45383192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:25,133][635489] Avg episode reward: [(0, '4509.877')] [2023-03-09 06:54:27,124][635776] Updated weights for policy 0, policy_version 88720 (0.0005) [2023-03-09 06:54:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10149.8). Total num frames: 45453312. Throughput: 0: 10258.2. Samples: 45443796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:30,132][635489] Avg episode reward: [(0, '4483.623')] [2023-03-09 06:54:30,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088776_45453312.pth... [2023-03-09 06:54:30,136][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088168_45142016.pth [2023-03-09 06:54:31,157][635776] Updated weights for policy 0, policy_version 88800 (0.0005) [2023-03-09 06:54:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 45502464. Throughput: 0: 10212.0. Samples: 45503768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:35,122][635489] Avg episode reward: [(0, '4463.520')] [2023-03-09 06:54:35,199][635776] Updated weights for policy 0, policy_version 88880 (0.0005) [2023-03-09 06:54:39,240][635776] Updated weights for policy 0, policy_version 88960 (0.0004) [2023-03-09 06:54:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 45555712. Throughput: 0: 10214.6. Samples: 45534756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:40,122][635489] Avg episode reward: [(0, '4496.396')] [2023-03-09 06:54:43,176][635776] Updated weights for policy 0, policy_version 89040 (0.0005) [2023-03-09 06:54:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10149.7). Total num frames: 45608960. Throughput: 0: 10284.6. Samples: 45596640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:45,122][635489] Avg episode reward: [(0, '4543.802')] [2023-03-09 06:54:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089080_45608960.pth... [2023-03-09 06:54:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088464_45293568.pth [2023-03-09 06:54:47,085][635776] Updated weights for policy 0, policy_version 89120 (0.0005) [2023-03-09 06:54:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 45654016. Throughput: 0: 10221.9. Samples: 45655036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:50,122][635489] Avg episode reward: [(0, '4495.481')] [2023-03-09 06:54:51,393][635776] Updated weights for policy 0, policy_version 89200 (0.0004) [2023-03-09 06:54:55,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 45703168. Throughput: 0: 10233.3. Samples: 45684500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:54:55,122][635489] Avg episode reward: [(0, '4410.190')] [2023-03-09 06:54:55,652][635776] Updated weights for policy 0, policy_version 89280 (0.0005) [2023-03-09 06:54:59,850][635776] Updated weights for policy 0, policy_version 89360 (0.0005) [2023-03-09 06:55:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 45752320. Throughput: 0: 10197.6. Samples: 45744048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:00,122][635489] Avg episode reward: [(0, '4230.424')] [2023-03-09 06:55:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089360_45752320.pth... [2023-03-09 06:55:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000088776_45453312.pth [2023-03-09 06:55:03,938][635776] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-09 06:55:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 45801472. Throughput: 0: 10037.2. Samples: 45801952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:05,122][635489] Avg episode reward: [(0, '4506.077')] [2023-03-09 06:55:07,906][635776] Updated weights for policy 0, policy_version 89520 (0.0005) [2023-03-09 06:55:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 45854720. Throughput: 0: 10023.3. Samples: 45834240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:10,122][635489] Avg episode reward: [(0, '4541.791')] [2023-03-09 06:55:11,961][635776] Updated weights for policy 0, policy_version 89600 (0.0005) [2023-03-09 06:55:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 45903872. Throughput: 0: 10001.9. Samples: 45893880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:15,122][635489] Avg episode reward: [(0, '4525.736')] [2023-03-09 06:55:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089656_45903872.pth... [2023-03-09 06:55:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089080_45608960.pth [2023-03-09 06:55:16,312][635776] Updated weights for policy 0, policy_version 89680 (0.0004) [2023-03-09 06:55:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 45953024. Throughput: 0: 10027.1. Samples: 45954988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:20,122][635489] Avg episode reward: [(0, '4205.465')] [2023-03-09 06:55:20,150][635776] Updated weights for policy 0, policy_version 89760 (0.0005) [2023-03-09 06:55:24,463][635776] Updated weights for policy 0, policy_version 89840 (0.0004) [2023-03-09 06:55:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 46002176. Throughput: 0: 9960.0. Samples: 45982956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:25,122][635489] Avg episode reward: [(0, '4338.290')] [2023-03-09 06:55:28,497][635776] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-09 06:55:30,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 46055424. Throughput: 0: 9921.6. Samples: 46043112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:30,122][635489] Avg episode reward: [(0, '4305.416')] [2023-03-09 06:55:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089952_46055424.pth... [2023-03-09 06:55:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089360_45752320.pth [2023-03-09 06:55:32,638][635776] Updated weights for policy 0, policy_version 90000 (0.0005) [2023-03-09 06:55:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 46100480. Throughput: 0: 9934.9. Samples: 46102108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:35,122][635489] Avg episode reward: [(0, '4056.943')] [2023-03-09 06:55:36,750][635776] Updated weights for policy 0, policy_version 90080 (0.0004) [2023-03-09 06:55:40,121][635489] Fps is (10 sec: 9830.6, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 46153728. Throughput: 0: 9973.6. Samples: 46133312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:40,122][635489] Avg episode reward: [(0, '4295.263')] [2023-03-09 06:55:40,579][635776] Updated weights for policy 0, policy_version 90160 (0.0004) [2023-03-09 06:55:44,502][635776] Updated weights for policy 0, policy_version 90240 (0.0005) [2023-03-09 06:55:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 46206976. Throughput: 0: 10061.3. Samples: 46196808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:45,122][635489] Avg episode reward: [(0, '4161.329')] [2023-03-09 06:55:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090248_46206976.pth... [2023-03-09 06:55:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089656_45903872.pth [2023-03-09 06:55:48,517][635776] Updated weights for policy 0, policy_version 90320 (0.0005) [2023-03-09 06:55:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 46260224. Throughput: 0: 10130.1. Samples: 46257808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:50,122][635489] Avg episode reward: [(0, '4304.373')] [2023-03-09 06:55:52,333][635776] Updated weights for policy 0, policy_version 90400 (0.0005) [2023-03-09 06:55:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 46309376. Throughput: 0: 10150.8. Samples: 46291028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:55:55,122][635489] Avg episode reward: [(0, '4172.707')] [2023-03-09 06:55:56,489][635776] Updated weights for policy 0, policy_version 90480 (0.0004) [2023-03-09 06:56:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 46362624. Throughput: 0: 10144.9. Samples: 46350400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:00,122][635489] Avg episode reward: [(0, '4321.329')] [2023-03-09 06:56:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090552_46362624.pth... [2023-03-09 06:56:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000089952_46055424.pth [2023-03-09 06:56:00,474][635776] Updated weights for policy 0, policy_version 90560 (0.0005) [2023-03-09 06:56:04,669][635776] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-09 06:56:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 46411776. Throughput: 0: 10118.3. Samples: 46410312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:05,122][635489] Avg episode reward: [(0, '4303.506')] [2023-03-09 06:56:08,608][635776] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-09 06:56:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 46465024. Throughput: 0: 10167.9. Samples: 46440512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:10,122][635489] Avg episode reward: [(0, '4353.664')] [2023-03-09 06:56:12,277][635776] Updated weights for policy 0, policy_version 90800 (0.0005) [2023-03-09 06:56:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 46518272. Throughput: 0: 10309.6. Samples: 46507044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:15,122][635489] Avg episode reward: [(0, '4382.746')] [2023-03-09 06:56:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090856_46518272.pth... [2023-03-09 06:56:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090248_46206976.pth [2023-03-09 06:56:16,340][635776] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-09 06:56:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 46567424. Throughput: 0: 10310.2. Samples: 46566068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:20,133][635489] Avg episode reward: [(0, '4281.493')] [2023-03-09 06:56:20,436][635776] Updated weights for policy 0, policy_version 90960 (0.0005) [2023-03-09 06:56:24,498][635776] Updated weights for policy 0, policy_version 91040 (0.0005) [2023-03-09 06:56:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 46616576. Throughput: 0: 10284.1. Samples: 46596096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:25,122][635489] Avg episode reward: [(0, '4176.053')] [2023-03-09 06:56:28,612][635776] Updated weights for policy 0, policy_version 91120 (0.0005) [2023-03-09 06:56:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10122.0). Total num frames: 46665728. Throughput: 0: 10226.2. Samples: 46656988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:30,122][635489] Avg episode reward: [(0, '4439.732')] [2023-03-09 06:56:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091144_46665728.pth... [2023-03-09 06:56:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090552_46362624.pth [2023-03-09 06:56:32,863][635776] Updated weights for policy 0, policy_version 91200 (0.0004) [2023-03-09 06:56:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 46714880. Throughput: 0: 10158.6. Samples: 46714944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:35,122][635489] Avg episode reward: [(0, '3584.734')] [2023-03-09 06:56:36,811][635776] Updated weights for policy 0, policy_version 91280 (0.0005) [2023-03-09 06:56:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 46768128. Throughput: 0: 10140.4. Samples: 46747348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:40,133][635489] Avg episode reward: [(0, '4428.827')] [2023-03-09 06:56:40,937][635776] Updated weights for policy 0, policy_version 91360 (0.0005) [2023-03-09 06:56:45,028][635776] Updated weights for policy 0, policy_version 91440 (0.0005) [2023-03-09 06:56:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 46817280. Throughput: 0: 10121.0. Samples: 46805844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:45,133][635489] Avg episode reward: [(0, '4442.043')] [2023-03-09 06:56:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091440_46817280.pth... [2023-03-09 06:56:45,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000090856_46518272.pth [2023-03-09 06:56:49,243][635776] Updated weights for policy 0, policy_version 91520 (0.0004) [2023-03-09 06:56:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 46866432. Throughput: 0: 10105.4. Samples: 46865056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:50,133][635489] Avg episode reward: [(0, '4556.964')] [2023-03-09 06:56:53,169][635776] Updated weights for policy 0, policy_version 91600 (0.0005) [2023-03-09 06:56:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 46915584. Throughput: 0: 10108.9. Samples: 46895412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:56:55,132][635489] Avg episode reward: [(0, '4165.709')] [2023-03-09 06:56:57,069][635776] Updated weights for policy 0, policy_version 91680 (0.0005) [2023-03-09 06:57:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 46968832. Throughput: 0: 10046.8. Samples: 46959148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:00,132][635489] Avg episode reward: [(0, '4313.777')] [2023-03-09 06:57:00,148][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091744_46972928.pth... [2023-03-09 06:57:00,149][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091144_46665728.pth [2023-03-09 06:57:00,902][635776] Updated weights for policy 0, policy_version 91760 (0.0005) [2023-03-09 06:57:04,598][635776] Updated weights for policy 0, policy_version 91840 (0.0005) [2023-03-09 06:57:05,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 47026176. Throughput: 0: 10224.8. Samples: 47026184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:05,133][635489] Avg episode reward: [(0, '4559.069')] [2023-03-09 06:57:08,164][635776] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-09 06:57:10,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 47079424. Throughput: 0: 10287.1. Samples: 47059016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:10,132][635489] Avg episode reward: [(0, '4589.044')] [2023-03-09 06:57:12,121][635776] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-09 06:57:15,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 47132672. Throughput: 0: 10365.0. Samples: 47123412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:15,132][635489] Avg episode reward: [(0, '4517.475')] [2023-03-09 06:57:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092056_47132672.pth... [2023-03-09 06:57:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091440_46817280.pth [2023-03-09 06:57:16,069][635776] Updated weights for policy 0, policy_version 92080 (0.0005) [2023-03-09 06:57:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 47181824. Throughput: 0: 10382.8. Samples: 47182172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:20,132][635489] Avg episode reward: [(0, '4186.417')] [2023-03-09 06:57:20,264][635776] Updated weights for policy 0, policy_version 92160 (0.0005) [2023-03-09 06:57:24,412][635776] Updated weights for policy 0, policy_version 92240 (0.0005) [2023-03-09 06:57:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 47230976. Throughput: 0: 10302.1. Samples: 47210940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:25,132][635489] Avg episode reward: [(0, '4481.775')] [2023-03-09 06:57:28,443][635776] Updated weights for policy 0, policy_version 92320 (0.0004) [2023-03-09 06:57:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10149.7). Total num frames: 47284224. Throughput: 0: 10359.8. Samples: 47272036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:30,133][635489] Avg episode reward: [(0, '4475.579')] [2023-03-09 06:57:30,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092352_47284224.pth... [2023-03-09 06:57:30,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000091744_46972928.pth [2023-03-09 06:57:32,415][635776] Updated weights for policy 0, policy_version 92400 (0.0005) [2023-03-09 06:57:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10149.7). Total num frames: 47333376. Throughput: 0: 10449.2. Samples: 47335268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:35,133][635489] Avg episode reward: [(0, '4541.816')] [2023-03-09 06:57:36,358][635776] Updated weights for policy 0, policy_version 92480 (0.0005) [2023-03-09 06:57:40,019][635776] Updated weights for policy 0, policy_version 92560 (0.0005) [2023-03-09 06:57:40,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 47390720. Throughput: 0: 10462.3. Samples: 47366216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:40,122][635489] Avg episode reward: [(0, '4585.536')] [2023-03-09 06:57:43,810][635776] Updated weights for policy 0, policy_version 92640 (0.0005) [2023-03-09 06:57:45,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 47443968. Throughput: 0: 10513.0. Samples: 47432236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:45,133][635489] Avg episode reward: [(0, '4571.777')] [2023-03-09 06:57:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092664_47443968.pth... [2023-03-09 06:57:45,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092056_47132672.pth [2023-03-09 06:57:47,683][635776] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-09 06:57:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10205.3). Total num frames: 47497216. Throughput: 0: 10420.1. Samples: 47495088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:50,133][635489] Avg episode reward: [(0, '4526.255')] [2023-03-09 06:57:51,734][635776] Updated weights for policy 0, policy_version 92800 (0.0005) [2023-03-09 06:57:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10191.4). Total num frames: 47546368. Throughput: 0: 10378.3. Samples: 47526040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:57:55,122][635489] Avg episode reward: [(0, '4395.371')] [2023-03-09 06:57:55,522][635776] Updated weights for policy 0, policy_version 92880 (0.0005) [2023-03-09 06:57:59,620][635776] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-09 06:58:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10205.3). Total num frames: 47599616. Throughput: 0: 10319.3. Samples: 47587780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:00,133][635489] Avg episode reward: [(0, '4417.609')] [2023-03-09 06:58:00,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092968_47599616.pth... [2023-03-09 06:58:00,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092352_47284224.pth [2023-03-09 06:58:03,649][635776] Updated weights for policy 0, policy_version 93040 (0.0005) [2023-03-09 06:58:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 47648768. Throughput: 0: 10381.0. Samples: 47649316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:05,132][635489] Avg episode reward: [(0, '4290.747')] [2023-03-09 06:58:07,732][635776] Updated weights for policy 0, policy_version 93120 (0.0005) [2023-03-09 06:58:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 47702016. Throughput: 0: 10404.0. Samples: 47679120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:10,123][635489] Avg episode reward: [(0, '4090.354')] [2023-03-09 06:58:11,645][635776] Updated weights for policy 0, policy_version 93200 (0.0005) [2023-03-09 06:58:15,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 47751168. Throughput: 0: 10420.9. Samples: 47740976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:15,132][635489] Avg episode reward: [(0, '4130.497')] [2023-03-09 06:58:15,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093264_47751168.pth... [2023-03-09 06:58:15,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092664_47443968.pth [2023-03-09 06:58:15,675][635776] Updated weights for policy 0, policy_version 93280 (0.0005) [2023-03-09 06:58:19,340][635776] Updated weights for policy 0, policy_version 93360 (0.0005) [2023-03-09 06:58:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10219.2). Total num frames: 47808512. Throughput: 0: 10463.6. Samples: 47806132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:20,133][635489] Avg episode reward: [(0, '4313.494')] [2023-03-09 06:58:23,485][635776] Updated weights for policy 0, policy_version 93440 (0.0005) [2023-03-09 06:58:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 47853568. Throughput: 0: 10454.4. Samples: 47836664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:25,122][635489] Avg episode reward: [(0, '4176.937')] [2023-03-09 06:58:27,637][635776] Updated weights for policy 0, policy_version 93520 (0.0004) [2023-03-09 06:58:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 47906816. Throughput: 0: 10272.1. Samples: 47894480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:30,122][635489] Avg episode reward: [(0, '4115.291')] [2023-03-09 06:58:30,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093568_47906816.pth... [2023-03-09 06:58:30,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000092968_47599616.pth [2023-03-09 06:58:31,611][635776] Updated weights for policy 0, policy_version 93600 (0.0004) [2023-03-09 06:58:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10233.1). Total num frames: 47960064. Throughput: 0: 10330.6. Samples: 47959964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:35,122][635489] Avg episode reward: [(0, '4405.652')] [2023-03-09 06:58:35,334][635776] Updated weights for policy 0, policy_version 93680 (0.0005) [2023-03-09 06:58:39,391][635776] Updated weights for policy 0, policy_version 93760 (0.0005) [2023-03-09 06:58:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 48009216. Throughput: 0: 10337.4. Samples: 47991224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:40,122][635489] Avg episode reward: [(0, '4088.647')] [2023-03-09 06:58:43,394][635776] Updated weights for policy 0, policy_version 93840 (0.0005) [2023-03-09 06:58:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 48062464. Throughput: 0: 10298.6. Samples: 48051216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:45,122][635489] Avg episode reward: [(0, '4301.942')] [2023-03-09 06:58:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093872_48062464.pth... [2023-03-09 06:58:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093264_47751168.pth [2023-03-09 06:58:47,313][635776] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-09 06:58:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 48111616. Throughput: 0: 10273.3. Samples: 48111616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:50,122][635489] Avg episode reward: [(0, '4109.284')] [2023-03-09 06:58:51,469][635776] Updated weights for policy 0, policy_version 94000 (0.0004) [2023-03-09 06:58:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 48164864. Throughput: 0: 10334.5. Samples: 48144172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:58:55,122][635489] Avg episode reward: [(0, '4334.782')] [2023-03-09 06:58:55,401][635776] Updated weights for policy 0, policy_version 94080 (0.0004) [2023-03-09 06:58:59,773][635776] Updated weights for policy 0, policy_version 94160 (0.0004) [2023-03-09 06:59:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10219.2). Total num frames: 48209920. Throughput: 0: 10240.4. Samples: 48201792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:00,122][635489] Avg episode reward: [(0, '4414.197')] [2023-03-09 06:59:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094160_48209920.pth... [2023-03-09 06:59:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093568_47906816.pth [2023-03-09 06:59:03,839][635776] Updated weights for policy 0, policy_version 94240 (0.0005) [2023-03-09 06:59:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 48263168. Throughput: 0: 10115.8. Samples: 48261344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:05,122][635489] Avg episode reward: [(0, '4402.933')] [2023-03-09 06:59:07,901][635776] Updated weights for policy 0, policy_version 94320 (0.0005) [2023-03-09 06:59:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 48312320. Throughput: 0: 10115.0. Samples: 48291840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:10,122][635489] Avg episode reward: [(0, '4260.680')] [2023-03-09 06:59:11,960][635776] Updated weights for policy 0, policy_version 94400 (0.0005) [2023-03-09 06:59:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 48361472. Throughput: 0: 10144.4. Samples: 48350980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:15,122][635489] Avg episode reward: [(0, '4463.558')] [2023-03-09 06:59:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094456_48361472.pth... [2023-03-09 06:59:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000093872_48062464.pth [2023-03-09 06:59:16,214][635776] Updated weights for policy 0, policy_version 94480 (0.0005) [2023-03-09 06:59:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 48410624. Throughput: 0: 10041.8. Samples: 48411844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:20,122][635489] Avg episode reward: [(0, '4486.230')] [2023-03-09 06:59:20,123][635776] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-09 06:59:24,103][635776] Updated weights for policy 0, policy_version 94640 (0.0005) [2023-03-09 06:59:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 48463872. Throughput: 0: 10057.3. Samples: 48443804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:25,122][635489] Avg episode reward: [(0, '4533.568')] [2023-03-09 06:59:27,943][635776] Updated weights for policy 0, policy_version 94720 (0.0005) [2023-03-09 06:59:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 48517120. Throughput: 0: 10118.3. Samples: 48506540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:30,122][635489] Avg episode reward: [(0, '4509.107')] [2023-03-09 06:59:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094760_48517120.pth... [2023-03-09 06:59:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094160_48209920.pth [2023-03-09 06:59:32,200][635776] Updated weights for policy 0, policy_version 94800 (0.0004) [2023-03-09 06:59:35,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 48566272. Throughput: 0: 10103.8. Samples: 48566288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:35,122][635489] Avg episode reward: [(0, '4524.468')] [2023-03-09 06:59:36,118][635776] Updated weights for policy 0, policy_version 94880 (0.0005) [2023-03-09 06:59:40,096][635776] Updated weights for policy 0, policy_version 94960 (0.0005) [2023-03-09 06:59:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 48619520. Throughput: 0: 10034.8. Samples: 48595736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:40,122][635489] Avg episode reward: [(0, '4560.072')] [2023-03-09 06:59:44,274][635776] Updated weights for policy 0, policy_version 95040 (0.0004) [2023-03-09 06:59:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 48668672. Throughput: 0: 10102.2. Samples: 48656392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:45,122][635489] Avg episode reward: [(0, '4480.879')] [2023-03-09 06:59:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095056_48668672.pth... [2023-03-09 06:59:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094456_48361472.pth [2023-03-09 06:59:48,347][635776] Updated weights for policy 0, policy_version 95120 (0.0005) [2023-03-09 06:59:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 48717824. Throughput: 0: 10141.4. Samples: 48717708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:50,122][635489] Avg episode reward: [(0, '4464.509')] [2023-03-09 06:59:52,344][635776] Updated weights for policy 0, policy_version 95200 (0.0005) [2023-03-09 06:59:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 48771072. Throughput: 0: 10162.2. Samples: 48749140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 06:59:55,122][635489] Avg episode reward: [(0, '4406.952')] [2023-03-09 06:59:56,190][635776] Updated weights for policy 0, policy_version 95280 (0.0004) [2023-03-09 07:00:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 48820224. Throughput: 0: 10210.4. Samples: 48810448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:00,122][635489] Avg episode reward: [(0, '4452.464')] [2023-03-09 07:00:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095352_48820224.pth... [2023-03-09 07:00:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000094760_48517120.pth [2023-03-09 07:00:00,324][635776] Updated weights for policy 0, policy_version 95360 (0.0005) [2023-03-09 07:00:04,336][635776] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-09 07:00:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 48873472. Throughput: 0: 10208.5. Samples: 48871228. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:05,122][635489] Avg episode reward: [(0, '4502.433')] [2023-03-09 07:00:08,261][635776] Updated weights for policy 0, policy_version 95520 (0.0005) [2023-03-09 07:00:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 48922624. Throughput: 0: 10191.6. Samples: 48902428. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:10,122][635489] Avg episode reward: [(0, '4433.595')] [2023-03-09 07:00:12,081][635776] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-09 07:00:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 48975872. Throughput: 0: 10172.5. Samples: 48964304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:15,122][635489] Avg episode reward: [(0, '4416.784')] [2023-03-09 07:00:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095656_48975872.pth... [2023-03-09 07:00:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095056_48668672.pth [2023-03-09 07:00:16,075][635776] Updated weights for policy 0, policy_version 95680 (0.0004) [2023-03-09 07:00:19,835][635776] Updated weights for policy 0, policy_version 95760 (0.0005) [2023-03-09 07:00:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 49029120. Throughput: 0: 10289.1. Samples: 49029296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:20,122][635489] Avg episode reward: [(0, '4279.375')] [2023-03-09 07:00:23,296][635776] Updated weights for policy 0, policy_version 95840 (0.0005) [2023-03-09 07:00:25,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 49086464. Throughput: 0: 10434.5. Samples: 49065288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:25,122][635489] Avg episode reward: [(0, '4432.735')] [2023-03-09 07:00:27,338][635776] Updated weights for policy 0, policy_version 95920 (0.0005) [2023-03-09 07:00:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 49135616. Throughput: 0: 10422.1. Samples: 49125388. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:30,122][635489] Avg episode reward: [(0, '4241.569')] [2023-03-09 07:00:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095968_49135616.pth... [2023-03-09 07:00:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095352_48820224.pth [2023-03-09 07:00:31,681][635776] Updated weights for policy 0, policy_version 96000 (0.0004) [2023-03-09 07:00:35,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 49184768. Throughput: 0: 10349.9. Samples: 49183456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:35,122][635489] Avg episode reward: [(0, '4541.017')] [2023-03-09 07:00:35,989][635776] Updated weights for policy 0, policy_version 96080 (0.0004) [2023-03-09 07:00:39,946][635776] Updated weights for policy 0, policy_version 96160 (0.0005) [2023-03-09 07:00:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 49233920. Throughput: 0: 10282.7. Samples: 49211860. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:40,122][635489] Avg episode reward: [(0, '4503.878')] [2023-03-09 07:00:43,754][635776] Updated weights for policy 0, policy_version 96240 (0.0005) [2023-03-09 07:00:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 49287168. Throughput: 0: 10344.9. Samples: 49275968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:45,122][635489] Avg episode reward: [(0, '4564.455')] [2023-03-09 07:00:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096264_49287168.pth... [2023-03-09 07:00:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095656_48975872.pth [2023-03-09 07:00:47,799][635776] Updated weights for policy 0, policy_version 96320 (0.0004) [2023-03-09 07:00:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 49340416. Throughput: 0: 10408.8. Samples: 49339624. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:00:50,122][635489] Avg episode reward: [(0, '4560.798')] [2023-03-09 07:00:51,766][635776] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-09 07:00:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 49385472. Throughput: 0: 10368.7. Samples: 49369020. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:00:55,122][635489] Avg episode reward: [(0, '4438.064')] [2023-03-09 07:00:56,074][635776] Updated weights for policy 0, policy_version 96480 (0.0005) [2023-03-09 07:00:59,887][635776] Updated weights for policy 0, policy_version 96560 (0.0005) [2023-03-09 07:01:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 49438720. Throughput: 0: 10292.1. Samples: 49427448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:00,122][635489] Avg episode reward: [(0, '4386.044')] [2023-03-09 07:01:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096560_49438720.pth... [2023-03-09 07:01:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000095968_49135616.pth [2023-03-09 07:01:03,911][635776] Updated weights for policy 0, policy_version 96640 (0.0005) [2023-03-09 07:01:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 49487872. Throughput: 0: 10230.0. Samples: 49489648. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:05,122][635489] Avg episode reward: [(0, '4533.911')] [2023-03-09 07:01:07,927][635776] Updated weights for policy 0, policy_version 96720 (0.0005) [2023-03-09 07:01:10,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10260.8). Total num frames: 49545216. Throughput: 0: 10118.0. Samples: 49520600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:10,122][635489] Avg episode reward: [(0, '4197.599')] [2023-03-09 07:01:11,603][635776] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-09 07:01:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 49594368. Throughput: 0: 10189.4. Samples: 49583912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:15,122][635489] Avg episode reward: [(0, '4152.782')] [2023-03-09 07:01:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096864_49594368.pth... [2023-03-09 07:01:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096264_49287168.pth [2023-03-09 07:01:15,649][635776] Updated weights for policy 0, policy_version 96880 (0.0005) [2023-03-09 07:01:19,798][635776] Updated weights for policy 0, policy_version 96960 (0.0005) [2023-03-09 07:01:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 49643520. Throughput: 0: 10238.5. Samples: 49644188. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:20,122][635489] Avg episode reward: [(0, '4090.561')] [2023-03-09 07:01:23,711][635776] Updated weights for policy 0, policy_version 97040 (0.0005) [2023-03-09 07:01:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 49696768. Throughput: 0: 10320.4. Samples: 49676280. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:25,122][635489] Avg episode reward: [(0, '4465.779')] [2023-03-09 07:01:27,821][635776] Updated weights for policy 0, policy_version 97120 (0.0005) [2023-03-09 07:01:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 49745920. Throughput: 0: 10227.9. Samples: 49736224. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:30,122][635489] Avg episode reward: [(0, '4384.165')] [2023-03-09 07:01:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097160_49745920.pth... [2023-03-09 07:01:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096560_49438720.pth [2023-03-09 07:01:31,746][635776] Updated weights for policy 0, policy_version 97200 (0.0005) [2023-03-09 07:01:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 49799168. Throughput: 0: 10208.5. Samples: 49799008. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:35,122][635489] Avg episode reward: [(0, '4291.556')] [2023-03-09 07:01:35,755][635776] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-09 07:01:39,417][635776] Updated weights for policy 0, policy_version 97360 (0.0005) [2023-03-09 07:01:40,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 49856512. Throughput: 0: 10255.4. Samples: 49830512. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:40,122][635489] Avg episode reward: [(0, '4247.487')] [2023-03-09 07:01:43,172][635776] Updated weights for policy 0, policy_version 97440 (0.0005) [2023-03-09 07:01:45,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 49905664. Throughput: 0: 10421.6. Samples: 49896420. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:45,122][635489] Avg episode reward: [(0, '4450.151')] [2023-03-09 07:01:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097472_49905664.pth... [2023-03-09 07:01:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000096864_49594368.pth [2023-03-09 07:01:47,143][635776] Updated weights for policy 0, policy_version 97520 (0.0005) [2023-03-09 07:01:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10316.4). Total num frames: 49958912. Throughput: 0: 10410.3. Samples: 49958112. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:01:50,122][635489] Avg episode reward: [(0, '4401.316')] [2023-03-09 07:01:51,200][635776] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-09 07:01:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 50008064. Throughput: 0: 10379.0. Samples: 49987656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:01:55,122][635489] Avg episode reward: [(0, '4456.182')] [2023-03-09 07:01:55,325][635776] Updated weights for policy 0, policy_version 97680 (0.0006) [2023-03-09 07:01:59,152][635776] Updated weights for policy 0, policy_version 97760 (0.0005) [2023-03-09 07:02:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 50061312. Throughput: 0: 10348.6. Samples: 50049600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:00,122][635489] Avg episode reward: [(0, '4474.038')] [2023-03-09 07:02:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097776_50061312.pth... [2023-03-09 07:02:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097160_49745920.pth [2023-03-09 07:02:03,330][635776] Updated weights for policy 0, policy_version 97840 (0.0005) [2023-03-09 07:02:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 50110464. Throughput: 0: 10358.9. Samples: 50110340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:05,122][635489] Avg episode reward: [(0, '4432.474')] [2023-03-09 07:02:07,255][635776] Updated weights for policy 0, policy_version 97920 (0.0005) [2023-03-09 07:02:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 50163712. Throughput: 0: 10337.7. Samples: 50141476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:10,122][635489] Avg episode reward: [(0, '4567.096')] [2023-03-09 07:02:11,137][635776] Updated weights for policy 0, policy_version 98000 (0.0005) [2023-03-09 07:02:15,004][635776] Updated weights for policy 0, policy_version 98080 (0.0005) [2023-03-09 07:02:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 50216960. Throughput: 0: 10405.1. Samples: 50204452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:15,122][635489] Avg episode reward: [(0, '4553.312')] [2023-03-09 07:02:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098080_50216960.pth... [2023-03-09 07:02:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097472_49905664.pth [2023-03-09 07:02:18,922][635776] Updated weights for policy 0, policy_version 98160 (0.0005) [2023-03-09 07:02:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 50266112. Throughput: 0: 10407.1. Samples: 50267328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:20,122][635489] Avg episode reward: [(0, '4477.081')] [2023-03-09 07:02:23,163][635776] Updated weights for policy 0, policy_version 98240 (0.0006) [2023-03-09 07:02:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 50315264. Throughput: 0: 10346.1. Samples: 50296088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:25,122][635489] Avg episode reward: [(0, '4531.992')] [2023-03-09 07:02:27,300][635776] Updated weights for policy 0, policy_version 98320 (0.0005) [2023-03-09 07:02:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 50368512. Throughput: 0: 10219.3. Samples: 50356288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:30,122][635489] Avg episode reward: [(0, '4565.585')] [2023-03-09 07:02:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098376_50368512.pth... [2023-03-09 07:02:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000097776_50061312.pth [2023-03-09 07:02:31,432][635776] Updated weights for policy 0, policy_version 98400 (0.0005) [2023-03-09 07:02:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 50417664. Throughput: 0: 10200.4. Samples: 50417132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:35,122][635489] Avg episode reward: [(0, '4333.052')] [2023-03-09 07:02:35,478][635776] Updated weights for policy 0, policy_version 98480 (0.0005) [2023-03-09 07:02:39,392][635776] Updated weights for policy 0, policy_version 98560 (0.0005) [2023-03-09 07:02:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 50466816. Throughput: 0: 10223.1. Samples: 50447696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:40,122][635489] Avg episode reward: [(0, '4005.368')] [2023-03-09 07:02:43,546][635776] Updated weights for policy 0, policy_version 98640 (0.0005) [2023-03-09 07:02:45,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 50515968. Throughput: 0: 10177.0. Samples: 50507564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:45,122][635489] Avg episode reward: [(0, '4489.579')] [2023-03-09 07:02:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098664_50515968.pth... [2023-03-09 07:02:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098080_50216960.pth [2023-03-09 07:02:47,647][635776] Updated weights for policy 0, policy_version 98720 (0.0004) [2023-03-09 07:02:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 50569216. Throughput: 0: 10208.3. Samples: 50569712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:50,122][635489] Avg episode reward: [(0, '4402.702')] [2023-03-09 07:02:51,522][635776] Updated weights for policy 0, policy_version 98800 (0.0005) [2023-03-09 07:02:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 50618368. Throughput: 0: 10183.2. Samples: 50599720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:02:55,122][635489] Avg episode reward: [(0, '4142.030')] [2023-03-09 07:02:55,595][635776] Updated weights for policy 0, policy_version 98880 (0.0005) [2023-03-09 07:02:59,803][635776] Updated weights for policy 0, policy_version 98960 (0.0005) [2023-03-09 07:03:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 50667520. Throughput: 0: 10108.1. Samples: 50659316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:00,122][635489] Avg episode reward: [(0, '4181.712')] [2023-03-09 07:03:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098960_50667520.pth... [2023-03-09 07:03:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098376_50368512.pth [2023-03-09 07:03:03,762][635776] Updated weights for policy 0, policy_version 99040 (0.0006) [2023-03-09 07:03:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 50720768. Throughput: 0: 10075.6. Samples: 50720732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:05,122][635489] Avg episode reward: [(0, '3919.776')] [2023-03-09 07:03:07,969][635776] Updated weights for policy 0, policy_version 99120 (0.0004) [2023-03-09 07:03:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 50769920. Throughput: 0: 10070.6. Samples: 50749264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:10,122][635489] Avg episode reward: [(0, '4050.057')] [2023-03-09 07:03:11,969][635776] Updated weights for policy 0, policy_version 99200 (0.0004) [2023-03-09 07:03:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 50819072. Throughput: 0: 10051.7. Samples: 50808616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:15,122][635489] Avg episode reward: [(0, '4215.607')] [2023-03-09 07:03:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099256_50819072.pth... [2023-03-09 07:03:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098664_50515968.pth [2023-03-09 07:03:16,193][635776] Updated weights for policy 0, policy_version 99280 (0.0004) [2023-03-09 07:03:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 50868224. Throughput: 0: 10008.0. Samples: 50867492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:20,122][635489] Avg episode reward: [(0, '4024.986')] [2023-03-09 07:03:20,443][635776] Updated weights for policy 0, policy_version 99360 (0.0004) [2023-03-09 07:03:24,518][635776] Updated weights for policy 0, policy_version 99440 (0.0005) [2023-03-09 07:03:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 50917376. Throughput: 0: 9982.1. Samples: 50896892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:25,122][635489] Avg episode reward: [(0, '4056.871')] [2023-03-09 07:03:28,453][635776] Updated weights for policy 0, policy_version 99520 (0.0004) [2023-03-09 07:03:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 50970624. Throughput: 0: 10018.7. Samples: 50958408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:30,122][635489] Avg episode reward: [(0, '4105.006')] [2023-03-09 07:03:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099552_50970624.pth... [2023-03-09 07:03:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000098960_50667520.pth [2023-03-09 07:03:32,274][635776] Updated weights for policy 0, policy_version 99600 (0.0005) [2023-03-09 07:03:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 51019776. Throughput: 0: 10064.4. Samples: 51022612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:35,122][635489] Avg episode reward: [(0, '4010.524')] [2023-03-09 07:03:36,259][635776] Updated weights for policy 0, policy_version 99680 (0.0005) [2023-03-09 07:03:39,992][635776] Updated weights for policy 0, policy_version 99760 (0.0005) [2023-03-09 07:03:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 51077120. Throughput: 0: 10134.8. Samples: 51055788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:40,122][635489] Avg episode reward: [(0, '4050.365')] [2023-03-09 07:03:44,065][635776] Updated weights for policy 0, policy_version 99840 (0.0005) [2023-03-09 07:03:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 51126272. Throughput: 0: 10178.8. Samples: 51117364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:45,122][635489] Avg episode reward: [(0, '3977.135')] [2023-03-09 07:03:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099856_51126272.pth... [2023-03-09 07:03:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099256_50819072.pth [2023-03-09 07:03:48,086][635776] Updated weights for policy 0, policy_version 99920 (0.0005) [2023-03-09 07:03:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 51175424. Throughput: 0: 10135.5. Samples: 51176828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:50,122][635489] Avg episode reward: [(0, '4375.980')] [2023-03-09 07:03:52,014][635776] Updated weights for policy 0, policy_version 100000 (0.0005) [2023-03-09 07:03:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 51228672. Throughput: 0: 10201.8. Samples: 51208344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:03:55,122][635489] Avg episode reward: [(0, '4107.553')] [2023-03-09 07:03:56,068][635776] Updated weights for policy 0, policy_version 100080 (0.0005) [2023-03-09 07:04:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 51277824. Throughput: 0: 10234.1. Samples: 51269148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:00,122][635489] Avg episode reward: [(0, '4145.727')] [2023-03-09 07:04:00,159][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100160_51281920.pth... [2023-03-09 07:04:00,160][635776] Updated weights for policy 0, policy_version 100160 (0.0005) [2023-03-09 07:04:00,161][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099552_50970624.pth [2023-03-09 07:04:04,153][635776] Updated weights for policy 0, policy_version 100240 (0.0005) [2023-03-09 07:04:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 51331072. Throughput: 0: 10283.0. Samples: 51330228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:05,122][635489] Avg episode reward: [(0, '3958.703')] [2023-03-09 07:04:08,274][635776] Updated weights for policy 0, policy_version 100320 (0.0005) [2023-03-09 07:04:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 51380224. Throughput: 0: 10287.0. Samples: 51359808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:10,122][635489] Avg episode reward: [(0, '4215.121')] [2023-03-09 07:04:12,512][635776] Updated weights for policy 0, policy_version 100400 (0.0006) [2023-03-09 07:04:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 51429376. Throughput: 0: 10224.4. Samples: 51418508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:15,122][635489] Avg episode reward: [(0, '4157.568')] [2023-03-09 07:04:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100448_51429376.pth... [2023-03-09 07:04:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000099856_51126272.pth [2023-03-09 07:04:16,636][635776] Updated weights for policy 0, policy_version 100480 (0.0005) [2023-03-09 07:04:20,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 51474432. Throughput: 0: 10050.0. Samples: 51474864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:20,122][635489] Avg episode reward: [(0, '4036.081')] [2023-03-09 07:04:21,138][635776] Updated weights for policy 0, policy_version 100560 (0.0004) [2023-03-09 07:04:25,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 51523584. Throughput: 0: 9981.0. Samples: 51504932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:25,122][635489] Avg episode reward: [(0, '4225.576')] [2023-03-09 07:04:25,144][635776] Updated weights for policy 0, policy_version 100640 (0.0005) [2023-03-09 07:04:29,022][635776] Updated weights for policy 0, policy_version 100720 (0.0005) [2023-03-09 07:04:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 51580928. Throughput: 0: 9998.0. Samples: 51567276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:30,122][635489] Avg episode reward: [(0, '4286.354')] [2023-03-09 07:04:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100744_51580928.pth... [2023-03-09 07:04:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100160_51281920.pth [2023-03-09 07:04:32,764][635776] Updated weights for policy 0, policy_version 100800 (0.0005) [2023-03-09 07:04:35,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 51630080. Throughput: 0: 10089.4. Samples: 51630852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:35,122][635489] Avg episode reward: [(0, '4271.514')] [2023-03-09 07:04:36,644][635776] Updated weights for policy 0, policy_version 100880 (0.0005) [2023-03-09 07:04:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 51683328. Throughput: 0: 10105.8. Samples: 51663104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:40,122][635489] Avg episode reward: [(0, '3958.259')] [2023-03-09 07:04:40,981][635776] Updated weights for policy 0, policy_version 100960 (0.0005) [2023-03-09 07:04:44,989][635776] Updated weights for policy 0, policy_version 101040 (0.0005) [2023-03-09 07:04:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10103.4, 300 sec: 10219.2). Total num frames: 51732480. Throughput: 0: 10045.1. Samples: 51721180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:45,122][635489] Avg episode reward: [(0, '3714.838')] [2023-03-09 07:04:45,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101040_51732480.pth... [2023-03-09 07:04:45,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100448_51429376.pth [2023-03-09 07:04:49,054][635776] Updated weights for policy 0, policy_version 101120 (0.0004) [2023-03-09 07:04:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 51781632. Throughput: 0: 10032.6. Samples: 51781696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:50,122][635489] Avg episode reward: [(0, '3741.988')] [2023-03-09 07:04:53,178][635776] Updated weights for policy 0, policy_version 101200 (0.0005) [2023-03-09 07:04:55,122][635489] Fps is (10 sec: 9830.6, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 51830784. Throughput: 0: 10039.4. Samples: 51811580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:04:55,122][635489] Avg episode reward: [(0, '4188.080')] [2023-03-09 07:04:57,190][635776] Updated weights for policy 0, policy_version 101280 (0.0005) [2023-03-09 07:05:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 51884032. Throughput: 0: 10141.2. Samples: 51874864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:00,122][635489] Avg episode reward: [(0, '4498.648')] [2023-03-09 07:05:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101336_51884032.pth... [2023-03-09 07:05:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000100744_51580928.pth [2023-03-09 07:05:01,114][635776] Updated weights for policy 0, policy_version 101360 (0.0005) [2023-03-09 07:05:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 51933184. Throughput: 0: 10198.2. Samples: 51933784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:05,122][635489] Avg episode reward: [(0, '4193.202')] [2023-03-09 07:05:05,207][635776] Updated weights for policy 0, policy_version 101440 (0.0005) [2023-03-09 07:05:09,165][635776] Updated weights for policy 0, policy_version 101520 (0.0005) [2023-03-09 07:05:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 51986432. Throughput: 0: 10228.1. Samples: 51965196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:10,122][635489] Avg episode reward: [(0, '4256.971')] [2023-03-09 07:05:13,263][635776] Updated weights for policy 0, policy_version 101600 (0.0005) [2023-03-09 07:05:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 52035584. Throughput: 0: 10218.3. Samples: 52027100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:15,122][635489] Avg episode reward: [(0, '4268.549')] [2023-03-09 07:05:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101632_52035584.pth... [2023-03-09 07:05:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101040_51732480.pth [2023-03-09 07:05:17,474][635776] Updated weights for policy 0, policy_version 101680 (0.0005) [2023-03-09 07:05:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 52084736. Throughput: 0: 10083.9. Samples: 52084628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:20,122][635489] Avg episode reward: [(0, '4240.499')] [2023-03-09 07:05:21,681][635776] Updated weights for policy 0, policy_version 101760 (0.0005) [2023-03-09 07:05:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 52133888. Throughput: 0: 9993.5. Samples: 52112812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:25,122][635489] Avg episode reward: [(0, '4046.613')] [2023-03-09 07:05:25,924][635776] Updated weights for policy 0, policy_version 101840 (0.0005) [2023-03-09 07:05:30,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 52178944. Throughput: 0: 9985.9. Samples: 52170544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:30,122][635489] Avg episode reward: [(0, '3906.978')] [2023-03-09 07:05:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101912_52178944.pth... [2023-03-09 07:05:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101336_51884032.pth [2023-03-09 07:05:30,186][635776] Updated weights for policy 0, policy_version 101920 (0.0005) [2023-03-09 07:05:34,228][635776] Updated weights for policy 0, policy_version 102000 (0.0005) [2023-03-09 07:05:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 52232192. Throughput: 0: 9982.2. Samples: 52230896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:35,122][635489] Avg episode reward: [(0, '4158.896')] [2023-03-09 07:05:38,457][635776] Updated weights for policy 0, policy_version 102080 (0.0005) [2023-03-09 07:05:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 52281344. Throughput: 0: 9963.0. Samples: 52259916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:40,122][635489] Avg episode reward: [(0, '4240.546')] [2023-03-09 07:05:42,439][635776] Updated weights for policy 0, policy_version 102160 (0.0005) [2023-03-09 07:05:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9967.0, 300 sec: 10135.9). Total num frames: 52330496. Throughput: 0: 9913.1. Samples: 52320952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:45,122][635489] Avg episode reward: [(0, '4034.693')] [2023-03-09 07:05:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102208_52330496.pth... [2023-03-09 07:05:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101632_52035584.pth [2023-03-09 07:05:46,538][635776] Updated weights for policy 0, policy_version 102240 (0.0005) [2023-03-09 07:05:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 52379648. Throughput: 0: 9905.8. Samples: 52379544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:05:50,122][635489] Avg episode reward: [(0, '4016.663')] [2023-03-09 07:05:50,790][635776] Updated weights for policy 0, policy_version 102320 (0.0005) [2023-03-09 07:05:54,938][635776] Updated weights for policy 0, policy_version 102400 (0.0004) [2023-03-09 07:05:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10135.9). Total num frames: 52428800. Throughput: 0: 9891.2. Samples: 52410300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:05:55,122][635489] Avg episode reward: [(0, '4125.696')] [2023-03-09 07:05:58,948][635776] Updated weights for policy 0, policy_version 102480 (0.0005) [2023-03-09 07:06:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 52477952. Throughput: 0: 9832.3. Samples: 52469552. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:00,122][635489] Avg episode reward: [(0, '4266.882')] [2023-03-09 07:06:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102496_52477952.pth... [2023-03-09 07:06:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000101912_52178944.pth [2023-03-09 07:06:03,014][635776] Updated weights for policy 0, policy_version 102560 (0.0005) [2023-03-09 07:06:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 52531200. Throughput: 0: 9885.4. Samples: 52529472. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:05,122][635489] Avg episode reward: [(0, '3964.913')] [2023-03-09 07:06:07,054][635776] Updated weights for policy 0, policy_version 102640 (0.0005) [2023-03-09 07:06:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 52580352. Throughput: 0: 9936.1. Samples: 52559936. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:10,122][635489] Avg episode reward: [(0, '4263.394')] [2023-03-09 07:06:10,905][635776] Updated weights for policy 0, policy_version 102720 (0.0005) [2023-03-09 07:06:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 52629504. Throughput: 0: 10018.7. Samples: 52621384. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:15,122][635489] Avg episode reward: [(0, '4403.132')] [2023-03-09 07:06:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102792_52629504.pth... [2023-03-09 07:06:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102208_52330496.pth [2023-03-09 07:06:15,179][635776] Updated weights for policy 0, policy_version 102800 (0.0005) [2023-03-09 07:06:19,599][635776] Updated weights for policy 0, policy_version 102880 (0.0004) [2023-03-09 07:06:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 52678656. Throughput: 0: 9942.0. Samples: 52678284. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:20,122][635489] Avg episode reward: [(0, '4213.057')] [2023-03-09 07:06:23,645][635776] Updated weights for policy 0, policy_version 102960 (0.0005) [2023-03-09 07:06:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 52727808. Throughput: 0: 9942.7. Samples: 52707336. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:25,122][635489] Avg episode reward: [(0, '4464.044')] [2023-03-09 07:06:27,754][635776] Updated weights for policy 0, policy_version 103040 (0.0005) [2023-03-09 07:06:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 52776960. Throughput: 0: 9950.7. Samples: 52768732. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:30,122][635489] Avg episode reward: [(0, '4427.124')] [2023-03-09 07:06:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103080_52776960.pth... [2023-03-09 07:06:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102496_52477952.pth [2023-03-09 07:06:31,821][635776] Updated weights for policy 0, policy_version 103120 (0.0005) [2023-03-09 07:06:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 52830208. Throughput: 0: 9965.2. Samples: 52827976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:35,122][635489] Avg episode reward: [(0, '4266.637')] [2023-03-09 07:06:35,883][635776] Updated weights for policy 0, policy_version 103200 (0.0005) [2023-03-09 07:06:39,972][635776] Updated weights for policy 0, policy_version 103280 (0.0005) [2023-03-09 07:06:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 52879360. Throughput: 0: 9966.7. Samples: 52858800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:40,122][635489] Avg episode reward: [(0, '4369.734')] [2023-03-09 07:06:43,944][635776] Updated weights for policy 0, policy_version 103360 (0.0005) [2023-03-09 07:06:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10066.4). Total num frames: 52928512. Throughput: 0: 10016.2. Samples: 52920280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:45,122][635489] Avg episode reward: [(0, '4231.554')] [2023-03-09 07:06:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103384_52932608.pth... [2023-03-09 07:06:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000102792_52629504.pth [2023-03-09 07:06:47,867][635776] Updated weights for policy 0, policy_version 103440 (0.0005) [2023-03-09 07:06:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 52981760. Throughput: 0: 10032.9. Samples: 52980952. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:06:50,122][635489] Avg episode reward: [(0, '4212.555')] [2023-03-09 07:06:51,919][635776] Updated weights for policy 0, policy_version 103520 (0.0004) [2023-03-09 07:06:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 53030912. Throughput: 0: 10023.5. Samples: 53010992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:06:55,122][635489] Avg episode reward: [(0, '4081.370')] [2023-03-09 07:06:56,025][635776] Updated weights for policy 0, policy_version 103600 (0.0005) [2023-03-09 07:06:59,882][635776] Updated weights for policy 0, policy_version 103680 (0.0005) [2023-03-09 07:07:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 53084160. Throughput: 0: 10061.0. Samples: 53074128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:00,122][635489] Avg episode reward: [(0, '4006.316')] [2023-03-09 07:07:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103680_53084160.pth... [2023-03-09 07:07:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103080_52776960.pth [2023-03-09 07:07:03,947][635776] Updated weights for policy 0, policy_version 103760 (0.0004) [2023-03-09 07:07:05,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 53137408. Throughput: 0: 10142.0. Samples: 53134672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:05,122][635489] Avg episode reward: [(0, '4193.764')] [2023-03-09 07:07:07,843][635776] Updated weights for policy 0, policy_version 103840 (0.0005) [2023-03-09 07:07:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 53186560. Throughput: 0: 10196.3. Samples: 53166168. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:10,122][635489] Avg episode reward: [(0, '4308.373')] [2023-03-09 07:07:11,769][635776] Updated weights for policy 0, policy_version 103920 (0.0005) [2023-03-09 07:07:15,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 53239808. Throughput: 0: 10211.0. Samples: 53228228. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:15,122][635489] Avg episode reward: [(0, '3964.504')] [2023-03-09 07:07:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103984_53239808.pth... [2023-03-09 07:07:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103384_52932608.pth [2023-03-09 07:07:15,694][635776] Updated weights for policy 0, policy_version 104000 (0.0005) [2023-03-09 07:07:19,406][635776] Updated weights for policy 0, policy_version 104080 (0.0005) [2023-03-09 07:07:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 53293056. Throughput: 0: 10337.6. Samples: 53293168. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:20,122][635489] Avg episode reward: [(0, '4143.312')] [2023-03-09 07:07:23,270][635776] Updated weights for policy 0, policy_version 104160 (0.0005) [2023-03-09 07:07:25,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 53346304. Throughput: 0: 10376.4. Samples: 53325736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:25,122][635489] Avg episode reward: [(0, '4374.357')] [2023-03-09 07:07:27,676][635776] Updated weights for policy 0, policy_version 104240 (0.0005) [2023-03-09 07:07:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 53391360. Throughput: 0: 10283.7. Samples: 53383044. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:30,122][635489] Avg episode reward: [(0, '4349.333')] [2023-03-09 07:07:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104280_53391360.pth... [2023-03-09 07:07:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103680_53084160.pth [2023-03-09 07:07:31,832][635776] Updated weights for policy 0, policy_version 104320 (0.0005) [2023-03-09 07:07:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 53444608. Throughput: 0: 10272.8. Samples: 53443228. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:35,122][635489] Avg episode reward: [(0, '4483.971')] [2023-03-09 07:07:35,841][635776] Updated weights for policy 0, policy_version 104400 (0.0005) [2023-03-09 07:07:39,810][635776] Updated weights for policy 0, policy_version 104480 (0.0005) [2023-03-09 07:07:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 53493760. Throughput: 0: 10280.8. Samples: 53473628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:40,122][635489] Avg episode reward: [(0, '4456.567')] [2023-03-09 07:07:43,743][635776] Updated weights for policy 0, policy_version 104560 (0.0005) [2023-03-09 07:07:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 53547008. Throughput: 0: 10264.5. Samples: 53536032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:45,122][635489] Avg episode reward: [(0, '4458.973')] [2023-03-09 07:07:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104584_53547008.pth... [2023-03-09 07:07:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000103984_53239808.pth [2023-03-09 07:07:47,844][635776] Updated weights for policy 0, policy_version 104640 (0.0005) [2023-03-09 07:07:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 53596160. Throughput: 0: 10255.8. Samples: 53596184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:07:50,122][635489] Avg episode reward: [(0, '4480.167')] [2023-03-09 07:07:51,870][635776] Updated weights for policy 0, policy_version 104720 (0.0005) [2023-03-09 07:07:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 53645312. Throughput: 0: 10208.0. Samples: 53625528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:07:55,122][635489] Avg episode reward: [(0, '4441.986')] [2023-03-09 07:07:56,090][635776] Updated weights for policy 0, policy_version 104800 (0.0005) [2023-03-09 07:08:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10080.3). Total num frames: 53694464. Throughput: 0: 10135.6. Samples: 53684328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:00,122][635489] Avg episode reward: [(0, '4385.585')] [2023-03-09 07:08:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104872_53694464.pth... [2023-03-09 07:08:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104280_53391360.pth [2023-03-09 07:08:00,330][635776] Updated weights for policy 0, policy_version 104880 (0.0005) [2023-03-09 07:08:04,245][635776] Updated weights for policy 0, policy_version 104960 (0.0005) [2023-03-09 07:08:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 53747712. Throughput: 0: 10063.7. Samples: 53746032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:05,122][635489] Avg episode reward: [(0, '4395.955')] [2023-03-09 07:08:08,219][635776] Updated weights for policy 0, policy_version 105040 (0.0005) [2023-03-09 07:08:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 53796864. Throughput: 0: 10027.8. Samples: 53776988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:10,122][635489] Avg episode reward: [(0, '4387.298')] [2023-03-09 07:08:12,141][635776] Updated weights for policy 0, policy_version 105120 (0.0005) [2023-03-09 07:08:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 53850112. Throughput: 0: 10146.4. Samples: 53839632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:15,122][635489] Avg episode reward: [(0, '4352.232')] [2023-03-09 07:08:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105176_53850112.pth... [2023-03-09 07:08:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104584_53547008.pth [2023-03-09 07:08:16,032][635776] Updated weights for policy 0, policy_version 105200 (0.0005) [2023-03-09 07:08:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 53899264. Throughput: 0: 10184.3. Samples: 53901524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:20,122][635489] Avg episode reward: [(0, '4016.236')] [2023-03-09 07:08:20,160][635776] Updated weights for policy 0, policy_version 105280 (0.0005) [2023-03-09 07:08:24,412][635776] Updated weights for policy 0, policy_version 105360 (0.0005) [2023-03-09 07:08:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 53948416. Throughput: 0: 10139.9. Samples: 53929924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:25,122][635489] Avg episode reward: [(0, '4360.481')] [2023-03-09 07:08:28,705][635776] Updated weights for policy 0, policy_version 105440 (0.0006) [2023-03-09 07:08:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 53997568. Throughput: 0: 10024.7. Samples: 53987144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:30,122][635489] Avg episode reward: [(0, '4492.450')] [2023-03-09 07:08:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105464_53997568.pth... [2023-03-09 07:08:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000104872_53694464.pth [2023-03-09 07:08:32,770][635776] Updated weights for policy 0, policy_version 105520 (0.0005) [2023-03-09 07:08:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 54046720. Throughput: 0: 10039.8. Samples: 54047972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:35,122][635489] Avg episode reward: [(0, '4396.030')] [2023-03-09 07:08:36,743][635776] Updated weights for policy 0, policy_version 105600 (0.0005) [2023-03-09 07:08:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.4, 300 sec: 10080.3). Total num frames: 54099968. Throughput: 0: 10083.9. Samples: 54079304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:40,122][635489] Avg episode reward: [(0, '4374.086')] [2023-03-09 07:08:40,848][635776] Updated weights for policy 0, policy_version 105680 (0.0005) [2023-03-09 07:08:45,025][635776] Updated weights for policy 0, policy_version 105760 (0.0005) [2023-03-09 07:08:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 54149120. Throughput: 0: 10068.1. Samples: 54137392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:45,122][635489] Avg episode reward: [(0, '4308.736')] [2023-03-09 07:08:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105760_54149120.pth... [2023-03-09 07:08:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105176_53850112.pth [2023-03-09 07:08:49,080][635776] Updated weights for policy 0, policy_version 105840 (0.0004) [2023-03-09 07:08:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 54198272. Throughput: 0: 10051.4. Samples: 54198344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:50,122][635489] Avg episode reward: [(0, '4158.467')] [2023-03-09 07:08:53,406][635776] Updated weights for policy 0, policy_version 105920 (0.0005) [2023-03-09 07:08:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 54247424. Throughput: 0: 9996.4. Samples: 54226828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:08:55,122][635489] Avg episode reward: [(0, '4224.101')] [2023-03-09 07:08:57,759][635776] Updated weights for policy 0, policy_version 106000 (0.0005) [2023-03-09 07:09:00,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 54292480. Throughput: 0: 9878.7. Samples: 54284172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:00,122][635489] Avg episode reward: [(0, '4326.481')] [2023-03-09 07:09:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106040_54292480.pth... [2023-03-09 07:09:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105464_53997568.pth [2023-03-09 07:09:01,873][635776] Updated weights for policy 0, policy_version 106080 (0.0005) [2023-03-09 07:09:05,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 54341632. Throughput: 0: 9808.4. Samples: 54342904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:05,122][635489] Avg episode reward: [(0, '4359.429')] [2023-03-09 07:09:05,884][635776] Updated weights for policy 0, policy_version 106160 (0.0004) [2023-03-09 07:09:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 54390784. Throughput: 0: 9852.9. Samples: 54373304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:10,122][635489] Avg episode reward: [(0, '4288.438')] [2023-03-09 07:09:10,201][635776] Updated weights for policy 0, policy_version 106240 (0.0004) [2023-03-09 07:09:14,353][635776] Updated weights for policy 0, policy_version 106320 (0.0004) [2023-03-09 07:09:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 54439936. Throughput: 0: 9877.6. Samples: 54431636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:15,122][635489] Avg episode reward: [(0, '4108.840')] [2023-03-09 07:09:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106328_54439936.pth... [2023-03-09 07:09:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000105760_54149120.pth [2023-03-09 07:09:18,603][635776] Updated weights for policy 0, policy_version 106400 (0.0005) [2023-03-09 07:09:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10052.6). Total num frames: 54489088. Throughput: 0: 9809.1. Samples: 54489380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:20,122][635489] Avg episode reward: [(0, '4278.202')] [2023-03-09 07:09:22,853][635776] Updated weights for policy 0, policy_version 106480 (0.0005) [2023-03-09 07:09:25,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 54538240. Throughput: 0: 9745.1. Samples: 54517832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:25,122][635489] Avg episode reward: [(0, '4292.176')] [2023-03-09 07:09:26,949][635776] Updated weights for policy 0, policy_version 106560 (0.0005) [2023-03-09 07:09:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 54591488. Throughput: 0: 9819.4. Samples: 54579264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:30,122][635489] Avg episode reward: [(0, '4337.755')] [2023-03-09 07:09:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106624_54591488.pth... [2023-03-09 07:09:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106040_54292480.pth [2023-03-09 07:09:30,876][635776] Updated weights for policy 0, policy_version 106640 (0.0005) [2023-03-09 07:09:35,012][635776] Updated weights for policy 0, policy_version 106720 (0.0004) [2023-03-09 07:09:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 54640640. Throughput: 0: 9822.0. Samples: 54640336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:35,122][635489] Avg episode reward: [(0, '4295.488')] [2023-03-09 07:09:39,082][635776] Updated weights for policy 0, policy_version 106800 (0.0005) [2023-03-09 07:09:40,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 54689792. Throughput: 0: 9824.5. Samples: 54668932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:40,122][635489] Avg episode reward: [(0, '4406.146')] [2023-03-09 07:09:43,238][635776] Updated weights for policy 0, policy_version 106880 (0.0005) [2023-03-09 07:09:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 54738944. Throughput: 0: 9918.6. Samples: 54730508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:45,122][635489] Avg episode reward: [(0, '4424.381')] [2023-03-09 07:09:45,154][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106920_54743040.pth... [2023-03-09 07:09:45,157][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106328_54439936.pth [2023-03-09 07:09:46,947][635776] Updated weights for policy 0, policy_version 106960 (0.0005) [2023-03-09 07:09:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 54792192. Throughput: 0: 10004.1. Samples: 54793088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:09:50,122][635489] Avg episode reward: [(0, '4398.089')] [2023-03-09 07:09:51,056][635776] Updated weights for policy 0, policy_version 107040 (0.0004) [2023-03-09 07:09:55,011][635776] Updated weights for policy 0, policy_version 107120 (0.0005) [2023-03-09 07:09:55,122][635489] Fps is (10 sec: 10649.5, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 54845440. Throughput: 0: 10013.4. Samples: 54823908. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:09:55,122][635489] Avg episode reward: [(0, '4466.531')] [2023-03-09 07:09:59,203][635776] Updated weights for policy 0, policy_version 107200 (0.0005) [2023-03-09 07:10:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 54894592. Throughput: 0: 10019.2. Samples: 54882500. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:00,122][635489] Avg episode reward: [(0, '4511.595')] [2023-03-09 07:10:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107216_54894592.pth... [2023-03-09 07:10:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106624_54591488.pth [2023-03-09 07:10:03,473][635776] Updated weights for policy 0, policy_version 107280 (0.0005) [2023-03-09 07:10:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 54943744. Throughput: 0: 10096.3. Samples: 54943712. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:05,122][635489] Avg episode reward: [(0, '4486.686')] [2023-03-09 07:10:07,238][635776] Updated weights for policy 0, policy_version 107360 (0.0006) [2023-03-09 07:10:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 54996992. Throughput: 0: 10193.1. Samples: 54976520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:10,122][635489] Avg episode reward: [(0, '4435.249')] [2023-03-09 07:10:11,333][635776] Updated weights for policy 0, policy_version 107440 (0.0005) [2023-03-09 07:10:15,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10103.4, 300 sec: 10038.7). Total num frames: 55046144. Throughput: 0: 10161.6. Samples: 55036536. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:15,122][635489] Avg episode reward: [(0, '4538.881')] [2023-03-09 07:10:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107512_55046144.pth... [2023-03-09 07:10:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000106920_54743040.pth [2023-03-09 07:10:15,255][635776] Updated weights for policy 0, policy_version 107520 (0.0005) [2023-03-09 07:10:19,478][635776] Updated weights for policy 0, policy_version 107600 (0.0005) [2023-03-09 07:10:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 55095296. Throughput: 0: 10111.7. Samples: 55095360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:20,122][635489] Avg episode reward: [(0, '4465.991')] [2023-03-09 07:10:23,391][635776] Updated weights for policy 0, policy_version 107680 (0.0005) [2023-03-09 07:10:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 55148544. Throughput: 0: 10193.7. Samples: 55127648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:25,122][635489] Avg episode reward: [(0, '4442.960')] [2023-03-09 07:10:27,615][635776] Updated weights for policy 0, policy_version 107760 (0.0006) [2023-03-09 07:10:30,121][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 55193600. Throughput: 0: 10108.2. Samples: 55185376. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:30,122][635489] Avg episode reward: [(0, '4510.265')] [2023-03-09 07:10:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107800_55193600.pth... [2023-03-09 07:10:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107216_54894592.pth [2023-03-09 07:10:31,947][635776] Updated weights for policy 0, policy_version 107840 (0.0005) [2023-03-09 07:10:35,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 55246848. Throughput: 0: 10075.4. Samples: 55246484. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:35,122][635489] Avg episode reward: [(0, '4493.369')] [2023-03-09 07:10:35,855][635776] Updated weights for policy 0, policy_version 107920 (0.0005) [2023-03-09 07:10:39,898][635776] Updated weights for policy 0, policy_version 108000 (0.0005) [2023-03-09 07:10:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 55296000. Throughput: 0: 10035.8. Samples: 55275520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:40,122][635489] Avg episode reward: [(0, '4522.678')] [2023-03-09 07:10:43,855][635776] Updated weights for policy 0, policy_version 108080 (0.0005) [2023-03-09 07:10:45,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 55349248. Throughput: 0: 10100.5. Samples: 55337024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:45,122][635489] Avg episode reward: [(0, '4515.425')] [2023-03-09 07:10:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108104_55349248.pth... [2023-03-09 07:10:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107512_55046144.pth [2023-03-09 07:10:47,888][635776] Updated weights for policy 0, policy_version 108160 (0.0005) [2023-03-09 07:10:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 55398400. Throughput: 0: 10084.2. Samples: 55397500. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:50,122][635489] Avg episode reward: [(0, '4551.984')] [2023-03-09 07:10:52,088][635776] Updated weights for policy 0, policy_version 108240 (0.0004) [2023-03-09 07:10:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 55447552. Throughput: 0: 10008.3. Samples: 55426892. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:10:55,122][635489] Avg episode reward: [(0, '4545.064')] [2023-03-09 07:10:56,203][635776] Updated weights for policy 0, policy_version 108320 (0.0005) [2023-03-09 07:11:00,005][635776] Updated weights for policy 0, policy_version 108400 (0.0005) [2023-03-09 07:11:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10066.4). Total num frames: 55500800. Throughput: 0: 10050.8. Samples: 55488820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:00,122][635489] Avg episode reward: [(0, '4470.802')] [2023-03-09 07:11:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108400_55500800.pth... [2023-03-09 07:11:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000107800_55193600.pth [2023-03-09 07:11:03,994][635776] Updated weights for policy 0, policy_version 108480 (0.0005) [2023-03-09 07:11:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 55549952. Throughput: 0: 10135.6. Samples: 55551464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:05,122][635489] Avg episode reward: [(0, '4524.175')] [2023-03-09 07:11:07,760][635776] Updated weights for policy 0, policy_version 108560 (0.0005) [2023-03-09 07:11:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 55607296. Throughput: 0: 10140.4. Samples: 55583968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:10,122][635489] Avg episode reward: [(0, '4470.225')] [2023-03-09 07:11:11,405][635776] Updated weights for policy 0, policy_version 108640 (0.0005) [2023-03-09 07:11:15,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 55660544. Throughput: 0: 10287.8. Samples: 55648328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:15,122][635489] Avg episode reward: [(0, '4446.919')] [2023-03-09 07:11:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108712_55660544.pth... [2023-03-09 07:11:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108104_55349248.pth [2023-03-09 07:11:15,414][635776] Updated weights for policy 0, policy_version 108720 (0.0005) [2023-03-09 07:11:19,254][635776] Updated weights for policy 0, policy_version 108800 (0.0005) [2023-03-09 07:11:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.2, 300 sec: 10122.0). Total num frames: 55713792. Throughput: 0: 10344.5. Samples: 55711988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:20,122][635489] Avg episode reward: [(0, '4466.840')] [2023-03-09 07:11:23,294][635776] Updated weights for policy 0, policy_version 108880 (0.0005) [2023-03-09 07:11:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 55762944. Throughput: 0: 10378.1. Samples: 55742536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:25,122][635489] Avg episode reward: [(0, '4381.305')] [2023-03-09 07:11:27,477][635776] Updated weights for policy 0, policy_version 108960 (0.0005) [2023-03-09 07:11:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.2, 300 sec: 10108.1). Total num frames: 55812096. Throughput: 0: 10344.4. Samples: 55802524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:30,122][635489] Avg episode reward: [(0, '4350.655')] [2023-03-09 07:11:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109008_55812096.pth... [2023-03-09 07:11:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108400_55500800.pth [2023-03-09 07:11:31,547][635776] Updated weights for policy 0, policy_version 109040 (0.0005) [2023-03-09 07:11:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 55861248. Throughput: 0: 10286.3. Samples: 55860384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:35,122][635489] Avg episode reward: [(0, '4486.875')] [2023-03-09 07:11:35,880][635776] Updated weights for policy 0, policy_version 109120 (0.0004) [2023-03-09 07:11:39,813][635776] Updated weights for policy 0, policy_version 109200 (0.0005) [2023-03-09 07:11:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 55910400. Throughput: 0: 10290.9. Samples: 55889984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:40,122][635489] Avg episode reward: [(0, '4374.479')] [2023-03-09 07:11:43,659][635776] Updated weights for policy 0, policy_version 109280 (0.0005) [2023-03-09 07:11:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 55963648. Throughput: 0: 10339.6. Samples: 55954100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:45,122][635489] Avg episode reward: [(0, '4278.706')] [2023-03-09 07:11:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109304_55963648.pth... [2023-03-09 07:11:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000108712_55660544.pth [2023-03-09 07:11:47,615][635776] Updated weights for policy 0, policy_version 109360 (0.0005) [2023-03-09 07:11:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 56012800. Throughput: 0: 10262.8. Samples: 56013288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:50,122][635489] Avg episode reward: [(0, '4200.294')] [2023-03-09 07:11:51,934][635776] Updated weights for policy 0, policy_version 109440 (0.0005) [2023-03-09 07:11:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 56066048. Throughput: 0: 10218.1. Samples: 56043784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:11:55,122][635489] Avg episode reward: [(0, '4430.820')] [2023-03-09 07:11:55,854][635776] Updated weights for policy 0, policy_version 109520 (0.0005) [2023-03-09 07:11:59,914][635776] Updated weights for policy 0, policy_version 109600 (0.0005) [2023-03-09 07:12:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 56115200. Throughput: 0: 10158.6. Samples: 56105464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:00,122][635489] Avg episode reward: [(0, '4464.274')] [2023-03-09 07:12:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109600_56115200.pth... [2023-03-09 07:12:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109008_55812096.pth [2023-03-09 07:12:04,075][635776] Updated weights for policy 0, policy_version 109680 (0.0005) [2023-03-09 07:12:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 56164352. Throughput: 0: 10058.2. Samples: 56164604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:05,122][635489] Avg episode reward: [(0, '4440.001')] [2023-03-09 07:12:08,067][635776] Updated weights for policy 0, policy_version 109760 (0.0005) [2023-03-09 07:12:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 56217600. Throughput: 0: 10055.3. Samples: 56195024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:10,122][635489] Avg episode reward: [(0, '4491.991')] [2023-03-09 07:12:12,013][635776] Updated weights for policy 0, policy_version 109840 (0.0005) [2023-03-09 07:12:15,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 56266752. Throughput: 0: 10126.8. Samples: 56258228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:15,122][635489] Avg episode reward: [(0, '4516.382')] [2023-03-09 07:12:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109904_56270848.pth... [2023-03-09 07:12:15,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109304_55963648.pth [2023-03-09 07:12:15,864][635776] Updated weights for policy 0, policy_version 109920 (0.0005) [2023-03-09 07:12:19,767][635776] Updated weights for policy 0, policy_version 110000 (0.0005) [2023-03-09 07:12:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 56324096. Throughput: 0: 10239.4. Samples: 56321156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:20,122][635489] Avg episode reward: [(0, '4467.243')] [2023-03-09 07:12:23,798][635776] Updated weights for policy 0, policy_version 110080 (0.0005) [2023-03-09 07:12:25,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 56373248. Throughput: 0: 10282.0. Samples: 56352672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:25,122][635489] Avg episode reward: [(0, '4484.049')] [2023-03-09 07:12:27,494][635776] Updated weights for policy 0, policy_version 110160 (0.0005) [2023-03-09 07:12:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 56426496. Throughput: 0: 10313.9. Samples: 56418224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:30,122][635489] Avg episode reward: [(0, '4237.638')] [2023-03-09 07:12:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110208_56426496.pth... [2023-03-09 07:12:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109600_56115200.pth [2023-03-09 07:12:31,435][635776] Updated weights for policy 0, policy_version 110240 (0.0004) [2023-03-09 07:12:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 56479744. Throughput: 0: 10365.0. Samples: 56479712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:35,122][635489] Avg episode reward: [(0, '4269.502')] [2023-03-09 07:12:35,358][635776] Updated weights for policy 0, policy_version 110320 (0.0005) [2023-03-09 07:12:39,268][635776] Updated weights for policy 0, policy_version 110400 (0.0005) [2023-03-09 07:12:40,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 56532992. Throughput: 0: 10394.3. Samples: 56511528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:40,122][635489] Avg episode reward: [(0, '4316.812')] [2023-03-09 07:12:43,069][635776] Updated weights for policy 0, policy_version 110480 (0.0005) [2023-03-09 07:12:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10122.0). Total num frames: 56582144. Throughput: 0: 10409.7. Samples: 56573900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:45,122][635489] Avg episode reward: [(0, '4334.721')] [2023-03-09 07:12:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110512_56582144.pth... [2023-03-09 07:12:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000109904_56270848.pth [2023-03-09 07:12:47,336][635776] Updated weights for policy 0, policy_version 110560 (0.0005) [2023-03-09 07:12:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 56631296. Throughput: 0: 10372.3. Samples: 56631360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:50,122][635489] Avg episode reward: [(0, '4327.461')] [2023-03-09 07:12:51,549][635776] Updated weights for policy 0, policy_version 110640 (0.0005) [2023-03-09 07:12:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 56680448. Throughput: 0: 10363.2. Samples: 56661368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:12:55,122][635489] Avg episode reward: [(0, '4189.152')] [2023-03-09 07:12:55,612][635776] Updated weights for policy 0, policy_version 110720 (0.0004) [2023-03-09 07:12:59,811][635776] Updated weights for policy 0, policy_version 110800 (0.0004) [2023-03-09 07:13:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 56729600. Throughput: 0: 10262.8. Samples: 56720056. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:00,122][635489] Avg episode reward: [(0, '3990.253')] [2023-03-09 07:13:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110800_56729600.pth... [2023-03-09 07:13:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110208_56426496.pth [2023-03-09 07:13:03,794][635776] Updated weights for policy 0, policy_version 110880 (0.0005) [2023-03-09 07:13:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10122.0). Total num frames: 56782848. Throughput: 0: 10257.6. Samples: 56782748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:05,122][635489] Avg episode reward: [(0, '4133.279')] [2023-03-09 07:13:07,709][635776] Updated weights for policy 0, policy_version 110960 (0.0005) [2023-03-09 07:13:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 56836096. Throughput: 0: 10244.6. Samples: 56813680. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:10,122][635489] Avg episode reward: [(0, '4057.940')] [2023-03-09 07:13:11,626][635776] Updated weights for policy 0, policy_version 111040 (0.0005) [2023-03-09 07:13:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 56885248. Throughput: 0: 10197.9. Samples: 56877128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:15,122][635489] Avg episode reward: [(0, '4239.809')] [2023-03-09 07:13:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111104_56885248.pth... [2023-03-09 07:13:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110512_56582144.pth [2023-03-09 07:13:15,674][635776] Updated weights for policy 0, policy_version 111120 (0.0004) [2023-03-09 07:13:19,634][635776] Updated weights for policy 0, policy_version 111200 (0.0005) [2023-03-09 07:13:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 56938496. Throughput: 0: 10173.6. Samples: 56937524. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:20,122][635489] Avg episode reward: [(0, '4331.940')] [2023-03-09 07:13:23,539][635776] Updated weights for policy 0, policy_version 111280 (0.0005) [2023-03-09 07:13:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 56987648. Throughput: 0: 10160.0. Samples: 56968728. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:25,122][635489] Avg episode reward: [(0, '4275.239')] [2023-03-09 07:13:27,400][635776] Updated weights for policy 0, policy_version 111360 (0.0005) [2023-03-09 07:13:30,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 57044992. Throughput: 0: 10188.6. Samples: 57032384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:30,122][635489] Avg episode reward: [(0, '4079.155')] [2023-03-09 07:13:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111416_57044992.pth... [2023-03-09 07:13:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000110800_56729600.pth [2023-03-09 07:13:31,257][635776] Updated weights for policy 0, policy_version 111440 (0.0005) [2023-03-09 07:13:35,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 57094144. Throughput: 0: 10300.3. Samples: 57094872. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:35,122][635489] Avg episode reward: [(0, '3719.221')] [2023-03-09 07:13:35,253][635776] Updated weights for policy 0, policy_version 111520 (0.0005) [2023-03-09 07:13:39,391][635776] Updated weights for policy 0, policy_version 111600 (0.0005) [2023-03-09 07:13:40,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 57143296. Throughput: 0: 10297.6. Samples: 57124760. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:40,122][635489] Avg episode reward: [(0, '4070.520')] [2023-03-09 07:13:43,303][635776] Updated weights for policy 0, policy_version 111680 (0.0004) [2023-03-09 07:13:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 57196544. Throughput: 0: 10346.5. Samples: 57185648. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:45,122][635489] Avg episode reward: [(0, '4096.215')] [2023-03-09 07:13:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111712_57196544.pth... [2023-03-09 07:13:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111104_56885248.pth [2023-03-09 07:13:47,639][635776] Updated weights for policy 0, policy_version 111760 (0.0004) [2023-03-09 07:13:50,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 57245696. Throughput: 0: 10283.5. Samples: 57245504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:50,122][635489] Avg episode reward: [(0, '4146.429')] [2023-03-09 07:13:51,491][635776] Updated weights for policy 0, policy_version 111840 (0.0005) [2023-03-09 07:13:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 57294848. Throughput: 0: 10247.2. Samples: 57274804. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:13:55,122][635489] Avg episode reward: [(0, '4142.589')] [2023-03-09 07:13:55,737][635776] Updated weights for policy 0, policy_version 111920 (0.0005) [2023-03-09 07:13:59,582][635776] Updated weights for policy 0, policy_version 112000 (0.0005) [2023-03-09 07:14:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10191.4). Total num frames: 57348096. Throughput: 0: 10194.3. Samples: 57335872. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:00,122][635489] Avg episode reward: [(0, '4064.251')] [2023-03-09 07:14:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112008_57348096.pth... [2023-03-09 07:14:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111416_57044992.pth [2023-03-09 07:14:03,519][635776] Updated weights for policy 0, policy_version 112080 (0.0005) [2023-03-09 07:14:05,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 57401344. Throughput: 0: 10261.0. Samples: 57399268. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:05,122][635489] Avg episode reward: [(0, '3834.617')] [2023-03-09 07:14:07,551][635776] Updated weights for policy 0, policy_version 112160 (0.0005) [2023-03-09 07:14:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 57450496. Throughput: 0: 10245.6. Samples: 57429780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:10,122][635489] Avg episode reward: [(0, '3767.437')] [2023-03-09 07:14:11,545][635776] Updated weights for policy 0, policy_version 112240 (0.0005) [2023-03-09 07:14:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10219.2). Total num frames: 57503744. Throughput: 0: 10199.8. Samples: 57491376. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:15,122][635489] Avg episode reward: [(0, '4054.494')] [2023-03-09 07:14:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112312_57503744.pth... [2023-03-09 07:14:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000111712_57196544.pth [2023-03-09 07:14:15,450][635776] Updated weights for policy 0, policy_version 112320 (0.0004) [2023-03-09 07:14:19,708][635776] Updated weights for policy 0, policy_version 112400 (0.0004) [2023-03-09 07:14:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 57548800. Throughput: 0: 10128.4. Samples: 57550648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:20,122][635489] Avg episode reward: [(0, '4245.913')] [2023-03-09 07:14:23,759][635776] Updated weights for policy 0, policy_version 112480 (0.0004) [2023-03-09 07:14:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 57602048. Throughput: 0: 10136.5. Samples: 57580900. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:25,122][635489] Avg episode reward: [(0, '4162.857')] [2023-03-09 07:14:27,786][635776] Updated weights for policy 0, policy_version 112560 (0.0005) [2023-03-09 07:14:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 57651200. Throughput: 0: 10145.9. Samples: 57642212. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:30,122][635489] Avg episode reward: [(0, '4272.387')] [2023-03-09 07:14:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112600_57651200.pth... [2023-03-09 07:14:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112008_57348096.pth [2023-03-09 07:14:31,802][635776] Updated weights for policy 0, policy_version 112640 (0.0005) [2023-03-09 07:14:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 57704448. Throughput: 0: 10183.9. Samples: 57703780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:35,122][635489] Avg episode reward: [(0, '4357.012')] [2023-03-09 07:14:35,759][635776] Updated weights for policy 0, policy_version 112720 (0.0005) [2023-03-09 07:14:39,998][635776] Updated weights for policy 0, policy_version 112800 (0.0004) [2023-03-09 07:14:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 57753600. Throughput: 0: 10196.7. Samples: 57733656. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:40,122][635489] Avg episode reward: [(0, '4260.070')] [2023-03-09 07:14:43,928][635776] Updated weights for policy 0, policy_version 112880 (0.0004) [2023-03-09 07:14:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10219.2). Total num frames: 57806848. Throughput: 0: 10189.5. Samples: 57794400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:45,122][635489] Avg episode reward: [(0, '4019.785')] [2023-03-09 07:14:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112904_57806848.pth... [2023-03-09 07:14:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112312_57503744.pth [2023-03-09 07:14:47,815][635776] Updated weights for policy 0, policy_version 112960 (0.0005) [2023-03-09 07:14:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10205.3). Total num frames: 57856000. Throughput: 0: 10151.0. Samples: 57856064. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:50,122][635489] Avg episode reward: [(0, '3424.598')] [2023-03-09 07:14:52,038][635776] Updated weights for policy 0, policy_version 113040 (0.0005) [2023-03-09 07:14:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 57905152. Throughput: 0: 10121.5. Samples: 57885248. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:14:55,122][635489] Avg episode reward: [(0, '3109.864')] [2023-03-09 07:14:55,861][635776] Updated weights for policy 0, policy_version 113120 (0.0005) [2023-03-09 07:15:00,037][635776] Updated weights for policy 0, policy_version 113200 (0.0006) [2023-03-09 07:15:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 57958400. Throughput: 0: 10140.6. Samples: 57947704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:00,122][635489] Avg episode reward: [(0, '3560.076')] [2023-03-09 07:15:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113200_57958400.pth... [2023-03-09 07:15:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112600_57651200.pth [2023-03-09 07:15:04,159][635776] Updated weights for policy 0, policy_version 113280 (0.0005) [2023-03-09 07:15:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 58007552. Throughput: 0: 10153.4. Samples: 58007552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:05,122][635489] Avg episode reward: [(0, '3944.221')] [2023-03-09 07:15:07,973][635776] Updated weights for policy 0, policy_version 113360 (0.0005) [2023-03-09 07:15:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 58060800. Throughput: 0: 10200.8. Samples: 58039936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:10,122][635489] Avg episode reward: [(0, '4169.595')] [2023-03-09 07:15:11,968][635776] Updated weights for policy 0, policy_version 113440 (0.0005) [2023-03-09 07:15:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 58109952. Throughput: 0: 10190.0. Samples: 58100764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:15,122][635489] Avg episode reward: [(0, '3796.290')] [2023-03-09 07:15:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113496_58109952.pth... [2023-03-09 07:15:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000112904_57806848.pth [2023-03-09 07:15:15,937][635776] Updated weights for policy 0, policy_version 113520 (0.0005) [2023-03-09 07:15:19,766][635776] Updated weights for policy 0, policy_version 113600 (0.0005) [2023-03-09 07:15:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 58163200. Throughput: 0: 10229.2. Samples: 58164096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:20,122][635489] Avg episode reward: [(0, '3887.582')] [2023-03-09 07:15:23,662][635776] Updated weights for policy 0, policy_version 113680 (0.0005) [2023-03-09 07:15:25,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 58216448. Throughput: 0: 10275.2. Samples: 58196040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:25,122][635489] Avg episode reward: [(0, '3860.986')] [2023-03-09 07:15:27,697][635776] Updated weights for policy 0, policy_version 113760 (0.0005) [2023-03-09 07:15:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 58265600. Throughput: 0: 10287.6. Samples: 58257344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:30,122][635489] Avg episode reward: [(0, '3541.029')] [2023-03-09 07:15:30,169][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113808_58269696.pth... [2023-03-09 07:15:30,171][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113200_57958400.pth [2023-03-09 07:15:31,692][635776] Updated weights for policy 0, policy_version 113840 (0.0004) [2023-03-09 07:15:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.8, 300 sec: 10233.1). Total num frames: 58314752. Throughput: 0: 10233.1. Samples: 58316552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:35,122][635489] Avg episode reward: [(0, '3844.730')] [2023-03-09 07:15:35,956][635776] Updated weights for policy 0, policy_version 113920 (0.0005) [2023-03-09 07:15:39,718][635776] Updated weights for policy 0, policy_version 114000 (0.0004) [2023-03-09 07:15:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 58368000. Throughput: 0: 10304.7. Samples: 58348960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:40,122][635489] Avg episode reward: [(0, '4095.221')] [2023-03-09 07:15:44,132][635776] Updated weights for policy 0, policy_version 114080 (0.0005) [2023-03-09 07:15:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 58417152. Throughput: 0: 10213.4. Samples: 58407308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:45,122][635489] Avg episode reward: [(0, '4301.778')] [2023-03-09 07:15:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114096_58417152.pth... [2023-03-09 07:15:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113496_58109952.pth [2023-03-09 07:15:48,353][635776] Updated weights for policy 0, policy_version 114160 (0.0005) [2023-03-09 07:15:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 58466304. Throughput: 0: 10174.7. Samples: 58465412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:50,122][635489] Avg episode reward: [(0, '4244.546')] [2023-03-09 07:15:52,237][635776] Updated weights for policy 0, policy_version 114240 (0.0004) [2023-03-09 07:15:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 58519552. Throughput: 0: 10188.1. Samples: 58498400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:15:55,122][635489] Avg episode reward: [(0, '3873.658')] [2023-03-09 07:15:56,273][635776] Updated weights for policy 0, policy_version 114320 (0.0005) [2023-03-09 07:16:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10233.1). Total num frames: 58568704. Throughput: 0: 10213.5. Samples: 58560372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:00,122][635489] Avg episode reward: [(0, '3767.550')] [2023-03-09 07:16:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114392_58568704.pth... [2023-03-09 07:16:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000113808_58269696.pth [2023-03-09 07:16:00,180][635776] Updated weights for policy 0, policy_version 114400 (0.0004) [2023-03-09 07:16:03,931][635776] Updated weights for policy 0, policy_version 114480 (0.0005) [2023-03-09 07:16:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 58621952. Throughput: 0: 10210.7. Samples: 58623576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:05,122][635489] Avg episode reward: [(0, '4134.890')] [2023-03-09 07:16:07,899][635776] Updated weights for policy 0, policy_version 114560 (0.0004) [2023-03-09 07:16:10,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 58675200. Throughput: 0: 10193.3. Samples: 58654740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:10,127][635489] Avg episode reward: [(0, '3884.309')] [2023-03-09 07:16:11,908][635776] Updated weights for policy 0, policy_version 114640 (0.0004) [2023-03-09 07:16:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10219.2). Total num frames: 58728448. Throughput: 0: 10191.6. Samples: 58715968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:15,133][635489] Avg episode reward: [(0, '4126.431')] [2023-03-09 07:16:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114704_58728448.pth... [2023-03-09 07:16:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114096_58417152.pth [2023-03-09 07:16:15,899][635776] Updated weights for policy 0, policy_version 114720 (0.0005) [2023-03-09 07:16:19,992][635776] Updated weights for policy 0, policy_version 114800 (0.0005) [2023-03-09 07:16:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 58777600. Throughput: 0: 10225.7. Samples: 58776708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:20,122][635489] Avg episode reward: [(0, '4080.839')] [2023-03-09 07:16:24,078][635776] Updated weights for policy 0, policy_version 114880 (0.0005) [2023-03-09 07:16:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 58826752. Throughput: 0: 10166.1. Samples: 58806436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:25,122][635489] Avg episode reward: [(0, '4114.179')] [2023-03-09 07:16:28,265][635776] Updated weights for policy 0, policy_version 114960 (0.0005) [2023-03-09 07:16:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 58875904. Throughput: 0: 10210.0. Samples: 58866756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:30,122][635489] Avg episode reward: [(0, '4352.629')] [2023-03-09 07:16:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114992_58875904.pth... [2023-03-09 07:16:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114392_58568704.pth [2023-03-09 07:16:32,284][635776] Updated weights for policy 0, policy_version 115040 (0.0004) [2023-03-09 07:16:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 58929152. Throughput: 0: 10267.6. Samples: 58927456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:35,122][635489] Avg episode reward: [(0, '4342.605')] [2023-03-09 07:16:36,417][635776] Updated weights for policy 0, policy_version 115120 (0.0005) [2023-03-09 07:16:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 58978304. Throughput: 0: 10182.5. Samples: 58956612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:40,122][635489] Avg episode reward: [(0, '4325.503')] [2023-03-09 07:16:40,505][635776] Updated weights for policy 0, policy_version 115200 (0.0005) [2023-03-09 07:16:44,339][635776] Updated weights for policy 0, policy_version 115280 (0.0005) [2023-03-09 07:16:45,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10219.2). Total num frames: 59027456. Throughput: 0: 10197.6. Samples: 59019264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:45,122][635489] Avg episode reward: [(0, '4391.511')] [2023-03-09 07:16:45,142][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115296_59031552.pth... [2023-03-09 07:16:45,143][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114704_58728448.pth [2023-03-09 07:16:48,568][635776] Updated weights for policy 0, policy_version 115360 (0.0005) [2023-03-09 07:16:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 59076608. Throughput: 0: 10084.5. Samples: 59077380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:50,122][635489] Avg episode reward: [(0, '4455.000')] [2023-03-09 07:16:52,706][635776] Updated weights for policy 0, policy_version 115440 (0.0005) [2023-03-09 07:16:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 59125760. Throughput: 0: 10053.3. Samples: 59107140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:16:55,122][635489] Avg episode reward: [(0, '4322.654')] [2023-03-09 07:16:57,052][635776] Updated weights for policy 0, policy_version 115520 (0.0004) [2023-03-09 07:17:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 59174912. Throughput: 0: 9938.5. Samples: 59163200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:00,122][635489] Avg episode reward: [(0, '4360.254')] [2023-03-09 07:17:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115576_59174912.pth... [2023-03-09 07:17:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000114992_58875904.pth [2023-03-09 07:17:01,161][635776] Updated weights for policy 0, policy_version 115600 (0.0005) [2023-03-09 07:17:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 59224064. Throughput: 0: 9942.8. Samples: 59224136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:05,122][635489] Avg episode reward: [(0, '4302.143')] [2023-03-09 07:17:05,322][635776] Updated weights for policy 0, policy_version 115680 (0.0005) [2023-03-09 07:17:09,688][635776] Updated weights for policy 0, policy_version 115760 (0.0004) [2023-03-09 07:17:10,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 59269120. Throughput: 0: 9915.5. Samples: 59252632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:10,122][635489] Avg episode reward: [(0, '4384.491')] [2023-03-09 07:17:13,995][635776] Updated weights for policy 0, policy_version 115840 (0.0005) [2023-03-09 07:17:15,122][635489] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 10149.7). Total num frames: 59318272. Throughput: 0: 9849.2. Samples: 59309972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:15,122][635489] Avg episode reward: [(0, '4232.883')] [2023-03-09 07:17:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115856_59318272.pth... [2023-03-09 07:17:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115296_59031552.pth [2023-03-09 07:17:17,902][635776] Updated weights for policy 0, policy_version 115920 (0.0005) [2023-03-09 07:17:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 59371520. Throughput: 0: 9858.8. Samples: 59371100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:20,122][635489] Avg episode reward: [(0, '3440.129')] [2023-03-09 07:17:21,897][635776] Updated weights for policy 0, policy_version 116000 (0.0005) [2023-03-09 07:17:25,121][635489] Fps is (10 sec: 10240.2, 60 sec: 9898.7, 300 sec: 10149.8). Total num frames: 59420672. Throughput: 0: 9893.9. Samples: 59401836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:25,122][635489] Avg episode reward: [(0, '3809.498')] [2023-03-09 07:17:25,910][635776] Updated weights for policy 0, policy_version 116080 (0.0005) [2023-03-09 07:17:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 59469824. Throughput: 0: 9834.8. Samples: 59461832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:30,122][635489] Avg episode reward: [(0, '4423.197')] [2023-03-09 07:17:30,151][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116160_59473920.pth... [2023-03-09 07:17:30,152][635776] Updated weights for policy 0, policy_version 116160 (0.0005) [2023-03-09 07:17:30,153][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115576_59174912.pth [2023-03-09 07:17:34,295][635776] Updated weights for policy 0, policy_version 116240 (0.0005) [2023-03-09 07:17:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 59523072. Throughput: 0: 9864.7. Samples: 59521292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:35,122][635489] Avg episode reward: [(0, '4523.253')] [2023-03-09 07:17:38,522][635776] Updated weights for policy 0, policy_version 116320 (0.0004) [2023-03-09 07:17:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10122.0). Total num frames: 59568128. Throughput: 0: 9853.0. Samples: 59550524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:40,122][635489] Avg episode reward: [(0, '4552.101')] [2023-03-09 07:17:42,761][635776] Updated weights for policy 0, policy_version 116400 (0.0004) [2023-03-09 07:17:45,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10122.0). Total num frames: 59617280. Throughput: 0: 9901.2. Samples: 59608756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:45,122][635489] Avg episode reward: [(0, '4340.381')] [2023-03-09 07:17:45,163][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116448_59621376.pth... [2023-03-09 07:17:45,165][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000115856_59318272.pth [2023-03-09 07:17:46,850][635776] Updated weights for policy 0, policy_version 116480 (0.0005) [2023-03-09 07:17:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 59670528. Throughput: 0: 9905.7. Samples: 59669892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:50,122][635489] Avg episode reward: [(0, '4483.109')] [2023-03-09 07:17:50,921][635776] Updated weights for policy 0, policy_version 116560 (0.0004) [2023-03-09 07:17:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10122.0). Total num frames: 59715584. Throughput: 0: 9901.8. Samples: 59698212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:17:55,122][635489] Avg episode reward: [(0, '4558.119')] [2023-03-09 07:17:55,153][635776] Updated weights for policy 0, policy_version 116640 (0.0005) [2023-03-09 07:17:59,217][635776] Updated weights for policy 0, policy_version 116720 (0.0005) [2023-03-09 07:18:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 59768832. Throughput: 0: 9925.6. Samples: 59756624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:00,122][635489] Avg episode reward: [(0, '4568.543')] [2023-03-09 07:18:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116736_59768832.pth... [2023-03-09 07:18:00,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116160_59473920.pth [2023-03-09 07:18:03,478][635776] Updated weights for policy 0, policy_version 116800 (0.0005) [2023-03-09 07:18:05,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 59813888. Throughput: 0: 9873.0. Samples: 59815384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:05,122][635489] Avg episode reward: [(0, '4571.386')] [2023-03-09 07:18:07,495][635776] Updated weights for policy 0, policy_version 116880 (0.0005) [2023-03-09 07:18:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 59867136. Throughput: 0: 9885.1. Samples: 59846664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:10,122][635489] Avg episode reward: [(0, '4515.037')] [2023-03-09 07:18:11,471][635776] Updated weights for policy 0, policy_version 116960 (0.0005) [2023-03-09 07:18:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 59920384. Throughput: 0: 9918.4. Samples: 59908160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:15,122][635489] Avg episode reward: [(0, '4265.190')] [2023-03-09 07:18:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117032_59920384.pth... [2023-03-09 07:18:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116448_59621376.pth [2023-03-09 07:18:15,487][635776] Updated weights for policy 0, policy_version 117040 (0.0005) [2023-03-09 07:18:19,657][635776] Updated weights for policy 0, policy_version 117120 (0.0005) [2023-03-09 07:18:20,122][635489] Fps is (10 sec: 10239.8, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 59969536. Throughput: 0: 9931.3. Samples: 59968200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:20,122][635489] Avg episode reward: [(0, '4457.431')] [2023-03-09 07:18:23,720][635776] Updated weights for policy 0, policy_version 117200 (0.0005) [2023-03-09 07:18:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 60018688. Throughput: 0: 9949.9. Samples: 59998272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:25,122][635489] Avg episode reward: [(0, '4576.449')] [2023-03-09 07:18:27,775][635776] Updated weights for policy 0, policy_version 117280 (0.0005) [2023-03-09 07:18:30,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 60067840. Throughput: 0: 10014.7. Samples: 60059416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:30,122][635489] Avg episode reward: [(0, '4573.798')] [2023-03-09 07:18:30,133][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117328_60071936.pth... [2023-03-09 07:18:30,136][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000116736_59768832.pth [2023-03-09 07:18:31,655][635776] Updated weights for policy 0, policy_version 117360 (0.0005) [2023-03-09 07:18:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 60121088. Throughput: 0: 10006.2. Samples: 60120172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:35,122][635489] Avg episode reward: [(0, '4519.114')] [2023-03-09 07:18:35,884][635776] Updated weights for policy 0, policy_version 117440 (0.0004) [2023-03-09 07:18:40,018][635776] Updated weights for policy 0, policy_version 117520 (0.0005) [2023-03-09 07:18:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 60170240. Throughput: 0: 10020.0. Samples: 60149116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:40,122][635489] Avg episode reward: [(0, '4363.778')] [2023-03-09 07:18:43,914][635776] Updated weights for policy 0, policy_version 117600 (0.0005) [2023-03-09 07:18:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10094.2). Total num frames: 60223488. Throughput: 0: 10101.7. Samples: 60211200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:45,122][635489] Avg episode reward: [(0, '4326.320')] [2023-03-09 07:18:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117624_60223488.pth... [2023-03-09 07:18:45,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117032_59920384.pth [2023-03-09 07:18:48,061][635776] Updated weights for policy 0, policy_version 117680 (0.0005) [2023-03-09 07:18:50,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 60272640. Throughput: 0: 10118.8. Samples: 60270732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:50,122][635489] Avg episode reward: [(0, '4222.242')] [2023-03-09 07:18:52,142][635776] Updated weights for policy 0, policy_version 117760 (0.0005) [2023-03-09 07:18:55,121][635489] Fps is (10 sec: 9421.0, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 60317696. Throughput: 0: 10089.6. Samples: 60300696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:18:55,122][635489] Avg episode reward: [(0, '4237.790')] [2023-03-09 07:18:56,349][635776] Updated weights for policy 0, policy_version 117840 (0.0005) [2023-03-09 07:19:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10066.4). Total num frames: 60370944. Throughput: 0: 10019.0. Samples: 60359016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:19:00,122][635489] Avg episode reward: [(0, '4542.631')] [2023-03-09 07:19:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117912_60370944.pth... [2023-03-09 07:19:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117328_60071936.pth [2023-03-09 07:19:00,599][635776] Updated weights for policy 0, policy_version 117920 (0.0004) [2023-03-09 07:19:04,542][635776] Updated weights for policy 0, policy_version 118000 (0.0005) [2023-03-09 07:19:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 60420096. Throughput: 0: 10042.2. Samples: 60420096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:05,122][635489] Avg episode reward: [(0, '4390.518')] [2023-03-09 07:19:08,510][635776] Updated weights for policy 0, policy_version 118080 (0.0005) [2023-03-09 07:19:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 60473344. Throughput: 0: 10028.6. Samples: 60449556. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:10,122][635489] Avg episode reward: [(0, '4453.108')] [2023-03-09 07:19:12,090][635776] Updated weights for policy 0, policy_version 118160 (0.0005) [2023-03-09 07:19:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 60526592. Throughput: 0: 10149.5. Samples: 60516144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:15,122][635489] Avg episode reward: [(0, '4158.473')] [2023-03-09 07:19:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118216_60526592.pth... [2023-03-09 07:19:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117624_60223488.pth [2023-03-09 07:19:16,017][635776] Updated weights for policy 0, policy_version 118240 (0.0005) [2023-03-09 07:19:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10080.3). Total num frames: 60575744. Throughput: 0: 10129.8. Samples: 60576012. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:20,122][635489] Avg episode reward: [(0, '3855.593')] [2023-03-09 07:19:20,265][635776] Updated weights for policy 0, policy_version 118320 (0.0005) [2023-03-09 07:19:24,021][635776] Updated weights for policy 0, policy_version 118400 (0.0005) [2023-03-09 07:19:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10094.2). Total num frames: 60628992. Throughput: 0: 10210.3. Samples: 60608576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:25,122][635489] Avg episode reward: [(0, '4294.458')] [2023-03-09 07:19:27,884][635776] Updated weights for policy 0, policy_version 118480 (0.0005) [2023-03-09 07:19:30,026][635732] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000007 [2023-03-09 07:19:30,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 60686336. Throughput: 0: 10238.8. Samples: 60671944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:30,122][635489] Avg episode reward: [(0, '4200.108')] [2023-03-09 07:19:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118528_60686336.pth... [2023-03-09 07:19:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000117912_60370944.pth [2023-03-09 07:19:31,716][635776] Updated weights for policy 0, policy_version 118560 (0.0005) [2023-03-09 07:19:35,121][635489] Fps is (10 sec: 11059.1, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 60739584. Throughput: 0: 10404.4. Samples: 60738928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:35,122][635489] Avg episode reward: [(0, '4233.881')] [2023-03-09 07:19:35,309][635776] Updated weights for policy 0, policy_version 118640 (0.0005) [2023-03-09 07:19:39,293][635776] Updated weights for policy 0, policy_version 118720 (0.0005) [2023-03-09 07:19:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 60792832. Throughput: 0: 10465.0. Samples: 60771624. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:40,122][635489] Avg episode reward: [(0, '4414.062')] [2023-03-09 07:19:43,323][635776] Updated weights for policy 0, policy_version 118800 (0.0005) [2023-03-09 07:19:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 60841984. Throughput: 0: 10527.3. Samples: 60832744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:45,122][635489] Avg episode reward: [(0, '4428.517')] [2023-03-09 07:19:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118832_60841984.pth... [2023-03-09 07:19:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118216_60526592.pth [2023-03-09 07:19:46,908][635776] Updated weights for policy 0, policy_version 118880 (0.0005) [2023-03-09 07:19:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10135.9). Total num frames: 60895232. Throughput: 0: 10568.4. Samples: 60895672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:50,122][635489] Avg episode reward: [(0, '4387.729')] [2023-03-09 07:19:51,004][635776] Updated weights for policy 0, policy_version 118960 (0.0004) [2023-03-09 07:19:55,108][635776] Updated weights for policy 0, policy_version 119040 (0.0005) [2023-03-09 07:19:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10135.9). Total num frames: 60948480. Throughput: 0: 10630.1. Samples: 60927912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:19:55,122][635489] Avg episode reward: [(0, '4500.226')] [2023-03-09 07:19:59,181][635776] Updated weights for policy 0, policy_version 119120 (0.0004) [2023-03-09 07:20:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 60997632. Throughput: 0: 10444.5. Samples: 60986148. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:20:00,122][635489] Avg episode reward: [(0, '4322.110')] [2023-03-09 07:20:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119136_60997632.pth... [2023-03-09 07:20:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118528_60686336.pth [2023-03-09 07:20:03,302][635776] Updated weights for policy 0, policy_version 119200 (0.0005) [2023-03-09 07:20:05,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 61046784. Throughput: 0: 10459.1. Samples: 61046672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:20:05,122][635489] Avg episode reward: [(0, '4360.626')] [2023-03-09 07:20:07,238][635776] Updated weights for policy 0, policy_version 119280 (0.0005) [2023-03-09 07:20:10,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 61095936. Throughput: 0: 10447.7. Samples: 61078724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:10,122][635489] Avg episode reward: [(0, '4491.828')] [2023-03-09 07:20:11,362][635776] Updated weights for policy 0, policy_version 119360 (0.0005) [2023-03-09 07:20:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 61149184. Throughput: 0: 10342.0. Samples: 61137336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:15,122][635489] Avg episode reward: [(0, '4459.722')] [2023-03-09 07:20:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119432_61149184.pth... [2023-03-09 07:20:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000118832_60841984.pth [2023-03-09 07:20:15,486][635776] Updated weights for policy 0, policy_version 119440 (0.0005) [2023-03-09 07:20:19,702][635776] Updated weights for policy 0, policy_version 119520 (0.0004) [2023-03-09 07:20:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10108.1). Total num frames: 61198336. Throughput: 0: 10163.0. Samples: 61196264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:20,122][635489] Avg episode reward: [(0, '4503.798')] [2023-03-09 07:20:23,577][635776] Updated weights for policy 0, policy_version 119600 (0.0005) [2023-03-09 07:20:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 61247488. Throughput: 0: 10132.4. Samples: 61227580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:25,122][635489] Avg episode reward: [(0, '4545.598')] [2023-03-09 07:20:27,698][635776] Updated weights for policy 0, policy_version 119680 (0.0004) [2023-03-09 07:20:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 61300736. Throughput: 0: 10126.6. Samples: 61288444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:30,122][635489] Avg episode reward: [(0, '4559.153')] [2023-03-09 07:20:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119728_61300736.pth... [2023-03-09 07:20:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119136_60997632.pth [2023-03-09 07:20:31,639][635776] Updated weights for policy 0, policy_version 119760 (0.0005) [2023-03-09 07:20:35,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 61353984. Throughput: 0: 10184.7. Samples: 61353984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:35,122][635489] Avg episode reward: [(0, '4557.268')] [2023-03-09 07:20:35,326][635776] Updated weights for policy 0, policy_version 119840 (0.0005) [2023-03-09 07:20:39,399][635776] Updated weights for policy 0, policy_version 119920 (0.0004) [2023-03-09 07:20:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 61403136. Throughput: 0: 10163.5. Samples: 61385268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:40,122][635489] Avg episode reward: [(0, '4568.342')] [2023-03-09 07:20:43,369][635776] Updated weights for policy 0, policy_version 120000 (0.0005) [2023-03-09 07:20:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 61456384. Throughput: 0: 10219.1. Samples: 61446008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:45,122][635489] Avg episode reward: [(0, '4586.448')] [2023-03-09 07:20:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120032_61456384.pth... [2023-03-09 07:20:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119432_61149184.pth [2023-03-09 07:20:47,527][635776] Updated weights for policy 0, policy_version 120080 (0.0005) [2023-03-09 07:20:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 61505536. Throughput: 0: 10163.8. Samples: 61504044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:50,122][635489] Avg episode reward: [(0, '4518.946')] [2023-03-09 07:20:51,615][635776] Updated weights for policy 0, policy_version 120160 (0.0005) [2023-03-09 07:20:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 61554688. Throughput: 0: 10138.0. Samples: 61534936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:20:55,122][635489] Avg episode reward: [(0, '4519.882')] [2023-03-09 07:20:55,550][635776] Updated weights for policy 0, policy_version 120240 (0.0004) [2023-03-09 07:20:59,617][635776] Updated weights for policy 0, policy_version 120320 (0.0005) [2023-03-09 07:21:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.8, 300 sec: 10122.0). Total num frames: 61607936. Throughput: 0: 10186.3. Samples: 61595720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:21:00,122][635489] Avg episode reward: [(0, '4434.255')] [2023-03-09 07:21:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120328_61607936.pth... [2023-03-09 07:21:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000119728_61300736.pth [2023-03-09 07:21:03,678][635776] Updated weights for policy 0, policy_version 120400 (0.0005) [2023-03-09 07:21:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 61652992. Throughput: 0: 10209.9. Samples: 61655708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:21:05,122][635489] Avg episode reward: [(0, '4425.166')] [2023-03-09 07:21:08,221][635776] Updated weights for policy 0, policy_version 120480 (0.0004) [2023-03-09 07:21:10,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10103.4, 300 sec: 10080.3). Total num frames: 61702144. Throughput: 0: 10114.6. Samples: 61682736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:10,122][635489] Avg episode reward: [(0, '4235.412')] [2023-03-09 07:21:12,155][635776] Updated weights for policy 0, policy_version 120560 (0.0005) [2023-03-09 07:21:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 61755392. Throughput: 0: 10143.9. Samples: 61744920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:15,122][635489] Avg episode reward: [(0, '4332.935')] [2023-03-09 07:21:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120616_61755392.pth... [2023-03-09 07:21:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120032_61456384.pth [2023-03-09 07:21:15,983][635776] Updated weights for policy 0, policy_version 120640 (0.0005) [2023-03-09 07:21:19,656][635776] Updated weights for policy 0, policy_version 120720 (0.0005) [2023-03-09 07:21:20,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 61812736. Throughput: 0: 10172.0. Samples: 61811724. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:20,122][635489] Avg episode reward: [(0, '4504.939')] [2023-03-09 07:21:23,613][635776] Updated weights for policy 0, policy_version 120800 (0.0005) [2023-03-09 07:21:25,121][635489] Fps is (10 sec: 11059.4, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 61865984. Throughput: 0: 10142.9. Samples: 61841700. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:25,122][635489] Avg episode reward: [(0, '4556.944')] [2023-03-09 07:21:27,332][635776] Updated weights for policy 0, policy_version 120880 (0.0005) [2023-03-09 07:21:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 61915136. Throughput: 0: 10236.7. Samples: 61906660. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:30,122][635489] Avg episode reward: [(0, '4447.638')] [2023-03-09 07:21:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120928_61915136.pth... [2023-03-09 07:21:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120328_61607936.pth [2023-03-09 07:21:31,573][635776] Updated weights for policy 0, policy_version 120960 (0.0005) [2023-03-09 07:21:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 61968384. Throughput: 0: 10314.0. Samples: 61968176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:35,122][635489] Avg episode reward: [(0, '4039.970')] [2023-03-09 07:21:35,354][635776] Updated weights for policy 0, policy_version 121040 (0.0005) [2023-03-09 07:21:39,364][635776] Updated weights for policy 0, policy_version 121120 (0.0005) [2023-03-09 07:21:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 62017536. Throughput: 0: 10296.0. Samples: 61998256. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:40,122][635489] Avg episode reward: [(0, '3848.109')] [2023-03-09 07:21:43,221][635776] Updated weights for policy 0, policy_version 121200 (0.0005) [2023-03-09 07:21:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10163.6). Total num frames: 62074880. Throughput: 0: 10366.7. Samples: 62062224. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:45,122][635489] Avg episode reward: [(0, '4127.075')] [2023-03-09 07:21:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121240_62074880.pth... [2023-03-09 07:21:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120616_61755392.pth [2023-03-09 07:21:47,113][635776] Updated weights for policy 0, policy_version 121280 (0.0006) [2023-03-09 07:21:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 62124032. Throughput: 0: 10408.6. Samples: 62124096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:50,122][635489] Avg episode reward: [(0, '4368.221')] [2023-03-09 07:21:51,175][635776] Updated weights for policy 0, policy_version 121360 (0.0005) [2023-03-09 07:21:55,098][635776] Updated weights for policy 0, policy_version 121440 (0.0005) [2023-03-09 07:21:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 62177280. Throughput: 0: 10469.1. Samples: 62153844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:21:55,122][635489] Avg episode reward: [(0, '4404.327')] [2023-03-09 07:21:59,239][635776] Updated weights for policy 0, policy_version 121520 (0.0005) [2023-03-09 07:22:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 62222336. Throughput: 0: 10429.4. Samples: 62214244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:22:00,122][635489] Avg episode reward: [(0, '4401.263')] [2023-03-09 07:22:00,132][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121536_62226432.pth... [2023-03-09 07:22:00,133][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000120928_61915136.pth [2023-03-09 07:22:03,281][635776] Updated weights for policy 0, policy_version 121600 (0.0005) [2023-03-09 07:22:05,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 62275584. Throughput: 0: 10308.0. Samples: 62275584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:22:05,122][635489] Avg episode reward: [(0, '4594.313')] [2023-03-09 07:22:07,323][635776] Updated weights for policy 0, policy_version 121680 (0.0005) [2023-03-09 07:22:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 62324736. Throughput: 0: 10345.7. Samples: 62307256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:10,122][635489] Avg episode reward: [(0, '4599.256')] [2023-03-09 07:22:11,289][635776] Updated weights for policy 0, policy_version 121760 (0.0004) [2023-03-09 07:22:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 62377984. Throughput: 0: 10242.7. Samples: 62367580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:15,122][635489] Avg episode reward: [(0, '4538.648')] [2023-03-09 07:22:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121832_62377984.pth... [2023-03-09 07:22:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121240_62074880.pth [2023-03-09 07:22:15,202][635776] Updated weights for policy 0, policy_version 121840 (0.0005) [2023-03-09 07:22:19,132][635776] Updated weights for policy 0, policy_version 121920 (0.0005) [2023-03-09 07:22:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 62431232. Throughput: 0: 10290.3. Samples: 62431240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:20,122][635489] Avg episode reward: [(0, '4479.103')] [2023-03-09 07:22:23,176][635776] Updated weights for policy 0, policy_version 122000 (0.0005) [2023-03-09 07:22:25,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 62480384. Throughput: 0: 10264.8. Samples: 62460172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:25,122][635489] Avg episode reward: [(0, '4594.759')] [2023-03-09 07:22:27,340][635776] Updated weights for policy 0, policy_version 122080 (0.0005) [2023-03-09 07:22:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 62533632. Throughput: 0: 10229.6. Samples: 62522556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:30,122][635489] Avg episode reward: [(0, '4587.835')] [2023-03-09 07:22:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122136_62533632.pth... [2023-03-09 07:22:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121536_62226432.pth [2023-03-09 07:22:31,132][635776] Updated weights for policy 0, policy_version 122160 (0.0005) [2023-03-09 07:22:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 62582784. Throughput: 0: 10167.2. Samples: 62581620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:35,122][635489] Avg episode reward: [(0, '4601.575')] [2023-03-09 07:22:35,584][635776] Updated weights for policy 0, policy_version 122240 (0.0004) [2023-03-09 07:22:39,761][635776] Updated weights for policy 0, policy_version 122320 (0.0004) [2023-03-09 07:22:40,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 62627840. Throughput: 0: 10128.1. Samples: 62609608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:40,122][635489] Avg episode reward: [(0, '4581.062')] [2023-03-09 07:22:43,556][635776] Updated weights for policy 0, policy_version 122400 (0.0005) [2023-03-09 07:22:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 62681088. Throughput: 0: 10189.4. Samples: 62672768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:45,122][635489] Avg episode reward: [(0, '4521.964')] [2023-03-09 07:22:45,175][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122432_62685184.pth... [2023-03-09 07:22:45,177][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000121832_62377984.pth [2023-03-09 07:22:47,598][635776] Updated weights for policy 0, policy_version 122480 (0.0005) [2023-03-09 07:22:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 62734336. Throughput: 0: 10161.3. Samples: 62732844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:50,122][635489] Avg episode reward: [(0, '4562.241')] [2023-03-09 07:22:51,528][635776] Updated weights for policy 0, policy_version 122560 (0.0005) [2023-03-09 07:22:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 62787584. Throughput: 0: 10166.7. Samples: 62764756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:22:55,122][635489] Avg episode reward: [(0, '4552.004')] [2023-03-09 07:22:55,344][635776] Updated weights for policy 0, policy_version 122640 (0.0006) [2023-03-09 07:22:59,374][635776] Updated weights for policy 0, policy_version 122720 (0.0005) [2023-03-09 07:23:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 62836736. Throughput: 0: 10239.1. Samples: 62828340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:23:00,122][635489] Avg episode reward: [(0, '4541.293')] [2023-03-09 07:23:00,150][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122736_62840832.pth... [2023-03-09 07:23:00,152][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122136_62533632.pth [2023-03-09 07:23:03,436][635776] Updated weights for policy 0, policy_version 122800 (0.0005) [2023-03-09 07:23:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 62889984. Throughput: 0: 10167.3. Samples: 62888768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:23:05,122][635489] Avg episode reward: [(0, '4533.164')] [2023-03-09 07:23:07,374][635776] Updated weights for policy 0, policy_version 122880 (0.0004) [2023-03-09 07:23:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 62939136. Throughput: 0: 10203.7. Samples: 62919340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:23:10,122][635489] Avg episode reward: [(0, '4476.371')] [2023-03-09 07:23:11,451][635776] Updated weights for policy 0, policy_version 122960 (0.0005) [2023-03-09 07:23:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 62992384. Throughput: 0: 10217.1. Samples: 62982328. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:15,122][635489] Avg episode reward: [(0, '4448.963')] [2023-03-09 07:23:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123032_62992384.pth... [2023-03-09 07:23:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122432_62685184.pth [2023-03-09 07:23:15,193][635776] Updated weights for policy 0, policy_version 123040 (0.0004) [2023-03-09 07:23:19,066][635776] Updated weights for policy 0, policy_version 123120 (0.0005) [2023-03-09 07:23:20,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 63049728. Throughput: 0: 10361.0. Samples: 63047868. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:20,122][635489] Avg episode reward: [(0, '4452.580')] [2023-03-09 07:23:22,605][635776] Updated weights for policy 0, policy_version 123200 (0.0005) [2023-03-09 07:23:25,121][635489] Fps is (10 sec: 11059.3, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 63102976. Throughput: 0: 10495.7. Samples: 63081916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:25,122][635489] Avg episode reward: [(0, '4331.142')] [2023-03-09 07:23:26,732][635776] Updated weights for policy 0, policy_version 123280 (0.0005) [2023-03-09 07:23:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10274.7). Total num frames: 63152128. Throughput: 0: 10423.1. Samples: 63141808. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:30,122][635489] Avg episode reward: [(0, '4335.173')] [2023-03-09 07:23:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123344_63152128.pth... [2023-03-09 07:23:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000122736_62840832.pth [2023-03-09 07:23:30,840][635776] Updated weights for policy 0, policy_version 123360 (0.0005) [2023-03-09 07:23:34,848][635776] Updated weights for policy 0, policy_version 123440 (0.0005) [2023-03-09 07:23:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 63201280. Throughput: 0: 10411.3. Samples: 63201352. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:35,122][635489] Avg episode reward: [(0, '4239.258')] [2023-03-09 07:23:38,742][635776] Updated weights for policy 0, policy_version 123520 (0.0005) [2023-03-09 07:23:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 63254528. Throughput: 0: 10424.3. Samples: 63233852. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:40,127][635489] Avg episode reward: [(0, '4177.945')] [2023-03-09 07:23:42,546][635776] Updated weights for policy 0, policy_version 123600 (0.0005) [2023-03-09 07:23:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 63303680. Throughput: 0: 10381.3. Samples: 63295500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:45,132][635489] Avg episode reward: [(0, '4197.525')] [2023-03-09 07:23:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123648_63307776.pth... [2023-03-09 07:23:45,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123032_62992384.pth [2023-03-09 07:23:46,923][635776] Updated weights for policy 0, policy_version 123680 (0.0005) [2023-03-09 07:23:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 63352832. Throughput: 0: 10303.0. Samples: 63352404. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:50,132][635489] Avg episode reward: [(0, '3665.380')] [2023-03-09 07:23:51,203][635776] Updated weights for policy 0, policy_version 123760 (0.0005) [2023-03-09 07:23:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 63401984. Throughput: 0: 10275.4. Samples: 63381732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:23:55,132][635489] Avg episode reward: [(0, '3881.239')] [2023-03-09 07:23:55,379][635776] Updated weights for policy 0, policy_version 123840 (0.0005) [2023-03-09 07:23:59,279][635776] Updated weights for policy 0, policy_version 123920 (0.0005) [2023-03-09 07:24:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10288.6). Total num frames: 63455232. Throughput: 0: 10233.8. Samples: 63442848. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:00,133][635489] Avg episode reward: [(0, '3771.517')] [2023-03-09 07:24:00,137][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123936_63455232.pth... [2023-03-09 07:24:00,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123344_63152128.pth [2023-03-09 07:24:03,416][635776] Updated weights for policy 0, policy_version 124000 (0.0005) [2023-03-09 07:24:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 63504384. Throughput: 0: 10119.6. Samples: 63503248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:05,132][635489] Avg episode reward: [(0, '3953.136')] [2023-03-09 07:24:07,611][635776] Updated weights for policy 0, policy_version 124080 (0.0004) [2023-03-09 07:24:10,122][635489] Fps is (10 sec: 9420.9, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 63549440. Throughput: 0: 10014.4. Samples: 63532564. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:10,132][635489] Avg episode reward: [(0, '4139.562')] [2023-03-09 07:24:11,728][635776] Updated weights for policy 0, policy_version 124160 (0.0005) [2023-03-09 07:24:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 63602688. Throughput: 0: 9982.9. Samples: 63591040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:15,133][635489] Avg episode reward: [(0, '3872.986')] [2023-03-09 07:24:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124224_63602688.pth... [2023-03-09 07:24:15,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123648_63307776.pth [2023-03-09 07:24:15,690][635776] Updated weights for policy 0, policy_version 124240 (0.0005) [2023-03-09 07:24:19,599][635776] Updated weights for policy 0, policy_version 124320 (0.0005) [2023-03-09 07:24:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 63655936. Throughput: 0: 10097.1. Samples: 63655724. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:20,133][635489] Avg episode reward: [(0, '3771.365')] [2023-03-09 07:24:23,538][635776] Updated weights for policy 0, policy_version 124400 (0.0005) [2023-03-09 07:24:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 63705088. Throughput: 0: 10048.9. Samples: 63686052. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:25,132][635489] Avg episode reward: [(0, '3511.915')] [2023-03-09 07:24:27,510][635776] Updated weights for policy 0, policy_version 124480 (0.0005) [2023-03-09 07:24:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 63758336. Throughput: 0: 10060.9. Samples: 63748240. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:30,132][635489] Avg episode reward: [(0, '3541.357')] [2023-03-09 07:24:30,134][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124528_63758336.pth... [2023-03-09 07:24:30,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000123936_63455232.pth [2023-03-09 07:24:31,553][635776] Updated weights for policy 0, policy_version 124560 (0.0005) [2023-03-09 07:24:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10219.2). Total num frames: 63807488. Throughput: 0: 10113.2. Samples: 63807500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:35,133][635489] Avg episode reward: [(0, '3278.927')] [2023-03-09 07:24:35,788][635776] Updated weights for policy 0, policy_version 124640 (0.0005) [2023-03-09 07:24:39,956][635776] Updated weights for policy 0, policy_version 124720 (0.0005) [2023-03-09 07:24:40,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 63856640. Throughput: 0: 10099.8. Samples: 63836224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:40,133][635489] Avg episode reward: [(0, '3527.677')] [2023-03-09 07:24:43,974][635776] Updated weights for policy 0, policy_version 124800 (0.0005) [2023-03-09 07:24:45,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 63905792. Throughput: 0: 10101.6. Samples: 63897420. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:45,132][635489] Avg episode reward: [(0, '3778.263')] [2023-03-09 07:24:45,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124816_63905792.pth... [2023-03-09 07:24:45,136][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124224_63602688.pth [2023-03-09 07:24:47,912][635776] Updated weights for policy 0, policy_version 124880 (0.0005) [2023-03-09 07:24:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 63959040. Throughput: 0: 10130.1. Samples: 63959104. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:50,132][635489] Avg episode reward: [(0, '3558.999')] [2023-03-09 07:24:51,736][635776] Updated weights for policy 0, policy_version 124960 (0.0005) [2023-03-09 07:24:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 64008192. Throughput: 0: 10197.3. Samples: 63991444. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:24:55,122][635489] Avg episode reward: [(0, '3409.035')] [2023-03-09 07:24:56,001][635776] Updated weights for policy 0, policy_version 125040 (0.0005) [2023-03-09 07:25:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 64057344. Throughput: 0: 10180.3. Samples: 64049152. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:25:00,132][635489] Avg episode reward: [(0, '3364.272')] [2023-03-09 07:25:00,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125112_64057344.pth... [2023-03-09 07:25:00,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124528_63758336.pth [2023-03-09 07:25:00,209][635776] Updated weights for policy 0, policy_version 125120 (0.0005) [2023-03-09 07:25:04,470][635776] Updated weights for policy 0, policy_version 125200 (0.0005) [2023-03-09 07:25:05,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 64106496. Throughput: 0: 10017.2. Samples: 64106496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:25:05,132][635489] Avg episode reward: [(0, '3846.171')] [2023-03-09 07:25:08,795][635776] Updated weights for policy 0, policy_version 125280 (0.0005) [2023-03-09 07:25:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 64155648. Throughput: 0: 9973.8. Samples: 64134876. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:25:10,122][635489] Avg episode reward: [(0, '3531.294')] [2023-03-09 07:25:12,755][635776] Updated weights for policy 0, policy_version 125360 (0.0005) [2023-03-09 07:25:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 64204800. Throughput: 0: 9954.3. Samples: 64196184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:25:15,133][635489] Avg episode reward: [(0, '3896.046')] [2023-03-09 07:25:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125400_64204800.pth... [2023-03-09 07:25:15,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000124816_63905792.pth [2023-03-09 07:25:16,964][635776] Updated weights for policy 0, policy_version 125440 (0.0005) [2023-03-09 07:25:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 64258048. Throughput: 0: 9967.9. Samples: 64256056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:20,132][635489] Avg episode reward: [(0, '3667.842')] [2023-03-09 07:25:20,792][635776] Updated weights for policy 0, policy_version 125520 (0.0005) [2023-03-09 07:25:24,798][635776] Updated weights for policy 0, policy_version 125600 (0.0005) [2023-03-09 07:25:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 64307200. Throughput: 0: 10012.6. Samples: 64286792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:25,132][635489] Avg episode reward: [(0, '3680.925')] [2023-03-09 07:25:29,260][635776] Updated weights for policy 0, policy_version 125680 (0.0004) [2023-03-09 07:25:30,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 64352256. Throughput: 0: 9947.9. Samples: 64345076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:30,132][635489] Avg episode reward: [(0, '3878.475')] [2023-03-09 07:25:30,141][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125696_64356352.pth... [2023-03-09 07:25:30,142][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125112_64057344.pth [2023-03-09 07:25:33,189][635776] Updated weights for policy 0, policy_version 125760 (0.0004) [2023-03-09 07:25:35,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 64405504. Throughput: 0: 9920.4. Samples: 64405524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:35,133][635489] Avg episode reward: [(0, '3812.825')] [2023-03-09 07:25:37,492][635776] Updated weights for policy 0, policy_version 125840 (0.0004) [2023-03-09 07:25:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9967.0, 300 sec: 10163.6). Total num frames: 64454656. Throughput: 0: 9839.9. Samples: 64434240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:40,132][635489] Avg episode reward: [(0, '3729.530')] [2023-03-09 07:25:41,513][635776] Updated weights for policy 0, policy_version 125920 (0.0005) [2023-03-09 07:25:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 64503808. Throughput: 0: 9921.1. Samples: 64495600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:45,122][635489] Avg episode reward: [(0, '3918.261')] [2023-03-09 07:25:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125984_64503808.pth... [2023-03-09 07:25:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125400_64204800.pth [2023-03-09 07:25:45,589][635776] Updated weights for policy 0, policy_version 126000 (0.0005) [2023-03-09 07:25:49,476][635776] Updated weights for policy 0, policy_version 126080 (0.0005) [2023-03-09 07:25:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 64557056. Throughput: 0: 10013.9. Samples: 64557120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:50,122][635489] Avg episode reward: [(0, '4091.333')] [2023-03-09 07:25:53,546][635776] Updated weights for policy 0, policy_version 126160 (0.0006) [2023-03-09 07:25:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 64610304. Throughput: 0: 10049.7. Samples: 64587112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:25:55,122][635489] Avg episode reward: [(0, '3659.989')] [2023-03-09 07:25:57,359][635776] Updated weights for policy 0, policy_version 126240 (0.0005) [2023-03-09 07:26:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 64659456. Throughput: 0: 10072.3. Samples: 64649436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:00,122][635489] Avg episode reward: [(0, '4098.748')] [2023-03-09 07:26:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126288_64659456.pth... [2023-03-09 07:26:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125696_64356352.pth [2023-03-09 07:26:01,440][635776] Updated weights for policy 0, policy_version 126320 (0.0005) [2023-03-09 07:26:05,024][635776] Updated weights for policy 0, policy_version 126400 (0.0004) [2023-03-09 07:26:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 64716800. Throughput: 0: 10227.1. Samples: 64716276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:05,122][635489] Avg episode reward: [(0, '4174.726')] [2023-03-09 07:26:09,054][635776] Updated weights for policy 0, policy_version 126480 (0.0005) [2023-03-09 07:26:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 64765952. Throughput: 0: 10223.4. Samples: 64746844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:10,122][635489] Avg episode reward: [(0, '4027.860')] [2023-03-09 07:26:13,122][635776] Updated weights for policy 0, policy_version 126560 (0.0005) [2023-03-09 07:26:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 64819200. Throughput: 0: 10273.1. Samples: 64807368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:15,122][635489] Avg episode reward: [(0, '3858.192')] [2023-03-09 07:26:15,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126600_64819200.pth... [2023-03-09 07:26:15,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000125984_64503808.pth [2023-03-09 07:26:17,192][635776] Updated weights for policy 0, policy_version 126640 (0.0005) [2023-03-09 07:26:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 64868352. Throughput: 0: 10273.3. Samples: 64867824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:20,133][635489] Avg episode reward: [(0, '3046.260')] [2023-03-09 07:26:21,161][635776] Updated weights for policy 0, policy_version 126720 (0.0005) [2023-03-09 07:26:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 64917504. Throughput: 0: 10335.5. Samples: 64899340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:25,128][635776] Updated weights for policy 0, policy_version 126800 (0.0006) [2023-03-09 07:26:25,133][635489] Avg episode reward: [(0, '3794.783')] [2023-03-09 07:26:29,420][635776] Updated weights for policy 0, policy_version 126880 (0.0004) [2023-03-09 07:26:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 64966656. Throughput: 0: 10284.3. Samples: 64958396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:30,133][635489] Avg episode reward: [(0, '4091.961')] [2023-03-09 07:26:30,172][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126896_64970752.pth... [2023-03-09 07:26:30,174][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126288_64659456.pth [2023-03-09 07:26:33,301][635776] Updated weights for policy 0, policy_version 126960 (0.0005) [2023-03-09 07:26:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 65019904. Throughput: 0: 10284.3. Samples: 65019912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:35,132][635489] Avg episode reward: [(0, '3968.868')] [2023-03-09 07:26:37,186][635776] Updated weights for policy 0, policy_version 127040 (0.0005) [2023-03-09 07:26:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.2, 300 sec: 10163.6). Total num frames: 65073152. Throughput: 0: 10342.8. Samples: 65052540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:40,133][635489] Avg episode reward: [(0, '4128.850')] [2023-03-09 07:26:41,315][635776] Updated weights for policy 0, policy_version 127120 (0.0006) [2023-03-09 07:26:45,112][635776] Updated weights for policy 0, policy_version 127200 (0.0006) [2023-03-09 07:26:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 65126400. Throughput: 0: 10319.5. Samples: 65113816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:45,133][635489] Avg episode reward: [(0, '4169.862')] [2023-03-09 07:26:45,137][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127200_65126400.pth... [2023-03-09 07:26:45,141][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126600_64819200.pth [2023-03-09 07:26:48,936][635776] Updated weights for policy 0, policy_version 127280 (0.0005) [2023-03-09 07:26:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 65175552. Throughput: 0: 10207.8. Samples: 65175624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:50,132][635489] Avg episode reward: [(0, '4238.265')] [2023-03-09 07:26:53,157][635776] Updated weights for policy 0, policy_version 127360 (0.0005) [2023-03-09 07:26:55,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 65224704. Throughput: 0: 10208.5. Samples: 65206228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:26:55,122][635489] Avg episode reward: [(0, '4060.093')] [2023-03-09 07:26:57,317][635776] Updated weights for policy 0, policy_version 127440 (0.0005) [2023-03-09 07:27:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 65277952. Throughput: 0: 10184.4. Samples: 65265664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:00,122][635489] Avg episode reward: [(0, '4091.631')] [2023-03-09 07:27:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127496_65277952.pth... [2023-03-09 07:27:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000126896_64970752.pth [2023-03-09 07:27:01,103][635776] Updated weights for policy 0, policy_version 127520 (0.0005) [2023-03-09 07:27:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 65327104. Throughput: 0: 10206.4. Samples: 65327112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:05,122][635489] Avg episode reward: [(0, '3917.139')] [2023-03-09 07:27:05,289][635776] Updated weights for policy 0, policy_version 127600 (0.0005) [2023-03-09 07:27:09,447][635776] Updated weights for policy 0, policy_version 127680 (0.0005) [2023-03-09 07:27:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 65376256. Throughput: 0: 10145.3. Samples: 65355880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:10,122][635489] Avg episode reward: [(0, '3952.897')] [2023-03-09 07:27:13,460][635776] Updated weights for policy 0, policy_version 127760 (0.0005) [2023-03-09 07:27:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 65425408. Throughput: 0: 10205.2. Samples: 65417628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:15,122][635489] Avg episode reward: [(0, '4137.642')] [2023-03-09 07:27:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127784_65425408.pth... [2023-03-09 07:27:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127200_65126400.pth [2023-03-09 07:27:17,536][635776] Updated weights for policy 0, policy_version 127840 (0.0005) [2023-03-09 07:27:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 65478656. Throughput: 0: 10171.3. Samples: 65477620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:20,122][635489] Avg episode reward: [(0, '3818.455')] [2023-03-09 07:27:21,703][635776] Updated weights for policy 0, policy_version 127920 (0.0004) [2023-03-09 07:27:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 65527808. Throughput: 0: 10095.7. Samples: 65506848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:25,122][635489] Avg episode reward: [(0, '3566.045')] [2023-03-09 07:27:25,806][635776] Updated weights for policy 0, policy_version 128000 (0.0004) [2023-03-09 07:27:29,993][635776] Updated weights for policy 0, policy_version 128080 (0.0004) [2023-03-09 07:27:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 65576960. Throughput: 0: 10069.9. Samples: 65566960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:30,122][635489] Avg episode reward: [(0, '3633.499')] [2023-03-09 07:27:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128080_65576960.pth... [2023-03-09 07:27:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127496_65277952.pth [2023-03-09 07:27:33,921][635776] Updated weights for policy 0, policy_version 128160 (0.0005) [2023-03-09 07:27:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 65626112. Throughput: 0: 10034.6. Samples: 65627180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:35,122][635489] Avg episode reward: [(0, '3369.059')] [2023-03-09 07:27:38,096][635776] Updated weights for policy 0, policy_version 128240 (0.0005) [2023-03-09 07:27:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 65675264. Throughput: 0: 10021.7. Samples: 65657204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:40,122][635489] Avg episode reward: [(0, '3771.929')] [2023-03-09 07:27:42,213][635776] Updated weights for policy 0, policy_version 128320 (0.0005) [2023-03-09 07:27:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 65728512. Throughput: 0: 10019.3. Samples: 65716532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:45,122][635489] Avg episode reward: [(0, '3809.973')] [2023-03-09 07:27:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128376_65728512.pth... [2023-03-09 07:27:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000127784_65425408.pth [2023-03-09 07:27:46,249][635776] Updated weights for policy 0, policy_version 128400 (0.0005) [2023-03-09 07:27:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 65777664. Throughput: 0: 10041.2. Samples: 65778964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:50,122][635489] Avg episode reward: [(0, '3622.772')] [2023-03-09 07:27:50,149][635776] Updated weights for policy 0, policy_version 128480 (0.0005) [2023-03-09 07:27:54,069][635776] Updated weights for policy 0, policy_version 128560 (0.0005) [2023-03-09 07:27:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 65830912. Throughput: 0: 10101.4. Samples: 65810444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:27:55,122][635489] Avg episode reward: [(0, '3827.092')] [2023-03-09 07:27:58,209][635776] Updated weights for policy 0, policy_version 128640 (0.0004) [2023-03-09 07:28:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 65880064. Throughput: 0: 10070.1. Samples: 65870784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:28:00,122][635489] Avg episode reward: [(0, '3718.030')] [2023-03-09 07:28:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128672_65880064.pth... [2023-03-09 07:28:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128080_65576960.pth [2023-03-09 07:28:02,438][635776] Updated weights for policy 0, policy_version 128720 (0.0005) [2023-03-09 07:28:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 65929216. Throughput: 0: 10043.9. Samples: 65929596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:28:05,122][635489] Avg episode reward: [(0, '3845.768')] [2023-03-09 07:28:06,533][635776] Updated weights for policy 0, policy_version 128800 (0.0004) [2023-03-09 07:28:10,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 65978368. Throughput: 0: 10054.4. Samples: 65959296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:28:10,122][635489] Avg episode reward: [(0, '3798.041')] [2023-03-09 07:28:10,746][635776] Updated weights for policy 0, policy_version 128880 (0.0004) [2023-03-09 07:28:14,901][635776] Updated weights for policy 0, policy_version 128960 (0.0004) [2023-03-09 07:28:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 66027520. Throughput: 0: 10042.9. Samples: 66018888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:28:15,122][635489] Avg episode reward: [(0, '3420.382')] [2023-03-09 07:28:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128960_66027520.pth... [2023-03-09 07:28:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128376_65728512.pth [2023-03-09 07:28:19,164][635776] Updated weights for policy 0, policy_version 129040 (0.0005) [2023-03-09 07:28:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 66076672. Throughput: 0: 9990.1. Samples: 66076736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:28:20,122][635489] Avg episode reward: [(0, '3270.069')] [2023-03-09 07:28:23,312][635776] Updated weights for policy 0, policy_version 129120 (0.0005) [2023-03-09 07:28:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 66125824. Throughput: 0: 9955.3. Samples: 66105192. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:25,122][635489] Avg episode reward: [(0, '3491.199')] [2023-03-09 07:28:27,459][635776] Updated weights for policy 0, policy_version 129200 (0.0005) [2023-03-09 07:28:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10080.3). Total num frames: 66174976. Throughput: 0: 9966.1. Samples: 66165008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:30,122][635489] Avg episode reward: [(0, '3624.761')] [2023-03-09 07:28:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129248_66174976.pth... [2023-03-09 07:28:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128672_65880064.pth [2023-03-09 07:28:31,475][635776] Updated weights for policy 0, policy_version 129280 (0.0005) [2023-03-09 07:28:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 66228224. Throughput: 0: 9994.0. Samples: 66228696. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:35,122][635489] Avg episode reward: [(0, '4089.123')] [2023-03-09 07:28:35,236][635776] Updated weights for policy 0, policy_version 129360 (0.0005) [2023-03-09 07:28:39,312][635776] Updated weights for policy 0, policy_version 129440 (0.0005) [2023-03-09 07:28:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10080.3). Total num frames: 66277376. Throughput: 0: 9959.9. Samples: 66258640. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:40,122][635489] Avg episode reward: [(0, '4241.952')] [2023-03-09 07:28:43,369][635776] Updated weights for policy 0, policy_version 129520 (0.0005) [2023-03-09 07:28:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 66330624. Throughput: 0: 9986.7. Samples: 66320184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:45,122][635489] Avg episode reward: [(0, '4386.890')] [2023-03-09 07:28:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129552_66330624.pth... [2023-03-09 07:28:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000128960_66027520.pth [2023-03-09 07:28:47,342][635776] Updated weights for policy 0, policy_version 129600 (0.0004) [2023-03-09 07:28:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 66379776. Throughput: 0: 10029.4. Samples: 66380920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:50,122][635489] Avg episode reward: [(0, '4246.432')] [2023-03-09 07:28:51,229][635776] Updated weights for policy 0, policy_version 129680 (0.0005) [2023-03-09 07:28:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 66433024. Throughput: 0: 10106.9. Samples: 66414104. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:28:55,122][635489] Avg episode reward: [(0, '4138.817')] [2023-03-09 07:28:55,124][635776] Updated weights for policy 0, policy_version 129760 (0.0005) [2023-03-09 07:28:59,095][635776] Updated weights for policy 0, policy_version 129840 (0.0005) [2023-03-09 07:29:00,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10103.4, 300 sec: 10108.1). Total num frames: 66486272. Throughput: 0: 10163.3. Samples: 66476236. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:29:00,122][635489] Avg episode reward: [(0, '4347.105')] [2023-03-09 07:29:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129856_66486272.pth... [2023-03-09 07:29:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129248_66174976.pth [2023-03-09 07:29:03,250][635776] Updated weights for policy 0, policy_version 129920 (0.0005) [2023-03-09 07:29:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 66535424. Throughput: 0: 10190.2. Samples: 66535296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:29:05,122][635489] Avg episode reward: [(0, '4288.791')] [2023-03-09 07:29:07,380][635776] Updated weights for policy 0, policy_version 130000 (0.0005) [2023-03-09 07:29:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 66584576. Throughput: 0: 10239.7. Samples: 66565976. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:29:10,122][635489] Avg episode reward: [(0, '4428.569')] [2023-03-09 07:29:11,410][635776] Updated weights for policy 0, policy_version 130080 (0.0005) [2023-03-09 07:29:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 66637824. Throughput: 0: 10269.4. Samples: 66627132. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:29:15,122][635489] Avg episode reward: [(0, '4227.213')] [2023-03-09 07:29:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130152_66637824.pth... [2023-03-09 07:29:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129552_66330624.pth [2023-03-09 07:29:15,455][635776] Updated weights for policy 0, policy_version 130160 (0.0005) [2023-03-09 07:29:19,498][635776] Updated weights for policy 0, policy_version 130240 (0.0005) [2023-03-09 07:29:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 66686976. Throughput: 0: 10184.0. Samples: 66686976. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 07:29:20,122][635489] Avg episode reward: [(0, '4217.048')] [2023-03-09 07:29:23,535][635776] Updated weights for policy 0, policy_version 130320 (0.0005) [2023-03-09 07:29:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 66740224. Throughput: 0: 10217.1. Samples: 66718412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:25,122][635489] Avg episode reward: [(0, '4350.175')] [2023-03-09 07:29:27,355][635776] Updated weights for policy 0, policy_version 130400 (0.0005) [2023-03-09 07:29:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 66789376. Throughput: 0: 10208.4. Samples: 66779564. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:30,122][635489] Avg episode reward: [(0, '4467.602')] [2023-03-09 07:29:30,155][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130456_66793472.pth... [2023-03-09 07:29:30,157][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000129856_66486272.pth [2023-03-09 07:29:31,430][635776] Updated weights for policy 0, policy_version 130480 (0.0004) [2023-03-09 07:29:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 66838528. Throughput: 0: 10198.1. Samples: 66839836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:35,122][635489] Avg episode reward: [(0, '4470.122')] [2023-03-09 07:29:35,535][635776] Updated weights for policy 0, policy_version 130560 (0.0004) [2023-03-09 07:29:39,518][635776] Updated weights for policy 0, policy_version 130640 (0.0005) [2023-03-09 07:29:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 66891776. Throughput: 0: 10160.0. Samples: 66871304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:40,122][635489] Avg episode reward: [(0, '4261.451')] [2023-03-09 07:29:43,385][635776] Updated weights for policy 0, policy_version 130720 (0.0005) [2023-03-09 07:29:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 66945024. Throughput: 0: 10174.4. Samples: 66934084. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:45,122][635489] Avg episode reward: [(0, '4321.214')] [2023-03-09 07:29:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130752_66945024.pth... [2023-03-09 07:29:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130152_66637824.pth [2023-03-09 07:29:47,198][635776] Updated weights for policy 0, policy_version 130800 (0.0004) [2023-03-09 07:29:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 66994176. Throughput: 0: 10214.1. Samples: 66994928. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:50,122][635489] Avg episode reward: [(0, '4280.730')] [2023-03-09 07:29:51,430][635776] Updated weights for policy 0, policy_version 130880 (0.0005) [2023-03-09 07:29:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 67043328. Throughput: 0: 10160.7. Samples: 67023208. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:29:55,122][635489] Avg episode reward: [(0, '4240.169')] [2023-03-09 07:29:55,654][635776] Updated weights for policy 0, policy_version 130960 (0.0005) [2023-03-09 07:29:59,697][635776] Updated weights for policy 0, policy_version 131040 (0.0005) [2023-03-09 07:30:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.8, 300 sec: 10135.9). Total num frames: 67096576. Throughput: 0: 10165.3. Samples: 67084568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:00,122][635489] Avg episode reward: [(0, '4216.322')] [2023-03-09 07:30:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131048_67096576.pth... [2023-03-09 07:30:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130456_66793472.pth [2023-03-09 07:30:03,513][635776] Updated weights for policy 0, policy_version 131120 (0.0004) [2023-03-09 07:30:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 67149824. Throughput: 0: 10282.0. Samples: 67149668. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:05,122][635489] Avg episode reward: [(0, '4310.749')] [2023-03-09 07:30:07,499][635776] Updated weights for policy 0, policy_version 131200 (0.0005) [2023-03-09 07:30:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 67198976. Throughput: 0: 10225.5. Samples: 67178560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:10,122][635489] Avg episode reward: [(0, '4295.698')] [2023-03-09 07:30:11,365][635776] Updated weights for policy 0, policy_version 131280 (0.0004) [2023-03-09 07:30:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 67252224. Throughput: 0: 10226.3. Samples: 67239748. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:15,122][635489] Avg episode reward: [(0, '4377.643')] [2023-03-09 07:30:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131352_67252224.pth... [2023-03-09 07:30:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000130752_66945024.pth [2023-03-09 07:30:15,310][635776] Updated weights for policy 0, policy_version 131360 (0.0004) [2023-03-09 07:30:19,192][635776] Updated weights for policy 0, policy_version 131440 (0.0005) [2023-03-09 07:30:20,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 67305472. Throughput: 0: 10340.9. Samples: 67305176. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:20,122][635489] Avg episode reward: [(0, '4395.737')] [2023-03-09 07:30:23,453][635776] Updated weights for policy 0, policy_version 131520 (0.0005) [2023-03-09 07:30:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 67354624. Throughput: 0: 10269.4. Samples: 67333428. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:30:25,122][635489] Avg episode reward: [(0, '4060.039')] [2023-03-09 07:30:27,522][635776] Updated weights for policy 0, policy_version 131600 (0.0005) [2023-03-09 07:30:30,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 67399680. Throughput: 0: 10166.1. Samples: 67391560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:30,122][635489] Avg episode reward: [(0, '4164.912')] [2023-03-09 07:30:30,177][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131648_67403776.pth... [2023-03-09 07:30:30,179][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131048_67096576.pth [2023-03-09 07:30:31,906][635776] Updated weights for policy 0, policy_version 131680 (0.0006) [2023-03-09 07:30:35,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 67448832. Throughput: 0: 10088.4. Samples: 67448904. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:35,122][635489] Avg episode reward: [(0, '4154.064')] [2023-03-09 07:30:35,974][635776] Updated weights for policy 0, policy_version 131760 (0.0004) [2023-03-09 07:30:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 67497984. Throughput: 0: 10138.2. Samples: 67479428. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:40,122][635489] Avg episode reward: [(0, '3848.509')] [2023-03-09 07:30:40,211][635776] Updated weights for policy 0, policy_version 131840 (0.0005) [2023-03-09 07:30:44,537][635776] Updated weights for policy 0, policy_version 131920 (0.0005) [2023-03-09 07:30:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 67547136. Throughput: 0: 10066.3. Samples: 67537552. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:45,122][635489] Avg episode reward: [(0, '3460.642')] [2023-03-09 07:30:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131928_67547136.pth... [2023-03-09 07:30:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131352_67252224.pth [2023-03-09 07:30:48,703][635776] Updated weights for policy 0, policy_version 132000 (0.0005) [2023-03-09 07:30:50,121][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 67596288. Throughput: 0: 9909.7. Samples: 67595604. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:50,122][635489] Avg episode reward: [(0, '3981.098')] [2023-03-09 07:30:52,984][635776] Updated weights for policy 0, policy_version 132080 (0.0005) [2023-03-09 07:30:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 67645440. Throughput: 0: 9917.9. Samples: 67624864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:30:55,122][635489] Avg episode reward: [(0, '4231.844')] [2023-03-09 07:30:57,062][635776] Updated weights for policy 0, policy_version 132160 (0.0005) [2023-03-09 07:31:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 67694592. Throughput: 0: 9918.7. Samples: 67686088. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:00,122][635489] Avg episode reward: [(0, '4018.904')] [2023-03-09 07:31:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132216_67694592.pth... [2023-03-09 07:31:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131648_67403776.pth [2023-03-09 07:31:00,973][635776] Updated weights for policy 0, policy_version 132240 (0.0005) [2023-03-09 07:31:05,121][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 67743744. Throughput: 0: 9746.1. Samples: 67743752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:05,122][635489] Avg episode reward: [(0, '4288.689')] [2023-03-09 07:31:05,211][635776] Updated weights for policy 0, policy_version 132320 (0.0005) [2023-03-09 07:31:09,230][635776] Updated weights for policy 0, policy_version 132400 (0.0005) [2023-03-09 07:31:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 67796992. Throughput: 0: 9825.9. Samples: 67775592. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:10,122][635489] Avg episode reward: [(0, '4176.923')] [2023-03-09 07:31:13,009][635776] Updated weights for policy 0, policy_version 132480 (0.0005) [2023-03-09 07:31:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 67846144. Throughput: 0: 9941.9. Samples: 67838944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:15,122][635489] Avg episode reward: [(0, '4193.660')] [2023-03-09 07:31:15,132][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132520_67850240.pth... [2023-03-09 07:31:15,133][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000131928_67547136.pth [2023-03-09 07:31:17,202][635776] Updated weights for policy 0, policy_version 132560 (0.0004) [2023-03-09 07:31:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 67899392. Throughput: 0: 9986.0. Samples: 67898272. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:20,122][635489] Avg episode reward: [(0, '4338.566')] [2023-03-09 07:31:21,273][635776] Updated weights for policy 0, policy_version 132640 (0.0005) [2023-03-09 07:31:25,079][635776] Updated weights for policy 0, policy_version 132720 (0.0005) [2023-03-09 07:31:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 67952640. Throughput: 0: 9972.4. Samples: 67928188. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:31:25,122][635489] Avg episode reward: [(0, '4477.456')] [2023-03-09 07:31:28,888][635776] Updated weights for policy 0, policy_version 132800 (0.0005) [2023-03-09 07:31:30,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 68005888. Throughput: 0: 10135.8. Samples: 67993664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:30,122][635489] Avg episode reward: [(0, '4178.481')] [2023-03-09 07:31:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132824_68005888.pth... [2023-03-09 07:31:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132216_67694592.pth [2023-03-09 07:31:32,741][635776] Updated weights for policy 0, policy_version 132880 (0.0005) [2023-03-09 07:31:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 68055040. Throughput: 0: 10216.2. Samples: 68055332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:35,122][635489] Avg episode reward: [(0, '4324.818')] [2023-03-09 07:31:36,909][635776] Updated weights for policy 0, policy_version 132960 (0.0005) [2023-03-09 07:31:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 68104192. Throughput: 0: 10224.8. Samples: 68084980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:40,122][635489] Avg episode reward: [(0, '4410.402')] [2023-03-09 07:31:41,082][635776] Updated weights for policy 0, policy_version 133040 (0.0006) [2023-03-09 07:31:45,103][635776] Updated weights for policy 0, policy_version 133120 (0.0005) [2023-03-09 07:31:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 68157440. Throughput: 0: 10202.8. Samples: 68145216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:45,122][635489] Avg episode reward: [(0, '4468.613')] [2023-03-09 07:31:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133120_68157440.pth... [2023-03-09 07:31:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132520_67850240.pth [2023-03-09 07:31:49,159][635776] Updated weights for policy 0, policy_version 133200 (0.0005) [2023-03-09 07:31:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 68206592. Throughput: 0: 10282.7. Samples: 68206476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:50,122][635489] Avg episode reward: [(0, '4443.307')] [2023-03-09 07:31:53,198][635776] Updated weights for policy 0, policy_version 133280 (0.0005) [2023-03-09 07:31:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 68259840. Throughput: 0: 10223.0. Samples: 68235628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:31:55,122][635489] Avg episode reward: [(0, '4505.710')] [2023-03-09 07:31:57,106][635776] Updated weights for policy 0, policy_version 133360 (0.0005) [2023-03-09 07:32:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 68308992. Throughput: 0: 10239.9. Samples: 68299740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:00,122][635489] Avg episode reward: [(0, '4420.915')] [2023-03-09 07:32:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133416_68308992.pth... [2023-03-09 07:32:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000132824_68005888.pth [2023-03-09 07:32:00,976][635776] Updated weights for policy 0, policy_version 133440 (0.0005) [2023-03-09 07:32:04,725][635776] Updated weights for policy 0, policy_version 133520 (0.0005) [2023-03-09 07:32:05,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 68362240. Throughput: 0: 10346.8. Samples: 68363880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:05,122][635489] Avg episode reward: [(0, '4492.272')] [2023-03-09 07:32:08,741][635776] Updated weights for policy 0, policy_version 133600 (0.0005) [2023-03-09 07:32:10,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 68415488. Throughput: 0: 10358.6. Samples: 68394324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:10,122][635489] Avg episode reward: [(0, '4505.215')] [2023-03-09 07:32:12,471][635776] Updated weights for policy 0, policy_version 133680 (0.0005) [2023-03-09 07:32:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 68468736. Throughput: 0: 10370.7. Samples: 68460344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:15,122][635489] Avg episode reward: [(0, '4478.850')] [2023-03-09 07:32:15,138][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133736_68472832.pth... [2023-03-09 07:32:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133120_68157440.pth [2023-03-09 07:32:16,413][635776] Updated weights for policy 0, policy_version 133760 (0.0005) [2023-03-09 07:32:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10149.7). Total num frames: 68521984. Throughput: 0: 10369.1. Samples: 68521944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:20,122][635489] Avg episode reward: [(0, '4482.472')] [2023-03-09 07:32:20,331][635776] Updated weights for policy 0, policy_version 133840 (0.0005) [2023-03-09 07:32:24,555][635776] Updated weights for policy 0, policy_version 133920 (0.0005) [2023-03-09 07:32:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 68571136. Throughput: 0: 10348.4. Samples: 68550656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:25,122][635489] Avg episode reward: [(0, '4231.319')] [2023-03-09 07:32:28,699][635776] Updated weights for policy 0, policy_version 134000 (0.0005) [2023-03-09 07:32:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 68620288. Throughput: 0: 10332.3. Samples: 68610168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:32:30,122][635489] Avg episode reward: [(0, '4007.557')] [2023-03-09 07:32:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134024_68620288.pth... [2023-03-09 07:32:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133416_68308992.pth [2023-03-09 07:32:32,761][635776] Updated weights for policy 0, policy_version 134080 (0.0005) [2023-03-09 07:32:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 68669440. Throughput: 0: 10288.1. Samples: 68669440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:32:35,122][635489] Avg episode reward: [(0, '4260.814')] [2023-03-09 07:32:37,062][635776] Updated weights for policy 0, policy_version 134160 (0.0005) [2023-03-09 07:32:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 68718592. Throughput: 0: 10285.2. Samples: 68698460. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:32:40,122][635489] Avg episode reward: [(0, '3776.667')] [2023-03-09 07:32:41,152][635776] Updated weights for policy 0, policy_version 134240 (0.0005) [2023-03-09 07:32:45,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.8, 300 sec: 10135.9). Total num frames: 68767744. Throughput: 0: 10158.9. Samples: 68756892. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:32:45,122][635489] Avg episode reward: [(0, '4191.717')] [2023-03-09 07:32:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134312_68767744.pth... [2023-03-09 07:32:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000133736_68472832.pth [2023-03-09 07:32:45,423][635776] Updated weights for policy 0, policy_version 134320 (0.0005) [2023-03-09 07:32:49,566][635776] Updated weights for policy 0, policy_version 134400 (0.0005) [2023-03-09 07:32:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 68816896. Throughput: 0: 10060.2. Samples: 68816588. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:32:50,122][635489] Avg episode reward: [(0, '4395.402')] [2023-03-09 07:32:53,467][635776] Updated weights for policy 0, policy_version 134480 (0.0005) [2023-03-09 07:32:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 68870144. Throughput: 0: 10062.3. Samples: 68847128. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:32:55,122][635489] Avg episode reward: [(0, '4275.563')] [2023-03-09 07:32:57,304][635776] Updated weights for policy 0, policy_version 134560 (0.0005) [2023-03-09 07:33:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 68919296. Throughput: 0: 10015.3. Samples: 68911032. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:00,122][635489] Avg episode reward: [(0, '4397.710')] [2023-03-09 07:33:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134616_68923392.pth... [2023-03-09 07:33:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134024_68620288.pth [2023-03-09 07:33:01,293][635776] Updated weights for policy 0, policy_version 134640 (0.0005) [2023-03-09 07:33:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 68972544. Throughput: 0: 10014.8. Samples: 68972608. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:05,122][635489] Avg episode reward: [(0, '4380.786')] [2023-03-09 07:33:05,195][635776] Updated weights for policy 0, policy_version 134720 (0.0005) [2023-03-09 07:33:09,405][635776] Updated weights for policy 0, policy_version 134800 (0.0004) [2023-03-09 07:33:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 69021696. Throughput: 0: 10096.3. Samples: 69004988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:10,122][635489] Avg episode reward: [(0, '4363.607')] [2023-03-09 07:33:13,554][635776] Updated weights for policy 0, policy_version 134880 (0.0005) [2023-03-09 07:33:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 69070848. Throughput: 0: 10048.5. Samples: 69062352. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:15,122][635489] Avg episode reward: [(0, '4472.214')] [2023-03-09 07:33:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134904_69070848.pth... [2023-03-09 07:33:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134312_68767744.pth [2023-03-09 07:33:17,820][635776] Updated weights for policy 0, policy_version 134960 (0.0004) [2023-03-09 07:33:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 69120000. Throughput: 0: 10013.8. Samples: 69120064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:20,122][635489] Avg episode reward: [(0, '4376.725')] [2023-03-09 07:33:21,816][635776] Updated weights for policy 0, policy_version 135040 (0.0004) [2023-03-09 07:33:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 69173248. Throughput: 0: 10095.8. Samples: 69152768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:25,122][635489] Avg episode reward: [(0, '4335.526')] [2023-03-09 07:33:25,705][635776] Updated weights for policy 0, policy_version 135120 (0.0004) [2023-03-09 07:33:29,611][635776] Updated weights for policy 0, policy_version 135200 (0.0005) [2023-03-09 07:33:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 69226496. Throughput: 0: 10164.0. Samples: 69214272. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 07:33:30,122][635489] Avg episode reward: [(0, '4137.454')] [2023-03-09 07:33:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135208_69226496.pth... [2023-03-09 07:33:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134616_68923392.pth [2023-03-09 07:33:33,563][635776] Updated weights for policy 0, policy_version 135280 (0.0005) [2023-03-09 07:33:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 69275648. Throughput: 0: 10206.3. Samples: 69275872. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:33:35,122][635489] Avg episode reward: [(0, '4021.263')] [2023-03-09 07:33:37,785][635776] Updated weights for policy 0, policy_version 135360 (0.0005) [2023-03-09 07:33:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 69328896. Throughput: 0: 10180.5. Samples: 69305252. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:33:40,122][635489] Avg episode reward: [(0, '4138.309')] [2023-03-09 07:33:41,275][635776] Updated weights for policy 0, policy_version 135440 (0.0004) [2023-03-09 07:33:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 69382144. Throughput: 0: 10218.0. Samples: 69370844. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:33:45,122][635489] Avg episode reward: [(0, '4387.431')] [2023-03-09 07:33:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135512_69382144.pth... [2023-03-09 07:33:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000134904_69070848.pth [2023-03-09 07:33:45,495][635776] Updated weights for policy 0, policy_version 135520 (0.0005) [2023-03-09 07:33:49,811][635776] Updated weights for policy 0, policy_version 135600 (0.0004) [2023-03-09 07:33:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 69427200. Throughput: 0: 10111.2. Samples: 69427612. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:33:50,122][635489] Avg episode reward: [(0, '4217.185')] [2023-03-09 07:33:53,706][635776] Updated weights for policy 0, policy_version 135680 (0.0005) [2023-03-09 07:33:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 69480448. Throughput: 0: 10120.5. Samples: 69460412. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:33:55,122][635489] Avg episode reward: [(0, '4380.709')] [2023-03-09 07:33:57,687][635776] Updated weights for policy 0, policy_version 135760 (0.0005) [2023-03-09 07:34:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 69533696. Throughput: 0: 10201.5. Samples: 69521420. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:00,122][635489] Avg episode reward: [(0, '4322.353')] [2023-03-09 07:34:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135808_69533696.pth... [2023-03-09 07:34:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135208_69226496.pth [2023-03-09 07:34:01,641][635776] Updated weights for policy 0, policy_version 135840 (0.0005) [2023-03-09 07:34:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 69586944. Throughput: 0: 10333.5. Samples: 69585072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:05,122][635489] Avg episode reward: [(0, '4443.308')] [2023-03-09 07:34:05,433][635776] Updated weights for policy 0, policy_version 135920 (0.0005) [2023-03-09 07:34:09,339][635776] Updated weights for policy 0, policy_version 136000 (0.0005) [2023-03-09 07:34:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 69636096. Throughput: 0: 10283.5. Samples: 69615528. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:10,122][635489] Avg episode reward: [(0, '4433.763')] [2023-03-09 07:34:13,445][635776] Updated weights for policy 0, policy_version 136080 (0.0005) [2023-03-09 07:34:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 69685248. Throughput: 0: 10284.4. Samples: 69677068. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:15,122][635489] Avg episode reward: [(0, '4334.473')] [2023-03-09 07:34:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136104_69685248.pth... [2023-03-09 07:34:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135512_69382144.pth [2023-03-09 07:34:17,599][635776] Updated weights for policy 0, policy_version 136160 (0.0005) [2023-03-09 07:34:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 69738496. Throughput: 0: 10279.3. Samples: 69738440. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:20,122][635489] Avg episode reward: [(0, '4364.718')] [2023-03-09 07:34:21,602][635776] Updated weights for policy 0, policy_version 136240 (0.0005) [2023-03-09 07:34:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 69787648. Throughput: 0: 10274.2. Samples: 69767592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:25,122][635489] Avg episode reward: [(0, '4417.700')] [2023-03-09 07:34:25,770][635776] Updated weights for policy 0, policy_version 136320 (0.0006) [2023-03-09 07:34:29,952][635776] Updated weights for policy 0, policy_version 136400 (0.0005) [2023-03-09 07:34:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 69836800. Throughput: 0: 10107.1. Samples: 69825664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:30,122][635489] Avg episode reward: [(0, '4384.396')] [2023-03-09 07:34:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136400_69836800.pth... [2023-03-09 07:34:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000135808_69533696.pth [2023-03-09 07:34:33,898][635776] Updated weights for policy 0, policy_version 136480 (0.0004) [2023-03-09 07:34:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 69885952. Throughput: 0: 10191.2. Samples: 69886216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:34:35,122][635489] Avg episode reward: [(0, '4416.360')] [2023-03-09 07:34:38,127][635776] Updated weights for policy 0, policy_version 136560 (0.0005) [2023-03-09 07:34:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 69939200. Throughput: 0: 10125.0. Samples: 69916036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:34:40,122][635489] Avg episode reward: [(0, '4334.001')] [2023-03-09 07:34:41,776][635776] Updated weights for policy 0, policy_version 136640 (0.0005) [2023-03-09 07:34:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 69992448. Throughput: 0: 10237.2. Samples: 69982096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:34:45,122][635489] Avg episode reward: [(0, '4521.739')] [2023-03-09 07:34:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136704_69992448.pth... [2023-03-09 07:34:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136104_69685248.pth [2023-03-09 07:34:45,682][635776] Updated weights for policy 0, policy_version 136720 (0.0005) [2023-03-09 07:34:49,389][635776] Updated weights for policy 0, policy_version 136800 (0.0005) [2023-03-09 07:34:50,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 70049792. Throughput: 0: 10282.1. Samples: 70047768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:34:50,122][635489] Avg episode reward: [(0, '4433.033')] [2023-03-09 07:34:53,129][635776] Updated weights for policy 0, policy_version 136880 (0.0005) [2023-03-09 07:34:55,122][635489] Fps is (10 sec: 11059.3, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 70103040. Throughput: 0: 10325.2. Samples: 70080164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:34:55,122][635489] Avg episode reward: [(0, '4412.899')] [2023-03-09 07:34:57,111][635776] Updated weights for policy 0, policy_version 136960 (0.0005) [2023-03-09 07:35:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 70152192. Throughput: 0: 10322.1. Samples: 70141560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:00,122][635489] Avg episode reward: [(0, '4492.466')] [2023-03-09 07:35:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137016_70152192.pth... [2023-03-09 07:35:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136400_69836800.pth [2023-03-09 07:35:00,983][635776] Updated weights for policy 0, policy_version 137040 (0.0005) [2023-03-09 07:35:05,037][635776] Updated weights for policy 0, policy_version 137120 (0.0005) [2023-03-09 07:35:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 70205440. Throughput: 0: 10354.1. Samples: 70204372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:05,122][635489] Avg episode reward: [(0, '4486.770')] [2023-03-09 07:35:09,110][635776] Updated weights for policy 0, policy_version 137200 (0.0005) [2023-03-09 07:35:10,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 70254592. Throughput: 0: 10368.5. Samples: 70234176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:10,122][635489] Avg episode reward: [(0, '4514.804')] [2023-03-09 07:35:13,256][635776] Updated weights for policy 0, policy_version 137280 (0.0005) [2023-03-09 07:35:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 70303744. Throughput: 0: 10396.0. Samples: 70293484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:15,122][635489] Avg episode reward: [(0, '4152.522')] [2023-03-09 07:35:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137312_70303744.pth... [2023-03-09 07:35:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000136704_69992448.pth [2023-03-09 07:35:17,433][635776] Updated weights for policy 0, policy_version 137360 (0.0005) [2023-03-09 07:35:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 70352896. Throughput: 0: 10372.6. Samples: 70352984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:20,122][635489] Avg episode reward: [(0, '3540.782')] [2023-03-09 07:35:21,403][635776] Updated weights for policy 0, policy_version 137440 (0.0005) [2023-03-09 07:35:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 70406144. Throughput: 0: 10417.9. Samples: 70384840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:25,122][635489] Avg episode reward: [(0, '3450.438')] [2023-03-09 07:35:25,406][635776] Updated weights for policy 0, policy_version 137520 (0.0005) [2023-03-09 07:35:29,391][635776] Updated weights for policy 0, policy_version 137600 (0.0005) [2023-03-09 07:35:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 70455296. Throughput: 0: 10305.5. Samples: 70445844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:30,122][635489] Avg episode reward: [(0, '3584.597')] [2023-03-09 07:35:30,160][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137616_70459392.pth... [2023-03-09 07:35:30,162][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137016_70152192.pth [2023-03-09 07:35:33,456][635776] Updated weights for policy 0, policy_version 137680 (0.0005) [2023-03-09 07:35:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 70508544. Throughput: 0: 10199.2. Samples: 70506732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:35,122][635489] Avg episode reward: [(0, '3472.740')] [2023-03-09 07:35:37,693][635776] Updated weights for policy 0, policy_version 137760 (0.0005) [2023-03-09 07:35:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 70553600. Throughput: 0: 10115.6. Samples: 70535364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:40,122][635489] Avg episode reward: [(0, '3258.050')] [2023-03-09 07:35:41,856][635776] Updated weights for policy 0, policy_version 137840 (0.0005) [2023-03-09 07:35:45,122][635489] Fps is (10 sec: 9830.2, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 70606848. Throughput: 0: 10064.8. Samples: 70594476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:45,122][635489] Avg episode reward: [(0, '3449.937')] [2023-03-09 07:35:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137904_70606848.pth... [2023-03-09 07:35:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137312_70303744.pth [2023-03-09 07:35:45,949][635776] Updated weights for policy 0, policy_version 137920 (0.0005) [2023-03-09 07:35:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 70651904. Throughput: 0: 9936.8. Samples: 70651528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:50,122][635489] Avg episode reward: [(0, '3479.026')] [2023-03-09 07:35:50,437][635776] Updated weights for policy 0, policy_version 138000 (0.0005) [2023-03-09 07:35:54,689][635776] Updated weights for policy 0, policy_version 138080 (0.0005) [2023-03-09 07:35:55,121][635489] Fps is (10 sec: 9011.3, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 70696960. Throughput: 0: 9906.8. Samples: 70679980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:35:55,122][635489] Avg episode reward: [(0, '3526.158')] [2023-03-09 07:35:58,892][635776] Updated weights for policy 0, policy_version 138160 (0.0005) [2023-03-09 07:36:00,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 70746112. Throughput: 0: 9876.6. Samples: 70737928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:00,122][635489] Avg episode reward: [(0, '3533.340')] [2023-03-09 07:36:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138176_70746112.pth... [2023-03-09 07:36:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137616_70459392.pth [2023-03-09 07:36:03,127][635776] Updated weights for policy 0, policy_version 138240 (0.0005) [2023-03-09 07:36:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 70799360. Throughput: 0: 9914.1. Samples: 70799120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:05,122][635489] Avg episode reward: [(0, '3756.476')] [2023-03-09 07:36:07,182][635776] Updated weights for policy 0, policy_version 138320 (0.0004) [2023-03-09 07:36:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 70848512. Throughput: 0: 9839.9. Samples: 70827636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:10,122][635489] Avg episode reward: [(0, '3865.599')] [2023-03-09 07:36:11,299][635776] Updated weights for policy 0, policy_version 138400 (0.0005) [2023-03-09 07:36:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 70897664. Throughput: 0: 9780.0. Samples: 70885944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:15,122][635489] Avg episode reward: [(0, '3622.314')] [2023-03-09 07:36:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138472_70897664.pth... [2023-03-09 07:36:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000137904_70606848.pth [2023-03-09 07:36:15,429][635776] Updated weights for policy 0, policy_version 138480 (0.0005) [2023-03-09 07:36:19,444][635776] Updated weights for policy 0, policy_version 138560 (0.0004) [2023-03-09 07:36:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 70946816. Throughput: 0: 9779.8. Samples: 70946824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:20,122][635489] Avg episode reward: [(0, '3908.412')] [2023-03-09 07:36:23,543][635776] Updated weights for policy 0, policy_version 138640 (0.0004) [2023-03-09 07:36:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10149.8). Total num frames: 71000064. Throughput: 0: 9791.9. Samples: 70976000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:25,122][635489] Avg episode reward: [(0, '4359.199')] [2023-03-09 07:36:27,486][635776] Updated weights for policy 0, policy_version 138720 (0.0004) [2023-03-09 07:36:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 71049216. Throughput: 0: 9877.8. Samples: 71038976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:30,132][635489] Avg episode reward: [(0, '4388.328')] [2023-03-09 07:36:30,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138768_71049216.pth... [2023-03-09 07:36:30,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138176_70746112.pth [2023-03-09 07:36:31,512][635776] Updated weights for policy 0, policy_version 138800 (0.0005) [2023-03-09 07:36:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10149.8). Total num frames: 71098368. Throughput: 0: 9938.5. Samples: 71098760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:35,132][635489] Avg episode reward: [(0, '4049.093')] [2023-03-09 07:36:35,767][635776] Updated weights for policy 0, policy_version 138880 (0.0005) [2023-03-09 07:36:39,978][635776] Updated weights for policy 0, policy_version 138960 (0.0004) [2023-03-09 07:36:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 71147520. Throughput: 0: 9933.8. Samples: 71127004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:40,133][635489] Avg episode reward: [(0, '4307.252')] [2023-03-09 07:36:44,209][635776] Updated weights for policy 0, policy_version 139040 (0.0004) [2023-03-09 07:36:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10135.9). Total num frames: 71196672. Throughput: 0: 9946.1. Samples: 71185504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:45,122][635489] Avg episode reward: [(0, '4393.191')] [2023-03-09 07:36:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139056_71196672.pth... [2023-03-09 07:36:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138472_70897664.pth [2023-03-09 07:36:48,560][635776] Updated weights for policy 0, policy_version 139120 (0.0005) [2023-03-09 07:36:50,121][635489] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 71241728. Throughput: 0: 9849.8. Samples: 71242360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:50,122][635489] Avg episode reward: [(0, '4433.804')] [2023-03-09 07:36:52,639][635776] Updated weights for policy 0, policy_version 139200 (0.0005) [2023-03-09 07:36:55,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 71290880. Throughput: 0: 9906.4. Samples: 71273424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:36:55,122][635489] Avg episode reward: [(0, '4432.385')] [2023-03-09 07:36:56,776][635776] Updated weights for policy 0, policy_version 139280 (0.0005) [2023-03-09 07:37:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 71344128. Throughput: 0: 9974.4. Samples: 71334792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:00,122][635489] Avg episode reward: [(0, '4401.676')] [2023-03-09 07:37:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139344_71344128.pth... [2023-03-09 07:37:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000138768_71049216.pth [2023-03-09 07:37:00,640][635776] Updated weights for policy 0, policy_version 139360 (0.0005) [2023-03-09 07:37:04,526][635776] Updated weights for policy 0, policy_version 139440 (0.0005) [2023-03-09 07:37:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 71397376. Throughput: 0: 10010.1. Samples: 71397276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:05,122][635489] Avg episode reward: [(0, '4514.812')] [2023-03-09 07:37:08,735][635776] Updated weights for policy 0, policy_version 139520 (0.0005) [2023-03-09 07:37:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 71446528. Throughput: 0: 10001.1. Samples: 71426048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:10,122][635489] Avg episode reward: [(0, '4474.983')] [2023-03-09 07:37:12,597][635776] Updated weights for policy 0, policy_version 139600 (0.0006) [2023-03-09 07:37:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 71499776. Throughput: 0: 9996.9. Samples: 71488836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:15,122][635489] Avg episode reward: [(0, '4387.369')] [2023-03-09 07:37:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139648_71499776.pth... [2023-03-09 07:37:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139056_71196672.pth [2023-03-09 07:37:16,653][635776] Updated weights for policy 0, policy_version 139680 (0.0005) [2023-03-09 07:37:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 71548928. Throughput: 0: 10003.6. Samples: 71548924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:20,122][635489] Avg episode reward: [(0, '4122.963')] [2023-03-09 07:37:20,730][635776] Updated weights for policy 0, policy_version 139760 (0.0005) [2023-03-09 07:37:24,895][635776] Updated weights for policy 0, policy_version 139840 (0.0004) [2023-03-09 07:37:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 71598080. Throughput: 0: 10020.9. Samples: 71577944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:25,122][635489] Avg episode reward: [(0, '4392.847')] [2023-03-09 07:37:28,737][635776] Updated weights for policy 0, policy_version 139920 (0.0005) [2023-03-09 07:37:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 71651328. Throughput: 0: 10109.4. Samples: 71640428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:30,122][635489] Avg episode reward: [(0, '4409.446')] [2023-03-09 07:37:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139944_71651328.pth... [2023-03-09 07:37:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139344_71344128.pth [2023-03-09 07:37:32,620][635776] Updated weights for policy 0, policy_version 140000 (0.0004) [2023-03-09 07:37:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.4, 300 sec: 10122.0). Total num frames: 71704576. Throughput: 0: 10208.0. Samples: 71701720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:35,122][635489] Avg episode reward: [(0, '4273.255')] [2023-03-09 07:37:36,815][635776] Updated weights for policy 0, policy_version 140080 (0.0005) [2023-03-09 07:37:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 71749632. Throughput: 0: 10174.4. Samples: 71731272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:40,122][635489] Avg episode reward: [(0, '4275.795')] [2023-03-09 07:37:41,188][635776] Updated weights for policy 0, policy_version 140160 (0.0005) [2023-03-09 07:37:45,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 71798784. Throughput: 0: 10091.0. Samples: 71788888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:45,122][635489] Avg episode reward: [(0, '3899.394')] [2023-03-09 07:37:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140232_71798784.pth... [2023-03-09 07:37:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139648_71499776.pth [2023-03-09 07:37:45,311][635776] Updated weights for policy 0, policy_version 140240 (0.0005) [2023-03-09 07:37:49,473][635776] Updated weights for policy 0, policy_version 140320 (0.0005) [2023-03-09 07:37:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 71847936. Throughput: 0: 10014.9. Samples: 71847948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:50,122][635489] Avg episode reward: [(0, '3986.597')] [2023-03-09 07:37:53,433][635776] Updated weights for policy 0, policy_version 140400 (0.0005) [2023-03-09 07:37:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 71901184. Throughput: 0: 10089.2. Samples: 71880064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:37:55,122][635489] Avg episode reward: [(0, '4223.826')] [2023-03-09 07:37:57,354][635776] Updated weights for policy 0, policy_version 140480 (0.0004) [2023-03-09 07:38:00,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 71950336. Throughput: 0: 10073.5. Samples: 71942144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:00,122][635489] Avg episode reward: [(0, '4168.499')] [2023-03-09 07:38:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140536_71954432.pth... [2023-03-09 07:38:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000139944_71651328.pth [2023-03-09 07:38:01,166][635776] Updated weights for policy 0, policy_version 140560 (0.0005) [2023-03-09 07:38:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 72003584. Throughput: 0: 10124.6. Samples: 72004532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:05,122][635489] Avg episode reward: [(0, '4242.666')] [2023-03-09 07:38:05,155][635776] Updated weights for policy 0, policy_version 140640 (0.0005) [2023-03-09 07:38:08,931][635776] Updated weights for policy 0, policy_version 140720 (0.0005) [2023-03-09 07:38:10,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.8, 300 sec: 10122.0). Total num frames: 72056832. Throughput: 0: 10196.0. Samples: 72036764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:10,122][635489] Avg episode reward: [(0, '4258.792')] [2023-03-09 07:38:12,935][635776] Updated weights for policy 0, policy_version 140800 (0.0004) [2023-03-09 07:38:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 72110080. Throughput: 0: 10180.8. Samples: 72098564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:15,122][635489] Avg episode reward: [(0, '4261.151')] [2023-03-09 07:38:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140840_72110080.pth... [2023-03-09 07:38:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140232_71798784.pth [2023-03-09 07:38:16,986][635776] Updated weights for policy 0, policy_version 140880 (0.0005) [2023-03-09 07:38:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 72159232. Throughput: 0: 10172.9. Samples: 72159500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:20,122][635489] Avg episode reward: [(0, '3984.750')] [2023-03-09 07:38:21,189][635776] Updated weights for policy 0, policy_version 140960 (0.0005) [2023-03-09 07:38:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 72208384. Throughput: 0: 10158.8. Samples: 72188420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:25,122][635489] Avg episode reward: [(0, '4059.345')] [2023-03-09 07:38:25,237][635776] Updated weights for policy 0, policy_version 141040 (0.0005) [2023-03-09 07:38:29,168][635776] Updated weights for policy 0, policy_version 141120 (0.0005) [2023-03-09 07:38:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 72261632. Throughput: 0: 10257.0. Samples: 72250452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:30,122][635489] Avg episode reward: [(0, '3918.264')] [2023-03-09 07:38:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141136_72261632.pth... [2023-03-09 07:38:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140536_71954432.pth [2023-03-09 07:38:33,113][635776] Updated weights for policy 0, policy_version 141200 (0.0005) [2023-03-09 07:38:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 72310784. Throughput: 0: 10305.7. Samples: 72311704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:35,122][635489] Avg episode reward: [(0, '3968.035')] [2023-03-09 07:38:37,294][635776] Updated weights for policy 0, policy_version 141280 (0.0005) [2023-03-09 07:38:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 72364032. Throughput: 0: 10262.7. Samples: 72341884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:40,122][635489] Avg episode reward: [(0, '4074.671')] [2023-03-09 07:38:41,258][635776] Updated weights for policy 0, policy_version 141360 (0.0004) [2023-03-09 07:38:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 72413184. Throughput: 0: 10259.3. Samples: 72403812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:38:45,122][635489] Avg episode reward: [(0, '3836.362')] [2023-03-09 07:38:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141432_72413184.pth... [2023-03-09 07:38:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000140840_72110080.pth [2023-03-09 07:38:45,323][635776] Updated weights for policy 0, policy_version 141440 (0.0004) [2023-03-09 07:38:49,629][635776] Updated weights for policy 0, policy_version 141520 (0.0004) [2023-03-09 07:38:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 72462336. Throughput: 0: 10154.1. Samples: 72461468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:38:50,122][635489] Avg episode reward: [(0, '3936.103')] [2023-03-09 07:38:53,704][635776] Updated weights for policy 0, policy_version 141600 (0.0005) [2023-03-09 07:38:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10094.2). Total num frames: 72511488. Throughput: 0: 10073.6. Samples: 72490076. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:38:55,122][635489] Avg episode reward: [(0, '3887.050')] [2023-03-09 07:38:57,507][635776] Updated weights for policy 0, policy_version 141680 (0.0004) [2023-03-09 07:39:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 72564736. Throughput: 0: 10102.9. Samples: 72553196. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:00,122][635489] Avg episode reward: [(0, '3867.610')] [2023-03-09 07:39:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141728_72564736.pth... [2023-03-09 07:39:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141136_72261632.pth [2023-03-09 07:39:01,500][635776] Updated weights for policy 0, policy_version 141760 (0.0005) [2023-03-09 07:39:05,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 72617984. Throughput: 0: 10155.4. Samples: 72616496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:05,122][635489] Avg episode reward: [(0, '3929.180')] [2023-03-09 07:39:05,501][635776] Updated weights for policy 0, policy_version 141840 (0.0005) [2023-03-09 07:39:09,282][635776] Updated weights for policy 0, policy_version 141920 (0.0005) [2023-03-09 07:39:10,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 72671232. Throughput: 0: 10185.9. Samples: 72646784. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:10,122][635489] Avg episode reward: [(0, '4234.543')] [2023-03-09 07:39:13,409][635776] Updated weights for policy 0, policy_version 142000 (0.0004) [2023-03-09 07:39:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 72720384. Throughput: 0: 10192.1. Samples: 72709096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:15,122][635489] Avg episode reward: [(0, '4334.454')] [2023-03-09 07:39:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142032_72720384.pth... [2023-03-09 07:39:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141432_72413184.pth [2023-03-09 07:39:16,951][635776] Updated weights for policy 0, policy_version 142080 (0.0005) [2023-03-09 07:39:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 72773632. Throughput: 0: 10260.5. Samples: 72773428. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:20,122][635489] Avg episode reward: [(0, '4215.754')] [2023-03-09 07:39:21,257][635776] Updated weights for policy 0, policy_version 142160 (0.0004) [2023-03-09 07:39:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 72822784. Throughput: 0: 10233.0. Samples: 72802368. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:25,122][635489] Avg episode reward: [(0, '4203.348')] [2023-03-09 07:39:25,403][635776] Updated weights for policy 0, policy_version 142240 (0.0005) [2023-03-09 07:39:29,436][635776] Updated weights for policy 0, policy_version 142320 (0.0004) [2023-03-09 07:39:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 72871936. Throughput: 0: 10197.3. Samples: 72862692. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:30,122][635489] Avg episode reward: [(0, '4448.514')] [2023-03-09 07:39:30,131][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142336_72876032.pth... [2023-03-09 07:39:30,132][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000141728_72564736.pth [2023-03-09 07:39:33,511][635776] Updated weights for policy 0, policy_version 142400 (0.0005) [2023-03-09 07:39:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 72921088. Throughput: 0: 10248.5. Samples: 72922652. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:35,122][635489] Avg episode reward: [(0, '4432.428')] [2023-03-09 07:39:37,644][635776] Updated weights for policy 0, policy_version 142480 (0.0005) [2023-03-09 07:39:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 72974336. Throughput: 0: 10278.2. Samples: 72952596. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:40,133][635489] Avg episode reward: [(0, '4383.610')] [2023-03-09 07:39:41,471][635776] Updated weights for policy 0, policy_version 142560 (0.0005) [2023-03-09 07:39:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 73027584. Throughput: 0: 10281.5. Samples: 73015864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:45,133][635489] Avg episode reward: [(0, '4425.257')] [2023-03-09 07:39:45,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142632_73027584.pth... [2023-03-09 07:39:45,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142032_72720384.pth [2023-03-09 07:39:45,446][635776] Updated weights for policy 0, policy_version 142640 (0.0005) [2023-03-09 07:39:49,445][635776] Updated weights for policy 0, policy_version 142720 (0.0005) [2023-03-09 07:39:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 73076736. Throughput: 0: 10234.0. Samples: 73077024. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:39:50,132][635489] Avg episode reward: [(0, '4387.726')] [2023-03-09 07:39:53,278][635776] Updated weights for policy 0, policy_version 142800 (0.0005) [2023-03-09 07:39:55,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 73129984. Throughput: 0: 10281.3. Samples: 73109440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:39:55,132][635489] Avg episode reward: [(0, '4437.103')] [2023-03-09 07:39:57,330][635776] Updated weights for policy 0, policy_version 142880 (0.0005) [2023-03-09 07:40:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 73179136. Throughput: 0: 10263.5. Samples: 73170952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:00,133][635489] Avg episode reward: [(0, '4500.861')] [2023-03-09 07:40:00,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142936_73183232.pth... [2023-03-09 07:40:00,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142336_72876032.pth [2023-03-09 07:40:01,298][635776] Updated weights for policy 0, policy_version 142960 (0.0005) [2023-03-09 07:40:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 73232384. Throughput: 0: 10251.4. Samples: 73234740. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:05,132][635489] Avg episode reward: [(0, '4468.695')] [2023-03-09 07:40:05,145][635776] Updated weights for policy 0, policy_version 143040 (0.0005) [2023-03-09 07:40:08,685][635776] Updated weights for policy 0, policy_version 143120 (0.0005) [2023-03-09 07:40:10,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 73289728. Throughput: 0: 10350.0. Samples: 73268120. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:10,133][635489] Avg episode reward: [(0, '4467.298')] [2023-03-09 07:40:12,771][635776] Updated weights for policy 0, policy_version 143200 (0.0004) [2023-03-09 07:40:15,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 73338880. Throughput: 0: 10371.9. Samples: 73329428. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:15,133][635489] Avg episode reward: [(0, '4493.202')] [2023-03-09 07:40:15,142][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143248_73342976.pth... [2023-03-09 07:40:15,144][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142632_73027584.pth [2023-03-09 07:40:16,641][635776] Updated weights for policy 0, policy_version 143280 (0.0005) [2023-03-09 07:40:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 73396224. Throughput: 0: 10498.8. Samples: 73395100. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:20,122][635489] Avg episode reward: [(0, '4462.342')] [2023-03-09 07:40:20,420][635776] Updated weights for policy 0, policy_version 143360 (0.0005) [2023-03-09 07:40:24,478][635776] Updated weights for policy 0, policy_version 143440 (0.0005) [2023-03-09 07:40:25,121][635489] Fps is (10 sec: 10649.9, 60 sec: 10376.6, 300 sec: 10135.9). Total num frames: 73445376. Throughput: 0: 10486.1. Samples: 73424468. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:25,122][635489] Avg episode reward: [(0, '4263.295')] [2023-03-09 07:40:28,471][635776] Updated weights for policy 0, policy_version 143520 (0.0006) [2023-03-09 07:40:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 73498624. Throughput: 0: 10459.9. Samples: 73486560. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:30,122][635489] Avg episode reward: [(0, '4303.128')] [2023-03-09 07:40:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143552_73498624.pth... [2023-03-09 07:40:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000142936_73183232.pth [2023-03-09 07:40:32,159][635776] Updated weights for policy 0, policy_version 143600 (0.0005) [2023-03-09 07:40:35,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 73551872. Throughput: 0: 10552.1. Samples: 73551868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:35,122][635489] Avg episode reward: [(0, '4419.722')] [2023-03-09 07:40:36,081][635776] Updated weights for policy 0, policy_version 143680 (0.0006) [2023-03-09 07:40:39,809][635776] Updated weights for policy 0, policy_version 143760 (0.0005) [2023-03-09 07:40:40,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 73605120. Throughput: 0: 10555.9. Samples: 73584456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:40,122][635489] Avg episode reward: [(0, '4389.956')] [2023-03-09 07:40:43,790][635776] Updated weights for policy 0, policy_version 143840 (0.0005) [2023-03-09 07:40:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10191.4). Total num frames: 73658368. Throughput: 0: 10575.9. Samples: 73646868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:45,122][635489] Avg episode reward: [(0, '4170.743')] [2023-03-09 07:40:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143864_73658368.pth... [2023-03-09 07:40:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143248_73342976.pth [2023-03-09 07:40:47,809][635776] Updated weights for policy 0, policy_version 143920 (0.0005) [2023-03-09 07:40:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10513.0, 300 sec: 10205.3). Total num frames: 73707520. Throughput: 0: 10506.7. Samples: 73707544. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:40:50,122][635489] Avg episode reward: [(0, '4317.295')] [2023-03-09 07:40:52,038][635776] Updated weights for policy 0, policy_version 144000 (0.0005) [2023-03-09 07:40:55,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 73756672. Throughput: 0: 10403.0. Samples: 73736256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:40:55,122][635489] Avg episode reward: [(0, '4209.394')] [2023-03-09 07:40:55,960][635776] Updated weights for policy 0, policy_version 144080 (0.0006) [2023-03-09 07:40:59,831][635776] Updated weights for policy 0, policy_version 144160 (0.0005) [2023-03-09 07:41:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10205.3). Total num frames: 73809920. Throughput: 0: 10448.9. Samples: 73799628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:00,122][635489] Avg episode reward: [(0, '4077.117')] [2023-03-09 07:41:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144160_73809920.pth... [2023-03-09 07:41:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143552_73498624.pth [2023-03-09 07:41:03,778][635776] Updated weights for policy 0, policy_version 144240 (0.0005) [2023-03-09 07:41:05,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 73859072. Throughput: 0: 10340.3. Samples: 73860412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:05,122][635489] Avg episode reward: [(0, '4002.366')] [2023-03-09 07:41:07,833][635776] Updated weights for policy 0, policy_version 144320 (0.0005) [2023-03-09 07:41:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10219.2). Total num frames: 73912320. Throughput: 0: 10391.0. Samples: 73892064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:10,122][635489] Avg episode reward: [(0, '4194.386')] [2023-03-09 07:41:11,810][635776] Updated weights for policy 0, policy_version 144400 (0.0005) [2023-03-09 07:41:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10233.1). Total num frames: 73965568. Throughput: 0: 10391.4. Samples: 73954172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:15,122][635489] Avg episode reward: [(0, '4201.852')] [2023-03-09 07:41:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144464_73965568.pth... [2023-03-09 07:41:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000143864_73658368.pth [2023-03-09 07:41:15,778][635776] Updated weights for policy 0, policy_version 144480 (0.0005) [2023-03-09 07:41:19,788][635776] Updated weights for policy 0, policy_version 144560 (0.0006) [2023-03-09 07:41:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 74014720. Throughput: 0: 10306.4. Samples: 74015656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:20,122][635489] Avg episode reward: [(0, '4278.597')] [2023-03-09 07:41:23,720][635776] Updated weights for policy 0, policy_version 144640 (0.0005) [2023-03-09 07:41:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 74067968. Throughput: 0: 10291.2. Samples: 74047560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:25,122][635489] Avg episode reward: [(0, '4414.615')] [2023-03-09 07:41:27,384][635776] Updated weights for policy 0, policy_version 144720 (0.0005) [2023-03-09 07:41:30,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 74125312. Throughput: 0: 10402.1. Samples: 74114960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:30,122][635489] Avg episode reward: [(0, '4457.906')] [2023-03-09 07:41:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144776_74125312.pth... [2023-03-09 07:41:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144160_73809920.pth [2023-03-09 07:41:31,063][635776] Updated weights for policy 0, policy_version 144800 (0.0005) [2023-03-09 07:41:35,110][635776] Updated weights for policy 0, policy_version 144880 (0.0005) [2023-03-09 07:41:35,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 74178560. Throughput: 0: 10432.3. Samples: 74177000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:35,122][635489] Avg episode reward: [(0, '4386.861')] [2023-03-09 07:41:39,290][635776] Updated weights for policy 0, policy_version 144960 (0.0005) [2023-03-09 07:41:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 74227712. Throughput: 0: 10423.4. Samples: 74205308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:40,122][635489] Avg episode reward: [(0, '4435.428')] [2023-03-09 07:41:43,113][635776] Updated weights for policy 0, policy_version 145040 (0.0004) [2023-03-09 07:41:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 74280960. Throughput: 0: 10421.4. Samples: 74268592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:45,122][635489] Avg episode reward: [(0, '4490.487')] [2023-03-09 07:41:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145080_74280960.pth... [2023-03-09 07:41:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144464_73965568.pth [2023-03-09 07:41:47,095][635776] Updated weights for policy 0, policy_version 145120 (0.0005) [2023-03-09 07:41:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 74334208. Throughput: 0: 10475.7. Samples: 74331820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:50,122][635489] Avg episode reward: [(0, '4376.456')] [2023-03-09 07:41:50,905][635776] Updated weights for policy 0, policy_version 145200 (0.0004) [2023-03-09 07:41:55,025][635776] Updated weights for policy 0, policy_version 145280 (0.0006) [2023-03-09 07:41:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 74383360. Throughput: 0: 10460.5. Samples: 74362788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:41:55,122][635489] Avg episode reward: [(0, '4331.101')] [2023-03-09 07:41:59,068][635776] Updated weights for policy 0, policy_version 145360 (0.0005) [2023-03-09 07:42:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 74432512. Throughput: 0: 10422.9. Samples: 74423200. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:00,122][635489] Avg episode reward: [(0, '4274.704')] [2023-03-09 07:42:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145376_74432512.pth... [2023-03-09 07:42:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000144776_74125312.pth [2023-03-09 07:42:02,841][635776] Updated weights for policy 0, policy_version 145440 (0.0005) [2023-03-09 07:42:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 74485760. Throughput: 0: 10448.2. Samples: 74485824. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:05,122][635489] Avg episode reward: [(0, '4322.020')] [2023-03-09 07:42:06,686][635776] Updated weights for policy 0, policy_version 145520 (0.0004) [2023-03-09 07:42:10,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 74539008. Throughput: 0: 10465.1. Samples: 74518488. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:10,122][635489] Avg episode reward: [(0, '4374.222')] [2023-03-09 07:42:10,825][635776] Updated weights for policy 0, policy_version 145600 (0.0005) [2023-03-09 07:42:14,808][635776] Updated weights for policy 0, policy_version 145680 (0.0005) [2023-03-09 07:42:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 74588160. Throughput: 0: 10295.7. Samples: 74578268. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:15,122][635489] Avg episode reward: [(0, '4503.703')] [2023-03-09 07:42:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145680_74588160.pth... [2023-03-09 07:42:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145080_74280960.pth [2023-03-09 07:42:18,768][635776] Updated weights for policy 0, policy_version 145760 (0.0005) [2023-03-09 07:42:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 74641408. Throughput: 0: 10323.9. Samples: 74641572. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:20,122][635489] Avg episode reward: [(0, '4435.364')] [2023-03-09 07:42:22,597][635776] Updated weights for policy 0, policy_version 145840 (0.0005) [2023-03-09 07:42:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 74694656. Throughput: 0: 10414.6. Samples: 74673964. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:25,122][635489] Avg episode reward: [(0, '4225.715')] [2023-03-09 07:42:26,647][635776] Updated weights for policy 0, policy_version 145920 (0.0005) [2023-03-09 07:42:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 74743808. Throughput: 0: 10287.5. Samples: 74731528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:30,122][635489] Avg episode reward: [(0, '4277.635')] [2023-03-09 07:42:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145984_74743808.pth... [2023-03-09 07:42:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145376_74432512.pth [2023-03-09 07:42:30,805][635776] Updated weights for policy 0, policy_version 146000 (0.0005) [2023-03-09 07:42:34,593][635776] Updated weights for policy 0, policy_version 146080 (0.0005) [2023-03-09 07:42:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 74797056. Throughput: 0: 10335.8. Samples: 74796928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:35,122][635489] Avg episode reward: [(0, '4276.391')] [2023-03-09 07:42:38,464][635776] Updated weights for policy 0, policy_version 146160 (0.0005) [2023-03-09 07:42:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 74846208. Throughput: 0: 10325.3. Samples: 74827424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:40,122][635489] Avg episode reward: [(0, '4432.810')] [2023-03-09 07:42:42,334][635776] Updated weights for policy 0, policy_version 146240 (0.0006) [2023-03-09 07:42:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 74903552. Throughput: 0: 10425.7. Samples: 74892356. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:45,122][635489] Avg episode reward: [(0, '4354.429')] [2023-03-09 07:42:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146296_74903552.pth... [2023-03-09 07:42:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145680_74588160.pth [2023-03-09 07:42:46,179][635776] Updated weights for policy 0, policy_version 146320 (0.0005) [2023-03-09 07:42:50,009][635776] Updated weights for policy 0, policy_version 146400 (0.0005) [2023-03-09 07:42:50,121][635489] Fps is (10 sec: 11059.2, 60 sec: 10376.6, 300 sec: 10358.0). Total num frames: 74956800. Throughput: 0: 10463.3. Samples: 74956672. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:50,122][635489] Avg episode reward: [(0, '4404.049')] [2023-03-09 07:42:53,901][635776] Updated weights for policy 0, policy_version 146480 (0.0005) [2023-03-09 07:42:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 75005952. Throughput: 0: 10382.6. Samples: 74985704. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:42:55,122][635489] Avg episode reward: [(0, '4488.749')] [2023-03-09 07:42:57,766][635776] Updated weights for policy 0, policy_version 146560 (0.0005) [2023-03-09 07:43:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 75063296. Throughput: 0: 10504.5. Samples: 75050972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:00,122][635489] Avg episode reward: [(0, '4510.438')] [2023-03-09 07:43:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146608_75063296.pth... [2023-03-09 07:43:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000145984_74743808.pth [2023-03-09 07:43:01,696][635776] Updated weights for policy 0, policy_version 146640 (0.0005) [2023-03-09 07:43:05,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 75112448. Throughput: 0: 10470.7. Samples: 75112756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:05,122][635489] Avg episode reward: [(0, '4532.876')] [2023-03-09 07:43:05,603][635776] Updated weights for policy 0, policy_version 146720 (0.0005) [2023-03-09 07:43:09,473][635776] Updated weights for policy 0, policy_version 146800 (0.0005) [2023-03-09 07:43:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 75165696. Throughput: 0: 10473.7. Samples: 75145280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:10,122][635489] Avg episode reward: [(0, '4385.210')] [2023-03-09 07:43:13,303][635776] Updated weights for policy 0, policy_version 146880 (0.0004) [2023-03-09 07:43:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 75218944. Throughput: 0: 10589.4. Samples: 75208052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:15,122][635489] Avg episode reward: [(0, '4491.401')] [2023-03-09 07:43:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146912_75218944.pth... [2023-03-09 07:43:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146296_74903552.pth [2023-03-09 07:43:17,597][635776] Updated weights for policy 0, policy_version 146960 (0.0004) [2023-03-09 07:43:20,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 75268096. Throughput: 0: 10472.0. Samples: 75268168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:20,122][635489] Avg episode reward: [(0, '4492.142')] [2023-03-09 07:43:21,501][635776] Updated weights for policy 0, policy_version 147040 (0.0005) [2023-03-09 07:43:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 75321344. Throughput: 0: 10450.1. Samples: 75297680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:25,122][635489] Avg episode reward: [(0, '4416.755')] [2023-03-09 07:43:25,521][635776] Updated weights for policy 0, policy_version 147120 (0.0004) [2023-03-09 07:43:29,289][635776] Updated weights for policy 0, policy_version 147200 (0.0004) [2023-03-09 07:43:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10385.8). Total num frames: 75374592. Throughput: 0: 10442.8. Samples: 75362280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:30,122][635489] Avg episode reward: [(0, '4488.139')] [2023-03-09 07:43:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147216_75374592.pth... [2023-03-09 07:43:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146608_75063296.pth [2023-03-09 07:43:33,060][635776] Updated weights for policy 0, policy_version 147280 (0.0005) [2023-03-09 07:43:35,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10513.0, 300 sec: 10385.8). Total num frames: 75427840. Throughput: 0: 10447.4. Samples: 75426808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:35,122][635489] Avg episode reward: [(0, '4350.590')] [2023-03-09 07:43:37,021][635776] Updated weights for policy 0, policy_version 147360 (0.0005) [2023-03-09 07:43:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10399.7). Total num frames: 75481088. Throughput: 0: 10469.6. Samples: 75456836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:40,122][635489] Avg episode reward: [(0, '4437.405')] [2023-03-09 07:43:40,953][635776] Updated weights for policy 0, policy_version 147440 (0.0005) [2023-03-09 07:43:44,982][635776] Updated weights for policy 0, policy_version 147520 (0.0005) [2023-03-09 07:43:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 75530240. Throughput: 0: 10397.3. Samples: 75518848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:45,122][635489] Avg episode reward: [(0, '4436.177')] [2023-03-09 07:43:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147520_75530240.pth... [2023-03-09 07:43:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000146912_75218944.pth [2023-03-09 07:43:48,994][635776] Updated weights for policy 0, policy_version 147600 (0.0005) [2023-03-09 07:43:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10413.6). Total num frames: 75583488. Throughput: 0: 10413.3. Samples: 75581356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:50,122][635489] Avg episode reward: [(0, '4460.191')] [2023-03-09 07:43:52,769][635776] Updated weights for policy 0, policy_version 147680 (0.0005) [2023-03-09 07:43:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 75632640. Throughput: 0: 10401.0. Samples: 75613324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:43:55,122][635489] Avg episode reward: [(0, '4529.902')] [2023-03-09 07:43:56,645][635776] Updated weights for policy 0, policy_version 147760 (0.0005) [2023-03-09 07:44:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 75685888. Throughput: 0: 10392.8. Samples: 75675728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:00,122][635489] Avg episode reward: [(0, '4523.372')] [2023-03-09 07:44:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147824_75685888.pth... [2023-03-09 07:44:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147216_75374592.pth [2023-03-09 07:44:00,838][635776] Updated weights for policy 0, policy_version 147840 (0.0005) [2023-03-09 07:44:04,942][635776] Updated weights for policy 0, policy_version 147920 (0.0005) [2023-03-09 07:44:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 75735040. Throughput: 0: 10370.1. Samples: 75734824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:05,122][635489] Avg episode reward: [(0, '4534.745')] [2023-03-09 07:44:09,033][635776] Updated weights for policy 0, policy_version 148000 (0.0005) [2023-03-09 07:44:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 75784192. Throughput: 0: 10357.8. Samples: 75763784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:10,122][635489] Avg episode reward: [(0, '4472.206')] [2023-03-09 07:44:10,296][635732] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000008 [2023-03-09 07:44:13,175][635776] Updated weights for policy 0, policy_version 148080 (0.0005) [2023-03-09 07:44:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 75833344. Throughput: 0: 10285.3. Samples: 75825116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:15,122][635489] Avg episode reward: [(0, '4528.141')] [2023-03-09 07:44:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148112_75833344.pth... [2023-03-09 07:44:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147520_75530240.pth [2023-03-09 07:44:17,269][635776] Updated weights for policy 0, policy_version 148160 (0.0005) [2023-03-09 07:44:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 75882496. Throughput: 0: 10149.1. Samples: 75883516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:20,122][635489] Avg episode reward: [(0, '4534.560')] [2023-03-09 07:44:21,411][635776] Updated weights for policy 0, policy_version 148240 (0.0005) [2023-03-09 07:44:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10371.9). Total num frames: 75931648. Throughput: 0: 10148.2. Samples: 75913504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:25,122][635489] Avg episode reward: [(0, '4497.476')] [2023-03-09 07:44:25,661][635776] Updated weights for policy 0, policy_version 148320 (0.0005) [2023-03-09 07:44:29,671][635776] Updated weights for policy 0, policy_version 148400 (0.0005) [2023-03-09 07:44:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 75984896. Throughput: 0: 10085.1. Samples: 75972680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:30,122][635489] Avg episode reward: [(0, '4573.430')] [2023-03-09 07:44:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148408_75984896.pth... [2023-03-09 07:44:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000147824_75685888.pth [2023-03-09 07:44:33,618][635776] Updated weights for policy 0, policy_version 148480 (0.0005) [2023-03-09 07:44:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10371.9). Total num frames: 76034048. Throughput: 0: 10071.3. Samples: 76034564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:35,122][635489] Avg episode reward: [(0, '4575.479')] [2023-03-09 07:44:37,455][635776] Updated weights for policy 0, policy_version 148560 (0.0004) [2023-03-09 07:44:40,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10371.9). Total num frames: 76087296. Throughput: 0: 10079.0. Samples: 76066880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:40,122][635489] Avg episode reward: [(0, '4567.147')] [2023-03-09 07:44:41,334][635776] Updated weights for policy 0, policy_version 148640 (0.0005) [2023-03-09 07:44:45,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 76140544. Throughput: 0: 10086.2. Samples: 76129608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:45,122][635489] Avg episode reward: [(0, '4566.510')] [2023-03-09 07:44:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148712_76140544.pth... [2023-03-09 07:44:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148112_75833344.pth [2023-03-09 07:44:45,267][635776] Updated weights for policy 0, policy_version 148720 (0.0005) [2023-03-09 07:44:49,263][635776] Updated weights for policy 0, policy_version 148800 (0.0006) [2023-03-09 07:44:50,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 76193792. Throughput: 0: 10156.7. Samples: 76191876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:50,122][635489] Avg episode reward: [(0, '4554.256')] [2023-03-09 07:44:53,358][635776] Updated weights for policy 0, policy_version 148880 (0.0005) [2023-03-09 07:44:55,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 76247040. Throughput: 0: 10191.9. Samples: 76222420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:44:55,122][635489] Avg episode reward: [(0, '4556.037')] [2023-03-09 07:44:57,061][635776] Updated weights for policy 0, policy_version 148960 (0.0005) [2023-03-09 07:45:00,122][635489] Fps is (10 sec: 10649.3, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 76300288. Throughput: 0: 10257.2. Samples: 76286692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:00,122][635489] Avg episode reward: [(0, '4578.093')] [2023-03-09 07:45:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149024_76300288.pth... [2023-03-09 07:45:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148408_75984896.pth [2023-03-09 07:45:00,877][635776] Updated weights for policy 0, policy_version 149040 (0.0005) [2023-03-09 07:45:04,743][635776] Updated weights for policy 0, policy_version 149120 (0.0005) [2023-03-09 07:45:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 76349440. Throughput: 0: 10389.3. Samples: 76351036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:05,122][635489] Avg episode reward: [(0, '4558.744')] [2023-03-09 07:45:08,670][635776] Updated weights for policy 0, policy_version 149200 (0.0005) [2023-03-09 07:45:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 76402688. Throughput: 0: 10417.0. Samples: 76382272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:10,122][635489] Avg episode reward: [(0, '4506.859')] [2023-03-09 07:45:12,418][635776] Updated weights for policy 0, policy_version 149280 (0.0005) [2023-03-09 07:45:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10371.9). Total num frames: 76455936. Throughput: 0: 10555.7. Samples: 76447684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:15,122][635489] Avg episode reward: [(0, '4495.521')] [2023-03-09 07:45:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149328_76455936.pth... [2023-03-09 07:45:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000148712_76140544.pth [2023-03-09 07:45:16,278][635776] Updated weights for policy 0, policy_version 149360 (0.0005) [2023-03-09 07:45:20,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 76509184. Throughput: 0: 10551.9. Samples: 76509400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:20,122][635489] Avg episode reward: [(0, '4530.393')] [2023-03-09 07:45:20,273][635776] Updated weights for policy 0, policy_version 149440 (0.0005) [2023-03-09 07:45:24,522][635776] Updated weights for policy 0, policy_version 149520 (0.0004) [2023-03-09 07:45:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 76558336. Throughput: 0: 10483.7. Samples: 76538648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:25,122][635489] Avg episode reward: [(0, '4486.585')] [2023-03-09 07:45:28,679][635776] Updated weights for policy 0, policy_version 149600 (0.0004) [2023-03-09 07:45:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 76607488. Throughput: 0: 10395.1. Samples: 76597388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:30,122][635489] Avg episode reward: [(0, '4491.654')] [2023-03-09 07:45:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149624_76607488.pth... [2023-03-09 07:45:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149024_76300288.pth [2023-03-09 07:45:32,742][635776] Updated weights for policy 0, policy_version 149680 (0.0005) [2023-03-09 07:45:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 76656640. Throughput: 0: 10339.6. Samples: 76657156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:35,122][635489] Avg episode reward: [(0, '4538.188')] [2023-03-09 07:45:37,013][635776] Updated weights for policy 0, policy_version 149760 (0.0005) [2023-03-09 07:45:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 76709888. Throughput: 0: 10286.7. Samples: 76685320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:40,122][635489] Avg episode reward: [(0, '4441.739')] [2023-03-09 07:45:40,906][635776] Updated weights for policy 0, policy_version 149840 (0.0004) [2023-03-09 07:45:44,694][635776] Updated weights for policy 0, policy_version 149920 (0.0005) [2023-03-09 07:45:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 76763136. Throughput: 0: 10314.8. Samples: 76750856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:45,122][635489] Avg episode reward: [(0, '4511.912')] [2023-03-09 07:45:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149928_76763136.pth... [2023-03-09 07:45:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149328_76455936.pth [2023-03-09 07:45:48,775][635776] Updated weights for policy 0, policy_version 150000 (0.0005) [2023-03-09 07:45:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 76812288. Throughput: 0: 10233.2. Samples: 76811532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:50,122][635489] Avg episode reward: [(0, '4501.819')] [2023-03-09 07:45:52,991][635776] Updated weights for policy 0, policy_version 150080 (0.0005) [2023-03-09 07:45:55,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 76861440. Throughput: 0: 10191.6. Samples: 76840892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:45:55,122][635489] Avg episode reward: [(0, '4539.479')] [2023-03-09 07:45:57,146][635776] Updated weights for policy 0, policy_version 150160 (0.0005) [2023-03-09 07:46:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 76910592. Throughput: 0: 10015.7. Samples: 76898392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:00,122][635489] Avg episode reward: [(0, '4586.121')] [2023-03-09 07:46:00,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150216_76910592.pth... [2023-03-09 07:46:00,131][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149624_76607488.pth [2023-03-09 07:46:01,186][635776] Updated weights for policy 0, policy_version 150240 (0.0005) [2023-03-09 07:46:04,944][635776] Updated weights for policy 0, policy_version 150320 (0.0006) [2023-03-09 07:46:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 76963840. Throughput: 0: 10097.0. Samples: 76963768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:05,122][635489] Avg episode reward: [(0, '4582.590')] [2023-03-09 07:46:08,950][635776] Updated weights for policy 0, policy_version 150400 (0.0004) [2023-03-09 07:46:10,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 77017088. Throughput: 0: 10142.0. Samples: 76995036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:10,122][635489] Avg episode reward: [(0, '4583.106')] [2023-03-09 07:46:12,891][635776] Updated weights for policy 0, policy_version 150480 (0.0005) [2023-03-09 07:46:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 77066240. Throughput: 0: 10212.4. Samples: 77056948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:15,122][635489] Avg episode reward: [(0, '4554.893')] [2023-03-09 07:46:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150520_77066240.pth... [2023-03-09 07:46:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000149928_76763136.pth [2023-03-09 07:46:16,823][635776] Updated weights for policy 0, policy_version 150560 (0.0005) [2023-03-09 07:46:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 77119488. Throughput: 0: 10272.6. Samples: 77119424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:20,122][635489] Avg episode reward: [(0, '4538.108')] [2023-03-09 07:46:20,766][635776] Updated weights for policy 0, policy_version 150640 (0.0005) [2023-03-09 07:46:24,614][635776] Updated weights for policy 0, policy_version 150720 (0.0005) [2023-03-09 07:46:25,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10330.2). Total num frames: 77172736. Throughput: 0: 10332.4. Samples: 77150280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:25,122][635489] Avg episode reward: [(0, '4587.964')] [2023-03-09 07:46:28,621][635776] Updated weights for policy 0, policy_version 150800 (0.0005) [2023-03-09 07:46:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 77221888. Throughput: 0: 10272.3. Samples: 77213108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:30,122][635489] Avg episode reward: [(0, '4587.070')] [2023-03-09 07:46:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150824_77221888.pth... [2023-03-09 07:46:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150216_76910592.pth [2023-03-09 07:46:32,358][635776] Updated weights for policy 0, policy_version 150880 (0.0005) [2023-03-09 07:46:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 77279232. Throughput: 0: 10383.4. Samples: 77278784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:35,122][635489] Avg episode reward: [(0, '4590.746')] [2023-03-09 07:46:36,164][635776] Updated weights for policy 0, policy_version 150960 (0.0005) [2023-03-09 07:46:39,974][635776] Updated weights for policy 0, policy_version 151040 (0.0004) [2023-03-09 07:46:40,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 77332480. Throughput: 0: 10415.9. Samples: 77309608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:40,122][635489] Avg episode reward: [(0, '4543.320')] [2023-03-09 07:46:44,117][635776] Updated weights for policy 0, policy_version 151120 (0.0005) [2023-03-09 07:46:45,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 77381632. Throughput: 0: 10515.5. Samples: 77371588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:45,122][635489] Avg episode reward: [(0, '4597.120')] [2023-03-09 07:46:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151136_77381632.pth... [2023-03-09 07:46:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150520_77066240.pth [2023-03-09 07:46:48,195][635776] Updated weights for policy 0, policy_version 151200 (0.0005) [2023-03-09 07:46:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 77430784. Throughput: 0: 10402.7. Samples: 77431888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:50,122][635489] Avg episode reward: [(0, '4592.040')] [2023-03-09 07:46:52,104][635776] Updated weights for policy 0, policy_version 151280 (0.0005) [2023-03-09 07:46:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 77484032. Throughput: 0: 10405.6. Samples: 77463288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:46:55,122][635489] Avg episode reward: [(0, '4559.781')] [2023-03-09 07:46:56,088][635776] Updated weights for policy 0, policy_version 151360 (0.0005) [2023-03-09 07:47:00,080][635776] Updated weights for policy 0, policy_version 151440 (0.0005) [2023-03-09 07:47:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 77537280. Throughput: 0: 10422.4. Samples: 77525956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:47:00,122][635489] Avg episode reward: [(0, '4594.994')] [2023-03-09 07:47:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151440_77537280.pth... [2023-03-09 07:47:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000150824_77221888.pth [2023-03-09 07:47:04,185][635776] Updated weights for policy 0, policy_version 151520 (0.0005) [2023-03-09 07:47:05,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10330.3). Total num frames: 77586432. Throughput: 0: 10373.0. Samples: 77586208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:47:05,122][635489] Avg episode reward: [(0, '4536.499')] [2023-03-09 07:47:07,983][635776] Updated weights for policy 0, policy_version 151600 (0.0005) [2023-03-09 07:47:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 77639680. Throughput: 0: 10387.9. Samples: 77617736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:47:10,122][635489] Avg episode reward: [(0, '4446.779')] [2023-03-09 07:47:11,862][635776] Updated weights for policy 0, policy_version 151680 (0.0005) [2023-03-09 07:47:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10330.2). Total num frames: 77688832. Throughput: 0: 10374.7. Samples: 77679972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:15,122][635489] Avg episode reward: [(0, '4494.309')] [2023-03-09 07:47:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151736_77688832.pth... [2023-03-09 07:47:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151136_77381632.pth [2023-03-09 07:47:16,021][635776] Updated weights for policy 0, policy_version 151760 (0.0005) [2023-03-09 07:47:20,026][635776] Updated weights for policy 0, policy_version 151840 (0.0005) [2023-03-09 07:47:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.6, 300 sec: 10330.3). Total num frames: 77742080. Throughput: 0: 10285.2. Samples: 77741616. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:20,122][635489] Avg episode reward: [(0, '4605.373')] [2023-03-09 07:47:24,067][635776] Updated weights for policy 0, policy_version 151920 (0.0005) [2023-03-09 07:47:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10330.2). Total num frames: 77791232. Throughput: 0: 10277.3. Samples: 77772088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:25,122][635489] Avg episode reward: [(0, '4446.056')] [2023-03-09 07:47:27,868][635776] Updated weights for policy 0, policy_version 152000 (0.0005) [2023-03-09 07:47:30,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10330.2). Total num frames: 77844480. Throughput: 0: 10284.3. Samples: 77834384. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:30,122][635489] Avg episode reward: [(0, '4433.911')] [2023-03-09 07:47:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152040_77844480.pth... [2023-03-09 07:47:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151440_77537280.pth [2023-03-09 07:47:31,979][635776] Updated weights for policy 0, policy_version 152080 (0.0005) [2023-03-09 07:47:35,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 77897728. Throughput: 0: 10335.3. Samples: 77896976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:35,122][635489] Avg episode reward: [(0, '4604.477')] [2023-03-09 07:47:35,713][635776] Updated weights for policy 0, policy_version 152160 (0.0005) [2023-03-09 07:47:39,668][635776] Updated weights for policy 0, policy_version 152240 (0.0005) [2023-03-09 07:47:40,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10330.3). Total num frames: 77950976. Throughput: 0: 10306.0. Samples: 77927056. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:40,122][635489] Avg episode reward: [(0, '4596.511')] [2023-03-09 07:47:43,606][635776] Updated weights for policy 0, policy_version 152320 (0.0005) [2023-03-09 07:47:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10316.4). Total num frames: 78000128. Throughput: 0: 10349.3. Samples: 77991672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:45,122][635489] Avg episode reward: [(0, '4601.869')] [2023-03-09 07:47:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152344_78000128.pth... [2023-03-09 07:47:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000151736_77688832.pth [2023-03-09 07:47:47,625][635776] Updated weights for policy 0, policy_version 152400 (0.0005) [2023-03-09 07:47:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10330.3). Total num frames: 78053376. Throughput: 0: 10382.9. Samples: 78053440. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:50,122][635489] Avg episode reward: [(0, '4599.994')] [2023-03-09 07:47:51,502][635776] Updated weights for policy 0, policy_version 152480 (0.0005) [2023-03-09 07:47:55,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 78102528. Throughput: 0: 10346.7. Samples: 78083340. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:47:55,122][635489] Avg episode reward: [(0, '4594.288')] [2023-03-09 07:47:55,711][635776] Updated weights for policy 0, policy_version 152560 (0.0005) [2023-03-09 07:47:59,921][635776] Updated weights for policy 0, policy_version 152640 (0.0005) [2023-03-09 07:48:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 78151680. Throughput: 0: 10247.6. Samples: 78141112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:48:00,122][635489] Avg episode reward: [(0, '4598.456')] [2023-03-09 07:48:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152640_78151680.pth... [2023-03-09 07:48:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152040_77844480.pth [2023-03-09 07:48:04,055][635776] Updated weights for policy 0, policy_version 152720 (0.0005) [2023-03-09 07:48:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 78200832. Throughput: 0: 10204.1. Samples: 78200800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:48:05,122][635489] Avg episode reward: [(0, '4590.441')] [2023-03-09 07:48:08,038][635776] Updated weights for policy 0, policy_version 152800 (0.0005) [2023-03-09 07:48:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 78249984. Throughput: 0: 10236.8. Samples: 78232744. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:48:10,122][635489] Avg episode reward: [(0, '4525.951')] [2023-03-09 07:48:12,263][635776] Updated weights for policy 0, policy_version 152880 (0.0005) [2023-03-09 07:48:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 78303232. Throughput: 0: 10160.4. Samples: 78291600. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 07:48:15,122][635489] Avg episode reward: [(0, '4443.586')] [2023-03-09 07:48:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152936_78303232.pth... [2023-03-09 07:48:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152344_78000128.pth [2023-03-09 07:48:16,352][635776] Updated weights for policy 0, policy_version 152960 (0.0005) [2023-03-09 07:48:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 78352384. Throughput: 0: 10113.5. Samples: 78352084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:20,122][635489] Avg episode reward: [(0, '4523.951')] [2023-03-09 07:48:20,376][635776] Updated weights for policy 0, policy_version 153040 (0.0005) [2023-03-09 07:48:24,595][635776] Updated weights for policy 0, policy_version 153120 (0.0006) [2023-03-09 07:48:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 78401536. Throughput: 0: 10091.1. Samples: 78381156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:25,122][635489] Avg episode reward: [(0, '4451.978')] [2023-03-09 07:48:28,483][635776] Updated weights for policy 0, policy_version 153200 (0.0005) [2023-03-09 07:48:30,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 78454784. Throughput: 0: 10018.4. Samples: 78442504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:30,122][635489] Avg episode reward: [(0, '4582.905')] [2023-03-09 07:48:30,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153232_78454784.pth... [2023-03-09 07:48:30,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152640_78151680.pth [2023-03-09 07:48:32,593][635776] Updated weights for policy 0, policy_version 153280 (0.0005) [2023-03-09 07:48:35,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 78499840. Throughput: 0: 9958.7. Samples: 78501584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:35,122][635489] Avg episode reward: [(0, '4541.502')] [2023-03-09 07:48:36,860][635776] Updated weights for policy 0, policy_version 153360 (0.0004) [2023-03-09 07:48:40,122][635489] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 10233.1). Total num frames: 78548992. Throughput: 0: 9917.2. Samples: 78529616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:40,122][635489] Avg episode reward: [(0, '4438.563')] [2023-03-09 07:48:41,113][635776] Updated weights for policy 0, policy_version 153440 (0.0004) [2023-03-09 07:48:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 78598144. Throughput: 0: 9944.6. Samples: 78588620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:45,122][635489] Avg episode reward: [(0, '4407.803')] [2023-03-09 07:48:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153512_78598144.pth... [2023-03-09 07:48:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000152936_78303232.pth [2023-03-09 07:48:45,258][635776] Updated weights for policy 0, policy_version 153520 (0.0004) [2023-03-09 07:48:49,489][635776] Updated weights for policy 0, policy_version 153600 (0.0005) [2023-03-09 07:48:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10219.2). Total num frames: 78647296. Throughput: 0: 9922.1. Samples: 78647296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:50,122][635489] Avg episode reward: [(0, '4488.138')] [2023-03-09 07:48:53,766][635776] Updated weights for policy 0, policy_version 153680 (0.0005) [2023-03-09 07:48:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 78696448. Throughput: 0: 9847.6. Samples: 78675888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:48:55,122][635489] Avg episode reward: [(0, '4587.905')] [2023-03-09 07:48:57,778][635776] Updated weights for policy 0, policy_version 153760 (0.0005) [2023-03-09 07:49:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10205.3). Total num frames: 78745600. Throughput: 0: 9896.9. Samples: 78736960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:00,122][635489] Avg episode reward: [(0, '4567.138')] [2023-03-09 07:49:00,157][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153808_78749696.pth... [2023-03-09 07:49:00,158][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153232_78454784.pth [2023-03-09 07:49:01,788][635776] Updated weights for policy 0, policy_version 153840 (0.0005) [2023-03-09 07:49:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 78798848. Throughput: 0: 9928.3. Samples: 78798856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:05,122][635489] Avg episode reward: [(0, '4560.017')] [2023-03-09 07:49:05,641][635776] Updated weights for policy 0, policy_version 153920 (0.0004) [2023-03-09 07:49:09,648][635776] Updated weights for policy 0, policy_version 154000 (0.0005) [2023-03-09 07:49:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 78852096. Throughput: 0: 9994.0. Samples: 78830884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:10,122][635489] Avg episode reward: [(0, '4424.777')] [2023-03-09 07:49:13,453][635776] Updated weights for policy 0, policy_version 154080 (0.0005) [2023-03-09 07:49:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 78905344. Throughput: 0: 10014.8. Samples: 78893168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:15,122][635489] Avg episode reward: [(0, '4525.810')] [2023-03-09 07:49:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154112_78905344.pth... [2023-03-09 07:49:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153512_78598144.pth [2023-03-09 07:49:17,452][635776] Updated weights for policy 0, policy_version 154160 (0.0005) [2023-03-09 07:49:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 78954496. Throughput: 0: 10061.8. Samples: 78954364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:20,122][635489] Avg episode reward: [(0, '4523.938')] [2023-03-09 07:49:21,570][635776] Updated weights for policy 0, policy_version 154240 (0.0005) [2023-03-09 07:49:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10246.9). Total num frames: 79007744. Throughput: 0: 10080.5. Samples: 78983240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:25,122][635489] Avg episode reward: [(0, '4530.197')] [2023-03-09 07:49:25,267][635776] Updated weights for policy 0, policy_version 154320 (0.0005) [2023-03-09 07:49:29,330][635776] Updated weights for policy 0, policy_version 154400 (0.0005) [2023-03-09 07:49:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 79060992. Throughput: 0: 10219.9. Samples: 79048516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:30,122][635489] Avg episode reward: [(0, '4535.393')] [2023-03-09 07:49:30,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154416_79060992.pth... [2023-03-09 07:49:30,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000153808_78749696.pth [2023-03-09 07:49:33,278][635776] Updated weights for policy 0, policy_version 154480 (0.0004) [2023-03-09 07:49:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 79110144. Throughput: 0: 10286.9. Samples: 79110208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:35,122][635489] Avg episode reward: [(0, '4429.805')] [2023-03-09 07:49:37,186][635776] Updated weights for policy 0, policy_version 154560 (0.0004) [2023-03-09 07:49:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 79163392. Throughput: 0: 10373.5. Samples: 79142696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:40,122][635489] Avg episode reward: [(0, '4589.089')] [2023-03-09 07:49:41,246][635776] Updated weights for policy 0, policy_version 154640 (0.0005) [2023-03-09 07:49:45,093][635776] Updated weights for policy 0, policy_version 154720 (0.0005) [2023-03-09 07:49:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 79216640. Throughput: 0: 10405.1. Samples: 79205192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:45,122][635489] Avg episode reward: [(0, '4515.863')] [2023-03-09 07:49:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154720_79216640.pth... [2023-03-09 07:49:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154112_78905344.pth [2023-03-09 07:49:49,019][635776] Updated weights for policy 0, policy_version 154800 (0.0005) [2023-03-09 07:49:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 79265792. Throughput: 0: 10378.6. Samples: 79265892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:50,122][635489] Avg episode reward: [(0, '4509.877')] [2023-03-09 07:49:53,016][635776] Updated weights for policy 0, policy_version 154880 (0.0004) [2023-03-09 07:49:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 79319040. Throughput: 0: 10378.9. Samples: 79297936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:49:55,122][635489] Avg episode reward: [(0, '4481.077')] [2023-03-09 07:49:57,145][635776] Updated weights for policy 0, policy_version 154960 (0.0005) [2023-03-09 07:50:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 79368192. Throughput: 0: 10322.3. Samples: 79357672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:00,122][635489] Avg episode reward: [(0, '4493.931')] [2023-03-09 07:50:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155016_79368192.pth... [2023-03-09 07:50:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154416_79060992.pth [2023-03-09 07:50:01,217][635776] Updated weights for policy 0, policy_version 155040 (0.0004) [2023-03-09 07:50:05,122][635489] Fps is (10 sec: 9830.6, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 79417344. Throughput: 0: 10298.1. Samples: 79417776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:05,122][635489] Avg episode reward: [(0, '4448.113')] [2023-03-09 07:50:05,251][635776] Updated weights for policy 0, policy_version 155120 (0.0005) [2023-03-09 07:50:09,306][635776] Updated weights for policy 0, policy_version 155200 (0.0005) [2023-03-09 07:50:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10219.2). Total num frames: 79470592. Throughput: 0: 10328.5. Samples: 79448024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:10,122][635489] Avg episode reward: [(0, '4174.527')] [2023-03-09 07:50:13,148][635776] Updated weights for policy 0, policy_version 155280 (0.0005) [2023-03-09 07:50:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 79519744. Throughput: 0: 10278.0. Samples: 79511024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:15,122][635489] Avg episode reward: [(0, '4425.447')] [2023-03-09 07:50:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155312_79519744.pth... [2023-03-09 07:50:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000154720_79216640.pth [2023-03-09 07:50:17,017][635776] Updated weights for policy 0, policy_version 155360 (0.0005) [2023-03-09 07:50:20,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 79572992. Throughput: 0: 10260.4. Samples: 79571924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:20,122][635489] Avg episode reward: [(0, '4577.743')] [2023-03-09 07:50:21,247][635776] Updated weights for policy 0, policy_version 155440 (0.0004) [2023-03-09 07:50:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 79622144. Throughput: 0: 10194.7. Samples: 79601456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:50:25,122][635489] Avg episode reward: [(0, '4574.106')] [2023-03-09 07:50:25,447][635776] Updated weights for policy 0, policy_version 155520 (0.0005) [2023-03-09 07:50:29,434][635776] Updated weights for policy 0, policy_version 155600 (0.0005) [2023-03-09 07:50:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 79671296. Throughput: 0: 10174.3. Samples: 79663036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:30,122][635489] Avg episode reward: [(0, '4487.229')] [2023-03-09 07:50:30,165][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155616_79675392.pth... [2023-03-09 07:50:30,167][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155016_79368192.pth [2023-03-09 07:50:33,304][635776] Updated weights for policy 0, policy_version 155680 (0.0005) [2023-03-09 07:50:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 79724544. Throughput: 0: 10195.3. Samples: 79724680. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:35,122][635489] Avg episode reward: [(0, '4484.458')] [2023-03-09 07:50:37,317][635776] Updated weights for policy 0, policy_version 155760 (0.0004) [2023-03-09 07:50:40,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 79777792. Throughput: 0: 10155.0. Samples: 79754912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:40,122][635489] Avg episode reward: [(0, '4575.548')] [2023-03-09 07:50:41,322][635776] Updated weights for policy 0, policy_version 155840 (0.0005) [2023-03-09 07:50:45,053][635776] Updated weights for policy 0, policy_version 155920 (0.0005) [2023-03-09 07:50:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 79831040. Throughput: 0: 10198.1. Samples: 79816588. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:45,122][635489] Avg episode reward: [(0, '4453.201')] [2023-03-09 07:50:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155920_79831040.pth... [2023-03-09 07:50:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155312_79519744.pth [2023-03-09 07:50:49,096][635776] Updated weights for policy 0, policy_version 156000 (0.0005) [2023-03-09 07:50:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 79880192. Throughput: 0: 10270.6. Samples: 79879952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:50,122][635489] Avg episode reward: [(0, '4524.515')] [2023-03-09 07:50:53,125][635776] Updated weights for policy 0, policy_version 156080 (0.0005) [2023-03-09 07:50:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 79933440. Throughput: 0: 10290.9. Samples: 79911116. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:50:55,122][635489] Avg episode reward: [(0, '4572.446')] [2023-03-09 07:50:56,911][635776] Updated weights for policy 0, policy_version 156160 (0.0005) [2023-03-09 07:51:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 79986688. Throughput: 0: 10298.7. Samples: 79974464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:00,122][635489] Avg episode reward: [(0, '4562.895')] [2023-03-09 07:51:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156224_79986688.pth... [2023-03-09 07:51:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155616_79675392.pth [2023-03-09 07:51:00,879][635776] Updated weights for policy 0, policy_version 156240 (0.0005) [2023-03-09 07:51:04,908][635776] Updated weights for policy 0, policy_version 156320 (0.0005) [2023-03-09 07:51:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 80035840. Throughput: 0: 10307.6. Samples: 80035768. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:05,122][635489] Avg episode reward: [(0, '4615.713')] [2023-03-09 07:51:08,935][635776] Updated weights for policy 0, policy_version 156400 (0.0005) [2023-03-09 07:51:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 80084992. Throughput: 0: 10322.6. Samples: 80065972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:10,122][635489] Avg episode reward: [(0, '4583.554')] [2023-03-09 07:51:13,121][635776] Updated weights for policy 0, policy_version 156480 (0.0005) [2023-03-09 07:51:15,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 80134144. Throughput: 0: 10287.0. Samples: 80125952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:15,122][635489] Avg episode reward: [(0, '4446.763')] [2023-03-09 07:51:15,149][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156520_80138240.pth... [2023-03-09 07:51:15,151][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000155920_79831040.pth [2023-03-09 07:51:17,031][635776] Updated weights for policy 0, policy_version 156560 (0.0005) [2023-03-09 07:51:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 80187392. Throughput: 0: 10283.9. Samples: 80187456. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:20,122][635489] Avg episode reward: [(0, '4554.471')] [2023-03-09 07:51:21,141][635776] Updated weights for policy 0, policy_version 156640 (0.0006) [2023-03-09 07:51:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 80236544. Throughput: 0: 10252.9. Samples: 80216292. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:51:25,122][635489] Avg episode reward: [(0, '4469.340')] [2023-03-09 07:51:25,258][635776] Updated weights for policy 0, policy_version 156720 (0.0005) [2023-03-09 07:51:29,369][635776] Updated weights for policy 0, policy_version 156800 (0.0005) [2023-03-09 07:51:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 80285696. Throughput: 0: 10242.6. Samples: 80277504. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:30,122][635489] Avg episode reward: [(0, '4505.723')] [2023-03-09 07:51:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156808_80285696.pth... [2023-03-09 07:51:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156224_79986688.pth [2023-03-09 07:51:33,409][635776] Updated weights for policy 0, policy_version 156880 (0.0005) [2023-03-09 07:51:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 80338944. Throughput: 0: 10178.8. Samples: 80338000. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:35,122][635489] Avg episode reward: [(0, '4552.078')] [2023-03-09 07:51:37,530][635776] Updated weights for policy 0, policy_version 156960 (0.0005) [2023-03-09 07:51:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 80392192. Throughput: 0: 10144.6. Samples: 80367624. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:40,122][635489] Avg episode reward: [(0, '4369.030')] [2023-03-09 07:51:41,084][635776] Updated weights for policy 0, policy_version 157040 (0.0005) [2023-03-09 07:51:45,030][635776] Updated weights for policy 0, policy_version 157120 (0.0005) [2023-03-09 07:51:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 80445440. Throughput: 0: 10194.7. Samples: 80433224. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:45,122][635489] Avg episode reward: [(0, '4471.684')] [2023-03-09 07:51:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157120_80445440.pth... [2023-03-09 07:51:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156520_80138240.pth [2023-03-09 07:51:49,128][635776] Updated weights for policy 0, policy_version 157200 (0.0005) [2023-03-09 07:51:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 80494592. Throughput: 0: 10195.0. Samples: 80494544. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:50,122][635489] Avg episode reward: [(0, '4515.734')] [2023-03-09 07:51:52,863][635776] Updated weights for policy 0, policy_version 157280 (0.0005) [2023-03-09 07:51:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 80547840. Throughput: 0: 10254.7. Samples: 80527432. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:51:55,122][635489] Avg episode reward: [(0, '4544.064')] [2023-03-09 07:51:57,086][635776] Updated weights for policy 0, policy_version 157360 (0.0005) [2023-03-09 07:52:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 80596992. Throughput: 0: 10194.7. Samples: 80584712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:00,122][635489] Avg episode reward: [(0, '4323.662')] [2023-03-09 07:52:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157416_80596992.pth... [2023-03-09 07:52:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000156808_80285696.pth [2023-03-09 07:52:01,439][635776] Updated weights for policy 0, policy_version 157440 (0.0005) [2023-03-09 07:52:05,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10171.8, 300 sec: 10191.4). Total num frames: 80646144. Throughput: 0: 10146.4. Samples: 80644044. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:05,122][635489] Avg episode reward: [(0, '4430.586')] [2023-03-09 07:52:05,534][635776] Updated weights for policy 0, policy_version 157520 (0.0005) [2023-03-09 07:52:09,719][635776] Updated weights for policy 0, policy_version 157600 (0.0004) [2023-03-09 07:52:10,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 80691200. Throughput: 0: 10157.7. Samples: 80673388. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:10,122][635489] Avg episode reward: [(0, '4410.069')] [2023-03-09 07:52:13,816][635776] Updated weights for policy 0, policy_version 157680 (0.0004) [2023-03-09 07:52:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 80744448. Throughput: 0: 10114.1. Samples: 80732640. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:15,122][635489] Avg episode reward: [(0, '4405.534')] [2023-03-09 07:52:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157704_80744448.pth... [2023-03-09 07:52:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157120_80445440.pth [2023-03-09 07:52:17,914][635776] Updated weights for policy 0, policy_version 157760 (0.0004) [2023-03-09 07:52:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 80793600. Throughput: 0: 10124.3. Samples: 80793592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:20,122][635489] Avg episode reward: [(0, '4581.565')] [2023-03-09 07:52:21,886][635776] Updated weights for policy 0, policy_version 157840 (0.0004) [2023-03-09 07:52:25,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 80842752. Throughput: 0: 10152.8. Samples: 80824500. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:25,122][635489] Avg episode reward: [(0, '4414.084')] [2023-03-09 07:52:25,995][635776] Updated weights for policy 0, policy_version 157920 (0.0005) [2023-03-09 07:52:29,823][635776] Updated weights for policy 0, policy_version 158000 (0.0005) [2023-03-09 07:52:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 80896000. Throughput: 0: 10087.2. Samples: 80887148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 07:52:30,122][635489] Avg episode reward: [(0, '4460.887')] [2023-03-09 07:52:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158000_80896000.pth... [2023-03-09 07:52:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157416_80596992.pth [2023-03-09 07:52:33,920][635776] Updated weights for policy 0, policy_version 158080 (0.0005) [2023-03-09 07:52:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 80945152. Throughput: 0: 10025.3. Samples: 80945684. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:52:35,122][635489] Avg episode reward: [(0, '4366.804')] [2023-03-09 07:52:38,124][635776] Updated weights for policy 0, policy_version 158160 (0.0004) [2023-03-09 07:52:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 80994304. Throughput: 0: 9983.5. Samples: 80976688. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:52:40,122][635489] Avg episode reward: [(0, '4543.929')] [2023-03-09 07:52:42,305][635776] Updated weights for policy 0, policy_version 158240 (0.0004) [2023-03-09 07:52:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 81047552. Throughput: 0: 10011.0. Samples: 81035204. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:52:45,122][635489] Avg episode reward: [(0, '4459.556')] [2023-03-09 07:52:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158296_81047552.pth... [2023-03-09 07:52:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000157704_80744448.pth [2023-03-09 07:52:46,245][635776] Updated weights for policy 0, policy_version 158320 (0.0004) [2023-03-09 07:52:50,090][635776] Updated weights for policy 0, policy_version 158400 (0.0005) [2023-03-09 07:52:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 81100800. Throughput: 0: 10120.4. Samples: 81099464. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:52:50,122][635489] Avg episode reward: [(0, '4413.205')] [2023-03-09 07:52:53,948][635776] Updated weights for policy 0, policy_version 158480 (0.0005) [2023-03-09 07:52:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 81149952. Throughput: 0: 10135.4. Samples: 81129480. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:52:55,122][635489] Avg episode reward: [(0, '4371.693')] [2023-03-09 07:52:58,023][635776] Updated weights for policy 0, policy_version 158560 (0.0004) [2023-03-09 07:53:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 81203200. Throughput: 0: 10187.8. Samples: 81191092. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:00,122][635489] Avg episode reward: [(0, '4540.519')] [2023-03-09 07:53:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158600_81203200.pth... [2023-03-09 07:53:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158000_80896000.pth [2023-03-09 07:53:02,113][635776] Updated weights for policy 0, policy_version 158640 (0.0005) [2023-03-09 07:53:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10177.5). Total num frames: 81252352. Throughput: 0: 10194.7. Samples: 81252352. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:05,122][635489] Avg episode reward: [(0, '4362.674')] [2023-03-09 07:53:06,108][635776] Updated weights for policy 0, policy_version 158720 (0.0005) [2023-03-09 07:53:10,097][635776] Updated weights for policy 0, policy_version 158800 (0.0005) [2023-03-09 07:53:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 81305600. Throughput: 0: 10186.8. Samples: 81282908. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:10,122][635489] Avg episode reward: [(0, '4406.728')] [2023-03-09 07:53:14,151][635776] Updated weights for policy 0, policy_version 158880 (0.0005) [2023-03-09 07:53:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 81354752. Throughput: 0: 10162.7. Samples: 81344468. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:15,122][635489] Avg episode reward: [(0, '4446.145')] [2023-03-09 07:53:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158896_81354752.pth... [2023-03-09 07:53:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158296_81047552.pth [2023-03-09 07:53:18,299][635776] Updated weights for policy 0, policy_version 158960 (0.0005) [2023-03-09 07:53:20,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 81403904. Throughput: 0: 10171.4. Samples: 81403400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:20,122][635489] Avg episode reward: [(0, '4368.167')] [2023-03-09 07:53:22,477][635776] Updated weights for policy 0, policy_version 159040 (0.0005) [2023-03-09 07:53:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 81453056. Throughput: 0: 10130.8. Samples: 81432576. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:25,122][635489] Avg episode reward: [(0, '4523.300')] [2023-03-09 07:53:26,611][635776] Updated weights for policy 0, policy_version 159120 (0.0004) [2023-03-09 07:53:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 81502208. Throughput: 0: 10137.8. Samples: 81491408. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:30,122][635489] Avg episode reward: [(0, '4398.355')] [2023-03-09 07:53:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159184_81502208.pth... [2023-03-09 07:53:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158600_81203200.pth [2023-03-09 07:53:30,838][635776] Updated weights for policy 0, policy_version 159200 (0.0004) [2023-03-09 07:53:34,927][635776] Updated weights for policy 0, policy_version 159280 (0.0005) [2023-03-09 07:53:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 81551360. Throughput: 0: 10038.9. Samples: 81551216. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 07:53:35,122][635489] Avg episode reward: [(0, '4207.357')] [2023-03-09 07:53:38,799][635776] Updated weights for policy 0, policy_version 159360 (0.0005) [2023-03-09 07:53:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 81604608. Throughput: 0: 10094.5. Samples: 81583736. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:53:40,122][635489] Avg episode reward: [(0, '4280.113')] [2023-03-09 07:53:43,113][635776] Updated weights for policy 0, policy_version 159440 (0.0005) [2023-03-09 07:53:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 81649664. Throughput: 0: 10007.7. Samples: 81641436. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:53:45,122][635489] Avg episode reward: [(0, '4356.923')] [2023-03-09 07:53:45,142][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159480_81653760.pth... [2023-03-09 07:53:45,144][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000158896_81354752.pth [2023-03-09 07:53:47,118][635776] Updated weights for policy 0, policy_version 159520 (0.0005) [2023-03-09 07:53:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 81702912. Throughput: 0: 9987.1. Samples: 81701772. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:53:50,122][635489] Avg episode reward: [(0, '4444.902')] [2023-03-09 07:53:51,365][635776] Updated weights for policy 0, policy_version 159600 (0.0005) [2023-03-09 07:53:55,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 81752064. Throughput: 0: 9953.6. Samples: 81730816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:53:55,122][635489] Avg episode reward: [(0, '4402.066')] [2023-03-09 07:53:55,316][635776] Updated weights for policy 0, policy_version 159680 (0.0005) [2023-03-09 07:53:59,446][635776] Updated weights for policy 0, policy_version 159760 (0.0005) [2023-03-09 07:54:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 81801216. Throughput: 0: 9959.9. Samples: 81792664. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:00,122][635489] Avg episode reward: [(0, '4168.954')] [2023-03-09 07:54:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159768_81801216.pth... [2023-03-09 07:54:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159184_81502208.pth [2023-03-09 07:54:03,321][635776] Updated weights for policy 0, policy_version 159840 (0.0006) [2023-03-09 07:54:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 81854464. Throughput: 0: 10032.6. Samples: 81854868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:05,122][635489] Avg episode reward: [(0, '4071.752')] [2023-03-09 07:54:07,065][635776] Updated weights for policy 0, policy_version 159920 (0.0005) [2023-03-09 07:54:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 81907712. Throughput: 0: 10107.7. Samples: 81887424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:10,122][635489] Avg episode reward: [(0, '4241.858')] [2023-03-09 07:54:10,812][635776] Updated weights for policy 0, policy_version 160000 (0.0005) [2023-03-09 07:54:14,900][635776] Updated weights for policy 0, policy_version 160080 (0.0005) [2023-03-09 07:54:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 81960960. Throughput: 0: 10176.8. Samples: 81949364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:15,122][635489] Avg episode reward: [(0, '4474.251')] [2023-03-09 07:54:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160080_81960960.pth... [2023-03-09 07:54:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159480_81653760.pth [2023-03-09 07:54:18,795][635776] Updated weights for policy 0, policy_version 160160 (0.0005) [2023-03-09 07:54:20,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 82014208. Throughput: 0: 10288.3. Samples: 82014188. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:20,122][635489] Avg episode reward: [(0, '4455.013')] [2023-03-09 07:54:22,827][635776] Updated weights for policy 0, policy_version 160240 (0.0004) [2023-03-09 07:54:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 82063360. Throughput: 0: 10208.1. Samples: 82043100. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:25,122][635489] Avg episode reward: [(0, '4506.520')] [2023-03-09 07:54:26,977][635776] Updated weights for policy 0, policy_version 160320 (0.0005) [2023-03-09 07:54:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 82116608. Throughput: 0: 10287.9. Samples: 82104392. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:30,122][635489] Avg episode reward: [(0, '4522.647')] [2023-03-09 07:54:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160384_82116608.pth... [2023-03-09 07:54:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000159768_81801216.pth [2023-03-09 07:54:30,681][635776] Updated weights for policy 0, policy_version 160400 (0.0005) [2023-03-09 07:54:34,693][635776] Updated weights for policy 0, policy_version 160480 (0.0005) [2023-03-09 07:54:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 82169856. Throughput: 0: 10359.0. Samples: 82167928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:35,122][635489] Avg episode reward: [(0, '4545.599')] [2023-03-09 07:54:38,749][635776] Updated weights for policy 0, policy_version 160560 (0.0005) [2023-03-09 07:54:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 82219008. Throughput: 0: 10391.3. Samples: 82198428. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 07:54:40,122][635489] Avg episode reward: [(0, '4525.823')] [2023-03-09 07:54:42,746][635776] Updated weights for policy 0, policy_version 160640 (0.0005) [2023-03-09 07:54:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 82268160. Throughput: 0: 10372.5. Samples: 82259428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:54:45,122][635489] Avg episode reward: [(0, '4534.290')] [2023-03-09 07:54:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160680_82268160.pth... [2023-03-09 07:54:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160080_81960960.pth [2023-03-09 07:54:46,940][635776] Updated weights for policy 0, policy_version 160720 (0.0005) [2023-03-09 07:54:50,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 82317312. Throughput: 0: 10278.1. Samples: 82317384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:54:50,122][635489] Avg episode reward: [(0, '4532.626')] [2023-03-09 07:54:51,108][635776] Updated weights for policy 0, policy_version 160800 (0.0005) [2023-03-09 07:54:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 82366464. Throughput: 0: 10222.7. Samples: 82347444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:54:55,122][635489] Avg episode reward: [(0, '4504.357')] [2023-03-09 07:54:55,182][635776] Updated weights for policy 0, policy_version 160880 (0.0005) [2023-03-09 07:54:59,144][635776] Updated weights for policy 0, policy_version 160960 (0.0005) [2023-03-09 07:55:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 82419712. Throughput: 0: 10220.3. Samples: 82409276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:00,122][635489] Avg episode reward: [(0, '4485.505')] [2023-03-09 07:55:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160976_82419712.pth... [2023-03-09 07:55:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160384_82116608.pth [2023-03-09 07:55:02,980][635776] Updated weights for policy 0, policy_version 161040 (0.0005) [2023-03-09 07:55:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10177.5). Total num frames: 82472960. Throughput: 0: 10194.9. Samples: 82472960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:05,122][635489] Avg episode reward: [(0, '4529.726')] [2023-03-09 07:55:06,896][635776] Updated weights for policy 0, policy_version 161120 (0.0005) [2023-03-09 07:55:10,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 82526208. Throughput: 0: 10242.1. Samples: 82503992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:10,122][635489] Avg episode reward: [(0, '4440.707')] [2023-03-09 07:55:10,716][635776] Updated weights for policy 0, policy_version 161200 (0.0005) [2023-03-09 07:55:14,574][635776] Updated weights for policy 0, policy_version 161280 (0.0005) [2023-03-09 07:55:15,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 82579456. Throughput: 0: 10294.8. Samples: 82567660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:15,122][635489] Avg episode reward: [(0, '4517.619')] [2023-03-09 07:55:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161288_82579456.pth... [2023-03-09 07:55:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160680_82268160.pth [2023-03-09 07:55:18,636][635776] Updated weights for policy 0, policy_version 161360 (0.0005) [2023-03-09 07:55:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 82628608. Throughput: 0: 10259.8. Samples: 82629616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:20,122][635489] Avg episode reward: [(0, '4544.042')] [2023-03-09 07:55:22,871][635776] Updated weights for policy 0, policy_version 161440 (0.0004) [2023-03-09 07:55:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 82677760. Throughput: 0: 10198.1. Samples: 82657344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:25,122][635489] Avg episode reward: [(0, '4486.610')] [2023-03-09 07:55:26,974][635776] Updated weights for policy 0, policy_version 161520 (0.0005) [2023-03-09 07:55:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 82731008. Throughput: 0: 10208.1. Samples: 82718792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:30,122][635489] Avg episode reward: [(0, '4460.893')] [2023-03-09 07:55:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161584_82731008.pth... [2023-03-09 07:55:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000160976_82419712.pth [2023-03-09 07:55:30,845][635776] Updated weights for policy 0, policy_version 161600 (0.0005) [2023-03-09 07:55:34,807][635776] Updated weights for policy 0, policy_version 161680 (0.0005) [2023-03-09 07:55:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 82780160. Throughput: 0: 10297.4. Samples: 82780768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:35,122][635489] Avg episode reward: [(0, '4546.376')] [2023-03-09 07:55:38,928][635776] Updated weights for policy 0, policy_version 161760 (0.0005) [2023-03-09 07:55:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10163.6). Total num frames: 82829312. Throughput: 0: 10254.7. Samples: 82808904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:40,122][635489] Avg episode reward: [(0, '4532.036')] [2023-03-09 07:55:42,867][635776] Updated weights for policy 0, policy_version 161840 (0.0005) [2023-03-09 07:55:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 82882560. Throughput: 0: 10279.5. Samples: 82871856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:55:45,122][635489] Avg episode reward: [(0, '4519.356')] [2023-03-09 07:55:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161880_82882560.pth... [2023-03-09 07:55:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161288_82579456.pth [2023-03-09 07:55:46,941][635776] Updated weights for policy 0, policy_version 161920 (0.0005) [2023-03-09 07:55:50,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 82931712. Throughput: 0: 10196.0. Samples: 82931776. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:55:50,122][635489] Avg episode reward: [(0, '4516.978')] [2023-03-09 07:55:51,056][635776] Updated weights for policy 0, policy_version 162000 (0.0004) [2023-03-09 07:55:55,075][635776] Updated weights for policy 0, policy_version 162080 (0.0005) [2023-03-09 07:55:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 82984960. Throughput: 0: 10193.9. Samples: 82962720. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:55:55,122][635489] Avg episode reward: [(0, '4452.833')] [2023-03-09 07:55:59,165][635776] Updated weights for policy 0, policy_version 162160 (0.0004) [2023-03-09 07:56:00,121][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 83034112. Throughput: 0: 10136.0. Samples: 83023780. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:00,122][635489] Avg episode reward: [(0, '4520.659')] [2023-03-09 07:56:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162176_83034112.pth... [2023-03-09 07:56:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161584_82731008.pth [2023-03-09 07:56:03,316][635776] Updated weights for policy 0, policy_version 162240 (0.0005) [2023-03-09 07:56:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 83083264. Throughput: 0: 10062.7. Samples: 83082440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:05,122][635489] Avg episode reward: [(0, '4488.797')] [2023-03-09 07:56:07,368][635776] Updated weights for policy 0, policy_version 162320 (0.0005) [2023-03-09 07:56:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 83136512. Throughput: 0: 10139.8. Samples: 83113636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:10,122][635489] Avg episode reward: [(0, '4445.786')] [2023-03-09 07:56:11,327][635776] Updated weights for policy 0, policy_version 162400 (0.0005) [2023-03-09 07:56:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 83185664. Throughput: 0: 10118.7. Samples: 83174132. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:15,122][635489] Avg episode reward: [(0, '4521.874')] [2023-03-09 07:56:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162472_83185664.pth... [2023-03-09 07:56:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000161880_82882560.pth [2023-03-09 07:56:15,302][635776] Updated weights for policy 0, policy_version 162480 (0.0005) [2023-03-09 07:56:19,069][635776] Updated weights for policy 0, policy_version 162560 (0.0005) [2023-03-09 07:56:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 83238912. Throughput: 0: 10180.2. Samples: 83238876. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:20,122][635489] Avg episode reward: [(0, '4503.060')] [2023-03-09 07:56:23,112][635776] Updated weights for policy 0, policy_version 162640 (0.0005) [2023-03-09 07:56:25,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 83292160. Throughput: 0: 10232.6. Samples: 83269372. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:25,122][635489] Avg episode reward: [(0, '4533.722')] [2023-03-09 07:56:27,234][635776] Updated weights for policy 0, policy_version 162720 (0.0005) [2023-03-09 07:56:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 83337216. Throughput: 0: 10158.3. Samples: 83328980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:30,122][635489] Avg episode reward: [(0, '4548.930')] [2023-03-09 07:56:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162768_83337216.pth... [2023-03-09 07:56:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162176_83034112.pth [2023-03-09 07:56:31,553][635776] Updated weights for policy 0, policy_version 162800 (0.0005) [2023-03-09 07:56:35,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 83386368. Throughput: 0: 10065.2. Samples: 83384712. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:35,122][635489] Avg episode reward: [(0, '4572.389')] [2023-03-09 07:56:35,941][635776] Updated weights for policy 0, policy_version 162880 (0.0005) [2023-03-09 07:56:39,970][635776] Updated weights for policy 0, policy_version 162960 (0.0005) [2023-03-09 07:56:40,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 83435520. Throughput: 0: 10031.6. Samples: 83414140. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:40,122][635489] Avg episode reward: [(0, '4565.074')] [2023-03-09 07:56:43,997][635776] Updated weights for policy 0, policy_version 163040 (0.0004) [2023-03-09 07:56:45,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 83484672. Throughput: 0: 10056.9. Samples: 83476340. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:45,122][635489] Avg episode reward: [(0, '4584.117')] [2023-03-09 07:56:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163056_83484672.pth... [2023-03-09 07:56:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162472_83185664.pth [2023-03-09 07:56:47,928][635776] Updated weights for policy 0, policy_version 163120 (0.0005) [2023-03-09 07:56:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 83537920. Throughput: 0: 10122.0. Samples: 83537928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 07:56:50,122][635489] Avg episode reward: [(0, '4591.114')] [2023-03-09 07:56:51,840][635776] Updated weights for policy 0, policy_version 163200 (0.0005) [2023-03-09 07:56:55,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 83591168. Throughput: 0: 10138.1. Samples: 83569852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:56:55,122][635489] Avg episode reward: [(0, '4582.331')] [2023-03-09 07:56:55,758][635776] Updated weights for policy 0, policy_version 163280 (0.0005) [2023-03-09 07:56:59,596][635776] Updated weights for policy 0, policy_version 163360 (0.0005) [2023-03-09 07:57:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 83644416. Throughput: 0: 10188.5. Samples: 83632616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:00,132][635489] Avg episode reward: [(0, '4574.913')] [2023-03-09 07:57:00,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163368_83644416.pth... [2023-03-09 07:57:00,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000162768_83337216.pth [2023-03-09 07:57:03,726][635776] Updated weights for policy 0, policy_version 163440 (0.0005) [2023-03-09 07:57:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10177.5). Total num frames: 83693568. Throughput: 0: 10104.3. Samples: 83693568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:05,122][635489] Avg episode reward: [(0, '4555.597')] [2023-03-09 07:57:07,971][635776] Updated weights for policy 0, policy_version 163520 (0.0005) [2023-03-09 07:57:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 83742720. Throughput: 0: 10062.8. Samples: 83722196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:10,122][635489] Avg episode reward: [(0, '4520.352')] [2023-03-09 07:57:11,906][635776] Updated weights for policy 0, policy_version 163600 (0.0005) [2023-03-09 07:57:15,122][635489] Fps is (10 sec: 9830.2, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 83791872. Throughput: 0: 10094.1. Samples: 83783216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:15,122][635489] Avg episode reward: [(0, '4572.061')] [2023-03-09 07:57:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163656_83791872.pth... [2023-03-09 07:57:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163056_83484672.pth [2023-03-09 07:57:16,154][635776] Updated weights for policy 0, policy_version 163680 (0.0004) [2023-03-09 07:57:20,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 83841024. Throughput: 0: 10128.1. Samples: 83840476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:20,122][635489] Avg episode reward: [(0, '4371.612')] [2023-03-09 07:57:20,372][635776] Updated weights for policy 0, policy_version 163760 (0.0005) [2023-03-09 07:57:24,575][635776] Updated weights for policy 0, policy_version 163840 (0.0005) [2023-03-09 07:57:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 83890176. Throughput: 0: 10121.9. Samples: 83869624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:25,122][635489] Avg episode reward: [(0, '4291.477')] [2023-03-09 07:57:28,684][635776] Updated weights for policy 0, policy_version 163920 (0.0005) [2023-03-09 07:57:30,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 83939328. Throughput: 0: 10066.1. Samples: 83929316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:30,122][635489] Avg episode reward: [(0, '4499.575')] [2023-03-09 07:57:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163944_83939328.pth... [2023-03-09 07:57:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163368_83644416.pth [2023-03-09 07:57:32,797][635776] Updated weights for policy 0, policy_version 164000 (0.0005) [2023-03-09 07:57:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 83988480. Throughput: 0: 10042.7. Samples: 83989848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:35,122][635489] Avg episode reward: [(0, '4573.334')] [2023-03-09 07:57:36,776][635776] Updated weights for policy 0, policy_version 164080 (0.0005) [2023-03-09 07:57:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 84041728. Throughput: 0: 10030.3. Samples: 84021216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:40,122][635489] Avg episode reward: [(0, '4571.047')] [2023-03-09 07:57:40,725][635776] Updated weights for policy 0, policy_version 164160 (0.0005) [2023-03-09 07:57:44,300][635776] Updated weights for policy 0, policy_version 164240 (0.0004) [2023-03-09 07:57:45,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 84099072. Throughput: 0: 10078.5. Samples: 84086148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:45,122][635489] Avg episode reward: [(0, '4560.304')] [2023-03-09 07:57:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164256_84099072.pth... [2023-03-09 07:57:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163656_83791872.pth [2023-03-09 07:57:48,271][635776] Updated weights for policy 0, policy_version 164320 (0.0005) [2023-03-09 07:57:50,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 84148224. Throughput: 0: 10109.9. Samples: 84148512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:50,122][635489] Avg episode reward: [(0, '4565.733')] [2023-03-09 07:57:52,198][635776] Updated weights for policy 0, policy_version 164400 (0.0004) [2023-03-09 07:57:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 84201472. Throughput: 0: 10188.8. Samples: 84180692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:57:55,122][635489] Avg episode reward: [(0, '4573.617')] [2023-03-09 07:57:56,368][635776] Updated weights for policy 0, policy_version 164480 (0.0005) [2023-03-09 07:58:00,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 84250624. Throughput: 0: 10115.2. Samples: 84238400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:00,122][635489] Avg episode reward: [(0, '4577.902')] [2023-03-09 07:58:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164552_84250624.pth... [2023-03-09 07:58:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000163944_83939328.pth [2023-03-09 07:58:00,496][635776] Updated weights for policy 0, policy_version 164560 (0.0005) [2023-03-09 07:58:04,881][635776] Updated weights for policy 0, policy_version 164640 (0.0004) [2023-03-09 07:58:05,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 84295680. Throughput: 0: 10115.8. Samples: 84295688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:05,122][635489] Avg episode reward: [(0, '4582.542')] [2023-03-09 07:58:09,161][635776] Updated weights for policy 0, policy_version 164720 (0.0004) [2023-03-09 07:58:10,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 84344832. Throughput: 0: 10106.5. Samples: 84324416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:10,122][635489] Avg episode reward: [(0, '4595.795')] [2023-03-09 07:58:13,506][635776] Updated weights for policy 0, policy_version 164800 (0.0004) [2023-03-09 07:58:15,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 84389888. Throughput: 0: 10052.8. Samples: 84381692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:15,122][635489] Avg episode reward: [(0, '4566.802')] [2023-03-09 07:58:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164824_84389888.pth... [2023-03-09 07:58:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164256_84099072.pth [2023-03-09 07:58:17,617][635776] Updated weights for policy 0, policy_version 164880 (0.0005) [2023-03-09 07:58:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 84443136. Throughput: 0: 10074.5. Samples: 84443200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:20,122][635489] Avg episode reward: [(0, '4544.407')] [2023-03-09 07:58:21,652][635776] Updated weights for policy 0, policy_version 164960 (0.0005) [2023-03-09 07:58:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 84492288. Throughput: 0: 10017.8. Samples: 84472016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:25,122][635489] Avg episode reward: [(0, '4570.941')] [2023-03-09 07:58:25,634][635776] Updated weights for policy 0, policy_version 165040 (0.0005) [2023-03-09 07:58:29,823][635776] Updated weights for policy 0, policy_version 165120 (0.0005) [2023-03-09 07:58:30,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 84541440. Throughput: 0: 9934.0. Samples: 84533176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:30,122][635489] Avg episode reward: [(0, '4569.724')] [2023-03-09 07:58:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165120_84541440.pth... [2023-03-09 07:58:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164552_84250624.pth [2023-03-09 07:58:33,951][635776] Updated weights for policy 0, policy_version 165200 (0.0005) [2023-03-09 07:58:35,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 84594688. Throughput: 0: 9911.2. Samples: 84594516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:35,122][635489] Avg episode reward: [(0, '4578.173')] [2023-03-09 07:58:37,773][635776] Updated weights for policy 0, policy_version 165280 (0.0005) [2023-03-09 07:58:40,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 84647936. Throughput: 0: 9855.9. Samples: 84624208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:40,122][635489] Avg episode reward: [(0, '4519.129')] [2023-03-09 07:58:41,395][635776] Updated weights for policy 0, policy_version 165360 (0.0005) [2023-03-09 07:58:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 84701184. Throughput: 0: 10011.2. Samples: 84688904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:45,122][635489] Avg episode reward: [(0, '4549.641')] [2023-03-09 07:58:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165432_84701184.pth... [2023-03-09 07:58:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000164824_84389888.pth [2023-03-09 07:58:45,513][635776] Updated weights for policy 0, policy_version 165440 (0.0005) [2023-03-09 07:58:49,602][635776] Updated weights for policy 0, policy_version 165520 (0.0005) [2023-03-09 07:58:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 84750336. Throughput: 0: 10094.4. Samples: 84749936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:50,122][635489] Avg episode reward: [(0, '4564.280')] [2023-03-09 07:58:53,785][635776] Updated weights for policy 0, policy_version 165600 (0.0005) [2023-03-09 07:58:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 84799488. Throughput: 0: 10100.0. Samples: 84778916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:58:55,122][635489] Avg episode reward: [(0, '4569.946')] [2023-03-09 07:58:57,563][635776] Updated weights for policy 0, policy_version 165680 (0.0005) [2023-03-09 07:59:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 84852736. Throughput: 0: 10284.5. Samples: 84844496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:00,122][635489] Avg episode reward: [(0, '4555.388')] [2023-03-09 07:59:00,146][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165736_84856832.pth... [2023-03-09 07:59:00,147][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165120_84541440.pth [2023-03-09 07:59:01,171][635776] Updated weights for policy 0, policy_version 165760 (0.0005) [2023-03-09 07:59:05,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 84905984. Throughput: 0: 10285.5. Samples: 84906048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:05,122][635489] Avg episode reward: [(0, '4566.657')] [2023-03-09 07:59:05,268][635776] Updated weights for policy 0, policy_version 165840 (0.0005) [2023-03-09 07:59:09,370][635776] Updated weights for policy 0, policy_version 165920 (0.0004) [2023-03-09 07:59:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 84955136. Throughput: 0: 10303.1. Samples: 84935656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:10,122][635489] Avg episode reward: [(0, '4594.629')] [2023-03-09 07:59:13,322][635776] Updated weights for policy 0, policy_version 166000 (0.0005) [2023-03-09 07:59:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10149.7). Total num frames: 85008384. Throughput: 0: 10322.4. Samples: 84997684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:15,122][635489] Avg episode reward: [(0, '4550.577')] [2023-03-09 07:59:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166032_85008384.pth... [2023-03-09 07:59:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165432_84701184.pth [2023-03-09 07:59:17,553][635776] Updated weights for policy 0, policy_version 166080 (0.0004) [2023-03-09 07:59:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 85057536. Throughput: 0: 10283.9. Samples: 85057292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:20,122][635489] Avg episode reward: [(0, '4571.127')] [2023-03-09 07:59:21,702][635776] Updated weights for policy 0, policy_version 166160 (0.0004) [2023-03-09 07:59:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 85106688. Throughput: 0: 10268.1. Samples: 85086272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:25,122][635489] Avg episode reward: [(0, '4585.998')] [2023-03-09 07:59:25,784][635776] Updated weights for policy 0, policy_version 166240 (0.0004) [2023-03-09 07:59:29,741][635776] Updated weights for policy 0, policy_version 166320 (0.0005) [2023-03-09 07:59:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 85155840. Throughput: 0: 10194.8. Samples: 85147668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:30,122][635489] Avg episode reward: [(0, '4580.617')] [2023-03-09 07:59:30,145][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166328_85159936.pth... [2023-03-09 07:59:30,146][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000165736_84856832.pth [2023-03-09 07:59:33,910][635776] Updated weights for policy 0, policy_version 166400 (0.0005) [2023-03-09 07:59:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 85204992. Throughput: 0: 10144.8. Samples: 85206452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:35,122][635489] Avg episode reward: [(0, '4554.874')] [2023-03-09 07:59:37,954][635776] Updated weights for policy 0, policy_version 166480 (0.0005) [2023-03-09 07:59:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 85258240. Throughput: 0: 10190.8. Samples: 85237500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:40,122][635489] Avg episode reward: [(0, '4591.805')] [2023-03-09 07:59:41,985][635776] Updated weights for policy 0, policy_version 166560 (0.0004) [2023-03-09 07:59:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 85311488. Throughput: 0: 10111.6. Samples: 85299520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:45,122][635489] Avg episode reward: [(0, '4599.332')] [2023-03-09 07:59:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166624_85311488.pth... [2023-03-09 07:59:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166032_85008384.pth [2023-03-09 07:59:45,793][635776] Updated weights for policy 0, policy_version 166640 (0.0005) [2023-03-09 07:59:49,695][635776] Updated weights for policy 0, policy_version 166720 (0.0004) [2023-03-09 07:59:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 85364736. Throughput: 0: 10154.9. Samples: 85363020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:50,122][635489] Avg episode reward: [(0, '4603.426')] [2023-03-09 07:59:53,675][635776] Updated weights for policy 0, policy_version 166800 (0.0005) [2023-03-09 07:59:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 85413888. Throughput: 0: 10170.9. Samples: 85393348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 07:59:55,122][635489] Avg episode reward: [(0, '4528.185')] [2023-03-09 07:59:57,699][635776] Updated weights for policy 0, policy_version 166880 (0.0004) [2023-03-09 08:00:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 85467136. Throughput: 0: 10160.6. Samples: 85454912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:00:00,122][635489] Avg episode reward: [(0, '4558.794')] [2023-03-09 08:00:00,148][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166936_85471232.pth... [2023-03-09 08:00:00,149][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166328_85159936.pth [2023-03-09 08:00:01,128][635776] Updated weights for policy 0, policy_version 166960 (0.0004) [2023-03-09 08:00:05,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 85520384. Throughput: 0: 10362.1. Samples: 85523588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:00:05,122][635489] Avg episode reward: [(0, '4557.290')] [2023-03-09 08:00:05,129][635776] Updated weights for policy 0, policy_version 167040 (0.0005) [2023-03-09 08:00:09,332][635776] Updated weights for policy 0, policy_version 167120 (0.0004) [2023-03-09 08:00:10,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 85573632. Throughput: 0: 10357.7. Samples: 85552368. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:10,122][635489] Avg episode reward: [(0, '4511.803')] [2023-03-09 08:00:13,343][635776] Updated weights for policy 0, policy_version 167200 (0.0005) [2023-03-09 08:00:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 85622784. Throughput: 0: 10323.5. Samples: 85612228. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:15,122][635489] Avg episode reward: [(0, '4420.115')] [2023-03-09 08:00:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167232_85622784.pth... [2023-03-09 08:00:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166624_85311488.pth [2023-03-09 08:00:17,271][635776] Updated weights for policy 0, policy_version 167280 (0.0005) [2023-03-09 08:00:20,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 85671936. Throughput: 0: 10365.2. Samples: 85672888. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:20,122][635489] Avg episode reward: [(0, '4491.766')] [2023-03-09 08:00:21,251][635776] Updated weights for policy 0, policy_version 167360 (0.0004) [2023-03-09 08:00:25,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 85725184. Throughput: 0: 10380.2. Samples: 85704608. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:25,122][635489] Avg episode reward: [(0, '4350.144')] [2023-03-09 08:00:25,542][635776] Updated weights for policy 0, policy_version 167440 (0.0006) [2023-03-09 08:00:29,742][635776] Updated weights for policy 0, policy_version 167520 (0.0005) [2023-03-09 08:00:30,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 85770240. Throughput: 0: 10279.8. Samples: 85762112. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:30,122][635489] Avg episode reward: [(0, '4502.558')] [2023-03-09 08:00:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167520_85770240.pth... [2023-03-09 08:00:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000166936_85471232.pth [2023-03-09 08:00:33,958][635776] Updated weights for policy 0, policy_version 167600 (0.0004) [2023-03-09 08:00:35,121][635489] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 85819392. Throughput: 0: 10162.8. Samples: 85820344. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:35,122][635489] Avg episode reward: [(0, '4533.116')] [2023-03-09 08:00:38,183][635776] Updated weights for policy 0, policy_version 167680 (0.0005) [2023-03-09 08:00:40,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 85872640. Throughput: 0: 10127.5. Samples: 85849084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:40,122][635489] Avg episode reward: [(0, '4492.083')] [2023-03-09 08:00:42,034][635776] Updated weights for policy 0, policy_version 167760 (0.0006) [2023-03-09 08:00:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 85921792. Throughput: 0: 10161.8. Samples: 85912196. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:45,122][635489] Avg episode reward: [(0, '4551.422')] [2023-03-09 08:00:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167816_85921792.pth... [2023-03-09 08:00:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167232_85622784.pth [2023-03-09 08:00:46,092][635776] Updated weights for policy 0, policy_version 167840 (0.0005) [2023-03-09 08:00:50,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 85970944. Throughput: 0: 9964.8. Samples: 85972004. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:50,122][635489] Avg episode reward: [(0, '4548.553')] [2023-03-09 08:00:50,213][635776] Updated weights for policy 0, policy_version 167920 (0.0006) [2023-03-09 08:00:54,175][635776] Updated weights for policy 0, policy_version 168000 (0.0005) [2023-03-09 08:00:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 86024192. Throughput: 0: 10023.0. Samples: 86003404. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:00:55,122][635489] Avg episode reward: [(0, '4538.405')] [2023-03-09 08:00:58,180][635776] Updated weights for policy 0, policy_version 168080 (0.0006) [2023-03-09 08:01:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 86073344. Throughput: 0: 10062.0. Samples: 86065020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:01:00,122][635489] Avg episode reward: [(0, '4539.130')] [2023-03-09 08:01:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168112_86073344.pth... [2023-03-09 08:01:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167520_85770240.pth [2023-03-09 08:01:02,480][635776] Updated weights for policy 0, policy_version 168160 (0.0006) [2023-03-09 08:01:05,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 86122496. Throughput: 0: 9984.6. Samples: 86122196. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-09 08:01:05,122][635489] Avg episode reward: [(0, '4539.386')] [2023-03-09 08:01:06,703][635776] Updated weights for policy 0, policy_version 168240 (0.0006) [2023-03-09 08:01:10,122][635489] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 86171648. Throughput: 0: 9925.0. Samples: 86151232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:10,122][635489] Avg episode reward: [(0, '4538.720')] [2023-03-09 08:01:10,692][635776] Updated weights for policy 0, policy_version 168320 (0.0005) [2023-03-09 08:01:14,776][635776] Updated weights for policy 0, policy_version 168400 (0.0004) [2023-03-09 08:01:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 86220800. Throughput: 0: 10012.5. Samples: 86212672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:15,122][635489] Avg episode reward: [(0, '4507.762')] [2023-03-09 08:01:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168400_86220800.pth... [2023-03-09 08:01:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000167816_85921792.pth [2023-03-09 08:01:18,831][635776] Updated weights for policy 0, policy_version 168480 (0.0005) [2023-03-09 08:01:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 86274048. Throughput: 0: 10044.2. Samples: 86272332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:20,122][635489] Avg episode reward: [(0, '4490.637')] [2023-03-09 08:01:22,940][635776] Updated weights for policy 0, policy_version 168560 (0.0005) [2023-03-09 08:01:25,121][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 86323200. Throughput: 0: 10079.9. Samples: 86302680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:25,122][635489] Avg episode reward: [(0, '4546.729')] [2023-03-09 08:01:26,952][635776] Updated weights for policy 0, policy_version 168640 (0.0005) [2023-03-09 08:01:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10122.0). Total num frames: 86372352. Throughput: 0: 10020.0. Samples: 86363096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:30,122][635489] Avg episode reward: [(0, '4541.479')] [2023-03-09 08:01:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168696_86372352.pth... [2023-03-09 08:01:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168112_86073344.pth [2023-03-09 08:01:31,167][635776] Updated weights for policy 0, policy_version 168720 (0.0005) [2023-03-09 08:01:34,843][635776] Updated weights for policy 0, policy_version 168800 (0.0005) [2023-03-09 08:01:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 86425600. Throughput: 0: 10081.7. Samples: 86425680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:35,122][635489] Avg episode reward: [(0, '4540.820')] [2023-03-09 08:01:38,780][635776] Updated weights for policy 0, policy_version 168880 (0.0005) [2023-03-09 08:01:40,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 86478848. Throughput: 0: 10107.6. Samples: 86458244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:40,122][635489] Avg episode reward: [(0, '4545.493')] [2023-03-09 08:01:42,728][635776] Updated weights for policy 0, policy_version 168960 (0.0005) [2023-03-09 08:01:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 86532096. Throughput: 0: 10105.3. Samples: 86519756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:45,122][635489] Avg episode reward: [(0, '4501.707')] [2023-03-09 08:01:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169008_86532096.pth... [2023-03-09 08:01:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168400_86220800.pth [2023-03-09 08:01:46,541][635776] Updated weights for policy 0, policy_version 169040 (0.0005) [2023-03-09 08:01:50,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 86585344. Throughput: 0: 10260.0. Samples: 86583896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:50,122][635489] Avg episode reward: [(0, '4445.640')] [2023-03-09 08:01:50,536][635776] Updated weights for policy 0, policy_version 169120 (0.0005) [2023-03-09 08:01:54,578][635776] Updated weights for policy 0, policy_version 169200 (0.0005) [2023-03-09 08:01:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 86634496. Throughput: 0: 10288.9. Samples: 86614232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:01:55,133][635489] Avg episode reward: [(0, '4510.974')] [2023-03-09 08:01:58,594][635776] Updated weights for policy 0, policy_version 169280 (0.0006) [2023-03-09 08:02:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 86683648. Throughput: 0: 10273.7. Samples: 86674988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:02:00,132][635489] Avg episode reward: [(0, '4517.444')] [2023-03-09 08:02:00,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169304_86683648.pth... [2023-03-09 08:02:00,137][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000168696_86372352.pth [2023-03-09 08:02:02,469][635776] Updated weights for policy 0, policy_version 169360 (0.0005) [2023-03-09 08:02:05,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 86740992. Throughput: 0: 10376.0. Samples: 86739252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:02:05,132][635489] Avg episode reward: [(0, '4545.302')] [2023-03-09 08:02:06,085][635776] Updated weights for policy 0, policy_version 169440 (0.0004) [2023-03-09 08:02:10,132][635776] Updated weights for policy 0, policy_version 169520 (0.0004) [2023-03-09 08:02:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10163.6). Total num frames: 86790144. Throughput: 0: 10449.9. Samples: 86772928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:02:10,133][635489] Avg episode reward: [(0, '4544.467')] [2023-03-09 08:02:14,241][635776] Updated weights for policy 0, policy_version 169600 (0.0004) [2023-03-09 08:02:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 86843392. Throughput: 0: 10401.6. Samples: 86831168. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:15,133][635489] Avg episode reward: [(0, '4322.198')] [2023-03-09 08:02:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169616_86843392.pth... [2023-03-09 08:02:15,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169008_86532096.pth [2023-03-09 08:02:18,224][635776] Updated weights for policy 0, policy_version 169680 (0.0005) [2023-03-09 08:02:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 86892544. Throughput: 0: 10376.2. Samples: 86892608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:20,122][635489] Avg episode reward: [(0, '4427.951')] [2023-03-09 08:02:22,350][635776] Updated weights for policy 0, policy_version 169760 (0.0004) [2023-03-09 08:02:25,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 86945792. Throughput: 0: 10319.8. Samples: 86922636. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:25,122][635489] Avg episode reward: [(0, '4227.904')] [2023-03-09 08:02:26,257][635776] Updated weights for policy 0, policy_version 169840 (0.0005) [2023-03-09 08:02:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 86994944. Throughput: 0: 10351.1. Samples: 86985556. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:30,122][635489] Avg episode reward: [(0, '4451.290')] [2023-03-09 08:02:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169912_86994944.pth... [2023-03-09 08:02:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169304_86683648.pth [2023-03-09 08:02:30,299][635776] Updated weights for policy 0, policy_version 169920 (0.0005) [2023-03-09 08:02:34,120][635776] Updated weights for policy 0, policy_version 170000 (0.0005) [2023-03-09 08:02:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 87048192. Throughput: 0: 10319.1. Samples: 87048256. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:35,122][635489] Avg episode reward: [(0, '4373.200')] [2023-03-09 08:02:37,922][635776] Updated weights for policy 0, policy_version 170080 (0.0005) [2023-03-09 08:02:40,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 87101440. Throughput: 0: 10372.2. Samples: 87080980. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:40,122][635489] Avg episode reward: [(0, '4484.350')] [2023-03-09 08:02:41,716][635776] Updated weights for policy 0, policy_version 170160 (0.0005) [2023-03-09 08:02:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 87154688. Throughput: 0: 10477.9. Samples: 87146492. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:45,122][635489] Avg episode reward: [(0, '4351.434')] [2023-03-09 08:02:45,139][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170232_87158784.pth... [2023-03-09 08:02:45,140][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169616_86843392.pth [2023-03-09 08:02:45,491][635776] Updated weights for policy 0, policy_version 170240 (0.0005) [2023-03-09 08:02:49,629][635776] Updated weights for policy 0, policy_version 170320 (0.0005) [2023-03-09 08:02:50,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 87207936. Throughput: 0: 10394.4. Samples: 87207000. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:50,122][635489] Avg episode reward: [(0, '4305.601')] [2023-03-09 08:02:53,385][635776] Updated weights for policy 0, policy_version 170400 (0.0005) [2023-03-09 08:02:55,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 87261184. Throughput: 0: 10346.7. Samples: 87238528. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:02:55,122][635489] Avg episode reward: [(0, '4127.959')] [2023-03-09 08:02:57,182][635776] Updated weights for policy 0, policy_version 170480 (0.0005) [2023-03-09 08:03:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10233.1). Total num frames: 87314432. Throughput: 0: 10493.3. Samples: 87303364. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:03:00,122][635489] Avg episode reward: [(0, '4093.616')] [2023-03-09 08:03:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170536_87314432.pth... [2023-03-09 08:03:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000169912_86994944.pth [2023-03-09 08:03:01,101][635776] Updated weights for policy 0, policy_version 170560 (0.0005) [2023-03-09 08:03:05,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 87363584. Throughput: 0: 10467.6. Samples: 87363648. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:03:05,122][635489] Avg episode reward: [(0, '4382.891')] [2023-03-09 08:03:05,232][635776] Updated weights for policy 0, policy_version 170640 (0.0005) [2023-03-09 08:03:09,063][635776] Updated weights for policy 0, policy_version 170720 (0.0005) [2023-03-09 08:03:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 87416832. Throughput: 0: 10527.0. Samples: 87396352. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:03:10,122][635489] Avg episode reward: [(0, '4347.453')] [2023-03-09 08:03:13,029][635776] Updated weights for policy 0, policy_version 170800 (0.0005) [2023-03-09 08:03:15,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 87470080. Throughput: 0: 10496.0. Samples: 87457876. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:03:15,122][635489] Avg episode reward: [(0, '4128.310')] [2023-03-09 08:03:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170840_87470080.pth... [2023-03-09 08:03:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170232_87158784.pth [2023-03-09 08:03:17,178][635776] Updated weights for policy 0, policy_version 170880 (0.0005) [2023-03-09 08:03:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 87519232. Throughput: 0: 10463.9. Samples: 87519132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:20,122][635489] Avg episode reward: [(0, '4085.092')] [2023-03-09 08:03:21,225][635776] Updated weights for policy 0, policy_version 170960 (0.0005) [2023-03-09 08:03:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10260.8). Total num frames: 87568384. Throughput: 0: 10377.5. Samples: 87547968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:25,122][635489] Avg episode reward: [(0, '4286.922')] [2023-03-09 08:03:25,257][635776] Updated weights for policy 0, policy_version 171040 (0.0005) [2023-03-09 08:03:29,171][635776] Updated weights for policy 0, policy_version 171120 (0.0005) [2023-03-09 08:03:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 87621632. Throughput: 0: 10308.0. Samples: 87610352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:30,122][635489] Avg episode reward: [(0, '4305.020')] [2023-03-09 08:03:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171136_87621632.pth... [2023-03-09 08:03:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170536_87314432.pth [2023-03-09 08:03:33,325][635776] Updated weights for policy 0, policy_version 171200 (0.0005) [2023-03-09 08:03:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 87670784. Throughput: 0: 10298.7. Samples: 87670444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:35,122][635489] Avg episode reward: [(0, '4329.572')] [2023-03-09 08:03:37,374][635776] Updated weights for policy 0, policy_version 171280 (0.0005) [2023-03-09 08:03:40,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 87719936. Throughput: 0: 10257.6. Samples: 87700120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:40,122][635489] Avg episode reward: [(0, '4303.162')] [2023-03-09 08:03:41,593][635776] Updated weights for policy 0, policy_version 171360 (0.0004) [2023-03-09 08:03:45,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 87769088. Throughput: 0: 10151.7. Samples: 87760188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:45,122][635489] Avg episode reward: [(0, '4351.662')] [2023-03-09 08:03:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171424_87769088.pth... [2023-03-09 08:03:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000170840_87470080.pth [2023-03-09 08:03:45,682][635776] Updated weights for policy 0, policy_version 171440 (0.0005) [2023-03-09 08:03:49,665][635776] Updated weights for policy 0, policy_version 171520 (0.0005) [2023-03-09 08:03:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 87822336. Throughput: 0: 10155.3. Samples: 87820636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:50,122][635489] Avg episode reward: [(0, '4384.763')] [2023-03-09 08:03:53,936][635776] Updated weights for policy 0, policy_version 171600 (0.0004) [2023-03-09 08:03:55,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 87867392. Throughput: 0: 10078.9. Samples: 87849904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:03:55,122][635489] Avg episode reward: [(0, '4371.764')] [2023-03-09 08:03:57,960][635776] Updated weights for policy 0, policy_version 171680 (0.0005) [2023-03-09 08:04:00,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 87920640. Throughput: 0: 10089.4. Samples: 87911896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:00,122][635489] Avg episode reward: [(0, '4257.564')] [2023-03-09 08:04:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171720_87920640.pth... [2023-03-09 08:04:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171136_87621632.pth [2023-03-09 08:04:02,006][635776] Updated weights for policy 0, policy_version 171760 (0.0005) [2023-03-09 08:04:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 87969792. Throughput: 0: 10023.4. Samples: 87970184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:05,122][635489] Avg episode reward: [(0, '4383.272')] [2023-03-09 08:04:06,021][635776] Updated weights for policy 0, policy_version 171840 (0.0005) [2023-03-09 08:04:09,896][635776] Updated weights for policy 0, policy_version 171920 (0.0005) [2023-03-09 08:04:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 88023040. Throughput: 0: 10098.6. Samples: 88002404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:10,122][635489] Avg episode reward: [(0, '4333.311')] [2023-03-09 08:04:13,851][635776] Updated weights for policy 0, policy_version 172000 (0.0005) [2023-03-09 08:04:15,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 88076288. Throughput: 0: 10082.7. Samples: 88064072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:15,122][635489] Avg episode reward: [(0, '4164.272')] [2023-03-09 08:04:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172024_88076288.pth... [2023-03-09 08:04:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171424_87769088.pth [2023-03-09 08:04:17,975][635776] Updated weights for policy 0, policy_version 172080 (0.0004) [2023-03-09 08:04:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 88125440. Throughput: 0: 10112.5. Samples: 88125504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:20,122][635489] Avg episode reward: [(0, '4476.076')] [2023-03-09 08:04:21,965][635776] Updated weights for policy 0, policy_version 172160 (0.0005) [2023-03-09 08:04:25,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 88170496. Throughput: 0: 10090.1. Samples: 88154176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:25,122][635489] Avg episode reward: [(0, '4422.262')] [2023-03-09 08:04:26,411][635776] Updated weights for policy 0, policy_version 172240 (0.0005) [2023-03-09 08:04:30,121][635489] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 88219648. Throughput: 0: 10028.4. Samples: 88211464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:30,122][635489] Avg episode reward: [(0, '4157.634')] [2023-03-09 08:04:30,139][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172312_88223744.pth... [2023-03-09 08:04:30,140][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000171720_87920640.pth [2023-03-09 08:04:30,523][635776] Updated weights for policy 0, policy_version 172320 (0.0005) [2023-03-09 08:04:34,554][635776] Updated weights for policy 0, policy_version 172400 (0.0005) [2023-03-09 08:04:35,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 88272896. Throughput: 0: 10047.4. Samples: 88272768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:35,122][635489] Avg episode reward: [(0, '4023.929')] [2023-03-09 08:04:38,579][635776] Updated weights for policy 0, policy_version 172480 (0.0005) [2023-03-09 08:04:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 88322048. Throughput: 0: 10040.7. Samples: 88301736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:40,122][635489] Avg episode reward: [(0, '4219.953')] [2023-03-09 08:04:42,722][635776] Updated weights for policy 0, policy_version 172560 (0.0005) [2023-03-09 08:04:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 88371200. Throughput: 0: 10018.4. Samples: 88362724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:45,122][635489] Avg episode reward: [(0, '4140.375')] [2023-03-09 08:04:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172600_88371200.pth... [2023-03-09 08:04:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172024_88076288.pth [2023-03-09 08:04:46,942][635776] Updated weights for policy 0, policy_version 172640 (0.0004) [2023-03-09 08:04:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 88424448. Throughput: 0: 10079.1. Samples: 88423744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:50,122][635489] Avg episode reward: [(0, '4267.427')] [2023-03-09 08:04:50,836][635776] Updated weights for policy 0, policy_version 172720 (0.0005) [2023-03-09 08:04:55,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 88469504. Throughput: 0: 10015.3. Samples: 88453092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:04:55,122][635489] Avg episode reward: [(0, '4268.585')] [2023-03-09 08:04:55,174][635776] Updated weights for policy 0, policy_version 172800 (0.0005) [2023-03-09 08:04:58,912][635776] Updated weights for policy 0, policy_version 172880 (0.0005) [2023-03-09 08:05:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 88522752. Throughput: 0: 10010.8. Samples: 88514560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:00,122][635489] Avg episode reward: [(0, '4265.910')] [2023-03-09 08:05:00,166][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172904_88526848.pth... [2023-03-09 08:05:00,168][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172312_88223744.pth [2023-03-09 08:05:03,047][635776] Updated weights for policy 0, policy_version 172960 (0.0005) [2023-03-09 08:05:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 88576000. Throughput: 0: 9977.6. Samples: 88574496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:05,122][635489] Avg episode reward: [(0, '4377.826')] [2023-03-09 08:05:06,955][635776] Updated weights for policy 0, policy_version 173040 (0.0005) [2023-03-09 08:05:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 88629248. Throughput: 0: 10057.4. Samples: 88606760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:10,122][635489] Avg episode reward: [(0, '4329.595')] [2023-03-09 08:05:10,860][635776] Updated weights for policy 0, policy_version 173120 (0.0005) [2023-03-09 08:05:14,780][635776] Updated weights for policy 0, policy_version 173200 (0.0005) [2023-03-09 08:05:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 88678400. Throughput: 0: 10164.1. Samples: 88668848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:15,122][635489] Avg episode reward: [(0, '4293.937')] [2023-03-09 08:05:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173200_88678400.pth... [2023-03-09 08:05:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172600_88371200.pth [2023-03-09 08:05:18,812][635776] Updated weights for policy 0, policy_version 173280 (0.0005) [2023-03-09 08:05:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 88731648. Throughput: 0: 10190.2. Samples: 88731328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:20,122][635489] Avg episode reward: [(0, '4523.538')] [2023-03-09 08:05:22,811][635776] Updated weights for policy 0, policy_version 173360 (0.0005) [2023-03-09 08:05:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 88780800. Throughput: 0: 10203.2. Samples: 88760880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:25,122][635489] Avg episode reward: [(0, '4468.806')] [2023-03-09 08:05:26,894][635776] Updated weights for policy 0, policy_version 173440 (0.0005) [2023-03-09 08:05:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 88834048. Throughput: 0: 10200.4. Samples: 88821740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:30,122][635489] Avg episode reward: [(0, '4399.529')] [2023-03-09 08:05:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173504_88834048.pth... [2023-03-09 08:05:30,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000172904_88526848.pth [2023-03-09 08:05:30,867][635776] Updated weights for policy 0, policy_version 173520 (0.0005) [2023-03-09 08:05:34,673][635776] Updated weights for policy 0, policy_version 173600 (0.0005) [2023-03-09 08:05:35,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 88887296. Throughput: 0: 10262.2. Samples: 88885544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:35,122][635489] Avg episode reward: [(0, '4304.542')] [2023-03-09 08:05:38,633][635776] Updated weights for policy 0, policy_version 173680 (0.0005) [2023-03-09 08:05:40,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 88936448. Throughput: 0: 10291.1. Samples: 88916192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:40,122][635489] Avg episode reward: [(0, '4520.022')] [2023-03-09 08:05:42,583][635776] Updated weights for policy 0, policy_version 173760 (0.0005) [2023-03-09 08:05:45,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 88989696. Throughput: 0: 10306.9. Samples: 88978368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:45,122][635489] Avg episode reward: [(0, '4386.650')] [2023-03-09 08:05:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173808_88989696.pth... [2023-03-09 08:05:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173200_88678400.pth [2023-03-09 08:05:46,681][635776] Updated weights for policy 0, policy_version 173840 (0.0005) [2023-03-09 08:05:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 89038848. Throughput: 0: 10318.1. Samples: 89038808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:50,122][635489] Avg episode reward: [(0, '4331.903')] [2023-03-09 08:05:50,642][635776] Updated weights for policy 0, policy_version 173920 (0.0005) [2023-03-09 08:05:54,340][635776] Updated weights for policy 0, policy_version 174000 (0.0005) [2023-03-09 08:05:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 89092096. Throughput: 0: 10329.1. Samples: 89071568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:05:55,122][635489] Avg episode reward: [(0, '4411.396')] [2023-03-09 08:05:58,508][635776] Updated weights for policy 0, policy_version 174080 (0.0004) [2023-03-09 08:06:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 89145344. Throughput: 0: 10314.3. Samples: 89132992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:00,122][635489] Avg episode reward: [(0, '4431.656')] [2023-03-09 08:06:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174112_89145344.pth... [2023-03-09 08:06:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173504_88834048.pth [2023-03-09 08:06:02,638][635776] Updated weights for policy 0, policy_version 174160 (0.0005) [2023-03-09 08:06:05,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 89194496. Throughput: 0: 10290.3. Samples: 89194392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:05,133][635489] Avg episode reward: [(0, '4359.194')] [2023-03-09 08:06:06,581][635776] Updated weights for policy 0, policy_version 174240 (0.0005) [2023-03-09 08:06:10,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 89243648. Throughput: 0: 10280.6. Samples: 89223504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:10,122][635489] Avg episode reward: [(0, '4347.962')] [2023-03-09 08:06:10,668][635776] Updated weights for policy 0, policy_version 174320 (0.0005) [2023-03-09 08:06:14,725][635776] Updated weights for policy 0, policy_version 174400 (0.0005) [2023-03-09 08:06:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10246.9). Total num frames: 89296896. Throughput: 0: 10282.3. Samples: 89284444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:15,133][635489] Avg episode reward: [(0, '4430.180')] [2023-03-09 08:06:15,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174408_89296896.pth... [2023-03-09 08:06:15,140][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000173808_88989696.pth [2023-03-09 08:06:18,675][635776] Updated weights for policy 0, policy_version 174480 (0.0005) [2023-03-09 08:06:20,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 89346048. Throughput: 0: 10243.1. Samples: 89346484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:20,132][635489] Avg episode reward: [(0, '4486.991')] [2023-03-09 08:06:22,646][635776] Updated weights for policy 0, policy_version 174560 (0.0005) [2023-03-09 08:06:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 89399296. Throughput: 0: 10251.6. Samples: 89377512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:25,131][635489] Avg episode reward: [(0, '4465.339')] [2023-03-09 08:06:26,800][635776] Updated weights for policy 0, policy_version 174640 (0.0005) [2023-03-09 08:06:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 89448448. Throughput: 0: 10211.2. Samples: 89437872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:30,130][635489] Avg episode reward: [(0, '4522.469')] [2023-03-09 08:06:30,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174704_89448448.pth... [2023-03-09 08:06:30,139][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174112_89145344.pth [2023-03-09 08:06:30,640][635776] Updated weights for policy 0, policy_version 174720 (0.0005) [2023-03-09 08:06:34,784][635776] Updated weights for policy 0, policy_version 174800 (0.0005) [2023-03-09 08:06:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 89497600. Throughput: 0: 10214.7. Samples: 89498468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:35,132][635489] Avg episode reward: [(0, '4242.551')] [2023-03-09 08:06:38,814][635776] Updated weights for policy 0, policy_version 174880 (0.0005) [2023-03-09 08:06:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 89550848. Throughput: 0: 10188.1. Samples: 89530032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:40,122][635489] Avg episode reward: [(0, '4354.841')] [2023-03-09 08:06:42,726][635776] Updated weights for policy 0, policy_version 174960 (0.0006) [2023-03-09 08:06:45,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 89604096. Throughput: 0: 10196.0. Samples: 89591812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:45,132][635489] Avg episode reward: [(0, '4426.913')] [2023-03-09 08:06:45,135][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175008_89604096.pth... [2023-03-09 08:06:45,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174408_89296896.pth [2023-03-09 08:06:46,569][635776] Updated weights for policy 0, policy_version 175040 (0.0005) [2023-03-09 08:06:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 89657344. Throughput: 0: 10269.3. Samples: 89656512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:50,133][635489] Avg episode reward: [(0, '4478.595')] [2023-03-09 08:06:50,473][635776] Updated weights for policy 0, policy_version 175120 (0.0005) [2023-03-09 08:06:54,428][635776] Updated weights for policy 0, policy_version 175200 (0.0005) [2023-03-09 08:06:55,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 89706496. Throughput: 0: 10282.0. Samples: 89686196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:06:55,133][635489] Avg episode reward: [(0, '4483.391')] [2023-03-09 08:06:58,233][635776] Updated weights for policy 0, policy_version 175280 (0.0004) [2023-03-09 08:07:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 89763840. Throughput: 0: 10352.1. Samples: 89750288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:00,133][635489] Avg episode reward: [(0, '4431.141')] [2023-03-09 08:07:00,136][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175320_89763840.pth... [2023-03-09 08:07:00,138][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000174704_89448448.pth [2023-03-09 08:07:02,120][635776] Updated weights for policy 0, policy_version 175360 (0.0005) [2023-03-09 08:07:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 89812992. Throughput: 0: 10339.5. Samples: 89811764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:05,133][635489] Avg episode reward: [(0, '4349.352')] [2023-03-09 08:07:06,221][635776] Updated weights for policy 0, policy_version 175440 (0.0004) [2023-03-09 08:07:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 89862144. Throughput: 0: 10342.8. Samples: 89842936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:10,122][635489] Avg episode reward: [(0, '4438.892')] [2023-03-09 08:07:10,151][635776] Updated weights for policy 0, policy_version 175520 (0.0005) [2023-03-09 08:07:14,103][635776] Updated weights for policy 0, policy_version 175600 (0.0005) [2023-03-09 08:07:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 89915392. Throughput: 0: 10355.7. Samples: 89903880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:15,122][635489] Avg episode reward: [(0, '4466.836')] [2023-03-09 08:07:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175616_89915392.pth... [2023-03-09 08:07:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175008_89604096.pth [2023-03-09 08:07:18,086][635776] Updated weights for policy 0, policy_version 175680 (0.0005) [2023-03-09 08:07:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 89968640. Throughput: 0: 10448.6. Samples: 89968656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:20,122][635489] Avg episode reward: [(0, '4502.059')] [2023-03-09 08:07:21,754][635776] Updated weights for policy 0, policy_version 175760 (0.0005) [2023-03-09 08:07:25,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10260.8). Total num frames: 90021888. Throughput: 0: 10475.0. Samples: 90001408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:25,122][635489] Avg episode reward: [(0, '4374.573')] [2023-03-09 08:07:25,771][635776] Updated weights for policy 0, policy_version 175840 (0.0005) [2023-03-09 08:07:29,681][635776] Updated weights for policy 0, policy_version 175920 (0.0005) [2023-03-09 08:07:30,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 90075136. Throughput: 0: 10467.0. Samples: 90062828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:30,122][635489] Avg episode reward: [(0, '4331.748')] [2023-03-09 08:07:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175928_90075136.pth... [2023-03-09 08:07:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175320_89763840.pth [2023-03-09 08:07:33,878][635776] Updated weights for policy 0, policy_version 176000 (0.0005) [2023-03-09 08:07:35,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 90120192. Throughput: 0: 10353.3. Samples: 90122408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:07:35,122][635489] Avg episode reward: [(0, '4335.545')] [2023-03-09 08:07:37,927][635776] Updated weights for policy 0, policy_version 176080 (0.0005) [2023-03-09 08:07:40,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 90173440. Throughput: 0: 10372.5. Samples: 90152960. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:07:40,122][635489] Avg episode reward: [(0, '3862.439')] [2023-03-09 08:07:42,147][635776] Updated weights for policy 0, policy_version 176160 (0.0005) [2023-03-09 08:07:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 90226688. Throughput: 0: 10292.9. Samples: 90213468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:07:45,122][635489] Avg episode reward: [(0, '4029.082')] [2023-03-09 08:07:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176224_90226688.pth... [2023-03-09 08:07:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175616_89915392.pth [2023-03-09 08:07:45,852][635776] Updated weights for policy 0, policy_version 176240 (0.0005) [2023-03-09 08:07:49,922][635776] Updated weights for policy 0, policy_version 176320 (0.0005) [2023-03-09 08:07:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 90275840. Throughput: 0: 10312.8. Samples: 90275840. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:07:50,122][635489] Avg episode reward: [(0, '3956.443')] [2023-03-09 08:07:53,766][635776] Updated weights for policy 0, policy_version 176400 (0.0005) [2023-03-09 08:07:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 90329088. Throughput: 0: 10320.3. Samples: 90307352. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:07:55,122][635489] Avg episode reward: [(0, '4253.168')] [2023-03-09 08:07:57,517][635776] Updated weights for policy 0, policy_version 176480 (0.0005) [2023-03-09 08:08:00,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 90382336. Throughput: 0: 10385.3. Samples: 90371220. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:00,122][635489] Avg episode reward: [(0, '4156.003')] [2023-03-09 08:08:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176528_90382336.pth... [2023-03-09 08:08:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000175928_90075136.pth [2023-03-09 08:08:01,562][635776] Updated weights for policy 0, policy_version 176560 (0.0005) [2023-03-09 08:08:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 90431488. Throughput: 0: 10331.0. Samples: 90433552. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:05,122][635489] Avg episode reward: [(0, '4141.387')] [2023-03-09 08:08:05,553][635776] Updated weights for policy 0, policy_version 176640 (0.0004) [2023-03-09 08:08:09,611][635776] Updated weights for policy 0, policy_version 176720 (0.0004) [2023-03-09 08:08:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 90484736. Throughput: 0: 10269.2. Samples: 90463524. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:10,122][635489] Avg episode reward: [(0, '3985.986')] [2023-03-09 08:08:13,396][635776] Updated weights for policy 0, policy_version 176800 (0.0005) [2023-03-09 08:08:15,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 90537984. Throughput: 0: 10289.8. Samples: 90525868. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:15,122][635489] Avg episode reward: [(0, '4444.248')] [2023-03-09 08:08:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176832_90537984.pth... [2023-03-09 08:08:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176224_90226688.pth [2023-03-09 08:08:17,529][635776] Updated weights for policy 0, policy_version 176880 (0.0004) [2023-03-09 08:08:20,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 90587136. Throughput: 0: 10326.5. Samples: 90587104. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:20,122][635489] Avg episode reward: [(0, '4306.719')] [2023-03-09 08:08:21,593][635776] Updated weights for policy 0, policy_version 176960 (0.0005) [2023-03-09 08:08:25,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 90636288. Throughput: 0: 10286.9. Samples: 90615872. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:25,122][635489] Avg episode reward: [(0, '3700.726')] [2023-03-09 08:08:25,563][635776] Updated weights for policy 0, policy_version 177040 (0.0004) [2023-03-09 08:08:29,603][635776] Updated weights for policy 0, policy_version 177120 (0.0005) [2023-03-09 08:08:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 90689536. Throughput: 0: 10309.1. Samples: 90677380. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:30,122][635489] Avg episode reward: [(0, '3953.314')] [2023-03-09 08:08:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177128_90689536.pth... [2023-03-09 08:08:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176528_90382336.pth [2023-03-09 08:08:33,445][635776] Updated weights for policy 0, policy_version 177200 (0.0005) [2023-03-09 08:08:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 90742784. Throughput: 0: 10368.9. Samples: 90742440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:35,122][635489] Avg episode reward: [(0, '3900.056')] [2023-03-09 08:08:37,457][635776] Updated weights for policy 0, policy_version 177280 (0.0005) [2023-03-09 08:08:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 90791936. Throughput: 0: 10316.9. Samples: 90771612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:40,122][635489] Avg episode reward: [(0, '4189.558')] [2023-03-09 08:08:41,384][635776] Updated weights for policy 0, policy_version 177360 (0.0005) [2023-03-09 08:08:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 90841088. Throughput: 0: 10258.1. Samples: 90832832. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-09 08:08:45,122][635489] Avg episode reward: [(0, '3910.036')] [2023-03-09 08:08:45,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177424_90841088.pth... [2023-03-09 08:08:45,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000176832_90537984.pth [2023-03-09 08:08:45,607][635776] Updated weights for policy 0, policy_version 177440 (0.0005) [2023-03-09 08:08:49,639][635776] Updated weights for policy 0, policy_version 177520 (0.0005) [2023-03-09 08:08:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 90894336. Throughput: 0: 10221.6. Samples: 90893524. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:08:50,122][635489] Avg episode reward: [(0, '4029.725')] [2023-03-09 08:08:53,497][635776] Updated weights for policy 0, policy_version 177600 (0.0005) [2023-03-09 08:08:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 90947584. Throughput: 0: 10266.7. Samples: 90925524. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:08:55,122][635489] Avg episode reward: [(0, '4302.316')] [2023-03-09 08:08:57,536][635776] Updated weights for policy 0, policy_version 177680 (0.0005) [2023-03-09 08:09:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 90996736. Throughput: 0: 10243.8. Samples: 90986836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:00,122][635489] Avg episode reward: [(0, '4327.733')] [2023-03-09 08:09:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177728_90996736.pth... [2023-03-09 08:09:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177128_90689536.pth [2023-03-09 08:09:01,602][635776] Updated weights for policy 0, policy_version 177760 (0.0004) [2023-03-09 08:09:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10260.8). Total num frames: 91049984. Throughput: 0: 10268.3. Samples: 91049176. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:05,122][635489] Avg episode reward: [(0, '4307.430')] [2023-03-09 08:09:05,474][635776] Updated weights for policy 0, policy_version 177840 (0.0005) [2023-03-09 08:09:09,344][635776] Updated weights for policy 0, policy_version 177920 (0.0005) [2023-03-09 08:09:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 91099136. Throughput: 0: 10285.5. Samples: 91078720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:10,122][635489] Avg episode reward: [(0, '4306.899')] [2023-03-09 08:09:13,278][635776] Updated weights for policy 0, policy_version 178000 (0.0005) [2023-03-09 08:09:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 91152384. Throughput: 0: 10342.2. Samples: 91142780. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:15,122][635489] Avg episode reward: [(0, '4253.333')] [2023-03-09 08:09:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178032_91152384.pth... [2023-03-09 08:09:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177424_90841088.pth [2023-03-09 08:09:17,231][635776] Updated weights for policy 0, policy_version 178080 (0.0004) [2023-03-09 08:09:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 91205632. Throughput: 0: 10263.1. Samples: 91204280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:20,122][635489] Avg episode reward: [(0, '4397.872')] [2023-03-09 08:09:21,329][635776] Updated weights for policy 0, policy_version 178160 (0.0005) [2023-03-09 08:09:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 91254784. Throughput: 0: 10283.6. Samples: 91234376. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:25,122][635489] Avg episode reward: [(0, '4150.607')] [2023-03-09 08:09:25,279][635776] Updated weights for policy 0, policy_version 178240 (0.0004) [2023-03-09 08:09:29,351][635776] Updated weights for policy 0, policy_version 178320 (0.0005) [2023-03-09 08:09:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 91308032. Throughput: 0: 10253.9. Samples: 91294260. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:30,122][635489] Avg episode reward: [(0, '4477.103')] [2023-03-09 08:09:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178336_91308032.pth... [2023-03-09 08:09:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000177728_90996736.pth [2023-03-09 08:09:33,268][635776] Updated weights for policy 0, policy_version 178400 (0.0005) [2023-03-09 08:09:35,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 91357184. Throughput: 0: 10303.6. Samples: 91357184. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:35,122][635489] Avg episode reward: [(0, '4529.918')] [2023-03-09 08:09:37,355][635776] Updated weights for policy 0, policy_version 178480 (0.0005) [2023-03-09 08:09:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 91406336. Throughput: 0: 10254.1. Samples: 91386960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:40,122][635489] Avg episode reward: [(0, '4461.952')] [2023-03-09 08:09:41,530][635776] Updated weights for policy 0, policy_version 178560 (0.0005) [2023-03-09 08:09:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 91455488. Throughput: 0: 10226.5. Samples: 91447028. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:45,122][635489] Avg episode reward: [(0, '4492.207')] [2023-03-09 08:09:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178624_91455488.pth... [2023-03-09 08:09:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178032_91152384.pth [2023-03-09 08:09:45,691][635776] Updated weights for policy 0, policy_version 178640 (0.0005) [2023-03-09 08:09:49,595][635776] Updated weights for policy 0, policy_version 178720 (0.0004) [2023-03-09 08:09:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 91508736. Throughput: 0: 10212.4. Samples: 91508736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:09:50,122][635489] Avg episode reward: [(0, '4415.900')] [2023-03-09 08:09:53,341][635776] Updated weights for policy 0, policy_version 178800 (0.0005) [2023-03-09 08:09:55,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 91561984. Throughput: 0: 10266.1. Samples: 91540696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:09:55,122][635489] Avg episode reward: [(0, '4494.629')] [2023-03-09 08:09:57,096][635776] Updated weights for policy 0, policy_version 178880 (0.0005) [2023-03-09 08:10:00,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 91615232. Throughput: 0: 10247.7. Samples: 91603928. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:00,122][635489] Avg episode reward: [(0, '4548.652')] [2023-03-09 08:10:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178936_91615232.pth... [2023-03-09 08:10:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178336_91308032.pth [2023-03-09 08:10:01,214][635776] Updated weights for policy 0, policy_version 178960 (0.0005) [2023-03-09 08:10:05,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 91664384. Throughput: 0: 10250.6. Samples: 91665556. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:05,122][635489] Avg episode reward: [(0, '4515.061')] [2023-03-09 08:10:05,202][635776] Updated weights for policy 0, policy_version 179040 (0.0006) [2023-03-09 08:10:09,488][635776] Updated weights for policy 0, policy_version 179120 (0.0005) [2023-03-09 08:10:10,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 91713536. Throughput: 0: 10199.8. Samples: 91693368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:10,122][635489] Avg episode reward: [(0, '4385.647')] [2023-03-09 08:10:13,144][635776] Updated weights for policy 0, policy_version 179200 (0.0005) [2023-03-09 08:10:15,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10302.5). Total num frames: 91770880. Throughput: 0: 10312.9. Samples: 91758340. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:15,122][635489] Avg episode reward: [(0, '4425.334')] [2023-03-09 08:10:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179240_91770880.pth... [2023-03-09 08:10:15,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178624_91455488.pth [2023-03-09 08:10:17,192][635776] Updated weights for policy 0, policy_version 179280 (0.0004) [2023-03-09 08:10:20,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 91820032. Throughput: 0: 10283.5. Samples: 91819940. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:20,122][635489] Avg episode reward: [(0, '4514.367')] [2023-03-09 08:10:21,241][635776] Updated weights for policy 0, policy_version 179360 (0.0005) [2023-03-09 08:10:24,968][635776] Updated weights for policy 0, policy_version 179440 (0.0006) [2023-03-09 08:10:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 91873280. Throughput: 0: 10293.6. Samples: 91850172. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:25,122][635489] Avg episode reward: [(0, '4511.483')] [2023-03-09 08:10:28,917][635776] Updated weights for policy 0, policy_version 179520 (0.0005) [2023-03-09 08:10:30,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 91922432. Throughput: 0: 10380.2. Samples: 91914136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:30,122][635489] Avg episode reward: [(0, '4217.053')] [2023-03-09 08:10:30,140][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179544_91926528.pth... [2023-03-09 08:10:30,142][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000178936_91615232.pth [2023-03-09 08:10:33,011][635776] Updated weights for policy 0, policy_version 179600 (0.0005) [2023-03-09 08:10:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 91975680. Throughput: 0: 10348.7. Samples: 91974424. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:35,122][635489] Avg episode reward: [(0, '4436.572')] [2023-03-09 08:10:37,093][635776] Updated weights for policy 0, policy_version 179680 (0.0005) [2023-03-09 08:10:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 92024832. Throughput: 0: 10313.6. Samples: 92004808. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:40,122][635489] Avg episode reward: [(0, '4560.588')] [2023-03-09 08:10:40,984][635776] Updated weights for policy 0, policy_version 179760 (0.0004) [2023-03-09 08:10:45,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 92073984. Throughput: 0: 10262.8. Samples: 92065752. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:45,122][635489] Avg episode reward: [(0, '4480.029')] [2023-03-09 08:10:45,142][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179840_92078080.pth... [2023-03-09 08:10:45,142][635776] Updated weights for policy 0, policy_version 179840 (0.0004) [2023-03-09 08:10:45,144][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179240_91770880.pth [2023-03-09 08:10:49,178][635776] Updated weights for policy 0, policy_version 179920 (0.0005) [2023-03-09 08:10:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 92127232. Throughput: 0: 10256.7. Samples: 92127108. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:50,122][635489] Avg episode reward: [(0, '4404.130')] [2023-03-09 08:10:53,251][635776] Updated weights for policy 0, policy_version 180000 (0.0005) [2023-03-09 08:10:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 92176384. Throughput: 0: 10280.0. Samples: 92155968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:10:55,122][635489] Avg episode reward: [(0, '4501.722')] [2023-03-09 08:10:57,211][635776] Updated weights for policy 0, policy_version 180080 (0.0005) [2023-03-09 08:11:00,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 92229632. Throughput: 0: 10202.3. Samples: 92217440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:00,122][635489] Avg episode reward: [(0, '4196.495')] [2023-03-09 08:11:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180136_92229632.pth... [2023-03-09 08:11:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179544_91926528.pth [2023-03-09 08:11:01,250][635776] Updated weights for policy 0, policy_version 180160 (0.0005) [2023-03-09 08:11:04,985][635776] Updated weights for policy 0, policy_version 180240 (0.0005) [2023-03-09 08:11:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 92282880. Throughput: 0: 10281.1. Samples: 92282588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:05,122][635489] Avg episode reward: [(0, '4571.259')] [2023-03-09 08:11:09,021][635776] Updated weights for policy 0, policy_version 180320 (0.0005) [2023-03-09 08:11:10,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 92332032. Throughput: 0: 10279.0. Samples: 92312728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:10,122][635489] Avg episode reward: [(0, '4423.195')] [2023-03-09 08:11:13,205][635776] Updated weights for policy 0, policy_version 180400 (0.0005) [2023-03-09 08:11:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 92381184. Throughput: 0: 10163.0. Samples: 92371472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:15,122][635489] Avg episode reward: [(0, '4400.906')] [2023-03-09 08:11:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180432_92381184.pth... [2023-03-09 08:11:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000179840_92078080.pth [2023-03-09 08:11:17,421][635776] Updated weights for policy 0, policy_version 180480 (0.0005) [2023-03-09 08:11:20,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 92430336. Throughput: 0: 10160.0. Samples: 92431624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:20,122][635489] Avg episode reward: [(0, '4486.294')] [2023-03-09 08:11:21,368][635776] Updated weights for policy 0, policy_version 180560 (0.0005) [2023-03-09 08:11:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 92483584. Throughput: 0: 10184.5. Samples: 92463112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:25,122][635489] Avg episode reward: [(0, '4532.774')] [2023-03-09 08:11:25,358][635776] Updated weights for policy 0, policy_version 180640 (0.0005) [2023-03-09 08:11:29,564][635776] Updated weights for policy 0, policy_version 180720 (0.0005) [2023-03-09 08:11:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 92532736. Throughput: 0: 10124.9. Samples: 92521372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:30,122][635489] Avg episode reward: [(0, '4498.642')] [2023-03-09 08:11:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180728_92532736.pth... [2023-03-09 08:11:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180136_92229632.pth [2023-03-09 08:11:33,536][635776] Updated weights for policy 0, policy_version 180800 (0.0005) [2023-03-09 08:11:35,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 92585984. Throughput: 0: 10154.5. Samples: 92584060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:35,122][635489] Avg episode reward: [(0, '4364.367')] [2023-03-09 08:11:37,394][635776] Updated weights for policy 0, policy_version 180880 (0.0005) [2023-03-09 08:11:40,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 92639232. Throughput: 0: 10230.0. Samples: 92616316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:40,122][635489] Avg episode reward: [(0, '4334.682')] [2023-03-09 08:11:40,895][635776] Updated weights for policy 0, policy_version 180960 (0.0005) [2023-03-09 08:11:44,895][635776] Updated weights for policy 0, policy_version 181040 (0.0005) [2023-03-09 08:11:45,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 92692480. Throughput: 0: 10310.9. Samples: 92681432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:45,122][635489] Avg episode reward: [(0, '4408.772')] [2023-03-09 08:11:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181040_92692480.pth... [2023-03-09 08:11:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180432_92381184.pth [2023-03-09 08:11:49,020][635776] Updated weights for policy 0, policy_version 181120 (0.0005) [2023-03-09 08:11:50,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 92741632. Throughput: 0: 10207.5. Samples: 92741924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:50,122][635489] Avg episode reward: [(0, '4499.756')] [2023-03-09 08:11:52,975][635776] Updated weights for policy 0, policy_version 181200 (0.0005) [2023-03-09 08:11:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 92794880. Throughput: 0: 10253.8. Samples: 92774152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:11:55,122][635489] Avg episode reward: [(0, '4452.240')] [2023-03-09 08:11:57,248][635776] Updated weights for policy 0, policy_version 181280 (0.0004) [2023-03-09 08:12:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 92844032. Throughput: 0: 10235.0. Samples: 92832048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:12:00,122][635489] Avg episode reward: [(0, '4495.159')] [2023-03-09 08:12:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181336_92844032.pth... [2023-03-09 08:12:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000180728_92532736.pth [2023-03-09 08:12:01,044][635776] Updated weights for policy 0, policy_version 181360 (0.0005) [2023-03-09 08:12:05,039][635776] Updated weights for policy 0, policy_version 181440 (0.0005) [2023-03-09 08:12:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 92897280. Throughput: 0: 10325.3. Samples: 92896264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:05,122][635489] Avg episode reward: [(0, '4534.948')] [2023-03-09 08:12:09,151][635776] Updated weights for policy 0, policy_version 181520 (0.0005) [2023-03-09 08:12:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 92946432. Throughput: 0: 10285.3. Samples: 92925952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:10,122][635489] Avg episode reward: [(0, '4574.927')] [2023-03-09 08:12:13,089][635776] Updated weights for policy 0, policy_version 181600 (0.0005) [2023-03-09 08:12:15,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 92999680. Throughput: 0: 10357.6. Samples: 92987464. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:15,122][635489] Avg episode reward: [(0, '4578.958')] [2023-03-09 08:12:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181640_92999680.pth... [2023-03-09 08:12:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181040_92692480.pth [2023-03-09 08:12:16,856][635776] Updated weights for policy 0, policy_version 181680 (0.0005) [2023-03-09 08:12:20,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 93052928. Throughput: 0: 10392.8. Samples: 93051736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:20,122][635489] Avg episode reward: [(0, '4536.553')] [2023-03-09 08:12:20,907][635776] Updated weights for policy 0, policy_version 181760 (0.0005) [2023-03-09 08:12:24,978][635776] Updated weights for policy 0, policy_version 181840 (0.0005) [2023-03-09 08:12:25,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 93102080. Throughput: 0: 10338.3. Samples: 93081540. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:25,122][635489] Avg episode reward: [(0, '4569.127')] [2023-03-09 08:12:28,659][635776] Updated weights for policy 0, policy_version 181920 (0.0005) [2023-03-09 08:12:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 93155328. Throughput: 0: 10323.1. Samples: 93145972. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:30,122][635489] Avg episode reward: [(0, '4457.305')] [2023-03-09 08:12:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181944_93155328.pth... [2023-03-09 08:12:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181336_92844032.pth [2023-03-09 08:12:32,577][635776] Updated weights for policy 0, policy_version 182000 (0.0005) [2023-03-09 08:12:35,122][635489] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 93212672. Throughput: 0: 10394.1. Samples: 93209660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:35,122][635489] Avg episode reward: [(0, '4579.950')] [2023-03-09 08:12:36,164][635776] Updated weights for policy 0, policy_version 182080 (0.0004) [2023-03-09 08:12:40,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 93261824. Throughput: 0: 10420.1. Samples: 93243056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:40,122][635489] Avg episode reward: [(0, '4541.321')] [2023-03-09 08:12:40,235][635776] Updated weights for policy 0, policy_version 182160 (0.0004) [2023-03-09 08:12:43,899][635776] Updated weights for policy 0, policy_version 182240 (0.0004) [2023-03-09 08:12:45,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 93315072. Throughput: 0: 10551.8. Samples: 93306880. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:45,123][635489] Avg episode reward: [(0, '4583.868')] [2023-03-09 08:12:45,152][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182264_93319168.pth... [2023-03-09 08:12:45,154][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181640_92999680.pth [2023-03-09 08:12:48,050][635776] Updated weights for policy 0, policy_version 182320 (0.0005) [2023-03-09 08:12:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 93368320. Throughput: 0: 10467.6. Samples: 93367304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:50,122][635489] Avg episode reward: [(0, '4555.367')] [2023-03-09 08:12:52,084][635776] Updated weights for policy 0, policy_version 182400 (0.0004) [2023-03-09 08:12:55,122][635489] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 93417472. Throughput: 0: 10467.6. Samples: 93396992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:12:55,122][635489] Avg episode reward: [(0, '4557.244')] [2023-03-09 08:12:56,078][635776] Updated weights for policy 0, policy_version 182480 (0.0005) [2023-03-09 08:13:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 93466624. Throughput: 0: 10433.3. Samples: 93456964. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:13:00,122][635489] Avg episode reward: [(0, '4557.928')] [2023-03-09 08:13:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182552_93466624.pth... [2023-03-09 08:13:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000181944_93155328.pth [2023-03-09 08:13:00,253][635776] Updated weights for policy 0, policy_version 182560 (0.0005) [2023-03-09 08:13:04,208][635776] Updated weights for policy 0, policy_version 182640 (0.0004) [2023-03-09 08:13:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 93519872. Throughput: 0: 10388.4. Samples: 93519216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-09 08:13:05,122][635489] Avg episode reward: [(0, '4558.528')] [2023-03-09 08:13:08,465][635776] Updated weights for policy 0, policy_version 182720 (0.0005) [2023-03-09 08:13:10,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 93569024. Throughput: 0: 10373.0. Samples: 93548324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:10,122][635489] Avg episode reward: [(0, '4595.051')] [2023-03-09 08:13:12,398][635776] Updated weights for policy 0, policy_version 182800 (0.0005) [2023-03-09 08:13:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 93618176. Throughput: 0: 10293.1. Samples: 93609164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:15,122][635489] Avg episode reward: [(0, '4568.516')] [2023-03-09 08:13:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182848_93618176.pth... [2023-03-09 08:13:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182264_93319168.pth [2023-03-09 08:13:16,549][635776] Updated weights for policy 0, policy_version 182880 (0.0005) [2023-03-09 08:13:20,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10308.2, 300 sec: 10288.6). Total num frames: 93671424. Throughput: 0: 10245.4. Samples: 93670704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:20,122][635489] Avg episode reward: [(0, '4607.723')] [2023-03-09 08:13:20,396][635776] Updated weights for policy 0, policy_version 182960 (0.0005) [2023-03-09 08:13:24,322][635776] Updated weights for policy 0, policy_version 183040 (0.0005) [2023-03-09 08:13:25,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10376.6, 300 sec: 10288.6). Total num frames: 93724672. Throughput: 0: 10192.5. Samples: 93701720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:25,122][635489] Avg episode reward: [(0, '4606.928')] [2023-03-09 08:13:28,204][635776] Updated weights for policy 0, policy_version 183120 (0.0005) [2023-03-09 08:13:30,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 93773824. Throughput: 0: 10189.7. Samples: 93765416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:30,122][635489] Avg episode reward: [(0, '4607.461')] [2023-03-09 08:13:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183152_93773824.pth... [2023-03-09 08:13:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182552_93466624.pth [2023-03-09 08:13:31,899][635776] Updated weights for policy 0, policy_version 183200 (0.0005) [2023-03-09 08:13:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 93831168. Throughput: 0: 10273.6. Samples: 93829616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:35,122][635489] Avg episode reward: [(0, '4614.336')] [2023-03-09 08:13:35,837][635776] Updated weights for policy 0, policy_version 183280 (0.0004) [2023-03-09 08:13:39,917][635776] Updated weights for policy 0, policy_version 183360 (0.0004) [2023-03-09 08:13:40,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 93880320. Throughput: 0: 10287.1. Samples: 93859912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:40,122][635489] Avg episode reward: [(0, '4621.162')] [2023-03-09 08:13:40,122][635732] Saving new best policy, reward=4621.162! [2023-03-09 08:13:43,855][635776] Updated weights for policy 0, policy_version 183440 (0.0005) [2023-03-09 08:13:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 93933568. Throughput: 0: 10319.7. Samples: 93921352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:45,122][635489] Avg episode reward: [(0, '4583.217')] [2023-03-09 08:13:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183464_93933568.pth... [2023-03-09 08:13:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000182848_93618176.pth [2023-03-09 08:13:47,634][635776] Updated weights for policy 0, policy_version 183520 (0.0005) [2023-03-09 08:13:50,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 93986816. Throughput: 0: 10391.1. Samples: 93986816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:50,122][635489] Avg episode reward: [(0, '4623.841')] [2023-03-09 08:13:50,122][635732] Saving new best policy, reward=4623.841! [2023-03-09 08:13:51,435][635776] Updated weights for policy 0, policy_version 183600 (0.0005) [2023-03-09 08:13:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 94040064. Throughput: 0: 10476.3. Samples: 94019760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:13:55,123][635489] Avg episode reward: [(0, '4580.121')] [2023-03-09 08:13:55,330][635776] Updated weights for policy 0, policy_version 183680 (0.0005) [2023-03-09 08:13:59,319][635776] Updated weights for policy 0, policy_version 183760 (0.0005) [2023-03-09 08:14:00,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 94093312. Throughput: 0: 10485.8. Samples: 94081024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:14:00,122][635489] Avg episode reward: [(0, '4539.278')] [2023-03-09 08:14:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183776_94093312.pth... [2023-03-09 08:14:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183152_93773824.pth [2023-03-09 08:14:03,277][635776] Updated weights for policy 0, policy_version 183840 (0.0005) [2023-03-09 08:14:05,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 94142464. Throughput: 0: 10483.8. Samples: 94142472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:14:05,122][635489] Avg episode reward: [(0, '4583.814')] [2023-03-09 08:14:07,356][635776] Updated weights for policy 0, policy_version 183920 (0.0005) [2023-03-09 08:14:10,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 94191616. Throughput: 0: 10441.8. Samples: 94171600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:14:10,122][635489] Avg episode reward: [(0, '4318.226')] [2023-03-09 08:14:11,457][635776] Updated weights for policy 0, policy_version 184000 (0.0005) [2023-03-09 08:14:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10288.6). Total num frames: 94240768. Throughput: 0: 10371.4. Samples: 94232128. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:15,122][635489] Avg episode reward: [(0, '4618.292')] [2023-03-09 08:14:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184064_94240768.pth... [2023-03-09 08:14:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183464_93933568.pth [2023-03-09 08:14:15,612][635776] Updated weights for policy 0, policy_version 184080 (0.0005) [2023-03-09 08:14:19,642][635776] Updated weights for policy 0, policy_version 184160 (0.0005) [2023-03-09 08:14:20,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10376.6, 300 sec: 10302.5). Total num frames: 94294016. Throughput: 0: 10295.1. Samples: 94292896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:20,122][635489] Avg episode reward: [(0, '4511.777')] [2023-03-09 08:14:23,390][635776] Updated weights for policy 0, policy_version 184240 (0.0005) [2023-03-09 08:14:25,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 94347264. Throughput: 0: 10353.6. Samples: 94325824. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:25,122][635489] Avg episode reward: [(0, '4616.004')] [2023-03-09 08:14:27,551][635776] Updated weights for policy 0, policy_version 184320 (0.0005) [2023-03-09 08:14:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 94396416. Throughput: 0: 10289.5. Samples: 94384380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:30,122][635489] Avg episode reward: [(0, '4521.841')] [2023-03-09 08:14:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184368_94396416.pth... [2023-03-09 08:14:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000183776_94093312.pth [2023-03-09 08:14:31,683][635776] Updated weights for policy 0, policy_version 184400 (0.0005) [2023-03-09 08:14:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 94445568. Throughput: 0: 10165.8. Samples: 94444276. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:35,122][635489] Avg episode reward: [(0, '4550.254')] [2023-03-09 08:14:35,854][635776] Updated weights for policy 0, policy_version 184480 (0.0004) [2023-03-09 08:14:39,964][635776] Updated weights for policy 0, policy_version 184560 (0.0005) [2023-03-09 08:14:40,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 94494720. Throughput: 0: 10094.8. Samples: 94474024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:40,122][635489] Avg episode reward: [(0, '4617.018')] [2023-03-09 08:14:43,953][635776] Updated weights for policy 0, policy_version 184640 (0.0005) [2023-03-09 08:14:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 94543872. Throughput: 0: 10101.3. Samples: 94535580. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:45,122][635489] Avg episode reward: [(0, '4604.032')] [2023-03-09 08:14:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184664_94547968.pth... [2023-03-09 08:14:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184064_94240768.pth [2023-03-09 08:14:47,836][635776] Updated weights for policy 0, policy_version 184720 (0.0005) [2023-03-09 08:14:50,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 94597120. Throughput: 0: 10099.6. Samples: 94596956. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:50,122][635489] Avg episode reward: [(0, '4608.764')] [2023-03-09 08:14:51,921][635776] Updated weights for policy 0, policy_version 184800 (0.0005) [2023-03-09 08:14:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 94650368. Throughput: 0: 10111.4. Samples: 94626612. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:14:55,122][635489] Avg episode reward: [(0, '4605.806')] [2023-03-09 08:14:55,900][635776] Updated weights for policy 0, policy_version 184880 (0.0005) [2023-03-09 08:14:59,921][635776] Updated weights for policy 0, policy_version 184960 (0.0005) [2023-03-09 08:15:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10288.6). Total num frames: 94699520. Throughput: 0: 10197.3. Samples: 94691004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:15:00,122][635489] Avg episode reward: [(0, '4609.060')] [2023-03-09 08:15:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184960_94699520.pth... [2023-03-09 08:15:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184368_94396416.pth [2023-03-09 08:15:03,894][635776] Updated weights for policy 0, policy_version 185040 (0.0005) [2023-03-09 08:15:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10302.5). Total num frames: 94752768. Throughput: 0: 10189.0. Samples: 94751400. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:15:05,122][635489] Avg episode reward: [(0, '4606.622')] [2023-03-09 08:15:07,903][635776] Updated weights for policy 0, policy_version 185120 (0.0004) [2023-03-09 08:15:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 94801920. Throughput: 0: 10124.4. Samples: 94781424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:15:10,122][635489] Avg episode reward: [(0, '4541.633')] [2023-03-09 08:15:11,747][635776] Updated weights for policy 0, policy_version 185200 (0.0005) [2023-03-09 08:15:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 94851072. Throughput: 0: 10185.7. Samples: 94842736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-09 08:15:15,122][635489] Avg episode reward: [(0, '4533.083')] [2023-03-09 08:15:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185256_94851072.pth... [2023-03-09 08:15:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184664_94547968.pth [2023-03-09 08:15:15,974][635776] Updated weights for policy 0, policy_version 185280 (0.0005) [2023-03-09 08:15:19,785][635776] Updated weights for policy 0, policy_version 185360 (0.0005) [2023-03-09 08:15:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10274.7). Total num frames: 94904320. Throughput: 0: 10242.1. Samples: 94905172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:20,122][635489] Avg episode reward: [(0, '4526.909')] [2023-03-09 08:15:23,660][635776] Updated weights for policy 0, policy_version 185440 (0.0005) [2023-03-09 08:15:25,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 94957568. Throughput: 0: 10309.7. Samples: 94937960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:25,122][635489] Avg episode reward: [(0, '4614.061')] [2023-03-09 08:15:27,346][635776] Updated weights for policy 0, policy_version 185520 (0.0005) [2023-03-09 08:15:30,122][635489] Fps is (10 sec: 11059.1, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 95014912. Throughput: 0: 10378.9. Samples: 95002632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:30,122][635489] Avg episode reward: [(0, '4490.123')] [2023-03-09 08:15:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185576_95014912.pth... [2023-03-09 08:15:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000184960_94699520.pth [2023-03-09 08:15:31,153][635776] Updated weights for policy 0, policy_version 185600 (0.0005) [2023-03-09 08:15:35,121][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 95064064. Throughput: 0: 10416.5. Samples: 95065700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:35,122][635489] Avg episode reward: [(0, '4613.314')] [2023-03-09 08:15:35,132][635776] Updated weights for policy 0, policy_version 185680 (0.0005) [2023-03-09 08:15:39,106][635776] Updated weights for policy 0, policy_version 185760 (0.0005) [2023-03-09 08:15:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 95117312. Throughput: 0: 10451.1. Samples: 95096912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:40,122][635489] Avg episode reward: [(0, '4569.884')] [2023-03-09 08:15:42,716][635776] Updated weights for policy 0, policy_version 185840 (0.0005) [2023-03-09 08:15:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 95170560. Throughput: 0: 10460.2. Samples: 95161712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:45,122][635489] Avg episode reward: [(0, '4514.839')] [2023-03-09 08:15:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185880_95170560.pth... [2023-03-09 08:15:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185256_94851072.pth [2023-03-09 08:15:46,808][635776] Updated weights for policy 0, policy_version 185920 (0.0005) [2023-03-09 08:15:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10330.2). Total num frames: 95223808. Throughput: 0: 10483.0. Samples: 95223136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:50,122][635489] Avg episode reward: [(0, '4569.412')] [2023-03-09 08:15:50,901][635776] Updated weights for policy 0, policy_version 186000 (0.0005) [2023-03-09 08:15:54,845][635776] Updated weights for policy 0, policy_version 186080 (0.0005) [2023-03-09 08:15:55,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 95272960. Throughput: 0: 10468.1. Samples: 95252488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:15:55,122][635489] Avg episode reward: [(0, '4598.510')] [2023-03-09 08:15:58,630][635776] Updated weights for policy 0, policy_version 186160 (0.0005) [2023-03-09 08:16:00,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 95326208. Throughput: 0: 10540.7. Samples: 95317068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:00,122][635489] Avg episode reward: [(0, '4613.067')] [2023-03-09 08:16:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186184_95326208.pth... [2023-03-09 08:16:00,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185576_95014912.pth [2023-03-09 08:16:02,543][635776] Updated weights for policy 0, policy_version 186240 (0.0005) [2023-03-09 08:16:05,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10330.2). Total num frames: 95379456. Throughput: 0: 10535.7. Samples: 95379280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:05,122][635489] Avg episode reward: [(0, '4608.682')] [2023-03-09 08:16:06,640][635776] Updated weights for policy 0, policy_version 186320 (0.0006) [2023-03-09 08:16:10,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10330.2). Total num frames: 95428608. Throughput: 0: 10449.6. Samples: 95408192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:10,122][635489] Avg episode reward: [(0, '4518.510')] [2023-03-09 08:16:10,898][635776] Updated weights for policy 0, policy_version 186400 (0.0005) [2023-03-09 08:16:15,043][635776] Updated weights for policy 0, policy_version 186480 (0.0005) [2023-03-09 08:16:15,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10444.8, 300 sec: 10330.2). Total num frames: 95477760. Throughput: 0: 10289.1. Samples: 95465640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:15,122][635489] Avg episode reward: [(0, '4613.119')] [2023-03-09 08:16:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186480_95477760.pth... [2023-03-09 08:16:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000185880_95170560.pth [2023-03-09 08:16:18,907][635776] Updated weights for policy 0, policy_version 186560 (0.0005) [2023-03-09 08:16:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 95526912. Throughput: 0: 10276.6. Samples: 95528148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:20,122][635489] Avg episode reward: [(0, '4583.410')] [2023-03-09 08:16:23,059][635776] Updated weights for policy 0, policy_version 186640 (0.0005) [2023-03-09 08:16:25,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10330.3). Total num frames: 95580160. Throughput: 0: 10273.2. Samples: 95559208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:25,122][635489] Avg episode reward: [(0, '4611.767')] [2023-03-09 08:16:27,138][635776] Updated weights for policy 0, policy_version 186720 (0.0005) [2023-03-09 08:16:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 95629312. Throughput: 0: 10119.5. Samples: 95617088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:30,122][635489] Avg episode reward: [(0, '4629.677')] [2023-03-09 08:16:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186776_95629312.pth... [2023-03-09 08:16:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186184_95326208.pth [2023-03-09 08:16:30,128][635732] Saving new best policy, reward=4629.677! [2023-03-09 08:16:31,343][635776] Updated weights for policy 0, policy_version 186800 (0.0004) [2023-03-09 08:16:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 95678464. Throughput: 0: 10118.4. Samples: 95678464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:35,122][635489] Avg episode reward: [(0, '4629.421')] [2023-03-09 08:16:35,331][635776] Updated weights for policy 0, policy_version 186880 (0.0006) [2023-03-09 08:16:39,586][635776] Updated weights for policy 0, policy_version 186960 (0.0004) [2023-03-09 08:16:40,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10288.6). Total num frames: 95727616. Throughput: 0: 10103.5. Samples: 95707148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:40,122][635489] Avg episode reward: [(0, '4622.469')] [2023-03-09 08:16:43,917][635776] Updated weights for policy 0, policy_version 187040 (0.0004) [2023-03-09 08:16:45,122][635489] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 10274.7). Total num frames: 95772672. Throughput: 0: 9941.1. Samples: 95764416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:45,122][635489] Avg episode reward: [(0, '4595.112')] [2023-03-09 08:16:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187056_95772672.pth... [2023-03-09 08:16:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186480_95477760.pth [2023-03-09 08:16:48,057][635776] Updated weights for policy 0, policy_version 187120 (0.0005) [2023-03-09 08:16:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10274.7). Total num frames: 95825920. Throughput: 0: 9881.0. Samples: 95823924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:50,122][635489] Avg episode reward: [(0, '4618.987')] [2023-03-09 08:16:51,873][635776] Updated weights for policy 0, policy_version 187200 (0.0005) [2023-03-09 08:16:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10274.7). Total num frames: 95875072. Throughput: 0: 9966.9. Samples: 95856704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:16:55,122][635489] Avg episode reward: [(0, '4611.606')] [2023-03-09 08:16:55,957][635776] Updated weights for policy 0, policy_version 187280 (0.0005) [2023-03-09 08:16:59,955][635776] Updated weights for policy 0, policy_version 187360 (0.0004) [2023-03-09 08:17:00,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10274.7). Total num frames: 95928320. Throughput: 0: 10038.5. Samples: 95917372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:00,122][635489] Avg episode reward: [(0, '4580.044')] [2023-03-09 08:17:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187360_95928320.pth... [2023-03-09 08:17:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000186776_95629312.pth [2023-03-09 08:17:04,220][635776] Updated weights for policy 0, policy_version 187440 (0.0004) [2023-03-09 08:17:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10274.7). Total num frames: 95977472. Throughput: 0: 9950.8. Samples: 95975936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:05,122][635489] Avg episode reward: [(0, '4428.080')] [2023-03-09 08:17:08,470][635776] Updated weights for policy 0, policy_version 187520 (0.0006) [2023-03-09 08:17:10,122][635489] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 96022528. Throughput: 0: 9916.3. Samples: 96005444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:10,122][635489] Avg episode reward: [(0, '4587.254')] [2023-03-09 08:17:12,617][635776] Updated weights for policy 0, policy_version 187600 (0.0005) [2023-03-09 08:17:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 96075776. Throughput: 0: 9974.5. Samples: 96065940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:15,122][635489] Avg episode reward: [(0, '4610.629')] [2023-03-09 08:17:15,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187648_96075776.pth... [2023-03-09 08:17:15,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187056_95772672.pth [2023-03-09 08:17:16,268][635776] Updated weights for policy 0, policy_version 187680 (0.0005) [2023-03-09 08:17:20,121][635489] Fps is (10 sec: 10649.8, 60 sec: 10035.2, 300 sec: 10260.8). Total num frames: 96129024. Throughput: 0: 10013.9. Samples: 96129088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:20,122][635489] Avg episode reward: [(0, '4564.908')] [2023-03-09 08:17:20,259][635776] Updated weights for policy 0, policy_version 187760 (0.0005) [2023-03-09 08:17:23,917][635776] Updated weights for policy 0, policy_version 187840 (0.0005) [2023-03-09 08:17:25,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 10260.8). Total num frames: 96182272. Throughput: 0: 10156.3. Samples: 96164180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:25,122][635489] Avg episode reward: [(0, '4549.964')] [2023-03-09 08:17:27,986][635776] Updated weights for policy 0, policy_version 187920 (0.0005) [2023-03-09 08:17:30,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10246.9). Total num frames: 96235520. Throughput: 0: 10196.2. Samples: 96223244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:17:30,122][635489] Avg episode reward: [(0, '4599.115')] [2023-03-09 08:17:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187960_96235520.pth... [2023-03-09 08:17:30,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187360_95928320.pth [2023-03-09 08:17:32,011][635776] Updated weights for policy 0, policy_version 188000 (0.0005) [2023-03-09 08:17:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 96288768. Throughput: 0: 10321.5. Samples: 96288392. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:17:35,122][635489] Avg episode reward: [(0, '4617.020')] [2023-03-09 08:17:35,837][635776] Updated weights for policy 0, policy_version 188080 (0.0005) [2023-03-09 08:17:39,831][635776] Updated weights for policy 0, policy_version 188160 (0.0005) [2023-03-09 08:17:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10247.0). Total num frames: 96337920. Throughput: 0: 10268.6. Samples: 96318792. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:17:40,122][635489] Avg episode reward: [(0, '4466.574')] [2023-03-09 08:17:43,837][635776] Updated weights for policy 0, policy_version 188240 (0.0005) [2023-03-09 08:17:45,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 96391168. Throughput: 0: 10259.8. Samples: 96379064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:17:45,122][635489] Avg episode reward: [(0, '4625.165')] [2023-03-09 08:17:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188264_96391168.pth... [2023-03-09 08:17:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187648_96075776.pth [2023-03-09 08:17:47,977][635776] Updated weights for policy 0, policy_version 188320 (0.0005) [2023-03-09 08:17:50,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 96436224. Throughput: 0: 10263.3. Samples: 96437784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:17:50,122][635489] Avg episode reward: [(0, '4588.459')] [2023-03-09 08:17:52,323][635776] Updated weights for policy 0, policy_version 188400 (0.0005) [2023-03-09 08:17:55,121][635489] Fps is (10 sec: 9420.9, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 96485376. Throughput: 0: 10234.9. Samples: 96466012. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:17:55,122][635489] Avg episode reward: [(0, '4504.448')] [2023-03-09 08:17:56,400][635776] Updated weights for policy 0, policy_version 188480 (0.0005) [2023-03-09 08:18:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 96534528. Throughput: 0: 10231.2. Samples: 96526344. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:00,122][635489] Avg episode reward: [(0, '4381.364')] [2023-03-09 08:18:00,160][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188552_96538624.pth... [2023-03-09 08:18:00,161][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000187960_96235520.pth [2023-03-09 08:18:00,571][635776] Updated weights for policy 0, policy_version 188560 (0.0006) [2023-03-09 08:18:04,534][635776] Updated weights for policy 0, policy_version 188640 (0.0006) [2023-03-09 08:18:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 96587776. Throughput: 0: 10194.6. Samples: 96587848. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:05,122][635489] Avg episode reward: [(0, '4534.117')] [2023-03-09 08:18:08,645][635776] Updated weights for policy 0, policy_version 188720 (0.0005) [2023-03-09 08:18:10,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 96636928. Throughput: 0: 10080.3. Samples: 96617796. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:10,122][635489] Avg episode reward: [(0, '4609.319')] [2023-03-09 08:18:12,765][635776] Updated weights for policy 0, policy_version 188800 (0.0006) [2023-03-09 08:18:15,121][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 96686080. Throughput: 0: 10087.6. Samples: 96677184. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:15,122][635489] Avg episode reward: [(0, '4616.511')] [2023-03-09 08:18:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188840_96686080.pth... [2023-03-09 08:18:15,125][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188264_96391168.pth [2023-03-09 08:18:16,742][635776] Updated weights for policy 0, policy_version 188880 (0.0005) [2023-03-09 08:18:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10219.2). Total num frames: 96739328. Throughput: 0: 10000.6. Samples: 96738420. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:20,122][635489] Avg episode reward: [(0, '4624.662')] [2023-03-09 08:18:20,766][635776] Updated weights for policy 0, policy_version 188960 (0.0004) [2023-03-09 08:18:24,683][635776] Updated weights for policy 0, policy_version 189040 (0.0005) [2023-03-09 08:18:25,121][635489] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10233.1). Total num frames: 96792576. Throughput: 0: 10049.4. Samples: 96771016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:25,122][635489] Avg episode reward: [(0, '4584.684')] [2023-03-09 08:18:28,919][635776] Updated weights for policy 0, policy_version 189120 (0.0005) [2023-03-09 08:18:30,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 96837632. Throughput: 0: 10008.7. Samples: 96829456. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:30,122][635489] Avg episode reward: [(0, '4579.782')] [2023-03-09 08:18:30,129][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189144_96841728.pth... [2023-03-09 08:18:30,131][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188552_96538624.pth [2023-03-09 08:18:33,047][635776] Updated weights for policy 0, policy_version 189200 (0.0005) [2023-03-09 08:18:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 96890880. Throughput: 0: 10044.3. Samples: 96889780. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:18:35,122][635489] Avg episode reward: [(0, '4448.342')] [2023-03-09 08:18:36,884][635776] Updated weights for policy 0, policy_version 189280 (0.0005) [2023-03-09 08:18:40,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10103.4, 300 sec: 10205.3). Total num frames: 96944128. Throughput: 0: 10101.1. Samples: 96920564. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:18:40,122][635489] Avg episode reward: [(0, '4620.808')] [2023-03-09 08:18:40,896][635776] Updated weights for policy 0, policy_version 189360 (0.0005) [2023-03-09 08:18:44,685][635776] Updated weights for policy 0, policy_version 189440 (0.0004) [2023-03-09 08:18:45,122][635489] Fps is (10 sec: 10649.4, 60 sec: 10103.4, 300 sec: 10205.3). Total num frames: 96997376. Throughput: 0: 10193.4. Samples: 96985048. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:18:45,122][635489] Avg episode reward: [(0, '4619.549')] [2023-03-09 08:18:45,127][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189448_96997376.pth... [2023-03-09 08:18:45,130][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000188840_96686080.pth [2023-03-09 08:18:48,758][635776] Updated weights for policy 0, policy_version 189520 (0.0004) [2023-03-09 08:18:50,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 97046528. Throughput: 0: 10161.0. Samples: 97045092. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:18:50,122][635489] Avg episode reward: [(0, '4627.248')] [2023-03-09 08:18:52,998][635776] Updated weights for policy 0, policy_version 189600 (0.0005) [2023-03-09 08:18:55,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 97095680. Throughput: 0: 10155.0. Samples: 97074772. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:18:55,122][635489] Avg episode reward: [(0, '4627.945')] [2023-03-09 08:18:57,167][635776] Updated weights for policy 0, policy_version 189680 (0.0005) [2023-03-09 08:19:00,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 97144832. Throughput: 0: 10135.4. Samples: 97133276. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:00,122][635489] Avg episode reward: [(0, '4630.521')] [2023-03-09 08:19:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189736_97144832.pth... [2023-03-09 08:19:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189144_96841728.pth [2023-03-09 08:19:00,128][635732] Saving new best policy, reward=4630.521! [2023-03-09 08:19:00,977][635776] Updated weights for policy 0, policy_version 189760 (0.0005) [2023-03-09 08:19:04,902][635776] Updated weights for policy 0, policy_version 189840 (0.0005) [2023-03-09 08:19:05,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 97198080. Throughput: 0: 10214.8. Samples: 97198088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:05,122][635489] Avg episode reward: [(0, '4620.502')] [2023-03-09 08:19:08,994][635776] Updated weights for policy 0, policy_version 189920 (0.0005) [2023-03-09 08:19:10,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 97251328. Throughput: 0: 10164.1. Samples: 97228400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:10,122][635489] Avg episode reward: [(0, '4627.731')] [2023-03-09 08:19:12,902][635776] Updated weights for policy 0, policy_version 190000 (0.0004) [2023-03-09 08:19:15,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 97300480. Throughput: 0: 10254.5. Samples: 97290908. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:15,122][635489] Avg episode reward: [(0, '4629.520')] [2023-03-09 08:19:15,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190040_97300480.pth... [2023-03-09 08:19:15,125][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189448_96997376.pth [2023-03-09 08:19:16,796][635776] Updated weights for policy 0, policy_version 190080 (0.0005) [2023-03-09 08:19:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 97353728. Throughput: 0: 10257.2. Samples: 97351356. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:20,122][635489] Avg episode reward: [(0, '4632.656')] [2023-03-09 08:19:20,123][635732] Saving new best policy, reward=4632.656! [2023-03-09 08:19:20,917][635776] Updated weights for policy 0, policy_version 190160 (0.0005) [2023-03-09 08:19:25,018][635776] Updated weights for policy 0, policy_version 190240 (0.0005) [2023-03-09 08:19:25,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 97402880. Throughput: 0: 10255.5. Samples: 97382060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:25,122][635489] Avg episode reward: [(0, '4620.768')] [2023-03-09 08:19:28,955][635776] Updated weights for policy 0, policy_version 190320 (0.0005) [2023-03-09 08:19:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10205.3). Total num frames: 97456128. Throughput: 0: 10185.7. Samples: 97443404. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:30,123][635489] Avg episode reward: [(0, '4630.444')] [2023-03-09 08:19:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190344_97456128.pth... [2023-03-09 08:19:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000189736_97144832.pth [2023-03-09 08:19:32,597][635776] Updated weights for policy 0, policy_version 190400 (0.0005) [2023-03-09 08:19:35,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 97505280. Throughput: 0: 10269.4. Samples: 97507216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:35,122][635489] Avg episode reward: [(0, '4614.867')] [2023-03-09 08:19:36,769][635776] Updated weights for policy 0, policy_version 190480 (0.0005) [2023-03-09 08:19:40,121][635489] Fps is (10 sec: 10240.2, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 97558528. Throughput: 0: 10292.6. Samples: 97537940. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-09 08:19:40,132][635489] Avg episode reward: [(0, '4628.089')] [2023-03-09 08:19:40,712][635776] Updated weights for policy 0, policy_version 190560 (0.0005) [2023-03-09 08:19:44,559][635776] Updated weights for policy 0, policy_version 190640 (0.0004) [2023-03-09 08:19:45,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 97611776. Throughput: 0: 10385.6. Samples: 97600628. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:19:45,133][635489] Avg episode reward: [(0, '4630.077')] [2023-03-09 08:19:45,138][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190648_97611776.pth... [2023-03-09 08:19:45,141][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190040_97300480.pth [2023-03-09 08:19:48,355][635776] Updated weights for policy 0, policy_version 190720 (0.0005) [2023-03-09 08:19:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.2, 300 sec: 10219.2). Total num frames: 97665024. Throughput: 0: 10382.8. Samples: 97665312. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:19:50,132][635489] Avg episode reward: [(0, '4588.430')] [2023-03-09 08:19:52,427][635776] Updated weights for policy 0, policy_version 190800 (0.0004) [2023-03-09 08:19:55,122][635489] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 97714176. Throughput: 0: 10342.4. Samples: 97693808. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:19:55,132][635489] Avg episode reward: [(0, '4590.082')] [2023-03-09 08:19:56,528][635776] Updated weights for policy 0, policy_version 190880 (0.0005) [2023-03-09 08:20:00,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 97763328. Throughput: 0: 10311.5. Samples: 97754924. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:00,132][635489] Avg episode reward: [(0, '4636.195')] [2023-03-09 08:20:00,176][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190952_97767424.pth... [2023-03-09 08:20:00,179][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190344_97456128.pth [2023-03-09 08:20:00,179][635732] Saving new best policy, reward=4636.195! [2023-03-09 08:20:00,578][635776] Updated weights for policy 0, policy_version 190960 (0.0004) [2023-03-09 08:20:04,482][635776] Updated weights for policy 0, policy_version 191040 (0.0005) [2023-03-09 08:20:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10219.2). Total num frames: 97816576. Throughput: 0: 10339.6. Samples: 97816640. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:05,133][635489] Avg episode reward: [(0, '4619.335')] [2023-03-09 08:20:08,793][635776] Updated weights for policy 0, policy_version 191120 (0.0005) [2023-03-09 08:20:10,122][635489] Fps is (10 sec: 10239.8, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 97865728. Throughput: 0: 10294.5. Samples: 97845312. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:10,122][635489] Avg episode reward: [(0, '4577.785')] [2023-03-09 08:20:13,171][635776] Updated weights for policy 0, policy_version 191200 (0.0006) [2023-03-09 08:20:15,122][635489] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 97910784. Throughput: 0: 10196.3. Samples: 97902236. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:15,122][635489] Avg episode reward: [(0, '4625.076')] [2023-03-09 08:20:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191232_97910784.pth... [2023-03-09 08:20:15,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190648_97611776.pth [2023-03-09 08:20:17,130][635776] Updated weights for policy 0, policy_version 191280 (0.0005) [2023-03-09 08:20:20,121][635489] Fps is (10 sec: 9830.6, 60 sec: 10171.8, 300 sec: 10191.4). Total num frames: 97964032. Throughput: 0: 10156.5. Samples: 97964260. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:20,122][635489] Avg episode reward: [(0, '4630.368')] [2023-03-09 08:20:21,148][635776] Updated weights for policy 0, policy_version 191360 (0.0005) [2023-03-09 08:20:25,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 98013184. Throughput: 0: 10187.0. Samples: 97996356. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:25,122][635489] Avg episode reward: [(0, '4542.175')] [2023-03-09 08:20:25,147][635776] Updated weights for policy 0, policy_version 191440 (0.0004) [2023-03-09 08:20:29,200][635776] Updated weights for policy 0, policy_version 191520 (0.0004) [2023-03-09 08:20:30,121][635489] Fps is (10 sec: 10240.0, 60 sec: 10171.8, 300 sec: 10177.5). Total num frames: 98066432. Throughput: 0: 10078.2. Samples: 98054144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:30,122][635489] Avg episode reward: [(0, '4546.111')] [2023-03-09 08:20:30,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191536_98066432.pth... [2023-03-09 08:20:30,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000190952_97767424.pth [2023-03-09 08:20:32,953][635776] Updated weights for policy 0, policy_version 191600 (0.0005) [2023-03-09 08:20:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10177.5). Total num frames: 98119680. Throughput: 0: 10085.4. Samples: 98119156. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:35,122][635489] Avg episode reward: [(0, '4623.699')] [2023-03-09 08:20:36,977][635776] Updated weights for policy 0, policy_version 191680 (0.0005) [2023-03-09 08:20:40,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 98168832. Throughput: 0: 10106.0. Samples: 98148580. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:40,122][635489] Avg episode reward: [(0, '4603.510')] [2023-03-09 08:20:41,099][635776] Updated weights for policy 0, policy_version 191760 (0.0006) [2023-03-09 08:20:45,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 98217984. Throughput: 0: 10101.1. Samples: 98209476. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:45,122][635489] Avg episode reward: [(0, '4620.126')] [2023-03-09 08:20:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191832_98217984.pth... [2023-03-09 08:20:45,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191232_97910784.pth [2023-03-09 08:20:45,225][635776] Updated weights for policy 0, policy_version 191840 (0.0005) [2023-03-09 08:20:49,179][635776] Updated weights for policy 0, policy_version 191920 (0.0005) [2023-03-09 08:20:50,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 98271232. Throughput: 0: 10082.6. Samples: 98270356. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-09 08:20:50,122][635489] Avg episode reward: [(0, '4615.697')] [2023-03-09 08:20:53,297][635776] Updated weights for policy 0, policy_version 192000 (0.0005) [2023-03-09 08:20:55,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 98320384. Throughput: 0: 10102.2. Samples: 98299912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:20:55,122][635489] Avg episode reward: [(0, '4526.103')] [2023-03-09 08:20:57,425][635776] Updated weights for policy 0, policy_version 192080 (0.0006) [2023-03-09 08:21:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 98369536. Throughput: 0: 10202.1. Samples: 98361332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:00,122][635489] Avg episode reward: [(0, '4563.602')] [2023-03-09 08:21:00,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192128_98369536.pth... [2023-03-09 08:21:00,126][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191536_98066432.pth [2023-03-09 08:21:01,248][635776] Updated weights for policy 0, policy_version 192160 (0.0005) [2023-03-09 08:21:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.7). Total num frames: 98422784. Throughput: 0: 10164.9. Samples: 98421684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:05,122][635489] Avg episode reward: [(0, '4618.545')] [2023-03-09 08:21:05,530][635776] Updated weights for policy 0, policy_version 192240 (0.0004) [2023-03-09 08:21:09,550][635776] Updated weights for policy 0, policy_version 192320 (0.0005) [2023-03-09 08:21:10,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10149.8). Total num frames: 98471936. Throughput: 0: 10113.1. Samples: 98451448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:10,122][635489] Avg episode reward: [(0, '4577.939')] [2023-03-09 08:21:13,709][635776] Updated weights for policy 0, policy_version 192400 (0.0005) [2023-03-09 08:21:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 98521088. Throughput: 0: 10151.1. Samples: 98510944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:15,122][635489] Avg episode reward: [(0, '4565.812')] [2023-03-09 08:21:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192424_98521088.pth... [2023-03-09 08:21:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000191832_98217984.pth [2023-03-09 08:21:17,865][635776] Updated weights for policy 0, policy_version 192480 (0.0005) [2023-03-09 08:21:20,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 98570240. Throughput: 0: 10024.3. Samples: 98570248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:20,122][635489] Avg episode reward: [(0, '4570.895')] [2023-03-09 08:21:21,921][635776] Updated weights for policy 0, policy_version 192560 (0.0005) [2023-03-09 08:21:25,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10135.9). Total num frames: 98619392. Throughput: 0: 10021.7. Samples: 98599556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:25,122][635489] Avg episode reward: [(0, '4568.971')] [2023-03-09 08:21:25,980][635776] Updated weights for policy 0, policy_version 192640 (0.0005) [2023-03-09 08:21:29,951][635776] Updated weights for policy 0, policy_version 192720 (0.0005) [2023-03-09 08:21:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.4, 300 sec: 10149.7). Total num frames: 98672640. Throughput: 0: 10048.0. Samples: 98661636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:30,122][635489] Avg episode reward: [(0, '4618.282')] [2023-03-09 08:21:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192720_98672640.pth... [2023-03-09 08:21:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192128_98369536.pth [2023-03-09 08:21:33,672][635776] Updated weights for policy 0, policy_version 192800 (0.0005) [2023-03-09 08:21:35,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 98725888. Throughput: 0: 10119.1. Samples: 98725716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:35,122][635489] Avg episode reward: [(0, '4605.584')] [2023-03-09 08:21:37,879][635776] Updated weights for policy 0, policy_version 192880 (0.0004) [2023-03-09 08:21:40,121][635489] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 98775040. Throughput: 0: 10104.7. Samples: 98754624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:40,122][635489] Avg episode reward: [(0, '4598.652')] [2023-03-09 08:21:41,918][635776] Updated weights for policy 0, policy_version 192960 (0.0005) [2023-03-09 08:21:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 98828288. Throughput: 0: 10103.9. Samples: 98816008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:45,122][635489] Avg episode reward: [(0, '4596.055')] [2023-03-09 08:21:45,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193024_98828288.pth... [2023-03-09 08:21:45,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192424_98521088.pth [2023-03-09 08:21:45,858][635776] Updated weights for policy 0, policy_version 193040 (0.0005) [2023-03-09 08:21:49,459][635776] Updated weights for policy 0, policy_version 193120 (0.0005) [2023-03-09 08:21:50,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 98881536. Throughput: 0: 10220.5. Samples: 98881608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:50,122][635489] Avg episode reward: [(0, '4609.132')] [2023-03-09 08:21:53,369][635776] Updated weights for policy 0, policy_version 193200 (0.0005) [2023-03-09 08:21:55,122][635489] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 98934784. Throughput: 0: 10284.8. Samples: 98914264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:21:55,122][635489] Avg episode reward: [(0, '4599.659')] [2023-03-09 08:21:57,320][635776] Updated weights for policy 0, policy_version 193280 (0.0005) [2023-03-09 08:22:00,121][635489] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 98988032. Throughput: 0: 10329.2. Samples: 98975756. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:00,122][635489] Avg episode reward: [(0, '4598.626')] [2023-03-09 08:22:00,124][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193336_98988032.pth... [2023-03-09 08:22:00,125][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000192720_98672640.pth [2023-03-09 08:22:01,216][635776] Updated weights for policy 0, policy_version 193360 (0.0005) [2023-03-09 08:22:05,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10219.2). Total num frames: 99037184. Throughput: 0: 10359.3. Samples: 99036416. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:05,122][635489] Avg episode reward: [(0, '4620.588')] [2023-03-09 08:22:05,470][635776] Updated weights for policy 0, policy_version 193440 (0.0005) [2023-03-09 08:22:09,627][635776] Updated weights for policy 0, policy_version 193520 (0.0005) [2023-03-09 08:22:10,122][635489] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 99086336. Throughput: 0: 10362.4. Samples: 99065864. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:10,122][635489] Avg episode reward: [(0, '4611.555')] [2023-03-09 08:22:13,427][635776] Updated weights for policy 0, policy_version 193600 (0.0005) [2023-03-09 08:22:15,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 99139584. Throughput: 0: 10377.1. Samples: 99128608. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:15,122][635489] Avg episode reward: [(0, '4623.849')] [2023-03-09 08:22:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193632_99139584.pth... [2023-03-09 08:22:15,127][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193024_98828288.pth [2023-03-09 08:22:17,568][635776] Updated weights for policy 0, policy_version 193680 (0.0004) [2023-03-09 08:22:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 99188736. Throughput: 0: 10286.8. Samples: 99188620. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:20,122][635489] Avg episode reward: [(0, '4598.522')] [2023-03-09 08:22:21,517][635776] Updated weights for policy 0, policy_version 193760 (0.0005) [2023-03-09 08:22:25,121][635489] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 99237888. Throughput: 0: 10316.2. Samples: 99218852. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:25,122][635489] Avg episode reward: [(0, '4597.080')] [2023-03-09 08:22:25,535][635776] Updated weights for policy 0, policy_version 193840 (0.0005) [2023-03-09 08:22:29,537][635776] Updated weights for policy 0, policy_version 193920 (0.0004) [2023-03-09 08:22:30,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 99291136. Throughput: 0: 10296.1. Samples: 99279332. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:30,122][635489] Avg episode reward: [(0, '4619.585')] [2023-03-09 08:22:30,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193928_99291136.pth... [2023-03-09 08:22:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193336_98988032.pth [2023-03-09 08:22:33,314][635776] Updated weights for policy 0, policy_version 194000 (0.0004) [2023-03-09 08:22:35,122][635489] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 99344384. Throughput: 0: 10283.9. Samples: 99344384. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:35,122][635489] Avg episode reward: [(0, '4615.927')] [2023-03-09 08:22:37,419][635776] Updated weights for policy 0, policy_version 194080 (0.0004) [2023-03-09 08:22:40,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 99397632. Throughput: 0: 10221.4. Samples: 99374228. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:40,122][635489] Avg episode reward: [(0, '4613.677')] [2023-03-09 08:22:41,332][635776] Updated weights for policy 0, policy_version 194160 (0.0005) [2023-03-09 08:22:45,122][635489] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10205.3). Total num frames: 99446784. Throughput: 0: 10194.2. Samples: 99434496. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:45,122][635489] Avg episode reward: [(0, '4612.487')] [2023-03-09 08:22:45,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194232_99446784.pth... [2023-03-09 08:22:45,129][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193632_99139584.pth [2023-03-09 08:22:45,421][635776] Updated weights for policy 0, policy_version 194240 (0.0004) [2023-03-09 08:22:49,533][635776] Updated weights for policy 0, policy_version 194320 (0.0004) [2023-03-09 08:22:50,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10205.3). Total num frames: 99495936. Throughput: 0: 10207.8. Samples: 99495768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:50,122][635489] Avg episode reward: [(0, '4609.346')] [2023-03-09 08:22:53,652][635776] Updated weights for policy 0, policy_version 194400 (0.0005) [2023-03-09 08:22:55,122][635489] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 99545088. Throughput: 0: 10209.7. Samples: 99525300. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:22:55,122][635489] Avg episode reward: [(0, '4582.396')] [2023-03-09 08:22:58,008][635776] Updated weights for policy 0, policy_version 194480 (0.0005) [2023-03-09 08:23:00,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.4, 300 sec: 10191.4). Total num frames: 99594240. Throughput: 0: 10074.6. Samples: 99581964. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-09 08:23:00,122][635489] Avg episode reward: [(0, '4597.146')] [2023-03-09 08:23:00,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194520_99594240.pth... [2023-03-09 08:23:00,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000193928_99291136.pth [2023-03-09 08:23:02,120][635776] Updated weights for policy 0, policy_version 194560 (0.0005) [2023-03-09 08:23:05,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 99643392. Throughput: 0: 10102.5. Samples: 99643232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:05,122][635489] Avg episode reward: [(0, '4619.104')] [2023-03-09 08:23:06,106][635776] Updated weights for policy 0, policy_version 194640 (0.0005) [2023-03-09 08:23:10,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 99692544. Throughput: 0: 10081.1. Samples: 99672504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:10,122][635489] Avg episode reward: [(0, '4613.815')] [2023-03-09 08:23:10,318][635776] Updated weights for policy 0, policy_version 194720 (0.0005) [2023-03-09 08:23:14,616][635776] Updated weights for policy 0, policy_version 194800 (0.0005) [2023-03-09 08:23:15,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 99741696. Throughput: 0: 10030.9. Samples: 99730724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:15,122][635489] Avg episode reward: [(0, '4598.155')] [2023-03-09 08:23:15,125][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194808_99741696.pth... [2023-03-09 08:23:15,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194232_99446784.pth [2023-03-09 08:23:18,427][635776] Updated weights for policy 0, policy_version 194880 (0.0005) [2023-03-09 08:23:20,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 99794944. Throughput: 0: 9983.5. Samples: 99793640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:20,122][635489] Avg episode reward: [(0, '4604.533')] [2023-03-09 08:23:22,568][635776] Updated weights for policy 0, policy_version 194960 (0.0004) [2023-03-09 08:23:25,122][635489] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10205.3). Total num frames: 99848192. Throughput: 0: 9980.9. Samples: 99823368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:25,122][635489] Avg episode reward: [(0, '4625.908')] [2023-03-09 08:23:26,357][635776] Updated weights for policy 0, policy_version 195040 (0.0005) [2023-03-09 08:23:30,122][635489] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 99897344. Throughput: 0: 10013.9. Samples: 99885120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:30,122][635489] Avg episode reward: [(0, '4610.266')] [2023-03-09 08:23:30,126][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000195112_99897344.pth... [2023-03-09 08:23:30,128][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194520_99594240.pth [2023-03-09 08:23:30,467][635776] Updated weights for policy 0, policy_version 195120 (0.0005) [2023-03-09 08:23:34,566][635776] Updated weights for policy 0, policy_version 195200 (0.0005) [2023-03-09 08:23:35,122][635489] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 99946496. Throughput: 0: 10014.2. Samples: 99946408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:35,122][635489] Avg episode reward: [(0, '4613.245')] [2023-03-09 08:23:38,709][635776] Updated weights for policy 0, policy_version 195280 (0.0004) [2023-03-09 08:23:40,121][635489] Fps is (10 sec: 9830.6, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 99995648. Throughput: 0: 9998.5. Samples: 99975232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-09 08:23:40,122][635489] Avg episode reward: [(0, '4606.660')] [2023-03-09 08:23:40,665][635732] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-09 08:23:41,117][635732] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-09 08:23:41,118][635812] Stopping RolloutWorker_w5... [2023-03-09 08:23:41,118][635777] Stopping RolloutWorker_w2... [2023-03-09 08:23:41,118][635814] Stopping RolloutWorker_w6... [2023-03-09 08:23:41,118][635812] Loop rollout_proc5_evt_loop terminating... [2023-03-09 08:23:41,118][635778] Stopping RolloutWorker_w3... [2023-03-09 08:23:41,118][635813] Stopping RolloutWorker_w7... [2023-03-09 08:23:41,118][635811] Stopping RolloutWorker_w4... [2023-03-09 08:23:41,118][635779] Stopping RolloutWorker_w0... [2023-03-09 08:23:41,118][635777] Loop rollout_proc2_evt_loop terminating... [2023-03-09 08:23:41,118][635732] Stopping Batcher_0... [2023-03-09 08:23:41,118][635814] Loop rollout_proc6_evt_loop terminating... [2023-03-09 08:23:41,118][635813] Loop rollout_proc7_evt_loop terminating... [2023-03-09 08:23:41,118][635778] Loop rollout_proc3_evt_loop terminating... [2023-03-09 08:23:41,118][635846] Stopping RolloutWorker_w1... [2023-03-09 08:23:41,118][635779] Loop rollout_proc0_evt_loop terminating... [2023-03-09 08:23:41,118][635811] Loop rollout_proc4_evt_loop terminating... [2023-03-09 08:23:41,118][635732] Loop batcher_evt_loop terminating... [2023-03-09 08:23:41,118][635846] Loop rollout_proc1_evt_loop terminating... [2023-03-09 08:23:41,118][635489] Component RolloutWorker_w5 stopped! [2023-03-09 08:23:41,119][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-09 08:23:41,119][635489] Component RolloutWorker_w2 stopped! [2023-03-09 08:23:41,119][635489] Component RolloutWorker_w6 stopped! [2023-03-09 08:23:41,120][635489] Component RolloutWorker_w3 stopped! [2023-03-09 08:23:41,120][635489] Component RolloutWorker_w4 stopped! [2023-03-09 08:23:41,121][635489] Component RolloutWorker_w0 stopped! [2023-03-09 08:23:41,121][635732] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000194808_99741696.pth [2023-03-09 08:23:41,121][635489] Component Batcher_0 stopped! [2023-03-09 08:23:41,121][635489] Component RolloutWorker_w7 stopped! [2023-03-09 08:23:41,122][635732] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/lever-pull-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-09 08:23:41,122][635489] Component RolloutWorker_w1 stopped! [2023-03-09 08:23:41,124][635732] Stopping LearnerWorker_p0... [2023-03-09 08:23:41,124][635732] Loop learner_proc0_evt_loop terminating... [2023-03-09 08:23:41,124][635489] Component LearnerWorker_p0 stopped! [2023-03-09 08:23:41,168][635776] Weights refcount: 2 0 [2023-03-09 08:23:41,169][635776] Stopping InferenceWorker_p0-w0... [2023-03-09 08:23:41,170][635776] Loop inference_proc0-0_evt_loop terminating... [2023-03-09 08:23:41,170][635489] Component InferenceWorker_p0-w0 stopped! [2023-03-09 08:23:41,171][635489] Waiting for process learner_proc0 to stop... [2023-03-09 08:23:41,645][635489] Waiting for process inference_proc0-0 to join... [2023-03-09 08:23:41,646][635489] Waiting for process rollout_proc0 to join... [2023-03-09 08:23:41,660][635489] Waiting for process rollout_proc1 to join... [2023-03-09 08:23:41,661][635489] Waiting for process rollout_proc2 to join... [2023-03-09 08:23:41,661][635489] Waiting for process rollout_proc3 to join... [2023-03-09 08:23:41,661][635489] Waiting for process rollout_proc4 to join... [2023-03-09 08:23:41,661][635489] Waiting for process rollout_proc5 to join... [2023-03-09 08:23:41,662][635489] Waiting for process rollout_proc6 to join... [2023-03-09 08:23:41,662][635489] Waiting for process rollout_proc7 to join... [2023-03-09 08:23:41,662][635489] Batcher 0 profile tree view: batching: 17.2459, releasing_batches: 14.9651 [2023-03-09 08:23:41,662][635489] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0051 wait_policy_total: 3494.4281 update_model: 106.2137 weight_update: 0.0005 one_step: 0.0020 handle_policy_step: 5559.0443 deserialize: 235.4188, stack: 57.4409, obs_to_device_normalize: 977.2036, forward: 2765.3351, send_messages: 435.1189 prepare_outputs: 607.1697 to_cpu: 94.5583 [2023-03-09 08:23:41,662][635489] Learner 0 profile tree view: misc: 0.0960, prepare_batch: 79.1793 train: 1015.8026 epoch_init: 0.3322, minibatch_init: 10.7159, losses_postprocess: 12.0983, kl_divergence: 3.9162, after_optimizer: 5.5754 calculate_losses: 414.1972 losses_init: 0.2775, forward_head: 200.2069, bptt_initial: 1.1395, bptt: 1.1444, tail: 102.0237, advantages_returns: 8.2264, losses: 88.7835 update: 554.7724 clip: 50.4974 [2023-03-09 08:23:41,663][635489] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.3404, enqueue_policy_requests: 156.5424, env_step: 5480.9096, overhead: 329.9148, complete_rollouts: 3.6008 save_policy_outputs: 367.3087 split_output_tensors: 180.2528 [2023-03-09 08:23:41,663][635489] RolloutWorker_w7 profile tree view: wait_for_trajectories: 4.0716, enqueue_policy_requests: 160.1942, env_step: 5493.6536, overhead: 338.6053, complete_rollouts: 3.6737 save_policy_outputs: 367.9042 split_output_tensors: 179.7499 [2023-03-09 08:23:41,663][635489] Loop Runner_EvtLoop terminating... [2023-03-09 08:23:41,663][635489] Runner profile tree view: main_loop: 9834.1915 [2023-03-09 08:23:41,664][635489] Collected {0: 100007936}, FPS: 10169.4