[2023-09-12 15:15:28,046][62109] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:15:28,046][62109] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-09-12 15:15:28,091][62109] Num visible devices: 1 [2023-09-12 15:15:28,133][62109] Starting seed is not provided [2023-09-12 15:15:28,133][62109] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:15:28,133][62109] Initializing actor-critic model on device cuda:0 [2023-09-12 15:15:28,134][62109] RunningMeanStd input shape: (23,) [2023-09-12 15:15:28,134][62109] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:15:28,135][62109] RunningMeanStd input shape: (1,) [2023-09-12 15:15:28,155][62109] ConvEncoder: input_channels=3 [2023-09-12 15:15:28,438][62109] Conv encoder output size: 512 [2023-09-12 15:15:28,440][62109] Policy head output size: 640 [2023-09-12 15:15:28,469][62109] Created Actor Critic model with architecture: [2023-09-12 15:15:28,469][62109] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (measurements): RunningMeanStdInPlace() (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): VizdoomEncoder( (basic_encoder): ConvEncoder( (enc): RecursiveScriptModule( original_name=ConvEncoderImpl (conv_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Conv2d) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Conv2d) (3): RecursiveScriptModule(original_name=ELU) (4): RecursiveScriptModule(original_name=Conv2d) (5): RecursiveScriptModule(original_name=ELU) ) (mlp_layers): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) ) ) ) (measurements_head): Sequential( (0): Linear(in_features=23, out_features=128, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=128, out_features=128, bias=True) (3): ELU(alpha=1.0) ) ) (core): ModelCoreRNN( (core): GRU(640, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=15, bias=True) ) ) [2023-09-12 15:15:29,811][62109] Using optimizer [2023-09-12 15:15:29,811][62109] No checkpoints found [2023-09-12 15:15:29,812][62109] Did not load from checkpoint, starting from scratch! [2023-09-12 15:15:29,812][62109] Initialized policy 0 weights for model version 0 [2023-09-12 15:15:29,814][62109] LearnerWorker_p0 finished initialization! [2023-09-12 15:15:29,815][62109] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:15:29,990][62305] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-09-12 15:15:29,990][62262] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-09-12 15:15:29,995][62307] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-09-12 15:15:30,002][62265] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-09-12 15:15:30,008][62266] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-09-12 15:15:30,164][62309] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-09-12 15:15:30,227][62267] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-09-12 15:15:30,408][62259] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:15:30,408][62259] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-09-12 15:15:30,408][62308] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-09-12 15:15:30,426][62259] Num visible devices: 1 [2023-09-12 15:15:31,129][62259] RunningMeanStd input shape: (23,) [2023-09-12 15:15:31,130][62259] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:15:31,130][62259] RunningMeanStd input shape: (1,) [2023-09-12 15:15:31,142][62259] ConvEncoder: input_channels=3 [2023-09-12 15:15:31,249][62259] Conv encoder output size: 512 [2023-09-12 15:15:31,250][62259] Policy head output size: 640 [2023-09-12 15:15:31,611][62265] Multi agent env, num agents: 8 [2023-09-12 15:15:31,611][62262] Multi agent env, num agents: 8 [2023-09-12 15:15:31,613][62267] Multi agent env, num agents: 8 [2023-09-12 15:15:31,613][62308] Multi agent env, num agents: 8 [2023-09-12 15:15:31,614][62266] Multi agent env, num agents: 8 [2023-09-12 15:15:31,615][62307] Multi agent env, num agents: 8 [2023-09-12 15:15:31,615][62309] Multi agent env, num agents: 8 [2023-09-12 15:15:31,615][62305] Multi agent env, num agents: 8 [2023-09-12 15:15:31,645][62267] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,646][62308] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,646][62266] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,647][62307] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,648][62267] Multi agent env, num agents: 8 [2023-09-12 15:15:31,649][62308] Multi agent env, num agents: 8 [2023-09-12 15:15:31,650][62266] Multi agent env, num agents: 8 [2023-09-12 15:15:31,650][62307] Multi agent env, num agents: 8 [2023-09-12 15:15:31,658][62265] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,658][62262] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,664][62262] Multi agent env, num agents: 8 [2023-09-12 15:15:31,664][62265] Multi agent env, num agents: 8 [2023-09-12 15:15:31,665][62305] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,665][62309] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:15:31,671][62305] Multi agent env, num agents: 8 [2023-09-12 15:15:31,671][62309] Multi agent env, num agents: 8 [2023-09-12 15:15:31,680][62308] Port 40800 is available [2023-09-12 15:15:31,680][62308] Using port 40800 [2023-09-12 15:15:31,680][62266] Port 40400 is available [2023-09-12 15:15:31,680][62266] Using port 40400 [2023-09-12 15:15:31,681][62308] Initializing env for player 0, init_info: {'port': 40800}... [2023-09-12 15:15:31,681][62266] Initializing env for player 0, init_info: {'port': 40400}... [2023-09-12 15:15:31,690][62307] UDP port 40900 cannot be used [Errno 98] Address already in use [2023-09-12 15:15:31,691][62307] Port 41900 is available [2023-09-12 15:15:31,691][62307] Using port 41900 [2023-09-12 15:15:31,692][62307] Initializing env for player 0, init_info: {'port': 41900}... [2023-09-12 15:15:31,693][62267] UDP port 40700 cannot be used [Errno 98] Address already in use [2023-09-12 15:15:31,694][62267] Port 41700 is available [2023-09-12 15:15:31,694][62267] Using port 41700 [2023-09-12 15:15:31,695][62267] Initializing env for player 0, init_info: {'port': 41700}... [2023-09-12 15:15:31,711][62265] Port 40500 is available [2023-09-12 15:15:31,711][62265] Using port 40500 [2023-09-12 15:15:31,712][62265] Initializing env for player 0, init_info: {'port': 40500}... [2023-09-12 15:15:31,731][62308] Initializing env for player 1, init_info: {'port': 40800}... [2023-09-12 15:15:31,732][62266] Using port 40400 on host... [2023-09-12 15:15:31,733][62308] Using port 40800 on host... [2023-09-12 15:15:31,733][62266] Initializing env for player 1, init_info: {'port': 40400}... [2023-09-12 15:15:31,734][62267] Using port 41700 on host... [2023-09-12 15:15:31,739][62307] Using port 41900 on host... [2023-09-12 15:15:31,742][62307] Initializing env for player 1, init_info: {'port': 41900}... [2023-09-12 15:15:31,745][62267] Initializing env for player 1, init_info: {'port': 41700}... [2023-09-12 15:15:31,757][62265] Using port 40500 on host... [2023-09-12 15:15:31,762][62265] Initializing env for player 1, init_info: {'port': 40500}... [2023-09-12 15:15:31,782][62308] Initializing env for player 2, init_info: {'port': 40800}... [2023-09-12 15:15:31,783][62266] Initializing env for player 2, init_info: {'port': 40400}... [2023-09-12 15:15:31,788][62305] Port 40600 is available [2023-09-12 15:15:31,789][62305] Using port 40600 [2023-09-12 15:15:31,792][62309] Port 41000 is available [2023-09-12 15:15:31,792][62309] Using port 41000 [2023-09-12 15:15:31,794][62309] Initializing env for player 0, init_info: {'port': 41000}... [2023-09-12 15:15:31,794][62307] Initializing env for player 2, init_info: {'port': 41900}... [2023-09-12 15:15:31,799][62262] Port 40300 is available [2023-09-12 15:15:31,800][62262] Using port 40300 [2023-09-12 15:15:31,800][62267] Initializing env for player 2, init_info: {'port': 41700}... [2023-09-12 15:15:31,801][62262] Initializing env for player 0, init_info: {'port': 40300}... [2023-09-12 15:15:31,830][62309] Using port 41000 on host... [2023-09-12 15:15:31,831][62265] Initializing env for player 2, init_info: {'port': 40500}... [2023-09-12 15:15:31,832][62308] Initializing env for player 3, init_info: {'port': 40800}... [2023-09-12 15:15:31,835][62266] Initializing env for player 3, init_info: {'port': 40400}... [2023-09-12 15:15:31,845][62309] Initializing env for player 1, init_info: {'port': 41000}... [2023-09-12 15:15:31,846][62262] Using port 40300 on host... [2023-09-12 15:15:31,851][62267] Initializing env for player 3, init_info: {'port': 41700}... [2023-09-12 15:15:31,852][62262] Initializing env for player 1, init_info: {'port': 40300}... [2023-09-12 15:15:31,855][62307] Initializing env for player 3, init_info: {'port': 41900}... [2023-09-12 15:15:31,883][62308] Initializing env for player 4, init_info: {'port': 40800}... [2023-09-12 15:15:31,886][62265] Initializing env for player 3, init_info: {'port': 40500}... [2023-09-12 15:15:31,891][62266] Initializing env for player 4, init_info: {'port': 40400}... [2023-09-12 15:15:31,897][62309] Initializing env for player 2, init_info: {'port': 41000}... [2023-09-12 15:15:31,901][62262] Initializing env for player 2, init_info: {'port': 40300}... [2023-09-12 15:15:31,903][62267] Initializing env for player 4, init_info: {'port': 41700}... [2023-09-12 15:15:31,907][62307] Initializing env for player 4, init_info: {'port': 41900}... [2023-09-12 15:15:31,933][62308] Initializing env for player 5, init_info: {'port': 40800}... [2023-09-12 15:15:31,943][62266] Initializing env for player 5, init_info: {'port': 40400}... [2023-09-12 15:15:31,943][62265] Initializing env for player 4, init_info: {'port': 40500}... [2023-09-12 15:15:31,948][62309] Initializing env for player 3, init_info: {'port': 41000}... [2023-09-12 15:15:31,952][62262] Initializing env for player 3, init_info: {'port': 40300}... [2023-09-12 15:15:31,955][62267] Initializing env for player 5, init_info: {'port': 41700}... [2023-09-12 15:15:31,963][62307] Initializing env for player 5, init_info: {'port': 41900}... [2023-09-12 15:15:31,983][62308] Initializing env for player 6, init_info: {'port': 40800}... [2023-09-12 15:15:31,991][62265] Initializing env for player 5, init_info: {'port': 40500}... [2023-09-12 15:15:31,991][62266] Initializing env for player 6, init_info: {'port': 40400}... [2023-09-12 15:15:31,999][62309] Initializing env for player 4, init_info: {'port': 41000}... [2023-09-12 15:15:32,011][62267] Initializing env for player 6, init_info: {'port': 41700}... [2023-09-12 15:15:32,012][62262] Initializing env for player 4, init_info: {'port': 40300}... [2023-09-12 15:15:32,015][62307] Initializing env for player 6, init_info: {'port': 41900}... [2023-09-12 15:15:32,047][62265] Initializing env for player 6, init_info: {'port': 40500}... [2023-09-12 15:15:32,055][62308] Initializing env for player 7, init_info: {'port': 40800}... [2023-09-12 15:15:32,063][62307] Initializing env for player 7, init_info: {'port': 41900}... [2023-09-12 15:15:32,051][62266] Initializing env for player 7, init_info: {'port': 40400}... [2023-09-12 15:15:32,063][62309] Initializing env for player 5, init_info: {'port': 41000}... [2023-09-12 15:15:32,063][62267] Initializing env for player 7, init_info: {'port': 41700}... [2023-09-12 15:15:32,103][62309] Initializing env for player 6, init_info: {'port': 41000}... [2023-09-12 15:15:32,104][62265] Initializing env for player 7, init_info: {'port': 40500}... [2023-09-12 15:15:32,114][62262] Initializing env for player 5, init_info: {'port': 40300}... [2023-09-12 15:15:32,151][62309] Initializing env for player 7, init_info: {'port': 41000}... [2023-09-12 15:15:32,163][62262] Initializing env for player 6, init_info: {'port': 40300}... [2023-09-12 15:15:32,215][62262] Initializing env for player 7, init_info: {'port': 40300}... [2023-09-12 15:15:33,544][62266] Initialized w:1 v:0 player:0 [2023-09-12 15:15:33,545][62266] Initialized w:1 v:0 player:7 [2023-09-12 15:15:33,545][62266] Initialized w:1 v:0 player:1 [2023-09-12 15:15:33,545][62266] Initialized w:1 v:0 player:3 [2023-09-12 15:15:33,546][62266] Initialized w:1 v:0 player:2 [2023-09-12 15:15:33,546][62266] Initialized w:1 v:0 player:4 [2023-09-12 15:15:33,546][62266] Initialized w:1 v:0 player:5 [2023-09-12 15:15:33,547][62266] Initialized w:1 v:0 player:6 [2023-09-12 15:15:33,549][62266] 8 agent workers initialized for env 1! [2023-09-12 15:15:33,569][62305] Initializing env for player 0, init_info: {'port': 40600}... [2023-09-12 15:15:33,596][62266] Decorrelating experience for 0 frames... [2023-09-12 15:15:33,598][62266] Port 40401 is available [2023-09-12 15:15:33,599][62266] Using port 40401 [2023-09-12 15:15:33,615][62305] Using port 40600 on host... [2023-09-12 15:15:33,616][62309] Initialized w:7 v:0 player:1 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:5 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:4 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:6 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:3 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:0 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:2 [2023-09-12 15:15:33,618][62309] Initialized w:7 v:0 player:7 [2023-09-12 15:15:33,621][62309] 8 agent workers initialized for env 7! [2023-09-12 15:15:33,622][62305] Initializing env for player 1, init_info: {'port': 40600}... [2023-09-12 15:15:33,658][62308] Initialized w:5 v:0 player:5 [2023-09-12 15:15:33,659][62308] Initialized w:5 v:0 player:7 [2023-09-12 15:15:33,660][62308] Initialized w:5 v:0 player:0 [2023-09-12 15:15:33,661][62308] Initialized w:5 v:0 player:6 [2023-09-12 15:15:33,662][62308] Initialized w:5 v:0 player:4 [2023-09-12 15:15:33,662][62308] Initialized w:5 v:0 player:3 [2023-09-12 15:15:33,662][62308] Initialized w:5 v:0 player:2 [2023-09-12 15:15:33,663][62308] Initialized w:5 v:0 player:1 [2023-09-12 15:15:33,664][62308] 8 agent workers initialized for env 5! [2023-09-12 15:15:33,665][62309] Decorrelating experience for 0 frames... [2023-09-12 15:15:33,666][62309] Port 41001 is available [2023-09-12 15:15:33,667][62309] Using port 41001 [2023-09-12 15:15:33,672][62305] Initializing env for player 2, init_info: {'port': 40600}... [2023-09-12 15:15:33,723][62305] Initializing env for player 3, init_info: {'port': 40600}... [2023-09-12 15:15:33,748][62308] Decorrelating experience for 0 frames... [2023-09-12 15:15:33,749][62308] Port 40801 is available [2023-09-12 15:15:33,750][62308] Using port 40801 [2023-09-12 15:15:33,750][62308] Initializing env for player 0, init_info: {'port': 40801}... [2023-09-12 15:15:33,781][62308] Using port 40801 on host... [2023-09-12 15:15:33,783][62267] Initialized w:4 v:0 player:0 [2023-09-12 15:15:33,785][62267] Initialized w:4 v:0 player:4 [2023-09-12 15:15:33,785][62305] Initializing env for player 4, init_info: {'port': 40600}... [2023-09-12 15:15:33,785][62267] Initialized w:4 v:0 player:5 [2023-09-12 15:15:33,785][62267] Initialized w:4 v:0 player:6 [2023-09-12 15:15:33,785][62267] Initialized w:4 v:0 player:1 [2023-09-12 15:15:33,785][62267] Initialized w:4 v:0 player:7 [2023-09-12 15:15:33,786][62267] Initialized w:4 v:0 player:3 [2023-09-12 15:15:33,787][62267] Initialized w:4 v:0 player:2 [2023-09-12 15:15:33,788][62267] 8 agent workers initialized for env 4! [2023-09-12 15:15:33,791][62265] Initialized w:2 v:0 player:4 [2023-09-12 15:15:33,792][62265] Initialized w:2 v:0 player:0 [2023-09-12 15:15:33,792][62265] Initialized w:2 v:0 player:7 [2023-09-12 15:15:33,793][62265] Initialized w:2 v:0 player:2 [2023-09-12 15:15:33,793][62265] Initialized w:2 v:0 player:3 [2023-09-12 15:15:33,794][62265] Initialized w:2 v:0 player:1 [2023-09-12 15:15:33,794][62265] Initialized w:2 v:0 player:5 [2023-09-12 15:15:33,794][62265] Initialized w:2 v:0 player:6 [2023-09-12 15:15:33,796][62265] 8 agent workers initialized for env 2! [2023-09-12 15:15:33,800][62308] Initializing env for player 1, init_info: {'port': 40801}... [2023-09-12 15:15:33,801][62266] Initializing env for player 0, init_info: {'port': 40401}... [2023-09-12 15:15:33,842][62266] Using port 40401 on host... [2023-09-12 15:15:33,844][62265] Decorrelating experience for 0 frames... [2023-09-12 15:15:33,846][62265] Port 40501 is available [2023-09-12 15:15:33,846][62265] Using port 40501 [2023-09-12 15:15:33,847][62265] Initializing env for player 0, init_info: {'port': 40501}... [2023-09-12 15:15:33,847][62305] Initializing env for player 5, init_info: {'port': 40600}... [2023-09-12 15:15:33,851][62308] Initializing env for player 2, init_info: {'port': 40801}... [2023-09-12 15:15:33,851][62266] Initializing env for player 1, init_info: {'port': 40401}... [2023-09-12 15:15:33,868][62267] Decorrelating experience for 0 frames... [2023-09-12 15:15:33,870][62267] Port 40701 is available [2023-09-12 15:15:33,870][62267] Using port 40701 [2023-09-12 15:15:33,871][62267] Initializing env for player 0, init_info: {'port': 40701}... [2023-09-12 15:15:33,892][62265] Using port 40501 on host... [2023-09-12 15:15:33,895][62305] Initializing env for player 6, init_info: {'port': 40600}... [2023-09-12 15:15:33,901][62265] Initializing env for player 1, init_info: {'port': 40501}... [2023-09-12 15:15:33,906][62308] Initializing env for player 3, init_info: {'port': 40801}... [2023-09-12 15:15:33,906][62266] Initializing env for player 2, init_info: {'port': 40401}... [2023-09-12 15:15:33,917][62267] Using port 40701 on host... [2023-09-12 15:15:33,921][62267] Initializing env for player 1, init_info: {'port': 40701}... [2023-09-12 15:15:33,944][62307] Initialized w:6 v:0 player:5 [2023-09-12 15:15:33,945][62307] Initialized w:6 v:0 player:7 [2023-09-12 15:15:33,947][62305] Initializing env for player 7, init_info: {'port': 40600}... [2023-09-12 15:15:33,947][62307] Initialized w:6 v:0 player:2 [2023-09-12 15:15:33,948][62307] Initialized w:6 v:0 player:4 [2023-09-12 15:15:33,949][62307] Initialized w:6 v:0 player:0 [2023-09-12 15:15:33,950][62307] Initialized w:6 v:0 player:6 [2023-09-12 15:15:33,951][62307] Initialized w:6 v:0 player:3 [2023-09-12 15:15:33,956][62262] Initialized w:0 v:0 player:6 [2023-09-12 15:15:33,957][62265] Initializing env for player 2, init_info: {'port': 40501}... [2023-09-12 15:15:33,958][62262] Initialized w:0 v:0 player:7 [2023-09-12 15:15:33,959][62262] Initialized w:0 v:0 player:5 [2023-09-12 15:15:33,959][62262] Initialized w:0 v:0 player:1 [2023-09-12 15:15:33,960][62262] Initialized w:0 v:0 player:0 [2023-09-12 15:15:33,960][62262] Initialized w:0 v:0 player:4 [2023-09-12 15:15:33,960][62262] Initialized w:0 v:0 player:2 [2023-09-12 15:15:33,960][62262] Initialized w:0 v:0 player:3 [2023-09-12 15:15:33,963][62262] 8 agent workers initialized for env 0! [2023-09-12 15:15:33,964][62266] Initializing env for player 3, init_info: {'port': 40401}... [2023-09-12 15:15:33,967][62308] Initializing env for player 4, init_info: {'port': 40801}... [2023-09-12 15:15:33,970][62309] Initializing env for player 0, init_info: {'port': 41001}... [2023-09-12 15:15:33,972][62267] Initializing env for player 2, init_info: {'port': 40701}... [2023-09-12 15:15:34,006][62262] Decorrelating experience for 0 frames... [2023-09-12 15:15:34,007][62265] Initializing env for player 3, init_info: {'port': 40501}... [2023-09-12 15:15:34,008][62262] Port 40301 is available [2023-09-12 15:15:34,008][62262] Using port 40301 [2023-09-12 15:15:34,009][62262] Initializing env for player 0, init_info: {'port': 40301}... [2023-09-12 15:15:34,011][62308] Initializing env for player 5, init_info: {'port': 40801}... [2023-09-12 15:15:34,016][62309] Using port 41001 on host... [2023-09-12 15:15:34,020][62266] Initializing env for player 4, init_info: {'port': 40401}... [2023-09-12 15:15:34,025][62309] Initializing env for player 1, init_info: {'port': 41001}... [2023-09-12 15:15:34,034][62267] Initializing env for player 3, init_info: {'port': 40701}... [2023-09-12 15:15:34,056][62262] Using port 40301 on host... [2023-09-12 15:15:34,061][62308] Initializing env for player 6, init_info: {'port': 40801}... [2023-09-12 15:15:34,062][62265] Initializing env for player 4, init_info: {'port': 40501}... [2023-09-12 15:15:34,065][62262] Initializing env for player 1, init_info: {'port': 40301}... [2023-09-12 15:15:34,071][62266] Initializing env for player 5, init_info: {'port': 40401}... [2023-09-12 15:15:34,085][62309] Initializing env for player 2, init_info: {'port': 41001}... [2023-09-12 15:15:34,091][62267] Initializing env for player 4, init_info: {'port': 40701}... [2023-09-12 15:15:34,119][62265] Initializing env for player 5, init_info: {'port': 40501}... [2023-09-12 15:15:34,120][62308] Initializing env for player 7, init_info: {'port': 40801}... [2023-09-12 15:15:34,121][62266] Initializing env for player 6, init_info: {'port': 40401}... [2023-09-12 15:15:34,123][62262] Initializing env for player 2, init_info: {'port': 40301}... [2023-09-12 15:15:34,135][62309] Initializing env for player 3, init_info: {'port': 41001}... [2023-09-12 15:15:34,143][62267] Initializing env for player 5, init_info: {'port': 40701}... [2023-09-12 15:15:34,173][62262] Initializing env for player 3, init_info: {'port': 40301}... [2023-09-12 15:15:34,175][62266] Initializing env for player 7, init_info: {'port': 40401}... [2023-09-12 15:15:34,178][62265] Initializing env for player 6, init_info: {'port': 40501}... [2023-09-12 15:15:34,201][62267] Initializing env for player 6, init_info: {'port': 40701}... [2023-09-12 15:15:34,204][62309] Initializing env for player 4, init_info: {'port': 41001}... [2023-09-12 15:15:34,224][62262] Initializing env for player 4, init_info: {'port': 40301}... [2023-09-12 15:15:34,230][62265] Initializing env for player 7, init_info: {'port': 40501}... [2023-09-12 15:15:34,255][62309] Initializing env for player 5, init_info: {'port': 41001}... [2023-09-12 15:15:34,255][62267] Initializing env for player 7, init_info: {'port': 40701}... [2023-09-12 15:15:34,275][62262] Initializing env for player 5, init_info: {'port': 40301}... [2023-09-12 15:15:34,305][62309] Initializing env for player 6, init_info: {'port': 41001}... [2023-09-12 15:15:34,356][62309] Initializing env for player 7, init_info: {'port': 41001}... [2023-09-12 15:15:34,358][62262] Initializing env for player 6, init_info: {'port': 40301}... [2023-09-12 15:15:34,410][62262] Initializing env for player 7, init_info: {'port': 40301}... [2023-09-12 15:15:34,943][62307] Initialized w:6 v:0 player:1 [2023-09-12 15:15:34,945][62307] 8 agent workers initialized for env 6! [2023-09-12 15:15:35,039][62307] Decorrelating experience for 0 frames... [2023-09-12 15:15:35,111][62307] UDP port 40901 cannot be used [Errno 98] Address already in use [2023-09-12 15:15:35,111][62307] Port 41901 is available [2023-09-12 15:15:35,111][62307] Using port 41901 [2023-09-12 15:15:35,115][62307] Initializing env for player 0, init_info: {'port': 41901}... [2023-09-12 15:15:35,163][62307] Initializing env for player 1, init_info: {'port': 41901}... [2023-09-12 15:15:35,191][62307] Using port 41901 on host... [2023-09-12 15:15:35,215][62307] Initializing env for player 2, init_info: {'port': 41901}... [2023-09-12 15:15:35,271][62307] Initializing env for player 3, init_info: {'port': 41901}... [2023-09-12 15:15:35,319][62307] Initializing env for player 4, init_info: {'port': 41901}... [2023-09-12 15:15:35,395][62307] Initializing env for player 5, init_info: {'port': 41901}... [2023-09-12 15:15:35,455][62307] Initializing env for player 6, init_info: {'port': 41901}... [2023-09-12 15:15:35,499][62307] Initializing env for player 7, init_info: {'port': 41901}... [2023-09-12 15:15:35,555][62305] Initialized w:3 v:0 player:4 [2023-09-12 15:15:35,559][62305] Initialized w:3 v:0 player:6 [2023-09-12 15:15:35,559][62305] Initialized w:3 v:0 player:2 [2023-09-12 15:15:35,559][62305] Initialized w:3 v:0 player:5 [2023-09-12 15:15:35,559][62305] Initialized w:3 v:0 player:0 [2023-09-12 15:15:35,560][62305] Initialized w:3 v:0 player:1 [2023-09-12 15:15:35,560][62305] Initialized w:3 v:0 player:7 [2023-09-12 15:15:35,560][62305] Initialized w:3 v:0 player:3 [2023-09-12 15:15:35,564][62305] 8 agent workers initialized for env 3! [2023-09-12 15:15:35,625][62305] Decorrelating experience for 0 frames... [2023-09-12 15:15:35,626][62305] Port 40601 is available [2023-09-12 15:15:35,626][62305] Using port 40601 [2023-09-12 15:15:35,627][62305] Initializing env for player 0, init_info: {'port': 40601}... [2023-09-12 15:15:35,665][62305] Using port 40601 on host... [2023-09-12 15:15:35,677][62305] Initializing env for player 1, init_info: {'port': 40601}... [2023-09-12 15:15:35,728][62305] Initializing env for player 2, init_info: {'port': 40601}... [2023-09-12 15:15:35,778][62305] Initializing env for player 3, init_info: {'port': 40601}... [2023-09-12 15:15:35,839][62305] Initializing env for player 4, init_info: {'port': 40601}... [2023-09-12 15:15:35,906][62305] Initializing env for player 5, init_info: {'port': 40601}... [2023-09-12 15:15:35,911][62266] Initialized w:1 v:1 player:7 [2023-09-12 15:15:35,912][62266] Initialized w:1 v:1 player:3 [2023-09-12 15:15:35,914][62266] Initialized w:1 v:1 player:6 [2023-09-12 15:15:35,914][62266] Initialized w:1 v:1 player:0 [2023-09-12 15:15:35,915][62308] Initialized w:5 v:1 player:5 [2023-09-12 15:15:35,914][62266] Initialized w:1 v:1 player:5 [2023-09-12 15:15:35,916][62308] Initialized w:5 v:1 player:4 [2023-09-12 15:15:35,915][62266] Initialized w:1 v:1 player:1 [2023-09-12 15:15:35,915][62266] Initialized w:1 v:1 player:4 [2023-09-12 15:15:35,916][62266] Initialized w:1 v:1 player:2 [2023-09-12 15:15:35,916][62308] Initialized w:5 v:1 player:1 [2023-09-12 15:15:35,917][62308] Initialized w:5 v:1 player:3 [2023-09-12 15:15:35,917][62308] Initialized w:5 v:1 player:0 [2023-09-12 15:15:35,918][62308] Initialized w:5 v:1 player:6 [2023-09-12 15:15:35,918][62266] 8 agent workers initialized for env 1! [2023-09-12 15:15:35,917][62308] Initialized w:5 v:1 player:7 [2023-09-12 15:15:35,920][62308] Initialized w:5 v:1 player:2 [2023-09-12 15:15:35,920][62308] 8 agent workers initialized for env 5! [2023-09-12 15:15:35,952][62305] Initializing env for player 6, init_info: {'port': 40601}... [2023-09-12 15:15:35,967][62266] Decorrelating experience for 32 frames... [2023-09-12 15:15:35,969][62308] Decorrelating experience for 32 frames... [2023-09-12 15:15:35,991][62265] Initialized w:2 v:1 player:1 [2023-09-12 15:15:35,991][62265] Initialized w:2 v:1 player:7 [2023-09-12 15:15:35,993][62265] Initialized w:2 v:1 player:6 [2023-09-12 15:15:35,993][62265] Initialized w:2 v:1 player:4 [2023-09-12 15:15:35,993][62265] Initialized w:2 v:1 player:5 [2023-09-12 15:15:35,995][62265] Initialized w:2 v:1 player:2 [2023-09-12 15:15:35,995][62265] Initialized w:2 v:1 player:0 [2023-09-12 15:15:35,996][62265] Initialized w:2 v:1 player:3 [2023-09-12 15:15:35,997][62265] 8 agent workers initialized for env 2! [2023-09-12 15:15:36,011][62305] Initializing env for player 7, init_info: {'port': 40601}... [2023-09-12 15:15:36,043][62265] Decorrelating experience for 32 frames... [2023-09-12 15:15:36,099][62309] Initialized w:7 v:1 player:3 [2023-09-12 15:15:36,101][62309] Initialized w:7 v:1 player:4 [2023-09-12 15:15:36,101][62309] Initialized w:7 v:1 player:5 [2023-09-12 15:15:36,102][62309] Initialized w:7 v:1 player:1 [2023-09-12 15:15:36,103][62309] Initialized w:7 v:1 player:7 [2023-09-12 15:15:36,104][62309] Initialized w:7 v:1 player:2 [2023-09-12 15:15:36,105][62309] Initialized w:7 v:1 player:0 [2023-09-12 15:15:36,105][62309] Initialized w:7 v:1 player:6 [2023-09-12 15:15:36,106][62309] 8 agent workers initialized for env 7! [2023-09-12 15:15:36,164][62309] Decorrelating experience for 32 frames... [2023-09-12 15:15:36,282][62262] Initialized w:0 v:1 player:7 [2023-09-12 15:15:36,285][62262] Initialized w:0 v:1 player:1 [2023-09-12 15:15:36,286][62262] Initialized w:0 v:1 player:2 [2023-09-12 15:15:36,286][62262] Initialized w:0 v:1 player:6 [2023-09-12 15:15:36,286][62262] Initialized w:0 v:1 player:4 [2023-09-12 15:15:36,285][62262] Initialized w:0 v:1 player:0 [2023-09-12 15:15:36,286][62262] Initialized w:0 v:1 player:5 [2023-09-12 15:15:36,287][62262] Initialized w:0 v:1 player:3 [2023-09-12 15:15:36,290][62262] 8 agent workers initialized for env 0! [2023-09-12 15:15:36,310][62267] Initialized w:4 v:1 player:6 [2023-09-12 15:15:36,311][62267] Initialized w:4 v:1 player:1 [2023-09-12 15:15:36,311][62267] Initialized w:4 v:1 player:4 [2023-09-12 15:15:36,316][62267] Initialized w:4 v:1 player:5 [2023-09-12 15:15:36,319][62267] Initialized w:4 v:1 player:3 [2023-09-12 15:15:36,329][62267] Initialized w:4 v:1 player:2 [2023-09-12 15:15:36,320][62267] Initialized w:4 v:1 player:7 [2023-09-12 15:15:36,331][62267] Initialized w:4 v:1 player:0 [2023-09-12 15:15:36,332][62267] 8 agent workers initialized for env 4! [2023-09-12 15:15:36,365][62262] Decorrelating experience for 32 frames... [2023-09-12 15:15:36,412][62267] Decorrelating experience for 32 frames... [2023-09-12 15:15:36,531][62308] Multi agent env, num agents: 8 [2023-09-12 15:15:36,534][62266] Multi agent env, num agents: 8 [2023-09-12 15:15:36,580][62308] Multi agent env, num agents: 8 [2023-09-12 15:15:36,584][62266] Multi agent env, num agents: 8 [2023-09-12 15:15:36,589][62265] Multi agent env, num agents: 8 [2023-09-12 15:15:36,632][62308] Port 40802 is available [2023-09-12 15:15:36,632][62308] Using port 40802 [2023-09-12 15:15:36,635][62265] Multi agent env, num agents: 8 [2023-09-12 15:15:36,641][62266] Port 40402 is available [2023-09-12 15:15:36,641][62266] Using port 40402 [2023-09-12 15:15:36,642][62266] Initializing env for player 0, init_info: {'port': 40402}... [2023-09-12 15:15:36,675][62266] Using port 40402 on host... [2023-09-12 15:15:36,685][62265] Port 40502 is available [2023-09-12 15:15:36,685][62265] Using port 40502 [2023-09-12 15:15:36,686][62265] Initializing env for player 0, init_info: {'port': 40502}... [2023-09-12 15:15:36,695][62266] Initializing env for player 1, init_info: {'port': 40402}... [2023-09-12 15:15:36,711][62309] Multi agent env, num agents: 8 [2023-09-12 15:15:36,730][62265] Using port 40502 on host... [2023-09-12 15:15:36,740][62265] Initializing env for player 1, init_info: {'port': 40502}... [2023-09-12 15:15:36,747][62266] Initializing env for player 2, init_info: {'port': 40402}... [2023-09-12 15:15:36,752][62309] Multi agent env, num agents: 8 [2023-09-12 15:15:36,796][62309] Port 41002 is available [2023-09-12 15:15:36,796][62309] Using port 41002 [2023-09-12 15:15:36,797][62309] Initializing env for player 0, init_info: {'port': 41002}... [2023-09-12 15:15:36,802][62266] Initializing env for player 3, init_info: {'port': 40402}... [2023-09-12 15:15:36,804][62265] Initializing env for player 2, init_info: {'port': 40502}... [2023-09-12 15:15:36,832][62309] Using port 41002 on host... [2023-09-12 15:15:36,847][62309] Initializing env for player 1, init_info: {'port': 41002}... [2023-09-12 15:15:36,855][62266] Initializing env for player 4, init_info: {'port': 40402}... [2023-09-12 15:15:36,863][62265] Initializing env for player 3, init_info: {'port': 40502}... [2023-09-12 15:15:36,892][62262] Multi agent env, num agents: 8 [2023-09-12 15:15:36,901][62309] Initializing env for player 2, init_info: {'port': 41002}... [2023-09-12 15:15:36,904][62267] Multi agent env, num agents: 8 [2023-09-12 15:15:36,907][62266] Initializing env for player 5, init_info: {'port': 40402}... [2023-09-12 15:15:36,913][62265] Initializing env for player 4, init_info: {'port': 40502}... [2023-09-12 15:15:36,928][62262] Multi agent env, num agents: 8 [2023-09-12 15:15:36,934][62267] Multi agent env, num agents: 8 [2023-09-12 15:15:36,955][62309] Initializing env for player 3, init_info: {'port': 41002}... [2023-09-12 15:15:36,967][62265] Initializing env for player 5, init_info: {'port': 40502}... [2023-09-12 15:15:37,011][62309] Initializing env for player 4, init_info: {'port': 41002}... [2023-09-12 15:15:36,975][62266] Initializing env for player 6, init_info: {'port': 40402}... [2023-09-12 15:15:37,017][62267] Port 40702 is available [2023-09-12 15:15:37,017][62267] Using port 40702 [2023-09-12 15:15:37,017][62267] Initializing env for player 0, init_info: {'port': 40702}... [2023-09-12 15:15:37,019][62265] Initializing env for player 6, init_info: {'port': 40502}... [2023-09-12 15:15:37,043][62266] Initializing env for player 7, init_info: {'port': 40402}... [2023-09-12 15:15:37,047][62267] Using port 40702 on host... [2023-09-12 15:15:37,071][62262] Port 40302 is available [2023-09-12 15:15:37,071][62262] Using port 40302 [2023-09-12 15:15:37,071][62267] Initializing env for player 1, init_info: {'port': 40702}... [2023-09-12 15:15:37,072][62262] Initializing env for player 0, init_info: {'port': 40302}... [2023-09-12 15:15:37,076][62309] Initializing env for player 5, init_info: {'port': 41002}... [2023-09-12 15:15:37,105][62262] Using port 40302 on host... [2023-09-12 15:15:37,111][62307] Initialized w:6 v:1 player:5 [2023-09-12 15:15:37,113][62307] Initialized w:6 v:1 player:2 [2023-09-12 15:15:37,114][62307] Initialized w:6 v:1 player:1 [2023-09-12 15:15:37,115][62307] Initialized w:6 v:1 player:4 [2023-09-12 15:15:37,115][62307] Initialized w:6 v:1 player:3 [2023-09-12 15:15:37,116][62307] Initialized w:6 v:1 player:7 [2023-09-12 15:15:37,119][62307] Initialized w:6 v:1 player:6 [2023-09-12 15:15:37,119][62307] Initialized w:6 v:1 player:0 [2023-09-12 15:15:37,121][62307] 8 agent workers initialized for env 6! [2023-09-12 15:15:37,123][62267] Initializing env for player 2, init_info: {'port': 40702}... [2023-09-12 15:15:37,122][62262] Initializing env for player 1, init_info: {'port': 40302}... [2023-09-12 15:15:37,127][62309] Initializing env for player 6, init_info: {'port': 41002}... [2023-09-12 15:15:37,151][62265] Initializing env for player 7, init_info: {'port': 40502}... [2023-09-12 15:15:37,172][62262] Initializing env for player 2, init_info: {'port': 40302}... [2023-09-12 15:15:37,178][62309] Initializing env for player 7, init_info: {'port': 41002}... [2023-09-12 15:15:37,191][62267] Initializing env for player 3, init_info: {'port': 40702}... [2023-09-12 15:15:37,205][62307] Decorrelating experience for 32 frames... [2023-09-12 15:15:37,227][62262] Initializing env for player 3, init_info: {'port': 40302}... [2023-09-12 15:15:37,259][62267] Initializing env for player 4, init_info: {'port': 40702}... [2023-09-12 15:15:37,279][62262] Initializing env for player 4, init_info: {'port': 40302}... [2023-09-12 15:15:37,307][62267] Initializing env for player 5, init_info: {'port': 40702}... [2023-09-12 15:15:37,329][62262] Initializing env for player 5, init_info: {'port': 40302}... [2023-09-12 15:15:37,363][62267] Initializing env for player 6, init_info: {'port': 40702}... [2023-09-12 15:15:37,383][62262] Initializing env for player 6, init_info: {'port': 40302}... [2023-09-12 15:15:37,397][62305] Initialized w:3 v:1 player:7 [2023-09-12 15:15:37,400][62305] Initialized w:3 v:1 player:6 [2023-09-12 15:15:37,400][62305] Initialized w:3 v:1 player:4 [2023-09-12 15:15:37,402][62305] Initialized w:3 v:1 player:2 [2023-09-12 15:15:37,406][62305] Initialized w:3 v:1 player:1 [2023-09-12 15:15:37,408][62305] Initialized w:3 v:1 player:0 [2023-09-12 15:15:37,410][62305] Initialized w:3 v:1 player:3 [2023-09-12 15:15:37,415][62267] Initializing env for player 7, init_info: {'port': 40702}... [2023-09-12 15:15:37,424][62305] Initialized w:3 v:1 player:5 [2023-09-12 15:15:37,426][62305] 8 agent workers initialized for env 3! [2023-09-12 15:15:37,439][62262] Initializing env for player 7, init_info: {'port': 40302}... [2023-09-12 15:15:37,453][62308] Initializing env for player 0, init_info: {'port': 40802}... [2023-09-12 15:15:37,504][62308] Initializing env for player 1, init_info: {'port': 40802}... [2023-09-12 15:15:37,526][62305] Decorrelating experience for 32 frames... [2023-09-12 15:15:37,555][62308] Using port 40802 on host... [2023-09-12 15:15:37,556][62308] Initializing env for player 2, init_info: {'port': 40802}... [2023-09-12 15:15:37,611][62308] Initializing env for player 3, init_info: {'port': 40802}... [2023-09-12 15:15:37,664][62308] Initializing env for player 4, init_info: {'port': 40802}... [2023-09-12 15:15:37,715][62308] Initializing env for player 5, init_info: {'port': 40802}... [2023-09-12 15:15:37,819][62308] Initializing env for player 6, init_info: {'port': 40802}... [2023-09-12 15:15:37,851][62308] Initializing env for player 7, init_info: {'port': 40802}... [2023-09-12 15:15:38,207][62305] Multi agent env, num agents: 8 [2023-09-12 15:15:38,254][62305] Multi agent env, num agents: 8 [2023-09-12 15:15:38,300][62307] Multi agent env, num agents: 8 [2023-09-12 15:15:38,323][62305] Port 40602 is available [2023-09-12 15:15:38,323][62305] Using port 40602 [2023-09-12 15:15:38,324][62305] Initializing env for player 0, init_info: {'port': 40602}... [2023-09-12 15:15:38,355][62307] Multi agent env, num agents: 8 [2023-09-12 15:15:38,368][62305] Using port 40602 on host... [2023-09-12 15:15:38,380][62305] Initializing env for player 1, init_info: {'port': 40602}... [2023-09-12 15:15:38,400][62307] Port 40902 is available [2023-09-12 15:15:38,400][62307] Using port 40902 [2023-09-12 15:15:38,435][62305] Initializing env for player 2, init_info: {'port': 40602}... [2023-09-12 15:15:38,495][62305] Initializing env for player 3, init_info: {'port': 40602}... [2023-09-12 15:15:38,552][62305] Initializing env for player 4, init_info: {'port': 40602}... [2023-09-12 15:15:38,603][62305] Initializing env for player 5, init_info: {'port': 40602}... [2023-09-12 15:15:38,607][62266] Initialized w:1 v:2 player:2 [2023-09-12 15:15:38,610][62266] Initialized w:1 v:2 player:6 [2023-09-12 15:15:38,611][62266] Initialized w:1 v:2 player:0 [2023-09-12 15:15:38,611][62266] Initialized w:1 v:2 player:4 [2023-09-12 15:15:38,611][62266] Initialized w:1 v:2 player:3 [2023-09-12 15:15:38,612][62266] Initialized w:1 v:2 player:1 [2023-09-12 15:15:38,613][62266] Initialized w:1 v:2 player:5 [2023-09-12 15:15:38,613][62266] Initialized w:1 v:2 player:7 [2023-09-12 15:15:38,616][62266] 8 agent workers initialized for env 1! [2023-09-12 15:15:38,659][62305] Initializing env for player 6, init_info: {'port': 40602}... [2023-09-12 15:15:38,678][62266] Decorrelating experience for 64 frames... [2023-09-12 15:15:38,711][62305] Initializing env for player 7, init_info: {'port': 40602}... [2023-09-12 15:15:38,722][62309] Initialized w:7 v:2 player:4 [2023-09-12 15:15:38,726][62309] Initialized w:7 v:2 player:7 [2023-09-12 15:15:38,726][62309] Initialized w:7 v:2 player:1 [2023-09-12 15:15:38,726][62309] Initialized w:7 v:2 player:6 [2023-09-12 15:15:38,726][62309] Initialized w:7 v:2 player:3 [2023-09-12 15:15:38,726][62309] Initialized w:7 v:2 player:2 [2023-09-12 15:15:38,727][62309] Initialized w:7 v:2 player:0 [2023-09-12 15:15:38,727][62309] Initialized w:7 v:2 player:5 [2023-09-12 15:15:38,729][62309] 8 agent workers initialized for env 7! [2023-09-12 15:15:38,779][62309] Decorrelating experience for 64 frames... [2023-09-12 15:15:38,926][62265] Initialized w:2 v:2 player:2 [2023-09-12 15:15:38,928][62265] Initialized w:2 v:2 player:7 [2023-09-12 15:15:38,928][62265] Initialized w:2 v:2 player:3 [2023-09-12 15:15:38,929][62265] Initialized w:2 v:2 player:1 [2023-09-12 15:15:38,930][62265] Initialized w:2 v:2 player:6 [2023-09-12 15:15:38,930][62265] Initialized w:2 v:2 player:4 [2023-09-12 15:15:38,931][62265] Initialized w:2 v:2 player:0 [2023-09-12 15:15:38,931][62265] Initialized w:2 v:2 player:5 [2023-09-12 15:15:38,933][62265] 8 agent workers initialized for env 2! [2023-09-12 15:15:38,972][62307] Initializing env for player 0, init_info: {'port': 40902}... [2023-09-12 15:15:39,007][62265] Decorrelating experience for 64 frames... [2023-09-12 15:15:39,012][62307] Using port 40902 on host... [2023-09-12 15:15:39,022][62307] Initializing env for player 1, init_info: {'port': 40902}... [2023-09-12 15:15:39,087][62307] Initializing env for player 2, init_info: {'port': 40902}... [2023-09-12 15:15:39,139][62307] Initializing env for player 3, init_info: {'port': 40902}... [2023-09-12 15:15:39,187][62307] Initializing env for player 4, init_info: {'port': 40902}... [2023-09-12 15:15:39,229][62267] Initialized w:4 v:2 player:7 [2023-09-12 15:15:39,231][62267] Initialized w:4 v:2 player:3 [2023-09-12 15:15:39,232][62267] Initialized w:4 v:2 player:4 [2023-09-12 15:15:39,233][62267] Initialized w:4 v:2 player:2 [2023-09-12 15:15:39,233][62267] Initialized w:4 v:2 player:5 [2023-09-12 15:15:39,237][62267] Initialized w:4 v:2 player:6 [2023-09-12 15:15:39,239][62267] Initialized w:4 v:2 player:1 [2023-09-12 15:15:39,243][62267] Initialized w:4 v:2 player:0 [2023-09-12 15:15:39,244][62267] 8 agent workers initialized for env 4! [2023-09-12 15:15:39,243][62307] Initializing env for player 5, init_info: {'port': 40902}... [2023-09-12 15:15:39,299][62307] Initializing env for player 6, init_info: {'port': 40902}... [2023-09-12 15:15:39,375][62307] Initializing env for player 7, init_info: {'port': 40902}... [2023-09-12 15:15:39,392][62308] Initialized w:5 v:2 player:2 [2023-09-12 15:15:39,392][62308] Initialized w:5 v:2 player:3 [2023-09-12 15:15:39,395][62267] Decorrelating experience for 64 frames... [2023-09-12 15:15:39,395][62308] Initialized w:5 v:2 player:6 [2023-09-12 15:15:39,395][62308] Initialized w:5 v:2 player:0 [2023-09-12 15:15:39,396][62308] Initialized w:5 v:2 player:7 [2023-09-12 15:15:39,396][62308] Initialized w:5 v:2 player:5 [2023-09-12 15:15:39,397][62308] Initialized w:5 v:2 player:1 [2023-09-12 15:15:39,397][62308] Initialized w:5 v:2 player:4 [2023-09-12 15:15:39,400][62308] 8 agent workers initialized for env 5! [2023-09-12 15:15:39,434][62262] Initialized w:0 v:2 player:2 [2023-09-12 15:15:39,435][62262] Initialized w:0 v:2 player:1 [2023-09-12 15:15:39,436][62262] Initialized w:0 v:2 player:6 [2023-09-12 15:15:39,436][62262] Initialized w:0 v:2 player:3 [2023-09-12 15:15:39,437][62262] Initialized w:0 v:2 player:0 [2023-09-12 15:15:39,445][62262] Initialized w:0 v:2 player:4 [2023-09-12 15:15:39,445][62262] Initialized w:0 v:2 player:5 [2023-09-12 15:15:39,465][62308] Decorrelating experience for 64 frames... [2023-09-12 15:15:39,732][62309] Port 41003 is available [2023-09-12 15:15:39,732][62309] Using port 41003 [2023-09-12 15:15:39,791][62266] Port 40403 is available [2023-09-12 15:15:39,791][62266] Using port 40403 [2023-09-12 15:15:39,792][62266] Initializing env for player 0, init_info: {'port': 40403}... [2023-09-12 15:15:39,826][62266] Using port 40403 on host... [2023-09-12 15:15:39,843][62266] Initializing env for player 1, init_info: {'port': 40403}... [2023-09-12 15:15:39,896][62266] Initializing env for player 2, init_info: {'port': 40403}... [2023-09-12 15:15:39,946][62266] Initializing env for player 3, init_info: {'port': 40403}... [2023-09-12 15:15:40,002][62266] Initializing env for player 4, init_info: {'port': 40403}... [2023-09-12 15:15:40,023][62265] Port 40503 is available [2023-09-12 15:15:40,024][62265] Using port 40503 [2023-09-12 15:15:40,059][62266] Initializing env for player 5, init_info: {'port': 40403}... [2023-09-12 15:15:40,115][62266] Initializing env for player 6, init_info: {'port': 40403}... [2023-09-12 15:15:40,175][62266] Initializing env for player 7, init_info: {'port': 40403}... [2023-09-12 15:15:40,208][62305] Initialized w:3 v:2 player:7 [2023-09-12 15:15:40,209][62305] Initialized w:3 v:2 player:1 [2023-09-12 15:15:40,209][62305] Initialized w:3 v:2 player:6 [2023-09-12 15:15:40,210][62305] Initialized w:3 v:2 player:4 [2023-09-12 15:15:40,210][62305] Initialized w:3 v:2 player:0 [2023-09-12 15:15:40,210][62305] Initialized w:3 v:2 player:5 [2023-09-12 15:15:40,213][62305] Initialized w:3 v:2 player:2 [2023-09-12 15:15:40,424][62308] Port 40803 is available [2023-09-12 15:15:40,424][62308] Using port 40803 [2023-09-12 15:15:40,434][62262] Initialized w:0 v:2 player:7 [2023-09-12 15:15:40,435][62262] 8 agent workers initialized for env 0! [2023-09-12 15:15:40,475][62308] Initializing env for player 0, init_info: {'port': 40803}... [2023-09-12 15:15:40,505][62262] Decorrelating experience for 64 frames... [2023-09-12 15:15:40,520][62308] Using port 40803 on host... [2023-09-12 15:15:40,532][62308] Initializing env for player 1, init_info: {'port': 40803}... [2023-09-12 15:15:40,591][62308] Initializing env for player 2, init_info: {'port': 40803}... [2023-09-12 15:15:40,647][62308] Initializing env for player 3, init_info: {'port': 40803}... [2023-09-12 15:15:40,659][62267] UDP port 40703 cannot be used [Errno 98] Address already in use [2023-09-12 15:15:40,659][62267] Port 41703 is available [2023-09-12 15:15:40,660][62267] Using port 41703 [2023-09-12 15:15:40,660][62267] Initializing env for player 0, init_info: {'port': 41703}... [2023-09-12 15:15:40,696][62267] Using port 41703 on host... [2023-09-12 15:15:40,699][62308] Initializing env for player 4, init_info: {'port': 40803}... [2023-09-12 15:15:40,711][62267] Initializing env for player 1, init_info: {'port': 41703}... [2023-09-12 15:15:40,761][62267] Initializing env for player 2, init_info: {'port': 41703}... [2023-09-12 15:15:40,767][62308] Initializing env for player 5, init_info: {'port': 40803}... [2023-09-12 15:15:40,817][62267] Initializing env for player 3, init_info: {'port': 41703}... [2023-09-12 15:15:40,819][62308] Initializing env for player 6, init_info: {'port': 40803}... [2023-09-12 15:15:40,867][62308] Initializing env for player 7, init_info: {'port': 40803}... [2023-09-12 15:15:40,871][62267] Initializing env for player 4, init_info: {'port': 41703}... [2023-09-12 15:15:40,931][62267] Initializing env for player 5, init_info: {'port': 41703}... [2023-09-12 15:15:40,995][62267] Initializing env for player 6, init_info: {'port': 41703}... [2023-09-12 15:15:41,031][62267] Initializing env for player 7, init_info: {'port': 41703}... [2023-09-12 15:15:41,100][62307] Initialized w:6 v:2 player:7 [2023-09-12 15:15:41,101][62307] Initialized w:6 v:2 player:2 [2023-09-12 15:15:41,102][62307] Initialized w:6 v:2 player:4 [2023-09-12 15:15:41,103][62307] Initialized w:6 v:2 player:6 [2023-09-12 15:15:41,104][62307] Initialized w:6 v:2 player:0 [2023-09-12 15:15:41,104][62307] Initialized w:6 v:2 player:5 [2023-09-12 15:15:41,106][62307] Initialized w:6 v:2 player:3 [2023-09-12 15:15:41,206][62305] Initialized w:3 v:2 player:3 [2023-09-12 15:15:41,208][62305] 8 agent workers initialized for env 3! [2023-09-12 15:15:41,255][62309] Initializing env for player 0, init_info: {'port': 41003}... [2023-09-12 15:15:41,299][62309] Using port 41003 on host... [2023-09-12 15:15:41,310][62309] Initializing env for player 1, init_info: {'port': 41003}... [2023-09-12 15:15:41,313][62305] Decorrelating experience for 64 frames... [2023-09-12 15:15:41,369][62309] Initializing env for player 2, init_info: {'port': 41003}... [2023-09-12 15:15:41,425][62309] Initializing env for player 3, init_info: {'port': 41003}... [2023-09-12 15:15:41,428][62262] Port 40303 is available [2023-09-12 15:15:41,429][62262] Using port 40303 [2023-09-12 15:15:41,429][62262] Initializing env for player 0, init_info: {'port': 40303}... [2023-09-12 15:15:41,474][62262] Using port 40303 on host... [2023-09-12 15:15:41,486][62262] Initializing env for player 1, init_info: {'port': 40303}... [2023-09-12 15:15:41,487][62309] Initializing env for player 4, init_info: {'port': 41003}... [2023-09-12 15:15:41,535][62309] Initializing env for player 5, init_info: {'port': 41003}... [2023-09-12 15:15:41,544][62262] Initializing env for player 2, init_info: {'port': 40303}... [2023-09-12 15:15:41,587][62309] Initializing env for player 6, init_info: {'port': 41003}... [2023-09-12 15:15:41,604][62262] Initializing env for player 3, init_info: {'port': 40303}... [2023-09-12 15:15:41,638][62309] Initializing env for player 7, init_info: {'port': 41003}... [2023-09-12 15:15:41,642][62266] Initialized w:1 v:3 player:1 [2023-09-12 15:15:41,643][62266] Initialized w:1 v:3 player:0 [2023-09-12 15:15:41,643][62266] Initialized w:1 v:3 player:2 [2023-09-12 15:15:41,646][62266] Initialized w:1 v:3 player:3 [2023-09-12 15:15:41,646][62266] Initialized w:1 v:3 player:7 [2023-09-12 15:15:41,646][62266] Initialized w:1 v:3 player:4 [2023-09-12 15:15:41,647][62266] Initialized w:1 v:3 player:6 [2023-09-12 15:15:41,648][62266] Initialized w:1 v:3 player:5 [2023-09-12 15:15:41,648][62266] 8 agent workers initialized for env 1! [2023-09-12 15:15:41,654][62262] Initializing env for player 4, init_info: {'port': 40303}... [2023-09-12 15:15:41,702][62266] Decorrelating experience for 96 frames... [2023-09-12 15:15:41,706][62262] Initializing env for player 5, init_info: {'port': 40303}... [2023-09-12 15:15:41,756][62262] Initializing env for player 6, init_info: {'port': 40303}... [2023-09-12 15:15:41,827][62262] Initializing env for player 7, init_info: {'port': 40303}... [2023-09-12 15:15:42,093][62307] Initialized w:6 v:2 player:1 [2023-09-12 15:15:42,095][62307] 8 agent workers initialized for env 6! [2023-09-12 15:15:42,107][62265] Initializing env for player 0, init_info: {'port': 40503}... [2023-09-12 15:15:42,142][62308] Initialized w:5 v:3 player:5 [2023-09-12 15:15:42,147][62308] Initialized w:5 v:3 player:3 [2023-09-12 15:15:42,149][62308] Initialized w:5 v:3 player:6 [2023-09-12 15:15:42,149][62308] Initialized w:5 v:3 player:4 [2023-09-12 15:15:42,150][62308] Initialized w:5 v:3 player:7 [2023-09-12 15:15:42,151][62308] Initialized w:5 v:3 player:1 [2023-09-12 15:15:42,152][62308] Initialized w:5 v:3 player:0 [2023-09-12 15:15:42,154][62308] Initialized w:5 v:3 player:2 [2023-09-12 15:15:42,154][62308] 8 agent workers initialized for env 5! [2023-09-12 15:15:42,162][62265] Initializing env for player 1, init_info: {'port': 40503}... [2023-09-12 15:15:42,203][62307] Decorrelating experience for 64 frames... [2023-09-12 15:15:42,207][62265] Initializing env for player 2, init_info: {'port': 40503}... [2023-09-12 15:15:42,233][62265] Using port 40503 on host... [2023-09-12 15:15:42,236][62308] Decorrelating experience for 96 frames... [2023-09-12 15:15:42,263][62265] Initializing env for player 3, init_info: {'port': 40503}... [2023-09-12 15:15:42,277][62305] Port 40603 is available [2023-09-12 15:15:42,277][62305] Using port 40603 [2023-09-12 15:15:42,278][62305] Initializing env for player 0, init_info: {'port': 40603}... [2023-09-12 15:15:42,315][62305] Using port 40603 on host... [2023-09-12 15:15:42,329][62305] Initializing env for player 1, init_info: {'port': 40603}... [2023-09-12 15:15:42,323][62265] Initializing env for player 4, init_info: {'port': 40503}... [2023-09-12 15:15:42,379][62305] Initializing env for player 2, init_info: {'port': 40603}... [2023-09-12 15:15:42,384][62265] Initializing env for player 5, init_info: {'port': 40503}... [2023-09-12 15:15:42,423][62265] Initializing env for player 6, init_info: {'port': 40503}... [2023-09-12 15:15:42,431][62305] Initializing env for player 3, init_info: {'port': 40603}... [2023-09-12 15:15:42,491][62305] Initializing env for player 4, init_info: {'port': 40603}... [2023-09-12 15:15:42,511][62265] Initializing env for player 7, init_info: {'port': 40503}... [2023-09-12 15:15:42,540][62305] Initializing env for player 5, init_info: {'port': 40603}... [2023-09-12 15:15:42,591][62305] Initializing env for player 6, init_info: {'port': 40603}... [2023-09-12 15:15:42,643][62305] Initializing env for player 7, init_info: {'port': 40603}... [2023-09-12 15:15:42,767][62267] Initialized w:4 v:3 player:0 [2023-09-12 15:15:42,768][62267] Initialized w:4 v:3 player:1 [2023-09-12 15:15:42,768][62267] Initialized w:4 v:3 player:2 [2023-09-12 15:15:42,768][62267] Initialized w:4 v:3 player:7 [2023-09-12 15:15:42,768][62267] Initialized w:4 v:3 player:5 [2023-09-12 15:15:42,768][62267] Initialized w:4 v:3 player:6 [2023-09-12 15:15:42,767][62267] Initialized w:4 v:3 player:4 [2023-09-12 15:15:42,769][62267] Initialized w:4 v:3 player:3 [2023-09-12 15:15:42,771][62267] 8 agent workers initialized for env 4! [2023-09-12 15:15:42,827][62267] Decorrelating experience for 96 frames... [2023-09-12 15:15:43,051][62309] Initialized w:7 v:3 player:5 [2023-09-12 15:15:43,052][62309] Initialized w:7 v:3 player:1 [2023-09-12 15:15:43,053][62309] Initialized w:7 v:3 player:7 [2023-09-12 15:15:43,057][62309] Initialized w:7 v:3 player:0 [2023-09-12 15:15:43,057][62309] Initialized w:7 v:3 player:2 [2023-09-12 15:15:43,059][62309] Initialized w:7 v:3 player:3 [2023-09-12 15:15:43,060][62309] Initialized w:7 v:3 player:6 [2023-09-12 15:15:43,061][62309] Initialized w:7 v:3 player:4 [2023-09-12 15:15:43,062][62309] 8 agent workers initialized for env 7! [2023-09-12 15:15:43,150][62309] Decorrelating experience for 96 frames... [2023-09-12 15:15:43,315][62262] Initialized w:0 v:3 player:4 [2023-09-12 15:15:43,317][62262] Initialized w:0 v:3 player:6 [2023-09-12 15:15:43,317][62262] Initialized w:0 v:3 player:3 [2023-09-12 15:15:43,317][62262] Initialized w:0 v:3 player:1 [2023-09-12 15:15:43,318][62262] Initialized w:0 v:3 player:5 [2023-09-12 15:15:43,318][62262] Initialized w:0 v:3 player:7 [2023-09-12 15:15:43,319][62262] Initialized w:0 v:3 player:0 [2023-09-12 15:15:43,321][62262] Initialized w:0 v:3 player:2 [2023-09-12 15:15:43,321][62262] 8 agent workers initialized for env 0! [2023-09-12 15:15:43,384][62262] Decorrelating experience for 96 frames... [2023-09-12 15:15:43,790][62307] Port 40903 is available [2023-09-12 15:15:43,790][62307] Using port 40903 [2023-09-12 15:15:43,791][62307] Initializing env for player 0, init_info: {'port': 40903}... [2023-09-12 15:15:43,823][62307] Using port 40903 on host... [2023-09-12 15:15:43,843][62307] Initializing env for player 1, init_info: {'port': 40903}... [2023-09-12 15:15:43,892][62307] Initializing env for player 2, init_info: {'port': 40903}... [2023-09-12 15:15:43,951][62307] Initializing env for player 3, init_info: {'port': 40903}... [2023-09-12 15:15:43,999][62307] Initializing env for player 4, init_info: {'port': 40903}... [2023-09-12 15:15:44,048][62265] Initialized w:2 v:3 player:3 [2023-09-12 15:15:44,052][62265] Initialized w:2 v:3 player:1 [2023-09-12 15:15:44,053][62265] Initialized w:2 v:3 player:5 [2023-09-12 15:15:44,053][62265] Initialized w:2 v:3 player:4 [2023-09-12 15:15:44,053][62265] Initialized w:2 v:3 player:2 [2023-09-12 15:15:44,053][62265] Initialized w:2 v:3 player:6 [2023-09-12 15:15:44,054][62265] Initialized w:2 v:3 player:0 [2023-09-12 15:15:44,054][62265] Initialized w:2 v:3 player:7 [2023-09-12 15:15:44,057][62265] 8 agent workers initialized for env 2! [2023-09-12 15:15:44,062][62305] Initialized w:3 v:3 player:0 [2023-09-12 15:15:44,065][62305] Initialized w:3 v:3 player:5 [2023-09-12 15:15:44,066][62305] Initialized w:3 v:3 player:4 [2023-09-12 15:15:44,067][62305] Initialized w:3 v:3 player:3 [2023-09-12 15:15:44,068][62305] Initialized w:3 v:3 player:7 [2023-09-12 15:15:44,068][62305] Initialized w:3 v:3 player:2 [2023-09-12 15:15:44,070][62305] Initialized w:3 v:3 player:6 [2023-09-12 15:15:44,070][62305] Initialized w:3 v:3 player:1 [2023-09-12 15:15:44,071][62305] 8 agent workers initialized for env 3! [2023-09-12 15:15:44,071][62307] Initializing env for player 5, init_info: {'port': 40903}... [2023-09-12 15:15:44,123][62307] Initializing env for player 6, init_info: {'port': 40903}... [2023-09-12 15:15:44,140][62265] Decorrelating experience for 96 frames... [2023-09-12 15:15:44,148][62305] Decorrelating experience for 96 frames... [2023-09-12 15:15:44,191][62307] Initializing env for player 7, init_info: {'port': 40903}... [2023-09-12 15:15:46,090][62307] Initialized w:6 v:3 player:7 [2023-09-12 15:15:46,091][62307] Initialized w:6 v:3 player:0 [2023-09-12 15:15:46,092][62307] Initialized w:6 v:3 player:5 [2023-09-12 15:15:46,093][62307] Initialized w:6 v:3 player:4 [2023-09-12 15:15:46,095][62307] Initialized w:6 v:3 player:6 [2023-09-12 15:15:46,096][62307] Initialized w:6 v:3 player:3 [2023-09-12 15:15:46,097][62307] Initialized w:6 v:3 player:2 [2023-09-12 15:15:46,099][62307] Initialized w:6 v:3 player:1 [2023-09-12 15:15:46,100][62307] 8 agent workers initialized for env 6! [2023-09-12 15:15:46,175][62307] Decorrelating experience for 96 frames... [2023-09-12 15:15:46,381][62109] Signal inference workers to stop experience collection... [2023-09-12 15:15:46,403][62259] InferenceWorker_p0-w0: stopping experience collection [2023-09-12 15:15:47,468][62109] EvtLoop [learner_proc0_evt_loop, process=learner_proc0] unhandled exception in slot='on_new_training_batch' connected to emitter=Emitter(object_id='Batcher_0', signal_name='training_batches_available'), args=(0,) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/learning/learner_worker.py", line 150, in on_new_training_batch stats = self.learner.train(self.batcher.training_batches[batch_idx]) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/learning/learner.py", line 1046, in train train_stats = self._train(buff, self.cfg.batch_size, experience_size, num_invalids) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/learning/learner.py", line 731, in _train ) = self._calculate_losses(mb, num_invalids) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/learning/learner.py", line 649, in _calculate_losses exploration_loss = self.exploration_loss_func(action_distribution, valids, num_invalids) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/learning/learner.py", line 477, in _symmetric_kl_exploration_loss kl_prior = action_distribution.symmetric_kl_with_uniform_prior() File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/action_distributions.py", line 247, in symmetric_kl_with_uniform_prior sym_kls = [d.symmetric_kl_with_uniform_prior().unsqueeze(dim=1) for d in self.distributions] File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/action_distributions.py", line 247, in sym_kls = [d.symmetric_kl_with_uniform_prior().unsqueeze(dim=1) for d in self.distributions] AttributeError: 'ContinuousActionDistribution' object has no attribute 'symmetric_kl_with_uniform_prior' [2023-09-12 15:15:47,468][62109] Unhandled exception 'ContinuousActionDistribution' object has no attribute 'symmetric_kl_with_uniform_prior' in evt loop learner_proc0_evt_loop [2023-09-12 15:16:41,871][62109] Stopping Batcher_0... [2023-09-12 15:16:41,871][62109] Loop batcher_evt_loop terminating... [2023-09-12 15:16:41,886][62259] Weights refcount: 2 0 [2023-09-12 15:16:41,887][62259] Stopping InferenceWorker_p0-w0... [2023-09-12 15:16:41,887][62259] Loop inference_proc0-0_evt_loop terminating... [2023-09-12 15:16:45,092][62262] Stopping RolloutWorker_w0... [2023-09-12 15:16:45,093][62262] Loop rollout_proc0_evt_loop terminating... [2023-09-12 15:16:45,095][62309] Stopping RolloutWorker_w7... [2023-09-12 15:16:45,095][62309] Loop rollout_proc7_evt_loop terminating... [2023-09-12 15:16:45,095][62305] Stopping RolloutWorker_w3... [2023-09-12 15:16:45,096][62308] Stopping RolloutWorker_w5... [2023-09-12 15:16:45,096][62266] Stopping RolloutWorker_w1... [2023-09-12 15:16:45,096][62305] Loop rollout_proc3_evt_loop terminating... [2023-09-12 15:16:45,096][62308] Loop rollout_proc5_evt_loop terminating... [2023-09-12 15:16:45,096][62266] Loop rollout_proc1_evt_loop terminating... [2023-09-12 15:16:45,099][62267] Stopping RolloutWorker_w4... [2023-09-12 15:16:45,099][62265] Stopping RolloutWorker_w2... [2023-09-12 15:16:45,099][62267] Loop rollout_proc4_evt_loop terminating... [2023-09-12 15:16:45,099][62265] Loop rollout_proc2_evt_loop terminating... [2023-09-12 15:16:45,100][62307] Stopping RolloutWorker_w6... [2023-09-12 15:16:45,101][62307] Loop rollout_proc6_evt_loop terminating... [2023-09-12 15:16:53,596][70316] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:16:53,597][70316] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-09-12 15:16:53,634][70316] Num visible devices: 1 [2023-09-12 15:16:53,676][70316] Starting seed is not provided [2023-09-12 15:16:53,676][70316] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:16:53,676][70316] Initializing actor-critic model on device cuda:0 [2023-09-12 15:16:53,677][70316] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:16:53,678][70316] RunningMeanStd input shape: (1,) [2023-09-12 15:16:53,697][70316] ConvEncoder: input_channels=3 [2023-09-12 15:16:53,957][70316] Conv encoder output size: 512 [2023-09-12 15:16:53,957][70316] Policy head output size: 512 [2023-09-12 15:16:53,980][70316] Created Actor Critic model with architecture: [2023-09-12 15:16:53,981][70316] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): VizdoomEncoder( (basic_encoder): ConvEncoder( (enc): RecursiveScriptModule( original_name=ConvEncoderImpl (conv_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Conv2d) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Conv2d) (3): RecursiveScriptModule(original_name=ELU) (4): RecursiveScriptModule(original_name=Conv2d) (5): RecursiveScriptModule(original_name=ELU) ) (mlp_layers): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=6, bias=True) ) ) [2023-09-12 15:16:55,332][70316] Using optimizer [2023-09-12 15:16:55,333][70316] No checkpoints found [2023-09-12 15:16:55,333][70316] Did not load from checkpoint, starting from scratch! [2023-09-12 15:16:55,333][70316] Initialized policy 0 weights for model version 0 [2023-09-12 15:16:55,335][70316] LearnerWorker_p0 finished initialization! [2023-09-12 15:16:55,336][70316] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:16:55,641][70392] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-09-12 15:16:55,677][70433] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-09-12 15:16:55,747][70390] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-09-12 15:16:55,890][70429] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-09-12 15:16:55,902][70388] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-09-12 15:16:55,993][70389] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:16:55,993][70389] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-09-12 15:16:56,011][70389] Num visible devices: 1 [2023-09-12 15:16:56,012][70391] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-09-12 15:16:56,013][70434] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-09-12 15:16:56,043][70393] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-09-12 15:16:56,646][70389] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:16:56,646][70389] RunningMeanStd input shape: (1,) [2023-09-12 15:16:56,658][70389] ConvEncoder: input_channels=3 [2023-09-12 15:16:56,759][70389] Conv encoder output size: 512 [2023-09-12 15:16:56,759][70389] Policy head output size: 512 [2023-09-12 15:16:57,100][70429] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,100][70433] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,100][70393] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,100][70391] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,100][70434] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,113][70390] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,113][70392] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,113][70388] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:16:57,452][70390] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,453][70429] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,453][70391] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,456][70434] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,457][70433] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,551][70388] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,569][70392] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,725][70434] Decorrelating experience for 32 frames... [2023-09-12 15:16:57,733][70429] Decorrelating experience for 32 frames... [2023-09-12 15:16:57,804][70391] Decorrelating experience for 32 frames... [2023-09-12 15:16:57,816][70433] Decorrelating experience for 32 frames... [2023-09-12 15:16:57,885][70393] Decorrelating experience for 0 frames... [2023-09-12 15:16:57,921][70390] Decorrelating experience for 32 frames... [2023-09-12 15:16:57,943][70388] Decorrelating experience for 32 frames... [2023-09-12 15:16:58,147][70429] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,154][70434] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,186][70392] Decorrelating experience for 32 frames... [2023-09-12 15:16:58,256][70391] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,272][70393] Decorrelating experience for 32 frames... [2023-09-12 15:16:58,492][70433] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,496][70390] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,505][70434] Decorrelating experience for 96 frames... [2023-09-12 15:16:58,514][70429] Decorrelating experience for 96 frames... [2023-09-12 15:16:58,562][70388] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,633][70393] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,642][70392] Decorrelating experience for 64 frames... [2023-09-12 15:16:58,778][70391] Decorrelating experience for 96 frames... [2023-09-12 15:16:58,883][70390] Decorrelating experience for 96 frames... [2023-09-12 15:16:58,930][70433] Decorrelating experience for 96 frames... [2023-09-12 15:16:58,944][70388] Decorrelating experience for 96 frames... [2023-09-12 15:16:59,248][70393] Decorrelating experience for 96 frames... [2023-09-12 15:16:59,257][70392] Decorrelating experience for 96 frames... [2023-09-12 15:16:59,588][70316] Signal inference workers to stop experience collection... [2023-09-12 15:16:59,595][70389] InferenceWorker_p0-w0: stopping experience collection [2023-09-12 15:17:03,152][70316] Signal inference workers to resume experience collection... [2023-09-12 15:17:03,153][70389] InferenceWorker_p0-w0: resuming experience collection [2023-09-12 15:17:06,236][70389] Updated weights for policy 0, policy_version 10 (0.0372) [2023-09-12 15:17:09,721][70389] Updated weights for policy 0, policy_version 20 (0.0010) [2023-09-12 15:17:13,010][70389] Updated weights for policy 0, policy_version 30 (0.0008) [2023-09-12 15:17:13,837][70316] Saving new best policy, reward=-0.093! [2023-09-12 15:17:16,293][70389] Updated weights for policy 0, policy_version 40 (0.0008) [2023-09-12 15:17:19,631][70389] Updated weights for policy 0, policy_version 50 (0.0010) [2023-09-12 15:17:22,990][70389] Updated weights for policy 0, policy_version 60 (0.0008) [2023-09-12 15:17:26,391][70389] Updated weights for policy 0, policy_version 70 (0.0009) [2023-09-12 15:17:29,824][70389] Updated weights for policy 0, policy_version 80 (0.0009) [2023-09-12 15:17:33,210][70389] Updated weights for policy 0, policy_version 90 (0.0010) [2023-09-12 15:17:36,615][70389] Updated weights for policy 0, policy_version 100 (0.0009) [2023-09-12 15:17:39,951][70389] Updated weights for policy 0, policy_version 110 (0.0008) [2023-09-12 15:17:43,320][70389] Updated weights for policy 0, policy_version 120 (0.0008) [2023-09-12 15:17:46,635][70389] Updated weights for policy 0, policy_version 130 (0.0008) [2023-09-12 15:17:49,928][70389] Updated weights for policy 0, policy_version 140 (0.0009) [2023-09-12 15:17:53,332][70389] Updated weights for policy 0, policy_version 150 (0.0009) [2023-09-12 15:17:56,723][70389] Updated weights for policy 0, policy_version 160 (0.0009) [2023-09-12 15:18:00,055][70389] Updated weights for policy 0, policy_version 170 (0.0009) [2023-09-12 15:18:03,491][70389] Updated weights for policy 0, policy_version 180 (0.0009) [2023-09-12 15:18:06,829][70389] Updated weights for policy 0, policy_version 190 (0.0009) [2023-09-12 15:18:10,117][70389] Updated weights for policy 0, policy_version 200 (0.0009) [2023-09-12 15:18:13,417][70389] Updated weights for policy 0, policy_version 210 (0.0009) [2023-09-12 15:18:16,810][70389] Updated weights for policy 0, policy_version 220 (0.0009) [2023-09-12 15:18:20,056][70389] Updated weights for policy 0, policy_version 230 (0.0009) [2023-09-12 15:18:23,368][70389] Updated weights for policy 0, policy_version 240 (0.0008) [2023-09-12 15:18:26,799][70389] Updated weights for policy 0, policy_version 250 (0.0009) [2023-09-12 15:18:30,175][70389] Updated weights for policy 0, policy_version 260 (0.0009) [2023-09-12 15:18:33,623][70389] Updated weights for policy 0, policy_version 270 (0.0009) [2023-09-12 15:18:36,997][70389] Updated weights for policy 0, policy_version 280 (0.0008) [2023-09-12 15:18:40,229][70389] Updated weights for policy 0, policy_version 290 (0.0009) [2023-09-12 15:18:43,508][70389] Updated weights for policy 0, policy_version 300 (0.0009) [2023-09-12 15:18:46,848][70389] Updated weights for policy 0, policy_version 310 (0.0008) [2023-09-12 15:18:48,845][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000000316_1294336.pth... [2023-09-12 15:18:50,168][70389] Updated weights for policy 0, policy_version 320 (0.0009) [2023-09-12 15:18:53,571][70389] Updated weights for policy 0, policy_version 330 (0.0009) [2023-09-12 15:18:56,786][70389] Updated weights for policy 0, policy_version 340 (0.0008) [2023-09-12 15:19:00,215][70389] Updated weights for policy 0, policy_version 350 (0.0009) [2023-09-12 15:19:03,591][70389] Updated weights for policy 0, policy_version 360 (0.0008) [2023-09-12 15:19:06,873][70389] Updated weights for policy 0, policy_version 370 (0.0008) [2023-09-12 15:19:10,252][70389] Updated weights for policy 0, policy_version 380 (0.0008) [2023-09-12 15:19:13,694][70389] Updated weights for policy 0, policy_version 390 (0.0021) [2023-09-12 15:19:17,031][70389] Updated weights for policy 0, policy_version 400 (0.0009) [2023-09-12 15:19:20,559][70389] Updated weights for policy 0, policy_version 410 (0.0010) [2023-09-12 15:19:23,948][70389] Updated weights for policy 0, policy_version 420 (0.0008) [2023-09-12 15:19:27,204][70389] Updated weights for policy 0, policy_version 430 (0.0009) [2023-09-12 15:19:30,664][70389] Updated weights for policy 0, policy_version 440 (0.0009) [2023-09-12 15:19:33,991][70389] Updated weights for policy 0, policy_version 450 (0.0009) [2023-09-12 15:19:37,379][70389] Updated weights for policy 0, policy_version 460 (0.0011) [2023-09-12 15:19:40,683][70389] Updated weights for policy 0, policy_version 470 (0.0008) [2023-09-12 15:19:43,994][70389] Updated weights for policy 0, policy_version 480 (0.0008) [2023-09-12 15:19:47,395][70389] Updated weights for policy 0, policy_version 490 (0.0009) [2023-09-12 15:19:50,641][70389] Updated weights for policy 0, policy_version 500 (0.0008) [2023-09-12 15:19:53,837][70316] Saving new best policy, reward=-0.087! [2023-09-12 15:19:54,040][70389] Updated weights for policy 0, policy_version 510 (0.0009) [2023-09-12 15:19:56,561][70389] Updated weights for policy 0, policy_version 520 (0.0008) [2023-09-12 15:19:58,841][70316] Saving new best policy, reward=-0.063! [2023-09-12 15:19:58,994][70389] Updated weights for policy 0, policy_version 530 (0.0008) [2023-09-12 15:20:01,390][70389] Updated weights for policy 0, policy_version 540 (0.0009) [2023-09-12 15:20:03,746][70389] Updated weights for policy 0, policy_version 550 (0.0009) [2023-09-12 15:20:06,201][70389] Updated weights for policy 0, policy_version 560 (0.0009) [2023-09-12 15:20:08,666][70389] Updated weights for policy 0, policy_version 570 (0.0009) [2023-09-12 15:20:11,130][70389] Updated weights for policy 0, policy_version 580 (0.0010) [2023-09-12 15:20:13,584][70389] Updated weights for policy 0, policy_version 590 (0.0008) [2023-09-12 15:20:15,974][70389] Updated weights for policy 0, policy_version 600 (0.0008) [2023-09-12 15:20:18,408][70389] Updated weights for policy 0, policy_version 610 (0.0008) [2023-09-12 15:20:20,863][70389] Updated weights for policy 0, policy_version 620 (0.0009) [2023-09-12 15:20:23,283][70389] Updated weights for policy 0, policy_version 630 (0.0008) [2023-09-12 15:20:25,669][70389] Updated weights for policy 0, policy_version 640 (0.0009) [2023-09-12 15:20:28,067][70389] Updated weights for policy 0, policy_version 650 (0.0008) [2023-09-12 15:20:30,404][70389] Updated weights for policy 0, policy_version 660 (0.0008) [2023-09-12 15:20:32,828][70389] Updated weights for policy 0, policy_version 670 (0.0009) [2023-09-12 15:20:35,654][70389] Updated weights for policy 0, policy_version 680 (0.0008) [2023-09-12 15:20:38,954][70389] Updated weights for policy 0, policy_version 690 (0.0009) [2023-09-12 15:20:42,406][70389] Updated weights for policy 0, policy_version 700 (0.0009) [2023-09-12 15:20:45,662][70389] Updated weights for policy 0, policy_version 710 (0.0008) [2023-09-12 15:20:48,844][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000000719_2945024.pth... [2023-09-12 15:20:49,079][70389] Updated weights for policy 0, policy_version 720 (0.0008) [2023-09-12 15:20:52,455][70389] Updated weights for policy 0, policy_version 730 (0.0008) [2023-09-12 15:20:55,763][70389] Updated weights for policy 0, policy_version 740 (0.0008) [2023-09-12 15:20:59,177][70389] Updated weights for policy 0, policy_version 750 (0.0008) [2023-09-12 15:21:02,571][70389] Updated weights for policy 0, policy_version 760 (0.0009) [2023-09-12 15:21:05,984][70389] Updated weights for policy 0, policy_version 770 (0.0008) [2023-09-12 15:21:09,233][70389] Updated weights for policy 0, policy_version 780 (0.0009) [2023-09-12 15:21:12,542][70389] Updated weights for policy 0, policy_version 790 (0.0009) [2023-09-12 15:21:15,889][70389] Updated weights for policy 0, policy_version 800 (0.0009) [2023-09-12 15:21:19,161][70389] Updated weights for policy 0, policy_version 810 (0.0009) [2023-09-12 15:21:22,516][70389] Updated weights for policy 0, policy_version 820 (0.0008) [2023-09-12 15:21:25,878][70389] Updated weights for policy 0, policy_version 830 (0.0008) [2023-09-12 15:21:29,134][70389] Updated weights for policy 0, policy_version 840 (0.0009) [2023-09-12 15:21:32,423][70389] Updated weights for policy 0, policy_version 850 (0.0010) [2023-09-12 15:21:35,896][70389] Updated weights for policy 0, policy_version 860 (0.0009) [2023-09-12 15:21:39,149][70389] Updated weights for policy 0, policy_version 870 (0.0008) [2023-09-12 15:21:42,494][70389] Updated weights for policy 0, policy_version 880 (0.0009) [2023-09-12 15:21:45,841][70389] Updated weights for policy 0, policy_version 890 (0.0008) [2023-09-12 15:21:49,057][70389] Updated weights for policy 0, policy_version 900 (0.0009) [2023-09-12 15:21:52,263][70389] Updated weights for policy 0, policy_version 910 (0.0008) [2023-09-12 15:21:55,606][70389] Updated weights for policy 0, policy_version 920 (0.0009) [2023-09-12 15:21:58,945][70389] Updated weights for policy 0, policy_version 930 (0.0008) [2023-09-12 15:22:02,335][70389] Updated weights for policy 0, policy_version 940 (0.0009) [2023-09-12 15:22:05,727][70389] Updated weights for policy 0, policy_version 950 (0.0008) [2023-09-12 15:22:08,953][70389] Updated weights for policy 0, policy_version 960 (0.0009) [2023-09-12 15:22:12,312][70389] Updated weights for policy 0, policy_version 970 (0.0008) [2023-09-12 15:22:15,662][70389] Updated weights for policy 0, policy_version 980 (0.0008) [2023-09-12 15:22:18,972][70389] Updated weights for policy 0, policy_version 990 (0.0009) [2023-09-12 15:22:22,344][70389] Updated weights for policy 0, policy_version 1000 (0.0008) [2023-09-12 15:22:25,658][70389] Updated weights for policy 0, policy_version 1010 (0.0009) [2023-09-12 15:22:28,987][70389] Updated weights for policy 0, policy_version 1020 (0.0008) [2023-09-12 15:22:32,343][70389] Updated weights for policy 0, policy_version 1030 (0.0008) [2023-09-12 15:22:35,691][70389] Updated weights for policy 0, policy_version 1040 (0.0008) [2023-09-12 15:22:39,076][70389] Updated weights for policy 0, policy_version 1050 (0.0009) [2023-09-12 15:22:42,412][70389] Updated weights for policy 0, policy_version 1060 (0.0009) [2023-09-12 15:22:45,739][70389] Updated weights for policy 0, policy_version 1070 (0.0009) [2023-09-12 15:22:48,842][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001079_4419584.pth... [2023-09-12 15:22:48,893][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000000316_1294336.pth [2023-09-12 15:22:49,048][70389] Updated weights for policy 0, policy_version 1080 (0.0009) [2023-09-12 15:22:52,419][70389] Updated weights for policy 0, policy_version 1090 (0.0009) [2023-09-12 15:22:55,791][70389] Updated weights for policy 0, policy_version 1100 (0.0009) [2023-09-12 15:22:59,112][70389] Updated weights for policy 0, policy_version 1110 (0.0008) [2023-09-12 15:23:02,458][70389] Updated weights for policy 0, policy_version 1120 (0.0008) [2023-09-12 15:23:05,787][70389] Updated weights for policy 0, policy_version 1130 (0.0009) [2023-09-12 15:23:09,145][70389] Updated weights for policy 0, policy_version 1140 (0.0008) [2023-09-12 15:23:12,425][70389] Updated weights for policy 0, policy_version 1150 (0.0009) [2023-09-12 15:23:15,828][70389] Updated weights for policy 0, policy_version 1160 (0.0009) [2023-09-12 15:23:19,213][70389] Updated weights for policy 0, policy_version 1170 (0.0008) [2023-09-12 15:23:22,633][70389] Updated weights for policy 0, policy_version 1180 (0.0009) [2023-09-12 15:23:25,512][70389] Updated weights for policy 0, policy_version 1190 (0.0009) [2023-09-12 15:23:27,926][70389] Updated weights for policy 0, policy_version 1200 (0.0008) [2023-09-12 15:23:30,389][70389] Updated weights for policy 0, policy_version 1210 (0.0009) [2023-09-12 15:23:32,819][70389] Updated weights for policy 0, policy_version 1220 (0.0009) [2023-09-12 15:23:35,236][70389] Updated weights for policy 0, policy_version 1230 (0.0008) [2023-09-12 15:23:37,587][70389] Updated weights for policy 0, policy_version 1240 (0.0008) [2023-09-12 15:23:39,993][70389] Updated weights for policy 0, policy_version 1250 (0.0009) [2023-09-12 15:23:42,419][70389] Updated weights for policy 0, policy_version 1260 (0.0008) [2023-09-12 15:23:44,776][70389] Updated weights for policy 0, policy_version 1270 (0.0008) [2023-09-12 15:23:47,241][70389] Updated weights for policy 0, policy_version 1280 (0.0009) [2023-09-12 15:23:49,695][70389] Updated weights for policy 0, policy_version 1290 (0.0008) [2023-09-12 15:23:52,094][70389] Updated weights for policy 0, policy_version 1300 (0.0009) [2023-09-12 15:23:54,465][70389] Updated weights for policy 0, policy_version 1310 (0.0008) [2023-09-12 15:23:56,875][70389] Updated weights for policy 0, policy_version 1320 (0.0008) [2023-09-12 15:23:58,843][70316] Saving new best policy, reward=-0.050! [2023-09-12 15:23:59,310][70389] Updated weights for policy 0, policy_version 1330 (0.0009) [2023-09-12 15:24:01,732][70389] Updated weights for policy 0, policy_version 1340 (0.0009) [2023-09-12 15:24:03,869][70316] Saving new best policy, reward=-0.037! [2023-09-12 15:24:04,122][70389] Updated weights for policy 0, policy_version 1350 (0.0009) [2023-09-12 15:24:07,167][70389] Updated weights for policy 0, policy_version 1360 (0.0009) [2023-09-12 15:24:10,461][70389] Updated weights for policy 0, policy_version 1370 (0.0008) [2023-09-12 15:24:13,808][70389] Updated weights for policy 0, policy_version 1380 (0.0008) [2023-09-12 15:24:17,150][70389] Updated weights for policy 0, policy_version 1390 (0.0008) [2023-09-12 15:24:20,508][70389] Updated weights for policy 0, policy_version 1400 (0.0009) [2023-09-12 15:24:23,777][70389] Updated weights for policy 0, policy_version 1410 (0.0009) [2023-09-12 15:24:27,139][70389] Updated weights for policy 0, policy_version 1420 (0.0009) [2023-09-12 15:24:30,474][70389] Updated weights for policy 0, policy_version 1430 (0.0008) [2023-09-12 15:24:33,830][70389] Updated weights for policy 0, policy_version 1440 (0.0009) [2023-09-12 15:24:37,574][70389] Updated weights for policy 0, policy_version 1450 (0.0009) [2023-09-12 15:24:41,285][70389] Updated weights for policy 0, policy_version 1460 (0.0011) [2023-09-12 15:24:44,836][70389] Updated weights for policy 0, policy_version 1470 (0.0009) [2023-09-12 15:24:48,233][70389] Updated weights for policy 0, policy_version 1480 (0.0009) [2023-09-12 15:24:48,881][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001482_6070272.pth... [2023-09-12 15:24:48,930][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000000719_2945024.pth [2023-09-12 15:24:51,554][70389] Updated weights for policy 0, policy_version 1490 (0.0008) [2023-09-12 15:24:53,869][70316] Saving new best policy, reward=-0.030! [2023-09-12 15:24:54,885][70389] Updated weights for policy 0, policy_version 1500 (0.0008) [2023-09-12 15:24:58,187][70389] Updated weights for policy 0, policy_version 1510 (0.0009) [2023-09-12 15:25:01,449][70389] Updated weights for policy 0, policy_version 1520 (0.0009) [2023-09-12 15:25:04,823][70389] Updated weights for policy 0, policy_version 1530 (0.0008) [2023-09-12 15:25:08,086][70389] Updated weights for policy 0, policy_version 1540 (0.0008) [2023-09-12 15:25:11,381][70389] Updated weights for policy 0, policy_version 1550 (0.0008) [2023-09-12 15:25:14,626][70389] Updated weights for policy 0, policy_version 1560 (0.0008) [2023-09-12 15:25:17,993][70389] Updated weights for policy 0, policy_version 1570 (0.0008) [2023-09-12 15:25:21,374][70389] Updated weights for policy 0, policy_version 1580 (0.0008) [2023-09-12 15:25:24,609][70389] Updated weights for policy 0, policy_version 1590 (0.0009) [2023-09-12 15:25:28,048][70389] Updated weights for policy 0, policy_version 1600 (0.0009) [2023-09-12 15:25:28,840][70316] Saving new best policy, reward=-0.009! [2023-09-12 15:25:31,325][70389] Updated weights for policy 0, policy_version 1610 (0.0009) [2023-09-12 15:25:33,837][70316] Saving new best policy, reward=0.060! [2023-09-12 15:25:34,594][70389] Updated weights for policy 0, policy_version 1620 (0.0008) [2023-09-12 15:25:37,945][70389] Updated weights for policy 0, policy_version 1630 (0.0009) [2023-09-12 15:25:41,211][70389] Updated weights for policy 0, policy_version 1640 (0.0009) [2023-09-12 15:25:44,593][70389] Updated weights for policy 0, policy_version 1650 (0.0008) [2023-09-12 15:25:47,883][70389] Updated weights for policy 0, policy_version 1660 (0.0008) [2023-09-12 15:25:51,286][70389] Updated weights for policy 0, policy_version 1670 (0.0009) [2023-09-12 15:25:54,594][70389] Updated weights for policy 0, policy_version 1680 (0.0008) [2023-09-12 15:25:57,878][70389] Updated weights for policy 0, policy_version 1690 (0.0008) [2023-09-12 15:26:01,156][70389] Updated weights for policy 0, policy_version 1700 (0.0009) [2023-09-12 15:26:04,482][70389] Updated weights for policy 0, policy_version 1710 (0.0008) [2023-09-12 15:26:07,813][70389] Updated weights for policy 0, policy_version 1720 (0.0008) [2023-09-12 15:26:11,064][70389] Updated weights for policy 0, policy_version 1730 (0.0008) [2023-09-12 15:26:14,442][70389] Updated weights for policy 0, policy_version 1740 (0.0008) [2023-09-12 15:26:17,840][70389] Updated weights for policy 0, policy_version 1750 (0.0008) [2023-09-12 15:26:21,240][70389] Updated weights for policy 0, policy_version 1760 (0.0009) [2023-09-12 15:26:24,568][70389] Updated weights for policy 0, policy_version 1770 (0.0008) [2023-09-12 15:26:27,864][70389] Updated weights for policy 0, policy_version 1780 (0.0008) [2023-09-12 15:26:31,196][70389] Updated weights for policy 0, policy_version 1790 (0.0009) [2023-09-12 15:26:34,625][70389] Updated weights for policy 0, policy_version 1800 (0.0009) [2023-09-12 15:26:38,001][70389] Updated weights for policy 0, policy_version 1810 (0.0008) [2023-09-12 15:26:41,345][70389] Updated weights for policy 0, policy_version 1820 (0.0009) [2023-09-12 15:26:44,666][70389] Updated weights for policy 0, policy_version 1830 (0.0009) [2023-09-12 15:26:48,049][70389] Updated weights for policy 0, policy_version 1840 (0.0010) [2023-09-12 15:26:48,843][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001842_7544832.pth... [2023-09-12 15:26:48,919][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001079_4419584.pth [2023-09-12 15:26:51,373][70389] Updated weights for policy 0, policy_version 1850 (0.0008) [2023-09-12 15:26:54,767][70389] Updated weights for policy 0, policy_version 1860 (0.0008) [2023-09-12 15:26:57,167][70389] Updated weights for policy 0, policy_version 1870 (0.0009) [2023-09-12 15:26:59,500][70389] Updated weights for policy 0, policy_version 1880 (0.0008) [2023-09-12 15:27:01,962][70389] Updated weights for policy 0, policy_version 1890 (0.0010) [2023-09-12 15:27:04,306][70389] Updated weights for policy 0, policy_version 1900 (0.0008) [2023-09-12 15:27:06,703][70389] Updated weights for policy 0, policy_version 1910 (0.0009) [2023-09-12 15:27:09,083][70389] Updated weights for policy 0, policy_version 1920 (0.0008) [2023-09-12 15:27:11,421][70389] Updated weights for policy 0, policy_version 1930 (0.0008) [2023-09-12 15:27:13,790][70389] Updated weights for policy 0, policy_version 1940 (0.0008) [2023-09-12 15:27:16,157][70389] Updated weights for policy 0, policy_version 1950 (0.0009) [2023-09-12 15:27:18,509][70389] Updated weights for policy 0, policy_version 1960 (0.0009) [2023-09-12 15:27:20,880][70389] Updated weights for policy 0, policy_version 1970 (0.0009) [2023-09-12 15:27:23,244][70389] Updated weights for policy 0, policy_version 1980 (0.0008) [2023-09-12 15:27:25,567][70389] Updated weights for policy 0, policy_version 1990 (0.0008) [2023-09-12 15:27:27,944][70389] Updated weights for policy 0, policy_version 2000 (0.0009) [2023-09-12 15:27:30,376][70389] Updated weights for policy 0, policy_version 2010 (0.0009) [2023-09-12 15:27:33,215][70389] Updated weights for policy 0, policy_version 2020 (0.0009) [2023-09-12 15:27:36,568][70389] Updated weights for policy 0, policy_version 2030 (0.0009) [2023-09-12 15:27:39,816][70389] Updated weights for policy 0, policy_version 2040 (0.0009) [2023-09-12 15:27:43,201][70389] Updated weights for policy 0, policy_version 2050 (0.0009) [2023-09-12 15:27:46,459][70389] Updated weights for policy 0, policy_version 2060 (0.0008) [2023-09-12 15:27:49,627][70389] Updated weights for policy 0, policy_version 2070 (0.0008) [2023-09-12 15:27:53,066][70389] Updated weights for policy 0, policy_version 2080 (0.0009) [2023-09-12 15:27:56,462][70389] Updated weights for policy 0, policy_version 2090 (0.0010) [2023-09-12 15:27:59,789][70389] Updated weights for policy 0, policy_version 2100 (0.0008) [2023-09-12 15:28:02,995][70389] Updated weights for policy 0, policy_version 2110 (0.0009) [2023-09-12 15:28:06,334][70389] Updated weights for policy 0, policy_version 2120 (0.0008) [2023-09-12 15:28:09,596][70389] Updated weights for policy 0, policy_version 2130 (0.0009) [2023-09-12 15:28:12,773][70389] Updated weights for policy 0, policy_version 2140 (0.0008) [2023-09-12 15:28:16,088][70389] Updated weights for policy 0, policy_version 2150 (0.0008) [2023-09-12 15:28:19,346][70389] Updated weights for policy 0, policy_version 2160 (0.0009) [2023-09-12 15:28:22,720][70389] Updated weights for policy 0, policy_version 2170 (0.0008) [2023-09-12 15:28:26,007][70389] Updated weights for policy 0, policy_version 2180 (0.0009) [2023-09-12 15:28:29,302][70389] Updated weights for policy 0, policy_version 2190 (0.0009) [2023-09-12 15:28:32,714][70389] Updated weights for policy 0, policy_version 2200 (0.0009) [2023-09-12 15:28:35,971][70389] Updated weights for policy 0, policy_version 2210 (0.0008) [2023-09-12 15:28:39,256][70389] Updated weights for policy 0, policy_version 2220 (0.0009) [2023-09-12 15:28:42,632][70389] Updated weights for policy 0, policy_version 2230 (0.0009) [2023-09-12 15:28:45,925][70389] Updated weights for policy 0, policy_version 2240 (0.0008) [2023-09-12 15:28:48,847][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000002249_9211904.pth... [2023-09-12 15:28:48,913][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001482_6070272.pth [2023-09-12 15:28:49,166][70389] Updated weights for policy 0, policy_version 2250 (0.0009) [2023-09-12 15:28:52,494][70389] Updated weights for policy 0, policy_version 2260 (0.0008) [2023-09-12 15:28:55,878][70389] Updated weights for policy 0, policy_version 2270 (0.0009) [2023-09-12 15:28:59,173][70389] Updated weights for policy 0, policy_version 2280 (0.0009) [2023-09-12 15:29:02,515][70389] Updated weights for policy 0, policy_version 2290 (0.0009) [2023-09-12 15:29:05,855][70389] Updated weights for policy 0, policy_version 2300 (0.0009) [2023-09-12 15:29:09,166][70389] Updated weights for policy 0, policy_version 2310 (0.0008) [2023-09-12 15:29:12,474][70389] Updated weights for policy 0, policy_version 2320 (0.0008) [2023-09-12 15:29:15,804][70389] Updated weights for policy 0, policy_version 2330 (0.0008) [2023-09-12 15:29:19,124][70389] Updated weights for policy 0, policy_version 2340 (0.0008) [2023-09-12 15:29:22,406][70389] Updated weights for policy 0, policy_version 2350 (0.0009) [2023-09-12 15:29:25,730][70389] Updated weights for policy 0, policy_version 2360 (0.0009) [2023-09-12 15:29:29,067][70389] Updated weights for policy 0, policy_version 2370 (0.0009) [2023-09-12 15:29:32,351][70389] Updated weights for policy 0, policy_version 2380 (0.0009) [2023-09-12 15:29:35,641][70389] Updated weights for policy 0, policy_version 2390 (0.0009) [2023-09-12 15:29:38,969][70389] Updated weights for policy 0, policy_version 2400 (0.0008) [2023-09-12 15:29:42,326][70389] Updated weights for policy 0, policy_version 2410 (0.0008) [2023-09-12 15:29:45,686][70389] Updated weights for policy 0, policy_version 2420 (0.0008) [2023-09-12 15:29:48,991][70389] Updated weights for policy 0, policy_version 2430 (0.0009) [2023-09-12 15:29:52,308][70389] Updated weights for policy 0, policy_version 2440 (0.0009) [2023-09-12 15:29:55,647][70389] Updated weights for policy 0, policy_version 2450 (0.0008) [2023-09-12 15:29:59,003][70389] Updated weights for policy 0, policy_version 2460 (0.0011) [2023-09-12 15:30:02,292][70389] Updated weights for policy 0, policy_version 2470 (0.0008) [2023-09-12 15:30:05,620][70389] Updated weights for policy 0, policy_version 2480 (0.0008) [2023-09-12 15:30:08,867][70389] Updated weights for policy 0, policy_version 2490 (0.0008) [2023-09-12 15:30:12,166][70389] Updated weights for policy 0, policy_version 2500 (0.0009) [2023-09-12 15:30:15,515][70389] Updated weights for policy 0, policy_version 2510 (0.0009) [2023-09-12 15:30:18,905][70389] Updated weights for policy 0, policy_version 2520 (0.0008) [2023-09-12 15:30:21,765][70389] Updated weights for policy 0, policy_version 2530 (0.0008) [2023-09-12 15:30:24,161][70389] Updated weights for policy 0, policy_version 2540 (0.0009) [2023-09-12 15:30:26,561][70389] Updated weights for policy 0, policy_version 2550 (0.0008) [2023-09-12 15:30:28,951][70389] Updated weights for policy 0, policy_version 2560 (0.0009) [2023-09-12 15:30:31,332][70389] Updated weights for policy 0, policy_version 2570 (0.0008) [2023-09-12 15:30:33,650][70389] Updated weights for policy 0, policy_version 2580 (0.0008) [2023-09-12 15:30:36,105][70389] Updated weights for policy 0, policy_version 2590 (0.0009) [2023-09-12 15:30:38,547][70389] Updated weights for policy 0, policy_version 2600 (0.0009) [2023-09-12 15:30:40,964][70389] Updated weights for policy 0, policy_version 2610 (0.0009) [2023-09-12 15:30:43,433][70389] Updated weights for policy 0, policy_version 2620 (0.0009) [2023-09-12 15:30:45,844][70389] Updated weights for policy 0, policy_version 2630 (0.0009) [2023-09-12 15:30:48,180][70389] Updated weights for policy 0, policy_version 2640 (0.0008) [2023-09-12 15:30:48,887][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000002643_10825728.pth... [2023-09-12 15:30:48,940][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000001842_7544832.pth [2023-09-12 15:30:50,585][70389] Updated weights for policy 0, policy_version 2650 (0.0008) [2023-09-12 15:30:52,985][70389] Updated weights for policy 0, policy_version 2660 (0.0009) [2023-09-12 15:30:55,374][70389] Updated weights for policy 0, policy_version 2670 (0.0008) [2023-09-12 15:30:57,713][70389] Updated weights for policy 0, policy_version 2680 (0.0008) [2023-09-12 15:31:00,036][70389] Updated weights for policy 0, policy_version 2690 (0.0008) [2023-09-12 15:31:02,403][70389] Updated weights for policy 0, policy_version 2700 (0.0009) [2023-09-12 15:31:04,732][70389] Updated weights for policy 0, policy_version 2710 (0.0009) [2023-09-12 15:31:07,452][70389] Updated weights for policy 0, policy_version 2720 (0.0008) [2023-09-12 15:31:10,819][70389] Updated weights for policy 0, policy_version 2730 (0.0008) [2023-09-12 15:31:14,152][70389] Updated weights for policy 0, policy_version 2740 (0.0009) [2023-09-12 15:31:17,431][70389] Updated weights for policy 0, policy_version 2750 (0.0009) [2023-09-12 15:31:21,028][70389] Updated weights for policy 0, policy_version 2760 (0.0010) [2023-09-12 15:31:24,725][70389] Updated weights for policy 0, policy_version 2770 (0.0010) [2023-09-12 15:31:28,147][70389] Updated weights for policy 0, policy_version 2780 (0.0009) [2023-09-12 15:31:31,540][70389] Updated weights for policy 0, policy_version 2790 (0.0009) [2023-09-12 15:31:34,861][70389] Updated weights for policy 0, policy_version 2800 (0.0008) [2023-09-12 15:31:38,223][70389] Updated weights for policy 0, policy_version 2810 (0.0009) [2023-09-12 15:31:41,501][70389] Updated weights for policy 0, policy_version 2820 (0.0009) [2023-09-12 15:31:44,821][70389] Updated weights for policy 0, policy_version 2830 (0.0008) [2023-09-12 15:31:48,140][70389] Updated weights for policy 0, policy_version 2840 (0.0008) [2023-09-12 15:31:51,409][70389] Updated weights for policy 0, policy_version 2850 (0.0008) [2023-09-12 15:31:54,816][70389] Updated weights for policy 0, policy_version 2860 (0.0008) [2023-09-12 15:31:58,150][70389] Updated weights for policy 0, policy_version 2870 (0.0008) [2023-09-12 15:32:01,460][70389] Updated weights for policy 0, policy_version 2880 (0.0008) [2023-09-12 15:32:04,770][70389] Updated weights for policy 0, policy_version 2890 (0.0009) [2023-09-12 15:32:08,138][70389] Updated weights for policy 0, policy_version 2900 (0.0009) [2023-09-12 15:32:11,449][70389] Updated weights for policy 0, policy_version 2910 (0.0008) [2023-09-12 15:32:14,752][70389] Updated weights for policy 0, policy_version 2920 (0.0008) [2023-09-12 15:32:18,084][70389] Updated weights for policy 0, policy_version 2930 (0.0009) [2023-09-12 15:32:21,557][70389] Updated weights for policy 0, policy_version 2940 (0.0009) [2023-09-12 15:32:24,833][70389] Updated weights for policy 0, policy_version 2950 (0.0009) [2023-09-12 15:32:28,177][70389] Updated weights for policy 0, policy_version 2960 (0.0009) [2023-09-12 15:32:31,492][70389] Updated weights for policy 0, policy_version 2970 (0.0009) [2023-09-12 15:32:34,894][70389] Updated weights for policy 0, policy_version 2980 (0.0008) [2023-09-12 15:32:38,239][70389] Updated weights for policy 0, policy_version 2990 (0.0008) [2023-09-12 15:32:41,485][70389] Updated weights for policy 0, policy_version 3000 (0.0009) [2023-09-12 15:32:44,801][70389] Updated weights for policy 0, policy_version 3010 (0.0008) [2023-09-12 15:32:48,120][70389] Updated weights for policy 0, policy_version 3020 (0.0008) [2023-09-12 15:32:48,841][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003022_12378112.pth... [2023-09-12 15:32:48,891][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000002249_9211904.pth [2023-09-12 15:32:51,571][70389] Updated weights for policy 0, policy_version 3030 (0.0008) [2023-09-12 15:32:54,862][70389] Updated weights for policy 0, policy_version 3040 (0.0008) [2023-09-12 15:32:58,218][70389] Updated weights for policy 0, policy_version 3050 (0.0008) [2023-09-12 15:33:01,425][70389] Updated weights for policy 0, policy_version 3060 (0.0009) [2023-09-12 15:33:04,809][70389] Updated weights for policy 0, policy_version 3070 (0.0008) [2023-09-12 15:33:08,145][70389] Updated weights for policy 0, policy_version 3080 (0.0008) [2023-09-12 15:33:11,448][70389] Updated weights for policy 0, policy_version 3090 (0.0008) [2023-09-12 15:33:14,810][70389] Updated weights for policy 0, policy_version 3100 (0.0008) [2023-09-12 15:33:18,074][70389] Updated weights for policy 0, policy_version 3110 (0.0008) [2023-09-12 15:33:21,380][70389] Updated weights for policy 0, policy_version 3120 (0.0008) [2023-09-12 15:33:24,794][70389] Updated weights for policy 0, policy_version 3130 (0.0008) [2023-09-12 15:33:28,118][70389] Updated weights for policy 0, policy_version 3140 (0.0009) [2023-09-12 15:33:31,380][70389] Updated weights for policy 0, policy_version 3150 (0.0009) [2023-09-12 15:33:34,751][70389] Updated weights for policy 0, policy_version 3160 (0.0008) [2023-09-12 15:33:38,023][70389] Updated weights for policy 0, policy_version 3170 (0.0009) [2023-09-12 15:33:41,328][70389] Updated weights for policy 0, policy_version 3180 (0.0009) [2023-09-12 15:33:44,630][70389] Updated weights for policy 0, policy_version 3190 (0.0008) [2023-09-12 15:33:47,939][70389] Updated weights for policy 0, policy_version 3200 (0.0008) [2023-09-12 15:33:51,257][70389] Updated weights for policy 0, policy_version 3210 (0.0008) [2023-09-12 15:33:54,572][70389] Updated weights for policy 0, policy_version 3220 (0.0009) [2023-09-12 15:33:56,912][70389] Updated weights for policy 0, policy_version 3230 (0.0008) [2023-09-12 15:33:59,240][70389] Updated weights for policy 0, policy_version 3240 (0.0009) [2023-09-12 15:34:01,624][70389] Updated weights for policy 0, policy_version 3250 (0.0008) [2023-09-12 15:34:03,971][70389] Updated weights for policy 0, policy_version 3260 (0.0009) [2023-09-12 15:34:06,340][70389] Updated weights for policy 0, policy_version 3270 (0.0009) [2023-09-12 15:34:08,721][70389] Updated weights for policy 0, policy_version 3280 (0.0009) [2023-09-12 15:34:11,053][70389] Updated weights for policy 0, policy_version 3290 (0.0009) [2023-09-12 15:34:13,431][70389] Updated weights for policy 0, policy_version 3300 (0.0008) [2023-09-12 15:34:15,797][70389] Updated weights for policy 0, policy_version 3310 (0.0008) [2023-09-12 15:34:18,149][70389] Updated weights for policy 0, policy_version 3320 (0.0008) [2023-09-12 15:34:20,506][70389] Updated weights for policy 0, policy_version 3330 (0.0008) [2023-09-12 15:34:22,889][70389] Updated weights for policy 0, policy_version 3340 (0.0008) [2023-09-12 15:34:25,237][70389] Updated weights for policy 0, policy_version 3350 (0.0008) [2023-09-12 15:34:27,611][70389] Updated weights for policy 0, policy_version 3360 (0.0008) [2023-09-12 15:34:29,941][70389] Updated weights for policy 0, policy_version 3370 (0.0009) [2023-09-12 15:34:32,362][70389] Updated weights for policy 0, policy_version 3380 (0.0008) [2023-09-12 15:34:34,797][70389] Updated weights for policy 0, policy_version 3390 (0.0009) [2023-09-12 15:34:37,949][70389] Updated weights for policy 0, policy_version 3400 (0.0009) [2023-09-12 15:34:41,200][70389] Updated weights for policy 0, policy_version 3410 (0.0008) [2023-09-12 15:34:44,540][70389] Updated weights for policy 0, policy_version 3420 (0.0008) [2023-09-12 15:34:47,769][70389] Updated weights for policy 0, policy_version 3430 (0.0008) [2023-09-12 15:34:48,840][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003433_14061568.pth... [2023-09-12 15:34:48,890][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000002643_10825728.pth [2023-09-12 15:34:51,060][70389] Updated weights for policy 0, policy_version 3440 (0.0008) [2023-09-12 15:34:54,336][70389] Updated weights for policy 0, policy_version 3450 (0.0008) [2023-09-12 15:34:57,607][70389] Updated weights for policy 0, policy_version 3460 (0.0009) [2023-09-12 15:35:00,890][70389] Updated weights for policy 0, policy_version 3470 (0.0008) [2023-09-12 15:35:04,198][70389] Updated weights for policy 0, policy_version 3480 (0.0008) [2023-09-12 15:35:07,539][70389] Updated weights for policy 0, policy_version 3490 (0.0008) [2023-09-12 15:35:10,768][70389] Updated weights for policy 0, policy_version 3500 (0.0008) [2023-09-12 15:35:14,029][70389] Updated weights for policy 0, policy_version 3510 (0.0008) [2023-09-12 15:35:17,380][70389] Updated weights for policy 0, policy_version 3520 (0.0008) [2023-09-12 15:35:20,626][70389] Updated weights for policy 0, policy_version 3530 (0.0008) [2023-09-12 15:35:23,945][70389] Updated weights for policy 0, policy_version 3540 (0.0008) [2023-09-12 15:35:27,292][70389] Updated weights for policy 0, policy_version 3550 (0.0009) [2023-09-12 15:35:30,558][70389] Updated weights for policy 0, policy_version 3560 (0.0008) [2023-09-12 15:35:33,950][70389] Updated weights for policy 0, policy_version 3570 (0.0008) [2023-09-12 15:35:37,296][70389] Updated weights for policy 0, policy_version 3580 (0.0009) [2023-09-12 15:35:40,632][70389] Updated weights for policy 0, policy_version 3590 (0.0008) [2023-09-12 15:35:43,900][70389] Updated weights for policy 0, policy_version 3600 (0.0008) [2023-09-12 15:35:47,230][70389] Updated weights for policy 0, policy_version 3610 (0.0008) [2023-09-12 15:35:50,602][70389] Updated weights for policy 0, policy_version 3620 (0.0008) [2023-09-12 15:35:53,898][70389] Updated weights for policy 0, policy_version 3630 (0.0008) [2023-09-12 15:35:57,265][70389] Updated weights for policy 0, policy_version 3640 (0.0008) [2023-09-12 15:35:58,852][70316] Saving new best policy, reward=0.063! [2023-09-12 15:36:00,486][70389] Updated weights for policy 0, policy_version 3650 (0.0008) [2023-09-12 15:36:03,878][70389] Updated weights for policy 0, policy_version 3660 (0.0009) [2023-09-12 15:36:07,152][70389] Updated weights for policy 0, policy_version 3670 (0.0008) [2023-09-12 15:36:10,443][70389] Updated weights for policy 0, policy_version 3680 (0.0009) [2023-09-12 15:36:13,715][70389] Updated weights for policy 0, policy_version 3690 (0.0008) [2023-09-12 15:36:16,912][70389] Updated weights for policy 0, policy_version 3700 (0.0008) [2023-09-12 15:36:20,225][70389] Updated weights for policy 0, policy_version 3710 (0.0008) [2023-09-12 15:36:23,615][70389] Updated weights for policy 0, policy_version 3720 (0.0009) [2023-09-12 15:36:26,865][70389] Updated weights for policy 0, policy_version 3730 (0.0008) [2023-09-12 15:36:30,189][70389] Updated weights for policy 0, policy_version 3740 (0.0008) [2023-09-12 15:36:33,532][70389] Updated weights for policy 0, policy_version 3750 (0.0008) [2023-09-12 15:36:36,776][70389] Updated weights for policy 0, policy_version 3760 (0.0008) [2023-09-12 15:36:40,059][70389] Updated weights for policy 0, policy_version 3770 (0.0008) [2023-09-12 15:36:43,462][70389] Updated weights for policy 0, policy_version 3780 (0.0009) [2023-09-12 15:36:46,763][70389] Updated weights for policy 0, policy_version 3790 (0.0009) [2023-09-12 15:36:48,842][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003796_15548416.pth... [2023-09-12 15:36:48,891][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003022_12378112.pth [2023-09-12 15:36:50,036][70389] Updated weights for policy 0, policy_version 3800 (0.0008) [2023-09-12 15:36:53,457][70389] Updated weights for policy 0, policy_version 3810 (0.0009) [2023-09-12 15:36:56,771][70389] Updated weights for policy 0, policy_version 3820 (0.0008) [2023-09-12 15:37:00,122][70389] Updated weights for policy 0, policy_version 3830 (0.0008) [2023-09-12 15:37:03,428][70389] Updated weights for policy 0, policy_version 3840 (0.0008) [2023-09-12 15:37:06,767][70389] Updated weights for policy 0, policy_version 3850 (0.0008) [2023-09-12 15:37:09,975][70389] Updated weights for policy 0, policy_version 3860 (0.0008) [2023-09-12 15:37:13,319][70389] Updated weights for policy 0, policy_version 3870 (0.0009) [2023-09-12 15:37:16,576][70389] Updated weights for policy 0, policy_version 3880 (0.0008) [2023-09-12 15:37:19,838][70389] Updated weights for policy 0, policy_version 3890 (0.0009) [2023-09-12 15:37:23,195][70389] Updated weights for policy 0, policy_version 3900 (0.0008) [2023-09-12 15:37:26,008][70389] Updated weights for policy 0, policy_version 3910 (0.0008) [2023-09-12 15:37:28,409][70389] Updated weights for policy 0, policy_version 3920 (0.0008) [2023-09-12 15:37:30,775][70389] Updated weights for policy 0, policy_version 3930 (0.0008) [2023-09-12 15:37:33,126][70389] Updated weights for policy 0, policy_version 3940 (0.0009) [2023-09-12 15:37:35,538][70389] Updated weights for policy 0, policy_version 3950 (0.0009) [2023-09-12 15:37:37,853][70389] Updated weights for policy 0, policy_version 3960 (0.0008) [2023-09-12 15:37:40,209][70389] Updated weights for policy 0, policy_version 3970 (0.0008) [2023-09-12 15:37:42,581][70389] Updated weights for policy 0, policy_version 3980 (0.0008) [2023-09-12 15:37:44,981][70389] Updated weights for policy 0, policy_version 3990 (0.0008) [2023-09-12 15:37:47,397][70389] Updated weights for policy 0, policy_version 4000 (0.0008) [2023-09-12 15:37:49,712][70389] Updated weights for policy 0, policy_version 4010 (0.0009) [2023-09-12 15:37:52,131][70389] Updated weights for policy 0, policy_version 4020 (0.0009) [2023-09-12 15:37:53,840][70316] Saving new best policy, reward=0.070! [2023-09-12 15:37:54,564][70389] Updated weights for policy 0, policy_version 4030 (0.0009) [2023-09-12 15:37:56,983][70389] Updated weights for policy 0, policy_version 4040 (0.0009) [2023-09-12 15:37:58,893][70316] Saving new best policy, reward=0.167! [2023-09-12 15:37:59,375][70389] Updated weights for policy 0, policy_version 4050 (0.0009) [2023-09-12 15:38:01,730][70389] Updated weights for policy 0, policy_version 4060 (0.0008) [2023-09-12 15:38:04,674][70389] Updated weights for policy 0, policy_version 4070 (0.0008) [2023-09-12 15:38:07,919][70389] Updated weights for policy 0, policy_version 4080 (0.0009) [2023-09-12 15:38:11,241][70389] Updated weights for policy 0, policy_version 4090 (0.0009) [2023-09-12 15:38:14,586][70389] Updated weights for policy 0, policy_version 4100 (0.0009) [2023-09-12 15:38:17,877][70389] Updated weights for policy 0, policy_version 4110 (0.0008) [2023-09-12 15:38:21,163][70389] Updated weights for policy 0, policy_version 4120 (0.0008) [2023-09-12 15:38:24,554][70389] Updated weights for policy 0, policy_version 4130 (0.0008) [2023-09-12 15:38:27,930][70389] Updated weights for policy 0, policy_version 4140 (0.0008) [2023-09-12 15:38:31,168][70389] Updated weights for policy 0, policy_version 4150 (0.0008) [2023-09-12 15:38:34,615][70389] Updated weights for policy 0, policy_version 4160 (0.0009) [2023-09-12 15:38:37,936][70389] Updated weights for policy 0, policy_version 4170 (0.0010) [2023-09-12 15:38:41,632][70389] Updated weights for policy 0, policy_version 4180 (0.0010) [2023-09-12 15:38:43,837][70316] Saving new best policy, reward=0.181! [2023-09-12 15:38:45,037][70389] Updated weights for policy 0, policy_version 4190 (0.0010) [2023-09-12 15:38:48,556][70389] Updated weights for policy 0, policy_version 4200 (0.0009) [2023-09-12 15:38:48,900][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004201_17207296.pth... [2023-09-12 15:38:48,948][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003433_14061568.pth [2023-09-12 15:38:51,953][70389] Updated weights for policy 0, policy_version 4210 (0.0010) [2023-09-12 15:38:55,576][70389] Updated weights for policy 0, policy_version 4220 (0.0008) [2023-09-12 15:38:58,842][70316] Saving new best policy, reward=0.189! [2023-09-12 15:38:59,068][70389] Updated weights for policy 0, policy_version 4230 (0.0009) [2023-09-12 15:39:02,527][70389] Updated weights for policy 0, policy_version 4240 (0.0010) [2023-09-12 15:39:03,886][70316] Saving new best policy, reward=0.291! [2023-09-12 15:39:05,859][70389] Updated weights for policy 0, policy_version 4250 (0.0009) [2023-09-12 15:39:08,843][70316] Saving new best policy, reward=0.381! [2023-09-12 15:39:09,384][70389] Updated weights for policy 0, policy_version 4260 (0.0009) [2023-09-12 15:39:12,941][70389] Updated weights for policy 0, policy_version 4270 (0.0010) [2023-09-12 15:39:13,836][70316] Saving new best policy, reward=0.422! [2023-09-12 15:39:16,368][70389] Updated weights for policy 0, policy_version 4280 (0.0009) [2023-09-12 15:39:18,841][70316] Saving new best policy, reward=0.423! [2023-09-12 15:39:19,744][70389] Updated weights for policy 0, policy_version 4290 (0.0010) [2023-09-12 15:39:23,295][70389] Updated weights for policy 0, policy_version 4300 (0.0010) [2023-09-12 15:39:26,752][70389] Updated weights for policy 0, policy_version 4310 (0.0008) [2023-09-12 15:39:30,100][70389] Updated weights for policy 0, policy_version 4320 (0.0008) [2023-09-12 15:39:33,779][70389] Updated weights for policy 0, policy_version 4330 (0.0009) [2023-09-12 15:39:37,261][70389] Updated weights for policy 0, policy_version 4340 (0.0009) [2023-09-12 15:39:40,778][70389] Updated weights for policy 0, policy_version 4350 (0.0009) [2023-09-12 15:39:44,170][70389] Updated weights for policy 0, policy_version 4360 (0.0008) [2023-09-12 15:39:47,720][70389] Updated weights for policy 0, policy_version 4370 (0.0009) [2023-09-12 15:39:51,102][70389] Updated weights for policy 0, policy_version 4380 (0.0008) [2023-09-12 15:39:53,839][70316] Saving new best policy, reward=0.575! [2023-09-12 15:39:54,489][70389] Updated weights for policy 0, policy_version 4390 (0.0008) [2023-09-12 15:39:57,774][70389] Updated weights for policy 0, policy_version 4400 (0.0009) [2023-09-12 15:40:01,161][70389] Updated weights for policy 0, policy_version 4410 (0.0009) [2023-09-12 15:40:04,454][70389] Updated weights for policy 0, policy_version 4420 (0.0008) [2023-09-12 15:40:07,768][70389] Updated weights for policy 0, policy_version 4430 (0.0008) [2023-09-12 15:40:11,102][70389] Updated weights for policy 0, policy_version 4440 (0.0009) [2023-09-12 15:40:14,413][70389] Updated weights for policy 0, policy_version 4450 (0.0009) [2023-09-12 15:40:17,742][70389] Updated weights for policy 0, policy_version 4460 (0.0009) [2023-09-12 15:40:21,080][70389] Updated weights for policy 0, policy_version 4470 (0.0009) [2023-09-12 15:40:24,461][70389] Updated weights for policy 0, policy_version 4480 (0.0009) [2023-09-12 15:40:27,962][70389] Updated weights for policy 0, policy_version 4490 (0.0009) [2023-09-12 15:40:31,312][70389] Updated weights for policy 0, policy_version 4500 (0.0009) [2023-09-12 15:40:34,771][70389] Updated weights for policy 0, policy_version 4510 (0.0009) [2023-09-12 15:40:35,756][70388] EvtLoop [rollout_proc1_evt_loop, process=rollout_proc1] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance1'), args=(0, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,757][70388] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc1_evt_loop [2023-09-12 15:40:35,756][70433] EvtLoop [rollout_proc7_evt_loop, process=rollout_proc7] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance7'), args=(0, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,758][70433] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc7_evt_loop [2023-09-12 15:40:35,758][70429] EvtLoop [rollout_proc4_evt_loop, process=rollout_proc4] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance4'), args=(0, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,759][70429] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc4_evt_loop [2023-09-12 15:40:35,759][70392] EvtLoop [rollout_proc2_evt_loop, process=rollout_proc2] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance2'), args=(1, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,760][70392] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc2_evt_loop [2023-09-12 15:40:35,759][70391] EvtLoop [rollout_proc3_evt_loop, process=rollout_proc3] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance3'), args=(0, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,761][70391] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc3_evt_loop [2023-09-12 15:40:35,760][70434] EvtLoop [rollout_proc6_evt_loop, process=rollout_proc6] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance6'), args=(0, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,761][70434] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc6_evt_loop [2023-09-12 15:40:35,761][70390] EvtLoop [rollout_proc0_evt_loop, process=rollout_proc0] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance0'), args=(1, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,763][70390] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc0_evt_loop [2023-09-12 15:40:35,757][70393] EvtLoop [rollout_proc5_evt_loop, process=rollout_proc5] unhandled exception in slot='advance_rollouts' connected to emitter=Emitter(object_id='InferenceWorker_p0-w0', signal_name='advance5'), args=(1, 0) Traceback (most recent call last): File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/signal_slot/signal_slot.py", line 355, in _process_signal slot_callable(*args) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/rollout_worker.py", line 241, in advance_rollouts complete_rollouts, episodic_stats = runner.advance_rollouts(policy_id, self.timing) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/sampling/non_batched_sampling.py", line 634, in advance_rollouts new_obs, rewards, terminated, truncated, infos = e.step(actions) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 129, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/algo/utils/make_env.py", line 115, in step obs, rew, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 522, in step observation, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sample_factory/envs/env_wrappers.py", line 86, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/gymnasium/core.py", line 461, in step return self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/wrappers/multiplayer_stats.py", line 54, in step obs, reward, terminated, truncated, info = self.env.step(action) File "/home/cogstack/.local/share/virtualenvs/sample_factory--NQNquiM/lib/python3.10/site-packages/sf_examples/vizdoom/doom/doom_gym.py", line 452, in step reward = self.game.make_action(actions_flattened, self.skip_frames) vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has been closed. [2023-09-12 15:40:35,766][70393] Unhandled exception Signal SIGINT received. ViZDoom instance has been closed. in evt loop rollout_proc5_evt_loop [2023-09-12 15:40:35,800][70316] Stopping Batcher_0... [2023-09-12 15:40:35,800][70316] Loop batcher_evt_loop terminating... [2023-09-12 15:40:35,801][70316] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004513_18485248.pth... [2023-09-12 15:40:35,852][70389] Weights refcount: 2 0 [2023-09-12 15:40:35,853][70389] Stopping InferenceWorker_p0-w0... [2023-09-12 15:40:35,853][70389] Loop inference_proc0-0_evt_loop terminating... [2023-09-12 15:40:35,870][70316] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000003796_15548416.pth [2023-09-12 15:40:35,880][70316] Stopping LearnerWorker_p0... [2023-09-12 15:40:35,881][70316] Loop learner_proc0_evt_loop terminating... [2023-09-12 15:51:21,821][89073] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:51:21,821][89073] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-09-12 15:51:21,840][89073] Num visible devices: 1 [2023-09-12 15:51:21,859][89073] Starting seed is not provided [2023-09-12 15:51:21,860][89073] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:51:21,860][89073] Initializing actor-critic model on device cuda:0 [2023-09-12 15:51:21,860][89073] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:51:21,861][89073] RunningMeanStd input shape: (1,) [2023-09-12 15:51:21,874][89073] ConvEncoder: input_channels=3 [2023-09-12 15:51:22,016][89073] Conv encoder output size: 512 [2023-09-12 15:51:22,017][89073] Policy head output size: 512 [2023-09-12 15:51:22,040][89073] Created Actor Critic model with architecture: [2023-09-12 15:51:22,041][89073] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): VizdoomEncoder( (basic_encoder): ConvEncoder( (enc): RecursiveScriptModule( original_name=ConvEncoderImpl (conv_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Conv2d) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Conv2d) (3): RecursiveScriptModule(original_name=ELU) (4): RecursiveScriptModule(original_name=Conv2d) (5): RecursiveScriptModule(original_name=ELU) ) (mlp_layers): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=6, bias=True) ) ) [2023-09-12 15:51:23,253][89073] Using optimizer [2023-09-12 15:51:23,253][89073] Loading state from checkpoint /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004513_18485248.pth... [2023-09-12 15:51:23,288][89073] Loading model from checkpoint [2023-09-12 15:51:23,293][89073] Loaded experiment state at self.train_step=4513, self.env_steps=18485248 [2023-09-12 15:51:23,294][89073] Initialized policy 0 weights for model version 4513 [2023-09-12 15:51:23,296][89073] LearnerWorker_p0 finished initialization! [2023-09-12 15:51:23,296][89073] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:51:23,885][89377] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-09-12 15:51:23,911][89379] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-09-12 15:51:23,919][89380] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-09-12 15:51:23,984][89382] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-09-12 15:51:24,023][89378] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-09-12 15:51:24,088][89375] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-09-12 15:51:24,144][89374] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-09-12 15:51:24,144][89374] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-09-12 15:51:24,186][89374] Num visible devices: 1 [2023-09-12 15:51:24,252][89381] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-09-12 15:51:24,435][89383] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-09-12 15:51:24,863][89374] RunningMeanStd input shape: (3, 72, 128) [2023-09-12 15:51:24,864][89374] RunningMeanStd input shape: (1,) [2023-09-12 15:51:24,875][89374] ConvEncoder: input_channels=3 [2023-09-12 15:51:24,975][89374] Conv encoder output size: 512 [2023-09-12 15:51:24,976][89374] Policy head output size: 512 [2023-09-12 15:51:25,300][89379] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,300][89383] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,300][89380] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,310][89375] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,319][89382] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,319][89377] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,319][89381] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,319][89378] Doom resolution: 160x120, resize resolution: (128, 72) [2023-09-12 15:51:25,605][89383] Decorrelating experience for 0 frames... [2023-09-12 15:51:25,693][89380] Decorrelating experience for 0 frames... [2023-09-12 15:51:25,725][89375] Decorrelating experience for 0 frames... [2023-09-12 15:51:25,782][89381] Decorrelating experience for 0 frames... [2023-09-12 15:51:25,874][89383] Decorrelating experience for 32 frames... [2023-09-12 15:51:25,962][89380] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,002][89375] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,016][89377] Decorrelating experience for 0 frames... [2023-09-12 15:51:26,055][89381] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,100][89378] Decorrelating experience for 0 frames... [2023-09-12 15:51:26,344][89377] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,345][89382] Decorrelating experience for 0 frames... [2023-09-12 15:51:26,351][89375] Decorrelating experience for 64 frames... [2023-09-12 15:51:26,373][89380] Decorrelating experience for 64 frames... [2023-09-12 15:51:26,612][89382] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,622][89379] Decorrelating experience for 0 frames... [2023-09-12 15:51:26,625][89378] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,656][89381] Decorrelating experience for 64 frames... [2023-09-12 15:51:26,679][89380] Decorrelating experience for 96 frames... [2023-09-12 15:51:26,888][89379] Decorrelating experience for 32 frames... [2023-09-12 15:51:26,969][89382] Decorrelating experience for 64 frames... [2023-09-12 15:51:27,032][89383] Decorrelating experience for 64 frames... [2023-09-12 15:51:27,033][89378] Decorrelating experience for 64 frames... [2023-09-12 15:51:27,279][89377] Decorrelating experience for 64 frames... [2023-09-12 15:51:27,287][89382] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,293][89379] Decorrelating experience for 64 frames... [2023-09-12 15:51:27,310][89381] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,478][89383] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,605][89379] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,650][89377] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,652][89375] Decorrelating experience for 96 frames... [2023-09-12 15:51:27,925][89378] Decorrelating experience for 96 frames... [2023-09-12 15:51:28,229][89073] Signal inference workers to stop experience collection... [2023-09-12 15:51:28,236][89374] InferenceWorker_p0-w0: stopping experience collection [2023-09-12 15:51:31,722][89073] Signal inference workers to resume experience collection... [2023-09-12 15:51:31,723][89374] InferenceWorker_p0-w0: resuming experience collection [2023-09-12 15:51:32,641][89073] Saving new best policy, reward=0.981! [2023-09-12 15:51:34,671][89374] Updated weights for policy 0, policy_version 4523 (0.0376) [2023-09-12 15:51:37,207][89374] Updated weights for policy 0, policy_version 4533 (0.0009) [2023-09-12 15:51:39,664][89374] Updated weights for policy 0, policy_version 4543 (0.0009) [2023-09-12 15:51:42,780][89374] Updated weights for policy 0, policy_version 4553 (0.0009) [2023-09-12 15:51:46,248][89374] Updated weights for policy 0, policy_version 4563 (0.0008) [2023-09-12 15:51:49,484][89374] Updated weights for policy 0, policy_version 4573 (0.0009) [2023-09-12 15:51:52,986][89374] Updated weights for policy 0, policy_version 4583 (0.0009) [2023-09-12 15:51:56,495][89374] Updated weights for policy 0, policy_version 4593 (0.0008) [2023-09-12 15:51:59,811][89374] Updated weights for policy 0, policy_version 4603 (0.0008) [2023-09-12 15:52:03,258][89374] Updated weights for policy 0, policy_version 4613 (0.0009) [2023-09-12 15:52:06,683][89374] Updated weights for policy 0, policy_version 4623 (0.0009) [2023-09-12 15:52:10,130][89374] Updated weights for policy 0, policy_version 4633 (0.0009) [2023-09-12 15:52:13,617][89374] Updated weights for policy 0, policy_version 4643 (0.0010) [2023-09-12 15:52:17,108][89374] Updated weights for policy 0, policy_version 4653 (0.0009) [2023-09-12 15:52:20,646][89374] Updated weights for policy 0, policy_version 4663 (0.0010) [2023-09-12 15:52:23,950][89374] Updated weights for policy 0, policy_version 4673 (0.0009) [2023-09-12 15:52:27,408][89374] Updated weights for policy 0, policy_version 4683 (0.0010) [2023-09-12 15:52:30,828][89374] Updated weights for policy 0, policy_version 4693 (0.0009) [2023-09-12 15:52:34,240][89374] Updated weights for policy 0, policy_version 4703 (0.0009) [2023-09-12 15:52:37,683][89374] Updated weights for policy 0, policy_version 4713 (0.0010) [2023-09-12 15:52:41,020][89374] Updated weights for policy 0, policy_version 4723 (0.0010) [2023-09-12 15:52:44,482][89374] Updated weights for policy 0, policy_version 4733 (0.0009) [2023-09-12 15:52:47,975][89374] Updated weights for policy 0, policy_version 4743 (0.0009) [2023-09-12 15:52:51,430][89374] Updated weights for policy 0, policy_version 4753 (0.0009) [2023-09-12 15:52:54,914][89374] Updated weights for policy 0, policy_version 4763 (0.0008) [2023-09-12 15:52:58,332][89374] Updated weights for policy 0, policy_version 4773 (0.0008) [2023-09-12 15:53:01,784][89374] Updated weights for policy 0, policy_version 4783 (0.0009) [2023-09-12 15:53:05,071][89374] Updated weights for policy 0, policy_version 4793 (0.0009) [2023-09-12 15:53:08,460][89374] Updated weights for policy 0, policy_version 4803 (0.0010) [2023-09-12 15:53:11,931][89374] Updated weights for policy 0, policy_version 4813 (0.0009) [2023-09-12 15:53:15,310][89374] Updated weights for policy 0, policy_version 4823 (0.0009) [2023-09-12 15:53:16,781][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004827_19771392.pth... [2023-09-12 15:53:16,834][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004201_17207296.pth [2023-09-12 15:53:18,828][89374] Updated weights for policy 0, policy_version 4833 (0.0013) [2023-09-12 15:53:22,325][89374] Updated weights for policy 0, policy_version 4843 (0.0010) [2023-09-12 15:53:25,856][89374] Updated weights for policy 0, policy_version 4853 (0.0009) [2023-09-12 15:53:29,275][89374] Updated weights for policy 0, policy_version 4863 (0.0010) [2023-09-12 15:53:32,661][89374] Updated weights for policy 0, policy_version 4873 (0.0009) [2023-09-12 15:53:36,195][89374] Updated weights for policy 0, policy_version 4883 (0.0009) [2023-09-12 15:53:39,635][89374] Updated weights for policy 0, policy_version 4893 (0.0008) [2023-09-12 15:53:43,076][89374] Updated weights for policy 0, policy_version 4903 (0.0009) [2023-09-12 15:53:46,572][89374] Updated weights for policy 0, policy_version 4913 (0.0009) [2023-09-12 15:53:50,010][89374] Updated weights for policy 0, policy_version 4923 (0.0009) [2023-09-12 15:53:53,633][89374] Updated weights for policy 0, policy_version 4933 (0.0010) [2023-09-12 15:53:56,975][89374] Updated weights for policy 0, policy_version 4943 (0.0009) [2023-09-12 15:54:00,374][89374] Updated weights for policy 0, policy_version 4953 (0.0008) [2023-09-12 15:54:03,894][89374] Updated weights for policy 0, policy_version 4963 (0.0009) [2023-09-12 15:54:07,397][89374] Updated weights for policy 0, policy_version 4973 (0.0009) [2023-09-12 15:54:10,714][89374] Updated weights for policy 0, policy_version 4983 (0.0008) [2023-09-12 15:54:14,089][89374] Updated weights for policy 0, policy_version 4993 (0.0009) [2023-09-12 15:54:17,511][89374] Updated weights for policy 0, policy_version 5003 (0.0009) [2023-09-12 15:54:20,893][89374] Updated weights for policy 0, policy_version 5013 (0.0008) [2023-09-12 15:54:24,215][89374] Updated weights for policy 0, policy_version 5023 (0.0009) [2023-09-12 15:54:27,673][89374] Updated weights for policy 0, policy_version 5033 (0.0009) [2023-09-12 15:54:30,907][89374] Updated weights for policy 0, policy_version 5043 (0.0009) [2023-09-12 15:54:33,433][89374] Updated weights for policy 0, policy_version 5053 (0.0009) [2023-09-12 15:54:35,985][89374] Updated weights for policy 0, policy_version 5063 (0.0009) [2023-09-12 15:54:38,487][89374] Updated weights for policy 0, policy_version 5073 (0.0009) [2023-09-12 15:54:41,000][89374] Updated weights for policy 0, policy_version 5083 (0.0009) [2023-09-12 15:54:43,477][89374] Updated weights for policy 0, policy_version 5093 (0.0008) [2023-09-12 15:54:46,006][89374] Updated weights for policy 0, policy_version 5103 (0.0009) [2023-09-12 15:54:48,518][89374] Updated weights for policy 0, policy_version 5113 (0.0009) [2023-09-12 15:54:50,960][89374] Updated weights for policy 0, policy_version 5123 (0.0008) [2023-09-12 15:54:53,455][89374] Updated weights for policy 0, policy_version 5133 (0.0009) [2023-09-12 15:54:55,892][89374] Updated weights for policy 0, policy_version 5143 (0.0009) [2023-09-12 15:54:58,404][89374] Updated weights for policy 0, policy_version 5153 (0.0009) [2023-09-12 15:55:00,933][89374] Updated weights for policy 0, policy_version 5163 (0.0009) [2023-09-12 15:55:03,407][89374] Updated weights for policy 0, policy_version 5173 (0.0009) [2023-09-12 15:55:05,881][89374] Updated weights for policy 0, policy_version 5183 (0.0008) [2023-09-12 15:55:08,996][89374] Updated weights for policy 0, policy_version 5193 (0.0008) [2023-09-12 15:55:12,375][89374] Updated weights for policy 0, policy_version 5203 (0.0009) [2023-09-12 15:55:15,722][89374] Updated weights for policy 0, policy_version 5213 (0.0009) [2023-09-12 15:55:16,785][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005216_21364736.pth... [2023-09-12 15:55:16,837][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004513_18485248.pth [2023-09-12 15:55:19,233][89374] Updated weights for policy 0, policy_version 5223 (0.0009) [2023-09-12 15:55:22,598][89374] Updated weights for policy 0, policy_version 5233 (0.0008) [2023-09-12 15:55:26,016][89374] Updated weights for policy 0, policy_version 5243 (0.0010) [2023-09-12 15:55:29,449][89374] Updated weights for policy 0, policy_version 5253 (0.0008) [2023-09-12 15:55:32,845][89374] Updated weights for policy 0, policy_version 5263 (0.0010) [2023-09-12 15:55:36,248][89374] Updated weights for policy 0, policy_version 5273 (0.0009) [2023-09-12 15:55:39,659][89374] Updated weights for policy 0, policy_version 5283 (0.0009) [2023-09-12 15:55:43,167][89374] Updated weights for policy 0, policy_version 5293 (0.0008) [2023-09-12 15:55:46,666][89374] Updated weights for policy 0, policy_version 5303 (0.0009) [2023-09-12 15:55:50,009][89374] Updated weights for policy 0, policy_version 5313 (0.0010) [2023-09-12 15:55:53,487][89374] Updated weights for policy 0, policy_version 5323 (0.0009) [2023-09-12 15:55:56,954][89374] Updated weights for policy 0, policy_version 5333 (0.0008) [2023-09-12 15:56:00,393][89374] Updated weights for policy 0, policy_version 5343 (0.0011) [2023-09-12 15:56:03,792][89374] Updated weights for policy 0, policy_version 5353 (0.0009) [2023-09-12 15:56:07,284][89374] Updated weights for policy 0, policy_version 5363 (0.0008) [2023-09-12 15:56:10,631][89374] Updated weights for policy 0, policy_version 5373 (0.0008) [2023-09-12 15:56:14,072][89374] Updated weights for policy 0, policy_version 5383 (0.0010) [2023-09-12 15:56:17,388][89374] Updated weights for policy 0, policy_version 5393 (0.0009) [2023-09-12 15:56:20,941][89374] Updated weights for policy 0, policy_version 5403 (0.0009) [2023-09-12 15:56:24,925][89374] Updated weights for policy 0, policy_version 5413 (0.0013) [2023-09-12 15:56:28,649][89374] Updated weights for policy 0, policy_version 5423 (0.0010) [2023-09-12 15:56:32,123][89374] Updated weights for policy 0, policy_version 5433 (0.0009) [2023-09-12 15:56:35,425][89374] Updated weights for policy 0, policy_version 5443 (0.0009) [2023-09-12 15:56:38,796][89374] Updated weights for policy 0, policy_version 5453 (0.0009) [2023-09-12 15:56:42,205][89374] Updated weights for policy 0, policy_version 5463 (0.0010) [2023-09-12 15:56:45,477][89374] Updated weights for policy 0, policy_version 5473 (0.0009) [2023-09-12 15:56:48,932][89374] Updated weights for policy 0, policy_version 5483 (0.0009) [2023-09-12 15:56:52,294][89374] Updated weights for policy 0, policy_version 5493 (0.0008) [2023-09-12 15:56:55,643][89374] Updated weights for policy 0, policy_version 5503 (0.0009) [2023-09-12 15:56:58,979][89374] Updated weights for policy 0, policy_version 5513 (0.0008) [2023-09-12 15:57:02,269][89374] Updated weights for policy 0, policy_version 5523 (0.0008) [2023-09-12 15:57:05,617][89374] Updated weights for policy 0, policy_version 5533 (0.0009) [2023-09-12 15:57:08,968][89374] Updated weights for policy 0, policy_version 5543 (0.0008) [2023-09-12 15:57:12,211][89374] Updated weights for policy 0, policy_version 5553 (0.0008) [2023-09-12 15:57:15,611][89374] Updated weights for policy 0, policy_version 5563 (0.0009) [2023-09-12 15:57:16,784][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005566_22798336.pth... [2023-09-12 15:57:16,837][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000004827_19771392.pth [2023-09-12 15:57:18,902][89374] Updated weights for policy 0, policy_version 5573 (0.0008) [2023-09-12 15:57:22,175][89374] Updated weights for policy 0, policy_version 5583 (0.0009) [2023-09-12 15:57:25,511][89374] Updated weights for policy 0, policy_version 5593 (0.0008) [2023-09-12 15:57:28,944][89374] Updated weights for policy 0, policy_version 5603 (0.0008) [2023-09-12 15:57:32,238][89374] Updated weights for policy 0, policy_version 5613 (0.0009) [2023-09-12 15:57:35,637][89374] Updated weights for policy 0, policy_version 5623 (0.0009) [2023-09-12 15:57:39,035][89374] Updated weights for policy 0, policy_version 5633 (0.0009) [2023-09-12 15:57:42,333][89374] Updated weights for policy 0, policy_version 5643 (0.0008) [2023-09-12 15:57:45,637][89374] Updated weights for policy 0, policy_version 5653 (0.0008) [2023-09-12 15:57:48,982][89374] Updated weights for policy 0, policy_version 5663 (0.0009) [2023-09-12 15:57:52,344][89374] Updated weights for policy 0, policy_version 5673 (0.0009) [2023-09-12 15:57:55,730][89374] Updated weights for policy 0, policy_version 5683 (0.0008) [2023-09-12 15:57:58,577][89374] Updated weights for policy 0, policy_version 5693 (0.0009) [2023-09-12 15:58:00,993][89374] Updated weights for policy 0, policy_version 5703 (0.0009) [2023-09-12 15:58:03,398][89374] Updated weights for policy 0, policy_version 5713 (0.0008) [2023-09-12 15:58:05,815][89374] Updated weights for policy 0, policy_version 5723 (0.0008) [2023-09-12 15:58:08,222][89374] Updated weights for policy 0, policy_version 5733 (0.0009) [2023-09-12 15:58:10,561][89374] Updated weights for policy 0, policy_version 5743 (0.0008) [2023-09-12 15:58:12,951][89374] Updated weights for policy 0, policy_version 5753 (0.0009) [2023-09-12 15:58:15,367][89374] Updated weights for policy 0, policy_version 5763 (0.0008) [2023-09-12 15:58:17,752][89374] Updated weights for policy 0, policy_version 5773 (0.0009) [2023-09-12 15:58:20,180][89374] Updated weights for policy 0, policy_version 5783 (0.0008) [2023-09-12 15:58:22,579][89374] Updated weights for policy 0, policy_version 5793 (0.0009) [2023-09-12 15:58:24,996][89374] Updated weights for policy 0, policy_version 5803 (0.0008) [2023-09-12 15:58:27,412][89374] Updated weights for policy 0, policy_version 5813 (0.0008) [2023-09-12 15:58:29,859][89374] Updated weights for policy 0, policy_version 5823 (0.0009) [2023-09-12 15:58:32,240][89374] Updated weights for policy 0, policy_version 5833 (0.0009) [2023-09-12 15:58:35,301][89374] Updated weights for policy 0, policy_version 5843 (0.0008) [2023-09-12 15:58:38,679][89374] Updated weights for policy 0, policy_version 5853 (0.0009) [2023-09-12 15:58:41,964][89374] Updated weights for policy 0, policy_version 5863 (0.0009) [2023-09-12 15:58:45,266][89374] Updated weights for policy 0, policy_version 5873 (0.0008) [2023-09-12 15:58:48,612][89374] Updated weights for policy 0, policy_version 5883 (0.0008) [2023-09-12 15:58:51,954][89374] Updated weights for policy 0, policy_version 5893 (0.0008) [2023-09-12 15:58:55,287][89374] Updated weights for policy 0, policy_version 5903 (0.0008) [2023-09-12 15:58:58,705][89374] Updated weights for policy 0, policy_version 5913 (0.0009) [2023-09-12 15:59:02,011][89374] Updated weights for policy 0, policy_version 5923 (0.0008) [2023-09-12 15:59:05,347][89374] Updated weights for policy 0, policy_version 5933 (0.0008) [2023-09-12 15:59:08,653][89374] Updated weights for policy 0, policy_version 5943 (0.0008) [2023-09-12 15:59:11,909][89374] Updated weights for policy 0, policy_version 5953 (0.0008) [2023-09-12 15:59:15,302][89374] Updated weights for policy 0, policy_version 5963 (0.0008) [2023-09-12 15:59:16,782][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005967_24440832.pth... [2023-09-12 15:59:16,835][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005216_21364736.pth [2023-09-12 15:59:18,605][89374] Updated weights for policy 0, policy_version 5973 (0.0008) [2023-09-12 15:59:22,004][89374] Updated weights for policy 0, policy_version 5983 (0.0009) [2023-09-12 15:59:25,438][89374] Updated weights for policy 0, policy_version 5993 (0.0008) [2023-09-12 15:59:28,741][89374] Updated weights for policy 0, policy_version 6003 (0.0009) [2023-09-12 15:59:32,135][89374] Updated weights for policy 0, policy_version 6013 (0.0009) [2023-09-12 15:59:35,536][89374] Updated weights for policy 0, policy_version 6023 (0.0009) [2023-09-12 15:59:38,922][89374] Updated weights for policy 0, policy_version 6033 (0.0009) [2023-09-12 15:59:42,275][89374] Updated weights for policy 0, policy_version 6043 (0.0009) [2023-09-12 15:59:45,649][89374] Updated weights for policy 0, policy_version 6053 (0.0009) [2023-09-12 15:59:49,011][89374] Updated weights for policy 0, policy_version 6063 (0.0008) [2023-09-12 15:59:52,339][89374] Updated weights for policy 0, policy_version 6073 (0.0009) [2023-09-12 15:59:55,656][89374] Updated weights for policy 0, policy_version 6083 (0.0008) [2023-09-12 15:59:59,029][89374] Updated weights for policy 0, policy_version 6093 (0.0008) [2023-09-12 16:00:02,365][89374] Updated weights for policy 0, policy_version 6103 (0.0009) [2023-09-12 16:00:05,807][89374] Updated weights for policy 0, policy_version 6113 (0.0008) [2023-09-12 16:00:09,222][89374] Updated weights for policy 0, policy_version 6123 (0.0010) [2023-09-12 16:00:12,579][89374] Updated weights for policy 0, policy_version 6133 (0.0008) [2023-09-12 16:00:15,949][89374] Updated weights for policy 0, policy_version 6143 (0.0008) [2023-09-12 16:00:19,318][89374] Updated weights for policy 0, policy_version 6153 (0.0009) [2023-09-12 16:00:22,665][89374] Updated weights for policy 0, policy_version 6163 (0.0009) [2023-09-12 16:00:26,060][89374] Updated weights for policy 0, policy_version 6173 (0.0009) [2023-09-12 16:00:29,396][89374] Updated weights for policy 0, policy_version 6183 (0.0009) [2023-09-12 16:00:32,737][89374] Updated weights for policy 0, policy_version 6193 (0.0008) [2023-09-12 16:00:36,024][89374] Updated weights for policy 0, policy_version 6203 (0.0009) [2023-09-12 16:00:39,361][89374] Updated weights for policy 0, policy_version 6213 (0.0009) [2023-09-12 16:00:42,694][89374] Updated weights for policy 0, policy_version 6223 (0.0008) [2023-09-12 16:00:46,015][89374] Updated weights for policy 0, policy_version 6233 (0.0008) [2023-09-12 16:00:49,434][89374] Updated weights for policy 0, policy_version 6243 (0.0009) [2023-09-12 16:00:52,807][89374] Updated weights for policy 0, policy_version 6253 (0.0009) [2023-09-12 16:00:56,168][89374] Updated weights for policy 0, policy_version 6263 (0.0009) [2023-09-12 16:00:59,539][89374] Updated weights for policy 0, policy_version 6273 (0.0008) [2023-09-12 16:01:02,877][89374] Updated weights for policy 0, policy_version 6283 (0.0009) [2023-09-12 16:01:06,220][89374] Updated weights for policy 0, policy_version 6293 (0.0008) [2023-09-12 16:01:09,537][89374] Updated weights for policy 0, policy_version 6303 (0.0008) [2023-09-12 16:01:12,942][89374] Updated weights for policy 0, policy_version 6313 (0.0009) [2023-09-12 16:01:16,437][89374] Updated weights for policy 0, policy_version 6323 (0.0008) [2023-09-12 16:01:16,784][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000006324_25903104.pth... [2023-09-12 16:01:16,835][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005566_22798336.pth [2023-09-12 16:01:19,733][89374] Updated weights for policy 0, policy_version 6333 (0.0009) [2023-09-12 16:01:22,717][89374] Updated weights for policy 0, policy_version 6343 (0.0009) [2023-09-12 16:01:25,132][89374] Updated weights for policy 0, policy_version 6353 (0.0008) [2023-09-12 16:01:27,601][89374] Updated weights for policy 0, policy_version 6363 (0.0008) [2023-09-12 16:01:29,982][89374] Updated weights for policy 0, policy_version 6373 (0.0009) [2023-09-12 16:01:32,493][89374] Updated weights for policy 0, policy_version 6383 (0.0009) [2023-09-12 16:01:34,961][89374] Updated weights for policy 0, policy_version 6393 (0.0009) [2023-09-12 16:01:37,367][89374] Updated weights for policy 0, policy_version 6403 (0.0009) [2023-09-12 16:01:39,831][89374] Updated weights for policy 0, policy_version 6413 (0.0009) [2023-09-12 16:01:42,200][89374] Updated weights for policy 0, policy_version 6423 (0.0009) [2023-09-12 16:01:44,660][89374] Updated weights for policy 0, policy_version 6433 (0.0008) [2023-09-12 16:01:47,094][89374] Updated weights for policy 0, policy_version 6443 (0.0008) [2023-09-12 16:01:49,491][89374] Updated weights for policy 0, policy_version 6453 (0.0008) [2023-09-12 16:01:51,942][89374] Updated weights for policy 0, policy_version 6463 (0.0009) [2023-09-12 16:01:54,365][89374] Updated weights for policy 0, policy_version 6473 (0.0008) [2023-09-12 16:01:56,798][89374] Updated weights for policy 0, policy_version 6483 (0.0009) [2023-09-12 16:01:59,209][89374] Updated weights for policy 0, policy_version 6493 (0.0009) [2023-09-12 16:02:02,521][89374] Updated weights for policy 0, policy_version 6503 (0.0009) [2023-09-12 16:02:05,899][89374] Updated weights for policy 0, policy_version 6513 (0.0008) [2023-09-12 16:02:09,391][89374] Updated weights for policy 0, policy_version 6523 (0.0009) [2023-09-12 16:02:12,829][89374] Updated weights for policy 0, policy_version 6533 (0.0009) [2023-09-12 16:02:16,290][89374] Updated weights for policy 0, policy_version 6543 (0.0009) [2023-09-12 16:02:19,624][89374] Updated weights for policy 0, policy_version 6553 (0.0008) [2023-09-12 16:02:23,063][89374] Updated weights for policy 0, policy_version 6563 (0.0010) [2023-09-12 16:02:26,564][89374] Updated weights for policy 0, policy_version 6573 (0.0009) [2023-09-12 16:02:29,868][89374] Updated weights for policy 0, policy_version 6583 (0.0008) [2023-09-12 16:02:33,236][89374] Updated weights for policy 0, policy_version 6593 (0.0009) [2023-09-12 16:02:36,608][89374] Updated weights for policy 0, policy_version 6603 (0.0008) [2023-09-12 16:02:39,870][89374] Updated weights for policy 0, policy_version 6613 (0.0009) [2023-09-12 16:02:43,280][89374] Updated weights for policy 0, policy_version 6623 (0.0009) [2023-09-12 16:02:46,640][89374] Updated weights for policy 0, policy_version 6633 (0.0009) [2023-09-12 16:02:50,024][89374] Updated weights for policy 0, policy_version 6643 (0.0008) [2023-09-12 16:02:53,416][89374] Updated weights for policy 0, policy_version 6653 (0.0009) [2023-09-12 16:02:56,686][89374] Updated weights for policy 0, policy_version 6663 (0.0009) [2023-09-12 16:03:00,017][89374] Updated weights for policy 0, policy_version 6673 (0.0009) [2023-09-12 16:03:03,454][89374] Updated weights for policy 0, policy_version 6683 (0.0010) [2023-09-12 16:03:06,797][89374] Updated weights for policy 0, policy_version 6693 (0.0008) [2023-09-12 16:03:10,062][89374] Updated weights for policy 0, policy_version 6703 (0.0009) [2023-09-12 16:03:13,347][89374] Updated weights for policy 0, policy_version 6713 (0.0008) [2023-09-12 16:03:16,766][89374] Updated weights for policy 0, policy_version 6723 (0.0008) [2023-09-12 16:03:16,785][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000006723_27537408.pth... [2023-09-12 16:03:16,840][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000005967_24440832.pth [2023-09-12 16:03:20,029][89374] Updated weights for policy 0, policy_version 6733 (0.0008) [2023-09-12 16:03:23,432][89374] Updated weights for policy 0, policy_version 6743 (0.0008) [2023-09-12 16:03:26,890][89374] Updated weights for policy 0, policy_version 6753 (0.0009) [2023-09-12 16:03:30,265][89374] Updated weights for policy 0, policy_version 6763 (0.0009) [2023-09-12 16:03:33,627][89374] Updated weights for policy 0, policy_version 6773 (0.0008) [2023-09-12 16:03:36,970][89374] Updated weights for policy 0, policy_version 6783 (0.0009) [2023-09-12 16:03:40,327][89374] Updated weights for policy 0, policy_version 6793 (0.0009) [2023-09-12 16:03:43,694][89374] Updated weights for policy 0, policy_version 6803 (0.0008) [2023-09-12 16:03:47,123][89374] Updated weights for policy 0, policy_version 6813 (0.0009) [2023-09-12 16:03:50,497][89374] Updated weights for policy 0, policy_version 6823 (0.0008) [2023-09-12 16:03:53,923][89374] Updated weights for policy 0, policy_version 6833 (0.0009) [2023-09-12 16:03:57,246][89374] Updated weights for policy 0, policy_version 6843 (0.0009) [2023-09-12 16:04:00,687][89374] Updated weights for policy 0, policy_version 6853 (0.0009) [2023-09-12 16:04:04,090][89374] Updated weights for policy 0, policy_version 6863 (0.0008) [2023-09-12 16:04:07,524][89374] Updated weights for policy 0, policy_version 6873 (0.0009) [2023-09-12 16:04:11,000][89374] Updated weights for policy 0, policy_version 6883 (0.0009) [2023-09-12 16:04:14,298][89374] Updated weights for policy 0, policy_version 6893 (0.0008) [2023-09-12 16:04:17,801][89374] Updated weights for policy 0, policy_version 6903 (0.0009) [2023-09-12 16:04:21,186][89374] Updated weights for policy 0, policy_version 6913 (0.0009) [2023-09-12 16:04:24,858][89374] Updated weights for policy 0, policy_version 6923 (0.0009) [2023-09-12 16:04:28,667][89374] Updated weights for policy 0, policy_version 6933 (0.0011) [2023-09-12 16:04:32,122][89374] Updated weights for policy 0, policy_version 6943 (0.0010) [2023-09-12 16:04:35,633][89374] Updated weights for policy 0, policy_version 6953 (0.0009) [2023-09-12 16:04:39,090][89374] Updated weights for policy 0, policy_version 6963 (0.0010) [2023-09-12 16:04:42,551][89374] Updated weights for policy 0, policy_version 6973 (0.0009) [2023-09-12 16:04:45,989][89374] Updated weights for policy 0, policy_version 6983 (0.0009) [2023-09-12 16:04:49,327][89374] Updated weights for policy 0, policy_version 6993 (0.0009) [2023-09-12 16:04:51,813][89374] Updated weights for policy 0, policy_version 7003 (0.0009) [2023-09-12 16:04:54,320][89374] Updated weights for policy 0, policy_version 7013 (0.0009) [2023-09-12 16:04:56,878][89374] Updated weights for policy 0, policy_version 7023 (0.0010) [2023-09-12 16:04:59,406][89374] Updated weights for policy 0, policy_version 7033 (0.0008) [2023-09-12 16:05:01,910][89374] Updated weights for policy 0, policy_version 7043 (0.0009) [2023-09-12 16:05:04,392][89374] Updated weights for policy 0, policy_version 7053 (0.0009) [2023-09-12 16:05:06,908][89374] Updated weights for policy 0, policy_version 7063 (0.0009) [2023-09-12 16:05:09,418][89374] Updated weights for policy 0, policy_version 7073 (0.0009) [2023-09-12 16:05:11,991][89374] Updated weights for policy 0, policy_version 7083 (0.0009) [2023-09-12 16:05:14,445][89374] Updated weights for policy 0, policy_version 7093 (0.0008) [2023-09-12 16:05:16,782][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007102_29089792.pth... [2023-09-12 16:05:16,843][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000006324_25903104.pth [2023-09-12 16:05:16,930][89374] Updated weights for policy 0, policy_version 7103 (0.0008) [2023-09-12 16:05:19,448][89374] Updated weights for policy 0, policy_version 7113 (0.0009) [2023-09-12 16:05:21,970][89374] Updated weights for policy 0, policy_version 7123 (0.0008) [2023-09-12 16:05:24,499][89374] Updated weights for policy 0, policy_version 7133 (0.0008) [2023-09-12 16:05:27,664][89374] Updated weights for policy 0, policy_version 7143 (0.0008) [2023-09-12 16:05:31,181][89374] Updated weights for policy 0, policy_version 7153 (0.0009) [2023-09-12 16:05:34,653][89374] Updated weights for policy 0, policy_version 7163 (0.0008) [2023-09-12 16:05:38,006][89374] Updated weights for policy 0, policy_version 7173 (0.0009) [2023-09-12 16:05:41,540][89374] Updated weights for policy 0, policy_version 7183 (0.0009) [2023-09-12 16:05:44,949][89374] Updated weights for policy 0, policy_version 7193 (0.0010) [2023-09-12 16:05:48,443][89374] Updated weights for policy 0, policy_version 7203 (0.0010) [2023-09-12 16:05:51,910][89374] Updated weights for policy 0, policy_version 7213 (0.0010) [2023-09-12 16:05:55,352][89374] Updated weights for policy 0, policy_version 7223 (0.0009) [2023-09-12 16:05:58,774][89374] Updated weights for policy 0, policy_version 7233 (0.0009) [2023-09-12 16:06:02,224][89374] Updated weights for policy 0, policy_version 7243 (0.0010) [2023-09-12 16:06:05,681][89374] Updated weights for policy 0, policy_version 7253 (0.0010) [2023-09-12 16:06:09,195][89374] Updated weights for policy 0, policy_version 7263 (0.0009) [2023-09-12 16:06:12,624][89374] Updated weights for policy 0, policy_version 7273 (0.0009) [2023-09-12 16:06:16,049][89374] Updated weights for policy 0, policy_version 7283 (0.0009) [2023-09-12 16:06:19,531][89374] Updated weights for policy 0, policy_version 7293 (0.0009) [2023-09-12 16:06:23,051][89374] Updated weights for policy 0, policy_version 7303 (0.0009) [2023-09-12 16:06:26,503][89374] Updated weights for policy 0, policy_version 7313 (0.0009) [2023-09-12 16:06:29,982][89374] Updated weights for policy 0, policy_version 7323 (0.0009) [2023-09-12 16:06:33,468][89374] Updated weights for policy 0, policy_version 7333 (0.0008) [2023-09-12 16:06:36,965][89374] Updated weights for policy 0, policy_version 7343 (0.0009) [2023-09-12 16:06:40,472][89374] Updated weights for policy 0, policy_version 7353 (0.0010) [2023-09-12 16:06:44,026][89374] Updated weights for policy 0, policy_version 7363 (0.0009) [2023-09-12 16:06:47,486][89374] Updated weights for policy 0, policy_version 7373 (0.0009) [2023-09-12 16:06:50,952][89374] Updated weights for policy 0, policy_version 7383 (0.0009) [2023-09-12 16:06:54,423][89374] Updated weights for policy 0, policy_version 7393 (0.0009) [2023-09-12 16:06:57,900][89374] Updated weights for policy 0, policy_version 7403 (0.0010) [2023-09-12 16:07:01,492][89374] Updated weights for policy 0, policy_version 7413 (0.0011) [2023-09-12 16:07:04,907][89374] Updated weights for policy 0, policy_version 7423 (0.0009) [2023-09-12 16:07:08,383][89374] Updated weights for policy 0, policy_version 7433 (0.0009) [2023-09-12 16:07:11,788][89374] Updated weights for policy 0, policy_version 7443 (0.0009) [2023-09-12 16:07:15,200][89374] Updated weights for policy 0, policy_version 7453 (0.0009) [2023-09-12 16:07:16,783][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007457_30543872.pth... [2023-09-12 16:07:16,838][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000006723_27537408.pth [2023-09-12 16:07:18,688][89374] Updated weights for policy 0, policy_version 7463 (0.0010) [2023-09-12 16:07:22,202][89374] Updated weights for policy 0, policy_version 7473 (0.0009) [2023-09-12 16:07:25,685][89374] Updated weights for policy 0, policy_version 7483 (0.0009) [2023-09-12 16:07:29,144][89374] Updated weights for policy 0, policy_version 7493 (0.0008) [2023-09-12 16:07:32,521][89374] Updated weights for policy 0, policy_version 7503 (0.0010) [2023-09-12 16:07:36,011][89374] Updated weights for policy 0, policy_version 7513 (0.0010) [2023-09-12 16:07:39,397][89374] Updated weights for policy 0, policy_version 7523 (0.0009) [2023-09-12 16:07:43,012][89374] Updated weights for policy 0, policy_version 7533 (0.0009) [2023-09-12 16:07:46,479][89374] Updated weights for policy 0, policy_version 7543 (0.0009) [2023-09-12 16:07:49,962][89374] Updated weights for policy 0, policy_version 7553 (0.0009) [2023-09-12 16:07:53,409][89374] Updated weights for policy 0, policy_version 7563 (0.0009) [2023-09-12 16:07:56,894][89374] Updated weights for policy 0, policy_version 7573 (0.0009) [2023-09-12 16:08:00,345][89374] Updated weights for policy 0, policy_version 7583 (0.0010) [2023-09-12 16:08:03,862][89374] Updated weights for policy 0, policy_version 7593 (0.0010) [2023-09-12 16:08:07,269][89374] Updated weights for policy 0, policy_version 7603 (0.0009) [2023-09-12 16:08:10,636][89374] Updated weights for policy 0, policy_version 7613 (0.0009) [2023-09-12 16:08:14,120][89374] Updated weights for policy 0, policy_version 7623 (0.0008) [2023-09-12 16:08:17,102][89374] Updated weights for policy 0, policy_version 7633 (0.0009) [2023-09-12 16:08:19,653][89374] Updated weights for policy 0, policy_version 7643 (0.0009) [2023-09-12 16:08:22,171][89374] Updated weights for policy 0, policy_version 7653 (0.0009) [2023-09-12 16:08:24,705][89374] Updated weights for policy 0, policy_version 7663 (0.0009) [2023-09-12 16:08:27,152][89374] Updated weights for policy 0, policy_version 7673 (0.0008) [2023-09-12 16:08:29,656][89374] Updated weights for policy 0, policy_version 7683 (0.0009) [2023-09-12 16:08:32,199][89374] Updated weights for policy 0, policy_version 7693 (0.0009) [2023-09-12 16:08:34,741][89374] Updated weights for policy 0, policy_version 7703 (0.0009) [2023-09-12 16:08:37,235][89374] Updated weights for policy 0, policy_version 7713 (0.0009) [2023-09-12 16:08:39,930][89374] Updated weights for policy 0, policy_version 7723 (0.0017) [2023-09-12 16:08:42,517][89374] Updated weights for policy 0, policy_version 7733 (0.0009) [2023-09-12 16:08:45,025][89374] Updated weights for policy 0, policy_version 7743 (0.0009) [2023-09-12 16:08:47,524][89374] Updated weights for policy 0, policy_version 7753 (0.0009) [2023-09-12 16:08:50,959][89374] Updated weights for policy 0, policy_version 7763 (0.0009) [2023-09-12 16:08:54,347][89374] Updated weights for policy 0, policy_version 7773 (0.0009) [2023-09-12 16:08:57,760][89374] Updated weights for policy 0, policy_version 7783 (0.0009) [2023-09-12 16:09:01,287][89374] Updated weights for policy 0, policy_version 7793 (0.0010) [2023-09-12 16:09:04,770][89374] Updated weights for policy 0, policy_version 7803 (0.0009) [2023-09-12 16:09:08,297][89374] Updated weights for policy 0, policy_version 7813 (0.0010) [2023-09-12 16:09:11,765][89374] Updated weights for policy 0, policy_version 7823 (0.0009) [2023-09-12 16:09:15,179][89374] Updated weights for policy 0, policy_version 7833 (0.0009) [2023-09-12 16:09:16,781][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007837_32100352.pth... [2023-09-12 16:09:16,833][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007102_29089792.pth [2023-09-12 16:09:18,649][89374] Updated weights for policy 0, policy_version 7843 (0.0009) [2023-09-12 16:09:22,108][89374] Updated weights for policy 0, policy_version 7853 (0.0009) [2023-09-12 16:09:25,563][89374] Updated weights for policy 0, policy_version 7863 (0.0009) [2023-09-12 16:09:29,063][89374] Updated weights for policy 0, policy_version 7873 (0.0010) [2023-09-12 16:09:32,439][89374] Updated weights for policy 0, policy_version 7883 (0.0009) [2023-09-12 16:09:35,888][89374] Updated weights for policy 0, policy_version 7893 (0.0009) [2023-09-12 16:09:39,403][89374] Updated weights for policy 0, policy_version 7903 (0.0009) [2023-09-12 16:09:43,334][89374] Updated weights for policy 0, policy_version 7913 (0.0012) [2023-09-12 16:09:47,131][89374] Updated weights for policy 0, policy_version 7923 (0.0010) [2023-09-12 16:09:50,576][89374] Updated weights for policy 0, policy_version 7933 (0.0010) [2023-09-12 16:09:53,981][89374] Updated weights for policy 0, policy_version 7943 (0.0009) [2023-09-12 16:09:57,414][89374] Updated weights for policy 0, policy_version 7953 (0.0009) [2023-09-12 16:10:00,845][89374] Updated weights for policy 0, policy_version 7963 (0.0011) [2023-09-12 16:10:04,300][89374] Updated weights for policy 0, policy_version 7973 (0.0009) [2023-09-12 16:10:07,695][89374] Updated weights for policy 0, policy_version 7983 (0.0009) [2023-09-12 16:10:11,045][89374] Updated weights for policy 0, policy_version 7993 (0.0009) [2023-09-12 16:10:14,438][89374] Updated weights for policy 0, policy_version 8003 (0.0008) [2023-09-12 16:10:17,732][89374] Updated weights for policy 0, policy_version 8013 (0.0010) [2023-09-12 16:10:21,096][89374] Updated weights for policy 0, policy_version 8023 (0.0008) [2023-09-12 16:10:24,471][89374] Updated weights for policy 0, policy_version 8033 (0.0008) [2023-09-12 16:10:27,879][89374] Updated weights for policy 0, policy_version 8043 (0.0010) [2023-09-12 16:10:31,179][89374] Updated weights for policy 0, policy_version 8053 (0.0008) [2023-09-12 16:10:34,602][89374] Updated weights for policy 0, policy_version 8063 (0.0008) [2023-09-12 16:10:37,998][89374] Updated weights for policy 0, policy_version 8073 (0.0009) [2023-09-12 16:10:41,361][89374] Updated weights for policy 0, policy_version 8083 (0.0008) [2023-09-12 16:10:44,738][89374] Updated weights for policy 0, policy_version 8093 (0.0008) [2023-09-12 16:10:48,178][89374] Updated weights for policy 0, policy_version 8103 (0.0009) [2023-09-12 16:10:51,579][89374] Updated weights for policy 0, policy_version 8113 (0.0008) [2023-09-12 16:10:54,988][89374] Updated weights for policy 0, policy_version 8123 (0.0008) [2023-09-12 16:10:58,378][89374] Updated weights for policy 0, policy_version 8133 (0.0008) [2023-09-12 16:11:01,721][89374] Updated weights for policy 0, policy_version 8143 (0.0008) [2023-09-12 16:11:05,157][89374] Updated weights for policy 0, policy_version 8153 (0.0008) [2023-09-12 16:11:08,607][89374] Updated weights for policy 0, policy_version 8163 (0.0008) [2023-09-12 16:11:12,026][89374] Updated weights for policy 0, policy_version 8173 (0.0009) [2023-09-12 16:11:15,496][89374] Updated weights for policy 0, policy_version 8183 (0.0009) [2023-09-12 16:11:16,845][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008187_33533952.pth... [2023-09-12 16:11:16,901][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007457_30543872.pth [2023-09-12 16:11:18,885][89374] Updated weights for policy 0, policy_version 8193 (0.0008) [2023-09-12 16:11:22,245][89374] Updated weights for policy 0, policy_version 8203 (0.0009) [2023-09-12 16:11:25,655][89374] Updated weights for policy 0, policy_version 8213 (0.0009) [2023-09-12 16:11:29,012][89374] Updated weights for policy 0, policy_version 8223 (0.0008) [2023-09-12 16:11:32,394][89374] Updated weights for policy 0, policy_version 8233 (0.0009) [2023-09-12 16:11:35,829][89374] Updated weights for policy 0, policy_version 8243 (0.0008) [2023-09-12 16:11:38,636][89374] Updated weights for policy 0, policy_version 8253 (0.0008) [2023-09-12 16:11:41,070][89374] Updated weights for policy 0, policy_version 8263 (0.0008) [2023-09-12 16:11:43,545][89374] Updated weights for policy 0, policy_version 8273 (0.0008) [2023-09-12 16:11:46,027][89374] Updated weights for policy 0, policy_version 8283 (0.0009) [2023-09-12 16:11:48,492][89374] Updated weights for policy 0, policy_version 8293 (0.0008) [2023-09-12 16:11:50,963][89374] Updated weights for policy 0, policy_version 8303 (0.0008) [2023-09-12 16:11:53,428][89374] Updated weights for policy 0, policy_version 8313 (0.0009) [2023-09-12 16:11:55,832][89374] Updated weights for policy 0, policy_version 8323 (0.0009) [2023-09-12 16:11:58,294][89374] Updated weights for policy 0, policy_version 8333 (0.0009) [2023-09-12 16:12:00,671][89374] Updated weights for policy 0, policy_version 8343 (0.0008) [2023-09-12 16:12:03,126][89374] Updated weights for policy 0, policy_version 8353 (0.0008) [2023-09-12 16:12:05,520][89374] Updated weights for policy 0, policy_version 8363 (0.0009) [2023-09-12 16:12:07,996][89374] Updated weights for policy 0, policy_version 8373 (0.0009) [2023-09-12 16:12:10,372][89374] Updated weights for policy 0, policy_version 8383 (0.0008) [2023-09-12 16:12:12,843][89374] Updated weights for policy 0, policy_version 8393 (0.0008) [2023-09-12 16:12:15,334][89374] Updated weights for policy 0, policy_version 8403 (0.0009) [2023-09-12 16:12:17,790][89374] Updated weights for policy 0, policy_version 8413 (0.0009) [2023-09-12 16:12:20,315][89374] Updated weights for policy 0, policy_version 8423 (0.0009) [2023-09-12 16:12:22,833][89374] Updated weights for policy 0, policy_version 8433 (0.0009) [2023-09-12 16:12:25,244][89374] Updated weights for policy 0, policy_version 8443 (0.0008) [2023-09-12 16:12:27,686][89374] Updated weights for policy 0, policy_version 8453 (0.0009) [2023-09-12 16:12:31,053][89374] Updated weights for policy 0, policy_version 8463 (0.0009) [2023-09-12 16:12:34,470][89374] Updated weights for policy 0, policy_version 8473 (0.0009) [2023-09-12 16:12:37,867][89374] Updated weights for policy 0, policy_version 8483 (0.0009) [2023-09-12 16:12:41,270][89374] Updated weights for policy 0, policy_version 8493 (0.0011) [2023-09-12 16:12:44,691][89374] Updated weights for policy 0, policy_version 8503 (0.0009) [2023-09-12 16:12:47,932][89374] Updated weights for policy 0, policy_version 8513 (0.0009) [2023-09-12 16:12:51,306][89374] Updated weights for policy 0, policy_version 8523 (0.0008) [2023-09-12 16:12:54,615][89374] Updated weights for policy 0, policy_version 8533 (0.0009) [2023-09-12 16:12:57,966][89374] Updated weights for policy 0, policy_version 8543 (0.0008) [2023-09-12 16:13:01,273][89374] Updated weights for policy 0, policy_version 8553 (0.0008) [2023-09-12 16:13:04,723][89374] Updated weights for policy 0, policy_version 8563 (0.0009) [2023-09-12 16:13:08,004][89374] Updated weights for policy 0, policy_version 8573 (0.0008) [2023-09-12 16:13:11,376][89374] Updated weights for policy 0, policy_version 8583 (0.0009) [2023-09-12 16:13:14,778][89374] Updated weights for policy 0, policy_version 8593 (0.0009) [2023-09-12 16:13:16,816][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008599_35221504.pth... [2023-09-12 16:13:16,866][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000007837_32100352.pth [2023-09-12 16:13:18,140][89374] Updated weights for policy 0, policy_version 8603 (0.0008) [2023-09-12 16:13:21,577][89374] Updated weights for policy 0, policy_version 8613 (0.0009) [2023-09-12 16:13:24,963][89374] Updated weights for policy 0, policy_version 8623 (0.0009) [2023-09-12 16:13:28,265][89374] Updated weights for policy 0, policy_version 8633 (0.0008) [2023-09-12 16:13:31,641][89374] Updated weights for policy 0, policy_version 8643 (0.0009) [2023-09-12 16:13:35,046][89374] Updated weights for policy 0, policy_version 8653 (0.0008) [2023-09-12 16:13:38,427][89374] Updated weights for policy 0, policy_version 8663 (0.0010) [2023-09-12 16:13:41,796][89374] Updated weights for policy 0, policy_version 8673 (0.0008) [2023-09-12 16:13:45,168][89374] Updated weights for policy 0, policy_version 8683 (0.0009) [2023-09-12 16:13:48,512][89374] Updated weights for policy 0, policy_version 8693 (0.0008) [2023-09-12 16:13:51,854][89374] Updated weights for policy 0, policy_version 8703 (0.0009) [2023-09-12 16:13:55,255][89374] Updated weights for policy 0, policy_version 8713 (0.0009) [2023-09-12 16:13:58,670][89374] Updated weights for policy 0, policy_version 8723 (0.0009) [2023-09-12 16:14:02,050][89374] Updated weights for policy 0, policy_version 8733 (0.0008) [2023-09-12 16:14:05,457][89374] Updated weights for policy 0, policy_version 8743 (0.0008) [2023-09-12 16:14:08,742][89374] Updated weights for policy 0, policy_version 8753 (0.0008) [2023-09-12 16:14:12,094][89374] Updated weights for policy 0, policy_version 8763 (0.0008) [2023-09-12 16:14:15,507][89374] Updated weights for policy 0, policy_version 8773 (0.0008) [2023-09-12 16:14:18,921][89374] Updated weights for policy 0, policy_version 8783 (0.0008) [2023-09-12 16:14:22,308][89374] Updated weights for policy 0, policy_version 8793 (0.0009) [2023-09-12 16:14:25,695][89374] Updated weights for policy 0, policy_version 8803 (0.0009) [2023-09-12 16:14:29,095][89374] Updated weights for policy 0, policy_version 8813 (0.0009) [2023-09-12 16:14:32,602][89374] Updated weights for policy 0, policy_version 8823 (0.0009) [2023-09-12 16:14:36,035][89374] Updated weights for policy 0, policy_version 8833 (0.0010) [2023-09-12 16:14:39,473][89374] Updated weights for policy 0, policy_version 8843 (0.0009) [2023-09-12 16:14:42,848][89374] Updated weights for policy 0, policy_version 8853 (0.0009) [2023-09-12 16:14:46,189][89374] Updated weights for policy 0, policy_version 8863 (0.0008) [2023-09-12 16:14:49,605][89374] Updated weights for policy 0, policy_version 8873 (0.0008) [2023-09-12 16:14:52,969][89374] Updated weights for policy 0, policy_version 8883 (0.0009) [2023-09-12 16:14:56,348][89374] Updated weights for policy 0, policy_version 8893 (0.0008) [2023-09-12 16:14:59,788][89374] Updated weights for policy 0, policy_version 8903 (0.0008) [2023-09-12 16:15:03,235][89374] Updated weights for policy 0, policy_version 8913 (0.0008) [2023-09-12 16:15:06,436][89374] Updated weights for policy 0, policy_version 8923 (0.0008) [2023-09-12 16:15:09,807][89374] Updated weights for policy 0, policy_version 8933 (0.0008) [2023-09-12 16:15:13,166][89374] Updated weights for policy 0, policy_version 8943 (0.0009) [2023-09-12 16:15:16,607][89374] Updated weights for policy 0, policy_version 8953 (0.0009) [2023-09-12 16:15:16,781][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008953_36671488.pth... [2023-09-12 16:15:16,833][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008187_33533952.pth [2023-09-12 16:15:19,116][89374] Updated weights for policy 0, policy_version 8963 (0.0008) [2023-09-12 16:15:21,472][89374] Updated weights for policy 0, policy_version 8973 (0.0009) [2023-09-12 16:15:23,919][89374] Updated weights for policy 0, policy_version 8983 (0.0008) [2023-09-12 16:15:26,359][89374] Updated weights for policy 0, policy_version 8993 (0.0009) [2023-09-12 16:15:28,819][89374] Updated weights for policy 0, policy_version 9003 (0.0009) [2023-09-12 16:15:31,284][89374] Updated weights for policy 0, policy_version 9013 (0.0009) [2023-09-12 16:15:33,693][89374] Updated weights for policy 0, policy_version 9023 (0.0010) [2023-09-12 16:15:36,219][89374] Updated weights for policy 0, policy_version 9033 (0.0009) [2023-09-12 16:15:38,720][89374] Updated weights for policy 0, policy_version 9043 (0.0009) [2023-09-12 16:15:41,229][89374] Updated weights for policy 0, policy_version 9053 (0.0009) [2023-09-12 16:15:43,778][89374] Updated weights for policy 0, policy_version 9063 (0.0009) [2023-09-12 16:15:46,218][89374] Updated weights for policy 0, policy_version 9073 (0.0009) [2023-09-12 16:15:48,627][89374] Updated weights for policy 0, policy_version 9083 (0.0009) [2023-09-12 16:15:51,057][89374] Updated weights for policy 0, policy_version 9093 (0.0009) [2023-09-12 16:15:53,513][89374] Updated weights for policy 0, policy_version 9103 (0.0009) [2023-09-12 16:15:55,882][89374] Updated weights for policy 0, policy_version 9113 (0.0009) [2023-09-12 16:15:58,419][89374] Updated weights for policy 0, policy_version 9123 (0.0009) [2023-09-12 16:16:01,766][89374] Updated weights for policy 0, policy_version 9133 (0.0009) [2023-09-12 16:16:05,159][89374] Updated weights for policy 0, policy_version 9143 (0.0010) [2023-09-12 16:16:08,481][89374] Updated weights for policy 0, policy_version 9153 (0.0009) [2023-09-12 16:16:11,882][89374] Updated weights for policy 0, policy_version 9163 (0.0009) [2023-09-12 16:16:15,193][89374] Updated weights for policy 0, policy_version 9173 (0.0008) [2023-09-12 16:16:18,503][89374] Updated weights for policy 0, policy_version 9183 (0.0009) [2023-09-12 16:16:21,912][89374] Updated weights for policy 0, policy_version 9193 (0.0008) [2023-09-12 16:16:25,353][89374] Updated weights for policy 0, policy_version 9203 (0.0009) [2023-09-12 16:16:28,739][89374] Updated weights for policy 0, policy_version 9213 (0.0009) [2023-09-12 16:16:32,094][89374] Updated weights for policy 0, policy_version 9223 (0.0009) [2023-09-12 16:16:35,482][89374] Updated weights for policy 0, policy_version 9233 (0.0009) [2023-09-12 16:16:38,837][89374] Updated weights for policy 0, policy_version 9243 (0.0008) [2023-09-12 16:16:42,228][89374] Updated weights for policy 0, policy_version 9253 (0.0009) [2023-09-12 16:16:45,660][89374] Updated weights for policy 0, policy_version 9263 (0.0009) [2023-09-12 16:16:49,263][89374] Updated weights for policy 0, policy_version 9273 (0.0009) [2023-09-12 16:16:52,663][89374] Updated weights for policy 0, policy_version 9283 (0.0009) [2023-09-12 16:16:56,044][89374] Updated weights for policy 0, policy_version 9293 (0.0009) [2023-09-12 16:16:59,444][89374] Updated weights for policy 0, policy_version 9303 (0.0008) [2023-09-12 16:17:02,809][89374] Updated weights for policy 0, policy_version 9313 (0.0008) [2023-09-12 16:17:06,218][89374] Updated weights for policy 0, policy_version 9323 (0.0008) [2023-09-12 16:17:09,585][89374] Updated weights for policy 0, policy_version 9333 (0.0008) [2023-09-12 16:17:12,917][89374] Updated weights for policy 0, policy_version 9343 (0.0008) [2023-09-12 16:17:16,331][89374] Updated weights for policy 0, policy_version 9353 (0.0008) [2023-09-12 16:17:16,781][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000009354_38313984.pth... [2023-09-12 16:17:16,833][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008599_35221504.pth [2023-09-12 16:17:19,748][89374] Updated weights for policy 0, policy_version 9363 (0.0009) [2023-09-12 16:17:23,143][89374] Updated weights for policy 0, policy_version 9373 (0.0008) [2023-09-12 16:17:26,432][89374] Updated weights for policy 0, policy_version 9383 (0.0008) [2023-09-12 16:17:29,691][89374] Updated weights for policy 0, policy_version 9393 (0.0008) [2023-09-12 16:17:33,096][89374] Updated weights for policy 0, policy_version 9403 (0.0009) [2023-09-12 16:17:36,518][89374] Updated weights for policy 0, policy_version 9413 (0.0008) [2023-09-12 16:17:39,874][89374] Updated weights for policy 0, policy_version 9423 (0.0009) [2023-09-12 16:17:43,275][89374] Updated weights for policy 0, policy_version 9433 (0.0008) [2023-09-12 16:17:46,662][89374] Updated weights for policy 0, policy_version 9443 (0.0008) [2023-09-12 16:17:50,037][89374] Updated weights for policy 0, policy_version 9453 (0.0009) [2023-09-12 16:17:53,510][89374] Updated weights for policy 0, policy_version 9463 (0.0008) [2023-09-12 16:17:56,928][89374] Updated weights for policy 0, policy_version 9473 (0.0008) [2023-09-12 16:18:00,270][89374] Updated weights for policy 0, policy_version 9483 (0.0008) [2023-09-12 16:18:03,748][89374] Updated weights for policy 0, policy_version 9493 (0.0008) [2023-09-12 16:18:07,111][89374] Updated weights for policy 0, policy_version 9503 (0.0008) [2023-09-12 16:18:10,495][89374] Updated weights for policy 0, policy_version 9513 (0.0009) [2023-09-12 16:18:13,936][89374] Updated weights for policy 0, policy_version 9523 (0.0009) [2023-09-12 16:18:17,339][89374] Updated weights for policy 0, policy_version 9533 (0.0009) [2023-09-12 16:18:20,667][89374] Updated weights for policy 0, policy_version 9543 (0.0008) [2023-09-12 16:18:24,055][89374] Updated weights for policy 0, policy_version 9553 (0.0009) [2023-09-12 16:18:27,403][89374] Updated weights for policy 0, policy_version 9563 (0.0009) [2023-09-12 16:18:30,766][89374] Updated weights for policy 0, policy_version 9573 (0.0008) [2023-09-12 16:18:34,072][89374] Updated weights for policy 0, policy_version 9583 (0.0008) [2023-09-12 16:18:37,526][89374] Updated weights for policy 0, policy_version 9593 (0.0009) [2023-09-12 16:18:40,869][89374] Updated weights for policy 0, policy_version 9603 (0.0009) [2023-09-12 16:18:44,339][89374] Updated weights for policy 0, policy_version 9613 (0.0008) [2023-09-12 16:18:47,703][89374] Updated weights for policy 0, policy_version 9623 (0.0008) [2023-09-12 16:18:50,183][89374] Updated weights for policy 0, policy_version 9633 (0.0008) [2023-09-12 16:18:52,674][89374] Updated weights for policy 0, policy_version 9643 (0.0009) [2023-09-12 16:18:55,207][89374] Updated weights for policy 0, policy_version 9653 (0.0009) [2023-09-12 16:18:57,628][89374] Updated weights for policy 0, policy_version 9663 (0.0008) [2023-09-12 16:19:00,139][89374] Updated weights for policy 0, policy_version 9673 (0.0009) [2023-09-12 16:19:02,634][89374] Updated weights for policy 0, policy_version 9683 (0.0009) [2023-09-12 16:19:05,062][89374] Updated weights for policy 0, policy_version 9693 (0.0008) [2023-09-12 16:19:07,550][89374] Updated weights for policy 0, policy_version 9703 (0.0008) [2023-09-12 16:19:10,073][89374] Updated weights for policy 0, policy_version 9713 (0.0009) [2023-09-12 16:19:12,502][89374] Updated weights for policy 0, policy_version 9723 (0.0009) [2023-09-12 16:19:14,919][89374] Updated weights for policy 0, policy_version 9733 (0.0008) [2023-09-12 16:19:16,781][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000009740_39895040.pth... [2023-09-12 16:19:16,835][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000008953_36671488.pth [2023-09-12 16:19:17,363][89374] Updated weights for policy 0, policy_version 9743 (0.0009) [2023-09-12 16:19:19,829][89374] Updated weights for policy 0, policy_version 9753 (0.0008) [2023-09-12 16:19:23,053][89374] Updated weights for policy 0, policy_version 9763 (0.0008) [2023-09-12 16:19:26,385][89374] Updated weights for policy 0, policy_version 9773 (0.0009) [2023-09-12 16:19:29,781][89374] Updated weights for policy 0, policy_version 9783 (0.0009) [2023-09-12 16:19:33,151][89374] Updated weights for policy 0, policy_version 9793 (0.0009) [2023-09-12 16:19:36,480][89374] Updated weights for policy 0, policy_version 9803 (0.0008) [2023-09-12 16:19:39,762][89374] Updated weights for policy 0, policy_version 9813 (0.0008) [2023-09-12 16:19:43,200][89374] Updated weights for policy 0, policy_version 9823 (0.0008) [2023-09-12 16:19:46,666][89374] Updated weights for policy 0, policy_version 9833 (0.0008) [2023-09-12 16:19:50,021][89374] Updated weights for policy 0, policy_version 9843 (0.0008) [2023-09-12 16:19:53,419][89374] Updated weights for policy 0, policy_version 9853 (0.0008) [2023-09-12 16:19:56,843][89374] Updated weights for policy 0, policy_version 9863 (0.0008) [2023-09-12 16:20:00,270][89374] Updated weights for policy 0, policy_version 9873 (0.0009) [2023-09-12 16:20:03,620][89374] Updated weights for policy 0, policy_version 9883 (0.0010) [2023-09-12 16:20:07,098][89374] Updated weights for policy 0, policy_version 9893 (0.0009) [2023-09-12 16:20:10,499][89374] Updated weights for policy 0, policy_version 9903 (0.0009) [2023-09-12 16:20:13,899][89374] Updated weights for policy 0, policy_version 9913 (0.0008) [2023-09-12 16:20:17,374][89374] Updated weights for policy 0, policy_version 9923 (0.0009) [2023-09-12 16:20:20,774][89374] Updated weights for policy 0, policy_version 9933 (0.0008) [2023-09-12 16:20:24,174][89374] Updated weights for policy 0, policy_version 9943 (0.0009) [2023-09-12 16:20:27,524][89374] Updated weights for policy 0, policy_version 9953 (0.0008) [2023-09-12 16:20:30,845][89374] Updated weights for policy 0, policy_version 9963 (0.0008) [2023-09-12 16:20:34,208][89374] Updated weights for policy 0, policy_version 9973 (0.0008) [2023-09-12 16:20:37,482][89374] Updated weights for policy 0, policy_version 9983 (0.0009) [2023-09-12 16:20:40,940][89374] Updated weights for policy 0, policy_version 9993 (0.0009) [2023-09-12 16:20:44,242][89374] Updated weights for policy 0, policy_version 10003 (0.0008) [2023-09-12 16:20:47,743][89374] Updated weights for policy 0, policy_version 10013 (0.0008) [2023-09-12 16:20:51,083][89374] Updated weights for policy 0, policy_version 10023 (0.0009) [2023-09-12 16:20:54,483][89374] Updated weights for policy 0, policy_version 10033 (0.0008) [2023-09-12 16:20:57,823][89374] Updated weights for policy 0, policy_version 10043 (0.0009) [2023-09-12 16:21:01,142][89374] Updated weights for policy 0, policy_version 10053 (0.0008) [2023-09-12 16:21:04,555][89374] Updated weights for policy 0, policy_version 10063 (0.0010) [2023-09-12 16:21:07,898][89374] Updated weights for policy 0, policy_version 10073 (0.0008) [2023-09-12 16:21:11,271][89374] Updated weights for policy 0, policy_version 10083 (0.0009) [2023-09-12 16:21:14,679][89374] Updated weights for policy 0, policy_version 10093 (0.0009) [2023-09-12 16:21:16,783][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010099_41365504.pth... [2023-09-12 16:21:16,844][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000009354_38313984.pth [2023-09-12 16:21:17,998][89374] Updated weights for policy 0, policy_version 10103 (0.0010) [2023-09-12 16:21:21,368][89374] Updated weights for policy 0, policy_version 10113 (0.0009) [2023-09-12 16:21:24,738][89374] Updated weights for policy 0, policy_version 10123 (0.0009) [2023-09-12 16:21:28,107][89374] Updated weights for policy 0, policy_version 10133 (0.0009) [2023-09-12 16:21:31,480][89374] Updated weights for policy 0, policy_version 10143 (0.0009) [2023-09-12 16:21:34,979][89374] Updated weights for policy 0, policy_version 10153 (0.0009) [2023-09-12 16:21:38,298][89374] Updated weights for policy 0, policy_version 10163 (0.0008) [2023-09-12 16:21:41,725][89374] Updated weights for policy 0, policy_version 10173 (0.0009) [2023-09-12 16:21:45,122][89374] Updated weights for policy 0, policy_version 10183 (0.0009) [2023-09-12 16:21:48,472][89374] Updated weights for policy 0, policy_version 10193 (0.0009) [2023-09-12 16:21:51,868][89374] Updated weights for policy 0, policy_version 10203 (0.0008) [2023-09-12 16:21:55,199][89374] Updated weights for policy 0, policy_version 10213 (0.0009) [2023-09-12 16:21:58,584][89374] Updated weights for policy 0, policy_version 10223 (0.0008) [2023-09-12 16:22:01,918][89374] Updated weights for policy 0, policy_version 10233 (0.0009) [2023-09-12 16:22:05,396][89374] Updated weights for policy 0, policy_version 10243 (0.0008) [2023-09-12 16:22:08,758][89374] Updated weights for policy 0, policy_version 10253 (0.0009) [2023-09-12 16:22:11,390][89374] Updated weights for policy 0, policy_version 10263 (0.0009) [2023-09-12 16:22:13,797][89374] Updated weights for policy 0, policy_version 10273 (0.0009) [2023-09-12 16:22:16,255][89374] Updated weights for policy 0, policy_version 10283 (0.0009) [2023-09-12 16:22:18,694][89374] Updated weights for policy 0, policy_version 10293 (0.0008) [2023-09-12 16:22:21,151][89374] Updated weights for policy 0, policy_version 10303 (0.0009) [2023-09-12 16:22:23,623][89374] Updated weights for policy 0, policy_version 10313 (0.0009) [2023-09-12 16:22:26,084][89374] Updated weights for policy 0, policy_version 10323 (0.0008) [2023-09-12 16:22:28,485][89374] Updated weights for policy 0, policy_version 10333 (0.0008) [2023-09-12 16:22:30,891][89374] Updated weights for policy 0, policy_version 10343 (0.0008) [2023-09-12 16:22:33,392][89374] Updated weights for policy 0, policy_version 10353 (0.0008) [2023-09-12 16:22:35,842][89374] Updated weights for policy 0, policy_version 10363 (0.0008) [2023-09-12 16:22:38,260][89374] Updated weights for policy 0, policy_version 10373 (0.0008) [2023-09-12 16:22:40,698][89374] Updated weights for policy 0, policy_version 10383 (0.0008) [2023-09-12 16:22:43,168][89374] Updated weights for policy 0, policy_version 10393 (0.0009) [2023-09-12 16:22:45,638][89374] Updated weights for policy 0, policy_version 10403 (0.0008) [2023-09-12 16:22:48,000][89374] Updated weights for policy 0, policy_version 10413 (0.0008) [2023-09-12 16:22:50,492][89374] Updated weights for policy 0, policy_version 10423 (0.0009) [2023-09-12 16:22:52,918][89374] Updated weights for policy 0, policy_version 10433 (0.0009) [2023-09-12 16:22:55,736][89374] Updated weights for policy 0, policy_version 10443 (0.0009) [2023-09-12 16:22:59,063][89374] Updated weights for policy 0, policy_version 10453 (0.0009) [2023-09-12 16:23:02,542][89374] Updated weights for policy 0, policy_version 10463 (0.0009) [2023-09-12 16:23:05,864][89374] Updated weights for policy 0, policy_version 10473 (0.0008) [2023-09-12 16:23:09,257][89374] Updated weights for policy 0, policy_version 10483 (0.0009) [2023-09-12 16:23:12,577][89374] Updated weights for policy 0, policy_version 10493 (0.0009) [2023-09-12 16:23:16,007][89374] Updated weights for policy 0, policy_version 10503 (0.0008) [2023-09-12 16:23:16,782][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010505_43028480.pth... [2023-09-12 16:23:16,834][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000009740_39895040.pth [2023-09-12 16:23:19,407][89374] Updated weights for policy 0, policy_version 10513 (0.0009) [2023-09-12 16:23:22,851][89374] Updated weights for policy 0, policy_version 10523 (0.0009) [2023-09-12 16:23:26,323][89374] Updated weights for policy 0, policy_version 10533 (0.0010) [2023-09-12 16:23:29,673][89374] Updated weights for policy 0, policy_version 10543 (0.0010) [2023-09-12 16:23:33,046][89374] Updated weights for policy 0, policy_version 10553 (0.0009) [2023-09-12 16:23:36,486][89374] Updated weights for policy 0, policy_version 10563 (0.0009) [2023-09-12 16:23:39,936][89374] Updated weights for policy 0, policy_version 10573 (0.0008) [2023-09-12 16:23:43,199][89374] Updated weights for policy 0, policy_version 10583 (0.0009) [2023-09-12 16:23:46,691][89374] Updated weights for policy 0, policy_version 10593 (0.0009) [2023-09-12 16:23:50,073][89374] Updated weights for policy 0, policy_version 10603 (0.0009) [2023-09-12 16:23:53,477][89374] Updated weights for policy 0, policy_version 10613 (0.0009) [2023-09-12 16:23:56,928][89374] Updated weights for policy 0, policy_version 10623 (0.0008) [2023-09-12 16:24:00,341][89374] Updated weights for policy 0, policy_version 10633 (0.0009) [2023-09-12 16:24:03,712][89374] Updated weights for policy 0, policy_version 10643 (0.0009) [2023-09-12 16:24:07,106][89374] Updated weights for policy 0, policy_version 10653 (0.0009) [2023-09-12 16:24:10,421][89374] Updated weights for policy 0, policy_version 10663 (0.0009) [2023-09-12 16:24:13,925][89374] Updated weights for policy 0, policy_version 10673 (0.0009) [2023-09-12 16:24:17,341][89374] Updated weights for policy 0, policy_version 10683 (0.0009) [2023-09-12 16:24:20,650][89374] Updated weights for policy 0, policy_version 10693 (0.0008) [2023-09-12 16:24:24,036][89374] Updated weights for policy 0, policy_version 10703 (0.0008) [2023-09-12 16:24:27,367][89374] Updated weights for policy 0, policy_version 10713 (0.0009) [2023-09-12 16:24:30,776][89374] Updated weights for policy 0, policy_version 10723 (0.0009) [2023-09-12 16:24:34,279][89374] Updated weights for policy 0, policy_version 10733 (0.0009) [2023-09-12 16:24:37,617][89374] Updated weights for policy 0, policy_version 10743 (0.0009) [2023-09-12 16:24:40,932][89374] Updated weights for policy 0, policy_version 10753 (0.0008) [2023-09-12 16:24:44,309][89374] Updated weights for policy 0, policy_version 10763 (0.0009) [2023-09-12 16:24:47,697][89374] Updated weights for policy 0, policy_version 10773 (0.0009) [2023-09-12 16:24:51,097][89374] Updated weights for policy 0, policy_version 10783 (0.0010) [2023-09-12 16:24:54,407][89374] Updated weights for policy 0, policy_version 10793 (0.0008) [2023-09-12 16:24:57,755][89374] Updated weights for policy 0, policy_version 10803 (0.0009) [2023-09-12 16:25:01,085][89374] Updated weights for policy 0, policy_version 10813 (0.0009) [2023-09-12 16:25:04,493][89374] Updated weights for policy 0, policy_version 10823 (0.0008) [2023-09-12 16:25:07,841][89374] Updated weights for policy 0, policy_version 10833 (0.0008) [2023-09-12 16:25:11,299][89374] Updated weights for policy 0, policy_version 10843 (0.0008) [2023-09-12 16:25:14,692][89374] Updated weights for policy 0, policy_version 10853 (0.0008) [2023-09-12 16:25:16,819][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010859_44478464.pth... [2023-09-12 16:25:16,870][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010099_41365504.pth [2023-09-12 16:25:18,153][89374] Updated weights for policy 0, policy_version 10863 (0.0008) [2023-09-12 16:25:21,460][89374] Updated weights for policy 0, policy_version 10873 (0.0009) [2023-09-12 16:25:24,860][89374] Updated weights for policy 0, policy_version 10883 (0.0008) [2023-09-12 16:25:28,200][89374] Updated weights for policy 0, policy_version 10893 (0.0009) [2023-09-12 16:25:31,625][89374] Updated weights for policy 0, policy_version 10903 (0.0009) [2023-09-12 16:25:35,019][89374] Updated weights for policy 0, policy_version 10913 (0.0008) [2023-09-12 16:25:38,361][89374] Updated weights for policy 0, policy_version 10923 (0.0009) [2023-09-12 16:25:41,718][89374] Updated weights for policy 0, policy_version 10933 (0.0009) [2023-09-12 16:25:44,682][89374] Updated weights for policy 0, policy_version 10943 (0.0009) [2023-09-12 16:25:47,205][89374] Updated weights for policy 0, policy_version 10953 (0.0009) [2023-09-12 16:25:49,634][89374] Updated weights for policy 0, policy_version 10963 (0.0009) [2023-09-12 16:25:52,110][89374] Updated weights for policy 0, policy_version 10973 (0.0009) [2023-09-12 16:25:54,532][89374] Updated weights for policy 0, policy_version 10983 (0.0008) [2023-09-12 16:25:56,944][89374] Updated weights for policy 0, policy_version 10993 (0.0009) [2023-09-12 16:25:59,468][89374] Updated weights for policy 0, policy_version 11003 (0.0009) [2023-09-12 16:26:01,979][89374] Updated weights for policy 0, policy_version 11013 (0.0008) [2023-09-12 16:26:04,324][89374] Updated weights for policy 0, policy_version 11023 (0.0008) [2023-09-12 16:26:06,771][89374] Updated weights for policy 0, policy_version 11033 (0.0009) [2023-09-12 16:26:09,231][89374] Updated weights for policy 0, policy_version 11043 (0.0009) [2023-09-12 16:26:11,613][89374] Updated weights for policy 0, policy_version 11053 (0.0008) [2023-09-12 16:26:14,011][89374] Updated weights for policy 0, policy_version 11063 (0.0009) [2023-09-12 16:26:16,497][89374] Updated weights for policy 0, policy_version 11073 (0.0010) [2023-09-12 16:26:18,930][89374] Updated weights for policy 0, policy_version 11083 (0.0008) [2023-09-12 16:26:21,392][89374] Updated weights for policy 0, policy_version 11093 (0.0008) [2023-09-12 16:26:23,811][89374] Updated weights for policy 0, policy_version 11103 (0.0009) [2023-09-12 16:26:26,881][89374] Updated weights for policy 0, policy_version 11113 (0.0009) [2023-09-12 16:26:30,278][89374] Updated weights for policy 0, policy_version 11123 (0.0008) [2023-09-12 16:26:33,625][89374] Updated weights for policy 0, policy_version 11133 (0.0008) [2023-09-12 16:26:36,997][89374] Updated weights for policy 0, policy_version 11143 (0.0008) [2023-09-12 16:26:40,416][89374] Updated weights for policy 0, policy_version 11153 (0.0009) [2023-09-12 16:26:43,761][89374] Updated weights for policy 0, policy_version 11163 (0.0008) [2023-09-12 16:26:47,194][89374] Updated weights for policy 0, policy_version 11173 (0.0008) [2023-09-12 16:26:50,530][89374] Updated weights for policy 0, policy_version 11183 (0.0009) [2023-09-12 16:26:53,981][89374] Updated weights for policy 0, policy_version 11193 (0.0009) [2023-09-12 16:26:57,349][89374] Updated weights for policy 0, policy_version 11203 (0.0009) [2023-09-12 16:27:00,784][89374] Updated weights for policy 0, policy_version 11213 (0.0009) [2023-09-12 16:27:04,141][89374] Updated weights for policy 0, policy_version 11223 (0.0009) [2023-09-12 16:27:07,542][89374] Updated weights for policy 0, policy_version 11233 (0.0009) [2023-09-12 16:27:10,865][89374] Updated weights for policy 0, policy_version 11243 (0.0008) [2023-09-12 16:27:14,263][89374] Updated weights for policy 0, policy_version 11253 (0.0008) [2023-09-12 16:27:16,782][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000011260_46120960.pth... [2023-09-12 16:27:16,838][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010505_43028480.pth [2023-09-12 16:27:17,628][89374] Updated weights for policy 0, policy_version 11263 (0.0008) [2023-09-12 16:27:20,992][89374] Updated weights for policy 0, policy_version 11273 (0.0008) [2023-09-12 16:27:24,412][89374] Updated weights for policy 0, policy_version 11283 (0.0008) [2023-09-12 16:27:27,698][89374] Updated weights for policy 0, policy_version 11293 (0.0008) [2023-09-12 16:27:31,029][89374] Updated weights for policy 0, policy_version 11303 (0.0009) [2023-09-12 16:27:34,474][89374] Updated weights for policy 0, policy_version 11313 (0.0008) [2023-09-12 16:27:37,789][89374] Updated weights for policy 0, policy_version 11323 (0.0008) [2023-09-12 16:27:41,095][89374] Updated weights for policy 0, policy_version 11333 (0.0009) [2023-09-12 16:27:44,552][89374] Updated weights for policy 0, policy_version 11343 (0.0009) [2023-09-12 16:27:47,959][89374] Updated weights for policy 0, policy_version 11353 (0.0008) [2023-09-12 16:27:51,184][89374] Updated weights for policy 0, policy_version 11363 (0.0008) [2023-09-12 16:27:54,576][89374] Updated weights for policy 0, policy_version 11373 (0.0009) [2023-09-12 16:27:57,940][89374] Updated weights for policy 0, policy_version 11383 (0.0009) [2023-09-12 16:28:01,245][89374] Updated weights for policy 0, policy_version 11393 (0.0008) [2023-09-12 16:28:04,716][89374] Updated weights for policy 0, policy_version 11403 (0.0008) [2023-09-12 16:28:07,946][89374] Updated weights for policy 0, policy_version 11413 (0.0009) [2023-09-12 16:28:11,285][89374] Updated weights for policy 0, policy_version 11423 (0.0008) [2023-09-12 16:28:14,701][89374] Updated weights for policy 0, policy_version 11433 (0.0009) [2023-09-12 16:28:18,058][89374] Updated weights for policy 0, policy_version 11443 (0.0009) [2023-09-12 16:28:21,304][89374] Updated weights for policy 0, policy_version 11453 (0.0008) [2023-09-12 16:28:24,718][89374] Updated weights for policy 0, policy_version 11463 (0.0008) [2023-09-12 16:28:28,078][89374] Updated weights for policy 0, policy_version 11473 (0.0009) [2023-09-12 16:28:31,494][89374] Updated weights for policy 0, policy_version 11483 (0.0008) [2023-09-12 16:28:34,904][89374] Updated weights for policy 0, policy_version 11493 (0.0009) [2023-09-12 16:28:38,185][89374] Updated weights for policy 0, policy_version 11503 (0.0009) [2023-09-12 16:28:41,571][89374] Updated weights for policy 0, policy_version 11513 (0.0008) [2023-09-12 16:28:44,920][89374] Updated weights for policy 0, policy_version 11523 (0.0009) [2023-09-12 16:28:48,297][89374] Updated weights for policy 0, policy_version 11533 (0.0008) [2023-09-12 16:28:51,617][89374] Updated weights for policy 0, policy_version 11543 (0.0008) [2023-09-12 16:28:55,061][89374] Updated weights for policy 0, policy_version 11553 (0.0009) [2023-09-12 16:28:58,428][89374] Updated weights for policy 0, policy_version 11563 (0.0009) [2023-09-12 16:29:01,770][89374] Updated weights for policy 0, policy_version 11573 (0.0009) [2023-09-12 16:29:01,777][89073] Saving new best policy, reward=0.982! [2023-09-12 16:29:05,201][89374] Updated weights for policy 0, policy_version 11583 (0.0009) [2023-09-12 16:29:08,567][89374] Updated weights for policy 0, policy_version 11593 (0.0009) [2023-09-12 16:29:12,006][89374] Updated weights for policy 0, policy_version 11603 (0.0009) [2023-09-12 16:29:15,152][89374] Updated weights for policy 0, policy_version 11613 (0.0008) [2023-09-12 16:29:16,822][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000011620_47595520.pth... [2023-09-12 16:29:16,873][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000010859_44478464.pth [2023-09-12 16:29:17,572][89374] Updated weights for policy 0, policy_version 11623 (0.0009) [2023-09-12 16:29:19,994][89374] Updated weights for policy 0, policy_version 11633 (0.0008) [2023-09-12 16:29:22,457][89374] Updated weights for policy 0, policy_version 11643 (0.0009) [2023-09-12 16:29:24,909][89374] Updated weights for policy 0, policy_version 11653 (0.0008) [2023-09-12 16:29:27,363][89374] Updated weights for policy 0, policy_version 11663 (0.0009) [2023-09-12 16:29:29,814][89374] Updated weights for policy 0, policy_version 11673 (0.0009) [2023-09-12 16:29:32,266][89374] Updated weights for policy 0, policy_version 11683 (0.0009) [2023-09-12 16:29:34,771][89374] Updated weights for policy 0, policy_version 11693 (0.0008) [2023-09-12 16:29:37,212][89374] Updated weights for policy 0, policy_version 11703 (0.0009) [2023-09-12 16:29:39,720][89374] Updated weights for policy 0, policy_version 11713 (0.0009) [2023-09-12 16:29:42,217][89374] Updated weights for policy 0, policy_version 11723 (0.0009) [2023-09-12 16:29:44,791][89374] Updated weights for policy 0, policy_version 11733 (0.0010) [2023-09-12 16:29:48,128][89374] Updated weights for policy 0, policy_version 11743 (0.0009) [2023-09-12 16:29:51,494][89374] Updated weights for policy 0, policy_version 11753 (0.0009) [2023-09-12 16:29:54,800][89374] Updated weights for policy 0, policy_version 11763 (0.0009) [2023-09-12 16:29:58,215][89374] Updated weights for policy 0, policy_version 11773 (0.0008) [2023-09-12 16:30:01,599][89374] Updated weights for policy 0, policy_version 11783 (0.0010) [2023-09-12 16:30:04,933][89374] Updated weights for policy 0, policy_version 11793 (0.0009) [2023-09-12 16:30:08,360][89374] Updated weights for policy 0, policy_version 11803 (0.0008) [2023-09-12 16:30:11,737][89374] Updated weights for policy 0, policy_version 11813 (0.0009) [2023-09-12 16:30:15,079][89374] Updated weights for policy 0, policy_version 11823 (0.0008) [2023-09-12 16:30:18,430][89374] Updated weights for policy 0, policy_version 11833 (0.0009) [2023-09-12 16:30:21,835][89374] Updated weights for policy 0, policy_version 11843 (0.0009) [2023-09-12 16:30:25,286][89374] Updated weights for policy 0, policy_version 11853 (0.0009) [2023-09-12 16:30:28,655][89374] Updated weights for policy 0, policy_version 11863 (0.0010) [2023-09-12 16:30:32,125][89374] Updated weights for policy 0, policy_version 11873 (0.0009) [2023-09-12 16:30:35,836][89374] Updated weights for policy 0, policy_version 11883 (0.0010) [2023-09-12 16:30:39,078][89374] Updated weights for policy 0, policy_version 11893 (0.0009) [2023-09-12 16:30:42,496][89374] Updated weights for policy 0, policy_version 11903 (0.0009) [2023-09-12 16:30:45,838][89374] Updated weights for policy 0, policy_version 11913 (0.0009) [2023-09-12 16:30:49,307][89374] Updated weights for policy 0, policy_version 11923 (0.0009) [2023-09-12 16:30:52,673][89374] Updated weights for policy 0, policy_version 11933 (0.0008) [2023-09-12 16:30:56,035][89374] Updated weights for policy 0, policy_version 11943 (0.0009) [2023-09-12 16:30:59,490][89374] Updated weights for policy 0, policy_version 11953 (0.0009) [2023-09-12 16:31:02,928][89374] Updated weights for policy 0, policy_version 11963 (0.0009) [2023-09-12 16:31:06,220][89374] Updated weights for policy 0, policy_version 11973 (0.0008) [2023-09-12 16:31:09,648][89374] Updated weights for policy 0, policy_version 11983 (0.0009) [2023-09-12 16:31:12,963][89374] Updated weights for policy 0, policy_version 11993 (0.0009) [2023-09-12 16:31:16,384][89374] Updated weights for policy 0, policy_version 12003 (0.0008) [2023-09-12 16:31:16,780][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000012004_49168384.pth... [2023-09-12 16:31:16,826][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000011260_46120960.pth [2023-09-12 16:31:19,821][89374] Updated weights for policy 0, policy_version 12013 (0.0009) [2023-09-12 16:31:23,160][89374] Updated weights for policy 0, policy_version 12023 (0.0008) [2023-09-12 16:31:26,627][89374] Updated weights for policy 0, policy_version 12033 (0.0008) [2023-09-12 16:31:29,985][89374] Updated weights for policy 0, policy_version 12043 (0.0008) [2023-09-12 16:31:33,383][89374] Updated weights for policy 0, policy_version 12053 (0.0008) [2023-09-12 16:31:36,761][89374] Updated weights for policy 0, policy_version 12063 (0.0009) [2023-09-12 16:31:40,094][89374] Updated weights for policy 0, policy_version 12073 (0.0008) [2023-09-12 16:31:43,536][89374] Updated weights for policy 0, policy_version 12083 (0.0008) [2023-09-12 16:31:47,038][89374] Updated weights for policy 0, policy_version 12093 (0.0009) [2023-09-12 16:31:50,400][89374] Updated weights for policy 0, policy_version 12103 (0.0009) [2023-09-12 16:31:53,834][89374] Updated weights for policy 0, policy_version 12113 (0.0009) [2023-09-12 16:31:57,222][89374] Updated weights for policy 0, policy_version 12123 (0.0008) [2023-09-12 16:32:00,617][89374] Updated weights for policy 0, policy_version 12133 (0.0009) [2023-09-12 16:32:03,984][89374] Updated weights for policy 0, policy_version 12143 (0.0009) [2023-09-12 16:32:07,426][89374] Updated weights for policy 0, policy_version 12153 (0.0009) [2023-09-12 16:32:10,756][89374] Updated weights for policy 0, policy_version 12163 (0.0010) [2023-09-12 16:32:14,107][89374] Updated weights for policy 0, policy_version 12173 (0.0009) [2023-09-12 16:32:17,568][89374] Updated weights for policy 0, policy_version 12183 (0.0009) [2023-09-12 16:32:20,950][89374] Updated weights for policy 0, policy_version 12193 (0.0008) [2023-09-12 16:32:24,292][89374] Updated weights for policy 0, policy_version 12203 (0.0009) [2023-09-12 16:32:26,349][89073] Stopping Batcher_0... [2023-09-12 16:32:26,349][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000012209_50008064.pth... [2023-09-12 16:32:26,350][89073] Loop batcher_evt_loop terminating... [2023-09-12 16:32:26,364][89378] Stopping RolloutWorker_w1... [2023-09-12 16:32:26,364][89378] Loop rollout_proc1_evt_loop terminating... [2023-09-12 16:32:26,364][89377] Stopping RolloutWorker_w2... [2023-09-12 16:32:26,365][89377] Loop rollout_proc2_evt_loop terminating... [2023-09-12 16:32:26,365][89379] Stopping RolloutWorker_w3... [2023-09-12 16:32:26,365][89375] Stopping RolloutWorker_w0... [2023-09-12 16:32:26,366][89375] Loop rollout_proc0_evt_loop terminating... [2023-09-12 16:32:26,366][89379] Loop rollout_proc3_evt_loop terminating... [2023-09-12 16:32:26,366][89382] Stopping RolloutWorker_w6... [2023-09-12 16:32:26,366][89383] Stopping RolloutWorker_w7... [2023-09-12 16:32:26,366][89380] Stopping RolloutWorker_w4... [2023-09-12 16:32:26,366][89382] Loop rollout_proc6_evt_loop terminating... [2023-09-12 16:32:26,366][89383] Loop rollout_proc7_evt_loop terminating... [2023-09-12 16:32:26,366][89381] Stopping RolloutWorker_w5... [2023-09-12 16:32:26,367][89380] Loop rollout_proc4_evt_loop terminating... [2023-09-12 16:32:26,367][89381] Loop rollout_proc5_evt_loop terminating... [2023-09-12 16:32:26,376][89374] Weights refcount: 2 0 [2023-09-12 16:32:26,379][89374] Stopping InferenceWorker_p0-w0... [2023-09-12 16:32:26,379][89374] Loop inference_proc0-0_evt_loop terminating... [2023-09-12 16:32:26,406][89073] Removing /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000011620_47595520.pth [2023-09-12 16:32:26,413][89073] Saving /home/cogstack/Documents/optuna/environments/sample_factory/train_dir/default_experiment/checkpoint_p0/checkpoint_000012209_50008064.pth... [2023-09-12 16:32:26,473][89073] Stopping LearnerWorker_p0... [2023-09-12 16:32:26,474][89073] Loop learner_proc0_evt_loop terminating...