WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** [2024-02-08 17:04:03,270] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,270] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,270] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,270] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,270] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,271] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,271] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:03,272] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:04:24,745] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,750] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,810] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,848] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,901] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,902] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,911] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:04:24,911] [INFO] [comm.py:668:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl [2024-02-08 17:04:24,955] [INFO] [comm.py:637:init_distributed] cdb=None 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 5, device: cuda:5, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 0, device: cuda:0, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:04:25 - INFO - __main__ - Training/evaluation parameters TrainingArguments( _n_gpu=1, adafactor=False, adam_beta1=0.9, adam_beta2=0.999, adam_epsilon=1e-08, auto_find_batch_size=False, bf16=True, bf16_full_eval=True, data_seed=None, dataloader_drop_last=False, dataloader_num_workers=8, dataloader_pin_memory=True, ddp_bucket_cap_mb=None, ddp_find_unused_parameters=None, ddp_timeout=72000, debug=[], deepspeed=/apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/train/deepspeed_config_bf16.json, disable_tqdm=False, do_eval=False, do_predict=False, do_train=True, eval_accumulation_steps=None, eval_delay=0, eval_steps=None, evaluation_strategy=no, fp16=False, fp16_backend=auto, fp16_full_eval=False, fp16_opt_level=O1, fsdp=[], fsdp_config={'fsdp_min_num_params': 0, 'xla': False, 'xla_fsdp_grad_ckpt': False}, fsdp_min_num_params=0, fsdp_transformer_layer_cls_to_wrap=None, full_determinism=False, gradient_accumulation_steps=8, gradient_checkpointing=True, greater_is_better=None, group_by_length=False, half_precision_backend=auto, hub_model_id=None, hub_private_repo=False, hub_strategy=every_save, hub_token=, ignore_data_skip=False, include_inputs_for_metrics=False, jit_mode_eval=False, label_names=None, label_smoothing_factor=0.0, learning_rate=2e-05, length_column_name=length, load_best_model_at_end=False, local_rank=0, log_level=passive, log_level_replica=warning, log_on_each_node=True, logging_dir=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/runs/Feb08_17-04-02_ts-b102359ecb124d359c32da25fe3785b5-launcher, logging_first_step=False, logging_nan_inf_filter=True, logging_steps=1, logging_strategy=steps, lr_scheduler_type=cosine, max_grad_norm=1.0, max_steps=-1, metric_for_best_model=None, mp_parameters=, no_cuda=False, num_train_epochs=3.0, optim=adamw_hf, optim_args=None, output_dir=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b, overwrite_output_dir=True, past_index=-1, per_device_eval_batch_size=1, per_device_train_batch_size=8, prediction_loss_only=False, push_to_hub=False, push_to_hub_model_id=None, push_to_hub_organization=None, push_to_hub_token=, ray_scope=last, remove_unused_columns=True, report_to=['tensorboard'], resume_from_checkpoint=None, run_name=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b, save_on_each_node=False, save_steps=500, save_strategy=steps, save_total_limit=1, seed=34, sharded_ddp=[], skip_memory_metrics=True, tf32=None, torch_compile=False, torch_compile_backend=None, torch_compile_mode=None, torchdynamo=None, tpu_metrics_debug=False, tpu_num_cores=None, use_ipex=False, use_legacy_prediction_loop=False, use_mps_device=False, warmup_ratio=0.03, warmup_steps=0, weight_decay=0.0, xpu_backend=None, ) 02/08/2024 17:04:25 - INFO - __main__ - Loading dataset from file: {'train': '/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/datasets/alpaca-gpt4/train.alpacawithnewstest17to20.addac.alphf.json', 'validation': '/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/datasets/alpaca-gpt4/eval.addac.json'} 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 3, device: cuda:3, n_gpu: 1distributed training: True, 16-bits training: False /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 2, device: cuda:2, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 1, device: cuda:1, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 4, device: cuda:4, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 7, device: cuda:7, n_gpu: 1distributed training: True, 16-bits training: False /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( 02/08/2024 17:04:25 - WARNING - __main__ - Process rank: 6, device: cuda:6, n_gpu: 1distributed training: True, 16-bits training: False /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( Downloading data files: 0%| | 0/2 [00:00> loading configuration file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/config.json [INFO|configuration_utils.py:720] 2024-02-08 17:04:26,765 >> Model config LlamaConfig { "_name_or_path": "/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b", "architectures": [ "LlamaForCausalLM" ], "bos_token_id": 1, "eos_token_id": 2, "hidden_act": "silu", "hidden_size": 4096, "initializer_range": 0.02, "intermediate_size": 11008, "max_position_embeddings": 4096, "model_type": "llama", "num_attention_heads": 32, "num_hidden_layers": 32, "num_key_value_heads": 32, "pad_token_id": 0, "pretraining_tp": 1, "rms_norm_eps": 1e-05, "rope_scaling": null, "tie_word_embeddings": false, "torch_dtype": "bfloat16", "transformers_version": "4.28.0.dev0", "use_cache": true, "vocab_size": 32002 } 02/08/2024 17:04:26 - INFO - __main__ - Tokenizer_kwargs: {'cache_dir': None, 'use_fast': True, 'revision': 'main', 'use_auth_token': None, 'model_max_length': 4096} [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:04:26,771 >> loading file tokenizer.model [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:04:26,771 >> loading file added_tokens.json [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:04:26,771 >> loading file special_tokens_map.json [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:04:26,771 >> loading file tokenizer_config.json [INFO|tokenization_utils.py:426] 2024-02-08 17:04:26,784 >> Adding [PAD] to the vocabulary [INFO|tokenization_utils.py:426] 2024-02-08 17:04:26,784 >> Adding to the vocabulary 02/08/2024 17:04:26 - INFO - __main__ - Loading checkpoints in dtype: None [INFO|modeling_utils.py:2395] 2024-02-08 17:04:26,789 >> loading weights file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/pytorch_model.bin [INFO|modeling_utils.py:2487] 2024-02-08 17:05:43,506 >> Detected DeepSpeed ZeRO-3: activating zero.init() for this model [INFO|configuration_utils.py:575] 2024-02-08 17:05:43,511 >> Generate config GenerationConfig { "_from_model_config": true, "bos_token_id": 1, "eos_token_id": 2, "pad_token_id": 0, "transformers_version": "4.28.0.dev0" } ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:81887 [0] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:81887 [0] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:81887 [0] NCCL INFO cudaDriverVersion 11070 NCCL version 2.14.3+cuda11.7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:81891 [4] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:81892 [5] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:81891 [4] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:81890 [3] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:81893 [6] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:81892 [5] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:81888 [1] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:81894 [7] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:81890 [3] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:81893 [6] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:81888 [1] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:81894 [7] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:81891 [4] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:81892 [5] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:81890 [3] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:81893 [6] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:81888 [1] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:81894 [7] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:81889 [2] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:81889 [2] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:81889 [2] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Setting affinity for GPU 0 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Setting affinity for GPU 5 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Setting affinity for GPU 6 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Setting affinity for GPU 4 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Setting affinity for GPU 7 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Setting affinity for GPU 2 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Setting affinity for GPU 3 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Setting affinity for GPU 1 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Trees [0] 2/-1/-1->1->0 [1] 2/-1/-1->1->0 [2] 2/-1/-1->1->0 [3] 2/-1/-1->1->0 [4] 2/-1/-1->1->0 [5] 2/-1/-1->1->0 [6] 2/-1/-1->1->0 [7] 2/-1/-1->1->0 [8] 2/-1/-1->1->0 [9] 2/-1/-1->1->0 [10] 2/-1/-1->1->0 [11] 2/-1/-1->1->0 [12] 2/-1/-1->1->0 [13] 2/-1/-1->1->0 [14] 2/-1/-1->1->0 [15] 2/-1/-1->1->0 [16] 2/-1/-1->1->0 [17] 2/-1/-1->1->0 [18] 2/-1/-1->1->0 [19] 2/-1/-1->1->0 [20] 2/-1/-1->1->0 [21] 2/-1/-1->1->0 [22] 2/-1/-1->1->0 [23] 2/-1/-1->1->0 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Trees [0] -1/-1/-1->7->6 [1] -1/-1/-1->7->6 [2] -1/-1/-1->7->6 [3] -1/-1/-1->7->6 [4] -1/-1/-1->7->6 [5] -1/-1/-1->7->6 [6] -1/-1/-1->7->6 [7] -1/-1/-1->7->6 [8] -1/-1/-1->7->6 [9] -1/-1/-1->7->6 [10] -1/-1/-1->7->6 [11] -1/-1/-1->7->6 [12] -1/-1/-1->7->6 [13] -1/-1/-1->7->6 [14] -1/-1/-1->7->6 [15] -1/-1/-1->7->6 [16] -1/-1/-1->7->6 [17] -1/-1/-1->7->6 [18] -1/-1/-1->7->6 [19] -1/-1/-1->7->6 [20] -1/-1/-1->7->6 [21] -1/-1/-1->7->6 [22] -1/-1/-1->7->6 [23] -1/-1/-1->7->6 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Trees [0] 7/-1/-1->6->5 [1] 7/-1/-1->6->5 [2] 7/-1/-1->6->5 [3] 7/-1/-1->6->5 [4] 7/-1/-1->6->5 [5] 7/-1/-1->6->5 [6] 7/-1/-1->6->5 [7] 7/-1/-1->6->5 [8] 7/-1/-1->6->5 [9] 7/-1/-1->6->5 [10] 7/-1/-1->6->5 [11] 7/-1/-1->6->5 [12] 7/-1/-1->6->5 [13] 7/-1/-1->6->5 [14] 7/-1/-1->6->5 [15] 7/-1/-1->6->5 [16] 7/-1/-1->6->5 [17] 7/-1/-1->6->5 [18] 7/-1/-1->6->5 [19] 7/-1/-1->6->5 [20] 7/-1/-1->6->5 [21] 7/-1/-1->6->5 [22] 7/-1/-1->6->5 [23] 7/-1/-1->6->5 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Trees [0] 4/-1/-1->3->2 [1] 4/-1/-1->3->2 [2] 4/-1/-1->3->2 [3] 4/-1/-1->3->2 [4] 4/-1/-1->3->2 [5] 4/-1/-1->3->2 [6] 4/-1/-1->3->2 [7] 4/-1/-1->3->2 [8] 4/-1/-1->3->2 [9] 4/-1/-1->3->2 [10] 4/-1/-1->3->2 [11] 4/-1/-1->3->2 [12] 4/-1/-1->3->2 [13] 4/-1/-1->3->2 [14] 4/-1/-1->3->2 [15] 4/-1/-1->3->2 [16] 4/-1/-1->3->2 [17] 4/-1/-1->3->2 [18] 4/-1/-1->3->2 [19] 4/-1/-1->3->2 [20] 4/-1/-1->3->2 [21] 4/-1/-1->3->2 [22] 4/-1/-1->3->2 [23] 4/-1/-1->3->2 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Trees [0] 6/-1/-1->5->4 [1] 6/-1/-1->5->4 [2] 6/-1/-1->5->4 [3] 6/-1/-1->5->4 [4] 6/-1/-1->5->4 [5] 6/-1/-1->5->4 [6] 6/-1/-1->5->4 [7] 6/-1/-1->5->4 [8] 6/-1/-1->5->4 [9] 6/-1/-1->5->4 [10] 6/-1/-1->5->4 [11] 6/-1/-1->5->4 [12] 6/-1/-1->5->4 [13] 6/-1/-1->5->4 [14] 6/-1/-1->5->4 [15] 6/-1/-1->5->4 [16] 6/-1/-1->5->4 [17] 6/-1/-1->5->4 [18] 6/-1/-1->5->4 [19] 6/-1/-1->5->4 [20] 6/-1/-1->5->4 [21] 6/-1/-1->5->4 [22] 6/-1/-1->5->4 [23] 6/-1/-1->5->4 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Trees [0] 3/-1/-1->2->1 [1] 3/-1/-1->2->1 [2] 3/-1/-1->2->1 [3] 3/-1/-1->2->1 [4] 3/-1/-1->2->1 [5] 3/-1/-1->2->1 [6] 3/-1/-1->2->1 [7] 3/-1/-1->2->1 [8] 3/-1/-1->2->1 [9] 3/-1/-1->2->1 [10] 3/-1/-1->2->1 [11] 3/-1/-1->2->1 [12] 3/-1/-1->2->1 [13] 3/-1/-1->2->1 [14] 3/-1/-1->2->1 [15] 3/-1/-1->2->1 [16] 3/-1/-1->2->1 [17] 3/-1/-1->2->1 [18] 3/-1/-1->2->1 [19] 3/-1/-1->2->1 [20] 3/-1/-1->2->1 [21] 3/-1/-1->2->1 [22] 3/-1/-1->2->1 [23] 3/-1/-1->2->1 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Trees [0] 5/-1/-1->4->3 [1] 5/-1/-1->4->3 [2] 5/-1/-1->4->3 [3] 5/-1/-1->4->3 [4] 5/-1/-1->4->3 [5] 5/-1/-1->4->3 [6] 5/-1/-1->4->3 [7] 5/-1/-1->4->3 [8] 5/-1/-1->4->3 [9] 5/-1/-1->4->3 [10] 5/-1/-1->4->3 [11] 5/-1/-1->4->3 [12] 5/-1/-1->4->3 [13] 5/-1/-1->4->3 [14] 5/-1/-1->4->3 [15] 5/-1/-1->4->3 [16] 5/-1/-1->4->3 [17] 5/-1/-1->4->3 [18] 5/-1/-1->4->3 [19] 5/-1/-1->4->3 [20] 5/-1/-1->4->3 [21] 5/-1/-1->4->3 [22] 5/-1/-1->4->3 [23] 5/-1/-1->4->3 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 00/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 00/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 00/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 00/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 00/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 01/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 01/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 01/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 01/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 01/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 02/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 02/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 02/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 02/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 02/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 03/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 03/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 03/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 03/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 03/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 04/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 04/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 04/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 04/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 04/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 05/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 05/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 05/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 05/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 05/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 06/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 06/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 06/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 06/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 06/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 07/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 07/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 07/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 07/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 07/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 08/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 08/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 08/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 08/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 08/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 09/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 09/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 09/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 09/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 09/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 10/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 10/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 10/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 10/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 10/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 11/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 11/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 11/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 11/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 11/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 12/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 12/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 12/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 12/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 12/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 13/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 13/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 13/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 13/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 13/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 14/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 14/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 14/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 14/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 14/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 15/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 15/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 15/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 15/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 15/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 16/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 16/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 16/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 16/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 16/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 17/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 17/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 17/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 17/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 17/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 18/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 18/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 18/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 18/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 18/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 19/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 19/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 19/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 19/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 19/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 20/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 20/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 20/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 20/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 20/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 21/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 21/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 21/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 21/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 21/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 22/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 22/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 22/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 22/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 22/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 23/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 23/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Channel 23/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 23/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 23/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 00/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 00/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 00/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 00/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 01/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 01/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 01/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 01/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 02/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 02/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 02/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 02/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 03/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 03/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 03/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 03/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 04/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 04/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 04/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 04/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 05/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 05/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 05/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 05/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 06/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 06/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 06/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 06/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 07/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 07/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 07/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 07/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 08/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 08/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 08/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 08/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 09/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 09/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 09/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 09/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 10/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 10/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 10/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 10/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 11/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 11/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 11/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 11/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 12/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 12/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 12/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 12/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 13/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 13/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 13/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 13/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 14/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 14/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 14/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 14/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 15/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 15/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 15/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 15/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 16/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 16/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 16/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 16/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 17/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 17/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 17/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 17/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 18/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 18/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 18/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 18/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 19/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 19/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 19/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 19/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 20/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 20/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 20/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 20/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 21/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 21/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 21/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 21/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 22/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 22/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 22/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 22/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Channel 23/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Channel 23/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Channel 23/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Channel 23/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:82863 [2] NCCL INFO comm 0x4356a150 rank 2 nranks 8 cudaDev 2 busId 4b000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:82861 [7] NCCL INFO comm 0x425dfa40 rank 7 nranks 8 cudaDev 7 busId d0000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:82860 [1] NCCL INFO comm 0x445f1d30 rank 1 nranks 8 cudaDev 1 busId 13000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:82855 [0] NCCL INFO comm 0x43770cb0 rank 0 nranks 8 cudaDev 0 busId e000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:82859 [6] NCCL INFO comm 0x45ebd100 rank 6 nranks 8 cudaDev 6 busId cb000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:82858 [3] NCCL INFO comm 0x449480c0 rank 3 nranks 8 cudaDev 3 busId 51000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:82856 [4] NCCL INFO comm 0x44e1f700 rank 4 nranks 8 cudaDev 4 busId 93000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:82857 [5] NCCL INFO comm 0x44ea3520 rank 5 nranks 8 cudaDev 5 busId 99000 - Init COMPLETE [2024-02-08 17:05:50,390] [INFO] [partition_parameters.py:347:__exit__] finished initializing model - num_params = 291, num_elems = 6.74B [INFO|modeling_utils.py:3029] 2024-02-08 17:05:52,477 >> All model checkpoint weights were used when initializing LlamaForCausalLM. [INFO|modeling_utils.py:3037] 2024-02-08 17:05:52,477 >> All the weights of LlamaForCausalLM were initialized from the model checkpoint at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b. If your task is similar to the task the model of the checkpoint was trained on, you can already use LlamaForCausalLM for predictions without further training. [INFO|configuration_utils.py:535] 2024-02-08 17:05:52,486 >> loading configuration file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/generation_config.json [INFO|configuration_utils.py:575] 2024-02-08 17:05:52,486 >> Generate config GenerationConfig { "bos_token_id": 1, "do_sample": true, "eos_token_id": 2, "max_length": 4096, "pad_token_id": 0, "temperature": 0.6, "top_p": 0.9, "transformers_version": "4.28.0.dev0" } 02/08/2024 17:05:54 - INFO - __main__ - The old token as an anchor Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00000_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00000_of_00032.arrow Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00001_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00001_of_00032.arrow Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00002_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00002_of_00032.arrow Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00003_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00003_of_00032.arrow Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00004_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00004_of_00032.arrow Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00005_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00005_of_00032.arrow Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00006_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00006_of_00032.arrow Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00007_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00007_of_00032.arrow Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00008_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00008_of_00032.arrow Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00009_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00009_of_00032.arrow Process #10 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00010_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #10 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00010_of_00032.arrow Process #11 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00011_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #11 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00011_of_00032.arrow Process #12 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00012_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #12 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00012_of_00032.arrow Process #13 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00013_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #13 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00013_of_00032.arrow Process #14 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00014_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #14 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00014_of_00032.arrow Process #15 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00015_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #15 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00015_of_00032.arrow Process #16 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00016_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #16 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00016_of_00032.arrow Process #17 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00017_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #17 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00017_of_00032.arrow Process #18 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00018_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #18 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00018_of_00032.arrow Process #19 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00019_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #19 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00019_of_00032.arrow Process #20 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00020_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #20 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00020_of_00032.arrow Process #21 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00021_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #21 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00021_of_00032.arrow Process #22 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00022_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #22 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00022_of_00032.arrow Process #23 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00023_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #23 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00023_of_00032.arrow Process #24 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00024_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #24 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00024_of_00032.arrow Process #25 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00025_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #25 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00025_of_00032.arrow Process #26 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00026_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #26 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00026_of_00032.arrow Process #27 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00027_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #27 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00027_of_00032.arrow Process #28 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00028_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #28 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00028_of_00032.arrow Process #29 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00029_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #29 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00029_of_00032.arrow Process #30 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00030_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #30 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00030_of_00032.arrow Process #31 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00031_of_00032.arrow 02/08/2024 17:05:54 - INFO - datasets.arrow_dataset - Process #31 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00031_of_00032.arrow Spawning 32 processes 02/08/2024 17:05:55 - INFO - datasets.arrow_dataset - Spawning 32 processes Tokenize with padding (num_proc=32): 0%| | 0/64204 [00:00> Using cuda_amp half precision backend [2024-02-08 17:06:05,162] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed info: version=0.11.1, git-hash=unknown, git-branch=unknown [2024-02-08 17:06:05,532] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Flops Profiler Enabled: False Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Detected CUDA files, patching ldflags Emitting ninja build file /root/.cache/torch_extensions/py38_cu117/cpu_adam/build.ninja... Building extension module cpu_adam... Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N) ninja: no work to do. Loading extension module cpu_adam... Time to load cpu_adam op: 1.3096520900726318 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.144273281097412 seconds Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Detected CUDA files, patching ldflags Emitting ninja build file /root/.cache/torch_extensions/py38_cu117/cpu_adam/build.ninja... Building extension module cpu_adam... Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N) ninja: no work to do. Loading extension module cpu_adam... Time to load cpu_adam op: 1.2942054271697998 seconds Loading extension module cpu_adam... Loading extension module cpu_adam... Time to load cpu_adam op: 1.3182263374328613 seconds Time to load cpu_adam op: 1.2661356925964355 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.2585327625274658 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.3013050556182861 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.2945947647094727 seconds Adam Optimizer #0 is created with AVX2 arithmetic capability. Config: alpha=0.000020, betas=(0.900000, 0.999000), weight_decay=0.000000, adam_w=1 [2024-02-08 17:06:11,764] [INFO] [logging.py:96:log_dist] [Rank 0] Using DeepSpeed Optimizer param name adam as basic optimizer [2024-02-08 17:06:11,764] [INFO] [logging.py:96:log_dist] [Rank 0] Removing param_group that has no 'params' in the basic Optimizer [2024-02-08 17:06:11,785] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Basic Optimizer = DeepSpeedCPUAdam [2024-02-08 17:06:11,785] [INFO] [utils.py:56:is_zero_supported_optimizer] Checking ZeRO support for optimizer=DeepSpeedCPUAdam type= [2024-02-08 17:06:11,785] [INFO] [logging.py:96:log_dist] [Rank 0] Creating fp16 ZeRO stage 3 optimizer, MiCS is enabled False, Hierarchical params gather False [2024-02-08 17:06:11,785] [INFO] [logging.py:96:log_dist] [Rank 0] Creating torch.bfloat16 ZeRO stage 3 optimizer [2024-02-08 17:06:11,898] [INFO] [utils.py:802:see_memory_usage] Stage 3 initialize beginning [2024-02-08 17:06:11,899] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.79 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:11,899] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.39 GB, percent = 8.4% [2024-02-08 17:06:11,902] [INFO] [stage3.py:126:__init__] Reduce bucket size 16777216 [2024-02-08 17:06:11,903] [INFO] [stage3.py:127:__init__] Prefetch bucket size 15099494 [2024-02-08 17:06:12,010] [INFO] [utils.py:802:see_memory_usage] DeepSpeedZeRoOffload initialize [begin] [2024-02-08 17:06:12,011] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:12,011] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.39 GB, percent = 8.4% Parameter Offload: Total persistent parameters: 266240 in 65 params [2024-02-08 17:06:12,143] [INFO] [utils.py:802:see_memory_usage] DeepSpeedZeRoOffload initialize [end] [2024-02-08 17:06:12,144] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:12,144] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.4 GB, percent = 8.4% [2024-02-08 17:06:12,254] [INFO] [utils.py:802:see_memory_usage] Before creating fp16 partitions [2024-02-08 17:06:12,255] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:12,255] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.39 GB, percent = 8.4% [2024-02-08 17:06:13,758] [INFO] [utils.py:802:see_memory_usage] After creating fp16 partitions: 1 [2024-02-08 17:06:13,759] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:13,759] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 104.08 GB, percent = 10.3% [2024-02-08 17:06:13,913] [INFO] [utils.py:802:see_memory_usage] Before creating fp32 partitions [2024-02-08 17:06:13,913] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:13,914] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 107.39 GB, percent = 10.7% [2024-02-08 17:06:15,600] [INFO] [utils.py:802:see_memory_usage] After creating fp32 partitions [2024-02-08 17:06:15,600] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:15,601] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 125.87 GB, percent = 12.5% [2024-02-08 17:06:15,743] [INFO] [utils.py:802:see_memory_usage] Before initializing optimizer states [2024-02-08 17:06:15,744] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:15,744] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 129.47 GB, percent = 12.9% [2024-02-08 17:06:22,870] [INFO] [utils.py:802:see_memory_usage] After initializing optimizer states [2024-02-08 17:06:22,871] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:06:22,871] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 211.85 GB, percent = 21.0% [2024-02-08 17:06:23,273] [INFO] [stage3.py:459:_setup_for_real_optimizer] optimizer state initialized [2024-02-08 17:06:25,352] [INFO] [utils.py:802:see_memory_usage] After initializing ZeRO optimizer [2024-02-08 17:06:25,353] [INFO] [utils.py:803:see_memory_usage] MA 0.06 GB Max_MA 0.55 GB CA 1.05 GB Max_CA 1 GB [2024-02-08 17:06:25,354] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 225.16 GB, percent = 22.4% [2024-02-08 17:06:25,354] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Final Optimizer = adam [2024-02-08 17:06:25,354] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed using client callable to create LR scheduler [2024-02-08 17:06:25,354] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed LR Scheduler = [2024-02-08 17:06:25,354] [INFO] [logging.py:96:log_dist] [Rank 0] step=0, skipped=0, lr=[0.0], mom=[[0.9, 0.999]] [2024-02-08 17:06:25,355] [INFO] [config.py:968:print] DeepSpeedEngine configuration: [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] activation_checkpointing_config { "partition_activations": false, "contiguous_memory_optimization": false, "cpu_checkpointing": false, "number_checkpoints": null, "synchronize_checkpoint_boundary": false, "profile": false } [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] aio_config ................... {'block_size': 1048576, 'queue_depth': 8, 'thread_count': 1, 'single_submit': False, 'overlap_events': True} [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] amp_enabled .................. False [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] amp_params ................... False [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] autotuning_config ............ { "enabled": false, "start_step": null, "end_step": null, "metric_path": null, "arg_mappings": null, "metric": "throughput", "model_info": null, "results_dir": "autotuning_results", "exps_dir": "autotuning_exps", "overwrite": true, "fast": true, "start_profile_step": 3, "end_profile_step": 5, "tuner_type": "gridsearch", "tuner_early_stopping": 5, "tuner_num_trials": 50, "model_info_path": null, "mp_size": 1, "max_train_batch_size": null, "min_train_batch_size": 1, "max_train_micro_batch_size_per_gpu": 1.024000e+03, "min_train_micro_batch_size_per_gpu": 1, "num_tuning_micro_batch_sizes": 3 } [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] bfloat16_enabled ............. True [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] checkpoint_parallel_write_pipeline False [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] checkpoint_tag_validation_enabled True [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] checkpoint_tag_validation_fail False [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] comms_config ................. [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] communication_data_type ...... None [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] compression_config ........... {'weight_quantization': {'shared_parameters': {'enabled': False, 'quantizer_kernel': False, 'schedule_offset': 0, 'quantize_groups': 1, 'quantize_verbose': False, 'quantization_type': 'symmetric', 'quantize_weight_in_forward': False, 'rounding': 'nearest', 'fp16_mixed_quantize': False, 'quantize_change_ratio': 0.001}, 'different_groups': {}}, 'activation_quantization': {'shared_parameters': {'enabled': False, 'quantization_type': 'symmetric', 'range_calibration': 'dynamic', 'schedule_offset': 1000}, 'different_groups': {}}, 'sparse_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'row_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'head_pruning': {'shared_parameters': {'enabled': False, 'method': 'topk', 'schedule_offset': 1000}, 'different_groups': {}}, 'channel_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'layer_reduction': {'enabled': False}} [2024-02-08 17:06:25,356] [INFO] [config.py:972:print] curriculum_enabled_legacy .... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] curriculum_params_legacy ..... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] data_efficiency_config ....... {'enabled': False, 'seed': 1234, 'data_sampling': {'enabled': False, 'num_epochs': 1000, 'num_workers': 0, 'curriculum_learning': {'enabled': False}}, 'data_routing': {'enabled': False, 'random_ltd': {'enabled': False, 'layer_token_lr_schedule': {'enabled': False}}}} [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] data_efficiency_enabled ...... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] dataloader_drop_last ......... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] disable_allgather ............ False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] dump_state ................... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] dynamic_loss_scale_args ...... None [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_enabled ........... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_gas_boundary_resolution 1 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_layer_name ........ bert.encoder.layer [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_layer_num ......... 0 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_max_iter .......... 100 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_stability ......... 1e-06 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_tol ............... 0.01 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] eigenvalue_verbose ........... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] elasticity_enabled ........... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] flops_profiler_config ........ { "enabled": false, "recompute_fwd_factor": 0.0, "profile_step": 1, "module_depth": -1, "top_modules": 1, "detailed": true, "output_file": null } [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] fp16_auto_cast ............... None [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] fp16_enabled ................. False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] fp16_master_weights_and_gradients False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] global_rank .................. 0 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] grad_accum_dtype ............. None [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] gradient_accumulation_steps .. 8 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] gradient_clipping ............ 1.0 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] gradient_predivide_factor .... 1.0 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] hybrid_engine ................ enabled=False max_out_tokens=512 inference_tp_size=1 release_inference_cache=False pin_parameters=True tp_gather_partition_size=8 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] initial_dynamic_scale ........ 1 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] load_universal_checkpoint .... False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] loss_scale ................... 1.0 [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] memory_breakdown ............. False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] mics_hierarchial_params_gather False [2024-02-08 17:06:25,357] [INFO] [config.py:972:print] mics_shard_size .............. -1 [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] monitor_config ............... tensorboard=TensorBoardConfig(enabled=False, output_path='', job_name='DeepSpeedJobName') wandb=WandbConfig(enabled=False, group=None, team=None, project='deepspeed') csv_monitor=CSVConfig(enabled=False, output_path='', job_name='DeepSpeedJobName') enabled=False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] nebula_config ................ { "enabled": false, "persistent_storage_path": null, "persistent_time_interval": 100, "num_of_version_in_retention": 2, "enable_nebula_load": true, "load_path": null } [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] optimizer_legacy_fusion ...... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] optimizer_name ............... adam [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] optimizer_params ............. {'lr': 2e-05, 'betas': [0.9, 0.999], 'eps': 1e-08, 'weight_decay': 0.0} [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] pipeline ..................... {'stages': 'auto', 'partition': 'best', 'seed_layers': False, 'activation_checkpoint_interval': 0} [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] pld_enabled .................. False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] pld_params ................... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] prescale_gradients ........... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] scheduler_name ............... None [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] scheduler_params ............. None [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] sparse_attention ............. None [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] sparse_gradients_enabled ..... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] steps_per_print .............. 1000 [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] train_batch_size ............. 512 [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] train_micro_batch_size_per_gpu 8 [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] use_node_local_storage ....... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] wall_clock_breakdown ......... False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] weight_quantization_config ... None [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] world_size ................... 8 [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] zero_allow_untested_optimizer False [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] zero_config .................. stage=3 contiguous_gradients=True reduce_scatter=True reduce_bucket_size=16777216 allgather_partitions=True allgather_bucket_size=500,000,000 overlap_comm=True load_from_fp32_weights=True elastic_checkpoint=False offload_param=DeepSpeedZeroOffloadParamConfig(device='cpu', nvme_path=None, buffer_count=5, buffer_size=100,000,000, max_in_cpu=1,000,000,000, pin_memory=True) offload_optimizer=DeepSpeedZeroOffloadOptimizerConfig(device='cpu', nvme_path=None, buffer_count=4, pin_memory=True, pipeline=False, pipeline_read=False, pipeline_write=False, fast_init=False) sub_group_size=1000000000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=15099494 param_persistence_threshold=40960 model_persistence_threshold=sys.maxsize max_live_parameters=1000000000 max_reuse_distance=1000000000 gather_16bit_weights_on_model_save=True stage3_gather_fp16_weights_on_model_save=False ignore_unused_parameters=True legacy_stage1=False round_robin_gradients=False zero_hpz_partition_size=1 zero_quantized_weights=False zero_quantized_nontrainable_weights=False zero_quantized_gradients=False mics_shard_size=-1 mics_hierarchical_params_gather=False memory_efficient_linear=True pipeline_loading_checkpoint=False override_module_apply=True [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] zero_enabled ................. True [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] zero_force_ds_cpu_optimizer .. True [2024-02-08 17:06:25,358] [INFO] [config.py:972:print] zero_optimization_stage ...... 3 [2024-02-08 17:06:25,359] [INFO] [config.py:958:print_user_config] json = { "optimizer": { "type": "Adam", "params": { "lr": 2e-05, "betas": [0.9, 0.999], "eps": 1e-08, "weight_decay": 0.0 } }, "bf16": { "enabled": true }, "zero_optimization": { "stage": 3, "offload_optimizer": { "device": "cpu", "pin_memory": true }, "offload_param": { "device": "cpu", "pin_memory": true }, "overlap_comm": true, "contiguous_gradients": true, "reduce_bucket_size": 1.677722e+07, "stage3_prefetch_bucket_size": 1.509949e+07, "stage3_param_persistence_threshold": 4.096000e+04, "sub_group_size": 1.000000e+09, "stage3_max_live_parameters": 1.000000e+09, "stage3_max_reuse_distance": 1.000000e+09, "stage3_gather_16bit_weights_on_model_save": true }, "gradient_accumulation_steps": 8, "gradient_clipping": 1.0, "steps_per_print": 1000, "train_batch_size": 512, "train_micro_batch_size_per_gpu": 8, "wall_clock_breakdown": false } [INFO|trainer.py:1755] 2024-02-08 17:06:25,360 >> ***** Running training ***** [INFO|trainer.py:1756] 2024-02-08 17:06:25,360 >> Num examples = 64204 [INFO|trainer.py:1757] 2024-02-08 17:06:25,360 >> Num Epochs = 3 [INFO|trainer.py:1758] 2024-02-08 17:06:25,360 >> Instantaneous batch size per device = 8 [INFO|trainer.py:1759] 2024-02-08 17:06:25,360 >> Total train batch size (w. parallel, distributed & accumulation) = 512 [INFO|trainer.py:1760] 2024-02-08 17:06:25,360 >> Gradient Accumulation steps = 8 [INFO|trainer.py:1761] 2024-02-08 17:06:25,360 >> Total optimization steps = 375 [INFO|trainer.py:1762] 2024-02-08 17:06:25,362 >> Number of trainable parameters = 6738432000 0%| | 0/375 [00:002->1 [1] 3/-1/-1->2->1 [2] 3/-1/-1->2->1 [3] 3/-1/-1->2->1 [4] 3/-1/-1->2->1 [5] 3/-1/-1->2->1 [6] 3/-1/-1->2->1 [7] 3/-1/-1->2->1 [8] 3/-1/-1->2->1 [9] 3/-1/-1->2->1 [10] 3/-1/-1->2->1 [11] 3/-1/-1->2->1 [12] 3/-1/-1->2->1 [13] 3/-1/-1->2->1 [14] 3/-1/-1->2->1 [15] 3/-1/-1->2->1 [16] 3/-1/-1->2->1 [17] 3/-1/-1->2->1 [18] 3/-1/-1->2->1 [19] 3/-1/-1->2->1 [20] 3/-1/-1->2->1 [21] 3/-1/-1->2->1 [22] 3/-1/-1->2->1 [23] 3/-1/-1->2->1 ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Trees [0] 2/-1/-1->1->0 [1] 2/-1/-1->1->0 [2] 2/-1/-1->1->0 [3] 2/-1/-1->1->0 [4] 2/-1/-1->1->0 [5] 2/-1/-1->1->0 [6] 2/-1/-1->1->0 [7] 2/-1/-1->1->0 [8] 2/-1/-1->1->0 [9] 2/-1/-1->1->0 [10] 2/-1/-1->1->0 [11] 2/-1/-1->1->0 [12] 2/-1/-1->1->0 [13] 2/-1/-1->1->0 [14] 2/-1/-1->1->0 [15] 2/-1/-1->1->0 [16] 2/-1/-1->1->0 [17] 2/-1/-1->1->0 [18] 2/-1/-1->1->0 [19] 2/-1/-1->1->0 [20] 2/-1/-1->1->0 [21] 2/-1/-1->1->0 [22] 2/-1/-1->1->0 [23] 2/-1/-1->1->0 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Trees [0] -1/-1/-1->7->6 [1] -1/-1/-1->7->6 [2] -1/-1/-1->7->6 [3] -1/-1/-1->7->6 [4] -1/-1/-1->7->6 [5] -1/-1/-1->7->6 [6] -1/-1/-1->7->6 [7] -1/-1/-1->7->6 [8] -1/-1/-1->7->6 [9] -1/-1/-1->7->6 [10] -1/-1/-1->7->6 [11] -1/-1/-1->7->6 [12] -1/-1/-1->7->6 [13] -1/-1/-1->7->6 [14] -1/-1/-1->7->6 [15] -1/-1/-1->7->6 [16] -1/-1/-1->7->6 [17] -1/-1/-1->7->6 [18] -1/-1/-1->7->6 [19] -1/-1/-1->7->6 [20] -1/-1/-1->7->6 [21] -1/-1/-1->7->6 [22] -1/-1/-1->7->6 [23] -1/-1/-1->7->6 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Trees [0] 6/-1/-1->5->4 [1] 6/-1/-1->5->4 [2] 6/-1/-1->5->4 [3] 6/-1/-1->5->4 [4] 6/-1/-1->5->4 [5] 6/-1/-1->5->4 [6] 6/-1/-1->5->4 [7] 6/-1/-1->5->4 [8] 6/-1/-1->5->4 [9] 6/-1/-1->5->4 [10] 6/-1/-1->5->4 [11] 6/-1/-1->5->4 [12] 6/-1/-1->5->4 [13] 6/-1/-1->5->4 [14] 6/-1/-1->5->4 [15] 6/-1/-1->5->4 [16] 6/-1/-1->5->4 [17] 6/-1/-1->5->4 [18] 6/-1/-1->5->4 [19] 6/-1/-1->5->4 [20] 6/-1/-1->5->4 [21] 6/-1/-1->5->4 [22] 6/-1/-1->5->4 [23] 6/-1/-1->5->4 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Trees [0] 7/-1/-1->6->5 [1] 7/-1/-1->6->5 [2] 7/-1/-1->6->5 [3] 7/-1/-1->6->5 [4] 7/-1/-1->6->5 [5] 7/-1/-1->6->5 [6] 7/-1/-1->6->5 [7] 7/-1/-1->6->5 [8] 7/-1/-1->6->5 [9] 7/-1/-1->6->5 [10] 7/-1/-1->6->5 [11] 7/-1/-1->6->5 [12] 7/-1/-1->6->5 [13] 7/-1/-1->6->5 [14] 7/-1/-1->6->5 [15] 7/-1/-1->6->5 [16] 7/-1/-1->6->5 [17] 7/-1/-1->6->5 [18] 7/-1/-1->6->5 [19] 7/-1/-1->6->5 [20] 7/-1/-1->6->5 [21] 7/-1/-1->6->5 [22] 7/-1/-1->6->5 [23] 7/-1/-1->6->5 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Trees [0] 5/-1/-1->4->3 [1] 5/-1/-1->4->3 [2] 5/-1/-1->4->3 [3] 5/-1/-1->4->3 [4] 5/-1/-1->4->3 [5] 5/-1/-1->4->3 [6] 5/-1/-1->4->3 [7] 5/-1/-1->4->3 [8] 5/-1/-1->4->3 [9] 5/-1/-1->4->3 [10] 5/-1/-1->4->3 [11] 5/-1/-1->4->3 [12] 5/-1/-1->4->3 [13] 5/-1/-1->4->3 [14] 5/-1/-1->4->3 [15] 5/-1/-1->4->3 [16] 5/-1/-1->4->3 [17] 5/-1/-1->4->3 [18] 5/-1/-1->4->3 [19] 5/-1/-1->4->3 [20] 5/-1/-1->4->3 [21] 5/-1/-1->4->3 [22] 5/-1/-1->4->3 [23] 5/-1/-1->4->3 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Trees [0] 4/-1/-1->3->2 [1] 4/-1/-1->3->2 [2] 4/-1/-1->3->2 [3] 4/-1/-1->3->2 [4] 4/-1/-1->3->2 [5] 4/-1/-1->3->2 [6] 4/-1/-1->3->2 [7] 4/-1/-1->3->2 [8] 4/-1/-1->3->2 [9] 4/-1/-1->3->2 [10] 4/-1/-1->3->2 [11] 4/-1/-1->3->2 [12] 4/-1/-1->3->2 [13] 4/-1/-1->3->2 [14] 4/-1/-1->3->2 [15] 4/-1/-1->3->2 [16] 4/-1/-1->3->2 [17] 4/-1/-1->3->2 [18] 4/-1/-1->3->2 [19] 4/-1/-1->3->2 [20] 4/-1/-1->3->2 [21] 4/-1/-1->3->2 [22] 4/-1/-1->3->2 [23] 4/-1/-1->3->2 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 00/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 00/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 00/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 00/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 00/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 01/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 01/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 01/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 01/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 01/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 02/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 02/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 02/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 02/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 02/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 03/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 03/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 03/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 03/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 03/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 04/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 04/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 04/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 04/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 04/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 05/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 05/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 05/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 05/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 05/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 06/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 06/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 06/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 06/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 06/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 07/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 07/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 07/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 07/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 07/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 08/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 08/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 08/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 08/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 08/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 09/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 09/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 09/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 09/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 09/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 10/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 10/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 10/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 10/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 10/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 11/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 11/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 11/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 11/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 11/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 12/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 12/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 12/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 12/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 12/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 13/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 13/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 13/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 13/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 13/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 14/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 14/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 14/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 14/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 14/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 15/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 15/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 15/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 15/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 15/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 16/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 16/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 16/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 16/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 16/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 17/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 17/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 17/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 17/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 17/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 18/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 18/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 18/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 18/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 18/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 19/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 19/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 19/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 19/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 19/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 20/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 20/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 20/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 20/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 20/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 21/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 21/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 21/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 21/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 21/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 22/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 22/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 22/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 22/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 22/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 23/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 23/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 23/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 23/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Channel 23/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 00/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 00/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 00/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 00/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 01/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 01/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 01/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 01/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 02/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 02/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 02/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 02/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 03/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 03/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 03/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 03/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 04/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 04/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 04/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 04/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 05/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 05/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 05/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 05/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 06/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 06/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 06/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 06/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 07/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 07/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 07/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 07/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 08/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 08/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 08/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 08/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 09/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 09/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 09/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 09/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 10/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 10/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 10/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 10/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 11/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 11/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 11/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 11/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 12/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 12/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 12/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 12/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 13/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 13/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 13/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 13/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 14/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 14/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 14/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 14/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 15/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 15/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 15/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 15/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 16/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 16/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 16/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 16/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 17/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 17/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 17/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 17/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 18/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 18/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 18/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 18/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 19/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 19/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 19/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 19/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 20/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 20/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 20/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 20/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 21/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 21/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 21/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 21/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 22/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 22/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 22/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 22/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Channel 23/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Channel 23/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Channel 23/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Channel 23/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:81888:88319 [1] NCCL INFO comm 0x7f1e78024b10 rank 1 nranks 8 cudaDev 1 busId 13000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81893:88316 [6] NCCL INFO comm 0x7f60f0025620 rank 6 nranks 8 cudaDev 6 busId cb000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81889:88320 [2] NCCL INFO comm 0x7fea24024ca0 rank 2 nranks 8 cudaDev 2 busId 4b000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81891:88317 [4] NCCL INFO comm 0x7ef7700249f0 rank 4 nranks 8 cudaDev 4 busId 93000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81887:88313 [0] NCCL INFO comm 0x7f9748025f60 rank 0 nranks 8 cudaDev 0 busId e000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81890:88314 [3] NCCL INFO comm 0x7f2b20024ba0 rank 3 nranks 8 cudaDev 3 busId 51000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81894:88318 [7] NCCL INFO comm 0x7f70f8024c00 rank 7 nranks 8 cudaDev 7 busId d0000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:81892:88315 [5] NCCL INFO comm 0x7f26c8024c90 rank 5 nranks 8 cudaDev 5 busId 99000 - Init COMPLETE 0%| | 1/375 [00:33<3:28:11, 33.40s/it] {'loss': 1.5044, 'learning_rate': 1.6666666666666667e-06, 'epoch': 0.01} 0%| | 1/375 [00:33<3:28:11, 33.40s/it] 1%| | 2/375 [01:01<3:06:26, 29.99s/it] {'loss': 1.4514, 'learning_rate': 3.3333333333333333e-06, 'epoch': 0.02} 1%| | 2/375 [01:01<3:06:26, 29.99s/it]WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,698] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:08:41,706] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-02-08 17:09:02,829] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,830] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,952] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,955] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,974] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,977] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,979] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,980] [INFO] [comm.py:637:init_distributed] cdb=None [2024-02-08 17:09:02,980] [INFO] [comm.py:668:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 2, device: cuda:2, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 5, device: cuda:5, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 1, device: cuda:1, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 6, device: cuda:6, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 7, device: cuda:7, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 4, device: cuda:4, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 0, device: cuda:0, n_gpu: 1distributed training: True, 16-bits training: False 02/08/2024 17:10:03 - INFO - __main__ - Training/evaluation parameters TrainingArguments( _n_gpu=1, adafactor=False, adam_beta1=0.9, adam_beta2=0.999, adam_epsilon=1e-08, auto_find_batch_size=False, bf16=True, bf16_full_eval=True, data_seed=None, dataloader_drop_last=False, dataloader_num_workers=8, dataloader_pin_memory=True, ddp_bucket_cap_mb=None, ddp_find_unused_parameters=None, ddp_timeout=72000, debug=[], deepspeed=/apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/train/deepspeed_config_bf16.json, disable_tqdm=False, do_eval=False, do_predict=False, do_train=True, eval_accumulation_steps=None, eval_delay=0, eval_steps=None, evaluation_strategy=no, fp16=False, fp16_backend=auto, fp16_full_eval=False, fp16_opt_level=O1, fsdp=[], fsdp_config={'fsdp_min_num_params': 0, 'xla': False, 'xla_fsdp_grad_ckpt': False}, fsdp_min_num_params=0, fsdp_transformer_layer_cls_to_wrap=None, full_determinism=False, gradient_accumulation_steps=8, gradient_checkpointing=True, greater_is_better=None, group_by_length=False, half_precision_backend=auto, hub_model_id=None, hub_private_repo=False, hub_strategy=every_save, hub_token=, ignore_data_skip=False, include_inputs_for_metrics=False, jit_mode_eval=False, label_names=None, label_smoothing_factor=0.0, learning_rate=2e-05, length_column_name=length, load_best_model_at_end=False, local_rank=0, log_level=passive, log_level_replica=warning, log_on_each_node=True, logging_dir=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/runs/Feb08_17-08-41_ts-b102359ecb124d359c32da25fe3785b5-launcher, logging_first_step=False, logging_nan_inf_filter=True, logging_steps=1, logging_strategy=steps, lr_scheduler_type=cosine, max_grad_norm=1.0, max_steps=-1, metric_for_best_model=None, mp_parameters=, no_cuda=False, num_train_epochs=3.0, optim=adamw_hf, optim_args=None, output_dir=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b, overwrite_output_dir=True, past_index=-1, per_device_eval_batch_size=1, per_device_train_batch_size=8, prediction_loss_only=False, push_to_hub=False, push_to_hub_model_id=None, push_to_hub_organization=None, push_to_hub_token=, ray_scope=last, remove_unused_columns=True, report_to=['tensorboard'], resume_from_checkpoint=None, run_name=./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b, save_on_each_node=False, save_steps=500, save_strategy=steps, save_total_limit=1, seed=34, sharded_ddp=[], skip_memory_metrics=True, tf32=None, torch_compile=False, torch_compile_backend=None, torch_compile_mode=None, torchdynamo=None, tpu_metrics_debug=False, tpu_num_cores=None, use_ipex=False, use_legacy_prediction_loop=False, use_mps_device=False, warmup_ratio=0.03, warmup_steps=0, weight_decay=0.0, xpu_backend=None, ) 02/08/2024 17:10:03 - INFO - __main__ - Loading dataset from file: {'train': '/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/datasets/alpaca-gpt4/train.alpacawithnewstest17to20.addac.alphf.json', 'validation': '/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/datasets/alpaca-gpt4/eval.addac.json'} /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( 02/08/2024 17:10:03 - WARNING - __main__ - Process rank: 3, device: cuda:3, n_gpu: 1distributed training: True, 16-bits training: False /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/load.py:2089: FutureWarning: 'use_auth_token' was deprecated in favor of 'token' in version 2.14.0 and will be removed in 3.0.0. You can remove this warning by passing 'token=None' instead. warnings.warn( Using custom data configuration default-c67d78b39e072232 02/08/2024 17:10:04 - INFO - datasets.builder - Using custom data configuration default-c67d78b39e072232 Loading Dataset Infos from /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/packaged_modules/json 02/08/2024 17:10:04 - INFO - datasets.info - Loading Dataset Infos from /apdcephfs/share_733425/vinnylywang/jianhuipang/llama2_sft/envs/lib/python3.8/site-packages/datasets/packaged_modules/json Overwrite dataset info from restored data version if exists. 02/08/2024 17:10:04 - INFO - datasets.builder - Overwrite dataset info from restored data version if exists. Loading Dataset info from /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96 02/08/2024 17:10:04 - INFO - datasets.info - Loading Dataset info from /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96 Found cached dataset json (/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96) 02/08/2024 17:10:04 - INFO - datasets.builder - Found cached dataset json (/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96) Loading Dataset info from /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96 02/08/2024 17:10:04 - INFO - datasets.info - Loading Dataset info from /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96 [INFO|configuration_utils.py:666] 2024-02-08 17:10:04,166 >> loading configuration file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/config.json [INFO|configuration_utils.py:720] 2024-02-08 17:10:04,167 >> Model config LlamaConfig { "_name_or_path": "/apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b", "architectures": [ "LlamaForCausalLM" ], "bos_token_id": 1, "eos_token_id": 2, "hidden_act": "silu", "hidden_size": 4096, "initializer_range": 0.02, "intermediate_size": 11008, "max_position_embeddings": 4096, "model_type": "llama", "num_attention_heads": 32, "num_hidden_layers": 32, "num_key_value_heads": 32, "pad_token_id": 0, "pretraining_tp": 1, "rms_norm_eps": 1e-05, "rope_scaling": null, "tie_word_embeddings": false, "torch_dtype": "bfloat16", "transformers_version": "4.28.0.dev0", "use_cache": true, "vocab_size": 32002 } 02/08/2024 17:10:04 - INFO - __main__ - Tokenizer_kwargs: {'cache_dir': None, 'use_fast': True, 'revision': 'main', 'use_auth_token': None, 'model_max_length': 4096} [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:10:04,174 >> loading file tokenizer.model [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:10:04,174 >> loading file added_tokens.json [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:10:04,174 >> loading file special_tokens_map.json [INFO|tokenization_utils_base.py:1801] 2024-02-08 17:10:04,174 >> loading file tokenizer_config.json [INFO|tokenization_utils.py:426] 2024-02-08 17:10:04,188 >> Adding [PAD] to the vocabulary [INFO|tokenization_utils.py:426] 2024-02-08 17:10:04,188 >> Adding to the vocabulary 02/08/2024 17:10:04 - INFO - __main__ - Loading checkpoints in dtype: None [INFO|modeling_utils.py:2395] 2024-02-08 17:10:04,194 >> loading weights file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/pytorch_model.bin [INFO|modeling_utils.py:2487] 2024-02-08 17:10:58,799 >> Detected DeepSpeed ZeRO-3: activating zero.init() for this model [INFO|configuration_utils.py:575] 2024-02-08 17:10:58,803 >> Generate config GenerationConfig { "_from_model_config": true, "bos_token_id": 1, "eos_token_id": 2, "pad_token_id": 0, "transformers_version": "4.28.0.dev0" } ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:88507 [0] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:88507 [0] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:88507 [0] NCCL INFO cudaDriverVersion 11070 NCCL version 2.14.3+cuda11.7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:88510 [3] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:88513 [6] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:88510 [3] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:88508 [1] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:88513 [6] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:88508 [1] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:88514 [7] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:88510 [3] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:88514 [7] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:88513 [6] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:88508 [1] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:88514 [7] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:88511 [4] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:88511 [4] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:88511 [4] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:88512 [5] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:88512 [5] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:88509 [2] NCCL INFO cudaDriverVersion 11070 ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:88509 [2] NCCL INFO Bootstrap : Using eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:88512 [5] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:88509 [2] NCCL INFO NET/Plugin : No plugin found (libnccl-net.so), using internal implementation ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO NET/IB : Using [0]mlx5_2:1/RoCE [RO]; OOB eth1:11.222.158.111<0> ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Using network IB ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Setting affinity for GPU 2 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Setting affinity for GPU 7 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Setting affinity for GPU 4 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Setting affinity for GPU 3 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Setting affinity for GPU 6 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Setting affinity for GPU 0 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Setting affinity for GPU 1 to ffff,ffffffff,00000000,0000ffff,ffffffff ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Setting affinity for GPU 5 to ffffffff,ffff0000,00000000,ffffffff,ffff0000,00000000 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Trees [0] 7/-1/-1->6->5 [1] 7/-1/-1->6->5 [2] 7/-1/-1->6->5 [3] 7/-1/-1->6->5 [4] 7/-1/-1->6->5 [5] 7/-1/-1->6->5 [6] 7/-1/-1->6->5 [7] 7/-1/-1->6->5 [8] 7/-1/-1->6->5 [9] 7/-1/-1->6->5 [10] 7/-1/-1->6->5 [11] 7/-1/-1->6->5 [12] 7/-1/-1->6->5 [13] 7/-1/-1->6->5 [14] 7/-1/-1->6->5 [15] 7/-1/-1->6->5 [16] 7/-1/-1->6->5 [17] 7/-1/-1->6->5 [18] 7/-1/-1->6->5 [19] 7/-1/-1->6->5 [20] 7/-1/-1->6->5 [21] 7/-1/-1->6->5 [22] 7/-1/-1->6->5 [23] 7/-1/-1->6->5 ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Trees [0] -1/-1/-1->7->6 [1] -1/-1/-1->7->6 [2] -1/-1/-1->7->6 [3] -1/-1/-1->7->6 [4] -1/-1/-1->7->6 [5] -1/-1/-1->7->6 [6] -1/-1/-1->7->6 [7] -1/-1/-1->7->6 [8] -1/-1/-1->7->6 [9] -1/-1/-1->7->6 [10] -1/-1/-1->7->6 [11] -1/-1/-1->7->6 [12] -1/-1/-1->7->6 [13] -1/-1/-1->7->6 [14] -1/-1/-1->7->6 [15] -1/-1/-1->7->6 [16] -1/-1/-1->7->6 [17] -1/-1/-1->7->6 [18] -1/-1/-1->7->6 [19] -1/-1/-1->7->6 [20] -1/-1/-1->7->6 [21] -1/-1/-1->7->6 [22] -1/-1/-1->7->6 [23] -1/-1/-1->7->6 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Trees [0] 5/-1/-1->4->3 [1] 5/-1/-1->4->3 [2] 5/-1/-1->4->3 [3] 5/-1/-1->4->3 [4] 5/-1/-1->4->3 [5] 5/-1/-1->4->3 [6] 5/-1/-1->4->3 [7] 5/-1/-1->4->3 [8] 5/-1/-1->4->3 [9] 5/-1/-1->4->3 [10] 5/-1/-1->4->3 [11] 5/-1/-1->4->3 [12] 5/-1/-1->4->3 [13] 5/-1/-1->4->3 [14] 5/-1/-1->4->3 [15] 5/-1/-1->4->3 [16] 5/-1/-1->4->3 [17] 5/-1/-1->4->3 [18] 5/-1/-1->4->3 [19] 5/-1/-1->4->3 [20] 5/-1/-1->4->3 [21] 5/-1/-1->4->3 [22] 5/-1/-1->4->3 [23] 5/-1/-1->4->3 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Trees [0] 6/-1/-1->5->4 [1] 6/-1/-1->5->4 [2] 6/-1/-1->5->4 [3] 6/-1/-1->5->4 [4] 6/-1/-1->5->4 [5] 6/-1/-1->5->4 [6] 6/-1/-1->5->4 [7] 6/-1/-1->5->4 [8] 6/-1/-1->5->4 [9] 6/-1/-1->5->4 [10] 6/-1/-1->5->4 [11] 6/-1/-1->5->4 [12] 6/-1/-1->5->4 [13] 6/-1/-1->5->4 [14] 6/-1/-1->5->4 [15] 6/-1/-1->5->4 [16] 6/-1/-1->5->4 [17] 6/-1/-1->5->4 [18] 6/-1/-1->5->4 [19] 6/-1/-1->5->4 [20] 6/-1/-1->5->4 [21] 6/-1/-1->5->4 [22] 6/-1/-1->5->4 [23] 6/-1/-1->5->4 ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Trees [0] 2/-1/-1->1->0 [1] 2/-1/-1->1->0 [2] 2/-1/-1->1->0 [3] 2/-1/-1->1->0 [4] 2/-1/-1->1->0 [5] 2/-1/-1->1->0 [6] 2/-1/-1->1->0 [7] 2/-1/-1->1->0 [8] 2/-1/-1->1->0 [9] 2/-1/-1->1->0 [10] 2/-1/-1->1->0 [11] 2/-1/-1->1->0 [12] 2/-1/-1->1->0 [13] 2/-1/-1->1->0 [14] 2/-1/-1->1->0 [15] 2/-1/-1->1->0 [16] 2/-1/-1->1->0 [17] 2/-1/-1->1->0 [18] 2/-1/-1->1->0 [19] 2/-1/-1->1->0 [20] 2/-1/-1->1->0 [21] 2/-1/-1->1->0 [22] 2/-1/-1->1->0 [23] 2/-1/-1->1->0 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Trees [0] 4/-1/-1->3->2 [1] 4/-1/-1->3->2 [2] 4/-1/-1->3->2 [3] 4/-1/-1->3->2 [4] 4/-1/-1->3->2 [5] 4/-1/-1->3->2 [6] 4/-1/-1->3->2 [7] 4/-1/-1->3->2 [8] 4/-1/-1->3->2 [9] 4/-1/-1->3->2 [10] 4/-1/-1->3->2 [11] 4/-1/-1->3->2 [12] 4/-1/-1->3->2 [13] 4/-1/-1->3->2 [14] 4/-1/-1->3->2 [15] 4/-1/-1->3->2 [16] 4/-1/-1->3->2 [17] 4/-1/-1->3->2 [18] 4/-1/-1->3->2 [19] 4/-1/-1->3->2 [20] 4/-1/-1->3->2 [21] 4/-1/-1->3->2 [22] 4/-1/-1->3->2 [23] 4/-1/-1->3->2 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Trees [0] 3/-1/-1->2->1 [1] 3/-1/-1->2->1 [2] 3/-1/-1->2->1 [3] 3/-1/-1->2->1 [4] 3/-1/-1->2->1 [5] 3/-1/-1->2->1 [6] 3/-1/-1->2->1 [7] 3/-1/-1->2->1 [8] 3/-1/-1->2->1 [9] 3/-1/-1->2->1 [10] 3/-1/-1->2->1 [11] 3/-1/-1->2->1 [12] 3/-1/-1->2->1 [13] 3/-1/-1->2->1 [14] 3/-1/-1->2->1 [15] 3/-1/-1->2->1 [16] 3/-1/-1->2->1 [17] 3/-1/-1->2->1 [18] 3/-1/-1->2->1 [19] 3/-1/-1->2->1 [20] 3/-1/-1->2->1 [21] 3/-1/-1->2->1 [22] 3/-1/-1->2->1 [23] 3/-1/-1->2->1 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 00/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 00/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 00/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 00/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 00/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 01/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 01/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 01/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 01/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 01/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 02/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 02/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 02/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 02/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 02/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 03/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 03/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 03/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 03/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 03/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 04/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 04/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 04/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 04/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 04/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 05/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 05/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 05/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 05/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 05/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 06/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 06/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 06/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 06/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 06/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 07/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 07/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 07/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 07/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 07/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 08/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 08/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 08/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 08/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 08/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 09/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 09/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 09/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 09/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 09/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 10/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 10/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 10/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 10/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 10/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 11/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 11/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 11/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 11/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 11/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 12/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 12/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 12/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 12/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 12/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 13/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 13/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 13/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 13/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 13/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 14/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 14/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 14/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 14/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 14/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 15/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 15/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 15/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 15/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 15/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 16/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 16/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 16/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 16/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 16/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 17/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 17/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 17/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 17/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 17/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 18/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 18/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 18/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 18/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 18/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 19/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 19/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 19/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 19/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 19/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 20/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 20/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 20/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 20/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 20/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 21/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 21/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 21/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 21/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 21/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 22/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 22/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 22/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 22/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 22/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 23/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 23/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 23/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 23/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Channel 23/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 00/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 00/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 00/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 00/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 01/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 01/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 01/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 01/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 02/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 02/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 02/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 02/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 03/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 03/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 03/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 03/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 04/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 04/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 04/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 04/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 05/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 05/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 05/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 05/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 06/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 06/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 06/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 06/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 07/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 07/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 07/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 07/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 08/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 08/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 08/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 08/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 09/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 09/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 09/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 09/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 10/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 10/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 10/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 10/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 11/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 11/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 11/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 11/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 12/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 12/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 12/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 12/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 13/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 13/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 13/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 13/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 14/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 14/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 14/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 14/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 15/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 15/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 15/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 15/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 16/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 16/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 16/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 16/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 17/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 17/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 17/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 17/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 18/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 18/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 18/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 18/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 19/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 19/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 19/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 19/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 20/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 20/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 20/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 20/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 21/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 21/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 21/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 21/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 22/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 22/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 22/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 22/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Channel 23/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Channel 23/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Channel 23/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Channel 23/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:89501 [6] NCCL INFO comm 0x43d40e40 rank 6 nranks 8 cudaDev 6 busId cb000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:89503 [7] NCCL INFO comm 0x4569b400 rank 7 nranks 8 cudaDev 7 busId d0000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:89510 [5] NCCL INFO comm 0x45b85800 rank 5 nranks 8 cudaDev 5 busId 99000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:89507 [4] NCCL INFO comm 0x42d45cf0 rank 4 nranks 8 cudaDev 4 busId 93000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:89502 [1] NCCL INFO comm 0x42a010e0 rank 1 nranks 8 cudaDev 1 busId 13000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:89512 [2] NCCL INFO comm 0x45e5f3a0 rank 2 nranks 8 cudaDev 2 busId 4b000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:89499 [0] NCCL INFO comm 0x45b1c820 rank 0 nranks 8 cudaDev 0 busId e000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:89500 [3] NCCL INFO comm 0x46179170 rank 3 nranks 8 cudaDev 3 busId 51000 - Init COMPLETE [2024-02-08 17:11:07,313] [INFO] [partition_parameters.py:347:__exit__] finished initializing model - num_params = 291, num_elems = 6.74B [INFO|modeling_utils.py:3029] 2024-02-08 17:11:09,615 >> All model checkpoint weights were used when initializing LlamaForCausalLM. [INFO|modeling_utils.py:3037] 2024-02-08 17:11:09,615 >> All the weights of LlamaForCausalLM were initialized from the model checkpoint at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b. If your task is similar to the task the model of the checkpoint was trained on, you can already use LlamaForCausalLM for predictions without further training. [INFO|configuration_utils.py:535] 2024-02-08 17:11:09,624 >> loading configuration file /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/apdcephfs/jianhuipang/gogollm/newmodels/checkpoints_ct/ac/allm-ac-7b/generation_config.json [INFO|configuration_utils.py:575] 2024-02-08 17:11:09,625 >> Generate config GenerationConfig { "bos_token_id": 1, "do_sample": true, "eos_token_id": 2, "max_length": 4096, "pad_token_id": 0, "temperature": 0.6, "top_p": 0.9, "transformers_version": "4.28.0.dev0" } 02/08/2024 17:11:11 - INFO - __main__ - The old token as an anchor Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00000_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00000_of_00032.arrow Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00001_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00001_of_00032.arrow Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00002_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00002_of_00032.arrow Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00003_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00003_of_00032.arrow Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00004_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00004_of_00032.arrow Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00005_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00005_of_00032.arrow Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00006_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00006_of_00032.arrow Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00007_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00007_of_00032.arrow Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00008_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00008_of_00032.arrow Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00009_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00009_of_00032.arrow Process #10 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00010_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #10 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00010_of_00032.arrow Process #11 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00011_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #11 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00011_of_00032.arrow Process #12 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00012_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #12 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00012_of_00032.arrow Process #13 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00013_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #13 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00013_of_00032.arrow Process #14 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00014_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #14 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00014_of_00032.arrow Process #15 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00015_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #15 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00015_of_00032.arrow Process #16 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00016_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #16 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00016_of_00032.arrow Process #17 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00017_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #17 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00017_of_00032.arrow Process #18 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00018_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #18 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00018_of_00032.arrow Process #19 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00019_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #19 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00019_of_00032.arrow Process #20 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00020_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #20 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00020_of_00032.arrow Process #21 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00021_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #21 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00021_of_00032.arrow Process #22 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00022_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #22 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00022_of_00032.arrow Process #23 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00023_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #23 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00023_of_00032.arrow Process #24 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00024_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #24 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00024_of_00032.arrow Process #25 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00025_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #25 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00025_of_00032.arrow Process #26 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00026_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #26 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00026_of_00032.arrow Process #27 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00027_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #27 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00027_of_00032.arrow Process #28 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00028_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #28 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00028_of_00032.arrow Process #29 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00029_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #29 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00029_of_00032.arrow Process #30 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00030_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #30 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00030_of_00032.arrow Process #31 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00031_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #31 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_00031_of_00032.arrow Loading cached processed dataset at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_*_of_00032.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Loading cached processed dataset at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-575756e50a2ee72d_*_of_00032.arrow Concatenating 32 shards 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Concatenating 32 shards num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:11 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00000_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #0 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00000_of_00010.arrow Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00001_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #1 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00001_of_00010.arrow Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00002_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #2 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00002_of_00010.arrow Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00003_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #3 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00003_of_00010.arrow Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00004_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #4 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00004_of_00010.arrow Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00005_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #5 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00005_of_00010.arrow Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00006_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #6 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00006_of_00010.arrow Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00007_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #7 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00007_of_00010.arrow Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00008_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #8 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00008_of_00010.arrow Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00009_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Process #9 will write at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_00009_of_00010.arrow Loading cached processed dataset at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_*_of_00010.arrow 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Loading cached processed dataset at /apdcephfs_qy3/share_733425/vinnylywang/jianhuipang_qy3/hf_cache/datasets/json/default-c67d78b39e072232/0.0.0/8bb11242116d547c741b2e8a1f18598ffdd40a1d4f2a2872c7a28b697434bc96/cache-8afdfe033b943cb9_*_of_00010.arrow Concatenating 10 shards 02/08/2024 17:11:11 - INFO - datasets.arrow_dataset - Concatenating 10 shards 02/08/2024 17:11:11 - INFO - __main__ - xxx: Showcase the tokenized training samples. {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. 02/08/2024 17:11:12 - WARNING - datasets.arrow_dataset - num_proc must be <= 10. Reducing num_proc to 10 for dataset of size 10. {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} {'input_ids': [1, 13866, 338, 385, 15278, 393, 16612, 263, 3414, 29889, 14350, 263, 2933, 393, 7128, 2486, 1614, 2167, 278, 2009, 29889, 32001, 835, 2799, 4080, 29901, 13, 29954, 573, 2211, 25562, 363, 7952, 292, 9045, 29891, 29889, 32001, 835, 13291, 29901, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], 'labels': [-100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, -100, 29896, 29889, 32001, 382, 271, 263, 6411, 8362, 322, 18254, 768, 2738, 652, 300, 29901, 8561, 1854, 596, 592, 1338, 526, 20978, 573, 310, 263, 12875, 310, 285, 21211, 322, 18655, 1849, 29892, 20793, 26823, 29892, 3353, 2646, 1144, 29892, 322, 9045, 29891, 285, 1446, 29889, 32001, 910, 6911, 304, 3867, 596, 3573, 411, 278, 18853, 18254, 374, 1237, 304, 740, 472, 967, 1900, 322, 508, 1371, 5557, 17168, 293, 10267, 2129, 29889, 32001, 29871, 29906, 29889, 32001, 2201, 482, 297, 4943, 9128, 6354, 29901, 1222, 6269, 895, 338, 7618, 1455, 363, 7344, 292, 4549, 289, 2873, 29892, 2301, 7799, 29892, 322, 5881, 29875, 586, 6151, 1070, 9045, 29889, 32001, 319, 326, 363, 472, 3203, 29871, 29896, 29945, 29900, 6233, 310, 17768, 403, 14911, 711, 293, 15058, 470, 29871, 29955, 29945, 6233, 310, 14877, 20657, 15058, 1269, 4723, 29889, 32001, 29871, 29941, 29889, 32001, 3617, 3307, 8709, 29901, 24162, 3307, 11029, 8709, 338, 7618, 1455, 363, 9128, 322, 19119, 1532, 29899, 915, 292, 29889, 32001, 739, 6911, 304, 1072, 5987, 286, 2092, 29892, 11157, 25323, 3321, 740, 29892, 322, 11286, 9045, 29891, 14321, 322, 5198, 1540, 740, 29889, 32001, 319, 326, 363, 29871, 29955, 29899, 29929, 6199, 310, 8709, 1269, 4646, 29889, 2]} [INFO|trainer.py:606] 2024-02-08 17:11:12,689 >> Using cuda_amp half precision backend [2024-02-08 17:11:12,704] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed info: version=0.11.1, git-hash=unknown, git-branch=unknown [2024-02-08 17:11:13,045] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Flops Profiler Enabled: False Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Detected CUDA files, patching ldflags Emitting ninja build file /root/.cache/torch_extensions/py38_cu117/cpu_adam/build.ninja... Building extension module cpu_adam... Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N) ninja: no work to do. Loading extension module cpu_adam... Time to load cpu_adam op: 1.2547764778137207 seconds Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... Detected CUDA files, patching ldflags Emitting ninja build file /root/.cache/torch_extensions/py38_cu117/cpu_adam/build.ninja... Building extension module cpu_adam... Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N) Using /root/.cache/torch_extensions/py38_cu117 as PyTorch extensions root... ninja: no work to do. Loading extension module cpu_adam... Time to load cpu_adam op: 1.2313756942749023 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 0.9743907451629639 seconds Loading extension module cpu_adam... Loading extension module cpu_adam... Time to load cpu_adam op: 1.2652218341827393 seconds Time to load cpu_adam op: 1.274989366531372 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.6485710144042969 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.002065896987915 seconds Loading extension module cpu_adam... Time to load cpu_adam op: 1.0593931674957275 seconds Adam Optimizer #0 is created with AVX2 arithmetic capability. Config: alpha=0.000020, betas=(0.900000, 0.999000), weight_decay=0.000000, adam_w=1 [2024-02-08 17:11:19,280] [INFO] [logging.py:96:log_dist] [Rank 0] Using DeepSpeed Optimizer param name adam as basic optimizer [2024-02-08 17:11:19,281] [INFO] [logging.py:96:log_dist] [Rank 0] Removing param_group that has no 'params' in the basic Optimizer [2024-02-08 17:11:19,302] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Basic Optimizer = DeepSpeedCPUAdam [2024-02-08 17:11:19,302] [INFO] [utils.py:56:is_zero_supported_optimizer] Checking ZeRO support for optimizer=DeepSpeedCPUAdam type= [2024-02-08 17:11:19,302] [INFO] [logging.py:96:log_dist] [Rank 0] Creating fp16 ZeRO stage 3 optimizer, MiCS is enabled False, Hierarchical params gather False [2024-02-08 17:11:19,302] [INFO] [logging.py:96:log_dist] [Rank 0] Creating torch.bfloat16 ZeRO stage 3 optimizer [2024-02-08 17:11:19,544] [INFO] [utils.py:802:see_memory_usage] Stage 3 initialize beginning [2024-02-08 17:11:19,545] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.79 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:19,545] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.38 GB, percent = 8.4% [2024-02-08 17:11:19,549] [INFO] [stage3.py:126:__init__] Reduce bucket size 16777216 [2024-02-08 17:11:19,549] [INFO] [stage3.py:127:__init__] Prefetch bucket size 15099494 [2024-02-08 17:11:19,774] [INFO] [utils.py:802:see_memory_usage] DeepSpeedZeRoOffload initialize [begin] [2024-02-08 17:11:19,775] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:19,775] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.45 GB, percent = 8.4% Parameter Offload: Total persistent parameters: 266240 in 65 params [2024-02-08 17:11:20,054] [INFO] [utils.py:802:see_memory_usage] DeepSpeedZeRoOffload initialize [end] [2024-02-08 17:11:20,055] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:20,055] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.44 GB, percent = 8.4% [2024-02-08 17:11:20,286] [INFO] [utils.py:802:see_memory_usage] Before creating fp16 partitions [2024-02-08 17:11:20,287] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:20,287] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 84.39 GB, percent = 8.4% [2024-02-08 17:11:21,890] [INFO] [utils.py:802:see_memory_usage] After creating fp16 partitions: 1 [2024-02-08 17:11:21,891] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:21,892] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 106.11 GB, percent = 10.5% [2024-02-08 17:11:22,134] [INFO] [utils.py:802:see_memory_usage] Before creating fp32 partitions [2024-02-08 17:11:22,135] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:22,135] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 111.26 GB, percent = 11.1% [2024-02-08 17:11:23,828] [INFO] [utils.py:802:see_memory_usage] After creating fp32 partitions [2024-02-08 17:11:23,829] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:23,829] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 125.82 GB, percent = 12.5% [2024-02-08 17:11:24,095] [INFO] [utils.py:802:see_memory_usage] Before initializing optimizer states [2024-02-08 17:11:24,096] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:24,096] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 132.52 GB, percent = 13.2% [2024-02-08 17:11:31,134] [INFO] [utils.py:802:see_memory_usage] After initializing optimizer states [2024-02-08 17:11:31,135] [INFO] [utils.py:803:see_memory_usage] MA 0.03 GB Max_MA 0.03 GB CA 0.8 GB Max_CA 1 GB [2024-02-08 17:11:31,135] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 211.72 GB, percent = 21.0% [2024-02-08 17:11:31,227] [INFO] [stage3.py:459:_setup_for_real_optimizer] optimizer state initialized [2024-02-08 17:11:33,476] [INFO] [utils.py:802:see_memory_usage] After initializing ZeRO optimizer [2024-02-08 17:11:33,477] [INFO] [utils.py:803:see_memory_usage] MA 0.06 GB Max_MA 0.55 GB CA 1.05 GB Max_CA 1 GB [2024-02-08 17:11:33,477] [INFO] [utils.py:810:see_memory_usage] CPU Virtual Memory: used = 225.38 GB, percent = 22.4% [2024-02-08 17:11:33,477] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Final Optimizer = adam [2024-02-08 17:11:33,477] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed using client callable to create LR scheduler [2024-02-08 17:11:33,477] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed LR Scheduler = [2024-02-08 17:11:33,477] [INFO] [logging.py:96:log_dist] [Rank 0] step=0, skipped=0, lr=[0.0], mom=[[0.9, 0.999]] [2024-02-08 17:11:33,479] [INFO] [config.py:968:print] DeepSpeedEngine configuration: [2024-02-08 17:11:33,479] [INFO] [config.py:972:print] activation_checkpointing_config { "partition_activations": false, "contiguous_memory_optimization": false, "cpu_checkpointing": false, "number_checkpoints": null, "synchronize_checkpoint_boundary": false, "profile": false } [2024-02-08 17:11:33,479] [INFO] [config.py:972:print] aio_config ................... {'block_size': 1048576, 'queue_depth': 8, 'thread_count': 1, 'single_submit': False, 'overlap_events': True} [2024-02-08 17:11:33,479] [INFO] [config.py:972:print] amp_enabled .................. False [2024-02-08 17:11:33,479] [INFO] [config.py:972:print] amp_params ................... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] autotuning_config ............ { "enabled": false, "start_step": null, "end_step": null, "metric_path": null, "arg_mappings": null, "metric": "throughput", "model_info": null, "results_dir": "autotuning_results", "exps_dir": "autotuning_exps", "overwrite": true, "fast": true, "start_profile_step": 3, "end_profile_step": 5, "tuner_type": "gridsearch", "tuner_early_stopping": 5, "tuner_num_trials": 50, "model_info_path": null, "mp_size": 1, "max_train_batch_size": null, "min_train_batch_size": 1, "max_train_micro_batch_size_per_gpu": 1.024000e+03, "min_train_micro_batch_size_per_gpu": 1, "num_tuning_micro_batch_sizes": 3 } [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] bfloat16_enabled ............. True [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] checkpoint_parallel_write_pipeline False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] checkpoint_tag_validation_enabled True [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] checkpoint_tag_validation_fail False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] comms_config ................. [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] communication_data_type ...... None [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] compression_config ........... {'weight_quantization': {'shared_parameters': {'enabled': False, 'quantizer_kernel': False, 'schedule_offset': 0, 'quantize_groups': 1, 'quantize_verbose': False, 'quantization_type': 'symmetric', 'quantize_weight_in_forward': False, 'rounding': 'nearest', 'fp16_mixed_quantize': False, 'quantize_change_ratio': 0.001}, 'different_groups': {}}, 'activation_quantization': {'shared_parameters': {'enabled': False, 'quantization_type': 'symmetric', 'range_calibration': 'dynamic', 'schedule_offset': 1000}, 'different_groups': {}}, 'sparse_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'row_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'head_pruning': {'shared_parameters': {'enabled': False, 'method': 'topk', 'schedule_offset': 1000}, 'different_groups': {}}, 'channel_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'layer_reduction': {'enabled': False}} [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] curriculum_enabled_legacy .... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] curriculum_params_legacy ..... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] data_efficiency_config ....... {'enabled': False, 'seed': 1234, 'data_sampling': {'enabled': False, 'num_epochs': 1000, 'num_workers': 0, 'curriculum_learning': {'enabled': False}}, 'data_routing': {'enabled': False, 'random_ltd': {'enabled': False, 'layer_token_lr_schedule': {'enabled': False}}}} [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] data_efficiency_enabled ...... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] dataloader_drop_last ......... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] disable_allgather ............ False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] dump_state ................... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] dynamic_loss_scale_args ...... None [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_enabled ........... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_gas_boundary_resolution 1 [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_layer_name ........ bert.encoder.layer [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_layer_num ......... 0 [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_max_iter .......... 100 [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_stability ......... 1e-06 [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_tol ............... 0.01 [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] eigenvalue_verbose ........... False [2024-02-08 17:11:33,480] [INFO] [config.py:972:print] elasticity_enabled ........... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] flops_profiler_config ........ { "enabled": false, "recompute_fwd_factor": 0.0, "profile_step": 1, "module_depth": -1, "top_modules": 1, "detailed": true, "output_file": null } [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] fp16_auto_cast ............... None [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] fp16_enabled ................. False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] fp16_master_weights_and_gradients False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] global_rank .................. 0 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] grad_accum_dtype ............. None [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] gradient_accumulation_steps .. 8 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] gradient_clipping ............ 1.0 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] gradient_predivide_factor .... 1.0 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] hybrid_engine ................ enabled=False max_out_tokens=512 inference_tp_size=1 release_inference_cache=False pin_parameters=True tp_gather_partition_size=8 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] initial_dynamic_scale ........ 1 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] load_universal_checkpoint .... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] loss_scale ................... 1.0 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] memory_breakdown ............. False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] mics_hierarchial_params_gather False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] mics_shard_size .............. -1 [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] monitor_config ............... tensorboard=TensorBoardConfig(enabled=False, output_path='', job_name='DeepSpeedJobName') wandb=WandbConfig(enabled=False, group=None, team=None, project='deepspeed') csv_monitor=CSVConfig(enabled=False, output_path='', job_name='DeepSpeedJobName') enabled=False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] nebula_config ................ { "enabled": false, "persistent_storage_path": null, "persistent_time_interval": 100, "num_of_version_in_retention": 2, "enable_nebula_load": true, "load_path": null } [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] optimizer_legacy_fusion ...... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] optimizer_name ............... adam [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] optimizer_params ............. {'lr': 2e-05, 'betas': [0.9, 0.999], 'eps': 1e-08, 'weight_decay': 0.0} [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] pipeline ..................... {'stages': 'auto', 'partition': 'best', 'seed_layers': False, 'activation_checkpoint_interval': 0} [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] pld_enabled .................. False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] pld_params ................... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] prescale_gradients ........... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] scheduler_name ............... None [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] scheduler_params ............. None [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] sparse_attention ............. None [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] sparse_gradients_enabled ..... False [2024-02-08 17:11:33,481] [INFO] [config.py:972:print] steps_per_print .............. 1000 [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] train_batch_size ............. 512 [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] train_micro_batch_size_per_gpu 8 [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] use_node_local_storage ....... False [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] wall_clock_breakdown ......... False [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] weight_quantization_config ... None [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] world_size ................... 8 [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] zero_allow_untested_optimizer False [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] zero_config .................. stage=3 contiguous_gradients=True reduce_scatter=True reduce_bucket_size=16777216 allgather_partitions=True allgather_bucket_size=500,000,000 overlap_comm=True load_from_fp32_weights=True elastic_checkpoint=False offload_param=DeepSpeedZeroOffloadParamConfig(device='cpu', nvme_path=None, buffer_count=5, buffer_size=100,000,000, max_in_cpu=1,000,000,000, pin_memory=True) offload_optimizer=DeepSpeedZeroOffloadOptimizerConfig(device='cpu', nvme_path=None, buffer_count=4, pin_memory=True, pipeline=False, pipeline_read=False, pipeline_write=False, fast_init=False) sub_group_size=1000000000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=15099494 param_persistence_threshold=40960 model_persistence_threshold=sys.maxsize max_live_parameters=1000000000 max_reuse_distance=1000000000 gather_16bit_weights_on_model_save=True stage3_gather_fp16_weights_on_model_save=False ignore_unused_parameters=True legacy_stage1=False round_robin_gradients=False zero_hpz_partition_size=1 zero_quantized_weights=False zero_quantized_nontrainable_weights=False zero_quantized_gradients=False mics_shard_size=-1 mics_hierarchical_params_gather=False memory_efficient_linear=True pipeline_loading_checkpoint=False override_module_apply=True [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] zero_enabled ................. True [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] zero_force_ds_cpu_optimizer .. True [2024-02-08 17:11:33,482] [INFO] [config.py:972:print] zero_optimization_stage ...... 3 [2024-02-08 17:11:33,482] [INFO] [config.py:958:print_user_config] json = { "optimizer": { "type": "Adam", "params": { "lr": 2e-05, "betas": [0.9, 0.999], "eps": 1e-08, "weight_decay": 0.0 } }, "bf16": { "enabled": true }, "zero_optimization": { "stage": 3, "offload_optimizer": { "device": "cpu", "pin_memory": true }, "offload_param": { "device": "cpu", "pin_memory": true }, "overlap_comm": true, "contiguous_gradients": true, "reduce_bucket_size": 1.677722e+07, "stage3_prefetch_bucket_size": 1.509949e+07, "stage3_param_persistence_threshold": 4.096000e+04, "sub_group_size": 1.000000e+09, "stage3_max_live_parameters": 1.000000e+09, "stage3_max_reuse_distance": 1.000000e+09, "stage3_gather_16bit_weights_on_model_save": true }, "gradient_accumulation_steps": 8, "gradient_clipping": 1.0, "steps_per_print": 1000, "train_batch_size": 512, "train_micro_batch_size_per_gpu": 8, "wall_clock_breakdown": false } [INFO|trainer.py:1755] 2024-02-08 17:11:33,484 >> ***** Running training ***** [INFO|trainer.py:1756] 2024-02-08 17:11:33,484 >> Num examples = 64204 [INFO|trainer.py:1757] 2024-02-08 17:11:33,484 >> Num Epochs = 3 [INFO|trainer.py:1758] 2024-02-08 17:11:33,484 >> Instantaneous batch size per device = 8 [INFO|trainer.py:1759] 2024-02-08 17:11:33,484 >> Total train batch size (w. parallel, distributed & accumulation) = 512 [INFO|trainer.py:1760] 2024-02-08 17:11:33,484 >> Gradient Accumulation steps = 8 [INFO|trainer.py:1761] 2024-02-08 17:11:33,484 >> Total optimization steps = 375 [INFO|trainer.py:1762] 2024-02-08 17:11:33,486 >> Number of trainable parameters = 6738432000 0%| | 0/375 [00:004->3 [1] 5/-1/-1->4->3 [2] 5/-1/-1->4->3 [3] 5/-1/-1->4->3 [4] 5/-1/-1->4->3 [5] 5/-1/-1->4->3 [6] 5/-1/-1->4->3 [7] 5/-1/-1->4->3 [8] 5/-1/-1->4->3 [9] 5/-1/-1->4->3 [10] 5/-1/-1->4->3 [11] 5/-1/-1->4->3 [12] 5/-1/-1->4->3 [13] 5/-1/-1->4->3 [14] 5/-1/-1->4->3 [15] 5/-1/-1->4->3 [16] 5/-1/-1->4->3 [17] 5/-1/-1->4->3 [18] 5/-1/-1->4->3 [19] 5/-1/-1->4->3 [20] 5/-1/-1->4->3 [21] 5/-1/-1->4->3 [22] 5/-1/-1->4->3 [23] 5/-1/-1->4->3 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Trees [0] -1/-1/-1->7->6 [1] -1/-1/-1->7->6 [2] -1/-1/-1->7->6 [3] -1/-1/-1->7->6 [4] -1/-1/-1->7->6 [5] -1/-1/-1->7->6 [6] -1/-1/-1->7->6 [7] -1/-1/-1->7->6 [8] -1/-1/-1->7->6 [9] -1/-1/-1->7->6 [10] -1/-1/-1->7->6 [11] -1/-1/-1->7->6 [12] -1/-1/-1->7->6 [13] -1/-1/-1->7->6 [14] -1/-1/-1->7->6 [15] -1/-1/-1->7->6 [16] -1/-1/-1->7->6 [17] -1/-1/-1->7->6 [18] -1/-1/-1->7->6 [19] -1/-1/-1->7->6 [20] -1/-1/-1->7->6 [21] -1/-1/-1->7->6 [22] -1/-1/-1->7->6 [23] -1/-1/-1->7->6 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Trees [0] 7/-1/-1->6->5 [1] 7/-1/-1->6->5 [2] 7/-1/-1->6->5 [3] 7/-1/-1->6->5 [4] 7/-1/-1->6->5 [5] 7/-1/-1->6->5 [6] 7/-1/-1->6->5 [7] 7/-1/-1->6->5 [8] 7/-1/-1->6->5 [9] 7/-1/-1->6->5 [10] 7/-1/-1->6->5 [11] 7/-1/-1->6->5 [12] 7/-1/-1->6->5 [13] 7/-1/-1->6->5 [14] 7/-1/-1->6->5 [15] 7/-1/-1->6->5 [16] 7/-1/-1->6->5 [17] 7/-1/-1->6->5 [18] 7/-1/-1->6->5 [19] 7/-1/-1->6->5 [20] 7/-1/-1->6->5 [21] 7/-1/-1->6->5 [22] 7/-1/-1->6->5 [23] 7/-1/-1->6->5 ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Trees [0] 6/-1/-1->5->4 [1] 6/-1/-1->5->4 [2] 6/-1/-1->5->4 [3] 6/-1/-1->5->4 [4] 6/-1/-1->5->4 [5] 6/-1/-1->5->4 [6] 6/-1/-1->5->4 [7] 6/-1/-1->5->4 [8] 6/-1/-1->5->4 [9] 6/-1/-1->5->4 [10] 6/-1/-1->5->4 [11] 6/-1/-1->5->4 [12] 6/-1/-1->5->4 [13] 6/-1/-1->5->4 [14] 6/-1/-1->5->4 [15] 6/-1/-1->5->4 [16] 6/-1/-1->5->4 [17] 6/-1/-1->5->4 [18] 6/-1/-1->5->4 [19] 6/-1/-1->5->4 [20] 6/-1/-1->5->4 [21] 6/-1/-1->5->4 [22] 6/-1/-1->5->4 [23] 6/-1/-1->5->4 ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Trees [0] 2/-1/-1->1->0 [1] 2/-1/-1->1->0 [2] 2/-1/-1->1->0 [3] 2/-1/-1->1->0 [4] 2/-1/-1->1->0 [5] 2/-1/-1->1->0 [6] 2/-1/-1->1->0 [7] 2/-1/-1->1->0 [8] 2/-1/-1->1->0 [9] 2/-1/-1->1->0 [10] 2/-1/-1->1->0 [11] 2/-1/-1->1->0 [12] 2/-1/-1->1->0 [13] 2/-1/-1->1->0 [14] 2/-1/-1->1->0 [15] 2/-1/-1->1->0 [16] 2/-1/-1->1->0 [17] 2/-1/-1->1->0 [18] 2/-1/-1->1->0 [19] 2/-1/-1->1->0 [20] 2/-1/-1->1->0 [21] 2/-1/-1->1->0 [22] 2/-1/-1->1->0 [23] 2/-1/-1->1->0 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Trees [0] 3/-1/-1->2->1 [1] 3/-1/-1->2->1 [2] 3/-1/-1->2->1 [3] 3/-1/-1->2->1 [4] 3/-1/-1->2->1 [5] 3/-1/-1->2->1 [6] 3/-1/-1->2->1 [7] 3/-1/-1->2->1 [8] 3/-1/-1->2->1 [9] 3/-1/-1->2->1 [10] 3/-1/-1->2->1 [11] 3/-1/-1->2->1 [12] 3/-1/-1->2->1 [13] 3/-1/-1->2->1 [14] 3/-1/-1->2->1 [15] 3/-1/-1->2->1 [16] 3/-1/-1->2->1 [17] 3/-1/-1->2->1 [18] 3/-1/-1->2->1 [19] 3/-1/-1->2->1 [20] 3/-1/-1->2->1 [21] 3/-1/-1->2->1 [22] 3/-1/-1->2->1 [23] 3/-1/-1->2->1 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Trees [0] 4/-1/-1->3->2 [1] 4/-1/-1->3->2 [2] 4/-1/-1->3->2 [3] 4/-1/-1->3->2 [4] 4/-1/-1->3->2 [5] 4/-1/-1->3->2 [6] 4/-1/-1->3->2 [7] 4/-1/-1->3->2 [8] 4/-1/-1->3->2 [9] 4/-1/-1->3->2 [10] 4/-1/-1->3->2 [11] 4/-1/-1->3->2 [12] 4/-1/-1->3->2 [13] 4/-1/-1->3->2 [14] 4/-1/-1->3->2 [15] 4/-1/-1->3->2 [16] 4/-1/-1->3->2 [17] 4/-1/-1->3->2 [18] 4/-1/-1->3->2 [19] 4/-1/-1->3->2 [20] 4/-1/-1->3->2 [21] 4/-1/-1->3->2 [22] 4/-1/-1->3->2 [23] 4/-1/-1->3->2 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 00/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 00/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 00/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 00/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 00/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 01/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 01/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 01/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 01/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 01/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 02/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 02/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 02/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 02/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 02/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 03/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 03/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 03/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 03/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 03/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 04/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 04/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 04/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 04/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 04/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 05/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 05/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 05/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 05/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 05/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 06/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 06/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 06/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 06/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 06/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 07/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 07/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 07/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 07/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 07/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 08/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 08/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 08/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 08/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 08/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 09/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 09/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 09/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 09/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 09/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 10/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 10/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 10/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 10/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 10/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 11/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 11/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 11/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 11/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 11/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 12/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 12/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 12/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 12/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 12/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 13/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 13/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 13/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 13/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 13/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 14/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 14/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 14/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 14/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 14/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 15/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 15/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 15/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 15/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 15/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 16/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 16/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 16/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 16/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 16/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 17/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 17/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 17/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 17/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 17/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 18/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 18/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 18/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 18/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 18/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 19/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 19/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 19/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 19/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 19/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 20/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 20/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 20/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 20/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 20/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 21/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 21/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 21/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 21/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 21/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 22/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 22/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 22/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 22/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 22/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 23/0 : 1[13000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 7[d0000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 23/0 : 4[93000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Channel 23/0 : 0[e000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 23/0 : 3[51000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 23/0 : 5[99000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 00/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Connected all rings ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 01/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 02/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 03/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 04/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 05/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 06/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 07/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 08/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 09/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 10/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 11/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 12/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 13/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 14/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 15/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 16/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 17/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 18/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 19/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 20/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 21/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 22/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Channel 23/0 : 7[d0000] -> 6[cb000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 00/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 00/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 00/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 00/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 00/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 00/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 01/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 01/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 01/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 01/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 01/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 01/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 02/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 02/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 02/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 02/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 02/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 02/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 03/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 03/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 03/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 03/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 03/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 03/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 04/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 04/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 04/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 04/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 04/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 04/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 05/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 05/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 05/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 05/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 05/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 05/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 06/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 06/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 06/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 06/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 06/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 06/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 07/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 07/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 07/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 07/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 07/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 07/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 08/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 08/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 08/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 08/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 08/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 08/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 09/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 09/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 09/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 09/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 09/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 09/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 10/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 10/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 10/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 10/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 10/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 10/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 11/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 11/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 11/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 11/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 11/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 11/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 12/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 12/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 12/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 12/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 12/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 12/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 13/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 13/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 13/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 13/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 13/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 13/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 14/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 14/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 14/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 14/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 14/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 14/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 15/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 15/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 15/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 15/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 15/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 15/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 16/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 16/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 16/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 16/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 16/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 16/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 17/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 17/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 17/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 17/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 17/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 17/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 18/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 18/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 18/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 18/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 18/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 18/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 19/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 19/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 19/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 19/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 19/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 19/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 20/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 20/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 20/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 20/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 20/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 20/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 21/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 21/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 21/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 21/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 21/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 21/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 22/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 22/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 22/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 22/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 22/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 22/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Channel 23/0 : 4[93000] -> 3[51000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Channel 23/0 : 1[13000] -> 0[e000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Channel 23/0 : 5[99000] -> 4[93000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Channel 23/0 : 6[cb000] -> 5[99000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Channel 23/0 : 3[51000] -> 2[4b000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Channel 23/0 : 2[4b000] -> 1[13000] via P2P/IPC/read ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO Connected all trees ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer ts-b102359ecb124d359c32da25fe3785b5-launcher:88514:94766 [7] NCCL INFO comm 0x7fbe04024c00 rank 7 nranks 8 cudaDev 7 busId d0000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88511:94765 [4] NCCL INFO comm 0x7f70980249a0 rank 4 nranks 8 cudaDev 4 busId 93000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88513:94769 [6] NCCL INFO comm 0x7f7cec025990 rank 6 nranks 8 cudaDev 6 busId cb000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88512:94770 [5] NCCL INFO comm 0x7ff308024c90 rank 5 nranks 8 cudaDev 5 busId 99000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88509:94771 [2] NCCL INFO comm 0x7f0c6c024c80 rank 2 nranks 8 cudaDev 2 busId 4b000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88508:94767 [1] NCCL INFO comm 0x7f011c024ba0 rank 1 nranks 8 cudaDev 1 busId 13000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88510:94768 [3] NCCL INFO comm 0x7f0b64024ba0 rank 3 nranks 8 cudaDev 3 busId 51000 - Init COMPLETE ts-b102359ecb124d359c32da25fe3785b5-launcher:88507:94764 [0] NCCL INFO comm 0x7faad0025eb0 rank 0 nranks 8 cudaDev 0 busId e000 - Init COMPLETE 0%| | 1/375 [00:33<3:28:09, 33.39s/it] {'loss': 1.5044, 'learning_rate': 1.6666666666666667e-06, 'epoch': 0.01} 0%| | 1/375 [00:33<3:28:09, 33.39s/it] 1%| | 2/375 [01:01<3:09:02, 30.41s/it] {'loss': 1.4514, 'learning_rate': 3.3333333333333333e-06, 'epoch': 0.02} 1%| | 2/375 [01:01<3:09:02, 30.41s/it] 1%| | 3/375 [01:28<2:59:19, 28.92s/it] {'loss': 1.4498, 'learning_rate': 5e-06, 'epoch': 0.02} 1%| | 3/375 [01:29<2:59:19, 28.92s/it] 1%| | 4/375 [01:55<2:53:16, 28.02s/it] {'loss': 1.4186, 'learning_rate': 6.666666666666667e-06, 'epoch': 0.03} 1%| | 4/375 [01:55<2:53:16, 28.02s/it] 1%|▏ | 5/375 [02:22<2:50:40, 27.68s/it] {'loss': 1.3586, 'learning_rate': 8.333333333333334e-06, 'epoch': 0.04} 1%|▏ | 5/375 [02:22<2:50:40, 27.68s/it] 2%|▏ | 6/375 [02:49<2:49:27, 27.55s/it] {'loss': 1.2792, 'learning_rate': 1e-05, 'epoch': 0.05} 2%|▏ | 6/375 [02:49<2:49:27, 27.55s/it] 2%|▏ | 7/375 [03:16<2:47:16, 27.27s/it] {'loss': 1.2723, 'learning_rate': 1.1666666666666668e-05, 'epoch': 0.06} 2%|▏ | 7/375 [03:16<2:47:16, 27.27s/it] 2%|▏ | 8/375 [03:43<2:45:46, 27.10s/it] {'loss': 1.2244, 'learning_rate': 1.3333333333333333e-05, 'epoch': 0.06} 2%|▏ | 8/375 [03:43<2:45:46, 27.10s/it] 2%|▏ | 9/375 [04:11<2:46:38, 27.32s/it] {'loss': 1.1807, 'learning_rate': 1.5000000000000002e-05, 'epoch': 0.07} 2%|▏ | 9/375 [04:11<2:46:38, 27.32s/it] 3%|▎ | 10/375 [04:39<2:47:43, 27.57s/it] {'loss': 1.1877, 'learning_rate': 1.6666666666666667e-05, 'epoch': 0.08} 3%|▎ | 10/375 [04:39<2:47:43, 27.57s/it] 3%|▎ | 11/375 [05:06<2:47:00, 27.53s/it] {'loss': 1.163, 'learning_rate': 1.8333333333333333e-05, 'epoch': 0.09} 3%|▎ | 11/375 [05:06<2:47:00, 27.53s/it] 3%|▎ | 12/375 [05:33<2:44:22, 27.17s/it] {'loss': 1.1489, 'learning_rate': 2e-05, 'epoch': 0.1} 3%|▎ | 12/375 [05:33<2:44:22, 27.17s/it] 3%|▎ | 13/375 [05:58<2:41:07, 26.70s/it] {'loss': 1.1348, 'learning_rate': 1.9999625498303936e-05, 'epoch': 0.1} 3%|▎ | 13/375 [05:58<2:41:07, 26.70s/it] 4%|▎ | 14/375 [06:25<2:40:55, 26.75s/it] {'loss': 1.1368, 'learning_rate': 1.999850202126604e-05, 'epoch': 0.11} 4%|▎ | 14/375 [06:25<2:40:55, 26.75s/it] 4%|▍ | 15/375 [06:51<2:39:45, 26.63s/it] {'loss': 1.1382, 'learning_rate': 1.9996629653035128e-05, 'epoch': 0.12} 4%|▍ | 15/375 [06:51<2:39:45, 26.63s/it] 4%|▍ | 16/375 [07:18<2:38:54, 26.56s/it] {'loss': 1.1297, 'learning_rate': 1.999400853385221e-05, 'epoch': 0.13} 4%|▍ | 16/375 [07:18<2:38:54, 26.56s/it] 5%|▍ | 17/375 [07:44<2:38:16, 26.53s/it] {'loss': 1.1365, 'learning_rate': 1.9990638860040007e-05, 'epoch': 0.14} 5%|▍ | 17/375 [07:44<2:38:16, 26.53s/it] 5%|▍ | 18/375 [08:10<2:36:51, 26.36s/it] {'loss': 1.1292, 'learning_rate': 1.9986520883988233e-05, 'epoch': 0.14} 5%|▍ | 18/375 [08:10<2:36:51, 26.36s/it] 5%|▌ | 19/375 [08:37<2:37:53, 26.61s/it] {'loss': 1.1278, 'learning_rate': 1.9981654914134684e-05, 'epoch': 0.15} 5%|▌ | 19/375 [08:37<2:37:53, 26.61s/it] 5%|▌ | 20/375 [09:04<2:37:25, 26.61s/it] {'loss': 1.106, 'learning_rate': 1.9976041314942156e-05, 'epoch': 0.16} 5%|▌ | 20/375 [09:04<2:37:25, 26.61s/it] 6%|▌ | 21/375 [09:32<2:40:19, 27.17s/it] {'loss': 1.1053, 'learning_rate': 1.9969680506871138e-05, 'epoch': 0.17} 6%|▌ | 21/375 [09:32<2:40:19, 27.17s/it] 6%|▌ | 22/375 [10:01<2:42:30, 27.62s/it] {'loss': 1.11, 'learning_rate': 1.99625729663483e-05, 'epoch': 0.18} 6%|▌ | 22/375 [10:01<2:42:30, 27.62s/it] 6%|▌ | 23/375 [10:31<2:45:52, 28.27s/it] {'loss': 1.1139, 'learning_rate': 1.9954719225730847e-05, 'epoch': 0.18} 6%|▌ | 23/375 [10:31<2:45:52, 28.27s/it] 6%|▋ | 24/375 [10:57<2:41:57, 27.69s/it] {'loss': 1.1226, 'learning_rate': 1.9946119873266615e-05, 'epoch': 0.19} 6%|▋ | 24/375 [10:57<2:41:57, 27.69s/it] 7%|▋ | 25/375 [11:23<2:37:41, 27.03s/it] {'loss': 1.046, 'learning_rate': 1.9936775553050017e-05, 'epoch': 0.2} 7%|▋ | 25/375 [11:23<2:37:41, 27.03s/it] 7%|▋ | 26/375 [11:51<2:38:52, 27.31s/it] {'loss': 1.1061, 'learning_rate': 1.9926686964973813e-05, 'epoch': 0.21} 7%|▋ | 26/375 [11:51<2:38:52, 27.31s/it] 7%|▋ | 27/375 [12:18<2:38:21, 27.30s/it] {'loss': 1.0507, 'learning_rate': 1.9915854864676665e-05, 'epoch': 0.22} 7%|▋ | 27/375 [12:18<2:38:21, 27.30s/it] 7%|▋ | 28/375 [12:45<2:36:56, 27.14s/it] {'loss': 1.0609, 'learning_rate': 1.9904280063486563e-05, 'epoch': 0.22} 7%|▋ | 28/375 [12:45<2:36:56, 27.14s/it] 8%|▊ | 29/375 [13:10<2:33:23, 26.60s/it] {'loss': 1.0782, 'learning_rate': 1.9891963428360043e-05, 'epoch': 0.23} 8%|▊ | 29/375 [13:10<2:33:23, 26.60s/it] 8%|▊ | 30/375 [13:35<2:30:15, 26.13s/it] {'loss': 1.1025, 'learning_rate': 1.9878905881817254e-05, 'epoch': 0.24} 8%|▊ | 30/375 [13:35<2:30:15, 26.13s/it] 8%|▊ | 31/375 [14:02<2:30:26, 26.24s/it] {'loss': 1.0876, 'learning_rate': 1.9865108401872856e-05, 'epoch': 0.25} 8%|▊ | 31/375 [14:02<2:30:26, 26.24s/it] 9%|▊ | 32/375 [14:30<2:32:54, 26.75s/it] {'loss': 1.1006, 'learning_rate': 1.9850572021962788e-05, 'epoch': 0.25} 9%|▊ | 32/375 [14:30<2:32:54, 26.75s/it] 9%|▉ | 33/375 [14:57<2:33:06, 26.86s/it] {'loss': 1.0444, 'learning_rate': 1.9835297830866827e-05, 'epoch': 0.26} 9%|▉ | 33/375 [14:57<2:33:06, 26.86s/it] 9%|▉ | 34/375 [15:25<2:34:30, 27.19s/it] {'loss': 1.0563, 'learning_rate': 1.9819286972627066e-05, 'epoch': 0.27} 9%|▉ | 34/375 [15:25<2:34:30, 27.19s/it] 9%|▉ | 35/375 [15:50<2:31:06, 26.67s/it] {'loss': 1.0656, 'learning_rate': 1.980254064646223e-05, 'epoch': 0.28} 9%|▉ | 35/375 [15:50<2:31:06, 26.67s/it] 10%|▉ | 36/375 [16:15<2:28:09, 26.22s/it] {'loss': 1.0823, 'learning_rate': 1.9785060106677818e-05, 'epoch': 0.29} 10%|▉ | 36/375 [16:15<2:28:09, 26.22s/it] 10%|▉ | 37/375 [16:44<2:31:09, 26.83s/it] {'loss': 1.0634, 'learning_rate': 1.976684666257219e-05, 'epoch': 0.29} 10%|▉ | 37/375 [16:44<2:31:09, 26.83s/it] 10%|█ | 38/375 [17:10<2:29:40, 26.65s/it] {'loss': 1.0581, 'learning_rate': 1.9747901678338496e-05, 'epoch': 0.3} 10%|█ | 38/375 [17:10<2:29:40, 26.65s/it] 10%|█ | 39/375 [17:36<2:28:36, 26.54s/it] {'loss': 1.0588, 'learning_rate': 1.9728226572962474e-05, 'epoch': 0.31} 10%|█ | 39/375 [17:36<2:28:36, 26.54s/it] 11%|█ | 40/375 [18:02<2:27:32, 26.42s/it] {'loss': 1.0704, 'learning_rate': 1.9707822820116193e-05, 'epoch': 0.32} 11%|█ | 40/375 [18:02<2:27:32, 26.42s/it] 11%|█ | 41/375 [18:27<2:24:37, 25.98s/it] {'loss': 1.0822, 'learning_rate': 1.9686691948047665e-05, 'epoch': 0.33} 11%|█ | 41/375 [18:27<2:24:37, 25.98s/it] 11%|█ | 42/375 [18:54<2:26:21, 26.37s/it] {'loss': 1.0621, 'learning_rate': 1.966483553946637e-05, 'epoch': 0.33} 11%|█ | 42/375 [18:54<2:26:21, 26.37s/it] 11%|█▏ | 43/375 [19:22<2:27:10, 26.60s/it] {'loss': 1.0648, 'learning_rate': 1.964225523142473e-05, 'epoch': 0.34} 11%|█▏ | 43/375 [19:22<2:27:10, 26.60s/it] 12%|█▏ | 44/375 [19:49<2:27:20, 26.71s/it] {'loss': 1.0837, 'learning_rate': 1.9618952715195476e-05, 'epoch': 0.35} 12%|█▏ | 44/375 [19:49<2:27:20, 26.71s/it] 12%|█▏ | 45/375 [20:14<2:24:19, 26.24s/it] {'loss': 1.0543, 'learning_rate': 1.9594929736144978e-05, 'epoch': 0.36} 12%|█▏ | 45/375 [20:14<2:24:19, 26.24s/it] 12%|█▏ | 46/375 [20:40<2:23:58, 26.26s/it] {'loss': 1.0535, 'learning_rate': 1.9570188093602512e-05, 'epoch': 0.37} 12%|█▏ | 46/375 [20:40<2:23:58, 26.26s/it] 13%|█▎ | 47/375 [21:07<2:25:15, 26.57s/it] {'loss': 1.0533, 'learning_rate': 1.95447296407255e-05, 'epoch': 0.37} 13%|█▎ | 47/375 [21:07<2:25:15, 26.57s/it] 13%|█▎ | 48/375 [21:34<2:24:37, 26.54s/it] {'loss': 1.0191, 'learning_rate': 1.9518556284360696e-05, 'epoch': 0.38} 13%|█▎ | 48/375 [21:34<2:24:37, 26.54s/it] 13%|█▎ | 49/375 [22:00<2:23:45, 26.46s/it] {'loss': 1.0395, 'learning_rate': 1.9491669984901377e-05, 'epoch': 0.39} 13%|█▎ | 49/375 [22:00<2:23:45, 26.46s/it] 13%|█▎ | 50/375 [22:27<2:24:16, 26.64s/it] {'loss': 1.0173, 'learning_rate': 1.9464072756140487e-05, 'epoch': 0.4} 13%|█▎ | 50/375 [22:27<2:24:16, 26.64s/it] 14%|█▎ | 51/375 [22:54<2:24:30, 26.76s/it] {'loss': 1.048, 'learning_rate': 1.9435766665119823e-05, 'epoch': 0.41} 14%|█▎ | 51/375 [22:54<2:24:30, 26.76s/it] 14%|█▍ | 52/375 [23:20<2:23:06, 26.58s/it] {'loss': 1.0496, 'learning_rate': 1.9406753831975202e-05, 'epoch': 0.41} 14%|█▍ | 52/375 [23:20<2:23:06, 26.58s/it] 14%|█▍ | 53/375 [23:47<2:22:41, 26.59s/it] {'loss': 1.0616, 'learning_rate': 1.9377036429777673e-05, 'epoch': 0.42} 14%|█▍ | 53/375 [23:47<2:22:41, 26.59s/it] 14%|█▍ | 54/375 [24:13<2:22:08, 26.57s/it] {'loss': 1.0504, 'learning_rate': 1.934661668437073e-05, 'epoch': 0.43} 14%|█▍ | 54/375 [24:13<2:22:08, 26.57s/it] 15%|█▍ | 55/375 [24:39<2:19:57, 26.24s/it] {'loss': 1.0399, 'learning_rate': 1.9315496874203637e-05, 'epoch': 0.44} 15%|█▍ | 55/375 [24:39<2:19:57, 26.24s/it] 15%|█▍ | 56/375 [25:06<2:21:31, 26.62s/it] {'loss': 1.0416, 'learning_rate': 1.9283679330160726e-05, 'epoch': 0.45} 15%|█▍ | 56/375 [25:06<2:21:31, 26.62s/it] 15%|█▌ | 57/375 [25:33<2:20:52, 26.58s/it] {'loss': 1.0175, 'learning_rate': 1.9251166435386837e-05, 'epoch': 0.45} 15%|█▌ | 57/375 [25:33<2:20:52, 26.58s/it] 15%|█▌ | 58/375 [25:59<2:19:53, 26.48s/it] {'loss': 1.0285, 'learning_rate': 1.921796062510882e-05, 'epoch': 0.46} 15%|█▌ | 58/375 [25:59<2:19:53, 26.48s/it] 16%|█▌ | 59/375 [26:26<2:19:37, 26.51s/it] {'loss': 1.0312, 'learning_rate': 1.9184064386453127e-05, 'epoch': 0.47} 16%|█▌ | 59/375 [26:26<2:19:37, 26.51s/it] 16%|█▌ | 60/375 [26:54<2:21:29, 26.95s/it] {'loss': 1.0463, 'learning_rate': 1.9149480258259535e-05, 'epoch': 0.48} 16%|█▌ | 60/375 [26:54<2:21:29, 26.95s/it] 16%|█▋ | 61/375 [27:20<2:20:19, 26.81s/it] {'loss': 1.0521, 'learning_rate': 1.911421083089097e-05, 'epoch': 0.49} 16%|█▋ | 61/375 [27:20<2:20:19, 26.81s/it] 17%|█▋ | 62/375 [27:46<2:18:23, 26.53s/it] {'loss': 1.0276, 'learning_rate': 1.907825874603951e-05, 'epoch': 0.49} 17%|█▋ | 62/375 [27:46<2:18:23, 26.53s/it] 17%|█▋ | 63/375 [28:14<2:20:19, 26.99s/it] {'loss': 1.0256, 'learning_rate': 1.9041626696528503e-05, 'epoch': 0.5} 17%|█▋ | 63/375 [28:14<2:20:19, 26.99s/it] 17%|█▋ | 64/375 [28:43<2:22:18, 27.46s/it] {'loss': 1.0403, 'learning_rate': 1.9004317426110888e-05, 'epoch': 0.51} 17%|█▋ | 64/375 [28:43<2:22:18, 27.46s/it] 17%|█▋ | 65/375 [29:08<2:19:18, 26.96s/it] {'loss': 1.0687, 'learning_rate': 1.8966333729263674e-05, 'epoch': 0.52} 17%|█▋ | 65/375 [29:08<2:19:18, 26.96s/it] 18%|█▊ | 66/375 [29:37<2:21:14, 27.43s/it] {'loss': 1.0394, 'learning_rate': 1.892767845097864e-05, 'epoch': 0.53} 18%|█▊ | 66/375 [29:37<2:21:14, 27.43s/it] 18%|█▊ | 67/375 [30:03<2:18:12, 26.92s/it] {'loss': 0.9965, 'learning_rate': 1.8888354486549238e-05, 'epoch': 0.53} 18%|█▊ | 67/375 [30:03<2:18:12, 26.92s/it] 18%|█▊ | 68/375 [30:30<2:17:57, 26.96s/it] {'loss': 1.0397, 'learning_rate': 1.8848364781353744e-05, 'epoch': 0.54} 18%|█▊ | 68/375 [30:30<2:17:57, 26.96s/it] 18%|█▊ | 69/375 [30:55<2:15:02, 26.48s/it] {'loss': 1.027, 'learning_rate': 1.8807712330634645e-05, 'epoch': 0.55} 18%|█▊ | 69/375 [30:55<2:15:02, 26.48s/it] 19%|█▊ | 70/375 [31:24<2:18:28, 27.24s/it] {'loss': 1.0201, 'learning_rate': 1.8766400179274287e-05, 'epoch': 0.56} 19%|█▊ | 70/375 [31:24<2:18:28, 27.24s/it] 19%|█▉ | 71/375 [31:53<2:20:45, 27.78s/it] {'loss': 1.0081, 'learning_rate': 1.8724431421566822e-05, 'epoch': 0.57} 19%|█▉ | 71/375 [31:53<2:20:45, 27.78s/it] 19%|█▉ | 72/375 [32:20<2:18:55, 27.51s/it] {'loss': 0.9956, 'learning_rate': 1.868180920098644e-05, 'epoch': 0.57} 19%|█▉ | 72/375 [32:20<2:18:55, 27.51s/it] 19%|█▉ | 73/375 [32:46<2:16:38, 27.15s/it] {'loss': 1.0127, 'learning_rate': 1.8638536709951916e-05, 'epoch': 0.58} 19%|█▉ | 73/375 [32:46<2:16:38, 27.15s/it] 20%|█▉ | 74/375 [33:15<2:18:31, 27.61s/it] {'loss': 0.9887, 'learning_rate': 1.8594617189587515e-05, 'epoch': 0.59} 20%|█▉ | 74/375 [33:15<2:18:31, 27.61s/it] 20%|██ | 75/375 [33:41<2:15:38, 27.13s/it] {'loss': 1.0266, 'learning_rate': 1.8550053929480202e-05, 'epoch': 0.6} 20%|██ | 75/375 [33:41<2:15:38, 27.13s/it] 20%|██ | 76/375 [34:08<2:14:44, 27.04s/it] {'loss': 1.0102, 'learning_rate': 1.8504850267433278e-05, 'epoch': 0.61} 20%|██ | 76/375 [34:08<2:14:44, 27.04s/it] 21%|██ | 77/375 [34:34<2:12:26, 26.67s/it] {'loss': 0.9728, 'learning_rate': 1.8459009589216364e-05, 'epoch': 0.61} 21%|██ | 77/375 [34:34<2:12:26, 26.67s/it] 21%|██ | 78/375 [35:01<2:12:49, 26.83s/it] {'loss': 0.9944, 'learning_rate': 1.8412535328311813e-05, 'epoch': 0.62} 21%|██ | 78/375 [35:01<2:12:49, 26.83s/it] 21%|██ | 79/375 [35:28<2:12:22, 26.83s/it] {'loss': 1.0591, 'learning_rate': 1.8365430965657527e-05, 'epoch': 0.63} 21%|██ | 79/375 [35:28<2:12:22, 26.83s/it] 21%|██▏ | 80/375 [35:56<2:14:02, 27.26s/it] {'loss': 1.0032, 'learning_rate': 1.8317700029386245e-05, 'epoch': 0.64} 21%|██▏ | 80/375 [35:56<2:14:02, 27.26s/it] 22%|██▏ | 81/375 [36:23<2:12:46, 27.10s/it] {'loss': 1.0094, 'learning_rate': 1.826934609456129e-05, 'epoch': 0.65} 22%|██▏ | 81/375 [36:23<2:12:46, 27.10s/it] 22%|██▏ | 82/375 [36:50<2:12:50, 27.20s/it] {'loss': 1.0118, 'learning_rate': 1.8220372782908778e-05, 'epoch': 0.65} 22%|██▏ | 82/375 [36:50<2:12:50, 27.20s/it] 22%|██▏ | 83/375 [37:18<2:14:03, 27.55s/it] {'loss': 1.0254, 'learning_rate': 1.8170783762546363e-05, 'epoch': 0.66} 22%|██▏ | 83/375 [37:18<2:14:03, 27.55s/it] 22%|██▏ | 84/375 [37:45<2:12:49, 27.39s/it] {'loss': 1.0108, 'learning_rate': 1.8120582747708503e-05, 'epoch': 0.67} 22%|██▏ | 84/375 [37:46<2:12:49, 27.39s/it] 23%|██▎ | 85/375 [38:12<2:11:16, 27.16s/it] {'loss': 0.9887, 'learning_rate': 1.8069773498468224e-05, 'epoch': 0.68} 23%|██▎ | 85/375 [38:12<2:11:16, 27.16s/it] 23%|██▎ | 86/375 [38:40<2:11:31, 27.31s/it] {'loss': 1.0241, 'learning_rate': 1.8018359820455535e-05, 'epoch': 0.69} 23%|██▎ | 86/375 [38:40<2:11:31, 27.31s/it] 23%|██▎ | 87/375 [39:10<2:15:52, 28.31s/it] {'loss': 1.0179, 'learning_rate': 1.796634556457236e-05, 'epoch': 0.69} 23%|██▎ | 87/375 [39:10<2:15:52, 28.31s/it] 23%|██▎ | 88/375 [39:38<2:14:04, 28.03s/it] {'loss': 1.0357, 'learning_rate': 1.791373462670411e-05, 'epoch': 0.7} 23%|██▎ | 88/375 [39:38<2:14:04, 28.03s/it] 24%|██▎ | 89/375 [40:04<2:11:18, 27.55s/it] {'loss': 0.9963, 'learning_rate': 1.7860530947427878e-05, 'epoch': 0.71} 24%|██▎ | 89/375 [40:04<2:11:18, 27.55s/it] 24%|██▍ | 90/375 [40:30<2:08:42, 27.10s/it] {'loss': 1.0084, 'learning_rate': 1.780673851171728e-05, 'epoch': 0.72} 24%|██▍ | 90/375 [40:30<2:08:42, 27.10s/it] 24%|██▍ | 91/375 [40:57<2:07:08, 26.86s/it] {'loss': 1.0178, 'learning_rate': 1.7752361348644012e-05, 'epoch': 0.73} 24%|██▍ | 91/375 [40:57<2:07:08, 26.86s/it] 25%|██▍ | 92/375 [41:23<2:06:39, 26.85s/it] {'loss': 0.9936, 'learning_rate': 1.769740353107602e-05, 'epoch': 0.73} 25%|██▍ | 92/375 [41:23<2:06:39, 26.85s/it] 25%|██▍ | 93/375 [41:50<2:05:43, 26.75s/it] {'loss': 0.9838, 'learning_rate': 1.7641869175372493e-05, 'epoch': 0.74} 25%|██▍ | 93/375 [41:50<2:05:43, 26.75s/it] 25%|██▌ | 94/375 [42:17<2:05:26, 26.78s/it] {'loss': 1.0163, 'learning_rate': 1.7585762441075504e-05, 'epoch': 0.75} 25%|██▌ | 94/375 [42:17<2:05:26, 26.78s/it] 25%|██▌ | 95/375 [42:43<2:04:10, 26.61s/it] {'loss': 0.9954, 'learning_rate': 1.752908753059849e-05, 'epoch': 0.76} 25%|██▌ | 95/375 [42:43<2:04:10, 26.61s/it] 26%|██▌ | 96/375 [43:10<2:03:51, 26.64s/it] {'loss': 1.0138, 'learning_rate': 1.7471848688911465e-05, 'epoch': 0.76} 26%|██▌ | 96/375 [43:10<2:03:51, 26.64s/it] 26%|██▌ | 97/375 [43:37<2:04:52, 26.95s/it] {'loss': 0.997, 'learning_rate': 1.7414050203223092e-05, 'epoch': 0.77} 26%|██▌ | 97/375 [43:37<2:04:52, 26.95s/it] 26%|██▌ | 98/375 [44:04<2:04:38, 27.00s/it] {'loss': 1.011, 'learning_rate': 1.735569640265955e-05, 'epoch': 0.78} 26%|██▌ | 98/375 [44:04<2:04:38, 27.00s/it] 26%|██▋ | 99/375 [44:31<2:03:08, 26.77s/it] {'loss': 0.9625, 'learning_rate': 1.72967916579403e-05, 'epoch': 0.79} 26%|██▋ | 99/375 [44:31<2:03:08, 26.77s/it] 27%|██▋ | 100/375 [44:58<2:02:59, 26.83s/it] {'loss': 1.0054, 'learning_rate': 1.72373403810507e-05, 'epoch': 0.8} 27%|██▋ | 100/375 [44:58<2:02:59, 26.83s/it] 27%|██▋ | 101/375 [45:24<2:01:17, 26.56s/it] {'loss': 1.0169, 'learning_rate': 1.7177347024911562e-05, 'epoch': 0.8} 27%|██▋ | 101/375 [45:24<2:01:17, 26.56s/it] 27%|██▋ | 102/375 [45:51<2:01:31, 26.71s/it] {'loss': 0.9911, 'learning_rate': 1.7116816083045603e-05, 'epoch': 0.81} 27%|██▋ | 102/375 [45:51<2:01:31, 26.71s/it] 27%|██▋ | 103/375 [46:17<2:00:20, 26.55s/it] {'loss': 1.0246, 'learning_rate': 1.7055752089240907e-05, 'epoch': 0.82} 27%|██▋ | 103/375 [46:17<2:00:20, 26.55s/it] 28%|██▊ | 104/375 [46:43<1:59:58, 26.56s/it] {'loss': 1.0024, 'learning_rate': 1.6994159617211318e-05, 'epoch': 0.83} 28%|██▊ | 104/375 [46:43<1:59:58, 26.56s/it] 28%|██▊ | 105/375 [47:09<1:57:59, 26.22s/it] {'loss': 0.9886, 'learning_rate': 1.6932043280253892e-05, 'epoch': 0.84} 28%|██▊ | 105/375 [47:09<1:57:59, 26.22s/it] 28%|██▊ | 106/375 [47:36<1:58:41, 26.47s/it] {'loss': 0.9775, 'learning_rate': 1.686940773090333e-05, 'epoch': 0.84} 28%|██▊ | 106/375 [47:36<1:58:41, 26.47s/it] 29%|██▊ | 107/375 [48:02<1:58:17, 26.48s/it] {'loss': 0.9755, 'learning_rate': 1.6806257660583534e-05, 'epoch': 0.85} 29%|██▊ | 107/375 [48:02<1:58:17, 26.48s/it] 29%|██▉ | 108/375 [48:29<1:57:36, 26.43s/it] {'loss': 1.0068, 'learning_rate': 1.6742597799256182e-05, 'epoch': 0.86} 29%|██▉ | 108/375 [48:29<1:57:36, 26.43s/it] 29%|██▉ | 109/375 [48:54<1:55:35, 26.07s/it] {'loss': 0.978, 'learning_rate': 1.6678432915066488e-05, 'epoch': 0.87} 29%|██▉ | 109/375 [48:54<1:55:35, 26.07s/it] 29%|██▉ | 110/375 [49:22<1:58:13, 26.77s/it] {'loss': 1.002, 'learning_rate': 1.6613767813986045e-05, 'epoch': 0.88} 29%|██▉ | 110/375 [49:22<1:58:13, 26.77s/it] 30%|██▉ | 111/375 [49:48<1:56:36, 26.50s/it] {'loss': 0.9859, 'learning_rate': 1.6548607339452853e-05, 'epoch': 0.88} 30%|██▉ | 111/375 [49:48<1:56:36, 26.50s/it] 30%|██▉ | 112/375 [50:14<1:55:33, 26.36s/it] {'loss': 0.9753, 'learning_rate': 1.648295637200856e-05, 'epoch': 0.89} 30%|██▉ | 112/375 [50:14<1:55:33, 26.36s/it] 30%|███ | 113/375 [50:41<1:55:28, 26.45s/it] {'loss': 0.9771, 'learning_rate': 1.64168198289329e-05, 'epoch': 0.9} 30%|███ | 113/375 [50:41<1:55:28, 26.45s/it] 30%|███ | 114/375 [51:08<1:55:57, 26.66s/it] {'loss': 0.9843, 'learning_rate': 1.6350202663875385e-05, 'epoch': 0.91} 30%|███ | 114/375 [51:08<1:55:57, 26.66s/it] 31%|███ | 115/375 [51:35<1:55:32, 26.66s/it] {'loss': 0.9507, 'learning_rate': 1.628310986648427e-05, 'epoch': 0.92} 31%|███ | 115/375 [51:35<1:55:32, 26.66s/it] 31%|███ | 116/375 [52:01<1:53:56, 26.40s/it] {'loss': 0.9875, 'learning_rate': 1.621554646203284e-05, 'epoch': 0.92} 31%|███ | 116/375 [52:01<1:53:56, 26.40s/it] 31%|███ | 117/375 [52:29<1:55:39, 26.90s/it] {'loss': 0.9724, 'learning_rate': 1.614751751104301e-05, 'epoch': 0.93} 31%|███ | 117/375 [52:29<1:55:39, 26.90s/it] 31%|███▏ | 118/375 [52:55<1:54:51, 26.81s/it] {'loss': 0.9572, 'learning_rate': 1.607902810890628e-05, 'epoch': 0.94} 31%|███▏ | 118/375 [52:55<1:54:51, 26.81s/it] 32%|███▏ | 119/375 [53:23<1:55:11, 27.00s/it] {'loss': 0.9905, 'learning_rate': 1.601008338550211e-05, 'epoch': 0.95} 32%|███▏ | 119/375 [53:23<1:55:11, 27.00s/it] 32%|███▏ | 120/375 [53:51<1:56:46, 27.48s/it] {'loss': 0.9758, 'learning_rate': 1.5940688504813664e-05, 'epoch': 0.96} 32%|███▏ | 120/375 [53:51<1:56:46, 27.48s/it] 32%|███▏ | 121/375 [54:17<1:53:58, 26.92s/it] {'loss': 0.9927, 'learning_rate': 1.5870848664541046e-05, 'epoch': 0.96} 32%|███▏ | 121/375 [54:17<1:53:58, 26.92s/it] 33%|███▎ | 122/375 [54:44<1:53:54, 27.01s/it] {'loss': 0.9696, 'learning_rate': 1.5800569095711983e-05, 'epoch': 0.97} 33%|███▎ | 122/375 [54:44<1:53:54, 27.01s/it] 33%|███▎ | 123/375 [55:10<1:52:37, 26.81s/it] {'loss': 0.9775, 'learning_rate': 1.5729855062290024e-05, 'epoch': 0.98} 33%|███▎ | 123/375 [55:10<1:52:37, 26.81s/it] 33%|███▎ | 124/375 [55:38<1:52:44, 26.95s/it] {'loss': 0.9904, 'learning_rate': 1.565871186078025e-05, 'epoch': 0.99} 33%|███▎ | 124/375 [55:38<1:52:44, 26.95s/it] 33%|███▎ | 125/375 [56:05<1:52:53, 27.09s/it] {'loss': 0.9796, 'learning_rate': 1.55871448198326e-05, 'epoch': 1.0} 33%|███▎ | 125/375 [56:05<1:52:53, 27.09s/it] 34%|███▎ | 126/375 [56:38<2:00:05, 28.94s/it] {'loss': 0.8877, 'learning_rate': 1.551515929984271e-05, 'epoch': 1.0} 34%|███▎ | 126/375 [56:38<2:00:05, 28.94s/it] 34%|███▍ | 127/375 [57:04<1:55:58, 28.06s/it] {'loss': 0.7575, 'learning_rate': 1.5442760692550443e-05, 'epoch': 1.01} 34%|███▍ | 127/375 [57:04<1:55:58, 28.06s/it] 34%|███▍ | 128/375 [57:31<1:53:29, 27.57s/it] {'loss': 0.7759, 'learning_rate': 1.5369954420636048e-05, 'epoch': 1.02} 34%|███▍ | 128/375 [57:31<1:53:29, 27.57s/it] 34%|███▍ | 129/375 [57:56<1:49:37, 26.74s/it] {'loss': 0.7747, 'learning_rate': 1.529674593731399e-05, 'epoch': 1.03} 34%|███▍ | 129/375 [57:56<1:49:37, 26.74s/it] 35%|███▍ | 130/375 [58:23<1:49:53, 26.91s/it] {'loss': 0.7596, 'learning_rate': 1.5223140725924494e-05, 'epoch': 1.04} 35%|███▍ | 130/375 [58:23<1:49:53, 26.91s/it] 35%|███▍ | 131/375 [58:49<1:48:48, 26.76s/it] {'loss': 0.7461, 'learning_rate': 1.5149144299522874e-05, 'epoch': 1.04} 35%|███▍ | 131/375 [58:49<1:48:48, 26.76s/it] 35%|███▌ | 132/375 [59:16<1:47:57, 26.65s/it] {'loss': 0.7336, 'learning_rate': 1.5074762200466557e-05, 'epoch': 1.05} 35%|███▌ | 132/375 [59:16<1:47:57, 26.65s/it] 35%|███▌ | 133/375 [59:42<1:46:35, 26.43s/it] {'loss': 0.7054, 'learning_rate': 1.5000000000000002e-05, 'epoch': 1.06} 35%|███▌ | 133/375 [59:42<1:46:35, 26.43s/it] 36%|███▌ | 134/375 [1:00:09<1:46:48, 26.59s/it] {'loss': 0.7358, 'learning_rate': 1.4924863297837378e-05, 'epoch': 1.07} 36%|███▌ | 134/375 [1:00:09<1:46:48, 26.59s/it] 36%|███▌ | 135/375 [1:00:35<1:46:00, 26.50s/it] {'loss': 0.7282, 'learning_rate': 1.4849357721743169e-05, 'epoch': 1.08} 36%|███▌ | 135/375 [1:00:35<1:46:00, 26.50s/it] 36%|███▋ | 136/375 [1:01:01<1:45:37, 26.52s/it] {'loss': 0.7195, 'learning_rate': 1.4773488927110633e-05, 'epoch': 1.08} 36%|███▋ | 136/375 [1:01:01<1:45:37, 26.52s/it] 37%|███▋ | 137/375 [1:01:30<1:47:41, 27.15s/it] {'loss': 0.7279, 'learning_rate': 1.4697262596538227e-05, 'epoch': 1.09} 37%|███▋ | 137/375 [1:01:30<1:47:41, 27.15s/it] 37%|███▋ | 138/375 [1:01:58<1:48:05, 27.36s/it] {'loss': 0.7313, 'learning_rate': 1.4620684439403962e-05, 'epoch': 1.1} 37%|███▋ | 138/375 [1:01:58<1:48:05, 27.36s/it] 37%|███▋ | 139/375 [1:02:25<1:47:06, 27.23s/it] {'loss': 0.7415, 'learning_rate': 1.454376019143779e-05, 'epoch': 1.11} 37%|███▋ | 139/375 [1:02:25<1:47:06, 27.23s/it] 37%|███▋ | 140/375 [1:02:51<1:44:56, 26.79s/it] {'loss': 0.7305, 'learning_rate': 1.4466495614291977e-05, 'epoch': 1.12} 37%|███▋ | 140/375 [1:02:51<1:44:56, 26.79s/it] 38%|███▊ | 141/375 [1:03:16<1:43:16, 26.48s/it] {'loss': 0.7545, 'learning_rate': 1.438889649510956e-05, 'epoch': 1.12} 38%|███▊ | 141/375 [1:03:16<1:43:16, 26.48s/it] 38%|███▊ | 142/375 [1:03:42<1:41:51, 26.23s/it] {'loss': 0.7307, 'learning_rate': 1.4310968646090884e-05, 'epoch': 1.13} 38%|███▊ | 142/375 [1:03:42<1:41:51, 26.23s/it] 38%|███▊ | 143/375 [1:04:08<1:40:35, 26.02s/it] {'loss': 0.7327, 'learning_rate': 1.423271790405828e-05, 'epoch': 1.14} 38%|███▊ | 143/375 [1:04:08<1:40:35, 26.02s/it] 38%|███▊ | 144/375 [1:04:34<1:40:26, 26.09s/it] {'loss': 0.7417, 'learning_rate': 1.4154150130018867e-05, 'epoch': 1.15} 38%|███▊ | 144/375 [1:04:34<1:40:26, 26.09s/it] 39%|███▊ | 145/375 [1:04:59<1:39:25, 25.94s/it] {'loss': 0.7491, 'learning_rate': 1.4075271208725572e-05, 'epoch': 1.16} 39%|███▊ | 145/375 [1:04:59<1:39:25, 25.94s/it] 39%|███▉ | 146/375 [1:05:27<1:41:20, 26.55s/it] {'loss': 0.7422, 'learning_rate': 1.3996087048236357e-05, 'epoch': 1.16} 39%|███▉ | 146/375 [1:05:27<1:41:20, 26.55s/it] 39%|███▉ | 147/375 [1:05:52<1:38:55, 26.03s/it] {'loss': 0.7551, 'learning_rate': 1.3916603579471705e-05, 'epoch': 1.17} 39%|███▉ | 147/375 [1:05:52<1:38:55, 26.03s/it] 39%|███▉ | 148/375 [1:06:19<1:38:54, 26.14s/it] {'loss': 0.7356, 'learning_rate': 1.3836826755770386e-05, 'epoch': 1.18} 39%|███▉ | 148/375 [1:06:19<1:38:54, 26.14s/it] 40%|███▉ | 149/375 [1:06:45<1:38:38, 26.19s/it] {'loss': 0.7139, 'learning_rate': 1.3756762552443555e-05, 'epoch': 1.19} 40%|███▉ | 149/375 [1:06:45<1:38:38, 26.19s/it] 40%|████ | 150/375 [1:07:11<1:38:21, 26.23s/it] {'loss': 0.7411, 'learning_rate': 1.3676416966327201e-05, 'epoch': 1.2} 40%|████ | 150/375 [1:07:11<1:38:21, 26.23s/it] 40%|████ | 151/375 [1:07:38<1:38:04, 26.27s/it] {'loss': 0.7511, 'learning_rate': 1.3595796015332986e-05, 'epoch': 1.2} 40%|████ | 151/375 [1:07:38<1:38:04, 26.27s/it] 41%|████ | 152/375 [1:08:07<1:40:36, 27.07s/it] {'loss': 0.747, 'learning_rate': 1.3514905737997474e-05, 'epoch': 1.21} 41%|████ | 152/375 [1:08:07<1:40:36, 27.07s/it] 41%|████ | 153/375 [1:08:32<1:37:54, 26.46s/it] {'loss': 0.7302, 'learning_rate': 1.3433752193029888e-05, 'epoch': 1.22} 41%|████ | 153/375 [1:08:32<1:37:54, 26.46s/it] 41%|████ | 154/375 [1:08:58<1:37:05, 26.36s/it] {'loss': 0.7489, 'learning_rate': 1.3352341458858264e-05, 'epoch': 1.23} 41%|████ | 154/375 [1:08:58<1:37:05, 26.36s/it] 41%|████▏ | 155/375 [1:09:22<1:34:54, 25.88s/it] {'loss': 0.7534, 'learning_rate': 1.3270679633174219e-05, 'epoch': 1.24} 41%|████▏ | 155/375 [1:09:22<1:34:54, 25.88s/it] 42%|████▏ | 156/375 [1:09:48<1:34:30, 25.89s/it] {'loss': 0.7423, 'learning_rate': 1.318877283247619e-05, 'epoch': 1.24} 42%|████▏ | 156/375 [1:09:48<1:34:30, 25.89s/it] 42%|████▏ | 157/375 [1:10:14<1:33:24, 25.71s/it] {'loss': 0.7141, 'learning_rate': 1.3106627191611333e-05, 'epoch': 1.25} 42%|████▏ | 157/375 [1:10:14<1:33:24, 25.71s/it] 42%|████▏ | 158/375 [1:10:39<1:33:04, 25.73s/it] {'loss': 0.7443, 'learning_rate': 1.3024248863316012e-05, 'epoch': 1.26} 42%|████▏ | 158/375 [1:10:39<1:33:04, 25.73s/it] 42%|████▏ | 159/375 [1:11:04<1:31:36, 25.45s/it] {'loss': 0.7383, 'learning_rate': 1.2941644017754964e-05, 'epoch': 1.27} 42%|████▏ | 159/375 [1:11:04<1:31:36, 25.45s/it] 43%|████▎ | 160/375 [1:11:29<1:30:18, 25.20s/it] {'loss': 0.7444, 'learning_rate': 1.2858818842059145e-05, 'epoch': 1.27} 43%|████▎ | 160/375 [1:11:29<1:30:18, 25.20s/it] 43%|████▎ | 161/375 [1:11:55<1:31:19, 25.60s/it] {'loss': 0.7489, 'learning_rate': 1.2775779539862305e-05, 'epoch': 1.28} 43%|████▎ | 161/375 [1:11:55<1:31:19, 25.60s/it] 43%|████▎ | 162/375 [1:12:24<1:33:46, 26.41s/it] {'loss': 0.7266, 'learning_rate': 1.2692532330836346e-05, 'epoch': 1.29} 43%|████▎ | 162/375 [1:12:24<1:33:46, 26.41s/it] 43%|████▎ | 163/375 [1:12:51<1:33:45, 26.53s/it] {'loss': 0.7159, 'learning_rate': 1.2609083450225468e-05, 'epoch': 1.3} 43%|████▎ | 163/375 [1:12:51<1:33:45, 26.53s/it] 44%|████▎ | 164/375 [1:13:18<1:34:01, 26.74s/it] {'loss': 0.7362, 'learning_rate': 1.2525439148379127e-05, 'epoch': 1.31} 44%|████▎ | 164/375 [1:13:18<1:34:01, 26.74s/it] 44%|████▍ | 165/375 [1:13:45<1:33:59, 26.85s/it] {'loss': 0.7174, 'learning_rate': 1.2441605690283915e-05, 'epoch': 1.31} 44%|████▍ | 165/375 [1:13:45<1:33:59, 26.85s/it] 44%|████▍ | 166/375 [1:14:11<1:32:58, 26.69s/it] {'loss': 0.7137, 'learning_rate': 1.2357589355094275e-05, 'epoch': 1.32} 44%|████▍ | 166/375 [1:14:11<1:32:58, 26.69s/it] 45%|████▍ | 167/375 [1:14:36<1:30:54, 26.22s/it] {'loss': 0.723, 'learning_rate': 1.2273396435662212e-05, 'epoch': 1.33} 45%|████▍ | 167/375 [1:14:36<1:30:54, 26.22s/it] 45%|████▍ | 168/375 [1:15:03<1:31:14, 26.45s/it] {'loss': 0.7185, 'learning_rate': 1.218903323806595e-05, 'epoch': 1.34} 45%|████▍ | 168/375 [1:15:03<1:31:14, 26.45s/it] 45%|████▌ | 169/375 [1:15:29<1:30:05, 26.24s/it] {'loss': 0.7306, 'learning_rate': 1.2104506081137608e-05, 'epoch': 1.35} 45%|████▌ | 169/375 [1:15:29<1:30:05, 26.24s/it] 45%|████▌ | 170/375 [1:15:56<1:30:37, 26.53s/it] {'loss': 0.7213, 'learning_rate': 1.2019821295989913e-05, 'epoch': 1.35} 45%|████▌ | 170/375 [1:15:56<1:30:37, 26.53s/it] 46%|████▌ | 171/375 [1:16:22<1:29:34, 26.35s/it] {'loss': 0.7159, 'learning_rate': 1.1934985225541998e-05, 'epoch': 1.36} 46%|████▌ | 171/375 [1:16:22<1:29:34, 26.35s/it] 46%|████▌ | 172/375 [1:16:49<1:29:22, 26.42s/it] {'loss': 0.7298, 'learning_rate': 1.1850004224044315e-05, 'epoch': 1.37} 46%|████▌ | 172/375 [1:16:49<1:29:22, 26.42s/it] 46%|████▌ | 173/375 [1:17:15<1:28:31, 26.30s/it] {'loss': 0.7324, 'learning_rate': 1.1764884656602711e-05, 'epoch': 1.38} 46%|████▌ | 173/375 [1:17:15<1:28:31, 26.30s/it] 46%|████▋ | 174/375 [1:17:41<1:27:53, 26.24s/it] {'loss': 0.7354, 'learning_rate': 1.1679632898701649e-05, 'epoch': 1.39} 46%|████▋ | 174/375 [1:17:41<1:27:53, 26.24s/it] 47%|████▋ | 175/375 [1:18:06<1:26:27, 25.94s/it] {'loss': 0.7195, 'learning_rate': 1.1594255335726725e-05, 'epoch': 1.39} 47%|████▋ | 175/375 [1:18:06<1:26:27, 25.94s/it] 47%|████▋ | 176/375 [1:18:32<1:26:01, 25.94s/it] {'loss': 0.7212, 'learning_rate': 1.1508758362486358e-05, 'epoch': 1.4} 47%|████▋ | 176/375 [1:18:32<1:26:01, 25.94s/it] 47%|████▋ | 177/375 [1:18:57<1:25:07, 25.79s/it] {'loss': 0.7272, 'learning_rate': 1.1423148382732854e-05, 'epoch': 1.41} 47%|████▋ | 177/375 [1:18:57<1:25:07, 25.79s/it] 47%|████▋ | 178/375 [1:19:23<1:24:07, 25.62s/it] {'loss': 0.7302, 'learning_rate': 1.133743180868273e-05, 'epoch': 1.42} 47%|████▋ | 178/375 [1:19:23<1:24:07, 25.62s/it] 48%|████▊ | 179/375 [1:19:50<1:25:23, 26.14s/it] {'loss': 0.7399, 'learning_rate': 1.125161506053646e-05, 'epoch': 1.43} 48%|████▊ | 179/375 [1:19:50<1:25:23, 26.14s/it] 48%|████▊ | 180/375 [1:20:17<1:25:35, 26.34s/it] {'loss': 0.7059, 'learning_rate': 1.1165704565997593e-05, 'epoch': 1.43} 48%|████▊ | 180/375 [1:20:17<1:25:35, 26.34s/it] 48%|████▊ | 181/375 [1:20:43<1:24:45, 26.21s/it] {'loss': 0.733, 'learning_rate': 1.1079706759791311e-05, 'epoch': 1.44} 48%|████▊ | 181/375 [1:20:43<1:24:45, 26.21s/it] 49%|████▊ | 182/375 [1:21:08<1:23:29, 25.96s/it] {'loss': 0.7086, 'learning_rate': 1.0993628083182468e-05, 'epoch': 1.45} 49%|████▊ | 182/375 [1:21:08<1:23:29, 25.96s/it] 49%|████▉ | 183/375 [1:21:34<1:22:32, 25.79s/it] {'loss': 0.7396, 'learning_rate': 1.0907474983493144e-05, 'epoch': 1.46} 49%|████▉ | 183/375 [1:21:34<1:22:32, 25.79s/it] 49%|████▉ | 184/375 [1:22:00<1:23:01, 26.08s/it] {'loss': 0.7405, 'learning_rate': 1.0821253913619727e-05, 'epoch': 1.47} 49%|████▉ | 184/375 [1:22:00<1:23:01, 26.08s/it] 49%|████▉ | 185/375 [1:22:26<1:22:33, 26.07s/it] {'loss': 0.7121, 'learning_rate': 1.0734971331549604e-05, 'epoch': 1.47} 49%|████▉ | 185/375 [1:22:26<1:22:33, 26.07s/it] 50%|████▉ | 186/375 [1:22:52<1:22:03, 26.05s/it] {'loss': 0.7149, 'learning_rate': 1.064863369987743e-05, 'epoch': 1.48} 50%|████▉ | 186/375 [1:22:52<1:22:03, 26.05s/it] 50%|████▉ | 187/375 [1:23:20<1:23:31, 26.66s/it] {'loss': 0.7227, 'learning_rate': 1.0562247485321116e-05, 'epoch': 1.49} 50%|████▉ | 187/375 [1:23:20<1:23:31, 26.66s/it] 50%|█████ | 188/375 [1:23:46<1:21:45, 26.23s/it] {'loss': 0.7337, 'learning_rate': 1.0475819158237426e-05, 'epoch': 1.5} 50%|█████ | 188/375 [1:23:46<1:21:45, 26.23s/it] 50%|█████ | 189/375 [1:24:11<1:20:55, 26.10s/it] {'loss': 0.7205, 'learning_rate': 1.0389355192137379e-05, 'epoch': 1.51} 50%|█████ | 189/375 [1:24:11<1:20:55, 26.10s/it] 51%|█████ | 190/375 [1:24:37<1:20:00, 25.95s/it] {'loss': 0.7189, 'learning_rate': 1.0302862063201367e-05, 'epoch': 1.51} 51%|█████ | 190/375 [1:24:37<1:20:00, 25.95s/it] 51%|█████ | 191/375 [1:25:03<1:19:30, 25.93s/it] {'loss': 0.7051, 'learning_rate': 1.0216346249794087e-05, 'epoch': 1.52} 51%|█████ | 191/375 [1:25:03<1:19:30, 25.93s/it] 51%|█████ | 192/375 [1:25:29<1:19:16, 25.99s/it] {'loss': 0.718, 'learning_rate': 1.012981423197931e-05, 'epoch': 1.53} 51%|█████ | 192/375 [1:25:29<1:19:16, 25.99s/it] 51%|█████▏ | 193/375 [1:25:57<1:20:13, 26.45s/it] {'loss': 0.7197, 'learning_rate': 1.0043272491034523e-05, 'epoch': 1.54} 51%|█████▏ | 193/375 [1:25:57<1:20:13, 26.45s/it] 52%|█████▏ | 194/375 [1:26:25<1:21:18, 26.95s/it] {'loss': 0.7101, 'learning_rate': 9.956727508965482e-06, 'epoch': 1.55} 52%|█████▏ | 194/375 [1:26:25<1:21:18, 26.95s/it] 52%|█████▏ | 195/375 [1:26:50<1:19:35, 26.53s/it] {'loss': 0.7415, 'learning_rate': 9.870185768020694e-06, 'epoch': 1.55} 52%|█████▏ | 195/375 [1:26:50<1:19:35, 26.53s/it] 52%|█████▏ | 196/375 [1:27:16<1:18:28, 26.31s/it] {'loss': 0.7303, 'learning_rate': 9.783653750205916e-06, 'epoch': 1.56} 52%|█████▏ | 196/375 [1:27:16<1:18:28, 26.31s/it] 53%|█████▎ | 197/375 [1:27:43<1:18:16, 26.39s/it] {'loss': 0.6957, 'learning_rate': 9.697137936798635e-06, 'epoch': 1.57} 53%|█████▎ | 197/375 [1:27:43<1:18:16, 26.39s/it] 53%|█████▎ | 198/375 [1:28:09<1:17:41, 26.34s/it] {'loss': 0.699, 'learning_rate': 9.610644807862625e-06, 'epoch': 1.58} 53%|█████▎ | 198/375 [1:28:09<1:17:41, 26.34s/it] 53%|█████▎ | 199/375 [1:28:34<1:16:32, 26.10s/it] {'loss': 0.7184, 'learning_rate': 9.524180841762577e-06, 'epoch': 1.59} 53%|█████▎ | 199/375 [1:28:34<1:16:32, 26.10s/it] 53%|█████▎ | 200/375 [1:29:01<1:16:13, 26.13s/it] {'loss': 0.7473, 'learning_rate': 9.437752514678888e-06, 'epoch': 1.59} 53%|█████▎ | 200/375 [1:29:01<1:16:13, 26.13s/it] 54%|█████▎ | 201/375 [1:29:27<1:16:02, 26.22s/it] {'loss': 0.7197, 'learning_rate': 9.351366300122569e-06, 'epoch': 1.6} 54%|█████▎ | 201/375 [1:29:27<1:16:02, 26.22s/it] 54%|█████▍ | 202/375 [1:29:54<1:16:38, 26.58s/it] {'loss': 0.7005, 'learning_rate': 9.265028668450403e-06, 'epoch': 1.61} 54%|█████▍ | 202/375 [1:29:54<1:16:38, 26.58s/it] 54%|█████▍ | 203/375 [1:30:20<1:14:56, 26.14s/it] {'loss': 0.7279, 'learning_rate': 9.178746086380274e-06, 'epoch': 1.62} 54%|█████▍ | 203/375 [1:30:20<1:14:56, 26.14s/it] 54%|█████▍ | 204/375 [1:30:47<1:15:26, 26.47s/it] {'loss': 0.7326, 'learning_rate': 9.092525016506858e-06, 'epoch': 1.63} 54%|█████▍ | 204/375 [1:30:47<1:15:26, 26.47s/it] 55%|█████▍ | 205/375 [1:31:14<1:15:15, 26.56s/it] {'loss': 0.7041, 'learning_rate': 9.006371916817533e-06, 'epoch': 1.63} 55%|█████▍ | 205/375 [1:31:14<1:15:15, 26.56s/it] 55%|█████▍ | 206/375 [1:31:41<1:15:42, 26.88s/it] {'loss': 0.714, 'learning_rate': 8.920293240208694e-06, 'epoch': 1.64} 55%|█████▍ | 206/375 [1:31:41<1:15:42, 26.88s/it] 55%|█████▌ | 207/375 [1:32:09<1:15:42, 27.04s/it] {'loss': 0.7057, 'learning_rate': 8.83429543400241e-06, 'epoch': 1.65} 55%|█████▌ | 207/375 [1:32:09<1:15:42, 27.04s/it] 55%|█████▌ | 208/375 [1:32:38<1:17:15, 27.76s/it] {'loss': 0.7059, 'learning_rate': 8.748384939463543e-06, 'epoch': 1.66} 55%|█████▌ | 208/375 [1:32:38<1:17:15, 27.76s/it] 56%|█████▌ | 209/375 [1:33:05<1:16:33, 27.67s/it] {'loss': 0.7115, 'learning_rate': 8.662568191317273e-06, 'epoch': 1.67} 56%|█████▌ | 209/375 [1:33:06<1:16:33, 27.67s/it] 56%|█████▌ | 210/375 [1:33:34<1:16:32, 27.83s/it] {'loss': 0.707, 'learning_rate': 8.576851617267151e-06, 'epoch': 1.67} 56%|█████▌ | 210/375 [1:33:34<1:16:32, 27.83s/it] 56%|█████▋ | 211/375 [1:34:00<1:14:54, 27.41s/it] {'loss': 0.7443, 'learning_rate': 8.491241637513644e-06, 'epoch': 1.68} 56%|█████▋ | 211/375 [1:34:00<1:14:54, 27.41s/it] 57%|█████▋ | 212/375 [1:34:26<1:13:19, 26.99s/it] {'loss': 0.7306, 'learning_rate': 8.405744664273278e-06, 'epoch': 1.69} 57%|█████▋ | 212/375 [1:34:26<1:13:19, 26.99s/it] 57%|█████▋ | 213/375 [1:34:53<1:12:34, 26.88s/it] {'loss': 0.7078, 'learning_rate': 8.320367101298351e-06, 'epoch': 1.7} 57%|█████▋ | 213/375 [1:34:53<1:12:34, 26.88s/it] 57%|█████▋ | 214/375 [1:35:20<1:12:46, 27.12s/it] {'loss': 0.7113, 'learning_rate': 8.235115343397295e-06, 'epoch': 1.71} 57%|█████▋ | 214/375 [1:35:20<1:12:46, 27.12s/it] 57%|█████▋ | 215/375 [1:35:48<1:12:38, 27.24s/it] {'loss': 0.7032, 'learning_rate': 8.149995775955686e-06, 'epoch': 1.71} 57%|█████▋ | 215/375 [1:35:48<1:12:38, 27.24s/it] 58%|█████▊ | 216/375 [1:36:14<1:10:53, 26.75s/it] {'loss': 0.716, 'learning_rate': 8.065014774458004e-06, 'epoch': 1.72} 58%|█████▊ | 216/375 [1:36:14<1:10:53, 26.75s/it] 58%|█████▊ | 217/375 [1:36:42<1:11:25, 27.12s/it] {'loss': 0.7612, 'learning_rate': 7.980178704010089e-06, 'epoch': 1.73} 58%|█████▊ | 217/375 [1:36:42<1:11:25, 27.12s/it] 58%|█████▊ | 218/375 [1:37:12<1:13:14, 27.99s/it] {'loss': 0.6952, 'learning_rate': 7.895493918862395e-06, 'epoch': 1.74} 58%|█████▊ | 218/375 [1:37:12<1:13:14, 27.99s/it] 58%|█████▊ | 219/375 [1:37:38<1:11:10, 27.38s/it] {'loss': 0.7036, 'learning_rate': 7.810966761934053e-06, 'epoch': 1.75} 58%|█████▊ | 219/375 [1:37:38<1:11:10, 27.38s/it] 59%|█████▊ | 220/375 [1:38:04<1:09:41, 26.98s/it] {'loss': 0.7083, 'learning_rate': 7.726603564337791e-06, 'epoch': 1.75} 59%|█████▊ | 220/375 [1:38:04<1:09:41, 26.98s/it] 59%|█████▉ | 221/375 [1:38:32<1:10:23, 27.43s/it] {'loss': 0.7207, 'learning_rate': 7.642410644905726e-06, 'epoch': 1.76} 59%|█████▉ | 221/375 [1:38:32<1:10:23, 27.43s/it] 59%|█████▉ | 222/375 [1:39:00<1:09:58, 27.44s/it] {'loss': 0.7083, 'learning_rate': 7.558394309716088e-06, 'epoch': 1.77} 59%|█████▉ | 222/375 [1:39:00<1:09:58, 27.44s/it] 59%|█████▉ | 223/375 [1:39:26<1:08:59, 27.23s/it] {'loss': 0.7454, 'learning_rate': 7.474560851620873e-06, 'epoch': 1.78} 59%|█████▉ | 223/375 [1:39:26<1:08:59, 27.23s/it] 60%|█████▉ | 224/375 [1:39:53<1:08:14, 27.11s/it] {'loss': 0.7308, 'learning_rate': 7.390916549774536e-06, 'epoch': 1.78} 60%|█████▉ | 224/375 [1:39:53<1:08:14, 27.11s/it] 60%|██████ | 225/375 [1:40:19<1:07:11, 26.88s/it] {'loss': 0.7114, 'learning_rate': 7.307467669163655e-06, 'epoch': 1.79} 60%|██████ | 225/375 [1:40:19<1:07:11, 26.88s/it] 60%|██████ | 226/375 [1:40:47<1:07:18, 27.10s/it] {'loss': 0.7021, 'learning_rate': 7.224220460137701e-06, 'epoch': 1.8} 60%|██████ | 226/375 [1:40:47<1:07:18, 27.10s/it] 61%|██████ | 227/375 [1:41:15<1:07:30, 27.37s/it] {'loss': 0.7276, 'learning_rate': 7.141181157940859e-06, 'epoch': 1.81} 61%|██████ | 227/375 [1:41:15<1:07:30, 27.37s/it] 61%|██████ | 228/375 [1:41:46<1:09:19, 28.29s/it] {'loss': 0.711, 'learning_rate': 7.058355982245038e-06, 'epoch': 1.82} 61%|██████ | 228/375 [1:41:46<1:09:19, 28.29s/it] 61%|██████ | 229/375 [1:42:12<1:07:43, 27.83s/it] {'loss': 0.7256, 'learning_rate': 6.97575113668399e-06, 'epoch': 1.82} 61%|██████ | 229/375 [1:42:12<1:07:43, 27.83s/it] 61%|██████▏ | 230/375 [1:42:40<1:06:54, 27.69s/it] {'loss': 0.7044, 'learning_rate': 6.893372808388674e-06, 'epoch': 1.83} 61%|██████▏ | 230/375 [1:42:40<1:06:54, 27.69s/it] 62%|██████▏ | 231/375 [1:43:06<1:05:45, 27.40s/it] {'loss': 0.6723, 'learning_rate': 6.8112271675238154e-06, 'epoch': 1.84} 62%|██████▏ | 231/375 [1:43:06<1:05:45, 27.40s/it] 62%|██████▏ | 232/375 [1:43:34<1:05:37, 27.54s/it] {'loss': 0.7268, 'learning_rate': 6.729320366825785e-06, 'epoch': 1.85} 62%|██████▏ | 232/375 [1:43:34<1:05:37, 27.54s/it] 62%|██████▏ | 233/375 [1:44:01<1:04:57, 27.45s/it] {'loss': 0.7028, 'learning_rate': 6.647658541141735e-06, 'epoch': 1.86} 62%|██████▏ | 233/375 [1:44:01<1:04:57, 27.45s/it] 62%|██████▏ | 234/375 [1:44:29<1:04:30, 27.45s/it] {'loss': 0.729, 'learning_rate': 6.566247806970119e-06, 'epoch': 1.86} 62%|██████▏ | 234/375 [1:44:29<1:04:30, 27.45s/it] 63%|██████▎ | 235/375 [1:44:57<1:04:22, 27.59s/it] {'loss': 0.6692, 'learning_rate': 6.485094262002529e-06, 'epoch': 1.87} 63%|██████▎ | 235/375 [1:44:57<1:04:22, 27.59s/it] 63%|██████▎ | 236/375 [1:45:26<1:04:46, 27.96s/it] {'loss': 0.6896, 'learning_rate': 6.404203984667019e-06, 'epoch': 1.88} 63%|██████▎ | 236/375 [1:45:26<1:04:46, 27.96s/it] 63%|██████▎ | 237/375 [1:45:55<1:05:08, 28.32s/it] {'loss': 0.715, 'learning_rate': 6.323583033672799e-06, 'epoch': 1.89} 63%|██████▎ | 237/375 [1:45:55<1:05:08, 28.32s/it] 63%|██████▎ | 238/375 [1:46:24<1:05:11, 28.55s/it] {'loss': 0.7148, 'learning_rate': 6.24323744755645e-06, 'epoch': 1.9} 63%|██████▎ | 238/375 [1:46:24<1:05:11, 28.55s/it] 64%|██████▎ | 239/375 [1:46:52<1:04:25, 28.42s/it] {'loss': 0.7357, 'learning_rate': 6.163173244229618e-06, 'epoch': 1.9} 64%|██████▎ | 239/375 [1:46:52<1:04:25, 28.42s/it] 64%|██████▍ | 240/375 [1:47:20<1:03:52, 28.39s/it] {'loss': 0.71, 'learning_rate': 6.083396420528298e-06, 'epoch': 1.91} 64%|██████▍ | 240/375 [1:47:20<1:03:52, 28.39s/it] 64%|██████▍ | 241/375 [1:47:48<1:03:00, 28.21s/it] {'loss': 0.696, 'learning_rate': 6.003912951763644e-06, 'epoch': 1.92} 64%|██████▍ | 241/375 [1:47:48<1:03:00, 28.21s/it] 65%|██████▍ | 242/375 [1:48:16<1:02:13, 28.07s/it] {'loss': 0.6842, 'learning_rate': 5.924728791274432e-06, 'epoch': 1.93} 65%|██████▍ | 242/375 [1:48:16<1:02:13, 28.07s/it] 65%|██████▍ | 243/375 [1:48:44<1:02:01, 28.20s/it] {'loss': 0.7091, 'learning_rate': 5.845849869981137e-06, 'epoch': 1.94} 65%|██████▍ | 243/375 [1:48:44<1:02:01, 28.20s/it] 65%|██████▌ | 244/375 [1:49:14<1:02:25, 28.59s/it] {'loss': 0.6849, 'learning_rate': 5.767282095941725e-06, 'epoch': 1.94} 65%|██████▌ | 244/375 [1:49:14<1:02:25, 28.59s/it] 65%|██████▌ | 245/375 [1:49:42<1:01:23, 28.33s/it] {'loss': 0.7449, 'learning_rate': 5.68903135390912e-06, 'epoch': 1.95} 65%|██████▌ | 245/375 [1:49:42<1:01:23, 28.33s/it] 66%|██████▌ | 246/375 [1:50:10<1:00:52, 28.32s/it] {'loss': 0.7074, 'learning_rate': 5.611103504890444e-06, 'epoch': 1.96} 66%|██████▌ | 246/375 [1:50:10<1:00:52, 28.32s/it] 66%|██████▌ | 247/375 [1:50:38<1:00:00, 28.13s/it] {'loss': 0.7138, 'learning_rate': 5.533504385708024e-06, 'epoch': 1.97} 66%|██████▌ | 247/375 [1:50:38<1:00:00, 28.13s/it] 66%|██████▌ | 248/375 [1:51:05<58:49, 27.79s/it] {'loss': 0.7145, 'learning_rate': 5.45623980856221e-06, 'epoch': 1.98} 66%|██████▌ | 248/375 [1:51:05<58:49, 27.79s/it] 66%|██████▋ | 249/375 [1:51:32<57:56, 27.59s/it] {'loss': 0.7426, 'learning_rate': 5.379315560596037e-06, 'epoch': 1.98} 66%|██████▋ | 249/375 [1:51:32<57:56, 27.59s/it] 67%|██████▋ | 250/375 [1:51:59<57:27, 27.58s/it] {'loss': 0.6741, 'learning_rate': 5.302737403461778e-06, 'epoch': 1.99} 67%|██████▋ | 250/375 [1:51:59<57:27, 27.58s/it] 67%|██████▋ | 251/375 [1:52:26<56:15, 27.22s/it] {'loss': 0.6832, 'learning_rate': 5.226511072889371e-06, 'epoch': 2.0} 67%|██████▋ | 251/375 [1:52:26<56:15, 27.22s/it] 67%|██████▋ | 252/375 [1:52:59<59:35, 29.07s/it] {'loss': 0.5329, 'learning_rate': 5.1506422782568345e-06, 'epoch': 2.01} 67%|██████▋ | 252/375 [1:52:59<59:35, 29.07s/it] 67%|██████▋ | 253/375 [1:53:26<57:47, 28.42s/it] {'loss': 0.5358, 'learning_rate': 5.075136702162622e-06, 'epoch': 2.02} 67%|██████▋ | 253/375 [1:53:26<57:47, 28.42s/it] 68%|██████▊ | 254/375 [1:53:50<54:57, 27.25s/it] {'loss': 0.5402, 'learning_rate': 5.000000000000003e-06, 'epoch': 2.02} 68%|██████▊ | 254/375 [1:53:50<54:57, 27.25s/it] 68%|██████▊ | 255/375 [1:54:17<53:56, 26.97s/it] {'loss': 0.5159, 'learning_rate': 4.925237799533445e-06, 'epoch': 2.03} 68%|██████▊ | 255/375 [1:54:17<53:56, 26.97s/it] 68%|██████▊ | 256/375 [1:54:44<53:30, 26.98s/it] {'loss': 0.5301, 'learning_rate': 4.85085570047713e-06, 'epoch': 2.04} 68%|██████▊ | 256/375 [1:54:44<53:30, 26.98s/it] 69%|██████▊ | 257/375 [1:55:11<53:14, 27.07s/it] {'loss': 0.535, 'learning_rate': 4.776859274075506e-06, 'epoch': 2.05} 69%|██████▊ | 257/375 [1:55:11<53:14, 27.07s/it] 69%|██████▉ | 258/375 [1:55:37<52:17, 26.82s/it] {'loss': 0.5051, 'learning_rate': 4.703254062686017e-06, 'epoch': 2.06} 69%|██████▉ | 258/375 [1:55:37<52:17, 26.82s/it] 69%|██████▉ | 259/375 [1:56:03<51:03, 26.41s/it] {'loss': 0.5253, 'learning_rate': 4.6300455793639565e-06, 'epoch': 2.06} 69%|██████▉ | 259/375 [1:56:03<51:03, 26.41s/it] 69%|██████▉ | 260/375 [1:56:28<50:15, 26.22s/it] {'loss': 0.5146, 'learning_rate': 4.557239307449562e-06, 'epoch': 2.07} 69%|██████▉ | 260/375 [1:56:29<50:15, 26.22s/it] 70%|██████▉ | 261/375 [1:56:55<50:10, 26.41s/it] {'loss': 0.5272, 'learning_rate': 4.4848407001572945e-06, 'epoch': 2.08} 70%|██████▉ | 261/375 [1:56:55<50:10, 26.41s/it] 70%|██████▉ | 262/375 [1:57:21<49:32, 26.30s/it] {'loss': 0.531, 'learning_rate': 4.412855180167406e-06, 'epoch': 2.09} 70%|██████▉ | 262/375 [1:57:21<49:32, 26.30s/it] 70%|███████ | 263/375 [1:57:50<50:37, 27.12s/it] {'loss': 0.5101, 'learning_rate': 4.341288139219752e-06, 'epoch': 2.1} 70%|███████ | 263/375 [1:57:50<50:37, 27.12s/it] 70%|███████ | 264/375 [1:58:17<50:09, 27.11s/it] {'loss': 0.4964, 'learning_rate': 4.270144937709981e-06, 'epoch': 2.1} 70%|███████ | 264/375 [1:58:17<50:09, 27.11s/it] 71%|███████ | 265/375 [1:58:45<50:06, 27.33s/it] {'loss': 0.5403, 'learning_rate': 4.19943090428802e-06, 'epoch': 2.11} 71%|███████ | 265/375 [1:58:45<50:06, 27.33s/it] 71%|███████ | 266/375 [1:59:11<48:48, 26.86s/it] {'loss': 0.5303, 'learning_rate': 4.1291513354589576e-06, 'epoch': 2.12} 71%|███████ | 266/375 [1:59:11<48:48, 26.86s/it] 71%|███████ | 267/375 [1:59:37<47:54, 26.62s/it] {'loss': 0.5164, 'learning_rate': 4.059311495186338e-06, 'epoch': 2.13} 71%|███████ | 267/375 [1:59:37<47:54, 26.62s/it] 71%|███████▏ | 268/375 [2:00:05<47:58, 26.90s/it] {'loss': 0.486, 'learning_rate': 3.989916614497891e-06, 'epoch': 2.14} 71%|███████▏ | 268/375 [2:00:05<47:58, 26.90s/it] 72%|███████▏ | 269/375 [2:00:32<47:44, 27.03s/it] {'loss': 0.4951, 'learning_rate': 3.9209718910937174e-06, 'epoch': 2.14} 72%|███████▏ | 269/375 [2:00:32<47:44, 27.03s/it] 72%|███████▏ | 270/375 [2:01:01<48:09, 27.52s/it] {'loss': 0.5051, 'learning_rate': 3.852482488956992e-06, 'epoch': 2.15} 72%|███████▏ | 270/375 [2:01:01<48:09, 27.52s/it] 72%|███████▏ | 271/375 [2:01:27<47:02, 27.14s/it] {'loss': 0.5312, 'learning_rate': 3.784453537967161e-06, 'epoch': 2.16} 72%|███████▏ | 271/375 [2:01:27<47:02, 27.14s/it] 73%|███████▎ | 272/375 [2:01:53<46:12, 26.92s/it] {'loss': 0.481, 'learning_rate': 3.7168901335157313e-06, 'epoch': 2.17} 73%|███████▎ | 272/375 [2:01:53<46:12, 26.92s/it] 73%|███████▎ | 273/375 [2:02:20<45:24, 26.71s/it] {'loss': 0.5259, 'learning_rate': 3.6497973361246153e-06, 'epoch': 2.18} 73%|███████▎ | 273/375 [2:02:20<45:24, 26.71s/it] 73%|███████▎ | 274/375 [2:02:45<44:29, 26.43s/it] {'loss': 0.5091, 'learning_rate': 3.583180171067101e-06, 'epoch': 2.18} 73%|███████▎ | 274/375 [2:02:45<44:29, 26.43s/it] 73%|███████▎ | 275/375 [2:03:12<44:08, 26.48s/it] {'loss': 0.5071, 'learning_rate': 3.517043627991441e-06, 'epoch': 2.19} 73%|███████▎ | 275/375 [2:03:12<44:08, 26.48s/it] 74%|███████▎ | 276/375 [2:03:37<43:05, 26.12s/it] {'loss': 0.5263, 'learning_rate': 3.4513926605471504e-06, 'epoch': 2.2} 74%|███████▎ | 276/375 [2:03:37<43:05, 26.12s/it] 74%|███████▍ | 277/375 [2:04:03<42:25, 25.98s/it] {'loss': 0.5056, 'learning_rate': 3.3862321860139578e-06, 'epoch': 2.21} 74%|███████▍ | 277/375 [2:04:03<42:25, 25.98s/it] 74%|███████▍ | 278/375 [2:04:30<42:33, 26.33s/it] {'loss': 0.5187, 'learning_rate': 3.3215670849335156e-06, 'epoch': 2.22} 74%|███████▍ | 278/375 [2:04:30<42:33, 26.33s/it] 74%|███████▍ | 279/375 [2:04:56<42:02, 26.28s/it] {'loss': 0.521, 'learning_rate': 3.257402200743821e-06, 'epoch': 2.22} 74%|███████▍ | 279/375 [2:04:56<42:02, 26.28s/it] 75%|███████▍ | 280/375 [2:05:21<41:03, 25.93s/it] {'loss': 0.5028, 'learning_rate': 3.19374233941647e-06, 'epoch': 2.23} 75%|███████▍ | 280/375 [2:05:21<41:03, 25.93s/it] 75%|███████▍ | 281/375 [2:05:48<41:06, 26.24s/it] {'loss': 0.4968, 'learning_rate': 3.1305922690966705e-06, 'epoch': 2.24} 75%|███████▍ | 281/375 [2:05:48<41:06, 26.24s/it] 75%|███████▌ | 282/375 [2:06:15<40:58, 26.43s/it] {'loss': 0.4963, 'learning_rate': 3.0679567197461135e-06, 'epoch': 2.25} 75%|███████▌ | 282/375 [2:06:15<40:58, 26.43s/it] 75%|███████▌ | 283/375 [2:06:41<40:14, 26.24s/it] {'loss': 0.5082, 'learning_rate': 3.005840382788685e-06, 'epoch': 2.25} 75%|███████▌ | 283/375 [2:06:41<40:14, 26.24s/it] 76%|███████▌ | 284/375 [2:07:06<39:21, 25.95s/it] {'loss': 0.5287, 'learning_rate': 2.944247910759097e-06, 'epoch': 2.26} 76%|███████▌ | 284/375 [2:07:06<39:21, 25.95s/it] 76%|███████▌ | 285/375 [2:07:32<39:00, 26.01s/it] {'loss': 0.5074, 'learning_rate': 2.8831839169543998e-06, 'epoch': 2.27} 76%|███████▌ | 285/375 [2:07:32<39:00, 26.01s/it] 76%|███████▋ | 286/375 [2:07:59<38:57, 26.26s/it] {'loss': 0.5049, 'learning_rate': 2.8226529750884403e-06, 'epoch': 2.28} 76%|███████▋ | 286/375 [2:07:59<38:57, 26.26s/it] 77%|███████▋ | 287/375 [2:08:26<38:34, 26.30s/it] {'loss': 0.477, 'learning_rate': 2.7626596189492983e-06, 'epoch': 2.29} 77%|███████▋ | 287/375 [2:08:26<38:34, 26.30s/it] 77%|███████▋ | 288/375 [2:08:52<38:06, 26.29s/it] {'loss': 0.5196, 'learning_rate': 2.7032083420597e-06, 'epoch': 2.29} 77%|███████▋ | 288/375 [2:08:52<38:06, 26.29s/it] 77%|███████▋ | 289/375 [2:09:18<37:46, 26.35s/it] {'loss': 0.5186, 'learning_rate': 2.6443035973404497e-06, 'epoch': 2.3} 77%|███████▋ | 289/375 [2:09:18<37:46, 26.35s/it] 77%|███████▋ | 290/375 [2:09:44<36:57, 26.09s/it] {'loss': 0.4904, 'learning_rate': 2.585949796776912e-06, 'epoch': 2.31} 77%|███████▋ | 290/375 [2:09:44<36:57, 26.09s/it] 78%|███████▊ | 291/375 [2:10:09<36:13, 25.87s/it] {'loss': 0.5234, 'learning_rate': 2.528151311088537e-06, 'epoch': 2.32} 78%|███████▊ | 291/375 [2:10:09<36:13, 25.87s/it] 78%|███████▊ | 292/375 [2:10:37<36:26, 26.35s/it] {'loss': 0.4933, 'learning_rate': 2.470912469401512e-06, 'epoch': 2.33} 78%|███████▊ | 292/375 [2:10:37<36:26, 26.35s/it] 78%|███████▊ | 293/375 [2:11:03<36:04, 26.40s/it] {'loss': 0.5143, 'learning_rate': 2.414237558924496e-06, 'epoch': 2.33} 78%|███████▊ | 293/375 [2:11:03<36:04, 26.40s/it] 78%|███████▊ | 294/375 [2:11:28<35:10, 26.06s/it] {'loss': 0.5168, 'learning_rate': 2.3581308246275103e-06, 'epoch': 2.34} 78%|███████▊ | 294/375 [2:11:28<35:10, 26.06s/it] 79%|███████▊ | 295/375 [2:11:55<35:02, 26.29s/it] {'loss': 0.5049, 'learning_rate': 2.302596468923981e-06, 'epoch': 2.35} 79%|███████▊ | 295/375 [2:11:55<35:02, 26.29s/it] 79%|███████▉ | 296/375 [2:12:24<35:26, 26.91s/it] {'loss': 0.517, 'learning_rate': 2.247638651355991e-06, 'epoch': 2.36} 79%|███████▉ | 296/375 [2:12:24<35:26, 26.91s/it] 79%|███████▉ | 297/375 [2:12:49<34:27, 26.51s/it] {'loss': 0.4923, 'learning_rate': 2.1932614882827196e-06, 'epoch': 2.37} 79%|███████▉ | 297/375 [2:12:49<34:27, 26.51s/it] 79%|███████▉ | 298/375 [2:13:16<34:07, 26.59s/it] {'loss': 0.4897, 'learning_rate': 2.1394690525721275e-06, 'epoch': 2.37} 79%|███████▉ | 298/375 [2:13:16<34:07, 26.59s/it] 80%|███████▉ | 299/375 [2:13:41<33:15, 26.25s/it] {'loss': 0.5118, 'learning_rate': 2.0862653732958914e-06, 'epoch': 2.38} 80%|███████▉ | 299/375 [2:13:41<33:15, 26.25s/it] 80%|████████ | 300/375 [2:14:07<32:30, 26.01s/it] {'loss': 0.508, 'learning_rate': 2.03365443542764e-06, 'epoch': 2.39} 80%|████████ | 300/375 [2:14:07<32:30, 26.01s/it] 80%|████████ | 301/375 [2:14:33<32:01, 25.97s/it] {'loss': 0.4983, 'learning_rate': 1.9816401795444664e-06, 'epoch': 2.4} 80%|████████ | 301/375 [2:14:33<32:01, 25.97s/it] 81%|████████ | 302/375 [2:15:00<32:04, 26.36s/it] {'loss': 0.5127, 'learning_rate': 1.93022650153178e-06, 'epoch': 2.41} 81%|████████ | 302/375 [2:15:00<32:04, 26.36s/it] 81%|████████ | 303/375 [2:15:27<31:48, 26.51s/it] {'loss': 0.5146, 'learning_rate': 1.8794172522915022e-06, 'epoch': 2.41} 81%|████████ | 303/375 [2:15:27<31:48, 26.51s/it] 81%|████████ | 304/375 [2:15:57<32:31, 27.49s/it] {'loss': 0.5289, 'learning_rate': 1.829216237453637e-06, 'epoch': 2.42} 81%|████████ | 304/375 [2:15:57<32:31, 27.49s/it] 81%|████████▏ | 305/375 [2:16:24<32:05, 27.50s/it] {'loss': 0.4871, 'learning_rate': 1.7796272170912255e-06, 'epoch': 2.43} 81%|████████▏ | 305/375 [2:16:24<32:05, 27.50s/it] 82%|████████▏ | 306/375 [2:16:52<31:53, 27.73s/it] {'loss': 0.5142, 'learning_rate': 1.730653905438714e-06, 'epoch': 2.44} 82%|████████▏ | 306/375 [2:16:52<31:53, 27.73s/it] 82%|████████▏ | 307/375 [2:17:18<30:43, 27.12s/it] {'loss': 0.5167, 'learning_rate': 1.6822999706137565e-06, 'epoch': 2.45} 82%|████████▏ | 307/375 [2:17:18<30:43, 27.12s/it] 82%|████████▏ | 308/375 [2:17:43<29:35, 26.49s/it] {'loss': 0.5056, 'learning_rate': 1.6345690343424758e-06, 'epoch': 2.45} 82%|████████▏ | 308/375 [2:17:43<29:35, 26.49s/it] 82%|████████▏ | 309/375 [2:18:11<29:39, 26.97s/it] {'loss': 0.5192, 'learning_rate': 1.587464671688187e-06, 'epoch': 2.46} 82%|████████▏ | 309/375 [2:18:11<29:39, 26.97s/it] 83%|████████▎ | 310/375 [2:18:40<29:44, 27.45s/it] {'loss': 0.5187, 'learning_rate': 1.540990410783636e-06, 'epoch': 2.47} 83%|████████▎ | 310/375 [2:18:40<29:44, 27.45s/it] 83%|████████▎ | 311/375 [2:19:07<29:10, 27.35s/it] {'loss': 0.5175, 'learning_rate': 1.495149732566723e-06, 'epoch': 2.48} 83%|████████▎ | 311/375 [2:19:07<29:10, 27.35s/it] 83%|████████▎ | 312/375 [2:19:33<28:20, 26.99s/it] {'loss': 0.5055, 'learning_rate': 1.4499460705198e-06, 'epoch': 2.49} 83%|████████▎ | 312/375 [2:19:33<28:20, 26.99s/it] 83%|████████▎ | 313/375 [2:19:59<27:36, 26.72s/it] {'loss': 0.5286, 'learning_rate': 1.4053828104124867e-06, 'epoch': 2.49} 83%|████████▎ | 313/375 [2:19:59<27:36, 26.72s/it] 84%|████████▎ | 314/375 [2:20:25<26:59, 26.55s/it] {'loss': 0.526, 'learning_rate': 1.361463290048085e-06, 'epoch': 2.5} 84%|████████▎ | 314/375 [2:20:25<26:59, 26.55s/it] 84%|████████▍ | 315/375 [2:20:51<26:23, 26.40s/it] {'loss': 0.5038, 'learning_rate': 1.3181907990135624e-06, 'epoch': 2.51} 84%|████████▍ | 315/375 [2:20:51<26:23, 26.40s/it] 84%|████████▍ | 316/375 [2:21:18<26:00, 26.45s/it] {'loss': 0.4852, 'learning_rate': 1.2755685784331784e-06, 'epoch': 2.52} 84%|████████▍ | 316/375 [2:21:18<26:00, 26.45s/it] 85%|████████▍ | 317/375 [2:21:44<25:22, 26.25s/it] {'loss': 0.5, 'learning_rate': 1.2335998207257138e-06, 'epoch': 2.53} 85%|████████▍ | 317/375 [2:21:44<25:22, 26.25s/it] 85%|████████▍ | 318/375 [2:22:11<25:06, 26.44s/it] {'loss': 0.5182, 'learning_rate': 1.1922876693653584e-06, 'epoch': 2.53} 85%|████████▍ | 318/375 [2:22:11<25:06, 26.44s/it] 85%|████████▌ | 319/375 [2:22:38<24:49, 26.59s/it] {'loss': 0.5406, 'learning_rate': 1.1516352186462588e-06, 'epoch': 2.54} 85%|████████▌ | 319/375 [2:22:38<24:49, 26.59s/it] 85%|████████▌ | 320/375 [2:23:03<24:10, 26.38s/it] {'loss': 0.488, 'learning_rate': 1.1116455134507665e-06, 'epoch': 2.55} 85%|████████▌ | 320/375 [2:23:03<24:10, 26.38s/it] 86%|████████▌ | 321/375 [2:23:28<23:16, 25.86s/it] {'loss': 0.5261, 'learning_rate': 1.0723215490213635e-06, 'epoch': 2.56} 86%|████████▌ | 321/375 [2:23:28<23:16, 25.86s/it] 86%|████████▌ | 322/375 [2:23:55<23:09, 26.22s/it] {'loss': 0.5127, 'learning_rate': 1.0336662707363287e-06, 'epoch': 2.57} 86%|████████▌ | 322/375 [2:23:55<23:09, 26.22s/it] 86%|████████▌ | 323/375 [2:24:22<22:55, 26.45s/it] {'loss': 0.5145, 'learning_rate': 9.95682573889114e-07, 'epoch': 2.57} 86%|████████▌ | 323/375 [2:24:22<22:55, 26.45s/it] 86%|████████▋ | 324/375 [2:24:49<22:39, 26.65s/it] {'loss': 0.4902, 'learning_rate': 9.583733034714982e-07, 'epoch': 2.58} 86%|████████▋ | 324/375 [2:24:49<22:39, 26.65s/it] 87%|████████▋ | 325/375 [2:25:16<22:20, 26.81s/it] {'loss': 0.4974, 'learning_rate': 9.217412539604942e-07, 'epoch': 2.59} 87%|████████▋ | 325/375 [2:25:16<22:20, 26.81s/it] 87%|████████▋ | 326/375 [2:25:43<21:45, 26.63s/it] {'loss': 0.4695, 'learning_rate': 8.857891691090336e-07, 'epoch': 2.6} 87%|████████▋ | 326/375 [2:25:43<21:45, 26.63s/it] 87%|████████▋ | 327/375 [2:26:13<22:05, 27.60s/it] {'loss': 0.4944, 'learning_rate': 8.505197417404687e-07, 'epoch': 2.61} 87%|████████▋ | 327/375 [2:26:13<22:05, 27.60s/it] 87%|████████▋ | 328/375 [2:26:39<21:26, 27.38s/it] {'loss': 0.4966, 'learning_rate': 8.159356135468721e-07, 'epoch': 2.61} 87%|████████▋ | 328/375 [2:26:39<21:26, 27.38s/it] 88%|████████▊ | 329/375 [2:27:07<21:00, 27.40s/it] {'loss': 0.5097, 'learning_rate': 7.820393748911792e-07, 'epoch': 2.62} 88%|████████▊ | 329/375 [2:27:07<21:00, 27.40s/it] 88%|████████▊ | 330/375 [2:27:34<20:23, 27.18s/it] {'loss': 0.525, 'learning_rate': 7.488335646131628e-07, 'epoch': 2.63} 88%|████████▊ | 330/375 [2:27:34<20:23, 27.18s/it] 88%|████████▊ | 331/375 [2:28:01<19:58, 27.23s/it] {'loss': 0.4993, 'learning_rate': 7.163206698392744e-07, 'epoch': 2.64} 88%|████████▊ | 331/375 [2:28:01<19:58, 27.23s/it] 89%|████████▊ | 332/375 [2:28:28<19:34, 27.31s/it] {'loss': 0.489, 'learning_rate': 6.845031257963619e-07, 'epoch': 2.65} 89%|████████▊ | 332/375 [2:28:28<19:34, 27.31s/it] 89%|████████▉ | 333/375 [2:28:55<19:04, 27.24s/it] {'loss': 0.4964, 'learning_rate': 6.53383315629268e-07, 'epoch': 2.65} 89%|████████▉ | 333/375 [2:28:55<19:04, 27.24s/it] 89%|████████▉ | 334/375 [2:29:24<18:50, 27.56s/it] {'loss': 0.5255, 'learning_rate': 6.229635702223325e-07, 'epoch': 2.66} 89%|████████▉ | 334/375 [2:29:24<18:50, 27.56s/it] 89%|████████▉ | 335/375 [2:29:50<18:12, 27.30s/it] {'loss': 0.5094, 'learning_rate': 5.932461680248014e-07, 'epoch': 2.67} 89%|████████▉ | 335/375 [2:29:50<18:12, 27.30s/it] 90%|████████▉ | 336/375 [2:30:17<17:41, 27.21s/it] {'loss': 0.4925, 'learning_rate': 5.64233334880181e-07, 'epoch': 2.68} 90%|████████▉ | 336/375 [2:30:17<17:41, 27.21s/it] 90%|████████▉ | 337/375 [2:30:43<16:58, 26.81s/it] {'loss': 0.511, 'learning_rate': 5.359272438595153e-07, 'epoch': 2.69} 90%|████████▉ | 337/375 [2:30:43<16:58, 26.81s/it] 90%|█████████ | 338/375 [2:31:11<16:46, 27.22s/it] {'loss': 0.5087, 'learning_rate': 5.083300150986259e-07, 'epoch': 2.69} 90%|█████████ | 338/375 [2:31:11<16:46, 27.22s/it] 90%|█████████ | 339/375 [2:31:37<15:59, 26.66s/it] {'loss': 0.5189, 'learning_rate': 4.814437156393048e-07, 'epoch': 2.7} 90%|█████████ | 339/375 [2:31:37<15:59, 26.66s/it] 91%|█████████ | 340/375 [2:32:02<15:22, 26.36s/it] {'loss': 0.4896, 'learning_rate': 4.5527035927450337e-07, 'epoch': 2.71} 91%|█████████ | 340/375 [2:32:02<15:22, 26.36s/it] 91%|█████████ | 341/375 [2:32:29<14:55, 26.34s/it] {'loss': 0.5004, 'learning_rate': 4.298119063974915e-07, 'epoch': 2.72} 91%|█████████ | 341/375 [2:32:29<14:55, 26.34s/it] 91%|█████████ | 342/375 [2:32:57<14:52, 27.05s/it] {'loss': 0.5096, 'learning_rate': 4.0507026385502747e-07, 'epoch': 2.73} 91%|█████████ | 342/375 [2:32:58<14:52, 27.05s/it] 91%|█████████▏| 343/375 [2:33:24<14:19, 26.86s/it] {'loss': 0.5195, 'learning_rate': 3.810472848045266e-07, 'epoch': 2.73} 91%|█████████▏| 343/375 [2:33:24<14:19, 26.86s/it] 92%|█████████▏| 344/375 [2:33:50<13:41, 26.51s/it] {'loss': 0.5102, 'learning_rate': 3.5774476857527107e-07, 'epoch': 2.74} 92%|█████████▏| 344/375 [2:33:50<13:41, 26.51s/it] 92%|█████████▏| 345/375 [2:34:17<13:21, 26.72s/it] {'loss': 0.4774, 'learning_rate': 3.3516446053363015e-07, 'epoch': 2.75} 92%|█████████▏| 345/375 [2:34:17<13:21, 26.72s/it] 92%|█████████▏| 346/375 [2:34:43<12:47, 26.46s/it] {'loss': 0.5122, 'learning_rate': 3.1330805195233684e-07, 'epoch': 2.76} 92%|█████████▏| 346/375 [2:34:43<12:47, 26.46s/it] 93%|█████████▎| 347/375 [2:35:09<12:17, 26.34s/it] {'loss': 0.4839, 'learning_rate': 2.921771798838069e-07, 'epoch': 2.76} 93%|█████████▎| 347/375 [2:35:09<12:17, 26.34s/it] 93%|█████████▎| 348/375 [2:35:37<12:04, 26.85s/it] {'loss': 0.4781, 'learning_rate': 2.717734270375272e-07, 'epoch': 2.77} 93%|█████████▎| 348/375 [2:35:37<12:04, 26.85s/it] 93%|█████████▎| 349/375 [2:36:05<11:46, 27.18s/it] {'loss': 0.5338, 'learning_rate': 2.520983216615047e-07, 'epoch': 2.78} 93%|█████████▎| 349/375 [2:36:05<11:46, 27.18s/it] 93%|█████████▎| 350/375 [2:36:32<11:18, 27.14s/it] {'loss': 0.5009, 'learning_rate': 2.3315333742780942e-07, 'epoch': 2.79} 93%|█████████▎| 350/375 [2:36:32<11:18, 27.14s/it] 94%|█████████▎| 351/375 [2:36:59<10:54, 27.26s/it] {'loss': 0.4998, 'learning_rate': 2.1493989332218468e-07, 'epoch': 2.8} 94%|█████████▎| 351/375 [2:36:59<10:54, 27.26s/it] 94%|█████████▍| 352/375 [2:37:28<10:35, 27.63s/it] {'loss': 0.5426, 'learning_rate': 1.9745935353777222e-07, 'epoch': 2.8} 94%|█████████▍| 352/375 [2:37:28<10:35, 27.63s/it] 94%|█████████▍| 353/375 [2:37:56<10:11, 27.79s/it] {'loss': 0.4944, 'learning_rate': 1.8071302737293294e-07, 'epoch': 2.81} 94%|█████████▍| 353/375 [2:37:56<10:11, 27.79s/it] 94%|█████████▍| 354/375 [2:38:23<09:36, 27.48s/it] {'loss': 0.5074, 'learning_rate': 1.6470216913317628e-07, 'epoch': 2.82} 94%|█████████▍| 354/375 [2:38:23<09:36, 27.48s/it] 95%|█████████▍| 355/375 [2:38:51<09:15, 27.77s/it] {'loss': 0.4987, 'learning_rate': 1.4942797803721543e-07, 'epoch': 2.83} 95%|█████████▍| 355/375 [2:38:51<09:15, 27.77s/it] 95%|█████████▍| 356/375 [2:39:18<08:41, 27.45s/it] {'loss': 0.5302, 'learning_rate': 1.348915981271437e-07, 'epoch': 2.84} 95%|█████████▍| 356/375 [2:39:18<08:41, 27.45s/it] 95%|█████████▌| 357/375 [2:39:44<08:07, 27.10s/it] {'loss': 0.4965, 'learning_rate': 1.2109411818274851e-07, 'epoch': 2.84} 95%|█████████▌| 357/375 [2:39:44<08:07, 27.10s/it] 95%|█████████▌| 358/375 [2:40:12<07:43, 27.26s/it] {'loss': 0.5119, 'learning_rate': 1.0803657163995896e-07, 'epoch': 2.85} 95%|█████████▌| 358/375 [2:40:12<07:43, 27.26s/it] 96%|█████████▌| 359/375 [2:40:39<07:17, 27.34s/it] {'loss': 0.4805, 'learning_rate': 9.571993651343869e-08, 'epoch': 2.86} 96%|█████████▌| 359/375 [2:40:39<07:17, 27.34s/it] 96%|█████████▌| 360/375 [2:41:07<06:52, 27.50s/it] {'loss': 0.5057, 'learning_rate': 8.41451353233369e-08, 'epoch': 2.87} 96%|█████████▌| 360/375 [2:41:07<06:52, 27.50s/it] 96%|█████████▋| 361/375 [2:41:33<06:19, 27.12s/it] {'loss': 0.4891, 'learning_rate': 7.331303502618903e-08, 'epoch': 2.88} 96%|█████████▋| 361/375 [2:41:33<06:19, 27.12s/it] 97%|█████████▋| 362/375 [2:42:00<05:49, 26.86s/it] {'loss': 0.5223, 'learning_rate': 6.32244469499832e-08, 'epoch': 2.88} 97%|█████████▋| 362/375 [2:42:00<05:49, 26.86s/it] 97%|█████████▋| 363/375 [2:42:26<05:21, 26.77s/it] {'loss': 0.5151, 'learning_rate': 5.388012673338661e-08, 'epoch': 2.89} 97%|█████████▋| 363/375 [2:42:26<05:21, 26.77s/it] 97%|█████████▋| 364/375 [2:42:54<04:58, 27.10s/it] {'loss': 0.4966, 'learning_rate': 4.528077426915412e-08, 'epoch': 2.9} 97%|█████████▋| 364/375 [2:42:54<04:58, 27.10s/it] 97%|█████████▋| 365/375 [2:43:20<04:27, 26.79s/it] {'loss': 0.5096, 'learning_rate': 3.7427033651702414e-08, 'epoch': 2.91} 97%|█████████▋| 365/375 [2:43:20<04:27, 26.79s/it] 98%|█████████▊| 366/375 [2:43:47<04:00, 26.75s/it] {'loss': 0.5051, 'learning_rate': 3.03194931288664e-08, 'epoch': 2.92} 98%|█████████▊| 366/375 [2:43:47<04:00, 26.75s/it] 98%|█████████▊| 367/375 [2:44:14<03:34, 26.75s/it] {'loss': 0.5117, 'learning_rate': 2.3958685057844378e-08, 'epoch': 2.92} 98%|█████████▊| 367/375 [2:44:14<03:34, 26.75s/it] 98%|█████████▊| 368/375 [2:44:41<03:08, 26.88s/it] {'loss': 0.4816, 'learning_rate': 1.83450858653178e-08, 'epoch': 2.93} 98%|█████████▊| 368/375 [2:44:41<03:08, 26.88s/it] 98%|█████████▊| 369/375 [2:45:08<02:41, 26.98s/it] {'loss': 0.4839, 'learning_rate': 1.3479116011769766e-08, 'epoch': 2.94} 98%|█████████▊| 369/375 [2:45:08<02:41, 26.98s/it] 99%|█████████▊| 370/375 [2:45:34<02:13, 26.74s/it] {'loss': 0.5347, 'learning_rate': 9.361139959993549e-09, 'epoch': 2.95} 99%|█████████▊| 370/375 [2:45:34<02:13, 26.74s/it] 99%|█████████▉| 371/375 [2:46:01<01:47, 26.91s/it] {'loss': 0.5263, 'learning_rate': 5.991466147791114e-09, 'epoch': 2.96} 99%|█████████▉| 371/375 [2:46:01<01:47, 26.91s/it] 99%|█████████▉| 372/375 [2:46:29<01:21, 27.26s/it] {'loss': 0.5253, 'learning_rate': 3.3703469648760367e-09, 'epoch': 2.96} 99%|█████████▉| 372/375 [2:46:29<01:21, 27.26s/it] 99%|█████████▉| 373/375 [2:46:58<00:55, 27.57s/it] {'loss': 0.5517, 'learning_rate': 1.497978733961958e-09, 'epoch': 2.97} 99%|█████████▉| 373/375 [2:46:58<00:55, 27.57s/it] 100%|█████████▉| 374/375 [2:47:24<00:27, 27.15s/it] {'loss': 0.5018, 'learning_rate': 3.745016960665648e-10, 'epoch': 2.98} 100%|█████████▉| 374/375 [2:47:24<00:27, 27.15s/it] 100%|██████████| 375/375 [2:47:53<00:00, 27.60s/it] {'loss': 0.514, 'learning_rate': 0.0, 'epoch': 2.99} 100%|██████████| 375/375 [2:47:53<00:00, 27.60s/it][INFO|trainer.py:2025] 2024-02-08 19:59:28,942 >> Training completed. Do not forget to share your model on huggingface.co/models =) {'train_runtime': 10075.457, 'train_samples_per_second': 19.117, 'train_steps_per_second': 0.037, 'train_loss': 0.765017323811849, 'epoch': 2.99} 100%|██████████| 375/375 [2:47:55<00:00, 27.60s/it] 100%|██████████| 375/375 [2:47:55<00:00, 26.87s/it] [INFO|trainer.py:2830] 2024-02-08 19:59:28,968 >> Saving model checkpoint to ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b [INFO|configuration_utils.py:457] 2024-02-08 19:59:28,977 >> Configuration saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/config.json [INFO|configuration_utils.py:362] 2024-02-08 19:59:28,982 >> Configuration saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/generation_config.json [INFO|modeling_utils.py:1759] 2024-02-08 19:59:29,012 >> Model weights saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/pytorch_model.bin [INFO|tokenization_utils_base.py:2164] 2024-02-08 19:59:29,014 >> tokenizer config file saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/tokenizer_config.json [INFO|tokenization_utils_base.py:2171] 2024-02-08 19:59:29,015 >> Special tokens file saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/special_tokens_map.json [INFO|tokenization_utils_base.py:2221] 2024-02-08 19:59:29,017 >> added tokens file saved in ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/added_tokens.json [2024-02-08 19:59:34,975] [INFO] [logging.py:96:log_dist] [Rank 0] [Torch] Checkpoint global_step375 is about to be saved! [2024-02-08 19:59:34,976] [INFO] [engine.py:3492:save_16bit_model] Saving model weights to ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/pytorch_model.bin, tag: global_step375 [2024-02-08 19:59:34,976] [INFO] [torch_checkpoint_engine.py:21:save] [Torch] Saving ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/pytorch_model.bin... [2024-02-08 19:59:50,664] [INFO] [torch_checkpoint_engine.py:23:save] [Torch] Saved ./checkpoints_ctthensft/fortranslation/ac/allm-addac-alpacanewstest17to20-ac-7b/pytorch_model.bin. [2024-02-08 19:59:50,665] [INFO] [torch_checkpoint_engine.py:33:commit] [Torch] Checkpoint global_step375 is ready now! ***** train metrics ***** epoch = 2.99 train_loss = 0.765 train_runtime = 2:47:55.45 train_samples = 64204 train_samples_per_second = 19.117 train_steps_per_second = 0.037 [INFO|modelcard.py:451] 2024-02-08 19:59:50,683 >> Dropping the following result as it does not have all the necessary fields: {'task': {'name': 'Causal Language Modeling', 'type': 'text-generation'}}