File size: 68,197 Bytes
1333c76
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
07/22/2022 12:31:56 - WARNING - __main__ - Process rank: -1, device: cuda:0, n_gpu: 1distributed training: False, 16-bits training: True
07/22/2022 12:31:56 - INFO - __main__ - Training/evaluation parameters TrainingArguments(
_n_gpu=1,
adafactor=False,
adam_beta1=0.9,
adam_beta2=0.999,
adam_epsilon=1e-08,
auto_find_batch_size=False,
bf16=False,
bf16_full_eval=False,
data_seed=None,
dataloader_drop_last=False,
dataloader_num_workers=0,
dataloader_pin_memory=True,
ddp_bucket_cap_mb=None,
ddp_find_unused_parameters=None,
debug=[],
deepspeed=None,
disable_tqdm=False,
do_eval=True,
do_predict=True,
do_train=True,
eval_accumulation_steps=None,
eval_delay=0,
eval_steps=None,
evaluation_strategy=IntervalStrategy.NO,
fp16=True,
fp16_backend=auto,
fp16_full_eval=False,
fp16_opt_level=O1,
fsdp=[],
fsdp_min_num_params=0,
full_determinism=False,
gradient_accumulation_steps=1,
gradient_checkpointing=False,
greater_is_better=None,
group_by_length=False,
half_precision_backend=auto,
hub_model_id=None,
hub_private_repo=False,
hub_strategy=HubStrategy.EVERY_SAVE,
hub_token=<HUB_TOKEN>,
ignore_data_skip=False,
include_inputs_for_metrics=False,
jit_mode_eval=False,
label_names=None,
label_smoothing_factor=0.0,
learning_rate=5e-05,
length_column_name=length,
load_best_model_at_end=False,
local_rank=-1,
log_level=-1,
log_level_replica=-1,
log_on_each_node=True,
logging_dir=runs/ebmnlp_hf/BioLinkBERT-base/runs/Jul22_12-31-56_spartan-gpgpu080.hpc.unimelb.edu.au,
logging_first_step=False,
logging_nan_inf_filter=True,
logging_steps=500,
logging_strategy=IntervalStrategy.STEPS,
lr_scheduler_type=SchedulerType.LINEAR,
max_grad_norm=1.0,
max_steps=-1,
metric_for_best_model=None,
mp_parameters=,
no_cuda=False,
num_train_epochs=1.0,
optim=OptimizerNames.ADAMW_HF,
output_dir=runs/ebmnlp_hf/BioLinkBERT-base,
overwrite_output_dir=True,
past_index=-1,
per_device_eval_batch_size=8,
per_device_train_batch_size=32,
prediction_loss_only=False,
push_to_hub=False,
push_to_hub_model_id=None,
push_to_hub_organization=None,
push_to_hub_token=<PUSH_TO_HUB_TOKEN>,
ray_scope=last,
remove_unused_columns=True,
report_to=['tensorboard'],
resume_from_checkpoint=None,
run_name=runs/ebmnlp_hf/BioLinkBERT-base,
save_on_each_node=False,
save_steps=500,
save_strategy=IntervalStrategy.NO,
save_total_limit=None,
seed=42,
sharded_ddp=[],
skip_memory_metrics=True,
tf32=None,
torchdynamo=None,
tpu_metrics_debug=False,
tpu_num_cores=None,
use_ipex=False,
use_legacy_prediction_loop=False,
warmup_ratio=0.0,
warmup_steps=0,
weight_decay=0.0,
xpu_backend=None,
)
07/22/2022 12:31:57 - WARNING - datasets.builder - Using custom data configuration default-2d9cec4b8a27d237
07/22/2022 12:31:57 - INFO - datasets.builder - Overwrite dataset info from restored data version.
07/22/2022 12:31:57 - INFO - datasets.info - Loading Dataset info from /home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5
07/22/2022 12:31:57 - WARNING - datasets.builder - Reusing dataset json (/home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5)
07/22/2022 12:31:57 - INFO - datasets.info - Loading Dataset info from /home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5

  0%|          | 0/3 [00:00<?, ?it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 3/3 [00:00<00:00, 491.92it/s]
[INFO|configuration_utils.py:659] 2022-07-22 12:31:59,048 >> loading configuration file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/config.json from cache at /home/hungthinht/.cache/huggingface/transformers/ad032c76cac1f75bba037ba006dcccc1c62ab157749b194df023bfa55e5f4fbf.22ae3f7c73ebda8488a8505a67c1b929a707ae7db67a129f60b7c28acfc38436
[INFO|configuration_utils.py:708] 2022-07-22 12:31:59,083 >> Model config BertConfig {
  "_name_or_path": "michiyasunaga/BioLinkBERT-base",
  "architectures": [
    "BertModel"
  ],
  "attention_probs_dropout_prob": 0.1,
  "classifier_dropout": null,
  "finetuning_task": "ner",
  "gradient_checkpointing": false,
  "hidden_act": "gelu",
  "hidden_dropout_prob": 0.1,
  "hidden_size": 768,
  "id2label": {
    "0": "B-INT",
    "1": "B-OUT",
    "2": "B-PAR",
    "3": "O"
  },
  "initializer_range": 0.02,
  "intermediate_size": 3072,
  "label2id": {
    "B-INT": 0,
    "B-OUT": 1,
    "B-PAR": 2,
    "O": 3
  },
  "layer_norm_eps": 1e-12,
  "max_position_embeddings": 512,
  "model_type": "bert",
  "num_attention_heads": 12,
  "num_hidden_layers": 12,
  "pad_token_id": 0,
  "position_embedding_type": "absolute",
  "transformers_version": "4.20.1",
  "type_vocab_size": 2,
  "use_cache": true,
  "vocab_size": 28895
}

[INFO|tokenization_utils_base.py:1781] 2022-07-22 12:32:05,294 >> loading file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/vocab.txt from cache at /home/hungthinht/.cache/huggingface/transformers/9eb712b5fcba51331b49cb69f18de1577371a2582055a298e2546c0c97d3b924.73b5c069d3e40205dd2df2379051c9f47d13c3bad0bcb3cee659c69e3a185a86
[INFO|tokenization_utils_base.py:1781] 2022-07-22 12:32:05,294 >> loading file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/tokenizer.json from cache at /home/hungthinht/.cache/huggingface/transformers/3c720cf86b025f815b1d833b6b39db05e8e7493b6f6a87788c485a946848b4d8.a25e24b89fd9bfd32e3c8d2dbb39879c62152e7f069ab24c97198c004cad94c9
[INFO|tokenization_utils_base.py:1781] 2022-07-22 12:32:05,294 >> loading file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/added_tokens.json from cache at None
[INFO|tokenization_utils_base.py:1781] 2022-07-22 12:32:05,294 >> loading file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/special_tokens_map.json from cache at /home/hungthinht/.cache/huggingface/transformers/0598867425495ec6baf3617ab3789f3d8b84ebf869f7b43aa4a2930195a74dbe.dd8bd9bfd3664b530ea4e645105f557769387b3da9f79bdb55ed556bdd80611d
[INFO|tokenization_utils_base.py:1781] 2022-07-22 12:32:05,294 >> loading file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/tokenizer_config.json from cache at /home/hungthinht/.cache/huggingface/transformers/30e2841862fd496cf36bc8647c9633a1dc319fbf6cc88a80438ca3f89e28339b.fab032bd2aab224bad4dcfc35e3bd6122976da1fa23e4feeb97d8fa65491aded
[INFO|modeling_utils.py:2107] 2022-07-22 12:32:06,276 >> loading weights file https://huggingface.co/michiyasunaga/BioLinkBERT-base/resolve/main/pytorch_model.bin from cache at /home/hungthinht/.cache/huggingface/transformers/76a88449a3eb7019bbc0d164cc39a6a231c8bbe3b9678b8d40977424f0ad934d.f8b95ad9e1dea734685fba5a5b6142b539678b7fc2311981cc14ae61b19f709d
[INFO|modeling_utils.py:2483] 2022-07-22 12:32:07,350 >> All model checkpoint weights were used when initializing BertForTokenClassification.

[WARNING|modeling_utils.py:2485] 2022-07-22 12:32:07,350 >> Some weights of BertForTokenClassification were not initialized from the model checkpoint at michiyasunaga/BioLinkBERT-base and are newly initialized: ['classifier.weight', 'classifier.bias']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
07/22/2022 12:32:07 - WARNING - datasets.fingerprint - Parameter 'function'=<function main.<locals>.tokenize_and_align_labels at 0x2ac6e9964940> of the transform datasets.arrow_dataset.Dataset._map_single couldn't be hashed properly, a random hash was used instead. Make sure your transforms and parameters are serializable with pickle or dill for the dataset fingerprinting and caching to work. If you reuse this transform, the caching mechanism will consider it to be different from the previous calls and recompute everything. This warning is only showed once. Subsequent hashing failures won't be showed.
07/22/2022 12:32:07 - WARNING - datasets.arrow_dataset - Loading cached processed dataset at /home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5/cache-1c80317fa3b1799d.arrow
07/22/2022 12:32:07 - INFO - datasets.fingerprint - Parameter 'function'=<function main.<locals>.tokenize_and_align_labels at 0x2ac6e99b3d30> of the transform datasets.arrow_dataset.Dataset._map_single couldn't be hashed properly, a random hash was used instead.
07/22/2022 12:32:07 - WARNING - datasets.arrow_dataset - Loading cached processed dataset at /home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5/cache-bdd640fb06671ad1.arrow
07/22/2022 12:32:07 - INFO - datasets.fingerprint - Parameter 'function'=<function main.<locals>.tokenize_and_align_labels at 0x2ac6e9964940> of the transform datasets.arrow_dataset.Dataset._map_single couldn't be hashed properly, a random hash was used instead.
07/22/2022 12:32:07 - WARNING - datasets.arrow_dataset - Loading cached processed dataset at /home/hungthinht/.cache/huggingface/datasets/json/default-2d9cec4b8a27d237/0.0.0/da492aad5680612e4028e7f6ddc04b1dfcec4b64db470ed7cc5f2bb265b9b6b5/cache-3eb13b9046685257.arrow
[INFO|trainer.py:533] 2022-07-22 12:32:09,812 >> Using cuda_amp half precision backend
[INFO|trainer.py:661] 2022-07-22 12:32:09,812 >> The following columns in the training set don't have a corresponding argument in `BertForTokenClassification.forward` and have been ignored: id, ner_tags, word_ids, tokens. If id, ner_tags, word_ids, tokens are not expected by `BertForTokenClassification.forward`,  you can safely ignore this message.
/home/hungthinht/miniconda3/lib/python3.9/site-packages/transformers/optimization.py:306: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use the PyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning
  warnings.warn(
[INFO|trainer.py:1516] 2022-07-22 12:32:09,838 >> ***** Running training *****
[INFO|trainer.py:1517] 2022-07-22 12:32:09,838 >>   Num examples = 40935
[INFO|trainer.py:1518] 2022-07-22 12:32:09,838 >>   Num Epochs = 1
[INFO|trainer.py:1519] 2022-07-22 12:32:09,838 >>   Instantaneous batch size per device = 32
[INFO|trainer.py:1520] 2022-07-22 12:32:09,838 >>   Total train batch size (w. parallel, distributed & accumulation) = 32
[INFO|trainer.py:1521] 2022-07-22 12:32:09,838 >>   Gradient Accumulation steps = 1
[INFO|trainer.py:1522] 2022-07-22 12:32:09,838 >>   Total optimization steps = 1280

  0%|          | 0/1280 [00:00<?, ?it/s]
  0%|          | 1/1280 [00:00<13:42,  1.56it/s]
  0%|          | 3/1280 [00:00<04:50,  4.39it/s]
  0%|          | 5/1280 [00:00<03:11,  6.65it/s]
  1%|          | 7/1280 [00:01<02:31,  8.43it/s]
  1%|          | 9/1280 [00:01<02:06, 10.01it/s]
  1%|          | 11/1280 [00:01<01:59, 10.62it/s]
  1%|          | 13/1280 [00:01<01:49, 11.60it/s]
  1%|          | 15/1280 [00:01<01:42, 12.29it/s]
  1%|▏         | 17/1280 [00:01<01:37, 12.91it/s]
  1%|▏         | 19/1280 [00:01<01:33, 13.42it/s]
  2%|▏         | 21/1280 [00:02<01:34, 13.33it/s]
  2%|▏         | 23/1280 [00:02<01:34, 13.28it/s]
  2%|▏         | 25/1280 [00:02<01:35, 13.19it/s]
  2%|▏         | 27/1280 [00:02<01:32, 13.51it/s]
  2%|▏         | 29/1280 [00:02<01:33, 13.35it/s]
  2%|▏         | 31/1280 [00:02<01:37, 12.82it/s]
  3%|β–Ž         | 33/1280 [00:03<01:36, 12.92it/s]
  3%|β–Ž         | 35/1280 [00:03<01:34, 13.13it/s]
  3%|β–Ž         | 37/1280 [00:03<01:40, 12.40it/s]
  3%|β–Ž         | 39/1280 [00:03<01:35, 13.03it/s]
  3%|β–Ž         | 41/1280 [00:03<01:35, 12.98it/s]
  3%|β–Ž         | 43/1280 [00:03<01:33, 13.28it/s]
  4%|β–Ž         | 45/1280 [00:03<01:33, 13.23it/s]
  4%|β–Ž         | 47/1280 [00:04<01:29, 13.78it/s]
  4%|▍         | 49/1280 [00:04<01:26, 14.19it/s]
  4%|▍         | 51/1280 [00:04<01:31, 13.42it/s]
  4%|▍         | 53/1280 [00:04<01:30, 13.51it/s]
  4%|▍         | 55/1280 [00:04<01:34, 12.92it/s]
  4%|▍         | 57/1280 [00:04<01:35, 12.77it/s]
  5%|▍         | 59/1280 [00:05<01:34, 12.95it/s]
  5%|▍         | 61/1280 [00:05<01:31, 13.35it/s]
  5%|▍         | 63/1280 [00:05<01:30, 13.39it/s]
  5%|β–Œ         | 65/1280 [00:05<01:28, 13.77it/s]
  5%|β–Œ         | 67/1280 [00:05<01:28, 13.66it/s]
  5%|β–Œ         | 69/1280 [00:05<01:28, 13.61it/s]
  6%|β–Œ         | 71/1280 [00:05<01:26, 14.00it/s]
  6%|β–Œ         | 73/1280 [00:06<01:34, 12.73it/s]
  6%|β–Œ         | 75/1280 [00:06<01:32, 12.96it/s]
  6%|β–Œ         | 77/1280 [00:06<01:28, 13.52it/s]
  6%|β–Œ         | 79/1280 [00:06<01:27, 13.80it/s]
  6%|β–‹         | 81/1280 [00:06<01:26, 13.85it/s]
  6%|β–‹         | 83/1280 [00:06<01:25, 13.96it/s]
  7%|β–‹         | 85/1280 [00:06<01:23, 14.27it/s]
  7%|β–‹         | 87/1280 [00:07<01:23, 14.30it/s]
  7%|β–‹         | 89/1280 [00:07<01:24, 14.14it/s]
  7%|β–‹         | 91/1280 [00:07<01:23, 14.22it/s]
  7%|β–‹         | 93/1280 [00:07<01:22, 14.45it/s]
  7%|β–‹         | 95/1280 [00:07<01:24, 14.05it/s]
  8%|β–Š         | 97/1280 [00:07<01:26, 13.74it/s]
  8%|β–Š         | 99/1280 [00:07<01:28, 13.27it/s]
  8%|β–Š         | 101/1280 [00:08<01:29, 13.11it/s]
  8%|β–Š         | 103/1280 [00:08<01:29, 13.17it/s]
  8%|β–Š         | 105/1280 [00:08<01:28, 13.34it/s]
  8%|β–Š         | 107/1280 [00:08<01:24, 13.82it/s]
  9%|β–Š         | 109/1280 [00:08<01:28, 13.30it/s]
  9%|β–Š         | 111/1280 [00:08<01:25, 13.65it/s]
  9%|β–‰         | 113/1280 [00:08<01:23, 14.05it/s]
  9%|β–‰         | 115/1280 [00:09<01:32, 12.63it/s]
  9%|β–‰         | 117/1280 [00:09<01:27, 13.26it/s]
  9%|β–‰         | 119/1280 [00:09<01:24, 13.80it/s]
  9%|β–‰         | 121/1280 [00:09<01:23, 13.93it/s]
 10%|β–‰         | 123/1280 [00:09<01:25, 13.57it/s]
 10%|β–‰         | 125/1280 [00:09<01:25, 13.44it/s]
 10%|β–‰         | 127/1280 [00:09<01:24, 13.63it/s]
 10%|β–ˆ         | 129/1280 [00:10<01:23, 13.71it/s]
 10%|β–ˆ         | 131/1280 [00:10<01:21, 14.15it/s]
 10%|β–ˆ         | 133/1280 [00:10<01:22, 13.91it/s]
 11%|β–ˆ         | 135/1280 [00:10<01:22, 13.90it/s]
 11%|β–ˆ         | 137/1280 [00:10<01:23, 13.75it/s]
 11%|β–ˆ         | 139/1280 [00:10<01:21, 13.96it/s]
 11%|β–ˆ         | 141/1280 [00:10<01:19, 14.30it/s]
 11%|β–ˆ         | 143/1280 [00:11<01:18, 14.40it/s]
 11%|β–ˆβ–        | 145/1280 [00:11<01:20, 14.08it/s]
 11%|β–ˆβ–        | 147/1280 [00:11<01:20, 14.06it/s]
 12%|β–ˆβ–        | 149/1280 [00:11<01:18, 14.34it/s]
 12%|β–ˆβ–        | 151/1280 [00:11<01:18, 14.34it/s]
 12%|β–ˆβ–        | 153/1280 [00:11<01:22, 13.70it/s]
 12%|β–ˆβ–        | 155/1280 [00:11<01:22, 13.71it/s]
 12%|β–ˆβ–        | 157/1280 [00:12<01:20, 14.03it/s]
 12%|β–ˆβ–        | 159/1280 [00:12<01:20, 13.98it/s]
 13%|β–ˆβ–Ž        | 161/1280 [00:12<01:23, 13.40it/s]
 13%|β–ˆβ–Ž        | 163/1280 [00:12<01:24, 13.14it/s]
 13%|β–ˆβ–Ž        | 165/1280 [00:12<01:29, 12.51it/s]
 13%|β–ˆβ–Ž        | 167/1280 [00:12<01:24, 13.16it/s]
 13%|β–ˆβ–Ž        | 169/1280 [00:13<01:24, 13.13it/s]
 13%|β–ˆβ–Ž        | 171/1280 [00:13<01:20, 13.70it/s]
 14%|β–ˆβ–Ž        | 173/1280 [00:13<01:17, 14.25it/s]
 14%|β–ˆβ–Ž        | 175/1280 [00:13<01:17, 14.27it/s]
 14%|β–ˆβ–        | 177/1280 [00:13<01:23, 13.16it/s]
 14%|β–ˆβ–        | 179/1280 [00:13<01:20, 13.74it/s]
 14%|β–ˆβ–        | 181/1280 [00:13<01:22, 13.29it/s]
 14%|β–ˆβ–        | 183/1280 [00:14<01:19, 13.74it/s]
 14%|β–ˆβ–        | 185/1280 [00:14<01:17, 14.13it/s]
 15%|β–ˆβ–        | 187/1280 [00:14<01:14, 14.60it/s]
 15%|β–ˆβ–        | 189/1280 [00:14<01:20, 13.53it/s]
 15%|β–ˆβ–        | 191/1280 [00:14<01:20, 13.60it/s]
 15%|β–ˆβ–Œ        | 193/1280 [00:14<01:18, 13.93it/s]
 15%|β–ˆβ–Œ        | 195/1280 [00:14<01:17, 14.06it/s]
 15%|β–ˆβ–Œ        | 197/1280 [00:15<01:16, 14.21it/s]
 16%|β–ˆβ–Œ        | 199/1280 [00:15<01:20, 13.44it/s]
 16%|β–ˆβ–Œ        | 201/1280 [00:15<01:18, 13.80it/s]
 16%|β–ˆβ–Œ        | 203/1280 [00:15<01:16, 14.17it/s]
 16%|β–ˆβ–Œ        | 205/1280 [00:15<01:16, 14.09it/s]
 16%|β–ˆβ–Œ        | 207/1280 [00:15<01:15, 14.28it/s]
 16%|β–ˆβ–‹        | 209/1280 [00:15<01:17, 13.82it/s]
 16%|β–ˆβ–‹        | 211/1280 [00:16<01:18, 13.62it/s]
 17%|β–ˆβ–‹        | 213/1280 [00:16<01:26, 12.31it/s]
 17%|β–ˆβ–‹        | 215/1280 [00:16<01:26, 12.25it/s]
 17%|β–ˆβ–‹        | 217/1280 [00:16<01:23, 12.66it/s]
 17%|β–ˆβ–‹        | 219/1280 [00:16<01:21, 13.10it/s]
 17%|β–ˆβ–‹        | 221/1280 [00:16<01:18, 13.57it/s]
 17%|β–ˆβ–‹        | 223/1280 [00:16<01:15, 13.92it/s]
 18%|β–ˆβ–Š        | 225/1280 [00:17<01:16, 13.76it/s]
 18%|β–ˆβ–Š        | 227/1280 [00:17<01:17, 13.56it/s]
 18%|β–ˆβ–Š        | 229/1280 [00:17<01:16, 13.79it/s]
 18%|β–ˆβ–Š        | 231/1280 [00:17<01:14, 14.05it/s]
 18%|β–ˆβ–Š        | 233/1280 [00:17<01:12, 14.36it/s]
 18%|β–ˆβ–Š        | 235/1280 [00:17<01:11, 14.62it/s]
 19%|β–ˆβ–Š        | 237/1280 [00:17<01:10, 14.76it/s]
 19%|β–ˆβ–Š        | 239/1280 [00:18<01:11, 14.54it/s]
 19%|β–ˆβ–‰        | 241/1280 [00:18<01:17, 13.43it/s]
 19%|β–ˆβ–‰        | 243/1280 [00:18<01:14, 13.94it/s]
 19%|β–ˆβ–‰        | 245/1280 [00:18<01:12, 14.21it/s]
 19%|β–ˆβ–‰        | 247/1280 [00:18<01:11, 14.43it/s]
 19%|β–ˆβ–‰        | 249/1280 [00:18<01:17, 13.35it/s]
 20%|β–ˆβ–‰        | 251/1280 [00:18<01:15, 13.55it/s]
 20%|β–ˆβ–‰        | 253/1280 [00:19<01:13, 13.88it/s]
 20%|β–ˆβ–‰        | 255/1280 [00:19<01:15, 13.57it/s]
 20%|β–ˆβ–ˆ        | 257/1280 [00:19<01:12, 14.04it/s]
 20%|β–ˆβ–ˆ        | 259/1280 [00:19<01:13, 13.85it/s]
 20%|β–ˆβ–ˆ        | 261/1280 [00:19<01:19, 12.82it/s]
 21%|β–ˆβ–ˆ        | 263/1280 [00:19<01:20, 12.68it/s]
 21%|β–ˆβ–ˆ        | 265/1280 [00:20<01:23, 12.09it/s]
 21%|β–ˆβ–ˆ        | 267/1280 [00:20<01:19, 12.68it/s]
 21%|β–ˆβ–ˆ        | 269/1280 [00:20<01:17, 13.06it/s]
 21%|β–ˆβ–ˆ        | 271/1280 [00:20<01:16, 13.27it/s]
 21%|β–ˆβ–ˆβ–       | 273/1280 [00:20<01:14, 13.56it/s]
 21%|β–ˆβ–ˆβ–       | 275/1280 [00:20<01:16, 13.16it/s]
 22%|β–ˆβ–ˆβ–       | 277/1280 [00:20<01:16, 13.14it/s]
 22%|β–ˆβ–ˆβ–       | 279/1280 [00:21<01:16, 13.12it/s]
 22%|β–ˆβ–ˆβ–       | 281/1280 [00:21<01:22, 12.05it/s]
 22%|β–ˆβ–ˆβ–       | 283/1280 [00:21<01:20, 12.31it/s]
 22%|β–ˆβ–ˆβ–       | 285/1280 [00:21<01:19, 12.51it/s]
 22%|β–ˆβ–ˆβ–       | 287/1280 [00:21<01:16, 13.06it/s]
 23%|β–ˆβ–ˆβ–Ž       | 289/1280 [00:21<01:14, 13.34it/s]
 23%|β–ˆβ–ˆβ–Ž       | 291/1280 [00:22<01:12, 13.63it/s]
 23%|β–ˆβ–ˆβ–Ž       | 293/1280 [00:22<01:11, 13.76it/s]
 23%|β–ˆβ–ˆβ–Ž       | 295/1280 [00:22<01:11, 13.85it/s]
 23%|β–ˆβ–ˆβ–Ž       | 297/1280 [00:22<01:18, 12.51it/s]
 23%|β–ˆβ–ˆβ–Ž       | 299/1280 [00:22<01:26, 11.31it/s]
 24%|β–ˆβ–ˆβ–Ž       | 301/1280 [00:22<01:27, 11.14it/s]
 24%|β–ˆβ–ˆβ–Ž       | 303/1280 [00:23<01:26, 11.35it/s]
 24%|β–ˆβ–ˆβ–       | 305/1280 [00:23<01:19, 12.34it/s]
 24%|β–ˆβ–ˆβ–       | 307/1280 [00:23<01:19, 12.30it/s]
 24%|β–ˆβ–ˆβ–       | 309/1280 [00:23<01:15, 12.94it/s]
 24%|β–ˆβ–ˆβ–       | 311/1280 [00:23<01:10, 13.69it/s]
 24%|β–ˆβ–ˆβ–       | 313/1280 [00:23<01:12, 13.35it/s]
 25%|β–ˆβ–ˆβ–       | 315/1280 [00:23<01:12, 13.32it/s]
 25%|β–ˆβ–ˆβ–       | 317/1280 [00:24<01:14, 13.01it/s]
 25%|β–ˆβ–ˆβ–       | 319/1280 [00:24<01:10, 13.55it/s]
 25%|β–ˆβ–ˆβ–Œ       | 321/1280 [00:24<01:08, 14.03it/s]
 25%|β–ˆβ–ˆβ–Œ       | 323/1280 [00:24<01:06, 14.39it/s]
 25%|β–ˆβ–ˆβ–Œ       | 325/1280 [00:24<01:07, 14.20it/s]
 26%|β–ˆβ–ˆβ–Œ       | 327/1280 [00:24<01:06, 14.44it/s]
 26%|β–ˆβ–ˆβ–Œ       | 329/1280 [00:24<01:09, 13.76it/s]
 26%|β–ˆβ–ˆβ–Œ       | 331/1280 [00:25<01:11, 13.22it/s]
 26%|β–ˆβ–ˆβ–Œ       | 333/1280 [00:25<01:13, 12.93it/s]
 26%|β–ˆβ–ˆβ–Œ       | 335/1280 [00:25<01:11, 13.30it/s]
 26%|β–ˆβ–ˆβ–‹       | 337/1280 [00:25<01:09, 13.62it/s]
 26%|β–ˆβ–ˆβ–‹       | 339/1280 [00:25<01:07, 13.98it/s]
 27%|β–ˆβ–ˆβ–‹       | 341/1280 [00:25<01:05, 14.29it/s]
 27%|β–ˆβ–ˆβ–‹       | 343/1280 [00:25<01:05, 14.20it/s]
 27%|β–ˆβ–ˆβ–‹       | 345/1280 [00:26<01:05, 14.26it/s]
 27%|β–ˆβ–ˆβ–‹       | 347/1280 [00:26<01:04, 14.55it/s]
 27%|β–ˆβ–ˆβ–‹       | 349/1280 [00:26<01:02, 14.80it/s]
 27%|β–ˆβ–ˆβ–‹       | 351/1280 [00:26<01:03, 14.73it/s]
 28%|β–ˆβ–ˆβ–Š       | 353/1280 [00:26<01:09, 13.35it/s]
 28%|β–ˆβ–ˆβ–Š       | 355/1280 [00:26<01:08, 13.55it/s]
 28%|β–ˆβ–ˆβ–Š       | 357/1280 [00:27<01:09, 13.37it/s]
 28%|β–ˆβ–ˆβ–Š       | 359/1280 [00:27<01:07, 13.65it/s]
 28%|β–ˆβ–ˆβ–Š       | 361/1280 [00:27<01:05, 14.12it/s]
 28%|β–ˆβ–ˆβ–Š       | 363/1280 [00:27<01:08, 13.32it/s]
 29%|β–ˆβ–ˆβ–Š       | 365/1280 [00:27<01:16, 12.02it/s]
 29%|β–ˆβ–ˆβ–Š       | 367/1280 [00:27<01:11, 12.68it/s]
 29%|β–ˆβ–ˆβ–‰       | 369/1280 [00:27<01:11, 12.78it/s]
 29%|β–ˆβ–ˆβ–‰       | 371/1280 [00:28<01:17, 11.75it/s]
 29%|β–ˆβ–ˆβ–‰       | 373/1280 [00:28<01:16, 11.91it/s]
 29%|β–ˆβ–ˆβ–‰       | 375/1280 [00:28<01:13, 12.33it/s]
 29%|β–ˆβ–ˆβ–‰       | 377/1280 [00:28<01:10, 12.79it/s]
 30%|β–ˆβ–ˆβ–‰       | 379/1280 [00:28<01:06, 13.54it/s]
 30%|β–ˆβ–ˆβ–‰       | 381/1280 [00:28<01:04, 13.93it/s]
 30%|β–ˆβ–ˆβ–‰       | 383/1280 [00:29<01:07, 13.26it/s]
 30%|β–ˆβ–ˆβ–ˆ       | 385/1280 [00:29<01:05, 13.67it/s]
 30%|β–ˆβ–ˆβ–ˆ       | 387/1280 [00:29<01:04, 13.75it/s]
 30%|β–ˆβ–ˆβ–ˆ       | 389/1280 [00:29<01:03, 14.12it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 391/1280 [00:29<01:07, 13.17it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 393/1280 [00:29<01:06, 13.34it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 395/1280 [00:29<01:04, 13.81it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 397/1280 [00:30<01:04, 13.63it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 399/1280 [00:30<01:06, 13.24it/s]
 31%|β–ˆβ–ˆβ–ˆβ–      | 401/1280 [00:30<01:08, 12.91it/s]
 31%|β–ˆβ–ˆβ–ˆβ–      | 403/1280 [00:30<01:04, 13.59it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 405/1280 [00:30<01:08, 12.82it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 407/1280 [00:30<01:06, 13.16it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 409/1280 [00:30<01:04, 13.42it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 411/1280 [00:31<01:04, 13.50it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 413/1280 [00:31<01:02, 13.85it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 415/1280 [00:31<01:06, 13.03it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 417/1280 [00:31<01:03, 13.49it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 419/1280 [00:31<01:04, 13.29it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 421/1280 [00:31<01:02, 13.84it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 423/1280 [00:31<01:02, 13.62it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 425/1280 [00:32<01:04, 13.20it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 427/1280 [00:32<01:07, 12.59it/s]
 34%|β–ˆβ–ˆβ–ˆβ–Ž      | 429/1280 [00:32<01:06, 12.88it/s]
 34%|β–ˆβ–ˆβ–ˆβ–Ž      | 431/1280 [00:32<01:03, 13.38it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 433/1280 [00:32<01:01, 13.77it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 435/1280 [00:32<00:59, 14.27it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 437/1280 [00:33<00:59, 14.20it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 439/1280 [00:33<00:58, 14.27it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 441/1280 [00:33<01:01, 13.53it/s]
 35%|β–ˆβ–ˆβ–ˆβ–      | 443/1280 [00:33<01:05, 12.86it/s]
 35%|β–ˆβ–ˆβ–ˆβ–      | 445/1280 [00:33<01:05, 12.72it/s]
 35%|β–ˆβ–ˆβ–ˆβ–      | 447/1280 [00:33<01:02, 13.22it/s]
 35%|β–ˆβ–ˆβ–ˆβ–Œ      | 449/1280 [00:34<01:15, 11.03it/s]
 35%|β–ˆβ–ˆβ–ˆβ–Œ      | 451/1280 [00:34<01:14, 11.17it/s]
 35%|β–ˆβ–ˆβ–ˆβ–Œ      | 453/1280 [00:34<01:09, 11.91it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 455/1280 [00:34<01:04, 12.80it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 457/1280 [00:34<01:01, 13.31it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 459/1280 [00:34<00:58, 13.92it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 461/1280 [00:34<01:00, 13.61it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 463/1280 [00:35<00:58, 14.07it/s]
 36%|β–ˆβ–ˆβ–ˆβ–‹      | 465/1280 [00:35<00:59, 13.59it/s]
 36%|β–ˆβ–ˆβ–ˆβ–‹      | 467/1280 [00:35<00:58, 13.91it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 469/1280 [00:35<00:57, 14.06it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 471/1280 [00:35<01:09, 11.62it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 473/1280 [00:35<01:05, 12.30it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 475/1280 [00:35<01:02, 12.96it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 477/1280 [00:36<01:03, 12.58it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 479/1280 [00:36<01:04, 12.32it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 481/1280 [00:36<01:01, 12.94it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 483/1280 [00:36<01:01, 12.91it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 485/1280 [00:36<00:59, 13.40it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 487/1280 [00:36<00:57, 13.83it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 489/1280 [00:37<00:55, 14.33it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 491/1280 [00:37<00:56, 13.88it/s]
 39%|β–ˆβ–ˆβ–ˆβ–Š      | 493/1280 [00:37<00:57, 13.67it/s]
 39%|β–ˆβ–ˆβ–ˆβ–Š      | 495/1280 [00:37<00:58, 13.33it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 497/1280 [00:37<01:00, 13.00it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 499/1280 [00:37<01:02, 12.58it/s]
                                                  
{'loss': 0.5034, 'learning_rate': 3.0546875e-05, 'epoch': 0.39}

 39%|β–ˆβ–ˆβ–ˆβ–‰      | 500/1280 [00:37<01:02, 12.58it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 501/1280 [00:37<00:59, 13.01it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 503/1280 [00:38<00:59, 13.00it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 505/1280 [00:38<00:57, 13.56it/s]
 40%|β–ˆβ–ˆβ–ˆβ–‰      | 507/1280 [00:38<00:56, 13.65it/s]
 40%|β–ˆβ–ˆβ–ˆβ–‰      | 509/1280 [00:38<00:55, 14.00it/s]
 40%|β–ˆβ–ˆβ–ˆβ–‰      | 511/1280 [00:38<00:54, 14.15it/s]
 40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 513/1280 [00:38<00:52, 14.51it/s]
 40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 515/1280 [00:38<00:51, 14.82it/s]
 40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 517/1280 [00:39<00:51, 14.91it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 519/1280 [00:39<00:51, 14.73it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 521/1280 [00:39<00:50, 14.90it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 523/1280 [00:39<00:51, 14.74it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 525/1280 [00:39<00:51, 14.75it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 527/1280 [00:39<00:51, 14.61it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 529/1280 [00:39<00:53, 14.01it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 531/1280 [00:40<00:53, 13.98it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 533/1280 [00:40<00:55, 13.37it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 535/1280 [00:40<00:53, 13.96it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 537/1280 [00:40<00:51, 14.34it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 539/1280 [00:40<00:50, 14.62it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 541/1280 [00:40<00:51, 14.45it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 543/1280 [00:40<00:51, 14.39it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 545/1280 [00:41<00:51, 14.38it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 547/1280 [00:41<00:53, 13.70it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 549/1280 [00:41<00:52, 13.81it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 551/1280 [00:41<00:51, 14.17it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 553/1280 [00:41<00:49, 14.55it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 555/1280 [00:41<00:50, 14.48it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 557/1280 [00:41<00:49, 14.50it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 559/1280 [00:42<00:52, 13.67it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 561/1280 [00:42<00:54, 13.20it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 563/1280 [00:42<00:54, 13.13it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 565/1280 [00:42<00:52, 13.62it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 567/1280 [00:42<00:55, 12.95it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 569/1280 [00:42<00:53, 13.33it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 571/1280 [00:42<00:51, 13.89it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 573/1280 [00:43<00:52, 13.36it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 575/1280 [00:43<00:54, 12.91it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 577/1280 [00:43<00:51, 13.55it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 579/1280 [00:43<00:49, 14.04it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 581/1280 [00:43<00:51, 13.70it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 583/1280 [00:43<00:52, 13.26it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 585/1280 [00:43<00:53, 13.11it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 587/1280 [00:44<00:51, 13.55it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 589/1280 [00:44<00:50, 13.74it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 591/1280 [00:44<00:49, 13.96it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 593/1280 [00:44<00:48, 14.28it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 595/1280 [00:44<00:49, 13.90it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 597/1280 [00:44<00:47, 14.34it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 599/1280 [00:44<00:47, 14.34it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 601/1280 [00:45<00:46, 14.55it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 603/1280 [00:45<00:46, 14.55it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 605/1280 [00:45<00:45, 14.70it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 607/1280 [00:45<00:48, 13.97it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 609/1280 [00:45<00:47, 14.07it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 611/1280 [00:45<00:50, 13.35it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 613/1280 [00:45<00:50, 13.28it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 615/1280 [00:46<00:48, 13.76it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 617/1280 [00:46<00:49, 13.31it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 619/1280 [00:46<00:52, 12.66it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 621/1280 [00:46<00:53, 12.22it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 623/1280 [00:46<00:54, 12.09it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 625/1280 [00:46<00:51, 12.77it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 627/1280 [00:47<00:48, 13.36it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 629/1280 [00:47<00:47, 13.73it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 631/1280 [00:47<00:47, 13.62it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 633/1280 [00:47<00:45, 14.23it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 635/1280 [00:47<00:47, 13.64it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 637/1280 [00:47<00:46, 13.84it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 639/1280 [00:47<00:47, 13.37it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 641/1280 [00:48<00:47, 13.57it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 643/1280 [00:48<00:45, 13.94it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 645/1280 [00:48<00:45, 13.93it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 647/1280 [00:48<00:45, 14.05it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 649/1280 [00:48<00:45, 13.75it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 651/1280 [00:48<00:45, 13.78it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 653/1280 [00:48<00:45, 13.75it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 655/1280 [00:49<00:48, 12.96it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 657/1280 [00:49<00:46, 13.39it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 659/1280 [00:49<00:59, 10.40it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 661/1280 [00:49<00:54, 11.31it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 663/1280 [00:49<00:50, 12.15it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 665/1280 [00:49<00:48, 12.66it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 667/1280 [00:50<00:47, 12.87it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 669/1280 [00:50<00:45, 13.51it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 671/1280 [00:50<00:45, 13.30it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 673/1280 [00:50<00:44, 13.66it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 675/1280 [00:50<00:42, 14.17it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 677/1280 [00:50<00:41, 14.55it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 679/1280 [00:50<00:42, 13.98it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 681/1280 [00:51<00:44, 13.57it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 683/1280 [00:51<00:45, 13.16it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 685/1280 [00:51<00:43, 13.80it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 687/1280 [00:51<00:42, 14.09it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 689/1280 [00:51<00:41, 14.30it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 691/1280 [00:51<00:43, 13.66it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 693/1280 [00:51<00:41, 14.12it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 695/1280 [00:52<00:40, 14.39it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 697/1280 [00:52<00:41, 14.00it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 699/1280 [00:52<00:41, 14.01it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 701/1280 [00:52<00:42, 13.76it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 703/1280 [00:52<00:41, 13.84it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 705/1280 [00:52<00:44, 12.89it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 707/1280 [00:53<00:49, 11.66it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 709/1280 [00:53<00:49, 11.53it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 711/1280 [00:53<00:45, 12.41it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 713/1280 [00:53<00:43, 13.18it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 715/1280 [00:53<00:43, 13.07it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 717/1280 [00:53<00:41, 13.52it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 719/1280 [00:53<00:39, 14.09it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 721/1280 [00:54<00:39, 14.08it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 723/1280 [00:54<00:38, 14.30it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 725/1280 [00:54<00:40, 13.81it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 727/1280 [00:54<00:43, 12.79it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 729/1280 [00:54<00:40, 13.52it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 731/1280 [00:54<00:40, 13.48it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 733/1280 [00:54<00:39, 13.75it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 735/1280 [00:55<00:38, 13.98it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 737/1280 [00:55<00:38, 14.14it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 739/1280 [00:55<00:38, 14.14it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 741/1280 [00:55<00:37, 14.21it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 743/1280 [00:55<00:37, 14.26it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 745/1280 [00:55<00:37, 14.41it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 747/1280 [00:55<00:39, 13.50it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 749/1280 [00:56<00:38, 13.89it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 751/1280 [00:56<00:38, 13.58it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 753/1280 [00:56<00:37, 14.00it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 755/1280 [00:56<00:38, 13.68it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 757/1280 [00:56<00:39, 13.40it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 759/1280 [00:56<00:39, 13.28it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 761/1280 [00:56<00:37, 13.68it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 763/1280 [00:57<00:41, 12.40it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 765/1280 [00:57<00:39, 13.07it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 767/1280 [00:57<00:38, 13.46it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 769/1280 [00:57<00:37, 13.62it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 771/1280 [00:57<00:38, 13.30it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 773/1280 [00:57<00:41, 12.28it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 775/1280 [00:58<00:38, 13.00it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 777/1280 [00:58<00:41, 12.15it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 779/1280 [00:58<00:38, 12.88it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 781/1280 [00:58<00:39, 12.68it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 783/1280 [00:58<00:38, 12.98it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 785/1280 [00:58<00:36, 13.55it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 787/1280 [00:58<00:35, 13.89it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 789/1280 [00:59<00:37, 13.19it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 791/1280 [00:59<00:38, 12.55it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 793/1280 [00:59<00:37, 12.95it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 795/1280 [00:59<00:36, 13.22it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 797/1280 [00:59<00:35, 13.63it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 799/1280 [00:59<00:35, 13.50it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 801/1280 [01:00<00:34, 13.72it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 803/1280 [01:00<00:35, 13.44it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 805/1280 [01:00<00:34, 13.85it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 807/1280 [01:00<00:35, 13.50it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 809/1280 [01:00<00:34, 13.67it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 811/1280 [01:00<00:33, 14.07it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 813/1280 [01:00<00:33, 13.86it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 815/1280 [01:01<00:33, 13.70it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 817/1280 [01:01<00:32, 14.14it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 819/1280 [01:01<00:33, 13.64it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 821/1280 [01:01<00:32, 13.91it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 823/1280 [01:01<00:32, 14.22it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 825/1280 [01:01<00:31, 14.44it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 827/1280 [01:01<00:31, 14.49it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 829/1280 [01:02<00:31, 14.34it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 831/1280 [01:02<00:32, 13.83it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 833/1280 [01:02<00:32, 13.84it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 835/1280 [01:02<00:38, 11.60it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 837/1280 [01:02<00:40, 10.99it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 839/1280 [01:02<00:40, 10.84it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 841/1280 [01:03<00:39, 11.24it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 843/1280 [01:03<00:37, 11.60it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 845/1280 [01:03<00:36, 11.99it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 847/1280 [01:03<00:34, 12.54it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 849/1280 [01:03<00:33, 13.00it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 851/1280 [01:03<00:32, 13.07it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 853/1280 [01:04<00:34, 12.26it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 855/1280 [01:04<00:32, 13.04it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 857/1280 [01:04<00:31, 13.63it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 859/1280 [01:04<00:31, 13.38it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 861/1280 [01:04<00:30, 13.87it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 863/1280 [01:04<00:32, 12.77it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 865/1280 [01:04<00:30, 13.57it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 867/1280 [01:05<00:30, 13.38it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 869/1280 [01:05<00:30, 13.29it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 871/1280 [01:05<00:30, 13.24it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 873/1280 [01:05<00:31, 13.09it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 875/1280 [01:05<00:30, 13.34it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 877/1280 [01:05<00:29, 13.64it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 879/1280 [01:05<00:28, 13.98it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 881/1280 [01:06<00:28, 13.99it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 883/1280 [01:06<00:29, 13.47it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 885/1280 [01:06<00:28, 13.91it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 887/1280 [01:06<00:28, 13.58it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 889/1280 [01:06<00:29, 13.44it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 891/1280 [01:06<00:30, 12.95it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 893/1280 [01:07<00:31, 12.43it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 895/1280 [01:07<00:31, 12.41it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 897/1280 [01:07<00:30, 12.37it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 899/1280 [01:07<00:29, 12.87it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 901/1280 [01:07<00:29, 12.96it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 903/1280 [01:07<00:33, 11.24it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 905/1280 [01:08<00:32, 11.66it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 907/1280 [01:08<00:30, 12.14it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 909/1280 [01:08<00:30, 12.32it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 911/1280 [01:08<00:28, 12.89it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 913/1280 [01:08<00:30, 12.01it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 915/1280 [01:08<00:29, 12.55it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 917/1280 [01:09<00:30, 11.73it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 919/1280 [01:09<00:28, 12.60it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 921/1280 [01:09<00:28, 12.52it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 923/1280 [01:09<00:27, 12.94it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 925/1280 [01:09<00:27, 12.99it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 927/1280 [01:09<00:26, 13.19it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 929/1280 [01:09<00:26, 13.39it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 931/1280 [01:10<00:26, 13.14it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 933/1280 [01:10<00:25, 13.45it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 935/1280 [01:10<00:24, 13.97it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 937/1280 [01:10<00:24, 13.80it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 939/1280 [01:10<00:25, 13.60it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 941/1280 [01:10<00:25, 13.11it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 943/1280 [01:10<00:25, 13.29it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 945/1280 [01:11<00:24, 13.57it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 947/1280 [01:11<00:23, 14.10it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 949/1280 [01:11<00:23, 13.90it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 951/1280 [01:11<00:22, 14.36it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 953/1280 [01:11<00:23, 14.07it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 955/1280 [01:11<00:23, 14.08it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 957/1280 [01:11<00:22, 14.17it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 959/1280 [01:12<00:23, 13.57it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 961/1280 [01:12<00:23, 13.72it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 963/1280 [01:12<00:22, 13.81it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 965/1280 [01:12<00:22, 14.17it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 967/1280 [01:12<00:22, 13.93it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 969/1280 [01:12<00:23, 13.51it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 971/1280 [01:12<00:22, 13.93it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 973/1280 [01:13<00:24, 12.42it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 975/1280 [01:13<00:23, 13.06it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 977/1280 [01:13<00:23, 13.06it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 979/1280 [01:13<00:23, 12.62it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 981/1280 [01:13<00:22, 13.14it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 983/1280 [01:13<00:22, 13.48it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 985/1280 [01:14<00:21, 13.93it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 987/1280 [01:14<00:21, 13.69it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 989/1280 [01:14<00:20, 14.13it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 991/1280 [01:14<00:20, 13.91it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 993/1280 [01:14<00:20, 13.70it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 995/1280 [01:14<00:20, 13.99it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 997/1280 [01:14<00:20, 14.07it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 999/1280 [01:15<00:19, 14.42it/s]
                                                  
{'loss': 0.4383, 'learning_rate': 1.1015625e-05, 'epoch': 0.78}

 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1000/1280 [01:15<00:19, 14.42it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1001/1280 [01:15<00:19, 14.55it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1003/1280 [01:15<00:19, 14.46it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1005/1280 [01:15<00:18, 14.60it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1007/1280 [01:15<00:19, 14.23it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1009/1280 [01:15<00:19, 13.56it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1011/1280 [01:15<00:22, 11.91it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1013/1280 [01:16<00:21, 12.57it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1015/1280 [01:16<00:19, 13.30it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1017/1280 [01:16<00:20, 13.14it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1019/1280 [01:16<00:20, 13.03it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1021/1280 [01:16<00:21, 11.95it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1023/1280 [01:16<00:20, 12.74it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1025/1280 [01:16<00:18, 13.53it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1027/1280 [01:17<00:17, 14.14it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1029/1280 [01:17<00:19, 12.59it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1031/1280 [01:17<00:18, 13.11it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1033/1280 [01:17<00:18, 13.03it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1035/1280 [01:17<00:18, 13.24it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1037/1280 [01:17<00:17, 13.70it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1039/1280 [01:18<00:17, 14.14it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1041/1280 [01:18<00:17, 13.52it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1043/1280 [01:18<00:17, 13.48it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1045/1280 [01:18<00:17, 13.28it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1047/1280 [01:18<00:18, 12.72it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1049/1280 [01:18<00:19, 12.13it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1051/1280 [01:18<00:18, 12.49it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1053/1280 [01:19<00:17, 13.07it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1055/1280 [01:19<00:17, 12.86it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1057/1280 [01:19<00:16, 13.29it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1059/1280 [01:19<00:16, 13.08it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1061/1280 [01:19<00:17, 12.77it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1063/1280 [01:19<00:16, 13.14it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1065/1280 [01:20<00:15, 13.57it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1067/1280 [01:20<00:16, 13.20it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1069/1280 [01:20<00:15, 13.51it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1071/1280 [01:20<00:15, 13.27it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1073/1280 [01:20<00:15, 13.63it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1075/1280 [01:20<00:15, 13.33it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1077/1280 [01:20<00:14, 13.82it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1079/1280 [01:21<00:14, 13.95it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1081/1280 [01:21<00:14, 13.72it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1083/1280 [01:21<00:14, 13.93it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1085/1280 [01:21<00:14, 13.34it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1087/1280 [01:21<00:13, 13.83it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1089/1280 [01:21<00:13, 14.19it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1091/1280 [01:21<00:13, 14.26it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1093/1280 [01:22<00:13, 13.90it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1095/1280 [01:22<00:13, 14.11it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1097/1280 [01:22<00:14, 12.87it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1099/1280 [01:22<00:13, 12.96it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1101/1280 [01:22<00:13, 12.95it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1103/1280 [01:22<00:13, 13.17it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1105/1280 [01:22<00:13, 13.15it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1107/1280 [01:23<00:12, 13.42it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1109/1280 [01:23<00:13, 12.25it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1111/1280 [01:23<00:13, 12.68it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1113/1280 [01:23<00:12, 12.98it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1115/1280 [01:23<00:12, 13.27it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1117/1280 [01:23<00:13, 12.31it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1119/1280 [01:24<00:12, 13.07it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1121/1280 [01:24<00:12, 13.13it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1123/1280 [01:24<00:11, 13.31it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1125/1280 [01:24<00:11, 13.65it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1127/1280 [01:24<00:11, 13.75it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1129/1280 [01:24<00:10, 14.13it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1131/1280 [01:24<00:10, 14.35it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1133/1280 [01:25<00:11, 13.11it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1135/1280 [01:25<00:10, 13.60it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1137/1280 [01:25<00:10, 13.88it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1139/1280 [01:25<00:10, 13.98it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1141/1280 [01:25<00:10, 13.36it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1143/1280 [01:25<00:09, 13.85it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1145/1280 [01:25<00:09, 14.33it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1147/1280 [01:26<00:10, 13.08it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1149/1280 [01:26<00:09, 13.74it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1151/1280 [01:26<00:09, 13.89it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1153/1280 [01:26<00:10, 12.70it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1155/1280 [01:26<00:09, 13.40it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1157/1280 [01:26<00:09, 13.65it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1159/1280 [01:26<00:08, 14.00it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1161/1280 [01:27<00:08, 14.15it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1163/1280 [01:27<00:08, 13.21it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1165/1280 [01:27<00:09, 12.44it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1167/1280 [01:27<00:08, 13.09it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1169/1280 [01:27<00:08, 12.61it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1171/1280 [01:27<00:09, 11.76it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1173/1280 [01:28<00:09, 11.13it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1175/1280 [01:28<00:08, 11.70it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1177/1280 [01:28<00:08, 12.50it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1179/1280 [01:28<00:08, 12.46it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1181/1280 [01:28<00:07, 13.15it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1183/1280 [01:28<00:07, 13.68it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1185/1280 [01:29<00:06, 13.86it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1187/1280 [01:29<00:07, 12.31it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1189/1280 [01:29<00:07, 12.85it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1191/1280 [01:29<00:06, 13.57it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1193/1280 [01:29<00:06, 13.60it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1195/1280 [01:29<00:06, 13.90it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1197/1280 [01:29<00:05, 14.01it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1199/1280 [01:30<00:05, 14.44it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1201/1280 [01:30<00:05, 14.41it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1203/1280 [01:30<00:05, 14.31it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1205/1280 [01:30<00:05, 14.43it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1207/1280 [01:30<00:05, 14.51it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1209/1280 [01:30<00:05, 13.64it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1211/1280 [01:30<00:04, 13.84it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1213/1280 [01:31<00:04, 13.71it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1215/1280 [01:31<00:04, 13.37it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1217/1280 [01:31<00:04, 13.88it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1219/1280 [01:31<00:04, 14.40it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1221/1280 [01:31<00:04, 13.72it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1223/1280 [01:31<00:04, 13.38it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1225/1280 [01:31<00:03, 13.90it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1227/1280 [01:32<00:03, 13.56it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1229/1280 [01:32<00:03, 13.89it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1231/1280 [01:32<00:03, 13.30it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1233/1280 [01:32<00:03, 13.89it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1235/1280 [01:32<00:03, 14.07it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1237/1280 [01:32<00:03, 13.96it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1239/1280 [01:32<00:02, 13.89it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1241/1280 [01:33<00:02, 13.93it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1243/1280 [01:33<00:02, 14.01it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1245/1280 [01:33<00:02, 14.14it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1247/1280 [01:33<00:02, 13.94it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1249/1280 [01:33<00:02, 14.24it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1251/1280 [01:33<00:02, 14.24it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1253/1280 [01:33<00:01, 14.54it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1255/1280 [01:34<00:01, 14.18it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1257/1280 [01:34<00:01, 13.97it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1259/1280 [01:34<00:01, 13.75it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1261/1280 [01:34<00:01, 13.85it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1263/1280 [01:34<00:01, 14.29it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1265/1280 [01:34<00:01, 12.84it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1267/1280 [01:35<00:01, 12.50it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1269/1280 [01:35<00:00, 13.25it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1271/1280 [01:35<00:00, 13.50it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1273/1280 [01:35<00:00, 13.58it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1275/1280 [01:35<00:00, 13.42it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1277/1280 [01:35<00:00, 13.66it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1279/1280 [01:35<00:00, 13.79it/s][INFO|trainer.py:1761] 2022-07-22 12:33:45,822 >> 

Training completed. Do not forget to share your model on huggingface.co/models =)



                                                   
{'train_runtime': 95.9834, 'train_samples_per_second': 426.48, 'train_steps_per_second': 13.336, 'train_loss': 0.4583479344844818, 'epoch': 1.0}

100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1280/1280 [01:35<00:00, 13.79it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1280/1280 [01:35<00:00, 13.34it/s]
[INFO|trainer.py:2503] 2022-07-22 12:33:45,829 >> Saving model checkpoint to runs/ebmnlp_hf/BioLinkBERT-base
[INFO|configuration_utils.py:446] 2022-07-22 12:33:45,831 >> Configuration saved in runs/ebmnlp_hf/BioLinkBERT-base/config.json
[INFO|modeling_utils.py:1660] 2022-07-22 12:33:46,435 >> Model weights saved in runs/ebmnlp_hf/BioLinkBERT-base/pytorch_model.bin
[INFO|tokenization_utils_base.py:2123] 2022-07-22 12:33:46,436 >> tokenizer config file saved in runs/ebmnlp_hf/BioLinkBERT-base/tokenizer_config.json
[INFO|tokenization_utils_base.py:2130] 2022-07-22 12:33:46,436 >> Special tokens file saved in runs/ebmnlp_hf/BioLinkBERT-base/special_tokens_map.json
***** train metrics *****
  epoch                    =        1.0
  train_loss               =     0.4583
  train_runtime            = 0:01:35.98
  train_samples            =      40935
  train_samples_per_second =     426.48
  train_steps_per_second   =     13.336
07/22/2022 12:33:46 - INFO - __main__ - *** Evaluate ***
[INFO|trainer.py:661] 2022-07-22 12:33:46,477 >> The following columns in the evaluation set don't have a corresponding argument in `BertForTokenClassification.forward` and have been ignored: id, ner_tags, word_ids, tokens. If id, ner_tags, word_ids, tokens are not expected by `BertForTokenClassification.forward`,  you can safely ignore this message.
[INFO|trainer.py:2753] 2022-07-22 12:33:46,479 >> ***** Running Evaluation *****
[INFO|trainer.py:2755] 2022-07-22 12:33:46,479 >>   Num examples = 10386
[INFO|trainer.py:2758] 2022-07-22 12:33:46,479 >>   Batch size = 8

  0%|          | 0/1299 [00:00<?, ?it/s]
  1%|          | 8/1299 [00:00<00:17, 72.89it/s]
  1%|          | 16/1299 [00:00<00:19, 67.39it/s]
  2%|▏         | 23/1299 [00:00<00:19, 65.81it/s]
  2%|▏         | 30/1299 [00:00<00:19, 64.92it/s]
  3%|β–Ž         | 37/1299 [00:00<00:19, 64.65it/s]
  3%|β–Ž         | 44/1299 [00:00<00:19, 64.42it/s]
  4%|▍         | 51/1299 [00:00<00:19, 64.17it/s]
  4%|▍         | 58/1299 [00:00<00:19, 64.09it/s]
  5%|β–Œ         | 65/1299 [00:01<00:19, 64.00it/s]
  6%|β–Œ         | 72/1299 [00:01<00:19, 63.66it/s]
  6%|β–Œ         | 79/1299 [00:01<00:19, 63.73it/s]
  7%|β–‹         | 86/1299 [00:01<00:18, 63.85it/s]
  7%|β–‹         | 93/1299 [00:01<00:19, 63.44it/s]
  8%|β–Š         | 100/1299 [00:01<00:18, 63.49it/s]
  8%|β–Š         | 107/1299 [00:01<00:18, 63.51it/s]
  9%|β–‰         | 114/1299 [00:01<00:18, 63.49it/s]
  9%|β–‰         | 121/1299 [00:01<00:18, 63.40it/s]
 10%|β–‰         | 128/1299 [00:01<00:18, 63.44it/s]
 10%|β–ˆ         | 135/1299 [00:02<00:18, 63.42it/s]
 11%|β–ˆ         | 142/1299 [00:02<00:18, 63.51it/s]
 11%|β–ˆβ–        | 149/1299 [00:02<00:18, 63.64it/s]
 12%|β–ˆβ–        | 156/1299 [00:02<00:17, 63.63it/s]
 13%|β–ˆβ–Ž        | 163/1299 [00:02<00:17, 63.62it/s]
 13%|β–ˆβ–Ž        | 170/1299 [00:02<00:17, 63.74it/s]
 14%|β–ˆβ–Ž        | 177/1299 [00:02<00:17, 63.73it/s]
 14%|β–ˆβ–        | 184/1299 [00:02<00:17, 63.49it/s]
 15%|β–ˆβ–        | 191/1299 [00:02<00:17, 63.53it/s]
 15%|β–ˆβ–Œ        | 198/1299 [00:03<00:17, 63.77it/s]
 16%|β–ˆβ–Œ        | 205/1299 [00:03<00:17, 63.78it/s]
 16%|β–ˆβ–‹        | 212/1299 [00:03<00:17, 63.85it/s]
 17%|β–ˆβ–‹        | 219/1299 [00:03<00:16, 63.83it/s]
 17%|β–ˆβ–‹        | 226/1299 [00:03<00:16, 63.83it/s]
 18%|β–ˆβ–Š        | 233/1299 [00:03<00:16, 63.89it/s]
 18%|β–ˆβ–Š        | 240/1299 [00:03<00:16, 63.93it/s]
 19%|β–ˆβ–‰        | 247/1299 [00:03<00:16, 63.88it/s]
 20%|β–ˆβ–‰        | 254/1299 [00:03<00:16, 63.93it/s]
 20%|β–ˆβ–ˆ        | 261/1299 [00:04<00:16, 64.00it/s]
 21%|β–ˆβ–ˆ        | 268/1299 [00:04<00:16, 63.84it/s]
 21%|β–ˆβ–ˆ        | 275/1299 [00:04<00:16, 63.91it/s]
 22%|β–ˆβ–ˆβ–       | 282/1299 [00:04<00:15, 63.85it/s]
 22%|β–ˆβ–ˆβ–       | 289/1299 [00:04<00:15, 63.79it/s]
 23%|β–ˆβ–ˆβ–Ž       | 296/1299 [00:04<00:15, 63.79it/s]
 23%|β–ˆβ–ˆβ–Ž       | 303/1299 [00:04<00:15, 63.75it/s]
 24%|β–ˆβ–ˆβ–       | 310/1299 [00:04<00:15, 63.86it/s]
 24%|β–ˆβ–ˆβ–       | 317/1299 [00:04<00:15, 63.86it/s]
 25%|β–ˆβ–ˆβ–       | 324/1299 [00:05<00:15, 63.81it/s]
 25%|β–ˆβ–ˆβ–Œ       | 331/1299 [00:05<00:15, 63.97it/s]
 26%|β–ˆβ–ˆβ–Œ       | 338/1299 [00:05<00:15, 63.88it/s]
 27%|β–ˆβ–ˆβ–‹       | 345/1299 [00:05<00:14, 63.83it/s]
 27%|β–ˆβ–ˆβ–‹       | 352/1299 [00:05<00:14, 63.92it/s]
 28%|β–ˆβ–ˆβ–Š       | 359/1299 [00:05<00:14, 63.93it/s]
 28%|β–ˆβ–ˆβ–Š       | 366/1299 [00:05<00:14, 63.96it/s]
 29%|β–ˆβ–ˆβ–Š       | 373/1299 [00:05<00:14, 63.97it/s]
 29%|β–ˆβ–ˆβ–‰       | 380/1299 [00:05<00:14, 63.79it/s]
 30%|β–ˆβ–ˆβ–‰       | 387/1299 [00:06<00:14, 63.70it/s]
 30%|β–ˆβ–ˆβ–ˆ       | 394/1299 [00:06<00:14, 63.63it/s]
 31%|β–ˆβ–ˆβ–ˆ       | 401/1299 [00:06<00:14, 63.61it/s]
 31%|β–ˆβ–ˆβ–ˆβ–      | 408/1299 [00:06<00:13, 63.87it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 415/1299 [00:06<00:13, 63.91it/s]
 32%|β–ˆβ–ˆβ–ˆβ–      | 422/1299 [00:06<00:13, 63.84it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 429/1299 [00:06<00:13, 63.82it/s]
 34%|β–ˆβ–ˆβ–ˆβ–Ž      | 436/1299 [00:06<00:13, 63.95it/s]
 34%|β–ˆβ–ˆβ–ˆβ–      | 443/1299 [00:06<00:13, 64.01it/s]
 35%|β–ˆβ–ˆβ–ˆβ–      | 450/1299 [00:07<00:13, 64.08it/s]
 35%|β–ˆβ–ˆβ–ˆβ–Œ      | 457/1299 [00:07<00:13, 64.04it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 464/1299 [00:07<00:13, 64.04it/s]
 36%|β–ˆβ–ˆβ–ˆβ–‹      | 471/1299 [00:07<00:12, 63.94it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 478/1299 [00:07<00:12, 63.82it/s]
 37%|β–ˆβ–ˆβ–ˆβ–‹      | 485/1299 [00:07<00:12, 63.94it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 492/1299 [00:07<00:12, 63.98it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 499/1299 [00:07<00:12, 63.71it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 506/1299 [00:07<00:12, 63.79it/s]
 39%|β–ˆβ–ˆβ–ˆβ–‰      | 513/1299 [00:08<00:12, 63.83it/s]
 40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 520/1299 [00:08<00:12, 63.85it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 527/1299 [00:08<00:12, 63.80it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 534/1299 [00:08<00:11, 63.76it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 541/1299 [00:08<00:11, 63.91it/s]
 42%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 548/1299 [00:08<00:11, 64.07it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 555/1299 [00:08<00:11, 64.08it/s]
 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž     | 562/1299 [00:08<00:11, 64.13it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 569/1299 [00:08<00:11, 64.11it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 576/1299 [00:09<00:11, 63.91it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 583/1299 [00:09<00:11, 63.98it/s]
 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 590/1299 [00:09<00:11, 63.98it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ     | 597/1299 [00:09<00:10, 63.98it/s]
 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 604/1299 [00:09<00:10, 63.92it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 611/1299 [00:09<00:10, 63.88it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 618/1299 [00:09<00:10, 63.96it/s]
 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 625/1299 [00:09<00:10, 63.78it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š     | 632/1299 [00:09<00:10, 63.83it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 639/1299 [00:10<00:10, 63.16it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 646/1299 [00:10<00:10, 63.26it/s]
 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 653/1299 [00:10<00:10, 63.41it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ     | 660/1299 [00:10<00:10, 63.57it/s]
 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 667/1299 [00:10<00:09, 63.45it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 674/1299 [00:10<00:09, 63.64it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 681/1299 [00:10<00:09, 63.78it/s]
 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 688/1299 [00:10<00:09, 63.51it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž    | 695/1299 [00:10<00:09, 63.66it/s]
 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 702/1299 [00:10<00:09, 63.93it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 709/1299 [00:11<00:09, 63.86it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 716/1299 [00:11<00:09, 63.83it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 723/1299 [00:11<00:09, 63.78it/s]
 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ    | 730/1299 [00:11<00:08, 63.93it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 737/1299 [00:11<00:08, 63.81it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 744/1299 [00:11<00:08, 63.86it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 751/1299 [00:11<00:08, 64.02it/s]
 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š    | 758/1299 [00:11<00:08, 63.98it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 765/1299 [00:11<00:08, 63.95it/s]
 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 772/1299 [00:12<00:08, 63.90it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰    | 779/1299 [00:12<00:08, 64.10it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 786/1299 [00:12<00:08, 64.00it/s]
 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 793/1299 [00:12<00:07, 64.00it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 800/1299 [00:12<00:07, 64.06it/s]
 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 807/1299 [00:12<00:07, 64.12it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 814/1299 [00:12<00:07, 64.01it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 821/1299 [00:12<00:07, 63.89it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 828/1299 [00:12<00:07, 63.90it/s]
 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 835/1299 [00:13<00:07, 63.95it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–   | 842/1299 [00:13<00:07, 63.95it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 849/1299 [00:13<00:07, 64.02it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 856/1299 [00:13<00:06, 63.86it/s]
 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 863/1299 [00:13<00:06, 63.76it/s]
 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹   | 870/1299 [00:13<00:06, 63.72it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 877/1299 [00:13<00:06, 63.71it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 884/1299 [00:13<00:06, 63.77it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 891/1299 [00:13<00:06, 63.70it/s]
 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 898/1299 [00:14<00:06, 63.74it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰   | 905/1299 [00:14<00:06, 63.77it/s]
 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 912/1299 [00:14<00:06, 63.83it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 919/1299 [00:14<00:05, 63.81it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 926/1299 [00:14<00:05, 63.89it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 933/1299 [00:14<00:05, 63.91it/s]
 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 940/1299 [00:14<00:05, 63.97it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 947/1299 [00:14<00:05, 63.93it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 954/1299 [00:14<00:05, 63.87it/s]
 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 961/1299 [00:15<00:05, 63.71it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–  | 968/1299 [00:15<00:05, 63.72it/s]
 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 975/1299 [00:15<00:05, 63.85it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 982/1299 [00:15<00:04, 63.93it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 989/1299 [00:15<00:04, 64.09it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 996/1299 [00:15<00:04, 63.93it/s]
 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹  | 1003/1299 [00:15<00:04, 63.89it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1010/1299 [00:15<00:04, 63.70it/s]
 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š  | 1017/1299 [00:15<00:04, 63.75it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1024/1299 [00:16<00:04, 63.72it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1031/1299 [00:16<00:04, 63.68it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 1038/1299 [00:16<00:04, 63.76it/s]
 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1045/1299 [00:16<00:03, 63.74it/s]
 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  | 1052/1299 [00:16<00:03, 63.94it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1059/1299 [00:16<00:03, 63.78it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1066/1299 [00:16<00:03, 63.74it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1073/1299 [00:16<00:03, 63.66it/s]
 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1080/1299 [00:16<00:03, 63.61it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 1087/1299 [00:17<00:03, 63.64it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1094/1299 [00:17<00:03, 63.80it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1101/1299 [00:17<00:03, 63.68it/s]
 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1108/1299 [00:17<00:02, 63.77it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1115/1299 [00:17<00:02, 63.84it/s]
 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1122/1299 [00:17<00:02, 63.74it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1129/1299 [00:17<00:02, 63.80it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 1136/1299 [00:17<00:02, 63.93it/s]
 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1143/1299 [00:17<00:02, 63.94it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1150/1299 [00:18<00:02, 63.91it/s]
 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1157/1299 [00:18<00:02, 63.89it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 1164/1299 [00:18<00:02, 63.84it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1171/1299 [00:18<00:02, 63.95it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1178/1299 [00:18<00:01, 63.95it/s]
 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1185/1299 [00:18<00:01, 64.14it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1192/1299 [00:18<00:01, 63.65it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1199/1299 [00:18<00:01, 63.71it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1206/1299 [00:18<00:01, 63.56it/s]
 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1213/1299 [00:18<00:01, 63.57it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1220/1299 [00:19<00:01, 63.33it/s]
 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1227/1299 [00:19<00:01, 63.43it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1234/1299 [00:19<00:01, 63.49it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1241/1299 [00:19<00:00, 63.46it/s]
 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 1248/1299 [00:19<00:00, 63.52it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1255/1299 [00:19<00:00, 63.43it/s]
 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1262/1299 [00:19<00:00, 63.53it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1269/1299 [00:19<00:00, 63.51it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 1276/1299 [00:19<00:00, 63.48it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1283/1299 [00:20<00:00, 63.54it/s]
 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1290/1299 [00:20<00:00, 63.50it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 1297/1299 [00:20<00:00, 63.56it/s]07/22/2022 12:34:13 - INFO - datasets.metric - Removing /home/hungthinht/.cache/huggingface/metrics/seqeval/default/default_experiment-1-0.arrow
type_name INT
type_name OUT
type_name PAR

100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1299/1299 [00:26<00:00, 48.70it/s]
***** eval metrics *****
  epoch                   =        1.0
  eval_loss               =     0.4264
  eval_macro_f1           =     0.7191
  eval_macro_precision    =     0.7641
  eval_macro_recall       =     0.6793
  eval_runtime            = 0:00:26.69
  eval_samples            =      10386
  eval_samples_per_second =    389.133
  eval_steps_per_second   =      48.67
07/22/2022 12:34:13 - INFO - __main__ - *** Predict ***
[INFO|trainer.py:661] 2022-07-22 12:34:13,176 >> The following columns in the test set don't have a corresponding argument in `BertForTokenClassification.forward` and have been ignored: id, ner_tags, word_ids, tokens. If id, ner_tags, word_ids, tokens are not expected by `BertForTokenClassification.forward`,  you can safely ignore this message.
[INFO|trainer.py:2753] 2022-07-22 12:34:13,177 >> ***** Running Prediction *****
[INFO|trainer.py:2755] 2022-07-22 12:34:13,178 >>   Num examples = 2076
[INFO|trainer.py:2758] 2022-07-22 12:34:13,178 >>   Batch size = 8

  0%|          | 0/260 [00:00<?, ?it/s]
  3%|β–Ž         | 8/260 [00:00<00:03, 72.67it/s]
  6%|β–Œ         | 16/260 [00:00<00:03, 67.39it/s]
  9%|β–‰         | 23/260 [00:00<00:03, 66.06it/s]
 12%|β–ˆβ–        | 30/260 [00:00<00:03, 65.17it/s]
 14%|β–ˆβ–        | 37/260 [00:00<00:03, 64.44it/s]
 17%|β–ˆβ–‹        | 44/260 [00:00<00:03, 64.25it/s]
 20%|β–ˆβ–‰        | 51/260 [00:00<00:03, 64.15it/s]
 22%|β–ˆβ–ˆβ–       | 58/260 [00:00<00:03, 64.02it/s]
 25%|β–ˆβ–ˆβ–Œ       | 65/260 [00:01<00:03, 63.86it/s]
 28%|β–ˆβ–ˆβ–Š       | 72/260 [00:01<00:02, 63.76it/s]
 30%|β–ˆβ–ˆβ–ˆ       | 79/260 [00:01<00:02, 63.78it/s]
 33%|β–ˆβ–ˆβ–ˆβ–Ž      | 86/260 [00:01<00:02, 63.62it/s]
 36%|β–ˆβ–ˆβ–ˆβ–Œ      | 93/260 [00:01<00:02, 63.55it/s]
 38%|β–ˆβ–ˆβ–ˆβ–Š      | 100/260 [00:01<00:02, 63.62it/s]
 41%|β–ˆβ–ˆβ–ˆβ–ˆ      | 107/260 [00:01<00:02, 63.78it/s]
 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–     | 114/260 [00:01<00:02, 63.79it/s]
 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹     | 121/260 [00:01<00:02, 63.79it/s]
 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰     | 128/260 [00:01<00:02, 63.72it/s]
 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 135/260 [00:02<00:01, 63.79it/s]
 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–    | 142/260 [00:02<00:01, 63.71it/s]
 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹    | 149/260 [00:02<00:01, 63.77it/s]
 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 156/260 [00:02<00:01, 63.98it/s]
 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž   | 163/260 [00:02<00:01, 64.06it/s]
 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ   | 170/260 [00:02<00:01, 64.05it/s]
 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š   | 177/260 [00:02<00:01, 64.01it/s]
 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ   | 184/260 [00:02<00:01, 64.00it/s]
 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž  | 191/260 [00:02<00:01, 63.86it/s]
 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ  | 198/260 [00:03<00:00, 63.89it/s]
 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰  | 205/260 [00:03<00:00, 63.87it/s]
 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 212/260 [00:03<00:00, 63.81it/s]
 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 219/260 [00:03<00:00, 63.78it/s]
 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 226/260 [00:03<00:00, 63.68it/s]
 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 233/260 [00:03<00:00, 63.76it/s]
 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 240/260 [00:03<00:00, 63.85it/s]
 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 247/260 [00:03<00:00, 63.75it/s]
 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 254/260 [00:03<00:00, 63.90it/s]07/22/2022 12:34:18 - INFO - datasets.metric - Removing /home/hungthinht/.cache/huggingface/metrics/seqeval/default/default_experiment-1-0.arrow
type_name INT
type_name OUT
type_name PAR
***** test metrics *****
  test_loss               =     0.4043
  test_macro_f1           =     0.7406
  test_macro_precision    =     0.7578
  test_macro_recall       =     0.7467
  test_runtime            = 0:00:05.17
  test_samples            =       2076
  test_samples_per_second =    401.101
  test_steps_per_second   =     50.234

100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 260/260 [00:07<00:00, 36.10it/s]