End of training

Browse files

Files changed (15) hide show

README.md +63 -0
all_results.json +19 -0
config.json +51 -0
eval_results.json +11 -0
merges.txt +0 -0
model.safetensors +3 -0
predict_results.json +10 -0
predictions.txt +0 -0
special_tokens_map.json +51 -0
tb/events.out.tfevents.1715608825.c331905616cf.2224.0 +3 -0
tokenizer.json +0 -0
tokenizer_config.json +58 -0
train.log +357 -0
training_args.bin +3 -0
vocab.json +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+license: apache-2.0
+base_model: PlanTL-GOB-ES/bsc-bio-ehr-es
+tags:
+- token-classification
+- generated_from_trainer
+datasets:
+- Rodrigo1771/multi-train-drugtemist-dev-ner
+model-index:
+- name: output
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# output
+This model is a fine-tuned version of [PlanTL-GOB-ES/bsc-bio-ehr-es](https://huggingface.co/PlanTL-GOB-ES/bsc-bio-ehr-es) on the Rodrigo1771/multi-train-drugtemist-dev-ner dataset.
+It achieves the following results on the evaluation set:
+- eval_loss: 2.4031
+- eval_precision: 0.0004
+- eval_recall: 0.0386
+- eval_f1: 0.0007
+- eval_accuracy: 0.0028
+- eval_runtime: 16.7962
+- eval_samples_per_second: 405.27
+- eval_steps_per_second: 50.666
+- step: 0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 4
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10.0
+### Framework versions
+- Transformers 4.40.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.1

all_results.json ADDED Viewed

	@@ -0,0 +1,19 @@

+{
+    "eval_accuracy": 0.002835545241707918,
+    "eval_f1": 0.0006942779922141681,
+    "eval_loss": 2.4030845165252686,
+    "eval_precision": 0.00035028898841544273,
+    "eval_recall": 0.03860294117647059,
+    "eval_runtime": 16.7962,
+    "eval_samples": 6807,
+    "eval_samples_per_second": 405.27,
+    "eval_steps_per_second": 50.666,
+    "predict_accuracy": 0.002835545241707918,
+    "predict_f1": 0.0006942779922141681,
+    "predict_loss": 2.4030845165252686,
+    "predict_precision": 0.00035028898841544273,
+    "predict_recall": 0.03860294117647059,
+    "predict_runtime": 16.0715,
+    "predict_samples_per_second": 423.545,
+    "predict_steps_per_second": 52.951
+}

config.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForTokenClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "finetuning_task": "ner",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "O",
+    "1": "B-ENFERMEDAD",
+    "2": "I-ENFERMEDAD",
+    "3": "B-PROCEDIMIENTO",
+    "4": "I-PROCEDIMIENTO",
+    "5": "B-SINTOMA",
+    "6": "I-SINTOMA",
+    "7": "B-FARMACO",
+    "8": "I-FARMACO"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "B-ENFERMEDAD": 1,
+    "B-FARMACO": 7,
+    "B-PROCEDIMIENTO": 3,
+    "B-SINTOMA": 5,
+    "I-ENFERMEDAD": 2,
+    "I-FARMACO": 8,
+    "I-PROCEDIMIENTO": 4,
+    "I-SINTOMA": 6,
+    "O": 0
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "torch_dtype": "float32",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}

eval_results.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "eval_accuracy": 0.002835545241707918,
+    "eval_f1": 0.0006942779922141681,
+    "eval_loss": 2.4030845165252686,
+    "eval_precision": 0.00035028898841544273,
+    "eval_recall": 0.03860294117647059,
+    "eval_runtime": 16.7962,
+    "eval_samples": 6807,
+    "eval_samples_per_second": 405.27,
+    "eval_steps_per_second": 50.666
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6cb8f27d656928c021399304ae37e82d147c80df5386c804b66e664424a60fee
+size 496262556

predict_results.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "predict_accuracy": 0.002835545241707918,
+    "predict_f1": 0.0006942779922141681,
+    "predict_loss": 2.4030845165252686,
+    "predict_precision": 0.00035028898841544273,
+    "predict_recall": 0.03860294117647059,
+    "predict_runtime": 16.0715,
+    "predict_samples_per_second": 423.545,
+    "predict_steps_per_second": 52.951
+}

predictions.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "cls_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "<mask>",
+    "lstrip": true,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tb/events.out.tfevents.1715608825.c331905616cf.2224.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d6b43bba93fcdae059a879a573e39c04184713f463eaf8c074aa07d771faa37e
+size 486

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,58 @@

+{
+  "add_prefix_space": true,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<pad>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "50261": {
+      "content": "<mask>",
+      "lstrip": true,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "<s>",
+  "eos_token": "</s>",
+  "errors": "replace",
+  "mask_token": "<mask>",
+  "max_len": 512,
+  "model_max_length": 512,
+  "pad_token": "<pad>",
+  "sep_token": "</s>",
+  "tokenizer_class": "RobertaTokenizer",
+  "trim_offsets": true,
+  "unk_token": "<unk>"
+}

train.log ADDED Viewed

@@ -0,0 +1,357 @@
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 6/851 [00:00<00:14, 56.79it/s]
  2%|▏         | 14/851 [00:00<00:12, 66.67it/s]
  2%|▏         | 21/851 [00:00<00:12, 67.60it/s]
  3%|▎         | 29/851 [00:00<00:11, 71.18it/s]
  4%|▍         | 37/851 [00:00<00:11, 72.27it/s]
  5%|▌         | 45/851 [00:00<00:10, 73.94it/s]
  6%|▌         | 53/851 [00:00<00:10, 75.21it/s]
  7%|▋         | 62/851 [00:00<00:10, 76.83it/s]
  8%|▊         | 70/851 [00:00<00:10, 72.77it/s]
  9%|▉         | 78/851 [00:01<00:10, 73.82it/s]
 10%|█         | 86/851 [00:01<00:10, 72.92it/s]
 11%|█         | 94/851 [00:01<00:10, 71.23it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.43it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.56it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.68it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.25it/s]
 16%|█▌        | 134/851 [00:01<00:10, 69.92it/s]
 17%|█▋        | 142/851 [00:01<00:10, 70.22it/s]
 18%|█▊        | 150/851 [00:02<00:10, 68.89it/s]
 19%|█▊        | 158/851 [00:02<00:09, 71.66it/s]
 20%|█▉        | 166/851 [00:02<00:09, 72.84it/s]
 20%|██        | 174/851 [00:02<00:09, 73.78it/s]
 21%|██▏       | 182/851 [00:02<00:08, 74.73it/s]
 22%|██▏       | 191/851 [00:02<00:08, 75.91it/s]
 24%|██▎       | 200/851 [00:02<00:08, 77.69it/s]
 24%|██▍       | 208/851 [00:02<00:08, 74.46it/s]
 25%|██▌       | 216/851 [00:02<00:08, 73.02it/s]
 26%|██▋       | 224/851 [00:03<00:08, 73.31it/s]
 27%|██▋       | 233/851 [00:03<00:08, 75.80it/s]
 28%|██▊       | 241/851 [00:03<00:08, 70.34it/s]
 29%|██▉       | 249/851 [00:03<00:08, 70.90it/s]
 30%|███       | 258/851 [00:03<00:07, 74.54it/s]
 31%|███▏      | 267/851 [00:03<00:07, 74.56it/s]
 32%|███▏      | 275/851 [00:03<00:07, 75.23it/s]
 33%|███▎      | 284/851 [00:03<00:07, 77.16it/s]
 34%|███▍      | 292/851 [00:03<00:07, 75.36it/s]
 35%|███▌      | 300/851 [00:04<00:07, 76.48it/s]
 36%|███▋      | 309/851 [00:04<00:06, 78.03it/s]
 37%|███▋      | 317/851 [00:04<00:07, 73.84it/s]
 38%|███▊      | 325/851 [00:04<00:07, 74.16it/s]
 39%|███▉      | 333/851 [00:04<00:07, 73.48it/s]
 40%|████      | 341/851 [00:04<00:06, 75.03it/s]
 41%|████      | 349/851 [00:04<00:06, 75.15it/s]
 42%|████▏     | 357/851 [00:04<00:06, 71.25it/s]
 43%|████▎     | 365/851 [00:04<00:06, 73.08it/s]
 44%|████▍     | 373/851 [00:05<00:06, 71.71it/s]
 45%|████▍     | 381/851 [00:05<00:06, 69.65it/s]
 46%|████▌     | 389/851 [00:05<00:06, 72.30it/s]
 47%|████▋     | 397/851 [00:05<00:06, 72.22it/s]
 48%|████▊     | 405/851 [00:05<00:06, 67.45it/s]
 49%|████▊     | 413/851 [00:05<00:06, 68.89it/s]
 49%|████▉     | 421/851 [00:05<00:05, 71.75it/s]
 50%|█████     | 429/851 [00:05<00:06, 70.29it/s]
 51%|█████▏    | 437/851 [00:06<00:05, 72.84it/s]
 52%|█████▏    | 445/851 [00:06<00:05, 73.81it/s]
 53%|█████▎    | 453/851 [00:06<00:05, 74.50it/s]
 54%|█████▍    | 461/851 [00:06<00:05, 72.92it/s]
 55%|█████▌    | 469/851 [00:06<00:05, 70.51it/s]
 56%|█████▌    | 477/851 [00:06<00:05, 65.80it/s]
 57%|█████▋    | 484/851 [00:06<00:05, 65.68it/s]
 58%|█████▊    | 492/851 [00:06<00:05, 69.27it/s]
 59%|█████▉    | 501/851 [00:06<00:04, 73.42it/s]
 60%|█████▉    | 509/851 [00:07<00:04, 72.53it/s]
 61%|██████    | 517/851 [00:07<00:04, 74.04it/s]
 62%|██████▏   | 525/851 [00:07<00:04, 69.50it/s]
 63%|██████▎   | 533/851 [00:07<00:04, 70.76it/s]
 64%|██████▎   | 541/851 [00:07<00:04, 72.75it/s]
 65%|██████▍   | 549/851 [00:07<00:04, 70.77it/s]
 65%|██████▌   | 557/851 [00:07<00:04, 72.93it/s]
 67%|██████▋   | 566/851 [00:07<00:03, 75.96it/s]
 67%|██████▋   | 574/851 [00:07<00:03, 75.68it/s]
 68%|██████▊   | 582/851 [00:08<00:03, 73.54it/s]
 69%|██████▉   | 590/851 [00:08<00:03, 70.20it/s]
 70%|███████   | 598/851 [00:08<00:03, 71.00it/s]
 71%|███████   | 606/851 [00:08<00:03, 69.47it/s]
 72%|███████▏  | 613/851 [00:08<00:03, 67.88it/s]
 73%|███████▎  | 620/851 [00:08<00:03, 68.06it/s]
 74%|███████▎  | 627/851 [00:08<00:03, 67.97it/s]
 75%|███████▍  | 635/851 [00:08<00:03, 68.41it/s]
 75%|███████▌  | 642/851 [00:08<00:03, 63.33it/s]
 76%|███████▋  | 650/851 [00:09<00:02, 67.25it/s]
 77%|███████▋  | 658/851 [00:09<00:02, 69.51it/s]
 78%|███████▊  | 666/851 [00:09<00:02, 70.79it/s]
 79%|███████▉  | 674/851 [00:09<00:02, 70.05it/s]
 80%|████████  | 682/851 [00:09<00:02, 71.60it/s]
 81%|████████  | 691/851 [00:09<00:02, 74.53it/s]
 82%|████████▏ | 699/851 [00:09<00:02, 74.50it/s]
 83%|████████▎ | 708/851 [00:09<00:01, 76.84it/s]
 84%|████████▍ | 716/851 [00:09<00:01, 74.99it/s]
 85%|████████▌ | 724/851 [00:10<00:01, 75.33it/s]
 86%|████████▌ | 732/851 [00:10<00:01, 76.41it/s]
 87%|████████▋ | 740/851 [00:10<00:01, 76.60it/s]
 88%|████████▊ | 748/851 [00:10<00:01, 76.14it/s]
 89%|████████▉ | 756/851 [00:10<00:01, 75.65it/s]
 90%|████████▉ | 764/851 [00:10<00:01, 76.38it/s]
 91%|█████████ | 772/851 [00:10<00:01, 71.78it/s]
 92%|█████████▏| 780/851 [00:10<00:01, 69.90it/s]
 93%|█████████▎| 788/851 [00:10<00:00, 70.05it/s]
 94%|█████████▎| 796/851 [00:11<00:00, 70.92it/s]
 94%|█████████▍| 804/851 [00:11<00:00, 73.14it/s]
 95%|█████████▌| 812/851 [00:11<00:00, 71.35it/s]
 96%|█████████▋| 820/851 [00:11<00:00, 72.32it/s]
 97%|█████████▋| 828/851 [00:11<00:00, 73.57it/s]
 98%|█████████▊| 836/851 [00:11<00:00, 73.25it/s]
 99%|█████████▉| 844/851 [00:11<00:00, 70.06it/s]/usr/local/lib/python3.10/dist-packages/seqeval/metrics/v1.py:57: UndefinedMetricWarning: Recall and F-score are ill-defined and being set to 0.0 in labels with no true samples. Use `zero_division` parameter to control this behavior.
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 10/851 [00:00<00:09, 91.44it/s]
  2%|▏         | 20/851 [00:00<00:10, 80.05it/s]
  3%|▎         | 29/851 [00:00<00:10, 76.91it/s]
  4%|▍         | 37/851 [00:00<00:10, 76.11it/s]
  5%|▌         | 45/851 [00:00<00:10, 76.81it/s]
  6%|▌         | 53/851 [00:00<00:10, 77.42it/s]
  7%|▋         | 62/851 [00:00<00:10, 78.22it/s]
  8%|▊         | 70/851 [00:00<00:10, 73.17it/s]
  9%|▉         | 78/851 [00:01<00:10, 74.14it/s]
 10%|█         | 86/851 [00:01<00:10, 74.47it/s]
 11%|█         | 94/851 [00:01<00:10, 71.54it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.66it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.47it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.13it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.31it/s]
 16%|█▌        | 134/851 [00:01<00:10, 69.99it/s]
 17%|█▋        | 142/851 [00:01<00:10, 70.34it/s]
 18%|█▊        | 150/851 [00:02<00:10, 69.10it/s]
 19%|█▊        | 159/851 [00:02<00:09, 73.31it/s]
 20%|█▉        | 167/851 [00:02<00:09, 73.76it/s]
 21%|██        | 175/851 [00:02<00:09, 74.62it/s]
 22%|██▏       | 183/851 [00:02<00:08, 75.28it/s]
 22%|██▏       | 191/851 [00:02<00:08, 75.29it/s]
 24%|██▎       | 200/851 [00:02<00:08, 77.19it/s]
 24%|██▍       | 208/851 [00:02<00:08, 74.09it/s]
 25%|██▌       | 216/851 [00:02<00:08, 72.43it/s]
 26%|██▋       | 224/851 [00:03<00:08, 72.43it/s]
 27%|██▋       | 232/851 [00:03<00:08, 74.47it/s]
 28%|██▊       | 240/851 [00:03<00:08, 72.23it/s]
 29%|██▉       | 248/851 [00:03<00:08, 70.30it/s]
 30%|███       | 257/851 [00:03<00:08, 74.22it/s]
 31%|███▏      | 266/851 [00:03<00:07, 76.38it/s]
 32%|███▏      | 274/851 [00:03<00:07, 73.69it/s]
 33%|███▎      | 283/851 [00:03<00:07, 75.70it/s]
 34%|███▍      | 291/851 [00:03<00:07, 74.53it/s]
 35%|███▌      | 299/851 [00:04<00:07, 75.95it/s]
 36%|███▌      | 308/851 [00:04<00:06, 77.72it/s]
 37%|███▋      | 316/851 [00:04<00:07, 73.39it/s]
 38%|███▊      | 324/851 [00:04<00:07, 74.25it/s]
 39%|███▉      | 332/851 [00:04<00:07, 73.40it/s]
 40%|███▉      | 340/851 [00:04<00:06, 74.69it/s]
 41%|████      | 348/851 [00:04<00:06, 74.45it/s]
 42%|████▏     | 356/851 [00:04<00:06, 71.61it/s]
 43%|████▎     | 364/851 [00:04<00:06, 71.59it/s]
 44%|████▎     | 372/851 [00:05<00:06, 71.39it/s]
 45%|████▍     | 380/851 [00:05<00:06, 68.82it/s]
 46%|████▌     | 388/851 [00:05<00:06, 71.34it/s]
 47%|████▋     | 396/851 [00:05<00:06, 71.74it/s]
 47%|████▋     | 404/851 [00:05<00:06, 67.34it/s]
 48%|████▊     | 411/851 [00:05<00:06, 67.07it/s]
 49%|████▉     | 419/851 [00:05<00:06, 70.05it/s]
 50%|█████     | 427/851 [00:05<00:06, 68.22it/s]
 51%|█████     | 435/851 [00:05<00:05, 71.05it/s]
 52%|█████▏    | 443/851 [00:06<00:05, 73.07it/s]
 53%|█████▎    | 451/851 [00:06<00:05, 73.76it/s]
 54%|█████▍    | 459/851 [00:06<00:05, 74.30it/s]
 55%|█████▍    | 467/851 [00:06<00:05, 69.53it/s]
 56%|█████▌    | 475/851 [00:06<00:05, 64.31it/s]
 57%|█████▋    | 482/851 [00:06<00:05, 65.00it/s]
 57%|█████▋    | 489/851 [00:06<00:05, 66.01it/s]
 59%|█████▊    | 498/851 [00:06<00:04, 70.87it/s]
 59%|█████▉    | 506/851 [00:06<00:04, 72.82it/s]
 60%|██████    | 514/851 [00:07<00:04, 72.54it/s]
 61%|██████▏   | 522/851 [00:07<00:04, 69.41it/s]
 62%|██████▏   | 530/851 [00:07<00:04, 69.19it/s]
 63%|██████▎   | 538/851 [00:07<00:04, 71.41it/s]
 64%|██████▍   | 546/851 [00:07<00:04, 72.09it/s]
 65%|██████▌   | 554/851 [00:07<00:04, 70.07it/s]
 66%|██████▌   | 562/851 [00:07<00:04, 72.16it/s]
 67%|██████▋   | 570/851 [00:07<00:03, 73.41it/s]
 68%|██████▊   | 578/851 [00:07<00:03, 71.89it/s]
 69%|██████▉   | 586/851 [00:08<00:03, 70.00it/s]
 70%|██████▉   | 594/851 [00:08<00:03, 70.61it/s]
 71%|███████   | 602/851 [00:08<00:03, 71.04it/s]
 72%|███████▏  | 610/851 [00:08<00:03, 70.41it/s]
 73%|███████▎  | 618/851 [00:08<00:03, 66.31it/s]
 74%|███████▎  | 626/851 [00:08<00:03, 66.58it/s]
 74%|███████▍  | 633/851 [00:08<00:03, 67.17it/s]
 75%|███████▌  | 640/851 [00:08<00:03, 67.05it/s]
 76%|███████▌  | 647/851 [00:09<00:03, 65.37it/s]
 77%|███████▋  | 655/851 [00:09<00:02, 68.38it/s]
 78%|███████▊  | 663/851 [00:09<00:02, 69.14it/s]
 79%|███████▉  | 671/851 [00:09<00:02, 69.12it/s]
 80%|███████▉  | 679/851 [00:09<00:02, 69.97it/s]
 81%|████████  | 687/851 [00:09<00:02, 70.94it/s]
 82%|████████▏ | 695/851 [00:09<00:02, 72.48it/s]
 83%|████████▎ | 703/851 [00:09<00:02, 73.26it/s]
 84%|████████▎ | 711/851 [00:09<00:01, 74.87it/s]
 84%|████████▍ | 719/851 [00:09<00:01, 73.29it/s]
 85%|████████▌ | 727/851 [00:10<00:01, 74.62it/s]
 86%|████████▋ | 735/851 [00:10<00:01, 75.28it/s]
 87%|████████▋ | 743/851 [00:10<00:01, 74.64it/s]
 88%|████████▊ | 751/851 [00:10<00:01, 73.98it/s]
 89%|████████▉ | 759/851 [00:10<00:01, 74.89it/s]
 90%|█████████ | 767/851 [00:10<00:01, 70.39it/s]
 91%|█████████ | 775/851 [00:10<00:01, 69.17it/s]
 92%|█████████▏| 782/851 [00:10<00:01, 68.09it/s]
 93%|█████████▎| 790/851 [00:10<00:00, 69.33it/s]
 94%|█████████▍| 798/851 [00:11<00:00, 70.49it/s]
 95%|█████████▍| 806/851 [00:11<00:00, 72.99it/s]
 96%|█████████▌| 814/851 [00:11<00:00, 70.07it/s]
 97%|█████████▋| 822/851 [00:11<00:00, 71.14it/s]
 98%|█████████▊| 830/851 [00:11<00:00, 71.12it/s]
 98%|█████████▊| 838/851 [00:11<00:00, 71.03it/s]
 99%|█████████▉| 846/851 [00:11<00:00, 66.90it/s]

+2024-05-13 13:59:44.191151: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
+2024-05-13 13:59:44.191275: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
+2024-05-13 13:59:44.193141: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
+2024-05-13 13:59:45.319933: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
+05/13/2024 13:59:47 - WARNING - __main__ -   Process rank: 0, device: cuda:0, n_gpu: 1distributed training: True, 16-bits training: False
+05/13/2024 13:59:47 - INFO - __main__ -   Training/evaluation parameters TrainingArguments(
+_n_gpu=1,
+accelerator_config={'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'gradient_accumulation_kwargs': None},
+adafactor=False,
+adam_beta1=0.9,
+adam_beta2=0.999,
+adam_epsilon=1e-08,
+auto_find_batch_size=False,
+bf16=False,
+bf16_full_eval=False,
+data_seed=None,
+dataloader_drop_last=False,
+dataloader_num_workers=0,
+dataloader_persistent_workers=False,
+dataloader_pin_memory=True,
+dataloader_prefetch_factor=None,
+ddp_backend=None,
+ddp_broadcast_buffers=None,
+ddp_bucket_cap_mb=None,
+ddp_find_unused_parameters=None,
+ddp_timeout=1800,
+debug=[],
+deepspeed=None,
+disable_tqdm=False,
+dispatch_batches=None,
+do_eval=True,
+do_predict=True,
+do_train=False,
+eval_accumulation_steps=None,
+eval_delay=0,
+eval_do_concat_batches=True,
+eval_steps=None,
+evaluation_strategy=epoch,
+fp16=False,
+fp16_backend=auto,
+fp16_full_eval=False,
+fp16_opt_level=O1,
+fsdp=[],
+fsdp_config={'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False},
+fsdp_min_num_params=0,
+fsdp_transformer_layer_cls_to_wrap=None,
+full_determinism=False,
+gradient_accumulation_steps=4,
+gradient_checkpointing=False,
+gradient_checkpointing_kwargs=None,
+greater_is_better=True,
+group_by_length=False,
+half_precision_backend=auto,
+hub_always_push=False,
+hub_model_id=None,
+hub_private_repo=False,
+hub_strategy=every_save,
+hub_token=<HUB_TOKEN>,
+ignore_data_skip=False,
+include_inputs_for_metrics=False,
+include_num_input_tokens_seen=False,
+include_tokens_per_second=False,
+jit_mode_eval=False,
+label_names=None,
+label_smoothing_factor=0.0,
+learning_rate=5e-05,
+length_column_name=length,
+load_best_model_at_end=True,
+local_rank=0,
+log_level=passive,
+log_level_replica=warning,
+log_on_each_node=True,
+logging_dir=/content/dissertation/scripts/ner/output/tb,
+logging_first_step=False,
+logging_nan_inf_filter=True,
+logging_steps=500,
+logging_strategy=steps,
+lr_scheduler_kwargs={},
+lr_scheduler_type=linear,
+max_grad_norm=1.0,
+max_steps=-1,
+metric_for_best_model=f1,
+mp_parameters=,
+neftune_noise_alpha=None,
+no_cuda=False,
+num_train_epochs=10.0,
+optim=adamw_torch,
+optim_args=None,
+optim_target_modules=None,
+output_dir=/content/dissertation/scripts/ner/output,
+overwrite_output_dir=True,
+past_index=-1,
+per_device_eval_batch_size=8,
+per_device_train_batch_size=4,
+prediction_loss_only=False,
+push_to_hub=True,
+push_to_hub_model_id=None,
+push_to_hub_organization=None,
+push_to_hub_token=<PUSH_TO_HUB_TOKEN>,
+ray_scope=last,
+remove_unused_columns=True,
+report_to=['tensorboard'],
+resume_from_checkpoint=None,
+run_name=/content/dissertation/scripts/ner/output,
+save_on_each_node=False,
+save_only_model=False,
+save_safetensors=True,
+save_steps=500,
+save_strategy=epoch,
+save_total_limit=None,
+seed=42,
+skip_memory_metrics=True,
+split_batches=None,
+tf32=None,
+torch_compile=False,
+torch_compile_backend=None,
+torch_compile_mode=None,
+torchdynamo=None,
+tpu_metrics_debug=False,
+tpu_num_cores=None,
+use_cpu=False,
+use_ipex=False,
+use_legacy_prediction_loop=False,
+use_mps_device=False,
+warmup_ratio=0.0,
+warmup_steps=0,
+weight_decay=0.0,
+)
+/usr/local/lib/python3.10/dist-packages/datasets/load.py:1486: FutureWarning: The repository for Rodrigo1771/multi-train-drugtemist-dev-ner contains custom code which must be executed to correctly load the dataset. You can inspect the repository content at https://hf.co/datasets/Rodrigo1771/multi-train-drugtemist-dev-ner
+You can avoid this message in future by passing the argument `trust_remote_code=True`.
+Passing `trust_remote_code=True` will be mandatory to load this dataset from the next major release of `datasets`.
+  warnings.warn(
+/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+[INFO|configuration_utils.py:726] 2024-05-13 13:59:59,622 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-13 13:59:59,630 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "finetuning_task": "ner",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "O",
+    "1": "B-ENFERMEDAD",
+    "2": "I-ENFERMEDAD",
+    "3": "B-PROCEDIMIENTO",
+    "4": "I-PROCEDIMIENTO",
+    "5": "B-SINTOMA",
+    "6": "I-SINTOMA",
+    "7": "B-FARMACO",
+    "8": "I-FARMACO"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "B-ENFERMEDAD": 1,
+    "B-FARMACO": 7,
+    "B-PROCEDIMIENTO": 3,
+    "B-SINTOMA": 5,
+    "I-ENFERMEDAD": 2,
+    "I-FARMACO": 8,
+    "I-PROCEDIMIENTO": 4,
+    "I-SINTOMA": 6,
+    "O": 0
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|configuration_utils.py:726] 2024-05-13 13:59:59,860 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-13 13:59:59,860 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file vocab.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/vocab.json
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file merges.txt from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/merges.txt
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file tokenizer.json from cache at None
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file added_tokens.json from cache at None
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file special_tokens_map.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/special_tokens_map.json
+[INFO|tokenization_utils_base.py:2087] 2024-05-13 14:00:01,284 >> loading file tokenizer_config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/tokenizer_config.json
+[INFO|configuration_utils.py:726] 2024-05-13 14:00:01,284 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-13 14:00:01,285 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|configuration_utils.py:726] 2024-05-13 14:00:01,385 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-13 14:00:01,386 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|modeling_utils.py:3429] 2024-05-13 14:00:06,524 >> loading weights file pytorch_model.bin from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/pytorch_model.bin
+[INFO|modeling_utils.py:4160] 2024-05-13 14:00:06,768 >> Some weights of the model checkpoint at PlanTL-GOB-ES/bsc-bio-ehr-es were not used when initializing RobertaForTokenClassification: ['lm_head.bias', 'lm_head.decoder.bias', 'lm_head.decoder.weight', 'lm_head.dense.bias', 'lm_head.dense.weight', 'lm_head.layer_norm.bias', 'lm_head.layer_norm.weight']
+- This IS expected if you are initializing RobertaForTokenClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
+- This IS NOT expected if you are initializing RobertaForTokenClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
+[WARNING|modeling_utils.py:4172] 2024-05-13 14:00:06,768 >> Some weights of RobertaForTokenClassification were not initialized from the model checkpoint at PlanTL-GOB-ES/bsc-bio-ehr-es and are newly initialized: ['classifier.bias', 'classifier.weight']
+You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
+/content/dissertation/scripts/ner/run_ner.py:397: FutureWarning: load_metric is deprecated and will be removed in the next major version of datasets. Use 'evaluate.load' instead, from the new library 🤗 Evaluate: https://huggingface.co/docs/evaluate
+  metric = load_metric("seqeval")
+/usr/local/lib/python3.10/dist-packages/datasets/load.py:759: FutureWarning: The repository for seqeval contains custom code which must be executed to correctly load the metric. You can inspect the repository content at https://raw.githubusercontent.com/huggingface/datasets/2.19.1/metrics/seqeval/seqeval.py
+You can avoid this message in future by passing the argument `trust_remote_code=True`.
+Passing `trust_remote_code=True` will be mandatory to load this metric from the next major release of `datasets`.
+  warnings.warn(
+05/13/2024 14:00:09 - INFO - __main__ -   *** Evaluate ***
+[INFO|trainer.py:786] 2024-05-13 14:00:09,048 >> The following columns in the evaluation set don't have a corresponding argument in `RobertaForTokenClassification.forward` and have been ignored: tokens, id, ner_tags. If tokens, id, ner_tags are not expected by `RobertaForTokenClassification.forward`,  you can safely ignore this message.
+[INFO|trainer.py:3614] 2024-05-13 14:00:09,055 >> ***** Running Evaluation *****
+[INFO|trainer.py:3616] 2024-05-13 14:00:09,055 >>   Num examples = 6807
+[INFO|trainer.py:3619] 2024-05-13 14:00:09,055 >>   Batch size = 8
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 6/851 [00:00<00:14, 56.79it/s]
  2%|▏         | 14/851 [00:00<00:12, 66.67it/s]
  2%|▏         | 21/851 [00:00<00:12, 67.60it/s]
  3%|▎         | 29/851 [00:00<00:11, 71.18it/s]
  4%|▍         | 37/851 [00:00<00:11, 72.27it/s]
  5%|▌         | 45/851 [00:00<00:10, 73.94it/s]
  6%|▌         | 53/851 [00:00<00:10, 75.21it/s]
  7%|▋         | 62/851 [00:00<00:10, 76.83it/s]
  8%|▊         | 70/851 [00:00<00:10, 72.77it/s]
  9%|▉         | 78/851 [00:01<00:10, 73.82it/s]
 10%|█         | 86/851 [00:01<00:10, 72.92it/s]
 11%|█         | 94/851 [00:01<00:10, 71.23it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.43it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.56it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.68it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.25it/s]
 16%|█▌        | 134/851 [00:01<00:10, 69.92it/s]
 17%|█▋        | 142/851 [00:01<00:10, 70.22it/s]
 18%|█▊        | 150/851 [00:02<00:10, 68.89it/s]
 19%|█▊        | 158/851 [00:02<00:09, 71.66it/s]
 20%|█▉        | 166/851 [00:02<00:09, 72.84it/s]
 20%|██        | 174/851 [00:02<00:09, 73.78it/s]
 21%|██▏       | 182/851 [00:02<00:08, 74.73it/s]
 22%|██▏       | 191/851 [00:02<00:08, 75.91it/s]
 24%|██▎       | 200/851 [00:02<00:08, 77.69it/s]
 24%|██▍       | 208/851 [00:02<00:08, 74.46it/s]
 25%|██▌       | 216/851 [00:02<00:08, 73.02it/s]
 26%|██▋       | 224/851 [00:03<00:08, 73.31it/s]
 27%|██▋       | 233/851 [00:03<00:08, 75.80it/s]
 28%|██▊       | 241/851 [00:03<00:08, 70.34it/s]
 29%|██▉       | 249/851 [00:03<00:08, 70.90it/s]
 30%|███       | 258/851 [00:03<00:07, 74.54it/s]
 31%|███▏      | 267/851 [00:03<00:07, 74.56it/s]
 32%|███▏      | 275/851 [00:03<00:07, 75.23it/s]
 33%|███▎      | 284/851 [00:03<00:07, 77.16it/s]
 34%|███▍      | 292/851 [00:03<00:07, 75.36it/s]
 35%|███▌      | 300/851 [00:04<00:07, 76.48it/s]
 36%|███▋      | 309/851 [00:04<00:06, 78.03it/s]
 37%|███▋      | 317/851 [00:04<00:07, 73.84it/s]
 38%|███▊      | 325/851 [00:04<00:07, 74.16it/s]
 39%|███▉      | 333/851 [00:04<00:07, 73.48it/s]
 40%|████      | 341/851 [00:04<00:06, 75.03it/s]
 41%|████      | 349/851 [00:04<00:06, 75.15it/s]
 42%|████▏     | 357/851 [00:04<00:06, 71.25it/s]
 43%|████▎     | 365/851 [00:04<00:06, 73.08it/s]
 44%|████▍     | 373/851 [00:05<00:06, 71.71it/s]
 45%|████▍     | 381/851 [00:05<00:06, 69.65it/s]
 46%|████▌     | 389/851 [00:05<00:06, 72.30it/s]
 47%|████▋     | 397/851 [00:05<00:06, 72.22it/s]
 48%|████▊     | 405/851 [00:05<00:06, 67.45it/s]
 49%|████▊     | 413/851 [00:05<00:06, 68.89it/s]
 49%|████▉     | 421/851 [00:05<00:05, 71.75it/s]
 50%|█████     | 429/851 [00:05<00:06, 70.29it/s]
 51%|█████▏    | 437/851 [00:06<00:05, 72.84it/s]
 52%|█████▏    | 445/851 [00:06<00:05, 73.81it/s]
 53%|█████▎    | 453/851 [00:06<00:05, 74.50it/s]
 54%|█████▍    | 461/851 [00:06<00:05, 72.92it/s]
 55%|█████▌    | 469/851 [00:06<00:05, 70.51it/s]
 56%|█████▌    | 477/851 [00:06<00:05, 65.80it/s]
 57%|█████▋    | 484/851 [00:06<00:05, 65.68it/s]
 58%|█████▊    | 492/851 [00:06<00:05, 69.27it/s]
 59%|█████▉    | 501/851 [00:06<00:04, 73.42it/s]
 60%|█████▉    | 509/851 [00:07<00:04, 72.53it/s]
 61%|██████    | 517/851 [00:07<00:04, 74.04it/s]
 62%|██████▏   | 525/851 [00:07<00:04, 69.50it/s]
 63%|██████▎   | 533/851 [00:07<00:04, 70.76it/s]
 64%|██████▎   | 541/851 [00:07<00:04, 72.75it/s]
 65%|██████▍   | 549/851 [00:07<00:04, 70.77it/s]
 65%|██████▌   | 557/851 [00:07<00:04, 72.93it/s]
 67%|██████▋   | 566/851 [00:07<00:03, 75.96it/s]
 67%|██████▋   | 574/851 [00:07<00:03, 75.68it/s]
 68%|██████▊   | 582/851 [00:08<00:03, 73.54it/s]
 69%|██████▉   | 590/851 [00:08<00:03, 70.20it/s]
 70%|███████   | 598/851 [00:08<00:03, 71.00it/s]
 71%|███████   | 606/851 [00:08<00:03, 69.47it/s]
 72%|███████▏  | 613/851 [00:08<00:03, 67.88it/s]
 73%|███████▎  | 620/851 [00:08<00:03, 68.06it/s]
 74%|███████▎  | 627/851 [00:08<00:03, 67.97it/s]
 75%|███████▍  | 635/851 [00:08<00:03, 68.41it/s]
 75%|███████▌  | 642/851 [00:08<00:03, 63.33it/s]
 76%|███████▋  | 650/851 [00:09<00:02, 67.25it/s]
 77%|███████▋  | 658/851 [00:09<00:02, 69.51it/s]
 78%|███████▊  | 666/851 [00:09<00:02, 70.79it/s]
 79%|███████▉  | 674/851 [00:09<00:02, 70.05it/s]
 80%|████████  | 682/851 [00:09<00:02, 71.60it/s]
 81%|████████  | 691/851 [00:09<00:02, 74.53it/s]
 82%|████████▏ | 699/851 [00:09<00:02, 74.50it/s]
 83%|████████▎ | 708/851 [00:09<00:01, 76.84it/s]
 84%|████████▍ | 716/851 [00:09<00:01, 74.99it/s]
 85%|████████▌ | 724/851 [00:10<00:01, 75.33it/s]
 86%|████████▌ | 732/851 [00:10<00:01, 76.41it/s]
 87%|████████▋ | 740/851 [00:10<00:01, 76.60it/s]
 88%|████████▊ | 748/851 [00:10<00:01, 76.14it/s]
 89%|████████▉ | 756/851 [00:10<00:01, 75.65it/s]
 90%|████████▉ | 764/851 [00:10<00:01, 76.38it/s]
 91%|█████████ | 772/851 [00:10<00:01, 71.78it/s]
 92%|█████████▏| 780/851 [00:10<00:01, 69.90it/s]
 93%|█████████▎| 788/851 [00:10<00:00, 70.05it/s]
 94%|█████████▎| 796/851 [00:11<00:00, 70.92it/s]
 94%|█████████▍| 804/851 [00:11<00:00, 73.14it/s]
 95%|█████████▌| 812/851 [00:11<00:00, 71.35it/s]
 96%|█████████▋| 820/851 [00:11<00:00, 72.32it/s]
 97%|█████████▋| 828/851 [00:11<00:00, 73.57it/s]
 98%|█████████▊| 836/851 [00:11<00:00, 73.25it/s]
 99%|█████████▉| 844/851 [00:11<00:00, 70.06it/s]/usr/local/lib/python3.10/dist-packages/seqeval/metrics/v1.py:57: UndefinedMetricWarning: Recall and F-score are ill-defined and being set to 0.0 in labels with no true samples. Use `zero_division` parameter to control this behavior.
+  _warn_prf(average, modifier, msg_start, len(result))
+***** eval metrics *****
+  eval_accuracy           =     0.0028
+  eval_f1                 =     0.0007
+  eval_loss               =     2.4031
+  eval_precision          =     0.0004
+  eval_recall             =     0.0386
+  eval_runtime            = 0:00:16.79
+  eval_samples            =       6807
+  eval_samples_per_second =     405.27
+  eval_steps_per_second   =     50.666
+05/13/2024 14:00:25 - INFO - __main__ -   *** Predict ***
+[INFO|trainer.py:786] 2024-05-13 14:00:25,855 >> The following columns in the test set don't have a corresponding argument in `RobertaForTokenClassification.forward` and have been ignored: tokens, id, ner_tags. If tokens, id, ner_tags are not expected by `RobertaForTokenClassification.forward`,  you can safely ignore this message.
+[INFO|trainer.py:3614] 2024-05-13 14:00:25,857 >> ***** Running Prediction *****
+[INFO|trainer.py:3616] 2024-05-13 14:00:25,857 >>   Num examples = 6807
+[INFO|trainer.py:3619] 2024-05-13 14:00:25,857 >>   Batch size = 8
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 10/851 [00:00<00:09, 91.44it/s]
  2%|▏         | 20/851 [00:00<00:10, 80.05it/s]
  3%|▎         | 29/851 [00:00<00:10, 76.91it/s]
  4%|▍         | 37/851 [00:00<00:10, 76.11it/s]
  5%|▌         | 45/851 [00:00<00:10, 76.81it/s]
  6%|▌         | 53/851 [00:00<00:10, 77.42it/s]
  7%|▋         | 62/851 [00:00<00:10, 78.22it/s]
  8%|▊         | 70/851 [00:00<00:10, 73.17it/s]
  9%|▉         | 78/851 [00:01<00:10, 74.14it/s]
 10%|█         | 86/851 [00:01<00:10, 74.47it/s]
 11%|█         | 94/851 [00:01<00:10, 71.54it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.66it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.47it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.13it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.31it/s]
 16%|█▌        | 134/851 [00:01<00:10, 69.99it/s]
 17%|█▋        | 142/851 [00:01<00:10, 70.34it/s]
 18%|█▊        | 150/851 [00:02<00:10, 69.10it/s]
 19%|█▊        | 159/851 [00:02<00:09, 73.31it/s]
 20%|█▉        | 167/851 [00:02<00:09, 73.76it/s]
 21%|██        | 175/851 [00:02<00:09, 74.62it/s]
 22%|██▏       | 183/851 [00:02<00:08, 75.28it/s]
 22%|██▏       | 191/851 [00:02<00:08, 75.29it/s]
 24%|██▎       | 200/851 [00:02<00:08, 77.19it/s]
 24%|██▍       | 208/851 [00:02<00:08, 74.09it/s]
 25%|██▌       | 216/851 [00:02<00:08, 72.43it/s]
 26%|██▋       | 224/851 [00:03<00:08, 72.43it/s]
 27%|██▋       | 232/851 [00:03<00:08, 74.47it/s]
 28%|██▊       | 240/851 [00:03<00:08, 72.23it/s]
 29%|██▉       | 248/851 [00:03<00:08, 70.30it/s]
 30%|███       | 257/851 [00:03<00:08, 74.22it/s]
 31%|███▏      | 266/851 [00:03<00:07, 76.38it/s]
 32%|███▏      | 274/851 [00:03<00:07, 73.69it/s]
 33%|███▎      | 283/851 [00:03<00:07, 75.70it/s]
 34%|███▍      | 291/851 [00:03<00:07, 74.53it/s]
 35%|███▌      | 299/851 [00:04<00:07, 75.95it/s]
 36%|███▌      | 308/851 [00:04<00:06, 77.72it/s]
 37%|███▋      | 316/851 [00:04<00:07, 73.39it/s]
 38%|███▊      | 324/851 [00:04<00:07, 74.25it/s]
 39%|███▉      | 332/851 [00:04<00:07, 73.40it/s]
 40%|███▉      | 340/851 [00:04<00:06, 74.69it/s]
 41%|████      | 348/851 [00:04<00:06, 74.45it/s]
 42%|████▏     | 356/851 [00:04<00:06, 71.61it/s]
 43%|████▎     | 364/851 [00:04<00:06, 71.59it/s]
 44%|████▎     | 372/851 [00:05<00:06, 71.39it/s]
 45%|████▍     | 380/851 [00:05<00:06, 68.82it/s]
 46%|████▌     | 388/851 [00:05<00:06, 71.34it/s]
 47%|████▋     | 396/851 [00:05<00:06, 71.74it/s]
 47%|████▋     | 404/851 [00:05<00:06, 67.34it/s]
 48%|████▊     | 411/851 [00:05<00:06, 67.07it/s]
 49%|████▉     | 419/851 [00:05<00:06, 70.05it/s]
 50%|█████     | 427/851 [00:05<00:06, 68.22it/s]
 51%|█████     | 435/851 [00:05<00:05, 71.05it/s]
 52%|█████▏    | 443/851 [00:06<00:05, 73.07it/s]
 53%|█████▎    | 451/851 [00:06<00:05, 73.76it/s]
 54%|█████▍    | 459/851 [00:06<00:05, 74.30it/s]
 55%|█████▍    | 467/851 [00:06<00:05, 69.53it/s]
 56%|█████▌    | 475/851 [00:06<00:05, 64.31it/s]
 57%|█████▋    | 482/851 [00:06<00:05, 65.00it/s]
 57%|█████▋    | 489/851 [00:06<00:05, 66.01it/s]
 59%|█████▊    | 498/851 [00:06<00:04, 70.87it/s]
 59%|█████▉    | 506/851 [00:06<00:04, 72.82it/s]
 60%|██████    | 514/851 [00:07<00:04, 72.54it/s]
 61%|██████▏   | 522/851 [00:07<00:04, 69.41it/s]
 62%|██████▏   | 530/851 [00:07<00:04, 69.19it/s]
 63%|██████▎   | 538/851 [00:07<00:04, 71.41it/s]
 64%|██████▍   | 546/851 [00:07<00:04, 72.09it/s]
 65%|██████▌   | 554/851 [00:07<00:04, 70.07it/s]
 66%|██████▌   | 562/851 [00:07<00:04, 72.16it/s]
 67%|██████▋   | 570/851 [00:07<00:03, 73.41it/s]
 68%|██████▊   | 578/851 [00:07<00:03, 71.89it/s]
 69%|██████▉   | 586/851 [00:08<00:03, 70.00it/s]
 70%|██████▉   | 594/851 [00:08<00:03, 70.61it/s]
 71%|███████   | 602/851 [00:08<00:03, 71.04it/s]
 72%|███████▏  | 610/851 [00:08<00:03, 70.41it/s]
 73%|███████▎  | 618/851 [00:08<00:03, 66.31it/s]
 74%|███████▎  | 626/851 [00:08<00:03, 66.58it/s]
 74%|███████▍  | 633/851 [00:08<00:03, 67.17it/s]
 75%|███████▌  | 640/851 [00:08<00:03, 67.05it/s]
 76%|███████▌  | 647/851 [00:09<00:03, 65.37it/s]
 77%|███████▋  | 655/851 [00:09<00:02, 68.38it/s]
 78%|███████▊  | 663/851 [00:09<00:02, 69.14it/s]
 79%|███████▉  | 671/851 [00:09<00:02, 69.12it/s]
 80%|███████▉  | 679/851 [00:09<00:02, 69.97it/s]
 81%|████████  | 687/851 [00:09<00:02, 70.94it/s]
 82%|████████▏ | 695/851 [00:09<00:02, 72.48it/s]
 83%|████████▎ | 703/851 [00:09<00:02, 73.26it/s]
 84%|████████▎ | 711/851 [00:09<00:01, 74.87it/s]
 84%|████████▍ | 719/851 [00:09<00:01, 73.29it/s]
 85%|████████▌ | 727/851 [00:10<00:01, 74.62it/s]
 86%|████████▋ | 735/851 [00:10<00:01, 75.28it/s]
 87%|████████▋ | 743/851 [00:10<00:01, 74.64it/s]
 88%|████████▊ | 751/851 [00:10<00:01, 73.98it/s]
 89%|████████▉ | 759/851 [00:10<00:01, 74.89it/s]
 90%|█████████ | 767/851 [00:10<00:01, 70.39it/s]
 91%|█████████ | 775/851 [00:10<00:01, 69.17it/s]
 92%|█████████▏| 782/851 [00:10<00:01, 68.09it/s]
 93%|█████████▎| 790/851 [00:10<00:00, 69.33it/s]
 94%|█████████▍| 798/851 [00:11<00:00, 70.49it/s]
 95%|█████████▍| 806/851 [00:11<00:00, 72.99it/s]
 96%|█████████▌| 814/851 [00:11<00:00, 70.07it/s]
 97%|█████████▋| 822/851 [00:11<00:00, 71.14it/s]
 98%|█████████▊| 830/851 [00:11<00:00, 71.12it/s]
 98%|█████████▊| 838/851 [00:11<00:00, 71.03it/s]
 99%|█████████▉| 846/851 [00:11<00:00, 66.90it/s]
+[INFO|trainer.py:3305] 2024-05-13 14:00:42,250 >> Saving model checkpoint to /content/dissertation/scripts/ner/output
+[INFO|configuration_utils.py:471] 2024-05-13 14:00:42,251 >> Configuration saved in /content/dissertation/scripts/ner/output/config.json
+[INFO|modeling_utils.py:2590] 2024-05-13 14:00:43,185 >> Model weights saved in /content/dissertation/scripts/ner/output/model.safetensors
+[INFO|tokenization_utils_base.py:2488] 2024-05-13 14:00:43,186 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/tokenizer_config.json
+[INFO|tokenization_utils_base.py:2497] 2024-05-13 14:00:43,186 >> Special tokens file saved in /content/dissertation/scripts/ner/output/special_tokens_map.json
+[INFO|modelcard.py:450] 2024-05-13 14:00:43,335 >> Dropping the following result as it does not have all the necessary fields:
+{'task': {'name': 'Token Classification', 'type': 'token-classification'}, 'dataset': {'name': 'Rodrigo1771/multi-train-drugtemist-dev-ner', 'type': 'Rodrigo1771/multi-train-drugtemist-dev-ner', 'config': 'MultiTrainDrugTEMISTDevNER', 'split': 'validation', 'args': 'MultiTrainDrugTEMISTDevNER'}}
+***** predict metrics *****
+  predict_accuracy           =     0.0028
+  predict_f1                 =     0.0007
+  predict_loss               =     2.4031
+  predict_precision          =     0.0004
+  predict_recall             =     0.0386
+  predict_runtime            = 0:00:16.07
+  predict_samples_per_second =    423.545
+  predict_steps_per_second   =     52.951

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:18cce49dc172023921a8c234d2d643afeaa0b4f8f209fd2e1d76d267cbbe3c95
+size 5048

vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff