End of training

Browse files

Files changed (15) hide show

README.md +63 -0
all_results.json +19 -0
config.json +39 -0
eval_results.json +11 -0
merges.txt +0 -0
model.safetensors +3 -0
predict_results.json +10 -0
predictions.txt +0 -0
special_tokens_map.json +51 -0
tb/events.out.tfevents.1715783401.61af03e56d14.4488.0 +3 -0
tokenizer.json +0 -0
tokenizer_config.json +58 -0
train.log +343 -0
training_args.bin +3 -0
vocab.json +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+license: apache-2.0
+base_model: PlanTL-GOB-ES/bsc-bio-ehr-es
+tags:
+- token-classification
+- generated_from_trainer
+datasets:
+- Rodrigo1771/drugtemist-ner
+model-index:
+- name: output
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# output
+This model is a fine-tuned version of [PlanTL-GOB-ES/bsc-bio-ehr-es](https://huggingface.co/PlanTL-GOB-ES/bsc-bio-ehr-es) on the Rodrigo1771/drugtemist-ner dataset.
+It achieves the following results on the evaluation set:
+- eval_loss: 1.2930
+- eval_precision: 0.0028
+- eval_recall: 0.2693
+- eval_f1: 0.0056
+- eval_accuracy: 0.0288
+- eval_runtime: 16.5731
+- eval_samples_per_second: 410.727
+- eval_steps_per_second: 51.348
+- step: 0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 4
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10.0
+### Framework versions
+- Transformers 4.40.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.1

all_results.json ADDED Viewed

	@@ -0,0 +1,19 @@

+{
+    "eval_accuracy": 0.028794858943639246,
+    "eval_f1": 0.00559299062744574,
+    "eval_loss": 1.2929713726043701,
+    "eval_precision": 0.0028258395540381536,
+    "eval_recall": 0.2693014705882353,
+    "eval_runtime": 16.5731,
+    "eval_samples": 6807,
+    "eval_samples_per_second": 410.727,
+    "eval_steps_per_second": 51.348,
+    "predict_accuracy": 0.028794858943639246,
+    "predict_f1": 0.00559299062744574,
+    "predict_loss": 1.2929713726043701,
+    "predict_precision": 0.0028258395540381536,
+    "predict_recall": 0.2693014705882353,
+    "predict_runtime": 15.7694,
+    "predict_samples_per_second": 431.659,
+    "predict_steps_per_second": 53.965
+}

config.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForTokenClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "finetuning_task": "ner",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "O",
+    "1": "B-FARMACO",
+    "2": "I-FARMACO"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "B-FARMACO": 1,
+    "I-FARMACO": 2,
+    "O": 0
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "torch_dtype": "float32",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}

eval_results.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "eval_accuracy": 0.028794858943639246,
+    "eval_f1": 0.00559299062744574,
+    "eval_loss": 1.2929713726043701,
+    "eval_precision": 0.0028258395540381536,
+    "eval_recall": 0.2693014705882353,
+    "eval_runtime": 16.5731,
+    "eval_samples": 6807,
+    "eval_samples_per_second": 410.727,
+    "eval_steps_per_second": 51.348
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1d6afec433b240e395bd3f2e246116229e0d675eb0ae73c1d584f9ad8e27ea20
+size 496244100

predict_results.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "predict_accuracy": 0.028794858943639246,
+    "predict_f1": 0.00559299062744574,
+    "predict_loss": 1.2929713726043701,
+    "predict_precision": 0.0028258395540381536,
+    "predict_recall": 0.2693014705882353,
+    "predict_runtime": 15.7694,
+    "predict_samples_per_second": 431.659,
+    "predict_steps_per_second": 53.965
+}

predictions.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "cls_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "<mask>",
+    "lstrip": true,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tb/events.out.tfevents.1715783401.61af03e56d14.4488.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ab9da2ae3e0d6f1547437ff1ca2dccde86482093f826a321df49f7924fc92ce
+size 486

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,58 @@

+{
+  "add_prefix_space": true,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<pad>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "50261": {
+      "content": "<mask>",
+      "lstrip": true,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "<s>",
+  "eos_token": "</s>",
+  "errors": "replace",
+  "mask_token": "<mask>",
+  "max_len": 512,
+  "model_max_length": 512,
+  "pad_token": "<pad>",
+  "sep_token": "</s>",
+  "tokenizer_class": "RobertaTokenizer",
+  "trim_offsets": true,
+  "unk_token": "<unk>"
+}

train.log ADDED Viewed

@@ -0,0 +1,343 @@
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 6/851 [00:00<00:14, 56.94it/s]
  2%|▏         | 14/851 [00:00<00:12, 67.06it/s]
  3%|▎         | 22/851 [00:00<00:11, 69.14it/s]
  4%|▎         | 30/851 [00:00<00:11, 70.86it/s]
  4%|▍         | 38/851 [00:00<00:11, 72.79it/s]
  5%|▌         | 46/851 [00:00<00:10, 74.66it/s]
  6%|▋         | 54/851 [00:00<00:10, 75.69it/s]
  7%|▋         | 62/851 [00:00<00:10, 76.61it/s]
  8%|▊         | 70/851 [00:00<00:10, 71.80it/s]
  9%|▉         | 78/851 [00:01<00:10, 72.55it/s]
 10%|█         | 86/851 [00:01<00:10, 73.09it/s]
 11%|█         | 94/851 [00:01<00:10, 71.17it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.61it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.20it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.02it/s]
 15%|█▍        | 126/851 [00:01<00:10, 68.64it/s]
 16%|█▌        | 134/851 [00:01<00:10, 68.76it/s]
 17%|█▋        | 142/851 [00:01<00:10, 69.40it/s]
 18%|█▊        | 149/851 [00:02<00:10, 67.95it/s]
 18%|█▊        | 157/851 [00:02<00:09, 70.23it/s]
 19%|█▉        | 165/851 [00:02<00:09, 71.18it/s]
 20%|██        | 173/851 [00:02<00:09, 72.70it/s]
 21%|██▏       | 181/851 [00:02<00:09, 73.97it/s]
 22%|██▏       | 189/851 [00:02<00:08, 75.32it/s]
 23%|██▎       | 197/851 [00:02<00:08, 75.83it/s]
 24%|██▍       | 206/851 [00:02<00:08, 76.15it/s]
 25%|██▌       | 214/851 [00:02<00:08, 71.48it/s]
 26%|██▌       | 222/851 [00:03<00:08, 72.75it/s]
 27%|██▋       | 230/851 [00:03<00:08, 73.89it/s]
 28%|██▊       | 238/851 [00:03<00:08, 72.13it/s]
 29%|██▉       | 246/851 [00:03<00:08, 70.45it/s]
 30%|██▉       | 255/851 [00:03<00:08, 73.88it/s]
 31%|███       | 264/851 [00:03<00:07, 76.63it/s]
 32%|███▏      | 272/851 [00:03<00:07, 74.50it/s]
 33%|███▎      | 281/851 [00:03<00:07, 77.22it/s]
 34%|███▍      | 289/851 [00:03<00:07, 74.33it/s]
 35%|███▌      | 298/851 [00:04<00:07, 76.25it/s]
 36%|███▌      | 307/851 [00:04<00:06, 78.12it/s]
 37%|███▋      | 315/851 [00:04<00:07, 73.67it/s]
 38%|███▊      | 324/851 [00:04<00:07, 74.73it/s]
 39%|███▉      | 332/851 [00:04<00:07, 73.75it/s]
 40%|███▉      | 340/851 [00:04<00:06, 74.75it/s]
 41%|████      | 348/851 [00:04<00:06, 75.70it/s]
 42%|████▏     | 356/851 [00:04<00:06, 72.87it/s]
 43%|████▎     | 364/851 [00:04<00:06, 73.10it/s]
 44%|████▎     | 372/851 [00:05<00:06, 71.82it/s]
 45%|████▍     | 380/851 [00:05<00:06, 69.17it/s]
 46%|████▌     | 388/851 [00:05<00:06, 72.06it/s]
 47%|████▋     | 396/851 [00:05<00:06, 72.74it/s]
 47%|████▋     | 404/851 [00:05<00:06, 68.56it/s]
 48%|████▊     | 412/851 [00:05<00:06, 69.70it/s]
 49%|████▉     | 420/851 [00:05<00:05, 72.48it/s]
 50%|█████     | 428/851 [00:05<00:05, 70.96it/s]
 51%|█████     | 436/851 [00:06<00:05, 72.47it/s]
 52%|█████▏    | 444/851 [00:06<00:05, 74.07it/s]
 53%|█████▎    | 452/851 [00:06<00:05, 75.55it/s]
 54%|█████▍    | 460/851 [00:06<00:05, 74.33it/s]
 55%|█████▍    | 468/851 [00:06<00:05, 71.78it/s]
 56%|█████▌    | 476/851 [00:06<00:05, 67.46it/s]
 57%|█████▋    | 483/851 [00:06<00:05, 67.69it/s]
 58%|█████▊    | 491/851 [00:06<00:05, 69.09it/s]
 59%|█████▉    | 500/851 [00:06<00:04, 73.77it/s]
 60%|█████▉    | 508/851 [00:07<00:04, 72.84it/s]
 61%|██████    | 517/851 [00:07<00:04, 74.44it/s]
 62%|██████▏   | 525/851 [00:07<00:04, 70.40it/s]
 63%|██████▎   | 533/851 [00:07<00:04, 71.47it/s]
 64%|██████▎   | 541/851 [00:07<00:04, 73.42it/s]
 65%|██████▍   | 549/851 [00:07<00:04, 71.38it/s]
 65%|██████▌   | 557/851 [00:07<00:04, 73.46it/s]
 67%|██████▋   | 566/851 [00:07<00:03, 76.65it/s]
 67%|██████▋   | 574/851 [00:07<00:03, 76.97it/s]
 68%|██████▊   | 582/851 [00:08<00:03, 74.62it/s]
 69%|██████▉   | 590/851 [00:08<00:03, 71.50it/s]
 70%|███████   | 598/851 [00:08<00:03, 72.36it/s]
 71%|███████   | 606/851 [00:08<00:03, 70.56it/s]
 72%|███████▏  | 614/851 [00:08<00:03, 69.62it/s]
 73%|███████▎  | 622/851 [00:08<00:03, 71.02it/s]
 74%|███████▍  | 630/851 [00:08<00:03, 67.52it/s]
 75%|███████▍  | 638/851 [00:08<00:03, 70.60it/s]
 76%|███████▌  | 646/851 [00:08<00:03, 66.60it/s]
 77%|███████▋  | 654/851 [00:09<00:02, 69.98it/s]
 78%|███████▊  | 662/851 [00:09<00:02, 70.75it/s]
 79%|███████▊  | 670/851 [00:09<00:02, 72.07it/s]
 80%|███████▉  | 678/851 [00:09<00:02, 70.99it/s]
 81%|████████  | 686/851 [00:09<00:02, 72.75it/s]
 82%|████████▏ | 695/851 [00:09<00:02, 75.21it/s]
 83%|████████▎ | 704/851 [00:09<00:01, 76.96it/s]
 84%|████████▍ | 713/851 [00:09<00:01, 78.47it/s]
 85%|████████▍ | 721/851 [00:09<00:01, 76.12it/s]
 86%|████████▌ | 730/851 [00:10<00:01, 78.39it/s]
 87%|████████▋ | 739/851 [00:10<00:01, 78.90it/s]
 88%|████████▊ | 747/851 [00:10<00:01, 78.42it/s]
 89%|████████▊ | 755/851 [00:10<00:01, 78.48it/s]
 90%|████████▉ | 764/851 [00:10<00:01, 78.64it/s]
 91%|█████████ | 772/851 [00:10<00:01, 74.92it/s]
 92%|█████████▏| 780/851 [00:10<00:00, 72.75it/s]
 93%|█████████▎| 788/851 [00:10<00:00, 73.39it/s]
 94%|█████████▎| 796/851 [00:10<00:00, 75.11it/s]
 95%|█████████▍| 805/851 [00:11<00:00, 76.84it/s]
 96%|█████████▌| 813/851 [00:11<00:00, 73.85it/s]
 96%|█████████▋| 821/851 [00:11<00:00, 74.79it/s]
 97%|█████████▋| 829/851 [00:11<00:00, 75.96it/s]
 98%|█████████▊| 837/851 [00:11<00:00, 74.64it/s]
 99%|█████████▉| 845/851 [00:11<00:00, 69.55it/s]
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 10/851 [00:00<00:08, 95.20it/s]
  2%|▏         | 20/851 [00:00<00:10, 81.67it/s]
  3%|▎         | 29/851 [00:00<00:10, 78.36it/s]
  4%|▍         | 37/851 [00:00<00:10, 76.45it/s]
  5%|▌         | 45/851 [00:00<00:10, 76.74it/s]
  6%|▌         | 53/851 [00:00<00:10, 77.26it/s]
  7%|▋         | 62/851 [00:00<00:10, 77.44it/s]
  8%|▊         | 70/851 [00:00<00:10, 72.24it/s]
  9%|▉         | 78/851 [00:01<00:10, 73.99it/s]
 10%|█         | 86/851 [00:01<00:10, 73.73it/s]
 11%|█         | 94/851 [00:01<00:10, 72.12it/s]
 12%|█▏        | 102/851 [00:01<00:10, 73.14it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.41it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.63it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.23it/s]
 16%|█▌        | 133/851 [00:01<00:10, 69.22it/s]
 16%|█▋        | 140/851 [00:01<00:10, 69.15it/s]
 17%|█▋        | 148/851 [00:02<00:10, 68.62it/s]
 18%|█▊        | 156/851 [00:02<00:09, 70.75it/s]
 19%|█▉        | 164/851 [00:02<00:09, 71.62it/s]
 20%|██        | 172/851 [00:02<00:09, 73.75it/s]
 21%|██        | 180/851 [00:02<00:09, 74.25it/s]
 22%|██▏       | 188/851 [00:02<00:08, 74.90it/s]
 23%|██▎       | 196/851 [00:02<00:08, 76.03it/s]
 24%|██▍       | 205/851 [00:02<00:08, 77.93it/s]
 25%|██▌       | 213/851 [00:02<00:08, 72.81it/s]
 26%|██▌       | 221/851 [00:03<00:08, 71.31it/s]
 27%|██▋       | 229/851 [00:03<00:08, 72.60it/s]
 28%|██▊       | 237/851 [00:03<00:08, 72.11it/s]
 29%|██▉       | 245/851 [00:03<00:08, 69.33it/s]
 30%|██▉       | 254/851 [00:03<00:08, 72.97it/s]
 31%|███       | 263/851 [00:03<00:07, 75.87it/s]
 32%|███▏      | 271/851 [00:03<00:07, 74.16it/s]
 33%|███▎      | 280/851 [00:03<00:07, 76.76it/s]
 34%|███▍      | 288/851 [00:03<00:07, 74.29it/s]
 35%|███▍      | 296/851 [00:04<00:07, 75.23it/s]
 36%|███▌      | 305/851 [00:04<00:07, 77.73it/s]
 37%|███▋      | 313/851 [00:04<00:07, 72.60it/s]
 38%|███▊      | 322/851 [00:04<00:06, 76.24it/s]
 39%|███▉      | 330/851 [00:04<00:06, 74.73it/s]
 40%|███▉      | 338/851 [00:04<00:06, 74.43it/s]
 41%|████      | 346/851 [00:04<00:06, 74.71it/s]
 42%|████▏     | 354/851 [00:04<00:06, 72.06it/s]
 43%|████▎     | 362/851 [00:04<00:06, 71.45it/s]
 43%|████▎     | 370/851 [00:05<00:06, 71.67it/s]
 44%|████▍     | 378/851 [00:05<00:06, 71.58it/s]
 45%|████▌     | 386/851 [00:05<00:06, 71.06it/s]
 46%|████▋     | 394/851 [00:05<00:06, 71.06it/s]
 47%|████▋     | 402/851 [00:05<00:06, 71.74it/s]
 48%|████▊     | 410/851 [00:05<00:06, 68.91it/s]
 49%|████▉     | 418/851 [00:05<00:06, 71.25it/s]
 50%|█████     | 426/851 [00:05<00:06, 69.84it/s]
 51%|█████     | 434/851 [00:05<00:05, 72.02it/s]
 52%|█████▏    | 443/851 [00:06<00:05, 73.38it/s]
 53%|█████▎    | 451/851 [00:06<00:05, 74.14it/s]
 54%|█████▍    | 459/851 [00:06<00:05, 75.25it/s]
 55%|█████▍    | 467/851 [00:06<00:05, 70.07it/s]
 56%|█████▌    | 475/851 [00:06<00:05, 65.66it/s]
 57%|█████▋    | 482/851 [00:06<00:05, 66.58it/s]
 58%|█████▊    | 490/851 [00:06<00:05, 68.00it/s]
 59%|█████▊    | 499/851 [00:06<00:04, 72.85it/s]
 60%|█████▉    | 507/851 [00:06<00:04, 72.58it/s]
 61%|██████    | 516/851 [00:07<00:04, 75.12it/s]
 62%|██████▏   | 524/851 [00:07<00:04, 71.11it/s]
 63%|██████▎   | 532/851 [00:07<00:04, 70.85it/s]
 63%|██████▎   | 540/851 [00:07<00:04, 73.05it/s]
 64%|██████▍   | 548/851 [00:07<00:04, 71.10it/s]
 65%|██████▌   | 556/851 [00:07<00:04, 73.24it/s]
 66%|██████▋   | 565/851 [00:07<00:03, 76.58it/s]
 67%|██████▋   | 573/851 [00:07<00:03, 76.46it/s]
 68%|██████▊   | 581/851 [00:07<00:03, 75.06it/s]
 69%|██████▉   | 589/851 [00:08<00:03, 70.81it/s]
 70%|███████   | 597/851 [00:08<00:03, 71.95it/s]
 71%|███████   | 605/851 [00:08<00:03, 71.62it/s]
 72%|███████▏  | 613/851 [00:08<00:03, 69.39it/s]
 73%|███████▎  | 621/851 [00:08<00:03, 70.18it/s]
 74%|███████▍  | 629/851 [00:08<00:03, 67.13it/s]
 75%|███████▍  | 637/851 [00:08<00:03, 69.79it/s]
 76%|███████▌  | 645/851 [00:08<00:03, 65.67it/s]
 77%|███████▋  | 653/851 [00:09<00:02, 68.49it/s]
 78%|███████▊  | 661/851 [00:09<00:02, 70.37it/s]
 79%|███████▊  | 669/851 [00:09<00:02, 71.13it/s]
 80%|███████▉  | 677/851 [00:09<00:02, 70.51it/s]
 80%|████████  | 685/851 [00:09<00:02, 72.36it/s]
 82%|████████▏ | 694/851 [00:09<00:02, 74.84it/s]
 83%|████████▎ | 703/851 [00:09<00:01, 76.51it/s]
 84%|████████▎ | 712/851 [00:09<00:01, 78.46it/s]
 85%|████████▍ | 720/851 [00:09<00:01, 76.13it/s]
 86%|████████▌ | 729/851 [00:10<00:01, 78.36it/s]
 87%|████████▋ | 737/851 [00:10<00:01, 78.39it/s]
 88%|████████▊ | 745/851 [00:10<00:01, 78.52it/s]
 88%|████████▊ | 753/851 [00:10<00:01, 77.70it/s]
 90%|████████▉ | 762/851 [00:10<00:01, 79.73it/s]
 90%|█████████ | 770/851 [00:10<00:01, 74.82it/s]
 91%|█████████▏| 778/851 [00:10<00:00, 74.26it/s]
 92%|█████████▏| 786/851 [00:10<00:00, 73.12it/s]
 93%|█████████▎| 794/851 [00:10<00:00, 73.97it/s]
 94%|█████████▍| 803/851 [00:10<00:00, 76.42it/s]
 95%|█████████▌| 811/851 [00:11<00:00, 73.67it/s]
 96%|█████████▌| 819/851 [00:11<00:00, 74.38it/s]
 97%|█████████▋| 828/851 [00:11<00:00, 76.43it/s]
 98%|█████████▊| 836/851 [00:11<00:00, 76.27it/s]
 99%|█████████▉| 844/851 [00:11<00:00, 73.07it/s]

+2024-05-15 14:29:25.440457: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
+2024-05-15 14:29:25.440508: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
+2024-05-15 14:29:25.442473: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
+2024-05-15 14:29:26.567248: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
+05/15/2024 14:29:28 - WARNING - __main__ -   Process rank: 0, device: cuda:0, n_gpu: 1distributed training: True, 16-bits training: False
+05/15/2024 14:29:28 - INFO - __main__ -   Training/evaluation parameters TrainingArguments(
+_n_gpu=1,
+accelerator_config={'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'gradient_accumulation_kwargs': None},
+adafactor=False,
+adam_beta1=0.9,
+adam_beta2=0.999,
+adam_epsilon=1e-08,
+auto_find_batch_size=False,
+bf16=False,
+bf16_full_eval=False,
+data_seed=None,
+dataloader_drop_last=False,
+dataloader_num_workers=0,
+dataloader_persistent_workers=False,
+dataloader_pin_memory=True,
+dataloader_prefetch_factor=None,
+ddp_backend=None,
+ddp_broadcast_buffers=None,
+ddp_bucket_cap_mb=None,
+ddp_find_unused_parameters=None,
+ddp_timeout=1800,
+debug=[],
+deepspeed=None,
+disable_tqdm=False,
+dispatch_batches=None,
+do_eval=True,
+do_predict=True,
+do_train=False,
+eval_accumulation_steps=None,
+eval_delay=0,
+eval_do_concat_batches=True,
+eval_steps=None,
+evaluation_strategy=epoch,
+fp16=False,
+fp16_backend=auto,
+fp16_full_eval=False,
+fp16_opt_level=O1,
+fsdp=[],
+fsdp_config={'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False},
+fsdp_min_num_params=0,
+fsdp_transformer_layer_cls_to_wrap=None,
+full_determinism=False,
+gradient_accumulation_steps=4,
+gradient_checkpointing=False,
+gradient_checkpointing_kwargs=None,
+greater_is_better=True,
+group_by_length=False,
+half_precision_backend=auto,
+hub_always_push=False,
+hub_model_id=None,
+hub_private_repo=False,
+hub_strategy=every_save,
+hub_token=<HUB_TOKEN>,
+ignore_data_skip=False,
+include_inputs_for_metrics=False,
+include_num_input_tokens_seen=False,
+include_tokens_per_second=False,
+jit_mode_eval=False,
+label_names=None,
+label_smoothing_factor=0.0,
+learning_rate=5e-05,
+length_column_name=length,
+load_best_model_at_end=True,
+local_rank=0,
+log_level=passive,
+log_level_replica=warning,
+log_on_each_node=True,
+logging_dir=/content/dissertation/scripts/ner/output/tb,
+logging_first_step=False,
+logging_nan_inf_filter=True,
+logging_steps=500,
+logging_strategy=steps,
+lr_scheduler_kwargs={},
+lr_scheduler_type=linear,
+max_grad_norm=1.0,
+max_steps=-1,
+metric_for_best_model=f1,
+mp_parameters=,
+neftune_noise_alpha=None,
+no_cuda=False,
+num_train_epochs=10.0,
+optim=adamw_torch,
+optim_args=None,
+optim_target_modules=None,
+output_dir=/content/dissertation/scripts/ner/output,
+overwrite_output_dir=True,
+past_index=-1,
+per_device_eval_batch_size=8,
+per_device_train_batch_size=4,
+prediction_loss_only=False,
+push_to_hub=True,
+push_to_hub_model_id=None,
+push_to_hub_organization=None,
+push_to_hub_token=<PUSH_TO_HUB_TOKEN>,
+ray_scope=last,
+remove_unused_columns=True,
+report_to=['tensorboard'],
+resume_from_checkpoint=None,
+run_name=/content/dissertation/scripts/ner/output,
+save_on_each_node=False,
+save_only_model=False,
+save_safetensors=True,
+save_steps=500,
+save_strategy=epoch,
+save_total_limit=None,
+seed=42,
+skip_memory_metrics=True,
+split_batches=None,
+tf32=None,
+torch_compile=False,
+torch_compile_backend=None,
+torch_compile_mode=None,
+torchdynamo=None,
+tpu_metrics_debug=False,
+tpu_num_cores=None,
+use_cpu=False,
+use_ipex=False,
+use_legacy_prediction_loop=False,
+use_mps_device=False,
+warmup_ratio=0.0,
+warmup_steps=0,
+weight_decay=0.0,
+)
+/usr/local/lib/python3.10/dist-packages/datasets/load.py:1486: FutureWarning: The repository for Rodrigo1771/drugtemist-ner contains custom code which must be executed to correctly load the dataset. You can inspect the repository content at https://hf.co/datasets/Rodrigo1771/drugtemist-ner
+You can avoid this message in future by passing the argument `trust_remote_code=True`.
+Passing `trust_remote_code=True` will be mandatory to load this dataset from the next major release of `datasets`.
+  warnings.warn(
+/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+[INFO|configuration_utils.py:726] 2024-05-15 14:29:40,285 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-15 14:29:40,289 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "finetuning_task": "ner",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "O",
+    "1": "B-FARMACO",
+    "2": "I-FARMACO"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "B-FARMACO": 1,
+    "I-FARMACO": 2,
+    "O": 0
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|configuration_utils.py:726] 2024-05-15 14:29:40,381 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-15 14:29:40,382 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file vocab.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/vocab.json
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file merges.txt from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/merges.txt
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file tokenizer.json from cache at None
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file added_tokens.json from cache at None
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file special_tokens_map.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/special_tokens_map.json
+[INFO|tokenization_utils_base.py:2087] 2024-05-15 14:29:40,392 >> loading file tokenizer_config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/tokenizer_config.json
+[INFO|configuration_utils.py:726] 2024-05-15 14:29:40,392 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-15 14:29:40,393 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|configuration_utils.py:726] 2024-05-15 14:29:40,477 >> loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/config.json
+[INFO|configuration_utils.py:789] 2024-05-15 14:29:40,478 >> Model config RobertaConfig {
+  "_name_or_path": "PlanTL-GOB-ES/bsc-bio-ehr-es",
+  "architectures": [
+    "RobertaForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.40.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50262
+}
+[INFO|modeling_utils.py:3429] 2024-05-15 14:29:40,721 >> loading weights file pytorch_model.bin from cache at /root/.cache/huggingface/hub/models--PlanTL-GOB-ES--bsc-bio-ehr-es/snapshots/1e543adb2d21f19d85a89305eebdbd64ab656b99/pytorch_model.bin
+[INFO|modeling_utils.py:4160] 2024-05-15 14:29:40,847 >> Some weights of the model checkpoint at PlanTL-GOB-ES/bsc-bio-ehr-es were not used when initializing RobertaForTokenClassification: ['lm_head.bias', 'lm_head.decoder.bias', 'lm_head.decoder.weight', 'lm_head.dense.bias', 'lm_head.dense.weight', 'lm_head.layer_norm.bias', 'lm_head.layer_norm.weight']
+- This IS expected if you are initializing RobertaForTokenClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
+- This IS NOT expected if you are initializing RobertaForTokenClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
+[WARNING|modeling_utils.py:4172] 2024-05-15 14:29:40,847 >> Some weights of RobertaForTokenClassification were not initialized from the model checkpoint at PlanTL-GOB-ES/bsc-bio-ehr-es and are newly initialized: ['classifier.bias', 'classifier.weight']
+You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
+/content/dissertation/scripts/ner/run_ner.py:397: FutureWarning: load_metric is deprecated and will be removed in the next major version of datasets. Use 'evaluate.load' instead, from the new library 🤗 Evaluate: https://huggingface.co/docs/evaluate
+  metric = load_metric("seqeval")
+/usr/local/lib/python3.10/dist-packages/datasets/load.py:759: FutureWarning: The repository for seqeval contains custom code which must be executed to correctly load the metric. You can inspect the repository content at https://raw.githubusercontent.com/huggingface/datasets/2.19.1/metrics/seqeval/seqeval.py
+You can avoid this message in future by passing the argument `trust_remote_code=True`.
+Passing `trust_remote_code=True` will be mandatory to load this metric from the next major release of `datasets`.
+  warnings.warn(
+05/15/2024 14:29:44 - INFO - __main__ -   *** Evaluate ***
+[INFO|trainer.py:786] 2024-05-15 14:29:44,602 >> The following columns in the evaluation set don't have a corresponding argument in `RobertaForTokenClassification.forward` and have been ignored: id, ner_tags, tokens. If id, ner_tags, tokens are not expected by `RobertaForTokenClassification.forward`,  you can safely ignore this message.
+[INFO|trainer.py:3614] 2024-05-15 14:29:44,607 >> ***** Running Evaluation *****
+[INFO|trainer.py:3616] 2024-05-15 14:29:44,608 >>   Num examples = 6807
+[INFO|trainer.py:3619] 2024-05-15 14:29:44,608 >>   Batch size = 8
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 6/851 [00:00<00:14, 56.94it/s]
  2%|▏         | 14/851 [00:00<00:12, 67.06it/s]
  3%|▎         | 22/851 [00:00<00:11, 69.14it/s]
  4%|▎         | 30/851 [00:00<00:11, 70.86it/s]
  4%|▍         | 38/851 [00:00<00:11, 72.79it/s]
  5%|▌         | 46/851 [00:00<00:10, 74.66it/s]
  6%|▋         | 54/851 [00:00<00:10, 75.69it/s]
  7%|▋         | 62/851 [00:00<00:10, 76.61it/s]
  8%|▊         | 70/851 [00:00<00:10, 71.80it/s]
  9%|▉         | 78/851 [00:01<00:10, 72.55it/s]
 10%|█         | 86/851 [00:01<00:10, 73.09it/s]
 11%|█         | 94/851 [00:01<00:10, 71.17it/s]
 12%|█▏        | 102/851 [00:01<00:10, 72.61it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.20it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.02it/s]
 15%|█▍        | 126/851 [00:01<00:10, 68.64it/s]
 16%|█▌        | 134/851 [00:01<00:10, 68.76it/s]
 17%|█▋        | 142/851 [00:01<00:10, 69.40it/s]
 18%|█▊        | 149/851 [00:02<00:10, 67.95it/s]
 18%|█▊        | 157/851 [00:02<00:09, 70.23it/s]
 19%|█▉        | 165/851 [00:02<00:09, 71.18it/s]
 20%|██        | 173/851 [00:02<00:09, 72.70it/s]
 21%|██▏       | 181/851 [00:02<00:09, 73.97it/s]
 22%|██▏       | 189/851 [00:02<00:08, 75.32it/s]
 23%|██▎       | 197/851 [00:02<00:08, 75.83it/s]
 24%|██▍       | 206/851 [00:02<00:08, 76.15it/s]
 25%|██▌       | 214/851 [00:02<00:08, 71.48it/s]
 26%|██▌       | 222/851 [00:03<00:08, 72.75it/s]
 27%|██▋       | 230/851 [00:03<00:08, 73.89it/s]
 28%|██▊       | 238/851 [00:03<00:08, 72.13it/s]
 29%|██▉       | 246/851 [00:03<00:08, 70.45it/s]
 30%|██▉       | 255/851 [00:03<00:08, 73.88it/s]
 31%|███       | 264/851 [00:03<00:07, 76.63it/s]
 32%|███▏      | 272/851 [00:03<00:07, 74.50it/s]
 33%|███▎      | 281/851 [00:03<00:07, 77.22it/s]
 34%|███▍      | 289/851 [00:03<00:07, 74.33it/s]
 35%|███▌      | 298/851 [00:04<00:07, 76.25it/s]
 36%|███▌      | 307/851 [00:04<00:06, 78.12it/s]
 37%|███▋      | 315/851 [00:04<00:07, 73.67it/s]
 38%|███▊      | 324/851 [00:04<00:07, 74.73it/s]
 39%|███▉      | 332/851 [00:04<00:07, 73.75it/s]
 40%|███▉      | 340/851 [00:04<00:06, 74.75it/s]
 41%|████      | 348/851 [00:04<00:06, 75.70it/s]
 42%|████▏     | 356/851 [00:04<00:06, 72.87it/s]
 43%|████▎     | 364/851 [00:04<00:06, 73.10it/s]
 44%|████▎     | 372/851 [00:05<00:06, 71.82it/s]
 45%|████▍     | 380/851 [00:05<00:06, 69.17it/s]
 46%|████▌     | 388/851 [00:05<00:06, 72.06it/s]
 47%|████▋     | 396/851 [00:05<00:06, 72.74it/s]
 47%|████▋     | 404/851 [00:05<00:06, 68.56it/s]
 48%|████▊     | 412/851 [00:05<00:06, 69.70it/s]
 49%|████▉     | 420/851 [00:05<00:05, 72.48it/s]
 50%|█████     | 428/851 [00:05<00:05, 70.96it/s]
 51%|█████     | 436/851 [00:06<00:05, 72.47it/s]
 52%|█████▏    | 444/851 [00:06<00:05, 74.07it/s]
 53%|█████▎    | 452/851 [00:06<00:05, 75.55it/s]
 54%|█████▍    | 460/851 [00:06<00:05, 74.33it/s]
 55%|█████▍    | 468/851 [00:06<00:05, 71.78it/s]
 56%|█████▌    | 476/851 [00:06<00:05, 67.46it/s]
 57%|█████▋    | 483/851 [00:06<00:05, 67.69it/s]
 58%|█████▊    | 491/851 [00:06<00:05, 69.09it/s]
 59%|█████▉    | 500/851 [00:06<00:04, 73.77it/s]
 60%|█████▉    | 508/851 [00:07<00:04, 72.84it/s]
 61%|██████    | 517/851 [00:07<00:04, 74.44it/s]
 62%|██████▏   | 525/851 [00:07<00:04, 70.40it/s]
 63%|██████▎   | 533/851 [00:07<00:04, 71.47it/s]
 64%|██████▎   | 541/851 [00:07<00:04, 73.42it/s]
 65%|██████▍   | 549/851 [00:07<00:04, 71.38it/s]
 65%|██████▌   | 557/851 [00:07<00:04, 73.46it/s]
 67%|██████▋   | 566/851 [00:07<00:03, 76.65it/s]
 67%|██████▋   | 574/851 [00:07<00:03, 76.97it/s]
 68%|██████▊   | 582/851 [00:08<00:03, 74.62it/s]
 69%|██████▉   | 590/851 [00:08<00:03, 71.50it/s]
 70%|███████   | 598/851 [00:08<00:03, 72.36it/s]
 71%|███████   | 606/851 [00:08<00:03, 70.56it/s]
 72%|███████▏  | 614/851 [00:08<00:03, 69.62it/s]
 73%|███████▎  | 622/851 [00:08<00:03, 71.02it/s]
 74%|███████▍  | 630/851 [00:08<00:03, 67.52it/s]
 75%|███████▍  | 638/851 [00:08<00:03, 70.60it/s]
 76%|███████▌  | 646/851 [00:08<00:03, 66.60it/s]
 77%|███████▋  | 654/851 [00:09<00:02, 69.98it/s]
 78%|███████▊  | 662/851 [00:09<00:02, 70.75it/s]
 79%|███████▊  | 670/851 [00:09<00:02, 72.07it/s]
 80%|███████▉  | 678/851 [00:09<00:02, 70.99it/s]
 81%|████████  | 686/851 [00:09<00:02, 72.75it/s]
 82%|████████▏ | 695/851 [00:09<00:02, 75.21it/s]
 83%|████████▎ | 704/851 [00:09<00:01, 76.96it/s]
 84%|████████▍ | 713/851 [00:09<00:01, 78.47it/s]
 85%|████████▍ | 721/851 [00:09<00:01, 76.12it/s]
 86%|████████▌ | 730/851 [00:10<00:01, 78.39it/s]
 87%|████████▋ | 739/851 [00:10<00:01, 78.90it/s]
 88%|████████▊ | 747/851 [00:10<00:01, 78.42it/s]
 89%|████████▊ | 755/851 [00:10<00:01, 78.48it/s]
 90%|████████▉ | 764/851 [00:10<00:01, 78.64it/s]
 91%|█████████ | 772/851 [00:10<00:01, 74.92it/s]
 92%|█████████▏| 780/851 [00:10<00:00, 72.75it/s]
 93%|█████████▎| 788/851 [00:10<00:00, 73.39it/s]
 94%|█████████▎| 796/851 [00:10<00:00, 75.11it/s]
 95%|█████████▍| 805/851 [00:11<00:00, 76.84it/s]
 96%|█████████▌| 813/851 [00:11<00:00, 73.85it/s]
 96%|█████████▋| 821/851 [00:11<00:00, 74.79it/s]
 97%|█████████▋| 829/851 [00:11<00:00, 75.96it/s]
 98%|█████████▊| 837/851 [00:11<00:00, 74.64it/s]
 99%|█████████▉| 845/851 [00:11<00:00, 69.55it/s]
+***** eval metrics *****
+  eval_accuracy           =     0.0288
+  eval_f1                 =     0.0056
+  eval_loss               =      1.293
+  eval_precision          =     0.0028
+  eval_recall             =     0.2693
+  eval_runtime            = 0:00:16.57
+  eval_samples            =       6807
+  eval_samples_per_second =    410.727
+  eval_steps_per_second   =     51.348
+05/15/2024 14:30:01 - INFO - __main__ -   *** Predict ***
+[INFO|trainer.py:786] 2024-05-15 14:30:01,182 >> The following columns in the test set don't have a corresponding argument in `RobertaForTokenClassification.forward` and have been ignored: id, ner_tags, tokens. If id, ner_tags, tokens are not expected by `RobertaForTokenClassification.forward`,  you can safely ignore this message.
+[INFO|trainer.py:3614] 2024-05-15 14:30:01,184 >> ***** Running Prediction *****
+[INFO|trainer.py:3616] 2024-05-15 14:30:01,184 >>   Num examples = 6807
+[INFO|trainer.py:3619] 2024-05-15 14:30:01,184 >>   Batch size = 8
  0%|          | 0/851 [00:00<?, ?it/s]
  1%|          | 10/851 [00:00<00:08, 95.20it/s]
  2%|▏         | 20/851 [00:00<00:10, 81.67it/s]
  3%|▎         | 29/851 [00:00<00:10, 78.36it/s]
  4%|▍         | 37/851 [00:00<00:10, 76.45it/s]
  5%|▌         | 45/851 [00:00<00:10, 76.74it/s]
  6%|▌         | 53/851 [00:00<00:10, 77.26it/s]
  7%|▋         | 62/851 [00:00<00:10, 77.44it/s]
  8%|▊         | 70/851 [00:00<00:10, 72.24it/s]
  9%|▉         | 78/851 [00:01<00:10, 73.99it/s]
 10%|█         | 86/851 [00:01<00:10, 73.73it/s]
 11%|█         | 94/851 [00:01<00:10, 72.12it/s]
 12%|█▏        | 102/851 [00:01<00:10, 73.14it/s]
 13%|█▎        | 110/851 [00:01<00:10, 71.41it/s]
 14%|█▍        | 118/851 [00:01<00:10, 72.63it/s]
 15%|█▍        | 126/851 [00:01<00:10, 69.23it/s]
 16%|█▌        | 133/851 [00:01<00:10, 69.22it/s]
 16%|█▋        | 140/851 [00:01<00:10, 69.15it/s]
 17%|█▋        | 148/851 [00:02<00:10, 68.62it/s]
 18%|█▊        | 156/851 [00:02<00:09, 70.75it/s]
 19%|█▉        | 164/851 [00:02<00:09, 71.62it/s]
 20%|██        | 172/851 [00:02<00:09, 73.75it/s]
 21%|██        | 180/851 [00:02<00:09, 74.25it/s]
 22%|██▏       | 188/851 [00:02<00:08, 74.90it/s]
 23%|██▎       | 196/851 [00:02<00:08, 76.03it/s]
 24%|██▍       | 205/851 [00:02<00:08, 77.93it/s]
 25%|██▌       | 213/851 [00:02<00:08, 72.81it/s]
 26%|██▌       | 221/851 [00:03<00:08, 71.31it/s]
 27%|██▋       | 229/851 [00:03<00:08, 72.60it/s]
 28%|██▊       | 237/851 [00:03<00:08, 72.11it/s]
 29%|██▉       | 245/851 [00:03<00:08, 69.33it/s]
 30%|██▉       | 254/851 [00:03<00:08, 72.97it/s]
 31%|███       | 263/851 [00:03<00:07, 75.87it/s]
 32%|███▏      | 271/851 [00:03<00:07, 74.16it/s]
 33%|███▎      | 280/851 [00:03<00:07, 76.76it/s]
 34%|███▍      | 288/851 [00:03<00:07, 74.29it/s]
 35%|███▍      | 296/851 [00:04<00:07, 75.23it/s]
 36%|███▌      | 305/851 [00:04<00:07, 77.73it/s]
 37%|███▋      | 313/851 [00:04<00:07, 72.60it/s]
 38%|███▊      | 322/851 [00:04<00:06, 76.24it/s]
 39%|███▉      | 330/851 [00:04<00:06, 74.73it/s]
 40%|███▉      | 338/851 [00:04<00:06, 74.43it/s]
 41%|████      | 346/851 [00:04<00:06, 74.71it/s]
 42%|████▏     | 354/851 [00:04<00:06, 72.06it/s]
 43%|████▎     | 362/851 [00:04<00:06, 71.45it/s]
 43%|████▎     | 370/851 [00:05<00:06, 71.67it/s]
 44%|████▍     | 378/851 [00:05<00:06, 71.58it/s]
 45%|████▌     | 386/851 [00:05<00:06, 71.06it/s]
 46%|████▋     | 394/851 [00:05<00:06, 71.06it/s]
 47%|████▋     | 402/851 [00:05<00:06, 71.74it/s]
 48%|████▊     | 410/851 [00:05<00:06, 68.91it/s]
 49%|████▉     | 418/851 [00:05<00:06, 71.25it/s]
 50%|█████     | 426/851 [00:05<00:06, 69.84it/s]
 51%|█████     | 434/851 [00:05<00:05, 72.02it/s]
 52%|█████▏    | 443/851 [00:06<00:05, 73.38it/s]
 53%|█████▎    | 451/851 [00:06<00:05, 74.14it/s]
 54%|█████▍    | 459/851 [00:06<00:05, 75.25it/s]
 55%|█████▍    | 467/851 [00:06<00:05, 70.07it/s]
 56%|█████▌    | 475/851 [00:06<00:05, 65.66it/s]
 57%|█████▋    | 482/851 [00:06<00:05, 66.58it/s]
 58%|█████▊    | 490/851 [00:06<00:05, 68.00it/s]
 59%|█████▊    | 499/851 [00:06<00:04, 72.85it/s]
 60%|█████▉    | 507/851 [00:06<00:04, 72.58it/s]
 61%|██████    | 516/851 [00:07<00:04, 75.12it/s]
 62%|██████▏   | 524/851 [00:07<00:04, 71.11it/s]
 63%|██████▎   | 532/851 [00:07<00:04, 70.85it/s]
 63%|██████▎   | 540/851 [00:07<00:04, 73.05it/s]
 64%|██████▍   | 548/851 [00:07<00:04, 71.10it/s]
 65%|██████▌   | 556/851 [00:07<00:04, 73.24it/s]
 66%|██████▋   | 565/851 [00:07<00:03, 76.58it/s]
 67%|██████▋   | 573/851 [00:07<00:03, 76.46it/s]
 68%|██████▊   | 581/851 [00:07<00:03, 75.06it/s]
 69%|██████▉   | 589/851 [00:08<00:03, 70.81it/s]
 70%|███████   | 597/851 [00:08<00:03, 71.95it/s]
 71%|███████   | 605/851 [00:08<00:03, 71.62it/s]
 72%|███████▏  | 613/851 [00:08<00:03, 69.39it/s]
 73%|███████▎  | 621/851 [00:08<00:03, 70.18it/s]
 74%|███████▍  | 629/851 [00:08<00:03, 67.13it/s]
 75%|███████▍  | 637/851 [00:08<00:03, 69.79it/s]
 76%|███████▌  | 645/851 [00:08<00:03, 65.67it/s]
 77%|███████▋  | 653/851 [00:09<00:02, 68.49it/s]
 78%|███████▊  | 661/851 [00:09<00:02, 70.37it/s]
 79%|███████▊  | 669/851 [00:09<00:02, 71.13it/s]
 80%|███████▉  | 677/851 [00:09<00:02, 70.51it/s]
 80%|████████  | 685/851 [00:09<00:02, 72.36it/s]
 82%|████████▏ | 694/851 [00:09<00:02, 74.84it/s]
 83%|████████▎ | 703/851 [00:09<00:01, 76.51it/s]
 84%|████████▎ | 712/851 [00:09<00:01, 78.46it/s]
 85%|████████▍ | 720/851 [00:09<00:01, 76.13it/s]
 86%|████████▌ | 729/851 [00:10<00:01, 78.36it/s]
 87%|████████▋ | 737/851 [00:10<00:01, 78.39it/s]
 88%|████████▊ | 745/851 [00:10<00:01, 78.52it/s]
 88%|████████▊ | 753/851 [00:10<00:01, 77.70it/s]
 90%|████████▉ | 762/851 [00:10<00:01, 79.73it/s]
 90%|█████████ | 770/851 [00:10<00:01, 74.82it/s]
 91%|█████████▏| 778/851 [00:10<00:00, 74.26it/s]
 92%|█████████▏| 786/851 [00:10<00:00, 73.12it/s]
 93%|█████████▎| 794/851 [00:10<00:00, 73.97it/s]
 94%|█████████▍| 803/851 [00:10<00:00, 76.42it/s]
 95%|█████████▌| 811/851 [00:11<00:00, 73.67it/s]
 96%|█████████▌| 819/851 [00:11<00:00, 74.38it/s]
 97%|█████████▋| 828/851 [00:11<00:00, 76.43it/s]
 98%|█████████▊| 836/851 [00:11<00:00, 76.27it/s]
 99%|█████████▉| 844/851 [00:11<00:00, 73.07it/s]
+[INFO|trainer.py:3305] 2024-05-15 14:30:17,255 >> Saving model checkpoint to /content/dissertation/scripts/ner/output
+[INFO|configuration_utils.py:471] 2024-05-15 14:30:17,256 >> Configuration saved in /content/dissertation/scripts/ner/output/config.json
+[INFO|modeling_utils.py:2590] 2024-05-15 14:30:18,218 >> Model weights saved in /content/dissertation/scripts/ner/output/model.safetensors
+[INFO|tokenization_utils_base.py:2488] 2024-05-15 14:30:18,219 >> tokenizer config file saved in /content/dissertation/scripts/ner/output/tokenizer_config.json
+[INFO|tokenization_utils_base.py:2497] 2024-05-15 14:30:18,219 >> Special tokens file saved in /content/dissertation/scripts/ner/output/special_tokens_map.json
+[INFO|modelcard.py:450] 2024-05-15 14:30:18,357 >> Dropping the following result as it does not have all the necessary fields:
+{'task': {'name': 'Token Classification', 'type': 'token-classification'}, 'dataset': {'name': 'Rodrigo1771/drugtemist-ner', 'type': 'Rodrigo1771/drugtemist-ner', 'config': 'DrugTEMIST NER', 'split': 'validation', 'args': 'DrugTEMIST NER'}}
+***** predict metrics *****
+  predict_accuracy           =     0.0288
+  predict_f1                 =     0.0056
+  predict_loss               =      1.293
+  predict_precision          =     0.0028
+  predict_recall             =     0.2693
+  predict_runtime            = 0:00:15.76
+  predict_samples_per_second =    431.659
+  predict_steps_per_second   =     53.965

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:18cce49dc172023921a8c234d2d643afeaa0b4f8f209fd2e1d76d267cbbe3c95
+size 5048

vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff