fmmolina commited on
Commit
2212e9f
1 Parent(s): cfb9ce4

Update README.md

Browse files

Added extra information about the model

Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -14,13 +14,11 @@ widget:
14
  - text: "El síndrome de dolor regional complejo es un trastorno de dolor crónico."
15
 
16
  ---
17
-
18
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
19
- should probably proofread and complete it, then remove this comment. -->
20
-
21
  # bert-base-spanish-wwm-uncased-finetuned-NER-medical
22
 
23
- This model is a fine-tuned version of [dccuchile/bert-base-spanish-wwm-uncased](https://huggingface.co/dccuchile/bert-base-spanish-wwm-uncased) on an adaptation of eHealth-KD Challenge 2020 dataset (filtered only for the task of NER). The dataset annotations for NER are ['Concept', 'Action', 'Predicate', 'Reference'].
 
 
24
 
25
 
26
  It achieves the following results on the evaluation set:
@@ -44,6 +42,8 @@ More information needed
44
 
45
  ## Training procedure
46
 
 
 
47
  ### Training hyperparameters
48
 
49
  The following hyperparameters were used during training:
 
14
  - text: "El síndrome de dolor regional complejo es un trastorno de dolor crónico."
15
 
16
  ---
 
 
 
 
17
  # bert-base-spanish-wwm-uncased-finetuned-NER-medical
18
 
19
+ This model is a fine-tuned version of [dccuchile/bert-base-spanish-wwm-uncased](https://huggingface.co/dccuchile/bert-base-spanish-wwm-uncased) on an adaptation of [eHealth-KD Challenge 2020 dataset](https://knowledge-learning.github.io/ehealthkd-2020/), filtered only for the task of NER. The dataset annotations for NER are ['Concept', 'Action', 'Predicate', 'Reference'].
20
+
21
+ Before the training process, the dataset had processed to label it with the BIO annotation format. Some cleaning and adaptations were needed, for example, to work with overlapped entities.
22
 
23
 
24
  It achieves the following results on the evaluation set:
 
42
 
43
  ## Training procedure
44
 
45
+ The chapter [“Token classification”]( https://huggingface.co/course/chapter7/2) in the Hugging Face online course was used as starting point for the training process. We made some adaptions because our dataset follows a slightly different structure. Moreover, a conversion between string labels and integers labels was needed.
46
+
47
  ### Training hyperparameters
48
 
49
  The following hyperparameters were used during training: