sultan commited on
Commit
7108315
1 Parent(s): 97c0a7b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -22,7 +22,7 @@ models.
22
 
23
  # Model Description
24
 
25
- This model is fine-tuned on the SQuAD2.0 dataset. Fine-tuning the biomedical language model on the SQuAD dataset helps improve the score on the BioASQ challenge. If you plan to work with BioASQ or biomedical QA tasks, it's better to use this model over BioM-ELECTRA-Large. This model (TensorFlow version ) took the lead in the BioASQ9b-Factoid challenge (Batch 5) under the name of (UDEL-LAB2). To see the full details of BioASQ9B results, please check this link http://participants-area.bioasq.org/results/9b/phaseB/ ( you need to register).
26
 
27
  Huggingface library doesn't implement Layer-Wise decay feature, which affects the performance on SQuAD task. The reported result of BioM-ELECTRA-SQuAD in our paper is 88.3 (F1) since we use ELECTRA open-source code with TF checkpoint, which uses Layer-Wise decay.
28
 
@@ -43,7 +43,7 @@ run_qa.py --model_name_or_path sultan/BioM-ELECTRA-Large-Discriminator \
43
  --doc_stride 128 \
44
  --per_device_train_batch_size 8 \
45
  --gradient_accumulation_steps 6 \
46
- --per_device_eval_batch_size 128 \
47
  --fp16 \
48
  --fp16_opt_level O1 \
49
  --logging_steps 50 \
22
 
23
  # Model Description
24
 
25
+ We fine-tuned BioM-ELECTRA-Large, which was pre-trained on PubMed Abstracts, on the SQuAD2.0 dataset. Fine-tuning the biomedical language model on the SQuAD dataset helps improve the score on the BioASQ challenge. If you plan to work with BioASQ or biomedical QA tasks, it's better to use this model over BioM-ELECTRA-Large. This model (TensorFlow version ) took the lead in the BioASQ9b-Factoid challenge (Batch 5) under the name of (UDEL-LAB2). To see the full details of BioASQ9B results, please check this link http://participants-area.bioasq.org/results/9b/phaseB/ ( you need to register).
26
 
27
  Huggingface library doesn't implement Layer-Wise decay feature, which affects the performance on SQuAD task. The reported result of BioM-ELECTRA-SQuAD in our paper is 88.3 (F1) since we use ELECTRA open-source code with TF checkpoint, which uses Layer-Wise decay.
28
 
43
  --doc_stride 128 \
44
  --per_device_train_batch_size 8 \
45
  --gradient_accumulation_steps 6 \
46
+ --per_device_eval_batch_size 128
47
  --fp16 \
48
  --fp16_opt_level O1 \
49
  --logging_steps 50 \