Sarmila
/

pubmed-bert-squad-covidqa

Question Answering

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Sarmila commited on Sep 19, 2023

Commit

2850262

•

1 Parent(s): 343875a

Update README.md

Files changed (1) hide show

README.md +18 -4

README.md CHANGED Viewed

@@ -3,11 +3,16 @@ license: mit
 base_model: Sarmila/pubmed-bert-squad-covidqa
 tags:
 - generated_from_trainer
 datasets:
 - covid_qa_deepset
 model-index:
 - name: pubmed-bert-squad-covidqa
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,13 +20,22 @@ should probably proofread and complete it, then remove this comment. -->
 # pubmed-bert-squad-covidqa
-This model is a fine-tuned version of [Sarmila/pubmed-bert-squad-covidqa](https://huggingface.co/Sarmila/pubmed-bert-squad-covidqa) on the covid_qa_deepset dataset.
-It achieves the following results on the evaluation set:
 - Loss: 0.4876
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -58,4 +72,4 @@ The following hyperparameters were used during training:
 - Transformers 4.33.0
 - Pytorch 2.0.0
 - Datasets 2.1.0
-- Tokenizers 0.13.3

 base_model: Sarmila/pubmed-bert-squad-covidqa
 tags:
 - generated_from_trainer
+- biology
 datasets:
 - covid_qa_deepset
+- squad
 model-index:
 - name: pubmed-bert-squad-covidqa
   results: []
+language:
+- en
+pipeline_tag: question-answering
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # pubmed-bert-squad-covidqa
+This model is a fine-tuned version of [Sarmila/pubmed-bert-squad-covidqa](https://huggingface.co/Sarmila/pubmed-bert-squad-covidqa) on the squad qa first, covid_qa_deepset dataset.
+It achieves the following results on the evaluation set for squad:
+{'exact_match': 59.0, 'f1': 76.32473929579194}
+It achieves the following results on the evaluation set for covidqa:
 - Loss: 0.4876
 ## Model description
+This model is trained with an intention of testing pumed bert bionlp language model for question answering pipeline.
+While testing on our custom dataset, we reliazed that the model when used directly for QA did not perform well at all. Hence, we decided to train on covidqa
+to make model accustomed with answer extraction. While, covidqa data is very similar to what we intended to use, it is samll in number hence resulting not much improvement.
+Therefore, we firt trained the model in squad dataset which is larger in number. Then, we trained the model for covid qa. Hence, squad helped model to learn how to extract answers and covid qa helped us to train the model on domain similar to ours i.e. biomedicine
+further, we have first performed MLM using our dataset on pubmed bert bionlp and then performed exactly same üiüeline to see the difference which is [here]
 ## Intended uses & limitations
 - Transformers 4.33.0
 - Pytorch 2.0.0
 - Datasets 2.1.0
+- Tokenizers 0.13.3