neuralmagic
/

mobilebert-uncased-finetuned-squadv1

Question Answering

Inference Endpoints

Model card Files Files and versions Community

ekurtic commited on Aug 4, 2022

Commit

794b132

•

1 Parent(s): fc083b8

Update notes on model prep

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -9,6 +9,7 @@ datasets: squad
 # mobilebert-uncased-finetuned-squadv1
 This model is a finetuned version of the [mobilebert-uncased](https://huggingface.co/google/mobilebert-uncased/tree/main) model on the SQuADv1 task.
 It is produced as part of the work on the paper [The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models](https://arxiv.org/abs/2203.07259).
@@ -30,4 +31,5 @@ If you find the model useful, please consider citing our work.
   journal={arXiv preprint arXiv:2203.07259},
   year={2022}
 }
-```

 # mobilebert-uncased-finetuned-squadv1
 This model is a finetuned version of the [mobilebert-uncased](https://huggingface.co/google/mobilebert-uncased/tree/main) model on the SQuADv1 task.
+To make this TPU-trained model stable when used in PyTorch on GPUs, the original model has been additionally pretrained for one epoch on BookCorpus and English Wikipedia with disabled dropout before finetuning on the SQuADv1 task.
 It is produced as part of the work on the paper [The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models](https://arxiv.org/abs/2203.07259).
   journal={arXiv preprint arXiv:2203.07259},
   year={2022}
 }
+```