madlag
/

bert-large-uncased-wwm-squadv2-x2.15-f83.2-d25-hybrid-v1

Question Answering

Inference Endpoints

Model card Files Files and versions Community

madlag commited on Jun 16, 2021

Commit

6233300

•

1 Parent(s): 4b2215f

Update README.md

Fix Squad reference

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ This is possible because the pruning method lead to structured matrices: to visu
 In terms of accuracy, its **F1 is 83.22**, compared with 85.85 for , a **F1 drop of 2.63**.
 ## Fine-Pruning details
-This model was fine-tuned from the HuggingFace [model](https://huggingface.co/bert-large-uncased-whole-word-masking)  uncased checkpoint on [SQuAD1.1](https://rajpurkar.github.io/SQuAD-explorer), and distilled from the model [madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2](https://huggingface.co/madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2).
 This model is case-insensitive: it does not make a difference between english and English.
 A side-effect of the block pruning is that some of the attention heads are completely removed: 155 heads were removed on a total of 384 (40.4%).

 In terms of accuracy, its **F1 is 83.22**, compared with 85.85 for , a **F1 drop of 2.63**.
 ## Fine-Pruning details
+This model was fine-tuned from the HuggingFace [model](https://huggingface.co/bert-large-uncased-whole-word-masking)  uncased checkpoint on [SQuAD2.0](https://rajpurkar.github.io/SQuAD-explorer), and distilled from the model [madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2](https://huggingface.co/madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2).
 This model is case-insensitive: it does not make a difference between english and English.
 A side-effect of the block pruning is that some of the attention heads are completely removed: 155 heads were removed on a total of 384 (40.4%).