julien-c HF staff commited on
Commit
92074fc
1 Parent(s): 0466dc3

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/mrm8488/bert-medium-finetuned-squadv2/README.md

Files changed (1) hide show
  1. README.md +122 -0
README.md ADDED
@@ -0,0 +1,122 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ thumbnail:
4
+ ---
5
+
6
+ # BERT-Medium fine-tuned on SQuAD v2
7
+
8
+ [BERT-Medium](https://github.com/google-research/bert/) created by [Google Research](https://github.com/google-research) and fine-tuned on [SQuAD 2.0](https://rajpurkar.github.io/SQuAD-explorer/) for **Q&A** downstream task.
9
+
10
+ **Mode size** (after training): **157.46 MB**
11
+
12
+ ## Details of BERT-Small and its 'family' (from their documentation)
13
+
14
+ Released on March 11th, 2020
15
+
16
+ This is model is a part of 24 smaller BERT models (English only, uncased, trained with WordPiece masking) referenced in [Well-Read Students Learn Better: On the Importance of Pre-training Compact Models](https://arxiv.org/abs/1908.08962).
17
+
18
+ The smaller BERT models are intended for environments with restricted computational resources. They can be fine-tuned in the same manner as the original BERT models. However, they are most effective in the context of knowledge distillation, where the fine-tuning labels are produced by a larger and more accurate teacher.
19
+
20
+ ## Details of the downstream task (Q&A) - Dataset
21
+
22
+ [SQuAD2.0](https://rajpurkar.github.io/SQuAD-explorer/) combines the 100,000 questions in SQuAD1.1 with over 50,000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones. To do well on SQuAD2.0, systems must not only answer questions when possible, but also determine when no answer is supported by the paragraph and abstain from answering.
23
+
24
+ | Dataset | Split | # samples |
25
+ | -------- | ----- | --------- |
26
+ | SQuAD2.0 | train | 130k |
27
+ | SQuAD2.0 | eval | 12.3k |
28
+
29
+ ## Model training
30
+
31
+ The model was trained on a Tesla P100 GPU and 25GB of RAM.
32
+ The script for fine tuning can be found [here](https://github.com/huggingface/transformers/blob/master/examples/question-answering/run_squad.py)
33
+
34
+ ## Results:
35
+
36
+ | Metric | # Value |
37
+ | ------ | --------- |
38
+ | **EM** | **65.95** |
39
+ | **F1** | **70.11** |
40
+
41
+ ### Raw metrics from benchmark included in training script:
42
+
43
+ ```json
44
+ {
45
+ "exact": 65.95637159942727,
46
+ "f1": 70.11632254245896,
47
+ "total": 11873,
48
+ "HasAns_exact": 67.79689608636977,
49
+ "HasAns_f1": 76.12872765631123,
50
+ "HasAns_total": 5928,
51
+ "NoAns_exact": 64.12111017661901,
52
+ "NoAns_f1": 64.12111017661901,
53
+ "NoAns_total": 5945,
54
+ "best_exact": 65.96479407058031,
55
+ "best_exact_thresh": 0.0,
56
+ "best_f1": 70.12474501361196,
57
+ "best_f1_thresh": 0.0
58
+ }
59
+ ```
60
+
61
+ ## Comparison:
62
+
63
+ | Model | EM | F1 score | SIZE (MB) |
64
+ | --------------------------------------------------------------------------------------------- | --------- | --------- | --------- |
65
+ | [bert-tiny-finetuned-squadv2](https://huggingface.co/mrm8488/bert-tiny-finetuned-squadv2) | 48.60 | 49.73 | **16.74** |
66
+ | [bert-tiny-5-finetuned-squadv2](https://huggingface.co/mrm8488/bert-tiny-5-finetuned-squadv2) | 57.12 | 60.86 | 24.34 |
67
+ | [bert-mini-finetuned-squadv2](https://huggingface.co/mrm8488/bert-mini-finetuned-squadv2) | 56.31 | 59.65 | 42.63 |
68
+ | [bert-mini-5-finetuned-squadv2](https://huggingface.co/mrm8488/bert-mini-5-finetuned-squadv2) | 63.51 | 66.78 | 66.76 |
69
+ | [bert-small-finetuned-squadv2](https://huggingface.co/mrm8488/bert-small-finetuned-squadv2) | 60.49 | 64.21 | 109.74 |
70
+ | [bert-medium-finetuned-squadv2](https://huggingface.co/mrm8488/bert-medium-finetuned-squadv2) | **65.95** | **70.11** | 157.46 |
71
+
72
+ ## Model in action
73
+
74
+ Fast usage with **pipelines**:
75
+
76
+ ```python
77
+ from transformers import pipeline
78
+
79
+ qa_pipeline = pipeline(
80
+ "question-answering",
81
+ model="mrm8488/bert-small-finetuned-squadv2",
82
+ tokenizer="mrm8488/bert-small-finetuned-squadv2"
83
+ )
84
+
85
+ qa_pipeline({
86
+ 'context': "Manuel Romero has been working hardly in the repository hugginface/transformers lately",
87
+ 'question': "Who has been working hard for hugginface/transformers lately?"
88
+
89
+ })
90
+
91
+ # Output:
92
+ ```
93
+
94
+ ```json
95
+ {
96
+ "answer": "Manuel Romero",
97
+ "end": 13,
98
+ "score": 0.9939319924374637,
99
+ "start": 0
100
+ }
101
+ ```
102
+
103
+ ### Yes! That was easy 🎉 Let's try with another example
104
+
105
+ ```python
106
+ qa_pipeline({
107
+ 'context': "Manuel Romero has been working remotely in the repository hugginface/transformers lately",
108
+ 'question': "How has been working Manuel Romero?"
109
+ })
110
+
111
+ # Output:
112
+ ```
113
+
114
+ ```json
115
+ { "answer": "remotely", "end": 39, "score": 0.3612058272768017, "start": 31 }
116
+ ```
117
+
118
+ ### It works!! 🎉 🎉 🎉
119
+
120
+ > Created by [Manuel Romero/@mrm8488](https://twitter.com/mrm8488) | [LinkedIn](https://www.linkedin.com/in/manuel-romero-cs/)
121
+
122
+ > Made with <span style="color: #e25555;">&hearts;</span> in Spain