madlag commited on
Commit
ab63cfa
1 Parent(s): f9e2938

Adding modes, graphs and metadata.

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +115 -0
  2. config.json +264 -0
  3. model_card/density_info.js +174 -0
  4. model_card/images/layer_0_attention_output_dense.png +0 -0
  5. model_card/images/layer_0_attention_self_key.png +0 -0
  6. model_card/images/layer_0_attention_self_query.png +0 -0
  7. model_card/images/layer_0_attention_self_value.png +0 -0
  8. model_card/images/layer_0_intermediate_dense.png +0 -0
  9. model_card/images/layer_0_output_dense.png +0 -0
  10. model_card/images/layer_10_attention_output_dense.png +0 -0
  11. model_card/images/layer_10_attention_self_key.png +0 -0
  12. model_card/images/layer_10_attention_self_query.png +0 -0
  13. model_card/images/layer_10_attention_self_value.png +0 -0
  14. model_card/images/layer_10_intermediate_dense.png +0 -0
  15. model_card/images/layer_10_output_dense.png +0 -0
  16. model_card/images/layer_11_attention_output_dense.png +0 -0
  17. model_card/images/layer_11_attention_self_key.png +0 -0
  18. model_card/images/layer_11_attention_self_query.png +0 -0
  19. model_card/images/layer_11_attention_self_value.png +0 -0
  20. model_card/images/layer_11_intermediate_dense.png +0 -0
  21. model_card/images/layer_11_output_dense.png +0 -0
  22. model_card/images/layer_12_attention_output_dense.png +0 -0
  23. model_card/images/layer_12_attention_self_key.png +0 -0
  24. model_card/images/layer_12_attention_self_query.png +0 -0
  25. model_card/images/layer_12_attention_self_value.png +0 -0
  26. model_card/images/layer_12_intermediate_dense.png +0 -0
  27. model_card/images/layer_12_output_dense.png +0 -0
  28. model_card/images/layer_13_attention_output_dense.png +0 -0
  29. model_card/images/layer_13_attention_self_key.png +0 -0
  30. model_card/images/layer_13_attention_self_query.png +0 -0
  31. model_card/images/layer_13_attention_self_value.png +0 -0
  32. model_card/images/layer_13_intermediate_dense.png +0 -0
  33. model_card/images/layer_13_output_dense.png +0 -0
  34. model_card/images/layer_14_attention_output_dense.png +0 -0
  35. model_card/images/layer_14_attention_self_key.png +0 -0
  36. model_card/images/layer_14_attention_self_query.png +0 -0
  37. model_card/images/layer_14_attention_self_value.png +0 -0
  38. model_card/images/layer_14_intermediate_dense.png +0 -0
  39. model_card/images/layer_14_output_dense.png +0 -0
  40. model_card/images/layer_15_attention_output_dense.png +0 -0
  41. model_card/images/layer_15_attention_self_key.png +0 -0
  42. model_card/images/layer_15_attention_self_query.png +0 -0
  43. model_card/images/layer_15_attention_self_value.png +0 -0
  44. model_card/images/layer_15_intermediate_dense.png +0 -0
  45. model_card/images/layer_15_output_dense.png +0 -0
  46. model_card/images/layer_16_attention_output_dense.png +0 -0
  47. model_card/images/layer_16_attention_self_key.png +0 -0
  48. model_card/images/layer_16_attention_self_query.png +0 -0
  49. model_card/images/layer_16_attention_self_value.png +0 -0
  50. model_card/images/layer_16_intermediate_dense.png +0 -0
README.md ADDED
@@ -0,0 +1,115 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ thumbnail:
4
+ license: mit
5
+ tags:
6
+ - question-answering
7
+ -
8
+ -
9
+ datasets:
10
+ - squad_v2
11
+ metrics:
12
+ - squad_v2
13
+ widget:
14
+ - text: "Where is the Eiffel Tower located?"
15
+ context: "The Eiffel Tower is a wrought-iron lattice tower on the Champ de Mars in Paris, France. It is named after the engineer Gustave Eiffel, whose company designed and built the tower."
16
+ - text: "Who is Frederic Chopin?"
17
+ context: "Frédéric François Chopin, born Fryderyk Franciszek Chopin (1 March 1810 – 17 October 1849), was a Polish composer and virtuoso pianist of the Romantic era who wrote primarily for solo piano."
18
+ ---
19
+
20
+ ## BERT-base uncased model fine-tuned on SQuAD v1
21
+
22
+ This model was created using the [nn_pruning](https://github.com/huggingface/nn_pruning) python library: the **linear layers contains 16.0%** of the original weights.
23
+
24
+
25
+
26
+ The model contains **24.0%** of the original weights **overall** (the embeddings account for a significant part of the model, and they are not pruned by this method).
27
+
28
+ With a simple resizing of the linear matrices it ran **2.63x as fast as bert-large-uncased-whole-word-masking** on the evaluation.
29
+ This is possible because the pruning method lead to structured matrices: to visualize them, hover below on the plot to see the non-zero/zero parts of each matrix.
30
+
31
+ <div class="graph"><script src="/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/density_info.js" id="cddd6c5c-2e1d-40c7-b172-f7d5422349a6"></script></div>
32
+
33
+ In terms of accuracy, its **F1 is 82.57**, compared with 85.85 for , a **F1 drop of 3.28**.
34
+
35
+ ## Fine-Pruning details
36
+ This model was fine-tuned from the HuggingFace [model](https://huggingface.co/bert-large-uncased-whole-word-masking) uncased checkpoint on [SQuAD1.1](https://rajpurkar.github.io/SQuAD-explorer), and distilled from the model [madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2](https://huggingface.co/madlag/bert-large-uncased-whole-word-masking-finetuned-squadv2).
37
+ This model is case-insensitive: it does not make a difference between english and English.
38
+
39
+ A side-effect of the block pruning is that some of the attention heads are completely removed: 190 heads were removed on a total of 384 (49.5%).
40
+ Here is a detailed view on how the remaining heads are distributed in the network after pruning.
41
+ <div class="graph"><script src="/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/pruning_info.js" id="03ad75cf-8048-44ae-a1d6-db69021cc168"></script></div>
42
+
43
+ ## Details of the SQuAD1.1 dataset
44
+
45
+ | Dataset | Split | # samples |
46
+ | -------- | ----- | --------- |
47
+ | SQuAD 2.0 | train | 130.0K |
48
+ | SQuAD 2.0 | eval | 11.9k |
49
+
50
+ ### Fine-tuning
51
+ - Python: `3.8.5`
52
+
53
+ - Machine specs:
54
+
55
+ ```CPU: Intel(R) Core(TM) i7-6700K CPU
56
+ Memory: 64 GiB
57
+ GPUs: 1 GeForce GTX 3090, with 24GiB memory
58
+ GPU driver: 455.23.05, CUDA: 11.1
59
+ ```
60
+
61
+ ### Results
62
+
63
+ **Pytorch model file size**: `1084MB` (original BERT: `1228.0MB`)
64
+
65
+ | Metric | # Value | # Original ([Table 2](https://www.aclweb.org/anthology/N19-1423.pdf))| Variation |
66
+ | ------ | --------- | --------- | --------- |
67
+ | **EM** | **79.70** | **82.83** | **-4.13**|
68
+ | **F1** | **82.57** | **85.85** | **-3.28**|
69
+
70
+ ```
71
+ {
72
+ "HasAns_exact": 74.8144399460189,
73
+ "HasAns_f1": 80.555306012496,
74
+ "HasAns_total": 5928,
75
+ "NoAns_exact": 84.57527333894029,
76
+ "NoAns_f1": 84.57527333894029,
77
+ "NoAns_total": 5945,
78
+ "best_exact": 79.70184452118251,
79
+ "best_exact_thresh": 0.0,
80
+ "best_f1": 82.56816761071966,
81
+ "best_f1_thresh": 0.0,
82
+ "exact": 79.70184452118251,
83
+ "f1": 82.56816761071981,
84
+ "total": 11873
85
+ }
86
+ ```
87
+
88
+ ## Example Usage
89
+ Install nn_pruning: it contains the optimization script, which just pack the linear layers into smaller ones by removing empty rows/columns.
90
+
91
+ `pip install nn_pruning`
92
+
93
+ Then you can use the `transformers library` almost as usual: you just have to call `optimize_model` when the pipeline has loaded.
94
+
95
+ ```python
96
+ from transformers import pipeline
97
+ from nn_pruning.inference_model_patcher import optimize_model
98
+
99
+ qa_pipeline = pipeline(
100
+ "question-answering",
101
+ model="madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1",
102
+ tokenizer="madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1"
103
+ )
104
+
105
+ print("bert-large-uncased-whole-word-masking parameters: 445.0M")
106
+ print(f"Parameters count (includes only head pruning, not feed forward pruning)={int(qa_pipeline.model.num_parameters() / 1E6)}M")
107
+ qa_pipeline.model = optimize_model(qa_pipeline.model, "dense")
108
+
109
+ print(f"Parameters count after complete optimization={int(qa_pipeline.model.num_parameters() / 1E6)}M")
110
+ predictions = qa_pipeline({
111
+ 'context': "Frédéric François Chopin, born Fryderyk Franciszek Chopin (1 March 1810 – 17 October 1849), was a Polish composer and virtuoso pianist of the Romantic era who wrote primarily for solo piano.",
112
+ 'question': "Who is Frederic Chopin?",
113
+ })
114
+ print("Predictions", predictions)
115
+ ```
config.json ADDED
@@ -0,0 +1,264 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "/tmp/tmpitf3rdr5",
3
+ "architectures": [
4
+ "BertForQuestionAnswering"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "gradient_checkpointing": false,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 1024,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 4096,
13
+ "layer_norm_eps": 1e-12,
14
+ "max_position_embeddings": 512,
15
+ "model_type": "bert",
16
+ "num_attention_heads": 16,
17
+ "num_hidden_layers": 24,
18
+ "pad_token_id": 0,
19
+ "position_embedding_type": "absolute",
20
+ "pruned_heads": {
21
+ "0": [
22
+ 0,
23
+ 1,
24
+ 3,
25
+ 4,
26
+ 5,
27
+ 8,
28
+ 9,
29
+ 13,
30
+ 15
31
+ ],
32
+ "1": [
33
+ 0,
34
+ 1,
35
+ 3,
36
+ 5,
37
+ 7,
38
+ 9,
39
+ 10,
40
+ 13,
41
+ 14
42
+ ],
43
+ "2": [
44
+ 0,
45
+ 1,
46
+ 4,
47
+ 6,
48
+ 7,
49
+ 8,
50
+ 9,
51
+ 10,
52
+ 11,
53
+ 12,
54
+ 13,
55
+ 14,
56
+ 15
57
+ ],
58
+ "3": [
59
+ 0,
60
+ 3,
61
+ 5,
62
+ 6,
63
+ 7,
64
+ 8,
65
+ 9,
66
+ 11,
67
+ 13,
68
+ 14,
69
+ 15
70
+ ],
71
+ "4": [
72
+ 0,
73
+ 2,
74
+ 3,
75
+ 4,
76
+ 5,
77
+ 6,
78
+ 7,
79
+ 9,
80
+ 10,
81
+ 11,
82
+ 12,
83
+ 14
84
+ ],
85
+ "5": [
86
+ 1,
87
+ 3,
88
+ 4,
89
+ 5,
90
+ 6,
91
+ 7,
92
+ 8,
93
+ 9,
94
+ 12,
95
+ 13,
96
+ 14,
97
+ 15
98
+ ],
99
+ "6": [
100
+ 0,
101
+ 1,
102
+ 2,
103
+ 3,
104
+ 4,
105
+ 7,
106
+ 8,
107
+ 9,
108
+ 10,
109
+ 11,
110
+ 12,
111
+ 15
112
+ ],
113
+ "7": [
114
+ 0,
115
+ 3,
116
+ 4,
117
+ 5,
118
+ 8,
119
+ 9,
120
+ 10,
121
+ 11,
122
+ 12,
123
+ 13,
124
+ 14
125
+ ],
126
+ "8": [
127
+ 3,
128
+ 4,
129
+ 5,
130
+ 7,
131
+ 8,
132
+ 9,
133
+ 10,
134
+ 11,
135
+ 12
136
+ ],
137
+ "9": [
138
+ 0,
139
+ 1,
140
+ 2,
141
+ 3,
142
+ 5,
143
+ 6,
144
+ 7,
145
+ 9,
146
+ 10,
147
+ 13,
148
+ 14,
149
+ 15
150
+ ],
151
+ "10": [
152
+ 1,
153
+ 2,
154
+ 4,
155
+ 5,
156
+ 6,
157
+ 8,
158
+ 11,
159
+ 13
160
+ ],
161
+ "11": [
162
+ 0,
163
+ 2,
164
+ 5,
165
+ 6,
166
+ 7,
167
+ 8,
168
+ 10,
169
+ 12,
170
+ 15
171
+ ],
172
+ "12": [
173
+ 0,
174
+ 2,
175
+ 6,
176
+ 8,
177
+ 9,
178
+ 11,
179
+ 13
180
+ ],
181
+ "13": [
182
+ 2,
183
+ 6,
184
+ 10,
185
+ 12,
186
+ 15
187
+ ],
188
+ "14": [
189
+ 1,
190
+ 5,
191
+ 6,
192
+ 10,
193
+ 11,
194
+ 15
195
+ ],
196
+ "15": [
197
+ 0,
198
+ 9
199
+ ],
200
+ "16": [
201
+ 5,
202
+ 7
203
+ ],
204
+ "17": [
205
+ 1,
206
+ 4,
207
+ 8,
208
+ 12,
209
+ 14
210
+ ],
211
+ "18": [
212
+ 3,
213
+ 4,
214
+ 11
215
+ ],
216
+ "19": [
217
+ 0,
218
+ 5,
219
+ 12
220
+ ],
221
+ "20": [
222
+ 0,
223
+ 4,
224
+ 10,
225
+ 12
226
+ ],
227
+ "21": [
228
+ 0,
229
+ 2,
230
+ 3,
231
+ 4,
232
+ 8,
233
+ 11,
234
+ 12,
235
+ 15
236
+ ],
237
+ "22": [
238
+ 0,
239
+ 1,
240
+ 3,
241
+ 4,
242
+ 7,
243
+ 9,
244
+ 10,
245
+ 11,
246
+ 13,
247
+ 15
248
+ ],
249
+ "23": [
250
+ 2,
251
+ 4,
252
+ 8,
253
+ 9,
254
+ 10,
255
+ 13,
256
+ 14,
257
+ 15
258
+ ]
259
+ },
260
+ "transformers_version": "4.5.1",
261
+ "type_vocab_size": 2,
262
+ "use_cache": true,
263
+ "vocab_size": 30522
264
+ }
model_card/density_info.js ADDED
@@ -0,0 +1,174 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ (function() {
2
+ var fn = function() {
3
+
4
+ (function(root) {
5
+ function now() {
6
+ return new Date();
7
+ }
8
+
9
+ var force = false;
10
+
11
+ if (typeof root._bokeh_onload_callbacks === "undefined" || force === true) {
12
+ root._bokeh_onload_callbacks = [];
13
+ root._bokeh_is_loading = undefined;
14
+ }
15
+
16
+
17
+
18
+
19
+ var element = document.getElementById("cddd6c5c-2e1d-40c7-b172-f7d5422349a6");
20
+ if (element == null) {
21
+ console.warn("Bokeh: autoload.js configured with elementid 'cddd6c5c-2e1d-40c7-b172-f7d5422349a6' but no matching script tag was found.")
22
+ }
23
+
24
+
25
+ function run_callbacks() {
26
+ try {
27
+ root._bokeh_onload_callbacks.forEach(function(callback) {
28
+ if (callback != null)
29
+ callback();
30
+ });
31
+ } finally {
32
+ delete root._bokeh_onload_callbacks
33
+ }
34
+ console.debug("Bokeh: all callbacks have finished");
35
+ }
36
+
37
+ function load_libs(css_urls, js_urls, callback) {
38
+ if (css_urls == null) css_urls = [];
39
+ if (js_urls == null) js_urls = [];
40
+
41
+ root._bokeh_onload_callbacks.push(callback);
42
+ if (root._bokeh_is_loading > 0) {
43
+ console.debug("Bokeh: BokehJS is being loaded, scheduling callback at", now());
44
+ return null;
45
+ }
46
+ if (js_urls == null || js_urls.length === 0) {
47
+ run_callbacks();
48
+ return null;
49
+ }
50
+ console.debug("Bokeh: BokehJS not loaded, scheduling load and callback at", now());
51
+ root._bokeh_is_loading = css_urls.length + js_urls.length;
52
+
53
+ function on_load() {
54
+ root._bokeh_is_loading--;
55
+ if (root._bokeh_is_loading === 0) {
56
+ console.debug("Bokeh: all BokehJS libraries/stylesheets loaded");
57
+ run_callbacks()
58
+ }
59
+ }
60
+
61
+ function on_error() {
62
+ console.error("failed to load " + url);
63
+ }
64
+
65
+ for (var i = 0; i < css_urls.length; i++) {
66
+ var url = css_urls[i];
67
+ const element = document.createElement("link");
68
+ element.onload = on_load;
69
+ element.onerror = on_error;
70
+ element.rel = "stylesheet";
71
+ element.type = "text/css";
72
+ element.href = url;
73
+ console.debug("Bokeh: injecting link tag for BokehJS stylesheet: ", url);
74
+ document.body.appendChild(element);
75
+ }
76
+
77
+ const hashes = {"https://cdn.bokeh.org/bokeh/release/bokeh-2.2.3.min.js": "T2yuo9Oe71Cz/I4X9Ac5+gpEa5a8PpJCDlqKYO0CfAuEszu1JrXLl8YugMqYe3sM", "https://cdn.bokeh.org/bokeh/release/bokeh-widgets-2.2.3.min.js": "98GDGJ0kOMCUMUePhksaQ/GYgB3+NH9h996V88sh3aOiUNX3N+fLXAtry6xctSZ6", "https://cdn.bokeh.org/bokeh/release/bokeh-tables-2.2.3.min.js": "89bArO+nlbP3sgakeHjCo1JYxYR5wufVgA3IbUvDY+K7w4zyxJqssu7wVnfeKCq8"};
78
+
79
+ for (var i = 0; i < js_urls.length; i++) {
80
+ var url = js_urls[i];
81
+ var element = document.createElement('script');
82
+ element.onload = on_load;
83
+ element.onerror = on_error;
84
+ element.async = false;
85
+ element.src = url;
86
+ if (url in hashes) {
87
+ element.crossOrigin = "anonymous";
88
+ element.integrity = "sha384-" + hashes[url];
89
+ }
90
+ console.debug("Bokeh: injecting script tag for BokehJS library: ", url);
91
+ document.head.appendChild(element);
92
+ }
93
+ };
94
+
95
+ function inject_raw_css(css) {
96
+ const element = document.createElement("style");
97
+ element.appendChild(document.createTextNode(css));
98
+ document.body.appendChild(element);
99
+ }
100
+
101
+
102
+ var js_urls = ["https://cdn.bokeh.org/bokeh/release/bokeh-2.2.3.min.js", "https://cdn.bokeh.org/bokeh/release/bokeh-widgets-2.2.3.min.js", "https://cdn.bokeh.org/bokeh/release/bokeh-tables-2.2.3.min.js"];
103
+ var css_urls = [];
104
+
105
+
106
+ var inline_js = [
107
+ function(Bokeh) {
108
+ Bokeh.set_log_level("info");
109
+ },
110
+
111
+ function(Bokeh) {
112
+ (function() {
113
+ var fn = function() {
114
+ Bokeh.safely(function() {
115
+ (function(root) {
116
+ function embed_document(root) {
117
+
118
+ var docs_json = '{"6f8350c7-6159-4d2b-8e5f-df1c89f733ab":{"roots":{"references":[{"attributes":{},"id":"1152","type":"UnionRenderers"},{"attributes":{"source":{"id":"1128"}},"id":"1133","type":"CDSView"},{"attributes":{},"id":"1153","type":"Selection"},{"attributes":{"data":{"density":["27.2%","7.0%","7.0%","24.2%","7.4%","7.4%","12.7%","10.4%","10.4%","22.9%","12.9%","12.9%","17.0%","13.7%","13.7%","13.4%","15.6%","15.6%","18.9%","17.2%","17.2%","24.6%","17.5%","17.5%","31.2%","17.5%","17.5%","14.8%","18.3%","18.3%","36.1%","15.1%","15.1%","33.6%","16.1%","16.1%","34.7%","14.7%","14.7%","44.4%","17.0%","17.0%","41.4%","13.7%","13.7%","41.5%","12.8%","12.8%","50.1%","13.0%","13.0%","43.1%","12.8%","12.8%","38.4%","9.2%","9.2%","28.5%","5.3%","5.3%","18.5%","3.6%","3.6%","9.5%","1.9%","1.9%","6.1%","1.1%","1.1%","7.3%","2.6%","2.6%"],"height":[0.285696,0.293888,0.293888,0.253952,0.31232,0.31232,0.13312,0.438272,0.438272,0.24064,0.54272,0.54272,0.178176,0.57344,0.57344,0.140288,0.654336,0.654336,0.198656,0.720896,0.720896,0.258048,0.734208,0.734208,0.326656,0.734208,0.734208,0.155648,0.765952,0.765952,0.37888,0.633856,0.633856,0.352256,0.67584,0.67584,0.36352,0.618496,0.618496,0.46592,0.71168,0.71168,0.434176,0.576512,0.576512,0.4352,0.538624,0.538624,0.525312,0.545792,0.545792,0.451584,0.535552,0.535552,0.402432,0.387072,0.387072,0.299008,0.223232,0.223232,0.193536,0.149504,0.149504,0.099328,0.077824,0.077824,0.063488,0.045056,0.045056,0.0768,0.110592,0.110592],"img_height":["128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px"],"img_width":["128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px","128px","128px","512px"],"name":["0.attention.output","0.intermediate","0.output","1.attention.output","1.intermediate","1.output","2.attention.output","2.intermediate","2.output","3.attention.output","3.intermediate","3.output","4.attention.output","4.intermediate","4.output","5.attention.output","5.intermediate","5.output","6.attention.output","6.intermediate","6.output","7.attention.output","7.intermediate","7.output","8.attention.output","8.intermediate","8.output","9.attention.output","9.intermediate","9.output","10.attention.output","10.intermediate","10.output","11.attention.output","11.intermediate","11.output","12.attention.output","12.intermediate","12.output","13.attention.output","13.intermediate","13.output","14.attention.output","14.intermediate","14.output","15.attention.output","15.intermediate","15.output","16.attention.output","16.intermediate","16.output","17.attention.output","17.intermediate","17.output","18.attention.output","18.intermediate","18.output","19.attention.output","19.intermediate","19.output","20.attention.output","20.intermediate","20.output","21.attention.output","21.intermediate","21.output","22.attention.output","22.intermediate","22.output","23.attention.output","23.intermediate","23.output"],"parameters":["0.29","0.29","0.29","0.25","0.31","0.31","0.13","0.44","0.44","0.24","0.54","0.54","0.18","0.57","0.57","0.14","0.65","0.65","0.20","0.72","0.72","0.26","0.73","0.73","0.33","0.73","0.73","0.16","0.77","0.77","0.38","0.63","0.63","0.35","0.68","0.68","0.36","0.62","0.62","0.47","0.71","0.71","0.43","0.58","0.58","0.44","0.54","0.54","0.53","0.55","0.55","0.45","0.54","0.54","0.40","0.39","0.39","0.30","0.22","0.22","0.19","0.15","0.15","0.10","0.08","0.08","0.06","0.05","0.05","0.08","0.11","0.11"],"url":["/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_attention_output_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_intermediate_dense.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_output_dense.png"],"x":[0.5833333333333334,0.75,0.9166666666666667,1.5833333333333333,1.75,1.9166666666666665,2.5833333333333335,2.75,2.916666666666667,3.5833333333333335,3.75,3.916666666666667,4.583333333333333,4.75,4.916666666666666,5.583333333333333,5.75,5.916666666666666,6.583333333333333,6.75,6.916666666666666,7.583333333333333,7.75,7.916666666666666,8.583333333333334,8.75,8.916666666666668,9.583333333333334,9.75,9.916666666666668,10.583333333333334,10.75,10.916666666666668,11.583333333333334,11.75,11.916666666666668,12.583333333333334,12.75,12.916666666666668,13.583333333333334,13.75,13.916666666666668,14.583333333333334,14.75,14.916666666666668,15.583333333333334,15.75,15.916666666666668,16.583333333333332,16.75,16.916666666666664,17.583333333333332,17.75,17.916666666666664,18.583333333333332,18.75,18.916666666666664,19.583333333333332,19.75,19.916666666666664,20.583333333333332,20.75,20.916666666666664,21.583333333333332,21.75,21.916666666666664,22.583333333333332,22.75,22.916666666666664,23.583333333333332,23.75,23.916666666666664]},"selected":{"id":"1157"},"selection_policy":{"id":"1156"}},"id":"1134","type":"ColumnDataSource"},{"attributes":{"fill_color":{"value":"#aa69f7"},"line_color":{"value":"#aa69f7"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1136","type":"VBar"},{"attributes":{"items":[{"id":"1141"},{"id":"1142"},{"id":"1143"},{"id":"1144"}],"location":[10,0],"orientation":"horizontal"},"id":"1140","type":"Legend"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#aa69f7"},"line_alpha":{"value":0.1},"line_color":{"value":"#aa69f7"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1137","type":"VBar"},{"attributes":{"source":{"id":"1134"}},"id":"1139","type":"CDSView"},{"attributes":{},"id":"1146","type":"BasicTickFormatter"},{"attributes":{},"id":"1154","type":"UnionRenderers"},{"attributes":{"label":{"value":"query"},"renderers":[{"id":"1120"}]},"id":"1141","type":"LegendItem"},{"attributes":{},"id":"1155","type":"Selection"},{"attributes":{"label":{"value":"key"},"renderers":[{"id":"1126"}]},"id":"1142","type":"LegendItem"},{"attributes":{},"id":"1148","type":"BasicTickFormatter"},{"attributes":{"data_source":{"id":"1116"},"glyph":{"id":"1118"},"hover_glyph":null,"muted_glyph":null,"name":"query","nonselection_glyph":{"id":"1119"},"selection_glyph":null,"view":{"id":"1121"}},"id":"1120","type":"GlyphRenderer"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#6573f7"},"line_alpha":{"value":0.1},"line_color":{"value":"#6573f7"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1119","type":"VBar"},{"attributes":{"fill_color":{"value":"#ed5642"},"line_color":{"value":"#ed5642"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1124","type":"VBar"},{"attributes":{"active_drag":"auto","active_inspect":"auto","active_multi":null,"active_scroll":"auto","active_tap":"auto","tools":[{"id":"1094"}]},"id":"1114","type":"Toolbar"},{"attributes":{"data":{"density":["21.9%","25.6%","14.8%","20.7%","15.9%","10.5%","15.9%","23.8%","33.8%","15.7%","36.9%","39.3%","37.3%","41.9%","45.7%","49.4%","51.6%","42.2%","48.9%","39.3%","32.0%","15.0%","10.2%","7.9%"],"height":[0.229376,0.268288,0.155648,0.217088,0.166912,0.110592,0.166912,0.249856,0.354304,0.164864,0.387072,0.411648,0.391168,0.439296,0.479232,0.518144,0.540672,0.442368,0.513024,0.411648,0.335872,0.157696,0.106496,0.082944],"img_height":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"img_width":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"name":["0.attention.query","1.attention.query","2.attention.query","3.attention.query","4.attention.query","5.attention.query","6.attention.query","7.attention.query","8.attention.query","9.attention.query","10.attention.query","11.attention.query","12.attention.query","13.attention.query","14.attention.query","15.attention.query","16.attention.query","17.attention.query","18.attention.query","19.attention.query","20.attention.query","21.attention.query","22.attention.query","23.attention.query"],"parameters":["0.23","0.27","0.16","0.22","0.17","0.11","0.17","0.25","0.35","0.16","0.39","0.41","0.39","0.44","0.48","0.52","0.54","0.44","0.51","0.41","0.34","0.16","0.11","0.08"],"url":["/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_attention_self_query.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_attention_self_query.png"],"x":[0.08333333333333333,1.0833333333333333,2.0833333333333335,3.0833333333333335,4.083333333333333,5.083333333333333,6.083333333333333,7.083333333333333,8.083333333333334,9.083333333333334,10.083333333333334,11.083333333333334,12.083333333333334,13.083333333333334,14.083333333333334,15.083333333333334,16.083333333333332,17.083333333333332,18.083333333333332,19.083333333333332,20.083333333333332,21.083333333333332,22.083333333333332,23.083333333333332]},"selected":{"id":"1151"},"selection_policy":{"id":"1150"}},"id":"1116","type":"ColumnDataSource"},{"attributes":{"label":{"value":"value"},"renderers":[{"id":"1132"}]},"id":"1143","type":"LegendItem"},{"attributes":{},"id":"1111","type":"BasicTicker"},{"attributes":{"source":{"id":"1116"}},"id":"1121","type":"CDSView"},{"attributes":{"axis_label":"Parameters (M)","formatter":{"id":"1146"},"minor_tick_line_color":null,"ticker":{"id":"1111"}},"id":"1110","type":"LinearAxis"},{"attributes":{"label":{"value":"fully connected"},"renderers":[{"id":"1138"}]},"id":"1144","type":"LegendItem"},{"attributes":{"above":[{"id":"1140"}],"below":[{"id":"1106"}],"center":[{"id":"1109"},{"id":"1113"}],"left":[{"id":"1110"}],"outline_line_color":null,"plot_height":300,"plot_width":505,"renderers":[{"id":"1120"},{"id":"1126"},{"id":"1132"},{"id":"1138"}],"title":{"id":"1096"},"toolbar":{"id":"1114"},"x_range":{"id":"1098"},"x_scale":{"id":"1102"},"y_range":{"id":"1100"},"y_scale":{"id":"1104"}},"id":"1095","subtype":"Figure","type":"Plot"},{"attributes":{"data":{"density":["20.7%","25.8%","16.2%","22.0%","16.6%","11.7%","17.1%","25.6%","34.1%","16.8%","39.8%","39.5%","37.6%","44.2%","44.6%","52.3%","55.2%","44.3%","52.2%","41.5%","33.5%","14.2%","12.1%","9.4%"],"height":[0.217088,0.270336,0.169984,0.2304,0.17408,0.12288,0.1792,0.268288,0.357376,0.176128,0.417792,0.413696,0.39424,0.463872,0.467968,0.548864,0.57856,0.464896,0.54784,0.4352,0.351232,0.14848,0.126976,0.098304],"img_height":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"img_width":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"name":["0.attention.key","1.attention.key","2.attention.key","3.attention.key","4.attention.key","5.attention.key","6.attention.key","7.attention.key","8.attention.key","9.attention.key","10.attention.key","11.attention.key","12.attention.key","13.attention.key","14.attention.key","15.attention.key","16.attention.key","17.attention.key","18.attention.key","19.attention.key","20.attention.key","21.attention.key","22.attention.key","23.attention.key"],"parameters":["0.22","0.27","0.17","0.23","0.17","0.12","0.18","0.27","0.36","0.18","0.42","0.41","0.39","0.46","0.47","0.55","0.58","0.46","0.55","0.44","0.35","0.15","0.13","0.10"],"url":["/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_attention_self_key.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_attention_self_key.png"],"x":[0.25,1.25,2.25,3.25,4.25,5.25,6.25,7.25,8.25,9.25,10.25,11.25,12.25,13.25,14.25,15.25,16.25,17.25,18.25,19.25,20.25,21.25,22.25,23.25]},"selected":{"id":"1153"},"selection_policy":{"id":"1152"}},"id":"1122","type":"ColumnDataSource"},{"attributes":{},"id":"1156","type":"UnionRenderers"},{"attributes":{"axis":{"id":"1110"},"dimension":1,"ticker":null},"id":"1113","type":"Grid"},{"attributes":{},"id":"1157","type":"Selection"},{"attributes":{},"id":"1102","type":"LinearScale"},{"attributes":{},"id":"1150","type":"UnionRenderers"},{"attributes":{"callback":null,"tooltips":"\\n &lt;div&gt;\\n &lt;div style=\\"margin-bottom:10px\\"&gt;\\n &lt;span style=\\"font-size: 15px;\\"&gt;&lt;b&gt;@name&lt;/b&gt;&lt;br/&gt;density=@density&lt;/span&gt;\\n &lt;/div&gt;\\n &lt;div&gt; \\n &lt;img\\n src=\\"@url\\" height=\\"@img_height\\" width=\\"@img_width\\" alt=\\"@url\\"\\n style=\\"float: left; margin: 0px 15px 15px 0px;\\"\\n border=\\"0\\"\\n /&gt;\\n &lt;/div&gt;\\n &lt;/div&gt;\\n "},"id":"1094","type":"HoverTool"},{"attributes":{"data_source":{"id":"1122"},"glyph":{"id":"1124"},"hover_glyph":null,"muted_glyph":null,"name":"key","nonselection_glyph":{"id":"1125"},"selection_glyph":null,"view":{"id":"1127"}},"id":"1126","type":"GlyphRenderer"},{"attributes":{"axis":{"id":"1106"},"grid_line_color":null,"ticker":null},"id":"1109","type":"Grid"},{"attributes":{"data_source":{"id":"1134"},"glyph":{"id":"1136"},"hover_glyph":null,"muted_glyph":null,"name":"fully connected","nonselection_glyph":{"id":"1137"},"selection_glyph":null,"view":{"id":"1139"}},"id":"1138","type":"GlyphRenderer"},{"attributes":{},"id":"1151","type":"Selection"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#ed5642"},"line_alpha":{"value":0.1},"line_color":{"value":"#ed5642"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1125","type":"VBar"},{"attributes":{"data":{"density":["25.4%","22.2%","13.1%","21.8%","15.2%","13.0%","16.5%","22.3%","27.5%","12.9%","33.2%","31.5%","35.9%","42.4%","37.5%","38.9%","50.1%","44.3%","40.3%","31.2%","18.9%","10.2%","7.5%","8.1%"],"height":[0.26624,0.232448,0.137216,0.228352,0.159744,0.136192,0.173056,0.233472,0.288768,0.135168,0.34816,0.330752,0.376832,0.444416,0.393216,0.407552,0.525312,0.464896,0.422912,0.326656,0.198656,0.106496,0.078848,0.084992],"img_height":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"img_width":["128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px","128px"],"name":["0.attention.value","1.attention.value","2.attention.value","3.attention.value","4.attention.value","5.attention.value","6.attention.value","7.attention.value","8.attention.value","9.attention.value","10.attention.value","11.attention.value","12.attention.value","13.attention.value","14.attention.value","15.attention.value","16.attention.value","17.attention.value","18.attention.value","19.attention.value","20.attention.value","21.attention.value","22.attention.value","23.attention.value"],"parameters":["0.27","0.23","0.14","0.23","0.16","0.14","0.17","0.23","0.29","0.14","0.35","0.33","0.38","0.44","0.39","0.41","0.53","0.46","0.42","0.33","0.20","0.11","0.08","0.08"],"url":["/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_0_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_1_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_2_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_3_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_4_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_5_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_6_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_7_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_8_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_9_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_10_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_11_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_12_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_13_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_14_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_15_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_16_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_17_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_18_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_19_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_20_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_21_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_22_attention_self_value.png","/madlag/bert-large-uncased-wwm-squadv2-x2.63-f82.6-d16-hybrid-v1/raw/main/model_card/images/layer_23_attention_self_value.png"],"x":[0.41666666666666663,1.4166666666666665,2.416666666666667,3.416666666666667,4.416666666666666,5.416666666666666,6.416666666666666,7.416666666666666,8.416666666666668,9.416666666666668,10.416666666666668,11.416666666666668,12.416666666666668,13.416666666666668,14.416666666666668,15.416666666666668,16.416666666666664,17.416666666666664,18.416666666666664,19.416666666666664,20.416666666666664,21.416666666666664,22.416666666666664,23.416666666666664]},"selected":{"id":"1155"},"selection_policy":{"id":"1154"}},"id":"1128","type":"ColumnDataSource"},{"attributes":{"text":"Transformer Layers"},"id":"1096","type":"Title"},{"attributes":{},"id":"1104","type":"LinearScale"},{"attributes":{"source":{"id":"1122"}},"id":"1127","type":"CDSView"},{"attributes":{"fill_color":{"value":"#6573f7"},"line_color":{"value":"#6573f7"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1118","type":"VBar"},{"attributes":{"fill_color":{"value":"#20cb97"},"line_color":{"value":"#20cb97"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1130","type":"VBar"},{"attributes":{"start":0},"id":"1100","type":"DataRange1d"},{"attributes":{},"id":"1098","type":"DataRange1d"},{"attributes":{"axis_label":"Layer","formatter":{"id":"1148"},"minor_tick_line_color":null,"ticker":{"id":"1107"}},"id":"1106","type":"LinearAxis"},{"attributes":{"data_source":{"id":"1128"},"glyph":{"id":"1130"},"hover_glyph":null,"muted_glyph":null,"name":"value","nonselection_glyph":{"id":"1131"},"selection_glyph":null,"view":{"id":"1133"}},"id":"1132","type":"GlyphRenderer"},{"attributes":{},"id":"1107","type":"BasicTicker"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#20cb97"},"line_alpha":{"value":0.1},"line_color":{"value":"#20cb97"},"top":{"field":"height"},"width":{"value":0.125},"x":{"field":"x"}},"id":"1131","type":"VBar"}],"root_ids":["1095"]},"title":"Bokeh Application","version":"2.2.3"}}';
119
+ var render_items = [{"docid":"6f8350c7-6159-4d2b-8e5f-df1c89f733ab","root_ids":["1095"],"roots":{"1095":"cddd6c5c-2e1d-40c7-b172-f7d5422349a6"}}];
120
+ root.Bokeh.embed.embed_items(docs_json, render_items);
121
+
122
+ }
123
+ if (root.Bokeh !== undefined) {
124
+ embed_document(root);
125
+ } else {
126
+ var attempts = 0;
127
+ var timer = setInterval(function(root) {
128
+ if (root.Bokeh !== undefined) {
129
+ clearInterval(timer);
130
+ embed_document(root);
131
+ } else {
132
+ attempts++;
133
+ if (attempts > 100) {
134
+ clearInterval(timer);
135
+ console.log("Bokeh: ERROR: Unable to run BokehJS code because BokehJS library is missing");
136
+ }
137
+ }
138
+ }, 10, root)
139
+ }
140
+ })(window);
141
+ });
142
+ };
143
+ if (document.readyState != "loading") fn();
144
+ else document.addEventListener("DOMContentLoaded", fn);
145
+ })();
146
+ },
147
+ function(Bokeh) {
148
+
149
+
150
+ }
151
+ ];
152
+
153
+ function run_inline_js() {
154
+
155
+ for (var i = 0; i < inline_js.length; i++) {
156
+ inline_js[i].call(root, root.Bokeh);
157
+ }
158
+
159
+ }
160
+
161
+ if (root._bokeh_is_loading === 0) {
162
+ console.debug("Bokeh: BokehJS loaded, going straight to plotting");
163
+ run_inline_js();
164
+ } else {
165
+ load_libs(css_urls, js_urls, function() {
166
+ console.debug("Bokeh: BokehJS plotting callback run at", now());
167
+ run_inline_js();
168
+ });
169
+ }
170
+ }(window));
171
+ };
172
+ if (document.readyState != "loading") fn();
173
+ else document.addEventListener("DOMContentLoaded", fn);
174
+ })();
model_card/images/layer_0_attention_output_dense.png ADDED
model_card/images/layer_0_attention_self_key.png ADDED
model_card/images/layer_0_attention_self_query.png ADDED
model_card/images/layer_0_attention_self_value.png ADDED
model_card/images/layer_0_intermediate_dense.png ADDED
model_card/images/layer_0_output_dense.png ADDED
model_card/images/layer_10_attention_output_dense.png ADDED
model_card/images/layer_10_attention_self_key.png ADDED
model_card/images/layer_10_attention_self_query.png ADDED
model_card/images/layer_10_attention_self_value.png ADDED
model_card/images/layer_10_intermediate_dense.png ADDED
model_card/images/layer_10_output_dense.png ADDED
model_card/images/layer_11_attention_output_dense.png ADDED
model_card/images/layer_11_attention_self_key.png ADDED
model_card/images/layer_11_attention_self_query.png ADDED
model_card/images/layer_11_attention_self_value.png ADDED
model_card/images/layer_11_intermediate_dense.png ADDED
model_card/images/layer_11_output_dense.png ADDED
model_card/images/layer_12_attention_output_dense.png ADDED
model_card/images/layer_12_attention_self_key.png ADDED
model_card/images/layer_12_attention_self_query.png ADDED
model_card/images/layer_12_attention_self_value.png ADDED
model_card/images/layer_12_intermediate_dense.png ADDED
model_card/images/layer_12_output_dense.png ADDED
model_card/images/layer_13_attention_output_dense.png ADDED
model_card/images/layer_13_attention_self_key.png ADDED
model_card/images/layer_13_attention_self_query.png ADDED
model_card/images/layer_13_attention_self_value.png ADDED
model_card/images/layer_13_intermediate_dense.png ADDED
model_card/images/layer_13_output_dense.png ADDED
model_card/images/layer_14_attention_output_dense.png ADDED
model_card/images/layer_14_attention_self_key.png ADDED
model_card/images/layer_14_attention_self_query.png ADDED
model_card/images/layer_14_attention_self_value.png ADDED
model_card/images/layer_14_intermediate_dense.png ADDED
model_card/images/layer_14_output_dense.png ADDED
model_card/images/layer_15_attention_output_dense.png ADDED
model_card/images/layer_15_attention_self_key.png ADDED
model_card/images/layer_15_attention_self_query.png ADDED
model_card/images/layer_15_attention_self_value.png ADDED
model_card/images/layer_15_intermediate_dense.png ADDED
model_card/images/layer_15_output_dense.png ADDED
model_card/images/layer_16_attention_output_dense.png ADDED
model_card/images/layer_16_attention_self_key.png ADDED
model_card/images/layer_16_attention_self_query.png ADDED
model_card/images/layer_16_attention_self_value.png ADDED
model_card/images/layer_16_intermediate_dense.png ADDED