Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
13 |
|
14 |
# bert-finetuned-squad
|
15 |
|
16 |
-
This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on
|
17 |
|
18 |
## Model description
|
19 |
|
@@ -29,6 +29,7 @@ More information needed
|
|
29 |
|
30 |
## Training procedure
|
31 |
|
|
|
32 |
### Training hyperparameters
|
33 |
|
34 |
The following hyperparameters were used during training:
|
@@ -42,7 +43,73 @@ The following hyperparameters were used during training:
|
|
42 |
- mixed_precision_training: Native AMP
|
43 |
|
44 |
### Training results
|
45 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
|
47 |
|
48 |
### Framework versions
|
|
|
13 |
|
14 |
# bert-finetuned-squad
|
15 |
|
16 |
+
This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the [SQUAD dataset]().
|
17 |
|
18 |
## Model description
|
19 |
|
|
|
29 |
|
30 |
## Training procedure
|
31 |
|
32 |
+
|
33 |
### Training hyperparameters
|
34 |
|
35 |
The following hyperparameters were used during training:
|
|
|
43 |
- mixed_precision_training: Native AMP
|
44 |
|
45 |
### Training results
|
46 |
+
Step Training Loss
|
47 |
+
500 2.635500
|
48 |
+
1000 1.655900
|
49 |
+
1500 1.460800
|
50 |
+
2000 1.378100
|
51 |
+
2500 1.328600
|
52 |
+
3000 1.287900
|
53 |
+
3500 1.236900
|
54 |
+
4000 1.179500
|
55 |
+
4500 1.130300
|
56 |
+
5000 1.163700
|
57 |
+
5500 1.122700
|
58 |
+
6000 1.140600
|
59 |
+
6500 1.141300
|
60 |
+
7000 1.082100
|
61 |
+
7500 1.096400
|
62 |
+
8000 1.108300
|
63 |
+
8500 1.058300
|
64 |
+
9000 1.082500
|
65 |
+
9500 1.026400
|
66 |
+
10000 1.040700
|
67 |
+
10500 1.035200
|
68 |
+
11000 1.010700
|
69 |
+
11500 0.807700
|
70 |
+
12000 0.710500
|
71 |
+
12500 0.784300
|
72 |
+
13000 0.740100
|
73 |
+
13500 0.771600
|
74 |
+
14000 0.777200
|
75 |
+
14500 0.749000
|
76 |
+
15000 0.734800
|
77 |
+
15500 0.749500
|
78 |
+
16000 0.775600
|
79 |
+
16500 0.724300
|
80 |
+
17000 0.768300
|
81 |
+
17500 0.753600
|
82 |
+
18000 0.732900
|
83 |
+
18500 0.734200
|
84 |
+
19000 0.699800
|
85 |
+
19500 0.732600
|
86 |
+
20000 0.764600
|
87 |
+
20500 0.772900
|
88 |
+
21000 0.734000
|
89 |
+
21500 0.734000
|
90 |
+
22000 0.691000
|
91 |
+
22500 0.588700
|
92 |
+
23000 0.514800
|
93 |
+
23500 0.539000
|
94 |
+
24000 0.515900
|
95 |
+
24500 0.490800
|
96 |
+
25000 0.524200
|
97 |
+
25500 0.516200
|
98 |
+
26000 0.486200
|
99 |
+
26500 0.526000
|
100 |
+
27000 0.495300
|
101 |
+
27500 0.527600
|
102 |
+
28000 0.484800
|
103 |
+
28500 0.486300
|
104 |
+
29000 0.522200
|
105 |
+
29500 0.519200
|
106 |
+
30000 0.508800
|
107 |
+
30500 0.516700
|
108 |
+
31000 0.490600
|
109 |
+
31500 0.516100
|
110 |
+
32000 0.499500
|
111 |
+
32500 0.496100
|
112 |
+
33000 0.465300
|
113 |
|
114 |
|
115 |
### Framework versions
|