hellaswag results added
Browse files
README.md
CHANGED
@@ -126,23 +126,22 @@ Base:
|
|
126 |
|hellaswag| 1|none | 0|acc |↑ |0.5492|± |0.0050|
|
127 |
| | |none | 0|acc_norm|↑ |0.7353|± |0.0044|
|
128 |
|
129 |
-
<figcaption>
|
130 |
|
131 |
</figure>
|
132 |
|
133 |
Finetuned:
|
134 |
|
135 |
-
|
136 |
-
|
137 |
-
|
138 |
-
|
139 |
-
|
140 |
-
|
141 |
-
5. FDA
|
142 |
|
143 |
-
|
144 |
|
145 |
-
|
146 |
|
147 |
|
148 |
1.2 Finetuned model
|
|
|
126 |
|hellaswag| 1|none | 0|acc |↑ |0.5492|± |0.0050|
|
127 |
| | |none | 0|acc_norm|↑ |0.7353|± |0.0044|
|
128 |
|
129 |
+
<figcaption>Benchmarks of the base model.</figcaption>
|
130 |
|
131 |
</figure>
|
132 |
|
133 |
Finetuned:
|
134 |
|
135 |
+
<figure>
|
136 |
+
|
137 |
+
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
138 |
+
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|
139 |
+
|hellaswag| 1|none | 0|acc |↑ |0.5490|± |0.0050|
|
140 |
+
| | |none | 0|acc_norm|↑ |0.7358|± |0.0044|
|
|
|
141 |
|
142 |
+
<figcaption>Benchmark of the finetuned model.</figcaption>
|
143 |
|
144 |
+
</figure>
|
145 |
|
146 |
|
147 |
1.2 Finetuned model
|