Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,7 @@ On german EQ-Bench (v2_de) 51.82 (insignificant over 51.41 for original llamafie
|
|
35 |
|
36 |
Note: We can improve the correctness of parsing, i.a., by only a few SFT steps, as shown with cas/phi3-mini-4k-llamafied-sft-v3 (170/171 correct but with then only 39.46 score in v2_de, which was also an experiment in changing the prompt template).
|
37 |
All that was quickly done with bnb and q4 quants only, which might, in theory, affect especially such small dense models significantly.
|
38 |
-
But it served the intention for both proof-of-concept-experiments at least. Probably it would easily be possible to further improve results, but that
|
39 |
|
40 |
# Training setup
|
41 |
|
|
|
35 |
|
36 |
Note: We can improve the correctness of parsing, i.a., by only a few SFT steps, as shown with cas/phi3-mini-4k-llamafied-sft-v3 (170/171 correct but with then only 39.46 score in v2_de, which was also an experiment in changing the prompt template).
|
37 |
All that was quickly done with bnb and q4 quants only, which might, in theory, affect especially such small dense models significantly.
|
38 |
+
But it served the intention for both proof-of-concept-experiments at least. Probably it would easily be possible to further improve results, but that would take some time and compute.
|
39 |
|
40 |
# Training setup
|
41 |
|