cstr
/

phi-3-orpo-v9_16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cstr commited on May 1

Commit

eb1c1ec

•

1 Parent(s): 5b94316

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ On german EQ-Bench (v2_de) 51.82 (insignificant over 51.41 for original llamafie
 Note: We can improve the correctness of parsing, i.a., by only a few SFT steps, as shown with cas/phi3-mini-4k-llamafied-sft-v3 (170/171 correct but with then only 39.46 score in v2_de, which was also an experiment in changing the prompt template).
 All that was quickly done with bnb and q4 quants only, which might, in theory, affect especially such small dense models significantly.
-But it served the intention for both proof-of-concept-experiments at least. Probably it would easily be possible to further improve results, but that what take some time and compute.
 # Training setup

 Note: We can improve the correctness of parsing, i.a., by only a few SFT steps, as shown with cas/phi3-mini-4k-llamafied-sft-v3 (170/171 correct but with then only 39.46 score in v2_de, which was also an experiment in changing the prompt template).
 All that was quickly done with bnb and q4 quants only, which might, in theory, affect especially such small dense models significantly.
+But it served the intention for both proof-of-concept-experiments at least. Probably it would easily be possible to further improve results, but that would take some time and compute.
 # Training setup