qihoo360
/

llama3-8B-360Zhinao-360k-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zhs12 commited on May 20

Commit

f013916

•

1 Parent(s): a0cf305

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -37,6 +37,16 @@ We found that [the "value retrieval" variant of NIAH](https://github.com/Arize-a
 This model does achieve 100% all-green results on value retrieval but less than satisfactory results on the original version.
 ## Usage

 This model does achieve 100% all-green results on value retrieval but less than satisfactory results on the original version.
+### Reproduce
+[360k/niah](https://github.com/Qihoo360/360zhinao/blob/main/360k/niah/) generates the raw results.
+The score for value retrieval NIAH is calculated on-the-fly when generating the raw results, while the actual score of original and Chinese NIAH is calculated in [360k/plot](https://github.com/Qihoo360/360zhinao/blob/main/360k/plot/).
+For the original version, 100% score is given if the regular expression `sandwich.+?dolores.+?sunny` matches the model output, and edit distance otherwise.
+For the Chinese version, 100% score is given if `刘秀` is present in the model output, and edit distance otherwise. For the English-biased llama3 models this may not be perfect.
 ## Usage