logical-reasoning / results /mgtv-llama3_p1_full_metrics.csv
dh-mc's picture
analysis of internlm/llama3
bc127a6
raw
history blame
412 Bytes
epoch,model,accuracy,precision,recall,f1
0,shenzhi-wang/Llama3-8B-Chinese-Chat,0.7836666666666666,0.7667122897184859,0.7929173693086004,0.7679400621793133
1,shenzhi-wang/Llama3-8B-Chinese-Chat_checkpoint-175,0.5686666666666667,0.8071228551961105,0.5686666666666667,0.625398807088777
2,shenzhi-wang/Llama3-8B-Chinese-Chat_checkpoint-350,0.7043333333333334,0.8108167278539298,0.7043333333333334,0.7421863499027709