OpsEval / data_v2 /zabbix_zh_mc_gen.csv
Junetheriver's picture
update 05-15
32e04fa
raw
history blame
No virus
1.21 kB
name,zero_naive,zero_self_con,zero_cot,zero_cot_self_con,few_naive,few_self_con,few_cot,few_cot_self_con
Baichuan2-13B-Chat,31.0,29.0,47.0,47.0,29.0,27.0,40.0,43.0
Chatglm3-6B,29.0,29.0,36.0,36.0,29.0,29.0,34.0,34.0
Devops-Model-14B-Chat,27.0,28.0,36.0,33.0,46.0,44.0,44.0,46.0
Ernie-Bot-4.0,44.0,44.0,48.0,48.0,47.0,47.0,51.0,51.0
Gpt-3.5-Turbo,36.0,36.0,42.0,42.0,40.0,40.0,48.0,48.0
GPT-4,51.0,51.0,53.0,53.0,60.0,60.0,59.0,59.0
Internlm2-Chat-20B,41.0,41.0,,,44.0,44.0,,
Internlm2-Chat-7B,43.0,43.0,39.0,39.0,45.0,45.0,35.0,35.0
Llama-2-13B,28.0,28.0,45.0,45.0,40.0,40.0,43.0,43.0
Llama-2-70B-Chat,1.0,1.0,47.0,47.0,29.0,29.0,46.0,46.0
Llama-2-7B,18.0,18.0,35.0,35.0,22.0,22.0,28.0,28.0
Mistral-7B,6.0,6.0,42.0,42.0,11.0,11.0,44.0,44.0
Qwen-14B-Chat,36.0,36.0,39.0,41.0,44.0,40.0,47.0,43.0
Qwen-72B-Chat,46.0,46.0,44.0,44.0,45.0,45.0,61.0,61.0
Yi-34B-Chat,40.0,40.0,40.0,40.0,42.0,42.0,42.0,42.0
gemma_2b,25.0,25.0,32.0,32.0,24.0,24.0,30.0,30.0
gemma_7b,22.0,22.0,44.0,44.0,28.0,28.0,40.0,40.0
Qwen1.5-14B-Base,38.0,39.670138888888886,39.0,37.58680555555556,48.0,30.381944444444443,36.0,33.072916666666664
Qwen1.5-14B-Chat,34.0,38.0,45.0,39.0,42.0,48.0,48.0,36.0
Qwen1.5-14B-Chat,,34.0,,43.0,,39.0,,49.0