OpsEval / data /zabbix_zh_mc.csv
Junetheriver's picture
update leaderboard
cb9ec9c
raw
history blame
No virus
862 Bytes
name,few_native,few_self_con,few_cot,few_cot_self_con,zero_native,zero_self_con,zero_cot,zero_cot_self_con
GPT4,60,60,59,59,51,51,53,53
Yi-34B-Chat,42,42,42,42,40,40,40,40
DevOps-Model-14B-Chat,46,44,44,46,27,28,36,33
LLaMA-2-7B,22,22,28,28,18,18,35,35
Qwen-72B-Chat,45,45,61,61,46,46,44,44
GPT-3.5-turbo,40,40,48,48,36,36,42,42
ERNIE-Bot-4.0,47,47,51,51,44,44,48,48
Mistral-7B,11,11,44,44,6,6,42,42
LLaMA-2-13B,40,40,43,43,28,28,45,45
Baichuan2-13B-Chat,29,27,40,43,31,29,47,47
Qwen-14B-Chat,44,40,47,43,36,36,39,41
LLaMA-2-70B-Chat,29,29,46,46,1,1,47,47
ChatGLM3-6B,29,29,34,34,29,29,36,36
InternLM2-Chat-20B,44,44,,,41,41,,
InternLM2-Chat-7B,45,45,35,35,43,43,39,39
gemma_2b,24,24,30,30,25,25,32,32
gemma_7b,28,28,40,40,22,22,44,44
qwen1.5-14b-base,48,48,36,36,38,38,39,39
qwen1.5-14b-chat,42,39,48,49,34,34,45,43
Claude-3-Opus,,,,,65,65,,