OpsEval / data /tencent_zh_qa.csv
Junetheriver's picture
added leaderboards
a6d507f
raw
history blame
397 Bytes
name,gpt4_score
Baichuan2-13B-Chat,8.727272727272727
DevOps-Model-14B-Chat,8.25974026
LLaMA-2-13B,7.636363636363637
LLaMA-2-70B-Chat,7.740259740259741
Mistral-7B,7.8441558441558445
Qwen-14B-Chat,8.642857142857142
Qwen-72B-Chat,8.811688311688311
GPT4,9.019480519480519
Yi-34B-Chat,8.844155844155845
ChatGLM3-6B,8.577922077922079
LLaMA-2-7B,5.318181818181818
GPT-3.5-turbo,8.850649351