name,gpt4_score Baichuan2-13B-Chat,8.727272727272727 DevOps-Model-14B-Chat,8.25974026 LLaMA-2-13B,7.636363636363637 LLaMA-2-70B-Chat,7.740259740259741 Mistral-7B,7.8441558441558445 Qwen-14B-Chat,8.642857142857142 Qwen-72B-Chat,8.811688311688311 GPT4,9.019480519480519 Yi-34B-Chat,8.844155844155845 ChatGLM3-6B,8.577922077922079 LLaMA-2-7B,5.318181818181818 GPT-3.5-turbo,8.850649351