Upload leaderboard_table_20230717.csv
Browse files
leaderboard_table_20230717.csv
CHANGED
@@ -1,9 +1,9 @@
|
|
1 |
Model,MT-bench (score),Arena Elo rating,MMLU,License,Link
|
2 |
GPT-4,8.99,1211,0.864,Proprietary,https://openai.com/research/gpt-4
|
3 |
-
Claude-2,8.06
|
4 |
GPT-3.5-turbo,7.94,1124,0.700,Proprietary,https://openai.com/blog/chatgpt
|
5 |
-
Claude-v1,7.90,1169,0.
|
6 |
-
Claude-instant-v1,7.85,1145,0.
|
7 |
Vicuna-33B,7.12,1096,0.592,Non-commercial,https://huggingface.co/lmsys/vicuna-33b-v1.3
|
8 |
WizardLM-30B,7.01,-,0.587,Non-commercial,https://huggingface.co/WizardLM/WizardLM-30B-V1.0
|
9 |
Llama-2-70b-chat,6.86,-,-,Llama 2 Community,https://huggingface.co/meta-llama/Llama-2-70b-hf
|
|
|
1 |
Model,MT-bench (score),Arena Elo rating,MMLU,License,Link
|
2 |
GPT-4,8.99,1211,0.864,Proprietary,https://openai.com/research/gpt-4
|
3 |
+
Claude-2,8.06,-,0.785,Proprietary,https://www.anthropic.com/index/claude-2
|
4 |
GPT-3.5-turbo,7.94,1124,0.700,Proprietary,https://openai.com/blog/chatgpt
|
5 |
+
Claude-v1,7.90,1169,0.770,Proprietary,https://www.anthropic.com/index/introducing-claude
|
6 |
+
Claude-instant-v1,7.85,1145,0.734,Proprietary,https://www.anthropic.com/index/introducing-claude
|
7 |
Vicuna-33B,7.12,1096,0.592,Non-commercial,https://huggingface.co/lmsys/vicuna-33b-v1.3
|
8 |
WizardLM-30B,7.01,-,0.587,Non-commercial,https://huggingface.co/WizardLM/WizardLM-30B-V1.0
|
9 |
Llama-2-70b-chat,6.86,-,-,Llama 2 Community,https://huggingface.co/meta-llama/Llama-2-70b-hf
|