update readme
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ Thank you to the OpenCompass MMBench team for updating the [leaderboard](https:/
|
|
22 |
| InternVL-Chat-V1.5 | 14/80.7 | 15/79.1 | 9/69.8 | 11/82.3 | 10/80.3 |
|
23 |
|
24 |
|
25 |
-
The average score on the MMBench Test (CN) reached 82.1, surpassing the InternVL-Chat-V1-5 model's score of 80.7 by 1.4 points.
|
26 |
|
27 |
We found this result noteworthy. As a result, we are sharing this model publicly.
|
28 |
|
|
|
22 |
| InternVL-Chat-V1.5 | 14/80.7 | 15/79.1 | 9/69.8 | 11/82.3 | 10/80.3 |
|
23 |
|
24 |
|
25 |
+
The average score on the MMBench Test (CN) reached 82.1, surpassing the InternVL-Chat-V1-5 model's score of 80.7 by 1.4 points. Although the rank is 7, this score matches GPT-4o's performance, which is ranked 4th, placing the model on par with GPT-4o. Additionally, scores on the other four benchmarks—MMBench v1.1 Test (CN), CCBench dev, MMBench Test, and MMBench v1.1 Test—have also improved by 0.2 to 0.6 points, further closing the gap to GPT-4o's performance.
|
26 |
|
27 |
We found this result noteworthy. As a result, we are sharing this model publicly.
|
28 |
|