DataCanvas
/

MMAlaya2

Model card Files Files and versions Community

bingwork commited on Aug 29

Commit

b1c19b8

•

1 Parent(s): 238e6ac

update readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ Thank you to the OpenCompass MMBench team for updating the [leaderboard](https:/
 | InternVL-Chat-V1.5   | 14/80.7        |    15/79.1   |   9/69.8    |   11/82.3    |   10/80.3    |
-The average score on the MMBench Test (CN) reached 82.1, surpassing the InternVL-Chat-V1-5 model's score of 80.7 by 1.4 points. This achievement places it in the top 4, on par with the performance of GPT-4o. Additionally, scores on the other four benchmarks—MMBench v1.1 Test (CN), CCBench dev, MMBench Test, and MMBench v1.1 Test—have also improved by 0.2 to 0.6 points, bringing them closer to GPT-4o's performance.
 We found this result noteworthy. As a result, we are sharing this model publicly.

 | InternVL-Chat-V1.5   | 14/80.7        |    15/79.1   |   9/69.8    |   11/82.3    |   10/80.3    |
+The average score on the MMBench Test (CN) reached 82.1, surpassing the InternVL-Chat-V1-5 model's score of 80.7 by 1.4 points. Although the rank is 7, this score matches GPT-4o's performance, which is ranked 4th, placing the model on par with GPT-4o. Additionally, scores on the other four benchmarks—MMBench v1.1 Test (CN), CCBench dev, MMBench Test, and MMBench v1.1 Test—have also improved by 0.2 to 0.6 points, further closing the gap to GPT-4o's performance.
 We found this result noteworthy. As a result, we are sharing this model publicly.