Independent MMLU-Pro evaluation

by yaronr - opened Sep 26, 2024

Sep 26, 2024

Hi Qwen team,

I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.
The results demonstrate impressive performance for the model across multiple categories compared with other models.
I hope you find this useful.

Deathgod7890

Oct 2, 2024

Hii kaise ho

yaronr

Oct 3, 2024

@Deathgod7890 Main thik hoon, shukriya

Deathgod7890

Oct 27, 2024

Tum kya kar rahi ho

yaronr

Nov 5, 2024

You may find additional analysis for detailed categories, under a new tab, 'Unity Subjects'

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment