The performance is lower than the base model?

by Spico - opened Jan 15

Discussion

Spico

Jan 15

It seems the model does not surpasses its base model (mlabonne/Marcoro14-7B-slerp).

Am I missing something?

mlabonne

Owner Jan 15

You're right, you can also see it here: https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard

Every expert in Beyonder ranks significantly lower than the base model. In addition, the RP and code models probably decrease its performance on this benchmark since there's no code or storytelling involved.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment