@fdaudens on Hugging Face: "Look at that 👀 Actual benchmarks have become too easy for recent models…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

fdaudens

posted an update Jun 26

Post

1879

Look at that 👀

Actual benchmarks have become too easy for recent models, much like grading high school students on middle school problems makes little sense. So the team worked on a new version of the Open LLM Leaderboard with new benchmarks.

Stellar work from @clefourrier @SaylorTwift and the team!

👉 Read the blog post: open-llm-leaderboard/blog
👉 Explore the leaderboard: open-llm-leaderboard/open_llm_leaderboard

dillfrescott

Jun 27

Can't wait to see deepseek coder v2 on there. I have a feeling it will score high. I love that model

In this post

fdaudens Florent Daudens
dillfrescott Cross