@fdaudens on Hugging Face: "Look at that 👀 Actual benchmarks have become too easy for recent models…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

fdaudens

posted an update 13 days ago

Post

1847

Look at that 👀

Actual benchmarks have become too easy for recent models, much like grading high school students on middle school problems makes little sense. So the team worked on a new version of the Open LLM Leaderboard with new benchmarks.

Stellar work from @clefourrier @SaylorTwift and the team!

👉 Read the blog post: open-llm-leaderboard/blog
👉 Explore the leaderboard: open-llm-leaderboard/open_llm_leaderboard

dillfrescott

12 days ago

Can't wait to see deepseek coder v2 on there. I have a feeling it will score high. I love that model

In this post

fdaudens Florent Daudens
dillfrescott Cross Nastasi