Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
fdaudensΒ 
posted an update 13 days ago
Post
1847
Look at that πŸ‘€

Actual benchmarks have become too easy for recent models, much like grading high school students on middle school problems makes little sense. So the team worked on a new version of the Open LLM Leaderboard with new benchmarks.

Stellar work from @clefourrier @SaylorTwift and the team!

πŸ‘‰ Read the blog post: open-llm-leaderboard/blog
πŸ‘‰ Explore the leaderboard: open-llm-leaderboard/open_llm_leaderboard

Can't wait to see deepseek coder v2 on there. I have a feeling it will score high. I love that model