Open LLM Leaderboard - a OpenEvals Collection

OpenEvals 's Collections

Open LLM Leaderboard

Research collaborations

Leaderboards related tools

Archived Open LLM Leaderboard (2023-2024)

Open LLM Leaderboard

updated about 11 hours ago

This leaderboard has been evaluating LLMs from Jun 2024 on IFEval, MuSR, GPQA, MATH, BBH and MMLU-Pro

Running

113

113

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation

Note Blog on why we made a new version of the Open LLM Leaderboard
Running on CPU Upgrade

12.6k

12.6k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

Note The actual leaderboard! With a stylish new ux :)
open-llm-leaderboard/contents

Viewer • Updated 23 minutes ago • 4.23k • 16k • 14

Note If you want to download the main leaderboard table, you'll find the dataset here!
open-llm-leaderboard/results

Preview • Updated 27 minutes ago • 86.9k • 9

Note To extract more detailed aggregated results for each model, look here!
open-llm-leaderboard/requests

Updated 1 minute ago • 428k • 9

Note All models ever submitted to the leaderboard
Running on CPU Upgrade

87

87

Open LLM Leaderboard Model Comparator

🏆

Compare Open LLM Leaderboard results