Open-LLM-Leaderboard

community

https://vila-lab.github.io/Open-LLM-Leaderboard-Website/

Request to join this org

AI & ML interests

None defined yet.

Organization Card

Community About org cards

Open-LLM-Leaderboard: Open-Style Question Evaluation

We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability. You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator.

spaces 1

Runtime error

🐨

OSQ Leaderboard

models

None public yet

datasets 1

Open-Style/Open-LLM-Benchmark

Viewer • Updated Jul 31 • 402k • 66