Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,6 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# Open-LLM-Leaderboard: Open-Style Question Evaluation
|
11 |
+
We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability.
|
12 |
+
You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator.
|