Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -37,7 +37,7 @@ def build_leaderboard_tab(elo_results_file):
|
|
37 |
md_1 = gr.Markdown(md)
|
38 |
gr.Markdown(
|
39 |
f"""## More Statistics\n
|
40 |
-
We
|
41 |
Please note that you may see different orders from different ranking methods. This is expected for models that perform similarly, as demonstrated by the confidence interval in the bootstrap figure. Going forward, we prefer the classical Elo calculation because of its scalability and interpretability. You can find more discussions in this blog [post](https://lmsys.org/blog/2023-05-03-arena/).
|
42 |
"""
|
43 |
)
|
|
|
37 |
md_1 = gr.Markdown(md)
|
38 |
gr.Markdown(
|
39 |
f"""## More Statistics\n
|
40 |
+
We added some additional figures to show more statistics. The code for generating them is also included in this [notebook]({notebook_url}).
|
41 |
Please note that you may see different orders from different ranking methods. This is expected for models that perform similarly, as demonstrated by the confidence interval in the bootstrap figure. Going forward, we prefer the classical Elo calculation because of its scalability and interpretability. You can find more discussions in this blog [post](https://lmsys.org/blog/2023-05-03-arena/).
|
42 |
"""
|
43 |
)
|