Spaces:
Restarting
on
CPU Upgrade
Restarting
on
CPU Upgrade
Commit History
Add raw results links if exists, and fix minor issues
aa7060a
Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority
8aaf0e7
Allow old model metrics
6269bd0
Add NPM field
f976f1c
Add new tasks and make leadboard work without new tasks evals
5639a81
support hf leaderboard format and my format
a69553b
minor fixes
79aba72
Add hidden option
b234783
Feature: FIELD with original HF Leaderboard ranking
71ecfbb
Evaluation time metric and plot
359d8a9
show baseline
1b2e131
Fix model eval links and remove huggingface icon from Leaderboard name
439afd4
Refactor code for adding generic tasks
36e3010
Added check to hide non FINISHED models
d9f882d
Clémentine
commited on
simplified display, added an extra config repo to carry dynamic information
9b2e755
Clémentine
commited on
wip
0c7ef71
Clémentine
commited on
Update src/leaderboard/read_evals.py
3b554b5
Incorrectly tagged merges are now flagged
90fa47e
Clémentine
commited on
Added checkbox for merges
b762711
Clémentine
commited on
adding merge check - super slow but at least info is displayed
20b060e
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
add model architecture as column
3dfaf22
Clémentine
commited on
Refactor 2 - added plotting back
b1a1395
Clémentine
commited on
Fix requirements for mistral models - to change once transformers gets updated.
002172c
Clémentine
commited on
fix col width
fc1e99b
Clémentine
commited on
refacto style + rate limit
df66f6e
Clémentine
commited on
Fix TruthfulQA NaN scores to 0
bb17be3
Clémentine
commited on
refacto part 1
2a5f9fb
Clémentine
commited on