Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit History
merge with 'main'
6c3a616
minor updates
5c4aa1e
minor fix
3cf286c
minor fix
2aa9a75
minor updates in publishing and logging results
2b9835a
minor update and extend to support different APIs
150bb15
Update src/display/about.py
8a6bfdc
verified
Update src/display/about.py
02cd86f
verified
Update src/display/about.py
56492c3
verified
Update src/display/about.py
6472dd8
verified
Update src/display/about.py
2a8e044
verified
Update src/display/about.py
b92e0da
verified
Updated bibtex
418a002
verified
Updated bibtex
31b8757
verified
Added bibtex
5ead597
verified
Updated bibtex citation
bac5383
verified
Update src/display/about.py
e2aca33
verified
Update src/display/about.py
3c0cb66
verified
modified about.py
818ee3d
Minseok Bae
commited on
Modified about.py so that it displays (%) in columns.
5bcc476
Minseok Bae
commited on
Fixed the leaderboard filtering functionality. Modified filter_models() function in app.py/
1f26f6c
Minseok Bae
commited on
modified the evaluation pipelines.
2c24f05
Minseok Bae
commited on
Added citations
b46b972
Minseok Bae
commited on
Updated about.py
dbcffd4
Minseok Bae
commited on
Edited README and added reproducibility functionality in main_backend.py
f0b90cf
Minseok Bae
commited on
modified read_evals.py
c3e9147
Minseok Bae
commited on
Refine the code style
156ef43
Minseok Bae
commited on
Implemented litellm pipeline
2864204
Minseok Bae
commited on
Edited README and removed error-rate metric
404587d
Minseok Bae
commited on
modified is_model_on_hub()
3b66490
Minseok Bae
commited on
changed back to TOKEN
0c85a8e
Minseok Bae
commited on
changed to HF_TOKEN
a9a1c18
Minseok Bae
commited on
modified check_validity.py and added sample dataset to test functionality
099e4e2
Minseok Bae
commited on
Integrated backend pipelines - error occurs during model submission. (Debugging needed).
58b9de9
Minseok Bae
commited on
Modified for hallucination evaluation task
d7b7dc6
Minseok Bae
commited on
Update src/display/about.py
0baf5c4
update read
943f952
Clémentine
commited on
fixs
314f91a
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
Added check on tokenizer to prevent submissions which won't run
7302987
Clémentine
commited on
Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7
fix order of request file vs request file list, to avoid resubmitting issues
976f398
Clémentine
commited on
cache
4ff9eef
Clémentine
commited on
update for caching
395eff6
Clémentine
commited on
add model architecture as column
3dfaf22
Clémentine
commited on
Simplify About
eaace79
Clémentine
commited on
Refactor 2 - added plotting back
b1a1395
Clémentine
commited on