Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
SUSTech
/
tlem
like
5
Running
App
Files
Files
Community
5
488ef58
tlem
/
tasks.py
Commit History
fix cmmlu prompt
488ef58
facat
commited on
Dec 19, 2023
clean bbh
29eceda
facat
commited on
Dec 9, 2023
output in dataset
84e1d00
facat
commited on
Dec 4, 2023
update index
1395a53
facat
commited on
Dec 3, 2023
upd
c1cde4c
facat
commited on
Nov 30, 2023
update
72dba58
facat
commited on
Nov 30, 2023
update drop
5ca9a91
facat
commited on
Nov 30, 2023
fix logging
132574a
facat
commited on
Nov 30, 2023
update
c6f1343
facat
commited on
Nov 30, 2023
clean
0c75eca
facat
commited on
Nov 29, 2023
fix async
360e3ac
facat
commited on
Nov 28, 2023
fixup! fix task
0f420dd
facat
commited on
Nov 28, 2023
feat: async run
f21585c
facat
commited on
Nov 28, 2023
fix task
232b173
facat
commited on
Nov 28, 2023
update
f2c1a54
facat
commited on
Nov 28, 2023
!ref suite
3a8c0d0
facat
commited on
Nov 28, 2023
refactor
9827786
facat
commited on
Nov 28, 2023
update math
58f14b3
facat
commited on
Nov 27, 2023
fix dataset in task
76eab85
facat
commited on
Nov 27, 2023
add math
d13c0d8
facat
commited on
Nov 27, 2023
FIX: extraction func of C-Eval; logging metrics (
#3
)
25e4875
facat
Cookize
commited on
Nov 25, 2023
Add new benchmark (
#2
)
141ccb9
facat
Cookize
commited on
Nov 25, 2023
update mt_bench
845a45a
facat
commited on
Nov 24, 2023
update
33a6f85
facat
commited on
Nov 12, 2023
fix mmlu
9199665
facat
commited on
Nov 12, 2023
fix fewshot
075ef98
facat
commited on
Nov 12, 2023
verbose mode
a034e31
facat
commited on
Nov 12, 2023
add mmlu and cmmlu
be1543a
facat
commited on
Oct 29, 2023
upd
044ed98
facat
commited on
Sep 6, 2023
refactor
4c7982b
facat
commited on
Sep 6, 2023