Commit History

[ADD] ENV variables for private leaderboard
237c120

tathagataraha commited on

[MODIFY] Submission viewing
46f69ad

tathagataraha commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
934a25d

tathagataraha commited on

Update src/about.py
a71f0d3
verified

cchristophe commited on

Update src/about.py
011baf1
verified

cchristophe commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
fedd68b

tathagataraha commited on

Update src/about.py
0da091e
verified

cchristophe commited on

Update src/about.py
b78ec05
verified

cchristophe commited on

[ADD] Dataset descriptions for cross-examination framework
5c80286

tathagataraha commited on

[MODIFY] Column descriptions for the cross examination framework
23fd02c

tathagataraha commited on

[ADD] CI intervals for med-safety
ba515db

tathagataraha commited on

[ADD] CSS for logo and MEDIC 5-pillar diagram
85b4142

tathagataraha commited on

[ADD] MEDIC 5-pillar diagram
ef49d36

tathagataraha commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
32eaa7c

tathagataraha commited on

Update src/about.py
a111e91
verified

cchristophe commited on

[MODIFY] Med-Safety: Average -> Harmfulness Score
2a7ac72

tathagataraha commited on

[MODIFY] Metrics for medical summarization, aci bench and soap notes
c92b14d

tathagataraha commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
7d6aad6

tathagataraha commited on

Update src/about.py
818cb65
verified

cchristophe commited on

Update src/about.py
8c295d9
verified

cchristophe commited on

[MODIFY] Cross-evaluation framework column names
faceee1

tathagataraha commited on

[ADD] Cross-examination framework
553b217

tathagataraha commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
8b771ed

tathagataraha commited on

Update src/about.py
9e77e60
verified

cchristophe commited on

[ADD] Open-ended evaluation
0da5ee3

tathagataraha commited on

Merge branch 'main' of https://huggingface.co./spaces/m42-health/MEDIC-Benchmark
b5701cc

tathagataraha commited on

[FIX] handled cases where one of the results are not present
34c150d

tathagataraha commited on

Update src/about.py
a2d8d52
verified

cchristophe commited on

[MODIFY] Added support for other frameworks in submit, evaluation queue and harness results displau
d86ca68

tathagataraha commited on

Update about.py
dfd63f4
verified

cchristophe commited on

[ADD]Model submission guide and citation
27e5b96

tathagataraha commited on

[FIX] Filters and search
d8147b8

tathagataraha commited on

[FIX] Preference tuned model symbol
e1cdc4b

tathagataraha commited on

[ADD] Auto Precision for loading directly from model
671e1a6

tathagataraha commited on

[REMOVED] Average column
c63935d

tathagataraha commited on

[ADD] Support for slurm id
8a76c2c

tathagataraha commited on

[ADD] Submit form, upload requests to requests dataset
b3eff40

tathagataraha commited on

[ADD] Harness tasks, data display
09b313f

tathagataraha commited on