Corey Morris
CoreyMorris
AI & ML interests
AI Safety
Recent Activity
upvoted
a
collection
about 1 month ago
PixMo
upvoted
a
paper
4 months ago
ARES: An Automated Evaluation Framework for Retrieval-Augmented
Generation Systems
liked
a Space
8 months ago
bigcode/bigcode-models-leaderboard
Organizations
None yet
CoreyMorris's activity
Where are the evaluation metrics originated?
1
#4 opened about 1 year ago
by
zhiminy
Is there a way to get the flair / classification ?
3
#2 opened about 1 year ago
by
CoreyMorris
Duplicates in the dataset
#1 opened about 1 year ago
by
CoreyMorris
Remove or modify section related to MMLU's moral scenarios task
1
#3 opened over 1 year ago
by
CoreyMorris
Support for multi column filtering using comma seperated values
2
#2 opened over 1 year ago
by
imdatta0
💬 Discussion thread: Model scores and model performances 💬
71
#265 opened over 1 year ago
by
clefourrier
[FLAG] gaodrew/gaodrew-gorgonzola-13b . Suspected to have MMLU in training data
11
#215 opened over 1 year ago
by
CoreyMorris
Dataset for models confirmed to have training data contaminated with evaluation data
2
#214 opened over 1 year ago
by
CoreyMorris
🚩 Report
3
#1 opened over 1 year ago
by
CoreyMorris
MMLU by task leaderboard
3
#173 opened over 1 year ago
by
CoreyMorris
Disaggregated Data
10
#73 opened over 1 year ago
by
alexpeys
Disaggregated Data
10
#73 opened over 1 year ago
by
alexpeys
Disaggregated Data
10
#73 opened over 1 year ago
by
alexpeys