arco / README.md
appvoid's picture
Update README.md
89b403e verified
|
raw
history blame
1.75 kB
metadata
license: apache-2.0

palmer-004

cubby consistently outperforms every sota model below 600m parameters, outperforms base 1b models and is competitive with the best ones.

benchmarks

zero-shot evaluations performed on current sota ~0.5b models and palmer-004.

Parameters Model MMLU ARC-C HellaSwag PIQA Winogrande Average
0.5b qwen2 0.4413 0.2892 0.4905 0.6931 0.5699 0.4968
0.5b palmer-004-turbo 0.2736 0.3558 0.6179 0.7367 0.6117 0.5191
1.1b palmer-004 0.2661 0.3490 0.6173 0.7481 0.6417 0.5244
0.5b cubby 0.2617 0.3729 0.6288 0.7437 0.6227 0.5260

supporters

Buy Me A Coffee