Llama-Guard-3-8B-GGUF / scores /Llama-Guard-3-8B-q3_k_l.arc
eaddario's picture
Generate Perplexity, KLD, ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores
6de6927 verified
common_init_from_params: setting dry_penalty_last_n to ctx_size = 768
common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
system_info: n_threads = 6 (n_threads_batch = 6) / 12 | Metal : EMBED_LIBRARY = 1 | CPU : NEON = 1 | ARM_FMA = 1 | FP16_VA = 1 | DOTPROD = 1 | LLAMAFILE = 1 | ACCELERATE = 1 | AARCH64_REPACK = 1 |
multiple_choice_score: there are 869 tasks in prompt
multiple_choice_score: selecting 750 random tasks from 869 tasks available
multiple_choice_score: preparing task data...done
multiple_choice_score : calculating TruthfulQA score over 750 tasks.
task acc_norm
1 100.00000000
2 100.00000000
3 66.66666667
4 50.00000000
5 60.00000000
6 66.66666667
7 71.42857143
8 75.00000000
9 66.66666667
10 70.00000000
11 63.63636364
12 66.66666667
13 69.23076923
14 71.42857143
15 73.33333333
16 75.00000000
17 76.47058824
18 72.22222222
19 73.68421053
20 70.00000000
21 71.42857143
22 72.72727273
23 69.56521739
24 70.83333333
25 68.00000000
26 65.38461538
27 66.66666667
28 67.85714286
29 68.96551724
30 70.00000000
31 67.74193548
32 68.75000000
33 66.66666667
34 64.70588235
35 62.85714286
36 61.11111111
37 62.16216216
38 63.15789474
39 64.10256410
40 65.00000000
41 65.85365854
42 64.28571429
43 62.79069767
44 61.36363636
45 62.22222222
46 63.04347826
47 63.82978723
48 62.50000000
49 63.26530612
50 62.00000000
51 60.78431373
52 61.53846154
53 62.26415094
54 61.11111111
55 60.00000000
56 60.71428571
57 61.40350877
58 62.06896552
59 61.01694915
60 61.66666667
61 62.29508197
62 61.29032258
63 61.90476190
64 62.50000000
65 61.53846154
66 62.12121212
67 62.68656716
68 63.23529412
69 63.76811594
70 62.85714286
71 61.97183099
72 61.11111111
73 61.64383562
74 62.16216216
75 62.66666667
76 61.84210526
77 61.03896104
78 61.53846154
79 60.75949367
80 61.25000000
81 60.49382716
82 60.97560976
83 60.24096386
84 60.71428571
85 60.00000000
86 60.46511628
87 59.77011494
88 60.22727273
89 60.67415730
90 61.11111111
91 61.53846154
92 61.95652174
93 61.29032258
94 60.63829787
95 61.05263158
96 61.45833333
97 61.85567010
98 62.24489796
99 62.62626263
100 63.00000000
101 62.37623762
102 61.76470588
103 61.16504854
104 60.57692308
105 60.95238095
106 60.37735849
107 60.74766355
108 60.18518519
109 59.63302752
110 60.00000000
111 60.36036036
112 60.71428571
113 61.06194690
114 61.40350877
115 61.73913043
116 61.20689655
117 61.53846154
118 61.86440678
119 61.34453782
120 60.83333333
121 60.33057851
122 60.65573770
123 60.97560976
124 60.48387097
125 60.80000000
126 61.11111111
127 61.41732283
128 61.71875000
129 61.24031008
130 60.76923077
131 61.06870229
132 61.36363636
133 61.65413534
134 61.19402985
135 60.74074074
136 61.02941176
137 61.31386861
138 61.59420290
139 61.87050360
140 62.14285714
141 62.41134752
142 62.67605634
143 62.93706294
144 63.19444444
145 63.44827586
146 63.01369863
147 62.58503401
148 62.16216216
149 62.41610738
150 62.66666667
151 62.25165563
152 62.50000000
153 62.74509804
154 62.33766234
155 61.93548387
156 62.17948718
157 61.78343949
158 61.39240506
159 61.63522013
160 61.87500000
161 62.11180124
162 62.34567901
163 62.57668712
164 62.19512195
165 61.81818182
166 62.04819277
167 61.67664671
168 61.90476190
169 61.53846154
170 61.76470588
171 61.98830409
172 62.20930233
173 62.42774566
174 62.64367816
175 62.28571429
176 62.50000000
177 62.71186441
178 62.35955056
179 62.56983240
180 62.77777778
181 62.43093923
182 62.63736264
183 62.29508197
184 61.95652174
185 62.16216216
186 62.36559140
187 62.56684492
188 62.76595745
189 62.43386243
190 62.10526316
191 62.30366492
192 62.50000000
193 62.17616580
194 62.37113402
195 62.56410256
196 62.75510204
197 62.94416244
198 63.13131313
199 62.81407035
200 63.00000000
201 63.18407960
202 63.36633663
203 63.54679803
204 63.72549020
205 63.41463415
206 63.10679612
207 62.80193237
208 62.98076923
209 63.15789474
210 63.33333333
211 63.50710900
212 63.20754717
213 63.38028169
214 63.55140187
215 63.72093023
216 63.88888889
217 63.59447005
218 63.76146789
219 63.92694064
220 63.63636364
221 63.34841629
222 63.51351351
223 63.22869955
224 62.94642857
225 62.66666667
226 62.38938053
227 62.55506608
228 62.71929825
229 62.88209607
230 63.04347826
231 62.77056277
232 62.50000000
233 62.66094421
234 62.82051282
235 62.55319149
236 62.71186441
237 62.86919831
238 63.02521008
239 63.17991632
240 63.33333333
241 63.48547718
242 63.63636364
243 63.78600823
244 63.52459016
245 63.67346939
246 63.82113821
247 63.96761134
248 64.11290323
249 63.85542169
250 64.00000000
251 64.14342629
252 64.28571429
253 64.42687747
254 64.17322835
255 64.31372549
256 64.45312500
257 64.20233463
258 64.34108527
259 64.09266409
260 64.23076923
261 64.36781609
262 64.50381679
263 64.63878327
264 64.77272727
265 64.52830189
266 64.28571429
267 64.41947566
268 64.55223881
269 64.68401487
270 64.81481481
271 64.94464945
272 64.70588235
273 64.46886447
274 64.23357664
275 64.00000000
276 64.13043478
277 64.25992780
278 64.02877698
279 64.15770609
280 64.28571429
281 64.41281139
282 64.53900709
283 64.31095406
284 64.43661972
285 64.56140351
286 64.68531469
287 64.80836237
288 64.93055556
289 64.70588235
290 64.82758621
291 64.94845361
292 65.06849315
293 65.18771331
294 64.96598639
295 65.08474576
296 65.20270270
297 65.31986532
298 65.43624161
299 65.55183946
300 65.66666667
301 65.44850498
302 65.56291391
303 65.67656766
304 65.78947368
305 65.57377049
306 65.35947712
307 65.47231270
308 65.58441558
309 65.69579288
310 65.80645161
311 65.91639871
312 66.02564103
313 66.13418530
314 65.92356688
315 65.71428571
316 65.82278481
317 65.93059937
318 65.72327044
319 65.51724138
320 65.31250000
321 65.42056075
322 65.52795031
323 65.63467492
324 65.43209877
325 65.53846154
326 65.64417178
327 65.44342508
328 65.54878049
329 65.65349544
330 65.45454545
331 65.55891239
332 65.66265060
333 65.76576577
334 65.56886228
335 65.37313433
336 65.17857143
337 64.98516320
338 64.79289941
339 64.89675516
340 65.00000000
341 64.80938416
342 64.61988304
343 64.72303207
344 64.82558140
345 64.63768116
346 64.45086705
347 64.26512968
348 64.08045977
349 64.18338109
350 64.28571429
351 64.10256410
352 64.20454545
353 64.30594901
354 64.12429379
355 63.94366197
356 64.04494382
357 64.14565826
358 64.24581006
359 64.06685237
360 64.16666667
361 64.26592798
362 64.36464088
363 64.18732782
364 64.01098901
365 64.10958904
366 63.93442623
367 64.03269755
368 64.13043478
369 64.22764228
370 64.32432432
371 64.42048518
372 64.51612903
373 64.61126005
374 64.70588235
375 64.53333333
376 64.62765957
377 64.45623342
378 64.55026455
379 64.37994723
380 64.47368421
381 64.56692913
382 64.39790576
383 64.22976501
384 64.06250000
385 63.89610390
386 63.98963731
387 63.82428941
388 63.65979381
389 63.49614396
390 63.33333333
391 63.17135550
392 63.26530612
393 63.35877863
394 63.45177665
395 63.54430380
396 63.38383838
397 63.22418136
398 63.06532663
399 63.15789474
400 63.25000000
401 63.09226933
402 62.93532338
403 63.02729529
404 63.11881188
405 63.20987654
406 63.30049261
407 63.39066339
408 63.48039216
409 63.32518337
410 63.41463415
411 63.50364964
412 63.59223301
413 63.68038741
414 63.76811594
415 63.61445783
416 63.70192308
417 63.78896882
418 63.87559809
419 63.72315036
420 63.80952381
421 63.65795724
422 63.50710900
423 63.59338061
424 63.44339623
425 63.52941176
426 63.61502347
427 63.70023419
428 63.55140187
429 63.63636364
430 63.72093023
431 63.80510441
432 63.88888889
433 63.74133949
434 63.82488479
435 63.90804598
436 63.99082569
437 64.07322654
438 64.15525114
439 64.23690205
440 64.09090909
441 64.17233560
442 64.02714932
443 63.88261851
444 63.73873874
445 63.82022472
446 63.90134529
447 63.98210291
448 64.06250000
449 64.14253898
450 64.00000000
451 64.07982262
452 64.15929204
453 64.23841060
454 64.09691630
455 63.95604396
456 64.03508772
457 64.11378556
458 64.19213974
459 64.27015251
460 64.34782609
461 64.42516269
462 64.50216450
463 64.36285097
464 64.22413793
465 64.30107527
466 64.37768240
467 64.45396146
468 64.52991453
469 64.39232409
470 64.46808511
471 64.54352442
472 64.61864407
473 64.48202960
474 64.55696203
475 64.63157895
476 64.49579832
477 64.57023061
478 64.64435146
479 64.71816284
480 64.79166667
481 64.65696466
482 64.73029046
483 64.80331263
484 64.87603306
485 64.94845361
486 65.02057613
487 64.88706366
488 64.95901639
489 65.03067485
490 64.89795918
491 64.76578411
492 64.83739837
493 64.70588235
494 64.77732794
495 64.64646465
496 64.71774194
497 64.78873239
498 64.85943775
499 64.92985972
500 65.00000000
501 64.87025948
502 64.94023904
503 65.00994036
504 65.07936508
505 65.14851485
506 65.01976285
507 64.89151874
508 64.96062992
509 64.83300589
510 64.90196078
511 64.77495108
512 64.64843750
513 64.52241715
514 64.59143969
515 64.46601942
516 64.34108527
517 64.41005803
518 64.47876448
519 64.54720617
520 64.42307692
521 64.49136276
522 64.55938697
523 64.43594646
524 64.50381679
525 64.38095238
526 64.25855513
527 64.13662239
528 64.01515152
529 63.89413989
530 63.96226415
531 64.03013183
532 64.09774436
533 64.16510319
534 64.04494382
535 63.92523364
536 63.99253731
537 64.05959032
538 63.94052045
539 64.00742115
540 63.88888889
541 63.95563771
542 64.02214022
543 64.08839779
544 64.15441176
545 64.22018349
546 64.10256410
547 63.98537477
548 64.05109489
549 63.93442623
550 64.00000000
551 63.88384755
552 63.76811594
553 63.83363472
554 63.89891697
555 63.96396396
556 63.84892086
557 63.91382406
558 63.79928315
559 63.86404293
560 63.92857143
561 63.99286988
562 64.05693950
563 64.12078153
564 64.00709220
565 63.89380531
566 63.95759717
567 63.84479718
568 63.90845070
569 63.79613357
570 63.68421053
571 63.74781086
572 63.63636364
573 63.52530541
574 63.58885017
575 63.47826087
576 63.54166667
577 63.60485269
578 63.66782007
579 63.73056995
580 63.62068966
581 63.68330465
582 63.57388316
583 63.63636364
584 63.69863014
585 63.76068376
586 63.65187713
587 63.71379898
588 63.77551020
589 63.83701188
590 63.89830508
591 63.95939086
592 64.02027027
593 64.08094435
594 64.14141414
595 64.20168067
596 64.09395973
597 63.98659966
598 63.87959866
599 63.93989983
600 63.83333333
601 63.89351082
602 63.78737542
603 63.68159204
604 63.74172185
605 63.80165289
606 63.86138614
607 63.92092257
608 63.98026316
609 63.87520525
610 63.93442623
611 63.82978723
612 63.88888889
613 63.78466558
614 63.84364821
615 63.90243902
616 63.79870130
617 63.85737439
618 63.75404531
619 63.65105008
620 63.70967742
621 63.60708535
622 63.50482315
623 63.56340289
624 63.62179487
625 63.52000000
626 63.41853035
627 63.47687400
628 63.53503185
629 63.43402226
630 63.49206349
631 63.54992076
632 63.44936709
633 63.34913112
634 63.40694006
635 63.46456693
636 63.52201258
637 63.42229199
638 63.47962382
639 63.53677621
640 63.43750000
641 63.49453978
642 63.39563863
643 63.45256610
644 63.50931677
645 63.41085271
646 63.46749226
647 63.52395672
648 63.42592593
649 63.32819723
650 63.38461538
651 63.44086022
652 63.49693252
653 63.39969372
654 63.30275229
655 63.20610687
656 63.10975610
657 63.01369863
658 63.06990881
659 62.97420334
660 63.03030303
661 63.08623298
662 63.14199396
663 63.19758673
664 63.25301205
665 63.30827068
666 63.21321321
667 63.26836582
668 63.17365269
669 63.07922272
670 63.13432836
671 63.04023845
672 63.09523810
673 63.00148588
674 62.90801187
675 62.81481481
676 62.72189349
677 62.77695716
678 62.83185841
679 62.88659794
680 62.94117647
681 62.84875184
682 62.75659824
683 62.66471449
684 62.57309942
685 62.62773723
686 62.53644315
687 62.44541485
688 62.50000000
689 62.40928882
690 62.46376812
691 62.51808973
692 62.57225434
693 62.62626263
694 62.53602305
695 62.44604317
696 62.35632184
697 62.41032999
698 62.32091691
699 62.23175966
700 62.28571429
701 62.33951498
702 62.39316239
703 62.44665718
704 62.50000000
705 62.41134752
706 62.46458924
707 62.51768034
708 62.57062147
709 62.62341326
710 62.67605634
711 62.72855134
712 62.78089888
713 62.83309958
714 62.74509804
715 62.65734266
716 62.70949721
717 62.62203626
718 62.53481894
719 62.44784423
720 62.50000000
721 62.55201110
722 62.60387812
723 62.65560166
724 62.56906077
725 62.62068966
726 62.53443526
727 62.44841816
728 62.50000000
729 62.55144033
730 62.60273973
731 62.65389877
732 62.70491803
733 62.75579809
734 62.80653951
735 62.72108844
736 62.77173913
737 62.68656716
738 62.73712737
739 62.78755074
740 62.83783784
741 62.88798920
742 62.93800539
743 62.98788694
744 62.90322581
745 62.95302013
746 63.00268097
747 62.91834003
748 62.83422460
749 62.75033378
750 62.80000000
Final result: 62.8000 +/- 1.7661
Random chance: 25.0083 +/- 1.5824