DeepSeek-R1-Distill-Qwen-7B-GGUF / scores /deepseek-r1-distill-qwen-7b-q3_k_s.arc
eaddario's picture
Generate Perplexity, KLD, ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores
94d6d71 verified
raw
history blame
12.6 kB
common_init_from_params: setting dry_penalty_last_n to ctx_size = 768
common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
system_info: n_threads = 6 (n_threads_batch = 6) / 12 | Metal : EMBED_LIBRARY = 1 | CPU : NEON = 1 | ARM_FMA = 1 | FP16_VA = 1 | DOTPROD = 1 | LLAMAFILE = 1 | ACCELERATE = 1 | AARCH64_REPACK = 1 |
multiple_choice_score: there are 869 tasks in prompt
multiple_choice_score: selecting 750 random tasks from 869 tasks available
multiple_choice_score: preparing task data...done
multiple_choice_score : calculating TruthfulQA score over 750 tasks.
task acc_norm
1 100.00000000
2 50.00000000
3 66.66666667
4 50.00000000
5 60.00000000
6 66.66666667
7 57.14285714
8 62.50000000
9 55.55555556
10 50.00000000
11 45.45454545
12 41.66666667
13 38.46153846
14 42.85714286
15 46.66666667
16 43.75000000
17 47.05882353
18 44.44444444
19 47.36842105
20 45.00000000
21 47.61904762
22 50.00000000
23 47.82608696
24 50.00000000
25 48.00000000
26 46.15384615
27 48.14814815
28 46.42857143
29 44.82758621
30 43.33333333
31 41.93548387
32 40.62500000
33 42.42424242
34 41.17647059
35 40.00000000
36 38.88888889
37 37.83783784
38 36.84210526
39 35.89743590
40 37.50000000
41 39.02439024
42 40.47619048
43 39.53488372
44 38.63636364
45 37.77777778
46 39.13043478
47 40.42553191
48 41.66666667
49 40.81632653
50 40.00000000
51 39.21568627
52 38.46153846
53 39.62264151
54 38.88888889
55 38.18181818
56 39.28571429
57 40.35087719
58 41.37931034
59 42.37288136
60 43.33333333
61 44.26229508
62 43.54838710
63 44.44444444
64 45.31250000
65 44.61538462
66 45.45454545
67 46.26865672
68 47.05882353
69 47.82608696
70 47.14285714
71 46.47887324
72 45.83333333
73 46.57534247
74 45.94594595
75 46.66666667
76 46.05263158
77 46.75324675
78 47.43589744
79 46.83544304
80 46.25000000
81 45.67901235
82 46.34146341
83 45.78313253
84 46.42857143
85 47.05882353
86 47.67441860
87 47.12643678
88 47.72727273
89 48.31460674
90 48.88888889
91 49.45054945
92 50.00000000
93 49.46236559
94 48.93617021
95 49.47368421
96 50.00000000
97 49.48453608
98 48.97959184
99 49.49494949
100 50.00000000
101 49.50495050
102 49.01960784
103 48.54368932
104 48.07692308
105 48.57142857
106 48.11320755
107 48.59813084
108 48.14814815
109 47.70642202
110 48.18181818
111 47.74774775
112 48.21428571
113 47.78761062
114 47.36842105
115 47.82608696
116 47.41379310
117 47.86324786
118 48.30508475
119 47.89915966
120 48.33333333
121 47.93388430
122 48.36065574
123 48.78048780
124 48.38709677
125 48.80000000
126 49.20634921
127 48.81889764
128 48.43750000
129 48.06201550
130 47.69230769
131 48.09160305
132 48.48484848
133 48.87218045
134 48.50746269
135 48.14814815
136 48.52941176
137 48.90510949
138 49.27536232
139 48.92086331
140 49.28571429
141 49.64539007
142 50.00000000
143 50.34965035
144 50.00000000
145 50.34482759
146 50.00000000
147 50.34013605
148 50.00000000
149 50.33557047
150 50.66666667
151 50.99337748
152 51.31578947
153 51.63398693
154 51.29870130
155 50.96774194
156 51.28205128
157 51.59235669
158 51.89873418
159 52.20125786
160 51.87500000
161 51.55279503
162 51.85185185
163 52.14723926
164 51.82926829
165 52.12121212
166 52.40963855
167 52.09580838
168 52.38095238
169 52.07100592
170 52.35294118
171 52.04678363
172 51.74418605
173 52.02312139
174 52.29885057
175 52.00000000
176 51.70454545
177 51.97740113
178 51.68539326
179 51.39664804
180 51.11111111
181 50.82872928
182 51.09890110
183 50.81967213
184 50.54347826
185 50.81081081
186 51.07526882
187 50.80213904
188 50.53191489
189 50.26455026
190 50.00000000
191 50.26178010
192 50.00000000
193 50.25906736
194 50.51546392
195 50.76923077
196 51.02040816
197 51.26903553
198 51.51515152
199 51.75879397
200 52.00000000
201 52.23880597
202 52.47524752
203 52.21674877
204 52.45098039
205 52.19512195
206 51.94174757
207 51.69082126
208 51.92307692
209 52.15311005
210 52.38095238
211 52.60663507
212 52.35849057
213 52.11267606
214 52.33644860
215 52.55813953
216 52.31481481
217 52.07373272
218 52.29357798
219 52.51141553
220 52.27272727
221 52.03619910
222 52.25225225
223 52.01793722
224 52.23214286
225 52.00000000
226 51.76991150
227 51.98237885
228 52.19298246
229 52.40174672
230 52.60869565
231 52.38095238
232 52.15517241
233 52.36051502
234 52.13675214
235 51.91489362
236 52.11864407
237 51.89873418
238 52.10084034
239 52.30125523
240 52.50000000
241 52.69709544
242 52.47933884
243 52.26337449
244 52.04918033
245 51.83673469
246 51.62601626
247 51.82186235
248 51.61290323
249 51.40562249
250 51.60000000
251 51.79282869
252 51.98412698
253 51.77865613
254 51.57480315
255 51.76470588
256 51.95312500
257 52.14007782
258 52.32558140
259 52.12355212
260 52.30769231
261 52.49042146
262 52.67175573
263 52.85171103
264 53.03030303
265 52.83018868
266 52.63157895
267 52.80898876
268 52.61194030
269 52.78810409
270 52.96296296
271 52.76752768
272 52.57352941
273 52.74725275
274 52.55474453
275 52.36363636
276 52.53623188
277 52.70758123
278 52.51798561
279 52.32974910
280 52.50000000
281 52.66903915
282 52.83687943
283 52.65017668
284 52.81690141
285 52.98245614
286 53.14685315
287 53.31010453
288 53.47222222
289 53.28719723
290 53.44827586
291 53.60824742
292 53.42465753
293 53.58361775
294 53.40136054
295 53.55932203
296 53.71621622
297 53.87205387
298 53.69127517
299 53.84615385
300 53.66666667
301 53.48837209
302 53.64238411
303 53.79537954
304 53.94736842
305 54.09836066
306 53.92156863
307 54.07166124
308 54.22077922
309 54.36893204
310 54.51612903
311 54.66237942
312 54.48717949
313 54.63258786
314 54.77707006
315 54.60317460
316 54.74683544
317 54.88958991
318 54.71698113
319 54.54545455
320 54.37500000
321 54.20560748
322 54.34782609
323 54.17956656
324 54.01234568
325 53.84615385
326 53.98773006
327 53.82262997
328 53.96341463
329 54.10334347
330 54.24242424
331 54.38066465
332 54.21686747
333 54.35435435
334 54.19161677
335 54.02985075
336 54.16666667
337 54.00593472
338 53.84615385
339 53.98230088
340 54.11764706
341 54.25219941
342 54.09356725
343 54.22740525
344 54.36046512
345 54.20289855
346 54.33526012
347 54.17867435
348 54.02298851
349 54.15472779
350 54.00000000
351 54.13105413
352 53.97727273
353 53.82436261
354 53.67231638
355 53.52112676
356 53.37078652
357 53.22128852
358 53.35195531
359 53.48189415
360 53.61111111
361 53.73961219
362 53.59116022
363 53.71900826
364 53.57142857
365 53.42465753
366 53.27868852
367 53.40599455
368 53.53260870
369 53.65853659
370 53.78378378
371 53.63881402
372 53.76344086
373 53.61930295
374 53.74331551
375 53.60000000
376 53.72340426
377 53.58090186
378 53.70370370
379 53.56200528
380 53.68421053
381 53.54330709
382 53.40314136
383 53.26370757
384 53.12500000
385 53.24675325
386 53.36787565
387 53.22997416
388 53.09278351
389 52.95629820
390 53.07692308
391 52.94117647
392 52.80612245
393 52.92620865
394 53.04568528
395 53.16455696
396 53.03030303
397 53.14861461
398 53.01507538
399 52.88220551
400 53.00000000
401 52.86783042
402 52.73631841
403 52.85359801
404 52.97029703
405 53.08641975
406 53.20197044
407 53.31695332
408 53.43137255
409 53.30073350
410 53.41463415
411 53.52798054
412 53.64077670
413 53.51089588
414 53.62318841
415 53.49397590
416 53.36538462
417 53.23741007
418 53.11004785
419 52.98329356
420 53.09523810
421 52.96912114
422 53.08056872
423 53.19148936
424 53.06603774
425 52.94117647
426 52.81690141
427 52.69320843
428 52.57009346
429 52.68065268
430 52.79069767
431 52.90023202
432 52.77777778
433 52.65588915
434 52.76497696
435 52.87356322
436 52.98165138
437 53.08924485
438 52.96803653
439 53.07517084
440 52.95454545
441 52.83446712
442 52.71493213
443 52.59593679
444 52.47747748
445 52.58426966
446 52.69058296
447 52.79642058
448 52.90178571
449 53.00668151
450 53.11111111
451 53.21507761
452 53.31858407
453 53.20088300
454 53.08370044
455 52.96703297
456 52.85087719
457 52.73522976
458 52.62008734
459 52.72331155
460 52.82608696
461 52.92841649
462 53.03030303
463 52.91576674
464 52.80172414
465 52.90322581
466 53.00429185
467 52.89079229
468 52.77777778
469 52.66524520
470 52.76595745
471 52.86624204
472 52.96610169
473 52.85412262
474 52.74261603
475 52.63157895
476 52.52100840
477 52.62054507
478 52.71966527
479 52.81837161
480 52.70833333
481 52.59875260
482 52.69709544
483 52.79503106
484 52.89256198
485 52.98969072
486 53.08641975
487 53.18275154
488 53.07377049
489 52.96523517
490 52.85714286
491 52.95315682
492 52.84552846
493 52.73833671
494 52.83400810
495 52.72727273
496 52.62096774
497 52.71629779
498 52.81124498
499 52.90581162
500 53.00000000
501 53.09381238
502 53.18725100
503 53.28031809
504 53.37301587
505 53.46534653
506 53.35968379
507 53.25443787
508 53.34645669
509 53.43811395
510 53.52941176
511 53.42465753
512 53.51562500
513 53.41130604
514 53.50194553
515 53.59223301
516 53.48837209
517 53.57833656
518 53.47490347
519 53.37186898
520 53.26923077
521 53.35892514
522 53.44827586
523 53.34608031
524 53.24427481
525 53.14285714
526 53.04182510
527 52.94117647
528 52.84090909
529 52.74102079
530 52.83018868
531 52.91902072
532 53.00751880
533 53.09568480
534 52.99625468
535 52.89719626
536 52.98507463
537 53.07262570
538 52.97397770
539 53.06122449
540 52.96296296
541 52.86506470
542 52.95202952
543 52.85451197
544 52.75735294
545 52.84403670
546 52.74725275
547 52.65082267
548 52.73722628
549 52.64116576
550 52.54545455
551 52.45009074
552 52.35507246
553 52.44122966
554 52.34657040
555 52.43243243
556 52.33812950
557 52.42369838
558 52.32974910
559 52.41502683
560 52.50000000
561 52.58467023
562 52.66903915
563 52.75310835
564 52.65957447
565 52.56637168
566 52.65017668
567 52.73368607
568 52.81690141
569 52.72407733
570 52.63157895
571 52.53940455
572 52.44755245
573 52.53054101
574 52.61324042
575 52.69565217
576 52.77777778
577 52.85961872
578 52.76816609
579 52.84974093
580 52.93103448
581 52.83993115
582 52.74914089
583 52.83018868
584 52.91095890
585 52.99145299
586 53.07167235
587 52.98126065
588 53.06122449
589 53.14091681
590 53.22033898
591 53.29949239
592 53.37837838
593 53.28836425
594 53.36700337
595 53.44537815
596 53.35570470
597 53.26633166
598 53.17725753
599 53.25542571
600 53.16666667
601 53.24459235
602 53.15614618
603 53.06799337
604 52.98013245
605 53.05785124
606 53.13531353
607 53.04777595
608 53.12500000
609 53.03776683
610 53.11475410
611 53.02782324
612 52.94117647
613 52.85481240
614 52.76872964
615 52.84552846
616 52.92207792
617 52.99837925
618 52.91262136
619 52.82714055
620 52.90322581
621 52.81803543
622 52.73311897
623 52.80898876
624 52.72435897
625 52.64000000
626 52.55591054
627 52.63157895
628 52.70700637
629 52.78219396
630 52.69841270
631 52.61489699
632 52.53164557
633 52.44865719
634 52.36593060
635 52.44094488
636 52.51572327
637 52.43328100
638 52.50783699
639 52.42566510
640 52.34375000
641 52.41809672
642 52.33644860
643 52.25505443
644 52.32919255
645 52.24806202
646 52.16718266
647 52.24111283
648 52.16049383
649 52.08012327
650 52.00000000
651 52.07373272
652 51.99386503
653 51.91424196
654 51.98776758
655 51.90839695
656 51.82926829
657 51.75038052
658 51.82370821
659 51.74506829
660 51.81818182
661 51.73978820
662 51.81268882
663 51.88536953
664 51.95783133
665 52.03007519
666 52.10210210
667 52.17391304
668 52.09580838
669 52.01793722
670 52.08955224
671 52.01192250
672 51.93452381
673 51.85735513
674 51.78041543
675 51.70370370
676 51.62721893
677 51.69867061
678 51.76991150
679 51.69366716
680 51.76470588
681 51.68869310
682 51.75953079
683 51.68374817
684 51.75438596
685 51.82481752
686 51.74927114
687 51.67394469
688 51.74418605
689 51.81422351
690 51.88405797
691 51.80897250
692 51.87861272
693 51.80375180
694 51.87319885
695 51.79856115
696 51.72413793
697 51.79340029
698 51.71919771
699 51.64520744
700 51.71428571
701 51.64051355
702 51.70940171
703 51.63584637
704 51.70454545
705 51.63120567
706 51.55807365
707 51.62659123
708 51.69491525
709 51.76304654
710 51.83098592
711 51.89873418
712 51.96629213
713 52.03366059
714 51.96078431
715 51.88811189
716 51.95530726
717 51.88284519
718 51.81058496
719 51.73852573
720 51.80555556
721 51.73370319
722 51.66204986
723 51.72890733
724 51.79558011
725 51.86206897
726 51.79063361
727 51.71939477
728 51.78571429
729 51.85185185
730 51.78082192
731 51.84678523
732 51.91256831
733 51.84174625
734 51.90735695
735 51.83673469
736 51.90217391
737 51.83175034
738 51.76151762
739 51.69147497
740 51.62162162
741 51.68690958
742 51.75202156
743 51.81695828
744 51.74731183
745 51.81208054
746 51.74262735
747 51.67336011
748 51.60427807
749 51.53538051
750 51.60000000
Final result: 51.6000 ±1.8260
Random chance: 25.0083 ±1.5824