File size: 12,600 Bytes
6de6927 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 |
common_init_from_params: setting dry_penalty_last_n to ctx_size = 768
common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
system_info: n_threads = 6 (n_threads_batch = 6) / 12 | Metal : EMBED_LIBRARY = 1 | CPU : NEON = 1 | ARM_FMA = 1 | FP16_VA = 1 | DOTPROD = 1 | LLAMAFILE = 1 | ACCELERATE = 1 | AARCH64_REPACK = 1 |
multiple_choice_score: there are 1548 tasks in prompt
multiple_choice_score: selecting 750 random tasks from 1548 tasks available
multiple_choice_score: preparing task data...done
multiple_choice_score : calculating TruthfulQA score over 750 tasks.
task acc_norm
1 100.00000000
2 50.00000000
3 33.33333333
4 50.00000000
5 40.00000000
6 33.33333333
7 42.85714286
8 50.00000000
9 44.44444444
10 50.00000000
11 45.45454545
12 41.66666667
13 38.46153846
14 35.71428571
15 40.00000000
16 43.75000000
17 47.05882353
18 44.44444444
19 47.36842105
20 45.00000000
21 47.61904762
22 45.45454545
23 43.47826087
24 45.83333333
25 48.00000000
26 46.15384615
27 44.44444444
28 42.85714286
29 44.82758621
30 46.66666667
31 48.38709677
32 50.00000000
33 48.48484848
34 50.00000000
35 51.42857143
36 52.77777778
37 51.35135135
38 52.63157895
39 51.28205128
40 52.50000000
41 51.21951220
42 50.00000000
43 51.16279070
44 50.00000000
45 51.11111111
46 50.00000000
47 48.93617021
48 50.00000000
49 48.97959184
50 48.00000000
51 47.05882353
52 48.07692308
53 47.16981132
54 46.29629630
55 47.27272727
56 48.21428571
57 49.12280702
58 50.00000000
59 49.15254237
60 48.33333333
61 49.18032787
62 48.38709677
63 47.61904762
64 46.87500000
65 46.15384615
66 45.45454545
67 44.77611940
68 44.11764706
69 43.47826087
70 44.28571429
71 45.07042254
72 44.44444444
73 43.83561644
74 44.59459459
75 45.33333333
76 46.05263158
77 46.75324675
78 47.43589744
79 46.83544304
80 47.50000000
81 48.14814815
82 47.56097561
83 48.19277108
84 47.61904762
85 47.05882353
86 46.51162791
87 47.12643678
88 47.72727273
89 47.19101124
90 46.66666667
91 46.15384615
92 45.65217391
93 45.16129032
94 44.68085106
95 44.21052632
96 44.79166667
97 44.32989691
98 43.87755102
99 43.43434343
100 43.00000000
101 43.56435644
102 43.13725490
103 42.71844660
104 43.26923077
105 43.80952381
106 44.33962264
107 43.92523364
108 43.51851852
109 44.03669725
110 44.54545455
111 44.14414414
112 43.75000000
113 43.36283186
114 43.85964912
115 44.34782609
116 43.96551724
117 44.44444444
118 44.91525424
119 44.53781513
120 44.16666667
121 44.62809917
122 44.26229508
123 43.90243902
124 43.54838710
125 43.20000000
126 42.85714286
127 42.51968504
128 42.18750000
129 41.86046512
130 41.53846154
131 41.98473282
132 41.66666667
133 41.35338346
134 41.04477612
135 40.74074074
136 40.44117647
137 40.14598540
138 39.85507246
139 40.28776978
140 40.71428571
141 40.42553191
142 40.84507042
143 41.25874126
144 40.97222222
145 41.37931034
146 41.78082192
147 41.49659864
148 41.89189189
149 41.61073826
150 41.33333333
151 41.05960265
152 41.44736842
153 41.17647059
154 40.90909091
155 40.64516129
156 40.38461538
157 40.76433121
158 41.13924051
159 40.88050314
160 41.25000000
161 40.99378882
162 40.74074074
163 40.49079755
164 40.24390244
165 40.60606061
166 40.36144578
167 40.71856287
168 40.47619048
169 40.82840237
170 40.58823529
171 40.93567251
172 40.69767442
173 40.46242775
174 40.22988506
175 40.00000000
176 39.77272727
177 39.54802260
178 39.32584270
179 39.66480447
180 40.00000000
181 40.33149171
182 40.10989011
183 40.43715847
184 40.21739130
185 40.54054054
186 40.86021505
187 41.17647059
188 41.48936170
189 41.79894180
190 41.57894737
191 41.88481675
192 41.66666667
193 41.45077720
194 41.75257732
195 42.05128205
196 42.34693878
197 42.13197970
198 42.42424242
199 42.21105528
200 42.00000000
201 41.79104478
202 41.58415842
203 41.37931034
204 41.17647059
205 41.46341463
206 41.26213592
207 41.54589372
208 41.34615385
209 41.14832536
210 40.95238095
211 41.23222749
212 41.03773585
213 40.84507042
214 40.65420561
215 40.46511628
216 40.74074074
217 40.55299539
218 40.82568807
219 41.09589041
220 41.36363636
221 41.17647059
222 41.44144144
223 41.70403587
224 41.51785714
225 41.77777778
226 41.59292035
227 41.40969163
228 41.66666667
229 41.92139738
230 41.73913043
231 41.55844156
232 41.81034483
233 41.63090129
234 41.88034188
235 41.70212766
236 41.52542373
237 41.35021097
238 41.17647059
239 41.00418410
240 41.25000000
241 41.07883817
242 41.32231405
243 41.56378601
244 41.39344262
245 41.63265306
246 41.46341463
247 41.29554656
248 41.53225806
249 41.36546185
250 41.20000000
251 41.03585657
252 40.87301587
253 40.71146245
254 40.55118110
255 40.39215686
256 40.62500000
257 40.85603113
258 40.69767442
259 40.54054054
260 40.38461538
261 40.61302682
262 40.45801527
263 40.30418251
264 40.53030303
265 40.75471698
266 40.97744361
267 40.82397004
268 40.67164179
269 40.89219331
270 40.74074074
271 40.59040590
272 40.44117647
273 40.29304029
274 40.14598540
275 40.00000000
276 40.21739130
277 40.43321300
278 40.28776978
279 40.14336918
280 40.35714286
281 40.21352313
282 40.42553191
283 40.28268551
284 40.14084507
285 40.00000000
286 39.86013986
287 40.06968641
288 39.93055556
289 39.79238754
290 40.00000000
291 40.20618557
292 40.41095890
293 40.27303754
294 40.13605442
295 40.00000000
296 39.86486486
297 39.73063973
298 39.93288591
299 39.79933110
300 40.00000000
301 39.86710963
302 39.73509934
303 39.60396040
304 39.47368421
305 39.67213115
306 39.54248366
307 39.73941368
308 39.61038961
309 39.48220065
310 39.35483871
311 39.22829582
312 39.42307692
313 39.29712460
314 39.17197452
315 39.04761905
316 38.92405063
317 38.80126183
318 38.99371069
319 39.18495298
320 39.06250000
321 38.94080997
322 39.13043478
323 39.00928793
324 38.88888889
325 39.07692308
326 38.95705521
327 39.14373089
328 39.02439024
329 38.90577508
330 38.78787879
331 38.67069486
332 38.55421687
333 38.43843844
334 38.32335329
335 38.50746269
336 38.69047619
337 38.57566766
338 38.46153846
339 38.34808260
340 38.23529412
341 38.41642229
342 38.59649123
343 38.48396501
344 38.66279070
345 38.55072464
346 38.72832370
347 38.90489914
348 38.79310345
349 38.96848138
350 39.14285714
351 39.03133903
352 39.20454545
353 39.37677054
354 39.26553672
355 39.15492958
356 39.04494382
357 39.21568627
358 39.10614525
359 38.99721448
360 39.16666667
361 39.33518006
362 39.50276243
363 39.66942149
364 39.56043956
365 39.45205479
366 39.61748634
367 39.50953678
368 39.40217391
369 39.29539295
370 39.45945946
371 39.35309973
372 39.24731183
373 39.14209115
374 39.03743316
375 39.20000000
376 39.09574468
377 38.99204244
378 39.15343915
379 39.05013193
380 38.94736842
381 38.84514436
382 39.00523560
383 39.16449086
384 39.06250000
385 39.22077922
386 39.11917098
387 39.01808786
388 38.91752577
389 39.07455013
390 39.23076923
391 39.38618926
392 39.54081633
393 39.44020356
394 39.34010152
395 39.49367089
396 39.64646465
397 39.54659950
398 39.44723618
399 39.59899749
400 39.50000000
401 39.65087282
402 39.55223881
403 39.45409429
404 39.35643564
405 39.50617284
406 39.40886700
407 39.31203931
408 39.21568627
409 39.36430318
410 39.26829268
411 39.17274939
412 39.07766990
413 38.98305085
414 39.13043478
415 39.03614458
416 38.94230769
417 38.84892086
418 38.75598086
419 38.90214797
420 38.80952381
421 38.71733967
422 38.62559242
423 38.53427896
424 38.67924528
425 38.58823529
426 38.49765258
427 38.64168618
428 38.55140187
429 38.69463869
430 38.60465116
431 38.74709977
432 38.65740741
433 38.56812933
434 38.47926267
435 38.39080460
436 38.30275229
437 38.21510297
438 38.35616438
439 38.49658314
440 38.40909091
441 38.54875283
442 38.46153846
443 38.37471783
444 38.28828829
445 38.20224719
446 38.11659193
447 38.03131991
448 37.94642857
449 37.86191537
450 37.77777778
451 37.69401330
452 37.61061947
453 37.74834437
454 37.66519824
455 37.58241758
456 37.71929825
457 37.85557987
458 37.77292576
459 37.69063181
460 37.82608696
461 37.74403471
462 37.87878788
463 37.79697624
464 37.93103448
465 37.84946237
466 37.76824034
467 37.90149893
468 38.03418803
469 38.16631130
470 38.29787234
471 38.21656051
472 38.13559322
473 38.26638478
474 38.18565401
475 38.10526316
476 38.23529412
477 38.15513627
478 38.28451883
479 38.41336117
480 38.33333333
481 38.25363825
482 38.17427386
483 38.09523810
484 38.01652893
485 37.93814433
486 38.06584362
487 38.19301848
488 38.11475410
489 38.03680982
490 38.16326531
491 38.08553971
492 38.00813008
493 37.93103448
494 38.05668016
495 38.18181818
496 38.30645161
497 38.22937626
498 38.15261044
499 38.27655311
500 38.40000000
501 38.32335329
502 38.44621514
503 38.56858847
504 38.49206349
505 38.61386139
506 38.73517787
507 38.65877712
508 38.58267717
509 38.70333988
510 38.82352941
511 38.74755382
512 38.67187500
513 38.59649123
514 38.52140078
515 38.64077670
516 38.75968992
517 38.87814313
518 38.99613900
519 38.92100193
520 38.84615385
521 38.77159309
522 38.88888889
523 38.81453155
524 38.93129771
525 38.85714286
526 38.78326996
527 38.89943074
528 39.01515152
529 39.13043478
530 39.24528302
531 39.17137476
532 39.09774436
533 39.02439024
534 39.13857678
535 39.25233645
536 39.17910448
537 39.29236499
538 39.21933086
539 39.14656772
540 39.07407407
541 39.00184843
542 39.11439114
543 39.04235727
544 39.15441176
545 39.08256881
546 39.01098901
547 39.12248629
548 39.23357664
549 39.16211293
550 39.09090909
551 39.20145191
552 39.13043478
553 39.05967450
554 38.98916968
555 38.91891892
556 39.02877698
557 38.95870736
558 39.06810036
559 38.99821109
560 38.92857143
561 38.85918004
562 38.96797153
563 39.07637655
564 39.00709220
565 39.11504425
566 39.22261484
567 39.15343915
568 39.08450704
569 39.01581722
570 38.94736842
571 38.87915937
572 38.81118881
573 38.74345550
574 38.67595819
575 38.60869565
576 38.71527778
577 38.64818024
578 38.58131488
579 38.51468048
580 38.62068966
581 38.55421687
582 38.48797251
583 38.59348199
584 38.69863014
585 38.80341880
586 38.73720137
587 38.67120954
588 38.60544218
589 38.53989813
590 38.47457627
591 38.40947547
592 38.34459459
593 38.44856661
594 38.55218855
595 38.48739496
596 38.59060403
597 38.52596315
598 38.46153846
599 38.56427379
600 38.50000000
601 38.43594010
602 38.37209302
603 38.47429519
604 38.41059603
605 38.51239669
606 38.44884488
607 38.55024712
608 38.48684211
609 38.42364532
610 38.36065574
611 38.46153846
612 38.39869281
613 38.49918434
614 38.59934853
615 38.53658537
616 38.47402597
617 38.41166937
618 38.34951456
619 38.28756058
620 38.22580645
621 38.16425121
622 38.26366559
623 38.20224719
624 38.14102564
625 38.08000000
626 38.17891374
627 38.11802233
628 38.05732484
629 38.15580286
630 38.09523810
631 38.03486529
632 38.13291139
633 38.07266983
634 38.01261830
635 37.95275591
636 37.89308176
637 37.99058085
638 37.93103448
639 37.87167449
640 37.96875000
641 37.90951638
642 38.00623053
643 37.94712286
644 37.88819876
645 37.82945736
646 37.77089783
647 37.71251932
648 37.80864198
649 37.75038521
650 37.69230769
651 37.63440860
652 37.73006135
653 37.67228178
654 37.76758410
655 37.70992366
656 37.80487805
657 37.74733638
658 37.84194529
659 37.78452200
660 37.72727273
661 37.67019667
662 37.61329305
663 37.55656109
664 37.50000000
665 37.44360902
666 37.38738739
667 37.33133433
668 37.42514970
669 37.51868460
670 37.46268657
671 37.40685544
672 37.35119048
673 37.29569094
674 37.38872404
675 37.33333333
676 37.27810651
677 37.22304284
678 37.31563422
679 37.26067747
680 37.20588235
681 37.29809104
682 37.24340176
683 37.18887262
684 37.13450292
685 37.22627737
686 37.17201166
687 37.11790393
688 37.06395349
689 37.01015965
690 36.95652174
691 37.04775687
692 36.99421965
693 36.94083694
694 36.88760807
695 36.83453237
696 36.92528736
697 36.87230990
698 36.96275072
699 37.05293276
700 37.00000000
701 36.94721826
702 37.03703704
703 37.12660028
704 37.21590909
705 37.16312057
706 37.11048159
707 37.05799151
708 37.00564972
709 36.95345557
710 36.90140845
711 36.99015471
712 36.93820225
713 37.02664797
714 37.11484594
715 37.06293706
716 37.15083799
717 37.09902371
718 37.04735376
719 36.99582754
720 36.94444444
721 36.89320388
722 36.84210526
723 36.92946058
724 36.87845304
725 36.96551724
726 36.91460055
727 37.00137552
728 37.08791209
729 37.03703704
730 36.98630137
731 36.93570451
732 36.88524590
733 36.97135061
734 36.92098093
735 36.87074830
736 36.82065217
737 36.90637720
738 36.99186992
739 37.07713126
740 37.02702703
741 36.97705803
742 37.06199461
743 37.01211306
744 37.09677419
745 37.04697987
746 36.99731903
747 36.94779116
748 36.89839572
749 36.84913218
750 36.93333333
Final result: 36.9333 +/- 1.7635
Random chance: 25.0000 +/- 1.5822
|