nerui-pt-pl10-2 / README.md
apwic's picture
End of training
8a94988 verified
|
raw
history blame
36.4 kB
---
language:
- id
license: mit
base_model: indolem/indobert-base-uncased
tags:
- generated_from_trainer
model-index:
- name: nerui-pt-pl10-2
results: []
---
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->
# nerui-pt-pl10-2
This model is a fine-tuned version of [indolem/indobert-base-uncased](https://huggingface.co./indolem/indobert-base-uncased) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.0599
- Location Precision: 0.9175
- Location Recall: 0.9570
- Location F1: 0.9368
- Location Number: 93
- Organization Precision: 0.9162
- Organization Recall: 0.9217
- Organization F1: 0.9189
- Organization Number: 166
- Person Precision: 0.9858
- Person Recall: 0.9789
- Person F1: 0.9823
- Person Number: 142
- Overall Precision: 0.9407
- Overall Recall: 0.9501
- Overall F1: 0.9454
- Overall Accuracy: 0.9882
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 16
- eval_batch_size: 64
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100.0
### Training results
| Training Loss | Epoch | Step | Validation Loss | Location Precision | Location Recall | Location F1 | Location Number | Organization Precision | Organization Recall | Organization F1 | Organization Number | Person Precision | Person Recall | Person F1 | Person Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
|:-------------:|:-----:|:----:|:---------------:|:------------------:|:---------------:|:-----------:|:---------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|
| 0.8428 | 1.0 | 96 | 0.3885 | 0.2353 | 0.0430 | 0.0727 | 93 | 0.2318 | 0.3253 | 0.2707 | 166 | 0.2865 | 0.3451 | 0.3131 | 142 | 0.2542 | 0.2668 | 0.2603 | 0.8697 |
| 0.3617 | 2.0 | 192 | 0.2171 | 0.3037 | 0.4409 | 0.3596 | 93 | 0.6190 | 0.5482 | 0.5815 | 166 | 0.4829 | 0.6972 | 0.5706 | 142 | 0.4743 | 0.5761 | 0.5203 | 0.9317 |
| 0.1955 | 3.0 | 288 | 0.1000 | 0.8193 | 0.7312 | 0.7727 | 93 | 0.7030 | 0.8554 | 0.7717 | 166 | 0.9650 | 0.9718 | 0.9684 | 142 | 0.8131 | 0.8678 | 0.8396 | 0.9682 |
| 0.1335 | 4.0 | 384 | 0.0860 | 0.7573 | 0.8387 | 0.7959 | 93 | 0.7437 | 0.8916 | 0.8110 | 166 | 0.9521 | 0.9789 | 0.9653 | 142 | 0.8147 | 0.9102 | 0.8598 | 0.9715 |
| 0.1071 | 5.0 | 480 | 0.0629 | 0.8190 | 0.9247 | 0.8687 | 93 | 0.8706 | 0.8916 | 0.8810 | 166 | 0.9720 | 0.9789 | 0.9754 | 142 | 0.8923 | 0.9302 | 0.9109 | 0.9813 |
| 0.0956 | 6.0 | 576 | 0.0531 | 0.7946 | 0.9570 | 0.8683 | 93 | 0.8721 | 0.9036 | 0.8876 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.8894 | 0.9426 | 0.9153 | 0.9830 |
| 0.083 | 7.0 | 672 | 0.0504 | 0.8617 | 0.8710 | 0.8663 | 93 | 0.8235 | 0.9277 | 0.8725 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.8863 | 0.9327 | 0.9089 | 0.9833 |
| 0.0784 | 8.0 | 768 | 0.0515 | 0.7719 | 0.9462 | 0.8502 | 93 | 0.8834 | 0.8675 | 0.8754 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.8876 | 0.9252 | 0.9060 | 0.9819 |
| 0.072 | 9.0 | 864 | 0.0427 | 0.8687 | 0.9247 | 0.8958 | 93 | 0.8982 | 0.9036 | 0.9009 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9214 | 0.9352 | 0.9282 | 0.9860 |
| 0.0688 | 10.0 | 960 | 0.0463 | 0.8318 | 0.9570 | 0.89 | 93 | 0.9 | 0.9217 | 0.9107 | 166 | 0.9650 | 0.9718 | 0.9684 | 142 | 0.9048 | 0.9476 | 0.9257 | 0.9849 |
| 0.0621 | 11.0 | 1056 | 0.0442 | 0.8585 | 0.9785 | 0.9146 | 93 | 0.9317 | 0.9036 | 0.9174 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9314 | 0.9476 | 0.9394 | 0.9860 |
| 0.0595 | 12.0 | 1152 | 0.0453 | 0.9091 | 0.9677 | 0.9375 | 93 | 0.9379 | 0.9096 | 0.9235 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9451 | 0.9451 | 0.9451 | 0.9874 |
| 0.0576 | 13.0 | 1248 | 0.0455 | 0.8505 | 0.9785 | 0.91 | 93 | 0.9383 | 0.9157 | 0.9268 | 166 | 0.9789 | 0.9789 | 0.9789 | 142 | 0.9294 | 0.9526 | 0.9409 | 0.9868 |
| 0.054 | 14.0 | 1344 | 0.0415 | 0.9263 | 0.9462 | 0.9362 | 93 | 0.9268 | 0.9157 | 0.9212 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.9426 | 0.9426 | 0.9426 | 0.9879 |
| 0.0491 | 15.0 | 1440 | 0.0367 | 0.9158 | 0.9355 | 0.9255 | 93 | 0.9167 | 0.9277 | 0.9222 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9406 | 0.9476 | 0.9441 | 0.9885 |
| 0.0468 | 16.0 | 1536 | 0.0437 | 0.8725 | 0.9570 | 0.9128 | 93 | 0.9217 | 0.9217 | 0.9217 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9315 | 0.9501 | 0.9407 | 0.9866 |
| 0.0466 | 17.0 | 1632 | 0.0427 | 0.8969 | 0.9355 | 0.9158 | 93 | 0.9006 | 0.9277 | 0.9139 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9267 | 0.9451 | 0.9358 | 0.9877 |
| 0.0442 | 18.0 | 1728 | 0.0398 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9207 | 0.9096 | 0.9152 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9378 | 0.9401 | 0.9390 | 0.9877 |
| 0.0407 | 19.0 | 1824 | 0.0447 | 0.91 | 0.9785 | 0.9430 | 93 | 0.9437 | 0.9096 | 0.9264 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9476 | 0.9476 | 0.9476 | 0.9888 |
| 0.0391 | 20.0 | 1920 | 0.0472 | 0.8491 | 0.9677 | 0.9045 | 93 | 0.9141 | 0.8976 | 0.9058 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9195 | 0.9401 | 0.9297 | 0.9857 |
| 0.0387 | 21.0 | 2016 | 0.0460 | 0.8667 | 0.9785 | 0.9192 | 93 | 0.9114 | 0.8675 | 0.8889 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9257 | 0.9327 | 0.9292 | 0.9874 |
| 0.0343 | 22.0 | 2112 | 0.0412 | 0.9278 | 0.9677 | 0.9474 | 93 | 0.9273 | 0.9217 | 0.9245 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9479 | 0.9526 | 0.9502 | 0.9890 |
| 0.0331 | 23.0 | 2208 | 0.0442 | 0.8969 | 0.9355 | 0.9158 | 93 | 0.9317 | 0.9036 | 0.9174 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9398 | 0.9352 | 0.9375 | 0.9879 |
| 0.0329 | 24.0 | 2304 | 0.0421 | 0.9278 | 0.9677 | 0.9474 | 93 | 0.9281 | 0.9337 | 0.9309 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9457 | 0.9551 | 0.9504 | 0.9882 |
| 0.0336 | 25.0 | 2400 | 0.0476 | 0.9091 | 0.9677 | 0.9375 | 93 | 0.95 | 0.9157 | 0.9325 | 166 | 0.9789 | 0.9789 | 0.9789 | 142 | 0.9501 | 0.9501 | 0.9501 | 0.9890 |
| 0.0313 | 26.0 | 2496 | 0.0408 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9023 | 0.9458 | 0.9235 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9390 | 0.9601 | 0.9494 | 0.9893 |
| 0.0287 | 27.0 | 2592 | 0.0449 | 0.9278 | 0.9677 | 0.9474 | 93 | 0.9329 | 0.9217 | 0.9273 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9478 | 0.9501 | 0.9489 | 0.9898 |
| 0.03 | 28.0 | 2688 | 0.0429 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9075 | 0.9458 | 0.9263 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9319 | 0.9551 | 0.9433 | 0.9888 |
| 0.031 | 29.0 | 2784 | 0.0416 | 0.9462 | 0.9462 | 0.9462 | 93 | 0.9133 | 0.9518 | 0.9322 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9435 | 0.9576 | 0.9505 | 0.9898 |
| 0.027 | 30.0 | 2880 | 0.0406 | 0.9565 | 0.9462 | 0.9514 | 93 | 0.9133 | 0.9518 | 0.9322 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9483 | 0.9601 | 0.9542 | 0.9909 |
| 0.0257 | 31.0 | 2976 | 0.0435 | 0.9278 | 0.9677 | 0.9474 | 93 | 0.9202 | 0.9036 | 0.9119 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9451 | 0.9451 | 0.9451 | 0.9901 |
| 0.0277 | 32.0 | 3072 | 0.0484 | 0.9184 | 0.9677 | 0.9424 | 93 | 0.9080 | 0.8916 | 0.8997 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9353 | 0.9377 | 0.9365 | 0.9888 |
| 0.024 | 33.0 | 3168 | 0.0450 | 0.8990 | 0.9570 | 0.9271 | 93 | 0.9024 | 0.8916 | 0.8970 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9307 | 0.9377 | 0.9342 | 0.9888 |
| 0.0227 | 34.0 | 3264 | 0.0509 | 0.9278 | 0.9677 | 0.9474 | 93 | 0.9157 | 0.9157 | 0.9157 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9885 |
| 0.0226 | 35.0 | 3360 | 0.0520 | 0.9091 | 0.9677 | 0.9375 | 93 | 0.8902 | 0.9277 | 0.9086 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9274 | 0.9551 | 0.9410 | 0.9871 |
| 0.0239 | 36.0 | 3456 | 0.0613 | 0.8440 | 0.9892 | 0.9109 | 93 | 0.9226 | 0.8614 | 0.8910 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.9187 | 0.9302 | 0.9244 | 0.9863 |
| 0.0222 | 37.0 | 3552 | 0.0489 | 0.88 | 0.9462 | 0.9119 | 93 | 0.8929 | 0.9036 | 0.8982 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9218 | 0.9401 | 0.9309 | 0.9877 |
| 0.0221 | 38.0 | 3648 | 0.0525 | 0.8762 | 0.9892 | 0.9293 | 93 | 0.9141 | 0.8976 | 0.9058 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9291 | 0.9476 | 0.9383 | 0.9868 |
| 0.0225 | 39.0 | 3744 | 0.0504 | 0.8824 | 0.9677 | 0.9231 | 93 | 0.8976 | 0.8976 | 0.8976 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9242 | 0.9426 | 0.9333 | 0.9860 |
| 0.0194 | 40.0 | 3840 | 0.0470 | 0.8990 | 0.9570 | 0.9271 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9358 | 0.9451 | 0.9404 | 0.9890 |
| 0.0206 | 41.0 | 3936 | 0.0489 | 0.8969 | 0.9355 | 0.9158 | 93 | 0.8793 | 0.9217 | 0.9 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9199 | 0.9451 | 0.9323 | 0.9866 |
| 0.0208 | 42.0 | 4032 | 0.0510 | 0.9 | 0.9677 | 0.9326 | 93 | 0.9048 | 0.9157 | 0.9102 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9291 | 0.9476 | 0.9383 | 0.9879 |
| 0.0178 | 43.0 | 4128 | 0.0496 | 0.8911 | 0.9677 | 0.9278 | 93 | 0.9620 | 0.9157 | 0.9383 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.95 | 0.9476 | 0.9488 | 0.9896 |
| 0.0184 | 44.0 | 4224 | 0.0533 | 0.9010 | 0.9785 | 0.9381 | 93 | 0.9048 | 0.9157 | 0.9102 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9293 | 0.9501 | 0.9396 | 0.9882 |
| 0.0172 | 45.0 | 4320 | 0.0525 | 0.9192 | 0.9785 | 0.9479 | 93 | 0.9444 | 0.9217 | 0.9329 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9502 | 0.9526 | 0.9514 | 0.9882 |
| 0.0159 | 46.0 | 4416 | 0.0524 | 0.92 | 0.9892 | 0.9534 | 93 | 0.9207 | 0.9096 | 0.9152 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9432 | 0.9526 | 0.9479 | 0.9885 |
| 0.0174 | 47.0 | 4512 | 0.0516 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.8935 | 0.9096 | 0.9015 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9310 | 0.9426 | 0.9368 | 0.9879 |
| 0.0176 | 48.0 | 4608 | 0.0463 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.9222 | 0.9277 | 0.9249 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9904 |
| 0.0156 | 49.0 | 4704 | 0.0531 | 0.9082 | 0.9570 | 0.9319 | 93 | 0.9321 | 0.9096 | 0.9207 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9426 | 0.9426 | 0.9426 | 0.9893 |
| 0.017 | 50.0 | 4800 | 0.0526 | 0.9 | 0.9677 | 0.9326 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9384 | 0.9501 | 0.9442 | 0.9879 |
| 0.0156 | 51.0 | 4896 | 0.0570 | 0.8824 | 0.9677 | 0.9231 | 93 | 0.9375 | 0.9036 | 0.9202 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9404 | 0.9451 | 0.9428 | 0.9871 |
| 0.0152 | 52.0 | 4992 | 0.0518 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9329 | 0.9217 | 0.9273 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9428 | 0.9451 | 0.9440 | 0.9898 |
| 0.017 | 53.0 | 5088 | 0.0487 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9281 | 0.9337 | 0.9309 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9432 | 0.9526 | 0.9479 | 0.9896 |
| 0.0143 | 54.0 | 5184 | 0.0542 | 0.88 | 0.9462 | 0.9119 | 93 | 0.9268 | 0.9157 | 0.9212 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9333 | 0.9426 | 0.9380 | 0.9877 |
| 0.0152 | 55.0 | 5280 | 0.0521 | 0.8990 | 0.9570 | 0.9271 | 93 | 0.9222 | 0.9277 | 0.9249 | 166 | 0.9786 | 0.9648 | 0.9716 | 142 | 0.9360 | 0.9476 | 0.9418 | 0.9888 |
| 0.0123 | 56.0 | 5376 | 0.0536 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9329 | 0.9217 | 0.9273 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9428 | 0.9451 | 0.9440 | 0.9888 |
| 0.0145 | 57.0 | 5472 | 0.0507 | 0.9167 | 0.9462 | 0.9312 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9403 | 0.9426 | 0.9415 | 0.9898 |
| 0.0138 | 58.0 | 5568 | 0.0550 | 0.8980 | 0.9462 | 0.9215 | 93 | 0.9264 | 0.9096 | 0.9179 | 166 | 0.9716 | 0.9648 | 0.9682 | 142 | 0.9353 | 0.9377 | 0.9365 | 0.9893 |
| 0.0122 | 59.0 | 5664 | 0.0532 | 0.9010 | 0.9785 | 0.9381 | 93 | 0.9379 | 0.9096 | 0.9235 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9429 | 0.9476 | 0.9453 | 0.9882 |
| 0.0144 | 60.0 | 5760 | 0.0572 | 0.8835 | 0.9785 | 0.9286 | 93 | 0.9367 | 0.8916 | 0.9136 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.9355 | 0.9401 | 0.9378 | 0.9882 |
| 0.0126 | 61.0 | 5856 | 0.0488 | 0.9474 | 0.9677 | 0.9574 | 93 | 0.9390 | 0.9277 | 0.9333 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.9526 | 0.9526 | 0.9526 | 0.9907 |
| 0.0113 | 62.0 | 5952 | 0.0563 | 0.8911 | 0.9677 | 0.9278 | 93 | 0.925 | 0.8916 | 0.9080 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.9330 | 0.9377 | 0.9353 | 0.9888 |
| 0.0109 | 63.0 | 6048 | 0.0592 | 0.8725 | 0.9570 | 0.9128 | 93 | 0.9062 | 0.8735 | 0.8896 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9256 | 0.9302 | 0.9279 | 0.9866 |
| 0.0124 | 64.0 | 6144 | 0.0568 | 0.9255 | 0.9355 | 0.9305 | 93 | 0.9085 | 0.8976 | 0.9030 | 166 | 0.9718 | 0.9718 | 0.9718 | 142 | 0.935 | 0.9327 | 0.9338 | 0.9877 |
| 0.0126 | 65.0 | 6240 | 0.0559 | 0.8990 | 0.9570 | 0.9271 | 93 | 0.9068 | 0.8795 | 0.8930 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9327 | 0.9327 | 0.9327 | 0.9877 |
| 0.0112 | 66.0 | 6336 | 0.0573 | 0.9263 | 0.9462 | 0.9362 | 93 | 0.9152 | 0.9096 | 0.9124 | 166 | 0.9787 | 0.9718 | 0.9753 | 142 | 0.9401 | 0.9401 | 0.9401 | 0.9885 |
| 0.0112 | 67.0 | 6432 | 0.0617 | 0.8558 | 0.9570 | 0.9036 | 93 | 0.8902 | 0.8795 | 0.8848 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9144 | 0.9327 | 0.9235 | 0.9863 |
| 0.0117 | 68.0 | 6528 | 0.0534 | 0.9082 | 0.9570 | 0.9319 | 93 | 0.9259 | 0.9036 | 0.9146 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9426 | 0.9426 | 0.9426 | 0.9885 |
| 0.0101 | 69.0 | 6624 | 0.0571 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.9157 | 0.9157 | 0.9157 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9429 | 0.9476 | 0.9453 | 0.9890 |
| 0.0092 | 70.0 | 6720 | 0.0559 | 0.9091 | 0.9677 | 0.9375 | 93 | 0.9325 | 0.9157 | 0.9240 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9454 | 0.9501 | 0.9478 | 0.9893 |
| 0.011 | 71.0 | 6816 | 0.0611 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9329 | 0.9217 | 0.9273 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9525 | 0.9501 | 0.9513 | 0.9879 |
| 0.0097 | 72.0 | 6912 | 0.0558 | 0.9167 | 0.9462 | 0.9312 | 93 | 0.9383 | 0.9157 | 0.9268 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9499 | 0.9451 | 0.9475 | 0.9893 |
| 0.0102 | 73.0 | 7008 | 0.0560 | 0.8980 | 0.9462 | 0.9215 | 93 | 0.9321 | 0.9096 | 0.9207 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9426 | 0.9426 | 0.9426 | 0.9879 |
| 0.0098 | 74.0 | 7104 | 0.0587 | 0.9184 | 0.9677 | 0.9424 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9882 |
| 0.0105 | 75.0 | 7200 | 0.0585 | 0.8812 | 0.9570 | 0.9175 | 93 | 0.9141 | 0.8976 | 0.9058 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9309 | 0.9401 | 0.9355 | 0.9874 |
| 0.01 | 76.0 | 7296 | 0.0599 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9273 | 0.9217 | 0.9245 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9501 | 0.9501 | 0.9501 | 0.9893 |
| 0.0084 | 77.0 | 7392 | 0.0602 | 0.9091 | 0.9677 | 0.9375 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9407 | 0.9501 | 0.9454 | 0.9893 |
| 0.0089 | 78.0 | 7488 | 0.0632 | 0.89 | 0.9570 | 0.9223 | 93 | 0.9207 | 0.9096 | 0.9152 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9358 | 0.9451 | 0.9404 | 0.9879 |
| 0.0086 | 79.0 | 7584 | 0.0588 | 0.9072 | 0.9462 | 0.9263 | 93 | 0.9379 | 0.9096 | 0.9235 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9474 | 0.9426 | 0.9450 | 0.9885 |
| 0.009 | 80.0 | 7680 | 0.0618 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9273 | 0.9217 | 0.9245 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9454 | 0.9501 | 0.9478 | 0.9885 |
| 0.0084 | 81.0 | 7776 | 0.0612 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9152 | 0.9096 | 0.9124 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9404 | 0.9451 | 0.9428 | 0.9882 |
| 0.0089 | 82.0 | 7872 | 0.0595 | 0.9082 | 0.9570 | 0.9319 | 93 | 0.9207 | 0.9096 | 0.9152 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9404 | 0.9451 | 0.9428 | 0.9890 |
| 0.0089 | 83.0 | 7968 | 0.0586 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9222 | 0.9277 | 0.9249 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9479 | 0.9526 | 0.9502 | 0.9890 |
| 0.0084 | 84.0 | 8064 | 0.0567 | 0.9468 | 0.9570 | 0.9519 | 93 | 0.9273 | 0.9217 | 0.9245 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9525 | 0.9501 | 0.9513 | 0.9909 |
| 0.0088 | 85.0 | 8160 | 0.0617 | 0.9082 | 0.9570 | 0.9319 | 93 | 0.9212 | 0.9157 | 0.9184 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9406 | 0.9476 | 0.9441 | 0.9890 |
| 0.0089 | 86.0 | 8256 | 0.0582 | 0.9263 | 0.9462 | 0.9362 | 93 | 0.9268 | 0.9157 | 0.9212 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9475 | 0.9451 | 0.9463 | 0.9896 |
| 0.0066 | 87.0 | 8352 | 0.0611 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9277 | 0.9277 | 0.9277 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9502 | 0.9526 | 0.9514 | 0.9893 |
| 0.0088 | 88.0 | 8448 | 0.0586 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9441 | 0.9157 | 0.9297 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9524 | 0.9476 | 0.95 | 0.9893 |
| 0.007 | 89.0 | 8544 | 0.0602 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.9207 | 0.9096 | 0.9152 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9451 | 0.9451 | 0.9451 | 0.9893 |
| 0.0083 | 90.0 | 8640 | 0.0580 | 0.9468 | 0.9570 | 0.9519 | 93 | 0.9333 | 0.9277 | 0.9305 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.955 | 0.9526 | 0.9538 | 0.9901 |
| 0.0077 | 91.0 | 8736 | 0.0591 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9107 | 0.9217 | 0.9162 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9890 |
| 0.0078 | 92.0 | 8832 | 0.0608 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.9162 | 0.9217 | 0.9189 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9879 |
| 0.0068 | 93.0 | 8928 | 0.0581 | 0.9468 | 0.9570 | 0.9519 | 93 | 0.9048 | 0.9157 | 0.9102 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9429 | 0.9476 | 0.9453 | 0.9890 |
| 0.0079 | 94.0 | 9024 | 0.0606 | 0.9368 | 0.9570 | 0.9468 | 93 | 0.9107 | 0.9217 | 0.9162 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9885 |
| 0.0072 | 95.0 | 9120 | 0.0600 | 0.9271 | 0.9570 | 0.9418 | 93 | 0.9162 | 0.9217 | 0.9189 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9896 |
| 0.007 | 96.0 | 9216 | 0.0599 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9107 | 0.9217 | 0.9162 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9384 | 0.9501 | 0.9442 | 0.9885 |
| 0.0082 | 97.0 | 9312 | 0.0601 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9217 | 0.9217 | 0.9217 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9885 |
| 0.0065 | 98.0 | 9408 | 0.0606 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9217 | 0.9217 | 0.9217 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9431 | 0.9501 | 0.9466 | 0.9885 |
| 0.0075 | 99.0 | 9504 | 0.0600 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9162 | 0.9217 | 0.9189 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9407 | 0.9501 | 0.9454 | 0.9882 |
| 0.0057 | 100.0 | 9600 | 0.0599 | 0.9175 | 0.9570 | 0.9368 | 93 | 0.9162 | 0.9217 | 0.9189 | 166 | 0.9858 | 0.9789 | 0.9823 | 142 | 0.9407 | 0.9501 | 0.9454 | 0.9882 |
### Framework versions
- Transformers 4.39.3
- Pytorch 2.3.0+cu121
- Datasets 2.19.1
- Tokenizers 0.15.2