nerui-lora-r8-4 / README.md
apwic's picture
End of training
899a5fa verified
|
raw
history blame
36.4 kB
metadata
language:
  - id
license: mit
base_model: indolem/indobert-base-uncased
tags:
  - generated_from_trainer
model-index:
  - name: nerui-lora-r8-4
    results: []

nerui-lora-r8-4

This model is a fine-tuned version of indolem/indobert-base-uncased on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0454
  • Location Precision: 0.8611
  • Location Recall: 0.9029
  • Location F1: 0.8815
  • Location Number: 103
  • Organization Precision: 0.8817
  • Organization Recall: 0.8713
  • Organization F1: 0.8765
  • Organization Number: 171
  • Person Precision: 0.9695
  • Person Recall: 0.9695
  • Person F1: 0.9695
  • Person Number: 131
  • Overall Precision: 0.9044
  • Overall Recall: 0.9111
  • Overall F1: 0.9077
  • Overall Accuracy: 0.9848

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 16
  • eval_batch_size: 64
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100.0

Training results

Training Loss Epoch Step Validation Loss Location Precision Location Recall Location F1 Location Number Organization Precision Organization Recall Organization F1 Organization Number Person Precision Person Recall Person F1 Person Number Overall Precision Overall Recall Overall F1 Overall Accuracy
1.1639 1.0 96 0.6983 0.0 0.0 0.0 103 0.0 0.0 0.0 171 0.0 0.0 0.0 131 0.0 0.0 0.0 0.8373
0.6685 2.0 192 0.5650 0.0 0.0 0.0 103 0.0 0.0 0.0 171 0.0 0.0 0.0 131 0.0 0.0 0.0 0.8376
0.553 3.0 288 0.4425 0.0 0.0 0.0 103 0.3889 0.0409 0.0741 171 0.225 0.0687 0.1053 131 0.2623 0.0395 0.0687 0.8470
0.4403 4.0 384 0.3329 0.2 0.0485 0.0781 103 0.3694 0.2398 0.2908 171 0.3924 0.4733 0.4291 131 0.3673 0.2667 0.3090 0.8835
0.3288 5.0 480 0.2455 0.425 0.3301 0.3716 103 0.5248 0.6199 0.5684 171 0.5449 0.6947 0.6107 131 0.5145 0.5704 0.5410 0.9263
0.2474 6.0 576 0.1893 0.6429 0.6117 0.6269 103 0.6337 0.7485 0.6863 171 0.7763 0.9008 0.8339 131 0.6836 0.7630 0.7211 0.9506
0.1962 7.0 672 0.1534 0.7957 0.7184 0.7551 103 0.6965 0.8187 0.7527 171 0.8803 0.9542 0.9158 131 0.7775 0.8370 0.8062 0.9605
0.1659 8.0 768 0.1277 0.81 0.7864 0.7980 103 0.7581 0.8246 0.7899 171 0.9124 0.9542 0.9328 131 0.8203 0.8568 0.8382 0.9652
0.1495 9.0 864 0.1119 0.8571 0.8155 0.8358 103 0.7656 0.8596 0.8099 171 0.9007 0.9695 0.9338 131 0.8306 0.8840 0.8565 0.9691
0.1342 10.0 960 0.1009 0.85 0.8252 0.8374 103 0.7884 0.8713 0.8278 171 0.9 0.9618 0.9299 131 0.8392 0.8889 0.8633 0.9707
0.1241 11.0 1056 0.0915 0.8286 0.8447 0.8365 103 0.8156 0.8538 0.8343 171 0.9265 0.9618 0.9438 131 0.8548 0.8864 0.8703 0.9727
0.1188 12.0 1152 0.0854 0.8269 0.8350 0.8309 103 0.8343 0.8830 0.8580 171 0.9197 0.9618 0.9403 131 0.8602 0.8963 0.8779 0.9746
0.1102 13.0 1248 0.0798 0.8654 0.8738 0.8696 103 0.8444 0.8889 0.8661 171 0.9333 0.9618 0.9474 131 0.8783 0.9086 0.8932 0.9762
0.1044 14.0 1344 0.0780 0.89 0.8641 0.8768 103 0.8830 0.8830 0.8830 171 0.9403 0.9618 0.9509 131 0.9037 0.9037 0.9037 0.9782
0.1009 15.0 1440 0.0721 0.9091 0.8738 0.8911 103 0.8478 0.9123 0.8789 171 0.9478 0.9695 0.9585 131 0.8945 0.9210 0.9075 0.9782
0.0978 16.0 1536 0.0696 0.8932 0.8932 0.8932 103 0.8462 0.9006 0.8725 171 0.9403 0.9618 0.9509 131 0.8878 0.9185 0.9029 0.9779
0.0962 17.0 1632 0.0680 0.92 0.8932 0.9064 103 0.8793 0.8947 0.8870 171 0.9549 0.9695 0.9621 131 0.9140 0.9185 0.9163 0.9801
0.0909 18.0 1728 0.0644 0.91 0.8835 0.8966 103 0.8611 0.9064 0.8832 171 0.9403 0.9618 0.9509 131 0.8986 0.9185 0.9084 0.9798
0.088 19.0 1824 0.0634 0.9192 0.8835 0.9010 103 0.8757 0.9064 0.8908 171 0.9474 0.9618 0.9545 131 0.9095 0.9185 0.9140 0.9807
0.0836 20.0 1920 0.0636 0.8846 0.8932 0.8889 103 0.8619 0.9123 0.8864 171 0.9403 0.9618 0.9509 131 0.8926 0.9235 0.9078 0.9798
0.0836 21.0 2016 0.0599 0.8846 0.8932 0.8889 103 0.8636 0.8889 0.8761 171 0.9474 0.9618 0.9545 131 0.8959 0.9136 0.9046 0.9807
0.0827 22.0 2112 0.0596 0.8598 0.8932 0.8762 103 0.8470 0.9064 0.8757 171 0.9403 0.9618 0.9509 131 0.8797 0.9210 0.8999 0.9798
0.077 23.0 2208 0.0583 0.8667 0.8835 0.8750 103 0.8693 0.8947 0.8818 171 0.9621 0.9695 0.9658 131 0.8983 0.9160 0.9071 0.9807
0.0774 24.0 2304 0.0580 0.8230 0.9029 0.8611 103 0.8580 0.8480 0.8529 171 0.9474 0.9618 0.9545 131 0.8771 0.8988 0.8878 0.9787
0.0782 25.0 2400 0.0571 0.875 0.8835 0.8792 103 0.8922 0.8713 0.8817 171 0.9474 0.9618 0.9545 131 0.9059 0.9037 0.9048 0.9798
0.0771 26.0 2496 0.0560 0.9020 0.8932 0.8976 103 0.8729 0.9240 0.8977 171 0.9621 0.9695 0.9658 131 0.9084 0.9309 0.9195 0.9820
0.0726 27.0 2592 0.0534 0.92 0.8932 0.9064 103 0.8736 0.9298 0.9008 171 0.9474 0.9618 0.9545 131 0.9084 0.9309 0.9195 0.9826
0.0712 28.0 2688 0.0528 0.8932 0.8932 0.8932 103 0.8764 0.9123 0.8940 171 0.9474 0.9618 0.9545 131 0.9034 0.9235 0.9133 0.9826
0.0703 29.0 2784 0.0505 0.9192 0.8835 0.9010 103 0.8736 0.9298 0.9008 171 0.9621 0.9695 0.9658 131 0.9128 0.9309 0.9218 0.9845
0.0649 30.0 2880 0.0501 0.9286 0.8835 0.9055 103 0.8791 0.9357 0.9065 171 0.9474 0.9618 0.9545 131 0.9128 0.9309 0.9218 0.9837
0.0642 31.0 2976 0.0501 0.8519 0.8932 0.8720 103 0.8686 0.8889 0.8786 171 0.9621 0.9695 0.9658 131 0.8940 0.9160 0.9049 0.9820
0.0664 32.0 3072 0.0506 0.8426 0.8835 0.8626 103 0.9024 0.8655 0.8836 171 0.9474 0.9618 0.9545 131 0.9012 0.9012 0.9012 0.9818
0.0659 33.0 3168 0.0509 0.8349 0.8835 0.8585 103 0.8976 0.8713 0.8843 171 0.9621 0.9695 0.9658 131 0.9017 0.9062 0.9039 0.9820
0.0667 34.0 3264 0.0507 0.8835 0.8835 0.8835 103 0.8757 0.9064 0.8908 171 0.9621 0.9695 0.9658 131 0.9053 0.9210 0.9131 0.9826
0.0639 35.0 3360 0.0510 0.8598 0.8932 0.8762 103 0.9042 0.8830 0.8935 171 0.9621 0.9695 0.9658 131 0.9113 0.9136 0.9125 0.9826
0.0632 36.0 3456 0.0505 0.8762 0.8932 0.8846 103 0.9059 0.9006 0.9032 171 0.9621 0.9695 0.9658 131 0.9165 0.9210 0.9187 0.9837
0.0632 37.0 3552 0.0487 0.9286 0.8835 0.9055 103 0.9029 0.9240 0.9133 171 0.9621 0.9695 0.9658 131 0.9284 0.9284 0.9284 0.9845
0.0605 38.0 3648 0.0502 0.8679 0.8932 0.8804 103 0.9141 0.8713 0.8922 171 0.9621 0.9695 0.9658 131 0.9177 0.9086 0.9132 0.9832
0.0609 39.0 3744 0.0470 0.8679 0.8932 0.8804 103 0.8902 0.9006 0.8953 171 0.9621 0.9695 0.9658 131 0.9075 0.9210 0.9142 0.9843
0.0601 40.0 3840 0.0469 0.8922 0.8835 0.8878 103 0.9059 0.9006 0.9032 171 0.9621 0.9695 0.9658 131 0.9208 0.9185 0.9197 0.9848
0.0588 41.0 3936 0.0464 0.92 0.8932 0.9064 103 0.8983 0.9298 0.9138 171 0.9621 0.9695 0.9658 131 0.9242 0.9333 0.9287 0.9865
0.0576 42.0 4032 0.0452 0.92 0.8932 0.9064 103 0.8933 0.9298 0.9112 171 0.9621 0.9695 0.9658 131 0.9220 0.9333 0.9276 0.9870
0.057 43.0 4128 0.0457 0.8679 0.8932 0.8804 103 0.8994 0.8889 0.8941 171 0.9621 0.9695 0.9658 131 0.9115 0.9160 0.9138 0.9854
0.056 44.0 4224 0.0453 0.8932 0.8932 0.8932 103 0.8953 0.9006 0.8980 171 0.9621 0.9695 0.9658 131 0.9165 0.9210 0.9187 0.9848
0.0557 45.0 4320 0.0455 0.8598 0.8932 0.8762 103 0.8941 0.8889 0.8915 171 0.9621 0.9695 0.9658 131 0.9071 0.9160 0.9115 0.9851
0.0565 46.0 4416 0.0474 0.8304 0.9029 0.8651 103 0.8963 0.8596 0.8776 171 0.9621 0.9695 0.9658 131 0.8995 0.9062 0.9028 0.9829
0.0534 47.0 4512 0.0448 0.8932 0.8932 0.8932 103 0.8908 0.9064 0.8986 171 0.9695 0.9695 0.9695 131 0.9167 0.9235 0.9200 0.9859
0.0523 48.0 4608 0.0452 0.8611 0.9029 0.8815 103 0.9096 0.8830 0.8961 171 0.9621 0.9695 0.9658 131 0.9138 0.9160 0.9149 0.9859
0.0523 49.0 4704 0.0456 0.8611 0.9029 0.8815 103 0.8882 0.8830 0.8856 171 0.9621 0.9695 0.9658 131 0.9049 0.9160 0.9104 0.9848
0.0509 50.0 4800 0.0464 0.8532 0.9029 0.8774 103 0.9018 0.8596 0.8802 171 0.9695 0.9695 0.9695 131 0.9107 0.9062 0.9084 0.9829
0.052 51.0 4896 0.0468 0.8246 0.9126 0.8664 103 0.9068 0.8538 0.8795 171 0.9695 0.9695 0.9695 131 0.9039 0.9062 0.9051 0.9843
0.0511 52.0 4992 0.0443 0.9020 0.8932 0.8976 103 0.8953 0.9006 0.8980 171 0.9695 0.9695 0.9695 131 0.9210 0.9210 0.9210 0.9862
0.051 53.0 5088 0.0457 0.8532 0.9029 0.8774 103 0.8982 0.8772 0.8876 171 0.9621 0.9695 0.9658 131 0.9069 0.9136 0.9102 0.9840
0.0483 54.0 5184 0.0448 0.9029 0.9029 0.9029 103 0.8914 0.9123 0.9017 171 0.9695 0.9695 0.9695 131 0.9193 0.9284 0.9238 0.9856
0.0525 55.0 5280 0.0451 0.8774 0.9029 0.8900 103 0.8902 0.9006 0.8953 171 0.9695 0.9695 0.9695 131 0.9122 0.9235 0.9178 0.9854
0.0479 56.0 5376 0.0445 0.8692 0.9029 0.8857 103 0.8941 0.8889 0.8915 171 0.9695 0.9695 0.9695 131 0.9118 0.9185 0.9151 0.9851
0.0486 57.0 5472 0.0445 0.8692 0.9029 0.8857 103 0.8889 0.8889 0.8889 171 0.9621 0.9695 0.9658 131 0.9073 0.9185 0.9129 0.9848
0.0457 58.0 5568 0.0437 0.8692 0.9029 0.8857 103 0.8947 0.8947 0.8947 171 0.9695 0.9695 0.9695 131 0.9120 0.9210 0.9165 0.9856
0.0478 59.0 5664 0.0441 0.8532 0.9029 0.8774 103 0.8922 0.8713 0.8817 171 0.9695 0.9695 0.9695 131 0.9066 0.9111 0.9089 0.9848
0.0472 60.0 5760 0.0440 0.8611 0.9029 0.8815 103 0.8929 0.8772 0.8850 171 0.9695 0.9695 0.9695 131 0.9091 0.9136 0.9113 0.9851
0.049 61.0 5856 0.0445 0.8857 0.9029 0.8942 103 0.8844 0.8947 0.8895 171 0.9695 0.9695 0.9695 131 0.9120 0.9210 0.9165 0.9854
0.0476 62.0 5952 0.0456 0.8378 0.9029 0.8692 103 0.8862 0.8655 0.8757 171 0.9695 0.9695 0.9695 131 0.8998 0.9086 0.9042 0.9840
0.0457 63.0 6048 0.0440 0.8692 0.9029 0.8857 103 0.8953 0.9006 0.8980 171 0.9695 0.9695 0.9695 131 0.9122 0.9235 0.9178 0.9859
0.0446 64.0 6144 0.0448 0.8692 0.9029 0.8857 103 0.8935 0.8830 0.8882 171 0.9695 0.9695 0.9695 131 0.9115 0.9160 0.9138 0.9845
0.0465 65.0 6240 0.0447 0.8545 0.9126 0.8826 103 0.8988 0.8830 0.8909 171 0.9695 0.9695 0.9695 131 0.9095 0.9185 0.9140 0.9856
0.0456 66.0 6336 0.0451 0.8692 0.9029 0.8857 103 0.8976 0.8713 0.8843 171 0.9695 0.9695 0.9695 131 0.9134 0.9111 0.9122 0.9848
0.043 67.0 6432 0.0466 0.8611 0.9029 0.8815 103 0.8957 0.8538 0.8743 171 0.9695 0.9695 0.9695 131 0.9104 0.9037 0.9071 0.9823
0.0441 68.0 6528 0.0455 0.8704 0.9126 0.8910 103 0.8824 0.8772 0.8798 171 0.9695 0.9695 0.9695 131 0.9071 0.9160 0.9115 0.9845
0.0439 69.0 6624 0.0454 0.8692 0.9029 0.8857 103 0.8824 0.8772 0.8798 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9848
0.0439 70.0 6720 0.0450 0.8611 0.9029 0.8815 103 0.8935 0.8830 0.8882 171 0.9695 0.9695 0.9695 131 0.9093 0.9160 0.9127 0.9851
0.0428 71.0 6816 0.0447 0.8785 0.9126 0.8952 103 0.8889 0.8889 0.8889 171 0.9695 0.9695 0.9695 131 0.9120 0.9210 0.9165 0.9859
0.0438 72.0 6912 0.0464 0.8611 0.9029 0.8815 103 0.9068 0.8538 0.8795 171 0.9695 0.9695 0.9695 131 0.915 0.9037 0.9093 0.9829
0.0431 73.0 7008 0.0448 0.8624 0.9126 0.8868 103 0.8876 0.8772 0.8824 171 0.9695 0.9695 0.9695 131 0.9071 0.9160 0.9115 0.9856
0.0415 74.0 7104 0.0458 0.8532 0.9029 0.8774 103 0.8862 0.8655 0.8757 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9843
0.0429 75.0 7200 0.0461 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9840
0.0426 76.0 7296 0.0454 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9845
0.043 77.0 7392 0.0456 0.8692 0.9029 0.8857 103 0.8862 0.8655 0.8757 171 0.9695 0.9695 0.9695 131 0.9086 0.9086 0.9086 0.9845
0.0397 78.0 7488 0.0450 0.8611 0.9029 0.8815 103 0.8976 0.8713 0.8843 171 0.9695 0.9695 0.9695 131 0.9111 0.9111 0.9111 0.9856
0.0411 79.0 7584 0.0449 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9848
0.0417 80.0 7680 0.0448 0.8611 0.9029 0.8815 103 0.8876 0.8772 0.8824 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9856
0.0423 81.0 7776 0.0446 0.8611 0.9029 0.8815 103 0.8876 0.8772 0.8824 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9856
0.0434 82.0 7872 0.0445 0.8611 0.9029 0.8815 103 0.9102 0.8889 0.8994 171 0.9695 0.9695 0.9695 131 0.9163 0.9185 0.9174 0.9862
0.0394 83.0 7968 0.0449 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9851
0.0413 84.0 8064 0.0454 0.8611 0.9029 0.8815 103 0.8970 0.8655 0.8810 171 0.9695 0.9695 0.9695 131 0.9109 0.9086 0.9098 0.9851
0.0408 85.0 8160 0.0457 0.8611 0.9029 0.8815 103 0.8916 0.8655 0.8783 171 0.9695 0.9695 0.9695 131 0.9086 0.9086 0.9086 0.9843
0.0412 86.0 8256 0.0455 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9845
0.0425 87.0 8352 0.0454 0.8611 0.9029 0.8815 103 0.8810 0.8655 0.8732 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9843
0.0404 88.0 8448 0.0452 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9851
0.0403 89.0 8544 0.0459 0.8611 0.9029 0.8815 103 0.8862 0.8655 0.8757 171 0.9695 0.9695 0.9695 131 0.9064 0.9086 0.9075 0.9843
0.0395 90.0 8640 0.0451 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9845
0.0413 91.0 8736 0.0450 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9848
0.039 92.0 8832 0.0452 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9848
0.0402 93.0 8928 0.0454 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9848
0.0397 94.0 9024 0.0453 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9848
0.0409 95.0 9120 0.0455 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9848
0.039 96.0 9216 0.0455 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9845
0.0402 97.0 9312 0.0456 0.8611 0.9029 0.8815 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9022 0.9111 0.9066 0.9845
0.0382 98.0 9408 0.0455 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9848
0.0392 99.0 9504 0.0455 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9848
0.0412 100.0 9600 0.0454 0.8611 0.9029 0.8815 103 0.8817 0.8713 0.8765 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9848

Framework versions

  • Transformers 4.39.3
  • Pytorch 2.3.0+cu121
  • Datasets 2.19.1
  • Tokenizers 0.15.2