nerui-lora-r16-4 / README.md
apwic's picture
End of training
7acf190 verified
|
raw
history blame
36.4 kB
metadata
language:
  - id
license: mit
base_model: indolem/indobert-base-uncased
tags:
  - generated_from_trainer
model-index:
  - name: nerui-lora-r16-4
    results: []

nerui-lora-r16-4

This model is a fine-tuned version of indolem/indobert-base-uncased on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0517
  • Location Precision: 0.8727
  • Location Recall: 0.9320
  • Location F1: 0.9014
  • Location Number: 103
  • Organization Precision: 0.875
  • Organization Recall: 0.8596
  • Organization F1: 0.8673
  • Organization Number: 171
  • Person Precision: 0.9695
  • Person Recall: 0.9695
  • Person F1: 0.9695
  • Person Number: 131
  • Overall Precision: 0.9046
  • Overall Recall: 0.9136
  • Overall F1: 0.9091
  • Overall Accuracy: 0.9834

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 16
  • eval_batch_size: 64
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100.0

Training results

Training Loss Epoch Step Validation Loss Location Precision Location Recall Location F1 Location Number Organization Precision Organization Recall Organization F1 Organization Number Person Precision Person Recall Person F1 Person Number Overall Precision Overall Recall Overall F1 Overall Accuracy
1.0769 1.0 96 0.6692 0.0 0.0 0.0 103 0.0 0.0 0.0 171 0.0 0.0 0.0 131 0.0 0.0 0.0 0.8373
0.6366 2.0 192 0.5158 0.0 0.0 0.0 103 0.0 0.0 0.0 171 0.0 0.0 0.0 131 0.0 0.0 0.0 0.8382
0.4812 3.0 288 0.3594 0.1364 0.0291 0.048 103 0.3654 0.2222 0.2764 171 0.2932 0.2977 0.2955 131 0.3089 0.1975 0.2410 0.8763
0.3408 4.0 384 0.2543 0.4133 0.3010 0.3483 103 0.4952 0.6023 0.5435 171 0.4971 0.6565 0.5658 131 0.4825 0.5432 0.5110 0.9207
0.2457 5.0 480 0.1880 0.6552 0.5534 0.6 103 0.6306 0.8187 0.7125 171 0.8151 0.9084 0.8592 131 0.6945 0.7802 0.7349 0.9519
0.1942 6.0 576 0.1469 0.7426 0.7282 0.7353 103 0.7374 0.8538 0.7913 171 0.8936 0.9618 0.9265 131 0.7886 0.8568 0.8213 0.9624
0.1609 7.0 672 0.1250 0.7925 0.8155 0.8038 103 0.7732 0.8772 0.8219 171 0.8944 0.9695 0.9304 131 0.8167 0.8914 0.8524 0.9660
0.1441 8.0 768 0.1038 0.7706 0.8155 0.7925 103 0.7849 0.8538 0.8179 171 0.9338 0.9695 0.9513 131 0.8283 0.8815 0.8541 0.9693
0.1317 9.0 864 0.0943 0.8627 0.8544 0.8585 103 0.7795 0.8889 0.8306 171 0.9203 0.9695 0.9442 131 0.8437 0.9062 0.8738 0.9710
0.1184 10.0 960 0.0872 0.8224 0.8544 0.8381 103 0.8021 0.9006 0.8485 171 0.9137 0.9695 0.9407 131 0.8425 0.9111 0.8754 0.9713
0.1103 11.0 1056 0.0762 0.89 0.8641 0.8768 103 0.8297 0.8830 0.8555 171 0.9265 0.9618 0.9438 131 0.8756 0.9037 0.8894 0.9749
0.1049 12.0 1152 0.0706 0.875 0.8835 0.8792 103 0.8315 0.8947 0.8620 171 0.9407 0.9695 0.9549 131 0.8771 0.9160 0.8961 0.9779
0.0961 13.0 1248 0.0640 0.8667 0.8835 0.8750 103 0.8580 0.8830 0.8703 171 0.9478 0.9695 0.9585 131 0.8892 0.9111 0.9000 0.9779
0.0909 14.0 1344 0.0653 0.8571 0.8738 0.8654 103 0.9018 0.8596 0.8802 171 0.9695 0.9695 0.9695 131 0.9123 0.8988 0.9055 0.9801
0.0888 15.0 1440 0.0579 0.9020 0.8932 0.8976 103 0.8571 0.9123 0.8839 171 0.9621 0.9695 0.9658 131 0.9014 0.9259 0.9135 0.9809
0.0873 16.0 1536 0.0562 0.8774 0.9029 0.8900 103 0.8495 0.9240 0.8852 171 0.9545 0.9618 0.9582 131 0.8892 0.9309 0.9095 0.9815
0.0827 17.0 1632 0.0557 0.9010 0.8835 0.8922 103 0.8728 0.8830 0.8779 171 0.9621 0.9695 0.9658 131 0.9089 0.9111 0.9100 0.9807
0.0798 18.0 1728 0.0514 0.8857 0.9029 0.8942 103 0.8920 0.9181 0.9049 171 0.9474 0.9618 0.9545 131 0.9082 0.9284 0.9182 0.9845
0.076 19.0 1824 0.0527 0.8952 0.9126 0.9038 103 0.8953 0.9006 0.8980 171 0.9474 0.9618 0.9545 131 0.9122 0.9235 0.9178 0.9834
0.0712 20.0 1920 0.0524 0.8962 0.9223 0.9091 103 0.8729 0.9240 0.8977 171 0.9478 0.9695 0.9585 131 0.9026 0.9383 0.9201 0.9837
0.072 21.0 2016 0.0508 0.8952 0.9126 0.9038 103 0.8941 0.8889 0.8915 171 0.9549 0.9695 0.9621 131 0.9142 0.9210 0.9176 0.9837
0.0717 22.0 2112 0.0481 0.8785 0.9126 0.8952 103 0.8619 0.9123 0.8864 171 0.9695 0.9695 0.9695 131 0.8998 0.9309 0.9150 0.9829
0.0644 23.0 2208 0.0492 0.8692 0.9029 0.8857 103 0.9064 0.9064 0.9064 171 0.9695 0.9695 0.9695 131 0.9169 0.9259 0.9214 0.9843
0.0647 24.0 2304 0.0494 0.8692 0.9029 0.8857 103 0.8935 0.8830 0.8882 171 0.9695 0.9695 0.9695 131 0.9115 0.9160 0.9138 0.9826
0.0652 25.0 2400 0.0511 0.8692 0.9029 0.8857 103 0.9018 0.8596 0.8802 171 0.9621 0.9695 0.9658 131 0.9129 0.9062 0.9095 0.9815
0.0635 26.0 2496 0.0465 0.8942 0.9029 0.8986 103 0.8883 0.9298 0.9086 171 0.9695 0.9695 0.9695 131 0.9155 0.9358 0.9255 0.9845
0.0597 27.0 2592 0.0450 0.8868 0.9126 0.8995 103 0.9075 0.9181 0.9128 171 0.9474 0.9618 0.9545 131 0.9150 0.9309 0.9229 0.9859
0.0596 28.0 2688 0.0456 0.8785 0.9126 0.8952 103 0.8977 0.9240 0.9107 171 0.9846 0.9771 0.9808 131 0.9201 0.9383 0.9291 0.9865
0.0588 29.0 2784 0.0439 0.8846 0.8932 0.8889 103 0.8701 0.9006 0.8851 171 0.9545 0.9618 0.9582 131 0.9007 0.9185 0.9095 0.9845
0.0546 30.0 2880 0.0453 0.8774 0.9029 0.8900 103 0.88 0.9006 0.8902 171 0.9695 0.9695 0.9695 131 0.9078 0.9235 0.9155 0.9848
0.0543 31.0 2976 0.0455 0.8319 0.9126 0.8704 103 0.8976 0.8713 0.8843 171 0.9695 0.9695 0.9695 131 0.9024 0.9136 0.9080 0.9832
0.0536 32.0 3072 0.0458 0.8304 0.9029 0.8651 103 0.9080 0.8655 0.8862 171 0.9695 0.9695 0.9695 131 0.9064 0.9086 0.9075 0.9834
0.0529 33.0 3168 0.0472 0.8190 0.9223 0.8676 103 0.9125 0.8538 0.8822 171 0.9695 0.9695 0.9695 131 0.9042 0.9086 0.9064 0.9832
0.054 34.0 3264 0.0448 0.8868 0.9126 0.8995 103 0.8743 0.8947 0.8844 171 0.9695 0.9695 0.9695 131 0.9078 0.9235 0.9155 0.9851
0.052 35.0 3360 0.0444 0.8532 0.9029 0.8774 103 0.8876 0.8772 0.8824 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9845
0.0513 36.0 3456 0.0451 0.8393 0.9126 0.8744 103 0.8721 0.8772 0.8746 171 0.9695 0.9695 0.9695 131 0.8940 0.9160 0.9049 0.9837
0.0501 37.0 3552 0.0453 0.9118 0.9029 0.9073 103 0.8757 0.9064 0.8908 171 0.9695 0.9695 0.9695 131 0.9146 0.9259 0.9202 0.9851
0.048 38.0 3648 0.0485 0.8624 0.9126 0.8868 103 0.9080 0.8655 0.8862 171 0.9695 0.9695 0.9695 131 0.9156 0.9111 0.9134 0.9837
0.0484 39.0 3744 0.0451 0.8774 0.9029 0.8900 103 0.8652 0.9006 0.8825 171 0.9695 0.9695 0.9695 131 0.9012 0.9235 0.9122 0.9845
0.0498 40.0 3840 0.0448 0.9126 0.9126 0.9126 103 0.8736 0.8889 0.8812 171 0.9621 0.9695 0.9658 131 0.9120 0.9210 0.9165 0.9851
0.047 41.0 3936 0.0427 0.8774 0.9029 0.8900 103 0.8757 0.9064 0.8908 171 0.9695 0.9695 0.9695 131 0.9058 0.9259 0.9158 0.9854
0.0465 42.0 4032 0.0416 0.8942 0.9029 0.8986 103 0.8715 0.9123 0.8914 171 0.9695 0.9695 0.9695 131 0.9082 0.9284 0.9182 0.9859
0.0443 43.0 4128 0.0423 0.8785 0.9126 0.8952 103 0.8743 0.8947 0.8844 171 0.9695 0.9695 0.9695 131 0.9056 0.9235 0.9144 0.9854
0.0428 44.0 4224 0.0433 0.8624 0.9126 0.8868 103 0.8882 0.8830 0.8856 171 0.9695 0.9695 0.9695 131 0.9073 0.9185 0.9129 0.9854
0.0437 45.0 4320 0.0440 0.8393 0.9126 0.8744 103 0.8876 0.8772 0.8824 171 0.9695 0.9695 0.9695 131 0.9005 0.9160 0.9082 0.9848
0.0447 46.0 4416 0.0479 0.8462 0.9612 0.9 103 0.9125 0.8538 0.8822 171 0.9695 0.9695 0.9695 131 0.9118 0.9185 0.9151 0.9840
0.0421 47.0 4512 0.0437 0.8796 0.9223 0.9005 103 0.8701 0.9006 0.8851 171 0.9695 0.9695 0.9695 131 0.9038 0.9284 0.9160 0.9851
0.0403 48.0 4608 0.0450 0.8522 0.9515 0.8991 103 0.9141 0.8713 0.8922 171 0.9695 0.9695 0.9695 131 0.9144 0.9235 0.9189 0.9856
0.0423 49.0 4704 0.0475 0.8448 0.9515 0.8950 103 0.9074 0.8596 0.8829 171 0.9695 0.9695 0.9695 131 0.9095 0.9185 0.9140 0.9851
0.039 50.0 4800 0.0503 0.8435 0.9417 0.8899 103 0.9187 0.8596 0.8882 171 0.9695 0.9695 0.9695 131 0.9138 0.9160 0.9149 0.9837
0.0395 51.0 4896 0.0512 0.8448 0.9515 0.8950 103 0.9130 0.8596 0.8855 171 0.9695 0.9695 0.9695 131 0.9118 0.9185 0.9151 0.9840
0.0405 52.0 4992 0.0451 0.8559 0.9223 0.8879 103 0.8713 0.8713 0.8713 171 0.9695 0.9695 0.9695 131 0.8983 0.9160 0.9071 0.9843
0.0383 53.0 5088 0.0474 0.8435 0.9417 0.8899 103 0.9012 0.8538 0.8769 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9834
0.0355 54.0 5184 0.0450 0.8972 0.9320 0.9143 103 0.9012 0.9064 0.9038 171 0.9695 0.9695 0.9695 131 0.9220 0.9333 0.9276 0.9865
0.0404 55.0 5280 0.0495 0.8727 0.9320 0.9014 103 0.8896 0.8480 0.8683 171 0.9695 0.9695 0.9695 131 0.9109 0.9086 0.9098 0.9829
0.0367 56.0 5376 0.0473 0.8571 0.9320 0.8930 103 0.8982 0.8772 0.8876 171 0.9695 0.9695 0.9695 131 0.9098 0.9210 0.9153 0.9854
0.0383 57.0 5472 0.0486 0.8496 0.9320 0.8889 103 0.8922 0.8713 0.8817 171 0.9695 0.9695 0.9695 131 0.9051 0.9185 0.9118 0.9848
0.0353 58.0 5568 0.0482 0.8571 0.9320 0.8930 103 0.9030 0.8713 0.8869 171 0.9695 0.9695 0.9695 131 0.9118 0.9185 0.9151 0.9843
0.0362 59.0 5664 0.0470 0.8649 0.9320 0.8972 103 0.8862 0.8655 0.8757 171 0.9695 0.9695 0.9695 131 0.9071 0.9160 0.9115 0.9845
0.0351 60.0 5760 0.0487 0.8727 0.9320 0.9014 103 0.8855 0.8596 0.8724 171 0.9695 0.9695 0.9695 131 0.9091 0.9136 0.9113 0.9843
0.0399 61.0 5856 0.0487 0.8889 0.9320 0.9100 103 0.8869 0.8713 0.8791 171 0.9695 0.9695 0.9695 131 0.9140 0.9185 0.9163 0.9843
0.0364 62.0 5952 0.0500 0.8673 0.9515 0.9074 103 0.9125 0.8538 0.8822 171 0.9695 0.9695 0.9695 131 0.9183 0.9160 0.9172 0.9834
0.0342 63.0 6048 0.0466 0.8649 0.9320 0.8972 103 0.8757 0.8655 0.8706 171 0.9695 0.9695 0.9695 131 0.9027 0.9160 0.9093 0.9843
0.0347 64.0 6144 0.0487 0.8807 0.9320 0.9057 103 0.8916 0.8655 0.8783 171 0.9695 0.9695 0.9695 131 0.9138 0.9160 0.9149 0.9840
0.0357 65.0 6240 0.0469 0.8596 0.9515 0.9032 103 0.9024 0.8655 0.8836 171 0.9695 0.9695 0.9695 131 0.9120 0.9210 0.9165 0.9856
0.035 66.0 6336 0.0515 0.8661 0.9417 0.9023 103 0.9119 0.8480 0.8788 171 0.9695 0.9695 0.9695 131 0.9179 0.9111 0.9145 0.9832
0.0341 67.0 6432 0.0496 0.8496 0.9320 0.8889 103 0.8909 0.8596 0.875 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9829
0.0336 68.0 6528 0.0496 0.8559 0.9223 0.8879 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9005 0.9160 0.9082 0.9840
0.033 69.0 6624 0.0500 0.8661 0.9417 0.9023 103 0.8902 0.8538 0.8716 171 0.9695 0.9695 0.9695 131 0.9091 0.9136 0.9113 0.9837
0.0327 70.0 6720 0.0488 0.8649 0.9320 0.8972 103 0.8795 0.8538 0.8665 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9837
0.0331 71.0 6816 0.0478 0.8571 0.9320 0.8930 103 0.8922 0.8713 0.8817 171 0.9695 0.9695 0.9695 131 0.9073 0.9185 0.9129 0.9851
0.0311 72.0 6912 0.0545 0.8596 0.9515 0.9032 103 0.9295 0.8480 0.8869 171 0.9695 0.9695 0.9695 131 0.9227 0.9136 0.9181 0.9832
0.033 73.0 7008 0.0502 0.8684 0.9612 0.9124 103 0.9024 0.8655 0.8836 171 0.9695 0.9695 0.9695 131 0.9144 0.9235 0.9189 0.9859
0.0321 74.0 7104 0.0519 0.8673 0.9515 0.9074 103 0.9074 0.8596 0.8829 171 0.9695 0.9695 0.9695 131 0.9163 0.9185 0.9174 0.9840
0.032 75.0 7200 0.0512 0.8636 0.9223 0.8920 103 0.8848 0.8538 0.8690 171 0.9695 0.9695 0.9695 131 0.9064 0.9086 0.9075 0.9829
0.0316 76.0 7296 0.0496 0.8584 0.9417 0.8981 103 0.8757 0.8655 0.8706 171 0.9695 0.9695 0.9695 131 0.9007 0.9185 0.9095 0.9843
0.0318 77.0 7392 0.0508 0.8739 0.9417 0.9065 103 0.8909 0.8596 0.875 171 0.9695 0.9695 0.9695 131 0.9115 0.9160 0.9138 0.9834
0.0294 78.0 7488 0.0518 0.8739 0.9417 0.9065 103 0.8902 0.8538 0.8716 171 0.9695 0.9695 0.9695 131 0.9113 0.9136 0.9125 0.9829
0.0307 79.0 7584 0.0515 0.8727 0.9320 0.9014 103 0.8848 0.8538 0.8690 171 0.9695 0.9695 0.9695 131 0.9089 0.9111 0.9100 0.9826
0.0314 80.0 7680 0.0506 0.8673 0.9515 0.9074 103 0.8909 0.8596 0.875 171 0.9695 0.9695 0.9695 131 0.9095 0.9185 0.9140 0.9845
0.0323 81.0 7776 0.0517 0.8761 0.9612 0.9167 103 0.9018 0.8596 0.8802 171 0.9695 0.9695 0.9695 131 0.9165 0.9210 0.9187 0.9837
0.0314 82.0 7872 0.0495 0.8807 0.9320 0.9057 103 0.8765 0.8713 0.8739 171 0.9695 0.9695 0.9695 131 0.9073 0.9185 0.9129 0.9843
0.0301 83.0 7968 0.0525 0.8739 0.9417 0.9065 103 0.8963 0.8596 0.8776 171 0.9695 0.9695 0.9695 131 0.9138 0.9160 0.9149 0.9829
0.0309 84.0 8064 0.0526 0.875 0.9515 0.9116 103 0.9018 0.8596 0.8802 171 0.9695 0.9695 0.9695 131 0.9163 0.9185 0.9174 0.9832
0.0305 85.0 8160 0.0519 0.875 0.9515 0.9116 103 0.8855 0.8596 0.8724 171 0.9695 0.9695 0.9695 131 0.9095 0.9185 0.9140 0.9840
0.0295 86.0 8256 0.0519 0.875 0.9515 0.9116 103 0.8909 0.8596 0.875 171 0.9695 0.9695 0.9695 131 0.9118 0.9185 0.9151 0.9837
0.0316 87.0 8352 0.0517 0.8739 0.9417 0.9065 103 0.8963 0.8596 0.8776 171 0.9695 0.9695 0.9695 131 0.9138 0.9160 0.9149 0.9829
0.0298 88.0 8448 0.0518 0.8727 0.9320 0.9014 103 0.8802 0.8596 0.8698 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9832
0.0299 89.0 8544 0.0534 0.8649 0.9320 0.8972 103 0.9018 0.8596 0.8802 171 0.9695 0.9695 0.9695 131 0.9136 0.9136 0.9136 0.9829
0.0292 90.0 8640 0.0517 0.8727 0.9320 0.9014 103 0.8802 0.8596 0.8698 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9832
0.0301 91.0 8736 0.0511 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834
0.0294 92.0 8832 0.0518 0.8727 0.9320 0.9014 103 0.8802 0.8596 0.8698 171 0.9695 0.9695 0.9695 131 0.9069 0.9136 0.9102 0.9832
0.0296 93.0 8928 0.0516 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834
0.0293 94.0 9024 0.0523 0.8661 0.9417 0.9023 103 0.8909 0.8596 0.875 171 0.9695 0.9695 0.9695 131 0.9093 0.9160 0.9127 0.9840
0.0295 95.0 9120 0.0515 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834
0.0284 96.0 9216 0.0515 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834
0.0289 97.0 9312 0.0522 0.8636 0.9223 0.8920 103 0.8802 0.8596 0.8698 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9834
0.0282 98.0 9408 0.0520 0.8636 0.9223 0.8920 103 0.8802 0.8596 0.8698 171 0.9695 0.9695 0.9695 131 0.9044 0.9111 0.9077 0.9834
0.0287 99.0 9504 0.0519 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834
0.0301 100.0 9600 0.0517 0.8727 0.9320 0.9014 103 0.875 0.8596 0.8673 171 0.9695 0.9695 0.9695 131 0.9046 0.9136 0.9091 0.9834

Framework versions

  • Transformers 4.39.3
  • Pytorch 2.3.0+cu121
  • Datasets 2.19.1
  • Tokenizers 0.15.2