File size: 36,369 Bytes
13891b1
7acf190
 
13891b1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
---
language:
- id
license: mit
base_model: indolem/indobert-base-uncased
tags:
- generated_from_trainer
model-index:
- name: nerui-lora-r16-4
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# nerui-lora-r16-4

This model is a fine-tuned version of [indolem/indobert-base-uncased](https://huggingface.co./indolem/indobert-base-uncased) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.0517
- Location Precision: 0.8727
- Location Recall: 0.9320
- Location F1: 0.9014
- Location Number: 103
- Organization Precision: 0.875
- Organization Recall: 0.8596
- Organization F1: 0.8673
- Organization Number: 171
- Person Precision: 0.9695
- Person Recall: 0.9695
- Person F1: 0.9695
- Person Number: 131
- Overall Precision: 0.9046
- Overall Recall: 0.9136
- Overall F1: 0.9091
- Overall Accuracy: 0.9834

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 16
- eval_batch_size: 64
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100.0

### Training results

| Training Loss | Epoch | Step | Validation Loss | Location Precision | Location Recall | Location F1 | Location Number | Organization Precision | Organization Recall | Organization F1 | Organization Number | Person Precision | Person Recall | Person F1 | Person Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
|:-------------:|:-----:|:----:|:---------------:|:------------------:|:---------------:|:-----------:|:---------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|
| 1.0769        | 1.0   | 96   | 0.6692          | 0.0                | 0.0             | 0.0         | 103             | 0.0                    | 0.0                 | 0.0             | 171                 | 0.0              | 0.0           | 0.0       | 131           | 0.0               | 0.0            | 0.0        | 0.8373           |
| 0.6366        | 2.0   | 192  | 0.5158          | 0.0                | 0.0             | 0.0         | 103             | 0.0                    | 0.0                 | 0.0             | 171                 | 0.0              | 0.0           | 0.0       | 131           | 0.0               | 0.0            | 0.0        | 0.8382           |
| 0.4812        | 3.0   | 288  | 0.3594          | 0.1364             | 0.0291          | 0.048       | 103             | 0.3654                 | 0.2222              | 0.2764          | 171                 | 0.2932           | 0.2977        | 0.2955    | 131           | 0.3089            | 0.1975         | 0.2410     | 0.8763           |
| 0.3408        | 4.0   | 384  | 0.2543          | 0.4133             | 0.3010          | 0.3483      | 103             | 0.4952                 | 0.6023              | 0.5435          | 171                 | 0.4971           | 0.6565        | 0.5658    | 131           | 0.4825            | 0.5432         | 0.5110     | 0.9207           |
| 0.2457        | 5.0   | 480  | 0.1880          | 0.6552             | 0.5534          | 0.6         | 103             | 0.6306                 | 0.8187              | 0.7125          | 171                 | 0.8151           | 0.9084        | 0.8592    | 131           | 0.6945            | 0.7802         | 0.7349     | 0.9519           |
| 0.1942        | 6.0   | 576  | 0.1469          | 0.7426             | 0.7282          | 0.7353      | 103             | 0.7374                 | 0.8538              | 0.7913          | 171                 | 0.8936           | 0.9618        | 0.9265    | 131           | 0.7886            | 0.8568         | 0.8213     | 0.9624           |
| 0.1609        | 7.0   | 672  | 0.1250          | 0.7925             | 0.8155          | 0.8038      | 103             | 0.7732                 | 0.8772              | 0.8219          | 171                 | 0.8944           | 0.9695        | 0.9304    | 131           | 0.8167            | 0.8914         | 0.8524     | 0.9660           |
| 0.1441        | 8.0   | 768  | 0.1038          | 0.7706             | 0.8155          | 0.7925      | 103             | 0.7849                 | 0.8538              | 0.8179          | 171                 | 0.9338           | 0.9695        | 0.9513    | 131           | 0.8283            | 0.8815         | 0.8541     | 0.9693           |
| 0.1317        | 9.0   | 864  | 0.0943          | 0.8627             | 0.8544          | 0.8585      | 103             | 0.7795                 | 0.8889              | 0.8306          | 171                 | 0.9203           | 0.9695        | 0.9442    | 131           | 0.8437            | 0.9062         | 0.8738     | 0.9710           |
| 0.1184        | 10.0  | 960  | 0.0872          | 0.8224             | 0.8544          | 0.8381      | 103             | 0.8021                 | 0.9006              | 0.8485          | 171                 | 0.9137           | 0.9695        | 0.9407    | 131           | 0.8425            | 0.9111         | 0.8754     | 0.9713           |
| 0.1103        | 11.0  | 1056 | 0.0762          | 0.89               | 0.8641          | 0.8768      | 103             | 0.8297                 | 0.8830              | 0.8555          | 171                 | 0.9265           | 0.9618        | 0.9438    | 131           | 0.8756            | 0.9037         | 0.8894     | 0.9749           |
| 0.1049        | 12.0  | 1152 | 0.0706          | 0.875              | 0.8835          | 0.8792      | 103             | 0.8315                 | 0.8947              | 0.8620          | 171                 | 0.9407           | 0.9695        | 0.9549    | 131           | 0.8771            | 0.9160         | 0.8961     | 0.9779           |
| 0.0961        | 13.0  | 1248 | 0.0640          | 0.8667             | 0.8835          | 0.8750      | 103             | 0.8580                 | 0.8830              | 0.8703          | 171                 | 0.9478           | 0.9695        | 0.9585    | 131           | 0.8892            | 0.9111         | 0.9000     | 0.9779           |
| 0.0909        | 14.0  | 1344 | 0.0653          | 0.8571             | 0.8738          | 0.8654      | 103             | 0.9018                 | 0.8596              | 0.8802          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9123            | 0.8988         | 0.9055     | 0.9801           |
| 0.0888        | 15.0  | 1440 | 0.0579          | 0.9020             | 0.8932          | 0.8976      | 103             | 0.8571                 | 0.9123              | 0.8839          | 171                 | 0.9621           | 0.9695        | 0.9658    | 131           | 0.9014            | 0.9259         | 0.9135     | 0.9809           |
| 0.0873        | 16.0  | 1536 | 0.0562          | 0.8774             | 0.9029          | 0.8900      | 103             | 0.8495                 | 0.9240              | 0.8852          | 171                 | 0.9545           | 0.9618        | 0.9582    | 131           | 0.8892            | 0.9309         | 0.9095     | 0.9815           |
| 0.0827        | 17.0  | 1632 | 0.0557          | 0.9010             | 0.8835          | 0.8922      | 103             | 0.8728                 | 0.8830              | 0.8779          | 171                 | 0.9621           | 0.9695        | 0.9658    | 131           | 0.9089            | 0.9111         | 0.9100     | 0.9807           |
| 0.0798        | 18.0  | 1728 | 0.0514          | 0.8857             | 0.9029          | 0.8942      | 103             | 0.8920                 | 0.9181              | 0.9049          | 171                 | 0.9474           | 0.9618        | 0.9545    | 131           | 0.9082            | 0.9284         | 0.9182     | 0.9845           |
| 0.076         | 19.0  | 1824 | 0.0527          | 0.8952             | 0.9126          | 0.9038      | 103             | 0.8953                 | 0.9006              | 0.8980          | 171                 | 0.9474           | 0.9618        | 0.9545    | 131           | 0.9122            | 0.9235         | 0.9178     | 0.9834           |
| 0.0712        | 20.0  | 1920 | 0.0524          | 0.8962             | 0.9223          | 0.9091      | 103             | 0.8729                 | 0.9240              | 0.8977          | 171                 | 0.9478           | 0.9695        | 0.9585    | 131           | 0.9026            | 0.9383         | 0.9201     | 0.9837           |
| 0.072         | 21.0  | 2016 | 0.0508          | 0.8952             | 0.9126          | 0.9038      | 103             | 0.8941                 | 0.8889              | 0.8915          | 171                 | 0.9549           | 0.9695        | 0.9621    | 131           | 0.9142            | 0.9210         | 0.9176     | 0.9837           |
| 0.0717        | 22.0  | 2112 | 0.0481          | 0.8785             | 0.9126          | 0.8952      | 103             | 0.8619                 | 0.9123              | 0.8864          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.8998            | 0.9309         | 0.9150     | 0.9829           |
| 0.0644        | 23.0  | 2208 | 0.0492          | 0.8692             | 0.9029          | 0.8857      | 103             | 0.9064                 | 0.9064              | 0.9064          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9169            | 0.9259         | 0.9214     | 0.9843           |
| 0.0647        | 24.0  | 2304 | 0.0494          | 0.8692             | 0.9029          | 0.8857      | 103             | 0.8935                 | 0.8830              | 0.8882          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9115            | 0.9160         | 0.9138     | 0.9826           |
| 0.0652        | 25.0  | 2400 | 0.0511          | 0.8692             | 0.9029          | 0.8857      | 103             | 0.9018                 | 0.8596              | 0.8802          | 171                 | 0.9621           | 0.9695        | 0.9658    | 131           | 0.9129            | 0.9062         | 0.9095     | 0.9815           |
| 0.0635        | 26.0  | 2496 | 0.0465          | 0.8942             | 0.9029          | 0.8986      | 103             | 0.8883                 | 0.9298              | 0.9086          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9155            | 0.9358         | 0.9255     | 0.9845           |
| 0.0597        | 27.0  | 2592 | 0.0450          | 0.8868             | 0.9126          | 0.8995      | 103             | 0.9075                 | 0.9181              | 0.9128          | 171                 | 0.9474           | 0.9618        | 0.9545    | 131           | 0.9150            | 0.9309         | 0.9229     | 0.9859           |
| 0.0596        | 28.0  | 2688 | 0.0456          | 0.8785             | 0.9126          | 0.8952      | 103             | 0.8977                 | 0.9240              | 0.9107          | 171                 | 0.9846           | 0.9771        | 0.9808    | 131           | 0.9201            | 0.9383         | 0.9291     | 0.9865           |
| 0.0588        | 29.0  | 2784 | 0.0439          | 0.8846             | 0.8932          | 0.8889      | 103             | 0.8701                 | 0.9006              | 0.8851          | 171                 | 0.9545           | 0.9618        | 0.9582    | 131           | 0.9007            | 0.9185         | 0.9095     | 0.9845           |
| 0.0546        | 30.0  | 2880 | 0.0453          | 0.8774             | 0.9029          | 0.8900      | 103             | 0.88                   | 0.9006              | 0.8902          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9078            | 0.9235         | 0.9155     | 0.9848           |
| 0.0543        | 31.0  | 2976 | 0.0455          | 0.8319             | 0.9126          | 0.8704      | 103             | 0.8976                 | 0.8713              | 0.8843          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9024            | 0.9136         | 0.9080     | 0.9832           |
| 0.0536        | 32.0  | 3072 | 0.0458          | 0.8304             | 0.9029          | 0.8651      | 103             | 0.9080                 | 0.8655              | 0.8862          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9064            | 0.9086         | 0.9075     | 0.9834           |
| 0.0529        | 33.0  | 3168 | 0.0472          | 0.8190             | 0.9223          | 0.8676      | 103             | 0.9125                 | 0.8538              | 0.8822          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9042            | 0.9086         | 0.9064     | 0.9832           |
| 0.054         | 34.0  | 3264 | 0.0448          | 0.8868             | 0.9126          | 0.8995      | 103             | 0.8743                 | 0.8947              | 0.8844          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9078            | 0.9235         | 0.9155     | 0.9851           |
| 0.052         | 35.0  | 3360 | 0.0444          | 0.8532             | 0.9029          | 0.8774      | 103             | 0.8876                 | 0.8772              | 0.8824          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9845           |
| 0.0513        | 36.0  | 3456 | 0.0451          | 0.8393             | 0.9126          | 0.8744      | 103             | 0.8721                 | 0.8772              | 0.8746          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.8940            | 0.9160         | 0.9049     | 0.9837           |
| 0.0501        | 37.0  | 3552 | 0.0453          | 0.9118             | 0.9029          | 0.9073      | 103             | 0.8757                 | 0.9064              | 0.8908          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9146            | 0.9259         | 0.9202     | 0.9851           |
| 0.048         | 38.0  | 3648 | 0.0485          | 0.8624             | 0.9126          | 0.8868      | 103             | 0.9080                 | 0.8655              | 0.8862          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9156            | 0.9111         | 0.9134     | 0.9837           |
| 0.0484        | 39.0  | 3744 | 0.0451          | 0.8774             | 0.9029          | 0.8900      | 103             | 0.8652                 | 0.9006              | 0.8825          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9012            | 0.9235         | 0.9122     | 0.9845           |
| 0.0498        | 40.0  | 3840 | 0.0448          | 0.9126             | 0.9126          | 0.9126      | 103             | 0.8736                 | 0.8889              | 0.8812          | 171                 | 0.9621           | 0.9695        | 0.9658    | 131           | 0.9120            | 0.9210         | 0.9165     | 0.9851           |
| 0.047         | 41.0  | 3936 | 0.0427          | 0.8774             | 0.9029          | 0.8900      | 103             | 0.8757                 | 0.9064              | 0.8908          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9058            | 0.9259         | 0.9158     | 0.9854           |
| 0.0465        | 42.0  | 4032 | 0.0416          | 0.8942             | 0.9029          | 0.8986      | 103             | 0.8715                 | 0.9123              | 0.8914          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9082            | 0.9284         | 0.9182     | 0.9859           |
| 0.0443        | 43.0  | 4128 | 0.0423          | 0.8785             | 0.9126          | 0.8952      | 103             | 0.8743                 | 0.8947              | 0.8844          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9056            | 0.9235         | 0.9144     | 0.9854           |
| 0.0428        | 44.0  | 4224 | 0.0433          | 0.8624             | 0.9126          | 0.8868      | 103             | 0.8882                 | 0.8830              | 0.8856          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9073            | 0.9185         | 0.9129     | 0.9854           |
| 0.0437        | 45.0  | 4320 | 0.0440          | 0.8393             | 0.9126          | 0.8744      | 103             | 0.8876                 | 0.8772              | 0.8824          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9005            | 0.9160         | 0.9082     | 0.9848           |
| 0.0447        | 46.0  | 4416 | 0.0479          | 0.8462             | 0.9612          | 0.9         | 103             | 0.9125                 | 0.8538              | 0.8822          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9118            | 0.9185         | 0.9151     | 0.9840           |
| 0.0421        | 47.0  | 4512 | 0.0437          | 0.8796             | 0.9223          | 0.9005      | 103             | 0.8701                 | 0.9006              | 0.8851          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9038            | 0.9284         | 0.9160     | 0.9851           |
| 0.0403        | 48.0  | 4608 | 0.0450          | 0.8522             | 0.9515          | 0.8991      | 103             | 0.9141                 | 0.8713              | 0.8922          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9144            | 0.9235         | 0.9189     | 0.9856           |
| 0.0423        | 49.0  | 4704 | 0.0475          | 0.8448             | 0.9515          | 0.8950      | 103             | 0.9074                 | 0.8596              | 0.8829          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9095            | 0.9185         | 0.9140     | 0.9851           |
| 0.039         | 50.0  | 4800 | 0.0503          | 0.8435             | 0.9417          | 0.8899      | 103             | 0.9187                 | 0.8596              | 0.8882          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9138            | 0.9160         | 0.9149     | 0.9837           |
| 0.0395        | 51.0  | 4896 | 0.0512          | 0.8448             | 0.9515          | 0.8950      | 103             | 0.9130                 | 0.8596              | 0.8855          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9118            | 0.9185         | 0.9151     | 0.9840           |
| 0.0405        | 52.0  | 4992 | 0.0451          | 0.8559             | 0.9223          | 0.8879      | 103             | 0.8713                 | 0.8713              | 0.8713          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.8983            | 0.9160         | 0.9071     | 0.9843           |
| 0.0383        | 53.0  | 5088 | 0.0474          | 0.8435             | 0.9417          | 0.8899      | 103             | 0.9012                 | 0.8538              | 0.8769          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9069            | 0.9136         | 0.9102     | 0.9834           |
| 0.0355        | 54.0  | 5184 | 0.0450          | 0.8972             | 0.9320          | 0.9143      | 103             | 0.9012                 | 0.9064              | 0.9038          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9220            | 0.9333         | 0.9276     | 0.9865           |
| 0.0404        | 55.0  | 5280 | 0.0495          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8896                 | 0.8480              | 0.8683          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9109            | 0.9086         | 0.9098     | 0.9829           |
| 0.0367        | 56.0  | 5376 | 0.0473          | 0.8571             | 0.9320          | 0.8930      | 103             | 0.8982                 | 0.8772              | 0.8876          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9098            | 0.9210         | 0.9153     | 0.9854           |
| 0.0383        | 57.0  | 5472 | 0.0486          | 0.8496             | 0.9320          | 0.8889      | 103             | 0.8922                 | 0.8713              | 0.8817          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9051            | 0.9185         | 0.9118     | 0.9848           |
| 0.0353        | 58.0  | 5568 | 0.0482          | 0.8571             | 0.9320          | 0.8930      | 103             | 0.9030                 | 0.8713              | 0.8869          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9118            | 0.9185         | 0.9151     | 0.9843           |
| 0.0362        | 59.0  | 5664 | 0.0470          | 0.8649             | 0.9320          | 0.8972      | 103             | 0.8862                 | 0.8655              | 0.8757          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9071            | 0.9160         | 0.9115     | 0.9845           |
| 0.0351        | 60.0  | 5760 | 0.0487          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8855                 | 0.8596              | 0.8724          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9091            | 0.9136         | 0.9113     | 0.9843           |
| 0.0399        | 61.0  | 5856 | 0.0487          | 0.8889             | 0.9320          | 0.9100      | 103             | 0.8869                 | 0.8713              | 0.8791          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9140            | 0.9185         | 0.9163     | 0.9843           |
| 0.0364        | 62.0  | 5952 | 0.0500          | 0.8673             | 0.9515          | 0.9074      | 103             | 0.9125                 | 0.8538              | 0.8822          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9183            | 0.9160         | 0.9172     | 0.9834           |
| 0.0342        | 63.0  | 6048 | 0.0466          | 0.8649             | 0.9320          | 0.8972      | 103             | 0.8757                 | 0.8655              | 0.8706          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9027            | 0.9160         | 0.9093     | 0.9843           |
| 0.0347        | 64.0  | 6144 | 0.0487          | 0.8807             | 0.9320          | 0.9057      | 103             | 0.8916                 | 0.8655              | 0.8783          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9138            | 0.9160         | 0.9149     | 0.9840           |
| 0.0357        | 65.0  | 6240 | 0.0469          | 0.8596             | 0.9515          | 0.9032      | 103             | 0.9024                 | 0.8655              | 0.8836          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9120            | 0.9210         | 0.9165     | 0.9856           |
| 0.035         | 66.0  | 6336 | 0.0515          | 0.8661             | 0.9417          | 0.9023      | 103             | 0.9119                 | 0.8480              | 0.8788          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9179            | 0.9111         | 0.9145     | 0.9832           |
| 0.0341        | 67.0  | 6432 | 0.0496          | 0.8496             | 0.9320          | 0.8889      | 103             | 0.8909                 | 0.8596              | 0.875           | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9829           |
| 0.0336        | 68.0  | 6528 | 0.0496          | 0.8559             | 0.9223          | 0.8879      | 103             | 0.8765                 | 0.8713              | 0.8739          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9005            | 0.9160         | 0.9082     | 0.9840           |
| 0.033         | 69.0  | 6624 | 0.0500          | 0.8661             | 0.9417          | 0.9023      | 103             | 0.8902                 | 0.8538              | 0.8716          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9091            | 0.9136         | 0.9113     | 0.9837           |
| 0.0327        | 70.0  | 6720 | 0.0488          | 0.8649             | 0.9320          | 0.8972      | 103             | 0.8795                 | 0.8538              | 0.8665          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9044            | 0.9111         | 0.9077     | 0.9837           |
| 0.0331        | 71.0  | 6816 | 0.0478          | 0.8571             | 0.9320          | 0.8930      | 103             | 0.8922                 | 0.8713              | 0.8817          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9073            | 0.9185         | 0.9129     | 0.9851           |
| 0.0311        | 72.0  | 6912 | 0.0545          | 0.8596             | 0.9515          | 0.9032      | 103             | 0.9295                 | 0.8480              | 0.8869          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9227            | 0.9136         | 0.9181     | 0.9832           |
| 0.033         | 73.0  | 7008 | 0.0502          | 0.8684             | 0.9612          | 0.9124      | 103             | 0.9024                 | 0.8655              | 0.8836          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9144            | 0.9235         | 0.9189     | 0.9859           |
| 0.0321        | 74.0  | 7104 | 0.0519          | 0.8673             | 0.9515          | 0.9074      | 103             | 0.9074                 | 0.8596              | 0.8829          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9163            | 0.9185         | 0.9174     | 0.9840           |
| 0.032         | 75.0  | 7200 | 0.0512          | 0.8636             | 0.9223          | 0.8920      | 103             | 0.8848                 | 0.8538              | 0.8690          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9064            | 0.9086         | 0.9075     | 0.9829           |
| 0.0316        | 76.0  | 7296 | 0.0496          | 0.8584             | 0.9417          | 0.8981      | 103             | 0.8757                 | 0.8655              | 0.8706          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9007            | 0.9185         | 0.9095     | 0.9843           |
| 0.0318        | 77.0  | 7392 | 0.0508          | 0.8739             | 0.9417          | 0.9065      | 103             | 0.8909                 | 0.8596              | 0.875           | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9115            | 0.9160         | 0.9138     | 0.9834           |
| 0.0294        | 78.0  | 7488 | 0.0518          | 0.8739             | 0.9417          | 0.9065      | 103             | 0.8902                 | 0.8538              | 0.8716          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9113            | 0.9136         | 0.9125     | 0.9829           |
| 0.0307        | 79.0  | 7584 | 0.0515          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8848                 | 0.8538              | 0.8690          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9089            | 0.9111         | 0.9100     | 0.9826           |
| 0.0314        | 80.0  | 7680 | 0.0506          | 0.8673             | 0.9515          | 0.9074      | 103             | 0.8909                 | 0.8596              | 0.875           | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9095            | 0.9185         | 0.9140     | 0.9845           |
| 0.0323        | 81.0  | 7776 | 0.0517          | 0.8761             | 0.9612          | 0.9167      | 103             | 0.9018                 | 0.8596              | 0.8802          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9165            | 0.9210         | 0.9187     | 0.9837           |
| 0.0314        | 82.0  | 7872 | 0.0495          | 0.8807             | 0.9320          | 0.9057      | 103             | 0.8765                 | 0.8713              | 0.8739          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9073            | 0.9185         | 0.9129     | 0.9843           |
| 0.0301        | 83.0  | 7968 | 0.0525          | 0.8739             | 0.9417          | 0.9065      | 103             | 0.8963                 | 0.8596              | 0.8776          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9138            | 0.9160         | 0.9149     | 0.9829           |
| 0.0309        | 84.0  | 8064 | 0.0526          | 0.875              | 0.9515          | 0.9116      | 103             | 0.9018                 | 0.8596              | 0.8802          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9163            | 0.9185         | 0.9174     | 0.9832           |
| 0.0305        | 85.0  | 8160 | 0.0519          | 0.875              | 0.9515          | 0.9116      | 103             | 0.8855                 | 0.8596              | 0.8724          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9095            | 0.9185         | 0.9140     | 0.9840           |
| 0.0295        | 86.0  | 8256 | 0.0519          | 0.875              | 0.9515          | 0.9116      | 103             | 0.8909                 | 0.8596              | 0.875           | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9118            | 0.9185         | 0.9151     | 0.9837           |
| 0.0316        | 87.0  | 8352 | 0.0517          | 0.8739             | 0.9417          | 0.9065      | 103             | 0.8963                 | 0.8596              | 0.8776          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9138            | 0.9160         | 0.9149     | 0.9829           |
| 0.0298        | 88.0  | 8448 | 0.0518          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8802                 | 0.8596              | 0.8698          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9069            | 0.9136         | 0.9102     | 0.9832           |
| 0.0299        | 89.0  | 8544 | 0.0534          | 0.8649             | 0.9320          | 0.8972      | 103             | 0.9018                 | 0.8596              | 0.8802          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9136            | 0.9136         | 0.9136     | 0.9829           |
| 0.0292        | 90.0  | 8640 | 0.0517          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8802                 | 0.8596              | 0.8698          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9069            | 0.9136         | 0.9102     | 0.9832           |
| 0.0301        | 91.0  | 8736 | 0.0511          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |
| 0.0294        | 92.0  | 8832 | 0.0518          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.8802                 | 0.8596              | 0.8698          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9069            | 0.9136         | 0.9102     | 0.9832           |
| 0.0296        | 93.0  | 8928 | 0.0516          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |
| 0.0293        | 94.0  | 9024 | 0.0523          | 0.8661             | 0.9417          | 0.9023      | 103             | 0.8909                 | 0.8596              | 0.875           | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9093            | 0.9160         | 0.9127     | 0.9840           |
| 0.0295        | 95.0  | 9120 | 0.0515          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |
| 0.0284        | 96.0  | 9216 | 0.0515          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |
| 0.0289        | 97.0  | 9312 | 0.0522          | 0.8636             | 0.9223          | 0.8920      | 103             | 0.8802                 | 0.8596              | 0.8698          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9044            | 0.9111         | 0.9077     | 0.9834           |
| 0.0282        | 98.0  | 9408 | 0.0520          | 0.8636             | 0.9223          | 0.8920      | 103             | 0.8802                 | 0.8596              | 0.8698          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9044            | 0.9111         | 0.9077     | 0.9834           |
| 0.0287        | 99.0  | 9504 | 0.0519          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |
| 0.0301        | 100.0 | 9600 | 0.0517          | 0.8727             | 0.9320          | 0.9014      | 103             | 0.875                  | 0.8596              | 0.8673          | 171                 | 0.9695           | 0.9695        | 0.9695    | 131           | 0.9046            | 0.9136         | 0.9091     | 0.9834           |


### Framework versions

- Transformers 4.39.3
- Pytorch 2.3.0+cu121
- Datasets 2.19.1
- Tokenizers 0.15.2