makhataei commited on
Commit
5ec308b
1 Parent(s): e2a32b6

End of training

Browse files
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [makhataei/qa-persian-mdeberta-v3-base-squad2](https://huggingface.co/makhataei/qa-persian-mdeberta-v3-base-squad2) on the pquad dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.0384
21
 
22
  ## Model description
23
 
@@ -36,7 +36,7 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - learning_rate: 0.0001
40
  - train_batch_size: 5
41
  - eval_batch_size: 5
42
  - seed: 42
@@ -48,262 +48,262 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:------:|:---------------:|
51
- | 1.3205 | 0.04 | 500 | 1.1532 |
52
- | 1.2554 | 0.08 | 1000 | 1.2856 |
53
- | 1.2457 | 0.12 | 1500 | 1.3620 |
54
- | 1.1863 | 0.16 | 2000 | 1.1648 |
55
- | 1.1713 | 0.19 | 2500 | 1.0594 |
56
- | 1.1432 | 0.23 | 3000 | 1.1919 |
57
- | 1.1346 | 0.27 | 3500 | 1.0333 |
58
- | 1.1474 | 0.31 | 4000 | 1.3021 |
59
- | 1.1183 | 0.35 | 4500 | 1.2632 |
60
- | 1.0927 | 0.39 | 5000 | 1.1592 |
61
- | 1.0658 | 0.43 | 5500 | 1.1402 |
62
- | 1.0596 | 0.47 | 6000 | 1.1386 |
63
- | 1.0637 | 0.51 | 6500 | 1.0560 |
64
- | 1.0825 | 0.55 | 7000 | 1.1514 |
65
- | 1.0453 | 0.58 | 7500 | 1.0431 |
66
- | 1.0457 | 0.62 | 8000 | 1.1177 |
67
- | 1.1175 | 0.66 | 8500 | 1.0410 |
68
- | 1.0816 | 0.7 | 9000 | 1.2672 |
69
- | 1.0608 | 0.74 | 9500 | 1.0187 |
70
- | 1.0553 | 0.78 | 10000 | 1.0729 |
71
- | 0.9987 | 0.82 | 10500 | 1.2627 |
72
- | 1.006 | 0.86 | 11000 | 1.0749 |
73
- | 1.0217 | 0.9 | 11500 | 1.1467 |
74
- | 0.9799 | 0.94 | 12000 | 1.1333 |
75
- | 1.0424 | 0.97 | 12500 | 1.0982 |
76
- | 0.9434 | 1.01 | 13000 | 1.1808 |
77
- | 0.909 | 1.05 | 13500 | 1.1468 |
78
- | 0.88 | 1.09 | 14000 | 1.1328 |
79
- | 0.8935 | 1.13 | 14500 | 1.3442 |
80
- | 0.9473 | 1.17 | 15000 | 1.1131 |
81
- | 0.8825 | 1.21 | 15500 | 1.0903 |
82
- | 0.8898 | 1.25 | 16000 | 1.0192 |
83
- | 0.8763 | 1.29 | 16500 | 1.0972 |
84
- | 0.9211 | 1.32 | 17000 | 1.0496 |
85
- | 0.8832 | 1.36 | 17500 | 1.0479 |
86
- | 0.8743 | 1.4 | 18000 | 1.2129 |
87
- | 0.9268 | 1.44 | 18500 | 1.0250 |
88
- | 0.9138 | 1.48 | 19000 | 1.0019 |
89
- | 0.8882 | 1.52 | 19500 | 1.0912 |
90
- | 0.8811 | 1.56 | 20000 | 1.0570 |
91
- | 0.9133 | 1.6 | 20500 | 1.2419 |
92
- | 0.9929 | 1.64 | 21000 | 1.1105 |
93
- | 0.9087 | 1.68 | 21500 | 1.2119 |
94
- | 0.9115 | 1.71 | 22000 | 0.9776 |
95
- | 0.8925 | 1.75 | 22500 | 1.0555 |
96
- | 0.9275 | 1.79 | 23000 | 1.0288 |
97
- | 0.9095 | 1.83 | 23500 | 1.0000 |
98
- | 0.8962 | 1.87 | 24000 | 1.0408 |
99
- | 0.8375 | 1.91 | 24500 | 1.0101 |
100
- | 0.8769 | 1.95 | 25000 | 1.0564 |
101
- | 0.9058 | 1.99 | 25500 | 0.9696 |
102
- | 0.7766 | 2.03 | 26000 | 1.0502 |
103
- | 0.7749 | 2.06 | 26500 | 1.2074 |
104
- | 0.7785 | 2.1 | 27000 | 1.0686 |
105
- | 0.761 | 2.14 | 27500 | 1.0766 |
106
- | 0.7855 | 2.18 | 28000 | 1.1775 |
107
- | 0.8157 | 2.22 | 28500 | 1.0827 |
108
- | 0.8119 | 2.26 | 29000 | 1.1154 |
109
- | 0.8156 | 2.3 | 29500 | 1.0658 |
110
- | 0.7983 | 2.34 | 30000 | 1.0678 |
111
- | 0.7973 | 2.38 | 30500 | 0.9599 |
112
- | 0.7406 | 2.42 | 31000 | 1.1299 |
113
- | 0.7789 | 2.45 | 31500 | 1.0523 |
114
- | 0.801 | 2.49 | 32000 | 1.0122 |
115
- | 0.7792 | 2.53 | 32500 | 1.0692 |
116
- | 0.754 | 2.57 | 33000 | 1.2131 |
117
- | 0.8046 | 2.61 | 33500 | 1.0260 |
118
- | 0.8078 | 2.65 | 34000 | 1.0682 |
119
- | 0.7659 | 2.69 | 34500 | 1.1423 |
120
- | 0.7871 | 2.73 | 35000 | 1.0014 |
121
- | 0.7978 | 2.77 | 35500 | 1.0414 |
122
- | 0.7399 | 2.81 | 36000 | 1.0891 |
123
- | 0.762 | 2.84 | 36500 | 1.0065 |
124
- | 0.7741 | 2.88 | 37000 | 1.0600 |
125
- | 0.8242 | 2.92 | 37500 | 0.9591 |
126
- | 0.757 | 2.96 | 38000 | 1.1498 |
127
- | 0.7642 | 3.0 | 38500 | 1.1127 |
128
- | 0.6326 | 3.04 | 39000 | 1.1282 |
129
- | 0.6553 | 3.08 | 39500 | 1.0911 |
130
- | 0.6331 | 3.12 | 40000 | 1.0935 |
131
- | 0.6521 | 3.16 | 40500 | 1.0161 |
132
- | 0.69 | 3.19 | 41000 | 0.9850 |
133
- | 0.7105 | 3.23 | 41500 | 1.1046 |
134
- | 0.6937 | 3.27 | 42000 | 1.1120 |
135
- | 0.6515 | 3.31 | 42500 | 1.2233 |
136
- | 0.7001 | 3.35 | 43000 | 1.0194 |
137
- | 0.7145 | 3.39 | 43500 | 1.0097 |
138
- | 0.669 | 3.43 | 44000 | 1.1708 |
139
- | 0.7048 | 3.47 | 44500 | 0.9999 |
140
- | 0.7169 | 3.51 | 45000 | 1.0786 |
141
- | 0.708 | 3.55 | 45500 | 0.9872 |
142
- | 0.6649 | 3.58 | 46000 | 1.1070 |
143
- | 0.6865 | 3.62 | 46500 | 1.0603 |
144
- | 0.6942 | 3.66 | 47000 | 1.0716 |
145
- | 0.661 | 3.7 | 47500 | 1.0570 |
146
- | 0.6773 | 3.74 | 48000 | 1.0194 |
147
- | 0.7106 | 3.78 | 48500 | 0.9938 |
148
- | 0.7211 | 3.82 | 49000 | 0.9909 |
149
- | 0.6662 | 3.86 | 49500 | 1.1505 |
150
- | 0.7154 | 3.9 | 50000 | 1.0337 |
151
- | 0.6643 | 3.94 | 50500 | 1.0835 |
152
- | 0.6639 | 3.97 | 51000 | 1.1088 |
153
- | 0.663 | 4.01 | 51500 | 1.1509 |
154
- | 0.5685 | 4.05 | 52000 | 1.2185 |
155
- | 0.583 | 4.09 | 52500 | 1.2467 |
156
- | 0.57 | 4.13 | 53000 | 1.1549 |
157
- | 0.5947 | 4.17 | 53500 | 1.0641 |
158
- | 0.583 | 4.21 | 54000 | 1.0671 |
159
- | 0.5833 | 4.25 | 54500 | 1.1481 |
160
- | 0.5581 | 4.29 | 55000 | 1.1786 |
161
- | 0.5859 | 4.32 | 55500 | 1.1768 |
162
- | 0.5941 | 4.36 | 56000 | 1.0698 |
163
- | 0.5697 | 4.4 | 56500 | 1.1152 |
164
- | 0.5698 | 4.44 | 57000 | 1.1168 |
165
- | 0.5767 | 4.48 | 57500 | 1.1962 |
166
- | 0.6357 | 4.52 | 58000 | 1.0031 |
167
- | 0.5633 | 4.56 | 58500 | 1.1203 |
168
- | 0.6057 | 4.6 | 59000 | 1.0423 |
169
- | 0.5839 | 4.64 | 59500 | 1.1597 |
170
- | 0.5804 | 4.68 | 60000 | 1.0598 |
171
- | 0.6081 | 4.71 | 60500 | 1.0859 |
172
- | 0.5824 | 4.75 | 61000 | 1.1191 |
173
- | 0.5765 | 4.79 | 61500 | 1.1511 |
174
- | 0.6035 | 4.83 | 62000 | 1.0515 |
175
- | 0.6036 | 4.87 | 62500 | 1.0047 |
176
- | 0.6066 | 4.91 | 63000 | 1.0589 |
177
- | 0.594 | 4.95 | 63500 | 1.1539 |
178
- | 0.5805 | 4.99 | 64000 | 1.0718 |
179
- | 0.5042 | 5.03 | 64500 | 1.1119 |
180
- | 0.4812 | 5.07 | 65000 | 1.1154 |
181
- | 0.4276 | 5.1 | 65500 | 1.3720 |
182
- | 0.4627 | 5.14 | 66000 | 1.2931 |
183
- | 0.4179 | 5.18 | 66500 | 1.3074 |
184
- | 0.5126 | 5.22 | 67000 | 1.1193 |
185
- | 0.4557 | 5.26 | 67500 | 1.2478 |
186
- | 0.4861 | 5.3 | 68000 | 1.2499 |
187
- | 0.4898 | 5.34 | 68500 | 1.2249 |
188
- | 0.4735 | 5.38 | 69000 | 1.1762 |
189
- | 0.4856 | 5.42 | 69500 | 1.2609 |
190
- | 0.486 | 5.45 | 70000 | 1.2758 |
191
- | 0.5102 | 5.49 | 70500 | 1.2111 |
192
- | 0.5024 | 5.53 | 71000 | 1.1145 |
193
- | 0.4803 | 5.57 | 71500 | 1.2216 |
194
- | 0.521 | 5.61 | 72000 | 1.1260 |
195
- | 0.4956 | 5.65 | 72500 | 1.1383 |
196
- | 0.4614 | 5.69 | 73000 | 1.2230 |
197
- | 0.5086 | 5.73 | 73500 | 1.1338 |
198
- | 0.4645 | 5.77 | 74000 | 1.2906 |
199
- | 0.5015 | 5.81 | 74500 | 1.1691 |
200
- | 0.497 | 5.84 | 75000 | 1.1654 |
201
- | 0.5003 | 5.88 | 75500 | 1.1713 |
202
- | 0.4673 | 5.92 | 76000 | 1.1805 |
203
- | 0.5054 | 5.96 | 76500 | 1.2260 |
204
- | 0.4951 | 6.0 | 77000 | 1.1944 |
205
- | 0.3814 | 6.04 | 77500 | 1.3640 |
206
- | 0.3503 | 6.08 | 78000 | 1.3561 |
207
- | 0.3756 | 6.12 | 78500 | 1.3969 |
208
- | 0.3662 | 6.16 | 79000 | 1.5735 |
209
- | 0.3925 | 6.19 | 79500 | 1.2852 |
210
- | 0.363 | 6.23 | 80000 | 1.5148 |
211
- | 0.3799 | 6.27 | 80500 | 1.5019 |
212
- | 0.3708 | 6.31 | 81000 | 1.3908 |
213
- | 0.4203 | 6.35 | 81500 | 1.4493 |
214
- | 0.3989 | 6.39 | 82000 | 1.2541 |
215
- | 0.3614 | 6.43 | 82500 | 1.4183 |
216
- | 0.3764 | 6.47 | 83000 | 1.4680 |
217
- | 0.402 | 6.51 | 83500 | 1.3842 |
218
- | 0.3592 | 6.55 | 84000 | 1.4426 |
219
- | 0.3834 | 6.58 | 84500 | 1.4939 |
220
- | 0.4127 | 6.62 | 85000 | 1.3073 |
221
- | 0.3846 | 6.66 | 85500 | 1.3353 |
222
- | 0.3735 | 6.7 | 86000 | 1.3667 |
223
- | 0.4271 | 6.74 | 86500 | 1.2717 |
224
- | 0.3958 | 6.78 | 87000 | 1.2076 |
225
- | 0.4061 | 6.82 | 87500 | 1.3166 |
226
- | 0.398 | 6.86 | 88000 | 1.2572 |
227
- | 0.3937 | 6.9 | 88500 | 1.3595 |
228
- | 0.3777 | 6.94 | 89000 | 1.3206 |
229
- | 0.4019 | 6.97 | 89500 | 1.1889 |
230
- | 0.3589 | 7.01 | 90000 | 1.5264 |
231
- | 0.2803 | 7.05 | 90500 | 1.6564 |
232
- | 0.2846 | 7.09 | 91000 | 1.6783 |
233
- | 0.2864 | 7.13 | 91500 | 1.6254 |
234
- | 0.307 | 7.17 | 92000 | 1.5310 |
235
- | 0.2863 | 7.21 | 92500 | 1.5002 |
236
- | 0.2879 | 7.25 | 93000 | 1.6794 |
237
- | 0.3007 | 7.29 | 93500 | 1.5955 |
238
- | 0.315 | 7.32 | 94000 | 1.6204 |
239
- | 0.2778 | 7.36 | 94500 | 1.6348 |
240
- | 0.3061 | 7.4 | 95000 | 1.5241 |
241
- | 0.2944 | 7.44 | 95500 | 1.5652 |
242
- | 0.319 | 7.48 | 96000 | 1.5054 |
243
- | 0.2485 | 7.52 | 96500 | 1.6965 |
244
- | 0.3092 | 7.56 | 97000 | 1.6713 |
245
- | 0.2814 | 7.6 | 97500 | 1.6686 |
246
- | 0.3146 | 7.64 | 98000 | 1.4131 |
247
- | 0.2984 | 7.68 | 98500 | 1.5230 |
248
- | 0.3042 | 7.71 | 99000 | 1.6290 |
249
- | 0.3017 | 7.75 | 99500 | 1.5662 |
250
- | 0.2887 | 7.79 | 100000 | 1.5735 |
251
- | 0.2875 | 7.83 | 100500 | 1.4787 |
252
- | 0.2977 | 7.87 | 101000 | 1.4646 |
253
- | 0.3023 | 7.91 | 101500 | 1.6137 |
254
- | 0.2822 | 7.95 | 102000 | 1.5982 |
255
- | 0.3319 | 7.99 | 102500 | 1.3800 |
256
- | 0.225 | 8.03 | 103000 | 1.7508 |
257
- | 0.1945 | 8.07 | 103500 | 1.8662 |
258
- | 0.2103 | 8.1 | 104000 | 1.7704 |
259
- | 0.2283 | 8.14 | 104500 | 1.7051 |
260
- | 0.2154 | 8.18 | 105000 | 1.7834 |
261
- | 0.218 | 8.22 | 105500 | 1.7594 |
262
- | 0.2125 | 8.26 | 106000 | 1.6883 |
263
- | 0.2225 | 8.3 | 106500 | 1.7300 |
264
- | 0.1957 | 8.34 | 107000 | 1.8693 |
265
- | 0.2459 | 8.38 | 107500 | 1.7509 |
266
- | 0.2211 | 8.42 | 108000 | 1.6318 |
267
- | 0.2044 | 8.45 | 108500 | 1.7483 |
268
- | 0.225 | 8.49 | 109000 | 1.6655 |
269
- | 0.1776 | 8.53 | 109500 | 1.9201 |
270
- | 0.2069 | 8.57 | 110000 | 1.8501 |
271
- | 0.2006 | 8.61 | 110500 | 1.8808 |
272
- | 0.2268 | 8.65 | 111000 | 1.8335 |
273
- | 0.1864 | 8.69 | 111500 | 1.8904 |
274
- | 0.2075 | 8.73 | 112000 | 1.8735 |
275
- | 0.2086 | 8.77 | 112500 | 1.7576 |
276
- | 0.2408 | 8.81 | 113000 | 1.6434 |
277
- | 0.2033 | 8.84 | 113500 | 1.8157 |
278
- | 0.1833 | 8.88 | 114000 | 1.8573 |
279
- | 0.2023 | 8.92 | 114500 | 1.8238 |
280
- | 0.2288 | 8.96 | 115000 | 1.7943 |
281
- | 0.2146 | 9.0 | 115500 | 1.7764 |
282
- | 0.1705 | 9.04 | 116000 | 1.8408 |
283
- | 0.1237 | 9.08 | 116500 | 2.0233 |
284
- | 0.1507 | 9.12 | 117000 | 2.0297 |
285
- | 0.1584 | 9.16 | 117500 | 1.9834 |
286
- | 0.1429 | 9.2 | 118000 | 2.0743 |
287
- | 0.1431 | 9.23 | 118500 | 2.1105 |
288
- | 0.1498 | 9.27 | 119000 | 2.0739 |
289
- | 0.1364 | 9.31 | 119500 | 2.1171 |
290
- | 0.1253 | 9.35 | 120000 | 2.0861 |
291
- | 0.1572 | 9.39 | 120500 | 2.0564 |
292
- | 0.1559 | 9.43 | 121000 | 1.9621 |
293
- | 0.1408 | 9.47 | 121500 | 1.9998 |
294
- | 0.1265 | 9.51 | 122000 | 2.0538 |
295
- | 0.1226 | 9.55 | 122500 | 2.0356 |
296
- | 0.1623 | 9.58 | 123000 | 2.0362 |
297
- | 0.1453 | 9.62 | 123500 | 2.0281 |
298
- | 0.1709 | 9.66 | 124000 | 2.0056 |
299
- | 0.1359 | 9.7 | 124500 | 2.0291 |
300
- | 0.1503 | 9.74 | 125000 | 2.0291 |
301
- | 0.1218 | 9.78 | 125500 | 2.0553 |
302
- | 0.1512 | 9.82 | 126000 | 2.0581 |
303
- | 0.1475 | 9.86 | 126500 | 2.0328 |
304
- | 0.1374 | 9.9 | 127000 | 2.0297 |
305
- | 0.1203 | 9.94 | 127500 | 2.0356 |
306
- | 0.1444 | 9.97 | 128000 | 2.0384 |
307
 
308
 
309
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [makhataei/qa-persian-mdeberta-v3-base-squad2](https://huggingface.co/makhataei/qa-persian-mdeberta-v3-base-squad2) on the pquad dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.4713
21
 
22
  ## Model description
23
 
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - learning_rate: 5e-05
40
  - train_batch_size: 5
41
  - eval_batch_size: 5
42
  - seed: 42
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:------:|:---------------:|
51
+ | 0.6303 | 0.04 | 500 | 1.0524 |
52
+ | 0.6072 | 0.08 | 1000 | 1.1925 |
53
+ | 0.6577 | 0.12 | 1500 | 0.9827 |
54
+ | 0.6057 | 0.16 | 2000 | 1.0477 |
55
+ | 0.6033 | 0.19 | 2500 | 1.0451 |
56
+ | 0.6192 | 0.23 | 3000 | 1.0471 |
57
+ | 0.6037 | 0.27 | 3500 | 1.0151 |
58
+ | 0.6043 | 0.31 | 4000 | 1.0837 |
59
+ | 0.5914 | 0.35 | 4500 | 1.1688 |
60
+ | 0.5731 | 0.39 | 5000 | 1.1192 |
61
+ | 0.6051 | 0.43 | 5500 | 1.0116 |
62
+ | 0.5836 | 0.47 | 6000 | 1.0353 |
63
+ | 0.6039 | 0.51 | 6500 | 1.0284 |
64
+ | 0.6132 | 0.55 | 7000 | 1.0560 |
65
+ | 0.5856 | 0.58 | 7500 | 1.1219 |
66
+ | 0.5806 | 0.62 | 8000 | 1.0780 |
67
+ | 0.6125 | 0.66 | 8500 | 0.9812 |
68
+ | 0.6356 | 0.7 | 9000 | 1.0886 |
69
+ | 0.5905 | 0.74 | 9500 | 1.0187 |
70
+ | 0.6037 | 0.78 | 10000 | 1.1142 |
71
+ | 0.5453 | 0.82 | 10500 | 1.1535 |
72
+ | 0.6065 | 0.86 | 11000 | 1.0309 |
73
+ | 0.5638 | 0.9 | 11500 | 1.1029 |
74
+ | 0.5774 | 0.94 | 12000 | 1.1913 |
75
+ | 0.6295 | 0.97 | 12500 | 1.1206 |
76
+ | 0.5427 | 1.01 | 13000 | 1.2944 |
77
+ | 0.468 | 1.05 | 13500 | 1.2117 |
78
+ | 0.4965 | 1.09 | 14000 | 1.1481 |
79
+ | 0.4549 | 1.13 | 14500 | 1.3192 |
80
+ | 0.4811 | 1.17 | 15000 | 1.1514 |
81
+ | 0.4507 | 1.21 | 15500 | 1.2152 |
82
+ | 0.4845 | 1.25 | 16000 | 1.2142 |
83
+ | 0.4529 | 1.29 | 16500 | 1.2136 |
84
+ | 0.5138 | 1.32 | 17000 | 1.2248 |
85
+ | 0.4843 | 1.36 | 17500 | 1.1198 |
86
+ | 0.501 | 1.4 | 18000 | 1.1145 |
87
+ | 0.4898 | 1.44 | 18500 | 1.0842 |
88
+ | 0.505 | 1.48 | 19000 | 1.1711 |
89
+ | 0.4979 | 1.52 | 19500 | 1.1617 |
90
+ | 0.4836 | 1.56 | 20000 | 1.1651 |
91
+ | 0.498 | 1.6 | 20500 | 1.1534 |
92
+ | 0.51 | 1.64 | 21000 | 1.1642 |
93
+ | 0.4648 | 1.68 | 21500 | 1.2294 |
94
+ | 0.4955 | 1.71 | 22000 | 1.1252 |
95
+ | 0.4645 | 1.75 | 22500 | 1.3847 |
96
+ | 0.503 | 1.79 | 23000 | 1.1256 |
97
+ | 0.5167 | 1.83 | 23500 | 1.2102 |
98
+ | 0.5039 | 1.87 | 24000 | 1.2769 |
99
+ | 0.4669 | 1.91 | 24500 | 1.2660 |
100
+ | 0.5273 | 1.95 | 25000 | 1.1858 |
101
+ | 0.5167 | 1.99 | 25500 | 1.0963 |
102
+ | 0.4213 | 2.03 | 26000 | 1.3819 |
103
+ | 0.4117 | 2.06 | 26500 | 1.3915 |
104
+ | 0.4107 | 2.1 | 27000 | 1.2944 |
105
+ | 0.3568 | 2.14 | 27500 | 1.4241 |
106
+ | 0.3969 | 2.18 | 28000 | 1.4366 |
107
+ | 0.4176 | 2.22 | 28500 | 1.4275 |
108
+ | 0.4129 | 2.26 | 29000 | 1.3802 |
109
+ | 0.4394 | 2.3 | 29500 | 1.3639 |
110
+ | 0.3945 | 2.34 | 30000 | 1.4072 |
111
+ | 0.3682 | 2.38 | 30500 | 1.3478 |
112
+ | 0.3564 | 2.42 | 31000 | 1.4649 |
113
+ | 0.3839 | 2.45 | 31500 | 1.6282 |
114
+ | 0.4148 | 2.49 | 32000 | 1.4738 |
115
+ | 0.3998 | 2.53 | 32500 | 1.4206 |
116
+ | 0.3987 | 2.57 | 33000 | 1.3601 |
117
+ | 0.3881 | 2.61 | 33500 | 1.4010 |
118
+ | 0.3852 | 2.65 | 34000 | 1.3936 |
119
+ | 0.3703 | 2.69 | 34500 | 1.4996 |
120
+ | 0.3955 | 2.73 | 35000 | 1.4243 |
121
+ | 0.414 | 2.77 | 35500 | 1.3599 |
122
+ | 0.3723 | 2.81 | 36000 | 1.4481 |
123
+ | 0.3927 | 2.84 | 36500 | 1.4327 |
124
+ | 0.3993 | 2.88 | 37000 | 1.3312 |
125
+ | 0.4074 | 2.92 | 37500 | 1.3248 |
126
+ | 0.4978 | 2.96 | 38000 | 1.2219 |
127
+ | 0.4957 | 3.0 | 38500 | 1.1998 |
128
+ | 0.3192 | 3.04 | 39000 | 1.5531 |
129
+ | 0.385 | 3.08 | 39500 | 1.3462 |
130
+ | 0.351 | 3.12 | 40000 | 1.3456 |
131
+ | 0.3584 | 3.16 | 40500 | 1.4219 |
132
+ | 0.3696 | 3.19 | 41000 | 1.5244 |
133
+ | 0.3872 | 3.23 | 41500 | 1.5260 |
134
+ | 0.3916 | 3.27 | 42000 | 1.3642 |
135
+ | 0.3598 | 3.31 | 42500 | 1.5210 |
136
+ | 0.3749 | 3.35 | 43000 | 1.3730 |
137
+ | 0.3781 | 3.39 | 43500 | 1.3904 |
138
+ | 0.38 | 3.43 | 44000 | 1.3847 |
139
+ | 0.4019 | 3.47 | 44500 | 1.3194 |
140
+ | 0.3876 | 3.51 | 45000 | 1.4494 |
141
+ | 0.3916 | 3.55 | 45500 | 1.5578 |
142
+ | 0.3895 | 3.58 | 46000 | 1.4429 |
143
+ | 0.3647 | 3.62 | 46500 | 1.3499 |
144
+ | 0.3848 | 3.66 | 47000 | 1.4542 |
145
+ | 0.3748 | 3.7 | 47500 | 1.2933 |
146
+ | 0.3892 | 3.74 | 48000 | 1.3987 |
147
+ | 0.3807 | 3.78 | 48500 | 1.4392 |
148
+ | 0.4057 | 3.82 | 49000 | 1.3771 |
149
+ | 0.3922 | 3.86 | 49500 | 1.3830 |
150
+ | 0.3976 | 3.9 | 50000 | 1.2871 |
151
+ | 0.383 | 3.94 | 50500 | 1.4306 |
152
+ | 0.3771 | 3.97 | 51000 | 1.3849 |
153
+ | 0.3793 | 4.01 | 51500 | 1.5489 |
154
+ | 0.2792 | 4.05 | 52000 | 1.5708 |
155
+ | 0.2859 | 4.09 | 52500 | 1.5634 |
156
+ | 0.2839 | 4.13 | 53000 | 1.6146 |
157
+ | 0.3118 | 4.17 | 53500 | 1.5593 |
158
+ | 0.3248 | 4.21 | 54000 | 1.5015 |
159
+ | 0.2981 | 4.25 | 54500 | 1.5262 |
160
+ | 0.2697 | 4.29 | 55000 | 1.6662 |
161
+ | 0.2929 | 4.32 | 55500 | 1.6073 |
162
+ | 0.3233 | 4.36 | 56000 | 1.4935 |
163
+ | 0.2944 | 4.4 | 56500 | 1.5488 |
164
+ | 0.3021 | 4.44 | 57000 | 1.5612 |
165
+ | 0.3162 | 4.48 | 57500 | 1.6165 |
166
+ | 0.337 | 4.52 | 58000 | 1.4389 |
167
+ | 0.3071 | 4.56 | 58500 | 1.6181 |
168
+ | 0.346 | 4.6 | 59000 | 1.5063 |
169
+ | 0.3359 | 4.64 | 59500 | 1.5319 |
170
+ | 0.283 | 4.68 | 60000 | 1.5716 |
171
+ | 0.3184 | 4.71 | 60500 | 1.5787 |
172
+ | 0.2911 | 4.75 | 61000 | 1.6882 |
173
+ | 0.3325 | 4.79 | 61500 | 1.5195 |
174
+ | 0.3223 | 4.83 | 62000 | 1.6573 |
175
+ | 0.3225 | 4.87 | 62500 | 1.4265 |
176
+ | 0.3028 | 4.91 | 63000 | 1.5742 |
177
+ | 0.318 | 4.95 | 63500 | 1.5170 |
178
+ | 0.3047 | 4.99 | 64000 | 1.5051 |
179
+ | 0.2552 | 5.03 | 64500 | 1.7450 |
180
+ | 0.2326 | 5.07 | 65000 | 1.6757 |
181
+ | 0.2174 | 5.1 | 65500 | 1.9674 |
182
+ | 0.2423 | 5.14 | 66000 | 1.8576 |
183
+ | 0.2066 | 5.18 | 66500 | 1.7914 |
184
+ | 0.2717 | 5.22 | 67000 | 1.8060 |
185
+ | 0.2353 | 5.26 | 67500 | 1.7933 |
186
+ | 0.2499 | 5.3 | 68000 | 1.7655 |
187
+ | 0.2415 | 5.34 | 68500 | 1.9094 |
188
+ | 0.2541 | 5.38 | 69000 | 1.7136 |
189
+ | 0.2616 | 5.42 | 69500 | 1.7428 |
190
+ | 0.2402 | 5.45 | 70000 | 1.8088 |
191
+ | 0.2543 | 5.49 | 70500 | 1.6588 |
192
+ | 0.2756 | 5.53 | 71000 | 1.6765 |
193
+ | 0.2447 | 5.57 | 71500 | 1.8263 |
194
+ | 0.2643 | 5.61 | 72000 | 1.6329 |
195
+ | 0.224 | 5.65 | 72500 | 1.7456 |
196
+ | 0.2385 | 5.69 | 73000 | 1.7220 |
197
+ | 0.2488 | 5.73 | 73500 | 1.5867 |
198
+ | 0.2424 | 5.77 | 74000 | 1.7738 |
199
+ | 0.2618 | 5.81 | 74500 | 1.7264 |
200
+ | 0.2498 | 5.84 | 75000 | 1.6741 |
201
+ | 0.264 | 5.88 | 75500 | 1.6714 |
202
+ | 0.2364 | 5.92 | 76000 | 1.6306 |
203
+ | 0.2377 | 5.96 | 76500 | 1.8281 |
204
+ | 0.2475 | 6.0 | 77000 | 1.6020 |
205
+ | 0.1762 | 6.04 | 77500 | 1.9256 |
206
+ | 0.182 | 6.08 | 78000 | 1.9239 |
207
+ | 0.1714 | 6.12 | 78500 | 1.9346 |
208
+ | 0.1702 | 6.16 | 79000 | 2.0071 |
209
+ | 0.1885 | 6.19 | 79500 | 1.8634 |
210
+ | 0.1933 | 6.23 | 80000 | 2.0296 |
211
+ | 0.1973 | 6.27 | 80500 | 1.8691 |
212
+ | 0.1698 | 6.31 | 81000 | 1.9280 |
213
+ | 0.1935 | 6.35 | 81500 | 1.9555 |
214
+ | 0.1892 | 6.39 | 82000 | 1.9595 |
215
+ | 0.1879 | 6.43 | 82500 | 1.9741 |
216
+ | 0.1939 | 6.47 | 83000 | 2.0260 |
217
+ | 0.1928 | 6.51 | 83500 | 2.0924 |
218
+ | 0.1906 | 6.55 | 84000 | 1.9643 |
219
+ | 0.1729 | 6.58 | 84500 | 2.1318 |
220
+ | 0.2198 | 6.62 | 85000 | 1.8794 |
221
+ | 0.1941 | 6.66 | 85500 | 1.9834 |
222
+ | 0.1798 | 6.7 | 86000 | 2.0396 |
223
+ | 0.2141 | 6.74 | 86500 | 1.8159 |
224
+ | 0.1748 | 6.78 | 87000 | 2.0235 |
225
+ | 0.2038 | 6.82 | 87500 | 1.9760 |
226
+ | 0.1948 | 6.86 | 88000 | 1.9607 |
227
+ | 0.209 | 6.9 | 88500 | 1.9526 |
228
+ | 0.1951 | 6.94 | 89000 | 2.0364 |
229
+ | 0.2238 | 6.97 | 89500 | 1.8029 |
230
+ | 0.1913 | 7.01 | 90000 | 2.0869 |
231
+ | 0.153 | 7.05 | 90500 | 2.1914 |
232
+ | 0.1393 | 7.09 | 91000 | 2.2019 |
233
+ | 0.145 | 7.13 | 91500 | 2.1408 |
234
+ | 0.1483 | 7.17 | 92000 | 2.1024 |
235
+ | 0.1396 | 7.21 | 92500 | 2.1224 |
236
+ | 0.1313 | 7.25 | 93000 | 2.1517 |
237
+ | 0.1288 | 7.29 | 93500 | 2.2002 |
238
+ | 0.1569 | 7.32 | 94000 | 2.1955 |
239
+ | 0.1291 | 7.36 | 94500 | 2.3081 |
240
+ | 0.1702 | 7.4 | 95000 | 2.0735 |
241
+ | 0.127 | 7.44 | 95500 | 2.0001 |
242
+ | 0.1503 | 7.48 | 96000 | 2.1695 |
243
+ | 0.1356 | 7.52 | 96500 | 2.1271 |
244
+ | 0.1466 | 7.56 | 97000 | 2.0921 |
245
+ | 0.1408 | 7.6 | 97500 | 2.1379 |
246
+ | 0.1367 | 7.64 | 98000 | 2.0763 |
247
+ | 0.1487 | 7.68 | 98500 | 2.2021 |
248
+ | 0.1657 | 7.71 | 99000 | 2.0800 |
249
+ | 0.1408 | 7.75 | 99500 | 2.1433 |
250
+ | 0.1328 | 7.79 | 100000 | 2.0924 |
251
+ | 0.1485 | 7.83 | 100500 | 2.1479 |
252
+ | 0.1546 | 7.87 | 101000 | 2.0750 |
253
+ | 0.1501 | 7.91 | 101500 | 2.0885 |
254
+ | 0.1391 | 7.95 | 102000 | 2.1003 |
255
+ | 0.173 | 7.99 | 102500 | 1.9603 |
256
+ | 0.096 | 8.03 | 103000 | 2.2128 |
257
+ | 0.0967 | 8.07 | 103500 | 2.2105 |
258
+ | 0.0909 | 8.1 | 104000 | 2.2345 |
259
+ | 0.086 | 8.14 | 104500 | 2.3129 |
260
+ | 0.1052 | 8.18 | 105000 | 2.3452 |
261
+ | 0.0975 | 8.22 | 105500 | 2.3279 |
262
+ | 0.0875 | 8.26 | 106000 | 2.3719 |
263
+ | 0.1167 | 8.3 | 106500 | 2.2740 |
264
+ | 0.0724 | 8.34 | 107000 | 2.3902 |
265
+ | 0.1067 | 8.38 | 107500 | 2.3961 |
266
+ | 0.1017 | 8.42 | 108000 | 2.2360 |
267
+ | 0.1003 | 8.45 | 108500 | 2.2271 |
268
+ | 0.1113 | 8.49 | 109000 | 2.3305 |
269
+ | 0.113 | 8.53 | 109500 | 2.2344 |
270
+ | 0.1047 | 8.57 | 110000 | 2.2780 |
271
+ | 0.0935 | 8.61 | 110500 | 2.3290 |
272
+ | 0.1159 | 8.65 | 111000 | 2.3176 |
273
+ | 0.0936 | 8.69 | 111500 | 2.3421 |
274
+ | 0.0954 | 8.73 | 112000 | 2.2757 |
275
+ | 0.1131 | 8.77 | 112500 | 2.2388 |
276
+ | 0.0939 | 8.81 | 113000 | 2.3273 |
277
+ | 0.1026 | 8.84 | 113500 | 2.2831 |
278
+ | 0.0842 | 8.88 | 114000 | 2.3705 |
279
+ | 0.1031 | 8.92 | 114500 | 2.3365 |
280
+ | 0.1114 | 8.96 | 115000 | 2.2940 |
281
+ | 0.1145 | 9.0 | 115500 | 2.2661 |
282
+ | 0.0796 | 9.04 | 116000 | 2.3672 |
283
+ | 0.0591 | 9.08 | 116500 | 2.5256 |
284
+ | 0.0724 | 9.12 | 117000 | 2.4654 |
285
+ | 0.0733 | 9.16 | 117500 | 2.4303 |
286
+ | 0.0727 | 9.2 | 118000 | 2.5239 |
287
+ | 0.0649 | 9.23 | 118500 | 2.4831 |
288
+ | 0.0874 | 9.27 | 119000 | 2.4823 |
289
+ | 0.0679 | 9.31 | 119500 | 2.5225 |
290
+ | 0.0798 | 9.35 | 120000 | 2.4684 |
291
+ | 0.0774 | 9.39 | 120500 | 2.4247 |
292
+ | 0.0837 | 9.43 | 121000 | 2.3901 |
293
+ | 0.077 | 9.47 | 121500 | 2.4002 |
294
+ | 0.0591 | 9.51 | 122000 | 2.4534 |
295
+ | 0.0598 | 9.55 | 122500 | 2.4878 |
296
+ | 0.0662 | 9.58 | 123000 | 2.5026 |
297
+ | 0.0716 | 9.62 | 123500 | 2.4876 |
298
+ | 0.0744 | 9.66 | 124000 | 2.4856 |
299
+ | 0.0759 | 9.7 | 124500 | 2.4703 |
300
+ | 0.0713 | 9.74 | 125000 | 2.4614 |
301
+ | 0.0687 | 9.78 | 125500 | 2.4629 |
302
+ | 0.0706 | 9.82 | 126000 | 2.4621 |
303
+ | 0.0702 | 9.86 | 126500 | 2.4521 |
304
+ | 0.0609 | 9.9 | 127000 | 2.4698 |
305
+ | 0.0782 | 9.94 | 127500 | 2.4702 |
306
+ | 0.062 | 9.97 | 128000 | 2.4713 |
307
 
308
 
309
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:35716cc6b70cc776a2bc5919d86ade67df620931ffff6737b2e345058cb3e6e9
3
  size 1112905680
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:200eb62b2ddf181030616f3904b90b7f2a09a8e65b4e7bce28b747712fc0e5a1
3
  size 1112905680
runs/Nov27_22-31-52_Software-AI/events.out.tfevents.1701111713.Software-AI.10944.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7c996d3a97b3eb23cddaa8a6b655b2f24c04c3cbd4fba0b0fc86494cfcdbfc7
3
+ size 116309
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5b84dbc5b0737ba461bb4aa389d0a8a86b840e415faf2d5fd9911190ca22a546
3
  size 4219
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff320f9c9d6b606d08c02765fa1f3d1f5688dcbe354925aa77f75fb3f9795ed5
3
  size 4219