End of training

Browse files

Files changed (5) hide show

README.md +11 -51
model.safetensors +1 -1
runs/Mar22_09-07-04_81a9b4ddc734/events.out.tfevents.1711098425.81a9b4ddc734.13612.2 +3 -0
runs/Mar22_09-07-04_81a9b4ddc734/events.out.tfevents.1711100104.81a9b4ddc734.13612.3 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ai-forever/ruBert-base](https://huggingface.co/ai-forever/ruBert-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8209
-- Precision: 0.5
-- Recall: 0.0071
-- F1: 0.0140
-- Accuracy: 0.2141
 ## Model description
@@ -49,57 +49,17 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 45
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 1.0   | 1    | 3.6999          | 0.0261    | 0.0260 | 0.0260 | 0.0107   |
-| No log        | 2.0   | 2    | 3.4111          | 0.0382    | 0.0355 | 0.0368 | 0.0043   |
-| No log        | 3.0   | 3    | 3.1202          | 0.0645    | 0.0473 | 0.0546 | 0.0043   |
-| No log        | 4.0   | 4    | 2.8246          | 0.0625    | 0.0307 | 0.0412 | 0.0021   |
-| No log        | 5.0   | 5    | 2.5289          | 0.0426    | 0.0095 | 0.0155 | 0.0      |
-| No log        | 6.0   | 6    | 2.2452          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 7.0   | 7    | 1.9900          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 8.0   | 8    | 1.7794          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 9.0   | 9    | 1.6144          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 10.0  | 10   | 1.4867          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 11.0  | 11   | 1.3877          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 12.0  | 12   | 1.3110          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 13.0  | 13   | 1.2515          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 14.0  | 14   | 1.2053          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 15.0  | 15   | 1.1687          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 16.0  | 16   | 1.1389          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 17.0  | 17   | 1.1135          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 18.0  | 18   | 1.0908          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 19.0  | 19   | 1.0698          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 20.0  | 20   | 1.0500          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 21.0  | 21   | 1.0312          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 22.0  | 22   | 1.0132          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 23.0  | 23   | 0.9962          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 24.0  | 24   | 0.9801          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 25.0  | 25   | 0.9648          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 26.0  | 26   | 0.9504          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 27.0  | 27   | 0.9368          | 0.0       | 0.0    | 0.0    | 0.0      |
-| No log        | 28.0  | 28   | 0.9241          | 0.0       | 0.0    | 0.0    | 0.0043   |
-| No log        | 29.0  | 29   | 0.9121          | 0.0       | 0.0    | 0.0    | 0.0128   |
-| No log        | 30.0  | 30   | 0.9009          | 0.0       | 0.0    | 0.0    | 0.0236   |
-| No log        | 31.0  | 31   | 0.8905          | 0.0       | 0.0    | 0.0    | 0.0385   |
-| No log        | 32.0  | 32   | 0.8809          | 0.0       | 0.0    | 0.0    | 0.0578   |
-| No log        | 33.0  | 33   | 0.8720          | 0.0       | 0.0    | 0.0    | 0.0792   |
-| No log        | 34.0  | 34   | 0.8639          | 0.0       | 0.0    | 0.0    | 0.1006   |
-| No log        | 35.0  | 35   | 0.8566          | 0.0       | 0.0    | 0.0    | 0.1199   |
-| No log        | 36.0  | 36   | 0.8500          | 0.0       | 0.0    | 0.0    | 0.1478   |
-| No log        | 37.0  | 37   | 0.8440          | 0.5       | 0.0024 | 0.0047 | 0.1713   |
-| No log        | 38.0  | 38   | 0.8388          | 0.5       | 0.0024 | 0.0047 | 0.1863   |
-| No log        | 39.0  | 39   | 0.8343          | 0.5       | 0.0024 | 0.0047 | 0.1949   |
-| No log        | 40.0  | 40   | 0.8304          | 0.5       | 0.0047 | 0.0094 | 0.2034   |
-| No log        | 41.0  | 41   | 0.8272          | 0.5       | 0.0047 | 0.0094 | 0.2099   |
-| No log        | 42.0  | 42   | 0.8247          | 0.5       | 0.0071 | 0.0140 | 0.2120   |
-| No log        | 43.0  | 43   | 0.8228          | 0.5       | 0.0071 | 0.0140 | 0.2120   |
-| No log        | 44.0  | 44   | 0.8215          | 0.5       | 0.0071 | 0.0140 | 0.2141   |
-| No log        | 45.0  | 45   | 0.8209          | 0.5       | 0.0071 | 0.0140 | 0.2141   |
 ### Framework versions

 This model is a fine-tuned version of [ai-forever/ruBert-base](https://huggingface.co/ai-forever/ruBert-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1544
+- Precision: 0.8561
+- Recall: 0.8723
+- F1: 0.8642
+- Accuracy: 0.8822
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 1.0   | 311  | 0.1686          | 0.8380    | 0.8440 | 0.8410 | 0.8565   |
+| 0.0464        | 2.0   | 622  | 0.1597          | 0.8462    | 0.8582 | 0.8521 | 0.8715   |
+| 0.0464        | 3.0   | 933  | 0.1544          | 0.8561    | 0.8723 | 0.8642 | 0.8822   |
+| 0.0046        | 4.0   | 1244 | 0.1564          | 0.8469    | 0.8629 | 0.8548 | 0.8737   |
+| 0.0029        | 5.0   | 1555 | 0.1556          | 0.8538    | 0.8700 | 0.8618 | 0.8801   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc8bef7ffa6304e30b98cbd77ad108e809957d44c5c544a446026ac33b25ab44
 size 711062560

 version https://git-lfs.github.com/spec/v1
+oid sha256:b4d9283275ac3b28d4e35726de5a483b6d235a68299dd6ffdc274781138c1522
 size 711062560

runs/Mar22_09-07-04_81a9b4ddc734/events.out.tfevents.1711098425.81a9b4ddc734.13612.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a55635242f86bc3464d779fb288cea35d5e4b9eb156a2551ffa3b30aba28c335
+size 10409

runs/Mar22_09-07-04_81a9b4ddc734/events.out.tfevents.1711100104.81a9b4ddc734.13612.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2db1214e028668cbd8749b4dea4bfe6727ee411e6657e58a1d6a8b8ab008b82c
+size 560

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:93e843f9bb13cce47a7fee3ecb79652740dc733dc8f7a6a98dc56fe2d8f07a82
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:649bd3e5422e5c107fa0e02304498cf8371f173894beff1d4048532b4b8512e4
 size 4920