Training complete

Browse files

Files changed (5) hide show

README.md +23 -13
config.json +9 -9
model.safetensors +1 -1
runs/May23_15-45-38_9d80cb969c0a/events.out.tfevents.1716480282.9d80cb969c0a.20961.4 +3 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1283
-- Precision: 0.8888
-- Recall: 0.9033
-- F1: 0.8960
-- Accuracy: 0.9656
 ## Model description
@@ -44,22 +44,32 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: reduce_lr_on_plateau
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 1.0   | 157  | 0.1525          | 0.8593    | 0.8675 | 0.8634 | 0.9550   |
-| No log        | 2.0   | 314  | 0.1343          | 0.8706    | 0.9106 | 0.8902 | 0.9620   |
-| No log        | 3.0   | 471  | 0.1283          | 0.8888    | 0.9033 | 0.8960 | 0.9656   |
-| 0.1657        | 4.0   | 628  | 0.1483          | 0.8703    | 0.9145 | 0.8918 | 0.9621   |
-| 0.1657        | 5.0   | 785  | 0.1563          | 0.8742    | 0.9141 | 0.8937 | 0.9644   |
 ### Framework versions

 This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1319
+- Precision: 0.8726
+- Recall: 0.9026
+- F1: 0.8874
+- Accuracy: 0.9619
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: reduce_lr_on_plateau
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 1.0   | 313  | 0.1464          | 0.8797    | 0.8615 | 0.8705 | 0.9587   |
+| 0.2128        | 2.0   | 626  | 0.1319          | 0.8726    | 0.9026 | 0.8874 | 0.9619   |
+| 0.2128        | 3.0   | 939  | 0.1461          | 0.8689    | 0.8924 | 0.8805 | 0.9596   |
+| 0.0783        | 4.0   | 1252 | 0.1529          | 0.8837    | 0.9049 | 0.8942 | 0.9620   |
+| 0.0443        | 5.0   | 1565 | 0.1921          | 0.8657    | 0.9157 | 0.8900 | 0.9615   |
+| 0.0443        | 6.0   | 1878 | 0.1647          | 0.8975    | 0.9224 | 0.9098 | 0.9685   |
+| 0.0201        | 7.0   | 2191 | 0.1725          | 0.8904    | 0.9183 | 0.9041 | 0.9674   |
+| 0.0098        | 8.0   | 2504 | 0.1766          | 0.8917    | 0.9199 | 0.9056 | 0.9682   |
+| 0.0098        | 9.0   | 2817 | 0.1756          | 0.8926    | 0.9202 | 0.9062 | 0.9686   |
+| 0.007         | 10.0  | 3130 | 0.1763          | 0.8916    | 0.9189 | 0.9051 | 0.9684   |
+| 0.007         | 11.0  | 3443 | 0.1772          | 0.8907    | 0.9183 | 0.9043 | 0.9682   |
+| 0.007         | 12.0  | 3756 | 0.1773          | 0.8895    | 0.9173 | 0.9032 | 0.9680   |
+| 0.0067        | 13.0  | 4069 | 0.1775          | 0.8892    | 0.9170 | 0.9029 | 0.9680   |
+| 0.0067        | 14.0  | 4382 | 0.1775          | 0.8897    | 0.9170 | 0.9032 | 0.9679   |
+| 0.0062        | 15.0  | 4695 | 0.1775          | 0.8897    | 0.9170 | 0.9032 | 0.9679   |
 ### Framework versions

config.json CHANGED Viewed

@@ -11,22 +11,22 @@
   "hidden_size": 768,
   "id2label": {
     "0": "O",
-    "1": "B-PER",
-    "2": "I-PER",
     "3": "B-LOC",
     "4": "I-LOC",
-    "5": "B-ORG",
-    "6": "I-ORG"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
     "B-LOC": 3,
-    "B-ORG": 5,
-    "B-PER": 1,
     "I-LOC": 4,
-    "I-ORG": 6,
-    "I-PER": 2,
     "O": 0
   },
   "layer_norm_eps": 1e-12,
@@ -42,7 +42,7 @@
   "pooler_type": "first_token_transform",
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.40.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 119547

   "hidden_size": 768,
   "id2label": {
     "0": "O",
+    "1": "B-ORG",
+    "2": "I-ORG",
     "3": "B-LOC",
     "4": "I-LOC",
+    "5": "B-PER",
+    "6": "I-PER"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
     "B-LOC": 3,
+    "B-ORG": 1,
+    "B-PER": 5,
     "I-LOC": 4,
+    "I-ORG": 2,
+    "I-PER": 6,
     "O": 0
   },
   "layer_norm_eps": 1e-12,
   "pooler_type": "first_token_transform",
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.41.0",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 119547

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b4e185154dc8f18bc11da79db5384fecba22e9db276e215a7d8c94e9599d11e9
 size 709096284

 version https://git-lfs.github.com/spec/v1
+oid sha256:42150f376fc5de08f0b0ea7b48f3e1c6c0ddf35f5edf41722d97ebc6ba55de9b
 size 709096284

runs/May23_15-45-38_9d80cb969c0a/events.out.tfevents.1716480282.9d80cb969c0a.20961.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dfe19decf16cfb62f8fa8fa27cacafcd5d3bba683220218198a9273dfa5ea788
+size 560

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7ffe3a8c9513ba194945196091f9c43e359ea408a82468c1cd63dc6aa033b484
-size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:ec6bead204e8da5a4f809eeb3956d4188275a564f78ba274faaadeaf7b8730a6
+size 5176