End of training

Browse files

Files changed (5) hide show

README.md +21 -16
config.json +1 -1
pytorch_model.bin +1 -1
tokenizer_config.json +43 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/muril-base-cased](https://huggingface.co/google/muril-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0831
-- Precision: 0.7350
-- Recall: 0.7591
-- F1: 0.7469
-- Accuracy: 0.9843
 ## Model description
@@ -49,22 +49,27 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.3465        | 1.0   | 1613 | 0.2747          | 0.0       | 0.0    | 0.0    | 0.9551   |
-| 0.1642        | 2.0   | 3226 | 0.1273          | 0.6436    | 0.5216 | 0.5762 | 0.9758   |
-| 0.1053        | 3.0   | 4839 | 0.0986          | 0.7257    | 0.7156 | 0.7206 | 0.9824   |
-| 0.0863        | 4.0   | 6452 | 0.0854          | 0.7166    | 0.7620 | 0.7386 | 0.9837   |
-| 0.0705        | 5.0   | 8065 | 0.0831          | 0.7350    | 0.7591 | 0.7469 | 0.9843   |
 ### Framework versions
-- Transformers 4.33.0
-- Pytorch 2.0.0
 - Datasets 2.14.5
-- Tokenizers 0.13.3

 This model is a fine-tuned version of [google/muril-base-cased](https://huggingface.co/google/muril-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0654
+- Precision: 0.7923
+- Recall: 0.8113
+- F1: 0.8017
+- Accuracy: 0.9859
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.3366        | 1.0   | 1613  | 0.2698          | 0.0       | 0.0    | 0.0    | 0.9551   |
+| 0.1552        | 2.0   | 3226  | 0.1180          | 0.7114    | 0.4972 | 0.5853 | 0.9763   |
+| 0.0959        | 3.0   | 4839  | 0.0904          | 0.7262    | 0.7161 | 0.7211 | 0.9829   |
+| 0.0708        | 4.0   | 6452  | 0.0751          | 0.7679    | 0.7498 | 0.7587 | 0.9840   |
+| 0.0474        | 5.0   | 8065  | 0.0672          | 0.7907    | 0.7731 | 0.7818 | 0.9854   |
+| 0.0367        | 6.0   | 9678  | 0.0627          | 0.7870    | 0.8045 | 0.7957 | 0.9856   |
+| 0.0308        | 7.0   | 11291 | 0.0598          | 0.7942    | 0.7915 | 0.7928 | 0.9859   |
+| 0.0247        | 8.0   | 12904 | 0.0612          | 0.7891    | 0.8123 | 0.8005 | 0.9860   |
+| 0.0202        | 9.0   | 14517 | 0.0666          | 0.8015    | 0.8015 | 0.8015 | 0.9860   |
+| 0.0181        | 10.0  | 16130 | 0.0654          | 0.7923    | 0.8113 | 0.8017 | 0.9859   |
 ### Framework versions
+- Transformers 4.34.0
+- Pytorch 2.0.1+cu118
 - Datasets 2.14.5
+- Tokenizers 0.14.1

config.json CHANGED Viewed

@@ -37,7 +37,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.33.0",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 197285

   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.34.0",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 197285

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:18273be8d8b89175cd7eeae82edf4df80907d02ade69646ef8377694568ee7bb
 size 947951785

 version https://git-lfs.github.com/spec/v1
+oid sha256:698155f64beb87f7bed2eeba6663828b96d5470f3a101f61fce4fc1a2ddfe0dd
 size 947951785

tokenizer_config.json CHANGED Viewed

@@ -1,4 +1,47 @@
 {
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,

 {
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "104": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "105": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [],
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cf0a615c42e8086b40425385deb056f4a20e62426ee49ef7712f3b8c59c36347
-size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:2df32b45fad79c2492c92fac9a7463b4ae1807efe81ff8a01fbe3b4addd69f03
+size 4091