correct weights

Browse files

Files changed (8) hide show

README.md +42 -41
all_results.json +15 -15
eval_results.json +11 -11
predict_results_None.txt +0 -0
pytorch_model.bin +1 -1
train_results.json +4 -4
trainer_state.json +49 -49
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -6,32 +6,32 @@ metrics:
 - f1
 - accuracy
 model-index:
-- name: final-lr2e-5-bs16-fullprecision
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# final-lr2e-5-bs16-fullprecision
 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4633
-- F1 Macro: 0.8276
-- F1 Weighted: 0.8754
-- F1: 0.7348
-- Accuracy: 0.8775
-- Confusion Matrix: [[2831  199]
- [ 291  679]]
-- Confusion Matrix Norm: [[0.93432343 0.06567657]
- [0.3        0.7       ]]
-- Classification Report:               precision    recall  f1-score    support
-0              0.906791  0.934323  0.920351  3030.0000
-1              0.773349  0.700000  0.734848   970.0000
-accuracy       0.877500  0.877500  0.877500     0.8775
-macro avg      0.840070  0.817162  0.827600  4000.0000
-weighted avg   0.874431  0.877500  0.875367  4000.0000
 ## Model description
@@ -57,35 +57,36 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 Weighted | F1     | Accuracy | Confusion Matrix           | Confusion Matrix Norm                              | Classification Report                                                                                                                                                                                                                                                                                                                           |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------:|:------:|:--------:|:--------------------------:|:--------------------------------------------------:|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
-| 0.3362        | 1.0   | 1000 | 0.3034          | 0.8182   | 0.8693      | 0.7191 | 0.8722   | [[2835  195]
- [ 316  654]] | [[0.93564356 0.06435644]
- [0.3257732  0.6742268 ]] |               precision    recall  f1-score     support
-0              0.899714  0.935644  0.917327  3030.00000
-1              0.770318  0.674227  0.719076   970.00000
-accuracy       0.872250  0.872250  0.872250     0.87225
-macro avg      0.835016  0.804935  0.818202  4000.00000
-weighted avg   0.868336  0.872250  0.869251  4000.00000 |
-| 0.2352        | 2.0   | 2000 | 0.3730          | 0.8270   | 0.8730      | 0.7374 | 0.8732   | [[2781  249]
- [ 258  712]] | [[0.91782178 0.08217822]
- [0.26597938 0.73402062]] |               precision    recall  f1-score     support
-0              0.915104  0.917822  0.916461  3030.00000
-1              0.740895  0.734021  0.737442   970.00000
-accuracy       0.873250  0.873250  0.873250     0.87325
-macro avg      0.827999  0.825921  0.826951  4000.00000
-weighted avg   0.872858  0.873250  0.873049  4000.00000 |
-| 0.1566        | 3.0   | 3000 | 0.4633          | 0.8276   | 0.8754      | 0.7348 | 0.8775   | [[2831  199]
- [ 291  679]] | [[0.93432343 0.06567657]
- [0.3        0.7       ]] |               precision    recall  f1-score    support
-0              0.906791  0.934323  0.920351  3030.0000
-1              0.773349  0.700000  0.734848   970.0000
-accuracy       0.877500  0.877500  0.877500     0.8775
-macro avg      0.840070  0.817162  0.827600  4000.0000
-weighted avg   0.874431  0.877500  0.875367  4000.0000       |
 ### Framework versions

 - f1
 - accuracy
 model-index:
+- name: final-lr2e-5-bs16-fp16-2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# final-lr2e-5-bs16-fp16-2
 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4823
+- F1 Macro: 0.8301
+- F1 Weighted: 0.8772
+- F1: 0.7388
+- Accuracy: 0.8792
+- Confusion Matrix: [[2834  196]
+ [ 287  683]]
+- Confusion Matrix Norm: [[0.93531353 0.06468647]
+ [0.29587629 0.70412371]]
+- Classification Report:               precision    recall  f1-score     support
+0              0.908042  0.935314  0.921476  3030.00000
+1              0.777019  0.704124  0.738778   970.00000
+accuracy       0.879250  0.879250  0.879250     0.87925
+macro avg      0.842531  0.819719  0.830127  4000.00000
+weighted avg   0.876269  0.879250  0.877172  4000.00000
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3.0
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 Weighted | F1     | Accuracy | Confusion Matrix           | Confusion Matrix Norm                              | Classification Report                                                                                                                                                                                                                                                                                                                           |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------:|:------:|:--------:|:--------------------------:|:--------------------------------------------------:|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
+| 0.3333        | 1.0   | 1000 | 0.3064          | 0.8165   | 0.8672      | 0.7181 | 0.8692   | [[2811  219]
+ [ 304  666]] | [[0.92772277 0.07227723]
+ [0.31340206 0.68659794]] |               precision    recall  f1-score     support
+0              0.902408  0.927723  0.914890  3030.00000
+1              0.752542  0.686598  0.718059   970.00000
+accuracy       0.869250  0.869250  0.869250     0.86925
+macro avg      0.827475  0.807160  0.816475  4000.00000
+weighted avg   0.866065  0.869250  0.867159  4000.00000 |
+| 0.2271        | 2.0   | 2000 | 0.3905          | 0.8238   | 0.8708      | 0.7326 | 0.871    | [[2777  253]
+ [ 263  707]] | [[0.91650165 0.08349835]
+ [0.27113402 0.72886598]] |               precision    recall  f1-score   support
+0              0.913487  0.916502  0.914992  3030.000
+1              0.736458  0.728866  0.732642   970.000
+accuracy       0.871000  0.871000  0.871000     0.871
+macro avg      0.824973  0.822684  0.823817  4000.000
+weighted avg   0.870557  0.871000  0.870772  4000.000             |
+| 0.1435        | 3.0   | 3000 | 0.4823          | 0.8301   | 0.8772      | 0.7388 | 0.8792   | [[2834  196]
+ [ 287  683]] | [[0.93531353 0.06468647]
+ [0.29587629 0.70412371]] |               precision    recall  f1-score     support
+0              0.908042  0.935314  0.921476  3030.00000
+1              0.777019  0.704124  0.738778   970.00000
+accuracy       0.879250  0.879250  0.879250     0.87925
+macro avg      0.842531  0.819719  0.830127  4000.00000
+weighted avg   0.876269  0.879250  0.877172  4000.00000 |
 ### Framework versions

all_results.json CHANGED Viewed

@@ -1,20 +1,20 @@
 {
     "epoch": 3.0,
-    "eval_accuracy": 0.8775,
-    "eval_classification_report": "              precision    recall  f1-score    support\n0              0.906791  0.934323  0.920351  3030.0000\n1              0.773349  0.700000  0.734848   970.0000\naccuracy       0.877500  0.877500  0.877500     0.8775\nmacro avg      0.840070  0.817162  0.827600  4000.0000\nweighted avg   0.874431  0.877500  0.875367  4000.0000",
-    "eval_confusion_matrix": "[[2831  199]\n [ 291  679]]",
-    "eval_confusion_matrix_norm": "[[0.93432343 0.06567657]\n [0.3        0.7       ]]",
-    "eval_f1": 0.7348484848484848,
-    "eval_f1_macro": 0.8275997950900422,
-    "eval_f1_weighted": 0.8753667198644444,
-    "eval_loss": 0.4632544219493866,
-    "eval_runtime": 16.6824,
     "eval_samples": 4000,
-    "eval_samples_per_second": 239.773,
-    "eval_steps_per_second": 14.986,
-    "train_loss": 0.2591003138224284,
-    "train_runtime": 651.1299,
     "train_samples": 16000,
-    "train_samples_per_second": 73.718,
-    "train_steps_per_second": 4.607
 }

 {
     "epoch": 3.0,
+    "eval_accuracy": 0.87925,
+    "eval_classification_report": "              precision    recall  f1-score     support\n0              0.908042  0.935314  0.921476  3030.00000\n1              0.777019  0.704124  0.738778   970.00000\naccuracy       0.879250  0.879250  0.879250     0.87925\nmacro avg      0.842531  0.819719  0.830127  4000.00000\nweighted avg   0.876269  0.879250  0.877172  4000.00000",
+    "eval_confusion_matrix": "[[2834  196]\n [ 287  683]]",
+    "eval_confusion_matrix_norm": "[[0.93531353 0.06468647]\n [0.29587629 0.70412371]]",
+    "eval_f1": 0.7387777176852353,
+    "eval_f1_macro": 0.8301269502098749,
+    "eval_f1_weighted": 0.8771718049600645,
+    "eval_loss": 0.4823172092437744,
+    "eval_runtime": 9.6229,
     "eval_samples": 4000,
+    "eval_samples_per_second": 415.677,
+    "eval_steps_per_second": 25.98,
+    "train_loss": 0.2520793151855469,
+    "train_runtime": 430.4281,
     "train_samples": 16000,
+    "train_samples_per_second": 111.517,
+    "train_steps_per_second": 6.97
 }

eval_results.json CHANGED Viewed

@@ -1,15 +1,15 @@
 {
     "epoch": 3.0,
-    "eval_accuracy": 0.8775,
-    "eval_classification_report": "              precision    recall  f1-score    support\n0              0.906791  0.934323  0.920351  3030.0000\n1              0.773349  0.700000  0.734848   970.0000\naccuracy       0.877500  0.877500  0.877500     0.8775\nmacro avg      0.840070  0.817162  0.827600  4000.0000\nweighted avg   0.874431  0.877500  0.875367  4000.0000",
-    "eval_confusion_matrix": "[[2831  199]\n [ 291  679]]",
-    "eval_confusion_matrix_norm": "[[0.93432343 0.06567657]\n [0.3        0.7       ]]",
-    "eval_f1": 0.7348484848484848,
-    "eval_f1_macro": 0.8275997950900422,
-    "eval_f1_weighted": 0.8753667198644444,
-    "eval_loss": 0.4632544219493866,
-    "eval_runtime": 16.6824,
     "eval_samples": 4000,
-    "eval_samples_per_second": 239.773,
-    "eval_steps_per_second": 14.986
 }

 {
     "epoch": 3.0,
+    "eval_accuracy": 0.87925,
+    "eval_classification_report": "              precision    recall  f1-score     support\n0              0.908042  0.935314  0.921476  3030.00000\n1              0.777019  0.704124  0.738778   970.00000\naccuracy       0.879250  0.879250  0.879250     0.87925\nmacro avg      0.842531  0.819719  0.830127  4000.00000\nweighted avg   0.876269  0.879250  0.877172  4000.00000",
+    "eval_confusion_matrix": "[[2834  196]\n [ 287  683]]",
+    "eval_confusion_matrix_norm": "[[0.93531353 0.06468647]\n [0.29587629 0.70412371]]",
+    "eval_f1": 0.7387777176852353,
+    "eval_f1_macro": 0.8301269502098749,
+    "eval_f1_weighted": 0.8771718049600645,
+    "eval_loss": 0.4823172092437744,
+    "eval_runtime": 9.6229,
     "eval_samples": 4000,
+    "eval_samples_per_second": 415.677,
+    "eval_steps_per_second": 25.98
 }

predict_results_None.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:21b0d3a5550e58b7ef30c0020bf27b3cb40aa1df1936fc53f798b5f45ef0f41b
 size 438007925

 version https://git-lfs.github.com/spec/v1
+oid sha256:4e9f8d462d8d9118a72bd13f278e2a7f6f042f093ccece0b57016b19572f8c56
 size 438007925

train_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
     "epoch": 3.0,
-    "train_loss": 0.2591003138224284,
-    "train_runtime": 651.1299,
     "train_samples": 16000,
-    "train_samples_per_second": 73.718,
-    "train_steps_per_second": 4.607
 }

 {
     "epoch": 3.0,
+    "train_loss": 0.2520793151855469,
+    "train_runtime": 430.4281,
     "train_samples": 16000,
+    "train_samples_per_second": 111.517,
+    "train_steps_per_second": 6.97
 }

trainer_state.json CHANGED Viewed

@@ -9,93 +9,93 @@
   "log_history": [
     {
       "epoch": 0.5,
-      "learning_rate": 1.6666666666666667e-05,
-      "loss": 0.4103,
       "step": 500
     },
     {
       "epoch": 1.0,
-      "learning_rate": 1.3333333333333333e-05,
-      "loss": 0.3362,
       "step": 1000
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.87225,
-      "eval_classification_report": "              precision    recall  f1-score     support\n0              0.899714  0.935644  0.917327  3030.00000\n1              0.770318  0.674227  0.719076   970.00000\naccuracy       0.872250  0.872250  0.872250     0.87225\nmacro avg      0.835016  0.804935  0.818202  4000.00000\nweighted avg   0.868336  0.872250  0.869251  4000.00000",
-      "eval_confusion_matrix": "[[2835  195]\n [ 316  654]]",
-      "eval_confusion_matrix_norm": "[[0.93564356 0.06435644]\n [0.3257732  0.6742268 ]]",
-      "eval_f1": 0.7190764156129743,
-      "eval_f1_macro": 0.8182018544656038,
-      "eval_f1_weighted": 0.8692514554747078,
-      "eval_loss": 0.3033996522426605,
-      "eval_runtime": 16.7014,
-      "eval_samples_per_second": 239.501,
-      "eval_steps_per_second": 14.969,
       "step": 1000
     },
     {
       "epoch": 1.5,
-      "learning_rate": 1e-05,
-      "loss": 0.2538,
       "step": 1500
     },
     {
       "epoch": 2.0,
-      "learning_rate": 6.666666666666667e-06,
-      "loss": 0.2352,
       "step": 2000
     },
     {
       "epoch": 2.0,
-      "eval_accuracy": 0.87325,
-      "eval_classification_report": "              precision    recall  f1-score     support\n0              0.915104  0.917822  0.916461  3030.00000\n1              0.740895  0.734021  0.737442   970.00000\naccuracy       0.873250  0.873250  0.873250     0.87325\nmacro avg      0.827999  0.825921  0.826951  4000.00000\nweighted avg   0.872858  0.873250  0.873049  4000.00000",
-      "eval_confusion_matrix": "[[2781  249]\n [ 258  712]]",
-      "eval_confusion_matrix_norm": "[[0.91782178 0.08217822]\n [0.26597938 0.73402062]]",
-      "eval_f1": 0.737441740031072,
-      "eval_f1_macro": 0.826951220979451,
-      "eval_f1_weighted": 0.8730486036678663,
-      "eval_loss": 0.37301740050315857,
-      "eval_runtime": 16.7066,
-      "eval_samples_per_second": 239.426,
-      "eval_steps_per_second": 14.964,
       "step": 2000
     },
     {
       "epoch": 2.5,
-      "learning_rate": 3.3333333333333333e-06,
-      "loss": 0.1625,
       "step": 2500
     },
     {
       "epoch": 3.0,
-      "learning_rate": 0.0,
-      "loss": 0.1566,
       "step": 3000
     },
     {
       "epoch": 3.0,
-      "eval_accuracy": 0.8775,
-      "eval_classification_report": "              precision    recall  f1-score    support\n0              0.906791  0.934323  0.920351  3030.0000\n1              0.773349  0.700000  0.734848   970.0000\naccuracy       0.877500  0.877500  0.877500     0.8775\nmacro avg      0.840070  0.817162  0.827600  4000.0000\nweighted avg   0.874431  0.877500  0.875367  4000.0000",
-      "eval_confusion_matrix": "[[2831  199]\n [ 291  679]]",
-      "eval_confusion_matrix_norm": "[[0.93432343 0.06567657]\n [0.3        0.7       ]]",
-      "eval_f1": 0.7348484848484848,
-      "eval_f1_macro": 0.8275997950900422,
-      "eval_f1_weighted": 0.8753667198644444,
-      "eval_loss": 0.4632544219493866,
-      "eval_runtime": 16.6967,
-      "eval_samples_per_second": 239.568,
-      "eval_steps_per_second": 14.973,
       "step": 3000
     },
     {
       "epoch": 3.0,
       "step": 3000,
       "total_flos": 1.262933065728e+16,
-      "train_loss": 0.2591003138224284,
-      "train_runtime": 651.1299,
-      "train_samples_per_second": 73.718,
-      "train_steps_per_second": 4.607
     }
   ],
   "max_steps": 3000,

   "log_history": [
     {
       "epoch": 0.5,
+      "learning_rate": 1.6673333333333335e-05,
+      "loss": 0.4114,
       "step": 500
     },
     {
       "epoch": 1.0,
+      "learning_rate": 1.3340000000000001e-05,
+      "loss": 0.3333,
       "step": 1000
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.86925,
+      "eval_classification_report": "              precision    recall  f1-score     support\n0              0.902408  0.927723  0.914890  3030.00000\n1              0.752542  0.686598  0.718059   970.00000\naccuracy       0.869250  0.869250  0.869250     0.86925\nmacro avg      0.827475  0.807160  0.816475  4000.00000\nweighted avg   0.866065  0.869250  0.867159  4000.00000",
+      "eval_confusion_matrix": "[[2811  219]\n [ 304  666]]",
+      "eval_confusion_matrix_norm": "[[0.92772277 0.07227723]\n [0.31340206 0.68659794]]",
+      "eval_f1": 0.7180592991913748,
+      "eval_f1_macro": 0.8164747268943042,
+      "eval_f1_weighted": 0.8671586721613128,
+      "eval_loss": 0.3063889145851135,
+      "eval_runtime": 9.6316,
+      "eval_samples_per_second": 415.3,
+      "eval_steps_per_second": 25.956,
       "step": 1000
     },
     {
       "epoch": 1.5,
+      "learning_rate": 1.0013333333333335e-05,
+      "loss": 0.2455,
       "step": 1500
     },
     {
       "epoch": 2.0,
+      "learning_rate": 6.6866666666666665e-06,
+      "loss": 0.2271,
       "step": 2000
     },
     {
       "epoch": 2.0,
+      "eval_accuracy": 0.871,
+      "eval_classification_report": "              precision    recall  f1-score   support\n0              0.913487  0.916502  0.914992  3030.000\n1              0.736458  0.728866  0.732642   970.000\naccuracy       0.871000  0.871000  0.871000     0.871\nmacro avg      0.824973  0.822684  0.823817  4000.000\nweighted avg   0.870557  0.871000  0.870772  4000.000",
+      "eval_confusion_matrix": "[[2777  253]\n [ 263  707]]",
+      "eval_confusion_matrix_norm": "[[0.91650165 0.08349835]\n [0.27113402 0.72886598]]",
+      "eval_f1": 0.732642487046632,
+      "eval_f1_macro": 0.8238171249071711,
+      "eval_f1_weighted": 0.8707720634053486,
+      "eval_loss": 0.3905148506164551,
+      "eval_runtime": 9.6371,
+      "eval_samples_per_second": 415.062,
+      "eval_steps_per_second": 25.941,
       "step": 2000
     },
     {
       "epoch": 2.5,
+      "learning_rate": 3.3533333333333336e-06,
+      "loss": 0.1517,
       "step": 2500
     },
     {
       "epoch": 3.0,
+      "learning_rate": 2e-08,
+      "loss": 0.1435,
       "step": 3000
     },
     {
       "epoch": 3.0,
+      "eval_accuracy": 0.87925,
+      "eval_classification_report": "              precision    recall  f1-score     support\n0              0.908042  0.935314  0.921476  3030.00000\n1              0.777019  0.704124  0.738778   970.00000\naccuracy       0.879250  0.879250  0.879250     0.87925\nmacro avg      0.842531  0.819719  0.830127  4000.00000\nweighted avg   0.876269  0.879250  0.877172  4000.00000",
+      "eval_confusion_matrix": "[[2834  196]\n [ 287  683]]",
+      "eval_confusion_matrix_norm": "[[0.93531353 0.06468647]\n [0.29587629 0.70412371]]",
+      "eval_f1": 0.7387777176852353,
+      "eval_f1_macro": 0.8301269502098749,
+      "eval_f1_weighted": 0.8771718049600645,
+      "eval_loss": 0.4823172092437744,
+      "eval_runtime": 9.6475,
+      "eval_samples_per_second": 414.617,
+      "eval_steps_per_second": 25.914,
       "step": 3000
     },
     {
       "epoch": 3.0,
       "step": 3000,
       "total_flos": 1.262933065728e+16,
+      "train_loss": 0.2520793151855469,
+      "train_runtime": 430.4281,
+      "train_samples_per_second": 111.517,
+      "train_steps_per_second": 6.97
     }
   ],
   "max_steps": 3000,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:23bdb167d6ed64d8a4fa58d65ab66a6eb55ed2e97a0cb401fb31212f13e2e697
-size 3643

 version https://git-lfs.github.com/spec/v1
+oid sha256:d641973e448ee0f5cd30cee300ef688f8e2706b6569a9d4b8a510df14f066454
+size 3579