End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,10 +18,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3049
-- Model Preparation Time: 0.0047
-- Accuracy: 0.9102
-- F1 Macro: 0.9104
 ## Model description
@@ -56,10 +56,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:----------------------:|:--------:|:--------:|
-| 0.8579        | 1.0   | 270  | 0.7867          | 0.0047                 | 0.7167   | 0.7161   |
-| 0.2739        | 2.0   | 540  | 0.3057          | 0.0047                 | 0.9037   | 0.9036   |
-| 0.2183        | 3.0   | 810  | 0.2756          | 0.0047                 | 0.9111   | 0.9106   |
-| 0.1703        | 4.0   | 1080 | 0.2870          | 0.0047                 | 0.8972   | 0.8963   |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4414
+- Model Preparation Time: 0.0045
+- Accuracy: 0.8313
+- F1 Macro: 0.8354
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:----------------------:|:--------:|:--------:|
+| 0.9162        | 1.0   | 368  | 0.8379          | 0.0045                 | 0.6361   | 0.6334   |
+| 0.5201        | 2.0   | 736  | 0.5242          | 0.0045                 | 0.7782   | 0.7849   |
+| 0.3988        | 3.0   | 1104 | 0.4936          | 0.0045                 | 0.7993   | 0.8024   |
+| 0.3288        | 4.0   | 1472 | 0.4774          | 0.0045                 | 0.8007   | 0.8084   |
+| 0.3602        | 5.0   | 1840 | 0.4858          | 0.0045                 | 0.8034   | 0.8103   |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -26,10 +26,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "o_proj",
     "v_proj",
-    "k_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "k_proj",
     "v_proj",
+    "q_proj",
+    "o_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc84cc5602adf9457600966fa47be62790703dfe9530f3a2df2cb6706d5c9444
 size 36779480

 version https://git-lfs.github.com/spec/v1
+oid sha256:9b2aedb4a849cb0cdfe72064eef4e3fd9ac7cc5c4f2212eda363357d9a2d31e8
 size 36779480

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:752e7f8416a8d1fa5155aaff942ef919e03b9c84d61f885b2defd46334699fdb
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:a9cb5b5055f7719f3fc7bcba3f495719c3c62406ed79bfa7817889c6af3dd0f2
 size 5304