llama3.1_8b_bwgenerator

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0982
 ## Model description
@@ -51,14 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.7155        | 0.1216 | 40   | 0.2546          |
-| 0.218         | 0.2433 | 80   | 0.1804          |
-| 0.1513        | 0.3649 | 120  | 0.1246          |
-| 0.1193        | 0.4865 | 160  | 0.1116          |
-| 0.1092        | 0.6081 | 200  | 0.1051          |
-| 0.1046        | 0.7298 | 240  | 0.1012          |
-| 0.1017        | 0.8514 | 280  | 0.0993          |
-| 0.0999        | 0.9730 | 320  | 0.0982          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1084
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.7243        | 0.1373 | 40   | 0.2700          |
+| 0.2306        | 0.2746 | 80   | 0.2057          |
+| 0.1693        | 0.4120 | 120  | 0.1377          |
+| 0.1284        | 0.5493 | 160  | 0.1213          |
+| 0.1176        | 0.6866 | 200  | 0.1148          |
+| 0.1127        | 0.8239 | 240  | 0.1100          |
+| 0.1098        | 0.9613 | 280  | 0.1084          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -22,8 +22,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1de99ba3c69896469e24e31d640496d977ca9154e6e8ede2c9d8c58ee1c49a20
 size 6832728

 version https://git-lfs.github.com/spec/v1
+oid sha256:987586eaa7ace46f9b524d80439f7d9ac7b82c3ac62fd9de5c0a10543badf75e
 size 6832728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d289afc35be50448ca012c8270b76abc4f7753d1d7bd83a50c3267c0533c498d
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:70e45cc6a309dc82431d6beb2afc7335d77e080beeff9b6107b8b430968f6b24
 size 5496