End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ xformers_attention: true
 This model is a fine-tuned version of [VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6540
 ## Model description
@@ -137,7 +137,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.4057        | 0.0009 | 10   | 0.6540          |
 ### Framework versions

 This model is a fine-tuned version of [VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6444
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.49          | 0.0009 | 10   | 0.6444          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
     "gate_proj",
-    "k_proj",
-    "o_proj",
     "down_proj",
     "v_proj",
-    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "gate_proj",
     "down_proj",
+    "up_proj",
+    "o_proj",
+    "k_proj",
     "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9d4d9f47c72103b77b08b72a2f2245620cd9ab4940370add92323bbcf81d568b
 size 167934026

 version https://git-lfs.github.com/spec/v1
+oid sha256:91449ae67b8632099d2b659d98eda2a02bb34154b01e0f786c080457f65e942a
 size 167934026

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bbd064c624155f4cb578cb0a690bba8bdfcccbfedc897f77e08dfbed72993b0c
 size 167832240

 version https://git-lfs.github.com/spec/v1
+oid sha256:667caf91752ffe9db39f5b00742e477b4ce371b95d0e04943747f0695278f49d
 size 167832240

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ecf9758ae99633e58b0d4a82d74cbc826c9b0dd8e8befb80c4f276ba231ad3b0
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:f3f0f9fa537a4e1865c58c91bdacc639be08d5aede552f6054ce9954bbd6793c
 size 6776