NanQiangHF
/

llama3_8b_instruct_BWRM

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3034
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
@@ -49,22 +49,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.7965        | 0.1840 | 20   | 0.5969          |
-| 0.6001        | 0.3680 | 40   | 0.5778          |
-| 0.5739        | 0.5520 | 60   | 0.5419          |
-| 0.5411        | 0.7361 | 80   | 0.4995          |
-| 0.5147        | 0.9201 | 100  | 0.4758          |
-| 0.45          | 1.1041 | 120  | 0.4148          |
-| 0.4145        | 1.2881 | 140  | 0.4171          |
-| 0.4011        | 1.4721 | 160  | 0.3753          |
-| 0.371         | 1.6561 | 180  | 0.4154          |
-| 0.3702        | 1.8401 | 200  | 0.3424          |
-| 0.3438        | 2.0242 | 220  | 0.3332          |
-| 0.3298        | 2.2082 | 240  | 0.3231          |
-| 0.3185        | 2.3922 | 260  | 0.3174          |
-| 0.3127        | 2.5762 | 280  | 0.3130          |
-| 0.3073        | 2.7602 | 300  | 0.3060          |
-| 0.3033        | 2.9442 | 320  | 0.3034          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2710
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0005
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.8058        | 0.1840 | 20   | 0.5918          |
+| 0.5986        | 0.3680 | 40   | 0.5643          |
+| 0.5513        | 0.5520 | 60   | 0.5113          |
+| 0.5039        | 0.7361 | 80   | 0.4433          |
+| 0.4539        | 0.9201 | 100  | 0.4424          |
+| 0.4083        | 1.1041 | 120  | 0.4024          |
+| 0.3823        | 1.2881 | 140  | 0.3805          |
+| 0.3644        | 1.4721 | 160  | 0.3400          |
+| 0.336         | 1.6561 | 180  | 0.3206          |
+| 0.3314        | 1.8401 | 200  | 0.3185          |
+| 0.3105        | 2.0242 | 220  | 0.3078          |
+| 0.2929        | 2.2082 | 240  | 0.2948          |
+| 0.2855        | 2.3922 | 260  | 0.2831          |
+| 0.2787        | 2.5762 | 280  | 0.2821          |
+| 0.2717        | 2.7602 | 300  | 0.2757          |
+| 0.2703        | 2.9442 | 320  | 0.2710          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -25,8 +25,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": null,
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": null,
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d944ffb20ad72738fe6fb132bcbeb89f2037e49569828d98a760aa9c4faa1f40
 size 6849208

 version https://git-lfs.github.com/spec/v1
+oid sha256:a723410b8b5283b2ca0dfd2497b87c4c0bbaa9f452f6faa5a6b01de93dcf2c64
 size 6849208

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d5ac3b902f27c7b4b9913bc96bbb157a2688a50a705bb834dd6179b666b8445
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:56e6500f2b35510d5ad2fde91222b70fdc3ba67e3b05a808223e14ee7b01385f
 size 5304