llama3.1_8b_bwgenerator

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1193
 ## Model description
@@ -51,19 +51,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.2338        | 0.1456 | 20   | 0.3871          |
-| 0.3226        | 0.2911 | 40   | 0.2794          |
-| 0.2612        | 0.4367 | 60   | 0.2409          |
-| 0.2238        | 0.5822 | 80   | 0.2055          |
-| 0.1848        | 0.7278 | 100  | 0.1625          |
-| 0.1505        | 0.8733 | 120  | 0.1424          |
-| 0.1382        | 1.0189 | 140  | 0.1347          |
-| 0.1319        | 1.1644 | 160  | 0.1311          |
-| 0.1281        | 1.3100 | 180  | 0.1265          |
-| 0.1248        | 1.4555 | 200  | 0.1237          |
-| 0.1228        | 1.6011 | 220  | 0.1221          |
-| 0.1205        | 1.7466 | 240  | 0.1200          |
-| 0.1201        | 1.8922 | 260  | 0.1193          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1141
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.1896        | 0.1214 | 20   | 0.3793          |
+| 0.3219        | 0.2427 | 40   | 0.2798          |
+| 0.2583        | 0.3641 | 60   | 0.2367          |
+| 0.2204        | 0.4854 | 80   | 0.2002          |
+| 0.1785        | 0.6068 | 100  | 0.1566          |
+| 0.1488        | 0.7281 | 120  | 0.1404          |
+| 0.1391        | 0.8495 | 140  | 0.1348          |
+| 0.1332        | 0.9708 | 160  | 0.1310          |
+| 0.1281        | 1.0922 | 180  | 0.1254          |
+| 0.1246        | 1.2135 | 200  | 0.1229          |
+| 0.1229        | 1.3349 | 220  | 0.1200          |
+| 0.1202        | 1.4562 | 240  | 0.1179          |
+| 0.1185        | 1.5776 | 260  | 0.1164          |
+| 0.1166        | 1.6989 | 280  | 0.1154          |
+| 0.1165        | 1.8203 | 300  | 0.1143          |
+| 0.1155        | 1.9416 | 320  | 0.1141          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4f69449384f7f9777816d641acffe5fc1721bd9cb8f35df65a278fd43fe70fa8
 size 6832728

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc762beff544d34c8dd52c39eac6066cd7438f49e982c739329380881dc31607
 size 6832728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b369b6ad9bb7b80e5b5359a2eda8a6264f5ef7562c67f966d3aa7831507cce5c
 size 5560

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e944f2220951ec046e769adf16573a894f011721ea6967d90e49a82f20f1b3a
 size 5560