NanQiangHF commited on
Commit
60bc38b
1 Parent(s): 3b4000d

llama3.1_8b_bwgenerator

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1528
22
 
23
  ## Model description
24
 
@@ -51,14 +51,12 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
- | 1.2317 | 0.1246 | 20 | 0.3990 |
55
- | 0.3324 | 0.2492 | 40 | 0.2884 |
56
- | 0.2665 | 0.3738 | 60 | 0.2478 |
57
- | 0.2326 | 0.4983 | 80 | 0.2174 |
58
- | 0.2072 | 0.6229 | 100 | 0.1941 |
59
- | 0.1818 | 0.7475 | 120 | 0.1687 |
60
- | 0.1631 | 0.8721 | 140 | 0.1555 |
61
- | 0.1543 | 0.9967 | 160 | 0.1528 |
62
 
63
 
64
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1153
22
 
23
  ## Model description
24
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
+ | 0.7197 | 0.1536 | 40 | 0.2706 |
55
+ | 0.233 | 0.3071 | 80 | 0.2020 |
56
+ | 0.1719 | 0.4607 | 120 | 0.1414 |
57
+ | 0.1317 | 0.6142 | 160 | 0.1239 |
58
+ | 0.1209 | 0.7678 | 200 | 0.1179 |
59
+ | 0.1168 | 0.9213 | 240 | 0.1153 |
 
 
60
 
61
 
62
  ### Framework versions
adapter_config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
- "base_model_name_or_path": null,
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
 
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
+ "base_model_name_or_path": "meta-llama/Meta-Llama-3.1-8B-Instruct",
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:12ac22bba7490e37ed9784769714d9b6253b1e5b38f9536df4e96909f089dbab
3
- size 6834904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d19fd8b275141c2cd72d0fc45b7597ea6fe2c46506fdea60abe2a819a92d9a80
3
+ size 6832728
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b17beb214a9143f2befed0a63928bdeddafa1fe89cacf297ce36eb0396eee0a8
3
- size 5560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90bc6e73feded14695ff73155bb67b2938f7678a71a8c2897cd30f395075122e
3
+ size 5496