sedrickkeh commited on
Commit
e030fa4
1 Parent(s): ec76b53

Model save

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
- license: llama3
4
- base_model: meta-llama/Meta-Llama-3-8B
5
  tags:
6
  - llama-factory
7
  - generated_from_trainer
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # OH_DCFT_V3_wo_sharegpt
17
 
18
- This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.6457
21
 
22
  ## Model description
23
 
@@ -55,9 +55,9 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:-----:|:----:|:---------------:|
58
- | 0.6491 | 1.0 | 422 | 0.6519 |
59
- | 0.608 | 2.0 | 844 | 0.6422 |
60
- | 0.5724 | 3.0 | 1266 | 0.6457 |
61
 
62
 
63
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
+ license: llama3.1
4
+ base_model: meta-llama/Llama-3.1-8B
5
  tags:
6
  - llama-factory
7
  - generated_from_trainer
 
15
 
16
  # OH_DCFT_V3_wo_sharegpt
17
 
18
+ This model is a fine-tuned version of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.6425
21
 
22
  ## Model description
23
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:-----:|:----:|:---------------:|
58
+ | 0.6485 | 1.0 | 422 | 0.6513 |
59
+ | 0.6114 | 2.0 | 844 | 0.6410 |
60
+ | 0.5796 | 3.0 | 1266 | 0.6425 |
61
 
62
 
63
  ### Framework versions
generation_config.json CHANGED
@@ -1,8 +1,8 @@
1
  {
 
2
  "bos_token_id": 128000,
3
  "do_sample": true,
4
  "eos_token_id": 128001,
5
- "max_length": 4096,
6
  "temperature": 0.6,
7
  "top_p": 0.9,
8
  "transformers_version": "4.45.2"
 
1
  {
2
+ "_from_model_config": true,
3
  "bos_token_id": 128000,
4
  "do_sample": true,
5
  "eos_token_id": 128001,
 
6
  "temperature": 0.6,
7
  "top_p": 0.9,
8
  "transformers_version": "4.45.2"
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7422c53f72bc75a6cbe5f2d0dff64d58a8dea0eb75b63dc80670e8cb4db08d6a
3
  size 4976698672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d45c05e7230cb37adeb639c93228d1e3a3e75ed30ff0ea32ec6d982d178f5ade
3
  size 4976698672
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:05ec7bf46e9c8a50fe79654d2ff2b31d85b2c6ae7dcf52b4673856dd99efb477
3
  size 4999802720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:850a1b1872ab026fc6bb1f6d7b407f90e83cac181b3d8d60dbc91960d2bd6f76
3
  size 4999802720
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d831975ab147cad4b6431f24726646c6a1386a6babd55a7497e7ab0f5591dc75
3
  size 4915916176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b68f4588d702f11415f3fbdb61f302147a5e905fbe5908a244208a04140f0dd
3
  size 4915916176
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:095a5edd246c1a4f45d2a9a17a4d1102172a4ccf17919fca7b2d3efaedb3adff
3
  size 1168138808
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0706e522ad3ac4bf289c98ec2040f65fa2ce3f6d29c850a690183045cee2720
3
  size 1168138808
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ce4573fa1af923688fa94c4857545674f3c7bca3ebdbe66e936d1a43e12cb314
3
  size 7160
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba233a36e3b0c621def697e2ccc35525403506f0b99318122ccbd1598aa64b54
3
  size 7160