deepakkoli93 commited on
Commit
5c0220a
1 Parent(s): 912a9a3

Model save

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [princeton-nlp/Sheared-LLaMA-1.3B](https://huggingface.co/princeton-nlp/Sheared-LLaMA-1.3B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: nan
22
 
23
  ## Model description
24
 
@@ -37,7 +37,7 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.2
41
  - train_batch_size: 4
42
  - eval_batch_size: 8
43
  - seed: 42
@@ -46,17 +46,14 @@ The following hyperparameters were used during training:
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: cosine
48
  - lr_scheduler_warmup_ratio: 0.05
49
- - num_epochs: 4
50
- - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
- | 0.0 | 1.0 | 778 | nan |
57
- | 0.0 | 2.0 | 1557 | nan |
58
- | 0.0 | 3.0 | 2336 | nan |
59
- | 0.0 | 4.0 | 3112 | nan |
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [princeton-nlp/Sheared-LLaMA-1.3B](https://huggingface.co/princeton-nlp/Sheared-LLaMA-1.3B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.8278
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.01
41
  - train_batch_size: 4
42
  - eval_batch_size: 8
43
  - seed: 42
 
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: cosine
48
  - lr_scheduler_warmup_ratio: 0.05
49
+ - num_epochs: 2
 
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 4.0872 | 1.0 | 778 | 4.0301 |
56
+ | 4.0179 | 2.0 | 1556 | 3.8278 |
 
 
57
 
58
 
59
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:41fa8f41c892282fcbaa8ea776e07347713eb8378d1065ee86d888d7dfb057fd
3
  size 12595704
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e8fd5691c34e0ed9899ecbf3cf542c43775cdf3472addaeae1d520526e0694b
3
  size 12595704
runs/Mar17_03-30-27_49ba99224e28/events.out.tfevents.1710646302.49ba99224e28.266.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:969930c79d13b3fbb7752a1f01b1b49babc8381d25e7e585d6260ee635d9a43d
3
- size 168598
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ab4ed1990dda6327080cacd13142ffb2e99890da47815f1b5e906494bd83f879
3
+ size 333381