ospanbatyr
/

llama-2-7b-chat-hf-ft-compact

Generated from Trainer

Model card Files Files and versions Community

ospanbatyr commited on Nov 18, 2023

Commit

017af1c

•

1 Parent(s): 9502879

End of training

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ should probably proofread and complete it, then remove this comment. -->
 # llama-2-7b-chat-hf-ft-compact
 This model was trained from scratch on an unknown dataset.
 ## Model description
@@ -42,6 +44,20 @@ The following hyperparameters were used during training:
 - num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.35.2

 # llama-2-7b-chat-hf-ft-compact
 This model was trained from scratch on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.8585
 ## Model description
 - num_epochs: 3
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 4.1287        | 0.36  | 25   | 2.8154          |
+| 1.5           | 0.73  | 50   | 1.3773          |
+| 1.1092        | 1.09  | 75   | 0.9817          |
+| 0.9247        | 1.45  | 100  | 0.9045          |
+| 0.8907        | 1.82  | 125  | 0.8791          |
+| 0.8572        | 2.18  | 150  | 0.8663          |
+| 0.8359        | 2.55  | 175  | 0.8608          |
+| 0.8156        | 2.91  | 200  | 0.8585          |
 ### Framework versions
 - Transformers 4.35.2