monsterapi
/

llama2_70B_dolly15k

databricks-dolly-15k

Model card Files Files and versions Community

Zangs3011 commited on Oct 11, 2023

Commit

9a7edbf

•

1 Parent(s): d572c8f

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -30,10 +30,10 @@ Breaking it down further, each epoch took only 5.8 hours and cost a mere `$19.25
 - Total finetuning Cost: $57.75
 - Model Path: meta-llama/Llama-2-70b-hf
 - Dataset: databricks/databricks-dolly-15k
-- Learning rate: (not provided in the original data)
 - Number of epochs: 3
-- Data split: (not provided in the original data, assuming Training: 90% / Validation: 10%)
-- Gradient accumulation steps: (not provided in the original data)
 license: apache-2.0
 ---
@@ -52,3 +52,9 @@ Prompt Used:
 [response]
 ```

 - Total finetuning Cost: $57.75
 - Model Path: meta-llama/Llama-2-70b-hf
 - Dataset: databricks/databricks-dolly-15k
+- Learning rate: 0.0002
 - Number of epochs: 3
+- Data split: Training 90% / Validation 10%
+- Gradient accumulation steps: 4
 license: apache-2.0
 ---
 [response]
 ```
+Loss metrics
+Training loss (Blue) Validation Loss (orange):
+![training loss](train-loss.png "Training loss")