catallama
/

CataLlama-v0.1-Instruct-SFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

laurentiubp commited on May 26

Commit

352b05a

•

1 Parent(s): 05fc736

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -32,7 +32,9 @@ The model shows improved proficiency with the Catalan language.
 The model achieves a loss rate of 0.8528 on the validation dataset after two epochs.
-**NOTE:** The model was trained for one epoch, then the `train` split of dataset was shuffled and the model was trained for another epoch
 **Model developers** [Laurentiu Petrea](https://www.linkedin.com/in/laurentiupetrea/) based on Llama-3 from Meta.

 The model achieves a loss rate of 0.8528 on the validation dataset after two epochs.
+**NOTE:** The model was trained for one epoch on the `train` split of dataset and after manual evaluation, I decided to go for another epoch.
+The first epoch logs every 100 steps while the second epoch logs every 200 steps, but I am pasting the train and eval losses for both epochs bellow.
+*The `train` split of the dataset was shuffled before the second epoch. The `test` split dataset is identical in both epochs without shuffling*
 **Model developers** [Laurentiu Petrea](https://www.linkedin.com/in/laurentiupetrea/) based on Llama-3 from Meta.