End of training

Files changed (2) hide show

README.md CHANGED Viewed

@@ -36,8 +36,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 2    | nan             |
-| No log        | 2.0   | 4    | nan             |
-| No log        | 3.0   | 6    | nan             |
 ### Framework versions

 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 4    | nan             |
+| No log        | 2.0   | 8    | nan             |
+| No log        | 3.0   | 12   | nan             |
 ### Framework versions

runs/Dec30_18-23-51_Mazamessos-MacBook-Air.local/events.out.tfevents.1735601035.Mazamessos-MacBook-Air.local.14512.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:770ea70f5f232330e6beb3923acc9da8a53972f51aca4413dee1f1140a845e09
-size 5470

 version https://git-lfs.github.com/spec/v1
+oid sha256:b888234ac0b516d6a55522f6fced14f6d757aa1663c03e5977ca325db2d7bf5b
+size 6084