End of training
Browse files
README.md
CHANGED
@@ -36,8 +36,8 @@ More information needed
|
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
- learning_rate: 2e-05
|
39 |
-
- train_batch_size:
|
40 |
-
- eval_batch_size:
|
41 |
- seed: 42
|
42 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
43 |
- lr_scheduler_type: linear
|
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
|
|
47 |
|
48 |
| Training Loss | Epoch | Step | Validation Loss |
|
49 |
|:-------------:|:-----:|:----:|:---------------:|
|
50 |
-
| No log | 1.0 |
|
51 |
-
| No log | 2.0 |
|
52 |
-
| No log | 3.0 |
|
53 |
|
54 |
|
55 |
### Framework versions
|
|
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
- learning_rate: 2e-05
|
39 |
+
- train_batch_size: 8
|
40 |
+
- eval_batch_size: 8
|
41 |
- seed: 42
|
42 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
43 |
- lr_scheduler_type: linear
|
|
|
47 |
|
48 |
| Training Loss | Epoch | Step | Validation Loss |
|
49 |
|:-------------:|:-----:|:----:|:---------------:|
|
50 |
+
| No log | 1.0 | 4 | nan |
|
51 |
+
| No log | 2.0 | 8 | nan |
|
52 |
+
| No log | 3.0 | 12 | nan |
|
53 |
|
54 |
|
55 |
### Framework versions
|
runs/Dec30_18-23-51_Mazamessos-MacBook-Air.local/events.out.tfevents.1735601035.Mazamessos-MacBook-Air.local.14512.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b888234ac0b516d6a55522f6fced14f6d757aa1663c03e5977ca325db2d7bf5b
|
3 |
+
size 6084
|