mjpsm commited on
Commit
ad1a448
verified
1 Parent(s): 2dab21a

End of training

Browse files
README.md CHANGED
@@ -36,8 +36,8 @@ More information needed
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
- - train_batch_size: 16
40
- - eval_batch_size: 16
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | No log | 1.0 | 2 | nan |
51
- | No log | 2.0 | 4 | nan |
52
- | No log | 3.0 | 6 | nan |
53
 
54
 
55
  ### Framework versions
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
+ - train_batch_size: 8
40
+ - eval_batch_size: 8
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | No log | 1.0 | 4 | nan |
51
+ | No log | 2.0 | 8 | nan |
52
+ | No log | 3.0 | 12 | nan |
53
 
54
 
55
  ### Framework versions
runs/Dec30_18-23-51_Mazamessos-MacBook-Air.local/events.out.tfevents.1735601035.Mazamessos-MacBook-Air.local.14512.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:770ea70f5f232330e6beb3923acc9da8a53972f51aca4413dee1f1140a845e09
3
- size 5470
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b888234ac0b516d6a55522f6fced14f6d757aa1663c03e5977ca325db2d7bf5b
3
+ size 6084