makhataei commited on
Commit
d3d1bdc
verified
1 Parent(s): 257cc1b

End of training

Browse files
README.md CHANGED
@@ -36,68 +36,28 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - learning_rate: 3.125e-05
40
  - train_batch_size: 14
41
  - eval_batch_size: 14
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 50
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 4.8087 | 1.0 | 86 | 4.7698 |
52
- | 4.8168 | 2.0 | 172 | 4.7698 |
53
- | 4.8165 | 3.0 | 258 | 4.7698 |
54
- | 4.8191 | 4.0 | 344 | 4.7698 |
55
- | 4.819 | 5.0 | 430 | 4.7698 |
56
- | 4.8202 | 6.0 | 516 | 4.7698 |
57
- | 4.8197 | 7.0 | 602 | 4.7698 |
58
- | 4.8206 | 8.0 | 688 | 4.7698 |
59
- | 4.8115 | 9.0 | 774 | 4.7698 |
60
- | 4.8102 | 10.0 | 860 | 4.7698 |
61
- | 4.8133 | 11.0 | 946 | 4.7698 |
62
- | 4.8152 | 12.0 | 1032 | 4.7698 |
63
- | 4.8171 | 13.0 | 1118 | 4.7698 |
64
- | 4.8167 | 14.0 | 1204 | 4.7698 |
65
- | 4.8204 | 15.0 | 1290 | 4.7698 |
66
- | 4.8176 | 16.0 | 1376 | 4.7698 |
67
- | 4.8178 | 17.0 | 1462 | 4.7698 |
68
- | 4.819 | 18.0 | 1548 | 4.7698 |
69
- | 4.8212 | 19.0 | 1634 | 4.7698 |
70
- | 4.8204 | 20.0 | 1720 | 4.7698 |
71
- | 4.8235 | 21.0 | 1806 | 4.7698 |
72
- | 4.8184 | 22.0 | 1892 | 4.7698 |
73
- | 4.8246 | 23.0 | 1978 | 4.7698 |
74
- | 4.821 | 24.0 | 2064 | 4.7698 |
75
- | 4.8208 | 25.0 | 2150 | 4.7698 |
76
- | 4.8258 | 26.0 | 2236 | 4.7698 |
77
- | 4.8195 | 27.0 | 2322 | 4.7698 |
78
- | 4.8246 | 28.0 | 2408 | 4.7698 |
79
- | 4.8278 | 29.0 | 2494 | 4.7698 |
80
- | 4.828 | 30.0 | 2580 | 4.7698 |
81
- | 4.8249 | 31.0 | 2666 | 4.7698 |
82
- | 4.8275 | 32.0 | 2752 | 4.7698 |
83
- | 4.822 | 33.0 | 2838 | 4.7698 |
84
- | 4.8278 | 34.0 | 2924 | 4.7698 |
85
- | 4.8259 | 35.0 | 3010 | 4.7698 |
86
- | 4.8309 | 36.0 | 3096 | 4.7698 |
87
- | 4.8238 | 37.0 | 3182 | 4.7698 |
88
- | 4.8276 | 38.0 | 3268 | 4.7698 |
89
- | 4.8247 | 39.0 | 3354 | 4.7698 |
90
- | 4.8295 | 40.0 | 3440 | 4.7698 |
91
- | 4.8347 | 41.0 | 3526 | 4.7698 |
92
- | 4.8286 | 42.0 | 3612 | 4.7698 |
93
- | 4.8256 | 43.0 | 3698 | 4.7698 |
94
- | 4.8322 | 44.0 | 3784 | 4.7698 |
95
- | 4.8316 | 45.0 | 3870 | 4.7698 |
96
- | 4.8269 | 46.0 | 3956 | 4.7698 |
97
- | 4.8365 | 47.0 | 4042 | 4.7698 |
98
- | 4.8359 | 48.0 | 4128 | 4.7698 |
99
- | 4.8302 | 49.0 | 4214 | 4.7698 |
100
- | 4.8303 | 50.0 | 4300 | 4.7698 |
101
 
102
 
103
  ### Framework versions
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - learning_rate: 6.25e-05
40
  - train_batch_size: 14
41
  - eval_batch_size: 14
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 10
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 4.8071 | 1.0 | 86 | 4.7698 |
52
+ | 4.8157 | 2.0 | 172 | 4.7698 |
53
+ | 4.8171 | 3.0 | 258 | 4.7698 |
54
+ | 4.8177 | 4.0 | 344 | 4.7698 |
55
+ | 4.8177 | 5.0 | 430 | 4.7698 |
56
+ | 4.8188 | 6.0 | 516 | 4.7698 |
57
+ | 4.8203 | 7.0 | 602 | 4.7698 |
58
+ | 4.8213 | 8.0 | 688 | 4.7698 |
59
+ | 4.8093 | 9.0 | 774 | 4.7698 |
60
+ | 4.8085 | 10.0 | 860 | 4.7698 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ee71fd5af984e7e240c60e4234de82c710442d3ea58172d1288091ecaad21f57
3
  size 1112905680
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44b132acdcb1f5aa6128a9847f05ea5e3241ca72b9f3ab5b0eb4ec18e6c891aa
3
  size 1112905680
runs/Mar03_10-30-58_Software-AI/events.out.tfevents.1709449259.Software-AI.1548892.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:325537f251f8b933a20baa1dad6b16d5f5169dee06ede68b71c7be7c724b96eb
3
- size 11322
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee7eb4df3ed336b8fc82b3fc45adcc6cce4265b1438e4dc1b9bd70ee1c9f216b
3
+ size 13816