Augusto777 commited on
Commit
ac87fae
1 Parent(s): 82309f0

End of training

Browse files
README.md CHANGED
@@ -1,107 +1,108 @@
1
- ---
2
- license: apache-2.0
3
- base_model: microsoft/swinv2-tiny-patch4-window8-256
4
- tags:
5
- - generated_from_trainer
6
- datasets:
7
- - imagefolder
8
- metrics:
9
- - accuracy
10
- model-index:
11
- - name: swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis
12
- results:
13
- - task:
14
- name: Image Classification
15
- type: image-classification
16
- dataset:
17
- name: imagefolder
18
- type: imagefolder
19
- config: default
20
- split: validation
21
- args: default
22
- metrics:
23
- - name: Accuracy
24
- type: accuracy
25
- value: 0.08064516129032258
26
- ---
27
-
28
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
- should probably proofread and complete it, then remove this comment. -->
30
-
31
- # swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis
32
-
33
- This model is a fine-tuned version of [microsoft/swinv2-tiny-patch4-window8-256](https://huggingface.co/microsoft/swinv2-tiny-patch4-window8-256) on the imagefolder dataset.
34
- It achieves the following results on the evaluation set:
35
- - Loss: 8.8834
36
- - Accuracy: 0.0806
37
-
38
- ## Model description
39
-
40
- More information needed
41
-
42
- ## Intended uses & limitations
43
-
44
- More information needed
45
-
46
- ## Training and evaluation data
47
-
48
- More information needed
49
-
50
- ## Training procedure
51
-
52
- ### Training hyperparameters
53
-
54
- The following hyperparameters were used during training:
55
- - learning_rate: 5e-05
56
- - train_batch_size: 32
57
- - eval_batch_size: 32
58
- - seed: 42
59
- - gradient_accumulation_steps: 4
60
- - total_train_batch_size: 128
61
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
- - lr_scheduler_type: linear
63
- - lr_scheduler_warmup_ratio: 0.1
64
- - num_epochs: 40
65
-
66
- ### Training results
67
-
68
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
69
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
70
- | No log | 0.73 | 2 | 8.8834 | 0.0806 |
71
- | No log | 1.82 | 5 | 8.8522 | 0.0806 |
72
- | No log | 2.91 | 8 | 8.7000 | 0.0806 |
73
- | 8.7803 | 4.0 | 11 | 8.2692 | 0.0806 |
74
- | 8.7803 | 4.73 | 13 | 7.8836 | 0.0806 |
75
- | 8.7803 | 5.82 | 16 | 7.3279 | 0.0806 |
76
- | 8.7803 | 6.91 | 19 | 6.7700 | 0.0806 |
77
- | 7.5847 | 8.0 | 22 | 6.1880 | 0.0806 |
78
- | 7.5847 | 8.73 | 24 | 5.7783 | 0.0806 |
79
- | 7.5847 | 9.82 | 27 | 5.2113 | 0.0806 |
80
- | 5.7442 | 10.91 | 30 | 4.7163 | 0.0806 |
81
- | 5.7442 | 12.0 | 33 | 4.2648 | 0.0806 |
82
- | 5.7442 | 12.73 | 35 | 3.9892 | 0.0806 |
83
- | 5.7442 | 13.82 | 38 | 3.6134 | 0.0806 |
84
- | 4.1747 | 14.91 | 41 | 3.2828 | 0.0806 |
85
- | 4.1747 | 16.0 | 44 | 2.9957 | 0.0806 |
86
- | 4.1747 | 16.73 | 46 | 2.8259 | 0.0806 |
87
- | 4.1747 | 17.82 | 49 | 2.5988 | 0.0806 |
88
- | 3.0458 | 18.91 | 52 | 2.4004 | 0.0806 |
89
- | 3.0458 | 20.0 | 55 | 2.2272 | 0.0806 |
90
- | 3.0458 | 20.73 | 57 | 2.1254 | 0.0806 |
91
- | 2.3301 | 21.82 | 60 | 1.9937 | 0.0806 |
92
- | 2.3301 | 22.91 | 63 | 1.8860 | 0.0806 |
93
- | 2.3301 | 24.0 | 66 | 1.8005 | 0.0806 |
94
- | 2.3301 | 24.73 | 68 | 1.7551 | 0.0806 |
95
- | 1.9107 | 25.82 | 71 | 1.7021 | 0.0806 |
96
- | 1.9107 | 26.91 | 74 | 1.6654 | 0.0806 |
97
- | 1.9107 | 28.0 | 77 | 1.6434 | 0.0806 |
98
- | 1.9107 | 28.73 | 79 | 1.6362 | 0.0806 |
99
- | 1.7061 | 29.09 | 80 | 1.6348 | 0.0806 |
100
-
101
-
102
- ### Framework versions
103
-
104
- - Transformers 4.36.2
105
- - Pytorch 2.1.2+cu118
106
- - Datasets 2.16.1
107
- - Tokenizers 0.15.0
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: microsoft/swinv2-tiny-patch4-window8-256
5
+ tags:
6
+ - generated_from_trainer
7
+ datasets:
8
+ - imagefolder
9
+ metrics:
10
+ - accuracy
11
+ model-index:
12
+ - name: swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis
13
+ results:
14
+ - task:
15
+ name: Image Classification
16
+ type: image-classification
17
+ dataset:
18
+ name: imagefolder
19
+ type: imagefolder
20
+ config: default
21
+ split: validation
22
+ args: default
23
+ metrics:
24
+ - name: Accuracy
25
+ type: accuracy
26
+ value: 0.8387096774193549
27
+ ---
28
+
29
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
+ should probably proofread and complete it, then remove this comment. -->
31
+
32
+ # swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis
33
+
34
+ This model is a fine-tuned version of [microsoft/swinv2-tiny-patch4-window8-256](https://huggingface.co/microsoft/swinv2-tiny-patch4-window8-256) on the imagefolder dataset.
35
+ It achieves the following results on the evaluation set:
36
+ - Loss: 0.5167
37
+ - Accuracy: 0.8387
38
+
39
+ ## Model description
40
+
41
+ More information needed
42
+
43
+ ## Intended uses & limitations
44
+
45
+ More information needed
46
+
47
+ ## Training and evaluation data
48
+
49
+ More information needed
50
+
51
+ ## Training procedure
52
+
53
+ ### Training hyperparameters
54
+
55
+ The following hyperparameters were used during training:
56
+ - learning_rate: 5e-05
57
+ - train_batch_size: 32
58
+ - eval_batch_size: 32
59
+ - seed: 42
60
+ - gradient_accumulation_steps: 4
61
+ - total_train_batch_size: 128
62
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
+ - lr_scheduler_type: linear
64
+ - lr_scheduler_warmup_ratio: 0.1
65
+ - num_epochs: 40
66
+
67
+ ### Training results
68
+
69
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
70
+ |:-------------:|:-------:|:----:|:---------------:|:--------:|
71
+ | No log | 0.7273 | 2 | 1.4057 | 0.2419 |
72
+ | No log | 1.8182 | 5 | 1.2100 | 0.4677 |
73
+ | No log | 2.9091 | 8 | 1.1808 | 0.4516 |
74
+ | 1.3062 | 4.0 | 11 | 1.0975 | 0.5968 |
75
+ | 1.3062 | 4.7273 | 13 | 1.0542 | 0.6613 |
76
+ | 1.3062 | 5.8182 | 16 | 0.9857 | 0.6613 |
77
+ | 1.3062 | 6.9091 | 19 | 0.9176 | 0.6774 |
78
+ | 1.0003 | 8.0 | 22 | 0.8761 | 0.6774 |
79
+ | 1.0003 | 8.7273 | 24 | 0.8540 | 0.6774 |
80
+ | 1.0003 | 9.8182 | 27 | 0.7777 | 0.6613 |
81
+ | 0.8096 | 10.9091 | 30 | 0.7498 | 0.6613 |
82
+ | 0.8096 | 12.0 | 33 | 0.7569 | 0.6613 |
83
+ | 0.8096 | 12.7273 | 35 | 0.7422 | 0.6774 |
84
+ | 0.8096 | 13.8182 | 38 | 0.7278 | 0.7097 |
85
+ | 0.6556 | 14.9091 | 41 | 0.6877 | 0.7258 |
86
+ | 0.6556 | 16.0 | 44 | 0.6433 | 0.7258 |
87
+ | 0.6556 | 16.7273 | 46 | 0.6324 | 0.7419 |
88
+ | 0.6556 | 17.8182 | 49 | 0.6390 | 0.7419 |
89
+ | 0.5725 | 18.9091 | 52 | 0.6504 | 0.7742 |
90
+ | 0.5725 | 20.0 | 55 | 0.6145 | 0.7581 |
91
+ | 0.5725 | 20.7273 | 57 | 0.5824 | 0.7903 |
92
+ | 0.5057 | 21.8182 | 60 | 0.5476 | 0.8226 |
93
+ | 0.5057 | 22.9091 | 63 | 0.5413 | 0.8226 |
94
+ | 0.5057 | 24.0 | 66 | 0.5335 | 0.8226 |
95
+ | 0.5057 | 24.7273 | 68 | 0.5302 | 0.8226 |
96
+ | 0.4945 | 25.8182 | 71 | 0.5231 | 0.8226 |
97
+ | 0.4945 | 26.9091 | 74 | 0.5167 | 0.8387 |
98
+ | 0.4945 | 28.0 | 77 | 0.5132 | 0.8387 |
99
+ | 0.4945 | 28.7273 | 79 | 0.5131 | 0.8387 |
100
+ | 0.4883 | 29.0909 | 80 | 0.5131 | 0.8387 |
101
+
102
+
103
+ ### Framework versions
104
+
105
+ - Transformers 4.44.2
106
+ - Pytorch 2.4.1+cu121
107
+ - Datasets 3.0.1
108
+ - Tokenizers 0.19.1
all_results.json CHANGED
@@ -1,12 +1,13 @@
1
- {
2
- "epoch": 29.09,
3
- "eval_accuracy": 0.08064516129032258,
4
- "eval_loss": 8.883430480957031,
5
- "eval_runtime": 2.5622,
6
- "eval_samples_per_second": 24.198,
7
- "eval_steps_per_second": 0.781,
8
- "train_loss": 4.409568953514099,
9
- "train_runtime": 541.1993,
10
- "train_samples_per_second": 25.868,
11
- "train_steps_per_second": 0.148
 
12
  }
 
1
+ {
2
+ "epoch": 29.09090909090909,
3
+ "eval_accuracy": 0.8387096774193549,
4
+ "eval_loss": 0.5166797637939453,
5
+ "eval_runtime": 2.8799,
6
+ "eval_samples_per_second": 21.529,
7
+ "eval_steps_per_second": 0.694,
8
+ "total_flos": 3.312830060612813e+17,
9
+ "train_loss": 0.7290899336338044,
10
+ "train_runtime": 714.1327,
11
+ "train_samples_per_second": 19.604,
12
+ "train_steps_per_second": 0.112
13
  }
config.json CHANGED
@@ -1,58 +1,71 @@
1
- {
2
- "_name_or_path": "microsoft/swinv2-tiny-patch4-window8-256",
3
- "architectures": [
4
- "Swinv2ForImageClassification"
5
- ],
6
- "attention_probs_dropout_prob": 0.0,
7
- "depths": [
8
- 2,
9
- 2,
10
- 6,
11
- 2
12
- ],
13
- "drop_path_rate": 0.1,
14
- "embed_dim": 96,
15
- "encoder_stride": 32,
16
- "hidden_act": "gelu",
17
- "hidden_dropout_prob": 0.0,
18
- "hidden_size": 768,
19
- "id2label": {
20
- "0": "active",
21
- "1": "active-inactive",
22
- "2": "healthy",
23
- "3": "inactive"
24
- },
25
- "image_size": 256,
26
- "initializer_range": 0.02,
27
- "label2id": {
28
- "active": 0,
29
- "active-inactive": 1,
30
- "healthy": 2,
31
- "inactive": 3
32
- },
33
- "layer_norm_eps": 1e-05,
34
- "mlp_ratio": 4.0,
35
- "model_type": "swinv2",
36
- "num_channels": 3,
37
- "num_heads": [
38
- 3,
39
- 6,
40
- 12,
41
- 24
42
- ],
43
- "num_layers": 4,
44
- "patch_size": 4,
45
- "path_norm": true,
46
- "pretrained_window_sizes": [
47
- 0,
48
- 0,
49
- 0,
50
- 0
51
- ],
52
- "problem_type": "single_label_classification",
53
- "qkv_bias": true,
54
- "torch_dtype": "float32",
55
- "transformers_version": "4.36.2",
56
- "use_absolute_embeddings": false,
57
- "window_size": 8
58
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "microsoft/swinv2-tiny-patch4-window8-256",
3
+ "architectures": [
4
+ "Swinv2ForImageClassification"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "depths": [
8
+ 2,
9
+ 2,
10
+ 6,
11
+ 2
12
+ ],
13
+ "drop_path_rate": 0.1,
14
+ "embed_dim": 96,
15
+ "encoder_stride": 32,
16
+ "hidden_act": "gelu",
17
+ "hidden_dropout_prob": 0.0,
18
+ "hidden_size": 768,
19
+ "id2label": {
20
+ "0": "active",
21
+ "1": "active-inactive",
22
+ "2": "healthy",
23
+ "3": "inactive"
24
+ },
25
+ "image_size": 256,
26
+ "initializer_range": 0.02,
27
+ "label2id": {
28
+ "active": 0,
29
+ "active-inactive": 1,
30
+ "healthy": 2,
31
+ "inactive": 3
32
+ },
33
+ "layer_norm_eps": 1e-05,
34
+ "mlp_ratio": 4.0,
35
+ "model_type": "swinv2",
36
+ "num_channels": 3,
37
+ "num_heads": [
38
+ 3,
39
+ 6,
40
+ 12,
41
+ 24
42
+ ],
43
+ "num_layers": 4,
44
+ "out_features": [
45
+ "stage4"
46
+ ],
47
+ "out_indices": [
48
+ 4
49
+ ],
50
+ "patch_size": 4,
51
+ "path_norm": true,
52
+ "pretrained_window_sizes": [
53
+ 0,
54
+ 0,
55
+ 0,
56
+ 0
57
+ ],
58
+ "problem_type": "single_label_classification",
59
+ "qkv_bias": true,
60
+ "stage_names": [
61
+ "stem",
62
+ "stage1",
63
+ "stage2",
64
+ "stage3",
65
+ "stage4"
66
+ ],
67
+ "torch_dtype": "float32",
68
+ "transformers_version": "4.44.2",
69
+ "use_absolute_embeddings": false,
70
+ "window_size": 8
71
+ }
eval_results.json CHANGED
@@ -1,8 +1,8 @@
1
- {
2
- "epoch": 29.09,
3
- "eval_accuracy": 0.08064516129032258,
4
- "eval_loss": 8.883430480957031,
5
- "eval_runtime": 2.5622,
6
- "eval_samples_per_second": 24.198,
7
- "eval_steps_per_second": 0.781
8
  }
 
1
+ {
2
+ "epoch": 29.09090909090909,
3
+ "eval_accuracy": 0.8387096774193549,
4
+ "eval_loss": 0.5166797637939453,
5
+ "eval_runtime": 2.8799,
6
+ "eval_samples_per_second": 21.529,
7
+ "eval_steps_per_second": 0.694
8
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2046d4398c99900e190202d9adf3e1f7972aa1a911326e467b32625d0cee42f5
3
  size 110356296
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9088d5cdfd4e0544dc4cb2872f795548dc215c7272f7a2578464c3f1ac9e198c
3
  size 110356296
preprocessor_config.json CHANGED
@@ -1,22 +1,22 @@
1
- {
2
- "do_normalize": true,
3
- "do_rescale": true,
4
- "do_resize": true,
5
- "image_mean": [
6
- 0.485,
7
- 0.456,
8
- 0.406
9
- ],
10
- "image_processor_type": "ViTImageProcessor",
11
- "image_std": [
12
- 0.229,
13
- 0.224,
14
- 0.225
15
- ],
16
- "resample": 3,
17
- "rescale_factor": 0.00392156862745098,
18
- "size": {
19
- "height": 256,
20
- "width": 256
21
- }
22
- }
 
1
+ {
2
+ "do_normalize": true,
3
+ "do_rescale": true,
4
+ "do_resize": true,
5
+ "image_mean": [
6
+ 0.485,
7
+ 0.456,
8
+ 0.406
9
+ ],
10
+ "image_processor_type": "ViTImageProcessor",
11
+ "image_std": [
12
+ 0.229,
13
+ 0.224,
14
+ 0.225
15
+ ],
16
+ "resample": 3,
17
+ "rescale_factor": 0.00392156862745098,
18
+ "size": {
19
+ "height": 256,
20
+ "width": 256
21
+ }
22
+ }
runs/Oct13_15-01-08_9b69f8f7fe92/events.out.tfevents.1728831686.9b69f8f7fe92.1642.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67e20f350dc036eac5e18af66e5effe677d91038def2cbec8e066bc3a47c286c
3
+ size 17079
runs/Oct13_15-01-08_9b69f8f7fe92/events.out.tfevents.1728832673.9b69f8f7fe92.1642.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b056a3326bc87b591abc5b6b3d29054361e4ede4d40769bbc30de720bc87c957
3
+ size 405
train_results.json CHANGED
@@ -1,7 +1,8 @@
1
- {
2
- "epoch": 29.09,
3
- "train_loss": 4.409568953514099,
4
- "train_runtime": 541.1993,
5
- "train_samples_per_second": 25.868,
6
- "train_steps_per_second": 0.148
 
7
  }
 
1
+ {
2
+ "epoch": 29.09090909090909,
3
+ "total_flos": 3.312830060612813e+17,
4
+ "train_loss": 0.7290899336338044,
5
+ "train_runtime": 714.1327,
6
+ "train_samples_per_second": 19.604,
7
+ "train_steps_per_second": 0.112
8
  }
trainer_state.json CHANGED
@@ -1,348 +1,368 @@
1
- {
2
- "best_metric": 0.08064516129032258,
3
- "best_model_checkpoint": "swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis\\checkpoint-2",
4
- "epoch": 29.09090909090909,
5
- "eval_steps": 500,
6
- "global_step": 80,
7
- "is_hyper_param_search": false,
8
- "is_local_process_zero": true,
9
- "is_world_process_zero": true,
10
- "log_history": [
11
- {
12
- "epoch": 0.73,
13
- "eval_accuracy": 0.08064516129032258,
14
- "eval_loss": 8.883430480957031,
15
- "eval_runtime": 2.2637,
16
- "eval_samples_per_second": 27.389,
17
- "eval_steps_per_second": 0.884,
18
- "step": 2
19
- },
20
- {
21
- "epoch": 1.82,
22
- "eval_accuracy": 0.08064516129032258,
23
- "eval_loss": 8.852208137512207,
24
- "eval_runtime": 2.3018,
25
- "eval_samples_per_second": 26.935,
26
- "eval_steps_per_second": 0.869,
27
- "step": 5
28
- },
29
- {
30
- "epoch": 2.91,
31
- "eval_accuracy": 0.08064516129032258,
32
- "eval_loss": 8.700010299682617,
33
- "eval_runtime": 2.5761,
34
- "eval_samples_per_second": 24.068,
35
- "eval_steps_per_second": 0.776,
36
- "step": 8
37
- },
38
- {
39
- "epoch": 3.64,
40
- "learning_rate": 4.8611111111111115e-05,
41
- "loss": 8.7803,
42
- "step": 10
43
- },
44
- {
45
- "epoch": 4.0,
46
- "eval_accuracy": 0.08064516129032258,
47
- "eval_loss": 8.269183158874512,
48
- "eval_runtime": 2.5329,
49
- "eval_samples_per_second": 24.478,
50
- "eval_steps_per_second": 0.79,
51
- "step": 11
52
- },
53
- {
54
- "epoch": 4.73,
55
- "eval_accuracy": 0.08064516129032258,
56
- "eval_loss": 7.88364839553833,
57
- "eval_runtime": 2.4337,
58
- "eval_samples_per_second": 25.475,
59
- "eval_steps_per_second": 0.822,
60
- "step": 13
61
- },
62
- {
63
- "epoch": 5.82,
64
- "eval_accuracy": 0.08064516129032258,
65
- "eval_loss": 7.327876091003418,
66
- "eval_runtime": 2.5074,
67
- "eval_samples_per_second": 24.727,
68
- "eval_steps_per_second": 0.798,
69
- "step": 16
70
- },
71
- {
72
- "epoch": 6.91,
73
- "eval_accuracy": 0.08064516129032258,
74
- "eval_loss": 6.769954204559326,
75
- "eval_runtime": 2.6471,
76
- "eval_samples_per_second": 23.422,
77
- "eval_steps_per_second": 0.756,
78
- "step": 19
79
- },
80
- {
81
- "epoch": 7.27,
82
- "learning_rate": 4.166666666666667e-05,
83
- "loss": 7.5847,
84
- "step": 20
85
- },
86
- {
87
- "epoch": 8.0,
88
- "eval_accuracy": 0.08064516129032258,
89
- "eval_loss": 6.1880202293396,
90
- "eval_runtime": 2.602,
91
- "eval_samples_per_second": 23.828,
92
- "eval_steps_per_second": 0.769,
93
- "step": 22
94
- },
95
- {
96
- "epoch": 8.73,
97
- "eval_accuracy": 0.08064516129032258,
98
- "eval_loss": 5.778294563293457,
99
- "eval_runtime": 2.4341,
100
- "eval_samples_per_second": 25.471,
101
- "eval_steps_per_second": 0.822,
102
- "step": 24
103
- },
104
- {
105
- "epoch": 9.82,
106
- "eval_accuracy": 0.08064516129032258,
107
- "eval_loss": 5.21131706237793,
108
- "eval_runtime": 2.3164,
109
- "eval_samples_per_second": 26.766,
110
- "eval_steps_per_second": 0.863,
111
- "step": 27
112
- },
113
- {
114
- "epoch": 10.91,
115
- "learning_rate": 3.472222222222222e-05,
116
- "loss": 5.7442,
117
- "step": 30
118
- },
119
- {
120
- "epoch": 10.91,
121
- "eval_accuracy": 0.08064516129032258,
122
- "eval_loss": 4.716261386871338,
123
- "eval_runtime": 2.4233,
124
- "eval_samples_per_second": 25.585,
125
- "eval_steps_per_second": 0.825,
126
- "step": 30
127
- },
128
- {
129
- "epoch": 12.0,
130
- "eval_accuracy": 0.08064516129032258,
131
- "eval_loss": 4.264786720275879,
132
- "eval_runtime": 2.513,
133
- "eval_samples_per_second": 24.671,
134
- "eval_steps_per_second": 0.796,
135
- "step": 33
136
- },
137
- {
138
- "epoch": 12.73,
139
- "eval_accuracy": 0.08064516129032258,
140
- "eval_loss": 3.989229202270508,
141
- "eval_runtime": 2.4651,
142
- "eval_samples_per_second": 25.151,
143
- "eval_steps_per_second": 0.811,
144
- "step": 35
145
- },
146
- {
147
- "epoch": 13.82,
148
- "eval_accuracy": 0.08064516129032258,
149
- "eval_loss": 3.6134493350982666,
150
- "eval_runtime": 2.6037,
151
- "eval_samples_per_second": 23.812,
152
- "eval_steps_per_second": 0.768,
153
- "step": 38
154
- },
155
- {
156
- "epoch": 14.55,
157
- "learning_rate": 2.777777777777778e-05,
158
- "loss": 4.1747,
159
- "step": 40
160
- },
161
- {
162
- "epoch": 14.91,
163
- "eval_accuracy": 0.08064516129032258,
164
- "eval_loss": 3.2827646732330322,
165
- "eval_runtime": 2.687,
166
- "eval_samples_per_second": 23.074,
167
- "eval_steps_per_second": 0.744,
168
- "step": 41
169
- },
170
- {
171
- "epoch": 16.0,
172
- "eval_accuracy": 0.08064516129032258,
173
- "eval_loss": 2.9957385063171387,
174
- "eval_runtime": 2.4174,
175
- "eval_samples_per_second": 25.647,
176
- "eval_steps_per_second": 0.827,
177
- "step": 44
178
- },
179
- {
180
- "epoch": 16.73,
181
- "eval_accuracy": 0.08064516129032258,
182
- "eval_loss": 2.825892686843872,
183
- "eval_runtime": 2.3083,
184
- "eval_samples_per_second": 26.86,
185
- "eval_steps_per_second": 0.866,
186
- "step": 46
187
- },
188
- {
189
- "epoch": 17.82,
190
- "eval_accuracy": 0.08064516129032258,
191
- "eval_loss": 2.5987932682037354,
192
- "eval_runtime": 2.4694,
193
- "eval_samples_per_second": 25.107,
194
- "eval_steps_per_second": 0.81,
195
- "step": 49
196
- },
197
- {
198
- "epoch": 18.18,
199
- "learning_rate": 2.0833333333333336e-05,
200
- "loss": 3.0458,
201
- "step": 50
202
- },
203
- {
204
- "epoch": 18.91,
205
- "eval_accuracy": 0.08064516129032258,
206
- "eval_loss": 2.400411367416382,
207
- "eval_runtime": 2.3426,
208
- "eval_samples_per_second": 26.467,
209
- "eval_steps_per_second": 0.854,
210
- "step": 52
211
- },
212
- {
213
- "epoch": 20.0,
214
- "eval_accuracy": 0.08064516129032258,
215
- "eval_loss": 2.227222204208374,
216
- "eval_runtime": 2.4914,
217
- "eval_samples_per_second": 24.885,
218
- "eval_steps_per_second": 0.803,
219
- "step": 55
220
- },
221
- {
222
- "epoch": 20.73,
223
- "eval_accuracy": 0.08064516129032258,
224
- "eval_loss": 2.125420331954956,
225
- "eval_runtime": 2.3746,
226
- "eval_samples_per_second": 26.11,
227
- "eval_steps_per_second": 0.842,
228
- "step": 57
229
- },
230
- {
231
- "epoch": 21.82,
232
- "learning_rate": 1.388888888888889e-05,
233
- "loss": 2.3301,
234
- "step": 60
235
- },
236
- {
237
- "epoch": 21.82,
238
- "eval_accuracy": 0.08064516129032258,
239
- "eval_loss": 1.9937151670455933,
240
- "eval_runtime": 2.4362,
241
- "eval_samples_per_second": 25.449,
242
- "eval_steps_per_second": 0.821,
243
- "step": 60
244
- },
245
- {
246
- "epoch": 22.91,
247
- "eval_accuracy": 0.08064516129032258,
248
- "eval_loss": 1.885993242263794,
249
- "eval_runtime": 2.4078,
250
- "eval_samples_per_second": 25.749,
251
- "eval_steps_per_second": 0.831,
252
- "step": 63
253
- },
254
- {
255
- "epoch": 24.0,
256
- "eval_accuracy": 0.08064516129032258,
257
- "eval_loss": 1.8005385398864746,
258
- "eval_runtime": 2.3561,
259
- "eval_samples_per_second": 26.314,
260
- "eval_steps_per_second": 0.849,
261
- "step": 66
262
- },
263
- {
264
- "epoch": 24.73,
265
- "eval_accuracy": 0.08064516129032258,
266
- "eval_loss": 1.7550740242004395,
267
- "eval_runtime": 2.3863,
268
- "eval_samples_per_second": 25.981,
269
- "eval_steps_per_second": 0.838,
270
- "step": 68
271
- },
272
- {
273
- "epoch": 25.45,
274
- "learning_rate": 6.944444444444445e-06,
275
- "loss": 1.9107,
276
- "step": 70
277
- },
278
- {
279
- "epoch": 25.82,
280
- "eval_accuracy": 0.08064516129032258,
281
- "eval_loss": 1.7021311521530151,
282
- "eval_runtime": 2.3225,
283
- "eval_samples_per_second": 26.696,
284
- "eval_steps_per_second": 0.861,
285
- "step": 71
286
- },
287
- {
288
- "epoch": 26.91,
289
- "eval_accuracy": 0.08064516129032258,
290
- "eval_loss": 1.6653900146484375,
291
- "eval_runtime": 2.59,
292
- "eval_samples_per_second": 23.939,
293
- "eval_steps_per_second": 0.772,
294
- "step": 74
295
- },
296
- {
297
- "epoch": 28.0,
298
- "eval_accuracy": 0.08064516129032258,
299
- "eval_loss": 1.6433522701263428,
300
- "eval_runtime": 2.5188,
301
- "eval_samples_per_second": 24.615,
302
- "eval_steps_per_second": 0.794,
303
- "step": 77
304
- },
305
- {
306
- "epoch": 28.73,
307
- "eval_accuracy": 0.08064516129032258,
308
- "eval_loss": 1.6361864805221558,
309
- "eval_runtime": 2.3834,
310
- "eval_samples_per_second": 26.013,
311
- "eval_steps_per_second": 0.839,
312
- "step": 79
313
- },
314
- {
315
- "epoch": 29.09,
316
- "learning_rate": 0.0,
317
- "loss": 1.7061,
318
- "step": 80
319
- },
320
- {
321
- "epoch": 29.09,
322
- "eval_accuracy": 0.08064516129032258,
323
- "eval_loss": 1.6347676515579224,
324
- "eval_runtime": 2.4175,
325
- "eval_samples_per_second": 25.646,
326
- "eval_steps_per_second": 0.827,
327
- "step": 80
328
- },
329
- {
330
- "epoch": 29.09,
331
- "step": 80,
332
- "total_flos": 3.312830060612813e+17,
333
- "train_loss": 4.409568953514099,
334
- "train_runtime": 541.1993,
335
- "train_samples_per_second": 25.868,
336
- "train_steps_per_second": 0.148
337
- }
338
- ],
339
- "logging_steps": 10,
340
- "max_steps": 80,
341
- "num_input_tokens_seen": 0,
342
- "num_train_epochs": 40,
343
- "save_steps": 500,
344
- "total_flos": 3.312830060612813e+17,
345
- "train_batch_size": 32,
346
- "trial_name": null,
347
- "trial_params": null
348
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 0.8387096774193549,
3
+ "best_model_checkpoint": "swinv2-tiny-patch4-window8-256-Ocular-Toxoplasmosis/checkpoint-74",
4
+ "epoch": 29.09090909090909,
5
+ "eval_steps": 500,
6
+ "global_step": 80,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "epoch": 0.7272727272727273,
13
+ "eval_accuracy": 0.24193548387096775,
14
+ "eval_loss": 1.4057228565216064,
15
+ "eval_runtime": 3.9176,
16
+ "eval_samples_per_second": 15.826,
17
+ "eval_steps_per_second": 0.511,
18
+ "step": 2
19
+ },
20
+ {
21
+ "epoch": 1.8181818181818183,
22
+ "eval_accuracy": 0.46774193548387094,
23
+ "eval_loss": 1.2099871635437012,
24
+ "eval_runtime": 2.8828,
25
+ "eval_samples_per_second": 21.507,
26
+ "eval_steps_per_second": 0.694,
27
+ "step": 5
28
+ },
29
+ {
30
+ "epoch": 2.909090909090909,
31
+ "eval_accuracy": 0.45161290322580644,
32
+ "eval_loss": 1.18076491355896,
33
+ "eval_runtime": 2.8384,
34
+ "eval_samples_per_second": 21.843,
35
+ "eval_steps_per_second": 0.705,
36
+ "step": 8
37
+ },
38
+ {
39
+ "epoch": 3.6363636363636362,
40
+ "grad_norm": 10.30179214477539,
41
+ "learning_rate": 4.8611111111111115e-05,
42
+ "loss": 1.3062,
43
+ "step": 10
44
+ },
45
+ {
46
+ "epoch": 4.0,
47
+ "eval_accuracy": 0.5967741935483871,
48
+ "eval_loss": 1.0975382328033447,
49
+ "eval_runtime": 3.6135,
50
+ "eval_samples_per_second": 17.158,
51
+ "eval_steps_per_second": 0.553,
52
+ "step": 11
53
+ },
54
+ {
55
+ "epoch": 4.7272727272727275,
56
+ "eval_accuracy": 0.6612903225806451,
57
+ "eval_loss": 1.0542328357696533,
58
+ "eval_runtime": 3.5171,
59
+ "eval_samples_per_second": 17.628,
60
+ "eval_steps_per_second": 0.569,
61
+ "step": 13
62
+ },
63
+ {
64
+ "epoch": 5.818181818181818,
65
+ "eval_accuracy": 0.6612903225806451,
66
+ "eval_loss": 0.9857348799705505,
67
+ "eval_runtime": 2.887,
68
+ "eval_samples_per_second": 21.475,
69
+ "eval_steps_per_second": 0.693,
70
+ "step": 16
71
+ },
72
+ {
73
+ "epoch": 6.909090909090909,
74
+ "eval_accuracy": 0.6774193548387096,
75
+ "eval_loss": 0.9176284074783325,
76
+ "eval_runtime": 2.8754,
77
+ "eval_samples_per_second": 21.562,
78
+ "eval_steps_per_second": 0.696,
79
+ "step": 19
80
+ },
81
+ {
82
+ "epoch": 7.2727272727272725,
83
+ "grad_norm": 4.642858982086182,
84
+ "learning_rate": 4.166666666666667e-05,
85
+ "loss": 1.0003,
86
+ "step": 20
87
+ },
88
+ {
89
+ "epoch": 8.0,
90
+ "eval_accuracy": 0.6774193548387096,
91
+ "eval_loss": 0.8760596513748169,
92
+ "eval_runtime": 3.7173,
93
+ "eval_samples_per_second": 16.679,
94
+ "eval_steps_per_second": 0.538,
95
+ "step": 22
96
+ },
97
+ {
98
+ "epoch": 8.727272727272727,
99
+ "eval_accuracy": 0.6774193548387096,
100
+ "eval_loss": 0.8539677262306213,
101
+ "eval_runtime": 3.3041,
102
+ "eval_samples_per_second": 18.764,
103
+ "eval_steps_per_second": 0.605,
104
+ "step": 24
105
+ },
106
+ {
107
+ "epoch": 9.818181818181818,
108
+ "eval_accuracy": 0.6612903225806451,
109
+ "eval_loss": 0.7776592969894409,
110
+ "eval_runtime": 3.1776,
111
+ "eval_samples_per_second": 19.511,
112
+ "eval_steps_per_second": 0.629,
113
+ "step": 27
114
+ },
115
+ {
116
+ "epoch": 10.909090909090908,
117
+ "grad_norm": 5.499239921569824,
118
+ "learning_rate": 3.472222222222222e-05,
119
+ "loss": 0.8096,
120
+ "step": 30
121
+ },
122
+ {
123
+ "epoch": 10.909090909090908,
124
+ "eval_accuracy": 0.6612903225806451,
125
+ "eval_loss": 0.7497676014900208,
126
+ "eval_runtime": 3.1503,
127
+ "eval_samples_per_second": 19.68,
128
+ "eval_steps_per_second": 0.635,
129
+ "step": 30
130
+ },
131
+ {
132
+ "epoch": 12.0,
133
+ "eval_accuracy": 0.6612903225806451,
134
+ "eval_loss": 0.7568932175636292,
135
+ "eval_runtime": 3.7657,
136
+ "eval_samples_per_second": 16.465,
137
+ "eval_steps_per_second": 0.531,
138
+ "step": 33
139
+ },
140
+ {
141
+ "epoch": 12.727272727272727,
142
+ "eval_accuracy": 0.6774193548387096,
143
+ "eval_loss": 0.7422052025794983,
144
+ "eval_runtime": 3.1158,
145
+ "eval_samples_per_second": 19.898,
146
+ "eval_steps_per_second": 0.642,
147
+ "step": 35
148
+ },
149
+ {
150
+ "epoch": 13.818181818181818,
151
+ "eval_accuracy": 0.7096774193548387,
152
+ "eval_loss": 0.7278109788894653,
153
+ "eval_runtime": 2.8488,
154
+ "eval_samples_per_second": 21.763,
155
+ "eval_steps_per_second": 0.702,
156
+ "step": 38
157
+ },
158
+ {
159
+ "epoch": 14.545454545454545,
160
+ "grad_norm": 8.175309181213379,
161
+ "learning_rate": 2.777777777777778e-05,
162
+ "loss": 0.6556,
163
+ "step": 40
164
+ },
165
+ {
166
+ "epoch": 14.909090909090908,
167
+ "eval_accuracy": 0.7258064516129032,
168
+ "eval_loss": 0.687738835811615,
169
+ "eval_runtime": 2.8406,
170
+ "eval_samples_per_second": 21.827,
171
+ "eval_steps_per_second": 0.704,
172
+ "step": 41
173
+ },
174
+ {
175
+ "epoch": 16.0,
176
+ "eval_accuracy": 0.7258064516129032,
177
+ "eval_loss": 0.6433460116386414,
178
+ "eval_runtime": 4.134,
179
+ "eval_samples_per_second": 14.998,
180
+ "eval_steps_per_second": 0.484,
181
+ "step": 44
182
+ },
183
+ {
184
+ "epoch": 16.727272727272727,
185
+ "eval_accuracy": 0.7419354838709677,
186
+ "eval_loss": 0.6324245929718018,
187
+ "eval_runtime": 2.8555,
188
+ "eval_samples_per_second": 21.713,
189
+ "eval_steps_per_second": 0.7,
190
+ "step": 46
191
+ },
192
+ {
193
+ "epoch": 17.818181818181817,
194
+ "eval_accuracy": 0.7419354838709677,
195
+ "eval_loss": 0.6389685273170471,
196
+ "eval_runtime": 2.8092,
197
+ "eval_samples_per_second": 22.07,
198
+ "eval_steps_per_second": 0.712,
199
+ "step": 49
200
+ },
201
+ {
202
+ "epoch": 18.181818181818183,
203
+ "grad_norm": 5.849218845367432,
204
+ "learning_rate": 2.0833333333333336e-05,
205
+ "loss": 0.5725,
206
+ "step": 50
207
+ },
208
+ {
209
+ "epoch": 18.90909090909091,
210
+ "eval_accuracy": 0.7741935483870968,
211
+ "eval_loss": 0.6503620743751526,
212
+ "eval_runtime": 2.8945,
213
+ "eval_samples_per_second": 21.42,
214
+ "eval_steps_per_second": 0.691,
215
+ "step": 52
216
+ },
217
+ {
218
+ "epoch": 20.0,
219
+ "eval_accuracy": 0.7580645161290323,
220
+ "eval_loss": 0.6144644618034363,
221
+ "eval_runtime": 4.0673,
222
+ "eval_samples_per_second": 15.244,
223
+ "eval_steps_per_second": 0.492,
224
+ "step": 55
225
+ },
226
+ {
227
+ "epoch": 20.727272727272727,
228
+ "eval_accuracy": 0.7903225806451613,
229
+ "eval_loss": 0.5823854207992554,
230
+ "eval_runtime": 3.0464,
231
+ "eval_samples_per_second": 20.352,
232
+ "eval_steps_per_second": 0.657,
233
+ "step": 57
234
+ },
235
+ {
236
+ "epoch": 21.818181818181817,
237
+ "grad_norm": 6.505163669586182,
238
+ "learning_rate": 1.388888888888889e-05,
239
+ "loss": 0.5057,
240
+ "step": 60
241
+ },
242
+ {
243
+ "epoch": 21.818181818181817,
244
+ "eval_accuracy": 0.8225806451612904,
245
+ "eval_loss": 0.547602117061615,
246
+ "eval_runtime": 2.9115,
247
+ "eval_samples_per_second": 21.295,
248
+ "eval_steps_per_second": 0.687,
249
+ "step": 60
250
+ },
251
+ {
252
+ "epoch": 22.90909090909091,
253
+ "eval_accuracy": 0.8225806451612904,
254
+ "eval_loss": 0.5412537455558777,
255
+ "eval_runtime": 2.8707,
256
+ "eval_samples_per_second": 21.598,
257
+ "eval_steps_per_second": 0.697,
258
+ "step": 63
259
+ },
260
+ {
261
+ "epoch": 24.0,
262
+ "eval_accuracy": 0.8225806451612904,
263
+ "eval_loss": 0.5334817171096802,
264
+ "eval_runtime": 3.2898,
265
+ "eval_samples_per_second": 18.846,
266
+ "eval_steps_per_second": 0.608,
267
+ "step": 66
268
+ },
269
+ {
270
+ "epoch": 24.727272727272727,
271
+ "eval_accuracy": 0.8225806451612904,
272
+ "eval_loss": 0.5301870703697205,
273
+ "eval_runtime": 3.8383,
274
+ "eval_samples_per_second": 16.153,
275
+ "eval_steps_per_second": 0.521,
276
+ "step": 68
277
+ },
278
+ {
279
+ "epoch": 25.454545454545453,
280
+ "grad_norm": 8.245793342590332,
281
+ "learning_rate": 6.944444444444445e-06,
282
+ "loss": 0.4945,
283
+ "step": 70
284
+ },
285
+ {
286
+ "epoch": 25.818181818181817,
287
+ "eval_accuracy": 0.8225806451612904,
288
+ "eval_loss": 0.5231319665908813,
289
+ "eval_runtime": 3.1472,
290
+ "eval_samples_per_second": 19.7,
291
+ "eval_steps_per_second": 0.635,
292
+ "step": 71
293
+ },
294
+ {
295
+ "epoch": 26.90909090909091,
296
+ "eval_accuracy": 0.8387096774193549,
297
+ "eval_loss": 0.5166797637939453,
298
+ "eval_runtime": 3.151,
299
+ "eval_samples_per_second": 19.677,
300
+ "eval_steps_per_second": 0.635,
301
+ "step": 74
302
+ },
303
+ {
304
+ "epoch": 28.0,
305
+ "eval_accuracy": 0.8387096774193549,
306
+ "eval_loss": 0.5131666660308838,
307
+ "eval_runtime": 3.202,
308
+ "eval_samples_per_second": 19.363,
309
+ "eval_steps_per_second": 0.625,
310
+ "step": 77
311
+ },
312
+ {
313
+ "epoch": 28.727272727272727,
314
+ "eval_accuracy": 0.8387096774193549,
315
+ "eval_loss": 0.513070821762085,
316
+ "eval_runtime": 4.4164,
317
+ "eval_samples_per_second": 14.039,
318
+ "eval_steps_per_second": 0.453,
319
+ "step": 79
320
+ },
321
+ {
322
+ "epoch": 29.09090909090909,
323
+ "grad_norm": 9.397185325622559,
324
+ "learning_rate": 0.0,
325
+ "loss": 0.4883,
326
+ "step": 80
327
+ },
328
+ {
329
+ "epoch": 29.09090909090909,
330
+ "eval_accuracy": 0.8387096774193549,
331
+ "eval_loss": 0.5131446719169617,
332
+ "eval_runtime": 2.9088,
333
+ "eval_samples_per_second": 21.314,
334
+ "eval_steps_per_second": 0.688,
335
+ "step": 80
336
+ },
337
+ {
338
+ "epoch": 29.09090909090909,
339
+ "step": 80,
340
+ "total_flos": 3.312830060612813e+17,
341
+ "train_loss": 0.7290899336338044,
342
+ "train_runtime": 714.1327,
343
+ "train_samples_per_second": 19.604,
344
+ "train_steps_per_second": 0.112
345
+ }
346
+ ],
347
+ "logging_steps": 10,
348
+ "max_steps": 80,
349
+ "num_input_tokens_seen": 0,
350
+ "num_train_epochs": 40,
351
+ "save_steps": 500,
352
+ "stateful_callbacks": {
353
+ "TrainerControl": {
354
+ "args": {
355
+ "should_epoch_stop": false,
356
+ "should_evaluate": false,
357
+ "should_log": false,
358
+ "should_save": true,
359
+ "should_training_stop": true
360
+ },
361
+ "attributes": {}
362
+ }
363
+ },
364
+ "total_flos": 3.312830060612813e+17,
365
+ "train_batch_size": 32,
366
+ "trial_name": null,
367
+ "trial_params": null
368
+ }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:707b23c3a5fc468561b9ad9ec6c5cb53ee88b1b9a1f9cd003dd50ee2da9987b5
3
- size 4792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d721724455b9c9117091e3482c977b759172e83b66a5f194bff2b562307699ec
3
+ size 5304